UGC-VideoCaptioner: An Omni UGC Video Detail Caption Model and New Benchmarks

Published in arXiv preprint arXiv:2507.11336, 2025

Recommended citation: Wu, P., Liu, Y., Zhu, Z., Zhou, E. & Shen, J. (2025). UGC-VideoCaptioner: An Omni UGC Video Detail Caption Model and New Benchmarks. arXiv:2507.11336. https://arxiv.org/abs/2507.11336

Peiran Wu, Yunze Liu, Zhengdong Zhu, Enmin Zhou, Junxiao Shen

An omni captioning model and new benchmarks for detailed description of user-generated video content across diverse domains.