CAMP-VQA: Caption-Embedded Multimodal Perception for No-Reference Quality Assessment of Compressed Video

Published in IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) 2026, 2026

Recommended citation: Wang, X., Katsenou, A., Shen, J., Bull, D. (2026). CAMP-VQA: Caption-Embedded Multimodal Perception for No-Reference Quality Assessment of Compressed Video. In IEEE/CVF Winter Conference on Applications of Computer Vision. https://arxiv.org/abs/2511.07290

Xinyao Wang, Angeliki Katsenou, Junxiao Shen, David Bull

Novel approach to video quality assessment using caption-embedded multimodal perception for compressed video analysis without reference frames.