MARC: Memory-Augmented RL Token Compression for Efficient Video Understanding

International Conference on Learning Representations (ICLR) 2026 · January 2026

Citation

Wu, P., Yu, Z., Liu, Y., Wu, C.H., Zhou, E., Shen, J. (2026). MARC: Memory-Augmented RL Token Compression for Efficient Video Understanding. In International Conference on Learning Representations.

Peiran Wu, Zihong Yu, Yiling Liu, Ching-Hui Wu, Enmin Zhou, Junxiao Shen

This work presents MARC, a novel approach for efficient video understanding through memory-augmented reinforcement learning token compression, enabling better processing of long-form video content.

← Back to publications