VideoMAP: Toward Scalable Mamba-based Video Autoregressive Pretraining
arXiv preprint arXiv:2503.12332 · March 2025
Citation
Liu, Y., Wu, P., Liang, C., Shen, J., Wang, L. & Yi, L. (2025). VideoMAP: Toward Scalable Mamba-based Video Autoregressive Pretraining. arXiv:2503.12332.
Yunze Liu, Peiran Wu, Cheng Liang, Junxiao Shen, Limin Wang, Li Yi
Scalable Mamba-based autoregressive pretraining recipe for long-form video understanding.