VideoScaffold: Elastic-Scale Visual Hierarchies for Streaming Video Understanding in MLLMs

April 7, 2026 ยท View on GitHub

Naishan Zheng, Qingpei Guo, Jie Huang, Feng Zhao

University of Science and Technology of China, Ant Group