MMDuet2: Enhancing Proactive Interaction of Video MLLMs with Multi-Turn Reinforcement Learning
Yueqian Wang, Songxiang Liu, Disong Wang, Nuo Xu, Guanglu Wan, Huishuai Zhang, Dongyan Zhao, arXiv:2512.06810 (2025)
LongCat-Flash-Omni Technical Report
Team, Meituan LongCat, arXiv:2511.00279 (2025)
Kimi-VL Technical Report
Team, Kimi, arXiv:2504.07491 (2025)
PARAGRAPH2GRAPH: A GNN-based Framework for Layout Paragraph Analysis
Wei, Shu & Nuo Xu, arXiv:2304.11810 (2023)