• 周一. 11 月 24th, 2025

[文献CS-LVLM-EN-20231116]Video-LLaVA: Learning United Visual Representation by Alignment Before Projection