周五. 6 月 5th, 2026

[文献CS-LVLM-EN-20231116]Video-LLaVA: Learning United Visual Representation by Alignment Before Projection

11 月 20, 2025 #LVLM大型视觉语言模型

文献索引号：

https://doi.org/10.48550/arXiv.2311.10122

Video-LLaVA Learning United Visual Representation by Alignment Before Projection 下载

微信扫描下方的二维码阅读本文

由李星海

简介： 2025-今浙江农林大学 | 2022-今广州白蓝碗蛋科技有限公司 | 2022-2024 广州商学院 | 2019-2022 广东工贸职业技术学院 | 服务宗旨：心始至客，行亦致远。

[文献CS-LLM-EN-20241204]Video LLMs for Temporal Reasoning in Long Videos

11 月 20, 2025

[文献CS-NEP-EN-20250528]Fostering Video Reasoning via Next-Event Prediction

11 月 20, 2025

[文献MLR-ML-EN-20241104]Chronos- Learning the Language of Time Series

11 月 20, 2025