PPLLaVA-7B-dpo
|
PPLLaVA: Varied Video Sequence Understanding With…
|
3.73
|
2024-11-04
|
|
VLM-RLAIF
|
Tuning Large Multimodal Models for Videos using R…
|
3.49
|
2024-02-06
|
|
TS-LLaVA-34B
|
TS-LLaVA: Constructing Visual Tokens through Thum…
|
3.38
|
2024-11-17
|
|
PLLaVA-34B
|
PLLaVA : Parameter-free LLaVA Extension from Imag…
|
3.32
|
2024-04-25
|
|
PPLLaVA-7B
|
PPLLaVA: Varied Video Sequence Understanding With…
|
3.32
|
2024-11-04
|
|
SlowFast-LLaVA-34B
|
SlowFast-LLaVA: A Strong Training-Free Baseline f…
|
3.32
|
2024-07-22
|
|
VideoGPT+
|
VideoGPT+: Integrating Image and Video Encoders f…
|
3.28
|
2024-06-13
|
|
IG-VLM-GPT4v
|
An Image Grid Can Be Worth a Video: Zero-shot Vid…
|
3.17
|
2024-03-27
|
|
ST-LLM-7B
|
ST-LLM: Large Language Models Are Effective Tempo…
|
3.15
|
2024-03-30
|
|
VideoChat2_HD_mistral
|
MVBench: A Comprehensive Multi-modal Video Unders…
|
3.10
|
2023-11-28
|
|
CAT-7B
|
CAT: Enhancing Multimodal Large Language Model to…
|
3.07
|
2024-03-07
|
|
LITA-13B
|
LITA: Language Instructed Temporal-Localization A…
|
3.04
|
2024-03-27
|
|
LLaMA-VID-13B (2 Token)
|
LLaMA-VID: An Image is Worth 2 Tokens in Large La…
|
2.99
|
2023-11-28
|
|
Chat-UniVi
|
Chat-UniVi: Unified Visual Representation Empower…
|
2.99
|
2023-11-14
|
|
VideoChat2
|
MVBench: A Comprehensive Multi-modal Video Unders…
|
2.98
|
2023-11-28
|
|
LLaMA-VID-7B (2 Token)
|
LLaMA-VID: An Image is Worth 2 Tokens in Large La…
|
2.89
|
2023-11-28
|
|
VTimeLLM
|
VTimeLLM: Empower LLM to Grasp Video Moments
|
2.85
|
2023-11-30
|
|
BT-Adapter
|
BT-Adapter: Video Conversation is Feasible Withou…
|
2.69
|
2023-09-27
|
|
BT-Adapter (zero-shot)
|
BT-Adapter: Video Conversation is Feasible Withou…
|
2.46
|
2023-09-27
|
|
Video-ChatGPT
|
Video-ChatGPT: Towards Detailed Video Understandi…
|
2.38
|
2023-06-08
|
|
Video Chat
|
VideoChat: Chat-Centric Video Understanding
|
2.29
|
2023-05-10
|
|
LLaMA Adapter
|
LLaMA-Adapter V2: Parameter-Efficient Visual Inst…
|
2.16
|
2023-04-28
|
|
Video LLaMA
|
Video-LLaMA: An Instruction-tuned Audio-Visual La…
|
1.98
|
2023-06-05
|
|