ColonGPT (w/ LoRA, w/o extra data)
|
Frontiers in Intelligent Colonoscopy
|
80.18
|
2024-10-22
|
|
MobileVLM-1.7B
(w/ LoRA, w/ extra data)
|
MobileVLM : A Fast, Strong and Open Vision Langua…
|
78.03
|
2023-12-28
|
|
LLaVA-Med-v1.0
(w/o LoRA, w/ extra data)
|
LLaVA-Med: Training a Large Language-and-Vision A…
|
75.25
|
2023-06-01
|
|
Bunny-v1.0-3B
(w/ LoRA, w/ extra data)
|
Efficient Multimodal Learning from Data-centric P…
|
75.08
|
2024-02-18
|
|
LLaVA-Med-v1.0
(w/o LoRA, w/o extra data)
|
LLaVA-Med: Training a Large Language-and-Vision A…
|
75.07
|
2023-06-01
|
|
MGM-2B
(w/o LoRA, w/ extra data)
|
Mini-Gemini: Mining the Potential of Multi-modali…
|
74.30
|
2024-03-27
|
|
MobileVLM-1.7B
(w/o LoRA, w/ extra data)
|
MobileVLM : A Fast, Strong and Open Vision Langua…
|
73.14
|
2023-12-28
|
|
LLaVA-Med-v1.5
(w/ LoRA, w/o extra data)
|
LLaVA-Med: Training a Large Language-and-Vision A…
|
73.05
|
2023-06-01
|
|
LLaVA-v1.5
(w/ LoRA, w/ extra data)
|
Improved Baselines with Visual Instruction Tuning
|
72.88
|
2023-10-05
|
|
MiniGPT-v2
(w/ LoRA, w/o extra data)
|
MiniGPT-v2: large language model as a unified int…
|
72.05
|
2023-10-14
|
|
LLaVA-v1.5
(w/ LoRA, w/o extra data)
|
Improved Baselines with Visual Instruction Tuning
|
70.38
|
2023-10-05
|
|
MiniGPT-v2
(w/ LoRA, w/ extra data)
|
MiniGPT-v2: large language model as a unified int…
|
70.23
|
2023-10-14
|
|
LLaVA-Med-v1.5
(w/ LoRA, w/ extra data)
|
LLaVA-Med: Training a Large Language-and-Vision A…
|
70.00
|
2023-06-01
|
|
MGM-2B
(w/o LoRA, w/o extra data)
|
Mini-Gemini: Mining the Potential of Multi-modali…
|
69.81
|
2024-03-27
|
|
Bunny-v1.0-3B
(w/ LoRA, w/o extra data)
|
Efficient Multimodal Learning from Data-centric P…
|
69.45
|
2024-02-18
|
|
LLaVA-v1
(w/ LoRA, w/o extra data)
|
Visual Instruction Tuning
|
68.11
|
2023-04-17
|
|
LLaVA-v1
(w/ LoRA, w/ extra data)
|
Visual Instruction Tuning
|
46.85
|
2023-04-17
|
|