ColonGPT (w/ LoRA, w/o extra data)
|
Frontiers in Intelligent Colonoscopy
|
83.24
|
2024-10-22
|
|
LLaVA-v1.5
(w/ LoRA, w/ extra data)
|
Improved Baselines with Visual Instruction Tuning
|
80.89
|
2023-10-05
|
|
MobileVLM-1.7B
(w/ LoRA, w/ extra data)
|
MobileVLM : A Fast, Strong and Open Vision Langua…
|
80.44
|
2023-12-28
|
|
Bunny-v1.0-3B
(w/ LoRA, w/ extra data)
|
Efficient Multimodal Learning from Data-centric P…
|
79.50
|
2024-02-18
|
|
LLaVA-Med-v1.5
(w/ LoRA, w/o extra data)
|
LLaVA-Med: Training a Large Language-and-Vision A…
|
79.24
|
2023-06-01
|
|
LLaVA-v1.5
(w/ LoRA, w/o extra data)
|
Improved Baselines with Visual Instruction Tuning
|
79.10
|
2023-10-05
|
|
MGM-2B
(w/o LoRA, w/o extra data)
|
Mini-Gemini: Mining the Potential of Multi-modali…
|
78.99
|
2024-03-27
|
|
MobileVLM-1.7B
(w/o LoRA, w/ extra data)
|
MobileVLM : A Fast, Strong and Open Vision Langua…
|
78.75
|
2023-12-28
|
|
MGM-2B
(w/o LoRA, w/ extra data)
|
Mini-Gemini: Mining the Potential of Multi-modali…
|
78.69
|
2024-03-27
|
|
LLaVA-Med-v1.0
(w/o LoRA, w/o extra data)
|
LLaVA-Med: Training a Large Language-and-Vision A…
|
78.04
|
2023-06-01
|
|
MiniGPT-v2
(w/ LoRA, w/o extra data)
|
MiniGPT-v2: large language model as a unified int…
|
77.93
|
2023-10-14
|
|
LLaVA-Med-v1.0
(w/o LoRA, w/ extra data)
|
LLaVA-Med: Training a Large Language-and-Vision A…
|
77.38
|
2023-06-01
|
|
MiniGPT-v2
(w/ LoRA, w/ extra data)
|
MiniGPT-v2: large language model as a unified int…
|
76.82
|
2023-10-14
|
|
Bunny-v1.0-3B
(w/ LoRA, w/o extra data)
|
Efficient Multimodal Learning from Data-centric P…
|
75.50
|
2024-02-18
|
|
LLaVA-v1
(w/ LoRA, w/o extra data)
|
Visual Instruction Tuning
|
72.08
|
2023-04-17
|
|
LLaVA-Med-v1.5
(w/ LoRA, w/ extra data)
|
LLaVA-Med: Training a Large Language-and-Vision A…
|
66.51
|
2023-06-01
|
|
LLaVA-v1
(w/ LoRA, w/ extra data)
|
Visual Instruction Tuning
|
42.17
|
2023-04-17
|
|