ColonGPT (w/ LoRA, w/o extra data)
|
Frontiers in Intelligent Colonoscopy
|
99.96
|
2024-10-22
|
|
LLaVA-v1.5
(w/ LoRA, w/ extra data)
|
Improved Baselines with Visual Instruction Tuning
|
99.32
|
2023-10-05
|
|
LLaVA-Med-v1.5
(w/ LoRA, w/o extra data)
|
LLaVA-Med: Training a Large Language-and-Vision A…
|
99.30
|
2023-06-01
|
|
MGM-2B
(w/o LoRA, w/ extra data)
|
Mini-Gemini: Mining the Potential of Multi-modali…
|
98.75
|
2024-03-27
|
|
LLaVA-v1.5
(w/ LoRA, w/o extra data)
|
Improved Baselines with Visual Instruction Tuning
|
98.58
|
2023-10-05
|
|
MGM-2B
(w/o LoRA, w/o extra data)
|
Mini-Gemini: Mining the Potential of Multi-modali…
|
98.17
|
2024-03-27
|
|
MobileVLM-1.7B
(w/ LoRA, w/ extra data)
|
MobileVLM : A Fast, Strong and Open Vision Langua…
|
97.87
|
2023-12-28
|
|
MobileVLM-1.7B
(w/o LoRA, w/ extra data)
|
MobileVLM : A Fast, Strong and Open Vision Langua…
|
97.78
|
2023-12-28
|
|
LLaVA-Med-v1.0
(w/o LoRA, w/o extra data)
|
LLaVA-Med: Training a Large Language-and-Vision A…
|
97.74
|
2023-06-01
|
|
LLaVA-Med-v1.0
(w/o LoRA, w/ extra data)
|
LLaVA-Med: Training a Large Language-and-Vision A…
|
97.35
|
2023-06-01
|
|
Bunny-v1.0-3B
(w/ LoRA, w/o extra data)
|
Efficient Multimodal Learning from Data-centric P…
|
96.61
|
2024-02-18
|
|
Bunny-v1.0-3B
(w/ LoRA, w/ extra data)
|
Efficient Multimodal Learning from Data-centric P…
|
96.02
|
2024-02-18
|
|
MiniGPT-v2
(w/ LoRA, w/o extra data)
|
MiniGPT-v2: large language model as a unified int…
|
94.69
|
2023-10-14
|
|
LLaVA-Med-v1.5
(w/ LoRA, w/ extra data)
|
LLaVA-Med: Training a Large Language-and-Vision A…
|
90.40
|
2023-06-01
|
|
MiniGPT-v2
(w/ LoRA, w/ extra data)
|
MiniGPT-v2: large language model as a unified int…
|
87.65
|
2023-10-14
|
|
LLaVA-v1
(w/ LoRA, w/ extra data)
|
Visual Instruction Tuning
|
86.87
|
2023-04-17
|
|
LLaVA-v1
(w/ LoRA, w/o extra data)
|
Visual Instruction Tuning
|
84.55
|
2023-04-17
|
|