ColonGPT (w/ LoRA, w/o extra data)
|
Frontiers in Intelligent Colonoscopy
|
94.06
|
2024-10-22
|
|
LLaVA-Med-v1.0 (w/o LoRA, w/ extra data)
|
LLaVA-Med: Training a Large Language-and-Vision A…
|
93.84
|
2023-06-01
|
|
MobileVLM-1.7B (w/ LoRA, w/ extra data)
|
MobileVLM : A Fast, Strong and Open Vision Langua…
|
93.64
|
2023-12-28
|
|
LLaVA-Med-v1.5 (w/ LoRA, w/o extra data)
|
LLaVA-Med: Training a Large Language-and-Vision A…
|
93.62
|
2023-06-01
|
|
LLaVA-Med-v1.0 (w/o LoRA, w/o extra data)
|
LLaVA-Med: Training a Large Language-and-Vision A…
|
93.52
|
2023-06-01
|
|
LLaVA-v1.5 (w/ LoRA, w/ extra data)
|
Improved Baselines with Visual Instruction Tuning
|
93.33
|
2023-10-05
|
|
MGM-2B (w/o LoRA, w/ extra data)
|
Mini-Gemini: Mining the Potential of Multi-modali…
|
93.24
|
2024-03-27
|
|
MobileVLM-1.7B (w/o LoRA, w/ extra data)
|
MobileVLM : A Fast, Strong and Open Vision Langua…
|
93.02
|
2023-12-28
|
|
LLaVA-v1.5 (w/ LoRA, w/o extra data)
|
Improved Baselines with Visual Instruction Tuning
|
92.97
|
2023-10-05
|
|
MGM-2B (w/o LoRA, w/o extra data)
|
Mini-Gemini: Mining the Potential of Multi-modali…
|
92.97
|
2024-03-27
|
|
Bunny-v1.0-3B (w/ LoRA, w/ extra data)
|
Efficient Multimodal Learning from Data-centric P…
|
92.47
|
2024-02-18
|
|
MiniGPT-v2 (w/ LoRA, w/o extra data)
|
MiniGPT-v2: large language model as a unified int…
|
91.49
|
2023-10-14
|
|
Bunny-v1.0-3B (w/ LoRA, w/o extra data)
|
Efficient Multimodal Learning from Data-centric P…
|
91.16
|
2024-02-18
|
|
MiniGPT-v2 (w/ LoRA, w/ extra data)
|
MiniGPT-v2: large language model as a unified int…
|
90.00
|
2023-10-14
|
|
LLaVA-v1 (w/ LoRA, w/ extra data)
|
Visual Instruction Tuning
|
89.61
|
2023-04-17
|
|
LLaVA-v1 (w/ LoRA, w/o extra data)
|
Visual Instruction Tuning
|
87.86
|
2023-04-17
|
|
LLaVA-Med-v1.5 (w/ LoRA, w/ extra data)
|
LLaVA-Med: Training a Large Language-and-Vision A…
|
87.22
|
2023-06-01
|
|