ML Research Wiki / Benchmarks / Image Classification / ColonINST-v1 (Unseen)

ColonINST-v1 (Unseen)

Image Classification Benchmark

Performance Over Time

📊 Showing 17 results | 📏 Metric: Accuray

Top Performing Models

Rank Model Paper Accuray Date Code
1 ColonGPT (w/ LoRA, w/o extra data) Frontiers in Intelligent Colonoscopy 83.24 2024-10-22 📦 ai4colonoscopy/intelliscope
2 LLaVA-v1.5 (w/ LoRA, w/ extra data) Improved Baselines with Visual Instruction Tuning 80.89 2023-10-05 📦 huggingface/transformers 📦 haotian-liu/LLaVA 📦 LLaVA-VL/LLaVA-NeXT
3 MobileVLM-1.7B (w/ LoRA, w/ extra data) MobileVLM : A Fast, Strong and Open Vision Language Assistant for Mobile Devices 80.44 2023-12-28 📦 meituan-automl/mobilevlm
4 Bunny-v1.0-3B (w/ LoRA, w/ extra data) Efficient Multimodal Learning from Data-centric Perspective 79.50 2024-02-18 📦 baai-dcai/bunny
5 LLaVA-Med-v1.5 (w/ LoRA, w/o extra data) LLaVA-Med: Training a Large Language-and-Vision Assistant for Biomedicine in One Day 79.24 2023-06-01 📦 microsoft/LLaVA-Med
6 LLaVA-v1.5 (w/ LoRA, w/o extra data) Improved Baselines with Visual Instruction Tuning 79.10 2023-10-05 📦 huggingface/transformers 📦 haotian-liu/LLaVA 📦 LLaVA-VL/LLaVA-NeXT
7 MGM-2B (w/o LoRA, w/o extra data) Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models 78.99 2024-03-27 📦 dvlab-research/MGM 📦 dvlab-research/minigemini
8 MobileVLM-1.7B (w/o LoRA, w/ extra data) MobileVLM : A Fast, Strong and Open Vision Language Assistant for Mobile Devices 78.75 2023-12-28 📦 meituan-automl/mobilevlm
9 MGM-2B (w/o LoRA, w/ extra data) Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models 78.69 2024-03-27 📦 dvlab-research/MGM 📦 dvlab-research/minigemini
10 LLaVA-Med-v1.0 (w/o LoRA, w/o extra data) LLaVA-Med: Training a Large Language-and-Vision Assistant for Biomedicine in One Day 78.04 2023-06-01 📦 microsoft/LLaVA-Med

All Papers (17)