ML Research Wiki / Benchmarks / Visual Question Answering / ViP-Bench

ViP-Bench

Visual Question Answering Benchmark

Performance Over Time

📊 Showing 13 results | 📏 Metric: GPT-4 score (bbox)

Top Performing Models

Rank Model Paper GPT-4 score (bbox) Date Code
1 GPT-4V-turbo-detail:high (Visual Prompt) GPT-4 Technical Report 60.70 2023-03-15 📦 openai/evals 📦 shmsw25/factscore 📦 unispac/visual-adversarial-examples-jailbreak-large-language-models
2 GPT-4V-turbo-detail:low (Visual Prompt) GPT-4 Technical Report 52.80 2023-03-15 📦 openai/evals 📦 shmsw25/factscore 📦 unispac/visual-adversarial-examples-jailbreak-large-language-models
3 LLaVA-NeXT-Inst-IT-Qwen2-7B (Visual Prompt 📚 Inst-IT: Boosting Multimodal Instance Understanding via Explicit Visual Prompt Instruction Tuning 50.50 2024-12-04 📦 inst-it/inst-it
4 ViP-LLaVA-13B (Visual Prompt) Making Large Language Models Better Data Creators 48.30 2023-10-31 📦 microsoft/llm-data-creation
5 LLaVA-1.5-13B (Coordinates) Improved Baselines with Visual Instruction Tuning 47.10 2023-10-05 📦 huggingface/transformers 📦 haotian-liu/LLaVA 📦 LLaVA-VL/LLaVA-NeXT
6 Qwen-VL-Chat (Coordinates) Qwen-VL: A Versatile Vision-Language Model for Understanding, Localization, Text Reading, and Beyond 45.30 2023-08-24 📦 qwenlm/qwen-vl 📦 brandon3964/multimodal-task-vector
7 LLaVA-NeXT-Inst-IT-Vicuna-7B (Visual Prompt 📚 Inst-IT: Boosting Multimodal Instance Understanding via Explicit Visual Prompt Instruction Tuning 45.10 2024-12-04 📦 inst-it/inst-it
8 LLaVA-1.5-13B (Visual Prompt) Improved Baselines with Visual Instruction Tuning 41.80 2023-10-05 📦 huggingface/transformers 📦 haotian-liu/LLaVA 📦 LLaVA-VL/LLaVA-NeXT
9 Qwen-VL-Chat (Visual Prompt) Qwen-VL: A Versatile Vision-Language Model for Understanding, Localization, Text Reading, and Beyond 39.20 2023-08-24 📦 qwenlm/qwen-vl 📦 brandon3964/multimodal-task-vector
10 InstructBLIP-13B (Visual Prompt) InstructBLIP: Towards General-purpose Vision-Language Models with Instruction Tuning 35.80 2023-05-11 📦 salesforce/lavis 📦 tabtoyou/kollava 📦 pwc-1/Paper-9 📦 MS-P3/code3

All Papers (13)