ML Research Wiki / Benchmarks / Visual Question Answering (VQA) / GRIT

GRIT

Visual Question Answering (VQA) Benchmark

Performance Over Time

📊 Showing 2 results | 📏 Metric: VQA (ablation)

Top Performing Models

Rank Model Paper VQA (ablation) Date Code
1 Unified-IOXL Unified-IO: A Unified Model for Vision, Language, and Multi-Modal Tasks 74.50 2022-06-17 -
2 GPV-2 Webly Supervised Concept Expansion for General Purpose Vision Models 63.20 2022-02-04 -

All Papers (2)