ML Research Wiki / Benchmarks / Visual Question Answering (VQA) / CLEVR

CLEVR

Visual Question Answering (VQA) Benchmark

Performance Over Time

📊 Showing 15 results | 📏 Metric: Accuracy

Top Performing Models

Rank Model Paper Accuracy Date Code
1 NS-VQA (1K programs) Neural-Symbolic VQA: Disentangling Reasoning from Vision and Language Understanding 99.80 2018-10-04 📦 kexinyi/ns-vqa 📦 nerdimite/neuro-symbolic-ai-soc
2 MDETR MDETR -- Modulated Detection for End-to-End Multi-Modal Understanding 99.70 2021-04-26 📦 facebookresearch/multimodal 📦 ashkamath/mdetr 📦 thunlp/pevl 📦 b-faye/lightmdetr 📦 AleDella/mdter_eval
3 NeSyCoCo NeSyCoCo: A Neuro-Symbolic Concept Composer for Compositional Generalization 99.70 2024-12-20 📦 hlr/nesycoco
4 OCCAM (ours) Interpretable Visual Reasoning via Induced Symbolic Space 99.40 2020-11-23 📦 SHI-Labs/Interpretable-Visual-Reasoning
5 TbD + reg + hres Transparency by Design: Closing the Gap Between Performance and Interpretability in Visual Reasoning 99.10 2018-03-14 📦 davidmascharka/tbd-nets
6 NS-CL The Neuro-Symbolic Concept Learner: Interpreting Scenes, Words, and Sentences From Natural Supervision 98.90 2019-04-26 📦 vacancy/NSCL-PyTorch-Release 📦 nerdimite/neuro-symbolic-ai-soc
7 MAC Compositional Attention Networks for Machine Reasoning 98.90 2018-03-08 📦 stanfordnlp/mac-network 📦 rosinality/mac-network-pytorch 📦 Glaciohound/VCML
8 CNN + LSTM + RN + HAN Learning Visual Question Answering by Bootstrapping Hard Attention 98.80 2018-08-01 📦 lienchibao1998/new
9 DDRprog* DDRprog: A CLEVR Differentiable Dynamic Reasoning Programmer 98.30 2018-03-30 -
10 single-hop + LCGN (ours) Language-Conditioned Graph Networks for Relational Reasoning 97.90 2019-05-10 📦 ronghanghu/lcgn

All Papers (15)