ML Research Wiki / Benchmarks / Video Question Answering / iVQA

iVQA

Video Question Answering Benchmark

Performance Over Time

📊 Showing 7 results | 📏 Metric: Accuracy

Rank	Model	Paper	Accuracy	Date	Code
1	Text + Text (no Multimodal Pretext Training)	Towards Fast Adaptation of Pretrained Contrastive Models for Multi-channel Video-Language Retrieval	40.20	2022-06-05	📦 xudonglinthu/upgradable-multimodal-intelligence
2	FrozenBiLM 📚	Zero-Shot Video Question Answering via Frozen Bidirectional Language Models	39.60	2022-06-16	📦 antoyang/FrozenBiLM 📦 klauscc/dam 📦 sts-vlcc/sts-vlcc
3	VideoCoCa 📚	VideoCoCa: Video-Text Modeling with Zero-Shot Transfer from Contrastive Captioners	39.00	2022-12-09	-
4	Co-Tokenization	Video Question Answering with Iterative Video-Text Co-Tokenization	38.20	2022-08-01	-
5	Just Ask (fine-tune)	Just Ask: Learning to Answer Questions from Millions of Narrated Videos	35.40	2020-12-01	📦 antoyang/just-ask
6	FrozenBiLM (0-shot)	Zero-Shot Video Question Answering via Frozen Bidirectional Language Models	26.80	2022-06-16	📦 antoyang/FrozenBiLM 📦 klauscc/dam 📦 sts-vlcc/sts-vlcc
7	Just Ask (0-shot)	Just Ask: Learning to Answer Questions from Millions of Narrated Videos	12.20	2020-12-01	📦 antoyang/just-ask

2022

Text + Text (no Multimodal Pretext Training)

xudonglinthu/upgradable-multimodal-intelligence

2022

FrozenBiLM

antoyang/FrozenBiLM klauscc/dam sts-vlcc/sts-vlcc

2022

VideoCoCa

2022

Co-Tokenization

2020

Just Ask (fine-tune)

antoyang/just-ask

2022

FrozenBiLM (0-shot)

antoyang/FrozenBiLM klauscc/dam sts-vlcc/sts-vlcc

2020

Just Ask (0-shot)

antoyang/just-ask

Model	Paper	Accuracy	Date
Text + Text (no Multimodal Pretext Training)	Towards Fast Adaptation of Pretrained Contrastive…	40.20	2022-06-05
FrozenBiLM	Zero-Shot Video Question Answering via Frozen Bid…	39.60	2022-06-16
VideoCoCa	VideoCoCa: Video-Text Modeling with Zero-Shot Tra…	39.00	2022-12-09
Co-Tokenization	Video Question Answering with Iterative Video-Tex…	38.20	2022-08-01
Just Ask (fine-tune)	Just Ask: Learning to Answer Questions from Milli…	35.40	2020-12-01
FrozenBiLM (0-shot)	Zero-Shot Video Question Answering via Frozen Bid…	26.80	2022-06-16
Just Ask (0-shot)	Just Ask: Learning to Answer Questions from Milli…	12.20	2020-12-01