📊 Showing 5 results | 📏 Metric: 1/4
Rank | Model | Paper | 1/4 | Date | Code |
---|---|---|---|---|---|
1 | Tem-adapter | Tem-adapter: Adapting Image-Text Pretraining for Video Question Answer | 46.00 | 2023-08-16 | 📦 xliu443/tem-adapter |
2 | Eclipse | SUTD-TrafficQA: A Question Answering Benchmark and an Efficient Network for Video Reasoning over Traffic Events | 37.05 | 2021-03-29 | 📦 SUTDCV/SUTD-TrafficQA 📦 MarkHershey/arxiv-dl 📦 saccharomycetes/text-based-traffic-understanding |
3 | HCRN | Hierarchical Conditional Relation Networks for Video Question Answering | 36.49 | 2020-02-25 | 📦 thaolmk54/hcrn-videoqa |
4 | TVQA | TVQA: Localized, Compositional Video Question Answering | 35.16 | 2018-09-05 | 📦 jayleicn/TVQA 📦 BM-K/Question-Difficulty-Estimation 📦 mansigoel/TVQA 📦 h19920918/quiz_for_day06 |
5 | VIS+LST | Exploring Models and Data for Image Question Answering | 29.91 | 2015-05-08 | 📦 renmengye/imageqa-public 📦 abhigoyal1997/CS-763-Project 📦 moh833/VQA |