Vinoground

Dataset Information
Modalities
Videos, Texts
Languages
English
Introduced
2024
License
Unknown
Homepage

Overview

A temporal counterfactual dataset composing of 1000 short and natural video-caption pairs.

Variants: Vinoground

Associated Benchmarks

This dataset is used in 1 benchmark:

Recent Benchmark Submissions

Task Model Paper Date
Temporal Relation Extraction Qwen2-VL-7B Qwen2-VL: Enhancing Vision-Language Model's Perception … 2024-09-18
Temporal Relation Extraction Qwen2-VL-72B Qwen2-VL: Enhancing Vision-Language Model's Perception … 2024-09-18
Temporal Relation Extraction LLaVA-OneVision-Qwen2-72B LLaVA-OneVision: Easy Visual Task Transfer 2024-08-06
Temporal Relation Extraction LLaVA-OneVision-Qwen2-7B LLaVA-OneVision: Easy Visual Task Transfer 2024-08-06
Temporal Relation Extraction MiniCPM-2.6 MiniCPM-V: A GPT-4V Level MLLM … 2024-08-03
Temporal Relation Extraction InternLM-XC-2.5 (CoT) InternLM-XComposer-2.5: A Versatile Large Vision … 2024-07-03
Temporal Relation Extraction InternLM-XC-2.5 InternLM-XComposer-2.5: A Versatile Large Vision … 2024-07-03
Temporal Relation Extraction VideoLLaMA2-72B VideoLLaMA 2: Advancing Spatial-Temporal Modeling … 2024-06-11
Temporal Relation Extraction MA-LMM-Vicuna-7B MA-LMM: Memory-Augmented Large Multimodal Model … 2024-04-08
Temporal Relation Extraction Gemini-1.5-Pro (CoT) Gemini 1.5: Unlocking multimodal understanding … 2024-03-08
Temporal Relation Extraction Gemini-1.5-Pro Gemini 1.5: Unlocking multimodal understanding … 2024-03-08
Temporal Relation Extraction VTimeLLM VTimeLLM: Empower LLM to Grasp … 2023-11-30
Temporal Relation Extraction Video-LLaVA-7B Video-LLaVA: Learning United Visual Representation … 2023-11-16
Temporal Relation Extraction LanguageBind LanguageBind: Extending Video-Language Pretraining to … 2023-10-03
Temporal Relation Extraction ImageBind ImageBind: One Embedding Space To … 2023-05-09
Temporal Relation Extraction VideoCLIP VideoCLIP: Contrastive Pre-training for Zero-shot … 2021-09-28

Research Papers

Recent papers with results on this dataset: