ML Research Wiki / Benchmarks / Temporal Relation Extraction / Vinoground

Vinoground

Temporal Relation Extraction Benchmark

Performance Over Time

📊 Showing 16 results | 📏 Metric: Text Score

Top Performing Models

Rank Model Paper Text Score Date Code
1 Qwen2-VL-72B Qwen2-VL: Enhancing Vision-Language Model's Perception of the World at Any Resolution 50.40 2024-09-18 📦 qwenlm/qwen2-vl 📦 qwenlm/qwen2.5-vl 📦 juruobenruo/DexVLA
2 LLaVA-OneVision-Qwen2-72B LLaVA-OneVision: Easy Visual Task Transfer 48.40 2024-08-06 📦 evolvinglmms-lab/lmms-eval 📦 MindSpore-scientific-2/code-14
3 LLaVA-OneVision-Qwen2-7B LLaVA-OneVision: Easy Visual Task Transfer 41.60 2024-08-06 📦 evolvinglmms-lab/lmms-eval 📦 MindSpore-scientific-2/code-14
4 Qwen2-VL-7B Qwen2-VL: Enhancing Vision-Language Model's Perception of the World at Any Resolution 40.20 2024-09-18 📦 qwenlm/qwen2-vl 📦 qwenlm/qwen2.5-vl 📦 juruobenruo/DexVLA
5 Gemini-1.5-Pro (CoT) Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context 37.00 2024-03-08 📦 dlvuldet/primevul
6 VideoLLaMA2-72B VideoLLaMA 2: Advancing Spatial-Temporal Modeling and Audio Understanding in Video-LLMs 36.20 2024-06-11 📦 damo-nlp-sg/videollama2 📦 damo-nlp-sg/videollama3 📦 damo-nlp-sg/inf-clip
7 Gemini-1.5-Pro Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context 35.80 2024-03-08 📦 dlvuldet/primevul
8 MiniCPM-2.6 MiniCPM-V: A GPT-4V Level MLLM on Your Phone 32.60 2024-08-03 📦 openbmb/minicpm-v 📦 OpenBMB/MiniCPM-o
9 InternLM-XC-2.5 (CoT) InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output 30.80 2024-07-03 📦 internlm/internlm-xcomposer
10 InternLM-XC-2.5 InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output 28.80 2024-07-03 📦 internlm/internlm-xcomposer

All Papers (16)