GPT-4o
|
GPT-4o System Card
|
37.45
|
2024-10-25
|
|
Qwen2.5-VL-7B
|
Qwen2.5-VL Technical Report
|
35.91
|
2025-02-19
|
|
InternVL2.5-26B
|
Expanding Performance Boundaries of Open-Source M…
|
30.50
|
2024-12-06
|
|
Qwen2-VL-7B
|
Qwen2-VL: Enhancing Vision-Language Model's Perce…
|
27.80
|
2024-09-18
|
|
InternVL2.5-8B
|
Expanding Performance Boundaries of Open-Source M…
|
21.24
|
2024-12-06
|
|
LLaVA-Video-7B
|
Video Instruction Tuning With Synthetic Data
|
18.53
|
2024-10-03
|
|
mPLUG-Owl3-7B
|
mPLUG-Owl3: Towards Long Image-Sequence Understan…
|
17.37
|
2024-08-09
|
|
LLaVA-OneVision-7B
|
LLaVA-OneVision: Easy Visual Task Transfer
|
16.60
|
2024-08-06
|
|
LongVA-7B
|
Long Context Transfer from Language to Vision
|
14.29
|
2024-06-24
|
|