ML Research Wiki / Benchmarks / Emotion Interpretation / EIBench

EIBench

Emotion Interpretation Benchmark

Performance Over Time

📊 Showing 13 results | 📏 Metric: Recall

Top Performing Models

Rank Model Paper Recall Date Code
1 Claude-3-haiku Why We Feel: Breaking Boundaries in Emotional Reasoning with Multimodal Large Language Models 63.24 2025-04-10 📦 Lum1104/MER-Factory 📦 lum1104/eibench
2 LLaVA-1.5 (13B) Why We Feel: Breaking Boundaries in Emotional Reasoning with Multimodal Large Language Models 54.37 2025-04-10 📦 Lum1104/MER-Factory 📦 lum1104/eibench
3 LLaVA-NEXT (13B) Why We Feel: Breaking Boundaries in Emotional Reasoning with Multimodal Large Language Models 54.33 2025-04-10 📦 Lum1104/MER-Factory 📦 lum1104/eibench
4 Claude-3-sonnet Why We Feel: Breaking Boundaries in Emotional Reasoning with Multimodal Large Language Models 54.10 2025-04-10 📦 Lum1104/MER-Factory 📦 lum1104/eibench
5 LLaVA-NEXT (7B) Why We Feel: Breaking Boundaries in Emotional Reasoning with Multimodal Large Language Models 53.82 2025-04-10 📦 Lum1104/MER-Factory 📦 lum1104/eibench
6 MiniGPT-v2 Why We Feel: Breaking Boundaries in Emotional Reasoning with Multimodal Large Language Models 52.89 2025-04-10 📦 Lum1104/MER-Factory 📦 lum1104/eibench
7 ChatGPT-4o Why We Feel: Breaking Boundaries in Emotional Reasoning with Multimodal Large Language Models 49.99 2025-04-10 📦 Lum1104/MER-Factory 📦 lum1104/eibench
8 Video-LLaVA Why We Feel: Breaking Boundaries in Emotional Reasoning with Multimodal Large Language Models 49.26 2025-04-10 📦 Lum1104/MER-Factory 📦 lum1104/eibench
9 LLaVA-NEXT (34B) Why We Feel: Breaking Boundaries in Emotional Reasoning with Multimodal Large Language Models 49.03 2025-04-10 📦 Lum1104/MER-Factory 📦 lum1104/eibench
10 ChatGPT-4V Why We Feel: Breaking Boundaries in Emotional Reasoning with Multimodal Large Language Models 46.86 2025-04-10 📦 Lum1104/MER-Factory 📦 lum1104/eibench

All Papers (13)