ML Research Wiki / Benchmarks / Logical Reasoning / LingOly

LingOly

Logical Reasoning Benchmark

Performance Over Time

📊 Showing 11 results | 📏 Metric: Delta_NoContext

Top Performing Models

Rank Model Paper Delta_NoContext Date Code
1 Claude Opus LINGOLY: A Benchmark of Olympiad-Level Linguistic Reasoning Puzzles in Low-Resource and Extinct Languages 0.00 2024-06-10 📦 am-bean/lingOly
2 GPT-4o LINGOLY: A Benchmark of Olympiad-Level Linguistic Reasoning Puzzles in Low-Resource and Extinct Languages 0.00 2024-06-10 📦 am-bean/lingOly
3 Gemini 1.5 Pro LINGOLY: A Benchmark of Olympiad-Level Linguistic Reasoning Puzzles in Low-Resource and Extinct Languages 0.00 2024-06-10 📦 am-bean/lingOly
4 GPT-4 LINGOLY: A Benchmark of Olympiad-Level Linguistic Reasoning Puzzles in Low-Resource and Extinct Languages 0.00 2024-06-10 📦 am-bean/lingOly
5 Command R+ LINGOLY: A Benchmark of Olympiad-Level Linguistic Reasoning Puzzles in Low-Resource and Extinct Languages 0.00 2024-06-10 📦 am-bean/lingOly
6 GPT-3.5 LINGOLY: A Benchmark of Olympiad-Level Linguistic Reasoning Puzzles in Low-Resource and Extinct Languages 0.00 2024-06-10 📦 am-bean/lingOly
7 Mixtral 8x7B LINGOLY: A Benchmark of Olympiad-Level Linguistic Reasoning Puzzles in Low-Resource and Extinct Languages 0.00 2024-06-10 📦 am-bean/lingOly
8 Llama 3 8B LINGOLY: A Benchmark of Olympiad-Level Linguistic Reasoning Puzzles in Low-Resource and Extinct Languages 0.00 2024-06-10 📦 am-bean/lingOly
9 Llama 3 70B LINGOLY: A Benchmark of Olympiad-Level Linguistic Reasoning Puzzles in Low-Resource and Extinct Languages 0.00 2024-06-10 📦 am-bean/lingOly
10 Gemma 7B LINGOLY: A Benchmark of Olympiad-Level Linguistic Reasoning Puzzles in Low-Resource and Extinct Languages 0.00 2024-06-10 📦 am-bean/lingOly

All Papers (11)