📊 Showing 1 results | 📏 Metric: Acc
Rank | Model | Paper | Acc | Date | Code |
---|---|---|---|---|---|
1 | GPT-4 | Are Language Models Puzzle Prodigies? Algorithmic Puzzles Unveil Serious Challenges in Multimodal Reasoning | 30.30 | 2024-03-06 | 📦 declare-lab/puzzle-reasoning 📦 declare-lab/llm-puzzletest |