📊 Showing 6 results | 📏 Metric: Execution Accuracy
Rank | Model | Paper | Execution Accuracy | Date | Code |
---|---|---|---|---|---|
1 | APOLLO | APOLLO: An Optimized Training Approach for Long-form Numerical Reasoning | 71.07 | 2022-12-14 | 📦 gasolsun36/iter-cot 📦 gasolsun36/dynamicrag 📦 gasolsun36/apollo |
2 | ELASTIC (RoBERTa-large) | ELASTIC: Numerical Reasoning with Adaptive Symbolic Compiler | 68.96 | 2022-10-18 | 📦 neurasearch/neurips-2022-submission-3358 |
3 | GPT-4 (8k) | Are ChatGPT and GPT-4 General-Purpose Solvers for Financial Text Analytics? A Study on Several Typical Tasks | 68.79 | 2023-05-10 | - |
4 | FinQANet (RoBERTa-large) | FinQA: A Dataset of Numerical Reasoning over Financial Data | 65.05 | 2021-09-01 | 📦 czyssrs/finqa |
5 | FinQANet (BERT-large) | FinQA: A Dataset of Numerical Reasoning over Financial Data | 57.43 | 2021-09-01 | 📦 czyssrs/finqa |
6 | FinQANet (FinBert ) | FinQA: A Dataset of Numerical Reasoning over Financial Data | 53.71 | 2021-09-01 | 📦 czyssrs/finqa |