ALBERT (Ensemble)
|
Improving Machine Reading Comprehension with Sing…
|
91.40
|
2020-11-06
|
|
Megatron-BERT (ensemble)
|
Megatron-LM: Training Multi-Billion Parameter Lan…
|
90.90
|
2019-09-17
|
|
ALBERTxxlarge+DUMA(ensemble)
|
DUMA: Reading Comprehension with Transposition Th…
|
89.80
|
2020-01-26
|
|
Megatron-BERT
|
Megatron-LM: Training Multi-Billion Parameter Lan…
|
89.50
|
2019-09-17
|
|
DeBERTalarge
|
DeBERTa: Decoding-enhanced BERT with Disentangled…
|
86.80
|
2020-06-05
|
|
B10-10-10
|
Funnel-Transformer: Filtering out Sequential Redu…
|
85.70
|
2020-06-05
|
|
XLNet
|
XLNet: Generalized Autoregressive Pretraining for…
|
84.00
|
2019-06-19
|
|
RoBERTa
|
RoBERTa: A Robustly Optimized BERT Pretraining Ap…
|
83.20
|
2019-07-26
|
|
Orca 2-13B
|
Orca 2: Teaching Small Language Models How to Rea…
|
82.87
|
2023-11-18
|
|
Orca 2-7B
|
Orca 2: Teaching Small Language Models How to Rea…
|
80.79
|
2023-11-18
|
|
HAT (Encoder)
|
Hierarchical Learning for Generation with Long So…
|
67.30
|
2021-04-15
|
|
GPT-3 175B (0-shot)
|
Language Models are Few-Shot Learners
|
58.40
|
2020-05-28
|
|
LLaMA 65B (zero-shot)
|
LLaMA: Open and Efficient Foundation Language Mod…
|
51.60
|
2023-02-27
|
|
PaLM 540B (zero-shot)
|
PaLM: Scaling Language Modeling with Pathways
|
49.10
|
2022-04-05
|
|
LLaMA 33B (zero-shot)
|
LLaMA: Open and Efficient Foundation Language Mod…
|
48.30
|
2023-02-27
|
|
PaLM 62B (zero-shot)
|
PaLM: Scaling Language Modeling with Pathways
|
47.50
|
2022-04-05
|
|
LLaMA 13B (zero-shot)
|
LLaMA: Open and Efficient Foundation Language Mod…
|
47.20
|
2023-02-27
|
|
LLaMA 7B (zero-shot)
|
LLaMA: Open and Efficient Foundation Language Mod…
|
46.90
|
2023-02-27
|
|
GPT-3 175B (zero-shot)
|
Language Models are Few-Shot Learners
|
45.50
|
2020-05-28
|
|
PaLM 8B (zero-shot)
|
PaLM: Scaling Language Modeling with Pathways
|
42.30
|
2022-04-05
|
|
Bloomberg GPT (one-shot)
|
BloombergGPT: A Large Language Model for Finance
|
41.74
|
2023-03-30
|
|
BLOOM 176B (one-shot)
|
BloombergGPT: A Large Language Model for Finance
|
39.14
|
2023-03-30
|
|
OPT 66B (one-shot)
|
BloombergGPT: A Large Language Model for Finance
|
37.02
|
2023-03-30
|
|
GPT-NeoX (one-shot)
|
BloombergGPT: A Large Language Model for Finance
|
34.33
|
2023-03-30
|
|