DeBERTa
|
DeBERTa: Decoding-enhanced BERT with Disentangled…
|
94.50
|
2020-06-05
|
|
T5-XXL 11B
|
Exploring the Limits of Transfer Learning with a …
|
93.20
|
2019-10-23
|
|
XLNet
|
XLNet: Generalized Autoregressive Pretraining for…
|
92.50
|
2019-06-19
|
|
ALBERT
|
ALBERT: A Lite BERT for Self-supervised Learning …
|
91.80
|
2019-09-26
|
|
T5-XL 3B
|
Exploring the Limits of Transfer Learning with a …
|
89.70
|
2019-10-23
|
|
StructBERTRoBERTa ensemble
|
StructBERT: Incorporating Language Structures int…
|
89.70
|
2019-08-13
|
|
HNNensemble
|
A Hybrid Neural Network Model for Commonsense Rea…
|
89.00
|
2019-07-27
|
|
RoBERTa (ensemble)
|
RoBERTa: A Robustly Optimized BERT Pretraining Ap…
|
89.00
|
2019-07-26
|
|
T5-Large 770M
|
Exploring the Limits of Transfer Learning with a …
|
85.60
|
2019-10-23
|
|
HNN
|
A Hybrid Neural Network Model for Commonsense Rea…
|
83.60
|
2019-07-27
|
|
T5-Base 220M
|
Exploring the Limits of Transfer Learning with a …
|
78.80
|
2019-10-23
|
|
BERTwiki 340M (fine-tuned on WSCR)
|
A Surprisingly Robust Trick for Winograd Schema C…
|
74.70
|
2019-05-15
|
|
FLAN 137B (zero-shot)
|
Finetuned Language Models Are Zero-Shot Learners
|
74.60
|
2021-09-03
|
|
BERT-large 340M (fine-tuned on WSCR)
|
A Surprisingly Robust Trick for Winograd Schema C…
|
71.90
|
2019-05-15
|
|
BERT-base 110M (fine-tuned on WSCR)
|
A Surprisingly Robust Trick for Winograd Schema C…
|
70.50
|
2019-05-15
|
|
FLAN 137B (few-shot, k=4)
|
Finetuned Language Models Are Zero-Shot Learners
|
70.40
|
2021-09-03
|
|
T5-Small 60M
|
Exploring the Limits of Transfer Learning with a …
|
69.20
|
2019-10-23
|
|
ERNIE 2.0 Large
|
ERNIE 2.0: A Continual Pre-training Framework for…
|
67.80
|
2019-07-29
|
|
SqueezeBERT
|
SqueezeBERT: What can computer vision teach NLP a…
|
65.10
|
2020-06-19
|
|
BERT-large 340M
|
BERT: Pre-training of Deep Bidirectional Transfor…
|
65.10
|
2018-10-11
|
|
RWKV-4-Raven-14B
|
RWKV: Reinventing RNNs for the Transformer Era
|
49.30
|
2023-05-22
|
|
DistilBERT 66M
|
DistilBERT, a distilled version of BERT: smaller,…
|
44.40
|
2019-10-02
|
|