GPT-2 (small)
|
StereoSet: Measuring stereotypical bias in pretra…
|
72.97
|
2020-04-20
|
|
XLNet (large)
|
StereoSet: Measuring stereotypical bias in pretra…
|
72.03
|
2020-04-20
|
|
GPT-2 (medium)
|
StereoSet: Measuring stereotypical bias in pretra…
|
71.73
|
2020-04-20
|
|
BERT (base)
|
StereoSet: Measuring stereotypical bias in pretra…
|
71.21
|
2020-04-20
|
|
GPT-2 (large)
|
StereoSet: Measuring stereotypical bias in pretra…
|
70.54
|
2020-04-20
|
|
BERT (large)
|
StereoSet: Measuring stereotypical bias in pretra…
|
69.89
|
2020-04-20
|
|
RoBERTa (base)
|
StereoSet: Measuring stereotypical bias in pretra…
|
67.50
|
2020-04-20
|
|
GAL 120B
|
Galactica: A Large Language Model for Science
|
65.60
|
2022-11-16
|
|
XLNet (base)
|
StereoSet: Measuring stereotypical bias in pretra…
|
62.10
|
2020-04-20
|
|
GPT-3 (text-davinci-002)
|
Galactica: A Large Language Model for Science
|
60.80
|
2022-11-16
|
|
OPT 175B
|
Galactica: A Large Language Model for Science
|
60.00
|
2022-11-16
|
|