ML Research Wiki / Benchmarks / Stereotypical Bias Analysis / CrowS-Pairs

CrowS-Pairs

Stereotypical Bias Analysis Benchmark

Performance Over Time

📊 Showing 4 results | 📏 Metric: Gender

Top Performing Models

Rank Model Paper Gender Date Code
1 LLaMA 65B LLaMA: Open and Efficient Foundation Language Models 70.10 2023-02-27 📦 huggingface/transformers 📦 ggml-org/llama.cpp 📦 ggerganov/llama.cpp
2 GAL 120B Galactica: A Large Language Model for Science 69.00 2022-11-16 📦 paperswithcode/galai
3 OPT-175B OPT: Open Pre-trained Transformer Language Models 67.80 2022-05-02 📦 facebookresearch/metaseq 📦 pku-alignment/safe-rlhf 📦 liangyuwang/zo2
4 GPT-3 OPT: Open Pre-trained Transformer Language Models 64.40 2022-05-02 📦 facebookresearch/metaseq 📦 pku-alignment/safe-rlhf 📦 liangyuwang/zo2

All Papers (4)