ML Research Wiki / Benchmarks / Language Modelling / LAMBADA

LAMBADA

Language Modelling Benchmark

Performance Over Time

📊 Showing 34 results | 📏 Metric: Accuracy

Top Performing Models

Rank Model Paper Accuracy Date Code
1 PaLM-540B (Few-Shot) PaLM: Scaling Language Modeling with Pathways 89.70 2022-04-05 📦 lucidrains/CoCa-pytorch 📦 lucidrains/PaLM-pytorch 📦 google/paxml
2 PaLM 2-L (one-shot) PaLM 2 Technical Report 86.90 2023-05-17 📦 eternityyw/tram-benchmark
3 GPT-3 175B (Few-Shot) Language Models are Few-Shot Learners 86.40 2020-05-28 📦 ggml-org/llama.cpp 📦 ggerganov/llama.cpp 📦 karpathy/llm.c
4 LLaMA-65B+CFG (Zero-Shot) Stay on topic with Classifier-Free Guidance 84.00 2023-06-30 -
5 LLaMA-30B+CFG (zero-shot) Stay on topic with Classifier-Free Guidance 83.90 2023-06-30 -
6 PaLM 2-M (one-shot) PaLM 2 Technical Report 83.70 2023-05-17 📦 eternityyw/tram-benchmark
7 LLaMA-13B+CFG (zero-shot) Stay on topic with Classifier-Free Guidance 82.20 2023-06-30 -
8 PaLM-540B (One-Shot) PaLM: Scaling Language Modeling with Pathways 81.80 2022-04-05 📦 lucidrains/CoCa-pytorch 📦 lucidrains/PaLM-pytorch 📦 google/paxml
9 GLaM 62B/64E (One-Shot) GLaM: Efficient Scaling of Language Models with Mixture-of-Experts 80.90 2021-12-13 -
10 PaLM 2-S (one-shot) PaLM 2 Technical Report 80.70 2023-05-17 📦 eternityyw/tram-benchmark

All Papers (34)

Language Models are Few-Shot Learners

2020
GPT-3 175B (Few-Shot)

Stay on topic with Classifier-Free Guidance

2023
LLaMA-65B+CFG (Zero-Shot)

Stay on topic with Classifier-Free Guidance

2023
LLaMA-30B+CFG (zero-shot)

Stay on topic with Classifier-Free Guidance

2023
LLaMA-13B+CFG (zero-shot)

Language Models are Few-Shot Learners

2020
GPT-3 175B (Zero-Shot)

Language Models are Few-Shot Learners

2020
GPT-3 13B (Zero-Shot)

Language Models are Few-Shot Learners

2020
GPT-3 6.7B (Zero-Shot)

Language Models are Few-Shot Learners

2020
GPT-3 2.7B (Zero-Shot)

Broad Context Language Modeling as Reading Comprehension

2016
Gated-Attention Reader (+ features)