This dataset is used in 2 benchmarks:
Task |
Model |
Paper |
Date |
Language Modelling
|
MDLM-Prime |
Beyond Masked and Unmasked: Discrete …
|
2025-05-24 |
Language Modelling
|
BD3-LMs |
Block Diffusion: Interpolating Between Autoregressive …
|
2025-03-12 |
Text Generation
|
GPT2-Hermite |
Polynomial, trigonometric, and tropical activations
|
2025-02-03 |
Language Modelling
|
GPT2-Hermite |
Polynomial, trigonometric, and tropical activations
|
2025-02-03 |
Language Modelling
|
GPT2-Tropical |
Polynomial, trigonometric, and tropical activations
|
2025-02-03 |
Language Modelling
|
GPT2-Fourier |
Polynomial, trigonometric, and tropical activations
|
2025-02-03 |
Language Modelling
|
GPT2-GELU |
Polynomial, trigonometric, and tropical activations
|
2025-02-03 |
Language Modelling
|
EDLM-NCE |
Energy-Based Diffusion Language Models for …
|
2024-10-28 |
Language Modelling
|
EDLM-coAR |
Energy-Based Diffusion Language Models for …
|
2024-10-28 |
Text Generation
|
GPT2-81M-LOOP |
Loop Neural Networks for Parameter …
|
2024-09-21 |
Language Modelling
|
ARM |
Simple and Effective Masked Diffusion …
|
2024-06-11 |
Language Modelling
|
MDLM |
Simple and Effective Masked Diffusion …
|
2024-06-11 |
Language Modelling
|
GenMD4 |
Simplified and Generalized Masked Diffusion …
|
2024-06-06 |
Language Modelling
|
SEDD |
Discrete Diffusion Modeling by Estimating …
|
2023-10-25 |
Recent papers with results on this dataset: