SEDD
|
Discrete Diffusion Modeling by Estimating the Rat…
|
24.10
|
2023-10-25
|
|
MDLM
|
Simple and Effective Masked Diffusion Language Mo…
|
22.98
|
2024-06-11
|
|
GenMD4
|
Simplified and Generalized Masked Diffusion for D…
|
21.80
|
2024-06-06
|
|
EDLM-NCE
|
Energy-Based Diffusion Language Models for Text G…
|
21.52
|
2024-10-28
|
|
BD3-LMs
|
Block Diffusion: Interpolating Between Autoregres…
|
20.73
|
2025-03-12
|
|
EDLM-coAR
|
Energy-Based Diffusion Language Models for Text G…
|
17.58
|
2024-10-28
|
|
ARM
|
Simple and Effective Masked Diffusion Language Mo…
|
17.54
|
2024-06-11
|
|
MDLM-Prime
|
Beyond Masked and Unmasked: Discrete Diffusion Mo…
|
15.36
|
2025-05-24
|
|
GPT2-GELU
|
Polynomial, trigonometric, and tropical activatio…
|
2.95
|
2025-02-03
|
|
GPT2-Fourier
|
Polynomial, trigonometric, and tropical activatio…
|
2.93
|
2025-02-03
|
|
GPT2-Tropical
|
Polynomial, trigonometric, and tropical activatio…
|
2.92
|
2025-02-03
|
|
GPT2-Hermite
|
Polynomial, trigonometric, and tropical activatio…
|
2.91
|
2025-02-03
|
|