📊 Showing 2 results | 📏 Metric: eval_loss
Rank | Model | Paper | eval_loss | Date | Code |
---|---|---|---|---|---|
1 | GPT2-81M-LOOP | Loop Neural Networks for Parameter Sharing | 3.11 | 2024-09-21 | - |
2 | GPT2-Hermite | Polynomial, trigonometric, and tropical activations | 2.91 | 2025-02-03 | 📦 K-H-Ismail/torchortho |