ML Research Wiki / Benchmarks / Language Modelling / Hutter Prize

Hutter Prize

Language Modelling Benchmark

Performance Over Time

📊 Showing 18 results | 📏 Metric: Bit per Character (BPC)

Top Performing Models

Rank Model Paper Bit per Character (BPC) Date Code
1 RHN - depth 5 [zilly2016recurrent] Recurrent Highway Networks 1.31 2016-07-12 📦 labmlai/annotated_deep_learning_paper_implementations 📦 julian121266/RecurrentHighwayNetworks 📦 jzilly/RecurrentHighwayNetworks
2 FS-LSTM-4 Fast-Slow Recurrent Neural Networks 1.28 2017-05-24 📦 amujika/Fast-Slow-LSTM
3 Large RHN Recurrent Highway Networks 1.27 2016-07-12 📦 labmlai/annotated_deep_learning_paper_implementations 📦 julian121266/RecurrentHighwayNetworks 📦 jzilly/RecurrentHighwayNetworks
4 Large FS-LSTM-4 Fast-Slow Recurrent Neural Networks 1.25 2017-05-24 📦 amujika/Fast-Slow-LSTM
5 Large mLSTM +emb +WN +VD Multiplicative LSTM for sequence modelling 1.24 2016-09-26 📦 astakara48/python_project
6 3-layer AWD-LSTM An Analysis of Neural Language Modeling at Multiple Scales 1.23 2018-03-22 📦 salesforce/awd-lstm-lm 📦 Han-JD/GRU-D 📦 jb33k/awd-lstm-lm-ThinkNet
7 Mogrifier LSTM Mogrifier LSTM 1.12 2019-09-04 📦 deepmind/lamb 📦 RMichaelSwan/MogrifierLSTM 📦 microcoder-py/mogrifier-lstm
8 12-layer Character Transformer Model Character-Level Language Modeling with Deeper Self-Attention 1.11 2018-08-09 📦 facebookresearch/code-prediction-transformer
9 mLSTM + dynamic eval Dynamic Evaluation of Neural Sequence Models 1.08 2017-09-21 📦 benkrause/dynamic-evaluation 📦 benkrause/dynamiceval-transformer 📦 sacmehta/PRU
10 64-layer Character Transformer Model Character-Level Language Modeling with Deeper Self-Attention 1.06 2018-08-09 📦 facebookresearch/code-prediction-transformer

All Papers (18)