RHN - depth 5 [zilly2016recurrent]
|
Recurrent Highway Networks
|
1.31
|
2016-07-12
|
|
FS-LSTM-4
|
Fast-Slow Recurrent Neural Networks
|
1.28
|
2017-05-24
|
|
Large RHN
|
Recurrent Highway Networks
|
1.27
|
2016-07-12
|
|
Large FS-LSTM-4
|
Fast-Slow Recurrent Neural Networks
|
1.25
|
2017-05-24
|
|
Large mLSTM +emb +WN +VD
|
Multiplicative LSTM for sequence modelling
|
1.24
|
2016-09-26
|
|
3-layer AWD-LSTM
|
An Analysis of Neural Language Modeling at Multip…
|
1.23
|
2018-03-22
|
|
Mogrifier LSTM
|
Mogrifier LSTM
|
1.12
|
2019-09-04
|
|
12-layer Character Transformer Model
|
Character-Level Language Modeling with Deeper Sel…
|
1.11
|
2018-08-09
|
|
mLSTM + dynamic eval
|
Dynamic Evaluation of Neural Sequence Models
|
1.08
|
2017-09-21
|
|
64-layer Character Transformer Model
|
Character-Level Language Modeling with Deeper Sel…
|
1.06
|
2018-08-09
|
|
12-layer Transformer-XL
|
Transformer-XL: Attentive Language Models Beyond …
|
1.06
|
2019-01-09
|
|
18-layer Transformer-XL
|
Transformer-XL: Attentive Language Models Beyond …
|
1.03
|
2019-01-09
|
|
Longformer Small
|
Longformer: The Long-Document Transformer
|
1.00
|
2020-04-10
|
|
24-layer Transformer-XL
|
Transformer-XL: Attentive Language Models Beyond …
|
0.99
|
2019-01-09
|
|
Longformer Large
|
Longformer: The Long-Document Transformer
|
0.99
|
2020-04-10
|
|
Mogrifier LSTM + dynamic eval
|
Mogrifier LSTM
|
0.99
|
2019-09-04
|
|
Compressive Transformer
|
Compressive Transformers for Long-Range Sequence …
|
0.97
|
2019-11-13
|
|
Transformer-XL + RMS dynamic eval
|
Dynamic Evaluation of Transformer Language Models
|
0.94
|
2019-04-17
|
|