Fourier Transformer
|
Fourier Transformer: Fast Long Range Modeling by …
|
26.90
|
2023-05-24
|
|
QG
|
Closed-book Question Generation via Contrastive L…
|
26.40
|
2022-10-13
|
|
BART
|
BART: Denoising Sequence-to-Sequence Pre-training…
|
24.30
|
2019-10-29
|
|
E-MCA
|
Using Local Knowledge Graph Construction to Scale…
|
24.00
|
2019-10-18
|
|
Transformer Multitask + LayerDrop
|
Reducing Transformer Depth on Demand with Structu…
|
23.40
|
2019-09-25
|
|
Multi-Inrerleave
|
Improving Conditioning in Context-Aware Sequence …
|
14.63
|
2019-11-21
|
|