Att
|
End-to-end Speech Recognition with Adaptive Compu…
|
18.70
|
2018-08-30
|
|
CTC/Att
|
A Comparative Study on Transformer vs RNN in Spee…
|
6.70
|
2019-09-13
|
|
BRA-E
|
Beyond Universal Transformer: block reusing with …
|
6.63
|
2023-03-23
|
|
CTC-CRF 4gram-LM
|
CAT: A CTC-CRF based ASR Toolkit Bridging the Hyb…
|
6.34
|
2020-05-27
|
|
BAT
|
BAT: Boundary aware transducer for memory-efficie…
|
4.97
|
2023-05-19
|
|
Paraformer
|
FunASR: A Fundamental End-to-End Speech Recogniti…
|
4.95
|
2023-05-18
|
|
U2
|
Unified Streaming and Non-streaming Two-pass End-…
|
4.72
|
2020-12-10
|
|
UMA
|
Unimodal Aggregation for CTC-based Speech Recogni…
|
4.70
|
2023-09-15
|
|
Lightweight Transducer
|
Lightweight Transducer Based on Frame-Level Crite…
|
4.31
|
2024-09-05
|
|
SE-WSBO With LM
|
Improving Mandarin Speech Recogntion with Block-a…
|
4.10
|
2022-07-24
|
|
CIF-HKD With LM
|
Knowledge Transfer from Pre-trained Language Mode…
|
4.10
|
2023-01-30
|
|
Lightweight Transducer With LM
|
Lightweight Transducer Based on Frame-Level Crite…
|
4.03
|
2024-09-05
|
|
Zipformer+CR-CTC (no external language model)
|
CR-CTC: Consistency regularization on CTC for imp…
|
4.02
|
2024-10-07
|
|
Paraformer-large
|
FunASR: A Fundamental End-to-End Speech Recogniti…
|
1.95
|
2023-05-18
|
|
MMSpeech With LM
|
MMSpeech: Multi-modal Multi-task Encoder-Decoder …
|
1.90
|
2022-11-29
|
|
Qwen-Audio
|
Qwen-Audio: Advancing Universal Audio Understandi…
|
1.29
|
2023-11-14
|
|
Seed-ASR
|
Seed-ASR: Understanding Diverse Speech and Contex…
|
0.68
|
2024-07-05
|
|
FireRedASR-AED
|
FireRedASR: Open-Source Industrial-Grade Mandarin…
|
0.55
|
2025-01-24
|
|