ML Research Wiki / Benchmarks / Speech Recognition / AISHELL-1

AISHELL-1

Speech Recognition Benchmark

Performance Over Time

📊 Showing 18 results | 📏 Metric: Word Error Rate (WER)

Top Performing Models

Rank Model Paper Word Error Rate (WER) Date Code
1 Att End-to-end Speech Recognition with Adaptive Computation Steps 18.70 2018-08-30 -
2 CTC/Att A Comparative Study on Transformer vs RNN in Speech Applications 6.70 2019-09-13 📦 espnet/espnet 📦 MindSpore-scientific-2/code-11
3 BRA-E Beyond Universal Transformer: block reusing with adaptor in Transformer for automatic speech recognition 6.63 2023-03-23 -
4 CTC-CRF 4gram-LM CAT: A CTC-CRF based ASR Toolkit Bridging the Hybrid and the End-to-end Approaches towards Data Efficiency and Low Latency 6.34 2020-05-27 📦 thu-spmi/cat
5 BAT BAT: Boundary aware transducer for memory-efficient and low-latency ASR 4.97 2023-05-19 📦 alibaba-damo-academy/FunASR
6 Paraformer FunASR: A Fundamental End-to-End Speech Recognition Toolkit 4.95 2023-05-18 📦 alibaba-damo-academy/FunASR
7 U2 Unified Streaming and Non-streaming Two-pass End-to-end Model for Speech Recognition 4.72 2020-12-10 📦 PaddlePaddle/PaddleSpeech 📦 TeaPoly/Conformer-Athena 📦 xianchao-wu/wenet-deep-sparse-conformer 📦 joseewei/wenet 📦 Vill-Lab/2023-TMM-Grad-SAS
8 UMA Unimodal Aggregation for CTC-based Speech Recognition 4.70 2023-09-15 📦 Audio-WestlakeU/UMA-ASR
9 Lightweight Transducer Lightweight Transducer Based on Frame-Level Criterion 4.31 2024-09-05 📦 wangmengzhi/Lightweight-Transducer
10 SE-WSBO With LM Improving Mandarin Speech Recogntion with Block-augmented Transformer 4.10 2022-07-24 📦 LeonWlw/asr_blockformer 📦 mininglamp-technology/asr-blockformer

All Papers (18)