AISHELL-2

Dataset Information
Modalities
Speech
Languages
Mandarin Chinese
License
Homepage

Overview

AISHELL-2 contains 1000 hours of clean read-speech data from iOS is free for academic usage.

Source: AISHELL-2: Transforming Mandarin ASR Research Into Industrial Scale
Image Source: https://arxiv.org/pdf/1808.10583v2.pdf

Variants: AISHELL-2, AISHELL-2 Test IOS, AISHELL-2 Test Android, AISHELL-2 Test Mic

Associated Benchmarks

This dataset is used in 1 benchmark:

Recent Benchmark Submissions

Task Model Paper Date
Speech Recognition Paraformer-large FunASR: A Fundamental End-to-End Speech … 2023-05-18
Speech Recognition Paraformer FunASR: A Fundamental End-to-End Speech … 2023-05-18

Research Papers

Recent papers with results on this dataset: