A Real-Recorded and Annotated Microphone Array Dataset for Dynamic Speech Enhancement and Localization
The Audio Signal and Information Processing Lab at Westlake University, in collaboration with AISHELL, has released the Real-recorded and annotated Microphone Array speech&Noise (RealMAN) dataset, which provides annotated multi-channel speech and noise recordings for dynamic speech enhancement and localization:
Variants: RealMAN
This dataset is used in 2 benchmarks:
Task | Model | Paper | Date |
---|---|---|---|
Speech Enhancement | CleanMel-L-map | CleanMel: Mel-Spectrogram Enhancement for Improving … | 2025-02-27 |
Automatic Speech Recognition (ASR) | CleanMel-L-mask | CleanMel: Mel-Spectrogram Enhancement for Improving … | 2025-02-27 |
Recent papers with results on this dataset: