Overall duration per microphone: about 36 hours (31 hrs train / 2.5 hrs dev / 2.5 hrs test)
Count of microphones: 3 (Microsoft Kinect, Yamaha, Samson)
Count of wave-files per microphone: about 14500
Overall count of participations: 180 (130 male / 50 female)
Variants: TUDA
This dataset is used in 1 benchmark:
Task | Model | Paper | Date |
---|---|---|---|
Speech Recognition | QuartzNet15x5DE (D37) | Scribosermo: Fast Speech-to-Text models for … | 2021-10-15 |
Speech Recognition | IMS-Speech | IMS-Speech: A Speech to Text … | 2019-08-13 |
Speech Recognition | Kaldi | Open Source Automatic Speech Recognition … | 2018-07-26 |
Recent papers with results on this dataset: