TUDA

Name: TUDA
Published: 2015-12-11
License: CC-BY-4.0

Dataset Information

Modalities

Audio, Speech

Languages

German

Introduced

2015

License

CC-BY-4.0

Homepage

Official Website

Contents

Overview
Associated Benchmarks
Recent Benchmark Submissions
Research Papers

Overview

Overall duration per microphone: about 36 hours (31 hrs train / 2.5 hrs dev / 2.5 hrs test)
Count of microphones: 3 (Microsoft Kinect, Yamaha, Samson)
Count of wave-files per microphone: about 14500
Overall count of participations: 180 (130 male / 50 female)

Variants: TUDA

Associated Benchmarks

This dataset is used in 1 benchmark:

Speech Recognition - Metrics: Test WER

Recent Benchmark Submissions

Task	Model	Paper	Date
Speech Recognition	QuartzNet15x5DE (D37)	Scribosermo: Fast Speech-to-Text models for …	2021-10-15
Speech Recognition	IMS-Speech	IMS-Speech: A Speech to Text …	2019-08-13
Speech Recognition	Kaldi	Open Source Automatic Speech Recognition …	2018-07-26

Research Papers

Recent papers with results on this dataset:

Scribosermo: Fast Speech-to-Text models for German and other Languages (2021) -
IMS-Speech: A Speech to Text Tool (2019) -
Open Source Automatic Speech Recognition for German (2018) -

External Links:

TUDA

Overview edit

Associated Benchmarks

Recent Benchmark Submissions

Research Papers

Edit Dataset Information

Overview