TIMIT Acoustic-Phonetic Continuous Speech Corpus
The TIMIT Acoustic-Phonetic Continuous Speech Corpus is a standard dataset used for evaluation of automatic speech recognition systems. It consists of recordings of 630 speakers of 8 dialects of American English each reading 10 phonetically-rich sentences. It also comes with the word and phone-level transcriptions of the speech.
Source: Improving neural networks by preventing co-adaptation of feature detectors
Image Source: https://roboticrun.wordpress.com/2016/06/21/timit-introduction-the-official-doc/
Variants: timit PER, TIMIT, TCD-TIMIT corpus (mixed-speech), DARPA TIMIT
This dataset is used in 1 benchmark:
Recent papers with results on this dataset: