VoxForge

Dataset Information
Modalities
Texts, Audio, Speech
Languages
English, French, Spanish, German, Italian, Japanese, Russian
License
Unknown
Homepage

Overview

VoxForge is an open speech dataset that was set up to collect transcribed speech for use with Free and Open Source Speech Recognition Engines (on Linux, Windows and Mac).
Image Source: http://www.voxforge.org/home

Variants: VoxForge American-Canadian, VoxForge Commonwealth, VoxForge European, VoxForge Indian, VoxForge

Associated Benchmarks

This dataset is used in 2 benchmarks:

Recent Benchmark Submissions

Task Model Paper Date
Language Identification ConformerG-P BigSSL: Exploring the Frontier of … 2021-09-27
Keyword Spotting 1D-ConvNet Spoken Language Identification using ConvNets 2019-10-09
Keyword Spotting 2D-ConvNet Spoken Language Identification using ConvNets 2019-10-09

Research Papers

Recent papers with results on this dataset: