VoxCeleb1

Dataset Information
Modalities
Audio
Introduced
2017
License
Unknown
Homepage

Overview

VoxCeleb1 is an audio dataset containing over 100,000 utterances for 1,251 celebrities, extracted from videos uploaded to YouTube.

Variants: VoxCeleb, VoxCeleb1, VoxCeleb1 - 1-shot learning, VoxCeleb1 - 8-shot learning, VoxCeleb1 - 32-shot learning

Associated Benchmarks

This dataset is used in 3 benchmarks:

Recent Benchmark Submissions

Task Model Paper Date
Speaker Verification ReDimNet-B5-SF2-LM (9.2M) Reshape Dimensions Network for Speaker … 2024-07-25
Speaker Verification ReDimNet-B3-LM-ASNorm (3.0M) Reshape Dimensions Network for Speaker … 2024-07-25
Speaker Verification ReDimNet-B3-LM (3.0M) Reshape Dimensions Network for Speaker … 2024-07-25
Speaker Verification ReDimNet-B0-LM (1.0M) Reshape Dimensions Network for Speaker … 2024-07-25
Speaker Verification ReDimNet-B0-LM-ASNorm (1.0M) Reshape Dimensions Network for Speaker … 2024-07-25
Speaker Verification ReDimNet-B6-SF2-LM (15.0M) Reshape Dimensions Network for Speaker … 2024-07-25
Speaker Verification ReDimNet-B5-SF2-LM-ASNorm (9.2M) Reshape Dimensions Network for Speaker … 2024-07-25
Speaker Verification ReDimNet-B6-SF2-LM-ASNorm (15.0M) Reshape Dimensions Network for Speaker … 2024-07-25
Speaker Verification ReDimNet-B1-LM (2.2M) Reshape Dimensions Network for Speaker … 2024-07-25
Speaker Verification ReDimNet-B4-LM-ASNorm (6.3M) Reshape Dimensions Network for Speaker … 2024-07-25
Speaker Verification ReDimNet-B1-LM-ASNorm (2.2M) Reshape Dimensions Network for Speaker … 2024-07-25
Speaker Verification ReDimNet-B2-SF2-LM (4.7M) Reshape Dimensions Network for Speaker … 2024-07-25
Speaker Verification ReDimNet-B2-SF2-LM-ASNorm (4.7M) Reshape Dimensions Network for Speaker … 2024-07-25
Speaker Verification ReDimNet-B4-LM (6.3M) Reshape Dimensions Network for Speaker … 2024-07-25
Speaker Identification SSAMBA SSAMBA: Self-Supervised Audio Representation Learning … 2024-05-20
Speaker Identification MSM-MAE Masked Modeling Duo: Towards a … 2024-04-09
Speaker Identification M2D/0.6 Masked Modeling Duo: Towards a … 2024-04-09
Speaker Identification M2D/0.7 Masked Modeling Duo: Towards a … 2024-04-09
Speaker Recognition WavLM+ECAPA-TDNN ESPnet-SPK: full pipeline speaker embedding … 2024-01-30
Speaker Identification M2D ratio=0.6 Masked Modeling Duo: Learning Representations … 2022-10-26

Research Papers

Recent papers with results on this dataset: