VoxCeleb1

Name: VoxCeleb1
Published: 2017-06-26
License: Unknown

Dataset Information

Modalities

Audio

Introduced

2017

License

Unknown

Homepage

Official Website

Contents

Overview
Associated Benchmarks
Recent Benchmark Submissions
Research Papers

Overview

VoxCeleb1 is an audio dataset containing over 100,000 utterances for 1,251 celebrities, extracted from videos uploaded to YouTube.

Variants: VoxCeleb, VoxCeleb1, VoxCeleb1 - 1-shot learning, VoxCeleb1 - 8-shot learning, VoxCeleb1 - 32-shot learning

Associated Benchmarks

This dataset is used in 3 benchmarks:

Speaker Identification - Metrics: Top-1 (%), Top-5 (%), Number of Params, Accuracy
Speaker Recognition - Metrics: EER
Speaker Verification - Metrics: EER

Recent Benchmark Submissions

Task	Model	Paper	Date
Speaker Verification	ReDimNet-B5-SF2-LM (9.2M)	Reshape Dimensions Network for Speaker …	2024-07-25
Speaker Verification	ReDimNet-B3-LM-ASNorm (3.0M)	Reshape Dimensions Network for Speaker …	2024-07-25
Speaker Verification	ReDimNet-B3-LM (3.0M)	Reshape Dimensions Network for Speaker …	2024-07-25
Speaker Verification	ReDimNet-B0-LM (1.0M)	Reshape Dimensions Network for Speaker …	2024-07-25
Speaker Verification	ReDimNet-B0-LM-ASNorm (1.0M)	Reshape Dimensions Network for Speaker …	2024-07-25
Speaker Verification	ReDimNet-B6-SF2-LM (15.0M)	Reshape Dimensions Network for Speaker …	2024-07-25
Speaker Verification	ReDimNet-B5-SF2-LM-ASNorm (9.2M)	Reshape Dimensions Network for Speaker …	2024-07-25
Speaker Verification	ReDimNet-B6-SF2-LM-ASNorm (15.0M)	Reshape Dimensions Network for Speaker …	2024-07-25
Speaker Verification	ReDimNet-B1-LM (2.2M)	Reshape Dimensions Network for Speaker …	2024-07-25
Speaker Verification	ReDimNet-B4-LM-ASNorm (6.3M)	Reshape Dimensions Network for Speaker …	2024-07-25
Speaker Verification	ReDimNet-B1-LM-ASNorm (2.2M)	Reshape Dimensions Network for Speaker …	2024-07-25
Speaker Verification	ReDimNet-B2-SF2-LM (4.7M)	Reshape Dimensions Network for Speaker …	2024-07-25
Speaker Verification	ReDimNet-B2-SF2-LM-ASNorm (4.7M)	Reshape Dimensions Network for Speaker …	2024-07-25
Speaker Verification	ReDimNet-B4-LM (6.3M)	Reshape Dimensions Network for Speaker …	2024-07-25
Speaker Identification	SSAMBA	SSAMBA: Self-Supervised Audio Representation Learning …	2024-05-20
Speaker Identification	MSM-MAE	Masked Modeling Duo: Towards a …	2024-04-09
Speaker Identification	M2D/0.6	Masked Modeling Duo: Towards a …	2024-04-09
Speaker Identification	M2D/0.7	Masked Modeling Duo: Towards a …	2024-04-09
Speaker Recognition	WavLM+ECAPA-TDNN	ESPnet-SPK: full pipeline speaker embedding …	2024-01-30
Speaker Identification	M2D ratio=0.6	Masked Modeling Duo: Learning Representations …	2022-10-26

Research Papers

Recent papers with results on this dataset:

External Links:

VoxCeleb1

Overview edit

Associated Benchmarks

Recent Benchmark Submissions

Research Papers

Edit Dataset Information

Overview