ESC-50

Dataset Information
Modalities
Audio
Introduced
2015
License
Homepage

Overview

The ESC-50 dataset is a labeled collection of 2000 environmental audio recordings suitable for benchmarking methods of environmental sound classification. It comprises 2000 5s-clips of 50 different classes across natural, human and domestic sounds, again, drawn from Freesound.org.

Source: The NIGENS General Sound Events Database
Image Source: https://github.com/karolpiczak/ESC-50

Variants: ESC-50

Associated Benchmarks

This dataset is used in 3 benchmarks:

Recent Benchmark Submissions

Task Model Paper Date
Audio Classification M2D2 AS+ M2D2: Exploring General-purpose Audio-Language Representations … 2025-03-28
Audio Classification MATPAC (SSL model, linear eval) Masked Latent Prediction and Classification … 2025-02-17
Audio Classification LHGNN LHGNN: Local-Higher Order Graph Neural … 2025-01-07
Image Classification SDGM-D Performance of Gaussian Mixture Model … 2024-10-17
Audio Classification M2D-CLAP/0.7 M2D-CLAP: Masked Modeling Duo Meets … 2024-06-04
Audio Classification M2D-AS/0.7 Masked Modeling Duo: Towards a … 2024-04-09
Audio Classification M2D/0.7 Masked Modeling Duo: Towards a … 2024-04-09
Audio Classification InternVideo2 InternVideo2: Scaling Foundation Models for … 2024-03-22
Audio Classification EAT EAT: Self-Supervised Pre-Training with Efficient … 2024-01-07
Audio Classification OmniVec OmniVec: Learning robust representations with … 2023-11-07
Audio Classification DyMN-L Dynamic Convolutional Neural Networks as … 2023-10-24
Audio Classification BEATs BEATs: Audio Pre-Training with Acoustic … 2022-12-18
Audio Classification mn40_as Efficient Large-scale Audio Tagging via … 2022-11-09
Audio Classification SepTr + LeRaC Learning Rate Curriculum 2022-05-18
Audio Classification EAT-S End-to-End Audio Strikes Back: Boosting … 2022-04-25
Audio Classification EAT-M End-to-End Audio Strikes Back: Boosting … 2022-04-25
Audio Classification EAT-S (scratch) End-to-End Audio Strikes Back: Boosting … 2022-04-25
Audio Classification SepTr SepTr: Separable Transformer for Audio … 2022-03-17
Audio Classification HTS-AT HTS-AT: A Hierarchical Token-Semantic Audio … 2022-02-02
Environmental Sound Classification AudioCLIP AudioCLIP: Extending CLIP to Image, … 2021-06-24

Research Papers

Recent papers with results on this dataset: