AVA-Speech

Dataset Information
License
Unknown
Homepage

Overview

Contains densely labeled speech activity in YouTube videos, with the goal of creating a shared, available dataset for this task.

Source: AVA-Speech: A Densely Labeled Dataset of Speech Activity in Movies

Variants: AVA-Speech

Associated Benchmarks

This dataset is used in 1 benchmark:

Recent Benchmark Submissions

Task Model Paper Date
Activity Detection SG-VAD (ours) SG-VAD: Stochastic Gates Based Speech … 2022-10-28
Activity Detection CNN-BiLSTM_best A Hybrid CNN-BiLSTM Voice Activity … 2021-03-05
Activity Detection CNN-BiLSTM_small A Hybrid CNN-BiLSTM Voice Activity … 2021-03-05

Research Papers

Recent papers with results on this dataset: