Contains densely labeled speech activity in YouTube videos, with the goal of creating a shared, available dataset for this task.
Source: AVA-Speech: A Densely Labeled Dataset of Speech Activity in Movies
Variants: AVA-Speech
This dataset is used in 1 benchmark:
Task | Model | Paper | Date |
---|---|---|---|
Activity Detection | SG-VAD (ours) | SG-VAD: Stochastic Gates Based Speech … | 2022-10-28 |
Activity Detection | CNN-BiLSTM_best | A Hybrid CNN-BiLSTM Voice Activity … | 2021-03-05 |
Activity Detection | CNN-BiLSTM_small | A Hybrid CNN-BiLSTM Voice Activity … | 2021-03-05 |
Recent papers with results on this dataset: