LibriCSS

Dataset Information
Modalities
Speech
License
Unknown
Homepage

Overview

Continuous speech separation (CSS) is an approach to handling overlapped speech in conversational audio signals. A real recorded dataset, called LibriCSS, is derived from LibriSpeech by concatenating the corpus utterances to simulate a conversation and capturing the audio replays with far-field microphones.

Source: https://github.com/chenzhuo1011/libri_css

Variants: LibriCSS

Associated Benchmarks

This dataset is used in 2 benchmarks:

Recent Benchmark Submissions

Task Model Paper Date
Speech Recognition TS-SEP TS-SEP: Joint Diarization and Separation … 2023-03-07
Speech Recognition GSS + Transducer GPU-accelerated Guided Source Separation for … 2022-12-10
Speech Separation Conformer (large) Continuous Speech Separation with Conformer 2020-08-13
Speech Separation Conformer (base) Continuous Speech Separation with Conformer 2020-08-13

Research Papers

Recent papers with results on this dataset: