LibriCSS

Name: LibriCSS
License: Unknown

Dataset Information

Modalities

Speech

License

Unknown

Homepage

Official Website

Contents

Overview
Associated Benchmarks
Recent Benchmark Submissions
Research Papers

Overview

Continuous speech separation (CSS) is an approach to handling overlapped speech in conversational audio signals. A real recorded dataset, called LibriCSS, is derived from LibriSpeech by concatenating the corpus utterances to simulate a conversation and capturing the audio replays with far-field microphones.

Source: https://github.com/chenzhuo1011/libri_css

Variants: LibriCSS

Associated Benchmarks

This dataset is used in 2 benchmarks:

Speech Recognition - Metrics: Word Error Rate (WER)
Speech Separation - Metrics: 0S, 0L, 10%, 20%, 30%, 40%

Recent Benchmark Submissions

Task	Model	Paper	Date
Speech Recognition	TS-SEP	TS-SEP: Joint Diarization and Separation …	2023-03-07
Speech Recognition	GSS + Transducer	GPU-accelerated Guided Source Separation for …	2022-12-10
Speech Separation	Conformer (large)	Continuous Speech Separation with Conformer	2020-08-13
Speech Separation	Conformer (base)	Continuous Speech Separation with Conformer	2020-08-13

Research Papers

Recent papers with results on this dataset:

External Links:

LibriCSS

Overview edit

Associated Benchmarks

Recent Benchmark Submissions

Research Papers

Edit Dataset Information

Overview