ML Research Wiki / Benchmarks / Speech Separation / WHAMR!

WHAMR!

Speech Separation Benchmark

Performance Over Time

📊 Showing 17 results | 📏 Metric: SI-SDRi

Top Performing Models

Rank Model Paper SI-SDRi Date Code
1 TF-Locoformer (M) TF-Locoformer: Transformer with Local Modeling by Convolution for Speech Separation and Enhancement 18.50 2024-08-06 📦 merlresearch/tf-locoformer
2 TF-Locoformer (S) TF-Locoformer: Transformer with Local Modeling by Convolution for Speech Separation and Enhancement 17.40 2024-08-06 📦 merlresearch/tf-locoformer
3 SepReformer-L + DM Separate and Reconstruct: Asymmetric Encoder-Decoder for Speech Separation 17.10 2024-06-10 📦 dmlguq456/SepReformer
4 MossFormer (L) + DM MossFormer: Pushing the Performance Limit of Monaural Speech Separation using Gated Single-Head Transformer with Convolution-Augmented Joint Self-Attentions 16.30 2023-02-23 📦 modelscope/ClearerVoice-Studio 📦 alibabasglab/mossformer
5 TD-Conformer (XL) + DM On Time Domain Conformer Models for Monaural Speech Separation in Noisy Reverberant Acoustic Environments 14.60 2023-10-09 📦 jwr1995/pubsep
6 Improved Sudo rm -rf (U=36) Compute and memory efficient universal sound source separation 13.50 2021-03-03 📦 etzinis/sudo_rm_rf 📦 etzinis/unsup_speech_enh_adaptation 📦 udase-chime2023/baseline
7 TD-Conformer (L) + DM On Time Domain Conformer Models for Monaural Speech Separation in Noisy Reverberant Acoustic Environments 13.40 2023-10-09 📦 jwr1995/pubsep
8 Wavesplit Wavesplit: End-to-End Speech Separation by Speaker Clustering 13.20 2020-02-20 -
9 DPTNET - SRSSN Stepwise-Refining Speech Separation Network via Fine-Grained Encoding in High-order Latent Domain 12.30 2021-10-10 -
10 DPRNN - SRSSN Stepwise-Refining Speech Separation Network via Fine-Grained Encoding in High-order Latent Domain 12.30 2021-10-10 -

All Papers (17)