NaturalSpeech
|
NaturalSpeech: End-to-End Text to Speech Synthesi…
|
4.56
|
2022-05-09
|
|
VITS
|
NaturalSpeech: End-to-End Text to Speech Synthesi…
|
4.43
|
2022-05-09
|
|
Grad-TTS + HiFiGAN (1000 steps)
|
Grad-TTS: A Diffusion Probabilistic Model for Tex…
|
4.37
|
2021-05-13
|
|
Glow-TTS + HiFiGAN
|
Glow-TTS: A Generative Flow for Text-to-Speech vi…
|
4.34
|
2020-05-22
|
|
FastSpeech 2 + HiFiGAN
|
NaturalSpeech: End-to-End Text to Speech Synthesi…
|
4.34
|
2022-05-09
|
|
FastSpeech 2 + HiFiGAN
|
FastSpeech 2: Fast and High-Quality End-to-End Te…
|
4.32
|
2020-06-08
|
|
FastDiff (4 steps)
|
FastDiff: A Fast Conditional Diffusion Model for …
|
4.28
|
2022-04-21
|
|
FastDiff-TTS
|
FastDiff: A Fast Conditional Diffusion Model for …
|
4.03
|
2022-04-21
|
|
Transformer TTS (Mel + WaveGlow)
|
Neural Speech Synthesis with Transformer Network
|
3.88
|
2018-09-19
|
|
FastSpeech (Mel + WaveGlow)
|
FastSpeech: Fast, Robust and Controllable Text to…
|
3.84
|
2019-05-22
|
|
Matcha-TTS
|
Matcha-TTS: A fast TTS architecture with conditio…
|
3.84
|
2023-09-06
|
|
Flowtron
|
Flowtron: an Autoregressive Flow-based Generative…
|
3.67
|
2020-05-12
|
|
Tacotron 2
|
Flowtron: an Autoregressive Flow-based Generative…
|
3.52
|
2020-05-12
|
|
OverFlow
|
OverFlow: Putting flows on top of neural transduc…
|
3.37
|
2022-11-13
|
|
Merlin
|
FastSpeech: Fast, Robust and Controllable Text to…
|
2.40
|
2019-05-22
|
|