- Shanghai
-
13:23
(UTC +08:00) - https://zhikangniu.github.io/
TTS
A Survey on Neural Speech Synthesis https://arxiv.org/pdf/2106.15561.pdf
Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key
High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.
A multi-voice TTS system trained with an emphasis on quality
EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
Chinese Mandarin tts text-to-speech 中文 (普通话) 语音 合成 , by fastspeech 2 , implemented in pytorch, using waveglow as vocoder, with biaobei and aishell3 datasets
ZMM-TTS: Zero-shot Multilingual and Multispeaker Speech Synthesis Conditioned on Self-supervised Discrete Speech Representations
Automatically Update Text-to-speech (TTS) Papers Daily using Github Actions (Update Every 12th hours)
Refactored / updated version of `stable-audio-tools` which is an open-source code for audio/music generative models originally by Stability AI.
Open Source Text-To-Speech Portuguese Dataset
[AAAI 2024] Code for CTX-vec2wav in UniCATS
Fine tuning the UnifiedVoice autoregressor for TortoiseTTS.
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
[ICASSP 2024] 🍵 Matcha-TTS: A fast TTS architecture with conditional flow matching
LibriSpeech-Long is a benchmark dataset for long-form speech generation and processing. Released as part of "Long-Form Speech Generation with Spoken Language Models" (arXiv 2024).
Zonos-v0.1 is a leading open-weight text-to-speech model trained on more than 200k hours of varied multilingual speech, delivering expressiveness and quality on par with—or even surpassing—top TTS …