Skip to content
View ZhikangNiu's full-sized avatar
🎯
focus
🎯
focus

Block or report ZhikangNiu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

TTS

20 repositories

A Survey on Neural Speech Synthesis https://arxiv.org/pdf/2106.15561.pdf

367 26 Updated Nov 5, 2021

Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key

Python 7,385 707 Updated Feb 3, 2025

High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.

Python 5,584 747 Updated Dec 24, 2024

A multi-voice TTS system trained with an emphasis on quality

Jupyter Notebook 13,707 1,898 Updated Nov 19, 2024

EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine

Python 7,671 654 Updated Aug 13, 2024

Chinese Mandarin tts text-to-speech 中文 (普通话) 语音 合成 , by fastspeech 2 , implemented in pytorch, using waveglow as vocoder, with biaobei and aishell3 datasets

Python 465 110 Updated May 28, 2022

ZMM-TTS: Zero-shot Multilingual and Multispeaker Speech Synthesis Conditioned on Self-supervised Discrete Speech Representations

C 143 9 Updated Mar 6, 2024

Automatically Update Text-to-speech (TTS) Papers Daily using Github Actions (Update Every 12th hours)

Python 365 23 Updated Feb 19, 2025

Refactored / updated version of `stable-audio-tools` which is an open-source code for audio/music generative models originally by Stability AI.

Python 164 11 Updated Jul 25, 2024

Open Source Text-To-Speech Portuguese Dataset

159 17 Updated Feb 2, 2024

[AAAI 2024] Code for CTX-vec2wav in UniCATS

Python 128 16 Updated Jun 11, 2024

Fine tuning the UnifiedVoice autoregressor for TortoiseTTS.

Python 15 Updated Nov 25, 2023

汉字转拼音(pypinyin)

Python 4,977 619 Updated Jan 3, 2025

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Python 10,761 1,053 Updated Feb 16, 2025
Python 67 8 Updated Sep 3, 2024

Local realtime voice AI

Python 2,224 121 Updated Feb 19, 2025

[ICASSP 2024] 🍵 Matcha-TTS: A fast TTS architecture with conditional flow matching

Jupyter Notebook 871 113 Updated Feb 17, 2025

LibriSpeech-Long is a benchmark dataset for long-form speech generation and processing. Released as part of "Long-Form Speech Generation with Spoken Language Models" (arXiv 2024).

55 1 Updated Dec 28, 2024

Zonos-v0.1 is a leading open-weight text-to-speech model trained on more than 200k hours of varied multilingual speech, delivering expressiveness and quality on par with—or even surpassing—top TTS …

Python 4,897 456 Updated Feb 18, 2025