-
Notifications
You must be signed in to change notification settings - Fork 401
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Voice Breaks and Latency in Text-to-Speech Conversion #108
Comments
Hey @Devloper-RG, |
@eustlb |
I never tried this setup, there are two possibilities for choppy audio:
What you can do is switching from Parler-TTS to MeloTTS by setting the
Also, can you give me the command you're running? |
Also, beware that I don't think the code uses multiple gpus yet. So 2 T4s is the same as 1. |
@eustlb |
I'm experiencing issues with breaks in the generated voice output, seemingly caused by latency in the text-to-speech (TTS) conversion process. The audio output has occasional breaks, which disrupt the flow of speech.
Steps I've tried:
Decreasing block size: This helped reduce some latency in delivering TTS audio output, but the issue persists.
Adjusting play_steps_s: I've decreased this parameter to minimize latency. However, setting play_steps_s below 0.5 causes errors, so I’ve kept it at 0.5 for now.
Any suggestions on how to further reduce the latency and improve the smoothness of the audio output would be greatly appreciated.
The text was updated successfully, but these errors were encountered: