Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[GSoC] Dynamic Text-to-Speech Playback #926

Open
danylo-boiko opened this issue Jan 27, 2025 · 0 comments
Open

[GSoC] Dynamic Text-to-Speech Playback #926

danylo-boiko opened this issue Jan 27, 2025 · 0 comments

Comments

@danylo-boiko
Copy link
Contributor

danylo-boiko commented Jan 27, 2025

Feature description

In the current implementation, playback begins only after the entire text has been processed by the Google Text-to-Speech API, which cause inconvenient delays for long texts. Journey Voices provides real-time streaming with low latency for some common languages (search for "Journey" on the voices page). The goal of the project is to implement a flexible approach that supports both streaming and waiting for a complete response, depending on the language.

Expected outcomes

Users can listen to synthesized text in two modes, depending on language support.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant