[GSoC] Dynamic Text-to-Speech Playback #926

danylo-boiko · 2025-01-27T20:35:20Z

Feature description

In the current implementation, playback begins only after the entire text has been processed by the Google Text-to-Speech API, which cause inconvenient delays for long texts. Journey Voices provides real-time streaming with low latency for some common languages (search for "Journey" on the voices page). The goal of the project is to implement a flexible approach that supports both streaming and waiting for a complete response, depending on the language.

Expected outcomes

Users can listen to synthesized text in two modes, depending on language support.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[GSoC] Dynamic Text-to-Speech Playback #926

[GSoC] Dynamic Text-to-Speech Playback #926

danylo-boiko commented Jan 27, 2025 •

edited

Loading

[GSoC] Dynamic Text-to-Speech Playback #926

[GSoC] Dynamic Text-to-Speech Playback #926

Comments

danylo-boiko commented Jan 27, 2025 • edited Loading

Feature description

Expected outcomes

danylo-boiko commented Jan 27, 2025 •

edited

Loading