v1.3.15.4

Latest

Latest

Sharrnah released this 02 Feb 00:21

· 6 commits to main since this release

7e2b5b8

Important:

This requires a lot of configuration if run directly. Recommended way is to use UI Application: https://github.com/Sharrnah/whispering-ui which downloads this automatically.

Standalone Release File (3.1 GB):

Download Server:

Changelog (v1.3.15.3)

[FEATURE] Add get_last_generation methods for TTS
[TASK] Add Greek F5 TTS Model
[TASK] Add silence after segments of F5-TTS generation
[TASK] F5 processing estimate for multi-segments
[TASK] Update libraries + fix for nltk
[TASK] Add pyctcdecode library
[TASK] Add normalization to F5 TTS
[TASK] unified tts event call
[BUGFIX] Channel error on MME Audio API with Silero
[BUGFIX] websocket disconnect on receiving generated TTS raw audio
[BUGFIX] TTS Model download not starting on model change

Full Changelog: v1.3.15.2...v1.3.15.4

Assets 2