Important:
This requires a lot of configuration if run directly. Recommended way is to use UI Application: https://github.com/Sharrnah/whispering-ui which downloads this automatically.
Standalone Release File (3.1 GB):
Download Server:
Changelog (v1.3.15.3)
- [FEATURE] Add
get_last_generation
methods for TTS - [TASK] Add Greek F5 TTS Model
- [TASK] Add silence after segments of F5-TTS generation
- [TASK] F5 processing estimate for multi-segments
- [TASK] Update libraries + fix for nltk
- [TASK] Add pyctcdecode library
- [TASK] Add normalization to F5 TTS
- [TASK] unified tts event call
- [BUGFIX] Channel error on MME Audio API with Silero
- [BUGFIX] websocket disconnect on receiving generated TTS raw audio
- [BUGFIX] TTS Model download not starting on model change
Full Changelog: v1.3.15.2...v1.3.15.4