Skip to content
Change the repository type filter

All

    Repositories list

    • Website and documentation
      HTML
      211812Updated Dec 23, 2024Dec 23, 2024
    • Automatic Speech Recognition in Unity using Vosk library
      C#
      166630Updated Dec 21, 2024Dec 21, 2024
    • vosk-tts

      Public
      Text To Speech Synthesis with Vosk
      Python
      Apache License 2.0
      18135170Updated Dec 17, 2024Dec 17, 2024
    • [ICASSP 2024] 🍵 Matcha-TTS: A fast TTS architecture with conditional flow matching
      Jupyter Notebook
      MIT License
      99100Updated Dec 12, 2024Dec 12, 2024
    • Resources that make every language unique
      Apache License 2.0
      0600Updated Nov 24, 2024Nov 24, 2024
    • vosk-api

      Public
      Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
      Jupyter Notebook
      Apache License 2.0
      1.1k8.4k46934Updated Nov 13, 2024Nov 13, 2024
    • Russian speech technology links
      Apache License 2.0
      1523400Updated Nov 8, 2024Nov 8, 2024
    • icefall

      Public
      Python
      Apache License 2.0
      306000Updated Oct 29, 2024Oct 29, 2024
    • Dart
      Apache License 2.0
      4557140Updated Oct 26, 2024Oct 26, 2024
    • SDDPM

      Public
      [WACV 2024] Spiking Denoising Diffusion Probabilistic Models
      Python
      8000Updated Oct 9, 2024Oct 9, 2024
    • WebSocket, gRPC and WebRTC speech recognition server based on Vosk and Kaldi libraries
      Python
      Apache License 2.0
      254955766Updated Aug 31, 2024Aug 31, 2024
    • kaldi

      Public
      An official git mirror of Kaldi project SVN repo
      Shell
      Other
      5.3k5102Updated Aug 23, 2024Aug 23, 2024
    • clapack

      Public
      CLAPACK clone for our builds
      C
      Other
      8210Updated Aug 23, 2024Aug 23, 2024
    • openfst

      Public
      Openfst mirror with some fixes
      C++
      Other
      131020Updated Aug 23, 2024Aug 23, 2024
    • Faster Whisper ASR transcription with CTranslate2
      Python
      MIT License
      1.1k000Updated Aug 19, 2024Aug 19, 2024
    • Real-time speech recognition using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, Raspberry Pi, x86_64 servers, websocket server/client, C/C++, Python, Kotlin, C#, Go
      C++
      Apache License 2.0
      455500Updated Aug 12, 2024Aug 12, 2024
    • A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.
      Apache License 2.0
      228200Updated Aug 11, 2024Aug 11, 2024
    • Speech Recognition in Asterisk with Vosk Server
      C
      GNU General Public License v2.0
      40107173Updated Jun 21, 2024Jun 21, 2024
    • RHVoice

      Public
      a free and open source speech synthesizer for Russian and other languages
      C++
      GNU General Public License v2.0
      233200Updated May 28, 2024May 28, 2024
    • Python
      Apache License 2.0
      0000Updated Apr 24, 2024Apr 24, 2024
    • TTS

      Public
      🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
      Python
      Mozilla Public License 2.0
      4.5k200Updated Apr 8, 2024Apr 8, 2024
    • ffmpeg

      Public
      C
      Other
      12k000Updated Apr 1, 2024Apr 1, 2024
    • 1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
      Python
      MIT License
      4.2k100Updated Mar 20, 2024Mar 20, 2024
    • aiortc

      Public
      WebRTC and ORTC implementation for Python using asyncio
      Python
      BSD 3-Clause "New" or "Revised" License
      773000Updated Dec 13, 2023Dec 13, 2023
    • Ирина - русский голосовой ассистент для работы оффлайн. Поддерживает скиллы через плагины.
      Python
      Other
      122100Updated Dec 5, 2023Dec 5, 2023
    • aioice

      Public
      asyncio-based Interactive Connectivity Establishment (RFC 5245)
      Python
      BSD 3-Clause "New" or "Revised" License
      52000Updated Nov 27, 2023Nov 27, 2023
    • Offline speech recognition for Android with Vosk library.
      Java
      Apache License 2.0
      209760685Updated Nov 24, 2023Nov 24, 2023
    • Application of MB-iSTFT-VITS components to vits2_pytorch
      Python
      MIT License
      29400Updated Oct 29, 2023Oct 29, 2023
    • Some fast-ish algorithms for batch text search in moderate-sized collections, intended for data cleanup
      Python
      16000Updated Oct 20, 2023Oct 20, 2023
    • OpenAI Whisper Prompt Examples
      Apache License 2.0
      24800Updated Jul 17, 2023Jul 17, 2023