This roadmap for WeNet. WeNet is a community-driven project and we love your feedback and proposals on where we should be heading.
Please open up issues or discussion on github to write your proposal. Feel free to volunteer yourself if you are interested in trying out some items(they do not have to be on the list).
- ONNX support
- RNN-T support
- Vosk like models and API for developers.
- Models(Chinese/English/Japanese/Korean/French/German/Spanish/Portuguese)
- API(python/c/c++/go/java)
- Self training, streaming
- Light weight, low latency, on-device model exploration
- Audio-Visual speech recognition
- Platforms
- Raspberry Pi
- Harmony OS
- ASIC XPU
- Horizon Journey
- GraphCore
- TO ADD
- U2++ framework for better accuracy
- n-gram + WFST language model solution
- Context biasing(hotword) solution
- Very big data training support with UIO
- More dataset support, including WenetSpeech, GigaSpeech, HKUST and so on.
- Streaming solution(U2 framework)
- Production runtime solution with
TorchScript
training andLibTorch
inference. - Unified streaming and non-streaming model(U2 framework)