-
Notifications
You must be signed in to change notification settings - Fork 116
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Want to suggest a wake word? Leave your thoughts here. (AIS-1441) #88
Comments
The Willow team and community would love "Hey Willow". It's our domain name because we've been waiting for this. Thank you very much for offering this option, it's very exciting! |
I'm glad you like this. Since "hey" and "hi" sound pretty similar, sometimes people might not really notice the difference. So, I was thinking, maybe we could support both "hey willow" and "hi willow" for waking up the device. That way, whether you say "hey willow" or "hi willow", it'll still work. Of course, when we release the wake word model, we'll call it like "wn9_heywillow". What do you think about that? |
Good idea! My only concern would be overall reduced accuracy (wake reliability vs false wake). We've noticed quite a bit of false wake with Alexa. From what I've read the automated TTS approach has 90-95% the accuracy of the models trained on human samples. I like "two word" wake words because they tend to improve accuracy, I suspect a 100% "Hey Willow" wake word could result in equivalent or even improved accuracy with the TTS approach vs even human sample trained Alexa? Of course we could always test this, even starting with a pure "Hey Willow" model, a pure "Hi Willow" model, and a merged model. Thanks again for offering this! |
Your concern may indeed happen. We will generate two words and test which model performs better. |
"hey/hi willow" model: Test dataset description: |
Guys, what you are doing is really great. We have created a smart speaker called Homai based on the esp32-s3. We trained the model ourselves, but it is resource-intensive and not so easy to integrate into the pipeline. Could you please add support for our word Homai [ho'mai]? Thank you in advance! |
Hi @AigizK , |
Hi @sun-xiangyu |
I'm sorry that our TTS model cannot specify a syllable to extend its pronunciation at the moment. This means that we cannot generate a large number of accurate “homa ai” phrases. |
Hi! Thank you for this awesome solution! We are developing a smart voice assistant called Sophia. Would it be possible to have the wake word "Hi Sophia"? This would help our user experience drastically. Thank you in advance! |
Hi @PrathamG , I'm glad you like it. "Sophia" sounds like a wake word that can be used directly. I mean, maybe we don't need an extra prefix "Hi". I suggest we start with just "Sophia". If the performance is not satisfactory, then we can train another one with "hi Sophia". What do you think? |
Sure, that sounds like a good plan! We can use only "Sophia" and test the performance first. Thank you |
If possible, I also wanted to request the wake word "Little Sophia". We are still unsure about which wake word to use, and having both options will help us determine this via user testing. |
Now our computing resources are limited. This project can generate about two wake word models in a month. So we will choose some popular wake words. Of course, if we have some free time, "Little Sophia" is also fine. |
No worries, totally understandable! Looking forward to testing out the "Sophia" wake word |
"Sophia" model: wn9_sophia_tts FAR(False Alarm Rate): 1 times / 8 hours |
“小美” or “小美同学” would be a perfect choice. It will suit a lot of use case. We all want wake word like a human name. |
@xygh, “小美同学” sounds good. |
Thank you! We will test it out and report the results by next week |
BTW, “你好小美” is also a perfect choice. |
"小当家" or "Hi 小星" is preferable wake word in our scenario. Thanks a lot! |
The second version "Sophia": Perfromace: Improvement: |
Both of these words sound good. If you have no preference, we will choose "hi 小星". |
"小美同学" FAR(False Alarm Rate): 1 times / 8 hours |
Hello! This is a great opportunity I was hoping would come up, I'm so glad this is now possible! I've seen that the wake-words "Mycroft" and "Hey, Mycroft" are very popular in the community, and it is also the name of my product so would very much improve user experience. Would it be possible to have either of these trained and released for the community? Thank you so much in advance for this! |
@lewardo, I'm glad it could help you. Although "Mycroft" is simpler, it seems there are quite a few words that sound similar, so I'll prioritize training with "Hey Mycroft." |
Yes, as @ayuusweetfish mentioned, you can find the wake word model you want in wakenet_model folder, then overwrite the model you were previously using, and it will be ready to use. |
@Spartan859 Perfromace: |
谢谢你的回复 |
明白了 谢谢 |
请问下您这个唤醒词支持adf里面替换吗,目前有用到adf的唤醒 |
当然可以,adf 也是用esp-sr进行唤醒 |
期待您帮助训练以下唤醒词。 |
“Hi,春风“ 也不错 |
哈哈,让我想起了《剑来》,不错,就是在冬天的时候喊,怪怪的。 |
您好,可否帮忙训练一个叫“小酥肉”的唤醒词?我正在用ESP32S3开发一个面向儿童、学生的语音助手(也支持成人使用),已经接近完成,问了下大家都非常喜欢和期待“小酥肉”这个名称,如果可以使用这个名称,会对提升产品效果有很大的帮助。非常感谢~~ :) |
Although we have implemented some optimizations, children's voices is still a challenge to our current TTS wake word model. |
哦,补充下,不是那种很小的小孩子。一般是小学五六年级和初高中学生,说话连贯性和准确度都类似成人了,我觉得可以用成人的数据。另外,目前调研了一下,也就是用esp-sr的方案最好,用其他方案都会有一些受限于算力和能耗方面的问题。如果可以的话,请帮忙训练一个吧,期待~~ |
@caseylai Perfromace: |
收到了,感谢 ♪(・ω・)ノ |
Hello, thank you so much for your assistance. We are developing an AI toy, and the character of Shin-chan is incredibly popular within our team. Therefore, we would like to train the wake words "Shin-chan", "Hi, Shin-chan!" and "Hello, Shin-chan!"、"小新", "嗨, 小新" and "你好, 小新!". If you feel it would be more efficient, we are also happy to focus solely on training "Shin-chan"and“小新”. This toy is primarily designed to be a companion, aiming to help people de-stress and add some fun to their lives, and will be geared towards a largely adult audience. Lastly, thank you again for your invaluable help, we are all very excited! |
Dear Team, We are currently developing an AI assistant aimed at providing users with an intuitive and intelligent interaction experience. To make our product more user-friendly, we would like to use specific wake words to activate our AI assistant. Here are our requirements: List of Wake Words:
Background Information: We understand that the choice of wake words is crucial for the user experience of the AI assistant, and we are seeking your support to add the above-mentioned wake words to your wake word model library. Please let us know if this request can be accommodated, and what specific information or materials we need to provide to complete this process. We look forward to your response and appreciate your support for the development of our AI assistant. Best regards |
这个 "小明同学" 大家都认识 🤣, 能否添加呢, https://espai.fun |
我认识,esp-ai,2000人以上用呢,必须支持 |
Thank you all for your liking. The server resources are currently quite limited, so there is no specific timeline. The current waiting list is as follows: |
Thank you for your reply! I'm looking forward very much! |
@sun-xiangyu Hello, we are developing a medical project for the elderly. In order to greatly facilitate the elderly to realize the monitoring of their physical condition through voice wake-up, we would like to use the Chinese wake-up words "小康同学" or "小康管家" to realize voice wake-up. If your company can add them, we will be grateful and look forward to your reply! Thank you! |
@sun-xiangyu 您好,我们团队正在开发一个针对环境工程领域问答的项目,可以极大的方便对应研究生同学在试验台上操作时对于步骤不熟悉而询问,我们想通过中文唤醒词:“你好,小问”or“小问小问”,如果贵公司可以添加,真的感激不禁!! |
小宇同学: wakenet9l_tts2h12_小宇同学_3_0.624_0.630 Perfromace: |
@sun-xiangyu 您好! 这是一个带有智能路由能力的 Agent 系统,可以根据用户需求自动调用不同的功能模块 联网搜索模块 选择"小尊同学"作为唤醒词的原因: "尊"字寓意: 引申为尊重、尊贵的含义,暗合 AI 助手提供有价值服务的定位 发音特点: "尊(zūn)"与"遵(zūn)"同音,巧妙暗示了智能助手能够准确理解和执行指令的特性 与项目理念契合: 基于 MCP 协议的标准化接口,确保了每个功能模块都能严格遵循协议规范 关于 MCP (Model Context Protocol) 协议的实现参考:https://github.com/modelcontextprotocol/servers Hello! This is an Agent system with intelligent routing capabilities that can automatically call different functional modules based on user needs Internet search module Reasons for choosing "小尊同学" as the wake word: The meaning of "尊" (Zun): Implies respect and dignity, aligning with our AI assistant's positioning of providing valuable services Phonetic characteristics: "尊(zūn)" is homophonous with "遵(zūn)" (to follow/obey), cleverly suggesting the assistant's ability to accurately understand and execute commands Alignment with project philosophy: The standardized interface based on MCP protocol ensures that each functional module strictly follows protocol specifications MCP (Model Context Protocol) implementation reference: https://github.com/modelcontextprotocol/servers |
Dear Team, We believe these wake words would be particularly appealing for robotics projects and could benefit other developers creating similar Wall-E inspired projects. Thank you for considering our suggestion! Best regards |
We can try to train a model to detect "嗨,瓦力" and "Hi, Wall-E" at the same time. |
你好,我们开发了一款桌面机器人,目前完成了80%的进度,但是试了下唤醒词,感觉都没有合适的, |
小明同学: wakenet9l_tts2h12_小明同学_3_0.624_0.630 Perfromace: |
Thank you for your help. |
Hi all,
We're excited to offer the community more free and high-quality wake word models. Everyone has their own unique wake word preferences. Now, we're ready to regularly release some of the most popular wake words. Please let us know the wake words you want! English and Chinese are both welcome.
In the past, it was an expensive process to collect high-quality human speech data. But now, our team has developed a cost-effective way to train wake word models by using only TTS samples, which reaches 90-95% accuracy compared to models trained by human-recorded samples.
The wake word models and esp-sr have the same license and are free for commercial use. If you want a more accurate and exclusive wake word, please use our wake word customization service.
Currently, we support over 20 wake words. You can choose any one wake word to test. Starting from August 1, 2024, to get a new wake word, you'll need to meet one of these requirements:
We have released Wake Word Training by TTS V2.0, which improves the TTS model and pipeline. Now it can reach 95-98% accuracy compared to models trained by human samples.
The text was updated successfully, but these errors were encountered: