Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Want to suggest a wake word? Leave your thoughts here. (AIS-1441) #88

Open
feizi opened this issue Dec 14, 2023 · 141 comments
Open

Want to suggest a wake word? Leave your thoughts here. (AIS-1441) #88

feizi opened this issue Dec 14, 2023 · 141 comments

Comments

@feizi
Copy link
Collaborator

feizi commented Dec 14, 2023

Hi all,

We're excited to offer the community more free and high-quality wake word models. Everyone has their own unique wake word preferences. Now, we're ready to regularly release some of the most popular wake words. Please let us know the wake words you want! English and Chinese are both welcome.

In the past, it was an expensive process to collect high-quality human speech data. But now, our team has developed a cost-effective way to train wake word models by using only TTS samples, which reaches 90-95% accuracy compared to models trained by human-recorded samples.

The wake word models and esp-sr have the same license and are free for commercial use. If you want a more accurate and exclusive wake word, please use our wake word customization service.

Currently, we support over 20 wake words. You can choose any one wake word to test. Starting from August 1, 2024, to get a new wake word, you'll need to meet one of these requirements:

  • If you've got an ongoing project, kindly attach the project link along with a brief overview when submitting your request.
  • Your wake word has been liked or upvoted by more than five people.

We have released Wake Word Training by TTS V2.0, which improves the TTS model and pipeline. Now it can reach 95-98% accuracy compared to models trained by human samples.

@feizi feizi pinned this issue Dec 14, 2023
@github-actions github-actions bot changed the title Want to suggest a wake word? Leave your thoughts here. Want to suggest a wake word? Leave your thoughts here. (AIS-1441) Dec 14, 2023
@kristiankielhofner
Copy link

The Willow team and community would love "Hey Willow". It's our domain name because we've been waiting for this.

Thank you very much for offering this option, it's very exciting!

@feizi
Copy link
Collaborator Author

feizi commented Dec 14, 2023

The Willow team and community would love "Hey Willow". It's our domain name because we've been waiting for this.

Thank you very much for offering this option, it's very exciting!

I'm glad you like this. Since "hey" and "hi" sound pretty similar, sometimes people might not really notice the difference. So, I was thinking, maybe we could support both "hey willow" and "hi willow" for waking up the device. That way, whether you say "hey willow" or "hi willow", it'll still work. Of course, when we release the wake word model, we'll call it like "wn9_heywillow". What do you think about that?

@kristiankielhofner
Copy link

Good idea!

My only concern would be overall reduced accuracy (wake reliability vs false wake). We've noticed quite a bit of false wake with Alexa. From what I've read the automated TTS approach has 90-95% the accuracy of the models trained on human samples. I like "two word" wake words because they tend to improve accuracy, I suspect a 100% "Hey Willow" wake word could result in equivalent or even improved accuracy with the TTS approach vs even human sample trained Alexa?

Of course we could always test this, even starting with a pure "Hey Willow" model, a pure "Hi Willow" model, and a merged model.

Thanks again for offering this!

@feizi
Copy link
Collaborator Author

feizi commented Dec 14, 2023

Your concern may indeed happen. We will generate two words and test which model performs better.

@feizi
Copy link
Collaborator Author

feizi commented Dec 28, 2023

"hey/hi willow" model:
Model name: wn9_heywillow_tts
FAR(False Alarm Rate): 1 times / 8 hours
RAR(Right Alarm Rate): 88%

Test dataset description:
The FAR dataset: This dataset contains a total of 64 hours of audio data, which includes audio collected from the internet and audio recorded using esp32-korvo boards.
The RAR dataset: This dataset is generated by multiple commercial TTS APIs, with a total of approximately 500 samples. These data and models were not used in the training process. However, due to the differences between TTS samples and human samples, please exercise caution when referring to the test results.

@AigizK
Copy link

AigizK commented Dec 29, 2023

Guys, what you are doing is really great. We have created a smart speaker called Homai based on the esp32-s3. We trained the model ourselves, but it is resource-intensive and not so easy to integrate into the pipeline. Could you please add support for our word Homai [ho'mai]? Thank you in advance!

@sun-xiangyu
Copy link
Contributor

Hi @AigizK ,
The syllable of Homai only has two. It is difficult to reduce the probability of false triggering for monosyllabic and disyllabic phrases. We recommend selecting a 3-5 syllable phrase as the wake word.

@AigizK
Copy link

AigizK commented Jan 3, 2024

Hi @sun-xiangyu
We have already launched a project with this name, so we can't change it significantly. But can we use the variant "homa ai", where the sound 'A' is pronounced long?

@sun-xiangyu
Copy link
Contributor

We have already launched a project with this name, so we can't change it significantly. But can we use the variant "homa ai", where the sound 'A' is pronounced long?

I'm sorry that our TTS model cannot specify a syllable to extend its pronunciation at the moment. This means that we cannot generate a large number of accurate “homa ai” phrases.

@PrathamG
Copy link

PrathamG commented Jan 9, 2024

Hi! Thank you for this awesome solution! We are developing a smart voice assistant called Sophia. Would it be possible to have the wake word "Hi Sophia"? This would help our user experience drastically. Thank you in advance!

@sun-xiangyu
Copy link
Contributor

Hi @PrathamG , I'm glad you like it. "Sophia" sounds like a wake word that can be used directly. I mean, maybe we don't need an extra prefix "Hi". I suggest we start with just "Sophia". If the performance is not satisfactory, then we can train another one with "hi Sophia". What do you think?

@PrathamG
Copy link

PrathamG commented Jan 9, 2024

Sure, that sounds like a good plan! We can use only "Sophia" and test the performance first. Thank you

@PrathamG
Copy link

PrathamG commented Jan 9, 2024

If possible, I also wanted to request the wake word "Little Sophia". We are still unsure about which wake word to use, and having both options will help us determine this via user testing.

@sun-xiangyu
Copy link
Contributor

sun-xiangyu commented Jan 10, 2024

If possible, I also wanted to request the wake word "Little Sophia". We are still unsure about which wake word to use, and having both options will help us determine this via user testing.

Now our computing resources are limited. This project can generate about two wake word models in a month. So we will choose some popular wake words. Of course, if we have some free time, "Little Sophia" is also fine.

@PrathamG
Copy link

No worries, totally understandable! Looking forward to testing out the "Sophia" wake word

@sun-xiangyu
Copy link
Contributor

"Sophia" model: wn9_sophia_tts

FAR(False Alarm Rate): 1 times / 8 hours
RAR(Right Alarm Rate): 97%

@xygh
Copy link

xygh commented Jan 22, 2024

“小美” or “小美同学” would be a perfect choice. It will suit a lot of use case. We all want wake word like a human name.

@sun-xiangyu
Copy link
Contributor

@xygh, “小美同学” sounds good.

@PrathamG
Copy link

"Sophia" model: wn9_sophia_tts

FAR(False Alarm Rate): 1 times / 8 hours RAR(Right Alarm Rate): 97%

Thank you! We will test it out and report the results by next week

@xygh
Copy link

xygh commented Jan 23, 2024

@xygh, “小美同学” sounds good.

BTW, “你好小美” is also a perfect choice.

@Henry586
Copy link

"小当家" or "Hi 小星" is preferable wake word in our scenario. Thanks a lot!

@sun-xiangyu
Copy link
Contributor

The second version "Sophia":
model info: wakenet9l_tts1h8v2_Sophia_3_0.647_0.649

Perfromace:
FAR(False Alarm Rate): 1 times / 8 hours
RAR(Right Alarm Rate): 95%

Improvement:
Add "Sophie" and "Sophy" as hard negatives to reduce false triggers.

@sun-xiangyu
Copy link
Contributor

"小当家" or "Hi 小星" is preferable wake word in our scenario. Thanks a lot!

Both of these words sound good. If you have no preference, we will choose "hi 小星".

@feizi
Copy link
Collaborator Author

feizi commented Jan 30, 2024

"小美同学"
model info: wakenet9l_tts1h8_小美同学_3_0.633_0.644

FAR(False Alarm Rate): 1 times / 8 hours
RAR(Right Alarm Rate): 95%

@lewardo
Copy link

lewardo commented Feb 11, 2024

Hello! This is a great opportunity I was hoping would come up, I'm so glad this is now possible! I've seen that the wake-words "Mycroft" and "Hey, Mycroft" are very popular in the community, and it is also the name of my product so would very much improve user experience. Would it be possible to have either of these trained and released for the community? Thank you so much in advance for this!

@sun-xiangyu
Copy link
Contributor

@lewardo, I'm glad it could help you. Although "Mycroft" is simpler, it seems there are quite a few words that sound similar, so I'll prioritize training with "Hey Mycroft."

@sun-xiangyu
Copy link
Contributor

sun-xiangyu commented Dec 16, 2024

@PoohWoah

当前头部提交的这个目录。Registry 上还没有发布新版本,所以我暂时把这里的几个文件放进工程根目录下 managed_components/espressif__esp-sr/model/wakenet_model/wn9_himiaomiao_tts/ 这个位置替换掉了原本的内容,然后在设置里选择 himiaomiao 唤醒词就可以了。

In this folder at the current head commit. The new version has not been published on the Registry, so I temporarily placed these files in the managed_components/espressif__esp-sr/model/wakenet_model/wn9_himiaomiao_tts/ directory at the project root, replacing existing content, and then selected the himiaomiao wake word in the configuration.

Yes, as @ayuusweetfish mentioned, you can find the wake word model you want in wakenet_model folder, then overwrite the model you were previously using, and it will be ready to use.

@sun-xiangyu
Copy link
Contributor

@Spartan859
璃奈板: wakenet9l_tts2h12_Linaiban_3_0.635_0.640

Perfromace:
FAR(False Alarm Rate): 1 times / 12 hours
RAR(Right Alarm Rate): 95%

@PoohWoah
Copy link

@PoohWoah

当前头部提交的这个目录。Registry 上还没有发布新版本,所以我暂时把这里的几个文件放进工程根目录下 这个位置替换掉了原本的内容,然后在设置里选择 唤醒词就可以了。managed_components/espressif__esp-sr/model/wakenet_model/wn9_himiaomiao_tts/``himiaomiao

在此文件夹中的当前 head 提交。新版本还没有在 Registry 上发布,所以我临时把这些文件放在了项目根目录下的目录下,替换了已有的内容,然后在配置中选择了唤醒词。managed_components/espressif__esp-sr/model/wakenet_model/wn9_himiaomiao_tts/``himiaomiao

谢谢你的回复

@PoohWoah
Copy link

@PoohWoah
当前头部提交的这个目录。Registry 上还没有发布新版本,所以我暂时把这里的几个文件放进工程根目录下 这个位置替换掉了原本的内容,然后在设置里选择 唤醒词就可以了。managed_components/espressif__esp-sr/model/wakenet_model/wn9_himiaomiao_tts/``himiaomiao
在此文件夹中的当前 head 提交。新版本还没有在 Registry 上发布,所以我临时把这些文件放在了项目根目录下的目录下,替换了已有的内容,然后在配置中选择了唤醒词。managed_components/espressif__esp-sr/model/wakenet_model/wn9_himiaomiao_tts/``himiaomiao

是的,如前所述,您可以在 wakenet_model 文件夹中找到所需的唤醒词模型,然后覆盖您之前使用的模型,它就可以使用了。

明白了 谢谢

@PoohWoah
Copy link

@PoohWoah
当前头部提交的这个目录。Registry 上还没有发布新版本,所以我暂时把这里的几个文件放进工程根目录下 managed_components/espressif__esp-sr/model/wakenet_model/wn9_himiaomiao_tts/ 这个位置替换掉了原本的内容,然后在设置里选择 himiaomiao 唤醒词就可以了。
In this folder at the current head commit. The new version has not been published on the Registry, so I temporarily placed these files in the managed_components/espressif__esp-sr/model/wakenet_model/wn9_himiaomiao_tts/ directory at the project root, replacing existing content, and then selected the himiaomiao wake word in the configuration.

Yes, as @ayuusweetfish mentioned, you can find the wake word model you want in wakenet_model folder, then overwrite the model you were previously using, and it will be ready to use.

请问下您这个唤醒词支持adf里面替换吗,目前有用到adf的唤醒

@sun-xiangyu
Copy link
Contributor

@PoohWoah
当前头部提交的这个目录。Registry 上还没有发布新版本,所以我暂时把这里的几个文件放进工程根目录下 managed_components/espressif__esp-sr/model/wakenet_model/wn9_himiaomiao_tts/ 这个位置替换掉了原本的内容,然后在设置里选择 himiaomiao 唤醒词就可以了。
In this folder at the current head commit. The new version has not been published on the Registry, so I temporarily placed these files in the managed_components/espressif__esp-sr/model/wakenet_model/wn9_himiaomiao_tts/ directory at the project root, replacing existing content, and then selected the himiaomiao wake word in the configuration.

Yes, as @ayuusweetfish mentioned, you can find the wake word model you want in wakenet_model folder, then overwrite the model you were previously using, and it will be ready to use.

请问下您这个唤醒词支持adf里面替换吗,目前有用到adf的唤醒

当然可以,adf 也是用esp-sr进行唤醒

@l137295
Copy link

l137295 commented Dec 31, 2024

期待您帮助训练以下唤醒词。
“Hi,清风”
清风有清风徐来意思是微风轻轻地、缓慢地吹来,形容风势轻柔、舒缓,蕴含宁静、舒适及大自然的温柔和谐,愿我们每个开发的项目都能如浴清风,也能贴近大自然的温柔和谐。

@sun-xiangyu
Copy link
Contributor

“Hi,春风“ 也不错
遇事不决,可问春风

@l137295
Copy link

l137295 commented Dec 31, 2024

“嗨,春风”也不错 遇事不决,可问春风

哈哈,让我想起了《剑来》,不错,就是在冬天的时候喊,怪怪的。

@caseylai
Copy link

caseylai commented Jan 9, 2025

您好,可否帮忙训练一个叫“小酥肉”的唤醒词?我正在用ESP32S3开发一个面向儿童、学生的语音助手(也支持成人使用),已经接近完成,问了下大家都非常喜欢和期待“小酥肉”这个名称,如果可以使用这个名称,会对提升产品效果有很大的帮助。非常感谢~~ :)

@sun-xiangyu
Copy link
Contributor

您好,可否帮忙训练一个叫“小酥肉”的唤醒词?我正在用ESP32S3开发一个面向儿童、学生的语音助手(也支持成人使用),已经接近完成,问了下大家都非常喜欢和期待“小酥肉”这个名称,如果可以使用这个名称,会对提升产品效果有很大的帮助。非常感谢~~ :)

Although we have implemented some optimizations, children's voices is still a challenge to our current TTS wake word model.

@caseylai
Copy link

哦,补充下,不是那种很小的小孩子。一般是小学五六年级和初高中学生,说话连贯性和准确度都类似成人了,我觉得可以用成人的数据。另外,目前调研了一下,也就是用esp-sr的方案最好,用其他方案都会有一些受限于算力和能耗方面的问题。如果可以的话,请帮忙训练一个吧,期待~~

@sun-xiangyu
Copy link
Contributor

@caseylai
小酥肉: wakenet9l_tts2h12_小酥肉_3_0.625_0.630

Perfromace:
FAR(False Alarm Rate): 1 times / 12 hours
RAR(Right Alarm Rate): 97%

@caseylai
Copy link

@caseylai 小酥肉: wakenet9l_tts2h12_小酥肉_3_0.625_0.630

Perfromace: FAR(False Alarm Rate): 1 times / 12 hours RAR(Right Alarm Rate): 97%

收到了,感谢 ♪(・ω・)ノ

@kongusen
Copy link

Hello, thank you so much for your assistance. We are developing an AI toy, and the character of Shin-chan is incredibly popular within our team. Therefore, we would like to train the wake words "Shin-chan", "Hi, Shin-chan!" and "Hello, Shin-chan!"、"小新", "嗨, 小新" and "你好, 小新!". If you feel it would be more efficient, we are also happy to focus solely on training "Shin-chan"and“小新”. This toy is primarily designed to be a companion, aiming to help people de-stress and add some fun to their lives, and will be geared towards a largely adult audience. Lastly, thank you again for your invaluable help, we are all very excited!

@Rvn0xsy
Copy link

Rvn0xsy commented Feb 7, 2025

Dear Team,

We are currently developing an AI assistant aimed at providing users with an intuitive and intelligent interaction experience. To make our product more user-friendly, we would like to use specific wake words to activate our AI assistant. Here are our requirements:

List of Wake Words:

  • Chinese: 小宇同学, 小羽同学, 小雨同学 (同音)
  • English: Hey Luma

Background Information:
Our AI assistant is designed for Chinese-speaking users, and therefore, Chinese wake words are more intuitive and familiar to our target audience. Additionally, we would also like to offer an English version to accommodate potentially multilingual users.

We understand that the choice of wake words is crucial for the user experience of the AI assistant, and we are seeking your support to add the above-mentioned wake words to your wake word model library.

Please let us know if this request can be accommodated, and what specific information or materials we need to provide to complete this process.

We look forward to your response and appreciate your support for the development of our AI assistant.

Best regards

@wangzongming
Copy link

这个 "小明同学" 大家都认识 🤣, 能否添加呢, https://espai.fun

@haoxueqing259
Copy link

这个 "小明同学" 大家都认识 🤣, 能否添加呢, https://espai.fun

我认识,esp-ai,2000人以上用呢,必须支持

@sun-xiangyu
Copy link
Contributor

Thank you all for your liking. The server resources are currently quite limited, so there is no specific timeline. The current waiting list is as follows:
小宇同学
小明同学
小新/Shin-chan (I want to support Japanese, so some work is still needed).

@Rvn0xsy
Copy link

Rvn0xsy commented Feb 11, 2025

Thank you all for your liking. The server resources are currently quite limited, so there is no specific timeline. The current waiting list is as follows: 小宇同学 小明同学 小新/Shin-chan (I want to support Japanese, so some work is still needed).

Thank you for your reply! I'm looking forward very much!

@positive1210
Copy link

positive1210 commented Feb 14, 2025

@sun-xiangyu Hello, we are developing a medical project for the elderly. In order to greatly facilitate the elderly to realize the monitoring of their physical condition through voice wake-up, we would like to use the Chinese wake-up words "小康同学" or "小康管家" to realize voice wake-up. If your company can add them, we will be grateful and look forward to your reply! Thank you!

@mdjhacker
Copy link

@sun-xiangyu 您好,我们团队正在开发一个针对环境工程领域问答的项目,可以极大的方便对应研究生同学在试验台上操作时对于步骤不熟悉而询问,我们想通过中文唤醒词:“你好,小问”or“小问小问”,如果贵公司可以添加,真的感激不禁!!
@sun-xiangyu Hello, our team is developing a Q&A project for the environmental engineering field, which can greatly help graduate students when they are unfamiliar with steps while operating on experimental benches. We would like to use Chinese wake words: "你好,小问" or "小问小问". If your company could add these, we would be incredibly grateful!!

@sun-xiangyu
Copy link
Contributor

@Rvn0xsy

小宇同学: wakenet9l_tts2h12_小宇同学_3_0.624_0.630

Perfromace:
FAR(False Alarm Rate): 1 times / 12 hours
RAR(Right Alarm Rate): 97%

@lcolok
Copy link

lcolok commented Feb 18, 2025

@sun-xiangyu 您好!
我们正在开发一个基于 MCP 协议的智能助手项目,希望能将"小尊同学"添加为唤醒词。
项目背景:

这是一个带有智能路由能力的 Agent 系统,可以根据用户需求自动调用不同的功能模块
基于 MCP (Model Context Protocol) 协议开发,支持灵活扩展各类功能,如:

联网搜索模块
智能家居控制模块
长期记忆存储模块
其他基于 MCP 协议的扩展工具

选择"小尊同学"作为唤醒词的原因:

"尊"字寓意:

引申为尊重、尊贵的含义,暗合 AI 助手提供有价值服务的定位

发音特点:

"尊(zūn)"与"遵(zūn)"同音,巧妙暗示了智能助手能够准确理解和执行指令的特性
体现了助手既尊重用户意愿,又能遵循指令精准执行的双重含义

与项目理念契合:

基于 MCP 协议的标准化接口,确保了每个功能模块都能严格遵循协议规范
"尊"字既表达了对用户的尊重,又暗示了对指令的严格执行
这个双关含义完美契合了我们的项目理念:打造一个既懂礼貌、又专业可靠的智能助手

关于 MCP (Model Context Protocol) 协议的实现参考:https://github.com/modelcontextprotocol/servers
非常感谢您的帮助和支持!期待您的回复。


Hello!
We are developing an intelligent assistant project based on the MCP protocol and would like to request adding "小尊同学" (Xiao Zun Tongxue) as a wake word.
Project Background:

This is an Agent system with intelligent routing capabilities that can automatically call different functional modules based on user needs
Developed based on MCP (Model Context Protocol), supporting flexible extension of various functionalities, such as:

Internet search module
Smart home control module
Long-term memory storage module
Other extension tools developed with the MCP protocol

Reasons for choosing "小尊同学" as the wake word:

The meaning of "尊" (Zun):

Implies respect and dignity, aligning with our AI assistant's positioning of providing valuable services

Phonetic characteristics:

"尊(zūn)" is homophonous with "遵(zūn)" (to follow/obey), cleverly suggesting the assistant's ability to accurately understand and execute commands
Embodies the dual meaning that the assistant both respects user intentions and precisely follows instructions

Alignment with project philosophy:

The standardized interface based on MCP protocol ensures that each functional module strictly follows protocol specifications
The character "尊" expresses both respect for users and implies strict execution of commands
This double meaning perfectly matches our project philosophy: creating an intelligent assistant that is both polite and professionally reliable

MCP (Model Context Protocol) implementation reference: https://github.com/modelcontextprotocol/servers
Thank you very much for your help and support! Looking forward to your reply.

@wangwd123
Copy link

Dear Team,
We're currently developing a desktop toy robot inspired by Wall-E. It would be amazing if we could use '你好,瓦力' , '嗨,瓦力' and 'Hi, Wall-E' as the wake words to activate our little robot. This would perfectly match the character and make the interaction more immersive for users.

We believe these wake words would be particularly appealing for robotics projects and could benefit other developers creating similar Wall-E inspired projects.

Thank you for considering our suggestion!

Best regards

@sun-xiangyu
Copy link
Contributor

Dear Team, We're currently developing a desktop toy robot inspired by Wall-E. It would be amazing if we could use '你好,瓦力' , '嗨,瓦力' and 'Hi, Wall-E' as the wake words to activate our little robot. This would perfectly match the character and make the interaction more immersive for users.

We believe these wake words would be particularly appealing for robotics projects and could benefit other developers creating similar Wall-E inspired projects.

Thank you for considering our suggestion!

Best regards

We can try to train a model to detect "嗨,瓦力" and "Hi, Wall-E" at the same time.

@welkinchan
Copy link

你好,我们开发了一款桌面机器人,目前完成了80%的进度,但是试了下唤醒词,感觉都没有合适的,
要么寓意与我们的预期不一致,要么唤醒效果很差,
所以现在还差一个比较适配的,比较个性化的唤醒词。
我们现在,很像有一个叫“Hi, Joybeing”,或者直接是“Joybeing”的唤醒词,
我们觉得这个名字很适合桌面机器人,
请问能否帮忙训练一下这个唤醒词?非常感谢!

@sun-xiangyu
Copy link
Contributor

@wangzongming

小明同学: wakenet9l_tts2h12_小明同学_3_0.624_0.630

Perfromace:
FAR(False Alarm Rate): 1 times / 12 hours
RAR(Right Alarm Rate): 97%

@wangzongming
Copy link

@wangzongming

小明同学: wakenet9l_tts2h12_小明同学_3_0.624_0.630

Perfromace: FAR(False Alarm Rate): 1 times / 12 hours RAR(Right Alarm Rate): 97%

Thank you for your help.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests