-
Notifications
You must be signed in to change notification settings - Fork 53
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Turn off the microfrontend autogain and noise_suppression? #279
Comments
The mWW models do not work well if you just turn off the AGC and NS, as they were trained with them on. Ideally, we would train on audio processed by XMOS's algorithms, but there doesn't seem to be a straightforward way to do this. We tested several different versions of the audio fed into mWW. Only AEC applied, AEC + Noise Supression, AEC + NS + Interference Cancellation, and AEC + NS + IC + Automatic Gain Control. We had the best results with AEC + NS + IC, so that's what the firmware uses. Duplicating the AGC increased the number of false activations. The XMOS NS and IC algorithms don't seem to interfere with the microfeature's NS algorithm. |
I am saying turn off AGC and NS in training so you can turn off AGC and NS of audio fed into mWW. You tested several different versions of the audio fed into mWW and never changed the training ?! Its not just you are wasting the purchased in speech enhancement by forcing it through basic algs to run on a selection of low end microcontrollers you are wasting all those ops on the esp32-S3 as they are being duplicated on the purchased in xmos... |
https://github.com/google-ai-edge/LiteRT/tree/main/tflite/experimental/microfrontend
I wasn't sure if the config_yaml actually disables microfrontend autogain and noise_suppression or remember if they are disabled training params.
The xmos algs have AGC and NS so if active you likely have a duplication and the microfrontend libs are likely inferior
noise_suppression_level: 0
auto_gain: 0 dbfs
https://github.com/google-ai-edge/LiteRT/blob/main/tflite/experimental/microfrontend/python/ops/audio_microfrontend_op.py
Seems to be enabled in the python microfrontend for training so presuming is also in https://github.com/google-ai-edge/LiteRT/blob/main/tflite/experimental/microfrontend/ops/audio_microfrontend_op.cc
Dunno but likely both should be disabled when using the xmos and datasets normalised to xmos levels
The text was updated successfully, but these errors were encountered: