Replies: 9 comments
-
>>> alchemi5t |
Beta Was this translation helpful? Give feedback.
-
Beta Was this translation helpful? Give feedback.
-
>>> alchemi5t |
Beta Was this translation helpful? Give feedback.
-
Beta Was this translation helpful? Give feedback.
-
>>> alchemi5t |
Beta Was this translation helpful? Give feedback.
-
>>> lissyx |
Beta Was this translation helpful? Give feedback.
-
>>> alchemi5t |
Beta Was this translation helpful? Give feedback.
-
>>> reuben |
Beta Was this translation helpful? Give feedback.
-
>>> alchemi5t |
Beta Was this translation helpful? Give feedback.
-
>>> alchemi5t
[July 23, 2019, 11:54am]
Continuing the discussion from: slash
github.com/mozilla/DeepSpeech
#### Resampling to 16khz
by alchemi5t on 09:04AM - 23 Jul 19
UTC
1 commits changed 3 files with 19 additions and 5
deletions.
> There could be value in ensuring that training can be done at other
> sample rates than 16kHz, but I'm unsure that resampling is the proper
> solution, to be honest.
I have added a variable(can be changed to a flag) which can be set to
desired target SR. But won't having a single sample rate(same as
training and inference) improve convergence?
I've tested training models on different SR and inferencing on 16khz,
and as expected, the model produces unacceptable results(WER, CER, LOSS
and output put together). But the same model infers much better when
using the same SR test files(Test results after training).
It does retain the original audio characteristics after resampling.
Do you think this would not be useful for training?
[This is an archived TTS discussion thread from discourse.mozilla.org/t/resample-training-input-samples-to-align-with-inference-restraints]
Beta Was this translation helpful? Give feedback.
All reactions