Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

中文语音合成无法对齐 #41

Open
cptbtptp2333 opened this issue May 8, 2022 · 0 comments
Open

中文语音合成无法对齐 #41

cptbtptp2333 opened this issue May 8, 2022 · 0 comments

Comments

@cptbtptp2333
Copy link

我在训练采用标贝数据集 基于pytorch的中文语音合成baseline,模型主要参考了nvidia/Tacotron2,目前尝试了拼音建模(字母编码转sequence),batch size 设为32,采样率为48000。目前训练57k steps后,loss已经不再下降(0.3左右),但完全没有对齐,inference结果也不对。请问可能是什么原因,应该如何调整呢?

In the training, I use the biaobei dataset and modify the sampling rate to 48000. I use Pinyin modeling (from character to sequence), and the batch size is set to 32. At present, after training 57K steps, loss no longer decreases, but there is no alignment at all, and the synthesis speech wave is also wrong. What are the possible reasons and how should they be adjusted?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant