Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

请教一些关于数据集的收集和微调问题 #110

Open
954028118 opened this issue Jan 26, 2025 · 0 comments
Open

请教一些关于数据集的收集和微调问题 #110

954028118 opened this issue Jan 26, 2025 · 0 comments

Comments

@954028118
Copy link

1 关于voxceleb2数据集,我下载了不同来源的几份数据后发现视频的分辨率都是224224,作者是在这个224224的基础上resize到某个固定的大小,再通过数据预处理的pipeline得出最终256256的训练数据吗?
2 对于256
256分辨率的中文微调,参照论文及各个issue知道只需微调unet的stage2即可,这部分的微调数据是只需要一些(中文数据集)即可还是说应该是(原始训练集+自定义中文数据集),还有就是微调的数据量大概需要多少个小时的数据呢?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant