Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Longer sample produce very noisy output #13

Open
ianni67 opened this issue Feb 2, 2017 · 2 comments
Open

Longer sample produce very noisy output #13

ianni67 opened this issue Feb 2, 2017 · 2 comments

Comments

@ianni67
Copy link

ianni67 commented Feb 2, 2017

Yes I'm kinda new to TF and still... Training, so bear with me for my lame questions.
I'm experiencing with the demo. It trained and generated correctly with the very short audio sample provided with the code, but then I wanted to try something different. I ran the demo on a short (abt. 20seconds) sample from a well-known Beethoven's symphony and then generated 300000 samples. Well, something strange happened: only the first half a second is fine, the rest of the generated sound is extremely noisy and barely recognizable.
In the code, I just changed the path of the input audio and the duration of the generated audio.
What am I doing wrong? Thank you for your patience in reading my post (and answering, if possible!)

@tomlepaine
Copy link
Owner

Hi @ianni67, what are your goals exactly?

This code is designed to demonstrate the fast wavenet generation algorithm.

If you want to learn the structure of music and generate novel samples, that is not what this repo is designed for. Instead try tensorflow-wavenet, which allows you to train on a large body of data.

If you want to memorize a single audio sample this code should work. Though I might have made some assumptions about audio size that bust it. If you can fix it, please make a pull request 😸.

Best,
Tom

@ianni67
Copy link
Author

ianni67 commented Feb 10, 2017

My short-term goal is experiencing with wavenet.
The long term is training a net for music generation.
Indeed I tried also tensor-wavenet, and got similar results. The output is very very noisy, while the input is not. Probably I'm pushing the wrong buttons.
Could you, please, give me some hints regarding how the input should be pre-conditioned or regarding the kind of output I can expect? Or (even better), some initial indications about how to fiddle with the parameters?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants