-
Notifications
You must be signed in to change notification settings - Fork 58
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Multiple images #9
Comments
Hi, @cpnovaes, thank you! Indeed, that would require some changes to the code examples, but the current conditional model allows to specify the class PixelDiffusionConditional(PixelDiffusion):
def __init__(self,
train_dataset,
valid_dataset=None,
condition_channels=3, # <- here
batch_size=1,
lr=1e-3): This means that you can potentially set the I am not entirely sure if this is what you're looking for, so let me know (and ideally provide some data examples) if this needs further discussion! Thanks again |
Hi @mikonvergence, thanks a lot for answering my question! I have been trying to modify the code following your suggestion, which is what I was looking for. I figured that, in the case of input and output having a different number of channels, I need to do the following: class PixelDiffusionConditional(PixelDiffusion):
def __init__(self,
train_dataset,
valid_dataset=None,
condition_channels=3, # <- here
generated_channels=3, # <- also here!
batch_size=1,
lr=1e-3): and use: Please, let me know if that make sense or if I may be missing something. In the case of Conditional Latent Diffusion, I am still trying to implement a similar idea, but I am having problems making the autoencoder accept a different number of channels. In principle, I could follow the same idea, right? Thanks! |
Hi @cpnovaes! That's exactly the right approach with the PixelDiffusion type. For the latent diffusion, that will be tricky, because:
However, if your signals are only 64 by 64, there could be less need for a latent diffusion approach. If you later plan to work with larger matrices, then I would suggest to finetune your own autoencoder, but that's outside of the scope of this course. I am always happy to provide hints here though, so feel free to continue this thread. |
Hi @mikonvergence ! Thank you so much, I have learned a lot from all your comments! My signals are 128x128, but I am testing the PixelDiffusion on them. In the meantime, I am also studying a possible implementation of the autoenconder. Thanks again! |
Hi! Thank you so much for providing the code. The video course is amazing, really helpful.
I have a question: is it possible to modify the code so that we can use more than one image as input and condition (target) data? In other words, can we do, e.g., 2images-to-3images, taking 2 images to predict 3 images.
Thanks again!
The text was updated successfully, but these errors were encountered: