-
Notifications
You must be signed in to change notification settings - Fork 557
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Question about output size of D network #38
Comments
Seems like a hastily modified version from some of stylegan's modules. |
im confused, too. since mapping_tl does not have the activation function, it seems to be the 3-layer multilinear map mentioned in the paper. the output of mapping_tl is supposed to be used to minimize the coding reconstruction error. but the output style of the encoder is used to minimize the coding reconstruction error while the output of mapping_tl is used to compute discriminator loss i wonder if this is a bug or i just misunderstand something? |
Yes, just one is used. The others should not affect anything. @6b5d ,
Yes, that's a bug |
Output of original StyleGAN's discriminator is a scalar, predicting whether the given image is real or fake. However, the output shape of your D network is batch x (2 * dlatent_size) in the line below.
ALAE/net.py
Line 893 in 5d8362f
Therefore, you selected one element among 2*dlatent_size elements as the final output of D network (which is used for loss function) in the line below (Z_).
ALAE/model.py
Line 111 in 5d8362f
I'm curious why the output shape of D network is batch x (2 * dlatent_size), since only one element is used for training and the others are useless.
Plus, I can't understand why the output of D network is reshaped like this.
ALAE/net.py
Line 903 in 5d8362f
The text was updated successfully, but these errors were encountered: