Discussion on GAN tricks on modern methods and issues #54
-
Dear all, |
Beta Was this translation helpful? Give feedback.
Replies: 4 comments 6 replies
-
As the first question, has anyone used global max pooling or global average pooling, instead of global sum pooling in the discriminator (after all res_blocks)? Do you think the impact will be distinguishable? |
Beta Was this translation helpful? Give feedback.
-
@mingukkang In ContraGAN, if the cosine similarity score of the embedding layer does not show the correct diversity ( showing similar scores for many classes although the images are not that similar), does it mean that we will have mode collapse? If so, should we increase the number of embedding dimensions (hypersphere dim.) or add weight decay to the shared embeddings? |
Beta Was this translation helpful? Give feedback.
-
Hi, I think this is a class confusion problem of ContraGAN. The main difference between ContraGAN and ProjGAN is that ContraGAN considers data-to-data relationships while ProjGAN only contemplates data-to-class relations and regards each class as having independent characteristics. I think that the confusion comes from classes whose features are similar to each other, but 2C_loss can not effectively figure it out. The solution to this problem is finding good hyperparameters for better optimization. In my experience, the following settings helped me to train an unconfused ContraGAN. Higher-dimensional hypersphere, Thank you. |
Beta Was this translation helpful? Give feedback.
-
@mingukkang Hi, I was wondering about sth. What happens to the image-to-image similarity term in the 2c loss, if there will be no common class labels in the batch (no class image occurs twice e.g a batch of size 4 with images from classes 1,2,3,4)? So, in this case, the image-to-image term 1_{y_i=y_k}.exp(l(x_i)l(x_k).T) in the numerator becomes 0? |
Beta Was this translation helpful? Give feedback.
As the first question, has anyone used global max pooling or global average pooling, instead of global sum pooling in the discriminator (after all res_blocks)? Do you think the impact will be distinguishable?