Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Question about training loss #18

Open
stein-666 opened this issue Jul 26, 2024 · 2 comments
Open

Question about training loss #18

stein-666 opened this issue Jul 26, 2024 · 2 comments

Comments

@stein-666
Copy link

stein-666 commented Jul 26, 2024

Hi, I’m struggling to reproduce the work. However, when I start training following the process in this repository, the loss decreases rapidly and it seems to be approaching convergence. Despite this, the model fails to reconstruct images. Does this make sense?
image
image

@llvictorll
Copy link
Collaborator

Hello,

based on the information you provided with the screenshot:

  • Are you using images of size 128? The VQGAN provided is not robust for images below 256.
  • Despite the loss dropping quickly, it seems you are showing results after only 7,500 iterations. The model needs many more updates to generate images with good quality. If the loss drops quickly in the beginning, it's mainly because the model first learns to copy-paste the unmasked tokens. Of course, I don't know the other hyperparameters you are using, but factors like batch size, learning rate or model size can drastically influence the training.

Best,

Victor

@stein-666
Copy link
Author

Thanks for your reply. Indeed, the image size is set to 128 for fast training. I will follow the technique report you released and make another attempt. Thanks again!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants