Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Custom Dalle-2 trained decoder generating random noise #298

Open
rahulmoorthy19 opened this issue Aug 8, 2023 · 1 comment
Open

Custom Dalle-2 trained decoder generating random noise #298

rahulmoorthy19 opened this issue Aug 8, 2023 · 1 comment

Comments

@rahulmoorthy19
Copy link

Hi,

Thank you for creating this repository, it is really helpful. So I trained a decoder for my custom task and was generating images using it and I found out that it was generating random noise images. A sample image generated by the model is added below-
trial

The inference code of the decoder model is as follows-
image_decoder = torch.load(image_decoder_path).cuda()
image_generated = image_decoder.sample(image_embed = image_proj, cond_scale = 2.)
The image_proj is a processed embedding from a trained CLIP.

This is the Unet decoder setting I am using while training

unet = Unet(
dim = 8,
image_embed_dim = 256,
cond_dim = 128,
channels = 3,
dim_mults=(1, 2, 4, 8)
).cuda()

decoder = Decoder(
unet = unet,
image_size = 224,
timesteps = 100,
image_cond_drop_prob = 0.1,
text_cond_drop_prob = 0.5,
learned_variance=False
).cuda()

The training loss is coming out to be 0.051178544054353195
Any help would be really useful...Thank You!!!

@u1ug
Copy link

u1ug commented Aug 12, 2024

You need to pass an embedding from prior, not a CLIP.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants