-
Notifications
You must be signed in to change notification settings - Fork 18
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Not an issue - richer datasets #6
Comments
Hi, I was not aware of these, these are very beautiful! |
"so no optimization loop" - fyi - @nerdyrodent |
" does that mean there's no 500x iterations to get a good looking image?" Yes |
Following the tweet you mentioned above, here is an example with "deviantart, volcano": https://imgur.com/a/cYMsNo5 with a model currently being trained on conceptual captions 12m. |
@johndpope I added a bunch of pre-trained models if you want to give it a try |
I had a play with the 1.7gb cc12m_32x1024 - I couldn't get my high quality that I was getting on VQGAN-CLIP - will keep trying - bumping the dimensions. Maybe docs could use some pointers - 256 x256 / 512x512 etc Here's some images I turned out over the weekend - Observerations Los Angeles Album Cover This didn't quite cut it. Other improvements for newbies - you could consider integrating these downloads into readme |
@johndpope have you considered re-embedding the outputs from the trained Also - one of the other primary benefits of this approach is that if you'd like to finetune from one of the checkpoints or even train your own from scratch - this can be relatively simple as all you need are some captions which can be generated/typed out. You'll want to cover a large-ish corpus but using something like the provided MIT states captions as a base should be a good start. Thanks for the extra info. I'm a little busy today but I think the README might need one or two more things and possibly a colab notebook specific to training (if we don't have that already) that would make it easy to customize MIT states. edit: realtime updates to your captions/display of rate of generations etc. may be outside of the scope of the project. |
are you familiar with this https://twitter.com/e08477/status/1418440857578098691?s=21 ?
I want to do cityscape shots. Are you familiar with any relevant datasets?
Can this repo help output higher quality images? Or does it help with the prompting?
The text was updated successfully, but these errors were encountered: