Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

AI AI agAIn #14

Open
barnoid opened this issue Nov 3, 2024 · 1 comment
Open

AI AI agAIn #14

barnoid opened this issue Nov 3, 2024 · 1 comment

Comments

@barnoid
Copy link

barnoid commented Nov 3, 2024

8 years ago, for NaNoGenMo 2016, I used image captioning AI on 5036 stills from the movie A.I. Artificial Intelligence (2001) to attempt to generate a movie novelisation.

Now, for better or worse, I'm going to try this again using current AI tools to see how things have changed.

I suspect the text will be considerably more coherent and more clearly based on the images, but I doubt it will be any easier to get an idea of the film.

@barnoid
Copy link
Author

barnoid commented Nov 5, 2024

Progress:

  • I ripped the DVD again, using ffmpeg to extract stills directly from VOBs seems to fail.
  • Used ffmpeg to extract 1054 images from the video.
  • This time I'm using the chapters on the DVD as the book chapters.
  • I'm using Llava 1.5 from here https://github.com/Mozilla-Ocho/llamafile running locally.
  • For the first image in a chapter I give a prompt and the image to Llava and record the resulting text.
  • For the rest of the images I give it the prompt, the previous image's text and the image.

So far 20 of 32 chapters are done for 40,824 words. I'm reading it all as we go to make sure it doesn't say anything horrible. If I don't like the chapter text I regenerate it.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant