You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hi, I found that the default image preprocess method is only for single image input.
in process_images func, we will use process_anyres_image as default preprocessor, which will cause a huge increase for image input tokens(surpass 32768) when we have multi images input. My solution for inference is change to preprocessor used in training
Hi, I found that the default image preprocess method is only for single image input.
in
process_images
func, we will useprocess_anyres_image
as default preprocessor, which will cause a huge increase for image input tokens(surpass 32768) when we have multi images input. My solution for inference is change to preprocessor used in trainingLLaVA-NeXT/llava/train/train.py
Line 1147 in 79ef45a
is that correct?
The text was updated successfully, but these errors were encountered: