You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hello. Thank you very much for sharing such great results. I really want to fine-tune and use this model.
As far as I have understood so far, when running finetune_docowl.sh, there is a deadlock issue with DeepSpeed stage 3 and 3-offload (zero 3, zero-offload), and it seems to be the same with zero2 and 3 of finetune_docowl_lora.sh.
Currently, I haven't been able to use finetune_docowl.sh (w/ zero2) due to a CUDA OOM issue.
Am I understanding this correctly? If you have resolved any of these deadlock issues, please share.
The text was updated successfully, but these errors were encountered:
Hello. Thank you very much for sharing such great results. I really want to fine-tune and use this model.
As far as I have understood so far, when running
finetune_docowl.sh
, there is a deadlock issue with DeepSpeed stage 3 and 3-offload (zero 3, zero-offload), and it seems to be the same with zero2 and 3 offinetune_docowl_lora.sh
.Currently, I haven't been able to use
finetune_docowl.sh (w/ zero2)
due to a CUDA OOM issue.Am I understanding this correctly? If you have resolved any of these deadlock issues, please share.
The text was updated successfully, but these errors were encountered: