You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
This issue has been automatically marked as stale because it has not had recent activity. If you think this still needs to be addressed please comment on this thread.
Please note that issues that do not follow the contributing guidelines are likely to be ignored.
In training in LLaVA 1.6, typically we might unevitable image file missing, it has this catching logic:
Actually works fine when using
deepspeed train.py
ortorchrun train.py
.But when using accelerate launch, especially with
auto_find_batch_size = True
, it will hang when file not found but catched.Am not sure how to make it work with accelerate since i found accelerate saves memory than deepspeed so I had to use it
The text was updated successfully, but these errors were encountered: