-
Notifications
You must be signed in to change notification settings - Fork 1k
Issues: huggingface/accelerate
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
tests/test_cli.py::ModelEstimatorTester::test_no_split_modules fails after 84a67891 in Transformers
#3362
opened Jan 23, 2025 by
dvrogozh
Google Colab TPU
notebook_launcher
doesn't work
#3358
opened Jan 21, 2025 by
matinmoezzi
2 of 4 tasks
Running TRL example doesn't work when using deepspeed on accelerate
#3354
opened Jan 20, 2025 by
Superskyyy
1 of 4 tasks
AttributeError: _hf_hook caused by delattr in hooks.remove_hook_from_module()
#3342
opened Jan 13, 2025 by
eppaneamd
2 of 4 tasks
Maybe a conflict between accelerate and transformers CLIPVisionModel
#3339
opened Jan 13, 2025 by
striveAgain
2 of 4 tasks
[Feature Request] include a DeepSpeed multi-node config slurm example
contributions-welcome
deepspeed
DS related issues/PRs
#3338
opened Jan 13, 2025 by
sayakpaul
How to save self-defined model with deepspeed zero 3?
#3320
opened Jan 2, 2025 by
amoyplane
2 of 4 tasks
openmpi launchers for deepseed should not set unsupported
--num_gpus
#3319
opened Jan 2, 2025 by
chiragjn
virtual environment with torch cpu forces to install torch gpu
#3317
opened Dec 27, 2024 by
mgathena
4 tasks
Accelerate + deepspeed_stage_zero3 Cannot save model checkpoint
#3316
opened Dec 27, 2024 by
wangyu-ustc
2 of 4 tasks
ValueError: Default process group has not been initialized, please make sure to call init_process_group
#3315
opened Dec 27, 2024 by
wangyu-ustc
4 tasks done
distributed GPU inference across multiple machines on the same network
#3313
opened Dec 25, 2024 by
MonkeeMan1
2 of 4 tasks
Accelerate on SLURM: server socket has failed to listen on any local network address
#3312
opened Dec 24, 2024 by
angadsinghsandhu
2 of 4 tasks
project_configuration.automatic_checkpoint_naming
synchronization between load_state
and save_state
#3306
opened Dec 19, 2024 by
diffunity
Previous Next
ProTip!
Follow long discussions with comments:>50.