Skip to content

Pull requests: microsoft/DeepSpeed

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

fix memcpy issue on backward for zero-infinity
#6670 opened Oct 26, 2024 by xylian86 Loading…
Add support for H100/sm_90 arch compilation
#6669 opened Oct 25, 2024 by loadams Loading…
Update gaudi2 docker version to latest release
#6648 opened Oct 21, 2024 by raza-sikander Loading…
AIO File Offsets
#6641 opened Oct 18, 2024 by jomayeri Loading…
modify_load_save_model
#6626 opened Oct 15, 2024 by ssklzx Loading…
Improve consistency of zero_grad
#6554 opened Sep 18, 2024 by tohtana Draft
Set shuffle=True by default in data_sampler
#6531 opened Sep 13, 2024 by ranzhejiang Loading…
Adding the new feature of FPDT
#6462 opened Aug 29, 2024 by YJHMITWEB Loading…
Add weights_only=True in torch.load
#6094 opened Aug 17, 2024 by terry-for-github Loading…
[NaN check] Add NaN check to support bfloat16.
#5879 opened Aug 8, 2024 by ys950902 Loading…
Fix circular import in ds_transformer.py
#5804 opened Jul 28, 2024 by sznmelvin Loading…
Hybrid Offloading for ZeRO3
#5625 opened Jun 7, 2024 by tohtana Draft
Add support for Phi-3 small to FastGen
#5614 opened Jun 4, 2024 by adk9 Draft
ProTip! Adding no:label will show everything without a label.