-
Notifications
You must be signed in to change notification settings - Fork 6
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
More improvements to FSDP, benchmark against DDP (#13)
* leave root params in mem * fix logic * don't show mem usage all the time * fix * make configurable * add alloc/free for unsharded data * add alloc/free for unsharded grad * fix * record for * revert 69d74c4 - alloc/free for unsharded grad * revert alloc/free trick for unsharded params data * add support for DDP in benchmark * set device ids explicitly * fix * change up how weights are initialized * fix test * Add back alloc/free hack for unsharded data * Revert "Add back alloc/free hack for unsharded data" This reverts commit 0386841. * Handle frozen layers with reshard-only post-backward hook * Revert "Handle frozen layers with reshard-only post-backward hook" This reverts commit 0f408d2. * add to test * add to test * Fixes for frozen modules * Divide grad before and after reducing for stability * Add support for hybrid sharding * make grad clipping optional * clean up * calculate grad norm more efficiently * Revert "calculate grad norm more efficiently" This reverts commit d66a683. * fix
- Loading branch information
Showing
8 changed files
with
280 additions
and
39 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.