modify_load_save_model #6626

ssklzx · 2024-10-15T03:22:29Z

accelerator = Accelerator()
model, optimizer, data = accelerator.prepare(model, optimizer, data)
device_map = {}
model = accelerate.dispatch_model(model, device_map=device_map)
accelerator.save_state(save_path)

When I use accelerate.dispatch_model after accelerator.prepare, there will be an error when saving the model

tjruwase · 2024-10-15T14:14:35Z

@ssklzx, thanks for creating this PR. However, I think you misunderstood my response
#6620 (comment).

What I meant is that we need to debug further to understand why some parameters are missing from self.param_names. Are you able to provide a full repro?

ssklzx · 2024-10-16T02:44:04Z

@ssklzx, thanks for creating this PR. However, I think you misunderstood my response #6620 (comment).

What I meant is that we need to debug further to understand why some parameters are missing from self.param_names. Are you able to provide a full repro?

Because after the initialization of 'self. param_name', I will change the position of the parameters, such as moving them from 'CUDA: 0' to 'CUDA: 1', so I will not be able to find these transferred parameters

for example:
model, optimizer, data = accelerator.prepare(model, optimizer, data). # initialization 'self. param_name'
model = accelerate.dispatch_model(model, device_map=device_map) # change parameter position
accelerator.save_state(save_path) # report errors

tjruwase · 2024-10-18T19:01:10Z

Because after the initialization of 'self. param_name', I will change the position of the parameters, such as moving them from 'CUDA: 0' to 'CUDA: 1', so I will not be able to find these transferred parameters

@ssklzx, thanks for the clarification. I think the correct solution here is for accelerate and DeepSpeed to coordinate to ensure that DeepSpeed is aware of new parameter locations, including updating self.param_names

modify_load_save_model

b18c5df

ssklzx requested review from tjruwase and tohtana as code owners October 15, 2024 03:22

Merge branch 'master' into issue-6620

b58e957

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

modify_load_save_model #6626

modify_load_save_model #6626

ssklzx commented Oct 15, 2024

tjruwase commented Oct 15, 2024

ssklzx commented Oct 16, 2024

tjruwase commented Oct 18, 2024

modify_load_save_model #6626

Are you sure you want to change the base?

modify_load_save_model #6626

Conversation

ssklzx commented Oct 15, 2024

tjruwase commented Oct 15, 2024

ssklzx commented Oct 16, 2024

tjruwase commented Oct 18, 2024