You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I don't quite understand line 524, kai stores the values in tensor corresponding to indices l; after an all_gather, agg_tensor has world_size such kais, but from what I see, line 524 is just put the values in kai from different processes continuously in output tensor, disregarding their real position. What is wrong about my understanding?
Thank you in advance.
The text was updated successfully, but these errors were encountered:
Hi, I have two questions regarding the MSTOP-k implementation in ddp_final:
all_gather
twice.kai
stores the values in tensor corresponding to indicesl
; after anall_gather
, agg_tensor hasworld_size
suchkai
s, but from what I see, line 524 is just put the values inkai
from different processes continuously in output tensor, disregarding their real position. What is wrong about my understanding?Thank you in advance.
The text was updated successfully, but these errors were encountered: