You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Summary:
Pull Request resolved: #71624
Now we have gather available in NCCL pg, we can switch our `sharded_tensor.gather` to use gather_object instead of all_gather_object, which will reduce the communication overhead.
TODO: To further reduce the comm overhead, we need to figure out a way to avoid using `gather_object`, as `gather_object` or `all_gather_object` incurs pickling copy between devices.
ghstack-source-id: 151007578
Test Plan: wait for ci
Reviewed By: pritamdamania87
Differential Revision: D33688907
fbshipit-source-id: 2073c5a46c33a7a2640a9e3599dc795d9e4c0a1e
(cherry picked from commit dbc983a)
0 commit comments