10BC0 [Distributed][CI] Rework continuous TestCase by kwen2501 · Pull Request #153653 · pytorch/pytorch · GitHub
[go: up one dir, main page]

Skip to content
Closed
Changes from 1 commit
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
Prev Previous commit
Next Next commit
Update
[ghstack-poisoned]
  • Loading branch information
kwen2501 committed May 24, 2025
commit 4b820a884e538cf14834cb0054d172a820d0b222
9 changes: 3 additions & 6 deletions torch/csrc/distributed/c10d/ProcessGroupNCCL.cpp
Original file line number Diff line number Diff line change
Expand Up @@ -1102,12 +1102,9 @@ bool ProcessGroupNCCL::useNonblocking() {
useNonblocking_ = nbEnv;
}
// 3rd priority: automatically use nonblocking if we are in eager init mode
// Note: this automatic selection is disabled in torch 2.7.1 to work around a
// hang in NCCL 2.26 in non-blocking mode. We can revisit if NCCL fixes the
// bug. See https://github.com/pytorch/pytorch/issues/153960
// else if (getBoundDeviceId()) {
// useNonblocking_ = true;
// }
else if (getBoundDeviceId()) {
useNonblocking_ = true;
}
// 4th priority: otherwise, nonblocking = false to preserve old behavior
else {
useNonblocking_ = false;
Expand Down
Loading
0