Refactor CUDAAllocatorConfig to reuse AcceleratorAllocatorConfig #150312

guangyey · 2025-03-31T16:12:35Z

Stack from ghstack (oldest at bottom):

Motivation

Refactor CUDAAllocatorConfig to reuse AcceleratorAllocatorConfig and ConfigTokenizer. We would deprecate those option that overleap with AcceleratorAllocatorConfig in the following PR and keep them only for BC.

cc @H-Huang @awgu @wanchaol @fegin @fduwjj @wz337 @wconstab @d4l3k

pytorch-bot · 2025-03-31T16:12:39Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/150312

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

❗ 1 Active SEVs

There are 1 currently active SEVs. If your PR is affected, please view them below:

ghstack-mergeability-check and Check labels failing with 'Resource not accessible by integration'

✅ You can merge normally! (3 Unrelated Failures)

As of commit df9befb with merge base bb67660 ():

UNSTABLE - The following jobs are marked as unstable, possibly due to flakiness on trunk:

pull / linux-jammy-py3_9-clang9-xla / test (xla, 1, 1, lf.linux.12xlarge, unstable) (gh) (#158876)
sccache: error: couldn't connect to server
rocm / linux-jammy-rocm-py3.10 / test (default, 1, 6, linux.rocm.gpu.2, unstable) (gh)
inductor/test_max_autotune.py::TestMaxAutotune::test_triton_template_generated_code_caching_mm_plus_mm
rocm / linux-jammy-rocm-py3.10 / test (default, 4, 6, linux.rocm.gpu.2, unstable) (gh)
##[error]The operation was canceled.

This comment was automatically generated by Dr. CI and updates every 15 minutes.

[ghstack-poisoned]

…fig (#150312)" This reverts commit dfacf11. Reverted #150312 on behalf of https://github.com/guangyey due to Static initialization order issue impact the downstream repo ([comment](#150312 (comment)))

[ghstack-poisoned]

pytorchmergebot · 2025-08-05T02:25:57Z

Starting merge as part of PR stack under #156175

pytorchmergebot · 2025-08-05T02:44:43Z

Starting merge as part of PR stack under #156175

pytorchmergebot · 2025-08-05T03:53:52Z

Starting merge as part of PR stack under #156175

< 8000 !-- no margin wins, so we check it last and use its value if true. -->

pytorchmergebot · 2025-08-05T04:01:37Z

Starting merge as part of PR stack under #156175

@ScottTodd

# Motivation As @ScottTodd identified in this [comment](#150312 (comment)), using STL containers like `std::string` and `std::unordered_set` at static init time can cause static initialization order issues. This PR is based on and modified from his original PR: #159607. I’m stacking this PR here to help facilitate the landing and validation process. Co-authored-by: @ScottTodd Pull Request resolved: #159629 Approved by: https://github.com/ScottTodd, https://github.com/albanD

…llocatorConfig instead (#156165) Pull Request resolved: #156165 Approved by: https://github.com/albanD ghstack dependencies: #159629, #150312

# Motivation This PR moves the implementation of `torch.cuda.memory._set_allocator_settings` to `torch._C._accelerator_setAllocatorSettings`. Since the original API was intended as a temporary/internal utility, I am not exposing the new function as a public API. Pull Request resolved: #156175 Approved by: https://github.com/albanD ghstack dependencies: #159629, #150312, #156165

…onfig (pytorch#150312)" Summary: reverting this diff since it caused S551328. Please see D80217492 for dertails. Test Plan: NA Rollback Plan: Differential Revision: D80553588

…onfig (#150312)" (#161002) Summary: reverting this diff since it caused S551328. Please see D80217492 for dertails. Test Plan: NA Rollback Plan: Reviewed By: sayitmemory, jingsh Differential Revision: D80553588

…onfig (pytorch#150312)" (pytorch#161002) Summary: Pull Request resolved: pytorch#161002 reverting this diff since it caused S551328. Please see D80217492 for dertails. Test Plan: NA Rollback Plan: Reviewed By: sayitmemory, jingsh Differential Revision: D80553588

…onfig (#150312)" (#161002) Summary: reverting this diff since it caused S551328. Please see D80217492 for dertails. Test Plan: NA Rollback Plan: Differential Revision: D80553588 Pull Request resolved: #161002 Approved by: https://github.com/jingsh, https://github.com/izaitsevfb

…locatorConfig (#150312)" (#161002)" This reverts commit a03cc53. Reverted #161002 on behalf of https://github.com/guangyey due to This PR breaks CI TestCudaMallocAsync::test_allocator_settings ([comment](#161002 (comment)))

…fig (#150312)" This reverts commit ae1a706. ghstack-source-id: 641c5bb Pull Request resolved: #161628

…fig (#150312)" This reverts commit ae1a706. ghstack-source-id: 0a5aacc Pull Request resolved: #161628

…fig (#150312)" (#161628) This reverts commit ae1a706. Pull Request resolved: #161628 Approved by: https://github.com/atalman ghstack dependencies: #161625, #161626, #161627

@ScottTodd

# Motivation As @ScottTodd identified in this [comment](pytorch#150312 (comment)), using STL containers like `std::string` and `std::unordered_set` at static init time can cause static initialization order issues. This PR is based on and modified from his original PR: pytorch#159607. I’m stacking this PR here to help facilitate the landing and validation process. Co-authored-by: @ScottTodd Pull Request resolved: pytorch#159629 Approved by: https://github.com/ScottTodd, https://github.com/albanD

…orch#150312) # Motivation Refactor `CUDAAllocatorConfig` to reuse `AcceleratorAllocatorConfig` and `ConfigTokenizer`. We would deprecate those option that overleap with `AcceleratorAllocatorConfig` in the following PR and keep them only for BC. Pull Request resolved: pytorch#150312 Approved by: https://github.com/albanD ghstack dependencies: pytorch#159629

…llocatorConfig instead (pytorch#156165) Pull Request resolved: pytorch#156165 Approved by: https://github.com/albanD ghstack dependencies: pytorch#159629, pytorch#150312

…6175) # Motivation This PR moves the implementation of `torch.cuda.memory._set_allocator_settings` to `torch._C._accelerator_setAllocatorSettings`. Since the original API was intended as a tempora 5D99 ry/internal utility, I am not exposing the new function as a public API. Pull Request resolved: pytorch#156175 Approved by: https://github.com/albanD ghstack dependencies: pytorch#159629, pytorch#150312, pytorch#156165

…onfig (pytorch#150312)" (pytorch#161002) Summary: reverting this diff since it caused S551328. Please see D80217492 for dertails. Test Plan: NA Rollback Plan: Differential Revision: D80553588 Pull Request resolved: pytorch#161002 Approved by: https://github.com/jingsh, https://github.com/izaitsevfb

…locatorConfig (pytorch#150312)" (pytorch#161002)" This reverts commit a03cc53. Reverted pytorch#161002 on behalf of https://github.com/guangyey due to This PR breaks CI TestCudaMallocAsync::test_allocator_settings ([comment](pytorch#161002 (comment)))

…fig (pytorch#150312)" (pytorch#161628) This reverts commit ae1a706. Pull Request resolved: pytorch#161628 Approved by: https://github.com/atalman ghstack dependencies: pytorch#161625, pytorch#161626, pytorch#161627

guangyey requested review from eqy and syed-ahmed as code owners March 31, 2025 16:12

This was referenced Mar 31, 2025

Introduce AcceleratorAllocatorConfig as the common class #149601

Closed

Add DeviceAllocator as the base device allocator #138222

Closed

pytorchbot added the open source label Mar 31, 2025

guangyey changed the title ~~Refactor CUDAAllocatorConfig to reuse AllocatorConfig~~ [WIP] Refactor CUDAAllocatorConfig to reuse AllocatorConfig Mar 31, 2025

guangyey added 21 commits March 31, 2025 23:46

Update

663cdff

[ghstack-poisoned]

Update

f022ed7

[ghstack-poisoned]

Update

52ae02b

[ghstack-poisoned]

Update

c951fd9

[ghstack-poisoned]

Update

0ea28f9

[ghstack-poisoned]

Update

080a3ff

[ghstack-poisoned]

Update

673db93

[ghstack-poisoned]

Update

47d8667

[ghstack-poisoned]

Update

1ac29cf

[ghstack-poisoned]

Update

218b097

[ghstack-poisoned]

Update

0c1d6cb

[ghstack-poisoned]

Update

cd9704a

[ghstack-poisoned]

Update

41a0879

[ghstack-poisoned]

Update

6ea72f0

[ghstack-poisoned]

Update

0e4b237

[ghstack-poisoned]

Update

15c2aae

[ghstack-poisoned]

Update

cc158dd

[ghstack-poisoned]

Update

931f30f

[ghstack-poisoned]

Update

bd2d75c

[ghstack-poisoned]

Update

9aecb48

[ghstack-poisoned]

Update

5766561

[ghstack-poisoned]

guangyey added release notes: cpp release notes category topic: not user facing topic category labels Apr 15, 2025

Update

df9befb

[ghstack-poisoned]

pytorchmergebot closed this in ae1a706 Aug 5, 2025

joshuuuasu mentioned this pull request Aug 14, 2025

[PyTorch][CachingAllocatorConfig] back out D79620246 and D79620264 #160666

Closed

atalman mentioned this pull request Aug 27, 2025

Back out Generalize torch._C._set_allocator_settings to be generic #161620

Closed

guangyey added a commit that referenced this pull request Aug 27, 2025

Revert "Refactor CUDAAllocatorConfig to reuse AcceleratorAllocatorCon…

7606ed3

…fig (#150312)" This reverts commit ae1a706. ghstack-source-id: 641c5bb Pull Request resolved: #161628

guangyey added a commit that referenced this pull request Aug 27, 2025

Revert "Refactor CUDAAllocatorConfig to reuse AcceleratorAllocatorCon…

b3646e5

…fig (#150312)" This reverts commit ae1a706. ghstack-source-id: 0a5aacc Pull Request resolved: #161628

guangyey mentioned this pull request Aug 29, 2025

[Reland] Refactor CUDAAllocatorConfig to reuse AcceleratorAllocatorConfig #161786

Closed

github-actions bot deleted the gh/guangyey/133/head branch September 5, 2025 02:09

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Refactor CUDAAllocatorConfig to reuse AcceleratorAllocatorConfig #150312

Refactor CUDAAllocatorConfig to reuse AcceleratorAllocatorConfig #150312

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

9 participants

Refactor CUDAAllocatorConfig to reuse AcceleratorAllocatorConfig #150312

Refactor CUDAAllocatorConfig to reuse AcceleratorAllocatorConfig #150312

Uh oh!

Conversation

Uh oh!

Motivation

Uh oh!

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/150312

❗ 1 Active SEVs

✅ You can merge normally! (3 Unrelated Failures)

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

9 participants