Enable CUDA 12.8.0, Disable CUDA 12.4 #145570

tinglvv · 2025-01-24T00:52:57Z

johnnynunez · 2025-01-24T16:46:42Z

this will be amazing for test new blackwell.
I can say that it is compiling well: @malfet

johnnynunez · 2025-01-24T19:04:42Z

@tinglvv new cudnn and tensorrt is out

ofirkris · 2025-01-25T21:33:45Z

Can we get 12.8 support and 2.7 in nightly?

johnnynunez · 2025-01-25T21:50:41Z

Can we get 12.8 support and 2.7 in nightly?

They are working on that:
pip3 install --pre torch torchvision torchaudio --index-url https://download.pytorch.org/whl/nightly/cu128

ofirkris · 2025-01-26T10:27:21Z

Can we get 12.8 support and 2.7 in nightly?

They are working on that: pip3 install --pre torch torchvision torchaudio --index-url https://download.pytorch.org/whl/nightly/cu128

ERROR: Could not find a version that satisfies the requirement torch (from versions: none) ERROR: No matching distribution found for torch

johnnynunez · 2025-01-26T11:19:53Z

@ofirkris it will be available in the following days

Breaking #145557 into two parts. Need to have manylinux-cuda12.8 in order to build magma. Issue: #145570 Pull Request resolved: #145567 Approved by: https://github.com/nWEIdia, https://github.com/atalman

Breaking pytorch#145557 into two parts. Need to have manylinux-cuda12.8 in order to build magma. Issue: pytorch#145570 Pull Request resolved: pytorch#145567 Approved by: https://github.com/nWEIdia, https://github.com/atalman

#145570 Builds 12.8 libtorch docker/deprecate 12.1 meanwhile Pull Request resolved: #145789 Approved by: https://github.com/nWEIdia, https://github.com/atalman

#145570 Pull Request resolved: #145765 Approved by: https://github.com/malfet

#145570 Build failure for libtorch wheel `CUDAContext.cpp:(.text+0x157): additional relocation overflows omitted from the output /usr/bin/ld: failed to convert GOTPCREL relocation; relink with --no-relax collect2: error: ld returned 1 exit status` Unsure if this is related, fixing as a start Pull Request resolved: #146019 Approved by: https://github.com/eqy

#145570 Adding cuda 12.8.0 x86 builds first Pull Request resolved: #145792 Approved by: https://github.com/nWEIdia, https://github.com/malfet, https://github.com/atalman

gdubroeucq · 2025-02-01T13:27:29Z

waiting for the windows build as well :)

tinglvv · 2025-02-03T02:12:56Z

Nightly manywheel torch builds are available https://download.pytorch.org/whl/nightly/cu128/torch
Windows build PRs are in the process, should be the week after as the AMI takes a week to build.

#145570 removing `.ci/pytorch/windows/internal/cuda_install.bat` as it is a duplicate with` .github/scripts/windows/cuda_install.bat`. The later one is the one in use - https://github.com/pytorch/pytorch/pull/146653/files#diff-613791f266f2f7b81148ca8f447b0cd6c6544f824f5f46a78a2794006c78957bR8 Pull Request resolved: #146653 Approved by: https://github.com/atalman

#145570 removing `.ci/pytorch/windows/internal/cuda_install.bat` as it is a duplicate with` .github/scripts/windows/cuda_install.bat`. The later one is the one in use - https://github.com/pytorch/pytorch/pull/146653/files#diff-613791f266f2f7b81148ca8f447b0cd6c6544f824f5f46a78a2794006c78957bR8 Pull Request resolved: #146653 Approved by: https://github.com/atalman Co-authored-by: atalman <atalman@fb.com>

#145570 windows AMI is deployed to prod today, prepping the windows cuda 12.8 build Pull Request resolved: #147037 Approved by: https://github.com/atalman

Try removing sm50 and sm60 to shrink binary size, and resolve the ld --relink error "Architecture support for Maxwell, Pascal, and Volta is considered feature-complete and will be frozen in an upcoming release." from 12.8 release note. Also updating the runner for cuda 12.8 test to g4dn (T4, sm75) due to the drop in sm50/60 support. #145570 Pull Request resolved: #146265 Approved by: https://github.com/atalman

Skylion007 · 2025-02-23T16:13:18Z

I really think we should try to keep the lower SM arches in the 12.8 build while they are supported. We just need to fix the ld --relink script operation for libtorch, and we are unblocked from that, right? That likely involves passing the link script provided by nvcc to it?

#145570 windows AMI is deployed to prod today, prepping the windows cuda 12.8 build Pull Request resolved: #147037 Approved by: https://github.com/atalman

Try removing sm50 and sm60 to shrink binary size, and resolve the ld --relink error "Architecture support for Maxwell, Pascal, and Volta is considered feature-complete and will be frozen in an upcoming release." from 12.8 release note. Also updating the runner for cuda 12.8 test to g4dn (T4, sm75) due to the drop in sm50/60 support. #145570 Pull Request resolved: #146265 Approved by: https://github.com/atalman

PandaWei · 2025-02-26T10:12:07Z

when release the version for cu128？

@atalman

pytorch/pytorch#145570 deprecate cuda 12.4 nightly build cc @atalman

FredGoo · 2025-03-02T09:34:41Z

when release the version for cu128？

Please be patient and kind

pytorch#145570 removing `.ci/pytorch/windows/internal/cuda_install.bat` as it is a duplicate with` .github/scripts/windows/cuda_install.bat`. The later one is the one in use - https://github.com/pytorch/pytorch/pull/146653/files#diff-613791f266f2f7b81148ca8f447b0cd6c6544f824f5f46a78a2794006c78957bR8 Pull Request resolved: pytorch#146653 Approved by: https://github.com/atalman Co-authored-by: atalman <atalman@fb.com>

pytorch#145570 windows AMI is deployed to prod today, prepping the windows cuda 12.8 build Pull Request resolved: pytorch#147037 Approved by: https://github.com/atalman

Try removing sm50 and sm60 to shrink binary size, and resolve the ld --relink error "Architecture support for Maxwell, Pascal, and Volta is considered feature-complete and will be frozen in an upcoming release." from 12.8 release note. Also updating the runner for cuda 12.8 test to g4dn (T4, sm75) due to the drop in sm50/60 support. pytorch#145570 Pull Request resolved: pytorch#146265 Approved by: https://github.com/atalman

follow up for https://github.com/pytorch/pytorch/pull/146265/files, dropping sm_70 as well, since "Architecture support for Maxwell, Pascal, and Volta is considered feature-complete and will be frozen in an upcoming release." #145570 Pull Request resolved: #147607 Approved by: https://github.com/atalman

) #145570 breaking #140793 into eager and inductor benchmarks to unblock Pull Request resolved: #148602 Approved by: https://github.com/atalman, https://github.com/malfet Co-authored-by: atalman <atalman@fb.com>

#145570 breaking #140793 into eager and inductor benchmarks to unblock Seems many inductor yml are added after initial change was prepared. Pull Request resolved: #148612 Approved by: https://github.com/nWEIdia, https://github.com/atalman Co-authored-by: atalman <atalman@fb.com>

#145570 removes cuda 12.4 nightly builds Pull Request resolved: #148625 Approved by: https://github.com/atalman

Skylion007 · 2025-03-09T16:15:59Z

Cudnn 9.8.0 just dropped: https://pypi.org/project/nvidia-cudnn-cu12/ Might be good to include it in this release given all the Blackwell performance optimizations that were made to it, especially regarding SDPA and given FlashAttention4 is still a ways off.

#145570 redo #148625 Pull Request resolved: #148895 Approved by: https://github.com/atalman

This was referenced Jan 24, 2025

Add CUDA 12.8 installation and manylinux-cuda12.8 #145567

Closed

Add CUDA 12.8 installation and Linux CD Docker images #145557

Closed

cpuhrsch added module: cuda Related to torch.cuda, and CUDA support in general triaged This issue has been looked at a team member, and triaged and prioritized into an appropriate module labels Jan 24, 2025

tinglvv mentioned this issue Jan 24, 2025

[RFC] Cuda support matrix for Release 2.7 #145544

Closed

johnnynunez mentioned this issue Jan 26, 2025

Initial Blackwell codegen Dao-AILab/causal-conv1d#44

Closed

tinglvv mentioned this issue Jan 27, 2025

Add magma cuda build 12.8 #145765

8000 Closed

This was referenced Jan 27, 2025

Update to NCCL 2.25.1 for 12.8 #145776

Closed

Add CUDA 12.8 libtorch image #145789

Closed

tinglvv mentioned this issue Jan 27, 2025

Add CUDA 12.8 manywheel x86 Builds to Binaries Matrix #145792

Closed

pytorchmergebot pushed a commit that referenced this issue Jan 29, 2025

Add CUDA 12.8 libtorch image (#145789)

f4ca989

#145570 Builds 12.8 libtorch docker/deprecate 12.1 meanwhile Pull Request resolved: #145789 Approved by: https://github.com/nWEIdia, https://github.com/atalman

pytorchmergebot pushed a commit that referenced this issue Jan 29, 2025

8000

Add magma cuda build 12.8 (#145765)

354fe48

#145570 Pull Request resolved: #145765 Approved by: https://github.com/malfet

tinglvv mentioned this issue Jan 30, 2025

Use Magma-cuda 12.8 for libtorch #146019

Closed

tinglvv mentioned this issue Jan 31, 2025

Libtorch CUDA 12.8 Test with --host-linker-script=use-lcs #146084

Closed

tinglvv mentioned this issue Feb 2, 2025

Add CUDA 12.8 Windows AMI pytorch/test-infra#6243

Merged

This was referenced Feb 3, 2025

Add CUDA 12.8 torchaudio and torchvision x86 builds pytorch/test-infra#6244

Merged

Add libtorch nightly build for CUDA 12.8 #146265

Closed

tinglvv self-assigned this Feb 3, 2025

Raymo111 pushed a commit that referenced this issue Feb 20, 2025

Add CUDA 12.8 windows nightly build (#147037)

232595f

#145570 windows AMI is deployed to prod today, prepping the windows cuda 12.8 build Pull Request resolved: #147037 Approved by: https://github.com/atalman

tinglvv mentioned this issue Feb 21, 2025

Deprecate sm70 for cuda 12.8 binary #147607

Closed

jeffrey-cochran mentioned this issue Feb 22, 2025

distributed_c10d.broadcast causing unexpected CUDA oom error #147677

Open

pytorch-bot bot pushed a commit that referenced this issue Feb 24, 2025

Add CUDA 12.8 windows nightly build (#147037)

4c9980b

#145570 windows AMI is deployed to prod today, prepping the windows cuda 12.8 build Pull Request resolved: #147037 Approved by: https://github.com/atalman

tinglvv mentioned this issue Feb 25, 2025

Disable CUDA 12.4 nightly build pytorch/test-infra#6333

Merged

tinglvv changed the title ~~Enable CUDA 12.8.0~~ Enable CUDA 12.8.0, Disable CUDA 12.4 Feb 25, 2025

atalman pushed a commit to pytorch/test-infra that referenced this issue Feb 26, 2025

Disable CUDA 12.4 nightly build (#6333)

2a18211

pytorch/pytorch#145570 deprecate cuda 12.4 nightly build cc @atalman

This was referenced Mar 5, 2025

[CI][CUDA] Move away from cuda12.4, Add cuda12.6 eager CI tests #148602

Closed

[CI] [inductor] Add cu126 inductor jobs and move away cu124 #148612

Closed

Remove Cuda 12.4 from nightly Binaries #148625

Closed

pytorchmergebot pushed a commit that referenced this issue Mar 7, 2025

Remove Cuda 12.4 from nightly Binaries (#148625)

1239176

#145570 removes cuda 12.4 nightly builds Pull Request resolved: #148625 Approved by: https://github.com/atalman

tinglvv mentioned this issue Mar 10, 2025

Remove 12.4 x86 builds and 12.6 sbsa builds from nightly #148895

Closed

pytorchmergebot pushed a commit that referenced this issue Mar 10, 2025

Remove 12.4 x86 builds and 12.6 sbsa builds from nightly (#148895)

2a1eeae

#145570 redo #148625 Pull Request resolved: #148895 Approved by: https://github.com/atalman

tinglvv mentioned this issue May 28, 2025

[CI] [CUDA] Add CUDA 12.8 eager CI tests #154469

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Enable CUDA 12.8.0, Disable CUDA 12.4 #145570

Enable CUDA 12.8.0, Disable CUDA 12.4 #145570

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Enable CUDA 12.8.0, Disable CUDA 12.4 #145570

Enable CUDA 12.8.0, Disable CUDA 12.4 #145570

Comments

Uh oh!

🚀 The feature, motivation and pitch

Alternatives

Additional context

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!