8000 CMake: update FindCUDAToolkit.cmake, use torch::nvtx3 if present, mod… by whitesscott · Pull Request #153339 · pytorch/pytorch · GitHub
[go: up one dir, main page]

Skip to content

CMake: update FindCUDAToolkit.cmake, use torch::nvtx3 if present, mod… #153339

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
wants to merge 6 commits into
base: main
Choose a base branch
from

Conversation

whitesscott
Copy link

CMake: update FindCUDAToolkit.cmake, use torch::nvtx3 if present, modify torch/init.py to include libnvToolsExt.so if cuda < 12.9

Fixes #152756

Cuda-12.9 removed libnvToolsExt.so.* and is now pure header nvtx3.

Modify torch/init.py to include libnvToolsExt.so only if cuda < 12.9

pytorch/cmake/Modules/FindCUDAToolkit.cmake is old enough to not have any reference to CUDA::nvtx3. I updated to the current version FindCUDAToolkit.cmake.

…ify torch/__init__.py to include libnvToolsExt.so if cuda < 12.9
Copy link
pytorch-bot bot commented May 11, 2025

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/153339

Note: Links to docs will display an error until the docs builds have been completed.

❌ 58 New Failures, 1 Unrelated Failure

As of commit 81860e8 with merge base e4f2282 (image):

NEW FAILURES - The following jobs have failed:

BROKEN TRUNK - The following job failed but were present on the merge base:

👉 Rebase onto the `viable/strict` branch to avoid these failures

This comment was automatically generated by Dr. CI and updates every 15 minutes.

Copy link
linux-foundation-easycla bot commented May 11, 2025

CLA Signed

The committers listed above are authorized under a signed CLA.

@whitesscott
Copy link
Author

@pytorchbot label "topic: not user facing"

@pytorch-bot pytorch-bot bot added the topic: not user facing topic category label May 11, 2025
@@ -1018,22 +1018,24 @@ endif()

# ---[ nvtx
if(USE_SYSTEM_NVTX)
find_path(nvtx3_dir NAMES nvtx3 PATHS ${CUDA_INCLUDE_DIRS})
find_path(nvtx3_dir NAMES nvtx3 PATHS /usr/local/cuda/include ${CUDA_INCLUDE_DIRS})
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Why is this hard coded? is there not some CMake variable for this?

Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

mistaken testing artifact. Thank you for finding it.

@colesbury colesbury requested a review from malfet May 13, 2025 00:56
@colesbury colesbury added the triaged This issue has been looked at a team member, and triaged and prioritized into an appropriate module label May 13, 2025
@Skylion007 Skylion007 requested review from eqy and ngimel May 14, 2025 15:12
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
open source topic: not user facing topic category triaged This issue has been looked at a team member, and triaged and prioritized into an appropriate module
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Cuda-12.9 removed libnvToolsExt.so.* and is now purely header nvtx3
4 participants
0