8000 [XPU] Linux CI/CD has been broken by the intel-deep-learning-essentials-2025.0 online installation · Issue #149995 · pytorch/pytorch · GitHub
[go: up one dir, main page]

Skip to content

[XPU] Linux CI/CD has been broken by the intel-deep-learning-essentials-2025.0 online installation #149995

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Closed
chuanqi129 opened this issue Mar 26, 2025 · 3 comments
Assignees
Labels
module: ci Related to continuous integration module: regression It used to work, and now it doesn't module: xpu Intel XPU related issues triaged This issue has been looked at a team member, and triaged and prioritized into an appropriate module

Comments

@chuanqi129
Copy link
Collaborator
chuanqi129 commented Mar 26, 2025

Recently, intel-deep-learning-essentials has release some new version packages which broken the 2025.0 clean installation, it led to the xpu build test build cpu-only pytorch in the past two days. Refer https://github.com/pytorch/pytorch/actions/runs/14053994965/job/39364027978#step:14:1789. And it block all XPU related PR testing and landing, for example PR #149696.

As solution, we decided to use offline installation mode to get more stable CI/CD build environment for XPU.

cc @seemethere @malfet @pytorch/pytorch-dev-infra @gujinghui @EikanWang @fengyuan14 @guangyey

@chuanqi129 chuanqi129 added module: xpu Intel XPU related issues triaged This issue has been looked at a team member, and triaged and prioritized into an appropriate module labels Mar 26, 2025
@chuanqi129 chuanqi129 self-assigned this Mar 26, 2025
@malfet
Copy link
Contributor
malfet commented Mar 26, 2025

Shouldn't docker configs be always pinned to a particular version?

@malfet malfet added module: ci Related to continuous integration module: regression It used to work, and now it doesn't labels Mar 26, 2025
@chuanqi129
Copy link
Collaborator Author

Shouldn't docker configs be always pinned to a particular version?

Yes, we pinned. But the online installation has some bugs led to the wrong version dependency packages installed. To fix this issue and also avoid such issue in the future, the PR #149843 switch to offline installation mode.

@chuanqi129
Copy link
Collaborator Author

The issue has been fixed by the oneAPI Deep learning essential 2025.0 latest patch release

@github-project-automation github-project-automation bot moved this from Cold Storage to Done in PyTorch OSS Dev Infra Apr 5, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
module: ci Related to continuous integration module: regression It used to work, and now it doesn't module: xpu Intel XPU related issues triaged This issue has been looked at a team member, and triaged and prioritized into an appropriate module
Projects
Archived in project
Development

Successfully merging a pull request may close this issue.

2 participants
0