-
Notifications
You must be signed in to change notification settings - Fork 24.2k
UNSTABLE pull / cuda12.4-py3.10-gcc9-sm75 / test (pr_time_benchmarks) #149370
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comments
Hello there! From the UNSTABLE prefix in this issue title, it looks like you are attempting to unstable a job in PyTorch CI. The information I have parsed is below:
Within ~15 minutes, |
this should fix it #149347 |
The above fix did work, but the issue wasn't closed, so there have been a couple more times where it got broken, then got fixed, but stayed in unstable because the issue was never closed. The most recent revert c93e34d is green, so I am closing this now |
How can I address the failures during migration? e.g. #151594 (comment) while moving from cuda12.4 to cuda12.6 |
there has been so much changes this week putting this PR to test and udpdate the results if needed can i get a stamp just in case i need to update results? ![]() |
Hi @laithsakka Do you know how to resolve the following errors while trying to reproduce the instruction count numbers?
Does this require the installation of any perf tools? Thanks! |
are you running on a machine that have access to hardware counter? @nWEIdia |
I believe I have root access to the hardware, but not sure how to enable it [the hardware counters]. Does this have any requirement for the OS? I have provisioned ubuntu 20.04. |
See https://hud.pytorch.org/hud/pytorch/pytorch/main/1?per_page=50&name_filter=pr_time&mergeLF=true <- job passes and fails intermittently with no apparent commit that could have started it
cc @chauhang @penguinwu @seemethere @pytorch/pytorch-dev-infra
The text was updated successfully, but these errors were encountered: