8000 Bf16 fused adam(W) by janeyx99 · Pull Request #147653 · pytorch/pytorch · GitHub
[go: up one dir, main page]

Skip to content

Bf16 fused adam(W) #147653

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Draft
wants to merge 9 commits into
base: gh/janeyx99/225/base
Choose a base branch
from
Draft

Bf16 fused adam(W) #147653

wants to merge 9 commits into from

Conversation

janeyx99
Copy link
Contributor
@janeyx99 janeyx99 commented Feb 22, 2025

Many things do work!

Some things do not:
amsgrad does not work
some checks are removed (not critical imo, but def less safe) to let multidevice work
adam + adamw should work
tensor lr would work if it's on cpu
the float, float, bf16, bf16 pattern is specialized, other mixed precision will not work

Stack from ghstack (oldest at bottom):

Copy link
pytorch-bot bot commented Feb 22, 2025

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/147653

Note: Links to docs will display an error until the docs builds have been completed.

❌ 12 New Failures, 1 Cancelled Job

As of commit 6a74269 with merge base 83bb921 (image):

NEW FAILURES - The following jobs have failed:

CANCELLED JOB - The following job was cancelled. Please retry:

This comment was automatically generated by Dr. CI and updates every 15 minutes.

@pytorch-bot pytorch-bot bot added the release notes: foreach_frontend release notes category label Feb 22, 2025
janeyx99 added a commit that referenced this pull request Feb 22, 2025
ghstack-source-id: eab9384
Pull Request resolved: #147653
janeyx99 added a commit that referenced this pull request Feb 22, 2025
ghstack-source-id: 9adfe54
Pull Request resolved: #147653
janeyx99 added a commit that referenced this pull request Feb 22, 2025
ghstack-source-id: 18ed96b
Pull Request resolved: #147653
janeyx99 added a commit that referenced this pull request Feb 24, 2025
ghstack-source-id: 5b6560e
Pull Request resolved: #147653
janeyx99 added a commit that referenced this pull request Feb 25, 2025
ghstack-source-id: 5e557ae
Pull Request resolved: #147653
@janeyx99 janeyx99 changed the title [skip ci] Attempt a mixed precision fused adam Attempt a mixed precision fused adam Feb 25, 2025
janeyx99 added a commit that referenced this pull request Feb 25, 2025
ghstack-source-id: 63ec4a7
Pull Request resolved: #147653
janeyx99 added a commit that referenced this pull request Feb 27, 2025
ghstack-source-id: 71c4ddb
Pull Request resolved: #147653
janeyx99 added a commit that referenced this pull request Feb 27, 2025
ghstack-source-id: 7a14380
Pull Request resolved: #147653
janeyx99 added a commit that referenced this pull request Feb 27, 2025
ghstack-source-id: 94c6a34
Pull Request resolved: #147653
@janeyx99 janeyx99 changed the title Attempt a mixed precision fused adam Bf16 fused adam(W) Mar 12, 2025
Copy link
Contributor

Looks like this PR hasn't been updated in a while so we're going to go ahead and mark this as Stale.
Feel free to remove the Stale label if you feel this was a mistake.
If you are unable to remove the Stale label please contact a maintainer in order to do so.
If you want the bot to never mark this PR stale again, add the no-stale label.
Stale pull requests will automatically be closed after 30 days of inactivity.

@github-actions github-actions bot added the Stale label May 11, 2025
@janeyx99 janeyx99 removed the Stale label May 12, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant
0