Bf16 fused adam(W) #147653

janeyx99 · 2025-02-22T00:10:24Z

Many things do work!

Some things do not:
amsgrad does not work
some checks are removed (not critical imo, but def less safe) to let multidevice work
adam + adamw should work
tensor lr would work if it's on cpu
the float, float, bf16, bf16 pattern is specialized, other mixed precision will not work

Stack from ghstack (oldest at bottom):

[ghstack-poisoned]

pytorch-bot · 2025-02-22T00:10:28Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/147653

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

❌ 12 New Failures, 1 Cancelled Job

As of commit 6a74269 with merge base 83bb921 ():

NEW FAILURES - The following jobs have failed:

Lint / lintrunner-clang / linux-job (gh)
>>> Lint for aten/src/ATen/native/ForeachUtils.h:
pull / linux-focal-cuda12.4-py3.10-gcc9 / test (default, 2, 5, lf.linux.4xlarge.nvidia.gpu) (gh)
test_optim.py::TestOptimRenewedCUDA::test_complex_2d_AdamW_cuda_complex64
pull / linux-focal-cuda12.4-py3.10-gcc9-sm89 / test (default, 2, 5, lf.linux.g6.4xlarge.experimental.nvidia.gpu) (gh)
test_optim.py::TestOptimRenewedCUDA::test_complex_2d_AdamW_cuda_complex64
pull / linux-focal-py3.13-clang10 / test (crossref, 2, 2, lf.linux.2xlarge) (gh)
test_optim.py::TestOptimRenewedCPU::test_complex_2d_AdamW_cpu_complex64
pull / linux-focal-py3.13-clang10 / test (default, 2, 5, lf.linux.4xlarge) (gh)
test_optim.py::TestOptimRenewedCPU::test_complex_2d_AdamW_cpu_complex64
pull / linux-focal-py3.13-clang10 / test (dynamo_wrapped, 3, 3, lf.linux.2xlarge) (gh)
test_optim.py::TestOptimRenewedCPU::test_complex_AdamW_cpu_complex64
pull / linux-focal-py3.9-clang10 / test (crossref, 2, 2, lf.linux.2xlarge) (gh)
test_optim.py::TestOptimRenewedCPU::test_complex_2d_AdamW_cpu_complex64
pull / linux-focal-py3.9-clang10 / test (default, 2, 5, lf.linux.4xlarge) (gh)
test_optim.py::TestOptimRenewedCPU::test_complex_2d_AdamW_cpu_complex64
pull / linux-focal-py3.9-clang10 / test (dynamo_wrapped, 3, 3, lf.linux.2xlarge) (gh)
test_optim.py::TestOptimRenewedCPU::test_complex_AdamW_cpu_complex64
pull / linux-focal-rocm6.3-py3.10 / build (gh)
ninja: build stopped: subcommand failed
pull / linux-jammy-py3.10-clang15-asan / test (default, 5, 6, lf.linux.4xlarge) (gh)
test_optim.py::TestOptimRenewedCPU::test_complex_2d_AdamW_cpu_complex64
pull / linux-jammy-py3.9-gcc11 / test (default, 2, 5, lf.linux.2xlarge) (gh)
test_optim.py::TestOptimRenewedCPU::test_complex_2d_AdamW_cpu_complex64

CANCELLED JOB - The following job was cancelled. Please retry:

Check Labels / Check labels (gh)

This comment was automatically generated by Dr. CI and updates every 15 minutes.

ghstack-source-id: eab9384 Pull Request resolved: #147653

aten/src/ATen/native/cuda/fused_adam_impl.cu

[ghstack-poisoned]

ghstack-source-id: 9adfe54 Pull Request resolved: #147653

[ghstack-poisoned]

ghstack-source-id: 18ed96b Pull Request resolved: #147653

aten/src/ATen/native/cuda/MultiTensorApply.cuh

[ghstack-poisoned]

ghstack-source-id: 5b6560e Pull Request resolved: #147653

[ghstack-poisoned]

ghstack-source-id: 5e557ae Pull Request resolved: #147653

[ghstack-poisoned]

ghstack-source-id: 63ec4a7 Pull Request resolved: #147653

[ghstack-poisoned]

ghstack-source-id: 71c4ddb Pull Request resolved: #147653

[ghstack-poisoned]

ghstack-source-id: 7a14380 Pull Request resolved: #147653

[ghstack-poisoned]

ghstack-source-id: 94c6a34 Pull Request resolved: #147653

github-actions · 2025-05-11T20:35:12Z

Looks like this PR hasn't been updated in a while so we're going to go ahead and mark this as Stale.
Feel free to remove the Stale label if you feel this was a mistake.
If you are unable to remove the Stale label please contact a maintainer in order to do so.
If you want the bot to never mark this PR stale again, add the no-stale label.
Stale pull requests will automatically be closed after 30 days of inactivity.

[skip ci] Attempt a mixed precision fused adam

8e7e1b2

[ghstack-poisoned]

janeyx99 mentioned this pull request Feb 22, 2025

POC for mixed prec optim frontend #146640

Draft

pytorch-bot bot added the release notes: foreach_frontend release notes category label Feb 22, 2025

janeyx99 added a commit that referenced this pull request Feb 22, 2025

[skip ci] Attempt a mixed precision fused adam

fcf21a8

ghstack-source-id: eab9384 Pull Request resolved: #147653

janeyx99 commented Feb 22, 2025

View reviewed changes

aten/src/ATen/native/cuda/fused_adam_impl.cu Outdated Show resolved Hide resolved

Update on "[skip ci] Attempt a mixed precision fused adam"

a9904cd

[ghstack-poisoned]

janeyx99 added a commit that referenced this pull request Feb 22, 2025

[skip ci] Attempt a mixed precision fused adam

0d094cb

ghstack-source-id: 9adfe54 Pull Request resolved: #147653

Update on "[skip ci] Attempt a mixed precision fused adam"

ff5235e

[ghstack-poisoned]

janeyx99 added a commit that referenced this pull request Feb 22, 2025

[skip ci] Attempt a mixed precision fused adam

0301945

ghstack-source-id: 18ed96b Pull Request resolved: #147653

janeyx99 commented Feb 22, 2025

View reviewed changes

aten/src/ATen/native/cuda/MultiTensorApply.cuh Outdated Show resolved Hide resolved

Update on "[skip ci] Attempt a mixed precision fused adam"

94a1c6c

[ghstack-poisoned]

janeyx99 added a commit that referenced this pull request Feb 24, 2025

[skip ci] Attempt a mixed precision fused adam

bce08b7

ghstack-source-id: 5b6560e Pull Request resolved: #147653

Update on "[skip ci] Attempt a mixed precision fused adam"

76aec8f

[ghstack-poisoned]

janeyx99 added a commit that referenced this pull request Feb 25, 2025

[skip ci] Attempt a mixed precision fused adam

2899fe1

ghstack-source-id: 5e557ae Pull Request resolved: #147653

janeyx99 changed the title ~~[skip ci] Attempt a mixed precision fused adam~~ Attempt a mixed precision fused adam Feb 25, 2025

Update on "Attempt a mixed precision fused adam"

851b216

[ghstack-poisoned]

janeyx99 added a commit that referenced this pull request Feb 25, 2025

Attempt a mixed precision fused adam

da5b584

ghstack-source-id: 63ec4a7 Pull Request resolved: #147653

Update on "Attempt a mixed precision fused adam"

59068ff

[ghstack-poisoned]

janeyx99 added a commit that referenced this pull request Feb 27, 2025

Attempt a mixed precision fused adam

45daa94

ghstack-source-id: 71c4ddb Pull Request resolved: #147653

Update on "Attempt a mixed precision fused adam"

6815585

[ghstack-poisoned]

janeyx99 added a commit that referenced this pull request Feb 27, 2025

Attempt a mixed precision fused adam

fa90587

ghstack-source-id: 7a14380 Pull Request resolved: #147653

Update on "Attempt a mixed precision fused adam"

6a74269

[ghstack-poisoned]

janeyx99 added a commit that referenced this pull request Feb 27, 2025

Attempt a mixed precision fused adam 8000

1e467d1

ghstack-source-id: 94c6a34 Pull Request resolved: #147653

janeyx99 changed the title ~~Attempt a mixed precision fused adam~~ Bf16 fused adam(W) Mar 12, 2025

This was referenced Apr 28, 2025

DeepSeek: mixed precision optimizers (BF16AdamW) #146542

Open

[DO NOT REVIEW] Attempt a mixed precision fused adam #152477

Draft

github-actions bot added the Stale label May 11, 2025

janeyx99 removed the Stale label May 12, 2025

janeyx99 added the no-stale label May 12, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bf16 fused adam(W) #147653

Bf16 fused adam(W) #147653

Bf16 fused adam(W) #147653

Are you sure you want to change the base?

Bf16 fused adam(W) #147653

Conversation

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/147653

❌ 12 New Failures, 1 Cancelled Job