[ONNX] Produce correct dtypes for bf16/f8 in IR TorchTensor #151259

justinchuby · 2025-04-14T19:47:53Z

Split the changes from #151069 to address microsoft/onnxscript#2187, where the output np arrays do not have the correct ml_dtypes types as expected.

pytorch-bot · 2025-04-14T19:47:57Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/151259

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit 3cf10f7 with merge base 46ce8f7 ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

…bfloat/float8 arrays

Bring changes from pytorch/pytorch#151259 to correctly support bfloat16 and float8* types.

titaiwangms · 2025-04-15T20:09:06Z

test/onnx/exporter/test_core.py

@@ -15,17 +16,17 @@ class TorchTensorTest(common_utils.TestCase):
    @common_utils.parametrize(
        "dtype, np_dtype",
        [
-            (torch.bfloat16, np.uint16),


I don't undeerstand why this is wrong and still passed the test?

Is the test still legit?

The test is changed with the implementation. So they are updated together to reflect the new behavior

titaiwangms · 2025-04-15T20:11:45Z

torch/onnx/_internal/exporter/_core.py

@@ -116,15 +116,17 @@ def __init__(self, tensor: torch.Tensor, name: str | None = None):
    def numpy(self) -> npt.NDArray:
        self.raw: torch.Tensor
        if self.dtype == ir.DataType.BFLOAT16:
-            return self.raw.view(torch.uint16).numpy(force=True)
+            return (
+                self.raw.view(torch.uint16).numpy(force=True).view(self.dtype.numpy())


Can you explain this workaround? Looks like it's getting around the issue that numpy does not support bfloat6?

Sure! Since there is no bfloat16 in numpy, converting a tensor directly from torch would fail. Thus we view it as uint16 first in torch, get the numpy representation, and then re-view it with the ml_dtypes type we get from the onnx ir dtype.numpy() call to get an array ONNX IR expects (which has dtype ml_dtypes.bfloat16. )

And uint16 is closer to bfloat16 in terms of the number of bit? Maybe worth mentioning somewhere.

Will add a comment. Yes they are both 16 bit dtypes

justinchuby · 2025-04-15T23:13:46Z

@pytorchbot merge

justinchuby · 2025-04-15T23:14:02Z

Will add comments separately since this is ready to merge

pytorchmergebot · 2025-04-15T23:15:29Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

…#151371) Follow up of #151259 Pull Request resolved: #151371 Approved by: https://github.com/titaiwangms

…151259) Split the changes from pytorch#151069 to address microsoft/onnxscript#2187, where the output np arrays do not have the correct ml_dtypes types as expected. Pull Request resolved: pytorch#151259 Approved by: https://github.com/titaiwangms

…pytorch#151371) Follow up of pytorch#151259 Pull Request resolved: pytorch#151371 Approved by: https://github.com/titaiwangms

…151259) Split the changes from pytorch#151069 to address microsoft/onnxscript#2187, where the output np arrays do not have the correct ml_dtypes types as expected. Pull Request resolved: pytorch#151259 Approved by: https://github.com/titaiwangms

…pytorch#151371) Follow up of pytorch#151259 Pull Request resolved: pytorch#151371 Approved by: https://github.com/titaiwangms

justinchuby requested review from titaiwangms, shubhambhokare1 and wschin as code owners April 14, 2025 19:47

pytorch-bot bot added the release notes: onnx torch.onnx related changes that should show up in the release notes label Apr 14, 2025

justinchuby added module: onnx Related to torch.onnx topic: bug fixes topic category labels Apr 14, 2025

justinchuby added 2 commits April 14, 2025 12:50

[ONNX] Update the TorchTensor object to produce correct dtypes for …

9f45715

…bfloat/float8 arrays

update

3f0e0cb

justinchuby force-pushed the justinchu/fix-bfloat16-optimize branch from 52c6def to 3f0e0cb Compare April 14, 2025 19:50

update test

c7617da

justinchuby mentioned this pull request Apr 14, 2025

Update TorchTensor to use ml_dtypes microsoft/onnxscript#2201

Merged

pytorchbot added the open source label Apr 14, 2025

< 8000 div > justinchuby mentioned this pull request Apr 14, 2025

Optimizer constant folding turns bfloat16 initializers into UINT16 microsoft/onnxscript#2187

Closed

test

3cf10f7

justinchuby added the ciflow/trunk Trigger trunk jobs on your pull request label Apr 15, 2025

justinchuby added a commit to microsoft/onnxscript that referenced this pull request Apr 15, 2025

Update TorchTensor to use ml_dtypes (#2201)

df26586

Bring changes from pytorch/pytorch#151259 to correctly support bfloat16 and float8* types.

titaiwangms reviewed Apr 15, 2025

View reviewed changes

titaiwangms approved these changes Apr 15, 2025

View reviewed changes

pytorchmergebot added the merging label Apr 15, 2025

pytorchmergebot closed this in 9917fef Apr 15, 2025

pytorchmergebot added Merged and removed merging labels Apr 15, 2025

justinchuby mentioned this pull request Apr 15, 2025

[ONNX] Add a comment for handling bf16/fp8 tensor to numpy conversion #151371

Closed

pytorchmergebot pushed a commit that referenced this pull request Apr 16, 2025

[ONNX] Add a comment for handling bf16/fp8 tensor to numpy conversion (…

8780d18

…#151371) Follow up of #151259 Pull Request resolved: #151371 Approved by: https://github.com/titaiwangms

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[ONNX] Produce correct dtypes for bf16/f8 in IR TorchTensor #151259

[ONNX] Produce correct dtypes for bf16/f8 in IR TorchTensor #151259

[ONNX] Produce correct dtypes for bf16/f8 in IR TorchTensor #151259

[ONNX] Produce correct dtypes for bf16/f8 in IR TorchTensor #151259

Conversation

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/151259

✅ No Failures

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Merge started