[MPS] Extend `torch.mm`/`torch.bmm` to integral types #145809

malfet · 2025-01-28T01:24:03Z

Stack from ghstack (oldest at bottom):

-> [MPS] Extend torch.mm/torch.bmm to integral types #145809

By using naive_mm kernel, but make sure that accumulation is done over int32 for smaller int types (and float for half and bfloat) as well as adding navie_bmm that follows the same pattern.
Remove stale restriction on torch.dot (which works fine on MacOS-14/15)
This also enables integer op flavors for:

addmv
einsum
inner
linalg.multi_dot
matmul
mv
tensordot

[ghstack-poisoned]

pytorch-bot · 2025-01-28T01:24:07Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/145809

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

⏳ No Failures, 49 Pending

As of commit b15f4af with merge base 6b41f31 ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

[ghstack-poisoned]

By using naivemm kernel ghstack-source-id: a993090 Pull Request resolved: #145809

[ghstack-poisoned]

By using naivemm kernel ghstack-source-id: a899b05 Pull Request resolved: #145809

[ghstack-poisoned]

malfet · 2025-01-30T14:24:38Z

Debug script

import torch
import os

os.environ["MTL_CAPTURE_ENABLED"]="1"
torch.manual_seed(42)
x = torch.testing.make_tensor([10,], dtype=torch.int16, device="mps")
y = torch.testing.make_tensor([5, 10, 5], dtype=torch.int16, device="mps")
# x = torch.ones((5, 1, 10,), dtype=torch.int16, device="mps")
#x = torch.randint(-1, 3, (10,), dtype=torch.int16, device="mps")
#y = torch.arange(250, dtype=torch.int16, device="mps").reshape(5, 10, 5)
#with torch.device("mps"):
#    x=torch.arange(15, dtype=torch.int16).reshape(3, 5, 1)
#    y=torch.arange(18, dtype=torch.int16).reshape(3, 1, 6)
#    print(x.stride(), y.stride())

#with torch.mps.profiler.metal_capture("bmm"):
#    z = torch.matmul(x, y)

print(x)
print(y)
z = torch.matmul(x, y)
print(x.stride(), y.stride())
print(z)
print(torch.matmul(x.cpu(), y.cpu()))

[ghstack-poisoned]

By using naivemm kernel ghstack-source-id: d298124 Pull Request resolved: #145809

[ghstack-poisoned]

By using naivemm kernel ghstack-source-id: bb04d86 Pull Request resolved: #145809

malfet · 2025-01-30T19:33:40Z

@pytorchbot merge -f "Lint + MPS are green"

pytorchmergebot · 2025-01-30T19:35:10Z

Merge started

Your change will be merged immediately since you used the force (-f) flag, bypassing any CI checks (ETA: 1-5 minutes). Please use -f as last resort and instead consider -i/--ignore-current to continue the merge ignoring current failures. This will allow currently pending tests to finish and report signal before the merge.

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

Update

305a3d7

[ghstack-poisoned]

malfet requested a review from kulinseth as a code owner January 28, 2025 01:24

malfet mentioned this pull request Jan 28, 2025

[MPS] Add op_math_t #145808

Closed

pytorch-bot bot added ciflow/mps Run MPS tests (subset of trunk) release notes: mps Release notes category labels Jan 28, 2025

malfet requested review from Skylion007 and dcci January 28, 2025 01:27

malfet added the topic: improvements topic category label Jan 28, 2025

Update

ac04666

[ghstack-poisoned]

malfet added a commit that referenced this pull request Jan 28, 2025

[MPS] Extend torch.mm to integral types

1aed168

By using naivemm kernel ghstack-source-id: a993090 Pull Request resolved: #145809

dcci approved these changes Jan 28, 2025

View reviewed changes

Update

43e5393

[ghstack-poisoned]

Update

9739184

[ghstack-poisoned]

malfet added a commit that referenced this pull request Jan 29, 2025

[MPS] Extend torch.mm to integral types

d78b1eb

By using naivemm kernel ghstack-source-id: a899b05 Pull Request resolved: #145809

Update

2d614fe

[ghstack-poisoned]

Update

93136cf

[ghstack-poisoned]

malfet marked this pull request as draft January 30, 2025 14:24

Update

53e5dd8

[ghstack-poisoned]

malfet marked this pull request as ready for review January 30, 2025 16:09

Update

71bc3b2

[ghstack-poisoned]

malfet changed the title ~~[MPS] Extend torch.mm to integral types~~ [MPS] Extend torch.mm/torch.bmm to integral types Jan 30, 2025

Update

94bf096

[ghstack-poisoned]

malfet added a commit that referenced this pull request Jan 30, 2025

[MPS] Extend torch.mm to integral types

3c24e9b

By using naivemm kernel ghstack-source-id: d298124 Pull Request resolved: #145809

Update

b15f4af

[ghstack-poisoned]

malfet added a commit that referenced this pull request Jan 30, 2025

[MPS] Extend torch.mm to integral types

605bb3d

By using naivemm kernel ghstack-source-id: bb04d86 Pull Request resolved: #145809

pytorchmergebot added the merging label Jan 30, 2025

pytorchmergebot closed this in 1fdb4d6 Jan 30, 2025

pytorchmergebot added Merged and removed merging labels Jan 30, 2025

github-actions bot deleted the gh/malfet/152/head branch March 2, 2025 02:11

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[MPS] Extend `torch.mm`/`torch.bmm` to integral types #145809

[MPS] Extend `torch.mm`/`torch.bmm` to integral types #145809

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

[MPS] Extend torch.mm/torch.bmm to integral types #145809

[MPS] Extend torch.mm/torch.bmm to integral types #145809

Uh oh!

Conversation

Uh oh!

Uh oh!

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/145809

⏳ No Failures, 49 Pending

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Merge started

Uh oh!

Uh oh!

[MPS] Extend `torch.mm`/`torch.bmm` to integral types #145809

[MPS] Extend `torch.mm`/`torch.bmm` to integral types #145809