[inductor] `torch.slice_scatter` throws `AssertionError` when meeting internal `float32` #147842

shaoyuyoung · 2025-02-25T11:25:00Z

🐛 Describe the bug

description: when meeting internal float32 (it's y in my case), eager pass the check and return 0 while inductor throws an assertion error
device: both on triton and CPP

import torch
import torch.nn as nn
import torch.nn.functional as F
from torch._inductor import config

config.fallback_random = True
torch.set_grad_enabled(False)


class Model(torch.nn.Module):
    def __init__(self):
        super(Model, self).__init__()

    def forward(self, x):
        y = torch.Tensor([0])  # y dtype: torch.float32
        x = torch.slice_scatter(y, x, 0)
        return x


model = Model()

x = torch.Tensor([0]).to(torch.int64)

inputs = [x]


def run_test(model, inputs, backend):
    model.eval()
    torch.manual_seed(0)
    if backend != "eager":
        model = torch.compile(model, backend=backend)
    try:
        c_output = model(*inputs)
        print(c_output)
    except Exception as e:
        print(e)


run_test(model, inputs, 'eager')
run_test(model, inputs, 'inductor')

Error logs

tensor([0.])
LoweringException: AssertionError: 
  target: aten.slice_scatter.default
  args[0]: TensorBox(StorageBox(
    Pointwise(
      'cpu',
      torch.float32,
      def inner_fn(index):
          _ = index
          tmp0 = ops.constant(0.0, torch.float32)
          return tmp0
      ,
      ranges=[1],
      origin_node=full_default,
      origins=OrderedSet([full_default])
    )
  ))
  args[1]: TensorBox(StorageBox(
    InputBuffer(name='arg0_1', layout=FixedLayout('cpu', torch.int64, size=[1], stride=[1]))
  ))

Versions

nightly 20250225

cc @chauhang @penguinwu @voznesenskym @EikanWang @jgong5 @Guobing-Chen @XiaobingSuper @zhuhaozhe @blzheng @wenzhe-nrv @jiayisunx @ipiszy @yf225 @chenyang78 @kadeng @muchulee8 @amjames @aakhundov

The text was updated successfully, but these errors were encountered:

Ajay-26 · 2025-02-28T22:31:54Z

Hi!
Correct me if I'm wrong, but should x not have dtype as torch.float32 ? It works for me in that case. The error looks like it's an AssertionError because they are of different types

shaoyuyoung · 2025-03-01T04:55:00Z

Hi, @Ajay-26

Correct me if I'm wrong, but should x not have dtype as torch.float32 ?

u r right! However, Eager backend can do implicit dtype conversion as u can print(c_output.dtype) to find it is torch.float32 although the original input dtype is torch.int64 (x = torch.Tensor([0]).to(torch.int64)).

Unfortunately, Inductor compiler can't do this dtype conversion and throws assertionerror which violates the criteria of DL compiler (simulate any behavior of eager). So I think it is a potential behavior inconsistency between pytorch eager and inductor.

Feel free to have any further discussion if it is helpful to you. :)

Numbers0689 · 2025-03-01T07:26:11Z

Hi @shaoyuyoung , I’d like to work on this issue! from my understanding, the inductor backend currently does not perform implicit dtype conversion, which leads to an assertion error. i plan to modify the inductor compiler to align with the eager backend’s behavior.

before proceeding, i wanted to confirm:

should the fix involve explicitly converting int64 inputs to float32 within inductor’s handling of slice_scatter?
are there any existing tests that check for dtype consistency in eager vs inductor?

let me know if this is the right way, thanks!

shaoyuyoung · 2025-03-01T07:41:21Z

Hi, @Numbers0689 , thanks for your comment and kindness!

should the fix involve explicitly converting int64 inputs to float32 within inductor’s handling of slice_scatter?

This solution is enough for this case. But I am not sure whether other dtypes (int32, int8, uint64, etc.) show similar behaviors with int64. It would be better if we could deal with all these similar problems at once.

are there any existing tests that check for dtype consistency in eager vs inductor?

To be honest, I am also not sure (but I think no tests exist currently). Previously, we have discussed some dtype inconsistency issues in #147666. Maybe you can get some inspiration from #147666 (comment).
Anyway, regardless of whether there are tests here, you should write a UT (unit test) to verify that the fix is correct. :)

I'm not sure if my answer is correct, feel free to discuss more. Or, you can draft a PR first? And then pt developers will take a look at your PR (for code review). :)

Numbers0689 · 2025-03-01T08:41:37Z

thanks for the clarification! I'll check out the dtype behavior across other integer types and go through #147666

I'll also add a unit test to verify the fix and start drafting a PR.

Numbers0689 · 2025-03-02T05:58:14Z

Hi, @shaoyuyoung , while investigating torch.slice_scatter I came across torch.select_scatter and upon testing, both of them throws AssertionError for all Integer dtype (int8, int16, int32, int64, uint8, uint16, uint32, uint64)

shaoyuyoung · 2025-03-02T06:13:39Z

Well, expected results... As I have encountered some similar cases. It seems that dtype processing in inductor is very fragile (about consistency with eager's dtype processing). You can get more details in #144362.

Back to fixing this issue. Previously, I have tried to add a manual check for these dtypes one by one like #145136. I think it is enough for this issue. But it would be great if we could find some way to solve this problem uniformly. :)

Vikramjeetsingh07 · 2025-03-07T04:59:08Z

@shaoyuyoung
#to ensure dtype consistency I tried replicating the error and the simple below fix helps
y = torch.Tensor([0]).to(x.dtype)
x = torch.slice_scatter(y, x, 0)

as you said this will only resolve this. broader solution is fixing Modifying the Inductor Compiler's Type Inference. If you want i can fix and commit a branch. Hope this helps.

shaoyuyoung · 2025-03-07T05:22:44Z

Hi, @Vikramjeetsingh07
thanks for the comment. I think developers are willing to see any PR to fix issues.
Feel free to draft a PR. :)

shaoyuyoung added the oncall: pt2 label Feb 25, 2025

desertfire added module: inductor good first issue labels Feb 25, 2025

desertfire added the triaged This issue has been looked at a team member, and triaged and prioritized into an appropriate module label Mar 4, 2025

golkir linked a pull request Mar 22, 2025 that will close this issue

[inductor] [bug fix] Enable type promotions in slice_scatter in inductor #149814

Open

shaoyuyoung mentioned this issue Apr 15, 2025

[inductor] [assertion error] torch.select_scatter crashes on inductor but passes on eager #151296

Open

tommyadams5 pushed a commit to tommyadams5/pytorch that referenced this issue Apr 22, 2025

Enable type promotions in slice_scatter (pytorch#147842)

207776d

tommyadams5 linked a pull request Apr 22, 2025 that will close this issue

Enable type promotions in slice_scatter (pytorch#147842) #151911

Open

tommyadams5 added a commit to tommyadams5/pytorch that referenced this issue Apr 22, 2025

Enable type promotions in slice_scatter (pytorch#147842)

c677667

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[inductor] `torch.slice_scatter` throws `AssertionError` when meeting internal `float32` #147842

[inductor] `torch.slice_scatter` throws `AssertionError` when meeting internal `float32` #147842

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

[inductor] torch.slice_scatter throws AssertionError when meeting internal float32 #147842

[inductor] torch.slice_scatter throws AssertionError when meeting internal float32 #147842

Comments

Uh oh!

🐛 Describe the bug

Error logs

Versions

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

[inductor] `torch.slice_scatter` throws `AssertionError` when meeting internal `float32` #147842

[inductor] `torch.slice_scatter` throws `AssertionError` when meeting internal `float32` #147842