[dynamic shapes] aten.constant_pad_nd meta impl #152129

pianpwk · 2025-04-24T18:43:41Z

We know the output shape, and we know this always produces a clone. Avoids data-dependent errors from the decomposition.

along with #150483, should fix #123855

pytorch-bot · 2025-04-24T18:43:45Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/152129

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

✅ You can merge normally! (1 Unrelated Failure)

As of commit 304fddd with merge base 89c0c3c ():

BROKEN TRUNK - The following job failed but were present on the merge base:

👉 Rebase onto the `viable/strict` branch to avoid these failures

trunk / macos-py3-arm64 / test (default, 1, 3, macos-m1-stable) (gh) (trunk failure)
test_optim.py::TestOptimRenewedMPS::test_can_load_from_to_named_state_dict_is_named_optim0_False_is_named_optim1_False_Adafactor_mps_float32

This comment was automatically generated by Dr. CI and updates every 15 minutes.

tugsbayasgalan · 2025-04-25T17:04:17Z

test/export/test_export.py

@@ -4269,6 +4269,26 @@ def forward(self, x):
        ):
            _ = export(M(), (torch.tensor([2, 3, 5]),))

+    @testing.expectedFailureTrainingIRToRunDecomp


Whats the failure? Since this is the default path, i think we should fix whatever it shows.

Fixed with #150483

…k/pad_nd_meta

laithsakka · 2025-04-29T22:47:56Z

in the summary you mention "we know this always produces a clone"
what is this?

laithsakka · 2025-04-29T22:56:37Z

torch/_meta_registrations.py

@@ -7322,6 +7323,29 @@ def softmax(x: Tensor, dim: int, half_to_float: bool) -> Tensor:
    return res


+@register_meta(aten.constant_pad_nd)
+@out_wrapper()
+def _constant_pad_nd_meta(input, pad, value=0):


shall we add the torch checks from the decomp version in order to fail earlier?

laithsakka

I reviewed this by comparing to


# This replicates at::constant_pad_nd, defined in ATen/native/PadNd.cpp
@register_decomposition(aten.constant_pad_nd)
@out_wrapper()
def constant_pad_nd(
    input: TensorLikeType, pad: list[int], value: NumberType = 0
) -> TensorLikeType:
    torch._check(
        len(pad) % 2 == 0,
        lambda: f"Length of pad must be even but instead it equals {len(pad)}",
    )

    input_sizes = input.shape
    l_inp = len(input_sizes)

    l_pad = len(pad) // 2
    l_diff = l_inp - l_pad

    torch._check(
        l_inp >= l_pad,
        lambda: "Length of pad should be no more than twice the number of "
        f"dimensions of the input. Pad length is {len(pad)} while the input has "
        f"{l_inp} dimensions.",
    )

    c_input = input
    for i in range(l_diff, l_inp):
        pad_idx = 2 * (l_inp - i - 1)
        if pad[pad_idx] < 0:
            c_input = c_input.narrow(i, -pad[pad_idx], c_input.shape[i] + pad[pad_idx])

        if pad[pad_idx + 1] < 0:
            c_input = c_input.narrow(i, 0, c_input.shape[i] + pad[pad_idx + 1])

    # If all the pads are negative we can return the result.
    # Avoid early exiting if all pads = 0 to prevent specialization on export.
    # During export, raw if statements are specialized on the input, meaning
    # that we lose a branch depending on the example input used to export.
    # Here, this is either the case where all pads = 0, or the case where at
    # least one pad > 0 and the rest are >= 0.
    # Avoiding the early exit when all pads = 0 ensures we can export
    # constant_pad_nd for cases when all pads >= 0.
    # Note: if any pads are negative, this code specializes due to the if statements above.
    if builtins.all(p < 0 for p in pad):
        return c_input.clone()

    new_shape = list(input_sizes[:l_diff])

    for i in range(l_pad):
        pad_idx = len(pad) - ((i + 1) * 2)
        new_dim = input_sizes[l_diff + i] + pad[pad_idx] + pad[pad_idx + 1]
        torch._check(
            new_dim > 0,
            lambda: f"The input size {input_sizes[l_diff + i]}, plus negative padding "
            f"{pad[pad_idx]} and {pad[pad_idx + 1]} resulted in a negative output size, "
            f"which is invalid. Check dimension {l_diff + i} of your input.",
        )
        new_shape.append(new_dim)

    memory_format = utils.suggest_memory_format(input)
    output = torch.empty(
        new_shape,
        dtype=input.dtype,
        device=input.device,
        requires_grad=input.requires_grad,
        memory_format=memory_format,
    )

    if value == 0 and input.dtype == torch.bool:
        value = False
    # torch.fill isn't typed to allow complex values
    output = torch.fill(output, value)  # type: ignore[arg-type]

    c_output = output
    for i in range(l_diff, l_inp):
        pad_idx = 2 * (l_inp - i - 1)
        if pad[pad_idx] >= 0:
            c_output = c_output.narrow(
                i, pad[pad_idx], c_output.shape[i] - pad[pad_idx]
            )
        if pad[pad_idx + 1] >= 0:
            c_output = c_output.narrow(i, 0, c_output.shape[i] - pad[pad_idx + 1])

    prims.copy_to(c_output, c_input)
    return output

looks legit can we add the torch checks though

pianpwk · 2025-04-30T00:57:30Z

in the summary you mention "we know this always produces a clone" what is this?

just that even if all padding is zero or negative, we'll never alias the original tensor, so this meta-kernel isn't semantic changing

pianpwk · 2025-05-01T05:52:03Z

@pytorchbot merge

pytorchmergebot · 2025-05-01T05:53:56Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

init

37b0c92

lint

3b16a76

pianpwk added the release notes: export label Apr 25, 2025

pianpwk changed the title ~~[WIP][dynamic shapes] aten.constant_pad_nd meta impl~~ [dynamic shapes] aten.constant_pad_nd meta impl Apr 25, 2025

pianpwk marked this pull request as ready for review April 25, 2025 00:34

pianpwk requested review from laithsakka and tugsbayasgalan April 25, 2025 00:35

tugsbayasgalan reviewed Apr 25, 2025

View reviewed changes

pianpwk added 2 commits April 28, 2025 10:33

lint

7f22a93

Merge branch 'main' of https://github.com/pytorch/pytorch into pianpw…

48d8dce

…k/pad_nd_meta

pianpwk requested review from bobrenjc93, aorenste and bdhirsh April 29, 2025 21:31

laithsakka reviewed Apr 29, 2025

View reviewed changes

laithsakka approved these changes Apr 29, 2025

View reviewed changes

pianpwk added 5 commits April 29, 2025 17:59

add torch._checks

761d313

Update _meta_registrations.py

b4a3000

Update test_export.py

01a7fbe

Update test_export.py

77bc1f0

Update test_export.py

304fddd

pytorch-bot bot added the ciflow/trunk Trigger trunk jobs on your pull request label May 1, 2025

pytorchmergebot added the merging label May 1, 2025

pytorchmergebot added the Merged label May 1, 2025

pytorchmergebot removed the merging label May 1, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[dynamic shapes] aten.constant_pad_nd meta impl #152129

[dynamic shapes] aten.constant_pad_nd meta impl #152129

[dynamic shapes] aten.constant_pad_nd meta impl #152129

[dynamic shapes] aten.constant_pad_nd meta impl #152129

Conversation

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/152129

✅ You can merge normally! (1 Unrelated Failure)

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Merge started