8000 Support narrow() on batch dim for NJT by jbschlosser · Pull Request #142063 · pytorch/pytorch · GitHub

Support narrow() on batch dim for NJT #142063

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Closed

jbschlosser wants to merge 20 commits into gh/jbschlosser/208/base from gh/jbschlosser/208/head

Contributor

jbschlosser commented

•

Stack from ghstack (oldest at bottom):

Requested in #136270

cc @cpuhrsch @bhosmer @drisspg @soulitzer @davidberard98 @YuqingJ @ezyang @SherlockNoMad @EikanWang @jgong5 @wenzhe-nrv


          Support narrow() on batch dim for NJT

ebbb308

[ghstack-poisoned]

jbschlosser requested review from albanD and soulitzer as code owners

December 4, 2024 18:23

jbschlosser mentioned this pull request

Unbacked SymInt fixes for subclasses + data-dependent slice() bounds #142062

Closed

pytorch-bot bot commented

•

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/142063

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

❌ 2 New Failures

As of commit dedcddf with merge base 46390e9 ():

NEW FAILURES - The following jobs have failed:

pull / linux-focal-py3.13-clang10 / test (crossref, 2, 2, linux.2xlarge) (gh)
'test/test_nestedtensor.py::TestNestedTensorOpInfoCPU::test_compile_backward_chunk_cpu_float32'
pull / linux-focal-py3.13-clang10 / test (default, 4, 5, linux.4xlarge) (gh)
'test/test_nestedtensor.py::TestNestedTensorOpInfoCPU::test_compile_backward_chunk_cpu_float32'

This comment was automatically generated by Dr. CI and updates every 15 minutes.

pytorch-bot bot added the release notes: fx label

jbschlosser added a commit that referenced this pull request


          Support narrow() on batch dim for NJT

194e053

ghstack-source-id: 11452bd
Pull Request resolved: #142063

facebook-github-bot added the fx label

jbschlosser added module: nestedtensor topic: improvements release notes: nested tensor labels

albanD removed their request for review

December 4, 2024 19:48


          Update on "Support narrow() on batch dim for NJT"

b2183d3

cc cpuhrsch bhosmer drisspg soulitzer davidberard98 YuqingJ ezyang SherlockNoMad EikanWang jgong5 wenzhe-nrv

[ghstack-poisoned]

jbschlosser added a commit that referenced this pull request


          Support narrow() on batch dim for NJT

1cb53b3

ghstack-source-id: 11452bd
Pull Request resolved: #142063


          Update on "Support narrow() on batch dim for NJT"

ddca687

cc cpuhrsch bhosmer drisspg soulitzer davidberard98 YuqingJ ezyang SherlockNoMad EikanWang jgong5 wenzhe-nrv

[ghstack-poisoned]

jbschlosser added a commit that referenced this pull request


          Support narrow() on batch dim for NJT

cd70c93

ghstack-source-id: ec7ce2e
Pull Request resolved: #142063

jbschlosser mentioned this pull request

Add eq, to, masked_select, index_select, narrow to nested tensors #88137

Open


          Update on "Support narrow() on batch dim for NJT"

679e4f1

cc cpuhrsch bhosmer drisspg soulitzer davidberard98 YuqingJ ezyang SherlockNoMad EikanWang jgong5 wenzhe-nrv

[ghstack-poisoned]

jbschlosser added a commit that referenced this pull request


          Support narrow() on batch dim for NJT

8cba052

ghstack-source-id: 86b5fac
Pull Request resolved: #142063


          Update on "Support narrow() on batch dim for NJT"

9e8dbab

cc cpuhrsch bhosmer drisspg soulitzer davidberard98 YuqingJ ezyang SherlockNoMad EikanWang jgong5 wenzhe-nrv

[ghstack-poisoned]

jbschlosser mentioned this pull request

Fix reductions for NJTs with ragged_idx != 1 #142173

Closed

jbschlosser added a commit that referenced this pull request


          Support narrow() on batch dim for NJT

7692de6

ghstack-source-id: e170412
Pull Request resolved: #142063


          Update on "Support narrow() on batch dim for NJT"

fe78f5c

cc cpuhrsch bhosmer drisspg soulitzer davidberard98 YuqingJ ezyang SherlockNoMad EikanWang jgong5 wenzhe-nrv

[ghstack-poisoned]

jbschlosser added a commit that referenced this pull request


          Support narrow() on batch dim for NJT

e72a4c1

ghstack-source-id: e170412
Pull Request resolved: #142063


          Update on "Support narrow() on batch dim for NJT"

bbea088

cc cpuhrsch bhosmer drisspg soulitzer davidberard98 YuqingJ ezyang SherlockNoMad EikanWang jgong5 wenzhe-nrv

[ghstack-poisoned]

jbschlosser added a commit that referenced this pull request


          Support narrow() on batch dim for NJT

6246fda

ghstack-source-id: eee29a8
Pull Request resolved: #142063

jbschlosser mentioned this pull request

[Tracker] Move nested tensors to beta #112398

Open

52 tasks


          Update on "Support narrow() on batch dim for NJT"

5bc120b

cc cpuhrsch bhosmer drisspg soulitzer davidberard98 YuqingJ ezyang SherlockNoMad EikanWang jgong5 wenzhe-nrv

[ghstack-poisoned]

jbschlosser mentioned this pull request

Fix NJT backward tests #143072

Closed

jbschlosser added a commit that referenced this pull request


          Support narrow() on batch dim for NJT

f0ed06c

ghstack-source-id: 328a004
Pull Request resolved: #142063

jbschlosser added a commit that referenced this pull request


          Support narrow() on batch dim for NJT

3e2701c

ghstack-source-id: 213c4d4
Pull Request resolved: #142063


          Update on "Support narrow() on batch dim for NJT"

cdbd890

cc cpuhrsch bhosmer drisspg soulitzer davidberard98 YuqingJ ezyang SherlockNoMad EikanWang jgong5 wenzhe-nrv

[ghstack-poisoned]

jbschlosser added a commit that referenced this pull request


          Support narrow() on batch dim for NJT

e8d1d1d

ghstack-source-id: 6fbd22c
Pull Request resolved: #142063

jbschlosser mentioned this pull request

Script to generate NJT OpInfo testing report #143311

Closed

jbschlosser added 2 commits

December 17, 2024 16:04


          Update on "Support narrow() on batch dim for NJT"

e079df1

cc cpuhrsch bhosmer drisspg soulitzer davidberard98 YuqingJ ezyang SherlockNoMad EikanWang jgong5 wenzhe-nrv

[ghstack-poisoned]


          Update on "Support narrow() on batch dim for NJT"

5a1628e

cc cpuhrsch bhosmer drisspg soulitzer davidberard98 YuqingJ ezyang SherlockNoMad EikanWang jgong5 wenzhe-nrv

[ghstack-poisoned]

jbschlosser mentioned this pull request

Unbacked SymInt fixes for subclasses + data-dependent slice() bounds (non-dynamic) #143526

Closed

jbschlosser added 2 commits

December 19, 2024 12:21


          Update on "Support narrow() on batch dim for NJT"

27492b2

cc cpuhrsch bhosmer drisspg soulitzer davidberard98 YuqingJ ezyang SherlockNoMad EikanWang jgong5 wenzhe-nrv

[ghstack-poisoned]


          Update on "Support narrow() on batch dim for NJT"

2d522f6

cc cpuhrsch bhosmer drisspg soulitzer davidberard98 YuqingJ ezyang SherlockNoMad EikanWang jgong5 wenzhe-nrv

[ghstack-poisoned]

jbschlosser mentioned this pull request

Support narrow() on batch dim for NJT #136444

Closed

jbschlosser added 2 commits

December 20, 2024 12:37


          Update on "Support narrow() on batch dim for NJT"

ebdb9ac

Requested in #136270

cc cpuhrsch bhosmer drisspg soulitzer davidberard98 YuqingJ ezyang SherlockNoMad EikanWang jgong5 wenzhe-nrv

[ghstack-poisoned]


          Update on "Support narrow() on batch dim for NJT"

Requested in #136270

cc cpuhrsch bhosmer drisspg soulitzer davidberard98 YuqingJ ezyang SherlockNoMad EikanWang jgong5 wenzhe-nrv

[ghstack-poisoned]

jbschlosser requested a review from cpuhrsch

December 23, 2024 21:13

Contributor Author

jbschlosser commented

@soulitzer / @cpuhrsch this is ready now - the PR works in eager / compile without graph breaks for contiguous NJTs

soulitzer approved these changes

View reviewed changes

Contributor

soulitzer left a comment

Looks great! Some small comments

torch/nested/_internal/ops.py Show resolved Hide resolved

torch/nested/_internal/ops.py Show resolved Hide resolved

torch/nested/_internal/ops.py

+                          end_val += inp._values.size(dim)
+                      start_val = max(min(start_val, inp._values.size(dim)), 0)
+                      end_val = max(min(end_val, inp._values.size(dim)), 0)
+                      length_val = max(min(length_val, end_val - start_val), 0)

Contributor

soulitzer

Unfortunate that we have to duplicate narrow input checking/manipulation logic here, but I guess maybe hard to avoid due to as_strided use.

Contributor Author

jbschlosser

Yeah agreed, I don't like this at all :( but indeed we aren't redispatching to narrow(), so we can't utilize the checks there

torch/nested/_internal/ops.py

+                  if operating_on_batch:
+                      # batch dim narrowing requires custom logic involving offsets
+                      out_kwargs = extract_kwargs(inp)
+                      start_val, length_val = new_kwargs["start"], new_kwargs["length"]

Contributor

soulitzer

Trying to think of weird edge cases (totally fine if normal narrow doesn't cover this either, but I guess we want parity?)

What happens if length_val is negative? I didn't see any explicit check, and docs say that it must "weakly positive" whatever that means 🤔.
What happens if start_val is negative, but length_val > size of the tensor.

Contributor Author

jbschlosser

good call, will add test cases for these

it must "weakly positive" whatever that means

my understanding is that this allows for length=0? idk for sure though

soulitzer reviewed

View reviewed changes

test/test_nestedtensor.py Outdated

                       op_match_fn=lambda device, op: (op.full_name in {"cdouble", "cfloat", "chalf"}),
                       name="unimplemented_view_as_real",
                   ),
+                  # narrow(): unbacked SymInt bug with non-contig transposed inputs

Contributor

soulitzer

Oof. Just curious - Is there an issue somewhere? If there is, maybe worth linking.

soulitzer reviewed

View reviewed changes

torch/nested/_internal/ops.py Outdated

+                          start_val += inp._values.size(dim)
+                      if end_val < 0:
+                          end_val += inp._values.size(dim)
+                      start_val = max(min(start_val, inp._values.size(dim)), 0)

Contributor

soulitzer

Hmm feels like it shouldn't be possible to be negative here (prior to max with 0)

soulitzer reviewed

View reviewed changes

torch/csrc/autograd/FunctionsManual.cpp Outdated Show resolved Hide resolved

soulitzer reviewed

View reviewed changes

torch/nested/_internal/ops.py Outdated Show resolved Hide resolved

soulitzer reviewed

View reviewed changes

test/test_nestedtensor.py Show resolved Hide resolved

soulitzer reviewed

View reviewed changes

test/test_nestedtensor.py Show resolved Hide resolved

soulitzer reviewed

View reviewed changes

test/test_nestedtensor.py Outdated

+                          self.assertEqual(out3_comp, nt_comp)
+                      # length past the end
+                      with self.assertRaisesRegex(RuntimeError, "exceeds dimension size"):

Contributor

soulitzer

In theory this is the type of thing we could also handle via XFail with sample_match_fn right?

Is it fair to say that we don't want to clutter that too much with basic input validity checks, and reserve those for actual bugs, features that are not implemented.

Contributor Author

jbschlosser

Yeah that's right - I have a mental TODO to use the error_inputs_func feature of OpInfos to handle this type of expected error checking failures. Ideally as you mentioned I'd want to avoid cluttering xfails with things that we'll never address and don't actually represent bugs

soulitzer reviewed

View reviewed changes

test/test_nestedtensor.py Outdated

+                  @torch._dynamo.utils.disable_cache_limit()
+                  @dtypes(torch.float32)
+                  @parametrize("env", ["eager", "compile", "compile_dynamic"])
+                  def test_narrow_on_batch_dim(self, device, dtype, env):

Contributor

soulitzer

•

I wonder how much overlap there is with this test and the narrow OpInfo one.
Like could the "first few", "middle", and "last" batch items be formulated as sample inputs?

And then we could reformalate this one to two smaller tests that do more specific things, e.g. test_narrow_on_batch_dim_input_validation and test_narrow_on_batch_dim_narrow_of_narrow.

Contributor Author

jbschlosser

•

Like could the "first few", "middle", and "last" batch items be formulated as sample inputs?

Definitely and I should do that :) will fix

Edit: I realized I'm kind of already doing this for the generated sample inputs on non-ragged dims

And then we could reformalate this one to two smaller tests that do more specific things, e.g. test_narrow_on_batch_dim_input_validation and test_narrow_on_batch_dim_narrow_of_narrow.

yeah these are harder to test with OpInfo so I think it makes sense to break them out

Contributor Author

jbschlosser

oh I just realized the compile tests for narrow-on-narrow weren't actually compiling :p

changing them to actually compile fails them; investigating

Contributor Author

jbschlosser

okay i tracked this down to incorrect clamping. old logic was clamping (start, end) on inner values dim space, but we want to clamp on outer batch dim space. This fixed the data-dependent guard errors.

There's still some work to be done for non-contiguous NJTs apparently, but I've been deprioritizing those in general so I'll just land this as-is

jbschlosser added 2 commits

February 6, 2025 12:46


          Update on "Support narrow() on batch dim for NJT"

11ce2cb

Requested in #136270

cc cpuhrsch bhosmer drisspg soulitzer davidberard98 YuqingJ ezyang SherlockNoMad EikanWang jgong5 wenzhe-nrv

[ghstack-poisoned]


          Update on "Support narrow() on batch dim for NJT"

dedcddf

Requested in #136270

cc cpuhrsch bhosmer drisspg soulitzer davidberard98 YuqingJ ezyang SherlockNoMad EikanWang jgong5 wenzhe-nrv

[ghstack-poisoned]

jbschlosser mentioned this pull request

Implement fast access to individual elements of jagged nested tensors #148497

Open

Contributor

github-actions bot commented

Looks like this PR hasn't been updated in a while so we're going to go ahead and mark this as Stale.
Feel free to remove the Stale label if you feel this was a mistake.
If you are unable to remove the Stale label please contact a maintainer in order to do so.
If you want the bot to never mark this PR stale again, add the no-stale label.
Stale pull requests will automatically be closed after 30 days of inactivity.

github-actions bot added the Stale label

Divigroup-RAP pushed a commit to Divigroup-RAP/PYTORCH that referenced this pull request


          Support narrow() on batch dim for NJT

e9f5d06

ghstack-source-id: 0f65635
Pull Request resolved: pytorch/pytorch#142063

github-actions bot closed this

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

fx module: nestedtensor release notes: fx release notes: nested tensor Stale topic: improvements

0