DEP: Deprecate non-tuple nd-indices #9686

eric-wieser · 2017-09-14T04:40:40Z

A reminder of the purpose of this - currently we allow both arr[[None, 0]] and arr[(None, 0)] to mean the same thing, yet arr[[0, 0]] and arr[(0, 0)] mean different things. This makes it super hard to make a compliant subclass, or duck array.

By deprecating this feature, we force downstream library code to stop using it, which in turn makes that library code more compatible with subclasses and duck types.

This reapplies @seberg's #4434, with the following changes:

tuple subclasses are not deprecated, because supporting namedtuples seems possibly useful
All warnings are FutureWarnings, since determining how np.array(...) will treat a sequence is expensive - it's easier to just say "this will error if np.array(seq) would"

As of #10618, none of our code relies on this "feature"

mhvk

Most comments just attempt at making a tuple to start with. The many occurrences in our own code do make me wonder slightly whether this is in fact a good idea. I fear the construction of lists consisting of lots of slice(None) plus a real slice is going to be common also outside of numpy...

mhvk · 2017-09-14T14:05:00Z

numpy/fft/fftpack.py

@@ -69,13 +69,13 @@ def _raw_fft(a, n=None, axis=-1, init_function=fftpack.cffti,
        if s[axis] > n:
            index = [slice(None)]*len(s)


Maybe nicer to just fix the creation of the index:

index = (slice(None),) * axis + (slice(0, n),)

then, no need to use tuple below.

mhvk · 2017-09-14T14:06:38Z

doc/release/1.14.0-notes.rst

@@ -21,6 +21,12 @@ New functions
 Deprecations
 ============

+* Multidimensional indexing with anything but a base class tuple is
+  deprecated. This means that code such as ``arr[[slice(None)]]`` has to


How about giving an example with a slightly more useful case, e.g., [slice(None), slice(0, 10)] -> tuple(slice(None), slice(0, 10)) (or even omit the tuple in that case).

mhvk · 2017-09-14T14:08:29Z

numpy/core/tests/test_memmap.py

@@ -126,7 +126,7 @@ def test_arithmetic_drops_references(self):
    def test_indexing_drops_references(self):
        fp = memmap(self.tmpfp, dtype=self.dtype, mode='w+',
                    shape=self.shape)
-        tmp = fp[[(1, 2), (2, 3)]]
+        tmp = fp[((1, 2), (2, 3))]


Here, the parenthesis are not necessary, correct?

mhvk · 2017-09-14T14:09:34Z

numpy/fft/fftpack.py

@@ -69,13 +69,13 @@ def _raw_fft(a, n=None, axis=-1, init_function=fftpack.cffti,
        if s[axis] > n:
            index = [slice(None)]*len(s)
            index[axis] = slice(0, n)
-            a = a[index]
+            a = a[tuple(index)]
        else:
            index = [slice(None)]*len(s)
            index[axis] = slice(0, s[axis])


And same here.

index = (slice(None),) * axis + (slice(0, s[axis]),)

mhvk · 2017-09-14T14:17:38Z

numpy/lib/function_base.py

@@ -1745,7 +1745,7 @@ def gradient(f, *varargs, **kwargs):
        slice4[axis] = slice(2, None)


If up to me, I'd just create a base prefix (slice(None),) * axis here and add the 1,2,3,4 as need below - which I think would make for clearer code. But that is getting well beyond the immediate purpose of this PR, so feel free to ignore.

mhvk · 2017-09-14T14:41:20Z

numpy/ma/core.py

@@ -5483,6 +5483,7 @@ def sort(self, axis=-1, kind='quicksort', order=None,
        else:
            idx = list(np.ix_(*[np.arange(x) for x in self.shape]))
            idx[axis] = sidx
+            idx = tuple(idx)


Maybe

idx = np.ix_(*[(sidx if i == axis else np.arange(self.shape[i])) for i in range(self.ndim)])

well, maybe not really better at all.

mhvk · 2017-09-14T14:41:58Z

numpy/ma/extras.py

@@ -723,6 +723,7 @@ def _median(a, axis=None, out=None, overwrite_input=False):
        # as median (which is mean of empty slice = nan)
        indexer = [slice(None)] * asorted.ndim
        indexer[axis] = slice(0, 0)
+        indexer = tuple(indexer)


indexer = (slice(None),) * axis + (slice(0, 0),)

mhvk · 2017-09-14T14:42:22Z

numpy/ma/extras.py

@@ -784,6 +785,7 @@ def replace_masked(s):
    if np.issubdtype(asorted.dtype, np.inexact):
        # avoid inf / x = masked
        s = np.ma.sum([low, high], axis=0, out=out)
+        print(repr(s.data))


Probably not intended!

Nice catch! Must have been in my working copy or stash or something

mhvk · 2017-09-14T14:43:13Z

numpy/ma/extras.py

@@ -1657,7 +1659,7 @@ def flatnotmasked_contiguous(a):
        if not k:
            result.append(slice(i, i + n))
        i += n
-    return result or None
+    return tuple(result) or None


Also update the docstring! (It states a list is returned)

mhvk · 2017-09-14T14:44:26Z

numpy/ma/extras.py

-        result.append(flatnotmasked_contiguous(a[idx]) or None)
-    return result
+        result.append(flatnotmasked_contiguous(a[tuple(idx)]) or None)
+    return tuple(result)


Again do update docstring.

eric-wieser · 2017-09-26T07:08:51Z

Just some notes from a performance perspective:

until MAINT: Make the refactor suggested in prepare_index #8278, passing lists was probably slower anyway, because you ended up doing a bunch of heuristics and then calling PyTuple_new on the list on the C side.
coercing a list to a tuple seems faster than building a tuple from scratch each time:

>>> a = np.zeros((2,)*10)

>>> sln = (slice(None),) # cache this to try and maximize speed
>>> %timeit a[sln*3 + (1,) + sln*6]
933 ns ± 20 ns per loop (mean ± std. dev. of 7 runs, 1000000 loops each)

>>> sl = list(sln * 10)
>>> %timeit sl[3] = 1; a[tuple(sl)]
858 ns ± 14.7 ns per loop (mean ± std. dev. of 7 runs, 1000000 loops each)

>>> %timeit sl[3] = 1; a[sl]  # passing a list is still fastest though
748 ns ± 27.3 ns per loop (mean ± std. dev. of 7 runs, 1000000 loops each)

seberg · 2017-09-26T07:15:46Z

Fun, it used to be quite a bit slower, I guess python optimized the keyword parsing in `tuple` or something.

eric-wieser · 2017-09-26T07:29:21Z

Weirdly, writing it out explicitly is slowest of all:

In [42]: %timeit a[:,:,:,1,:,:,:,:,:,:]
985 ns ± 10.1 ns per loop (mean ± std. dev. of 7 runs, 1000000 loops each)

seberg · 2017-09-26T09:55:23Z

Dunno, might be unfair since python has to create the slice objects inside the timeit, etc.

mhvk · 2017-09-26T12:40:34Z

The performance aspect probably doesn't matter so much; my comments were more about trying to make the code cleaner, with less conversions.

But I must admit that I've wondered why direct list comprehension is so much faster than doing it inside a function:

In [1]: %timeit [_i for _i in range(100)]
100000 loops, best of 3: 5.75 µs per loop

In [2]: %timeit list(_i for _i in range(100))
100000 loops, best of 3: 9.94 µs per loop

In [3]: %timeit tuple(_i for _i in range(100))
100000 loops, best of 3: 10.1 µs per loop

charris · 2017-10-18T20:22:10Z

Sounds like this might need more discussion.

charris · 2017-11-26T00:36:39Z

Pushing off to 1.15.

eric-wieser · 2018-02-17T05:27:44Z

@mhvk: I'd prefer to leave the different tuple generation strategy to a different PR, and keep this change as small as possible.

eric-wieser · 2018-03-16T01:22:22Z

numpy/core/src/multiarray/mapping.c

+                        "interpreted as an array index, `arr[np.array(seq)]`, "
+                        "which will result either in an error or a different "
+                        "result.") < 0) {
+                    return -1;


Think I might have missed some DECREFs here.

eric-wieser · 2018-03-16T01:23:11Z

Base switch above to recompute the diff. Shame there's no button to do that.

eric-wieser · 2018-05-25T21:41:49Z

This maybe needs a doc change to https://docs.scipy.org/doc/numpy/reference/arrays.indexing.html#advanced-indexing

charris · 2018-05-25T22:28:50Z

@eric-wieser Want to make that change or can I put this in.

See numpy/numpy#9686

shoyer · 2019-04-18T03:58:58Z

Is there a plan for finishing this deprecation cycle? Would 1.17 be too soon to change this?

eric-wieser · 2019-04-18T05:35:05Z

My gut feel is to leave this till (at least) 1.18 - this was quite an intrusive deprecation, and the PR references above suggest that downstream packages are still running into it every few months. Perhaps the deprecation NEP has a clear stance here.

Is there some reason you're hoping to get rid of this sooner? My main goal with this PR was to stop users writing code in this style, so I could stop trying to replicate it in my own __getitem__, without actually caring if the change ever happens

shoyer · 2019-04-18T05:46:13Z

Right now, numpy_array[duck_array] can fail in some but not all circumstances to be interpreted as numpy_array[np.asarray(duck_array)]. This resulted in an issue for JAX: jax-ml/jax#620

But I agree, it seems that we should wait a bit longer before changing this.

eric-wieser · 2019-04-18T06:02:30Z

Do the jax objects implement __array_ufunc__, or some other marker we could use look for meaning "do not use the deprecated behavior"?

shoyer · 2019-04-18T06:24:31Z

They implement __array__ currently, and will soon implement both __array_ufunc__ and __array_function__ (jax-ml/jax#611).

eric-wieser · 2019-04-18T07:21:49Z

I think the presence of __array__ is probably a good indication that we should not treat them as a tuple. Perhaps add a check for that attribute to shortcut this path, and if the type is not list?

rgommers · 2019-04-18T07:30:14Z

Perhaps the deprecation NEP has a clear stance here.

It won't have an opinion on any specific cases. That was a clear message from review: just express the principles not current cases. I'm planning to get back to that NEP soon by the way.

I agree with waiting till 1.18

eric-wieser · 2019-04-18T07:35:05Z

We might be able to finish the deprecation now for everything except lists, which I think have been all of the downstream cases that mattered.

We could also consider bumping this to an np.VisibleDeprecationWarning to try and flush out remaining downstream projects.

rgommers · 2019-04-18T07:38:53Z

We could also consider bumping this to an np.VisibleDeprecationWarning to try and flush out remaining downstream projects.

I like that idea, if we can do that for 1.17 that will probably flush out many more issues for end users.

This behavior has been deprecated since NumPy 1.15 (numpy#9686).

eric-wieser added the 07 - Deprecation label Sep 14, 2017

eric-wieser mentioned this pull request Sep 14, 2017

WIP: Actually try deprecating non-tuple nd-indices #4434

Closed

eric-wieser force-pushed the force-tuple branch from 17a20e5 to 175e0f1 Compare September 14, 2017 04:44

mhvk reviewed Sep 14, 2017

View reviewed changes

charris changed the title ~~Rebase of #4434~~ DEP: Deprecate non-tuple nd-indices Sep 25, 2017

charris added this to the 1.14.0 release milestone Sep 25, 2017

eric-wieser mentioned this pull request Sep 30, 2017

numpy.index_exp isn't quite consistent with direct indexing in backward compatibility case #9797

Open

charris modified the milestones: 1.14.0 release, 1.15.0 release Nov 26, 2017

eric-wieser mentioned this pull request Dec 18, 2017

MaskedArray heuristic for memory overlap seems simplistic and slow F438 #10234

Open

eric-wieser force-pushed the force-tuple branch from 175e0f1 to a4c0c57 Compare February 17, 2018 05:27

eric-wieser force-pushed the force-tuple branch 2 times, most recently from 397c822 to 8f469b3 Compare February 17, 2018 06:24

eric-wieser mentioned this pull request Feb 17, 2018

MAINT: Stop using non-tuple indices internally #10618

Merged

eric-wieser changed the base branch from master to maintenance/1.14.x March 16, 2018 01:20

eric-wieser changed the base branch from maintenance/1.14.x to master March 16, 2018 01:21

eric-wieser commented Mar 16, 2018

View reviewed changes

eric-wieser force-pushed the force-tuple branch from 8f469b3 to baac902 Compare March 18, 2018 23:44

eric-wieser mentioned this pull request Apr 22, 2018

ENH: Add index_tricks.as_index_tuple #8276

Closed

jthielen mentioned this pull request Aug 1, 2018

NumPy 1.15 FutureWarning from deprecated non-tuple indexing hgrecco/pint#669

Closed

brorfred mentioned this pull request Aug 19, 2018

FutureWarning when using slice. Unidata/netcdf4-python#833

Closed

rlmv added a commit to wmayner/pyphi that referenced this pull request Aug 24, 2018

Fix NumPy 1.15 non-tuple warning

1577d43

See numpy/numpy#9686

ahaldane mentioned this pull request Oct 10, 2018

ENH: ndrange, like range, but multidimensional #12094

Closed

9 tasks

jakirkham mentioned this pull request Jan 2, 2019

Always use tuples for multidimensional indexing zarr-developers/zarr-python#376

Merged

7 tasks

mattjj mentioned this pull request Apr 19, 2019

Indexing numpy array with DeviceArray: index interpreted as tuple jax-ml/jax#620

Closed

joaogui1 mentioned this pull request Dec 17, 2019

Closing old issues jax-ml/jax#1874

Closed

jakevdp mentioned this pull request Oct 19, 2020

Should JAX deprecate indexing with lists? jax-ml/jax#4564

Closed

donno2048 mentioned this pull request Dec 16, 2021

Deprecate non-tuple nd-indices tensorflow/tensorflow#53458

Closed

hawkinsp added a commit to hawkinsp/numpy that referenced this pull request Feb 10, 2022

DEP: Remove support for non-tuple nd-indices.

158b3ca

This behavior has been deprecated since NumPy 1.15 (numpy#9686).

hawkinsp mentioned this pull request Feb 10, 2022

DEP: Remove support for non-tuple nd-indices. #21029

Merged

hawkinsp added a commit to hawkinsp/numpy that referenced this pull request Feb 10, 2022

DEP: Remove support for non-tuple nd-indices.

38d60ed

This behavior has been deprecated since NumPy 1.15 (numpy#9686).

hawkinsp added a commit to hawkinsp/numpy that referenced this pull request Feb 10, 2022

DEP: Remove support for non-tuple nd-indices.

74b4f44

This behavior has been deprecated since NumPy 1.15 (numpy#9686).

hawkinsp added a commit to hawkinsp/numpy that referenced this pull request Feb 10, 2022

DEP: Remove support for non-tuple nd-indices.

f3dfbb2

This behavior has been deprecated since NumPy 1.15 (numpy#9686).

hawkinsp added a commit to hawkinsp/numpy that referenced this pull request Feb 11, 2022

DEP: Remove support for non-tuple nd-indices.

3c4f526

This behavior has been deprecated since NumPy 1.15 (numpy#9686).

hawkinsp added a commit to hawkinsp/numpy that referenced this pull request Feb 11, 2022

DEP: Remove support for non-tuple nd-indices.

5c3f9d3

This behavior has been deprecated since NumPy 1.15 (numpy#9686).

hawkinsp added a commit to hawkinsp/numpy that referenced this pull request Feb 11, 2022

DEP: Remove support for non-tuple nd-indices.

5e1be78

This behavior has been deprecated since NumPy 1.15 (numpy#9686).

hawkinsp added a commit to hawkinsp/numpy that referenced this pull request Feb 14, 2022

DEP: Remove support for non-tuple nd-indices.

e352160

This behavior has been deprecated since NumPy 1.15 (numpy#9686).

melissawm pushed a commit to melissawm/numpy that referenced this pull request Apr 12, 2022

DEP: Remove support for non-tuple nd-indices.

4bc0352

This behavior has been deprecated since NumPy 1.15 (numpy#9686).

seberg pushed a commit to seberg/numpy that referenced this pull request Apr 24, 2022

DEP: Remove support for non-tuple nd-indices.

10d70d6

This behavior has been deprecated since NumPy 1.15 (numpy#9686).

CSSFrancis mentioned this pull request Oct 31, 2022

Handling Axis attributes with @property decorator hyperspy/hyperspy#3031

Open

3 tasks

		@@ -69,13 +69,13 @@ def _raw_fft(a, n=None, axis=-1, init_function=fftpack.cffti,
		if s[axis] > n:
		index = [slice(None)]*len(s)

		@@ -1745,7 +1745,7 @@ def gradient(f, varargs, *kwargs):
		slice4[axis] = slice(2, None)

Uh oh!

DEP: Deprecate non-tuple nd-indices #9686

DEP: Deprecate non-tuple nd-indices #9686

Uh oh!

Conversation

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!