MAINT: Introduced a couple of fixes for the np.core.fromnumeric functions #74

BvB93 · 2020-04-25T21:33:59Z

#67 Fixes & Updates

Integers and booleans can often be used interchangeably (in fact, bool is an int subclass in builtin python). Created_ArrayLikeIntOrBool to reflect this compatibility.
Rename _ScalarGeneric to _ScalarGenericDT; add a new _ScalarGeneric TypeVar which is only bound to np.generic.
np.choose() can only accept array-like objects consisting of integers (or booleans); reflect this in its annotation.
np.argpartition() will return an np.integer if a scalar is passed; not an np.ndarray.

Further Notes

It seems np.partition() might actually bugged in NumPy 1.17.3.
it's default value for axis is set to -1 instead of None. Consequently, it will raise an AxisError if a scalar is passed unless one explicitly passes axis=None to the function.

>>> import numpy as np

>>> np.partition(0, 0, axis=None)
array([0])

>>> np.partition(0, 0)
Traceback (most recent call last):
  ...
AxisError: axis -1 is out of bounds for array of dimension 0

New functions: * ``argmax()`` * ``argmin()`` * ``searchsorted()`` * ``resize()`` * ``squeeze()`` * ``diagonal()`` * ``trace()`` * ``ravel()`` * ``nonzero()`` * ``shape()`` * ``compress()`` Updates: * Integers and booleans can often be used interchangebly (in fact, ``bool`` is an ``int`` subclass in builtin python). Updated ``_ArrayLikeInt`` to reflect this compatibility. * Rename ``_ScalarGeneric`` to ``_ScalarGenericPlus``; add a new ``_ScalarGeneric`` TypeVar which is *only* bound to ``np.generic``. * ``np.choose()`` cannot only accept integers (or booleans); reflect this in its annotation. * It seems ``np.partition()`` is bugged in NumPy 1.17.3; it's default value for ``axis`` is set to ``-1`` instead of ``None``. Consquently, it will raise an ``AxisError`` if a scalar is passed *unless* one explicitly passes ``axis=None`` to the function. * ``np.argpartition()`` will return a ``np.generic`` if a scalar is passed; not an ``np.ndarray``.

* ``np.searchsorted()`` always takes an 1D array; replace ``Sequence[_ArrayLike]`` with ``Sequence[_Scalar]``. * Added ``_ArrayLikeNested`` to ``np.swapaxes()``

BvB93 · 2020-04-25T21:51:15Z

numpy-stubs/__init__.pyi

@@ -898,6 +913,23 @@ def partition(
    kind: _PartitionKind = ...,
    order: Union[None, str, Sequence[str]] = ...,
 ) -> ndarray: ...
+@overload


The behaviour of np.argpartition() here is a bit odd.
If an np.ndarray is passed an np.ndarray is returned and if a np.generic is passed a np.generic is returned, so far so good.

However, if a builtin scalar is passed it also returns a np.ndarray, rather than a np.generic.

>>> import numpy as np >>> type(np.argpartition(0, 0)) numpy.ndarray >>> type(np.argpartition(np.int64(0), 0)) numpy.int64

Ouch. Sounds like it could potentially be worthy of opening an issue against NumPy.

…ve it

numpy-stubs/__init__.pyi

person142 · 2020-04-26T18:33:29Z

numpy-stubs/__init__.pyi

-_ScalarGeneric = TypeVar(
-    "_ScalarGeneric", bound=Union[dt.datetime, dt.timedelta, generic]
+# Integers and booleans can generally be used interchangeably
+_ScalarInt = TypeVar("_ScalarInt", bound=Union[integer, bool_])


I don't think _ScalarInt is the best name here; seems like it should be _ScalarIntOrBool. I mostly don't want this to accidentally get cargo-culted to other places where NumPy int/bool diverge; e.g. arithmetic.

Based on #74 (comment) I'd argue that it is a suitable name. It can still be changed though if you don't agree,

Besides, for arithmetic it would (most likely) be more convenient to use a typevar of np.number anyway.

person142 · 2020-04-26T18:34:53Z

numpy-stubs/__init__.pyi

+# Integers and booleans can generally be used interchangeably
+_ScalarInt = TypeVar("_ScalarInt", bound=Union[integer, bool_])
+_ScalarGeneric = TypeVar("_ScalarGeneric", bound=generic)
+_ScalarGenericPlus = TypeVar(


Similarly _ScalarGenericPlus is not very descriptive of the type here. Also this definition would allow passing through subclasses of datetime and timedelta; are we confident that will work correctly?

Maybe something like _ScalarGenericDT instead?

About your second concern, some quick testing suggests that subclassing does indeed work:

>>> import datetime as dt >>> import numpy as np >>> class A(dt.timedelta): ... >>> np.take(A(1), 0).__class__ __main__.A >>> class B(dt.datetime): ... >>> np.take(B(2000, 1, 1), 0).__class__ __main__.B

person142 · 2020-04-26T18:35:20Z

numpy-stubs/__init__.pyi

 )

 # An array-like object consisting of integers
-_Int = Union[int, integer]
+_Int = Union[int, integer, bool, bool_]


As above about naming.

person142 · 2020-04-26T18:35:40Z

numpy-stubs/__init__.pyi

+_ArrayLikeBoolNested = Any  # TODO: wait for support for recursive types
+
+# Integers and booleans can generally be used interchangeably
+_ArrayLikeInt = Union[


As above about naming.

person142 · 2020-04-26T18:41:39Z

numpy-stubs/__init__.pyi

 @overload
 def choose(
-    a: _ArrayLike,
+    a: _ArrayLikeInt,


So while passing bools technically works, given that the the docstring says

"This array must contain integers in [0, n-1], where n is the number of choices"

it seems to me that this is more an accident than the intent. I'm not sure that we should allow this in the types as it seems like not a best practice. @rgommers WDYT about this case? (We could also solicit opinions on the mailing list.)

Accident might not be the best term here.
The general compatibility between int and bool stems from the fact the bool is a subclass of the former.

>>> import numpy as np >>> issubclass(int, bool) True >>> True == 1 and False == 0 True >>> np.bool_(True) == 1 and np.bool_(False) == 0 True

Union[int, bool] is consequently equivalent to just plain int, but as np.bool_ is not a subclass of np.integer this does not hold for np.bool_. Even though the latter two classes behave (more or less; arithmetic is a bit of an exception here) identically to int and bool.

This also means that we could remove np.bool_ from the _ArrayLikeInt union, but properly removing bool would require adding a new overload along the lines of
choose(a: bool, ...) -> NoReturn.

but as np.bool_ is not a subclass of np.integer this does not hold for np.bool_

And this is why I don't want to include it. Yes, including int gets you bool, but that's intentionally not true for numpy.bool_, so I'm not sure we should make an extra effort to bring it back here. To be clear, when I said "accident" I meant "that np.bool_ works".

Fair enough, I've implemented the name changes in #74.

person142 · 2020-04-26T18:42:39Z

numpy-stubs/__init__.pyi

@@ -898,6 +913,23 @@ def partition(
    kind: _PartitionKind = ...,
    order: Union[None, str, Sequence[str]] = ...,
 ) -> ndarray: ...
+@overload


Ouch. Sounds like it could potentially be worthy of opening an issue against NumPy.

See #74 (comment).

* Renamed ``_ScalarGenericPlus `` to ``_ScalarGenericDT`` (#74 (comment)). * Renamed ``_Int`` to ``_IntOrBool`` (#74 (comment)). * Renamed ``_ArrayLikeInt`` to ``_ArrayLikeIntOrBool`` (#74 (comment)).

person142 · 2020-05-01T03:06:17Z

Thanks @BvB93!

Bas van Beek added 4 commits April 24, 2020 15:05

Replace _ArrayLike with _ArrayLikeNested for >= 1D arrays

4e2e6bf

Update __init__.pyi

fcd4e46

* ``np.searchsorted()`` always takes an 1D array; replace ``Sequence[_ArrayLike]`` with ``Sequence[_Scalar]``. * Added ``_ArrayLikeNested`` to ``np.swapaxes()``

Removed the in #71 introduced functions

062947b

BvB93 mentioned this pull request Apr 25, 2020

ENH: Add type annotations for the np.core.fromnumeric module: part 2/4 #71

Merged

BvB93 commented Apr 25, 2020

View reviewed changes

_ArrayLikeNested is redundant in its curent implementation; remo…

443b301

…ve it

person142 added the maintenance label Apr 26, 2020

person142 reviewed Apr 26, 2020

View reviewed changes

Bas van Beek added 4 commits April 27, 2020 23:30

Removed a leftover from #71

e519d92

See #74 (comment).

Addressed comments from #74

911eed0

* Renamed ``_ScalarGenericPlus `` to ``_ScalarGenericDT`` (#74 (comment)). * Renamed ``_Int`` to ``_IntOrBool`` (#74 (comment)). * Renamed ``_ArrayLikeInt`` to ``_ArrayLikeIntOrBool`` (#74 (comment)).

Fixed an incorrect variable name

cda1e70

Fixed: Forgot to actually rename _Int to _IntOrBoolin 911eed0

45158c1

person142 merged commit 44de2bb into numpy:master May 1, 2020

BvB93 deleted the fromnumeric2 branch May 1, 2020 09:10

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

MAINT: Introduced a couple of fixes for the np.core.fromnumeric functions #74

MAINT: Introduced a couple of fixes for the np.core.fromnumeric functions #74

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

MAINT: Introduced a couple of fixes for the np.core.fromnumeric functions #74

MAINT: Introduced a couple of fixes for the np.core.fromnumeric functions #74

Uh oh!

Conversation

Uh oh!

#67 Fixes & Updates

Further Notes

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!