ENH: `__array_function__` support for `np.lib`, part 2/2 #12119

shoyer · 2018-10-08T19:56:20Z

xref #12028

np.lib.npyio through np.lib.ufunclike

xref GH12028 np.lib.npyio through np.lib.ufunclike

shoyer · 2018-10-08T22:47:09Z

There's some weird interaction between iscomplexobj and assert_equal (which internally calls iscomplexobj) that is causing the observed test failures, e.g.,


self = <numpy.core.tests.test_overrides.TestArrayFunctionDispatch object at 0x7f8910047ba8>
    def test_interface(self):
    
        class MyArray(object):
            def __array_function__(self, func, types, args, kwargs):
                return (self, func, types, args, kwargs)
    
        original = MyArray()
        (obj, func, types, args, kwargs) = dispatched_one_arg(original)
        assert_(obj is original)
        assert_(func is dispatched_one_arg)
        assert_equal(set(types), {MyArray})
>       assert_equal(args, (original,))
MyArray    = <class 'numpy.core.tests.test_overrides.TestArrayFunctionDispatch.test_interface.<locals>.MyArray'>
args       = (<numpy.core.tests.test_overrides.TestArrayFunctionDispatch.test_interface.<locals>.MyArray object at 0x7f8910047898>,)
func       = <function dispatched_one_arg at 0x7f8913644e18>
kwargs     = {}
obj        = <numpy.core.tests.test_overrides.TestArrayFunctionDispatch.test_interface.<locals>.MyArray object at 0x7f8910047898>
original   = <numpy.core.tests.test_overrides.TestArrayFunctionDispatch.test_interface.<locals>.MyArray object at 0x7f8910047898>
self       = <numpy.core.tests.test_overrides.TestArrayFunctionDispatch object at 0x7f8910047ba8>
types      = [<class 'numpy.core.tests.test_overrides.TestArrayFunctionDispatch.test_interface.<locals>.MyArray'>]
../builds/venv/lib/python3.6/site-packages/numpy/core/tests/test_overrides.py:190: 
_ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ 
../builds/venv/lib/python3.6/site-packages/numpy/core/overrides.py:151: in public_api
    implementation, public_api, relevant_args, args, kwargs)
../builds/venv/lib/python3.6/site-packages/numpy/core/overrides.py:96: in array_function_implementation_or_override
    return implementation(*args, **kwargs)
_ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ 
x = 5
    @array_function_dispatch(_iscomplexobj_dispatcher)
    def iscomplexobj(x):
        """
        Check for a complex type or an array of complex numbers.
    
        The type of the input is checked, not the value. Even if the input
        has an imaginary part equal to zero, `iscomplexobj` evaluates to True.
    
        Parameters
        ----------
        x : any
            The input can be of any type and shape.
    
        Returns
        -------
        iscomplexobj : bool
            The return value, True if `x` is of a complex type or has at least
            one complex element.
    
        See Also
        --------
        isrealobj, iscomplex
    
        Examples
        --------
        >>> np.iscomplexobj(1)
        False
        >>> np.iscomplexobj(1+0j)
        True
        >>> np.iscomplexobj([3, 1+0j, True])
        True
    
        """
        try:
>           dtype = x.dtype
E           RecursionError: maximum recursion depth exceeded
x          = 5
../builds/venv/lib/python3.6/site-packages/numpy/lib/type_check.py:311: RecursionError

shoyer · 2018-10-08T22:57:14Z

The problem seems to be that my dummy __array_function__ implementation returns self for all functions, which then breaks assert_equal when iscomplexobj() is overloaded.

I see three possible ways to fix this:

adjust my __array_function__ implementation in test_overrides.py to special case np.iscomplexobj
use assert_(args == (original,)) instead of assert_equal()
add more fallback logic to assert_equal() to handle cases where iscomplexobj() returns a nonsensical result (i.e., a non-boolean)

shoyer · 2018-10-09T02:20:38Z

I decided to go for a mix of 2 and 3:

assert_equal should be OK with np.iscomplexobj raising TypeError.
for the one case where I define __array_function__ for everything, I use assert_ instead of assert_equal.

hameerabbasi

We perhaps should get rid of the hack sometime, but other than that I'm okay with this.

hameerabbasi · 2018-10-09T12:07:55Z

numpy/lib/ufunclike.py

+    return (x, out)
+
+
+@array_function_dispatch(_fix_dispatcher, verify=False)
 @_deprecate_out_named_y


Should this be added to the dispatcher?

Yes, this is a better idea. That way the warning will be raised even if the function is dispatching.

shoyer · 2018-10-11T18:38:09Z

Note: This one still needs more work before it's ready to merge (to fix the deprecation warnings).

shoyer · 2018-10-12T01:31:02Z

OK, inspired by @mhvk's and @hameerabbasi's comments I consolidated a bunch of unnecessarily repeated dispatcher functions here, too.

shoyer · 2018-10-16T16:17:42Z

This is ready for a final review.

hameerabbasi

One minor bug I spotted as I was going through this.

hameerabbasi · 2018-10-16T16:21:53Z

numpy/lib/scimath.py

@@ -309,6 +323,12 @@ def log10(x):
    x = _fix_real_lt_zero(x)
    return nx.log10(x)

+
+def _logn_dispatcher(n, x):


This should return both arguments -- Can have logs of different bases as well.

yes -- I think I missed this because I only read the docstring, which says n is an int.

mhvk

Looks good but some suggestions for using decorators for a group.

Also, more important, the last comment about the change in testing utils.py - that was really tricky to get right; I would suggest not to change it unless truly needed.

mhvk · 2018-10-16T21:09:55Z

numpy/lib/shape_base.py

@@ -712,7 +759,12 @@ def array_split(ary, indices_or_sections, axis=0):
    return sub_arys


-def split(ary,indices_or_sections,axis=0):
+def _split_dispatcher(ary, indices_or_sections, axis=None):


Here you could use a single _split_dispatcher with the function above as well as below (where the function currently gets redefined, but then it does get reused a few times).

good catch -- fixed

mhvk · 2018-10-16T21:11:28Z

numpy/lib/twodim_base.py

@@ -373,6 +386,11 @@ def tri(N, M=None, k=0, dtype=float):
    return m


+def _tril_dispatcher(m, k=None):


Maybe call it triul_dispatcher, since it is used for both?

mhvk · 2018-10-16T21:12:13Z

numpy/lib/twodim_base.py

@@ -812,6 +842,11 @@ def tril_indices(n, k=0, m=None):
    return nonzero(tri(n, m, k=k, dtype=bool))


+def _tril_indices_form_dispatcher(arr, k=None):


Same, either _tri or _triul

mhvk · 2018-10-16T21:13:47Z

numpy/lib/type_check.py

@@ -183,6 +194,11 @@ def imag(val):
        return asanyarray(val).imag


+def _iscomplex_dispatcher(x):


Next few could be common _is_type_dispatcher?

mhvk · 2018-10-16T21:17:34Z

numpy/testing/_private/utils.py

@@ -713,7 +713,7 @@ def func_assert_same_pos(x, y, func=isnan, hasval='nan'):
        # such subclasses, but some used to work.
        x_id = func(x)
        y_id = func(y)
-        if npall(x_id == y_id) != True:
+        if (x_id == y_id).all() != True:


Given the comment above, I think this should not be changed... Alternatively, at least the comment should be adjusted.

Good catch. The problem is that now np.all() may likely not be implemented for an ndarray subclass, if it doesn't define np.all in __array_function__.

I think we can just delete this comment, because it's no longer true that np.all() can work when a subclass implements.all() differently -- we already use getattr() to pull out a all() method to call if one exists.

The reason I worry is that this change is a very recent one, by @charris (#11756), which correctly something that .all() didn't catch. (Sorry, have to run, not sure what the real issue was...)

OK, I see, thanks. I think the comment was a little misleading here -- I think the problem was classes that define equality differently (e.g., to return a boolean) rather than classes that don't define an .all() method.

hameerabbasi · 2018-10-17T08:05:44Z

numpy/testing/_private/utils.py

-        if npall(x_id == y_id) != True:
+        result = x_id == y_id
+        all_equal = (result.all()
+                     if isinstance(result, ndarray)


Will this pass through subclasses? And if so, will that be an issue?

Agree with the worry of @hameerabbasi - also, at least the comment reads a bit weird now: I don't think we have to worry about classed that define __array_function__ but do not override np.all - they're new and they can override easily as part of their trials; maybe just leave it as it was?

Yes, this passes through subclasses -- which is the point.

I don't think we have to worry about classed that define array_function but do not override np.all - they're new and they can override easily as part of their trials; maybe just leave it as it was?

I don't exactly disagree with this, but this did come up with the ndarray subclass I made for unit testing purposes. With the arrival of __array_function__, it will be easier to depend on built-in methods like all() rather than np.all(). I guess we offer no guarantees for subclasses that don't implement the full numpy API, but it still feels like not a great user experience.

Two other ways to do this:

Cast to a NumPy boolean scalar/array, which guarantees that we have an all() method: np.bool_(x_id == y_id).all(). This looks less hacky and would still work for all the identified use-cases.

Give up on duck-typing for assert_array_equal, and instead decorate it with array_function_dispatch for explicit dispatching.

In that case, if this won't cause any problems, then I agree with this design.

Option 1 seems to cover all bases, and is shorter than what you have now.

Hmm. Testing out masked arrays, they no longer seem to return masked arrays from .all() or np.all():

In [16]: x = np.ma.array([1, 2, 3], mask=[False, True, False]) In [17]: x == x Out[17]: masked_array(data=[True, --, True], mask=[False, True, False], fill_value=True) In [18]: (x == x).all() Out[18]: True

That should cause a test failure, and if there is no test like that we should add one. Also note there are test failures with datetime64

That should cause a test failure, and if there is no test like that we should add one. Also note there are test failures with datetime64

I'm not sure. What would you do with a mask at all on a reduction? I mean, it makes sense to only reduce over the non-masked objects, but what is the output mask? For zero nonmasked elements do you set it to the identity of the ufunc or to False? Or do you just mask everything where any element is masked? Likely not the right answer.

mhvk · 2018-10-17T21:20:06Z

Wow, that should cover it! Thanks, @shoyer!

shoyer · 2018-10-19T16:19:38Z

OK, I plan to merge this shortly unless I get more feedback.

…#29317) TestArrayEqual.test_masked_scalar now passes. This case regressed since 7315145 (merged in numpy#12119) due to: - `<masked scalar> == <scalar>` returning np.ma.masked (not a 0-dim masked bool array), followed by - `np.bool(np.ma.masked)` unintentionally converting it to np._False There are a few ways to resolve this; I went with testing the comparison result with `isinstance(bool)` to check if a conversion to array is necessary, which is the same approach already taken in assert_array_compare after evaluating `comparison(x, y)`.

…#29317) TestArrayEqual.test_masked_scalar now passes. This case regressed since 7315145 (merged in numpy#12119) due to: - `<masked scalar> == <scalar>` returning np.ma.masked (not a 0-dim masked bool array), followed by - `np.bool(np.ma.masked)` unintentionally converting it to np._False Note on the modified comment: Confusingly, "isinstance(..., bool) checks" in the previous wording actually incorrectly referred to the ones towards the end of the function, which are not actually related to __eq__'s behavior but to the possibility of `func` returning a bool.

… (#29318) * TST: Add failing test TestArrayEqual.test_masked_scalar The two added test cases fail with: E AssertionError: E Arrays are not equal E E nan location mismatch: E ACTUAL: MaskedArray(3.) E DESIRED: array(3.) and E AssertionError: E Arrays are not equal E E nan location mismatch: E ACTUAL: MaskedArray(3.) E DESIRED: array(nan) * BUG: Fix np.testing utils failing for masked scalar vs. scalar (#29317) TestArrayEqual.test_masked_scalar now passes. This case regressed since 7315145 (merged in #12119) due to: - `<masked scalar> == <scalar>` returning np.ma.masked (not a 0-dim masked bool array), followed by - `np.bool(np.ma.masked)` unintentionally converting it to np._False Note on the modified comment: Confusingly, "isinstance(..., bool) checks" in the previous wording actually incorrectly referred to the ones towards the end of the function, which are not actually related to __eq__'s behavior but to the possibility of `func` returning a bool. * MNT: Improve comments on assert_array_compare nan/inf handling logic - Use same language as elsewhere below to explain `!= True` used to handle np.ma.masked - Clarify committed to support standard MaskedArrays - Restore note lost in 7315145 comment changes about how the np.bool casts towards the end of the function handle np.ma.masked, and expand further. * TST: Expand TestArrayEqual.test_masked_scalar

ENH: __array_function__ support for np.lib, part 2

3baa50f

xref GH12028 np.lib.npyio through np.lib.ufunclike

shoyer mentioned this pull request Oct 8, 2018

Tracking issue for implementation of NEP-18 (__array_function__) #12028

Closed

33 tasks

shoyer changed the title ~~ENH: __array_function__ support for np.lib, part 2~~ ENH: __array_function__ support for np.lib, part 2/2 Oct 8, 2018

shoyer added the component: __array_function__ label Oct 8, 2018

charris added 01 - Enhancement component: numpy.lib labels Oct 8, 2018

Fix failures in numpy/core/tests/test_overrides.py

d019e8f

hameerabbasi approved these changes Oct 9, 2018

View reviewed changes

shoyer mentioned this pull request Oct 9, 2018

ENH: __array_function__ support for most of numpy.core #12115

Merged

charris changed the title ~~ENH: __array_function__ support for np.lib, part 2/2~~ WIP, ENH: __array_function__ support for np.lib, part 2/2 Oct 11, 2018

charris added the 25 - WIP label Oct 11, 2018

shoyer added 4 commits October 11, 2018 18:23

CLN: handle depreaction in dispatchers for np.lib.ufunclike

b25c28f

CLN: fewer dispatchers in lib.twodim_base

51b87d4

CLN: fewer dispatchers in lib.shape_base

6d5af66

CLN: more dispatcher consolidation

eab547e

shoyer changed the title ~~WIP, ENH: __array_function__ support for np.lib, part 2/2~~ ENH: __array_function__ support for np.lib, part 2/2 Oct 12, 2018

shoyer added 3 commits October 12, 2018 08:27

BUG: fix test failure

a301581

Merge branch 'master' into array-function-numpy-lib2

658aff0

Use all method instead of function in assert_equal

ea7f866

hameerabbasi suggested changes Oct 16, 2018

View reviewed changes

DOC: indicate n is array_like in scimath.logn

81216bf

mhvk reviewed Oct 16, 2018

View reviewed changes

shoyer added 2 commits October 16, 2018 14:55

MAINT: updates per review

3254f7b

MAINT: more conservative changes in assert_array_equal

f460afa

MAINT: add back in comment

80319c0

hameerabbasi reviewed Oct 17, 2018

View reviewed changes

shoyer added 2 commits October 17, 2018 08:54

MAINT: casting tweaks in assert_array_equal

93db714

MAINT: fixes and tests for assert_array_equal on subclasses

3844ade

shoyer removed the 25 - WIP label Oct 23, 2018

shoyer merged commit 7315145 into numpy:master Oct 23, 2018

shoyer deleted the array-function-numpy-lib2 branch October 23, 2018 00:40

charris changed the title ~~ENH: __array_function__ support for np.lib, part 2/2~~ ENH: __array_function__ support for np.lib, part 2/2 Nov 10, 2018

		@@ -373,6 +386,11 @@ def tri(N, M=None, k=0, dtype=float):
		return m


		def _tril_dispatcher(m, k=None):

		@@ -812,6 +842,11 @@ def tril_indices(n, k=0, m=None):
		return nonzero(tri(n, m, k=k, dtype=bool))


		def _tril_indices_form_dispatcher(arr, k=None):

		@@ -183,6 +194,11 @@ def imag(val):
		return asanyarray(val).imag


		def _iscomplex_dispatcher(x):

Uh oh!

ENH: __array_function__ support for np.lib, part 2/2 #12119

ENH: __array_function__ support for np.lib, part 2/2 #12119

Uh oh!

Conversation

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ENH: `__array_function__` support for `np.lib`, part 2/2 #12119

ENH: `__array_function__` support for `np.lib`, part 2/2 #12119