ENH: Allow size=0 in numpy.random.choice #11383

mattip · 2018-06-19T22:48:33Z

Replaces lost PR #8717. Fixes #8311. Includes tests.

@MareinK if you wish to continue I can give you permissions to push to the source repo for the PR.

ghost · 2018-06-20T09:14:27Z

Thanks! Since you solved the merge conflict, I'm not sure what else needs to be done?

mattip · 2018-06-20T21:40:04Z

@eric-wieser approved the PR that this replaces, but requested a review from @seberg.

Edit: typo

eric-wieser · 2018-06-20T21:59:26Z

numpy/random/tests/test_random.py

+        assert_equal(np.random.randint(0,0,(3,0,4)).shape, (3,0,4))
+        assert_equal(np.random.randint(0,-10,0).shape, (0,))
+        assert_equal(np.random.choice(0,0).shape, (0,))
+        assert_equal(np.random.choice([],(0,)).shape, (0,))


nit: these tests would be clearer if the size/shape argument was passed by kwarg - too many zeros here

eric-wieser · 2018-06-20T21:59:59Z

numpy/random/mtrand/mtrand.pyx

-            if pop_size is 0:
-                raise ValueError("a must be non-empty")
+            if pop_size is 0 and np.prod(size) != 0:
+                raise ValueError("a cannot be empty unless no samples are taken")


If quotes are being added to all the as, one is missed here

eric-wieser · 2018-06-20T22:00:07Z

doc/release/1.16.0-notes.rst

+Even when no elements needed to be drawn, ``np.random.randint`` and
+``np.random.choice`` raised an error when the arguments described an empty
+distribution. This has been fixed so that e.g.
+``np.random.choice([],0) == np.array([],dtype=float64)``.


nit: spaces after commas

fixed, also replaced all spaces after comments in these three files grep ",[^ ]"

eric-wieser · 2018-06-21T00:04:37Z

numpy/random/tests/test_random.py

-        out1 = np.empty((len(self.seeds),) + sz)
-        out2 = np.empty((len(self.seeds),) + sz)
+        out1 = np.empty((len(self.seeds), ) + sz)
+        out2 = np.empty((len(self.seeds), ) + sz)


I guess (x,) is an exception to the space after comma rule. Let me clarify that to "space after comma not before a closing group, like ] or )"

Fixed.
Shows that I should check the standard before deciding matters of taste preference.

eric-wieser · 2018-06-21T06:53:08Z

Ideally all of the docstring whitespace changes would just go in a separate STY commit (or even PR), rather than muddying the blame.

eric-wieser · 2018-06-21T06:54:01Z

numpy/random/tests/test_random.py

@@ -440,6 +440,14 @@ def test_choice_return_shape(self):
        assert_equal(np.random.choice(6, s, replace=False, p=p).shape, s)
        assert_equal(np.random.choice(np.arange(6), s, replace=True).shape, s)

+        # Check zero-size
+        assert_equal(np.random.randint(0, 0, size=(3, 0, 4)).shape, (3, 0, 4))


A test for randint(10, 10, size=0) might be good too - just to check that we didn't special case 0 somehow

seberg

Seems OK to me, allowing empty probabilities might be nice.

seberg · 2018-06-21T17:18:47Z

numpy/random/mtrand/mtrand.pyx

-            if pop_size <= 0:
-                raise ValueError("a must be greater than 0")
+                raise ValueError("'a' must be 1-dimensional or an integer")
+            if pop_size <= 0 and np.prod(size) != 0:


a bit unintuitive that this works for None, but I guess OK, could also add size is None explicitly.

That also has the advantage that np.prod probably adds a bit of overhead, but maybe negligible in any case.

seberg · 2018-06-21T17:19:28Z

numpy/random/mtrand/mtrand.pyx

-                raise ValueError("a must be greater than 0")
+                raise ValueError("'a' must be 1-dimensional or an integer")
+            if pop_size <= 0 and np.prod(size) != 0:
+                raise ValueError("'a' must be greater than 0 unless no samples are taken")


I somewhat thought it was backticks ;). This is good, I do not think we or python has serious guidelines for errors.

seberg · 2018-06-21T17:24:27Z

numpy/random/mtrand/mtrand.pyx

            if p.size != pop_size:
-                raise ValueError("a and p must have same size")
+                raise ValueError("'a' and 'p' must have same size")
            if np.logical_or.reduce(p < 0):
                raise ValueError("probabilities are not non-negative")
            if abs(kahan_sum(pix, d) - 1.) > atol:


I think if you test for an empty probabilities array, you will see that this check fails also, so might as well allow that too?

seberg · 2018-06-21T17:26:19Z

numpy/random/mtrand/mtrand.pyx

-            raise ValueError("low >= high")
-
+        if ilow >= ihigh and np.prod(size) != 0:
+            raise ValueError("Range cannot be empty (low >= high) unless no samples are taken")


Was suprised for a bit here, but I guess we do it like a python range and allow strange ranges as empty ranges, seems fine to me.

mattip · 2018-06-21T18:43:17Z

IMO handling 'p' should be a separate PR, see issues #11250, #9867, #6132, and PR #11264

seberg · 2018-06-21T18:44:32Z

OK, fine with me, thought it might have been an oversight.

bashtage · 2018-06-24T12:03:11Z

numpy/random/mtrand/mtrand.pyx

        elif a.ndim != 1:
-            raise ValueError("a must be 1-dimensional")
+            raise ValueError("'a' must be 1-dimensional")


I don't think it is normal to escape argument names in error messages, see, e.g.,

https://github.com/numpy/numpy/blob/master/numpy/random/mtrand/mtrand.pyx#L988

My guess is, you can find examples for everything both in numpy and the standard lib. Personally, I think quotes probably make it slightly more discoverable what a refers to, so sounds good to me. (e.g. something says axis is invalid may not refer to an axis argument, something saying `axis` is invalid certainly does).

Anyway, I think the PR looks good and you guys can put it in if you like.

ghost · 2018-06-24T18:13:12Z

Since this issue and PR came from me originally: thanks to al 9F19 l who helped complete it!

ENH: Allow size=0 in numpy.random.choice

95dada6

charris added 01 - Enhancement component: numpy.random labels Jun 20, 2018

eric-wieser reviewed Jun 20, 2018

View reviewed changes

eric-wieser reviewed Jun 21, 2018

View reviewed changes

mattip force-pushed the recreate-8717 branch from 66fd6ad to 1a85409 Compare June 21, 2018 00:25

eric-wieser reviewed Jun 21, 2018

View reviewed changes

mattip force-pushed the recreate-8717 branch from 1a85409 to 7f3b191 Compare June 21, 2018 16:11

fixes from review

9013b0f

mattip force-pushed the recreate-8717 branch from 7f3b191 to 9013b0f Compare June 21, 2018 16:36

eric-wieser approved these changes Jun 21, 2018

View reviewed changes

eric-wieser requested a review from seberg June 21, 2018 16:59

mattip mentioned this pull request Jun 21, 2018

STY: ensure commas are followed by spaces #11401

Open

seberg reviewed Jun 21, 2018

View reviewed changes

bashtage reviewed Jun 24, 2018

View reviewed changes

eric-wieser merged commit 464f79e into numpy:master Jun 24, 2018

mattip deleted the recreate-8717 branch June 24, 2018 21:48

mattip mentioned this pull request Jul 27, 2018

ENH: Allow size=0 in numpy.random.choice, regardless of array #8717

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

ENH: Allow size=0 in numpy.random.choice #11383

ENH: Allow size=0 in numpy.random.choice #11383

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ENH: Allow size=0 in numpy.random.choice #11383

ENH: Allow size=0 in numpy.random.choice #11383

Uh oh!

Conversation

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!