bpo-27212: Modify islice recipe to consume initial values preceding start #6195

csabella · 2018-03-22T23:27:52Z

https://bugs.python.org/issue27212

rhettinger · 2018-03-23T18:37:45Z

Doc/library/itertools.rst

@@ -435,8 +435,13 @@ loops that truncate the stream.
          # islice('ABCDEFG', 2, 4) --> C D
          # islice('ABCDEFG', 2, None) --> C D E F G
          # islice('ABCDEFG', 0, None, 2) --> A C E G
+          # it = iter('abcdefghi')


Please omit the three additional comment lines and only include the code changes.

Instead, please update Lib/test/test_itertools.py to add a new class, TestPurePythonRoughEquivalents() that has a test_islice_recipe() method that runs all the test_islice() tests against the pure python version of islice().

While we're at it, let's modify the consume() recipe to have n default to None:

- def consume(iterator): + def consume(iterator, n=None):

Let's also add a test to Lib/test/test_itertools.py libreftest section so that the consume() code gets exercised

>>> it = iter(range(10)) >>> consume(it, 3) >>> next(it) 3 >>> consume(it) >>> next(it, 'Done') 'Done'

EDIT: Upon reflection, I think you want the pure python version to pass all the tests otherwise you wouldn't have suggested the new test class. Your code example is so well written and succinct that I was afraid to mess it up by adding code for it to pass the tests. But I think it would be best for me to take a shot at it first and then I can always remove the change if that wasn't your intent. Sorry for the noise.

Thank you for the suggestions. I have a question about how far the pure python recipe should be changed to get the tests to work. With adding the tests for the islice_recipe, I'm able to do this to avoid duplicating code:

class TestPurePythonRoughEquivalents(unittest.TestCase): @staticmethod def islice(iterable, *args): s = slice(*args) it = iter(range(s.start or 0, s.stop or sys.maxsize, s.step or 1)) try: nexti = next(it) except StopIteration: # Consume *iterable* up to the *start* position. for i, element in zip(range(s.start or 0), iterable): pass return for i, element in enumerate(iterable): if i == nexti: yield element try: nexti = next(it) except StopIteration: return def test_islice_recipe(self): global islice orig_islice = islice islice = self.islice TestBasicOps().test_islice() islice = orig_islice

However, test_islice() in TestBasicOps() has tests that check for invalid arguments with TypeError and ValueError, which the current recipe doesn't have. Should I expand the recipe for these cases?

Also, there's a test for issue 10323 that fails with the pure python version:

c = count() self.assertEqual(list(islice(c, 1, 3, 50)), [1]) self.assertEqual(next(c), 3)

It returns 2 instead of 3.

And the test for pickling fails with TypeError: can't pickle generator objects.

I don't know the way to get around the pickle error, but for the others, should I modify the recipe to pass all the tests?

Thanks!

We don't need to modify the recipe or try to pass all of the regular islice() tests. Instead, we just want to test the recipe to make sure its basic functionality is okay. The pure python rough equivalent is allowed to not be a class, not be pickleable, and to omit error checks, etc, but it does need to be good at slicing :-) The problem we're fixing here arose because the islice(it, n, n) case wasn't tested to see if it advanced t 8000 he iterator. That is core functionality, so it warrants a test.

Also remember, the main purpose of the pure python rough equivalents it to augment the docs to provide additional insights that aren't easily conveyed in English. It needs to be short and expressive. A true exact equivalent, drop-in substitute would likely lose the pithy explanatory value.

There's no need to work very hard on this one. We want to make a minimal change to the recipe to make it correct with regards to basic functionality, and then add more testing to make sure the job was done right. The latter step should be easy as well -- just copy the relevant lines from test_islice().

Thank you for explaining that.

A few notes on what the changes I made.

I changed the recipe for the stop point. This had a test referencing bpo-10323 and, to your point, it affects the slicing. But, it also makes the recipe less pithy, so it may be too much of an edge case to worry about in the recipe? I wanted to highlight it in this commit even though you ultimately may not want to add it.

There were no existing test cases where start = stop in test_islice, so I added one under the comment for consume. I thought maybe it should replace the existing test case for consume since it's more of an edge case?

# Test number of items consumed SF #1171417 it = iter(range(10)) self.assertEqual(list(islice(it, 3)), list(range(3))) self.assertEqual(list(it), list(range(3, 10))) it = iter(range(10)) self.assertEqual(list(islice(it, 3, 3)), []) self.assertEqual(list(it), list(range(3, 10)))

I added the second test to reflect this issue.

Thanks!

rhettinger · 2018-03-23T20:01:52Z

Doc/library/itertools.rst

          s = slice(*args)
-          it = iter(range(s.start or 0, s.stop or sys.maxsize, s.step or 1))
+          it = iter(range(0, s.stop or sys.maxsize, s.step or 1))
+          for i in zip(range(0, s.start or 0), iterable):


I'm thinking the additional code just covers a special case, so it would be better to put the new code inside the exception handler. That way, it doesn't interfere with out understanding of the common case and it doesn't slow down the common case. It also allows us to put in a clarifying comment about why the code is there. Also, the variable on the for-loop should be "i, element" to make clear what the zip components are. Trapping the tuple in a single pair named i is likely confusing because it means a tuple in one place and an index offset in other places:

s = slice(*args) it = iter(range(s.start or 0, s.stop or sys.maxsize, s.step or 1)) try: nexti = next(it) except StopIteration: # consume the iterable up the *start* position: for i, element in zip(range(s.start or 0), iterable): pass return for i, element in enumerate(iterable): if i == nexti: yield element nexti = next(it)

bedevere-bot · 2018-03-23T20:02:04Z

A Python core developer has requested some changes be made to your pull request before we can consider merging it. If you could please address their requests along with any other requests in other reviews from core developers that would be appreciated.

Once you have made the requested changes, please leave a comment on this pull request containing the phrase I have made the requested changes; please review again. I will then notify any core developers who have left a review that you're ready for them to take another look at this pull request.

And if you don't make the requested changes, you will be poked with soft cushions!

rhettinger · 2018-03-25T07:41:38Z

Doc/library/itertools.rst

              return
          for i, element in enumerate(iterable):
              if i == nexti:
                  yield element
-                  nexti = next(it)
+                  try:


Let's move the try/except outside the for-loop:

try: for i, element in enumerate(iterable): if i == nexti: yield element nexti = next(it) except StopIteration: # Consume to *stop*. for i, element in zip(range(i + 1, stop), iterable): pass

rhettinger · 2018-03-25T07:51:53Z

Doc/library/itertools.rst

+                  try:
+                      nexti = next(it)
+                  except StopIteration:
+                      # Consume to *stop*.


For now, keep this consume-to-stop code as you currently have it. I'm taking a while to mull over whether this needs to be part of the recipe or whether to count this distracting and exotic corner case as the "rough" part of the rough equivalent.

rhettinger · 2018-03-25T07:52:51Z

Lib/test/test_itertools.py

+        it = iter(range(10))
+        self.assertEqual(list(self.islice(it, 3, 3)), [])
+        self.assertEqual(list(it), list(range(3, 10)))
+        # Test that slice finishes in predictable state.


This is the part that I'm still thinking about.

rhettinger · 2018-03-25T07:53:44Z

Misc/NEWS.d/next/Documentation/2018-03-22-19-23-04.bpo-27212.wrE5KR.rst

@@ -0,0 +1,2 @@
+Modify documentation for the :func:`islice` recipe to consume initial values
+up to start.


up to the start index.

rhettinger

Except for the one little nit, this looks ready to go. Thanks for your work.

rhettinger · 2018-03-26T06:24:19Z

Lib/test/test_itertools.py

+            # Consume to *stop*.
+            for i, element in zip(range(i + 1, stop), iterable):
+                pass
+            return


Let's drop the "return" on the final line so that it doesn't look like an early-out as opposed to a normal end.

csabella · 2018-03-26T14:37:04Z

Raymond, thank you for the reviews and guiding me through this PR.

Were you interested in having the other Python rough equivalent functions added to the new test class? I'd like to contribute those changes if it's something you think should be added.

miss-islington · 2018-03-27T01:29:35Z

Thanks @csabella for the PR, and @rhettinger for merging it 🌮🎉.. I'm working now to backport this PR to: 3.6, 3.7.
🐍🍒⛏🤖

bedevere-bot · 2018-03-27T01:30:52Z

GH-6266 is a backport of this pull request to the 3.7 branch.

…tart (pythonGH-6195) (cherry picked from commit da1734c) Co-authored-by: Cheryl Sabella <cheryl.sabella@gmail.com>

bedevere-bot · 2018-03-27T01:31:44Z

GH-6267 is a backport of this pull request to the 3.6 branch.

…tart (GH-6195) (#GH-6266) (cherry picked from commit da1734c) Co-authored-by: Cheryl Sabella <cheryl.sabella@gmail.com>

…tart (GH-6195) (GH-6267) (cherry picked from commit da1734c) Co-authored-by: Cheryl Sabella <cheryl.sabella@gmail.com>

rhettinger · 2018-03-27T02:29:14Z

This will also need a backport to 2.7 but it won't need the try/except around the next(i) because the StopIteration just bubbles up to end the iteration.
No need to put in more recipe tests unless we're modifying them for some reason.

miss-islington · 2018-03-27T02:29:35Z

Thanks @csabella for the PR, and @rhettinger for merging it 🌮🎉.. I'm working now to backport this PR to: 2.7.
🐍🍒⛏🤖

miss-islington · 2018-03-27T02:30:40Z

Sorry, @csabella and @rhettinger, I could not cleanly backport this to 2.7 due to a conflict.
Please backport using cherry_picker on command line.
cherry_picker da1734c58d2f97387ccc9676074717d38b044128 2.7

rhettinger · 2018-03-27T19:31:13Z

Cheryl, can you work on a Python 2.7 backport?

csabella · 2018-03-28T09:43:42Z

Yes, I'll work on the backport. I had hoped to do it yesterday but just didn't get the time. I'll try to do it today.

csabella · 2018-03-28T12:49:58Z

This will also need a backport to 2.7 but it won't need the try/except around the next(i) because the StopIteration just bubbles up to end the iteration.

I apologize, but I don't quite understand how to consume the iterator up to start without the try/except block. If I remove the try/except, then it doesn't get to the loop after the next():

   nexti = next(it)
   for i, element in zip(xrange(start), iterable):
       pass

>>> i = iter(xrange(10))
>>> list(islice(i, 3, 3))
[]
>>> list(i)
[0, 1, 2, 3, 4, 5, 6, 7, 8, 9]
>>>

rhettinger · 2018-03-30T16:33:22Z

You're right. The try/excepts need to stay now that we're doing something more interesting than just a return in the except-block. :-)

For Python 2.7, the range() and zip() should become xrange() and izip().

…ding start (pythonGH-6195) (cherry picked from commit da1734c)

bedevere-bot · 2018-04-01T22:53:38Z

GH-6339 is a backport of this pull request to the 2.7 branch.

…ding start (GH-6195) (GH-6339) (cherry picked from commit da1734c)

bpo-27212: Modify islice recipe to consume initial values

0f51d3d

csabella requested a review from rhettinger as a code owner March 22, 2018 23:27

the-knights-who-say-ni added the CLA signed label Mar 22, 2018

bedevere-bot added the awaiting review label Mar 22, 2018

rhettinger requested changes Mar 23, 2018

View reviewed changes

bedevere-bot removed the awaiting review label Mar 23, 2018

bedevere-bot added the awaiting changes label Mar 23, 2018

rhettinger added needs backport to 3.6 docs Documentation in the Doc dir labels Mar 23, 2018

Requested changes and tests.

93504c6

rhettinger requested changes Mar 25, 2018

View reviewed changes

Move try block and update blurb

e73983d

rhettinger requested changes Mar 26, 2018

View reviewed changes

Remove unneeded return

6a064a3

rhettinger approved these changes Mar 27, 2018

View reviewed changes

bedevere-bot added awaiting merge and removed awaiting changes labels Mar 27, 2018

rhettinger merged commit da1734c into python:master Mar 27, 2018

bedevere-bot removed awaiting merge needs backport to 3.7 labels Mar 27, 2018

bedevere-bot removed the needs backport to 3.6 label Mar 27, 2018

rhettinger pushed a commit that referenced this pull request Mar 27, 2018

bpo-27212: Modify islice recipe to consume initial values preceding s…

f328caf

…tart (GH-6195) (#GH-6266) (cherry picked from commit da1734c) Co-authored-by: Cheryl Sabella <cheryl.sabella@gmail.com>

rhettinger pushed a commit that referenced this pull request Mar 27, 2018

bpo-27212: Modify islice recipe to consume initial values preceding s…

c8698cf

…tart (GH-6195) (GH-6267) (cherry picked from commit da1734c) Co-authored-by: Cheryl Sabella <cheryl.sabella@gmail.com>

rhettinger added the needs backport to 2.7 label Mar 27, 2018

csabella deleted the issue27212 branch March 27, 2018 09:52

csabella added a commit to csabella/cpython that referenced this pull request Apr 1, 2018

[2.7] bpo-27212: Modify islice recipe to consume initial values prece…

ebc5ee8

…ding start (pythonGH-6195) (cherry picked from commit da1734c)

csabella mentioned this pull request Apr 1, 2018

[2.7] bpo-27212: Modify islice recipe to consume initial values prece… #6339

Merged

bedevere-bot removed the needs backport to 2.7 label Apr 1, 2018

rhettinger pushed a commit that referenced this pull request Apr 2, 2018

[2.7] bpo-27212: Modify islice recipe to consume initial values prece…

325191b

…ding start (GH-6195) (GH-6339) (cherry picked from commit da1734c)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

bpo-27212: Modify islice recipe to consume initial values preceding start #6195

bpo-27212: Modify islice recipe to consume initial values preceding start #6195

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

		@@ -0,0 +1,2 @@
		Modify documentation for the :func:`islice` recipe to consume initial values
		up to start.

Uh oh!

bpo-27212: Modify islice recipe to consume initial values preceding start #6195

bpo-27212: Modify islice recipe to consume initial values preceding start #6195

Uh oh!

Conversation

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!