Fix npz header incompatibility #5178

charris · 2014-10-12T20:15:21Z

In addition, *.npy test files produced in both Python2 and Python3 are added so that compatibility between versions can be tested.

Rebased for backport to 1.9.

charris · 2014-10-12T20:59:58Z

I am unable to generate a test file with the 'L' addition, it probably requires 32 bits or windows. So the python2.npy test file here actually tests nothing.

charris · 2014-10-12T23:03:31Z

OK, added valid test file that has the problem. I think it needs 64 bit numpy running on windows.

njsmith · 2014-10-12T23:21:30Z

numpy/lib/format.py

+        token_string = token[1]
+        if (last_token_was_number and
+                token_type == tokenize.NAME and
+                token_string == "L"):


whitespace here is weird, though it doesn't really matter

My feeling is that the condition should be easily distinguished from the body. See also PEP8, indentation. The other option is a (useless) comment ;)

Fair enough! It's totally readable as is so IMO it doesn't matter at all.

On Mon, Oct 13, 2014 at 12:30 AM, Charles Harris notifications@github.com
wrote:

In numpy/lib/format.py:

"""

import tokenize

if sys.version_info[0] >= 3:

# In Python3 stderr, stdout are text files.

from io import StringIO

else:

from StringIO import StringIO

tokens = []

last_token_was_number = False

for token in tokenize.generate_tokens(StringIO(asstr(s)).read):

token_type = token[0]

token_string = token[1]

if (last_token_was_number and

token_type == tokenize.NAME and

token_string == "L"):

My feeling is that the condition should be easily distinguished from the
body. See also PEP8, indentation. The other option is a (useless) comment ;)

—
Reply to this email directly or view it on GitHub
https://github.com/numpy/numpy/pull/5178/files#r18751104.

Nathaniel J. Smith
Postdoctoral researcher - Informatics - University of Edinburgh
http://vorpus.org

njsmith · 2014-10-12T23:22:08Z

LGTM, up to you whether you care enough to fix the whitespace thing

The Python2 generated file had long integer literals like '1L' that broke in Python3. The fix here is to filter out the 'L' and let safe_eval take care of the integer type in converting the string. The fix here comes from Nathaniel Smith with a few added fixups. Closes numpy#5170.

juliantaylor · 2014-10-13T17:38:30Z

will this take care of array dtypes like: [('k', 'f4', 4L)]

can we fix the write path in a similar way?

charris · 2014-10-13T17:48:48Z

@juliantaylor I think the write path should be fixable in the same way. I think the dtype should work as the strings should not parse as numbers, but without a test file can't be sure. The test files need to be generated in win64 with 64 bit python2.

juliantaylor · 2014-10-13T17:58:25Z

alright I'll merge it to simplify testing, if there are issues we can followup, a write path fix would be nice

thanks

njsmith · 2014-10-13T18:11:52Z

That example dtype should be fine -- the fixup code just blindly strips out
unquoted-L whenever it appears directly after an unquoted-number.

On Mon, Oct 13, 2014 at 6:58 PM, Julian Taylor notifications@github.com
wrote:

alright I'll merge it to simplify testing, if there are issues we can
followup, a write path fix would be nice

thanks

—
Reply to this email directly or view it on GitHub
#5178 (comment).

Nathaniel J. Smith
Postdoctoral researcher - Informatics - University of Edinburgh
http://vorpus.org

Fix npz header incompatibility

charris · 2014-10-13T18:38:31Z

I can verify that [('k', 'f4', 4L)] works.

charris · 2014-10-14T01:48:07Z

I do think there might be problems with unicode field names. Currently, latin1 encoding is assumed for the header, and that might break down. The header should probably be utf-8 encoded.

charris · 2014-10-14T01:54:34Z

But unicode field names are not backward compatible with python2 ndarrays.

Replicates: numpy/numpy#5178

charris force-pushed the fix-npz-header-incompatibility branch from 095ff53 to edb4536 Compare October 12, 2014 23:01

charris force-pushed the fix-npz-header-incompatibility branch from edb4536 to bd606db Compare October 12, 2014 23:06

charris mentioned this pull request Oct 12, 2014

numpy npz files with v1 and v2 formats are not interoperable between python 2 and python 3. #5170

Closed

njsmith reviewed Oct 12, 2014
View reviewed changes

charris force-pushed the fix-npz-header-incompatibility branch from bd606db to 53ff0b1 Compare October 12, 2014 23:40

charris added 2 commits October 12, 2014 17:40

TST: Add tests for Python2, Python3 *.npy compatibility.

8b1f90a

charris force-pushed the fix-npz-header-incompatibility branch from 53ff0b1 to 8b1f90a Compare October 12, 2014 23:40

juliantaylor added a commit that referenced this pull request Oct 13, 2014

Merge pull request #5178 from charris/fix-npz-header-incompatibility

2f17863

Fix npz header incompatibility

juliantaylor merged commit 2f17863 into numpy:master Oct 13, 2014

juliantaylor added a commit that referenced this pull request Oct 13, 2014

Merge pull request #5178 from charris/fix-npz-header-incompatibility

90ae342

Fix npz header incompatibility

charris deleted the fix-npz-header-incompatibility branch October 14, 2014 00:06

njsmith mentioned this pull request Sep 24, 20 8697 15

WIP: preparing for numpy 1.9.4 release #6349

Closed

roblovelock pushed a commit to roblovelock/npy that referenced this pull request Apr 19, 2017

Fix for: numpy/numpy#5170

786a951

Replicates: numpy/numpy#5178

roblovelock mentioned this pull request Apr 19, 2017

Cant read NPY files created on win64 python2 JetBrains-Research/npy#6

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Fix npz header incompatibility #5178

Fix npz header incompatibility #5178

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Fix npz header incompatibility #5178

Fix npz header incompatibility #5178

Uh oh!

Conversation

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!