MAINT: use more conservative integer types for umath linalg #5899

argriffing · 2015-05-21T01:53:12Z

I'm not sure this is useful or even works.

jaimefrio · 2015-05-21T03:57:21Z

numpy/linalg/umath_linalg.c.src


    if (!compute_urows_vtcolumns(jobz, m, n, &u_row_count, &vt_column_count))
        goto error;

-    u_size = ((size_t)u_row_count)*m*sizeof(@ftyp@);
-    vt_size = n*((size_t)vt_column_count)*sizeof(@ftyp@);
+    u_size = ((size_t)u_row_count) * safe_m * sizeof(@ftyp@);


Why casting here? Wouldn't safe_u_row_count and safe_vt_col_count be more consistent choices?

jaimefrio · 2015-05-21T03:58:29Z

Mostly LGTM. It certainly cannot hurt...

argriffing · 2015-05-21T18:07:07Z

The latest commit has addressed some comments about notation consistency.

There is still room for improvement, but this could be out of scope for this PR. For example, there are some function calls like underlying_lapack_function(..., (low_precision_int_type) high_precision_workspace_size, ...); which could be checked for overflow. Also I think the malloc failure branch is weird -- it sets the output values to zero with no python warnings or exceptions as far as I can tell. These questions could be related to #3217.

jaimefrio · 2015-05-21T18:30:24Z

Is this ready to merge then? Or do we want to figure out what exactly is going on with the segfault in #5898 before putting it in?

argriffing · 2015-05-21T19:27:05Z

@jaimefrio I'm pretty sure the reason for the segfault in #5898 is that

After working around some overflow problems, the numpy allocation code no longer attempts to allocate a negative number of things, so it no longer prematurely returns from the function call. Therefore it actually reaches the point of calling the underlying LAPACK dgesv function.
Some implementations of the underlying LAPACK function (e.g. OpenBLAS, apparently) can deal with input matrices of size NxN, even with an interface that uses int32 for N and even if N*N would overflow 32 bits. For example OpenBLAS uses blasint for its interface (e.g. for passing N) which sounds like it could be 32-bit, whereas the function that does the actual indexing uses BLASLONG which sounds less likely to overflow.
https://github.com/xianyi/OpenBLAS/blob/develop/interface/lapack/gesv.c
https://github.com/xianyi/OpenBLAS/blob/develop/lapack/getrf/getrf_single.c
But the numpy lapack-lite implementation seems to internally use int32 indices into the flat NxN memory which overflows and segfaults.
https://github.com/numpy/numpy/blob/maintenance/1.9.x/numpy/linalg/lapack_lite/f2c.h#L10
https://raw.githubusercontent.com/numpy/numpy/maintenance/1.9.x/numpy/linalg/lapack_lite/dlapack_lite.c

My understanding from #5898 (comment) is that the blas-lite and lapack-lite included with numpy are already known to cause segfaults for these larger matrices (e.g. using dot) because of overflow issues like this.

I'm not 100% sure that this is what's going on with the segfault, but this is my best guess. Maybe wait for comments or a review from @pv or others before merging?

pv · 2015-05-21T19:46:05Z

LGTM (assuming no typos)

MAINT: use more conservative integer types for umath linalg

jaimefrio · 2015-05-21T20:01:59Z

In it goes then. Thanks!

MAINT: use more conservative integer types for umath linalg

ad4aa25

argriffing mentioned this pull request May 21, 2015

Numpy inverse of very large matrix returns all-zero matrix without error #5898

Closed

jaimefrio reviewed May 21, 2015
View reviewed changes

MAINT: more consistent notation in umath_linalg

b9f5e85 8000

jaimefrio added a commit that referenced this pull request May 21, 2015

Merge pull request #5899 from argriffing/improve-umath-linalg

9dba7a4

MAINT: use more conservative integer types for umath linalg

jaimefrio merged commit 9dba7a4 into numpy:master May 21, 2015

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

MAINT: use more conservative integer types for umath linalg #5899

MAINT: use more conservative integer types for umath linalg #5899

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

MAINT: use more conservative integer types for umath linalg #5899

MAINT: use more conservative integer types for umath linalg #5899

Uh oh!

Conversation

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!