ENH: rewrite ma.median to improve poor performance for multiple dimensions #4760

juliantaylor · 2014-05-30T17:08:07Z

many masked median along a small dimension is extremely slow due to the
usage of apply_along_axis which iterates fully in python. The unmasked
median is about 1000x faster.

Work around this issue by using indexing to select the median element
instead of apply_along_axis.

Further improvements are possible, e.g. using the current np.nanmedian
approach for masked medians along large dimensions so partition is used
instead of sort or to extend partition to allow broadcasting over
multiple elements.

Closes gh-4683.

juliantaylor · 2014-05-30T17:09:47Z

I'd like this in 1.9 but it still needs some iterations, e.g. using partition instead of sort for larger dimensions.
this technique should also be used to the new nanmedian and nanpercentile.
posted now for comments, maybe I'm overlooking some neat indexing trick that would make this nicer.

juliantaylor · 2014-05-30T17:11:01Z

numpy/ma/core.py

@@ -6188,7 +6188,7 @@ def sort(a, axis= -1, kind='quicksort', order=None, endwith=True, fill_value=Non
    else:
        filler = fill_value
 #    return
-    indx = np.indices(a.shape).tolist()
+    indx = [x for x in np.indices(a.shape)]


@seberg the testsuite works but I'm not sure if that is really the same for large dimensions
if not possibly a argument to tolist that does not convert to python longs would be good

I don't understand, if anything, the old code could misbehave (but it won't unless you plug in a 32 dimensional array)

so when indexing whether the input is a tuple or array only matters for the first dimension?

It is wonky. But since you have a a list of arrays (or lists), yes, unless the first dimension has more then 32 entries (which probably is a bug for 32 dimensional input in the old code).

Nvm. about the bug. I doubt it is there, but you are not changing that part. Would have to pass in a tuple to make it a reliable error. (doesn't hurt anyway, only is a tiny bit slower because tuples parse so slow -- old keyword arguments being slow thing)

actually here a sparse meshgrid seems to be the same, and uses significantly less memory, I'll update to that

juliantaylor · 2014-05-31T11:55:28Z

added usage of masked median to nanmedian at a cutoff of 400 elements in the axis. on larger arrays the faster partition gains more than the apply_along_axis costs

as sorting nans has the same properties a masked sort (moving them to the end) one could probably duplicate the masked median code to squeeze some more performance as the masked sort is heavy on indexing and memory.
The masked median could probably also benefit from the threshold to use partition, but that involves quite extensive refactoring which would probably take too long to still go into 1.9.

I'm probably also not going to do the same for percentile for 1.9 anymore either.

charris · 2014-06-02T20:24:08Z

numpy/lib/nanfunctions.py

+def _nanmedian_small(a, axis=None, out=None, overwrite_input=False):
+    """
+    sort + indexing median, faster for small medians along multiple dimensions
+    due to the high overhead of apply_along_axis


Might mention nanmedian for parameter documentation.

charris · 2014-06-02T20:35:09Z

Not sure of all the details, I'll leave that to the tests.

tolist() converts numpy integers to python integers which are converted back to numpy integers by the indexing. meshgrid(indexing='ij') returns the indices wanted here as the right type. triples performance of sorting a size=(200, 200, 50) array along axis 2 and reduces memory usage by almost 40%.

…sions many masked median along a small dimension is extremely slow due to the usage of apply_along_axis which iterates fully in python. The unmasked median is about 1000x faster. Work around this issue by using indexing to select the median element instead of apply_along_axis. Further improvements are possible, e.g. using the current np.nanmedian approach for masked medians along large dimensions so partition is used instead of sort or to extend partition to allow broadcasting over multiple elements. Closes numpygh-4683.

charris · 2014-06-02T20:55:46Z

How much gain does this bring to the small array performance? It might make sense to push this off to the next release so you can spend more time with it.

juliantaylor · 2014-06-02T21:03:09Z

easily a factor 500, the performance of many small medians is abysmal as the pure python logic in apply_along_axis is very slow compared to the median.
I have seen many stumble over this issue, its pretty common in astronomy where you often stack a bunch of images using a median, e.g. 1000x1000x100 stacks. It also came up on the mailing list at least once.
the only existing 8000 workaround is using bottleneck or writing this yourself which not trivial.

charris · 2014-06-02T21:14:02Z

Well, that's significant ;) Anything we could do to speed up apply_along_axis?

juliantaylor · 2014-06-02T21:17:24Z

in the linked issue I listed the ideas I had to solve this issue.
improving apply_along_axis should be possible and worthwhile by moving it to C code using npyiter, possibly one could also cythonize it

juliantaylor · 2014-06-02T21:42:58Z

updated fixing the minor style issue and added an explicit testcase for the two nanmedian code paths

ENH: rewrite ma.median to improve poor performance for multiple dimensions

charris · 2014-06-02T22:32:14Z

Thanks Julian.

Fixes numpy#5969. Performance fix numpy#4760 had caused wrong shaped results in the 1D case. This fix restores the original 1D behavior.

juliantaylor reviewed May 30, 2014
View reviewed changes

charris reviewed Jun 2, 2014
View reviewed changes

juliantaylor added 2 commits June 2, 2014 22:55

ENH: use masked median for small multidimensional nanmedians

99ff7a7

charris added a commit that referenced this pull request Jun 2, 2014

Merge pull request #4760 from juliantaylor/masked-median

14cc717

ENH: rewrite ma.median to improve poor performance for multiple dimensions

charris merged commit 14cc717 into numpy:master Jun 2, 2014

durack1 mentioned this pull request Jun 19, 2014

Memory bloat using numpy.ma.median (Py 2.7.4, Np 1.7.1) #4814

Closed

taldcroft mentioned this pull request Jun 20, 2015

np.ma.median returns masked_array; doesn't match example in docs #5969

Closed

AmitAronovitch mentioned this pull request Apr 30, 2016

BUG: masked-array median of 1d array should be scalar #7592

Closed

AmitAronovitch mentioned this pull request May 14, 2016

BUG: ma.median alternate fix for #7592 #7635

Merged

charris pushed a commit to charris/numpy that referenced this pull request May 22, 2016

BUG: ma.median of 1d array should return a scalar

c1153b8

Fixes numpy#5969. Performance fix numpy#4760 had caused wrong shaped results in the 1D case. This fix restores the original 1D behavior.

charris mentioned this pull request May 22, 2016

Backport 7635, BUG: ma.median of 1d array should return a scalar #7654

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

ENH: rewrite ma.median to improve poor performance for multiple dimensions #4760

ENH: rewrite ma.median to improve poor performance for multiple dimensions #4760

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ENH: rewrite ma.median to improve poor performance for multiple dimensions #4760

ENH: rewrite ma.median to improve poor performance for multiple dimensions #4760

Uh oh!

Conversation

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!