FIX _safe_divide should handle zero-division with numpy scalar #27312

glemaitre · 2023-09-07T17:01:35Z

Fixing the main branch.

This PR should handle the case of zero-division with two numpy scalar.

ping @lorentzenchr @thomasjpfan @OmarManzoor @lesteve

Edit: The CI failure popped up after merging #26278 in the example examples/ensemble/plot_gradient_boosting_regularization.py

github-actions · 2023-09-07T17:03:29Z

✔️ Linting Passed

All linting checks passed. Your pull request is in excellent shape! ☀️

_{Generated for commit: ad04d77. Link to the linter CI: here}

glemaitre · 2023-09-07T17:07:30Z

Given the failure in the example, it seems that the provided fix is not enough. We can hand up returning a very large value because the denominator is extremely small

numerator=-0.2857142857142857
denominator=3.18528276243611e-217
numerator=-8.969824879715574e+215

The next iteration, the numerator and denominator will nan.

glemaitre · 2023-09-07T17:15:03Z

I am not sure what it the right fix here to handle the numerical instability: either clip in the safe division because we should not assign such a large value in raw_predictions or handle the nan values of the next iteration?

glemaitre · 2023-09-07T17:26:19Z

Apparently, once we get some nan, it seems that this is too late:

vs. previously

glemaitre · 2023-09-07T17:33:27Z

I also see that we had potentially another strategy in the past: https://github.com/scikit-learn/scikit-learn/pull/26278/files#diff-dac0eef4868535102a7b9aeb319e1501dfbd10f1256f91d532927e12d30bfb15L732

glemaitre · 2023-09-07T18:00:12Z

So this is not enough. The negative gradient vector has some nan.

In the previous loss it seems that np.nan_to_num was used. Here I tried but the algorithm is still diverging.

lorentzenchr · 2023-09-07T19:49:38Z

I‘ll also have a look, but I need some time. Fortunately, we‘ve plenty of time to fix it.

lorentzenchr · 2023-09-08T06:26:08Z

Which version of numpy do you use?
With numpy 1.24.1 on macos, I can't reproduce this failure. My blue curve is like the one at the bottom plot of #27312 (comment).
I was mistaken, now I see the error.

OmarManzoor · 2023-09-08T06:58:34Z

@lorentzenchr I have numpy 1.25.1 on mac M1 and I can reproduce the error when I use the main branch.

lesteve · 2023-09-08T09:12:33Z

As an aside (I don't know how much we care to be perfectly honest): code based on np.seterr + catching an exception is not going to work in Pyodide, i.e. an exception will never be raised. Floating point exceptions are not supported in WebAssembly and this is unlikely to change in the near future, see numpy/numpy#21895 (comment) for more details.

OmarManzoor · 2023-09-08T10:25:04Z

@lorentzenchr @glemaitre @lesteve

scikit-learn/sklearn/ensemble/_gb.py

Lines 237 to 244 in c634b8a

    
           for leaf in np.nonzero(tree.children_left == TREE_LEAF)[0]: 
        
               indices = np.nonzero(terminal_regions == leaf)[0]  # of terminal regions 
        
               y_ = y.take(indices, axis=0) 
        
               sw = None if sample_weight is None else sample_weight[indices] 
        
               update = compute_update(y_, indices, neg_gradient, raw_prediction, k) 
        
               # TODO: Multiply here by learning rate instead of everywhere else. 
        
               tree.value[leaf, 0, 0] = update

I think we should have

indices = np.nonzero(masked_terminal_regions == leaf)[0]  # of terminal regions

if we compare with the original code.

OmarManzoor · 2023-09-08T11:18:03Z

Plot if I use on main

indices = np.nonzero(masked_terminal_regions == leaf)[0]

glemaitre · 2023-09-08T12:05:19Z

Yep this look much what we had. It makes sense also if we where computing gradient on data that we should not have :)

glemaitre · 2023-09-08T12:08:31Z

@lesteve @lorentzenchr Instead of the exception catching, we could use the previous trick that return 0.0 if smaller than smaller than a really small values. This would be OK for Pyodide?

lesteve · 2023-09-08T12:36:54Z

8000

we could use the previous trick that return 0.0 if smaller than smaller than a really small values. This would be OK for Pyodide?

I think this would work with Pyodide.

I am not too sure to which extent we want to support Pyodide quirkiness, as I mentioned above. It feels like not getting some warnings in Pyodide would acceptable but having a gradient boosting algorithm that behave weirdly because the loss becomes NaN or inf is maybe not that great.

glemaitre · 2023-09-08T12:54:02Z

So I made the fix of @OmarManzoor and change the error catching as it was previously.

glemaitre · 2023-09-08T12:58:28Z

I also added a comment to remember why we are not using np.errstate and who knows, we could potentially change it in the future if Pyodide handles it.

lesteve · 2023-09-08T13:15:54Z

I also added a comment to remember why we are not using np.errstate and who knows, we could potentially change it in the future if Pyodide handles it.

Don't hold your breath too much though, the link above says there is no plan to support it right now, so that's at least 3 years away.

sklearn/ensemble/_gb.py

sklearn/ensemble/tests/test_gradient_boosting.py

sklearn/ensemble/_gb.py

lorentzenchr

LGTM just some nits.

@glemaitre @OmarManzoor @lesteve Thanks for fixing my bugs.

sklearn/ensemble/_gb.py

Co-authored-by: Christian Lorentzen <lorentzen.ch@gmail.com>

OmarManzoor

LGTM. Thanks @glemaitre , @lesteve and @lorentzenchr

…t-learn#27312) Co-authored-by: Christian Lorentzen <lorentzen.ch@gmail.com>

FIX _safe_divide should handle zero-division with numpy scalar

980abdf

github-actions bot added the module:ensemble label Sep 7, 2023

glemaitre marked this pull request as draft September 7, 2023 17:02

lorentzenchr added this to the 1.4 milestone Sep 7, 2023

glemaitre mentioned this pull request Sep 7, 2023

MNT Deprecate SAMME.R algorithm from AdaBoostClassifier #26830

Merged

fix omar

0363642

glemaitre added the No Changelog Needed label Sep 8, 2023

[doc build] add a specific comment to document the strategy

f5a0868

glemaitre marked this pull request as ready for review September 8, 2023 12:57

lorentzenchr reviewed Sep 8, 2023

View reviewed changes

sklearn/ensemble/_gb.py Outdated Show resolved Hide resolved

glemaitre added 2 commits September 8, 2023 17:28

alternative to constant

9af396e

[doc build] trigger doc

a9b4d90

[doc build] remove old code

536b334

lorentzenchr reviewed Sep 8, 2023

View reviewed changes

sklearn/ensemble/_gb.py Show resolved Hide resolved

glemaitre added 3 commits September 8, 2023 17:48

iter

bec5d78

space

69bce93

whopps

0ca1090

glemaitre commented Sep 8, 2023

View reviewed changes

sklearn/ensemble/tests/test_gradient_boosting.py Outdated Show resolved Hide resolved

sklearn/ensemble/_gb.py Outdated Show resolved Hide resolved

Apply suggestions from code review

dfbdc93

lorentzenchr approved these changes Sep 8, 2023

View reviewed changes

sklearn/ensemble/_gb.py Outdated Show resolved Hide resolved

sklearn/ensemble/_gb.py Outdated Show resolved Hide resolved

sklearn/ensemble/_gb.py Outdated Show resolved Hide resolved

sklearn/ensemble/_gb.py Outdated Show resolved Hide resolved

glemaitre and others added 3 commits September 8, 2023 23:27

Apply suggestions from code review

9b437be

Co-authored-by: Christian Lorentzen <lorentzen.ch@gmail.com>

Apply suggestions from code review

ffb605a

Co-authored-by: Christian Lorentzen <lorentzen.ch@gmail.com>

iter

ad04d77

OmarManzoor approved these changes Sep 10, 2023

View reviewed changes

OmarManzoor merged commit bbc73cf into scikit-learn:main Sep 10, 2023

REDVM pushed a commit to REDVM/scikit-learn that referenced this pull request Nov 16, 2023

FIX _safe_divide should handle zero-division with numpy scalar (sciki…

3178b39

…t-learn#27312) Co-authored-by: Christian Lorentzen <lorentzen.ch@gmail.com>

lorentzenchr mentioned this pull request Jan 10, 2024

FIX divide by zero in line search of GradientBoostingClassifier #28095

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

FIX _safe_divide should handle zero-division with numpy scalar #27312

FIX _safe_divide should handle zero-division with numpy scalar #27312

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

FIX _safe_divide should handle zero-division with numpy scalar #27312

FIX _safe_divide should handle zero-division with numpy scalar #27312

Uh oh!

Conversation

Uh oh!

Uh oh!

Uh oh!

✔️ Linting Passed

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!