[MRG] SVM: Ensure nonnegative sample weights (#9494) #9674

jondo · 2017-09-02T00:22:06Z

As a first step for dealing with #9494, this reproduces the issue in a unit test.

This should fail with the same IndexError as in the original report.
It only fails when all of the weights of the second class are negative.

glemaitre · 2017-09-02T14:45:34Z

Uhm no chances on this one :)

jondo · 2017-09-02T14:56:30Z

@glemaitre, do you mean that the original issue should be closed without any code change, or that there is something fundamentally wrong with my try to do test driven programming?
Can you reproduce the error with my test?

glemaitre · 2017-09-02T14:59:07Z

Oh sorry, actually the CI did not run and thus I believed that you test did not failed.

jondo · 2017-09-02T15:05:07Z

I see. Bad nightly idea to deactivate CI on that. I will switch it back on, s.t. you don't need to test locally.

jondo · 2017-09-02T16:12:04Z

FWIW, A travis build additionally shows the stderr message warning: class label 1 specified in weight is not found, which might be related and seems to come from here in linear.cpp.

jondo · 2017-09-02T17:10:17Z

No, I guess the warning must be from the corresponding place in svm.cpp, since the test calls svm.SVC, not svm.LinearSVC—but I am reading all of this code for the first time.

Actually, this warning might just be a side effect (or even totally unrelated), because the code there is about label weights weight. The sample weights are in W. They were introduced by @fabianp in 2010, and they don't exist in upstream libSVM.

jondo · 2017-09-04T14:24:51Z

(See my comment in the issue.)

jnothman · 2017-09-05T08:38:07Z

Add a fix?

jondo · 2017-09-08T19:24:54Z

Ready for review.

agramfort · 2017-09-10T16:02:07Z

sklearn/svm/base.py

@@ -170,6 +170,9 @@ def fit(self, X, y, sample_weight=None):
                             "boolean masks (use `indices=True` in CV)."
                             % (sample_weight.shape, X.shape))

+        if sample_weight.shape[0] > 0 and min(sample_weight) < 0:


agramfort · 2017-09-10T16:02:30Z

sklearn/svm/tests/test_svm.py

+
+
+def test_negative_weights():
+    W = np.array([1, 1, 1, -1, -1, -1])


rename : W -> sample_weights

agramfort · 2017-09-10T16:03:57Z

I feel this should be with a check_sample_weights function tested on all estimators with a common test that can be disable for speed with the new context manager.

As needed for LIBSVM and LIBLINEAR.

jondo · 2017-09-10T21:06:57Z

@agramfort:
I have applied the small changes. Would (sample_weight < 0).any() be even better?

I guess sklearn/base.pyis the place to call check_sample_weights for all estimators? And utils/validation.py the place to define it? But does no estimator support negative weights?

Should the disabling look as in #7548? E.g.

with sklearn.config_context(assume_positive_sample_weights=True):
    clf.fit(X, Y, sample_weight=sample_weights)

jnothman · 2017-09-10T23:08:52Z

we don't usually put much in base.py. our approach is rather to create a common test, then modify whichever estimators are applicable. A shared utility function would go in sklearn/utils/validation.py

…

On 11 Sep 2017 7:07 am, "Robert Pollak" ***@***.***> wrote: @agramfort <https://github.com/agramfort>: I have applied the small changes. Would (sample_weight < 0).any() be even better? I guess sklearn/base.pyis the place to call check_sample_weights for all estimators? And utils/validation.py the place to define it? But does no estimator support negative weights? Should the disabling look like in #7548 <#7548>? E.g. with sklearn.config_context(assume_positive_sample_weights=True): clf.fit(X, Y, sample_weight=sample_weights) — You are receiving this because you commented. Reply to this email directly, view it on GitHub <#9674 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AAEz66V812BByNQ2I8O9G5OouDCLcyswks5shE9zgaJpZM4PKvfZ> .

agramfort · 2017-09-11T19:10:11Z

agreed with @jnothman I think adaboost can have negative weights. I remember a conversation or it's my troubled mind ...

jondo · 2017-09-11T20:04:22Z

Besides svm.SVC, are there other estimators that need nonnegative weights? I found no such case in the sklearn docs. (Should I document it for svm.SVC?)

I even don't know about the other svm estimators. I guess that at least svm.LinearSVC might also need nonnegative weights, since the README.weight of LIBLINEAR also says "Please make sure all weights are non-negative". So the common test can be reused at least here.

jnothman · 2017-09-11T23:21:33Z

Yes, so we could have a common test which checks for an appropriate error message or else no error.

…

On 12 September 2017 at 06:04, Robert Pollak ***@***.***> wrote: Besides svm.SVC, are there other estimators that need nonnegative weights? I found no such case in the sklearn docs. (Should I document it for svm.SVC?) I even don't know about the other svm estimators <http://scikit-learn.org/stable/modules/classes.html#estimators>. I guess that at least svm.LinearSVC might also need nonnegative weights, since the README.weight of LIBLINEAR also says "Please make sure all weights are non-negative". So the common test can be reused at least here. — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#9674 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AAEz68sDP1c2Qc3OzWrCSHGL1O3P9s97ks5shZJJgaJpZM4PKvfZ> .

amueller · 2017-09-12T14:53:31Z

I think most people assume weights are non-negative, but as @agramfort said, they don't need to be for adaboost (I'm not sure about gradient boosting and other tree-based algorithms).

Can you run a test over all_estimators and check which ones take weights and which one require non-negative ones and add error messages?
I guess we'd have a common test that check that either a good error is raised or the result is correct (I think flipping the sign of y would be ok for binary, not sure what negative weights do in multiclass)

alexshacked · 2019-07-01T12:55:56Z

Actually the SVC algorithm handles negative weights. It happens in the implementation in function PREFIX(train) in svm.cpp:2338. The first thing this function does is calling remove_zero_weight(), located also in svm.cpp. remove_zero_weight() will remove from the input all the samples which have a negative or zero weight. So the training algorithm will process only the samples with positive weights. The problem in the specific example that uncovered bug #9494 is that here all samples that had the "0" label also received a negative weight. So after removing the samples with negative weights the training algorithm in svm.cpp was left with only the samples having label "1". This is of course an invalid state and this is why coef_ was not created. For SVM training the input must contain samples with label "0" and also samples with label "1". Both classes must be present in the training data.
Function PREFIX(train) in svm.cpp:2338 does not test that both labels appear in the input data because it relies on the testing done in the base class - in sklearn.svm.base.BaseLibSvm.fit(). Here we call validate_targets() (base.py:147), which tests exactly that. But, this function looks at the labels not at the weights. So for the #9494 scenario what happens is that sklearn.svm.base.BaseLibSvm.fit() validates the input since it sees samples having both type of labels, but then in the implementation in svm.ccp, all the samples with negative weights are removed, leaving only samples with label "1" in the training data.
Since BaseLibSVM is inherited by many specialized classes (NuSVC, SVR, NuSVR, OneClassSVM)
testing for negative labels in BaseLibSvm.fit() could interfere with the other algorithms that inherit this class. This issue was mentioned by the participants in the pull-request #9674.

A more encapsulated fix could be done in the implementation - PREFIX(train) in svm.cpp
After removing the samples with zero/negative weights, the function should test that in the remaining training set both class labels are present, and if not throw an exception.

I see that this issue was not handled for some time, and would be happy to take it on and handle the pull-request.

alexshacked · 2019-07-01T14:28:19Z

@amueller, @agramfort I saw that issue #9494 (which is handled by this PR), was tagged "help wanted".
I analysed the problem starting from the PR comments and then by debugging the code. My findings in the comment above. I would like to proceed with the above solution. What do you think?

jnothman · 2019-07-01T22:00:19Z

If we can make the change in python land rather than CPP land, that would be preferable, but your solution would also be welcome as a pull request, thanks.

alexshacked · 2019-07-07T00:55:34Z

Opened PR #14282

alexshacked · 2019-07-07T09:50:13Z

Closed PR #14282
Opened PR #14286

jondo force-pushed the check-negative-SVC-weights branch from e9b3329 to f6d6fe3 Compare September 2, 2017 15:24

jondo force-pushed the check-negative-SVC-weights branch from f6d6fe3 to 744862c Compare September 5, 2017 21:04

jondo changed the title ~~[WIP] Reproduce the IndexError from #9494~~ [MRG] SVM: Ensure nonnegative sample weights (#9494) Sep 5, 2017

jondo force-pushed the check-negative-SVC-weights branch from 744862c to 66060f7 Compare September 6, 2017 16:30

agramfort reviewed Sep 10, 2017

View reviewed changes

[MRG] SVM: Ensure nonnegative sample weights (scikit-learn#9494)

2af259d

As needed for LIBSVM and LIBLINEAR.

jondo force-pushed the check-negative-SVC-weights branch from 66060f7 to 2af259d Compare September 10, 2017 20:02

jondo mentioned this pull request Jan 23, 2018

Bug: Fail to train SVM when got "warning: class label 0 specified in weight is not found" #9494

Closed

alexshacked mentioned this pull request Jul 7, 2019

[MRG] #9494 FIX Fail to train SVM ... #14282

Closed

This was referenced Jul 7, 2019

[MRG] #9494 FIX Fail to train SVM ... #14285

Closed

[MRG+1] FIX Negative or null sample_weights in SVM #14286

Merged

amueller added the Superseded PR has been replace by a newer PR label Aug 6, 2019

jnothman closed this in #14286 Sep 24, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[MRG] SVM: Ensure nonnegative sample weights (#9494) #9674

[MRG] SVM: Ensure nonnegative sample weights (#9494) #9674

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!



		def test_negative_weights():
		W = np.array([1, 1, 1, -1, -1, -1])

Uh oh!

[MRG] SVM: Ensure nonnegative sample weights (#9494) #9674

[MRG] SVM: Ensure nonnegative sample weights (#9494) #9674

Uh oh!

Conversation

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!