[MRG + 2] predict_proba should use the softmax function in the multinomial case #5182

MechCoder · 2015-08-28T21:08:15Z

…l case

mblondel · 2015-08-29T15:59:29Z

sklearn/linear_model/base.py

@@ -238,16 +238,32 @@ def _predict_proba_lr(self, X):
        1. / (1. + np.exp(-self.decision_function(X)));
        multiclass is handled by normalizing that over all classes.
        """
+        from sklearn.linear_model.logistic import (
+            LogisticRegression, LogisticRegressionCV)


That's pretty ugly. I would rather override predict_proba in LogisticRegression.

def predict_proba(self, X): if self.multiclass == "multinomial": [...] else: return super(LogisticRegression, self).predict_proba(X)

That should get rid of the isinstance on self as well.

MechCoder · 2015-08-30T14:40:24Z

@mblondel Soory for the hasty hack. I've fixed up your comment.

mblondel · 2015-08-30T14:47:58Z

I guess predict_proba's argmax = predict is already tested in the common tests?

LGTM.

MechCoder · 2015-08-30T14:50:20Z

yes. it is

mblondel · 2015-08-30T14:55:42Z

Does the test you added fails without the patch?

MechCoder · 2015-08-30T15:05:03Z

I was thinking about that. Would it be sufficient to check that the predicted probability values are different for both the cases?

MechCoder · 2015-08-30T15:10:11Z

It is always true that clf.predict_proba(X).max(axis=0) is greater for the multinomial case?

mblondel · 2015-08-30T15:12:06Z

You could try to compute the multinomial log loss:
http://scikit-learn.org/stable/modules/generated/sklearn.metrics.log_loss.html

Hopefully, the loss should be smaller with the right probabilities (although this might not be true due the l2 regularization term on the coefficients...).

MechCoder · 2015-08-30T15:30:31Z

thanks for the tip. I've added the test.

agramfort · 2015-08-30T15:40:29Z

+1 for merge when travis is happy. thanks @MechCoder

MechCoder · 2015-08-30T15:45:49Z

Ah, I see. I added that, but I kept the previous test as well because I thought it might be interesting.

mblondel · 2015-08-30T15:48:03Z

Thanks. LGTM now :)

MechCoder · 2015-08-30T15:50:11Z

Also fixes #5134

GaelVaroquaux · 2015-08-30T16:27:03Z

Two +1s. We're only waiting for Appveyor (which is currently very slow).

[MRG + 2] predict_proba should use the softmax function in the multinomial case

akxlr · 2015-08-31T05:52:51Z

Minor comment: would it be more stable to calculate the log probability (using scipy.misc.logsumexp for the denominator) then exponentiate? At least for predict_log_proba this could be done.

jagapiou · 2015-09-07T13:18:35Z

This will have overflow issues for large decision_function values (e.g. [750, 749, 748]). This can be fixed by subtracting the max from the output of decision_function (in this case 750), since:
exp(x_i) / sum_k{ exp(x_i) } = exp(x_i - k) exp(k) / sum_k{ exp(x_i - k) exp(k) } = exp(x_i - k) / sum_k{ exp(x_i - k) }.

MechCoder · 2015-09-08T02:13:17Z

Thanks for the top ! See #5225

[BUG] predict_proba should use the softmax function in the multinomia…

c3cfebe

…l case

mblondel reviewed Aug 29, 2015
View reviewed changes

MechCoder changed the title ~~[BUG] predict_proba should use the softmax function in the multinomial case~~ [MRG + 1] predict_proba should use the softmax function in the multinomial case Aug 30, 2015

override predict_proba in log_reg

a5537aa

MechCoder force-pushed the predict_proba_fix branch from 491f0ad to a5537aa Compare August 30, 2015 15:04

MechCoder force-pushed the predict_proba_fix branch from 1b6471e to af5e6e8 Compare August 30, 2015 15:41

Add non regression test

c85f2ad

MechCoder force-pushed the predict_proba_fix branch from af5e6e8 to c85f2ad Compare August 30, 2015 15:45

GaelVaroquaux changed the title ~~[MRG + 1] predict_proba should use the softmax function in the multinomial case~~ [MRG + 2] predict_proba should use the softmax function in the multinomial case Aug 30, 2015

GaelVaroquaux added a commit that referenced this pull request Aug 30, 2015

Merge pull request #5182 from MechCoder/predict_proba_fix

4f713ce

[MRG + 2] predict_proba should use the softmax function in the multinomial case

GaelVaroquaux merged commit 4f713ce into scikit-learn:master Aug 30, 2015

This was referenced Aug 31, 2015

DOC Updated documentation for cv parameter (issue #4533) #5184

Merged

Missing from what's new @ August sprint? #5191

Closed

MechCoder deleted the predict_proba_fix branch September 8, 2015 01:32

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[MRG + 2] predict_proba should use the softmax function in the multinomial case #5182

[MRG + 2] predict_proba should use the softmax function in the multinomial case #5182

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

[MRG + 2] predict_proba should use the softmax function in the multinomial case #5182

[MRG + 2] predict_proba should use the softmax function in the multinomial case #5182

Uh oh!

Conversation

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!