added multiclass_log_loss metric #1125

ephes · 2012-09-06T21:51:38Z

Don't know whether this is helpful, just practicing :)...

mblondel · 2012-09-07T15:06:05Z

Thanks, I do think that's useful. I will try to review the code later.

kyleabeauchamp · 2012-09-11T00:41:34Z

So one (somewhat) related issue is that one cannot optimize this type of metric using GridSearchCV. The problem is that the grid search always assumes that you want to score using model.predict() rather than predict_proba(). It's obviously easy to temporarily hack the code to allow this, but I was wondering if people had any desire for a better implementation of such a feature. Thoughts?

kyleabeauchamp · 2012-09-11T01:26:24Z

I have two recommendations:

Add multiclass_log_loss to metrics/__init__.py
Re-normalize the probabilities after clipping. As of right now, when you clip values outside of your window, the resulting probability vectors are no longer normalized--their sum may not be 1.0. This may subtly change the final result.

amueller · 2012-09-11T08:47:27Z

@kyleabeauchamp The issue with grid search was discussed here: #1014.

ephes · 2012-09-11T14:57:50Z

@kyleabeauchamp thanks for your recommendations, changed the files accordingly...

amueller · 2012-09-12T18:31:20Z

sklearn/metrics/metrics.py

+
+def multiclass_log_loss(y_true, y_pred, eps=1e-15):
+    """Multi class version of Logarithmic Loss metric.
+    https://www.kaggle.com/wiki/MultiClassLogLoss


I would rather use more standard refences such as elements of statistical learning and wikipedia.
The link to the thread seems pretty out of place. Also, I guess alternative names should be mentioned. This is the multinomial logistic regression loss, right? aka softmax loss aka max entropy?

amueller · 2012-10-01T19:52:54Z

Hey @ephes, are you still working on this? Or are you to busy with the competition ;)
I thought that was merged already and kind of forgot about it.

ephes · 2012-10-01T20:09:43Z

Yes, I'm too busy. The competition is eating up all of my spare time atm :). But I do plan to work on this again next week, when the competition is over.

amueller · 2012-10-01T21:42:43Z

Ok, no worries! I'll just use your branch until then. Good luck!

weilinear · 2012-11-27T05:51:33Z

How is this PR going @ephes ? I can try to help if needed :)

larsmans · 2013-05-21T15:39:50Z

Ping myself: this should get merged.

amueller · 2013-05-21T16:12:48Z

iirc the documentation and testing needs some work.
I agree, though, that shouldn't be much work and we should merge this soon.

arjoly · 2013-05-21T16:27:48Z

Could it be called log_loss and also support binary classification?

amueller · 2013-05-21T16:30:42Z

no, it should ;)

larsmans · 2013-05-21T18:00:13Z

I have some time tomorrow, I hope I can finish it then.

mblondel · 2013-05-21T23:01:43Z

I agree that this PR needs some work.

Binary log loss can also be used for OVR. You just need to sum up the losses of each class. So we might want to prefer two different functions, one for binary log loss and one for multiclass log loss.

arjoly · 2013-07-25T13:19:11Z

Close this one in favour of #2013.
Re-open it if you still want to contribute.

added multiclass_log_loss metric

a212ad3

ephes added 2 commits September 11, 2012 16:50

normalize predictions before calculating loss

6ba9aff

added multiclass_log_loss to __init__.py

6abb28b

multiclass_log_loss shouldn't modify the inputs

55534a1

amueller reviewed Sep 12, 2012
View reviewed changes

larsmans mentioned this pull request May 28, 2013

MRG add log loss (cross-entropy loss) to metrics #2013

Closed

arjoly closed this Jul 25, 2013

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

added multiclass_log_loss metric #1125

added multiclass_log_loss metric #1125

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

added multiclass_log_loss metric #1125

added multiclass_log_loss metric #1125

Uh oh!

Conversation

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!