[MRG + 3] ENH linearly scale to (-0.5, 0.5) instead of [-0.4, 0.4] #4295

raghavrv · 2015-02-25T14:44:51Z

Initially started off addressing #3891 (comment) by using expit (sigmoid) to map the sum_of_confidences to (0, 1)...

Currently uses linear scaling to map the sum_of_confidences to (-0.5, 0.5) before adding it to the votes. (scaling factor 0.4 --> 5 - np.finfo(sum_of_confidences.dtype).eps)

@mblondel @amueller kindly take a look!

raghavrv · 2015-02-25T14:57:44Z

The used sigmoid function with the parameter 0.5:

coveralls · 2015-02-25T18:18:59Z

Coverage increased (+0.0%) to 95.09% when pulling 7826375 on ragv:sigmoid_decision_fn into a926626 on scikit-learn:master.

amueller · 2015-02-25T18:44:03Z

sklearn/multiclass.py

-        return votes + sum_of_confidences * \
-               (0.4 / max(abs(max_confidences), abs(min_confidences)))
+        # Map the sum_of_confidences to (0, 1) using the sigmoid function
+        return votes + (expit(0.5*sum_of_confidences))


Why do you need a 0.5 here?

amueller · 2015-02-25T19:14:18Z

@mblondel the sigmoid is not <1 for large scores for numeric reasons. expit(40) == 1.0, so we'd need to use .9 * expit(scores) to make sure they are < 1.0. What do you think?

raghavrv · 2015-02-25T19:27:53Z

The other option would be to use a very small value (0.001 instead of the 0.5) and stretch the sigmoid further till it can handle all sane values for sum_of_confidences ( previously scores ) :)

amueller · 2015-02-25T19:29:54Z

well why change one magic value to another?

raghavrv · 2015-02-25T19:32:42Z

True that ;)

mblondel · 2015-02-26T13:10:43Z

or maybe np.minimum(0.99, score)?

raghavrv · 2015-02-26T14:59:03Z

With both options there is a minor problem that when score_1 = 38, score_2 = 40 the output in both cases will be 0.9.

There are two more ways we could deal with it :

scale to [0, 1] --then--> replace all 1s by the midpoint between the 2nd largest ( say 0.98 ) and 1... So it will be always in the range [0, 1) ( practically [0, (2nd largest + 1)/2] )... i.e [-9, 0, 9] --> [0, 0.5, 1] --> [0, 0.5, 0.75]
scale to [0, 1] --then--> sigmoid mapping to (0, 1)

Both these methods are quite weird as @amueller pointed out :)
moreover the 2nd one is 4 times slower than simply scaling ( current approach )

Perhaps we could simply replace 0.4 by 0.5 - np.finfo(float).resolution in the current approach?

# Scale the sum_of_confidences to (-0.5, 0.5) and add it with votes
scale_factor = 0.5 - np.finfo(float).resolution
sum_of_confidences *= ( scale_factor / max(abs(max_confidences),
                                           abs(min_confidences)))
return votes + sum_of_confidences

EDIT: I am suggesting 0.5 - np.finfo(float).resolution, assuming 0.5 won't be considered a magic number since its a standard scaling factor (for symmetric scaling).

ogrisel · 2015-03-06T11:10:07Z

If the original 0.4 magic constant is considered too magic then I think a linear scaling to 0.5 - np.finfo(sum_of_confidences.dtype).eps is explicit enough. I don't see the benefit of introducing expit.

raghavrv · 2015-03-06T12:05:57Z

Thanks for the comment! Done!

ogrisel · 2015-03-06T13:25:15Z

sklearn/multiclass.py

        return votes + sum_of_confidences * \
-               (0.4 / max(abs(max_confidences), abs(min_confidences)))
+            (scaling_factor / max(abs(max_confidences), abs(min_confidences)))


I would write that as:

# Scale the sum_of_confidences to (-0.5, 0.5) and add it with votes. # The motivation is to use confidence levels as a way to break ties in the # votes without switching any decision made based on a difference of 1 # vote. eps = np.finfo(sum_of_confidences.dtype).eps max_abs_confidence = max(abs(max_confidences), abs(min_confidences))) scale = (0.5 - eps) / max_abs_confidence return votes + sum_of_confidences * scale

raghavrv · 2015-03-06T14:01:32Z

@ogrisel done!

amueller · 2015-03-06T16:26:33Z

Fine with me.

raghavrv · 2015-03-06T16:40:40Z

Should I consider that a +1 ?

@mblondel Could you also please take a look at this?

raghavrv · 2015-03-14T06:27:48Z

@ogrisel Could you take a look at this?

ogrisel · 2015-03-14T12:59:19Z

LGTM as well. @mblondel no objection to using linear scaling instead of a logistic sigmoid? The ranking induced by any monotonic function to the (-0.5, 0.5) range should stay the same anyway.

mblondel · 2015-03-15T06:28:33Z

Nope, no objection.

ogrisel · 2015-03-15T18:50:53Z

Alright, merging. Thanks @ragv!

[MRG + 3] ENH linearly scale to (-0.5, 0.5) instead of [-0.4, 0.4]

raghavrv force-pushed the sigmoid_decision_fn branch from facdfa6 to 7826375 Compare February 25, 2015 16:54

amueller reviewed Feb 25, 2015
View reviewed changes

raghavrv force-pushed the sigmoid_decision_fn branch from 7826375 to 29af585 Compare March 6, 2015 12:05

raghavrv force-pushed the sigmoid_decision_fn branch from 29af585 to 0d41b33 Compare March 6, 2015 12:10

ogrisel reviewed Mar 6, 2015
View reviewed changes

raghavrv force-pushed the sigmoid_decision_fn branch from 0d41b33 to d6e389c Compare March 6, 2015 13:49

raghavrv changed the title ~~[MRG] ENH use the sigmoid to map the sum of confidences to (0, 1)~~ [MRG] ENH linearly scale to (-0.5, 0.5) instead of [-0.4, 0.4] Mar 6, 2015

amueller changed the title ~~[MRG] ENH linearly scale to (-0.5, 0.5) instead of [-0.4, 0.4]~~ [MRG + 1] ENH linearly scale to (-0.5, 0.5) instead of [-0.4, 0.4] Mar 6, 2015

raghavrv added 2 commits March 6, 2015 22:41

ENH Scale the sum_of_confidences to (-0.5, 0.5)

b74b004

COSMIT remove trailing whitespaces

342e6e4

raghavrv force-pushed the sigmoid_decision_fn branch from d6e389c to 342e6e4 Compare March 6, 2015 17:11

raghavrv changed the title ~~[MRG + 1] ENH linearly scale to (-0.5, 0.5) instead of [-0.4, 0.4]~~ [MRG + 3] ENH linearly scale to (-0.5, 0.5) instead of [-0.4, 0.4] Mar 15, 2015

ogrisel added a commit that referenced this pull request Mar 15, 2015

Merge pull request #4295 from ragv/sigmoid_decision_fn

4be6214

[MRG + 3] ENH linearly scale to (-0.5, 0.5) instead of [-0.4, 0.4]

ogrisel merged commit 4be6214 into scikit-learn:master Mar 15, 2015

raghavrv deleted the sigmoid_decision_fn branch March 18, 2015 22:52

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[MRG + 3] ENH linearly scale to (-0.5, 0.5) instead of [-0.4, 0.4] #4295

[MRG + 3] ENH linearly scale to (-0.5, 0.5) instead of [-0.4, 0.4] #4295

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

[MRG + 3] ENH linearly scale to (-0.5, 0.5) instead of [-0.4, 0.4] #4295

[MRG + 3] ENH linearly scale to (-0.5, 0.5) instead of [-0.4, 0.4] #4295

Uh oh!

Conversation

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!