[MRG + 1] BUG: Uses self.scoring for score function #11192

thomasjpfan · 2018-06-03T02:32:45Z

Reference Issues/PRs

Fixes #10998.

What does this implement/fix? Explain your changes.

Adds a score function to LogisticRegressionCV that uses self.scoring.

jnothman

Thanks

jnothman · 2018-06-04T01:16:30Z

sklearn/linear_model/logistic.py

+        if scoring is None:
+            return (super(LogisticRegressionCV, self)
+                    .score(X, y, sample_weight=sample_weight))
+


we should probably raise a ChangedBehaviorWarning here.

I think that scoring should default to "accuracy" in the init so you don't need this branching.

I think that the scoring param in the init should default to "accuracy" so you don't need this branching.

The other sklearn classes that accepts a scoring parameter, stores it into self.scoring without considering a default.

To avoid branching, I added the following to the score function:

scoring = self.scoring or 'accuracy'

This hardcodes accuracy as the default scorer.

jnothman · 2018-06-04T01:18:16Z

sklearn/linear_model/tests/test_logistic.py

@@ -89,6 +91,30 @@ def test_error():
        assert_raise_message(ValueError, msg, LR(max_iter="test").fit, X, Y1)


+def test_logistic_cv_neg_mean_squared_error():


It's a bit awkward that a test by this name does not check that the provided score is maximised. I assume there is an existing test that does. Either merge this test into that one, or make it explicit that this test is about the score method.

RFC: Score emits ChangedBehaviorWarning

thomasjpfan · 2018-06-04T05:00:41Z

I updated this PR with the following:

Adds ChangedBehaviorWarning to the score function.
There did not seem to be a test for custom scoring functions. test_logistic_cv_mock_scorer now uses a mock to test the score function and to check that the provided score is maximized.

amueller · 2018-06-04T17:09:14Z

sklearn/linear_model/tests/test_logistic.py

+    custom_score = assert_warns(ChangedBehaviorWarning, lr.score, X, pred)
+
+    assert_equal(custom_score, mock_scorer.scores[0])
+    assert_equal(mock_scorer.calls, 1)


Should we test that by default there is no warning? Or is that overkill? Otherwise looks good.

I have added test_logistic_cv_score_does_not_warn_by_default to test that there is no warning by default.

thomasjpfan · 2018-06-04T19:02:11Z

I notice how this library is transitioning to using pytest features in its tests. I have updated the original warning check to use pytest:

with pytest.warns(ChangedBehaviorWarning):
    custom_score = lr.score(X, lr.predict(X))

jnothman

Nice work!

jnothman · 2018-06-05T08:00:52Z

sklearn/linear_model/tests/test_logistic.py

+    lr.fit(X, Y1)
+
+    # Cs[2] has the highest score (0.8) from MockScorer
+    assert_equal(lr.C_[0], [Cs[2]])


given the transition to pytest, assert_{equal,true,false} should be avoided in new tests. use bare assert

jnothman · 2018-06-05T08:01:50Z

sklearn/linear_model/logistic.py

+        if self.scoring is not None:
+            warnings.warn("The long-standing behavior to use the "
+                          "accuracy score has changed. The scoring "
+                          "parameter is now used",


Append ". This warning will disappear in version 0.22" so we remember to remove it!

RFC: Add message to warning

thomasjpfan · 2018-06-05T15:35:26Z

I updated this PR with the following:

Uses bare asserts.
Adds "This warning will disappear in version 0.22" to warning.

amueller · 2018-06-15T19:19:41Z

sklearn/linear_model/tests/test_logistic.py

+    # reset mock_scorer
+    mock_scorer.calls = 0
+    with pytest.warns(ChangedBehaviorWarning):
+        custom_score = lr.score(X, lr.predict(X))


you're not asserting that it warns, right?

The pytest.warns context manager asserts that the inner block does warn. In this context, a ChangedBehaviorWarning warning appears when scoring is not None. This assertion is used to test the following lines in the LogisticRegressionCV.score function:

https://github.com/thomasjpfan/scikit-learn/blob/21af4b596023d3b3d66aaae3b3df8952a66a9e6a/sklearn/linear_model/logistic.py#L1816-L1821

thanks, haven't used those much.

amueller · 2018-06-15T20:01:19Z

looks good, needs an entry in whats_new.

thomasjpfan · 2018-06-16T04:15:33Z

I added an entry to the whats_new docs.

qinhanmin2014

Otherwise LGTM

qinhanmin2014 · 2018-06-17T02:12:04Z

sklearn/linear_model/logistic.py

+        X : array-like, shape = (n_samples, n_features)
+            Test samples.
+
+        y : array-like, shape = (n_samples) or (n_samples, n_outputs)


Do we support (n_samples, n_outputs)? Seems that in fit, we only support (n_samples,)

I agree. It should be (n_samples,)

thomasjpfan · 2018-06-17T14:20:31Z

This PR was updated with the documentation fix.

qinhanmin2014

LGTM, thanks @thomasjpfan

thomasjpfan added 2 commits June 2, 2018 22:13

BUG: Uses self.scoring for score function

ec511ba

BUG: Support Python 2

0a8b456

jnothman reviewed Jun 4, 2018

View reviewed changes

RFC: Uses mock to test scoring

bf2b01f

RFC: Score emits ChangedBehaviorWarning

RFC: Default to accuracy

526cb63

amueller reviewed Jun 4, 2018

View reviewed changes

RFC: Uses pytest for warnings

de46280

jnothman reviewed Jun 5, 2018

View reviewed changes

jnothman approved these changes Jun 5, 2018

View reviewed changes

RFC: Uses asserts for pytest

21af4b5

RFC: Add message to warning

amueller reviewed Jun 15, 2018

View reviewed changes

amueller changed the title ~~BUG: Uses self.scoring for score function~~ [MRG + 1] BUG: Uses self.scoring for score function Jun 15, 2018

DOC: Updates whats_new

0f8facc

amueller approved these changes Jun 16, 2018

View reviewed changes

qinhanmin2014 approved these changes Jun 17, 2018

View reviewed changes

thomasjpfan force-pushed the LogisticRegressionCV-scoring branch from e92ba53 to d690a1c Compare June 17, 2018 13:41

DOC: Fix

d690a1c

qinhanmin2014 approved these changes Jun 17, 2018

View reviewed changes

qinhanmin2014 merged commit c75bccf into scikit-learn:master Jun 17, 2018

georgipeev pushed a commit to georgipeev/scikit-learn that referenced this pull request Jun 20, 2018

FIX Uses self.scoring for score function (scikit-learn#11192)

0badbea

This was referenced Aug 5, 2019

LogisticRegressionCV score function should use scoring parameter in the constructor #8274

Closed

[MRG] LogisticCV Score Function Fixed #8529

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[MRG + 1] BUG: Uses self.scoring for score function #11192

[MRG + 1] BUG: Uses self.scoring for score function #11192

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

		@@ -89,6 +91,30 @@ def test_error():
		assert_raise_message(ValueError, msg, LR(max_iter="test").fit, X, Y1)


		def test_logistic_cv_neg_mean_squared_error():

Uh oh!

[MRG + 1] BUG: Uses self.scoring for score function #11192

[MRG + 1] BUG: Uses self.scoring for score function #11192

Uh oh!

Conversation

Reference Issues/PRs

What does this implement/fix? Explain your changes.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!