[MRG] scorer: add sample_weight support (+test) #3401

vene · 2014-07-16T14:49:17Z

Wraps up #3098 (a part of #1574), ready for review.

Initial description by @ndawe:

This is part of the larger #1574 and adds support for sample weights in the scorer interface.

vene · 2014-07-16T14:50:02Z

The test is simple but the change is equally simple, just passing the sample_weight parameter through.

amueller · 2014-07-16T14:55:58Z

sklearn/metrics/tests/test_score_objects.py

+
+    for name, scorer in SCORERS.items():
+        try:
+            weighted = scorer(estimator[name], X_test, y_test,


I would rather have a more explicit test. For example having for the regressor [0, 0, 1, 1, 2, 2] as output and then using weights=[0, 0, 1, 1, 0, 0] and once 1-weights and check that the expected thing happens.

@amueller The difficulty is that, unlike when testing the metrics directly, for testing the scorer it's tricky to anticipate the output of the classifier / regressor.

not if you use the dummys ;) What is wrong with my suggestion?

The 'expected thing' would need to be manually computed for each different scorer, right?

That would be just like coming up with a different hand-crafted test for each scorer, which I could do, if you insist.

You are right, if you are testing all score_functions, that doesn't make
any sense.

On 07/16/2014 05:23 PM, Vlad Niculae wrote:

In sklearn/metrics/tests/test_score_objects.py:

sample_weight = np.ones_like(y_test)

sample_weight[:10] = 0

get sensible estimators for each metric

sensible_regr = DummyRegressor(strategy='median')

sensible_regr.fit(X_train, y_train)

sensible_clf = DecisionTreeClassifier()

sensible_clf.fit(X_train, y_train)

estimator = dict([(name, sensible_regr)

for name in REGRESSION_SCORER_NAMES] +

[(name, sensible_clf)

for name in CLF_SCORER_NAMES])

for name, scorer in SCORERS.items():

try:

weighted = scorer(estimator[name], X_test, y_test,

The 'expected thing' would need to be manually computed for each
different scorer, right?

That would be just like coming up with a different hand-crafted test
for each scorer, which I could do, if you insist.

—
Reply to this email directly or view it on GitHub
https://github.com/scikit-learn/scikit-learn/pull/3401/files#r15004137.

arjoly · 2014-07-16T15:29:52Z

LGTM when travis is green.

amueller · 2014-07-16T15:31:01Z

lgtm

ndawe · 2014-07-16T19:43:30Z

Thanks a lot for taking this over @vene! I've had no time to work on this lately.

jnothman · 2014-07-17T02:16:30Z

sklearn/metrics/tests/test_score_objects.py

+            ignored = scorer(estimator[name], X_test[10:], y_test[10:])
+            unweighted = scorer(estimator[name], X_test, y_test)
+            assert_not_equal(weighted, unweighted,
+                             "scorer {} behaves identically when called with "


support for Py2.6 means we can't use {}. Use {0}, {1}, {2}, ...

arjoly · 2014-07-19T14:33:51Z

Not much to do to get that merged.

coveralls · 2014-07-19T16:14:36Z

Coverage increased (+0.0%) when pulling 6a4aa1d on vene:scorer_weights into 4ec8630 on scikit-learn:master.

arjoly · 2014-07-19T16:31:51Z

Travis is happy ! merging

[MRG] scorer: add sample_weight support (+test)

ndawe · 2014-07-19T21:47:38Z

Thanks! 🍻

jnothman · 2014-07-20T01:26:08Z

I'm glad to see this moving through. Thanks Noel and Vlad. I look forward
to the cross-validation support.

On 20 July 2014 07:47, Noel Dawe notifications@github.com wrote:

Thanks! [image: 🍻]

—
Reply to this email directly or view it on GitHub
#3401 (comment)
.

scorer: add sample_weight support

52dcf26

vene mentioned this pull request Jul 16, 2014

[WIP] scorer: add sample_weight support #3098

Closed

amueller reviewed Jul 16, 2014
View reviewed changes

ndawe mentioned this pull request Jul 16, 2014

[WIP] sample_weight support #1574

Closed

6 tasks

jnothman reviewed Jul 17, 2014
View reviewed changes

vene added 3 commits July 19, 2014 18:03

COSMIT Use explicit if/else in scorer

969a060

TST default scorers with sample_weight

6598ae8

DOC Update What's New

6a4aa1d

arjoly added a commit that referenced this pull request Jul 19, 2014

Merge pull request #3401 from vene/scorer_weights

70f4d47

[MRG] scorer: add sample_weight support (+test)

arjoly merged commit 70f4d47 into scikit-learn:master Jul 19, 2014

arjoly mentioned this pull request Jul 19, 2014

TST: fix tests on numpy 1.9.b2 #3446

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[MRG] scorer: add sample_weight support (+test) #3401

[MRG] scorer: add sample_weight support (+test) #3401

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

get sensible estimators for each metric

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

[MRG] scorer: add sample_weight support (+test) #3401

[MRG] scorer: add sample_weight support (+test) #3401

Uh oh!

Conversation

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

get sensible estimators for each metric

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!