FIX Fix recall in multilabel classification when true labels are all negative #19085

varunagrawal · 2020-12-31T19:10:31Z

Reference Issues/PRs

What does this implement/fix? Explain your changes.

When all the y_true labels are negative, precision_recall_curve returns nan because of recall being set to nan instead of 1. This is because of the direction division of the tps vector by tps[-1] which is be 0.

This fix checks if tps[-1] is 0 and if yes, sets the recall to 1 directly since there are no True Positives or False Negatives, else we calculate recall as normal.

I updated and added tests to check for this case.

Any other comments?

Please refer to the issue thread for more discussion points.

8000

cmarmo · 2021-01-01T17:27:25Z

Thanks @varunagrawal for following-up #14621. The failing check is unrelated with your PR: it will be solved merging #18930.

rayhou0710 · 2021-05-18T04:47:00Z

Is there a plan to merge this commit?

cmarmo · 2021-05-18T06:31:18Z

Hi @varunagrawal, thanks for your patience! Please add an entry to the change log at doc/whats_new/v1.0.rst. Like the other entries there, please reference this pull request with :pr: and credit yourself (and other contributors if applicable) with :user:.

adrinjalali · 2021-08-20T14:21:57Z

@amueller @lesteve you've been involved in the original issue, pinging you in case you wanna leave a review. The issue is tagged in the v1.0 milestone.

varunagrawal · 2021-08-21T00:22:51Z

@cmarmo done and done.

thomasjpfan

Thank you for the PR @varunagrawal !

thomasjpfan · 2021-11-01T20:55:59Z

sklearn/metrics/_ranking.py

@@ -856,7 +856,7 @@ def precision_recall_curve(y_true, probas_pred, *, pos_label=None, sample_weight

    precision = tps / (tps + fps)
    precision[np.isnan(precision)] = 0
-    recall = tps / tps[-1]
+    recall = np.ones(tps.size) if tps[-1] == 0 else tps / tps[-1]


In precision_recall_fscore_support, there is a zero_division parameter that controls how we handle the edge case. This parameter is also used in recall_score and precision_score. If we want to be consistent with precision_recall_fscore_support, we would need to add a zero_division and use the same semantics.

What do you think?

@thomasjpfan this issue has been sitting unfixed for years. I think this fix should be merged as-is before it bit rots, and open a separate issue for semantic consistency.

I agree with @crypdick on this. It's been over 3 years for what is a very simple bugfix, and we can track semantic consistency via another issue. That issue can be tackled by someone else, potentially from the sklearn team, which would get the ball rolling sooner.

Okay, let's keep the recall set to one. May we update precision_recall_curve's docstring to reflect this behavior?

thomasjpfan

Thank you for your patience and working on this PR.

thomasjpfan · 2021-12-19T19:49:34Z

sklearn/metrics/tests/test_ranking.py

+        y_true = np.array([[1, 0], [0, 1]])
+        y_score = np.array([[0, 1], [1, 0]])


I think we can remove this, since the input is the exactly the same as the one above it.

thomasjpfan · 2021-12-19T19:50:20Z

doc/whats_new/v1.0.rst

+- |Fix| Fix recall in multilabel classification when true labels are all negative.
+  :pr:`19085` by :user:`Varun Agrawal <varunagrawal>`.


We can adjust the whats new to state which function is being fixed.

- |Fix| Fixes `average_precision_score` for multilabel classification when true labels are all negative. :pr:`19085` by :user:`Varun Agrawal <varunagrawal>`.

There also needs to be an entry for precision_recall_curve to describe the new behavior.

These entries need to be moved to doc/whats_new/v1.1.rst.

thomasjpfan · 2021-12-19T19:56:23Z

sklearn/metrics/_ranking.py

@@ -856,7 +856,7 @@ def precision_recall_curve(y_true, probas_pred, *, pos_label=None, sample_weight

    precision = tps / (tps + fps)
    precision[np.isnan(precision)] = 0
-    recall = tps / tps[-1]
+    recall = np.ones(tps.size) if tps[-1] == 0 else tps / tps[-1]


Okay, let's keep the recall set to one. May we update precision_recall_curve's docstring to reflect this behavior?

jeremiedbb · 2022-03-24T17:54:27Z

I merged main and addressed the comments from reviews and the comments I was going to do.
During an irl discussion, @glemaitre suggested to warn when there are no positive class in y_true.

jeremiedbb

LGTM. @thomasjpfan and @glemaitre do you want to take a look ?

jeremiedbb · 2022-03-24T18:01:19Z

Note that setting the recall to one in that case lead to the following behavior, which seems like an acceptable expected behavior.

import numpy as np
from<
A8C6
/span> sklearn.metrics import precision_recall_curve
import matplotlib.pyplot as plt
from sklearn.metrics import PrecisionRecallDisplay

y_true = np.array([0, 0, 0, 0])
y_scores = np.array([0.1, 0.4, 0.35, 0.8])

display = PrecisionRecallDisplay.from_predictions(y_true, y_scores)
plt.show()

thomasjpfan

Minor nits. I'm still okay with this solution.

sklearn/metrics/_ranking.py

glemaitre

LGTM waiting for the CI to be green.

jeremiedbb · 2022-03-25T12:08:22Z

Thanks @varunagrawal !

…negative (scikit-learn#19085) Co-authored-by: Thomas J. Fan <thomasjpfan@gmail.com> Co-authored-by: jeremiedbb <jeremiedbb@yahoo.fr>

Note that scikit-learn>=1.1 is required in order to have valid values for average precision score. That is due to version 1.1+ of scikit-learn including this fix: scikit-learn/scikit-learn#19085 NB: version 1.1+ of scikit-learn requires python 3.8+.

add fix for all negative labels in multilabel classification

f082b66

github-actions bot added the module:metrics label Dec 31, 2020

varunagrawal changed the title ~~Fix F1 score when all true labels are negative~~ Fix recall in multilabel classification when true labels are all negative Dec 31, 2020

cmarmo added the Waiting for Reviewer label Jan 1, 2021

Base automatically changed from master to main January 22, 2021 10:53

Merge branch 'main' into pr-curve-bugfix

5a8939c

cmarmo added the Bug label May 18, 2021

cmarmo added this to the 1.0 milestone May 18, 2021

varunagrawal added 2 commits August 20, 2021 20:21

Merge branch 'main' into pr-curve-bugfix

e6e3c1c

update changelog

e0d17b4

varunagrawal added 2 commits August 20, 2021 20:23

linting

6619c82

black formatting

a134bd2

adrinjalali modified the milestones: 1.0, 1.1 Sep 7, 2021

thomasjpfan reviewed Nov 1, 2021

View reviewed changes

varunagrawal requested a review from thomasjpfan December 8, 2021 17:15

thomasjpfan reviewed Dec 19, 2021

View reviewed changes

thomasjpfan removed the Waiting for Reviewer label Dec 19, 2021

cmarmo mentioned this pull request Feb 16, 2022

precision_recall_curve should raise UndefinedMetricWarning when there are no true samples #13043

Open

jeremiedbb added 3 commits March 24, 2022 14:08

Merge branch 'master' into pr/varunagrawal/19085

722706a

address comments + add warning and comments

d921a2e

cln

3dd7785

jeremiedbb approved these changes Mar 24, 2022

View reviewed changes

thomasjpfan reviewed Mar 24, 2022

View reviewed changes

sklearn/metrics/_ranking.py Outdated Show resolved Hide resolved

sklearn/metrics/_ranking.py Outdated Show resolved Hide resolved

thomasjpfan changed the title ~~Fix recall in multilabel classification when true labels are all negative~~ FIX Fix recall in multilabel classification when true labels are all negative Mar 24, 2022

jeremiedbb added 2 commits March 24, 2022 22:34

address comments

92296ed

assert warnings

20ad4d0

glemaitre approved these changes Mar 25, 2022

View reviewed changes

jeremiedbb merged commit ea0571f into scikit-learn:main Mar 25, 2022

jeremiedbb mentioned this pull request May 2, 2022

average_precision_score does not return correct AP when all negative ground truth labels #8245

Closed

dmitryduev mentioned this pull request Jun 2, 2022

Address precision-recall curve updates in sklearn wandb/wandb#3735

Merged

lesteve mentioned this pull request Sep 7, 2022

FIX remove np.divide with where and without out argument in precision_recall_curve #24382

Merged

qmarcou mentioned this pull request Aug 10, 2023

Inconsistency in zero_division handling between precision/recall/f1 and precision_recall_curve/roc_curve related metrics #27047

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

FIX Fix recall in multilabel classification when true labels are all negative #19085

FIX Fix recall in multilabel classification when true labels are all negative #19085

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

		y_true = np.array([[1, 0], [0, 1]])
		y_score = np.array([[0, 1], [1, 0]])

		- \|Fix\| Fix recall in multilabel classification when true labels are all negative.
		:pr:`19085` by :user:`Varun Agrawal <varunagrawal>`.

Uh oh!

FIX Fix recall in multilabel classification when true labels are all negative #19085

FIX Fix recall in multilabel classification when true labels are all negative #19085

Uh oh!

Conversation

Uh oh!

Reference Issues/PRs

What does this implement/fix? Explain your changes.

Any other comments?

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!