average_precision_score does not return correct AP when all negative ground truth labels #8245

varunagrawal · 2017-01-30T05:52:45Z

Description

average_precision_score does not return correct AP when y_true is all negative labels.

Steps/Code to Reproduce

One can run this piece of dummy code:

sklearn.metrics.ranking.average_precision_score(np.array([0, 0, 0, 0, 0]), np.array([0.1, 0.1, 0.1, 0.1, 0.1]))

It returns nan instead the correct value with the error:

RuntimeWarning: invalid value encountered in true_divide
recall = tps / tps[-1]

Expected Results

As per this Stackoverflow answer, Recall = 1 when FN=0, since 100% of the TP were discovered and Precision = 1 when FP=0, since no there were no spurious results.

Actual Results

Current output is:

/usr/local/lib/python3.5/dist-packages/sklearn/metrics/ranking.py:415: RuntimeWarning: invalid value encountered in true_divide
  recall = tps / tps[-1]
Out[201]: nan

Versions

Linux-4.4.0-59-generic-x86_64-with-Ubuntu-16.04-xenial
Python 3.5.2 (default, Nov 17 2016, 17:05:23)
[GCC 5.4.0 20160609]
NumPy 1.12.0
SciPy 0.18.1
Scikit-Learn 0.18.1

The text was updated successfully, but these errors were encountered:

varunagrawal · 2017-01-30T06:01:16Z

If someone can help me understand how to correctly do the False Positive calculation, I can submit a Pull Request.

For this bug, recall will be 1 since there are 0 True Positives and False Negatives. However, to calculate precision, I need to understand the correct way to find False Positives. The above sample results in _binary_clf_curve returning the same number of False Positives as the number of samples. Should I simply use that as the cue and say that is TPs=0 and FPs=len(y_true), then precision is 1?

I am not sure I quite understand how the threshold calculation is being performed here.

amueller · 2017-01-30T19:55:03Z

Can you check if #7356 fixes this?

lesteve · 2017-02-01T15:11:33Z

Can you check if #7356 fixes this?

No it doesn't. I think we just need to have to do something like this to handle this edge case:

recall = 1 if tps[-1] == 0 else tps / tps[-1]

@varunagrawal if you do a PR please add a non-regression test with only zeros in y_true.

varunagrawal · 2017-02-01T17:30:57Z

@lesteve can you please also specify what needs to be done for precision? Or should that be as is?

lesteve · 2017-02-01T19:09:46Z

I believe the code works as it is. You can add a test with only 1s in y_true to make sure that precision is 1 in this case.

varunagrawal · 2017-02-03T05:05:34Z

@lesteve hoping you can take a look at the PR.

gvishal · 2017-10-08T19:47:55Z

Updates on this. Is this merged?

varunagrawal · 2017-10-08T20:38:45Z

The current functionality of average_precision has changed. I'm planning to submit a new PR for that. Will close this when the other PR is ready.

gvishal · 2017-10-09T05:19:57Z

The standard TREC Eval is able to compute AP and other metrics on the same data.

kangkang59812 · 2019-05-29T15:14:43Z

so how did you solve it ?

crypdick · 2020-04-28T01:54:24Z

What's the status on this?

MarkCarbonell98 · 2020-10-14T19:08:38Z

Same problem here with sklearn 0.23.2

varunagrawal · 2020-12-31T19:11:51Z

Finally got the PR in. Sorry about the delay.

Lorenz92 · 2021-09-10T19:34:50Z

Any update on this issue?

mcever · 2021-09-20T23:37:46Z

Just updated sklearn and this still appears to be an issue?

varunagrawal · 2021-09-21T09:41:41Z

It's been 5 years now with this issue. I've opened the PR and updated it countless times but the only blocker is the approving review.

Lorenz92 · 2021-09-21T11:47:59Z

Hi @lesteve, are you planning to approve this PR? Please let us know. Thank you!

mcever · 2021-09-21T16:34:47Z

@varunagrawal thanks for submitting a fix :) can you link your update so that I can copy it to my own code base? Also, I think I'm finding that this issue also presents itself when a class has no examples. For example, obviously the following example y_true will trigger this bug

   [0., 0., 0., 0., 0.],
   [0., 1., 1., 0., 0.],
   [0., 0., 1., 0., 0.],
   [1., 1., 0., 0., 0.],

But I think it will still be present even removing the all 0 vector because the final column is all 0s. Do you also find this to be the case? If so, does your PR cover this case? If not, perhaps there's something wrong with my own code.

ekosman · 2021-10-24T11:10:28Z

Hey there. Is this issue going to be fixed?

redsphinx · 2022-05-02T10:17:51Z

Hi it's 2022 now and this is still an issue, any plans for fixing it?

This issue persists for when y_true is all -1's or all 0's

jeremiedbb · 2022-05-02T10:24:35Z

Hi, it's 2022 now and this is issue has been fixed (see #19085). I encourage you to test the release candidate for version 1.1.0: ```pip install scikit-learn==1.1.0rc1``.

redsphinx · 2022-05-02T11:08:12Z

pip install scikit-learn==1.1.0rc1

Ok this works both with all -1 and 0, Thanks!

varunagrawal changed the title ~~average_precision_score does not return correct AP when y_true is all negative labels~~ average_precision_score does not return correct AP when all negative ground truth labels Jan 30, 2017

varunagrawal mentioned this issue Feb 3, 2017

[MRG+1] Bugfix for precision_recall_curve when all labels are negative #8280

Closed

mendozamanu mentioned this issue May 16, 2019

Fix nan problem in precision_recall_curve (affecting average_precision_score) #13891

Closed

varunagrawal mentioned this issue Aug 9, 2019

[MRG] Bugfix for precision_recall_curve when all labels are negative #14621

Closed

varunagrawal mentioned this issue Dec 31, 2020

FIX Fix recall in multilabel classification when true labels are all negative #19085

Merged

cmarmo added Bug module:metrics labels May 18, 2021

jeremiedbb closed this as completed in #19085 Mar 25, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

average_precision_score does not return correct AP when all negative ground truth labels #8245

average_precision_score does not return correct AP when all negative ground truth labels #8245

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

average_precision_score does not return correct AP when all negative ground truth labels #8245

average_precision_score does not return correct AP when all negative ground truth labels #8245

Comments

Description

Steps/Code to Reproduce

Expected Results

Actual Results

Versions

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!