log_loss giving nan when input is np.float32 and eps is default #24315

gsiisg · 2022-09-01T22:48:37Z

Describe the bug

When input has values that are numpy array of np.float32, 1-eps (with default eps=1e-15) results in 1.0, and log_loss() when calculating log(1-p) with p=1.0 results in nan.

Steps/Code to Reproduce

from sklearn.metrics import log_loss
import numpy as np
input = np.array([1],dtype=np.float32)

# when the input is array of np.float32, using the proper eps=eps=np.finfo(np.float32).eps, log_loss is fine
result = log_loss([[0,1]],input,eps=np.finfo(np.float32).eps)
print('with eps=np.finfo(np.float32).eps:',result)

# with input cast as np.float64, log_loss is also fine with the default eps=1e-15
result = log_loss([[0,1]],input.astype(np.float64))
print('with input as np.float64:',result)

# However, the following input as array of np.float32 using the default eps=1e-15 will give nan
result = log_loss([[0,1]],input)
print('with eps=1e-15 (default):',result)

Expected Results

not nan

Actual Results

with eps=1e-15 (default): nan

/Users/gso/anaconda3/lib/python3.9/site-packages/sklearn/metrics/_classification.py:2442: RuntimeWarning: divide by zero encountered in log
  loss = -(transformed_labels * np.log(y_pred)).sum(axis=1)
/Users/gso/anaconda3/lib/python3.9/site-packages/sklearn/metrics/_classification.py:2442: RuntimeWarning: invalid value encountered in multiply
  loss = -(transformed_labels * np.log(y_pred)).sum(axis=1)

Versions

System:
    python: 3.9.12 (main, Apr  5 2022, 01:53:17)  [Clang 12.0.0 ]
executable: /Users/gso/anaconda3/bin/python
   machine: macOS-10.16-x86_64-i386-64bit

Python dependencies:
          pip: 21.2.4
   setuptools: 61.2.0
      sklearn: 1.0.2
        numpy: 1.21.5
        scipy: 1.7.3
       Cython: 0.29.28
       pandas: 1.4.2
   matplotlib: 3.5.1
       joblib: 1.1.0
threadpoolctl: 2.2.0

Built with OpenMP: True

The text was updated successfully, but these errors were encountered:

gsiisg · 2022-09-01T22:51:20Z

I traced the behavior from sklearn.metrics.log_loss to np.clip not giving the correct answer for 1-eps, where if the input is numpy array of np.float32, with default eps=1e-15, 1-eps = 1.0, which caused log(1-p) with p=1.0 gives the nan. I initially went to report this to numpy but they say scikit is not supporting array of np.float32 properly. Please see:
numpy/numpy#22192

Micky774 · 2022-09-02T00:21:38Z

Seems reasonable enough. Would you like to open a PR updating the value to the float32 epsilon (i.e. np.finfo(np.float32).eps)?

Safikh · 2022-09-03T17:13:25Z

Hi, Can I take this?

Micky774 · 2022-09-04T12:48:59Z

Hi, Can I take this?

Yes, go ahead :)

Safikh · 2022-09-04T14:24:10Z

@Micky774 We would face the same issue, if we give a default epsilon of np.finfo(np.float32).eps and the user provides a float16 input.
So, should that be handled as well by having a different eps value for every type? Or going for float16 epsilon by default as that would be the largest epsilon?

Micky774 · 2022-09-04T14:29:55Z

@Micky774 We would face the same issue, if we give a default epsilon of np.finfo(np.float32).eps and the user provides a float16 input.
So, should that be handled as well by having a different eps value for every type? Or going for float16 epsilon by default as that would be the largest epsilon?

~~Afaik we don't generally support FP16 and hence it would be preferable to keep with FP32 epsilon.~~

Edit: The 'auto' option mentioned below handles this well.

gsiisg · 2022-09-04T22:23:27Z

pull request: #24357

added check to see if input is np.float32 or np.float16
if detected, change eps to satisfy input precision
added documentation describing the change
added warning message to user when this happens

The change @Safikh proposed would fix my immediate problem (Thank you!), but...

As Safikh alluded to, setting a fixed new default does not account for np.float16 (may not be an issue if everyone knows sklearn doesn't support float16, but I doubt it's common knowledge)
would decrease the accuracy drastically if input is just a list of numbers, because internally they are treated as float64 by python
I would leave the default as is, and just let the user know when we dynamically change eps due to input lacking 64bit precision

gsiisg · 2022-09-05T00:17:43Z

Not sure how to fix this in the pull request

glemaitre · 2022-09-05T14:55:06Z

Just mentioning what @ogrisel proposed in one of the PR.

It would be better to introduce a "auto" solution that will switch depending on the dtype using: np.finfo(y_pred.dtype).eps.

gsiisg added Bug Needs Triage Issue requires triage labels Sep 1, 2022

Micky774 added help wanted and removed Needs Triage Issue requires triage labels Sep 2, 2022

Safikh mentioned this issue Sep 4, 2022

FIX adapt epsilon value depending of the dtype of the input #24354

Merged

ogrisel mentioned this issue Sep 5, 2022

Log loss 16 32bit input #24357

Closed

glemaitre closed this as completed in #24354 Nov 10, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

log_loss giving nan when input is np.float32 and eps is default #24315

log_loss giving nan when input is np.float32 and eps is default #24315

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

log_loss giving nan when input is np.float32 and eps is default #24315

log_loss giving nan when input is np.float32 and eps is default #24315

Comments

Uh oh!

Describe the bug

Steps/Code to Reproduce

Expected Results

Actual Results

Versions

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!