Fix error in `euclidean_distances` when X is float64 and `X_norm_squared` is float32 #27624

jeromedockes · 2023-10-19T18:32:07Z

Reference Issues/PRs

What does this implement/fix? Explain your changes.

euclidean_distances discards precomputed squared norms if they are in float32 to avoid numerical precision issues.
But in the current implementation there is a code path where the used squared distances end up being None, when the X and Y are float64 but the squared distances are float32.

This PR implements the following logic

if X_norm_squared is provided in float64: use it
otherwise if X is float64: use it to compute the squared norm
otherwise rely on _euclidean_distances_upcast (as is done ATM when X is float32)

and the same for Y

my impression is that this was the original intention but maybe @jeremiedbb can confirm

Any other comments?

github-actions · 2023-10-19T18:33:58Z

✔️ Linting Passed

All linting checks passed. Your pull request is in excellent shape! ☀️

_{Generated for commit: ee65beb. Link to the linter CI: here}

jjerphan

LGTM. Thank you, @jeromedockes.

Can you add a |Fix| changelog entry?

sklearn/metrics/tests/test_pairwise.py

Co-authored-by: Julien Jerphanion <git@jjerphan.xyz>

jeremiedbb

LGTM. Thanks for the fix

dineshchitlangia

LGTM, non binding.

betatim · 2023-10-26T08:55:19Z

Thanks for the fix!

…red` is float32 (scikit-learn#27624) Co-authored-by: Julien Jerphanion <git@jjerphan.xyz>

use _euclidean_distances_upcast when needed in _euclidean_distances

9042ab7

github-actions bot added the module:metrics label Oct 19, 2023

jeromedockes added 2 commits October 19, 2023 20:45

formatting

a57af0b

Merge remote-tracking branch 'upstream/main' into fix_27621

b0d7763

jjerphan added the good first PR to review Simple atomic PR to review label Oct 20, 2023

jjerphan approved these changes Oct 21, 2023

View reviewed changes

sklearn/metrics/tests/test_pairwise.py Outdated Show resolved Hide resolved

jeromedockes and others added 2 commits October 23, 2023 10:11

Update sklearn/metrics/tests/test_pairwise.py

c54f998

Co-authored-by: Julien Jerphanion <git@jjerphan.xyz>

add whatsnew entry

ee65beb

jeremiedbb approved these changes Oct 24, 2023

View reviewed changes

dineshchitlangia approved these changes Oct 26, 2023

View reviewed changes

betatim merged commit 93fa00c into scikit-learn:main Oct 26, 2023

glemaitre pushed a commit to glemaitre/scikit-learn that referenced this pull request Oct 31, 2023

Fix error in euclidean_distances when X is float64 and `X_norm_squa…

4a51f89

…red` is float32 (scikit-learn#27624) Co-authored-by: Julien Jerphanion <git@jjerphan.xyz>

REDVM pushed a commit to REDVM/scikit-learn that referenced this pull request Nov 16, 2023

Fix error in euclidean_distances when X is float64 and `X_norm_squa…

aee2fd0

…red` is float32 (scikit-learn#27624) Co-authored-by: Julien Jerphanion <git@jjerphan.xyz>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Fix error in `euclidean_distances` when X is float64 and `X_norm_squared` is float32 #27624

Fix error in `euclidean_distances` when X is float64 and `X_norm_squared` is float32 #27624

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Uh oh!

Fix error in euclidean_distances when X is float64 and X_norm_squared is float32 #27624

Fix error in euclidean_distances when X is float64 and X_norm_squared is float32 #27624

Uh oh!

Conversation

Reference Issues/PRs

What does this implement/fix? Explain your changes.

Any other comments?

Uh oh!

Uh oh!

✔️ Linting Passed

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Fix error in `euclidean_distances` when X is float64 and `X_norm_squared` is float32 #27624

Fix error in `euclidean_distances` when X is float64 and `X_norm_squared` is float32 #27624