MAINT Parameters validation for chi2_kernel with gamma #26153

rand0wn · 2023-04-11T21:08:40Z

Reference Issues/PRs

What does this implement/fix? Explain your changes.

Added automatic parameter validation for "sklearn.metrics.pairwise.chi2_kernel".

Any other comments?

The validation of "sklearn.metrics.pairwise.additive_chi2_kernel" doesn't include validation of gamma, so even through X and Y are revalidated, gamm 8000 a isn't, additive_chi2_kernel validation can be removed as it may not be needed.

…ling parameter gamma

sklearn/metrics/pairwise.py

…heck for X and Y as duplicate

…heck cannot be skipped as it also checks for sparse matrix

glemaitre · 2023-04-14T12:31:22Z

sklearn/tests/test_public_functions.py

@@ -242,6 +242,7 @@ def _check_function_param_validation(
    "sklearn.tree.export_text",
    "sklearn.tree.plot_tree",
    "sklearn.utils.gen_batches",
+    "sklearn.metrics.pairwise.chi2_kernel",


Can you put this line in alphabetic order

sklearn/metrics/pairwise.py

glemaitre · 2023-04-14T12:46:23Z

sklearn/metrics/pairwise.py

+    {
+        "X": ["array-like", "sparse matrix"],
+        "Y": ["array-like", "sparse matrix", None],
+        "gamma": [Interval(Real, None, None, closed="neither")],


I am also surprised here that it does fail because we don't allow ndarray which would be required by Gaussian processes.

Indeed, here we would trigger a regression:

In [2]: from sklearn.datasets import make_friedman2 ...: from sklearn.gaussian_process import GaussianProcessRegressor ...: from sklearn.gaussian_process.kernels import DotProduct, WhiteKernel, PairwiseKernel ...: X, y = make_friedman2(n_samples=500, noise=0, random_state=0) ...: kernel = DotProduct() + WhiteKernel() + PairwiseKernel(gamma=1.0, metric="chi2") ...: gpr = GaussianProcessRegressor(kernel=kernel, ...: random_state=0).fit(X, y)

InvalidParameterError: The 'gamma' parameter of chi2_kernel must be a float in the range (-inf, inf). Got array([1.]) instead.

So it would be useful to add in test_gpr.py this minimal example to be sure that we have a test triggering this issue.

Here, it means that we need:

Suggested change

"gamma": [Interval(Real, None, None, closed="neither")],

"gamma": [Interval(Real, None, None, closed="neither"), Hidden(np.ndarray)],

@rand0wn please also add a test with what @glemaitre points out. Also, why don't we want to document gamma as an array publicly?

Because it is an array with unique values. We found out that the only case that this happens is internal to the GP models that provide such entry.

We therefore should still accept this type of entry to avoid regression but we should not document it since we don't support gamma being several values in an array.

…, added ndarray in gamma for gaussian processes

…NT-chi2

rand0wn · 2023-04-21T08:19:56Z

Added changes for review again

sklearn/metrics/pairwise.py

github-actions · 2023-06-27T17:00:29Z

✔️ Linting Passed

All linting checks passed. Your pull request is in excellent shape! ☀️

_{Generated for commit: 12ce33a. Link to the linter CI: here}

jeremiedbb

LGTM. Thanks @rand0wn

Co-authored-by: jeremiedbb <jeremiedbb@yahoo.fr>

MAINT Parameters validation for chi2_kernel with consideration of sca…

e0ffa0e

…ling parameter gamma

github-actions bot added the module:metrics label Apr 11, 2023

rand0wn changed the title ~~MAINT Parameters validation for chi2_kernel with consideration of sca…~~ MAINT Parameters validation for chi2_kernel with gamma Apr 11, 2023

2357juan reviewed Apr 11, 2023

View reviewed changes

sklearn/metrics/pairwise.py Outdated Show resolved Hide resolved

Abhishek added 2 commits April 12, 2023 10:26

MAINT Parameters validation for chi2_kernel - review change removed c…

7c846bd

…heck for X and Y as duplicate

MAINT Parameters validation for chi2_kernel - review change X and Y c…

a367e07

…heck cannot be skipped as it also checks for sparse matrix

jeremiedbb added No Changelog Needed Validation related to input validation labels Apr 12, 2023

glemaitre mentioned this pull request Apr 14, 2023

MAINT make it explicit that additive_chi2_kernel does not accept sparse matrix #26178

Merged

glemaitre reviewed Apr 14, 2023

View reviewed changes

glemaitre self-requested a review April 14, 2023 12:51

rand0wn closed this Apr 21, 2023

rand0wn force-pushed the MAINT-chi2 branch from a7b2627 to a187758 Compare April 21, 2023 07:59

Abhishek added 2 commits April 21, 2023 13:47

Added alphabatical order in public functions and removed sparse check…

1e976b6

…, added ndarray in gamma for gaussian processes

Merge branch 'MAINT-chi2' of github.com:rand0wn/scikit-learn into MAI…

10000

d337429

…NT-chi2

rand0wn reopened this Apr 21, 2023

Merge branch 'scikit-learn:main' into MAINT-chi2

63effac

adrinjalali reviewed Apr 27, 2023

View reviewed changes

sklearn/metrics/pairwise.py Outdated Show resolved Hide resolved

adrinjalali approved these changes May 4, 2023

View reviewed changes

rand0wn and others added 3 commits June 15, 2023 11:45

Merge branch 'main' into MAINT-chi2

7eec166

Merge remote-tracking branch 'upstream/main' into pr/rand0wn/26153

73c0f73

add prefer_skip_nested_validation

12ce33a

jeremiedbb approved these changes Jun 27, 2023

View reviewed changes

jeremiedbb enabled auto-merge (squash) June 27, 2023 17:01

jeremiedbb merged commit d2b9c80 into scikit-learn:main Jun 27, 2023

punndcoder28 pushed a commit to punndcoder28/scikit-learn that referenced this pull request Jul 29, 2023

MAINT Parameters validation for chi2_kernel (scikit-learn#26153)

76b97dc

Co-authored-by: jeremiedbb <jeremiedbb@yahoo.fr>

REDVM pushed a commit to REDVM/scikit-learn that referenced this pull request Nov 16, 2023

MAINT Parameters validation for chi2_kernel (scikit-learn#26153)

cd63957

Co-authored-by: jeremiedbb <jeremiedbb@yahoo.fr>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

MAINT Parameters validation for chi2_kernel with gamma #26153

MAINT Parameters validation for chi2_kernel with gamma #26153

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

	"gamma": [Interval(Real, None, None, closed="neither")],
	"gamma": [Interval(Real, None, None, closed="neither"), Hidden(np.ndarray)],

Uh oh!

MAINT Parameters validation for chi2_kernel with gamma #26153

MAINT Parameters validation for chi2_kernel with gamma #26153

Uh oh!

Conversation

Uh oh!

Reference Issues/PRs

What does this implement/fix? Explain your changes.

Any other comments?

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

✔️ Linting Passed

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!