MAINT remove normalize parameter and subsequent clean-up #27855

glemaitre · 2023-11-27T09:38:21Z

Remove the parameter normalize that has been deprecating in OMP and least angle estimators.

remove part of code that was doing dome processing when normalize=True
make _preprocess_data keyword only to be more explicit in terms of parameter names.

github-actions · 2023-11-27T09:39:41Z

✔️ Linting Passed

All linting checks passed. Your pull request is in excellent shape! ☀️

_{Generated for commit: cc5465d. Link to the linter CI: here}

lorentzenchr · 2023-11-28T18:16:41Z

@rth @agramfort @maikia @jnothman pinging as possible reviewers, taken from #3020.

ogrisel · 2023-11-30T10:41:12Z

The docstring of _preprocess_data needs a big update. I am working on it and will push a commit.

ogrisel

LGTM.

We should definitely remove all the occurrences of X_scale now that it's useless but we can do that later (it should only be private API changes and no behavioral change).

sklearn/linear_model/_base.py

ogrisel · 2023-12-01T17:16:32Z

sklearn/linear_model/_base.py

@@ -189,45 +109,51 @@ def make_dataset(X, y, sample_weight, random_state=None):
 def _preprocess_data(
    X,
    y,
+    *,


Co-authored-by: Olivier Grisel <olivier.grisel@ensta.org>

lesteve · 2023-12-04T12:54:57Z

sklearn/linear_model/tests/test_base.py

@@ -415,36 +378,23 @@ def test_preprocess_data(global_random_seed):
    X = rng.rand(n_samples, n_features)
    y = rng.rand(n_samples)
    expected_X_mean = np.mean(X, axis=0)
-    expected_X_scale = np.std(X, axis=0) * np.sqrt(X.shape[0])
+    np.std(X, axis=0) * np.sqrt(X.shape[0])


This line can be removed, right?

Suggested change

np.std(X, axis=0) * np.sqrt(X.shape[0])

I have pushed a commit removing this line and a similar one below

lesteve · 2023-12-04T12:57:05Z

sklearn/linear_model/tests/test_base.py

@@ -586,43 +485,30 @@ def test_sparse_preprocess_data_offsets(global_random_seed, lil_container):
    X = lil_container(X)
    y = rng.rand(n_samples)
    XA = X.toarray()
-    expected_X_scale = np.std(XA, axis=0) * np.sqrt(X.shape[0])
+    np.std(XA, axis=0) * np.sqrt(X.shape[0])


Suggested change

np.std(XA, axis=0) * np.sqrt(X.shape[0])

glemaitre · 2023-12-04T13:50:50Z

Thanks @lesteve for removing the left over.

lesteve

LGTM

MAINT remove normalize parameter in OMP and least angle

6bf3350

glemaitre marked this pull request as draft November 27, 2023 09:38

github-actions bot added the module:linear_model label Nov 27, 2023

glemaitre added the No Changelog Needed label Nov 27, 2023

follow-up remove all normalize

750402c

glemaitre marked this pull request as ready for review November 27, 2023 10:30

glemaitre changed the title ~~MAINT remove normalize parameter in OMP and least angle~~ MAINT remove normalize parameter and subsequent clean-up Nov 27, 2023

ogrisel self-assigned this Nov 30, 2023

Update the _preprocess_data docstring to reflect what it actually does

b68f911

ogrisel approved these changes Dec 1, 2023

View reviewed changes

ogrisel added this to the 1.4 milestone Dec 1, 2023

Update sklearn/linear_model/_base.py

5e58cd3

Co-authored-by: Olivier Grisel <olivier.grisel@ensta.org>

lesteve reviewed Dec 4, 2023

View reviewed changes

lesteve added 2 commits December 4, 2023 14:35

Remove unneeded lines

1cdf4c2

Remove unneeded filterwarnings

cc5465d

lesteve approved these changes Dec 4, 2023

View reviewed changes

lesteve enabled auto-merge (squash) December 4, 2023 13:55

lesteve merged commit ae2c80a into scikit-learn:main Dec 4, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

MAINT remove normalize parameter and subsequent clean-up #27855

MAINT remove normalize parameter and subsequent clean-up #27855

MAINT remove normalize parameter and subsequent clean-up #27855

MAINT remove normalize parameter and subsequent clean-up #27855

Conversation

✔️ Linting Passed

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment