[MRG] separate penalty factors for GLM regressions #22485

xiaowei1234 · 2022-02-15T01:55:20Z

agramfort

doc/whats_new/v1.1.rst

sklearn/linear_model/_glm/glm.py

agramfort · 2022-03-01T10:56:56Z

sklearn/linear_model/_glm/glm.py

+                    min_val=0.0,
+                    include_boundaries="left",
+                )
+            self.alpha = np.asarray(self.alpha, dtype=np.float_).ravel()


rather than np.float_ I would use X.dtype

I will change to np.float64, I don't like the idea of using X.dtype in the off chance that X.dtype is integer dtype. Even if X is all integers it would be very wrong for the penalty terms to be integers

agramfort · 2022-03-01T10:57:53Z

sklearn/linear_model/_glm/glm.py

@@ -268,7 +264,29 @@ def fit(self, X, y, sample_weight=None):
            y_numeric=True,
            multi_output=False,
        )
-
+        if hasattr(self.alpha, "__iter__") and not isinstance(self.alpha, str):


see how testing for array-like is done elsewhere. Looking at private attributes for this is not clean

All the checks I saw would first convert the variable to a numpy array and then check the size, problem here is if the regression only has one variable then the code would get clunky when checking

np.asarray(self.alpha).size == 1

so instead I used Iterable class from collections. It does result in additional library being imported but it is a core Python class

Internally we use:

scikit-learn/sklearn/utils/validation.py

Lines 248 to 250 in 742d39c

def _is_arraylike_not_scalar(array):

"""Return True if array is array-like and not a scalar"""

return _is_arraylike(array) and not np.isscalar(array)

(The "not np.isscalar" part handles the string case)

xiaowei1234 · 2022-03-01T15:40:32Z

glemaitre · 2022-03-03T09:52:20Z

xiaowei1234 · 2022-03-03T17:43:34Z

sklearn/linear_model/_glm/tests/test_glm.py

agramfort · 2022-03-04T09:37:01Z

xiaowei1234 · 2022-03-04T17:11:36Z

agramfort · 2022-03-04T18:17:44Z

xiaowei1234 · 2022-03-04T19:09:31Z

agramfort · 2022-03-05T09:35:27Z

xiaowei1234 · 2022-03-05T22:27:50Z

thomasjpfan · 2022-03-06T00:30:29Z

xiaowei1234 · 2022-03-06T22:25:38Z

lorentzenchr

doc/whats_new/v1.1.rst

sklearn/linear_model/_glm/glm.py

agramfort

xiaowei1234 · 2022-03-08T13:51:18Z

lorentzenchr · 2022-03-08T13:54:31Z

xiaowei1234 added 2 commits February 14, 2022 15:02

meat finished but not checked

e49b3aa

unit tests fixed and added

6a0f4d4

github-actions bot added the module:linear_model label Feb 15, 2022

xiaowei1234 changed the title ~~[WIP] separate penalty factors for GLM regressions~~ [MRG] separate penalty factors for GLM regressions Feb 21, 2022

agramfort reviewed Mar 1, 2022

View reviewed changes

xiaowei1234 and others added 12 commits March 3, 2022 10:38

meat finished but not checked

83dde0e

unit tests fixed and added

314105b

pep8 and black

df064c7

circleCI test and undid transformation of scalar alpha to ndarray

223e9bb

updated doc whats_new

42903ba

wrong PR #

7062825

updated docstring

01141d8

Update doc/whats_new/v1.1.rst

079cfad

Co-authored-by: Alexandre Gramfort <alexandre.gramfort@m4x.org>

Update sklearn/linear_model/_glm/glm.py

038fbd8

Co-authored-by: Alexandre Gramfort <alexandre.gramfort@m4x.org>

Update sklearn/linear_model/_glm/glm.py

c0e0a37

Co-authored-by: Alexandre Gramfort <alexandre.gramfort@m4x.org>

self.alpha check revision

a62c3bf

rebuilt and fixed unit tests

801166c

xiaowei1234 force-pushed the glm_alpha branch from c026c30 to 801166c Compare March 3, 2022 17:00

Merge branch 'main' into glm_alpha

5ab5dd0

agramfort reviewed Mar 3, 2022

View reviewed changes

sklearn/linear_model/_glm/tests/test_glm.py Show resolved Hide resolved

xiaowei1234 added 2 commits March 6, 2022 12:12

Merge branch 'main' into glm_alpha

248d1cb

ENH unit test for glm alpha array

03f1bc3

lorentzenchr reviewed Mar 7, 2022

View reviewed changes

doc/whats_new/v1.1.rst Outdated Show resolved Hide resolved

doc/whats_new/v1.1.rst Outdated Show resolved Hide resolved

sklearn/linear_model/_glm/glm.py Outdated Show resolved Hide resolved

xiaowei1234 added 3 commits March 7, 2022 09:36

ENH reverted matrix rank check and test

95dad3f

ENH doc cleanup for scikit-learn#22485

d06826a

ENH scikit-learn#22485 black

98f820e

agramfort reviewed Mar 8, 2022

View reviewed changes

lorentzenchr added the Stalled label Jul 8, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[MRG] separate penalty factors for GLM regressions #22485

[MRG] separate penalty factors for GLM regressions #22485

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

[MRG] separate penalty factors for GLM regressions #22485

Are you sure you want to change the base?

[MRG] separate penalty factors for GLM regressions #22485

Conversation

Uh oh!

Task list

Reference Issues/PRs

What does this implement/fix? Explain your changes.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!