don't overwrite precompute=True in lassocv #14591

amueller · 2019-08-07T18:48:58Z

Fixes #11014, alternative to #11021

Unfortunately this can't be tested (in a way I can see) because LassoCV doesn't store the underlying model

jnothman

(could be tested with a mock)

amueller · 2019-08-08T16:06:57Z

Oh you'd mock the model attribute and check that it's passed to that correctly? how would that work?
don't think it's necessary here, but I'd love to learn from your mocking skills ;)

jnothman · 2019-08-08T23:52:31Z

I'd mock the path function or something and check whether precompute gets passed

amueller · 2019-08-09T19:16:26Z

cc @agramfort maybe?

rth · 2019-08-10T11:38:23Z

sklearn/linear_model/coordinate_descent.py

-        model.precompute = False
+        precompute = getattr(self, "precompute", None)
+        if isinstance(precompute, str) and precompute == "auto":
+            model.precompute = False


Shouldn't there be an,

elif isinstance(precompute, bool): model.precompute = precompute

?
Say in Lasso the default for dense is precompute=False, which means that even if precompute=True is passed to LassoCV, it would still not be used. Or am I missing something?

I don't think it's the right fix. The _pre_fit function uses:

scikit-learn//sklearn/linear_model/base.py

Line 546 in 605ae39

precompute = (n_samples > n_features)

I would introduce a private function

def _get_precompute(X, precompute): if isinstance(precompute, str) and precompute == "auto": n_samples, n_features = X.shape return (n_samples > n_features) else: return precompute

that gets called in both _pre_fit and here taking as input self.precompute

and I yes I agree we should use the same strategy for the path and the single fit at the end. Otherwise it's dangerous as seen with the issue reported.

now this being said it's quite uncommon to use a Lasso with such n_samples >> n_features @Meta95
that's why i guess the problem was never reported.

@rth model.precompute was already set above to precompute as part of common_params.
@agramfort In Lasso, we removed the "auto" function because it was not helpful according to #3249. So I assume for a single fit setting it to False is a better choice than using the heuristic here. That would mean that if it's set to "auto", the path algorithm might use precompute=True while the last fit doesn't. But I think that makes sense, given that the path algorithm reuses results while the single fit doesn't.
That's purely based on the discussion in #3249 though.

ok you're right. it's the right fix as auto was removed from Lasso. Now I realize that 'auto' could/should have been kept with a rule like (n_samples > 3 * n_features) for example (3 or more based on a bench). Anyhow +1 for MRG as it is. thx @amueller for taking a stab at this.

TST Adds test to precompute

jnothman · 2019-08-13T08:06:26Z

sklearn/linear_model/tests/test_coordinate_descent.py

+            super().fit(X, y)
+            assert self.precompute == inner_precompute
+
+    monkeypatch.setattr("sklearn.linear_model.coordinate_descent.Lasso",


You humoured me!

However this test can pass if LassoCV does not use Lasso. So you need to assert (or believe coverage) that the mock is used.

@thomasjpfan did ;)

Surprisingly monkeypatch doesn't seem to keep track? I could probably make it inherit from mock or something but this ugly solution seems to work?

jnothman

What's new?

amueller · 2019-08-14T22:09:52Z

@jnothman will add tomorrow. I would love to hear back from @agramfort about whether he agrees with the current behavior.

amueller · 2019-08-21T19:11:01Z

merge?

agramfort · 2019-08-22T08:51:59Z

thx @amueller

don't overwrite precompute=True

605ae39

jnothman approved these changes Aug 8, 2019

View reviewed changes

ENH Adds test to precompute

cc4a1a5

rth reviewed Aug 10, 2019

View reviewed changes

Merge pull request #35 from thomasjpfan/pr/14591

a1b9471

TST Adds test to precompute

jnothman reviewed Aug 13, 2019

View reviewed changes

amueller added 2 commits August 13, 2019 15:00

Merge branch 'master' into lasso_precompute_auto

65e389e

check that mock was called

f6e8219

jnothman approved these changes Aug 14, 2019

View reviewed changes

whatsnew

6531263

agramfort merged commit 17786ae into scikit-learn:master Aug 22, 2019

amueller mentioned this pull request Aug 22, 2019

LinearModelCV no longer overrides precompute #11021

Closed

anton-khimich mentioned this pull request Feb 2, 2021

Lasso error when precompute='auto' #19283

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

don't overwrite precompute=True in lassocv #14591

don't overwrite precompute=True in lassocv #14591

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

don't overwrite precompute=True in lassocv #14591

don't overwrite precompute=True in lassocv #14591

Uh oh!

Conversation

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!