Adaptive lasso #4912

henridwyer · 2015-06-30T22:13:54Z

Implement adaptive Lasso, and resolves issue #555

This change is

avoid mutating eps change array division to use np.divide

amueller · 2015-07-01T21:06:00Z

Cool. Can you maybe post the plot from the example? Have you done any real-world comparison / benchmarks?

amueller · 2015-07-01T21:08:00Z

hm, your references are much older than the nips reference. What is the difference in the nips paper?

henridwyer · 2015-07-02T03:19:09Z

I couldn't find the nips paper, so I am not sure. Do you have a link to it?

agramfort · 2015-07-02T08:06:25Z

sklearn/linear_model/coordinate_descent.py

+        1/n * ||y - X Beta||^2_2 + alpha * w |Beta|_1
+
+    Where w is a weight vector calculated in the previous stage by::
+        w_j = alpha/(|Beta_j|^gamma + eps)


you should clarify what cost function you're actually minimizing with this reweighting scheme.

see this old snippet of mine:

https://gist.github.com/agramfort/1610922

agramfort · 2015-07-02T08:07:59Z

can you please share a screenshot of the output of the example?

henridwyer · 2015-07-05T19:54:30Z

Here is the example graph:

I clarified the objective in the docstring.

I updated the docstring to be compliant with pep257 (I think?). For the fit method, I had copied the docstring from the ElasticNet fit function so I updated that one too.

I also changed the references a bit.

agramfort · 2015-07-08T08:25:08Z

sklearn/linear_model/coordinate_descent.py

+    The Adaptive Lasso and Its Oracle Properties
+    Journal of the American Statistical Association
+    """
+    def __init__(self, n_lasso_iterations=2, gamma=1, alpha=1.0,


n_lasso_iterations=2 seems small. Also is eps=1e-3 a good default? I personally don't use eps (set it to zero) and I discard features once they have been zeroed.

Well I was thinking that the adaptive lasso corresponds to 2 steps, but this could be increased to more (5?).

About eps: 0.001 was the value they used in another article (added ref), that's why I figured it was a sensible default.

Well I was thinking that the adaptive lasso corresponds to 2 steps, but
this could be increased to more (5?).

you should iterate until the cost function stop decreasing up to a given
tolerance. My experience with neuroscience data is that 5 to 10 iterations
is enough.

About eps: 0.001 was the value they used in another article (added ref),
that's why I figured it was a sensible default.

hum ok...

henridwyer · 2015-07-13T15:09:05Z

I added calculation of the objective function, and iterations until the progress of the objective is below ada_tol

henridwyer · 2015-07-13T15:09:53Z

I changed n_lasso_iterations to max_lasso_iterations and set the default value to 20

agramfort · 2015-07-20T22:17:18Z

sklearn/linear_model/coordinate_descent.py

+    Gasso, G., Rakotomamonjy, A., & Canu, S.
+    Recovering Sparse Signals With a Certain Family of Nonconvex
+    Penalties and DC Programming
+    IEEE Trans. Signal Process., 4686-4698.


what is the year?

agramfort · 2016-02-23T21:11:00Z

Ok sounds good

henridwyer · 2016-03-23T04:23:48Z

I rewrote the class with a few major changes:

changed the parameters. In particular there is now a penalty parameter to define which penalty to use.
added the log and scad penalty as separate penalties.
calculated the loss at each iteration to check for convergence
more complete, testing for sparser coefficients and testing that the loss decreases at each iteration, which it should according to the paper.

I had a bit of trouble with getting _p and _p_prime to work, since the pickling test would fail (originally I had a factory method). Is this the right way to implement those?

agramfort · 2016-03-23T14:13:26Z

sleeping over it I feel this should be a metaestimator that allows you to reweight all linear models that expose a .coef_ parameter. It would then work for MultiTaskLasso, sparse logistic regression, Elastic-Net etc.

henridwyer · 2016-03-23T15:15:19Z

so something like RANSACRegressor ?

I can change it a bit to take an estimator as a parameter. Should it stay in the same file?

agramfort · 2016-03-23T16:45:05Z

yes but let's first maybe see what other core dev think? we're quite slow adding new code in sklearn these days

henridwyer · 2016-03-29T19:36:46Z

Thinking about it a bit more, it makes sense to be able to use a multitask lasso and logistic regression - but I'm not sure about other estimators (I've never seen reweighted elasticnet).

henridwyer · 2016-05-09T17:56:42Z

@agramfort any update?

howthebodyworks · 2016-09-06T04:21:33Z

Updated the reference request in the attached issue. FWIW, the key point here for me would be simply allowing a "weight" to each observation so that the user can choose their own reweighting scheme, Adaptive Lasso or one of the many others. This is the approach taken with R's glmnet and people regularly publish new papers testing out new and different weighting schemes.

jnothman · 2016-09-06T04:49:46Z

sklearn/linear_model/coordinate_descent.py

+    The optimization objective for the AdaptiveLasso is::
+
+        (1 / (2 * n_samples)) * ||y - X Beta||^2_2
+        + alpha * \sum_j p(|Beta|_j)


I think that should be |Beta_j| not |Beta|_j

henridwyer · 2016-09-14T20:20:36Z

@danmackinlay what API are you imagining? You would want more granularity than the penalty and q parameters ?

howthebodyworks · 2016-09-15T01:43:02Z

sklearn/linear_model/coordinate_descent.py

+
+        for k in xrange(1, self.max_lasso_iterations):
+            self.n_iter_ = k
+            weights = self._p_prime(np.abs(self.coef_))


This weighted fitting feels like it could potentially live in the parent class, since weights are not specific to the Adaptive Lasso but occur in other variant of the Lasso; the Adaptive Lasso is unique for how and when it chooses weights, not that it has weights.

howthebodyworks · 2016-09-15T02:01:33Z

@henridwyer sorry, that may not have been clear; For AdaptiveLasso I think the proposed API is great; rather it's the implementation as concerns the relationship between it and Lasso. If I had my brain engaged I would have been more clear. Trying again:

I believe weighted coefficient penalties are useful for many Lasso variants that could subclass Lasso in addition to AdaptiveLasso e.g. custom robust fitting, or a-priori-given weights because I don't want to penalise some coefficients etc.
So my suggestion is that the weight calculation naturally belongs in Adaptive Lasso, but the weights should be optionally selectable in the parent Lasso class (as with e.g. the non-adaptive penalty.weight in glmnet, and AdaptiveLasso could then pass those weights to Lasso

(I also think we should allow observation weights, like glmnet, FWIW, but that's a separate question.)

howthebodyworks · 2016-09-28T06:25:10Z

sklearn/linear_model/coordinate_descent.py

+        equivalent to a Lasso, solved by the :class:`Lasso`, and
+        max_lasso_iterations = 2 is equivalent to an adaptive Lasso.
+
+    penalty : 'scad' | 'log' | 'lq' (default='lq')


Inconsistency: in the actual constructor the default is 'log'

howthebodyworks · 2016-09-28T06:26:28Z

sklearn/linear_model/coordinate_descent.py

+    The Adaptive Lasso and Its Oracle Properties
+    Journal of the American Statistical Association, 2006.
+    """
+    def __init__(self, max_lasso_iterations=30, penalty='log', q=None,


Given that this is called adaptive Lasso, the log penalty as default is might be surprising; it should probably default to a parameters from the original lasso paper, e.g. penalty='lq', q=1

agramfort · 2018-07-16T14:25:13Z

it don't think it deserves to be a core estimator. It's simple to implement and can eventually be a simple example.

jodie-c · 2020-04-15T03:24:28Z

sklearn/linear_model/coordinate_descent.py

+    ----------
+    coef_ : array, shape (n_features,) | (n_targets, n_features)
+        parameter vector (w in the cost function formula)
+


I think this should be (Beta in the cost function formula)

jodie-c · 2020-04-15T03:25:59Z

sklearn/linear_model/coordinate_descent.py

+                or self.max_lasso_iterations <= 0:
+            raise ValueError("Maximum number of Lasso iterations must be"
+                             " positive; got (max_iter=%r)" % self.max_iter)
+


ValueError raised for max_iter parameter value but conditioned on max_lasso_iterations

henridwyer added 7 commits June 30, 2015 17:52

Implemented adaptive lasso

d1fd3cc

Add a test for the adaptive lasso

094f298

Added example for adaptive lasso

56c7da5

Update title and introduction to include adaptive lasso

3700253

Fixed adaptive lasso import

ac38295

fix travis errors

7ef3d80

avoid mutating eps change array division to use np.divide

remove lambda function

a39d84f

agramfort reviewed Jul 2, 2015
View reviewed changes

updated code and docstring for pep8 and pep 257

14426ce

henridwyer force-pushed the adaptive-lasso branch from f6f2338 to 14426ce Compare July 5, 2015 19:49

agramfort reviewed Jul 8, 2015
View reviewed changes

Henri Dwyer added 3 commits July 8, 2015 17:50

added eps ref, and updated docstring with blank lines

6ac7966

added line in test

9799b8e

added iterations until convergence of objective

b2705e4

fixed loop

c8b2c94

agramfort reviewed Jul 20, 2015
View reviewed changes

henridwyer added 8 commits March 14, 2016 17:11

merge conflicts

b5ffe20

changed loop, added warm start

2190cf6

added warm start

fa0e5ef

refactor code

b823416

pep8 and pep257

96557ce

fix pickling of penalty and loss functions

3b0eb28

changed loss function. fixed doc

3f5b422

fix documentation

3767d1a

henridwyer reopened this Mar 23, 2016

numpy copyto -> copy

b282d14

howthebodyworks mentioned this pull request Sep 6, 2016

Implement Adaptive Lasso / reweighted L1 #555

Closed

jnothman reviewed Sep 6, 2016
View reviewed changes

howthebodyworks reviewed Sep 15, 2016

View reviewed changes

howthebodyworks reviewed Sep 28, 2016

View reviewed changes

agramfort closed this Jul 16, 2018

jodie-c reviewed Apr 15, 2020

View reviewed changes

mathurinm mentioned this pull request Nov 13, 2020

WIP: AdaptiveLasso and AdaptiveLassoCV mathurinm/celer#169

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adaptive lasso #4912

Adaptive lasso #4912

Adaptive lasso #4912

Adaptive lasso #4912

Conversation

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment