Algorithm description for RandomizedLogisticRegression and RandomizedLasso is inaccurate

The algorithm descriptions for [RandomizedLogisticRegression](http://scikit-learn.org/stable/modules/generated/sklearn.linear_model.RandomizedLogisticRegression.html#sklearn.linear_model.RandomizedLogisticRegression) and [RandomizedLasso](http://scikit-learn.org/stable/modules/generated/sklearn.linear_model.RandomizedLasso.html#sklearn.linear_model.RandomizedLasso) are as follows:

> Randomized Logistic Regression
> Randomized Regression works by resampling the train data and computing a LogisticRegression on each resampling. In short, the features selected more often are good features. It is also known as stability selection.
> 
> Randomized Lasso.
> Randomized Lasso works by resampling the train data and computing a Lasso on each resampling. In short, the features selected more often are good features. It is also known as stability selection.

I don't think these descriptions are accurate. According to the original paper [here](http://onlinelibrary.wiley.com/doi/10.1111/j.1467-9868.2010.00740.x/abstract), the description of the randomized lasso (and by association, the randomized logistic regression) is as follows:

<img width="641" alt="screenshot 2016-03-05 13 38 20" src="https://cloud.githubusercontent.com/assets/1865885/13549955/b7f160ea-e2d7-11e5-820b-2d5b98f6ab56.png">

(We would then find multiple values of beta-hat using randomly chosen values for W)

In other words, the algorithm resamples some default weights of the _features_; the algorithm doesn't sample the training set and fit to these samples (ie: it doesn't bootstrap).

I think how the documentation is currently written, it seems like we're resampling the training set like a bootstrap approach. The documentation should instead clarify that we're reweighting each feature each time we fit Lasso / LogisticRegression to the data.

Thoughts, @agramfort, @GaelVaroquaux ?


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Uh oh!

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions