Increase execution speed of plot_sparse_logistic_regression_20newsgroup.py #21619

ghost · 2021-11-10T12:06:39Z

Adapted the number of samples and added the warm_start option for the Logistic Regression Modell. Execution time was reduced from 45 sec to 9 sec :

…xecution speed

adrinjalali

Thanks @sveneschlbeck

adrinjalali · 2021-11-10T13:14:21Z

examples/linear_model/plot_sparse_logistic_regression_20newsgroups.py

@@ -82,6 +82,7 @@
            penalty="l1",
            max_iter=this_max_iter,
            random_state=42,
+            warm_start=True,


you sure this has an effect? I'm not sure if in this case it should

I am not so deep in how exactly they run these modules/their tests but this would have an influence depending on how often they are executed and in what order (e.g. multiple runs)

Since you are creating a new lr instance each time you call fit, it should not have any impact.

warm_start=True could have an impact if you move the creation of the lr instance before entering the for this_max_iter in model_params["iters"] loop and then only calling the lr.set_params(max_iter=this_max_iter) inside the inner loop prior to calling fit. However wall time measurements would get a different meaning and would need to be readjusted before plotting. I have the feeling that this be too tricky to achieve or would render the example too complex.

Let's just use the training set size reduction and revert the use of warm_start=True.

@ogrisel Will do

adrinjalali · 2021-11-10T13:14:51Z

examples/linear_model/plot_sparse_logistic_regression_20newsgroups.py

@@ -40,7 +40,7 @@
 solver = "saga"

 # Turn down for faster run time
-n_samples = 10000
+n_samples = 7000


I'd even go lower, maybe 4000 would still be enough?

I'll look into it, probably 4000 is still enough :)

Yes, time for second example is now down to 5 sec (from 9) but accuracy also dropped a bit (from 0.7 to 0.6). Will check out 5000 and 6000

Okay, 5000 seems to be a good compromise, time is down to under 7 sec with accurace close to where it was before

removed ``warm_start=True``

thomasjpfan · 2022-04-23T18:02:26Z

Thank you for working on this PR! The proposed change looks to already be on main:

scikit-learn/examples/linear_model/plot_sparse_logistic_regression_20newsgroups.py

Line 43 in 3082d95

n_samples = 5000

which was merged in #21773. With that in mind, I am closing this PR.

Changed the sample number and added warm_start option for increased e…

5235d4a

…xecution speed

ghost changed the title ~~Increase execution speed of plot~~ Increase execution speed of plot_sparse_logistic_regression_20newsgroup.py Nov 10, 2021

adrinjalali reviewed Nov 10, 2021

View reviewed changes

Update plot_sparse_logistic_regression_20newsgroups.py

b24e151

adrinjalali mentioned this pull request Nov 10, 2021

Accelerate slow examples #21598

Closed

41 tasks

Update plot_sparse_logistic_regression_20newsgroups.py

c4c18f9

removed ``warm_start=True``

thomasjpfan closed this Apr 23, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Increase execution speed of plot_sparse_logistic_regression_20newsgroup.py #21619

Increase execution speed of plot_sparse_logistic_regression_20newsgroup.py #21619

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Increase execution speed of plot_sparse_logistic_regression_20newsgroup.py #21619

Increase execution speed of plot_sparse_logistic_regression_20newsgroup.py #21619

Uh oh!

Conversation

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!