DOC Increase execution speed of plot_sgd_comparison #21610

ghost · 2021-11-09T20:00:47Z

Adapted a few things in the module examples/linear_model/plot_sgd_comparison to make it a bit faster:

Tuned down the number of rounds/loops by 25 %
Adapted the max_iter parameter to a point where execution is still fast but parameter doesn't kill the optimization process
Added the option of a warm_start to a bunch of classifiers

…n to increase execution speed

adrinjalali · 2021-11-09T21:39:28Z

@sveneschlbeck it'd be useful to post the output of the example (text, and image if it produces any) before and after your change, and the time it takes to run on your machine (you can do time python /path/to/example.py to get the times in linux, and mac I think.

there's also failing lint tests in your PR, you should enable black on your clone. See here for more info, you can also activate git hooks on your clone which would do this automatically.

ghost · 2021-11-09T23:11:12Z

@adrinjalali I ran test using the datetime.datetime.now() measurement function and ran both scripts (original and mine) in an interactive kernel with the exact same conditions. My version took around half the time:

…nto speed_increased_example_modlinear

adrinjalali · 2021-11-10T13:23:04Z

@ogrisel @mblondel you've had opinions on this example in the past, it'd be nice if you could check this.

thomasjpfan

Thank you for the PR @sveneschlbeck !

I think that setting warm_start=True changes the narrative of the example. The model trained on a previous round will influence the model in the next.

examples/linear_model/plot_sgd_comparison.py

Co-authored-by: Thomas J. Fan <thomasjpfan@gmail.com>

ghost · 2021-11-23T18:22:37Z

@thomasjpfan Agreed, we also had a similar discussion at another example and concluded that it may change the example's character. Also, some examples are never run multiple times in a row, making the param toothless. Will remove warm_start.

Thanks for the input!

adrinjalali · 2021-11-24T13:15:03Z

Please apply black on your code to pass the linters in the CI

ghost · 2021-11-24T13:16:39Z

@adrinjalali Will do

thomasjpfan · 2021-11-26T22:28:26Z

The error on circleci should be fixed by syncing with main.

thomasjpfan

LGTM

Co-authored-by: Thomas J. Fan <thomasjpfan@gmail.com>

Small module adaptions such as parameter tuning or parameter extensio…

f3acedc

…n to increase execution speed

sveneschlbeck added 2 commits November 10, 2021 00:13

ran Black against the code

2ef4730

Merge branch 'main' of https://github.com/scikit-learn/scikit-learn i…

13f2a0a

…nto speed_increased_example_modlinear

adrinjalali mentioned this pull request Nov 10, 2021

Accelerate slow examples #21598

Closed

41 tasks

adrinjalali changed the title ~~Increase execution speed of plot_sgd_comparison through parameter adaptions~~ Increase execution speed of plot_sgd_comparison.py through parameter adaptions Nov 12, 2021

thomasjpfan reviewed Nov 23, 2021

View reviewed changes

examples/linear_model/plot_sgd_comparison.py Outdated Show resolved Hide resolved

Update examples/linear_model/plot_sgd_comparison.py

91bd69d

Co-authored-by: Thomas J. Fan <thomasjpfan@gmail.com>

Removing warm_start params

97565cd

sveneschlbeck added 2 commits November 24, 2021 14:30

Ran black against code to pass CI

c8176dc

Ran black against code to pass CI

781eb86

Merge branch 'scikit-learn:main' into speed_increased_example_modlinear

249ead3

adrinjalali approved these changes Dec 8, 2021

View reviewed changes

thomasjpfan approved these changes Dec 9, 2021

View reviewed changes

thomasjpfan changed the title ~~Increase execution speed of plot_sgd_comparison.py through parameter adaptions~~ DOC Increase execution speed of plot_sgd_comparison.py through parameter adaptions Dec 9, 2021

thomasjpfan changed the title ~~DOC Increase execution speed of plot_sgd_comparison.py through parameter adaptions~~ DOC Increase execution speed of plot_sgd_comparison Dec 9, 2021

thomasjpfan merged commit c29fe81 into scikit-learn:main Dec 9, 2021

github-actions bot added the Documentation label Dec 9, 2021

ghost deleted the speed_increased_example_modlinear branch December 9, 2021 18:04

glemaitre pushed a commit to glemaitre/scikit-learn that referenced this pull request Dec 24, 2021

DOC Increase execution speed of plot_sgd_comparison (scikit-learn#21610)

ddc2367

Co-authored-by: Thomas J. Fan <thomasjpfan@gmail.com>

glemaitre pushed a commit that referenced this pull request Dec 25, 2021

DOC Increase execution speed of plot_sgd_comparison (#21610)

2017d99

Co-authored-by: Thomas J. Fan <thomasjpfan@gmail.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

DOC Increase execution speed of plot_sgd_comparison #21610

DOC Increase execution speed of plot_sgd_comparison #21610

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

DOC Increase execution speed of plot_sgd_comparison #21610

DOC Increase execution speed of plot_sgd_comparison #21610

Uh oh!

Conversation

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!