[MRG] accelerate plot_successive_halving_iterations.py example #21598 #21612

sply88 · 2021-11-09T21:44:13Z

Speeds up ../examples/model_selection/plot_successive_halving_iterations.py (Issue #21598) by

reducing size of dataset (samples and features)
reducing n_estimators
reducing max_features

For me example runs in 5 sec now (previously plus 13).

Reducing number of samples also reduces number of iterations during search (now 5, previously 6). Final figure:

Original figure:

…#21598

adrinjalali · 2021-11-10T13:32:50Z

@sply88 could you please paste both before and after outputs of the example

Your change is trigerring an issue in the doc build. The CI fails.

sply88 · 2021-11-11T07:18:41Z

Your change is trigerring an issue in the doc build. The CI fails.

Seems like ax = mean_scores.plot(legend=False, alpha=0.6) (which was there before) currently causes the example to fail during the doc build. Works in my local env but not in the container.
Not sure how to best approach this. Any hints?

adrinjalali · 2021-11-12T11:05:54Z

@NicolasHug @rth could maybe help here?

thomasjpfan

Syncing with upstream should fix the issue. I think it is related to #21607 and how numpy is being updated in the min-doc build when installing the latest version of PyWavelets

…ing_iterations.py

sply88 · 2021-11-27T20:09:43Z

Syncing with upstream should fix the issue. I think it is related to #21607 and how numpy is being updated in the min-doc build when installing the latest version of PyWavelets

Works. Thanks for pointing that out @thomasjpfan

thomasjpfan · 2021-11-27T23:45:35Z

examples/model_selection/plot_successive_halving_iterations.py


-clf = RandomForestClassifier(n_estimators=20, random_state=rng)
+clf = RandomForestClassifier(n_estimators=15, random_state=rng)


Can we leave this at 20? I like how the last iteration shows a difference between the last two candidates:

I agree, result looks better with a unique winner. Only takes around 1s more on my machine. Thanks again for fine tuning!

…ing_iterations.py

thomasjpfan

LGTM

…earn#21598 (scikit-learn#21612) * accelerate plot_successive_halving_iterations.py example scikit-learn#21598 * n_estimators back to 20

…21612) * accelerate plot_successive_halving_iterations.py example #21598 * n_estimators back to 20

accelerate plot_successive_halving_iterations.py example scikit-learn…

2c06b62

…#21598

sply88 changed the title ~~[MRG] accelerate plot_successive_halving_iterations.py example #21598~~ accelerate plot_successive_halving_iterations.py example #21598 Nov 10, 2021

adrinjalali mentioned this pull request Nov 10, 2021

Accelerate slow examples #21598

Closed

41 tasks

thomasjpfan reviewed Nov 24, 2021

View reviewed changes

Merge branch 'scikit-learn:main' into accelerate-plot_successive_halv…

42b98c2

…ing_iterations.py

sply88 changed the title ~~accelerate plot_successive_halving_iterations.py example #21598~~ [MRG] accelerate plot_successive_halving_iterations.py example #21598 Nov 27, 2021

thomasjpfan reviewed Nov 27, 2021

View reviewed changes

sply88 and others added 2 commits November 28, 2021 11:26

Merge branch 'scikit-learn:main' into accelerate-plot_successive_halv…

7ec2acf

…ing_iterations.py

n_estimators back to 20

db4d055

thomasjpfan approved these changes Nov 28, 2021

View reviewed changes

adrinjalali approved these changes Nov 29, 2021

View reviewed changes

adrinjalali merged commit 64d2fdb into scikit-learn:main Nov 29, 2021

glemaitre pushed a commit that referenced this pull request Dec 25, 2021

DOC accelerate plot_successive_halving_iterations.py example #21598 (#…

5cca8bf

…21612) * accelerate plot_successive_halving_iterations.py example #21598 * n_estimators back to 20

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[MRG] accelerate plot_successive_halving_iterations.py example #21598 #21612

[MRG] accelerate plot_successive_halving_iterations.py example #21598 #21612

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!


		clf = RandomForestClassifier(n_estimators=20, random_state=rng)
		clf = RandomForestClassifier(n_estimators=15, random_state=rng)

Uh oh!

[MRG] accelerate plot_successive_halving_iterations.py example #21598 #21612

[MRG] accelerate plot_successive_halving_iterations.py example #21598 #21612

Uh oh!

Conversation

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!