[WIP] ENH estimator freezing to stop it being cloned/refit #8374

jnothman · 2017-02-16T14:49:15Z

A whole lot less magic than #8372, but still requires estimator to have a __dict__.

jnothman · 2017-02-16T14:58:24Z

(The semantics of freezing a list/array-like still needs clarification. If we want it to work with pipeline steps, it needs to know how to handle lists of tuples including estimators and strings.)

jnothman · 2017-02-16T15:01:13Z

Any opinion on whether freeze belongs in base or in some new model (sklearn.reuse for instnace)?

codecov · 2017-02-17T00:40:55Z

Codecov Report

Merging #8374 into master will increase coverage by <.01%.
The diff coverage is 95.91%.

@@            Coverage Diff             @@
##           master    #8374      +/-   ##
==========================================
+ Coverage   94.75%   94.75%   +<.01%     
==========================================
  Files         342      342              
  Lines       60801    60847      +46     
==========================================
+ Hits        57609    57653      +44     
- Misses       3192     3194       +2

Impacted Files	Coverage Δ
sklearn/tests/test_base.py	`97.2% <100%> (+0.4%)`	✅
sklearn/base.py	`93.68% <90.47%> (-0.47%)`	❌

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 8beaf32...705416b. Read the comment docs.

jnothman · 2017-02-17T01:48:52Z

I can't say I understand why codecov thinks those lines are untested.

jnothman · 2017-02-19T13:46:01Z

I've tried to work out where this might be documented (wrt semi-supervised/transfer learning? calibration etc? pipeline inspection?) and haven't come up with a best home for it yet.

jnothman · 2017-02-19T13:46:15Z

But perhaps I should write the docs first...

amueller · 2017-07-25T19:01:58Z

sklearn/base.py

+    if copy:
+        estimator = deepcopy(estimator)
+    estimator.fit = _FrozenFit(estimator)
+    if hasattr(estimator, 'fit_transform'):


I would just remove fit_transform and fit_predict as well. Downstream users should be able to duck-type around using these.

Or did you consider them required API? The transformer and cluster base classes have those, but they are not really part of the API contract imho.

jnothman · 2017-07-26T00:13:56Z

Take a pipeline with a frozen transformer: either frozen_fit (or whatever we call it) needs to handle the fit_transform-while-frozen case, or pipeline does, since pipeline currently calls fit_transform when available. I don't know that we need to handle other fit_ prefixes, but if it's handled by the metaestimator, why not?

amueller · 2017-07-26T15:04:16Z

calls fit_transform when available

Yeah, so if you remove them, it'll work fine ;)

jnothman · 2017-07-26T22:54:20Z

I don't get it, but i think I've mostly made up my mind.

…

On 27 Jul 2017 1:04 am, "Andreas Mueller" ***@***.***> wrote: calls fit_transform when available Yeah, so if you remove them, it'll work fine ;) — You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub <#8374 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AAEz68ea8NiVDKnxyTqlm48Es2Da2ND1ks5sR1V0gaJpZM4MDGar> .

amueller · 2017-07-27T16:22:24Z

My proposal is to remove fit_transform and fit_predict from estimators if they are frozen.

amueller · 2017-07-27T16:29:50Z

Though I thought it was easier to remove a method from an object. It doesn't seem to be possible without hacking getattr so it's probably not worth it :-/

amueller · 2017-07-27T16:34:26Z

You made up your mind in which direction? Not catering to old meta-estimators?

amueller · 2017-07-27T16:36:18Z

Oh, didn't catch up with the other PR, you think this or similar is the right solution, got it.

amueller · 2017-07-27T16:42:56Z

sklearn/tests/test_base.py

+        assert_array_equal(est.scores_, frozen_est2.scores_)
+
+        # scores should be unaffected by new fit
+        assert_true(frozen_est2.fit() is frozen_est2)


This is always true, right? Well, I guess you're testing that fit can be called without arguments?

amueller · 2017-07-27T16:45:40Z

sklearn/base.py

+        estimator = deepcopy(estimator)
+    estimator.fit = _FrozenFit(estimator)
+    if hasattr(estimator, 'fit_transform'):
+        estimator.fit_transform = functools.partial(_frozen_fit_method,


This is because estimator.transform might not exist, and we want to provide an attribute error when fit_transform is called, and not when freeze is called, right?
Maybe write a comment about that or maybe rename it to make that more clear?

amueller · 2017-07-27T16:47:09Z

sklearn/base.py

@@ -523,3 +526,51 @@ def is_classifier(estimator):
 def is_regressor(estimator):
    """Returns True if the given estimator is (probably) a regressor."""
    return getattr(estimator, "_estimator_type", None) == "regressor"
+
+
+class _FrozenFit(object):


I feel the boolean flag is somewhat either to understand / check.

ENH add freeze method which stops an estimator being cloned/refit

efa9a86

jnothman changed the title ~~[WIP] ENH add freeze method which stops an estimator being cloned/refit~~ [WIP] ENH estimator freezing to stop it being cloned/refit Feb 16, 2017

ENH/DOC copy param and caveats

4d06fb0

jnothman added 4 commits February 17, 2017 09:01

FIX case where non-estimator in clone

ecefd05

TST/FIX copy param

bcd4eae

FIX Avoid naming conflicts

3398026

TST/FIX fit_transform in freeze

705416b

amueller mentioned this pull request Jun 6, 2017

Add prefit to VotingClassifier #7382

Open

jnothman mentioned this pull request Jun 8, 2017

API Freezing estimators #8370

Closed

jnothman mentioned this pull request Jul 18, 2017

ENH estimator freezing #9397

Closed

amueller reviewed Jul 25, 2017

View reviewed changes

amueller reviewed Jul 27, 2017

View reviewed changes

Merge branch 'master' into freeze2

347c8ef

jnothman mentioned this pull request Jul 30, 2017

[WIP] ENH estimator FreezeWrap to stop it being cloned/refit #9464

Closed

4 tasks

jnothman closed this Feb 26, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[WIP] ENH estimator freezing to stop it being cloned/refit #8374

[WIP] ENH estimator freezing to stop it being cloned/refit #8374

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

[WIP] ENH estimator freezing to stop it being cloned/refit #8374

[WIP] ENH estimator freezing to stop it being cloned/refit #8374

Uh oh!

Conversation

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Codecov Report

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!