API Revamp estimator tags #29677

adrinjalali · 2024-08-15T07:53:51Z

Closes #22606
Closes #20804

This PR revamps estimator tags, puts them in dataclasses, and is based on #22606

High level changes from this PR:

(from #22606):

replace MRO mechanism with inheritance
remove _get_tags and _more_tags and introduce __sklearn_tags__

this PR:

dataclasses are introduced to store the tags, and they're scoped in a few dataclasses. This helps users with auto-complete as well.
stateless is removed and now we only use requires_fit. The two were redundant.
only_binary is now replaced with multi_class
multioutput_only is removed and now we have multi_output and single_output
get_tags, default_tags, and Tags are put into public API

github-actions · 2024-08-15T07:55:08Z

✔️ Linting Passed

All linting checks passed. Your pull request is in excellent shape! ☀️

_{Generated for commit: 88af896. Link to the linter CI: here}

sklearn/utils/_tags.py

glemaitre

A bunch of changes to be consistent for the documentation style mainly.

doc/developers/develop.rst

doc/whats_new/v1.6.rst

sklearn/utils/_tags.py

sklearn/utils/estimator_checks.py

…into estimator-tags

adrinjalali · 2024-09-04T13:05:44Z

It's a green CI @glemaitre

glemaitre

So LGTM.

larsoner · 2024-09-06T14:11:23Z

I had a quick look at the docs and maybe I missed it, but as a consumer of sklearn that subclasses BaseEstimator I'm not sure how to adapt my code. Fore example, I currently have classes that have stuff like:

    def _more_tags(self):
        return {"no_validation": True}

How do I adapt these classes in a way that is backward compatible with previous sklearn versions? I can't just leave _more_tags in there to be picked up by older versions of sklearn because then I fail validation:

mne/decoding/tests/test_search_light.py:373: in test_sklearn_compliance
    check(est)
../virtualenvs/base/lib/python3.12/site-packages/sklearn/utils/estimator_checks.py:3893: in check_estimator_tags_renamed
    assert not hasattr(estimator_orig, "_more_tags"), (
E   AssertionError: ('_more_tags() was removed in 1.6. Please use __sklearn_tags__ instead.',)

But I can't remove it because then it won't be backward compatible.

One solution would be to add an opt-in to have the validator ignore this attribute being present, or maybe change it to ensure that if _more_tags is present then __sklearn_tags__ is also present.

Or is there a simpler way for me to adjust my code?

adrinjalali · 2024-09-06T14:48:37Z

@larsoner you can leave _more_tags there with an @available_if decorator, and the check would be the scikit-learn version.

Something like this:

import numpy as np
import sklearn
from packaging import version
from sklearn.base import BaseEstimator
from sklearn.utils.estimator_checks import parametrize_with_checks
from sklearn.utils.metaestimators import available_if

from sklearn.utils.validation import check_is_fitted, validate_data

def check_version(estimator):
    return version.parse(sklearn.__version__) < version.parse("1.6.dev")

class MyEstimator(BaseEstimator):
    @available_if(check_version)
    def _more_tags(self):
        return {"_skip_test": False}
    
    def fit(self, X, y=None):
        validate_data(self, X, y)
        return self
    
    def predict(self, X):
        check_is_fitted(self)
        validate_data(self, X, reset=False)
        return np.zeros(X.shape[0], dtype=int)
    
@parametrize_with_checks([MyEstimator()])
def test_my_estimator(estimator, check):
    check(estimator)

thomasjpfan and others added 19 commits February 24, 2022 16:18

ENH Uses __sklearn_tags__ for tags instead of mro walking

3ebf1c3

DOC Adds whats new

b06fd10

CI Fix assign/unassign CI

e68f7f0

CI Fix assign/unassign CI

a5e9560

Merge remote-tracking branch 'origin/main' into tags_redesign

bb277da

Update new estimators with __sklearn_tags__

3ae8506

Merge remote-tracking branch 'upstream/main' into tags_redesign

d088cac

Update to fix failing tests

dc59eef

STY Lint

b4deda2

Fixes failing tests

87c113b

Add dataclasses

797dddf

Merge remote-tracking branch 'upstream/main' into tags_redesign

3175a82

move to 1.6

c596d08

Merge remote-tracking branch 'upstream/main' into tags_redesign

f2ae973

simplify bagging __sklearn_tags__

004d5a6

RFE allow nan handling

4edd1da

test fixes

813a8c4

merged with tags_redesign

87e3f25

progress

d073f1a

adrinjalali added 2 commits August 15, 2024 10:02

remove old tags

bdac769

more fixes

dcf6051

glemaitre reviewed Aug 15, 2024

View reviewed changes

sklearn/utils/_tags.py Outdated Show resolved Hide resolved

adrinjalali added 7 commits August 15, 2024 13:57

tune more tests

d93155b

...

41ed204

Merge remote-tracking branch 'upstream/main' into estimator-tags

10519e8

a lot more estimators

177d763

...

8914bf0

...

8000

d823d13

...

220f215

adrinjalali added 3 commits September 2, 2024 16:57

preserves_dtype is not a list of str

9c504dd

add missing required change in _tags.py

99486db

Merge remote-tracking branch 'upstream/main' into estimator-tags

c0869ab

glemaitre self-requested a review September 3, 2024 14:44

glemaitre reviewed Sep 3, 2024

View reviewed changes

glemaitre and others added 5 commits September 4, 2024 11:47

Merge branch 'main' into estimator-tags

0b0865a

Most Guillaume's comments

6ac2b7c

Merge branch 'estimator-tags' of github.com:adrinjalali/scikit-learn …

1df3729

…into estimator-tags

6D40 Merge remote-tracking branch 'upstream/main' into estimator-tags

55a5623

remove dtype map

88af896

glemaitre approved these changes Sep 4, 2024

View reviewed changes

9E88

glemaitre merged commit e04142c into scikit-learn:main Sep 4, 2024
30 checks passed

adrinjalali deleted the estimator-tags branch September 5, 2024 08:38

Remi-Gau mentioned this pull request Sep 6, 2024

Update decoder tags for scikitlearn > 1.5.1 nilearn/nilearn#4533

Closed

larsoner mentioned this pull request Sep 6, 2024

Type incompatibility between sklearn's API and SlidingEstimator mne-tools/mne-python#12748

Closed

This was referenced Sep 7, 2024

TST improve error message on _more_tags and _get_tags deprecation #29801

Merged

RFC Move _more_tags to "developer API" via __sklearn_tags__ #28910

Closed

vnherdeiro mentioned this pull request Sep 11, 2024

[python-package] require scikit-learn>=0.24.2, make scikit-learn estimators compatible with scikit-learn>=1.6.0dev microsoft/LightGBM#6651

Merged

jameslamb mentioned this pull request Sep 15, 2024

[ci] [python-package] scikit-learn compatibility tests fail with scikit-learn 1.6.dev0 microsoft/LightGBM#6653

Closed

thomasjpfan mentioned this pull request Nov 28, 2024

Defines missing base.is_transformer API #30368

Closed

ogrisel mentioned this pull request Nov 29, 2024

API drop Tags.regressor_tags.multi_label #30373

Merged

This was referenced Dec 16, 2024

Upstream sklearn change causes error ('super' object has no attribute '__sklearn_tags__') in test suite AdaptiveMotorControlLab/CEBRA#204

Closed

Add support for new __sklearn_tags__ AdaptiveMotorControlLab/CEBRA#205

Merged

deepyaman mentioned this pull request Dec 19, 2024

feat(metrics): add a classification metrics module ibis-project/ibis-ml#176

Merged

This comment was marked as off-topic.

Sign in to view

felixriese mentioned this pull request Jan 3, 2025

Prevent sklearn of 1.6 and higher felixriese/susi#53

Merged

perib mentioned this pull request Apr 13, 2025

Update for sklearn 1.6 EpistasisLab/tpot#1371

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

API Revamp estimator tags #29677

API Revamp estimator tags #29677

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

This comment was marked as off-topic.

Uh oh!

Uh oh!

Uh oh!

API Revamp estimator tags #29677

API Revamp estimator tags #29677

Uh oh!

Conversation

Uh oh!

Uh oh!

Uh oh!

✔️ Linting Passed

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

This comment was marked as off-topic.

Uh oh!

Uh oh!