-
-
Notifications
You must be signed in to change notification settings - Fork 25.8k
Refactor tests for sample weights #11316
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comments
I want to clarify I understand this correct. Those 4 tests you mentioned are already in |
What I am suggesting is that we should have the same testing code used for weighted metrics as for weighted model fitting, insofar as this is possible. |
Is this resolved? |
We have
|
In various parts of the code, we have tests for
sample_weight
support, including in metrics, and for individual estimators. we have some common estimator checks forclass_weight
, but not really forsample_weight
functionality (only for weight type invariance).Recent implementations of
sample_weight
include #10933 (KMeans) and #10803 (density estimation). But as well as estimators we have things like common tests for evaluation metrics.Invariance testing for sample weights should include:
sample_weight=np.ones(len(X))
makes the same model assample_weight=None
sample_weight=random
can make a different model tosample_weight=None
sample_weight=s
for integer arrays
makes the same model asX=np.repeat(X, s, axis=0), y=np.repeat(y, s, axis=0)
(although there may be exceptions to this depending on how the estimator defines iteration, convergence, etc., as in Test test_weighted_vs_repeated is somehow flaky #11236)sample_weight=s * k
for arrays
and positive constantk
makes the same model assample_weight=s
I wonder if it is possible to establish a generic test for this, e.g. something like:
The text was updated successfully, but these errors were encountered: