-
-
Notifications
You must be signed in to change notification settings - Fork 25.8k
FIX Set TSNE's internal PCA to always use numpy as output #25370
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
The internal PCA should always output a numpy array as it is results are only used internally.
311f08d
to
5f2cb81
Compare
Also, I would have expected #24932 to have introduced the common test if we are using t-SNE with the default parameter since we don't change the |
I took a deeper look at the code and forcing the output of
|
Co-authored-by: Guillaume Lemaitre <g.lemaitre58@gmail.com>
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
In this particular case, this solution makes sense, but we have many places where we create a clone of a given sub-estimator and expose the fitted version as a part of the public API.
What should be the behavior of those estimators?
cc @scikit-learn/core-devs
In this case, I would expect to have the estimator follow the global config and therefore we should make sure that the inner code works as much as possible whatever the data container provided. |
Do you have specific transformer in mind where this is the case? Is see the |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM!
I guess we'll see. I'm thinking third party estimators which fit estimators internally. Let's see. |
…rn#25370) Co-authored-by: Guillaume Lemaitre <g.lemaitre58@gmail.com> Co-authored-by: Tom Dupré la Tour <tom.duprelatour.10@gmail.com>
…rn#25370) Co-authored-by: Guillaume Lemaitre <g.lemaitre58@gmail.com> Co-authored-by: Tom Dupré la Tour <tom.duprelatour.10@gmail.com>
…rn#25370) Co-authored-by: Guillaume Lemaitre <g.lemaitre58@gmail.com> Co-authored-by: Tom Dupré la Tour <tom.duprelatour.10@gmail.com>
Co-authored-by: Guillaume Lemaitre <g.lemaitre58@gmail.com> Co-authored-by: Tom Dupré la Tour <tom.duprelatour.10@gmail.com>
The internal PCA should always output a numpy array as it is results are only used internally.
Reference Issues/PRs
Fixes #25365
This fix should be backported to the v1.2.x branch.