-
-
Notifications
You must be signed in to change notification settings - Fork 25.9k
FIX/ENH?: PCA and RandomizedPCA now have inconsistent meanings of .components_
#3930
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comments
From a consistency standpoint, definitely. https://github.com/scikit-learn/scikit-learn/blob/master/sklearn/decomposition/pca.py#L599 makes the components have non-unit length, which is not the case in the current Now I am wondering if there are other places where this needs to be changed. |
(While we are at it: The output of the SVD should rather be |
I don't think it's worth breaking the backward compat for this, as long as the shape of the returned arrays is properly documented. |
About @dengemann's report: does this mean that the I am not sure what was the motivation of 3f6c61f anymore and I am not sure it was worth breaking the backward compat (if it was not required to solve a bug). @larsmans you might want to have look at this. |
Alright I remember now that I re-read the discussion at #3107. I agree that this is more intuitive that |
Is this still open? I am happy to take a look at this since I am working of some of those classes. |
Yes, it is still open and needs a fix. |
when I changed the behaviour of PCA, fixing what clearly was a bug, there On Fri, Sep 11, 2015 at 5:46 PM, Andreas Mueller notifications@github.com
|
Actually, if we add the |
It depends if the new PCA will make it to next release. Should I focus on that then? |
I should have fixed this in #5299. Could anyone confirm this is all right now? |
Changes are now in master. @dengemann @eickenberg can anyone confirm we can close this issue? |
Great, will check it out. |
I confirm that we now store the unscaled components.
|
@eickenberg introduced a new API / meaning of the components attribute to PCA: 3f6c61f
The RandomizedPCA code still stores whitened components instead of unit scale components. This should be changed, right?
cc @agramfort @ogrisel
The text was updated successfully, but these errors were encountered: