-
-
Notifications
You must be signed in to change notification settings - Fork 25.8k
[RFC] Changes to model_selection? #5053
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comments
(I'm checking the entries as soon as there is an issue/PR for the same) MAJOR
MODERATE
MINOR
RFC Moved from #5569 to here
|
Defaulting
This was postponed until we have a solution for multiple metrics and/or training scores, which may mandate changing the format, too: namedtuples are user-friendly but inflexible to versioning. Having a clearer delineation of "labels" and "groups" would certainly be nice... |
In new style CV splitters, the constructor no longer differs between stratified and unstratified splitters. Should |
True, changing |
+1 |
A minor change that could be included directly in #4294: drop |
Half kidding: Is now also the time to rename |
is there a confusion about that? I wasn't aware. |
Well, the confusion is only when discussing the models, rather than being an inherent API problem. It's irrelevant; it's a silly thing to change, certainly as long as we call function/constructor arguments "parameters". |
@amueller @vene @jnothman sorry for going back and forth (- wrt to a discussion at gitter) on this, but how does grouping i.e Also if this sounds okay... we should think about how So after that is done what would Or would it be better to simply keep it as such as a separate class as it is right now? |
I think a separate class. It's not just an option. On 13 September 2015 at 14:35, Raghav R V notifications@github.com wrote:
|
I also think a separate class would be good, if nothing else for visibility. |
Okay! Thanks! |
Also could we rename |
or |
Personally, I much prefer |
I think leave |
Pun intended? ;) |
I'm missing the pun. And I make dad jokes all the time. On 13 April 2016 at 22:59, Raghav R V notifications@github.com wrote:
|
Nevermind ;) (I thought |
BTW on a similar note should we also rename |
Also could we rename p to p_samples?
n_samples? I don't understand p_samples? Or maybe n_features?
|
But we shouldn't be renaming things just for cosmetics (consistency is a |
I think the motivation for renaming |
That's my view as well. Groups is much better; though there could be confusion with, say, GroupLasso. Such breakage is annoying on two accounts. (a) user code will need to change: but this will happen anyway because of the move to model_selection. (b) books becoming out of date, as @GaelVaroquaux points out. Now, my first impulse is to say that (b) is also moot because of Still, I think that if you at least know what your intention is, and understand the word "Group", the latter can be easily identified as relevant when tabbing through the imports. (In light of this, I would have preferred |
Re-opening as #6660 only fixed the part on renaming labels to groups. |
I don't think anything remains here that needs to be changed before the release of |
+1 I'll do that tomorrow! |
I think this issue can be closed. @raghavrv are there any urgent things you still see? |
Few more tests that I was planning to raise as a PR soon... Otherwise this can be closed. |
F438
@amueller Do we want to remove It seems to add one more line and some code complexity. I don't feel it's worth it. Reviews on the commit and a quick comment on whether we need that as a PR? |
I think I prefer the version with the auxiliary function, it's much more readable to me at least. |
Yes. Thanks for the comment! |
why is groups passed and not used? |
I am sorry... I don't get you. |
sorry lack of coffee |
I think it can be closed now... The other comments are either addressed / have PRs for them. @amueller |
Once #4294 is merged, we might want to think about changing some of the api / default behaviors of the contents of this module.
There are several breaking changes that I'd like to make, and this might or might not be a good opportunity:
n_folds
to 5, as it is already in some of the cv objects (3 seems really low).grid_scores_
to be consistent withRandomizedSerachCV
_CVScoreTuple
[needs to be done for multiple metrics I think]labels
togroups
(?) in the cv iterators.p
parameter inLeavePOut
was there anything else? ping @rvraghav93
The text was updated successfully, but these errors were encountered: