-
-
Notifications
You must be signed in to change notification settings - Fork 25.9k
Name of test-fold results in GridSearchCV.results_ #7205
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comments
Well, the question is: how important is it? Is it worth breaking backward
compatibility? For me, renaming is worth it only if a) the name is
terribly confusing and users are getting things wrong, or b) the name is
not consistent across the package, and hence code is not interchangeable.
|
It's inside the
|
Yeah it's not released so we can still change it. |
It's not only inside model_selection, it's part of a new GridSearch results format where the previous grid_scores_ was deprecated. It's certainly up for question and perhaps. I think it'll make sense to have similar output from cross_val_score at some point soon (for multiple metric support and to allow it to return times and training scores). Whether or not this is "development/validation test score" or "final test score" is a matter of context. It's also a matter of difference across academic fields, I suspect. I think test_ is preferable for brevity, but I'm ambivalent, appreciating that we don't want people to misuse validation. We could make it cv_. In terms of the ordering of components (scope/aggregate, train vs test, metric), I think these are the factors to take into consideration:
On the basis of all of these things, I'd be okay seeing the scope/aggregate ("mean", "split0", ...) come first, as you suggest @amueller, and frankly can't recall (and the thread was too long!) why we did not land up choosing that (except perhaps impatience). |
@jnothman so you vote changing the word order, and +0 for keeping test? Maybe @agramfort has a preference? (I don't want to bug @GaelVaroquaux too much with naming things ;) [TLDR this is a new name, we can pick whatever we want]. |
I'm not sure I'm a fan of "cv". Maybe I have to sleep over it. |
I suppose that's my vote. I'm only +1/2 on changing word order! |
I would prefer scores_ to results_ (it's output of our scorers). I am +0 on adding a word eg test_scores_ |
If you are talking about the |
It's not just scores that go in there. There are also standard deviations On 27 August 2016 at 19:55, Raghav RV notifications@github.com wrote:
|
I'll proceed? It would be painful to deprecate the dict keys... Let's decide and do it before 0.18... |
ok so we go ahead with |
Sure, let's do it. On 1 September 2016 at 04:02, Raghav RV notifications@github.com wrote:
|
I forgot if we discussed this before, but I'm slightly uncertain about the naming in
results_
.Before, we used
mean_validation_score
, now it'stest_mean_score
. I'm wondering if it should betest
orvalidation
. I would call this the mean cross-validation score (I guess I named the previous named tuple).After all, it's called cross-validation, not cross-testing ;)
ping @jnothman @raghavrv @GaelVaroquaux
I don't feel very strongly but I think it deserves some consideration (I just saw me using it in the book and it felt strange).
The text was updated successfully, but these errors were encountered: