-
-
Notifications
You must be signed in to change notification settings - Fork 25.8k
ENH added RollingWindowCV to sklearn.model_selection #24589
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
fb7e8e8
to
00661e4
Compare
e0d7ebf
to
149da77
Compare
3aee974
to
42486fb
Compare
a960526
to
cf6047e
Compare
Given @glemaitre's comment here, I am not sure more time series cross validators (TSCVs) will be added in the short-term in scikit-learn, however this remains a simpler TSCV than CPCV and it could be great to see it in scikit-learn already. Maybe should RollingWindowCV be renamed RollingTimeSeriesSplit (and thus |
If merging is decided against in favor of having fewer TSCVs as @glemaitre has stated at the link I would be a bit bummed, but I have also discussed the steps required for running the class as a standalone module here (below the image) for those who prefer the appeal of simply passing the number of splits and the training ratio of each window. Though if we are in favor of merging after a few naming or other changes I would be happy to cooperate. |
As stated in the original issue, I would prefer to have I remembered to have started to review #23780. I did not get the chance to go back to it but from my guess, we only needed a couple of unit tests to ensure that we get the expected behaviour. I will try to give a review and finish up this PR. |
closing in favor of #23780 |
Reference Issues/PRs
Addresses #22523, similar to #23780.
Addresses #24243.
Addresses homogeneous variant of #6322.
What does this implement/fix? Explain your changes.
Implementation of rolling window cross validation with support for longitudinal time series data.
Any other comments?
Wrote this about 2 years ago and finally got around to making a PR.