8000 Change the grid search order to optimize caching in the new Pipeline · Issue #8830 · scikit-learn/scikit-learn · GitHub
[go: up one dir, main page]

Skip to content
Change the grid search order to optimize caching in the new Pipeline #8830
Closed
@knarf-hackatal2016

Description

@knarf-hackatal2016

Version: current master branch (19.)

GridSearchCV fits all the classifiers on the first split, then all the classifers on the second split, etc.

This is suboptimal for caching in the pipeline, since most intermediate transformers will be fitted with same input when the grid search runs with several jobs (since no caching has been done yet).

The order should be: fit the first classifier on all splits, then second classifier on all splits, etc

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions

      0