8000 Extend `FeatureUnion` to better handle heterogeneous data · Issue #2034 · scikit-learn/scikit-learn · GitHub
[go: up one dir, main page]

Skip to content
Extend FeatureUnion to better handle heterogeneous data #2034
Closed
@jnothman

Description

@jnothman

FeatureUnion currently passes identical data to each constituent transformer. Often one wants to differentiate between groups of features in how they are transformed. While this is possible by making each stacked transformer a Pipeline consisting of a pre-determined feature selector and another transformer, this is cumbersome.

A parameter should be added to specify which features are routed to which constituents. This is not necessarily trivial to design, particularly because the input X to FeatureUnion.transform need not be a conventional 2d feature array (it may be a list/array of dicts, texts, or other objects).

Metadata

Metadata

Assignees

No one assigned

    Labels

    ModerateAnything that requires some knowledge of conventions and best practicesNew Feature

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions

      0