8000 Is it possible to reduce StandardScaler.fit() memory consumption? · Issue #5651 · scikit-learn/scikit-learn · GitHub
[go: up one dir, main page]

Skip to content
Is it possible to reduce StandardScaler.fit() memory consumption? #5651
Closed
@enoonIT

Description

@enoonIT

The issue:
When applying StandardScaler to a big matrix, the memory requirements are expensive.

Example:

big = np.random.random([495982, 4098]) //this is around 8GB
scaler = StandardScaler()
scaler.fit(big) //this will require nearly another 16GB of RAM

I guess it uses some lookup tables to speed the standard deviation computations, but double the required RAM might be too much in some cases. A flag to enable a, slower, but less memory intensive version, would be nice.
Is there any solutions to reduce memory consumption?

Metadata

Metadata

Assignees

No one assigned

    Labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions

      0