mRMR (Minimum Redundancy and Maximum Relevance) score as score_func for feature selection methods #20067

iki77 · 2021-05-09T01:53:21Z

Describe the workflow you want to enable

Current score_func for feature selection methods does not consider multicollinearity between features.

Describe your proposed solution

Introduce mRMR (Minimum Redundancy and Maximum Relevance) score as score_func for feature selection methods.

Variant of mRMR scores in a nutshell:

MID: Mutual Information to target - Mutual Information between features
MIQ: Mutual Information to target / Mutual Information between features
FCD: F Statistic to target - Correlation between features
FCQ: F Statistic to target / Correlation between features

From what I understand Mutual Information and F Statistic already implemented as score_func in scikit-learn, so these mRmR scores are somewhat an extension of it.

glemaitre · 2021-05-09T09:47:31Z

duplicate of #8889

iki77 added the New Feature label May 9, 2021

glemaitre closed this as completed May 9, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

mRMR (Minimum Redundancy and Maximum Relevance) score as score_func for feature selection methods #20067

mRMR (Minimum Redundancy and Maximum Relevance) score as score_func for feature selection methods #20067

Uh oh!

Uh oh!

mRMR (Minimum Redundancy and Maximum Relevance) score as score_func for feature selection methods #20067

mRMR (Minimum Redundancy and Maximum Relevance) score as score_func for feature selection methods #20067

Comments

Uh oh!

Describe the workflow you want to enable

Describe your proposed solution

Uh oh!