Request more criterion for random forest regression #5368

boscotsang · 2015-10-08T15:23:15Z

Current random forest regressor only support for 'mse'. Can more criterions such as mean square of percentage error can be support by scikit-learn?

amueller · 2015-10-09T22:43:38Z

Is that the same as mean squared relative error? Do you have a reference?

jmschrei · 2015-10-11T21:41:20Z

New criteria can be supported fairly easily. They need to be added to sklearn/tree/_crtierion.pyx, and then have the appropriate dictionaries updated.

boscotsang · 2015-10-12T02:41:15Z

@amueller https://www.kaggle.com/c/rossmann-store-sales/details/evaluation defined the RMSPE criterion.

amueller · 2015-10-12T21:12:57Z

Not sure that is the most common name for that (also the formula is not really for percentages).
I think we could add that, it seems interesting. See what @jmschrei said ;)

glouppe · 2015-10-19T09:33:05Z

New criteria can be supported fairly easily.

It is easy when you have understood the codebase, but it is true that people may not know where to look at when they arrive. Maybe we could make it possible to pass Criterion object directly, as we do for Splitter? Then one would not have to hack around.

jmschrei · 2015-10-21T22:57:17Z

This is true. What do you mean pass Criterion object directly?

Sandy4321 · 2015-11-01T16:59:21Z

so when it will be done?
lets do it pls!!

On Wed, Oct 21, 2015 at 6:58 PM, Jacob Schreiber notifications@github.com
wrote:

This is true. What do you mean pass Criterion object directly?

—
Reply to this email directly or view it on GitHub
#5368 (comment)
.

jmschrei · 2015-11-01T21:32:18Z

I think that is kind of the wrong attitude to have. If there's a particular feature you'd like, you should attempt to submit a PR incorporating it, or be very specific about which criterion you'd like and maybe someone will take it up for you.

betatim · 2016-01-12T08:30:53Z

If there is a concrete need/idea for a new criterion I'd be interested in doing the coding work to implement it (for my education about how the decision tree internals work).

glouppe · 2016-01-12T08:33:29Z

There was some attempt to implement MAE at #6039. I think this criterion really is missing at the moment and could be used at many occasions.

fjanoos · 2016-01-26T17:59:36Z

As a follow up, how much effort would it be to implement the set of criteria currently available in GradientBoostingRegressor, namely lad and huber losses ?

fjanoos · 2016-02-03T15:46:21Z

Hi - I've written up a cython extension for a LAD (L1 norm) criterion that plugs into the tree based classifiers. Would this be of wider interest - to include back into sklearn ?

raghavrv · 2016-02-03T15:51:31Z

Could you make a PR? (An attempt at LMAD is also done at #6039. You could refer that)

fjanoos · 2016-02-03T18:13:28Z

The implementation there is for MAD - namely the L1 norm about the mean of each class. This is a LAD implementation that uses medians.
I'll create a PR.

raghavrv · 2016-02-03T18:45:59Z

Ah okay! thanks for the clarification :) I'm pretty much new to all this ;)

lorentzenchr · 2021-11-27T12:13:18Z

Meanwhile we have MAE #6667 and the Poisson deviance #17386. More concrete proposals could be opened separately. Therefore closing this issue.

glouppe added Enhancement Moderate Anything that requires some knowledge of conventions and best practices labels Oct 19, 2015

rth mentioned this issue Oct 3, 2019

A common private module for differentiable loss functions used as objective functions in estimators #15123

Open

lorentzenchr closed this as completed Nov 27, 2021

jimthompson5802 mentioned this issue Dec 11, 2023

[WIP] Add Huber loss criterion to DecisionTreeRegressor and RandomForestRegressor #27932

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Request more criterion for random forest regression #5368

Request more criterion for random forest regression #5368

Request more criterion for random forest regression #5368

Request more criterion for random forest regression #5368

Comments