-
-
Notifications
You must be signed in to change notification settings - Fork 25.9k
[MRG + 3] Store values for all nodes instead of just leaves for decision trees #4655
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
Can you try on, say MNIST, covertype and / or newsgroups? There should be scripts in the |
MNIST data (best for each). Master
PR
Both actually vary +-1 sec on my machine |
That is nice :) +1 cc @arjoly |
LGTM (but having @arjoly) sign off would obv be nice ;) |
Here some results with ensemble. This looks good. +1
|
node_id * tree.value_stride) | ||
# Store value for all nodes, to facilitate tree/model inspection and interpretation | ||
splitter.node_value(tree.value + | ||
node_id * tree.value_stride) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
you can use a single line and still fit in less than 80 columns now that the ident level was reduced by 4 spaces:
splitter.node_value(tree.value + node_id * tree.value_stride)
Let me do the cosmetic fix and merge this. |
Merged as 3594925 |
Store values for all nodes instead of just leaves, as discussed in #4644 and required in #2937.
Not surprisingly, the impact on performance is minimal.
Benchmark:
Master:
PR: