WIP simplify naive Bayes parametrization #1525

amueller · 2013-01-06T16:47:45Z

After moving class_prior to class_weight as in __init__ parameter in #1499, the parametrization was a bit weird. I now got rid of fit_prior and let class_weight be 'auto' ,None(=uniform) or an array. This is a bit more consistent with the other estimators.

I'll try to make it pass the common tests now and I guess I'll also allow dictionaries.

amueller · 2013-01-06T16:49:59Z

Does it make sense to support dictionaries as class_weight? What is the value of the left-out classes? hum...
ping @larsmans.

This is fixing #1511 btw.

amueller · 2013-01-06T16:56:43Z

Making the common tests pass is a bit more tricky. I think this is a step forward in the inferface, though.

larsmans · 2013-01-06T19:46:34Z

Actually, I'm not so sure things have gotten consistent now; class_weight="auto" is more like fit_intercept=True in the linear models. Whether a dict with missing values makes sense depends on what meaning we want to assign to class_weight: is it the prior (exp(intercept_)), or is the prior derived taking class_weight into account?

I think I'd favor the former for simplicity; Naive Bayes is a typical beginner's tool, so it should be simple. The latter would provide better API compatibility with other estimators, but NB's interpretation of alpha is already quite different as well.

amueller · 2013-01-06T19:54:07Z

I see your point about class_weight='auto' and fit_intercept=True. But adjusting the prior is also the same as duplicating samples in a class, right?

I don't know what you mean by "prior taking class_weight into account.". In the current / previous implementation, class_prior now class_weight simply specifies the prior, i.e. overwrites fit_prior if I understand correctly.

That these are mutually exclusive is one of the reasons I think having a single parameter is more natural.

One could also imagine class_weights to reweight the prior from the data but that is not implemented, right?

larsmans · 2013-01-06T19:58:59Z

In the current / previous implementation, class_prior now class_weight simply specifies the prior, i.e. overwrites fit_prior if I understand correctly.

Yes, but we could decouple them so that unrepresented classes get a weight of 1 and other classes can be up-/downweighted arbitrarily. We'd just have to fill in the ones and normalize, which seems to be what you're implying as well.

amueller · 2013-01-06T20:00:56Z

Ok but then the meaning of the parameter would change a lot from the previous class_prior.

amueller · 2013-01-06T20:02:17Z

Ok so you would keep the fit_prior and then do a multiplicative (?) modification of the prior that was found using the class_weight parameter?

larsmans · 2013-01-06T20:39:54Z

No, because I agree the API is too clumsy (and the code is more complicated than an NB estimator needs to be). Let me formulate a proposal:

When class_weight="auto", the empirical prior is used. This should be the default.
When class_weight is a dict or array-like, we normalize it so it becomes a custom prior. Missing values are implicitly 1, so when the user computes their own prior, they should provide the value for all classes. [Should we not allow unnormalized class_weight, then we'd have to validate it, which is just as much work but less convenient for the user.]
When class_weight=None, a uniform prior is used.

This way, custom priors can still be given explicitly, but the parameter becomes much more flexible.

Does that sound like something we almost could put in a docstring, aside from the [rationale]?

amueller · 2013-01-06T20:43:45Z

It sounds a lot like my current docstring, doesn't it ;)

It sounds like a good solution to me. I must admit I haven't really thought about the algorithm just shuffled the bits around. Sounds actually kinda simple the way you put it.

Do you want to give it a shot or should I try?

amueller · 2013-01-06T20:46:46Z

Ok it sounds more like I should have written the docstring, actually ;) but it was what I meant, I only put less elegantly.

amueller · 2013-01-06T20:50:32Z

We can actually just reuse the function from sklearn.utils.class_weights, right?
Anything else we need to do?

larsmans · 2013-01-06T23:31:04Z

Looks good!

amueller · 2013-01-07T07:28:49Z

Ok, I'll add some more tests for dictionaries and also for using lists in SVM and SGD.

…est of sklearn.

amueller · 2013-01-12T17:13:54Z

Ok so this doesn't seem to make much sense, maybe:
In NB, fit_prior surprisingly fits a prior to the classes. So frequent classes will be more likely to be predicted. In the linear models, class_weight='auto' downweights examples from the overrepresented classes.

So I just unified two functions that did the opposite...

I could still use the same function but take the inverse in the auto case. but then we should definitely document that quite explicitly!

amueller · 2013-01-12T17:19:22Z

Hm reusing the code is not so trivial given the different normalizations.
Also, we should really think about whether we want class_weight='auto' mean two completely opposite things :-/

Any ideas how to handle this better? Just leave the current parametrization?

amueller · 2013-02-20T20:41:47Z

closing this as it sucks.

amueller added 3 commits January 12, 2013 14:58

ENH simplify naive Bayes parametrization, make more consistent with r…

6f48d62

…est of sklearn.

ENH use compute_class_weight utility in NB.

35da4e4

misc

a30f3b5

amueller mentioned this pull request Jan 12, 2013

Fix class_weight parametrization in naive bayes #1511

Closed

amueller closed this Feb 20, 2013

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

WIP simplify naive Bayes parametrization #1525

WIP simplify naive Bayes parametrization #1525

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

WIP simplify naive Bayes parametrization #1525

WIP simplify naive Bayes parametrization #1525

Uh oh!

Conversation

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants