[MRG] Better convergence warnings for lbfgs solver in LogisticRegression #11767

rth · 2018-08-07T09:38:27Z

When the lbfgs solver in LogisticRegression fails to converge, the resulting ConvergenceWarning is not very informative. This PR increases it's verbosity so we have a better estimation of how bad the convergence is.
This is particularly relevant if lbfgs is to become the default solver (#11476)

The tricky part is that, as far as I understood, scipy.optimize.fmin_l_bfgs_b only returns the evaluated gradient at the minimum, while the convergence criterion is the max |projected gradient|. Still IMO providing at least some information about the final gradient is better than nothing..

Example

from sklearn.linear_model import LogisticRegression
from sklearn.datasets import load_iris


iris = load_iris()

estimator = LogisticRegression(solver='lbfgs', multi_class='ovr', max_iter=20)
estimator.fit(iris.data, iris.target)

Output on master

sklearn/linear_model/logistic.py:723: ConvergenceWarning: lbfgs failed to converge. Increase the number of iterations.
  "of iterations.", ConvergenceWarning)
sklearn/linear_model/logistic.py:723: ConvergenceWarning: lbfgs failed to converge. Increase the number of iterations.
  "of iterations.", ConvergenceWarning)
sklearn/linear_model/logistic.py:723: ConvergenceWarning: lbfgs failed to converge. Increase the number of iterations.
  "of iterations.", ConvergenceWarning)

Output with this PR

sklearn/linear_model/logistic.py:734: ConvergenceWarning: lbfgs failed to converge with max_iter=20. max(|grad|) = 1.610e+00 while pgtol=1.000e-04 (see scipy.optimize.fmin_l_bfgs_b documentation for more information). Increase the number of iterations.
  ConvergenceWarning)
sklearn/linear_model/logistic.py:734: ConvergenceWarning: lbfgs failed to converge with max_iter=20. max(|grad|) = 4.764e+00 while pgtol=1.000e-04 (see scipy.optimize.fmin_l_bfgs_b documentation for more information). Increase the number of iterations.
  ConvergenceWarning)
sklearn/linear_model/logistic.py:734: ConvergenceWarning: lbfgs failed to converge with max_iter=20. max(|grad|) = 4.413e-01 while pgtol=1.000e-04 (see scipy.optimize.fmin_l_bfgs_b documentation for more information). Increase the number of iterations.

LogisticRegression

rth · 2018-08-07T12:18:41Z

(Circle Ci fails due to unrelated mldata download issues)

jnothman · 2018-08-07T23:10:42Z

I'm curious. As a user, how would this new detail inform your behaviour?

rth · 2018-08-13T21:00:12Z

It's probably not very important, but say in a long running system with some log files, the case of the almost converged training (a gradient of a few 1e-3) in some cases, one might not bother retraining. For a moderately low gradient one could indeed increase the number of iterations. If the final gradient is > few 1.0-10.0 even after a significant number of iterations, it may suggest that something is wrong about the pipeline.

I guess generally, I was just generally itchy about an optimization problem that returned "convergence failed, try again" without outputting any useful information to understand what's happening. But maybe the problems considered here are sufficiently nicely convex that there is not reason to worry about it.

Still, something like this can help spot potential issues (related to #11536).

rth · 2018-09-26T21:09:38Z

There doesn't seem too much enthusiasm about this PR. There are larger issues to address anyway. If someone is interested please comment or re-open.

Increase verbosity of convergence warnings for lbfgs solver in

bfd8ad1

LogisticRegression

rth closed this Sep 26, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[MRG] Better convergence warnings for lbfgs solver in LogisticRegression #11767

[MRG] Better convergence warnings for lbfgs solver in LogisticRegression #11767

[MRG] Better convergence warnings for lbfgs solver in LogisticRegression #11767

[MRG] Better convergence warnings for lbfgs solver in LogisticRegression #11767

Conversation