TST / DEBUG: better warnings and tests when facing singular hessian problems #6

ogrisel · 2022-06-01T15:12:08Z

Here is an attempt (failed or successful depending on the viewpoint) at improving the tests for the fallback inner solver for the singular Hessian case in the context of the "newton-cholesky" solver introduced in scikit-learn#23314.

I think in it's current state, it's pretty useless:

either with lstsq or indefinite_factorization, the model never converges, all line searches fail, the deviance stays very high and we issue a ton of useless warnings.
lstsq seems to be a bit faster but it's not important since neither method is helpful at making the model converge.

Possible things to try:

a) do not try to do a linear search on a singular hessian solution and instead continue warning but instead do a simple gradient step with small learning rate. But then we are not sure how small a learning rate. Or maybe we could try to run the line search with the raw gradient direction?
b) when facing a singular hessian, just stop the solver with a helpful convergence warning that suggests less collinear features or stronger regularization
c) alternatively, we could warn and try to fit the linear model again with strong regularization automatically (without ever trying to find a solution for the first encountered singular hessian problem)

Note that the suggestion to slightly increase the regularization works as shown in the last section of the test.

…roblems

ogrisel · 2022-06-01T15:14:41Z

sklearn/linear_model/_glm/glm.py

-                f"Line search of Newton solver {self.__class__.__name__} did not "
-                "converge after 21 line search refinement iterations.",
+                f"Line search of Newton solver {self.__class__.__name__} at iteration"
+                f" #{self.iteration} did not converge after 21 line search refinement"


I think it can be helpful to include the iteration number in the warning but I am not sure if we should use the one-based convention "#{self.iteration}" or the 0-based convention used elsewhere in the code with #{self.iteration - 1}.

ogrisel · 2022-06-01T15:18:46Z

sklearn/linear_model/_glm/glm.py

@@ -31,37 +31,47 @@
 from .._linear_loss import LinearModelLoss


-def _solve_singular(H, g):
+def _solve_singular(H, g, method="lstsq"):


I decided to privately expose the method param to quickly switch between the 2 approaches to make it easier to investigate the singular Hessian problem but as I found that neither seems to be helpful I do not know which to keep. If we can actually make use of the indefinite_factorization variant, I think we should unittest it individually.

sklearn/linear_model/_glm/tests/test_glm.py

lorentzenchr · 2022-06-03T10:30:28Z

Nocedal & Wright write in Chapter 3.4:

As this discussion shows, there is a great deal of freedom in devising modification strategies, and there is currently no agreement on which strategy is best.

Let's see, if we find a satisfying solution.

ogrisel · 2022-06-03T12:12:31Z

I moved to a new place and my copy of N & W is still somewhere buried in a cardboard box :)

lorentzenchr · 2022-06-11T10:52:06Z

I finally went with option a) simple gradient steps, see scikit-learn#23314 (comment).
I think I can close with scikit-learn@8a108bb and scikit-learn@82287af.

TST / DEBUG: better warnings and tests when facing singular hessian p…

0ea62cb

…roblems

ogrisel commented Jun 1, 2022

View reviewed changes

ogrisel mentioned this pull request Jun 1, 2022

FEA add Cholesky based Newton solver to GLMs scikit-learn/scikit-learn#23314

Closed

ogrisel commented Jun 1, 2022

View reviewed changes

ogrisel commented Jun 2, 2022

View reviewed changes

sklearn/linear_model/_glm/tests/test_glm.py Outdated Show resolved Hide resolved

typo

aabad98

ogrisel commented Jun 2, 2022

View reviewed changes

sklearn/linear_model/_glm/tests/test_glm.py Outdated Show resolved Hide resolved

cosmit

7c6f2e5

ogrisel mentioned this pull request Jun 2, 2022

FEA add GLM Newton-LSMR Solver on top of Newton-Cholesky scikit-learn/scikit-learn#23507

Closed

lorentzenchr closed this Jun 11, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

TST / DEBUG: better warnings and tests when facing singular hessian problems #6

TST / DEBUG: better warnings and tests when facing singular hessian problems #6

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

TST / DEBUG: better warnings and tests when facing singular hessian problems #6

TST / DEBUG: better warnings and tests when facing singular hessian problems #6

Uh oh!

Conversation

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!