Add routines for updating QR decompositions #4249

ewmoore · 2014-12-09T14:18:50Z

This pull request goes on top of #4021, accordingly only the last few commits are relevant here.

Here three new functions are added to perform various modifications of qr decompositions: qr_update, qr_insert and qr_delete for performing rank-k updates, adding row(s) or column(s) and removing row(s) or column(s).

The methods used are essentially those in [1], with some input from [2].

I've endeavored to write this such that updates can be performed in-place as much as possible. This is, for instance, why the rank-1 update and the rank-k update algorithms are slightly different. This allows the rank-1 update to support updating q and r when they are stored in C order. (In this spirit, it might be worth changing linalg.qr to return fortran ordered 'r', since at present it always returns a C ordered r, and a F ordered q. Numpy's qr returns both as C order.)

Edit: both are done (3/18/15)
Some outstanding things:

I suppose I'll need to add a 'check_finite' flag to these.
overwrite_qr should probably default to False instead of true.

References

Golub, G. H. & Loan, C. F. van V. Matrix Computations, 3rd Ed. (Johns Hopkins University Press, 1996).
Hammarling, S. & Lucas, C. Updating the QR factorization and the least squares problem. 1-73 (The University of Manchester, 2008). at http://eprints.ma.man.ac.uk/1192/

josef-pkt · 2014-12-09T16:51:03Z

I don't have much idea about the details.

One question:
Q in scipy.linalg.qr is (M, M), or (M, K) for mode='economic'.

The update routines require (M,M) according to the docstring. Does this also work for the economic (M, K) Q-matrix?

In the stats applications M is often much larger than K (e.g. M,K = 10000, 50

Another question: Is there anything about singular matrices? How does it behave? Is it supposed to work or is it ruled out? Is it stable in the near singular case, or is it better to regularize in that case on the user side?

josef-pkt · 2014-12-09T16:56:06Z

And, Thanks for working on this. There will be many use cases where this is useful to speed up the calculations for example in statsmodels.

ev-br · 2014-12-09T20:03:40Z

scipy/linalg/_decomp_update.pyx

+
+    p_subdiag_qr(m, n-p, q, qs, r, rs, k, p, work)
+
+    libc.stdlib.free(work)


Cython docs, http://docs.cython.org/src/tutorial/memory_allocation.html, seem to recommend a try-finally block for free. For my education: is it that it's actually not needed?

I'm not actually sure. It seems to me that if it does, there must be tons of wrapped code that can leak memory in certain circumstances (multiple threads etc.), given that this compiles down to a simple C function. Perhaps some one that knows for certain can weigh in?

Depends on whether something in between can raise or not. In this case,
since it's a nogil function, no exceptions can be raised.

ewmoore · 2014-12-10T14:08:33Z

Nothing works with economy decompositions right now. I don't know of any reason why it couldn't be made to work though. I stopped here mostly because this is already 2000+ lines and that seemed big enough for a first cut at this.

There is nothing here that does anything in particular with singular matrices. Do you have anything in particular in mind? Or a reference?

In [1]: import numpy as np

In [2]: from scipy import linalg

In [3]: np.set_printoptions(precision=4, suppress=True)

In [4]: # a is singular

In [5]: a = np.array([[1, -1, 2],[2,1,1],[1,1,0]], 'd')

In [6]: qa, ra = linalg.qr(a)

In [7]: # make some update vectors

In [8]: u = np.random.randn(3)

In [9]: v = np.random.randn(3)

In [10]: qau, rau = linalg.qr_update(qa, ra, u, v, False)

In [11]: qau
Out[11]:
array([[ 0.0969, -0.9629, -0.2517],
       [ 0.8605, -0.046 ,  0.5073],
       [ 0.5001,  0.2658, -0.8242]])

In [12]: rau
Out[12]:
array([[ 1.5429,  0.4398,  2.8576],
       [ 0.    ,  2.0214, -3.8075],
       [ 0.    ,  0.    , -0.1387]])

In [13]: qa
Out[13]:
array([[ 0.4082, -0.8729, -0.2673],
       [ 0.8165,  0.2182,  0.5345],
       [ 0.4082,  0.4364, -0.8018]])

In [14]: ra
Out[14]:
array([[ 2.4495,  0.8165,  1.633 ],
       [ 0.    ,  1.5275, -1.5275],
       [ 0.    ,  0.    ,  0.    ]])

In [15]: # b is non singular

In [16]: b = a - np.outer(u, v)

In [17]: b
Out[17]:
array([[ 1.8505, -0.0962,  0.0217],
       [ 2.6723,  1.7145, -0.5639],
       [ 1.2284,  1.2428, -0.5313]])

In [18]: qb, rb = linalg.qr(b)

In [19]: qb
Out[19]:
array([[ 0.5325, -0.7994, -0.2781],
       [ 0.769 ,  0.3198,  0.5535],
       [ 0.3535,  0.5086, -0.7851]])

In [20]: rb
Out[20]:
array([[ 3.4748,  1.7067, -0.6099],
       [ 0.    ,  1.2572, -0.4679],
       [ 0.    ,  0.    ,  0.099 ]])

In [21]: qbu, rbu = linalg.qr_update(qb, rb, u, v, False)

In [22]: qbu
Out[22]:
array([[ 0.4082, -0.8729, -0.2673],
       [ 0.8165,  0.2182,  0.5345],
       [ 0.4082,  0.4364, -0.8018]])

In [23]: rbu
Out[23]:
array([[ 2.4495,  0.8165,  1.633 ],
       [ 0.    ,  1.5275, -1.5275],
       [ 0.    ,  0.    , -0.    ]])

In [24]:

josef-pkt · 2014-12-10T15:00:34Z

Nothing works with economy decompositions right now. I don't know of any reason why it couldn't be made to work though. I stopped here mostly because this is already 2000+ lines and that seemed big enough for a first cut at this.

Ok, understandable. The increased memory handling might eat away all advantages for efficient updatting for most of our (statsmodels) application.

There is nothing here that does anything in particular with singular matrices. Do you have anything in particular in mind? Or a reference?

This was mainly a usage question. 2 issues when I was reading around (skimming many articles) for this.
Downdating is numerically less stable than updating, the references mentioned a special rotation for downdating that I didn't try to figure out. My main take-away was to be careful when using the qr if the downdating results in a singular matrix that has however numerical noise.
The second issue is, I think, a pure application issue. IIRC (?) there were some problems with updating singular matrices and I've seen it handled by adding a small value to the diagonal (a tiny Ridge regularization).

ewmoore · 2014-12-19T15:03:27Z

I've pushed support for rank-k updates to economic mode decompositions.

Right now, the update is A + uv**T, but it would only be a tiny change to support A + b*uv**T where b is a scalar. Ultimately these are the same, since b can be folded into u or v, but I'm explicitly using a 1 for b right now and could easily expose this.

ev-br · 2014-12-19T15:25:59Z

Can these be generalized to handle A + buv**T with a diagonal matrix b?

ewmoore · 2014-12-19T17:03:21Z

I suppose it could be. In that case I'd end up scaling the rows of u by the diagonal entries of b just about exactly as you would do by hand.

The general approach to performing updates is to reduce q.T.dot(u) to a scalar using a unitary matrix then add v times this scalar to the first row of r. This give an easy place to put a scalar b, but there isn't a similar place for a diagonal matrix b. That being said, If you have a good use case it's straightforward and might be worth doing.

ev-br · 2014-12-19T17:34:01Z

Yup, you're right of course. No point complicating the API IMO for something which is easy to by hand.

rgommers · 2014-12-20T10:33:57Z

Failing test seems to need atol=1e-7.

argriffing · 2015-01-07T16:25:12Z

Is this PR still waiting for the changes suggested in #4021 (comment)?

By the way, I noticed that the code includes a workaround for a numpy 1.5.1 limitation. Now that support for numpy versions < 1.6 have been dropped, this workaround could be removed.

ewmoore · 2015-01-07T16:36:20Z

Mostly. It still needs a little work on updating economic decomposition for
column additions. But since as it is now, it depends on #4021, it can't be
merged until that is finished. It does not have to work that way. Lapack
is always available when building scipy so this could just link directly if
desired.

Some code review would also be good. Even though there haven't been any
comments, I'll bet there are at least some style issues if not actual code
issues.

On Wednesday, January 7, 2015, argriffing notifications@github.com wrote:

Is this PR still waiting for the changes suggested in #4021 (comment)
#4021 (comment)?

By the way, I noticed that the code includes a workaround for a numpy
1.5.1 limitation. Now that support for numpy versions < 1.6 have been
dropped, this workaround could be removed.

—
Reply to this email directly or view it on GitHub
#4249 (comment).

rgommers · 2015-03-15T09:10:16Z

scipy/linalg/_decomp_update.pyx

+    for 1D arrays these are equivalent.  This is Numpy's gh-2287, fixed in
+    9b8ff38.
+
+    FIXME: Is it worth only applying this for numpy 1.5.1? 


That commit is in numpy 1.6.2, which is now the lowest supported version. So this can go.

This is implemented by apply k consecutive rank-1 updates rather than attempting to block this algorithm. As part of this, the input validation code in qr_update was also reworked.

Works for me, but travis seems to fail depending on the exact configuration used.

This isn't so great yet for inserting a column vector that lies in the span of q.dot(r).

The output is not a QR decomposition if the column inserted lies in the span of Q. This will now fail in that case instead of returning an incorrect result.

ewmoore · 2015-04-06T15:24:03Z

Rebased.

rgommers · 2015-04-12T07:26:34Z

Okay, 6 days since the message on the mailing list that this is about to be merged. Time to green-button it I think, then it gets to sit in master for a while before branching for the next release.

ENH: add routines for updating QR decompositions

rgommers · 2015-04-12T07:27:55Z

Or better, it could be used inside signal.place_poles already. @I--P, this is now available.

I--P · 2015-04-12T10:05:11Z

@rgommers Nice ! I'm not sure I'll have time to update place-poles with it before 0.16 though.

rgommers · 2015-04-12T10:14:08Z

No worries. If you may not be able to do that for a long time, it may be an idea to open an enhancement issue that explains what needs to be done.

argriffing · 2015-04-13T19:31:33Z

Should _decomp_update.pyx be added to .gitignore? I think the input file is named _decomp_update.pyx.in, and _decomp_update.c is already ignored.

rgommers · 2015-04-13T19:45:42Z

Indeed. Want to fix that, or should I?

argriffing · 2015-04-13T19:48:15Z

You can fix it while I let my scipy build recover from the numpy deprecation flag incompatibility :)

rgommers · 2015-04-13T19:55:23Z

done

jseabold mentioned this pull request Dec 9, 2014

Updating QR statsmodels/statsmodels#2126

Open

ev-br reviewed Dec 9, 2014
View reviewed changes

I--P mentioned this pull request Dec 19, 2014

Pole placement #4295

Closed

rgommers added enhancement A new feature or improvement scipy.linalg labels Dec 20, 2014

ewmoore force-pushed the updateqr2 branch from 26bdba5 to 2bf0031 Compare December 30, 2014 20:34

ewmoore mentioned this pull request Ja 8000 n 7, 2015

The norm of a size zero array should be zero. numpy/numpy#5420

Closed

ev-br mentioned this pull request Feb 9, 2015

ENH: linalg: Adding wrapper for potentially useful LAPACK function *lasd4. #4491

Merged

ewmoore mentioned this pull request Feb 27, 2015

stable implementation of givens rotation #4571

Closed

rgommers reviewed Mar 15, 2015
View reviewed changes

ewmoore added 15 commits April 6, 2015 09:30

ENH: Support rank-k updates to thin QR decompositions

85bdf2f

This is implemented by apply k consecutive rank-1 updates rather than attempting to block this algorithm. As part of this, the input validation code in qr_update was also reworked.

ENH: Support check_finite in qr update routines

66778c1

ENH: Support row deletion to thin QR decompositions

a319330

ENH: support column deletions with thin QR decompositions

f7da055

TST: bump test tolerance

5c93e38

Works for me, but travis seems to fail depending on the exact configuration used.

ENH: support row insertion into thin QR decompositions

3fb4357

ENH: Support inserting columns in thin QR decompositions

7f0cd22

This isn't so great yet for inserting a column vector that lies in the span of q.dot(r).

MAINT: Clean up qr_insert entry point.

7d2e7bb

STY: address review comments, improve error messages. etc.

dce80e9

MAINT: Remove numpy 1.5 work arounds

6c08676

MAINT: Add _update_decomp extension to the bento build

85d857c

ENH: add rcond to qr_insert, only used for col insert to thin qrs.

307b4f3

The output is not a QR decomposition if the column inserted lies in the span of Q. This will now fail in that case instead of returning an incorrect result.

BUG: change overwrite default to False in qr update routines

b4c313b

MAINT: Use templates to remove some nearly duplicated code

6a8a428

ENH: release the GIL around computation routines in _decomp_update

d8b8f54

ewmoore force-pushed the updateqr2 branch 2 times, most recently from 7381603 to da0616f Compare April 6, 2015 15:23

MAINT: rebase changes

a31efd9

ewmoore force-pushed the updateqr2 branch from da0616f to a31efd9 Compare April 6, 2015 20:59

rgommers pushed a commit that referenced this pull request Apr 12, 2015

Merge pull request #4249 from ewmoore/updateqr2

02ca7d2

ENH: add routines for updating QR decompositions

rgommers merged commit 02ca7d2 into scipy:master Apr 12, 2015

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Add routines for updating QR decompositions #4249

Add routines for updating QR decompositions #4249

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!


		p_subdiag_qr(m, n-p, q, qs, r, rs, k, p, work)

		libc.stdlib.free(work)

Uh oh!

Add routines for updating QR decompositions #4249

Add routines for updating QR decompositions #4249

Uh oh!

Conversation

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!