A kind of QRD structure based on MGS
Technical field
Present invention relates generally to extensive multi-antenna technology fields, refer in particular to a kind of novel 2 × 2 tile QRD based on MGS
Structure.
Background technique
Extensive multi-antenna technology (multiple input multiple output, MIMO) is next generation communication technology
One of the key technology of (5G).In extensive mimo system, base station end can service tens equipped with up to a hundred antennas simultaneously
User's (for the sake of simplicity, each user only has 1 antenna).This and 4 foundation station antennas in traditional mimo system service 4
The case where single-antenna subscriber (referred to as 4 × 4MIMO system), is compared, and more antennas provide more spatial multiplexing gain and diversity increases
Benefit.Moreover, simple linear signal processing process can reach near-optimization performance in extensive mimo system.
In extensive mimo system, by being accommodated more in a large amount of antenna of base station equipment with providing bigger freedom degree
More information.Therefore, extensive MIMO can preferably improve the availability of frequency spectrum, channel capacity and company compared to traditional mimo system
Connect reliability.Compared with traditional mimo system, several times of the matrix dimension of the base band signal process algorithm of extensive MIMO is even
Tens times.Especially when algorithm is related to matrix inversion operation and QR decomposition etc..In order to overcome extensive matrix complex degree bottleneck, need
The new structure of one kind is designed to accelerate crucial matrix algorithm unit.
QRD matrix decomposition algorithm obtains extensive utilization in mimo systems, and in some known work, QRD
Become an essential component in transmitting terminal.In general, QRD is used to decompose channel response matrix H as one
A unitary matrice Q and upper triangular matrix R.In extensive MIMO, the number of user and the antenna number of base station are very big at one
Variation in range.The dimension of H changes in a very big range, it needs QRD hardware configuration to decompose the square of this multidimensional
Battle array.For now, already present QRD hardware configuration is concentrated mainly on one or several fixed dimensions in the field of wireless communication
Matrix.Therefore, flexible QRD hardware configuration has very great meaning to wireless communication system in future.
As depicted in figs. 1 and 2, traditional MGS (Modified Gram-Schmidt) hardware algorithm structure is broadly divided into
Two modules of DP, TP.Wherein DP module is mainly used to generate qjAnd rjj, TP module be mainly used to generate update matrixWith
rji, DMP is mainly used to generate qj, rjj,rji。
To the matrix column vector a of input in DP modulej, firstly, by each of which element be separately input to squaring module M1,
Squaring module M2, squaring module M3, in squaring module M4, obtained result deposit Buffer1.Then, by obtained result point
It is not input in adder M1, adder M2, obtained result is separately input in adder M3, the result that adder M3 is obtained
It is stored in Buffer2.Result r is obtained as a result, being input to and opening in more number module M1 obtained in Buffer2jj.In Buffer3
The result arrived is as divisor, ajAs dividend each of which element be separately input to divider M1, divider M2, divider M3,
Result q is obtained in divider M4jIt is stored in Buffer4.
To the matrix column vector a of input in TP moduleiAnd qj, wherein (<=3 i) incremented by successively i and each qjIt is corresponding
a0、a1、a2、a3, aiSuccessively as input, with qjIt is separately input to multiplier M1, multiplier M2, multiplier M3, multiplier M4, is obtained
To result deposit Buffer1 in.The knot that result is deposited into adder M4 respectively, adder M5 is obtained is read from Buffer1
Fruit deposit is input to adder M6, in obtained result deposit Buffer2.The column of result and input obtained in the Buffer2
Vector qjIn each element be separately input to multiplier M5, multiplier M6, multiplier M7, multiplier M8, obtained result is deposited
Enter into Buffer3.Using result obtained in Buffer3 as subtrahend aiSubtracter M1 is separately input to as minuend, is subtracted
Musical instruments used in a Buddhist or Taoist mass M2, subtracter M3, subtracter M4, obtain result
In above-mentioned traditional structure, extensive MIMO matrix dimension is larger, causes to be related to significantly increase when QRD algorithm
The computational complexity of base station,
Summary of the invention
The technical problem to be solved in the present invention is that, for technical problem of the existing technology, the present invention provides one
Kind structure is simple, can be improved the QRD structure based on MGS of overall performance.
In order to solve the above technical problems, the invention adopts the following technical scheme:
A kind of QRD structure based on MGS, including the first module CORE_GEQRT and the second module CORE_TTQRT, it is described
First module CORE_GEQRT is used to 2 × 2 matrix [a to input1,a2] pass through MGS algorithm progress QR decomposition output q1,r11,
r22,sign(f(A)),r12,q2, wherein [q1,q2] it is Q matrix, r11,r12,r22Composition upper triangular matrix R, sign (f (A))=
(a21a12-a11a22)/r11;Using obtain two R matrixes as the input and [1,0 of the second module CORE_TTQRT;01],[0,
0;0,0] matrix [a is collectively constituted11,a12,1,0;0,a22,0,1;a31,a32,0,0;0,a42, 0,0], finally obtain 4 × 2 square
Battle array.
As a further improvement of the present invention: the first module CORE_GEQRT includes:
2 × 2 matrixes of input are stored in buffer1, the matrix-vector a being stored by selector first1Element a11
Two inputs, a as multiplier 121Two inputs as multiplier 2.Obtain two are exportedWithAs addition
The input terminal of device 1, adder 1 obtain resultIt is stored in buffer2;The result that adder 1 obtains is read from buffer2 defeated
Enter to carry out out radical sign module 1, result radical sign operation will be opened obtainedIt is stored in buffer4.It will
Vector a1Make dividend, the r of divider11It is separately input in divider 1 and divider 2 as divisor, obtained result q1It deposits
Enter buffer5;
After vector is input to buffer1, vector a is read from buffer11While read vector a2, first by a11Become
For-a11, then by data-a11, a12, a21, a22It is stored in buffer2;WhenWhen being input to out radical sign module 1, a12, a21As multiplying
The input of musical instruments used in a Buddhist or Taoist mass 1, a22,-a11As the input of multiplier 2, the input that obtain two are exported as adder 1 finally will
In obtained output f (A) deposit buffer3;F (A) is read from buffer3 and carries out the operation that takes absolute value, obtained value is defeated
Enter into buffer4;Meanwhile taking the symbol of f (A) as output SignValue, will | f (A) | with r11It is read from buffer4 defeated
Enter in dividing module 3, by operation result r22It is stored in buffer5;
After the completion of f (A) calculating, by element a11, a12It is input to multiplier 1, by element a21, a22Multiplier 2 is inputted, it will
Input of the obtained output as adder 1, the result that adder 1 obtains are stored in buffer4;The result that adder 1 is obtained
It is taken out from buffer4 and r11It is linked into the input terminal of dividing module 4 jointly, obtains result r12It is stored in buffer5;
When obtaining q1It after Sign (f (A)), is entered into and is input in selector 2, wherein Sign (f (A)) is used as item
Part signal.When Sign (f (A)) is timing q21=q12, q22=-q11, the q when Sign (f (A)) is negative21=-q12, q22=q11。
Obtained q2As output.
As a further improvement of the present invention: the second module CORE_TTQRT includes: including third module QR and
Four module Column update;The third module QR is fully pipelined architecture, in first clock cycle first to its input matrix
A_1=[a11,1;a31, 0] and QR operation is carried out, in second clock cycle to its input matrix A_2=[a22,1;a42, 0] and it carries out
QR operation;Firstly, 2 × 2 matrixes of input are stored in buffer1, the matrix-vector a being stored by selector1Element a11
Two inputs, a as multiplier 321As two inputs of multiplier 4, meanwhile, take a21Symbol as output Sign;It will
Two obtained outputsWithAs the input terminal of adder 2, adder 1 obtains resultIt is stored in buffer2;By addition
The input of result that device 2 obtains carries out out radical sign module 2, the result that will be opened radical sign operation and obtain
As output;By vector a1Make dividend, the r of divider 511It is input to divider 6 as divisor, obtained result q1As defeated
Out.Wherein r12=q11, r22=q12;
Obtain Sign and q1Afterwards, by Sign and q1The alternatively input of device, wherein Sign alternatively signal.Work as Sign
It is q for timing output21=q12, q22=-q11, when Sign is negative, output is q21=-q12, q22=q11;Obtained q2As defeated
Out;
After obtaining the Q matrix of A_1, to a12, a32Column, which are carried out, by Column update updates operation;By a11, q12As
The input a of multiplier 521, q22Multiplier 6 is inputted, by a11, q11Multiplier 7 is inputted, by a21, q21Input multiplier 8;Multiplier 5
The result input summer 3 obtained with multiplier 6 obtains resultThe result input summer that multiplier 7 and multiplier 8 obtain
2 obtain resultObtain matrixTake matrixMiddle submatrixProgress is similarly operated with A_1
Afterwards, to vectorIt is updated operation, obtains 2 × 2 R matrix.
Compared with the prior art, the advantages of the present invention are as follows:
QRD structure based on MGS of the invention, structure is simple, can be improved overall performance, solves extensive MIMO square
Battle array dimension is larger, causes to be related to significantly increase this problem of the computational complexity of base station when QRD algorithm.Due to QRD tile
Algorithm is very suitable for future broadband wireless communication systems, and its bottleneck is the calculating of 2 × 2 tiles, therefore, proposed by the invention
2 × 2 tile structures are very meaningful.
Detailed description of the invention
Fig. 1 is the hardware structural diagram of DP in traditional MGS algorithm.
Fig. 2 is the hardware structural diagram of TP in traditional MGS algorithm.
Fig. 3 is the hardware structural diagram of present invention CORE_GEQRT in specific application example.
Fig. 4 is the hardware structural diagram of present invention CORE_TTQRT in specific application example.
Specific embodiment:
The present invention is described in further details below with reference to Figure of description and specific embodiment.
MGS structure of the invention includes: CORE_GEQRT and CORE_TTQRT, and wherein CORE_GEQRT is used to input
2 × 2 matrix [a1,a2] pass through MGS algorithm progress QR decomposition output q1,r11,r22,sign(f(A)),r12,q2, wherein [q1,q2]
For Q matrix, r11,r12,r22Form upper triangular matrix R, sign (f (A))=(a21a12-a11a22)/r11.By obtain two R squares
Input and [1,0 of the battle array as CORE_TTQRT;01],[0,0;0,0] matrix [a is collectively constituted11,a12,1,0;0,a22,0,1;
a31,a32,0,0;0,a42, 0,0], finally obtain 4 × 2 matrix.
In specific application example, as shown in figure 3, the hardware of CORE_GEQRT forms are as follows:
2 × 2 matrixes of input are stored in buffer1, the matrix-vector a being stored by selector first1Element a11
Two inputs, a as multiplier 121Two inputs as multiplier 2.Obtain two are exportedWithAs adding
The input terminal of musical instruments used in a Buddhist or Taoist mass 1, adder 1 obtain resultIt is stored in buffer2.The result that adder 1 obtains is read from buffer2
Input carries out out radical sign module 1, the result that will be opened radical sign operation and obtainIt is stored in buffer4.
By vector a1Make dividend, the r of divider11It is separately input in divider 1 and divider 2 as divisor, obtained result q1
It is stored in buffer5.
After vector is input to buffer1, vector a is read from buffer11While read vector a2, first by a11Become
For-a11, then by data-a11, a12, a21, a22It is stored in buffer2.WhenWhen being input to out radical sign module 1, a12, a21As multiplying
The input of musical instruments used in a Buddhist or Taoist mass 1, a22,-a11As the input of multiplier 2, the input that obtain two are exported as adder 1 finally will
In obtained output f (A) deposit buffer3.F (A) is read from buffer3 and carries out the operation that takes absolute value, obtained value is defeated
Enter into buffer4;Meanwhile taking the symbol of f (A) as the value of output Sign, and incite somebody to action | f (A) | with r11It is read from buffer4 defeated
Enter in dividing module 3, by operation result r22It is stored in buffer5.
After the completion of f (A) calculating, by element a11, a12It is input to multiplier 1, by element a21, a22Multiplier 2 is inputted, it will
Input of the obtained output as adder 1, the result that adder 1 obtains are stored in buffer4.The result that adder 1 is obtained
It is taken out from buffer4 and r11It is linked into the input terminal of dividing module 4 jointly, obtains result r12It is stored in buffer5.
When obtaining q1It after Sign (f (A)), is entered into and is input in selector 2, wherein Sign (f (A)) is used as item
Part signal.When Sign (f (A)) is timing q21=q12, q22=-q11, the q when Sign (f (A)) is negative21=-q12, q22=q11。
Obtained q2As output.
As shown in figure 4, CORE_TTQRT includes two modules of QR and Column update in specific application example.QR
Module is fully pipelined architecture, in first clock cycle first to its input matrix A_1=[a11,1;a31, 0] and QR operation is carried out,
Second clock cycle is to its input matrix A_2=[a22,1;a42, 0] and carry out QR operation.Firstly, 2 × 2 matrixes of input are deposited
Enter buffer1, the matrix-vector a being stored by selector1Element a11Two inputs, a as multiplier 321As multiplying
Two inputs of musical instruments used in a Buddhist or Taoist mass 4, meanwhile, take a21Symbol as output Sign.Obtain two are exportedWithAs adder
2 input terminal, adder 1 obtain resultIt is stored in buffer2.The result input that adder 2 obtains is subjected to out radical sign module
2, result radical sign operation will be opened obtainedAs output.By vector a1Make the quilt of divider 5
Divisor, r11It is input to divider 6 as divisor, obtained result q1As output.Wherein r12=q11, r22=q12。
Obtain Sign and q1Afterwards, by Sign and q1The alternatively input of device, wherein Sign alternatively signal.Work as Sign
It is q for timing output21=q12, q22=-q11, when Sign is negative, output is q21=-q12, q22=q11.Obtained q2As defeated
Out.
After obtaining the Q matrix of A_1, to a12, a32Column, which are carried out, by Column update updates operation.By a11, q12As
The input a of multiplier 521, q22Multiplier 6 is inputted, by a11, q11Multiplier 7 is inputted, by a21, q21Input multiplier 8.Multiplier 5
The result input summer 3 obtained with multiplier 6 obtains resultThe result input summer that multiplier 7 and multiplier 8 obtain
2 obtain resultObtain matrixTake matrixMiddle submatrixProgress is similarly operated with A_1
Afterwards, to vectorIt is updated operation.Obtain 2 × 2 R matrix.
The above is only the preferred embodiment of the present invention, protection scope of the present invention is not limited merely to above-described embodiment,
All technical solutions belonged under thinking of the present invention all belong to the scope of protection of the present invention.It should be pointed out that for the art
For those of ordinary skill, several improvements and modifications without departing from the principles of the present invention should be regarded as protection of the invention
Range.