[go: up one dir, main page]

CN1165176C - A fast sub-pixel motion estimation method based on prediction direction correction/statistical prediction - Google Patents

A fast sub-pixel motion estimation method based on prediction direction correction/statistical prediction Download PDF

Info

Publication number
CN1165176C
CN1165176C CNB021242542A CN02124254A CN1165176C CN 1165176 C CN1165176 C CN 1165176C CN B021242542 A CNB021242542 A CN B021242542A CN 02124254 A CN02124254 A CN 02124254A CN 1165176 C CN1165176 C CN 1165176C
Authority
CN
China
Prior art keywords
sub
pixel
point
sad
motion estimation
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
CNB021242542A
Other languages
Chinese (zh)
Other versions
CN1396774A (en
Inventor
芸 何
何芸
陈志波
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Tsinghua University
SVA Group Co Ltd
Original Assignee
Tsinghua University
SVA Group Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Tsinghua University, SVA Group Co Ltd filed Critical Tsinghua University
Priority to CNB021242542A priority Critical patent/CN1165176C/en
Publication of CN1396774A publication Critical patent/CN1396774A/en
Application granted granted Critical
Publication of CN1165176C publication Critical patent/CN1165176C/en
Anticipated expiration legal-status Critical
Expired - Fee Related legal-status Critical Current

Links

Images

Landscapes

  • Compression Or Coding Systems Of Tv Signals (AREA)

Abstract

本发明属于信号处理中的视频编码领域,涉及基于预测方向校正/统计预判的快速亚象素运动估计方法。该方法主要是在1/2,1/4,1/8等亚象素运动估计中利用上一级搜索精度运动估计的结果以及当前精度下的统计信息来预判当前级的运动矢量,并且引入搜索截止准则。在保持已有技术的编码率失真特性的同时极大地降低了软硬件中亚象素运动估计的运算复杂度。且本方法并不限于H.264国际标准,可以推广至其他国际标准和非国际标准视频编码的应用。同时本发明的方法具有一定的可扩展性,可以与众多的整象素运动估计算法相结合,并可以实现运算复杂度与预测精度之间的均衡调节。

The invention belongs to the field of video coding in signal processing, and relates to a fast sub-pixel motion estimation method based on prediction direction correction/statistical prediction. This method is mainly to use the results of the motion estimation of the previous level of search precision and the statistical information under the current precision to predict the motion vector of the current level in 1/2, 1/4, 1/8 and other sub-pixel motion estimation, and Introduce search cut-off criteria. While maintaining the coding rate-distortion characteristic of the prior art, the calculation complexity of sub-pixel motion estimation in software and hardware is greatly reduced. Moreover, the method is not limited to the H.264 international standard, and can be extended to other international standards and non-international standard video coding applications. Simultaneously, the method of the present invention has certain expansibility, can be combined with numerous integer pixel motion estimation algorithms, and can realize the balanced adjustment between operation complexity and prediction accuracy.

Description

Quick sub-pixel motion method of estimation based on prediction direction correction/statistical prejudgement
Technical field
The invention belongs to the field of video encoding in the signal processing, H.264 propose new quick sub-pixel motion searching method at up-to-date video encoding standard especially, under the prerequisite that guarantees video coding efficient, saved sub-pixel motion estimating part operand in the software and hardware realization significantly.
Background technology
H.261 traditional video encoding standard such as ITU formulate, H.263, H.26L, the MPEG-1 that organizes to set up of the MPEG of standard and ISO H.264, MPEG-2, MPEG-4 etc. are based on hybrid coding, both on the HybridCoding framework.So-called hybrid encoding frame is to take all factors into consideration prediction, and the coding framework of the method for conversion and entropy coding has following main feature:
1) utilize prediction to remove the redundancy of time-domain;
2) utilize conversion to remove the redundancy of spatial domain;
3) remove statistical redundancy with entropy coding;
Above-mentioned video encoding standard all has intracoded frame, i.e. I frame, and inter-frame encoding frame, i.e. and P frame, I frame and P frame adopt different coding methods.The cataloged procedure of I frame is as follows: original image data is carried out two-dimensional transform (adopting discrete cosine transform or integer transform); In transform domain, conversion coefficient is quantized then; Carry out entropy coding at last, i.e. Hunffman coding or arithmetic coding etc.The cataloged procedure of P frame is as follows: adopt estimation to obtain motion vector, adopt the inter prediction based on motion compensation then, then the residual block that inter prediction is obtained carries out two-dimensional transform, coefficient in transform domain is quantized again, and carries out entropy coding at last.
Because the stronger correlation of video sequence on time-domain, inter prediction is the key factor that improves coding gain, so estimation and motion compensation are the very part and parcels in the Video Coding Scheme.
Estimation is divided into two parts, and whole picture element movement is estimated and sub-pixel motion is estimated.Whole picture element movement is estimated need be with respect to (the 2*W of present frame present picture element point at the corresponding picture element of reference frame x+ 1) * (2*W y+ 1) seeks the match block of a cost function minimum in the window, wherein W x, W yBe search width and height parameter.Sub-pixel motion estimates it then is to search for corresponding to the inferior picture element around the whole picture element of the best of the whole picture element movement vector of optimum, obtaining the best inferior picture element corresponding to optimum sub-pixel motion vector.As half pel search of common employing be 8 half picture elements around the whole picture element of the best search for to best inferior picture element.The motion compensation of inferior pixel accuracy can greatly improve code efficiency, as H.263 adopt after the half-pixel accuracy motion compensation than only adopt whole pixel accuracy motion compensation H.261 under identical code rate signal to noise ratio approximately can improve above 1dB.Adopt higher inferior pixel accuracy motion compensation then can obtain higher coding gain, but the complexity of aspects such as corresponding Design of Filter and coding also can increase as 1/4 or 1/8.Adopted the motion compensation technique of 1/4 pixel accuracy in the MPEG-4 standard.
The video encoding standard of formulating at present H.264, absorbed the achievement of video coding technique development for many years, on code efficiency and function, all surmounted video encoding standard in the past, but its basic framework remains based on hybrid encoding frame, and the precision of its estimation can reach 1/8 pixel.Fig. 1 is inferior pixel location and motion search range schematic diagram thereof.Capitalization among the figure (C, H i, V i, D i) be whole pixel location, and Roman number (I, II, III...) expression half pixel location, lowercase (a, b, c...) expression 1/4 pixel location, Arabic numerals (1,2,3...) represent 1/8 pixel location.Estimation for each macro block in the video coding process is divided into following a few step basically:
1. the motion search of at first making whole pixel obtains the motion vector of whole pixel accuracy, obtains corresponding to the whole picture element C of the best of whole picture element movement vector;
2. in 8 half pixel location I~VIII around the whole picture element C of the best, seek best inferior picture element V;
3. in 8 1/4 pixel a~h around the inferior picture element V of the best, seek best 1/4 picture element h;
4. in 8 1/8 pixels 1~8 around the best 1/4 picture element h, seek best 1/8 picture element 1;
Seek in the motion search best matching blocks just need to adopt a matching criterior, the cost function of employing generally adopt absolute difference and: SAD (Sum of Absolute Difference) function, it is defined as:
SAD ( P ) = Σ i = 0 N - 1 Σ j = 0 N - 1 | f ( i , j , t ) - f ( i - x , j - y , t - 1 ) |
Formula (1)
Here the size of supposing match block is N * N, and (i, j t) are (i, j) the pixel brightness value at coordinate position place, (x, y) two components of the motion vector of the current picture block of expression present frame position sensing reference frame mid point P position of t picture frame constantly to f.
This shows,, but also need 24 times extra interpolation to calculate for the inferior pel search part of the motion vector light that obtains 1/8 pixel accuracy just needs the calculating of the formula (1) of 24 points.
Because in the computing of whole estimation, the shared operand of estimation of whole picture element movement vector is very big, as when step-size in search is 32, the all direction search method of whole pixel needs formula (1) computing of 4225 points, therefore in the research work in the past, rapid motion estimating method all is to estimate at whole picture element movement, and has ignored the sub-pixel motion estimation effect.But along with the research of fast whole picture element movement method of estimation deepens continuously, the operand that whole picture element movement is estimated is fewer and feweri, present achievement in research shows that the number of the point of whole picture element movement estimating searching can reach below 10, and all keeps goodish code efficiency under various code checks.Like this, sub-pixel motion estimates that ratio shared in the operand of whole estimation is higher, especially when more the motion vector of high pixel precision is adopted, the operand that sub-pixel motion is estimated more and more becomes the bottleneck that the restriction operand descends, and the research for quick sub-pixel motion method of estimation just seems important all the more like this.
Summary of the invention
The objective of the invention is for overcoming the weak point of prior art, a kind of sub-pixel motion method for fast searching based on prediction direction correction/statistical prejudgement is proposed, comprise rapid motion estimating method based on prediction-correction for direction, and based on the fast motion estimation method of statistical prejudgement.In the encoding rate distorted characteristic that keeps prior art, greatly reduce the computational complexity that sub-pixel motion is estimated in the software and hardware.And this method is not limited to H.264 international standard, can extend to the application of other international standards and non-international standard video coding.Simultaneously method of the present invention has certain extensibility, can combine with numerous whole picture element movement algorithm for estimating, and can realize that the equilibrium between computational complexity and the precision of prediction regulates.
The correction for direction rapid motion estimating method based on prediction in the sub-pixel motion method of estimation that the present invention proposes comprises following three steps:
1/2 motion-vector prediction (Prediction), correction for direction (Directional Refinement), cutoff criterion (Half-Stop).Introduce respectively below:
1.1/2 motion-vector prediction (Prediction):
Two hypothesis are at first arranged, the one, the cost function around the whole picture element of the best is a level and smooth convex function, second hypothesis is that the cost function of four whole picture elements around the best whole picture element of supposition is known (corresponding to being exactly V1 among Fig. 1, V2, the cost function of H1 and four positions of H2 is known), can be designated as SAD (V1) respectively, SAD (V2), SAD (H1), the cost function of SAD (H2) central point is SAD (C).This hypothesis is to be based upon at present numerous fast whole picture element movement methods of estimation all to be based on the rhombus estimation model based.Therefore can predict the possible direction of next step half picture element movement vector according to the cost function of adjacent four whole picture elements.
The specific implementation step is as follows:
1) at SAD (V1), SAD (V2) selects minimum value SAD among SAD (H1) and the SAD (H2) MinObtain cost function smallest point (being designated hereinafter simply as smallest point); Select inferior little value SAD SubObtain the inferior point (being designated hereinafter simply as inferior point) of cost function, and the picture element of corresponding points is P respectively MinAnd P Sub:
P min = arg min i ( SAD ( P j ) ) , subjectto P j ∈ Ω
P sub = arg min j ( SAD ( P j ) ) , subjectto P j ∈ ( Ω - P min )
Ω = { V 1 , V 2 , H 1 , H 2 |
Formula (2)
2) if P MinAnd P Sub2 on same horizontal line or vertical line, i.e. P Min(x)=P Sub(x) or P Min(y)=P Sub(y) time, selection is arranged in candidate's 1/2 picture element that two 1/2 picture elements on this straight line and whole picture element C estimate as next step 1/2 picture element movement (for example at Fig. 1, if V1 and V2 are respectively smallest point and inferior point, select II and IIV estimation point so) as half pixel;
3) if P MinAnd P Sub2 not on same horizontal line or vertical line, i.e. P Min(x) ≠ P Sub(x) and P Min(y) ≠ P Sub(y), so these 2 respectively and on two line segments being formed by connecting of central point C and between three 1/2 picture elements and whole picture element C constitute the candidate point of the estimation of 1/2 pixel (for example in Fig. 1, if V1 and H1 are respectively smallest point and inferior point, select I so, II, IV is as the estimation point of half pixel);
4) in candidate point, the point that selects the cost function reckling is 1/2 pixel smallest point, and the pairing motion vector of smallest point is 1/2 picture element movement vector.The point that selects the inferior little person of cost function is the inferior point of 1/2 pixel.
2. correction for direction (Directional Refinement)
The correction for direction technology belongs to the content that the inventor has delivered in the ISCAS international conference of IEEE in 2002, is used for 1/4 and 1/8 sub-pixel motion vector.Combine with above-mentioned 1/2 picture element movement vector and next step cutoff criterion method, constitute complete sub-pixel motion vector method.
On the basis of upper level motion estimation result, further the direction of correction of movement estimation makes it guarantee the accuracy of high accuracy motion vectors.The specific implementation step is as shown in Figure 2:
1) in the motion estimation process of upper level precision, the position of smallest point and inferior point is respectively P MinAnd P Sub, P so MinAnd P SubRelative position relation two kinds of patterns shown in Figure 2 are arranged, (a) expression P MinAnd P Sub2 on same horizontal line or vertical line, i.e. P Min(x)=P Sub(x) or P Min(y)=P Sub(y) situation the time, (b) expression P MinAnd P Sub2 not on same horizontal line or vertical line, i.e. P Min(x) ≠ P Sub(x) and P Min(y) ≠ P Sub(y) situation;
2) under every kind of pattern, get P MinAnd P SubBetween the picture element of three current precision be candidate point.Enumerated example under two kinds of patterns among Fig. 2, promptly chosen among the figure by slightly deceiving three current precision picture elements of Arabic numerals sign as candidate's picture element.
3) in the set that selected candidate pixel point and upper level search optimal match point constitute, selecting the point of cost function minimum is this class precision smallest point, and it is corresponding to this class precision picture element movement vector.The point of selecting the inferior little person of cost function is the inferior point of this class precision pixel.
4) the correction for direction method in above-mentioned 3 steps step is all adopted in the correction of the motion vector direction under 1/4 pixel and 1/8 pixel accuracy judgement.
3. by (Half-Stop) criterion
The process of general interframe encode is to carry out two-dimensional transform for the residual block after the motion compensation, coefficient in transform domain is quantized again, and carries out entropy coding at last.And when residual error less than certain value the time, its variation coefficient all can vanishing after quantizing, and does not need coding.Therefore in the process of estimation, when estimation to cost function less than just having there is no need to continue to search for value that cost function is littler after to a certain degree, do not improved again because of it can not make code efficiency.So adopted this search cutoff criterion in the present invention in the inferior pixel fast method of Ti Chuing:
When the cost function SAD of estimation point<T, motion estimation process ends, and wherein T is a threshold value, can get definite value and (obtain according to experimental result, be experiential cost error tolerance level and be provided with), also can get according to the formula and the quantification manner estimation of shaping conversion in H.264.
The operation principle based on the correction for direction rapid motion estimating method of predicting in the sub-pixel motion method of estimation that the present invention proposes is as follows:
Around the optimal motion vector, has this hypothesis of certain flatness based on prediction-correction for direction method based on cost function, direction according to the cost function value prediction next stage precision motion vector of adjacent position, and adopt search to avoid unnecessary computing by judgment criterion, it is original about 1/3 that the operand of sub-pix estimation is reduced to, and keeps original coding efficiency simultaneously.Help the reduction of operand in the hardware realization, realize that at hardware the complexity of the interpolative operation that Central Asia pixel motion is estimated also descends about 1/3.
The rapid motion estimating method based on statistical prejudgement in the sub-pixel motion method of estimation that the present invention proposes is that a predictive mode with unanimity carries out from the sub-pixel motion estimation approach of 1/2 pixel to 1/4 and 1/8 pixel accuracy.May be summarized to be and comprise following three steps: the one dimension coupling is estimated prediction, and the two dimension coupling is estimated computing, cutoff criterion.Introduce respectively below
1. the one dimension coupling is estimated prediction:
Here mainly be divided into three steps:
(1) calculates VSum (P) value that one dimension mates each position in estimating, promptly utilize the upper level search precision (for 1/2 pixel accuracy, the upper level search precision is exactly whole pixel accuracy, be respectively 1/2 and 1/4 precision for upper level search precision under the 1/4 and 1/8 pixel accuracy situation) one dimension coupling VSum (P) value in estimating, by the operation acquisition of medium filtering;
(2) according to formula
VSAD ( P ) = Σ i = 0 N - 1 | Σ j = 0 N - 1 f ( i , j , t ) - Σ j = 0 N - 1 f ( i - x , j - y , t - 1 ) |
= Σ i = 0 N - 1 | VSum ( i , 0 , t ) - VSum ( i - x , - y , t - 1 ) |
Formula (4)
All search points are carried out the coupling of one dimension and estimate prediction;
(3) judge that according to triangle inequality rule selection needs carry out the set П that the two dimension coupling is estimated the point of computing:
П={ P i, s.t.VSAD (P i)≤α * SAD (P Min) formula (5)
2. the two dimension coupling is estimated computing:
In the set П that one dimension coupling estimation forecasting institute gets, carry out the two dimension coupling and estimate computing, select optimal match point P Min, satisfy:
SAD ( P min ) = arg min j SAD ( P j ) , subjectto P j ∈ Π
Formula (6)
3. cutoff criterion:
The process of general interframe encode is to carry out two-dimensional transform for the residual block after the motion compensation, coefficient in transform domain is quantized again, and carries out entropy coding at last.And when residual error less than certain value the time, its variation coefficient all can vanishing after quantizing, and does not need coding.Therefore in the process of estimation, when estimation to cost function less than just having there is no need to continue to search for value that cost function is littler after to a certain degree, do not improved again because of it can not make code efficiency.So adopted this search cutoff criterion in this article in the inferior pixel fast method of Ti Chuing:
When the cost function SAD of estimation point<T, motion estimation process ends, and wherein T is a threshold value, can get definite value, also can get according to the formula and the quantification manner estimation of shaping conversion in H.264.
The operation principle based on the statistical prejudgement rapid motion estimating method in the sub-pixel motion method of estimation that the present invention proposes is as follows:
Triangle inequality is judged rule (being public technology):
Error adaptation function commonly used in the method for estimating is the absolute difference function, and is as follows:
SAD ( P ) = Σ i = 0 N - 1 Σ j = 0 N - 1 | f ( i , j , t ) - f ( i - x , j - y , t - 1 ) |
Formula (7)
Each row by calculating earlier current processing block and reference prediction piece and value, and then ask the error matching operation of an one dimension to get:
VSAD ( P ) = Σ i = 0 N - 1 | Σ j = 0 N - 1 f ( i , j , t ) - Σ j = 0 N - 1 f ( i - x , j - y , t - 1 ) |
Formula (8)
= Σ i = 0 N - 1 | VSum ( i , 0 , t ) - VSum ( i - x , - y , t - 1 ) |
Can get by triangle inequality:
VSAD (P)≤SAD (P) formula (9)
Estimation process be exactly in the set omega of all points that need detect, to select some P with minimum match error value MinAs optimal match point:
SAD ( P min ) = arg min j SAD ( P j ) , subjectto P j ∈ Ω
Formula (10)
For certain location point P jIf, VSAD (P j)>SAD (P Min) set up, then must have:
SAD (P j)>SAD (P Min) formula (11)
So by the matching operation of one dimension, can dope those can not be the point of optimum Match certainly, then other might be the matching operation of carrying out two dimension in the point of optimum Match, select optimum match point.
Obviously it is few more good more to carry out counting of two dimension coupling through remaining needs after the one dimension matching operation, and the distribution statistics characteristic of this result and concrete data, and SAD (P Min) to choose be relevant.
Because sub-pixel motion estimates that the method that this paper proposes has two following technical characterstics:
(1) SAD (P in the motion in one dimension estimation procedure of each grade Min) the matching error value of the minimum that obtained by the upper level estimation replaces, be to utilize whole picture element movement results estimated during half picture element movement is estimated, 1/4 picture element movement estimates then to utilize the half picture element movement results estimated, and 1/8 picture element movement estimates to utilize 1/4 picture element movement results estimated.
In the motion in one dimension of each grade of inferior pixel is estimated, choose satisfied
VSAD (P)<α * SAD (P Min) formula (12)
Location point carry out two dimensional motion and estimate.Wherein alpha parameter can be used to regulate the equilibrium between computational complexity and the precision of prediction.
(2) VSum (P) value in the motion in one dimension estimation procedure of each grade all is to carry out interpolation by used VSum (P) value in the upper level estimation to obtain, and can save huge operand like this.
By formula (4) as can be seen, the matching operation of so-called one dimension be exactly by ask every columns certificate and two-dimensional blocks of data is converted to the one-dimensional data piece, carry out the same matching error computing of finding the solution again.
Two dimension in the formula (4) can be described below to the conversion of one dimension:
VSum ( i , j , t ) = Σ l = j j + N - 1 f ( i , l , t )
Formula (13)
The motion estimation process of each class precision (1/2,1/4,1/8 pixel accuracy) all needs to calculate the VSum numerical value in the one dimension matched data piece, adopts two following principles to calculate fast here:
1. for the numerical value in the whole pixel accuracy one dimension matched data piece down because the VSum of vertical direction adjacent position point (i, j, t) have much overlapping, so have general fast algorithm implementation VSum (i, j, calculating t), utilize formula:
VSum ( i , j + 1 , t ) = Σ l = j + 1 j + N f ( i , l , t ) = VSum ( i , j , t ) - f ( i , j , t ) + f ( i , j + N , t )
Formula (14)
If combine some specific fast whole picture element movement algorithm for estimating, the operand of this part even can omit.
2. for 1/2,1/4, VSum (P) numerical value that VSum (P) numerical value in the one dimension matched data piece under 1/8 pixel accuracy then calculates according to upper level utilizes medium filtering to calculate and gets.
As shown in figure 14: suppose that the data point that circle is demarcated is the point of upper level estimation, C is the resulting optimal match point of upper level estimation, other points are the candidate point of required estimation under the current class resolution ratio, the point of wherein leg-of-mutton some expression level or vertical direction, and the point of rhombus is represented the point of diagonal positions.If with the value after the one dimension conversion of VSum (P) expression P point position, the value after the one dimension conversion of these picture elements of so current estimation can be obtained through interpolation by the value of upper level:
VSum(1)=(VSum(C)+VSum(V1))>>1
VSum(2)=(VSum(C)+VSum(V2))>>1
VSum(5)=(VSum(C)+VSum(H1))>>1
VSum(6)=(VSum(C)+VSum(H2))>>1
VSum(3)=(VSum(C)+VSum(V1)+VSum(D1)+VSum(H1))>>2
VSum(4)=(VSum(C)+VSum(V1)+VSum(D2)+VSum(H2))>>2
VSum(7)=(VSum(C)+VSum(V2)+VSum(D3)+VSum(H1))>>2
VSum(8)=(VSum(C)+VSum(V2)+VSum(D4)+VSum(H2))>>2
Formula (15)
Interpolation filter used herein is a median filter.Experimental result shows, adopts median filter to predict in the motion in one dimension estimation procedure and can obtain and adopt H.264 middle definition filter to predict akin result, and complexity significantly decreases simultaneously.
Characteristics of the present invention and effect:
The present invention proposes a kind of sub-pixel motion method for fast searching, comprise rapid motion estimating method based on prediction-correction for direction based on prediction direction correction/statistical prejudgement, and based on the fast motion estimation method of statistical prejudgement.This method greatly reduces the computational complexity that sub-pixel motion is estimated in the software and hardware in the encoding rate distorted characteristic that keeps prior art.And this method is not limited to H.264 international standard, can extend to the application of other international standards and non-international standard video coding.Simultaneously method of the present invention has certain extensibility, can combine with numerous whole picture element movement algorithm for estimating, and can realize that the equilibrium between computational complexity and the precision of prediction regulates.
Description of drawings:
Fig. 1 is H.264 standard Central Asia pixel location and estimation scope schematic diagram thereof.
Fig. 2 is the schematic diagram of two kinds of patterns of the prediction direction bearing calibration among the present invention.
Fig. 3 is the corresponding relation between the two-stage estimation point among the present invention.
Embodiment
In the sub-pixel motion method of estimation that the present invention proposes based on the correction for direction rapid motion estimating method of prediction
Specific embodiment is described as follows:
1.1/2 motion-vector prediction (Prediction):
Specific implementation step following (concrete pixel location is with reference to Fig. 1):
1) at SAD (V1), SAD (V2) selects minimum value SAD among SAD (H1) and the SAD (H2) MinAnd inferior little value SAD Sub
2) according to the position relation of minimum value and inferior little value, promptly on a horizontal/vertical line, perhaps on diagonal, select estimation candidate's picture element of 1/2 pixel;
3) selecting the pairing motion vector of cost function reckling in above-mentioned candidate point is 1/2 picture element movement vector.
2. correction for direction (Directional Refinement)
1) according to the smallest point and time point of 1/2 pixel accuracy search gained, determine the estimation candidate point of 1/4 pixel accuracy, selecting the cost function smallest point is best 1/4 pixel;
2) according to the smallest point and time point of 1/4 pixel accuracy search gained, determine the estimation candidate point of 1/8 pixel accuracy, selecting the cost function smallest point is best 1/8 pixel;
3. by (Half-Stop) criterion
When the cost function SAD of estimation point<T, motion estimation process ends, and wherein T is a thresholding, and getting definite value at the macro block of 16x16 in the present experiment is 500.For example when the error adaptation function value that detects certain point is 400, end search procedure, confirm that certain is best search match point.
Present embodiment is to realize that the international standard sequence of selecting more representational 4 CIF forms and 2 QCIF forms is as cycle tests on test platform JM2.0 basis H.264.The sequence of 4 CIF forms is Foreman, has been characterized in that stylus rocks; Stefan has been characterized in strenuous exercise; ContainShip is characterized in; And Carphon, be characterized in the moderate motion; The sequence of QICF form is Suzi, is characterized in that the head shoulder resembles; And Salesman, be characterized in that object rotates.Parameter in the present embodiment is provided with as follows:
1. reference frame number: 1
2.Slice pattern: do not adopt
3. entropy coding pattern: CABAC
4. whole picture element movement estimation range: 32
5. rate-distortion optimization: use
6.Hardmard conversion: do not use
7. interframe movement is estimated block mode: only use 16 * 16 pattern
Present embodiment shows that operand reduces to about 17.4%~34.7% of former method, and also can reduce the interpolative operation of same ratio in hardware is realized, and interpolative operation especially the interpolative operation amount of high accuracy pixel be very big.The inventive method has improved arithmetic speed significantly, can well keep the rate distortion characteristic of original encoder when reducing operand.
The rapid motion estimating method based on statistical prejudgement in the sub-pixel motion method of estimation that the present invention proposes is that a predictive mode with unanimity carries out from the sub-pixel motion method of estimation of 1/2 pixel to 1/4 and 1/8 pixel accuracy.
The specific embodiment step is as follows:
1. estimate that according to the one dimension coupling needs carry out the point that the two dimension coupling is estimated in prediction 1/2 picture element:
Here mainly be divided into three steps:
A) as shown in Figure 3, the optimal match point that C obtains for whole picture element movement estimation, two-dimensional blocks of data of the correspondence position of 8 adjacent whole pixel location points can calculate according to formula (4) to the conversion of one-dimensional data piece around C and its, and its operand is near the amount of calculation of a sad value.Obtain the data of the one-dimensional data piece of current inferior pixel location search point then by the computing of formula (15) medium filtering;
B) according to formula (4) coupling that all search points carry out one dimension is estimated prediction;
C) judge that according to triangle inequality rule selection needs carry out the set П that the two dimension coupling is estimated the point of computing:
П={P i,s.t.VSAD(P i)≤α*SAD(P min)}
2. the two dimension coupling is estimated computing:
In the set П that one dimension coupling estimation forecasting institute gets, carry out the two dimension coupling and estimate computing, select optimal match point P Min, satisfy:
SAD ( P min ) = arg min j SAD ( P j ) , subjectto P j ∈ Π
3. at the search point P of best 1/2 pixel accuracy MinCarry out the estimation of 1/4 pixel accuracy on every side, whole process is consistent with the motion estimation process of 1/2 pixel accuracy, just calculate the data Vsum value of the one-dimensional data piece in the 1/2 pixel accuracy estimation, obtain the Vsum value of the one-dimensional data piece of current 1/4 pixel location search point again by medium filtering by formula (4);
4. at the search point P of best 1/4 pixel accuracy MinCarry out the motion estimation process of 1/8 pixel accuracy on every side, whole process is consistent with the motion estimation process of 1/4 pixel accuracy, just calculate the data Vsum value of the one-dimensional data piece in the 1/4 pixel accuracy estimation, obtain the Vsum value of the one-dimensional data piece of current 1/8 pixel location search point again by medium filtering by formula (4);
5. cutoff criterion:
When the cost function SAD of certain estimation point<T, motion estimation process ends, and wherein T is a thresholding, and getting definite value at 16 * 16 macro block in the present experiment is 500.
6. choosing of the α value in actual the realization can be according to the difference of search precision, and the statistical property of image sequence itself is adjusted dynamically.
The term harmonization of the condition of present embodiment and a last embodiment.Present embodiment shows that operand can regulate with respect to the ratio that primal algorithm descends about 5%~80%, reduced the interpolative operation of same ratio, and interpolative operation especially the interpolative operation amount of high accuracy pixel be very big.
The inventive method can realize the equilibrium adjusting between computational complexity and the precision of prediction.

Claims (1)

1、基于预测方向校正的快速亚象素运动估计方法,其特征在于,包括1/2运动矢量预测、方向校正、和截止准则三个部分,各部分实现步骤如下:1, based on the fast sub-pixel motion estimation method of prediction direction correction, it is characterized in that, comprise 1/2 motion vector prediction, direction correction and cut-off criterion three parts, each part realizes steps as follows: 所说的1/2运动矢量预测包括以下步骤:Said 1/2 motion vector prediction comprises the following steps: 1)首先已知:在最佳整象素点周围的代价函数是一个平滑的凸函数,最佳整象素点周围的四个整象素点的代价函数,分别记为  SAD(V1),SAD(V2),SAD(H1),SAD(H2),以及整象素点的代价函数为SAD(C)1) Firstly, it is known that the cost function around the best integer pixel point is a smooth convex function, and the cost functions of the four integer pixel points around the best integer pixel point are respectively denoted as SAD(V1), SAD(V2), SAD(H1), SAD(H2), and the cost function of the integer pixel is SAD(C) 2)在SAD(V1),SAD(V2),SAD(H1)和SAD(H2)中选出最小值SADmin得到代价函数最小点;选出亚小值SADsub得到代价函数亚小点,且分别对应点的象素点为Pmin和Psub2) Select the minimum value SAD min from SAD(V1), SAD(V2), SAD(H1) and SAD(H2) to obtain the minimum point of the cost function; select the sub-minimum value SAD sub to obtain the sub-small point of the cost function, and The pixel points corresponding to the points are P min and P sub : PP minmin == argarg minmin jj (( SADSAD (( PP jj )) )) ,, subjecttosubject to PP jj &Element;&Element; &Omega;&Omega; PP subsub == argarg minmin jj (( SADSAD (( PP jj )) )) ,, subjecttosubject to PP jj &Element;&Element; (( &Omega;&Omega; -- PP minmin )) Ω={V1,V2,H1,H2}Ω={V1, V2, H1, H2} 3)如果Pmin和Psub两点在同一水平线或垂直线上,即Pmin(x)=Psub(x)或者Pmin(y)=Psub(y)时,选择位于这一线段上的两个1/2象素点和整象素点C作为下一步1/2象素运动估计的候选1/2象素点;3) If P min and P sub are on the same horizontal line or vertical line, that is, P min (x) = P sub (x) or P min (y) = P sub (y), choose to be located on this line segment The two 1/2 pixel points and the integer pixel point C are used as candidate 1/2 pixel points for the next step of 1/2 pixel motion estimation; 4)如果Pmin和Psub两点不在同一水平线或垂直线上,即Pmin(x)≠Psub(x)和Pmin(y)≠Psub(y),那么这两点分别与整象素点C连接而成的两条线段上的1/2象素点、该两线段所夹的1/2象素点、以及整象素点C共同构成1/2象素的运动估计的候选点;4) If P min and P sub are not on the same horizontal line or vertical line, that is, P min (x)≠P sub (x) and P min (y)≠P sub (y), then these two points are respectively related to the integer The 1/2 pixel point on the two line segments connected by the pixel point C, the 1/2 pixel point between the two line segments, and the integer pixel point C together constitute the motion estimation of 1/2 pixel Candidate point; 5)在该候选点中,选代价函数最小者的点为1/2象素最小点,最小点所对应的运动矢量为1/2象素运动矢量;选代价函数亚小者的点为1/2象素亚小点;5) Among the candidate points, the point of selecting the minimum cost function is the minimum point of 1/2 pixel, and the motion vector corresponding to the minimum point is the motion vector of 1/2 pixel; /2 pixel sub-small dot; 所说的方向校正用于1/4和1/8亚象素运动矢量估计,具体实现步骤如下:Said direction correction is used for 1/4 and 1/8 subpixel motion vector estimation, and concrete realization steps are as follows: 1)上一级精度的运动估计过程中,最小点和亚小点的位置分别为Pmin和Psub,分为两种模式:(a)Pmin和Psub两点在同一水平线或垂直线上,即Pmin(x)=Psub(x)或者Pmin(y)=Psub(y),(b)Pmin和Psub两点不在同一水平线或垂直线上,即Pmin(x)≠Psub(x)和Pmin(y)≠Psub(y);1) In the motion estimation process of the upper level of precision, the positions of the minimum point and the sub-small point are P min and P sub respectively, which are divided into two modes: (a) P min and P sub are on the same horizontal line or vertical line above, that is, P min (x)=P sub (x) or P min (y)=P sub (y), (b) P min and P sub are not on the same horizontal or vertical line, that is, P min (x )≠P sub (x) and P min (y)≠P sub (y); 2)在每种模式下,取Pmin和Psub之间的三个当前精度的象素点和上一级精度的最小点为候选点;2) In each mode, take three pixel points of current precision between P min and P sub and the minimum point of the previous level of precision as candidate points; 3)在所选取的候选象素点中,选代价函数最小的点为本级精度最小点,其对应于本级精度象素运动矢量,选代价函数亚小者的点为本级精度象素亚小点;3) Among the selected candidate pixels, the point with the smallest cost function is selected as the point with the smallest precision at this level, which corresponds to the motion vector of the pixel at this level of precision, and the point with the sub-smallest cost function is selected as the pixel at this level of precision Sub-small point; 所说的截止准则为:Said cut-off criteria are: 当运动估计点的代价函数SAD<T时,运动估计过程截止,其中T是阈值,该值的获得可以根据经验的代价误差允许程度而设或根据H.264中整形变换的公式和量化方式估测而得。When the cost function of the motion estimation point SAD<T, the motion estimation process is terminated, where T is the threshold value, which can be obtained according to the allowable degree of cost error in experience or estimated according to the formula and quantization method of shaping transformation in H.264. measured.
CNB021242542A 2002-07-12 2002-07-12 A fast sub-pixel motion estimation method based on prediction direction correction/statistical prediction Expired - Fee Related CN1165176C (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CNB021242542A CN1165176C (en) 2002-07-12 2002-07-12 A fast sub-pixel motion estimation method based on prediction direction correction/statistical prediction

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CNB021242542A CN1165176C (en) 2002-07-12 2002-07-12 A fast sub-pixel motion estimation method based on prediction direction correction/statistical prediction

Related Child Applications (1)

Application Number Title Priority Date Filing Date
CN 200410000741 Division CN1245031C (en) 2002-07-12 2002-07-12 Rapid sub-pixel motion estimation method based on prediction direction correction / statistic prejudgement

Publications (2)

Publication Number Publication Date
CN1396774A CN1396774A (en) 2003-02-12
CN1165176C true CN1165176C (en) 2004-09-01

Family

ID=4745373

Family Applications (1)

Application Number Title Priority Date Filing Date
CNB021242542A Expired - Fee Related CN1165176C (en) 2002-07-12 2002-07-12 A fast sub-pixel motion estimation method based on prediction direction correction/statistical prediction

Country Status (1)

Country Link
CN (1) CN1165176C (en)

Families Citing this family (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1913629B (en) * 2005-08-11 2011-06-29 深圳市凯达尔科技实业有限公司 Linear square parallel search quick motion estimating method
JP2010028220A (en) * 2008-07-15 2010-02-04 Sony Corp Motion vector detecting device, motion vector detecting method, image encoding device, and program
US8094714B2 (en) * 2008-07-16 2012-01-10 Sony Corporation Speculative start point selection for motion estimation iterative search
CN101771870B (en) * 2009-01-06 2013-06-19 上海中科计算技术研究所 Quick searching method for block motion matching of video coding technique
CN102272796B (en) * 2009-01-09 2014-03-12 柯尼卡美能达控股株式会社 Motion vector generation apparatus and motion vector generation method
CN101877786B (en) * 2009-04-30 2012-08-15 北京大学 Video frame foreground tracking method and video coder
EP2489189A1 (en) * 2009-10-14 2012-08-22 Thomson Licensing Methods and apparatus for adaptive coding of motion information
US10327008B2 (en) 2010-10-13 2019-06-18 Qualcomm Incorporated Adaptive motion vector resolution signaling for video coding
CN103384332A (en) * 2013-06-29 2013-11-06 张新安 Improved algorithm for HFPS algorithm of AVS video coding

Also Published As

Publication number Publication date
CN1396774A (en) 2003-02-12

Similar Documents

Publication Publication Date Title
CN1183489C (en) Apparatus and method for motion estimation and mode determination
CN1214647C (en) Image encoding method and image encoder
CN1229999C (en) Method and apparatus for encoding and decoding motion vectors
CN1874519A (en) Intraprediction method and apparatus using video symmetry and encoding and decoding method and apparatus
CN1820512A (en) Method, medium, and filter removing a blocking effect
CN1926875A (en) Motion compensation method
CN1795680A (en) Fast mode decision algorithm for intra prediction for advanced video coding
CN1640149A (en) Video data conversion device and video data conversion method
CN101039433A (en) Video coding method, video coding apparatus and video coding program
CN1581982A (en) Pattern analysis-based motion vector compensation apparatus and method
CN1608372A (en) Method and apparatus for selecting macroblock quantization in a video encoder
CN1829328A (en) Prediction image generating method and apparatus and image and video encoding/decoding method and apparatus using the same
CN1324904C (en) Kinematic image coder and decoder
CN1933600A (en) Motion estimating method for H.264/AVC coder
CN1658673A (en) Video compression codec method
CN1495674A (en) Interpolation device for motion vector compensation and method
CN1245028C (en) Non-uniform multilayer hexaploid lattice full pixel kinematic search method
CN1165176C (en) A fast sub-pixel motion estimation method based on prediction direction correction/statistical prediction
CN1723711A (en) A unified metric for digital video processing (UMDVP)
CN1756366A (en) Video coding
CN1198465C (en) Code converter and cod convertion method
CN1492687A (en) A Fast Integer Pixel Motion Estimation Method for Multiple Block Modes
CN1750659A (en) Method for forming interpolation image memory organization and fractional pixel and predicating error index calculation
CN1224270C (en) Frame coding method of inter-frame coding frame for two stage predicting coding of macro block group structure
CN1921627A (en) Video data compaction coding method

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant
C17 Cessation of patent right
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20040901

Termination date: 20100712