CN100525452C

CN100525452C - Frequency domain fast sub picture element global motion estimating method for image stability

Info

Publication number: CN100525452C
Application number: CN 200710099576
Authority: CN
Inventors: 李波; 于白; 郑锦; 胡叶凤
Original assignee: Beihang University
Current assignee: Beihang University
Priority date: 2007-05-24
Filing date: 2007-05-24
Publication date: 2009-08-05
Anticipated expiration: 2027-05-24
Also published as: CN101068357A

Abstract

The invention discloses a frequency-domain fast sub-pixel global motion estimation method for image stabilization. In this method, first calculate the cross-power spectrum function between two adjacent frames of images in the video sequence, perform inverse Fourier transform on the cross-power spectrum function, generate a motion distribution function, search for its maximum peak point to obtain the global motion vector of the whole pixel; judge and Recording several peaks adjacent to the coordinates of the maximum peak point in the row and column directions of the motion distribution function, and calculating the sub-pixel global motion vector according to the correspondence between the sub-pixel global motion and the distribution of the primary and secondary peaks in the motion distribution function; Approximate processing is performed on the calculated sub-pixel global motion vector to obtain a sub-pixel global motion vector with a specified precision. The invention avoids the image interpolation process of the traditional method, not only greatly reduces the calculation burden, but also improves the precision of global motion estimation, and can effectively support accurate and real-time image stabilization processing.

Description

Frequency-Domain Fast Sub-Pixel Global Motion Estimation Method for Image Stabilization

技术领域 technical field

本发明涉及一种用于图像稳定的频域快速亚像素全局运动估计方法，尤其涉及一种能够减小频域条件下的亚像素全局运动估计时的计算和存储开销、提高亚像素全局运动估计准确性并改善图像稳定性的方法，属于图像处理技术领域。The present invention relates to a frequency-domain fast sub-pixel global motion estimation method for image stabilization, in particular to a method capable of reducing calculation and storage costs of sub-pixel global motion estimation under frequency domain conditions and improving sub-pixel global motion estimation A method for improving accuracy and image stability belongs to the technical field of image processing.

背景技术 Background technique

全局运动估计技术可以用于获得视频序列中相邻两帧图像之间的整体运动情况。它是图像稳定过程中最为关键的技术环节。全局运动估计从以下两个方面影响了图像稳定系统的性能表现。首先，全局运动估计的准确性和精度直接决定了系统的图像稳定质量。其次，由于全局运动估计部分的计算量一般占到系统计算量的90％以上，所以全局运动估计的速度决定了稳像系统的实时处理能力。Global motion estimation technology can be used to obtain the overall motion between two adjacent frames in a video sequence. It is the most critical technical link in the process of image stabilization. Global motion estimation affects the performance of image stabilization systems from the following two aspects. First, the accuracy and precision of global motion estimation directly determine the image stabilization quality of the system. Secondly, since the calculation amount of the global motion estimation part generally accounts for more than 90% of the calculation amount of the system, the speed of the global motion estimation determines the real-time processing capability of the image stabilization system.

参见图1所示，全局运动估计由整像素全局运动估计和亚像素全局运动估计两部分组成。整像素全局运动估计是以直接采样图像为基础，对图像间的整像素级的全局运动矢量进行估计；亚像素全局运动估计则首先通过插值估算出图像中非直接采样点(亚像素点)的值，以此为基础，在已知整像素全局运动矢量的基础上进一步估计亚像素全局运动矢量。有关实验表明，亚像素全局运动估计能够更加准确地描述图像间的全局运动情况，有效改善运动补偿效果。因此亚像素全局运动估计成为当前许多应用领域中的研究热点。Referring to FIG. 1 , the global motion estimation consists of two parts: integer-pixel global motion estimation and sub-pixel global motion estimation. Integer-pixel global motion estimation is based on directly sampling images, and estimates the integer-pixel-level global motion vectors between images; sub-pixel global motion estimation first estimates the non-direct sampling points (sub-pixel points) in the image through interpolation. Based on this value, the sub-pixel global motion vector is further estimated on the basis of the known integer-pixel global motion vector. Relevant experiments show that sub-pixel global motion estimation can more accurately describe the global motion between images and effectively improve the effect of motion compensation. Therefore, sub-pixel global motion estimation has become a research hotspot in many application fields.

亚像素全局运动估计可以分为时域和频域两种实现方式。在申请号为200410073837.5，申请日为2004年9月3日的中国发明专利申请“—种快速亚像素运动估计方法”中，公开了一种基于时域的快速亚像素运动估计方法。与该时域实现方法相比较，基于频域的全局运动估计方法具有适用范围广、抗干扰能力强等优点，因而更加适合于图像稳定应用。但在具体实施时，频域全局运动估计方法需要依据傅里叶变换的平移特性，通过比较视频序列中相邻两帧图像频谱的相位关系得到空域中图像间的全局运动情况。这一过程中存在空域与频域之间的傅立叶变换过程，计算量随着图像大小的变化增长较快。例如进行1/2^k像素精度的亚像素全局运动估计时，插值后的图像大小变为原图的4^k倍，相应的存储开销将增加4^k-1倍，而采用离散傅立叶变换时计算开销接近于整像素情况的16^k倍，即使采用快速傅立叶变换算法，计算量也将达到原有计算量的8^k倍。对于频域全局运动估计方法而言，这种亚像素扩展方式不仅运算复杂度提高，而且随着亚像素精度的增加，存储开销也成指数级增长，因此很难满足存储和计算资源有限的应用环境的实际需求。另一方面，在实施基于频域的全局运动估计方法时，图像边缘内容变化对频域全局运动估计的干扰比较明显，因此必须有针对性地加以克服。Sub-pixel global motion estimation can be divided into two implementations in time domain and frequency domain. In the Chinese invention patent application "A fast sub-pixel motion estimation method" with the application number 200410073837.5 and the filing date of September 3, 2004, a fast sub-pixel motion estimation method based on time domain is disclosed. Compared with the time-domain implementation method, the global motion estimation method based on the frequency domain has the advantages of wide application range and strong anti-interference ability, so it is more suitable for image stabilization applications. However, in actual implementation, the global motion estimation method in the frequency domain needs to obtain the global motion between images in the spatial domain by comparing the phase relationship between the frequency spectra of two adjacent frames of images in the video sequence based on the translation characteristics of Fourier transform. In this process, there is a Fourier transform process between the spatial domain and the frequency domain, and the calculation amount increases rapidly with the change of the image size. For example, when performing sub-pixel global motion estimation with 1/2 ^k pixel precision, the size of the interpolated image becomes 4 ^k times that of the original image, and the corresponding storage cost will increase by 4 ^k -1 times, while the calculation cost when using discrete Fourier transform It is close to 16 ^k times of the whole pixel case, even if the fast Fourier transform algorithm is used, the calculation amount will reach 8 ^k times of the original calculation amount. For the frequency-domain global motion estimation method, this sub-pixel expansion method not only increases the computational complexity, but also increases the storage cost exponentially with the increase of sub-pixel precision, so it is difficult to meet the applications with limited storage and computing resources. actual needs of the environment. On the other hand, when implementing the global motion estimation method based on the frequency domain, the interference of the image edge content change on the global motion estimation in the frequency domain is obvious, so it must be overcome in a targeted manner.

发明内容 Contents of the invention

本发明的首要目的是提供用于图像稳定的频域快速亚像素全局运动估计方法。该方法可以利用图像亚像素全局运动产生的频域相位变化信息，在不进行图像插值的情况下，通过直接计算得到图像亚像素全局运动矢量。The primary purpose of the present invention is to provide a frequency-domain fast sub-pixel global motion estimation method for image stabilization. This method can use the frequency-domain phase change information generated by the image sub-pixel global motion, and obtain the image sub-pixel global motion vector through direct calculation without image interpolation.

本发明的另外一个目的在于提供一种能够有效抑制图像边缘内容变化对频域全局运动估计所造成干扰的方法。该方法能够提高频域全局运动估计的准确性，尤其是频域亚像素全局运动估计的准确性。Another object of the present invention is to provide a method capable of effectively suppressing interference caused by image edge content changes to frequency-domain global motion estimation. The method can improve the accuracy of frequency-domain global motion estimation, especially the accuracy of frequency-domain sub-pixel global motion estimation.

为实现上述目的，本发明采用下述的技术方案。In order to achieve the above object, the present invention adopts the following technical solutions.

一种用于图像稳定的频域快速亚像素全局运动估计方法，其特征在于包括如下步骤：A frequency-domain fast sub-pixel global motion estimation method for image stabilization, characterized in that it comprises the following steps:

(1)计算视频序列中相邻两帧图像的原始互功率谱；(1) Calculate the original cross-power spectrum of two adjacent frames of images in the video sequence;

(2)获得在指定精度范围下各频率点行、列方向对应的最大干扰系数，然后将前一帧图像频谱中各频率点的幅值分别与行、列方向对应的最大干扰系数相乘，得到在指定精度范围下各频率点能够克服的最大干扰分量的大小，将各频率点的最大干扰分量中的最小值与经验阈值进行比较，判断该频率点是否为易受干扰的频率点；(2) Obtain the maximum interference coefficient corresponding to the row and column direction of each frequency point under the specified accuracy range, and then multiply the amplitude of each frequency point in the image spectrum of the previous frame by the maximum interference coefficient corresponding to the row and column direction respectively, Get the size of the maximum interference component that each frequency point can overcome under the specified accuracy range, compare the minimum value of the maximum interference component at each frequency point with the empirical threshold, and judge whether the frequency point is a frequency point that is susceptible to interference;

(3)对于被判断为易受干扰的频率点进行抑制，获得干扰抑制后的互功率谱；(3) Suppress the frequency points that are judged to be susceptible to interference, and obtain the cross-power spectrum after interference suppression;

(4)对所述干扰抑制后的互功率谱进行反傅立叶变换，生成运动分布函数，搜索运动分布函数中的最大峰值，根据最大峰值点的坐标得到整像素全局运动矢量；(4) carry out inverse Fourier transform to the cross power spectrum after described interference suppression, generate motion distribution function, search the maximum peak value in the motion distribution function, obtain the whole pixel global motion vector according to the coordinate of maximum peak point;

(5)判断并记录下所述运动分布函数中行、列方向上与最大峰值点坐标相邻的若干次峰值，依据亚像素全局运动与运动分布函数中主、次峰值分布的对应关系计算出亚像素全局运动矢量；(5) Judging and recording several peaks adjacent to the coordinates of the maximum peak point in the row and column directions in the motion distribution function, and calculating the sub-pixel global motion based on the corresponding relationship between the sub-pixel global motion and the distribution of the main and secondary peaks in the motion distribution function Pixel global motion vector;

(6)对计算得到的亚像素全局运动矢量进行近似处理，得到指定精度下的亚像素全局运动矢量。(6) Perform approximate processing on the calculated sub-pixel global motion vector to obtain a sub-pixel global motion vector with a specified precision.

其中，所述经验阈值的大小与图像的纹理细节丰富程度正向相关，在200～400之间。Wherein, the size of the empirical threshold is positively correlated with the richness of texture details of the image, and is between 200 and 400.

所述步骤(5)分为两个步骤：Described step (5) is divided into two steps:

首先，根据步骤(4)中搜索得到的最大峰值，分别记录下在水平方向和垂直方向上与最大峰值点最邻近的三个坐标点的函数值；First, according to the maximum peak value searched in step (4), record the function values of the three coordinate points closest to the maximum peak point in the horizontal direction and the vertical direction respectively;

其次，根据得到的最大峰值以及记录下的在水平方向和垂直方向上与最大峰值点最邻近的三个坐标点的函数值，分别计算对应的任意精度下的亚像素全局运动矢量。Secondly, according to the obtained maximum peak value and the recorded function values of the three coordinate points closest to the maximum peak point in the horizontal and vertical directions, respectively calculate the corresponding sub-pixel global motion vector with arbitrary precision.

本发明所提供的用于图像稳定的频域快速亚像素全局运动估计方法可以有效减小传统频域亚像素全局运动估计时的计算和存储开销，并对于图像边缘内容变化具有较好的干扰抑制效果，提高了频域亚像素全局运动估计的准确性。有关的测试结果表明，本方法对于图像稳定中各类场景图像的亚像素全局运动估计都能获得较好的效果。The frequency-domain fast sub-pixel global motion estimation method for image stabilization provided by the present invention can effectively reduce the calculation and storage overhead of traditional frequency-domain sub-pixel global motion estimation, and has better interference suppression for image edge content changes As a result, the accuracy of sub-pixel global motion estimation in the frequency domain is improved. Relevant test results show that this method can achieve good results for sub-pixel global motion estimation of various scene images in image stabilization.

附图说明 Description of drawings

下面结合附图和具体实施方式对本发明作进一步的说明。The present invention will be further described below in conjunction with the accompanying drawings and specific embodiments.

图1是二维的亚像素全局运动的像素分布示意图；FIG. 1 is a schematic diagram of pixel distribution of two-dimensional sub-pixel global motion;

图2是本发明所提供的用于图像稳定的频域快速亚像素全局运动估计方法的流程图；Fig. 2 is the flow chart of the frequency-domain fast sub-pixel global motion estimation method for image stabilization provided by the present invention;

图3是一维的亚像素全局运动时运动分布函数峰值分布示意图。FIG. 3 is a schematic diagram of the peak value distribution of the motion distribution function during one-dimensional sub-pixel global motion.

具体实施方式 Detailed ways

在频域条件下，一种好的亚像素全局运动估计方法必须同时考虑运动估计的精度、准确性、运行效率及存储开销的影响，同时能够有效克服图像边缘内容变化所带来的不利影响。为此，本发明的解决思路是：根据空域图像间亚像素全局运动与频域中频谱变化情况的对应关系，在频域整像素全局运动估计的基础上，利用其中间结果，通过计算直接获得亚像素全局运动矢量，实现了快速准确的频域亚像素全局运动估计。In the frequency domain, a good sub-pixel global motion estimation method must simultaneously consider the impact of motion estimation accuracy, accuracy, operating efficiency and storage overhead, and at the same time be able to effectively overcome the adverse effects of image edge content changes. Therefore, the solution idea of the present invention is: according to the corresponding relationship between the sub-pixel global motion in the spatial domain image and the frequency spectrum change in the frequency domain, on the basis of the global motion estimation of the whole pixel in the frequency domain, use the intermediate results to obtain directly by calculation The sub-pixel global motion vector realizes fast and accurate sub-pixel global motion estimation in the frequency domain.

下面结合附图说明本发明的具体实现方式。图2是本发明所提供的用于图像稳定的频域快速亚像素全局运动估计方法的流程图。该方法包括如下的步骤：首先，计算视频序列中相邻两帧图像间的互功率谱函数S₀，并依据图像边缘内容变化所产生干扰的表现形式，对互功率谱函数进行干扰抑制，得到包含更准确全局运动信息的互功率谱函数S₁。对新得到的互功率谱函数S₁进行反傅立叶变换，生成运动分布函数I，搜索其最大峰值点得到整像素全局运动矢量；其次，判断并记录下运动分布函数I中行(列)方向上与最大峰值点坐标相邻的几个次峰值(优选为三次)，依据亚像素全局运动与运动分布函数中主、次峰值分布的对应关系就可以计算出亚像素全局运动矢量；最后，对计算得到的亚像素全局运动矢量进行近似处理，得到指定精度下的亚像素全局运动矢量。The specific implementation of the present invention will be described below in conjunction with the accompanying drawings. Fig. 2 is a flow chart of the frequency-domain fast sub-pixel global motion estimation method for image stabilization provided by the present invention. The method includes the following steps: First, calculate the cross-power spectrum function S ₀ between two adjacent frames of images in the video sequence, and suppress the interference of the cross-power spectrum function according to the form of interference caused by the change of the edge content of the image, and obtain Cross-power spectral function S ₁ containing more accurate global motion information. Perform inverse Fourier transform on the newly obtained cross-power spectrum function _S1 to generate motion distribution function I, search for its maximum peak point to obtain the global motion vector of the whole pixel; secondly, judge and record the relationship between the row (column) direction and For several secondary peaks (preferably three times) adjacent to the coordinates of the maximum peak point, the sub-pixel global motion vector can be calculated according to the correspondence between the sub-pixel global motion and the distribution of the primary and secondary peaks in the motion distribution function; finally, the calculated The sub-pixel global motion vector is approximated, and the sub-pixel global motion vector under the specified precision is obtained.

上述获得整像素全局运动矢量的步骤具体包括如下的三个子步骤：首先，依据互功率谱函数的定义计算视频序列中相邻两帧图像的原始互功率谱；然后，根据选定的精度范围对互功率谱中不同频率点的信息进行筛选和抑制，获得具有更高可靠性的互功率谱；最后，对干扰抑制后得到的新的互功率谱进行反傅立叶变换，生成运动分布函数，搜索运动分布函数中的最大峰值，根据最大峰值点的坐标得到整像素全局运动矢量。The above-mentioned step of obtaining the global motion vector of an entire pixel specifically includes the following three sub-steps: first, calculate the original cross-power spectrum of two adjacent frames of images in the video sequence according to the definition of the cross-power spectrum function; The information of different frequency points in the cross power spectrum is screened and suppressed to obtain a cross power spectrum with higher reliability; finally, the inverse Fourier transform is performed on the new cross power spectrum obtained after interference suppression to generate a motion distribution function and search for motion The maximum peak in the distribution function, according to the coordinates of the maximum peak point to get the whole pixel global motion vector.

具体计算步骤如下：The specific calculation steps are as follows:

(1)原始互功率谱的计算(1) Calculation of the original cross power spectrum

互功率谱表征了两帧图像频谱之间的相互关系。理论上互功率谱中各频率点均包含有两帧图像之间的全局运动信息。在计算互功率谱时，首先采用式(1)和式(2)计算出两帧待估计图像的频谱F₁(u，v)、F₂(u，v)，然后根据式(3)计算确定原始互功率谱函数S₀(u，v)。The cross power spectrum characterizes the mutual relationship between the frequency spectrum of two frames of images. In theory, each frequency point in the cross-power spectrum contains the global motion information between two frames of images. When calculating the cross-power spectrum, first use formula (1) and formula (2) to calculate the spectrum F ₁ (u, v) and F ₂ (u, v) of the two images to be estimated, and then calculate according to formula (3) Determine the original cross-power spectral function S ₀ (u,v).

F₁(u，v)＝F(f₁(x，y)) (1)F ₁ (u, v) = F(f ₁ (x, y)) (1)

F₂(u，v)＝F(f₂(x，y)) (2)F ₂ (u, v) = F(f ₂ (x, y)) (2)

式(1)、(2)中f₁(x，y)、f₂(x，y)分别表示两帧待估计图像，F₁(u，v)、F₂(u，v)则表示两帧待估计图像对应的频谱，符号F表示傅立叶变换。在实际应用中为了提高全局运动估计的速度，通常使用快速傅立叶变换来代替普通的离散傅立叶变换，图像大小取为2的整数次幂。In formulas (1) and (2), f ₁ (x, y) and f ₂ (x, y) represent two frames of images to be estimated respectively, while F ₁ (u, v) and F ₂ (u, v) represent two The frequency spectrum corresponding to the frame image to be estimated, and the symbol F represents Fourier transform. In order to improve the speed of global motion estimation in practical applications, fast Fourier transform is usually used instead of ordinary discrete Fourier transform, and the image size is taken as an integer power of 2.

${S S}_{00} ((u u,, v v)) = = \frac{{F f}_{22} ((u u,, v v))}{{F f}_{11} ((u u,, v v))} - - - - - - ((33))$

式(3)中S₀(u，v)即为两帧待估计图像间的原始互功率谱。In formula (3), S ₀ (u, v) is the original cross-power spectrum between two frames of images to be estimated.

(2)互功率谱的干扰抑制(2) Interference suppression of cross power spectrum

在理想情况下，互功率谱中各频率点所包含的全局运动信息都应是准确的，但实际情况中，由于图像边缘内容变化等因素的影响，会造成互功率谱中信息的失真。互功率谱中信息的可靠性决定了频域全局运动估计的准确性，因此抑制上述因素对互功率谱所产生的干扰，提高互功率谱中信息的可靠性，是实现准确频域全局运动估计的关键。Ideally, the global motion information contained in each frequency point in the cross-power spectrum should be accurate, but in reality, due to the influence of factors such as image edge content changes, the information in the cross-power spectrum will be distorted. The reliability of the information in the cross-power spectrum determines the accuracy of the global motion estimation in the frequency domain. Therefore, suppressing the interference caused by the above factors on the cross-power spectrum and improving the reliability of the information in the cross-power spectrum is the key to achieving accurate global motion estimation in the frequency domain. key.

不同频率点受到图像边缘内容变化的影响程度不同，为了提高全局运动估计的准确性，本发明针对互功率谱中各频率点信息的准确性进行评价，并根据指定的精度范围抑制其中易受干扰影响的频率点，得到新的、包含更为准确的全局运动信息的互功率谱。Different frequency points are affected to different extents by changes in image edge content. In order to improve the accuracy of global motion estimation, the present invention evaluates the accuracy of the information of each frequency point in the cross-power spectrum, and suppresses the information that is susceptible to interference according to the specified accuracy range. A new cross-power spectrum containing more accurate global motion information is obtained.

在对互功率谱中各频率点信息进行评价时，本发明首先用式(4)、(5)分别获得在指定精度范围下各频率点行、列方向对应的最大干扰系数R₁(u)、R₂(v)，该系数描述了在图像内容变化所产生的干扰效果最显著的情况下，图像频谱的模值与干扰分量之间的比例；然后利用式(6)、(7)将前一帧图像频谱中各频率点的幅值分别与行、列方向对应的最大干扰系数相乘，得到在指定精度范围下各频率点能够克服的最大干扰分量的大小V₁(u)、V₂(v)；最后利用式(8)将各频率点的V₁(u)和V₂(v)中的最小值与一个经验阈值进行比较，判断该频率点是否容易受到干扰。被判断为易受干扰的频率点，则需要对其进行抑制。When evaluating the information of each frequency point in the cross-power spectrum, the present invention first uses formulas (4) and (5) to respectively obtain the maximum interference coefficient R ₁ (u) corresponding to the row and column directions of each frequency point under the specified accuracy range , R ₂ (v), this coefficient describes the ratio between the modulus value of the image spectrum and the interference component when the interference effect caused by the change of the image content is the most significant; then use formula (6), (7) to The amplitude of each frequency point in the image spectrum of the previous frame is multiplied by the maximum interference coefficient corresponding to the row and column direction, and the maximum interference component size V ₁ (u), V ₂ (v); Finally, use formula (8) to compare the minimum value of V ₁ (u) and V ₂ (v) at each frequency point with an empirical threshold to determine whether the frequency point is susceptible to interference. Frequency points judged to be susceptible to interference need to be suppressed.

${R R}_{11} ((u u)) = = sin sin ((\frac{22 πu πu}{m m \cdot &Center Dot; n no} \cdot &Center Dot; \frac{11}{S S})) - - - - - - ((44))$

${R R}_{22} ((v v)) = = sin sin ((\frac{22 πv πv}{m m \cdot &Center Dot; n no} \cdot &Center Dot; \frac{11}{S S})) - - - - - - ((55))$

式(4)、(5)中m、n表示图像的行列数，u、v分别表示行、列方向的频率值，

表示指定精度范围(如1/2像素，1/4像素，1/8像素)。In formulas (4) and (5), m and n represent the number of rows and columns of the image, u and v represent the frequency values in the row and column direction respectively,

Indicates the specified precision range (such as 1/2 pixel, 1/4 pixel, 1/8 pixel).

V₁(u)＝|F₁(u，v)|·R₁(u) (6)V ₁ (u)=|F ₁ (u, v)|·R ₁ (u) (6)

V₂(v)＝|F₁(u，v)|·R₂(v) (7)V ₂ (v)=|F ₁ (u, v)|·R ₂ (v) (7)

式(6)、(7)中|F₁(u，v)|表示前一幅图像的频谱在频率点(u，v)处的模值大小。In formulas (6) and (7), |F ₁ (u, v)| represents the modulus value of the spectrum of the previous image at the frequency point (u, v).

${S S}_{11} ((u u,, v v)) = = \{\begin{matrix} {S S}_{00} ((u u,, v v)) & if if min min {{{V V}_{11} ((u u)),, {V V}_{22} ((v v))}} &GreaterEqual; &Greater Equal; Threshold Threshold \\ 00 & if if min min {{{V V}_{11} ((u u)),, {V V}_{22} ((v v))}} < < Threshold Threshold \end{matrix} - - - - - - ((88))$

式(8)中常数Threshold为一个经验阈值，表示图像边缘内容变化引起的干扰分量大小。通过对大量不同图像内容变化情况下干扰抑制实验得到，当域值Threshold的取值在200～400之间时，干扰抑制效果较好。针对纹理细节丰富的图像可以选择稍大一些的阈值，而对于纹理细节较少的图像选择较小的阈值效果较好。The constant Threshold in formula (8) is an empirical threshold, which indicates the size of the interference component caused by the content change of the edge of the image. Through the interference suppression experiment under the condition of a large number of different image content changes, the interference suppression effect is better when the threshold value Threshold is between 200 and 400. For images with rich texture details, a slightly larger threshold can be selected, while for images with less texture details, a smaller threshold is better.

与现有的频域整像素全局运动估计技术相比较，上述技术方案能够有效克服图像边缘内容的变化对全局运动估计准确性所造成的影响，尤其是对后续亚像素全局运动估计准确性的不利影响。Compared with the existing frequency-domain integer-pixel global motion estimation technology, the above-mentioned technical solution can effectively overcome the influence of the change of image edge content on the accuracy of global motion estimation, especially the adverse effect on the accuracy of subsequent sub-pixel global motion estimation. Influence.

(3)整像素全局运动矢量获取(3) Integer pixel global motion vector acquisition

上面对原始互功率谱函数进行干扰抑制后得到新的互功率谱，再对新的互功率谱进行反傅立叶变换得到运动分布函数I。搜索运动分布函数中的最大峰值点P₀，根据其坐标位置得到整像素全局运动矢量。式(9)为整像素全局运动矢量(m₁，m₂)的计算公式。A new cross power spectrum is obtained after interference suppression is performed on the original cross power spectrum function above, and then an inverse Fourier transform is performed on the new cross power spectrum to obtain a motion distribution function I. The maximum peak point P ₀ in the motion distribution function is searched, and the whole pixel global motion vector is obtained according to its coordinate position. Equation (9) is the calculation formula of the integer pixel global motion vector (m ₁ , m ₂ ).

(m₁，m₂)＝arg max(I(x，y)) (9)(m ₁ , m ₂ )=arg max(I(x,y)) (9)

下面进一步介绍在任意精度下计算亚像素全局运动矢量的各个子步骤。与传统的频域亚像素全局运动估计技术不同的是，本发明中，针对采用传统方法导致计算和存储负担激增的问题，将任意精度下亚像素全局运动矢量计算方法分为两个步骤：首先，根据上一步骤中搜索得到的最大峰值点P₀，分别记录下与该峰值点行、列方向坐标位置相邻的三个坐标点的函数值；其次，根据得到的最大峰值以及记录下的与最大峰值水平、垂直方向位置相邻的若干次峰值，分别计算其对应的任意精度下的亚像素全局运动矢量。The sub-steps of calculating the sub-pixel global motion vector at arbitrary precision are further described below. Different from the traditional sub-pixel global motion estimation technology in the frequency domain, in the present invention, in order to solve the problem of a surge in calculation and storage burden caused by the traditional method, the sub-pixel global motion vector calculation method under arbitrary precision is divided into two steps: first , according to the maximum peak point P ₀ searched in the previous step, respectively record the function values of the three coordinate points adjacent to the coordinate position of the peak point in the row and column directions; secondly, according to the obtained maximum peak value and the recorded For several peaks adjacent to the maximum peak horizontally and vertically, their corresponding sub-pixel global motion vectors with arbitrary precision are calculated respectively.

具体实施步骤如下：The specific implementation steps are as follows:

(1)记录次峰值(1) Record the secondary peak value

参见图3所示，令运动分布函数I中最大峰值点P₀的坐标为(x₀，y₀)。记录最大峰值点P₀左、右两侧最邻近的三个坐标点的函数值，分别记为P₁、P₂、P₃；记录最大峰值点P₀上、下两侧最邻近的三个坐标点的函数值，分别记为P₄、P₅、P₆。式(10)、(11)、(12)、(13)为上述行、列方向次峰值的选取公式。Referring to FIG. 3 , let the coordinates of the maximum peak point P ₀ in the motion distribution function I be (x ₀ , y ₀ ). Record the function values of the three nearest coordinate points on the left and right sides of the maximum peak point P ₀ , denoted as P ₁ , P ₂ , and P ₃ respectively; record the three most adjacent coordinate points on the upper and lower sides of the maximum peak point P ₀ The function values of the coordinate points are recorded as P ₄ , P ₅ , and P ₆ respectively. Equations (10), (11), (12), and (13) are the selection formulas for the secondary peak values in the row and column directions above.

$\begin{matrix} \{\begin{matrix} {P P}_{11} = = I I (({x x}_{00},, {y the y}_{00} - - 22)) \\ {P P}_{22} = = I I (({x x}_{00},, {y the y}_{00} - - 11)) \\ {P P}_{33} = = I I (({x x}_{00},, {y the y}_{00} + + 11)) \end{matrix} & if I if i (({x x}_{00},, {y the y}_{00} - - 11)) > > I I (({x x}_{00},, {y the y}_{00} + + 11)) \end{matrix} - - - - - - ((1010))$

$\begin{matrix} \{\begin{matrix} {P P}_{11} = = I I (({x x}_{00},, {y the y}_{00} + + 22)) \\ {P P}_{22} = = I I (({x x}_{00},, {y the y}_{00} + + 11)) \\ {P P}_{33} = = I I (({x x}_{00},, {y the y}_{00} - - 11)) \end{matrix} & if I if i (({x x}_{00},, {y the y}_{00} - - 11)) \leq \leq I I (({x x}_{00},, {y the y}_{00} + + 11)) \end{matrix} - - - - - - ((1111))$

$\begin{matrix} \{\begin{matrix} {P P}_{44} = = I I (({x x}_{00} - - 22,, {y the y}_{00})) \\ {P P}_{55} = = I I (({x x}_{00} - - 11,, {y the y}_{00})) \\ {P P}_{66} = = I I (({x x}_{00} + + 11,, {y the y}_{00})) \end{matrix} & if I if i (({x x}_{00} - - 11,, {y the y}_{00})) > > I I (({x x}_{00} + + 11,, {y the y}_{00})) \end{matrix} - - - - - - ((1212))$

$\begin{matrix} \{\begin{matrix} {P P}_{44} = = I I (({x x}_{00} + + 22,, {y the y}_{00})) \\ {P P}_{55} = = I I (({x x}_{00} + + 11,, {y the y}_{00})) \\ {P P}_{66} = = I I (({x x}_{00} - - 11,, {y the y}_{00})) \end{matrix} & if I if i (({x x}_{00} - - 11,, {y the y}_{00})) \leq \leq I I (({x x}_{00} + + 11,, {y the y}_{00})) \end{matrix} - - - - - - ((1313))$

(2)亚像素全局运动矢量计算(2) Sub-pixel global motion vector calculation

根据得到的最大峰值P₀以及记录下的与最大峰值水平、垂直方向位置相邻的次峰值P₁、P₂、P₃和P₄、P₅、P₆，利用式(14)、(15)计算出行、列方向上的亚像素全局运动矢量m₁和m₂。According to the obtained maximum peak P ₀ and the recorded secondary peaks P ₁ , P ₂ , P ₃ and P ₄ , P ₅ , P ₆ adjacent to the maximum peak horizontally and vertically, using equations (14), (15 ) Calculate sub-pixel global motion vectors m ₁ and m ₂ in row and column directions.

${m m}_{11} = = \frac{{P P}_{22} - - {P P}_{33} + + {22 P P}_{11}}{{P P}_{00} + + {P P}_{11} + + {P P}_{22} + + {P P}_{33}} - - - - - - ((1414))$

${m m}_{22} = = \frac{{P P}_{55} - - {P P}_{66} + + {22 P P}_{44}}{{P P}_{00} + + {P P}_{44} + + {P P}_{55} + + {P P}_{66}} - - - - - - ((1515))$

需要特别说明的是，上述的亚像素全局运动矢量计算公式中，所使用的次峰值为相邻的三个次峰值。这是在Catmull-Rom三阶插值方法的条件下，通过对比图像亚像素全局运动与频谱变化之间的对应关系推导得到的。Catmull-Rom三阶插值是较为常用的亚像素图像插值方法，能够较好表现图像纹理的变化情况。但是，也可以使用其它的成熟插值方法。在使用其它插值方法的前提下，所选择的次峰值也可以是其它的次峰值。It should be noted that in the above formula for calculating the sub-pixel global motion vector, the sub-peaks used are three adjacent sub-peaks. This is derived under the condition of the Catmull-Rom third-order interpolation method by comparing the corresponding relationship between the sub-pixel global motion of the image and the spectral change. Catmull-Rom third-order interpolation is a more commonly used sub-pixel image interpolation method, which can better express the change of image texture. However, other well-established interpolation methods may also be used. On the premise of using other interpolation methods, the selected secondary peak may also be other secondary peaks.

上述的亚像素全局运动矢量计算方法能够在频域整像素全局运动估计的基础上，利用其中间结果，通过计算直接获得图像间的亚像素全局运动矢量，避免了因图像插值而引起的计算和存储负担的增加。The above-mentioned sub-pixel global motion vector calculation method can directly obtain the sub-pixel global motion vector between images by using the intermediate results on the basis of frequency-domain integer-pixel global motion estimation, avoiding the calculation and calculation caused by image interpolation. Increased storage burden.

下面对如何获得指定精度下的亚像素全局运动矢量的方法展开详细的说明。The following describes in detail how to obtain the sub-pixel global motion vector at a specified precision.

前面的计算步骤已经计算出了任意精度下的亚像素全局运动矢量，但在实际应用中亚像素精度并非越高越好，还需要综合考虑应用效果和计算开销等因素。一般较常采用的亚像素精度有1/2，1/4或1/8像素。因此，为了满足实际应用的要求，需要对计算得到任意精度下的亚像素全局运动矢量进行近似处理。当指定亚像素精度为1/2^K时，可以根据式(16)、(17)计算得到所要求精度下的亚像素全局运动矢量。The previous calculation steps have calculated the sub-pixel global motion vector with arbitrary precision, but in practical applications, the sub-pixel precision is not as high as possible, and factors such as application effect and calculation cost need to be considered comprehensively. Generally, the sub-pixel accuracy commonly used is 1/2, 1/4 or 1/8 pixel. Therefore, in order to meet the requirements of practical applications, it is necessary to perform approximate processing on the calculated sub-pixel global motion vector with arbitrary precision. When the specified sub-pixel precision is 1/2 ^K , the sub-pixel global motion vector with the required precision can be calculated according to equations (16) and (17).

m₁′＝round(m₁×2^K)/2^K (16)m ₁ ′=round(m ₁ ×2 ^K )/2 ^K (16)

m₂′＝round(m₂×2^K)/2^K (17)m ₂ ′=round(m ₂ ×2 ^K )/2 ^K (17)

其中round表示四舍五入。由此得到指定精度下的亚像素全局运动矢量。Where round means rounding up. From this, a sub-pixel global motion vector at a specified precision is obtained.

以上公开的仅为本发明的具体实例，根据本发明提供的思想，本领域的技术人员能思及的变化，都应落入本发明的保护范围内。The above disclosures are only specific examples of the present invention. According to the idea provided by the present invention, changes conceivable by those skilled in the art should fall within the protection scope of the present invention.

Claims

1. a frequency domain fast sub picture element global motion estimating method that is used for image stabilization is characterized in that comprising the steps:

(1) the original crosspower spectrum of adjacent two two field pictures in the calculating video sequence;

(2) the maximum interference coefficient of acquisition each Frequency point row, column direction correspondence under the designated precision scope, then that the amplitude of each Frequency point in the former frame image spectrum is corresponding with the row, column direction respectively maximum interference multiplication, obtain the size of the maximum interference component that each Frequency point can overcome under the designated precision scope, minimum value and empirical value in the maximum interference component of each Frequency point are compared, judge whether this Frequency point is the Frequency point that is subject to disturb;

(3) suppress for being judged as the Frequency point that is subject to disturb, obtain to disturb the crosspower spectrum after suppressing;

(4) crosspower spectrum after described interference is suppressed carries out inverse-Fourier transform, generates the distribution of movement function, and the peak-peak in the searching moving distribution function obtains whole picture element global motion vector according to the coordinate of maximal peak point;

(5) judge and note in the described distribution of movement function several times peak value adjacent on the row, column direction, calculate the sub picture element global motion vector according to the corresponding relation of primary and secondary peak Distribution in sub picture element global motion and the distribution of movement function with the peak-peak point coordinates;

(6) the sub picture element global motion vector that calculates is carried out approximate processing, obtain the sub picture element global motion vector under the designated precision.

2. the frequency domain fast sub picture element global motion estimating method that is used for image stabilization as claimed in claim 1 is characterized in that:

It is relevant that the size of described empirical value and the grain details of image are enriched the degree forward, between 200～400.

3. the frequency domain fast sub picture element global motion estimating method that is used for image stabilization as claimed in claim 1 is characterized in that:

Described step (5) is divided into two steps:

At first, according to the peak-peak that search in the step (4) obtains, note respectively in the horizontal direction with vertical direction on the functional value of the most contiguous three coordinate points of maximal peak point;

Secondly, according to the peak-peak that obtains and note in the horizontal direction with vertical direction on the functional value of the most contiguous three coordinate points of maximal peak point, calculate the sub picture element global motion vector under the corresponding arbitrary accuracy respectively.