CN100481963C

CN100481963C - Visual difference calculating method for video image in multi-vision point system

Info

Publication number: CN100481963C
Application number: CNB2005100774732A
Authority: CN
Inventors: 张虎; 孙立锋; 杨士强
Original assignee: Tsinghua University
Current assignee: Tsinghua University
Priority date: 2005-06-24
Filing date: 2005-06-24
Publication date: 2009-04-22
Anticipated expiration: 2025-06-24
Also published as: CN1688168A

Abstract

This invention relates to a parallax computing method for video images in a multi-viewing point system, which utilizes a block matching method to compute the complete parallax of the first video image in the target video images takes the parallax value of the related image block in the above one target video image as the current position to carry out limit search and directional search to the reference video image, if the correspondent image block is searched, then the parallax is got, if not, the search is continued to the end, the parallax is got by fetching the image block with the smallest mean pixel gray difference in the target video images as the corresponding block.

Description

Disparity Calculation Method of Video Image in Multi-viewpoint System

技术领域 technical field

本发明设计一种多视点系统中视频图像的视差计算方法，属于视频图像处理技术领域。The invention designs a parallax calculation method of video images in a multi-viewpoint system, and belongs to the technical field of video image processing.

背景技术 Background technique

用于多视点视频系统的块匹配法是最常用的两个视频图像之间的视差计算方法，其基本思想是将目标图像分割为固定尺寸的块，并搜索每个块在参考图像中的最佳匹配。经典的搜索技术是在搜索窗内的完全搜索，其基本算法如下(设块尺寸为N乘N)：The block matching method used in multi-view video systems is the most commonly used disparity calculation method between two video images. best match. The classic search technique is a complete search within the search window, and its basic algorithm is as follows (set the block size to be N times N):

若 $f (T') = \min (&dtri; f (T)),$ 则视差d＝T’，like $f (T') = \min (&dtri; f (T)),$ Then the parallax d=T',

其中in

$f f ((T T)) = = {Σ Σ}_{i i = = 00}^{N N - - 11} {Σ Σ}_{j j = = 00}^{N N - - 11} | | {f f}_{l l} ((i i,, j j)) - - {f f}_{r r} ((i i + + m m,, j j + + n no)) | |$

$T T = = \sqrt{{m m}^{22} + + {n no}^{22}}$

T为图像块的空间位移量；T’是目标图像和参考图像中两个对应图像块之间相关性达到最大时的空间位移量；m、n分别是空间位移量T的横坐标和纵坐标，T is the spatial displacement of the image block; T' is the spatial displacement when the correlation between the two corresponding image blocks in the target image and the reference image reaches the maximum; m and n are the abscissa and ordinate of the spatial displacement T respectively ,

f_l、f_r是目标图像和参考图像中两个对应图像块的像素灰度值。f _l , f _r are the pixel gray values of two corresponding image blocks in the target image and the reference image.

传统的块匹配视差估计算法由于算法简单、计算量小而被普遍采用。然而，对整个目标图像进行完全搜索时，其计算量很大。The traditional block-matching disparity estimation algorithm is widely used because of its simple algorithm and small amount of calculation. However, it is computationally expensive to perform a full search on the entire target image.

发明内容 Contents of the invention

本发明的目的是提出一种多视点系统中视频图像的视差计算方法，在传统的块匹配基础上提出了基于立体摄像几何及视频特性的自适应算法，提高了视差计算的效率。The purpose of the present invention is to propose a disparity calculation method for video images in a multi-viewpoint system. On the basis of traditional block matching, an adaptive algorithm based on stereo camera geometry and video characteristics is proposed, which improves the efficiency of disparity calculation.

本发明提出的多视点系统中视频图像的视差计算方法，包括以下步骤：The parallax calculation method of the video image in the multi-viewpoint system that the present invention proposes, comprises the following steps:

(1)利用块匹配法计算目标视频图像中第一帧视频图像的完整视差；(1) Utilize the block matching method to calculate the complete disparity of the first frame video image in the target video image;

(2)以上一帧目标视频图像中的相应图像块的视差值为当前位置，以LW为搜索半径，对参考视频图像进行有限搜索；(2) The disparity value of the corresponding image block in the above frame target video image is the current position, and the reference video image is limitedly searched with LW as the search radius;

(3)若在参考视频图像中搜索到与目标视频图像中相对应的图像块，则进行步骤(8)，若搜索不到与目标视频图像中相对应的图像块，则进行步骤(4)；(3) if searching for the image block corresponding to the target video image in the reference video image, then proceed to step (8), if not searching for the corresponding image block in the target video image, then proceed to step (4) ;

(4)以上一帧目标视频图像中的相应图像块的视差值为当前位置，向左对参考视频图像进行方向搜索，搜索范围为LW+1～LW+DW1；(4) The disparity value of the corresponding image block in the above frame target video image is the current position, and the direction search is carried out to the left for the reference video image, and the search range is LW+1～LW+DW1;

(5)若在参考视频图像中搜索到与目标视频图像中相对应的图像块，则进行步骤(8)，若搜索不到与目标视频图像中相对应的图像块，则进行步骤(6)；(5) if searching for the image block corresponding to the target video image in the reference video image, then proceed to step (8), if not searching for the corresponding image block in the target video image, then proceed to step (6) ;

(6)以上一帧目标视频图像中的相应图像块的视差值为当前位置，向右对参考视频图像进行方向搜索，搜索范围为LW+1～LW+DW2；(6) The parallax value of the corresponding image block in the above frame target video image is the current position, and the direction search is carried out to the right for the reference video image, and the search range is LW+1～LW+DW2;

(7)若在参考视频图像中搜索到与目标视频图像中相对应的图像块，则进行步骤(8)，若搜索不到与目标视频图像中相对应的图像块，则取目标视频图像中平均象素灰度差最小的图像块作为对应的图像块，进行步骤(8)；(7) If in the reference video image, search for the image block corresponding to the target video image, then proceed to step (8), if not find the corresponding image block in the target video image, then get the image block corresponding to the target video image The image block with the smallest average pixel gray level difference is used as the corresponding image block, and then step (8);

(8)根据目标视频图像中图像块的位置和搜索到的参考图像块的位置，得到目标视频图像中的图像块与参考视频图像中相应图像块之间的视差。(8) Obtain the disparity between the image block in the target video image and the corresponding image block in the reference video image according to the position of the image block in the target video image and the position of the searched reference image block.

本发明提出的多视点系统中视频图像的视差计算方法，其优点是提高了算法的计算效率，只需比传统的块匹配算法更少的计算量，就可以得到相同的视差计算结果的准确性。The disparity calculation method of video images in the multi-viewpoint system proposed by the present invention has the advantage of improving the calculation efficiency of the algorithm, and only needs less calculation than the traditional block matching algorithm to obtain the same accuracy of the disparity calculation results .

具体实施方式 Detailed ways

本发明提出的多视点系统中视频图像的视差计算方法，首先利用块匹配法计算目标视频图像中第一帧视频图像的完整视差；以上一帧目标视频图像中的相应图像块的视差值为当前位置，以LW为搜索半径，对参考视频图像进行有限搜索；若在参考视频图像中搜索到与目标视频图像中相对应的图像块，则根据目标视频图像中图像块的位置和搜索到的参考图像块的位置，得到目标视频图像中的图像块与参考视频图像中相应图像块之间的视差，若搜索不到与目标视频图像中相对应的图像块，则以上一帧目标视频图像中的相应图像块的视差值为当前位置，向左对参考视频图像进行方向搜索，搜索范围为LW+1～LW+DW1；若在参考视频图像中搜索到与目标视频图像中相对应的图像块，则根据目标视频图像中图像块的位置和搜索到的参考图像块的位置，得到目标视频图像中的图像块与参考视频图像中相应图像块之间的视差，若搜索不到与目标视频图像中相对应的图像块，则以上一帧目标视频图像中的相应图像块的视差值为当前位置，向右对参考视频图像进行方向搜索，搜索范围为LW+1～LW+DW2；若在参考视频图像中搜索到与目标视频图像中相对应的图像块，则根据目标视频图像中图像块的位置和搜索到的参考图像块的位置，得到目标视频图像中的图像块与参考视频图像中相应图像块之间的视差，若搜索不到与目标视频图像中相对应的图像块，则取目标视频图像中平均像素灰度差最小的图像块作为对应的图像块，根据目标视频图像中图像块的位置和搜索到的参考图像块的位置，得到目标视频图像中的图像块与参考视频图像中相应图像块之间的视差。The method for calculating the disparity of video images in the multi-viewpoint system proposed by the present invention first utilizes the block matching method to calculate the complete disparity of the first frame video image in the target video image; the disparity value of the corresponding image block in the above frame of target video image is At the current position, with LW as the search radius, a limited search is performed on the reference video image; if the image block corresponding to the target video image is found in the reference video image, according to the position of the image block in the target video image and the searched Refer to the position of the image block to obtain the disparity between the image block in the target video image and the corresponding image block in the reference video image. If the image block corresponding to the target video image cannot be searched, then The disparity value of the corresponding image block is the current position, search the direction of the reference video image to the left, and the search range is LW+1～LW+DW1; if the image corresponding to the target video image is found in the reference video image block, then according to the position of the image block in the target video image and the position of the searched reference image block, the disparity between the image block in the target video image and the corresponding image block in the reference video image is obtained. The corresponding image block in the image, then the disparity value of the corresponding image block in the target video image of the previous frame is the current position, and the direction search is performed on the reference video image to the right, and the search range is LW+1～LW+DW2; if Search for the image block corresponding to the target video image in the reference video image, then according to the position of the image block in the target video image and the position of the searched reference image block, obtain the image block in the target video image and the reference video image The disparity between the corresponding image blocks in the target video image, if the corresponding image block in the target video image cannot be searched, the image block with the smallest average pixel gray level difference in the target video image is taken as the corresponding image block, according to the target video image The position of the image block and the searched position of the reference image block are used to obtain the disparity between the image block in the target video image and the corresponding image block in the reference video image.

在多视点摄像系统中，方向性与均匀性是与视差有关的两个性质，在视差计算时若能充分利用这些几何特性，可以简化视差匹配的算法，实现快速匹配：In a multi-viewpoint camera system, directionality and uniformity are two properties related to parallax. If these geometric characteristics can be fully utilized in parallax calculation, the algorithm of parallax matching can be simplified and fast matching can be achieved:

1、方向性：1. Direction:

对于同一景物，其透视投影左图像相对于右图像局部地向水平方向的反方向移动。也就是说，如果正水平方向是向右，则同一空间点在左图像中的投影位置比右图像偏左。这个特性为本算法的方向性搜索提供了依据，比如在搜索左图像各块在右图像中的匹配块时，可优先向右方搜索。For the same scene, the left image of its perspective projection moves to the opposite direction of the horizontal direction locally relative to the right image. That is to say, if the positive horizontal direction is to the right, the projected position of the same spatial point in the left image is to the left of the right image. This feature provides a basis for the directional search of this algorithm, for example, when searching for the matching blocks of the blocks in the left image in the right image, you can search to the right first.

2、均匀性：2. Uniformity:

偏振线上相邻图像子块间的视差差异实际上是很小的，因此，在进行视差匹配时，以邻近块已求出的视差矢量作为参考，在小范围内搜索当前块的视差矢量，能迅速准确地搜索出当前视差这一特性，称之为有限搜索。The disparity difference between adjacent image sub-blocks on the polarization line is actually very small. Therefore, when disparity matching is performed, the disparity vector of the adjacent block is used as a reference to search for the disparity vector of the current block in a small range. The property that can quickly and accurately search out the current disparity is called limited search.

视差计算的目的是应用于视频的编、解码，因此可以结合视频的特性，即相邻视频帧间的视差相关度非常高，进一步提高块匹配的效率；另外，由于算法是应用于视频，可以利用前面视频帧的搜索结果来修改本帧的参数值，从而使得参数本身可以自动适应视频的特性。The purpose of disparity calculation is to apply to video encoding and decoding, so it can combine the characteristics of video, that is, the disparity correlation between adjacent video frames is very high, and further improve the efficiency of block matching; in addition, because the algorithm is applied to video, it can Use the search results of the previous video frame to modify the parameter value of this frame, so that the parameter itself can automatically adapt to the characteristics of the video.

对于视频的第一帧，采用传统搜索算法得到其完整视差；从第二帧开始，每一帧都在前一帧的视差结果基础上进行视差计算。For the first frame of the video, the traditional search algorithm is used to obtain its complete disparity; starting from the second frame, each frame is calculated based on the disparity result of the previous frame.

本发明方法中的搜索半径LW、DW1和DW2，必须根据对目标图像进行搜索的结果不断调整，以提高搜索效率。三个搜索半径参数的调整方法完全相同，以下以LW为例进行说明：The search radii LW, DW1 and DW2 in the method of the present invention must be constantly adjusted according to the search results of the target image, so as to improve the search efficiency. The adjustment methods of the three search radius parameters are exactly the same, and the following uses LW as an example to illustrate:

为每一帧图像设定一个调整参数ALW，则第n+1帧图像的LW＝(第n帧的ALW+第n—1帧的ALW+第n—2帧的ALW)/3，其中第n帧的ALW根据本帧在LW范围内搜索后的结果是否满足匹配准则而定，若满足匹配准则，则第n帧的ALW为搜索到的视差值与上一帧视差值差的绝对值；若不满足匹配准则，则第n帧的ALW为第n—1帧的ALW加9。An adjustment parameter ALW is set for each frame image, then the LW=(ALW of the nth frame+the ALW of the n-1 frame+the ALW of the n-2 frame)/3 of the n+1th frame image, wherein the nth frame The ALW of the frame depends on whether the search result of this frame in the LW range satisfies the matching criterion. If the matching criterion is satisfied, the ALW of the nth frame is the absolute value of the difference between the searched disparity value and the previous frame disparity value; If the matching criterion is not met, the ALW of the nth frame is the ALW of the n-1th frame plus 9.

上述匹配准则可采用平均绝对差准则(MAD)或其他的相似度准则，这里定义一种离散度准则，性能优于一般的MAD准则。设f_l(i，j)、f_r(i，j)为参考视频图像和目标视频图像块的像素灰度，块的尺寸为N乘以N，在满足立体平行摄像系统条件下，定义离散度函数为：The above-mentioned matching criterion may adopt the mean absolute difference criterion (MAD) or other similarity criterion. Here, a dispersion criterion is defined, and its performance is better than that of the general MAD criterion. Let f _l (i, j), f _r (i, j) be the pixel grayscale of the reference video image and the target video image block, the size of the block is N times N, and under the condition of satisfying the stereo parallel camera system, define the discrete The degree function is:

当B(d)为最小时，两个图像块达到匹配，此时的d即为两图像块之间的视差。When B(d) is the minimum, the two image blocks are matched, and d at this time is the disparity between the two image blocks.

$B B ((d d)) = = \frac{{Σ Σ}_{i i = = 00}^{N N - - 11} {Σ Σ}_{j j = = 00}^{N N - - 11} {(({f f}_{l l} ((i i,, j j)) - - {f f}_{r r} ((i i + + d d,, j j))))}^{22}}{{Σ Σ}_{i i = = 00}^{N N - - 11} {Σ Σ}_{j j = = 00}^{N N - - 11} {(({f f}_{l l} ((i i,, j j)) + + {f f}_{r r} ((i i + + d d,, j j))))}^{22}}$

Claims

1, the parallax calculation method of video image in a kind of many view system is characterized in that this method may further comprise the steps:

(1) utilize BMA to calculate the complete parallax of first frame video image in the target video image;

(2) parallax value with the respective image piece in the previous frame target video image is a current location, with LW is search radius, reference video image is searched for, wherein the initial value of LW is made as the search box size in the BMA when first frame, be defaulted as 32 pixels, the LW=of n+1 two field picture (ALW of ALW+ n-2 frame of ALW+ n-1 frame of n frame)/3, whether the result after wherein the ALW of n frame searches in the LW scope according to this frame satisfies matching criterior is decided, if satisfy matching criterior, then the ALW of n frame is the parallax value that searches and the absolute value of previous frame parallax value difference; If do not satisfy matching criterior, then the ALW of n frame is that the ALW of n-1 frame adds 9;

(3) if in reference video image, search with target video image in corresponding image block, then carry out step (8), if the search less than with target video image in corresponding image block, then carry out step (4);

(4) parallax value with the respective image piece in the previous frame target video image is a current location, left reference video image is searched for, and search radius is LW+1～LW+DW1, and wherein the obtaining value method of DW1 is identical with the obtaining value method of above-mentioned LW;

(5) if in reference video image, search with target video image in corresponding image block, then carry out step (8), if the search less than with target video image in corresponding image block, then carry out step (6);

(6) parallax value with the respective image piece in the previous frame target video image is a current location, to the right reference video image is searched for, and search radius is LW+1～LW+DW2, and wherein the obtaining value method of DW2 is identical with the obtaining value method of above-mentioned LW;

(7) if in reference video image, search with target video image in corresponding image block, then carry out step (8), if the search less than with target video image in corresponding image block, then get the image block of the image block of mean pixel gray scale difference minimum in the target video image, carry out step (8) as correspondence;

(8), obtain the parallax between the respective image piece in image block in the target video image and the reference video image according to the position of the position of image block in the target video image and the reference image block that searches.