CN102790892A

CN102790892A - Depth map coding method and device

Info

Publication number: CN102790892A
Application number: CN2012102322123A
Authority: CN
Inventors: 戴琼海; 汪启扉; 张永兵; 季向阳; 王好谦
Original assignee: Tsinghua University; Shenzhen Graduate School Tsinghua University
Current assignee: Tsinghua University; Shenzhen Graduate School Tsinghua University
Priority date: 2012-07-05
Filing date: 2012-07-05
Publication date: 2012-11-21
Anticipated expiration: 2032-07-05
Also published as: CN102790892B

Abstract

The present invention proposes a depth map encoding method and device, the method comprising: establishing a plurality of division lines and forming a division set, and the plurality of division lines are used for wedge-shaped division of depth macroblocks; performing intra-coding mode on depth macroblocks Encoding to obtain the first rate-distortion cost value; judge whether to encode the depth macroblock in inter-frame coding mode, and if so, encode in inter-frame coding mode to obtain the second rate-distortion cost value; continue to judge whether the depth macroblock contains discontinuity If yes, encode the depth macroblock in the geometric division coding mode, including: select the optimal division line to divide the depth macroblock to obtain the first and second depth sub-regions, and perform predictive coding on the two sub-regions Obtain a third rate-distortion cost value; compare the rate-distortion cost values in different coding modes to select a coding mode pair with the smallest rate-distortion cost for coding. The embodiments of the present invention improve the compression efficiency of the depth map and reduce the coding complexity.

Description

Depth map coding method and device

技术领域 technical field

本发明涉及视频编码技术领域，特别涉及一种深度图编码方法和装置。The present invention relates to the technical field of video coding, in particular to a depth map coding method and device.

背景技术 Background technique

在立体视频以及自由视点视频系统中，由多视点视频和多视点深度图构成的数据格式获得了广泛的应用。其中，深度是指视频帧中每个像素点对应到三维空间中的点到相机平面的距离。在立体视频中，为了通过虚拟视点绘制获得用户期望观看的视点所对应的视频信息，我们需要获得每一个视频帧所对应的深度图。因此，立体视频序列通常包含多路彩色视频信息以及每一路彩色视频所对应的深度图序列。由于深度图中每个像素点的深度信息为0-255之间的一个整数值，因此深度图可以被看作是一系列灰度图像所组成的视频序列。为了有效的存储和传输海量的立体视频数据，立体视频系统通常采用多视点视频编码方法对多视点视频和深度图序列分别进行压缩。通过视点内和视间预测编码，多视点视频编码方法能够有效地压缩多路深度图之间的冗余。In stereoscopic video and free-viewpoint video systems, the data format composed of multi-viewpoint video and multi-viewpoint depth map has been widely used. Wherein, the depth refers to the distance from each pixel in the video frame corresponding to a point in the three-dimensional space to the camera plane. In stereoscopic video, in order to obtain the video information corresponding to the viewpoint that the user expects to watch through virtual viewpoint rendering, we need to obtain the depth map corresponding to each video frame. Therefore, a stereoscopic video sequence usually includes multiple channels of color video information and a depth map sequence corresponding to each channel of color video. Since the depth information of each pixel in the depth map is an integer value between 0 and 255, the depth map can be regarded as a video sequence composed of a series of grayscale images. In order to effectively store and transmit massive stereoscopic video data, stereoscopic video systems usually use multi-view video coding methods to compress multi-view video and depth map sequences respectively. Through intra-view and inter-view predictive coding, multi-view video coding methods can effectively compress the redundancy between multiple depth maps.

传统的多视点视频编码方法是基于H.264/MPEG-4编码标准上的一种多视点视频编码拓展方案。在传统的多视点视频编码中，编码器以宏块为单元对每一帧图像进行编码。每一个16×16大小的宏块又可以被进一步被划分成16×8、8×16和8×8子块以及8×4、4×8和4×4的亚块。不同的子块和亚块被称为预测单元。在编码过程中，编码器对每一个预测单元进行运动估计，获得运动补偿预测的率失真代价。然后进行基于率失真优化的模式选择，获得每个块最优的编码模式和预测残差，并对残差进行变换编码。实验证明，传统的多视点视频编码能够在深度图上获得较好的压缩性能。The traditional multi-view video coding method is a multi-view video coding extension scheme based on the H.264/MPEG-4 coding standard. In traditional multi-view video coding, the encoder codes each frame of image in units of macroblocks. Each 16×16 macroblock can be further divided into 16×8, 8×16 and 8×8 sub-blocks and 8×4, 4×8 and 4×4 sub-blocks. The different sub-blocks and sub-blocks are called prediction units. During the encoding process, the encoder performs motion estimation on each prediction unit to obtain the rate-distortion cost of motion compensation prediction. Then, mode selection based on rate-distortion optimization is performed to obtain the optimal coding mode and prediction residual for each block, and transform coding is performed on the residual. Experiments show that traditional multi-view video coding can achieve better compression performance on depth maps.

区别于传统的彩色视频，深度图仅包含每个像素点的距离信息，不包含任何视频纹理信息。因此，深度图中处于物体内部的编码宏块仅包含均一的深度纹理，处在物体边缘的编码宏块则包含两个或多个不连续的深度区域。传统的宏块划分无法有效地表示物体边缘。尤其是在低编码码率的条件下，通过划分得到的子块和亚块模式较少被选用。然而，采用传统的多视点视频编码方法对深度图编码时，编码器仍然需要对所有的宏块模式进行运动估计和模式决策。该过程需要消耗大量的计算资源，增大了编码端的复杂度。Different from traditional color video, the depth map only contains the distance information of each pixel and does not contain any video texture information. Therefore, the coded macroblocks inside the object in the depth map only contain a uniform depth texture, and the coded macroblocks on the edge of the object contain two or more discontinuous depth regions. Traditional macroblock division cannot effectively represent object edges. Especially under the condition of low coding rate, sub-blocks and sub-block patterns obtained by division are rarely selected. However, when encoding depth maps using traditional multi-view video coding methods, the encoder still needs to perform motion estimation and mode decisions for all macroblock modes. This process consumes a large amount of computing resources, which increases the complexity of the encoding side.

为了获得较高的深度图压缩效率，需要针对深度图的特点设计更加高效的编码方法。为此，本发明针对深度图仅包含物体轮廓信息这一特征，提出一种基于几何划分的深度图编码方法。通过对包含不连续运动场的深度宏块进行自适应几何划分，获得了较好的预测结果。从而提高深度图编码的效率，同时降低深度图编码的复杂度。In order to obtain a higher depth image compression efficiency, it is necessary to design a more efficient encoding method according to the characteristics of the depth image. For this reason, the present invention proposes a depth map encoding method based on geometrical division, aiming at the feature that the depth map only contains object outline information. Better prediction results are obtained by adaptive geometric partitioning of depth macroblocks containing discontinuous motion fields. Thereby, the efficiency of depth map coding is improved, and the complexity of depth map coding is reduced at the same time.

目前可以查到的与本发明比较相关的专利有四项，分别公开了一种应用于3DTV与FTV系统的深度图编码压缩方法，一种立体电视系统中深度图像编码方法,一种立体电视系统中深度图像的编码方法和一种多视点深度视频的编码方法，申请号分别为200810063741.9,200810161597.2,200810120082.8和200910154138.6。尽管所提及的四个专利均涉及立体视频中深度图的编码方法，但是第一、第二以及第三项专利均分别仅提出了一种对不同区域的深度宏块采用不同的量化参数的编码方案。区别在于这三个方案设计了不同的深度宏块分类方法；第四项专利则对不同的区域的深度宏块的编码模式进行了限定，对于物体边缘区域的深度宏块设定较多的编码模式，而对非物体边缘区域的深度宏块设定较少的编码模式。然而该发明中所有的编码模式对应的预测单元仍然是基于传统的H.264/MPEG-4 AVC编码标准中所规定的划分方法获得。因此，所提及的四个专利均未涉及基于几何划分的深度图编码方法。At present, there are four patents related to the present invention that can be found, respectively disclosing a depth image encoding and compression method applied to 3DTV and FTV systems, a depth image encoding method in a stereoscopic television system, and a stereoscopic television system. A coding method for mid-depth images and a coding method for multi-view depth video, the application numbers are 200810063741.9, 200810161597.2, 200810120082.8 and 200910154138.6, respectively. Although the four mentioned patents are all related to the coding method of the depth map in stereoscopic video, the first, second and third patents only propose a method of using different quantization parameters for depth macroblocks in different regions. encoding scheme. The difference is that these three schemes design different depth macroblock classification methods; the fourth patent limits the coding modes of depth macroblocks in different areas, and sets more coding modes for depth macroblocks in object edge areas. mode, and fewer coding modes are set for depth macroblocks in non-object edge regions. However, the prediction units corresponding to all coding modes in this invention are still obtained based on the division method specified in the traditional H.264/MPEG-4 AVC coding standard. Therefore, none of the four mentioned patents deals with geometric partition-based coding methods for depth maps.

发明内容 Contents of the invention

本发明旨在至少解决上述技术问题之一。The present invention aims to solve at least one of the above-mentioned technical problems.

为此，本发明的一个目的在于提出一种能够提高深度图压缩效率且降低深度图的编码复杂度的深度图编码方法。Therefore, an object of the present invention is to propose a depth map encoding method that can improve the compression efficiency of the depth map and reduce the coding complexity of the depth map.

本发明的另一目的在于提出一种深度图编码装置。Another object of the present invention is to provide a depth map encoding device.

为了实现上述目的，本发明第一方面的实施例提出了一种深度图编码方法，包括以下步骤：建立多个划分线并组成划分集合，其中，多个划分线用于对深度宏块进行楔形划分；以帧内编码模式对深度宏块进行编码以获取对应的第一率失真代价值；判断是否以帧间编码模式对所述深度宏块进行编码，如果是则以帧间编码模式对所述深度宏块进行编码以获取对应的第二率失真代价值；继续判断所述深度宏块是否包含不连续的运动向量场，如果所述深度宏块进行帧间编码且包含不连续的运动向量场，则以几何划分编码模式对所述深度宏块进行编码，其中，所述几何划分编码模式进一步包括：根据所述深度宏块的所有像素点的亮度信息从所述划分集合中选择最优划分线对所述深度宏块进行划分以得到第一和第二深度子区域，且分别对所述第一和第二深度子区域进行运动估计和预测编码以获取对应的第三率失真代价值；以及比较不同编码模式下深度宏块的率失真代价值，以根据比较结果选择率失真代价最小的编码模式对所述深度宏块进行编码。In order to achieve the above object, the embodiment of the first aspect of the present invention proposes a depth map encoding method, including the following steps: establishing a plurality of division lines and forming a division set, wherein the plurality of division lines are used to wedge the depth macroblock Divide; encode the depth macroblock in intra-frame coding mode to obtain the corresponding first rate-distortion cost value; judge whether to code the depth macroblock in inter-frame coding mode, and if so, use inter-frame coding mode to encode all The depth macroblock is encoded to obtain the corresponding second rate-distortion cost value; continue to judge whether the depth macroblock contains a discontinuous motion vector field, if the depth macroblock is inter-coded and contains discontinuous motion vectors field, the depth macroblock is coded in a geometric partition coding mode, wherein the geometric partition coding mode further includes: selecting the optimal partition set from the partition set according to the brightness information of all pixels of the depth macroblock The division line divides the depth macroblock to obtain first and second depth sub-regions, and respectively performs motion estimation and predictive coding on the first and second depth sub-regions to obtain corresponding third rate-distortion cost values ; and comparing the rate-distortion cost values of the depth macroblocks in different coding modes, so as to select the coding mode with the smallest rate-distortion cost according to the comparison result to encode the depth macroblocks.

另外，根据本发明上述实施例的深度图编码方法还可以具有如下附加的技术特征：In addition, the depth map coding method according to the above-mentioned embodiments of the present invention may also have the following additional technical features:

在一些示例中，所述划分集合为：In some examples, the set of partitions is:

G={ξ(ρ_i,θ_i)|i＝1,2,…,L}，G={ξ(ρ _i ,θ _i )|i=1,2,…,L},

其中，ξ(ρ_i,θ_i)为所述划分线集合中的第i条划分线，ρ_i为深度宏块至该划分线的距离，θ_i为该划分线与第一方向的夹角，L为划分线的个数。Wherein, ξ(ρ _i , θ _i ) is the i-th dividing line in the set of dividing lines, ρ _i is the distance from the depth macroblock to the dividing line, and θ _i is the angle between the dividing line and the first direction , L is the number of dividing lines.

在一些示例中，所述深度宏块至多个划分线的距离均位于[0,8)之间，多个划分线与第一方向的夹角均位于[0,2π)之间。In some examples, the distances from the depth macroblock to the multiple division lines are all between [0, 8), and the angles between the multiple division lines and the first direction are all between [0, 2π).

在一些示例中，所述判断深度宏块是否包含不连续的运动向量场的步骤进一步包括：获取所述深度宏块中所有像素点的亮度值；计算所述深度宏块中所有像素点的亮度值的方差；以及比较所述方差和预设阈值的大小，如果所述方差大于所述预设阈值，则判断所述深度宏块包含不连续的运动向量场。In some examples, the step of judging whether the depth macroblock contains a discontinuous motion vector field further includes: acquiring brightness values of all pixels in the depth macroblock; calculating brightness values of all pixels in the depth macroblock and comparing the variance with a preset threshold, and if the variance is larger than the preset threshold, it is judged that the depth macroblock contains a discontinuous motion vector field.

在一些示例中，根据所述深度宏块的所有像素点的亮度信息从所述划分集合中选择最优划分线，对所述深度宏块进行划分以得到第一和第二深度子区域的步骤进一步包括：分别以所述划分集合中的每一条划分线对所述深度宏块进行划分，并分别计算划分所得的两部分中各自的所有像素点的平均亮度值以得到对应的第一部分和第二部分的平均亮度值；分别对每一条划分线对应的第一部分和第二部分的平均亮度值做差得到差值；以及获取使差值的绝对值最大的划分线作为所述最优划分线，以对所述深度宏块进行划分以得到第一和第二深度子区域。In some examples, a step of selecting an optimal division line from the division set according to the brightness information of all pixels of the depth macroblock, and dividing the depth macroblock to obtain the first and second depth sub-regions It further includes: dividing the depth macroblock with each division line in the division set, and calculating the average luminance values of all the pixels in the two divided parts to obtain the corresponding first part and the second part respectively. The average luminance value of the two parts; the difference is obtained by making a difference between the average luminance values of the first part and the second part corresponding to each dividing line; and obtaining the dividing line that maximizes the absolute value of the difference as the optimal dividing line , to divide the depth macroblock to obtain first and second depth sub-regions.

在一些示例中，所述以帧内编码模式对深度宏块进行编码以获取对应的第一率失真代价值的步骤包括：根据H.264/MPEG-4 AVC编码标准中的帧内编码模式对所述深度宏块进行帧内编码，并将对应的最优率失真代价作为所述第一率失真代价值；所述以帧间编码模式对深度宏块进行编码以获取对应的第二率失真代价值的步骤包括：根据H.264/MPEG-4AVC编码标准中的帧间编码模式对所述深度宏块进行帧间模式编码，并将对应的最优率失真代价作为所述第二率失真代价值。In some examples, the step of encoding the depth macroblock in the intra-frame encoding mode to obtain the corresponding first rate-distortion cost value includes: according to the intra-frame encoding mode in the H.264/MPEG-4 AVC encoding standard The depth macroblock is intra-frame coded, and the corresponding optimal rate-distortion cost is used as the first rate-distortion cost value; the depth macroblock is coded in an inter-frame coding mode to obtain a corresponding second rate-distortion cost The cost value step includes: performing inter-mode coding on the depth macroblock according to the inter-frame coding mode in the H.264/MPEG-4AVC coding standard, and using the corresponding optimal rate-distortion cost as the second rate-distortion cost cost value.

在一些示例中，所述比较不同编码模式下深度宏块的率失真代价值，以根据比较结果选择率失真代价最小的编码模式对所述深度宏块进行编码的步骤包括：如果判断不对所述深度宏块进行帧间编码，则仅对所述深度宏块进行帧内编码；如果判断所述深度宏块进行帧间编码且没有包含不连续的运动向量场，则根据第一和第二率失真代价值的比较结果选择较小的率失真代价值对应的帧间编码模式或者帧内编码模式对所述深度宏块进行编码；以及如果判断所述深度宏块进行帧间编码且包含不连续的运动向量场，则根据第一至第三率失真代价值的比较结果选择最小的率失真代价值对应的帧间编码模式、帧内编码模式或者几何划分编码模式对所述深度宏块进行编码。In some examples, the step of comparing the rate-distortion cost values of the depth macroblocks in different coding modes, and selecting the coding mode with the smallest rate-distortion cost according to the comparison result to encode the depth macroblocks includes: if it is judged that the If the depth macroblock is inter-coded, then only the depth macroblock is intra-coded; if it is judged that the depth macroblock is inter-coded and does not contain discontinuous motion vector fields, then according to the first and second rates The comparison result of the distortion cost value selects the inter-frame coding mode or the intra-frame coding mode corresponding to the smaller rate-distortion cost value to encode the depth macroblock; and if it is judged that the depth macroblock is inter-frame coding and contains discontinuity motion vector field, then select the inter-frame coding mode, intra-frame coding mode or geometric partition coding mode corresponding to the smallest rate-distortion cost value to encode the depth macroblock according to the comparison results of the first to third rate-distortion cost values .

本发明第二方面的实施例提出了一种深度图编码装置，包括：划分线建立模块，用于建立多个划分线并组成划分集合，其中，多个划分线用于对深度宏块进行楔形划分；帧内编码模块，用于以帧内编码模式对深度宏块进行编码以获取对应的第一率失真代价值；帧间编码判断模块，用于判断是否以帧间编码模式对所述深度宏块进行编码；帧间编码模块，用于在所述帧间编码判断模块判断以帧间编码模式对所述深度宏块进行编码时以帧间编码模式对深度宏块进行编码以获取对应的第二率失真代价值；几何划分判断模块，用于判断所述深度宏块是否包含不连续的运动向量场；几何划分编码模块，用于在所述几何划分判断模块判断所述深度宏块包含不连续的运动向量场时，以几何划分编码模式对所述深度宏块进行编码，其中，所述几何划分编码模式包括：根据所述深度宏块的所有像素点的亮度信息从所述划分集合中选择最优划分线对所述深度宏块进行划分以得到第一和第二深度子区域，且分别对所述第一和第二深度子区域进行运动估计和预测编码以获取对应的第三率失真代价值；以及编码模式选择模块，用于比较所述深度宏块在不同编码模式下的率失真代价值，以根据比较结果选择率失真代价最小的编码模式以便对所述深度宏块进行编码。The embodiment of the second aspect of the present invention proposes a depth map encoding device, including: a division line establishment module, configured to establish a plurality of division lines and form a division set, wherein the plurality of division lines are used to wedge a depth macroblock Division; an intra-frame coding module, used to code the depth macroblock in an intra-frame coding mode to obtain a corresponding first rate-distortion cost value; an inter-frame coding judging module, used to judge whether to use an inter-frame coding mode to code the depth The macroblock is coded; the inter-frame coding module is used to code the depth macroblock in the inter-frame coding mode to obtain the corresponding The second rate-distortion cost value; the geometric division judgment module is used to judge whether the depth macroblock contains a discontinuous motion vector field; the geometric division coding module is used to judge whether the depth macroblock contains in the geometric division judgment module In the case of a discontinuous motion vector field, the depth macroblock is coded in a geometric partition coding mode, wherein the geometric partition coding mode includes: according to the brightness information of all pixels of the depth macroblock, from the partition set Select the optimal dividing line to divide the depth macroblock to obtain the first and second depth sub-regions, and perform motion estimation and predictive coding on the first and second depth sub-regions respectively to obtain the corresponding third A rate-distortion cost value; and a coding mode selection module, used to compare the rate-distortion cost values of the depth macroblock in different coding modes, so as to select the coding mode with the smallest rate-distortion cost according to the comparison result so as to carry out the depth macroblock coding.

另外，根据本发明上述实施例的深度图编码装置还可以具有如下附加的技术特征：In addition, the depth map encoding device according to the above-mentioned embodiments of the present invention may also have the following additional technical features:

G={ξ(ρ_i,θ_i)|i＝1,2,…,L}，G={ξ(ρ _i ,θ _i )|i=1,2,…,L},

在一些示例中，所述几何划分判断模块用于获取所述深度宏块中所有像素点的亮度值，并计算所述深度宏块中所有像素点的亮度值的方差，并比较所述方差和预设阈值的大小，如果所述方差大于所述预设阈值，则判断所述深度宏块包含不连续的运动向量场。In some examples, the geometric division judging module is used to obtain the luminance values of all pixels in the depth macroblock, calculate the variance of the luminance values of all pixels in the depth macroblock, and compare the variance and If the variance is greater than the preset threshold, it is determined that the depth macroblock contains a discontinuous motion vector field.

在一些示例中，所述几何划分编码模块用于分别以所述划分集合中的每一条划分线对所述深度宏块进行划分，并分别计算划分所得的两部分中各自的所有像素点的平均亮度值以得到对应的第一部分和第二部分的平均亮度值，分别对每一条划分线对应的第一部分和第二部分的平均亮度值做差得到差值，并获取使差值的绝对值最大的划分线作为所述最优划分线，以对所述深度宏块进行划分以得到第一和第二深度子区域。In some examples, the geometric partition encoding module is configured to respectively divide the depth macroblock with each dividing line in the division set, and respectively calculate the average value of all pixels in the two divided parts respectively. Brightness value to obtain the corresponding average brightness value of the first part and the second part, respectively make a difference between the average brightness values of the first part and the second part corresponding to each dividing line to obtain the difference, and obtain the maximum absolute value of the difference The dividing line of is used as the optimal dividing line, so as to divide the depth macroblock to obtain the first and second depth sub-regions.

在一些示例中，所述帧内编码模块用于根据H.264/MPEG-4 AVC编码标准中的帧内编码模式对所述深度宏块进行帧内编码，并将对应的最优率失真代价作为所述第一率失真代价值；所述帧间编码模块用于根据H.264/MPEG-4 AVC编码标准中的帧间编码模式对所述深度宏块进行帧间模式编码，并将对应的最优率失真代价作为所述第二率失真代价值。在一些示例中，所述编码模式选择模块用于在不对所述深度宏块进行帧间编码时，仅选择帧内编码模式，在所述深度宏块进行帧间编码且没有包含不连续的运动向量场时，根据第一和第二率失真代价值的比较结果选择较小的率失真代价值对应的帧间编码模式或者帧内编码模式，在所述深度宏块进行帧间编码且包含不连续的运动向量场时，根据第一至第三率失真代价值的比较结果选择最小的率失真代价值对应的帧间编码模式、帧内编码模式或者几何划分编码模式。In some examples, the intra-frame coding module is configured to perform intra-frame coding on the depth macroblock according to the intra-frame coding mode in the H.264/MPEG-4 AVC coding standard, and use the corresponding optimal rate-distortion cost As the first rate-distortion cost value; the inter-frame coding module is used to perform inter-mode coding on the depth macroblock according to the inter-frame coding mode in the H.264/MPEG-4 AVC coding standard, and correspond to The optimal rate-distortion cost of is used as the second rate-distortion cost value. In some examples, the encoding mode selection module is configured to only select an intra encoding mode when the depth macroblock is not inter-encoded, and the depth macroblock is inter-encoded and does not contain discontinuous motion In the vector field, according to the comparison result of the first and second rate-distortion cost values, select the inter-frame coding mode or the intra-frame coding mode corresponding to the smaller rate-distortion cost value, and perform inter-frame coding on the depth macroblock and include not For continuous motion vector fields, the inter-frame coding mode, intra-frame coding mode or geometric partition coding mode corresponding to the smallest rate-distortion cost value is selected according to the comparison results of the first to third rate-distortion cost values.

根据本发明实施例的深度图编码方法及装置，通过设计基于几何划分的深度图编码方法，根据合理的方式从帧间编码模式、帧内编码模式或者几何划分编码模式选择最佳的编码模式对深度宏块进行编码，由此，不但提高了深度图压缩效率、减少压缩时间，也降低深度图编码的复杂度，实现对深度图的高效编码。According to the depth map coding method and device of the embodiments of the present invention, by designing a depth map coding method based on geometric partitioning, the best coding mode pair is selected from the inter-frame coding mode, intra-frame coding mode or geometric partition coding mode in a reasonable manner. The depth macroblocks are coded, thereby not only improving the compression efficiency of the depth map, reducing the compression time, but also reducing the complexity of coding the depth map, and realizing efficient coding of the depth map.

本发明的附加方面和优点将在下面的描述中部分给出，部分将从下面的描述中变得明显，或通过本发明的实践了解到。Additional aspects and advantages of the invention will be set forth in the description which follows, and in part will be obvious from the description, or may be learned by practice of the invention.

附图说明 Description of drawings

本发明的上述和/或附加的方面和优点从结合下面附图对实施例的描述中将变得明显和容易理解，其中：The above and/or additional aspects and advantages of the present invention will become apparent and comprehensible from the description of the embodiments in conjunction with the following drawings, wherein:

图1为本发明实施例的深度图编码方法的流程图；FIG. 1 is a flowchart of a depth map encoding method according to an embodiment of the present invention;

图2为本发明一个实施例深度图编码方法的划分线对深度宏块进行划分的原理图；Fig. 2 is a principle diagram of dividing a depth macroblock by dividing lines of a depth map encoding method according to an embodiment of the present invention;

图3为本发明一个实施例的深度图编码方法的详细流程图；FIG. 3 is a detailed flowchart of a depth map encoding method according to an embodiment of the present invention;

图4为本发明一个实施例的深度图编码方法的采用的具有几何划分编码模式进行编码的编码器的工作原理图；Fig. 4 is a working principle diagram of an encoder with a geometric partition encoding mode used in a depth map encoding method according to an embodiment of the present invention;

图5为本发明一个实施例的深度图编码方法采用图4所示的编码器进行编码的对深度宏块预测编码模式的预测结构图；以及FIG. 5 is a prediction structure diagram of a depth macroblock prediction coding mode in which the coding method for a depth map according to an embodiment of the present invention uses the coder shown in FIG. 4 to code; and

图6为本发明实施例的深度图编码装置的结构图。Fig. 6 is a structural diagram of a depth map encoding device according to an embodiment of the present invention.

具体实施方式 Detailed ways

下面详细描述本发明的实施例，所述实施例的示例在附图中示出，其中自始至终相同或类似的标号表示相同或类似的元件或具有相同或类似功能的元件。下面通过参考附图描述的实施例是示例性的，仅用于解释本发明，而不能理解为对本发明的限制。Embodiments of the present invention are described in detail below, examples of which are shown in the drawings, wherein the same or similar reference numerals designate the same or similar elements or elements having the same or similar functions throughout. The embodiments described below by referring to the figures are exemplary only for explaining the present invention and should not be construed as limiting the present invention.

在本发明的描述中，需要理解的是，术语“中心”、“纵向”、“横向”、“上”、“下”、“前”、“后”、“左”、“右”、“竖直”、“水平”、“顶”、“底”、“内”、“外”等指示的方位或位置关系为基于附图所示的方位或位置关系，仅是为了便于描述本发明和简化描述，而不是指示或暗示所指的装置或元件必须具有特定的方位、以特定的方位构造和操作，因此不能理解为对本发明的限制。此外，术语“第一”、“第二”仅用于描述目的，而不能理解为指示或暗示相对重要性。In describing the present invention, it should be understood that the terms "center", "longitudinal", "transverse", "upper", "lower", "front", "rear", "left", "right", " The orientations or positional relationships indicated by "vertical", "horizontal", "top", "bottom", "inner" and "outer" are based on the orientations or positional relationships shown in the drawings, and are only for the convenience of describing the present invention and Simplified descriptions, rather than indicating or implying that the device or element referred to must have a particular orientation, be constructed and operate in a particular orientation, and thus should not be construed as limiting the invention. In addition, the terms "first" and "second" are used for descriptive purposes only, and should not be understood as indicating or implying relative importance.

在本发明的描述中，需要说明的是，除非另有明确的规定和限定，术语“安装”、“相连”、“连接”应做广义理解，例如，可以是固定连接，也可以是可拆卸连接，或一体地连接；可以是机械连接，也可以是电连接；可以是直接相连，也可以通过中间媒介间接相连，可以是两个元件内部的连通。对于本领域的普通技术人员而言，可以具体情况理解上述术语在本发明中的具体含义。In the description of the present invention, it should be noted that unless otherwise specified and limited, the terms "installation", "connection" and "connection" should be understood in a broad sense, for example, it can be a fixed connection or a detachable connection. Connected, or integrally connected; it may be mechanically connected or electrically connected; it may be directly connected or indirectly connected through an intermediary, and it may be the internal communication of two components. Those of ordinary skill in the art can understand the specific meanings of the above terms in the present invention in specific situations.

为了对本发明实施例的编码方式进行验证，在本发明的下述示例中，立体视频编码的测试序列采用标清格式的名字为“Book Arrival”的标准测试序列，该标清格式视频序列的分辨率为1024×768，该测试序列包含16个视点的视频和对应的深度图。在本发明的以下示例中，均以该测试序列中的深度图进行测试，采用该序列的第8个视点的深度图序列作为测试序列。该序列包含100帧深度图。如图4所示，编解码器采用H.264/MPEG-4 MVC标准参考软件JMVC 6.0；编码器GOP（Group of Pictures，图像组）的帧数为8；编码的时域预测编码采用Hierarchical B Picture（层次化双向预测编码帧，简称层次化B帧）的预测结构，编码预测结构图如图5所示。每个编码的深度宏块的大小为16×16。运动估计的精度为四分之一像素精度。其他编码参数均采用与H.264/MPEG-4 AVC标准中主档次（MainProfile）中规定的参数设置。In order to verify the encoding method of the embodiment of the present invention, in the following examples of the present invention, the test sequence of stereoscopic video encoding adopts the standard test sequence of "Book Arrival" with the name of SD format, and the resolution of this SD format video sequence is 1024×768, the test sequence contains 16 viewpoint videos and corresponding depth maps. In the following examples of the present invention, the depth map in the test sequence is used for testing, and the depth map sequence of the eighth viewpoint of the sequence is used as the test sequence. The sequence contains 100 frames of depth maps. As shown in Figure 4, the codec adopts the H.264/MPEG-4 MVC standard reference software JMVC 6.0; the number of frames of the encoder GOP (Group of Pictures, Group of Pictures) is 8; the temporal domain predictive encoding of the encoding adopts Hierarchical B The prediction structure of Picture (hierarchical bidirectional predictive coding frame, referred to as hierarchical B frame), the coding prediction structure diagram is shown in Figure 5. Each coded depth macroblock has a size of 16x16. The precision of the motion estimation is quarter-pixel precision. Other encoding parameters are set with the parameters specified in the main profile (MainProfile) of the H.264/MPEG-4 AVC standard.

以下结合附图首先描述根据本发明实施例的深度图编码方法。A method for encoding a depth map according to an embodiment of the present invention will first be described below with reference to the accompanying drawings.

参考图1，根据本发明实施例的深度图编码方法，包括如下步骤：Referring to Fig. 1, the depth map encoding method according to the embodiment of the present invention includes the following steps:

步骤S101，建立多个划分线并组成划分集合，其中，多个划分线用于对深度宏块进行楔形划分。在本发明的一个实施例中，划分集合为：G={ξ(ρ_i,θ_i)|i＝1,2,…,L}，其中，ξ(ρ_i,θ_i)为划分线集合中的第i条划分线，ρ_i为深度宏块至该划分线的距离，θ_i为该划分线与第一方向的夹角，L为划分线的个数。Step S101 , establishing a plurality of division lines and forming a division set, wherein the plurality of division lines are used for wedge-shaped division of depth macroblocks. In one embodiment of the present invention, the division set is: G={ξ(ρ _i ,θ _i )|i=1,2,...,L}, where ξ(ρ _i ,θ _i ) is the division line set The i-th dividing line in , ρ _i is the distance from the depth macroblock to the dividing line, θ _i is the angle between the dividing line and the first direction, and L is the number of dividing lines.

如图2所示，定义楔形几何划分的多个划分线组成的划分集合。定义划分线为一条直线（如图2所示的划分线），该直线通过到深度宏块中心点的距离ρ和该直线与y轴，即第一方向的夹角θ来表示。在该实施例中，多个划分线的距离均位于[0,8)之间，多个划分线与第一方向的夹角均位于[0,2π)之间，即每个划分线的两个参数ρ和θ的取值范围为[0,8)和[0,2π），参数ρ的采样间隔为1，其取值集合为ρ={0,1,2,…,7}，参数θ的采样间隔为π/16，其取值集合为

因此，划分集合G={ξ(ρ_i,θ_i)|i＝1,2,…,L}中共包含256条可选的划分线，即L=256。As shown in FIG. 2 , a division set composed of multiple division lines defining a wedge-shaped geometric division is defined. The dividing line is defined as a straight line (the dividing line shown in FIG. 2 ), which is represented by the distance ρ to the center point of the depth macroblock and the angle θ between the straight line and the y-axis, that is, the first direction. In this embodiment, the distances between the multiple dividing lines are all located between [0, 8), and the angles between the multiple dividing lines and the first direction are all located between [0, 2π), that is, the two points of each dividing line The value ranges of parameters ρ and θ are [0,8) and [0,2π), the sampling interval of parameter ρ is 1, and its value set is ρ={0,1,2,…,7}, the parameter The sampling interval of θ is π/16, and its value set is

Therefore, the division set G={ξ(ρ _i ,θ _i )|i=1, 2, . . . , L} contains 256 optional division lines, ie L=256.

步骤S102，以帧内编码模式对深度宏块进行编码以获取对应的第一率失真代价值。Step S102, encode the depth macroblock in an intra-frame coding mode to obtain a corresponding first rate-distortion cost value.

具体地，根据H.264/MPEG-4 AVC编码标准中的帧内编码模式对深度宏块进行帧内编码，并将对应的最优率失真代价作为第一率失真代价值。更为具体地，编码采用与传统H.264/MPEG-4 AVC编码标准中定义的帧内编码模式相同的方法对该深度宏块，记为B_k进行帧内预测编码。在本实施例中，当前深度宏块在帧内预测编码模式下最优率失真代价为 $J_{k}^{intra} = 5741,$ 即第一率失真代价值为 $J_{k}^{intra} = 5741 .$ Specifically, the depth macroblock is intra-coded according to the intra-frame coding mode in the H.264/MPEG-4 AVC coding standard, and the corresponding optimal rate-distortion cost is used as the first rate-distortion cost value. More specifically, the same method as the intra-frame coding mode defined in the traditional H.264/MPEG-4 AVC coding standard is used for coding to perform intra-frame prediction coding on the depth macroblock, denoted as B _k . In this embodiment, the optimal rate-distortion cost of the current depth macroblock in the intra prediction coding mode is $J_{k}^{intra} = 5741,$ That is, the first rate-distortion cost is $J_{k}^{intra} = 5741 .$

步骤S103，判断是否以帧间编码模式对深度宏块进行编码，如果是则以帧间编码模式对深度宏块进行编码以获取对应的第二率失真代价值。Step S103, judging whether the depth macroblock is coded in the inter-frame coding mode, and if yes, the depth macroblock is coded in the inter-frame coding mode to obtain a corresponding second rate-distortion cost value.

具体地，根据H.264/MPEG-4 AVC编码标准中的帧间编码模式对所述深度宏块进行帧间模式编码，并将对应的最优率失真代价作为所述第二率失真代价值。在该实例中，上述帧间编码模式为传统的帧间编码模式。Specifically, perform inter-mode coding on the depth macroblock according to the inter-frame coding mode in the H.264/MPEG-4 AVC coding standard, and use the corresponding optimal rate-distortion cost as the second rate-distortion cost value . In this example, the above-mentioned inter-frame coding mode is a conventional inter-frame coding mode.

步骤S104，继续判断深度宏块是否包含不连续的运动向量场，如果深度宏块进行帧间编码且包含不连续的运动向量场，则以几何划分编码模式对深度宏块进行编码，其中，几何划分编码模式进一步包括：根据深度宏块的所有像素点的亮度信息从划分集合中选择最优划分线对深度宏块进行划分以得到第一和第二深度子区域，且分别对第一和第二深度子区域进行运动估计和预测编码以获取对应的第三率失真代价值。Step S104, continue to judge whether the depth macroblock contains a discontinuous motion vector field, if the depth macroblock is inter-coded and contains a discontinuous motion vector field, then encode the depth macroblock in the geometric partition coding mode, wherein, the geometric The division coding mode further includes: selecting an optimal division line from the division set according to the luminance information of all pixels of the depth macroblock to divide the depth macroblock to obtain the first and second depth sub-regions, and the first and second depth sub-regions respectively Motion estimation and predictive coding are performed on the two-depth sub-regions to obtain corresponding third rate-distortion cost values.

具体地，在一些示例中，判断是否以几何划分模式对深度宏块进行编码，即判断深度宏块是否包含不连续的运动向量场的步骤包括：Specifically, in some examples, the step of judging whether to encode the depth macroblock in a geometric division mode, that is, judging whether the depth macroblock contains a discontinuous motion vector field includes:

1、获取深度宏块中所有像素点的亮度值。例如当前编码的深度宏块B_k的全部像素点集合为

其中N代表该深度宏块中像素点的个数，在本实施例中，N=256。假设该深度宏块B_k为当前帧的第529个深度宏块。设B_k的全部像素点的亮度值为

{Y_{n}^{k} | n = 1,2,3, \cdot \cdot \cdot, N},

其中，

的数值如下述矩阵所示：1. Obtain the brightness values of all pixels in the depth macroblock. For example, the set of all pixels of the currently coded depth macroblock B _k is

Where N represents the number of pixels in the depth macroblock, and in this embodiment, N=256. Assume that the depth macroblock B _k is the 529th depth macroblock of the current frame. Let the brightness value of all pixels of B _k be

{Y_{no}^{k} | no = 1,2,3, \cdot \cdot \cdot, N},

in,

The values of are shown in the following matrix:

$[\begin{matrix} 3838 & 3838 & 3838 & 3838 & 3838 & 3838 & 3838 & 3838 & 3838 & 3838 & 3838 & 3838 & 3838 & 3838 & 3838 & 3838 \\ 3838 & 3838 & 3838 & 3838 & 3838 & 3838 & 3838 & 3838 & 3838 & 3838 & 3838 & 3838 & 3838 & 3838 & 3838 & 3838 \\ 3838 & 3838 & 3838 & 3838 & 3838 & 3838 & 3838 & 3838 & 3838 & 3838 & 3838 & 3838 & 3838 & 3838 & 3838 & 3838 \\ 3838 & 3838 & 3838 & 3838 & 3838 & 3838 & 3838 & 3838 & 3838 & 3838 & 3838 & 3838 & 3838 & 3838 & 3838 & 3838 \\ 3838 & 3838 & 3838 & 3838 & 3838 & 3838 & 3838 & 3838 & 3838 & 3838 & 3838 & 3838 & 3838 & 3838 & 3838 & 3838 \\ 3838 & 3838 & 3838 & 3838 & 3838 & 3838 & 3838 & 3838 & 3838 & 3838 & 3838 & 3838 & 3838 & 3838 & 3838 & 108108 \\ 3838 & 3838 & 3838 & 3838 & 3838 & 3838 & 3838 & 3838 & 3838 & 3838 & 3838 & 3838 & 3838 & 3838 & 108108 & 108108 \\ 3838 & 3838 & 3838 & 3838 & 3838 & 3838 & 3838 & 3838 & 3838 & 3838 & 3838 & 3838 & 3838 & 108108 & 108108 & 108108 \\ 3838 & 3838 & 3838 & 3838 & 3838 & 3838 & 3838 & 3838 & 3838 & 3838 & 3838 & 3838 & 105105 & 108108 & 108108 & 108108 \\ 3838 & 3838 & 3838 & 3838 & 3838 & 3838 & 3838 & 3838 & 3838 & 3838 & 3838 & 3838 & 105105 & 105105 & 108108 & 108108 \\ 3838 & 3838 & 3838 & 3838 & 3838 & 3838 & 3838 & 3838 & 3838 & 3838 & 3838 & 105105 & 105105 & 105105 & 105105 & 105105 \\ 3838 & 3838 & 3838 & 3838 & 3838 & 3838 & 3838 & 3838 & 3838 & 3838 & 105105 & 105105 & 105105 & 105105 & 105105 & 105105 \\ 3838 & 3838 & 3838 & 3838 & 3838 & 3838 & 3838 & 3838 & 105105 & 105105 & 105105 & 105105 & 105105 & 105105 & 105105 & 105105 \\ 3838 & 3838 & 3838 & 3838 & 3838 & 3838 & 3838 & 108108 & 105105 & 105105 & 105105 & 105105 & 105105 & 105105 & 105105 & 105105 \\ 3838 & 3838 & 3838 & 3838 & 3838 & 3838 & 3838 & 108108 & 105105 & 105105 & 105105 & 108108 & 105105 & 105105 & 105105 & 105105 \\ 3838 & 3838 & 3838 & 3838 & 3838 & 3838 & 108108 & 108108 & 105105 & 105105 & 105105 & 105105 & 105105 & 105105 & 105105 & 105105 \end{matrix}]$

2、计算深度宏块中所有像素点的亮度值的方差。通过计算可以知道，该深度宏块Bk的所有像素点的亮度值的方差为838。2. Calculate the variance of the luminance values of all pixels in the depth macroblock. It can be known through calculation that the variance of the luminance values of all pixels in the depth macroblock Bk is 838.

3、比较方差和预设阈值的大小，如果方差大于预设阈值，则判断深度宏块包含不连续的运动向量场。例如，该预设阈值为T_S=100，则B_k所有像素点的亮度值的方差大于T_S，因此判断深度宏块包含不连续的运动向量场。3. Compare the magnitude of the variance with the preset threshold, and if the variance is greater than the preset threshold, it is determined that the depth macroblock contains a discontinuous motion vector field. For example, if the preset threshold is T _S =100, then the variance of the luminance values of all pixels in B _k is greater than T _S , so it is determined that the depth macroblock contains a discontinuous motion vector field.

根据深度宏块的所有像素点的亮度信息从划分集合中选择最优划分线对所述深度宏块进行划分以得到第一和第二深度子区域的步骤进一步包括：Selecting an optimal dividing line from the dividing set according to the luminance information of all pixels of the depth macroblock to divide the depth macroblock to obtain the first and second depth sub-regions further includes:

1、分别以划分集合中的每一条划分线对深度宏块进行划分，并分别计算划分所得的两部分中各自的所有像素点的平均亮度值以得到对应的第一部分和第二部分平均亮度值。例如，编码器通过遍历集合G中的每一条划分线来搜索B_k所对应的最优划分线。在遍历过程中，对每一条划分线ξ(ρ_i,θ_i)，将B_k划分成两部分。计算两个部分中各自所有像素点的平均亮度值，例如当参数ρ和参数θ的取值均为0时，此时划分线为位于宏块B_k中心的垂直线段。该划分线将宏块B_k分成左右大小相等的两部分，P¹(ρ₀,θ₀)和P²(ρ₀,θ₀)。此时，区域P¹(ρ₀,θ₀)中所有像素点的平均亮度值为40.2；区域P²(ρ₀,θ₀)中所有像素点的平均亮度值为68.1。1. Divide the depth macroblock with each division line in the division set, and calculate the average luminance values of all the pixels in the two divided parts respectively to obtain the corresponding average luminance values of the first part and the second part . For example, the encoder searches for the optimal dividing line corresponding to B _k by traversing each dividing line in the set G. During the traversal process, for each dividing line ξ(ρ _i ,θ _i ), divide B _k into two parts. Calculate the average luminance values of all pixels in the two parts, for example, when the values of the parameters ρ and θ are both 0, the division line is a vertical line segment at the center of the macroblock B _k . The division line divides the macroblock B _k into two parts, P ¹ (ρ ₀ ,θ ₀ ) and P ² (ρ ₀ ,θ ₀ ), which are equal in size on the left and right. At this time, the average luminance value of all pixels in the area P ¹ (ρ ₀ , θ ₀ ) is 40.2; the average luminance value of all pixels in the area P ² (ρ ₀ , θ ₀ ) is 68.1.

2、分别对每一条划分线对应的第一部分和第二部分的平均亮度值做差得到差值。根据步骤1中的示例，P¹(ρ₀,θ₀)和P²(ρ₀,θ₀)之间的平均亮度值做差结果为：d=40.2-68.1=-27.92. The difference is obtained by making a difference between the average luminance values of the first part and the second part corresponding to each dividing line. According to the example in step 1, the difference between the average brightness values between P ¹ (ρ ₀ ,θ ₀ ) and P ² (ρ ₀ ,θ ₀ ) is: d=40.2-68.1=-27.9

3、获取使差值的绝对值最大的划分线作为所述最优划分线，以对所述深度宏块进行划分以得到第一和第二深度子区域。根据步骤2中的示例，差值的绝对值为：Δd=|40.2-68.1|=27.9。3. Obtain a dividing line that maximizes the absolute value of the difference as the optimal dividing line, so as to divide the depth macroblock to obtain the first and second depth sub-regions. According to the example in step 2, the absolute value of the difference is: Δd=|40.2-68.1|=27.9.

同理，对划分线集合G中的所有划分线均以步骤1-3的方式求取其划分后对应的深度划分差，将获得最大深度划分差的划分线作为所述当前宏块最优的划分线，最优划分线记为ξ(ρ^opt,θ^opt)。在本实施例中，对于深度宏块B_k的最大深度划分差为Δd_max=65.6。其对应的最优划分线ξ(ρ^opt,θ^opt)的参数取值分别为ρ^opt=4， In the same way, for all the division lines in the division line set G, the corresponding depth division difference after division is obtained in the manner of steps 1-3, and the division line with the largest depth division difference is taken as the optimal value of the current macroblock. The dividing line, the optimal dividing line is denoted as ξ(ρ ^opt ,θ ^opt ). In this embodiment, the maximum depth division difference for the depth macroblock B _k is Δd _max =65.6. The parameters of the corresponding optimal dividing line ξ(ρ ^opt ,θ ^opt ) are respectively ρ ^opt =4,

根据上述的示例，分别对第一和第二深度子区域进行运动估计以获取对应的第三率失真代价值，具体为：对B_k采用最优的划分线ξ(ρ^opt,θ^opt)划分成第一和第二深度子区域，记为P¹(ρ^opt,θ^opt)和P²(ρ^opt,θ^opt)。然后对第一和第二深度子区域分别进行运动估计，并计算其对应的率失真代价，在该示例中，B_k在几何划分编码模式下的第三率失真代价为According to the above example, perform motion estimation on the first and second depth sub-regions to obtain the corresponding third rate-distortion cost value, specifically: use the optimal dividing line ξ(ρ ^opt ,θ ^opt ) to divide B _k into first and second depth sub-regions, denoted as P ¹ (ρ ^opt ,θ ^opt ) and P ² (ρ ^opt ,θ ^opt ). Then perform motion estimation on the first and second depth sub-regions respectively, and calculate their corresponding rate-distortion costs. In this example, the third rate-distortion cost of B _k in the geometric partition coding mode is

${J J}_{k k}^{GEO GEOs} = = 14381438 . .$

步骤S105，比较不同编码模式下深度宏块的率失真代价值，以根据比较结果选择率失真代价最小的编码模式对深度宏块进行编码。具体而言，包括如下步骤：Step S105 , comparing the rate-distortion cost values of the depth macroblocks in different coding modes, so as to encode the depth macroblocks in the coding mode with the smallest rate-distortion cost according to the comparison result. Specifically, it includes the following steps:

1、如果判断不对深度宏块进行帧间编码，则仅对深度宏块进行帧内编码。1. If it is determined that inter-coding is not performed on the depth macroblock, only intra-coding is performed on the depth macroblock.

2、如果判断深度宏块进行帧间编码且没有包含不连续的运动向量场，则根据第一和第二率失真代价值的比较结果选择较小的率失真代价值对应的帧间编码模式或者帧内编码模式对深度宏块进行编码。2. If it is judged that the depth macroblock is inter-frame coded and does not contain a discontinuous motion vector field, then select the inter-frame coding mode corresponding to the smaller rate-distortion cost value according to the comparison result of the first and second rate-distortion cost values or Intra coding mode codes depth macroblocks.

3、如果判断深度宏块进行帧间编码且包含不连续的运动向量场，则根据第一至第三率失真代价值的比较结果选择最小的率失真代价值对应的帧间编码模式、帧内编码模式或者几何划分编码模式对深度宏块进行编码。3. If it is judged that the depth macroblock is inter-coded and contains discontinuous motion vector fields, then select the inter-frame coding mode and intra-frame coding mode corresponding to the smallest rate-distortion cost value according to the comparison results of the first to third rate-distortion cost values. A coding mode or a geometric partition coding mode encodes depth macroblocks.

作为一个具体的示例，以上述的B_k为例：对B_k进行基于率失真优化的模式选择。在本实施例中，由于编码器对B_k进行了几何划分模式的预测，因此需要在几何划分编码模式，帧间编码模式、帧内编码模式这三种模式中选择率失真代价最小的模式作为B_k的最优编码模式。对于B_k，最小的率失真代价为： $\min (J_{k}^{GEO}, J_{k}^{inter}, J_{k}^{intra}) = J_{k}^{GEO} = 1438 .$ As a specific example, take the above-mentioned B _k as an example: the rate-distortion optimization-based mode selection is performed on B _k . In this embodiment, since the encoder predicts the geometric partition mode of _Bk , it is necessary to select the mode with the smallest rate-distortion cost among the three modes of geometric partition coding mode, inter-frame coding mode, and intra-frame coding mode as The optimal coding mode for B _k . For B _k , the minimum rate-distortion penalty is: $\min (J_{k}^{GEOs}, J_{k}^{inter}, J_{k}^{intra}) = J_{k}^{GEOs} = 1438 .$

因此，B_k的最优编码模式为几何划分编码模式。则该B_k采用几何划分编码模式进行编码。Therefore, the optimal coding mode of B _k is the geometric partition coding mode. Then the B _k is coded in the geometric partition coding mode.

参见图3，作为一个具体的例子，本发明实施例的深度图编码方法可按照如下步骤进行：Referring to FIG. 3, as a specific example, the depth map encoding method in the embodiment of the present invention can be performed according to the following steps:

步骤S110，设置几何划分模式中的划分线集合。Step S110, setting a set of dividing lines in the geometrical dividing mode.

步骤S120，开始编码当前宏块B_k。Step S120, start encoding the current macroblock B _k .

步骤S130，判断B_k是否为帧内编码宏块，如果是则转至步骤S140，否则转至步骤S220。Step S130, judging whether B _k is an intra-coded macroblock, if yes, go to step S140, otherwise go to step S220.

步骤S140，对B_k进行帧内预测编码并计算该模式下的率失真代价，并转入步骤S150。Step S140, perform intra-frame predictive coding on B _k and calculate the rate-distortion cost in this mode, and turn to step S150.

步骤S150，判断B_k是否进行帧间模式编码，如果是则转至步骤S160，否则转至步骤S200。Step S150, judging whether B _k is encoded in the inter-frame mode, if yes, go to step S160, otherwise go to step S200.

步骤S160，对B_k采用传统帧间预测模式进行编码并计算该模式下的率失真代价，并转入步骤S170。In step S160, encode B _k in a conventional inter-frame prediction mode and calculate the rate-distortion cost in this mode, and proceed to step S170.

步骤S170，判断B_k是否包含不连续运动场。如果是则转至步骤S180，否则转至步骤S200。Step S170, judging whether B _k contains a discontinuous motion field. If yes, go to step S180, otherwise go to step S200.

步骤S180，根据B_k中每个像素的亮度信息搜索对应的最优划分线。Step S180, searching for the corresponding optimal dividing line according to the brightness information of each pixel in _Bk .

步骤S190，对最优划分线所划分成的两个部分进行运动估计和编码，并计算其对应的率失真代价。Step S190, perform motion estimation and encoding on the two parts divided by the optimal dividing line, and calculate the corresponding rate-distortion cost.

步骤S200，选取率失真代价最小的模式作为B_k的最优编码模式。Step S200, select the mode with the smallest rate-distortion cost as the optimal coding mode of B _k .

步骤S210，采用最优模式编码B_k。Step S210, using the optimal mode to encode B _k .

步骤S220，判断B_k是否为当前帧的最后一个宏块。如果是则编码结束，否则转至步骤S120中以便对下一个深度宏块进行编码。Step S220, judging whether B _k is the last macroblock of the current frame. If yes, the encoding ends; otherwise, go to step S120 to encode the next depth macroblock.

参考图6，本发明进一步实施例提出了一种深度图编码装置600，包括划分线建立模块610、帧内编码模块620、帧间编码判断模块630、帧间编码模块640，几何划分判断模块650、几何划分编码模块660和编码模式选择模块670。其中：Referring to FIG. 6 , a further embodiment of the present invention proposes a depth map encoding device 600, including a division line establishment module 610, an intra-frame encoding module 620, an inter-frame encoding judgment module 630, an inter-frame encoding module 640, and a geometric division judgment module 650. , a geometric partition encoding module 660 and an encoding mode selection module 670. in:

划分线建立模块610用于建立多个划分线并组成划分集合，其中，多个划分线用于对深度宏块进行楔形划分。帧内编码模块620用于以帧内编码模式对深度宏块进行编码以获取对应的第一率失真代价值。帧间编码判断模块630用于判断是否以帧间编码模式对所述深度宏块进行编码。帧间编码模块640用于在帧间编码判断模块630判断以帧间编码模式对深度宏块进行编码时以帧间编码模式对深度宏块进行编码以获取对应的第二率失真代价值。几何划分判断模块650用于判断深度宏块是否包含不连续的运动向量场。几何划分编码模块660用于在几何划分判断模块650判断深度宏块包含不连续的运动向量场时，以几何划分编码模式对深度宏块进行编码，其中，几何划分编码模式包括：根据深度宏块的所有像素点的亮度信息从所述划分集合中选择最优划分线对深度宏块进行划分以得到第一和第二深度子区域，且分别对第一和第二深度子区域进行运动估计和预测编码以获取对应的第三率失真代价值。编码模式选择模块670用于比较深度宏块在不同编码模式下的率失真代价值，以根据比较结果选择率失真代价最小的编码模式以便对所述深度宏块进行编码。The dividing line establishment module 610 is used to establish a plurality of dividing lines and form a division set, wherein the plurality of dividing lines are used for wedge-shaped division of depth macroblocks. The intra-frame coding module 620 is used for coding the depth macroblock in an intra-frame coding mode to obtain a corresponding first rate-distortion cost value. The inter-frame coding judging module 630 is used for judging whether to code the depth macroblock in an inter-frame coding mode. The inter coding module 640 is configured to code the depth macroblock in the inter coding mode to obtain the corresponding second rate-distortion cost when the inter coding judging module 630 determines to code the depth macro block in the inter coding mode. The geometric division judging module 650 is used for judging whether the depth macroblock contains discontinuous motion vector fields. The geometric partition coding module 660 is used to code the depth macroblock in the geometric partition coding mode when the geometric partition judging module 650 judges that the depth macroblock contains a discontinuous motion vector field, wherein the geometric partition coding mode includes: according to the depth macroblock The luminance information of all pixels in the division set is selected from the division set to select the optimal division line to divide the depth macroblock to obtain the first and second depth sub-regions, and perform motion estimation and Predictive encoding to obtain a corresponding third rate-distortion cost value. The encoding mode selection module 670 is used to compare the rate-distortion cost values of the depth macroblocks in different encoding modes, so as to select the encoding mode with the smallest rate-distortion cost according to the comparison result to encode the depth macroblocks.

在一些示例中，划分集合为：In some examples, the set is partitioned as:

G={ξ(ρ_i,θ_i)|i＝1,2,…,L}，G={ξ(ρ _i ,θ _i )|i=1,2,…,L},

在一些示例中，深度宏块至多个划分线的距离均位于[0,8)之间，多个划分线与第一方向的夹角均位于[0,2π)之间。In some examples, the distances from the depth macroblock to the multiple dividing lines are all between [0, 8), and the angles between the multiple dividing lines and the first direction are all between [0, 2π).

在一些示例中，几何划分判断模块650用于获取深度宏块中所有像素点的亮度值，并计算深度宏块中所有像素点的亮度值的方差，并比较方差和预设阈值的大小，如果方差大于所述预设阈值，则判断深度宏块包含不连续的运动向量场。In some examples, the geometric division judging module 650 is used to obtain the brightness values of all pixels in the depth macroblock, and calculate the variance of the brightness values of all pixels in the depth macroblock, and compare the size of the variance with the preset threshold, if If the variance is greater than the preset threshold, it is determined that the depth macroblock contains a discontinuous motion vector field.

进一步地，几何划分编码模块660用于分别以所述划分集合中的每一条划分线对所述深度宏块进行划分，并分别计算划分所得的两部分中各自的所有像素点的平均亮度值以得到对应的第一部分和第二部分的平均亮度值，分别对每一条划分线对应的第一部分和第二部分的平均亮度值做差得到差值，并获取使差值的绝对值最大的划分线作为所述最优划分线，以对所述深度宏块进行划分以得到第一和第二深度子区域。Further, the geometric partition encoding module 660 is used to divide the depth macroblock with each division line in the division set, and calculate the average luminance values of all the pixels in the two divided parts to obtain Get the corresponding average luminance values of the first part and the second part, respectively make a difference between the average luminance values of the first part and the second part corresponding to each dividing line, and obtain the dividing line that maximizes the absolute value of the difference As the optimal division line, the depth macroblock is divided to obtain the first and second depth sub-regions.

在一些示例中，帧内编码模块620用于根据H.264/MPEG-4 AVC编码标准中的帧内编码模式对所述深度宏块进行帧内编码，并将对应的最优率失真代价作为所述第一率失真代价值。In some examples, the intra-frame coding module 620 is configured to perform intra-frame coding on the depth macroblock according to the intra-frame coding mode in the H.264/MPEG-4 AVC coding standard, and use the corresponding optimal rate-distortion cost as The first rate-distortion cost value.

在一些示例中，帧间编码模块640用于根据H.264/MPEG-4 AVC编码标准中的帧间编码模式对所述深度宏块进行帧间编码，并将对应的最优率失真代价作为所述第二率失真代价值。In some examples, the inter-frame coding module 640 is configured to perform inter-frame coding on the depth macroblock according to the inter-frame coding mode in the H.264/MPEG-4 AVC coding standard, and use the corresponding optimal rate-distortion cost as the second rate-distortion cost value.

在一些示例中，编码模式选择模块670用于在不对所述深度宏块进行帧间编码时，仅选择帧内编码模式，在所述深度宏块进行帧间编码且没有包含不连续的运动向量场时，根据第一和第二率失真代价值的比较结果选择较小的率失真代价值对应的帧间编码模式或者帧内编码模式，在所述深度宏块进行帧间编码且包含不连续的运动向量场时，根据第一至第三率失真代价值的比较结果选择最小的率失真代价值对应的帧间编码模式、帧内编码模式或者几何划分编码模式以便对所述深度宏块进行编码。In some examples, the encoding mode selection module 670 is configured to only select an intra encoding mode when the depth macroblock is not inter-encoded, and the depth macroblock is inter-encoded and does not contain discontinuous motion vectors In the frame time, according to the comparison result of the first and second rate-distortion cost values, select the inter-frame coding mode or the intra-frame coding mode corresponding to the smaller rate-distortion cost value, and perform inter-frame coding on the depth macroblock and include discontinuity When the motion vector field is selected, according to the comparison results of the first to third rate-distortion cost values, select the inter-frame coding mode, intra-frame coding mode or geometric partition coding mode corresponding to the smallest rate-distortion cost value so as to carry out the depth macroblock coding.

在本说明书的描述中，参考术语“一个实施例”、“一些实施例”、“示例”、“具体示例”、或“一些示例”等的描述意指结合该实施例或示例描述的具体特征、结构、材料或者特点包含于本发明的至少一个实施例或示例中。在本说明书中，对上述术语的示意性表述不一定指的是相同的实施例或示例。而且，描述的具体特征、结构、材料或者特点可以在任何的一个或多个实施例或示例中以合适的方式结合。In the description of this specification, descriptions referring to the terms "one embodiment", "some embodiments", "example", "specific examples", or "some examples" mean that specific features described in connection with the embodiment or example , structure, material or characteristic is included in at least one embodiment or example of the present invention. In this specification, schematic representations of the above terms do not necessarily refer to the same embodiment or example. Furthermore, the specific features, structures, materials or characteristics described may be combined in any suitable manner in any one or more embodiments or examples.

尽管已经示出和描述了本发明的实施例，本领域的普通技术人员可以理解：在不脱离本发明的原理和宗旨的情况下可以对这些实施例进行多种变化、修改、替换和变型，本发明的范围由权利要求及其等同限定。Although the embodiments of the present invention have been shown and described, those skilled in the art can understand that various changes, modifications, substitutions and modifications can be made to these embodiments without departing from the principle and spirit of the present invention. The scope of the invention is defined by the claims and their equivalents.

Claims

1. A depth map coding method, comprising:

establishing a plurality of dividing lines and forming a dividing set, wherein the dividing lines are used for carrying out wedge-shaped division on the depth macro block;

coding the depth macro block in an intra-frame coding mode to obtain a corresponding first rate distortion cost value;

judging whether the depth macro block is coded in an inter-frame coding mode, if so, coding the depth macro block in the inter-frame coding mode to obtain a corresponding second rate distortion cost value;

continuing to determine whether the depth macroblock contains a discontinuous motion vector field, and if the depth macroblock contains a discontinuous motion vector field and is inter-coded, coding the depth macroblock in a geometric partition coding mode, wherein the geometric partition coding mode further comprises: selecting an optimal dividing line from the dividing set according to the brightness information of all pixel points of the depth macro block to divide the depth macro block to obtain a first depth sub-area and a second depth sub-area, and respectively performing motion estimation and predictive coding on the first depth sub-area and the second depth sub-area to obtain a corresponding third rate distortion cost value; and

and comparing the rate distortion cost values of the depth macro blocks under different coding modes, and selecting the coding mode with the minimum rate distortion cost according to the comparison result to code the depth macro blocks.

2. The method of claim 1, wherein the partition set is:

G={ξ(ρ_i,θ_i)|i＝1,2,…,L}，

where ξ (ρ)_i,θ_i) Is the ith dividing line, rho, in the dividing line set_iIs the distance, θ, of the depth macroblock to the dividing line_iIs the angle between the dividing line and the first direction, and L is the number of the dividing lines.

3. The method of claim 2, wherein distances from the depth macroblock to the plurality of dividing lines are all between [0,8), and angles from the first direction are all between [0,2 π).

4. The method of claim 1, wherein the step of determining whether the depth macroblock contains a discontinuous motion vector field further comprises:

acquiring the brightness values of all pixel points in the depth macro block;

calculating the variance of the brightness values of all pixel points in the depth macro block; and

and comparing the variance with a preset threshold, and if the variance is greater than the preset threshold, judging that the depth macro block contains a discontinuous motion vector field.

5. The depth map encoding method of claim 1, wherein the step of selecting an optimal partition line from the partition set to partition the depth macroblock into the first and second depth sub-regions according to luminance information of all pixel points of the depth macroblock further comprises:

dividing the depth macro block by each dividing line in the dividing set, and calculating the average brightness value of all pixel points in the two divided parts respectively to obtain the average brightness value of the corresponding first part and the second part;

respectively carrying out difference on the average brightness values of the first part and the second part corresponding to each dividing line to obtain a difference value; and

obtaining a dividing line that maximizes an absolute value of the difference as the optimal dividing line to divide the depth macroblock to obtain first and second depth sub-regions.

6. The method of claim 1, wherein the step of encoding the depth macroblock in intra coding mode to obtain the corresponding first rate-distortion cost value comprises:

carrying out intra-frame coding on the depth macro block according to an intra-frame coding mode in an H.264/MPEG-4AVC coding standard, and taking the corresponding optimal rate distortion cost as the first rate distortion cost value;

the step of encoding the depth macroblock in the inter-coding mode to obtain the corresponding second rate-distortion cost value comprises:

and performing inter-mode coding on the depth macro block according to an inter-coding mode in the H.264/MPEG-4AVC coding standard, and taking the corresponding optimal rate distortion cost as the second rate distortion cost value.

7. The method of claim 1, wherein the step of comparing the rate-distortion cost values of the depth macroblocks in different coding modes to select a coding mode with the lowest rate-distortion cost according to the comparison result to code the depth macroblock comprises:

if the depth macro block is judged not to be subjected to inter-frame coding, only the depth macro block is subjected to intra-frame coding;

if the depth macro block is judged to be inter-coded and does not contain discontinuous motion vector fields, selecting an inter-coding mode or an intra-coding mode corresponding to a smaller rate distortion cost value according to a comparison result of the first rate distortion cost value and the second rate distortion cost value to code the depth macro block; and

and if the depth macro block is judged to be inter-coded and comprises discontinuous motion vector fields, selecting an inter-coding mode, an intra-coding mode or a geometric partition coding mode corresponding to the minimum rate distortion cost value according to the comparison result of the first rate distortion cost value to the third rate distortion cost value to code the depth macro block.

8. A depth map encoding apparatus, comprising:

the dividing line establishing module is used for establishing a plurality of dividing lines and forming a dividing set, wherein the dividing lines are used for carrying out wedge-shaped division on the depth macro block;

the intra-frame coding module is used for coding the depth macro block in an intra-frame coding mode to obtain a corresponding first rate distortion cost value;

the inter-frame coding judging module is used for judging whether the depth macro block is coded in an inter-frame coding mode;

the inter-frame coding module is used for coding the depth macro block in the inter-frame coding mode to acquire a corresponding second rate distortion cost value when the inter-frame coding judging module judges that the depth macro block is coded in the inter-frame coding mode;

a geometric partitioning judgment module, configured to judge whether the depth macroblock includes a discontinuous motion vector field;

a geometric partition coding module, configured to code the depth macroblock in a geometric partition coding mode when the geometric partition determining module determines that the depth macroblock is inter-coded and contains a discontinuous motion vector field, where the geometric partition coding mode includes: selecting an optimal dividing line from the dividing set according to the brightness information of all pixel points of the depth macro block to divide the depth macro block to obtain a first depth sub-area and a second depth sub-area, and respectively performing motion estimation and predictive coding on the first depth sub-area and the second depth sub-area to obtain a corresponding third rate distortion cost value; and

and the coding mode selection module is used for comparing the rate distortion cost values of the depth macro block under different coding modes so as to select the coding mode with the minimum rate distortion cost according to the comparison result so as to code the depth macro block.

9. The depth map coding apparatus of claim 8, wherein the partition set is:

G={ξ(ρ_i,θ_i)|i＝1,2,…,L}，

10. The depth map coding apparatus of claim 9, wherein distances from the depth macroblock to the plurality of division lines are each between [0,8), and angles from the first direction are each between [0,2 pi ].

11. The apparatus according to claim 8, wherein the geometric partition determining module is configured to obtain luminance values of all pixels in the depth macroblock, calculate a variance of the luminance values of all pixels in the depth macroblock, compare the variance with a preset threshold, and determine that the depth macroblock includes a discontinuous motion vector field if the variance is greater than the preset threshold.

12. The depth map encoding device of claim 8, wherein the geometric partition encoding module is configured to divide the depth macro block by each partition line in the partition set, calculate average luminance values of all pixel points in the two divided portions to obtain average luminance values of the corresponding first portion and second portion, difference the average luminance values of the first portion and second portion corresponding to each partition line to obtain a difference value, and obtain the partition line with the largest absolute value of the difference value as the optimal partition line to divide the depth macro block to obtain the first and second depth sub-regions.

13. The depth map coding apparatus of claim 8,

the intra-frame coding module is used for carrying out intra-frame coding on the depth macro block according to an intra-frame coding mode in an H264/MPEG-4 AVC coding standard and taking the corresponding optimal rate distortion cost as the first rate distortion cost value;

and the inter-frame coding module is used for carrying out inter-frame mode coding on the depth macro block according to an inter-frame coding mode in the H264/MPEG-4 AVC coding standard and taking the corresponding optimal rate distortion cost as the second rate distortion cost value.

14. The depth map encoding apparatus of claim 8, wherein the encoding mode selection module is configured to select only an intra-frame encoding mode when the depth macroblock is not inter-encoded, select an inter-frame encoding mode or an intra-frame encoding mode corresponding to a smaller rate-distortion cost value according to a comparison result between a first rate-distortion cost value and a second rate-distortion cost value when the depth macroblock is inter-encoded and does not include a discontinuous motion vector field, and select an inter-frame encoding mode, an intra-frame encoding mode, or a geometric partition encoding mode corresponding to a smallest rate-distortion cost value according to a comparison result between the first rate-distortion cost value and the third rate-distortion cost value when the depth macroblock is inter-encoded and includes a discontinuous motion vector field.