CN115379240A

CN115379240A - A MMVD prediction method and system for video coding based on direction extension

Info

Publication number: CN115379240A
Application number: CN202210792570.3A
Authority: CN
Inventors: 蒋先涛; 张纪庄; 郭咏梅; 郭咏阳
Original assignee: Kangda Intercontinental Medical Devices Co ltd
Current assignee: Kangda Intercontinental Medical Devices Co ltd
Priority date: 2022-07-07
Filing date: 2022-07-07
Publication date: 2022-11-22

Abstract

The invention discloses a video coding MMVD prediction method and system based on direction extension, which relates to the technical field of image processing, comprising the steps of: acquiring a motion vector candidate list based on a Merge mode; selecting an initial motion in the motion vector candidate list according to a preset selection rule Vector; set the starting point with the initial motion vector as the benchmark, and use the regular hexagon as the frame to expand the motion vector search direction; take the starting point as the origin, and perform the motion vector search under the reduced step size range according to the expanded search directions; Calculate the rate-distortion cost under the minimum rate-distortion criterion according to the motion vector set obtained by searching; select the motion vector with the smallest rate-distortion cost as the best prediction motion vector of the current prediction unit. In the present invention, the search directions in the basic MMVD technology of the prior art are expanded to six based on the regular hexagonal frame, which can better predict object motion information and obtain better coding effects.

Description

A MMVD prediction method and system for video coding based on direction extension

技术领域technical field

本发明涉及图像处理技术领域，具体涉及一种基于方向扩展的视频编码MMVD预测方法与系统。The present invention relates to the technical field of image processing, in particular to a video coding MMVD prediction method and system based on direction extension.

背景技术Background technique

H.266/VVC是由ITU-T视频编码专家组（VCEG）和ISO/IEC运动图像专家组（MPEG）共同协作，历经近4年完善，于2020年7月联合敲定的最新国际视频编码标准。随着视频分辨率的提高，VVC作为最先进的视频编码标准，与其前身高效视频编码（HEVC）标准相比，相同画质情况下，VVC可以节省50%左右编码码率。为提高帧间预测的准确性，VVC的帧间模式不仅对原本的Merge模式做了优化扩展调整，而且增添了一些在之前视频编码标准中没有出现过的帧间预测工具，如扩展的Merge模式（Extended merge prediction）、仿射运动补偿预测（Affine motion compensated prediction），双向光流技术（Bi-directional opticalflow）和带运动矢量差的合并模式（Merge mode with MVD，MMVD）等。但是，这些新提出的工具仍有许多不完善地方。H.266/VVC is the latest international video coding standard jointly finalized in July 2020 by ITU-T Video Coding Experts Group (VCEG) and ISO/IEC Moving Picture Experts Group (MPEG) after nearly 4 years of improvement. . With the improvement of video resolution, VVC is the most advanced video coding standard. Compared with its predecessor High Efficiency Video Coding (HEVC) standard, under the same picture quality, VVC can save about 50% of the coding bit rate. In order to improve the accuracy of inter-frame prediction, VVC's inter-frame mode not only optimizes and expands the original Merge mode, but also adds some inter-frame prediction tools that have not appeared in previous video coding standards, such as the extended Merge mode (Extended merge prediction), Affine motion compensated prediction (Affine motion compensated prediction), Bi-directional optical flow technology (Bi-directional optical flow) and merge mode with motion vector difference (Merge mode with MVD, MMVD), etc. However, these newly proposed tools still have many imperfections.

现如今，随着科技的发展、物体运动复杂化，设备移动灵活性不断增加，物体能够进行运动的方向也逐渐增多，仅针对上下左右四个方向上的平移预测已经很难准确预测大多数物体运动的情况，故此需要设计一种新型能预测更多物体运动特性的方向模型来进行优化。Nowadays, with the development of science and technology and the complexity of object movement, the flexibility of equipment movement is increasing, and the directions in which objects can move are gradually increasing. It is difficult to accurately predict most objects only for translation prediction in the four directions of up, down, left, and right. Therefore, it is necessary to design a new type of direction model that can predict the motion characteristics of more objects for optimization.

发明内容Contents of the invention

为了更好的适应视频变化的需求，同时避免码率的增加，本发明提出了一种基于方向扩展的视频编码MMVD预测方法，包括步骤：In order to better adapt to the needs of video changes, while avoiding the increase of bit rate, the present invention proposes a video coding MMVD prediction method based on direction extension, including steps:

S1：基于Merge模式获取运动矢量候选列表；S1: Obtain a motion vector candidate list based on the Merge mode;

S2：根据预设选取规则选取运动矢量候选列表中的初始运动矢量；S2: Select an initial motion vector in the motion vector candidate list according to a preset selection rule;

S3：以初始运动矢量为基准进行起点的设置，并以正六边形为框架进行运动矢量搜索方向扩展；S3: Set the starting point based on the initial motion vector, and use the regular hexagon as the frame to expand the search direction of the motion vector;

S4：以起点为原点，根据扩展后的各搜索方向进行步长范围缩小下的运动矢量搜索；S4: With the starting point as the origin, search for motion vectors with a reduced step size range according to the expanded search directions;

S5：根据搜索获得的运动矢量集进行最小率失真准则下的率失真代价计算；S5: Calculate the rate-distortion cost under the minimum rate-distortion criterion according to the motion vector set obtained by searching;

S6：选取率失真代价最小的运动矢量作为当前预测单元的最佳预测运动矢量。S6: Select the motion vector with the smallest rate-distortion cost as the best predicted motion vector of the current prediction unit.

进一步地，所述S3步骤中，起点为初始运动矢量在当前帧间图像的前一帧方向和后一帧方向的指向位置。Further, in the step S3, the starting point is the pointing position of the initial motion vector in the direction of the previous frame and the direction of the next frame of the current inter-frame image.

进一步地，所述S3步骤中，正六边形为水平方向上的正六边形，起点为正六边形的中心点。Further, in the step S3, the regular hexagon is a regular hexagon in the horizontal direction, and the starting point is the center point of the regular hexagon.

进一步地，所述S3步骤中，扩展后的搜索方向为正六边形中心点与正六边形上各角点的连线方向。Further, in the step S3, the expanded search direction is the direction of the line connecting the center point of the regular hexagon and each corner point of the regular hexagon.

进一步地，所述S4步骤中，运动矢量搜索是在初始像素大小到预设像素大小的步长范围内，进行步长2倍数递增的运动矢量搜索。Further, in the step S4, the motion vector search is performed within the step size range from the initial pixel size to the preset pixel size, and the step size is increased by a multiple of 2 to search for the motion vector.

本发明还提出了一种基于方向扩展的视频编码MMVD预测系统，包括：The present invention also proposes a video coding MMVD prediction system based on direction extension, including:

初始选取单元，用于基于Merge模式获取运动矢量候选列表，并根据预设选取规则选取运动矢量候选列表中的初始运动矢量；An initial selection unit, configured to obtain a motion vector candidate list based on the Merge mode, and select an initial motion vector in the motion vector candidate list according to a preset selection rule;

方向扩展单元，用于以初始运动矢量为基准进行起点的设置，并以正六边形为框架进行运动矢量搜索方向扩展；The direction expansion unit is used to set the starting point based on the initial motion vector, and to expand the motion vector search direction with the regular hexagon as the frame;

矢量搜索单元，用于以起点为原点，根据扩展后的各搜索方向进行步长范围缩小下的运动矢量搜索；The vector search unit is used for taking the starting point as the origin, and performing the motion vector search under the reduced step size range according to each search direction after expansion;

率失真计算单元，用于根据搜索获得的运动矢量集进行最小率失真准则下的率失真代价计算；A rate-distortion calculation unit, configured to calculate the rate-distortion cost under the minimum rate-distortion criterion according to the motion vector set obtained by searching;

矢量选取单元，用于选取率失真代价最小的运动矢量作为当前预测单元的最佳预测运动矢量。The vector selection unit is configured to select the motion vector with the smallest rate-distortion cost as the best prediction motion vector of the current prediction unit.

进一步地，所述方向扩展单元中，起点为初始运动矢量在当前帧间图像的前一帧方向和后一帧方向的指向位置。Further, in the direction extension unit, the starting point is the pointing position of the initial motion vector in the direction of the previous frame and the direction of the next frame of the current inter-frame image.

进一步地，所述方向扩展单元中，正六边形为水平方向上的正六边形，起点为正六边形的中心点。Further, in the direction extension unit, the regular hexagon is a regular hexagon in the horizontal direction, and the starting point is the center point of the regular hexagon.

进一步地，所述方向扩展单元中，扩展后的搜索方向为正六边形中心点与正六边形上各角点的连线方向。Further, in the direction extension unit, the extended search direction is the direction of the line connecting the center point of the regular hexagon and each corner point of the regular hexagon.

进一步地，所述矢量搜索单元中，运动矢量搜索是在初始像素大小到预设像素大小的步长范围内，进行步长2倍数递增的运动矢量搜索。Further, in the vector search unit, the motion vector search is a motion vector search in which the step size is increased by a multiple of 2 within the step size range from the initial pixel size to the preset pixel size.

与现有技术相比，本发明至少含有以下有益效果：Compared with the prior art, the present invention at least contains the following beneficial effects:

（1）本发明所述的一种基于方向扩展的视频编码MMVD预测方法与系统，在现有技术的基础上将MMVD技术中的搜索方向从4个扩展为6个，从而使之能够更好预测物体运动信息，以此来获得更优的编码效果；(1) The method and system for video coding MMVD prediction based on direction extension described in the present invention expands the search directions in MMVD technology from 4 to 6 on the basis of the prior art, so that it can better Predict object motion information to obtain better coding effect;

（2）在对搜索方向进行扩增的同时，通过步长搜索范围的缩减，降低由于方向扩增导致运动矢量搜索获得过多运动矢量引起的编码码率上升；(2) While expanding the search direction, through the reduction of the step size search range, the increase in the encoding bit rate caused by the motion vector search obtained by too many motion vectors due to the direction amplification is reduced;

（3）采用水平方向下的正六边形作为方向扩增的框架，保留了原十字模型水平方向运动预测的基础上，通过搜索方向的扩增进一步提高了相对垂直方向运动更剧烈的水平方向上的预测性能。(3) The regular hexagon in the horizontal direction is used as the framework for direction amplification. On the basis of retaining the horizontal movement prediction of the original cross model, the expansion of the search direction further improves the horizontal direction, which is more intense than the vertical movement. predictive performance.

附图说明Description of drawings

图1为一种基于方向扩展的视频编码MMVD预测方法的步骤图；Fig. 1 is a kind of step diagram of the video coding MMVD prediction method based on direction expansion;

图2为一种基于方向扩展的视频编码MMVD预测系统的结构图；Fig. 2 is a kind of structural diagram of the video coding MMVD prediction system based on direction expansion;

图3为水平方向上正六边形运动矢量搜索方向示意图；Fig. 3 is a schematic diagram of the search direction of a regular hexagonal motion vector in the horizontal direction;

图4为垂直方向上正六边形运动矢量搜索方向示意图。Fig. 4 is a schematic diagram of a search direction of a regular hexagonal motion vector in the vertical direction.

具体实施方式Detailed ways

以下是本发明的具体实施例并结合附图，对本发明的技术方案作进一步的描述，但本发明并不限于这些实施例。The following are specific embodiments of the present invention and in conjunction with the accompanying drawings, the technical solutions of the present invention are further described, but the present invention is not limited to these embodiments.

实施例一Embodiment one

HEVC编码标准中首次提出了名为Merge的帧间预测技术。通过为当前的预测单元（PU）块建立一个大小为5的运动矢量（MV）候选列表，利用空域和时域上运动矢量的相关性，对空域相临编码块和时域相临编码块进行运动信息的分析，获取候选运动矢量，继而进行帧间预测。而VVC在HEVC的基础上进行了改进，对Merge模式进行了优化。首先，扩展了运动矢量候选列表的长度，由最初的5个候选运动矢量扩展到6个候选运动矢量；然后对Merge技术做出了改进，保留了对当前与预测单元进行时域和空域相临编码块的候选运动向量选取方法，同时增加了几种新的帧间候选列表选择方式用以填补候选列表，如：基于历史信息构建MVP（HMVP）、逐对的平均MVP；最后，依旧利用最小率失真（RD）准则进行率失真代价计算，比较后选择一个最优MV作为当前PU最佳预测MV。An inter-frame prediction technology called Merge was proposed for the first time in the HEVC coding standard. By establishing a motion vector (MV) candidate list with a size of 5 for the current prediction unit (PU) block, using the correlation of motion vectors in the space domain and the time domain, the adjacent coding blocks in the spatial domain and the adjacent coding blocks in the temporal domain are processed. Analysis of motion information, obtaining candidate motion vectors, and then performing inter-frame prediction. VVC has improved on the basis of HEVC and optimized the Merge mode. First, the length of the motion vector candidate list is extended from the initial 5 candidate motion vectors to 6 candidate motion vectors; then the Merge technology is improved, and the time domain and space domain adjacent to the current prediction unit are retained. The candidate motion vector selection method of the coding block, and several new inter-frame candidate list selection methods are added to fill the candidate list, such as: constructing MVP (HMVP) based on historical information, pairwise average MVP; finally, still using the minimum The rate-distortion (RD) criterion is used to calculate the rate-distortion cost, and after comparison, an optimal MV is selected as the best predicted MV for the current PU.

然而，Merge模式获得的运动信息可能不准确，因为相临编码块的运动信息可能不是待预测编码块真正的运动趋势，所以MMVD技术被提出了。MMVD技术是在VVC编码标准中第一次被提出，是编码端和解码端都归属于Merge技术的一种帧间预测技术，其与Merge模式最大的差别就在于MMVD模式需要向解码端传递MVD。同时，考虑到算法效率的问题，目前暂且最多仅使用Merge候选列表中前四个候选作为初始运动向量。在原始的MMVD技术中，取1/4像素到32像素为步长范围进行运动矢量搜索，其步长随着搜索次数的增多而以2为倍数进行递增，达到32像素时停止增加；所以原始MMVD模式具体实现过程如下：首先基于Merge模式得到运动矢量候选列表，根据编码码率要求选取一定数量的初始运动矢量，以该初始运动矢量为基准，选取其在当前帧间图像的前一帧方向和后一帧方向的指向位置作为起点；其次，在起点上下左右四个方向上（也即是十字模型）进行可变步长的搜索；最后经过运动补偿得到预测值，通过预测值计算率失真代价，从而比较各搜索点位处的率失真代价，来获取最佳的一个运动向量、方向和步长的组合进行传递。However, the motion information obtained in Merge mode may be inaccurate, because the motion information of adjacent coding blocks may not be the real motion trend of the coding block to be predicted, so MMVD technology was proposed. The MMVD technology was proposed for the first time in the VVC coding standard. It is an inter-frame prediction technology that belongs to the Merge technology at the encoding end and the decoding end. The biggest difference between it and the Merge mode is that the MMVD mode needs to transmit MVD to the decoding end. . At the same time, considering the problem of algorithm efficiency, only the first four candidates in the Merge candidate list are used as the initial motion vector for the time being. In the original MMVD technology, the motion vector search is carried out with a step size ranging from 1/4 pixel to 32 pixels, and the step size increases with a multiple of 2 as the number of searches increases, and stops increasing when it reaches 32 pixels; so the original The specific implementation process of the MMVD mode is as follows: firstly, the motion vector candidate list is obtained based on the Merge mode, and a certain number of initial motion vectors are selected according to the coding bit rate requirements, and the direction of the previous frame of the current inter-frame image is selected based on the initial motion vector and the pointing position of the next frame direction as the starting point; secondly, search the variable step length in the four directions (that is, the cross model) of the starting point; finally, the predicted value is obtained through motion compensation, and the rate distortion is calculated through the predicted value cost, so as to compare the rate-distortion cost at each search point to obtain the best combination of motion vector, direction and step size for transmission.

MMVD技术相较于Merge模式已经有所改进，在运动矢量周围进行了精细化搜索。然而在实际生活中，由于物体运动灵活性不断提升，许多物体运动方向已经逐渐从简单的平移运动转向更复杂多变方向运动，仅仅是上下左右四个方向的平移运动难以满足日常视频编码需求。所以，为了更好处理物体复杂的运动，需要对MMVD技术进行搜索方向上的扩展，以更好的适应视频变化的需求。基于此，如图1所示，本发明提出了一种基于方向扩展的视频编码MMVD预测方法，包括步骤：Compared with the Merge mode, the MMVD technology has been improved, and a refined search is performed around the motion vector. However, in real life, due to the continuous improvement of the flexibility of object movement, the movement direction of many objects has gradually changed from simple translational movement to more complex and changeable direction movement. Only the translational movement in the four directions of up, down, left, and right cannot meet the daily video coding needs. Therefore, in order to better handle the complex motion of objects, the MMVD technology needs to be extended in the search direction to better adapt to the needs of video changes. Based on this, as shown in Figure 1, the present invention proposes a video coding MMVD prediction method based on direction extension, including steps:

对于更复杂多变方向运动的帧间图像编码，增加搜索方向显然是应对运动复杂化最好的方法，所以本发明先是想到了通过增加搜索方向的手段。在此基础上，考虑到运动矢量具有方向性，且在自然情况下，视频逐帧图像中图像水平运动的剧烈程度明显要高于垂直方向上的运动。根据该特性，在避免过多增加搜索方向的基础上，本发明提出以水平方向正六边形为框架进行运动矢量搜索方向的扩展。其中，如图3所示，S3步骤中所述的起点即为正六边形的中心点，扩展后的搜索方向为正六边形中心点与正六边形上各角点的连线方向，也即是以X轴正半轴为起始线，在0°、60°、120°、180°、240°、300°这六个方向上进行一定步长范围内的运动矢量搜索，以便获取更好的编码效果，并节省编码码率。如图4所示，与垂直方向正六边形为框架进行运动矢量搜索相比，水平方向上正六边形不但保留了原十字模型的水平方向上的运动预测，还将垂直方向上两个运动预测更替为四个指向不同方向的运动预测，更符合自然情况下图像运动的特性。For inter-frame image coding with more complex and changeable direction motion, increasing the search direction is obviously the best way to deal with the complexity of the motion, so the present invention first thinks of the means of increasing the search direction. On this basis, considering that the motion vector has directionality, and under natural circumstances, the intensity of horizontal motion in video frame-by-frame images is obviously higher than that in vertical direction. According to this characteristic, on the basis of avoiding too many search directions, the present invention proposes to expand the search direction of the motion vector with a regular hexagon in the horizontal direction as the frame. Wherein, as shown in Figure 3, the starting point described in the step S3 is the center point of the regular hexagon, and the search direction after expansion is the direction of the line connecting the center point of the regular hexagon and each corner point on the regular hexagon, that is Based on the positive semi-axis of the X-axis as the starting line, the motion vector search within a certain step range is carried out in six directions of 0°, 60°, 120°, 180°, 240°, and 300° in order to obtain better The encoding effect and save the encoding bit rate. As shown in Figure 4, compared with the regular hexagon in the vertical direction for motion vector search, the regular hexagon in the horizontal direction not only retains the motion prediction in the horizontal direction of the original cross model, but also predicts two motions in the vertical direction. It is replaced by four motion predictions pointing in different directions, which is more in line with the characteristics of image motion in natural situations.

可以看出，相较于原始的MMVD模型中仅结合常规十字模型的上下左右4个方向进行运动矢量的搜索，再根据搜索结果进行帧间预测最佳预测运动矢量的判定。本发明提出的以正六边形为框架进行的六向运动矢量搜索，覆盖范围更加广泛，因此预测结果更为准确，能够处理更灵活多变的图像运动情况。It can be seen that compared with the original MMVD model, the motion vector is searched only in four directions of the conventional cross model, up, down, left, and right, and then the inter-frame prediction optimal prediction motion vector is determined according to the search results. The six-direction motion vector search based on the regular hexagon proposed by the present invention has a wider coverage, so the prediction result is more accurate, and it can handle more flexible and changeable image motion situations.

由于MMVD模型中有前一帧和后一帧两个方向上的参考图像，而每个参考图像都扩展为四个方向，每个方向八个步长搜索点，这会导致搜索时算法的复杂度较高。而随着搜索方向的增加，如果依旧按照原始MMVD模型中的方式进行运动矢量搜索，可以预知的，搜索时计算的复杂度会再次增加。而降低算法复杂度最直接的方式就是减少搜索点数。搜索范围的大小直接影响搜索点数的多少，一个更大的搜索范围能够覆盖更多的搜索点，这在一定程度上可以提高解码后的视频质量。但过大的搜索范围会增加视频编码的计算复杂度，损失大量时间从而影响视频的实时性。而过小的搜索范围可能造成局部最优解的问题，无法很好的解决帧间预测的问题，所以搜索范围的大小需要在算法性能和算法时间复杂度上，根据实际使用需求进行平衡。Since there are reference images in two directions of the previous frame and the next frame in the MMVD model, and each reference image is expanded into four directions, and each direction has eight search points, which will lead to the complexity of the search algorithm higher degree. With the increase of the search direction, if the motion vector search is still performed in the original MMVD model, it can be predicted that the computational complexity of the search will increase again. The most direct way to reduce the complexity of the algorithm is to reduce the number of search points. The size of the search range directly affects the number of search points. A larger search range can cover more search points, which can improve the decoded video quality to a certain extent. However, an excessively large search range will increase the computational complexity of video encoding, and will lose a lot of time, thereby affecting the real-time performance of the video. A search range that is too small may cause problems with local optimal solutions, and cannot solve the problem of inter-frame prediction well. Therefore, the size of the search range needs to be balanced according to actual usage requirements in terms of algorithm performance and algorithm time complexity.

基于上述，为了降低算法的计算复杂度，本发明提出了缩减搜索步长范围的改进方式，在水平方向正六边形搜索方向扩增的基础上进行二次改进。根据运动的中心偏置特性，在原有的MMVD模型上保留了两个参考图像进行前一帧方向和后一帧方向两个不同方向帧间预测的基础上，缩小了搜索步长的范围。在一优选实施例中，让每个方向仅搜索四个搜索点，即仅搜索步长在1/4像素到2像素范围内的部分，以此来降低搜索复杂度。Based on the above, in order to reduce the computational complexity of the algorithm, the present invention proposes an improved way to reduce the range of the search step, and performs a secondary improvement on the basis of expanding the regular hexagonal search direction in the horizontal direction. According to the center bias characteristic of motion, on the basis of retaining two reference images in the original MMVD model for inter-frame prediction in two different directions of the previous frame direction and the next frame direction, the range of the search step is reduced. In a preferred embodiment, only four search points are searched in each direction, that is, only the part whose step size is within the range of 1/4 pixel to 2 pixels is searched, so as to reduce the search complexity.

MMVD技术在Skip和Merge模式中使用起始点、运动步长、运动方向三个参数对运动向量进行表示，它的候选列表选取过程也和VVC标准中的Merge运动矢量候选列表生成方式相同。本发明提出的MMVD在优化时保留了这些特性，仅在推测选择最终运动矢量过程上做了改进，增加了可选方向。本发明整个MMVD的帧间编码过程大致可以被分为三步：In the Skip and Merge modes, the MMVD technology uses three parameters: the starting point, the motion step size, and the motion direction to represent the motion vector. Its candidate list selection process is also the same as the Merge motion vector candidate list generation method in the VVC standard. The MMVD proposed by the present invention retains these characteristics during optimization, and only improves the process of guessing and selecting the final motion vector, adding optional directions. The interframe coding process of the whole MMVD of the present invention can roughly be divided into three steps:

Step1：基于Merge模式获取运动矢量候选列表，选择其中合适的运动矢量作为初始运动矢量（本实施例中为候选列表中的前两个运动矢量）；Step1: Acquire the motion vector candidate list based on the Merge mode, and select the appropriate motion vector as the initial motion vector (the first two motion vectors in the candidate list in this embodiment);

Step2：对选取的初始运动矢量在运动步长缩减和搜索方向扩展的情况下，构建新的MMVD运动矢量集；Step2: Construct a new MMVD motion vector set for the selected initial motion vector in the case of motion step reduction and search direction expansion;

Step3：基于最小率失真准则，在运动矢量集中进行最佳预测运动矢量的选取并执行帧间编码。Step3: Based on the minimum rate-distortion criterion, select the best predicted motion vector in the motion vector set and perform inter-frame coding.

其中，新的MMVD运动矢量集构建步骤如下：Among them, the new MMVD motion vector set construction steps are as follows:

首先，选择Merge运动矢量候选列表里的前两位进行检查后将其作为初始运动矢量。对于初始运动矢量，选择该运动矢量前后参考帧中所指向的位置作为起点进行搜索。然后，选择一个方向进行一定步长范围内的运动矢量搜索。在同一方向上，每次遇到一个步长点，形成一个候选运动矢量。重复该过程，直至对两个初始运动矢量分别6个方向4个步长遍历完成，得到相邻两个前后帧参考图像的各24个新的运动矢量作为运动矢量集。First, select the first two digits in the Merge motion vector candidate list to check and use them as the initial motion vector. For the initial motion vector, the positions pointed to in the reference frames before and after the motion vector are selected as the starting point for searching. Then, select a direction to search the motion vector within a certain step range. In the same direction, each time a step point is encountered, a candidate motion vector is formed. This process is repeated until the two initial motion vectors are traversed in 6 directions and 4 steps, respectively, and 24 new motion vectors of two adjacent frames of reference images before and after are obtained as motion vector sets.

实施例二Embodiment two

本发明的效果可通过以下仿真实验进一步说明：该实验基于H.266编码标准测试软件VTM，所用计算机配置为AMD Ryzen R75800H CPU，3.2GHz，运行内存为16GB，操作系统为Windows 10（64位），运行环境为Microsoft Visual Studio 2019。测试材料选自标准动态范围（SDR）视频的通用测试条件（CTC）。这里的测试序列被分为五个等级，分别为B、C、D、E、F，其中B类、C类和D类分别代表1920×1080、832×480和416×240分辨率视频。本次仿真测试条件模式为低延迟B（LPB）方式，使用四个量化参数（QP）22、27、32、37对标准测试视频序列进行编码。本实验所有的结果都是通过对每个视频序列进行50帧编码得到，编码性能是通过广泛采用的BD-Rate值来衡量的，而编解码时间由EncT、DecT表示。The effect of the present invention can be further illustrated by the following simulation experiment: the experiment is based on the H.266 encoding standard test software VTM, the computer used is configured as AMD Ryzen R75800H CPU, 3.2GHz, the running memory is 16GB, and the operating system is Windows 10 (64 bits) , the operating environment is Microsoft Visual Studio 2019. The test material was selected from Common Test Conditions (CTC) for Standard Dynamic Range (SDR) Video. The test sequences here are divided into five levels, namely B, C, D, E, and F, where B, C, and D represent 1920×1080, 832×480, and 416×240 resolution videos, respectively. The simulation test condition mode is low-latency B (LPB) mode, and four quantization parameters (QP) 22, 27, 32, 37 are used to encode the standard test video sequence. All the results of this experiment are obtained by encoding 50 frames of each video sequence. The encoding performance is measured by the widely used BD-Rate value, and the encoding and decoding time is represented by EncT and DecT.

BD-Rate值表示了图像质量PSNR一致情况下码率的变化情况。当其值为负数时，表示编码性能提升，节省了码率；否则，表示码率有所损失，性能下降。而EncT、DecT值表示了改进算法的总编解码时间占标准算法总编解码时间的比率。本次仿真实验结果如表3和表4所示：The BD-Rate value indicates the change of the bit rate when the image quality PSNR is consistent. When its value is negative, it means that the encoding performance is improved and the code rate is saved; otherwise, it means that the code rate is lost and the performance is degraded. The values of EncT and DecT represent the ratio of the total encoding and decoding time of the improved algorithm to the total encoding and decoding time of the standard algorithm. The results of this simulation experiment are shown in Table 3 and Table 4:

表3显示了所提出的方法在编码效率和编码复杂性方面的总体性能比较。Table 3 shows the overall performance comparison of the proposed methods in terms of coding efficiency and coding complexity.

表3：基于本发明所述方法不同序列的性能和时间结果Table 3: Performance and time results of different sequences based on the method of the present invention

表4：基于本发明所述方法不同序列性能结果Table 4: Performance results of different sequences based on the method of the present invention

本发明提出的方法总体上对MMVD技术编码性能进行了提高，同时对时间复杂度进行了平衡。从性能角度上进行分析，通过表3中YUV三列数据变化，YUV的大多数值都得到了降低；而从平均值上来看，平均的性能都得到了改善。在所有的测试序列中，亮度分量Y平均节约0.24%的码率，最高为Class D，节约了0.49%的码率，最低为Class B，也节约了0.15%的码率；而色度分量U能平均节约0.26%的码率，虽然在Class E中性能表现不是那么良好，但是对于Class B、Class D和Class F都得到了0.31%~0.44%的性能提升；色度分量V虽然仅平均节约0.01%的码率，但是单独看Class D得到了0.62%的性能提升，相对而言单项性能提升幅度是较大的；在性能提升的同时，对时间复杂度变化进行分析，从表3中EncT、DncT两列数据可以看出，编解码时间也有了轻微的降低，分别平均降低了6%和5%。而从分视频序列组的角度上来看，虽然对于视频尺寸较大的B、C两组，时间复杂度的结果不是很好，但是对于小尺寸测试序列，都提升了10%~15%的时间复杂度。The method proposed by the invention generally improves the coding performance of the MMVD technology, and at the same time balances the time complexity. From the perspective of performance, through the changes of the three columns of YUV data in Table 3, most of the values of YUV have been reduced; and from the average point of view, the average performance has been improved. In all test sequences, the luminance component Y saves an average of 0.24% of the code rate, the highest is Class D, saving 0.49% of the code rate, and the lowest is Class B, which also saves 0.15% of the code rate; while the chrominance component U It can save an average bit rate of 0.26%. Although the performance in Class E is not so good, it has achieved a performance improvement of 0.31%~0.44% for Class B, Class D and Class F; although the chroma component V only saves on average The code rate is 0.01%, but looking at Class D alone, it has achieved a performance improvement of 0.62%. Relatively speaking, the single performance improvement is relatively large; while the performance is improving, the change in time complexity is analyzed. From Table 3 EncT It can be seen from the two columns of data of DncT and DncT that the encoding and decoding time has also been slightly reduced, with an average reduction of 6% and 5% respectively. From the perspective of sub-video sequence groups, although the time complexity results are not very good for groups B and C with larger video sizes, but for small-sized test sequences, the time is increased by 10% to 15%. the complexity.

表4显示所提出方法在编码效率上分不同序列上的性能比较，其提供的数据是在表3基础上对性能YUV测试结果进行了细化，从表4中可以看出，亮度分量在Class F的亮度分量Y性能除了在ChinaSpeed序列中改善结果不那么显著，其余序列均有提升；通过分析Class E结果，可以发现本方案在FourPeople、KristenAndSara序列中仅对Y分量进行了部分提升，对于色度分量U和V，改善结果不太明显，而Class E组剩余RaceHorses序列对U分量的改善结果不明显；除此之外，其余几个测试序列组的YUV分量中基本上都有两个或两个以上的数值有所下降，意味着均获得编码增益，性能得到加强。值得注意的是，所提出的方法能在某些视频序列保持较为稳定的编码效率提升，如BasketballPass、BlowingBubbles、PartyScene，这些序列亮度分量和色度分量测试结果均得到了大于0.30%的增益。Table 4 shows the performance comparison of the proposed method in different sequences in terms of coding efficiency. The data provided is based on Table 3 and refines the performance YUV test results. It can be seen from Table 4 that the brightness component is in Class The Y performance of the luminance component of F is not so significant in the ChinaSpeed sequence, and the other sequences have improved; by analyzing the results of Class E, it can be found that this scheme only partially improves the Y component in the FourPeople and KristenAndSara sequences. Degree components U and V, the improvement results are not obvious, while the remaining RaceHorses sequences of the Class E group have no obvious improvement results on the U component; in addition, there are basically two or more YUV components in the remaining test sequence groups. A decrease of more than two values means that coding gain is obtained and performance is enhanced. It is worth noting that the proposed method can maintain a relatively stable coding efficiency improvement in some video sequences, such as BasketballPass, BlowingBubbles, and PartyScene. The test results of the luminance component and chrominance component of these sequences have achieved gains greater than 0.30%.

实施例三Embodiment Three

为了更好的对本发明的技术内容进行理解，本实施例通过系统间结构的形式来对本发明进行阐述，如图2所示，一种基于方向扩展的视频编码MMVD预测系统，包括：In order to better understand the technical content of the present invention, this embodiment illustrates the present invention in the form of an inter-system structure, as shown in Figure 2, a video coding MMVD prediction system based on direction extension, including:

进一步地，方向扩展单元中，起点为初始运动矢量在当前帧间图像的前一帧方向和后一帧方向的指向位置。Further, in the direction extension unit, the starting point is the pointing position of the initial motion vector in the direction of the previous frame and the direction of the next frame of the current inter-frame image.

进一步地，方向扩展单元中，正六边形为水平方向上的正六边形，起点为正六边形的中心点。Further, in the direction extension unit, the regular hexagon is a regular hexagon in the horizontal direction, and the starting point is the central point of the regular hexagon.

进一步地，方向扩展单元中，扩展后的搜索方向为正六边形中心点与正六边形上各角点的连线方向。Further, in the direction extension unit, the extended search direction is the direction of the line connecting the center point of the regular hexagon and each corner point of the regular hexagon.

进一步地，矢量搜索单元中，运动矢量搜索是在初始像素大小到预设像素大小的步长范围内，进行步长2倍数递增的运动矢量搜索。Further, in the vector search unit, the motion vector search is performed within the step size range from the initial pixel size to the preset pixel size, and the step size is increased by a multiple of 2.

综上所述，本发明所述的一种基于方向扩展的视频编码MMVD预测方法与系统，在现有技术的基础上将MMVD技术中的搜索方向从4个扩展为6个，从而使之能够更好预测物体运动信息，以此来获得更优的编码效果。在对搜索方向进行扩增的同时，通过步长搜索范围的缩减，降低由于方向扩增导致运动矢量搜索获得过多运动矢量引起的编码码率上升。In summary, a method and system for MMVD prediction of video coding based on direction extension described in the present invention expands the search directions in MMVD technology from 4 to 6 on the basis of the prior art, so that it can Better predict the motion information of the object, so as to obtain better coding effect. While amplifying the search direction, by reducing the search range of the step size, the encoding bit rate increase caused by too many motion vectors obtained by the motion vector search due to the direction amplification is reduced.

采用水平方向下的正六边形作为方向扩增的框架，保留了原十字模型水平方向运动预测的基础上，通过搜索方向的扩增进一步提高了相对垂直方向运动更剧烈的水平方向上的预测性能。The regular hexagon in the horizontal direction is used as the framework for direction amplification, and on the basis of retaining the horizontal movement prediction of the original cross model, the prediction performance in the horizontal direction, which is more severe than the vertical movement, is further improved through the expansion of the search direction .

需要说明，本发明实施例中所有方向性指示（诸如上、下、左、右、前、后……）仅用于解释在某一特定姿态（如附图所示）下各部件之间的相对位置关系、运动情况等，如果该特定姿态发生改变时，则该方向性指示也相应地随之改变。It should be noted that all directional indications (such as up, down, left, right, front, back...) in the embodiments of the present invention are only used to explain the relationship between the components in a certain posture (as shown in the figure). Relative positional relationship, movement conditions, etc., if the specific posture changes, the directional indication will also change accordingly.

另外，在本发明中如涉及“第一”、“第二”、“一”等的描述仅用于描述目的，而不能理解为指示或暗示其相对重要性或者隐含指明所指示的技术特征的数量。由此，限定有“第一”、“第二”的特征可以明示或者隐含地包括至少一个该特征。在本发明的描述中，“多个”的含义是至少两个，例如两个，三个等，除非另有明确具体的限定。In addition, in the present invention, descriptions such as "first", "second", "one" and so on are used for descriptive purposes only, and should not be understood as indicating or implying their relative importance or implicitly indicating the indicated technical features quantity. Thus, the features defined as "first" and "second" may explicitly or implicitly include at least one of these features. In the description of the present invention, "plurality" means at least two, such as two, three, etc., unless otherwise specifically defined.

在本发明中，除非另有明确的规定和限定，术语“连接”、“固定”等应做广义理解，例如，“固定”可以是固定连接，也可以是可拆卸连接，或成一体；可以是机械连接，也可以是电连接；可以是直接相连，也可以通过中间媒介间接相连，可以是两个元件内部的连通或两个元件的相互作用关系，除非另有明确的限定。对于本领域的普通技术人员而言，可以根据具体情况理解上述术语在本发明中的具体含义。In the present invention, unless otherwise specified and limited, the terms "connection" and "fixation" should be understood in a broad sense, for example, "fixation" can be a fixed connection, a detachable connection, or an integral body; It can be a mechanical connection or an electrical connection; it can be a direct connection or an indirect connection through an intermediary, and it can be an internal communication between two elements or an interaction relationship between two elements, unless otherwise clearly defined. Those of ordinary skill in the art can understand the specific meanings of the above terms in the present invention according to specific situations.

另外，本发明各个实施例之间的技术方案可以相互结合，但是必须是以本领域普通技术人员能够实现为基础，当技术方案的结合出现相互矛盾或无法实现时应当认为这种技术方案的结合不存在，也不在本发明要求的保护范围之内。In addition, the technical solutions of the various embodiments of the present invention can be combined with each other, but it must be based on the realization of those skilled in the art. When the combination of technical solutions is contradictory or cannot be realized, it should be considered as a combination of technical solutions. Does not exist, nor is it within the scope of protection required by the present invention.

Claims

1. A video coding MMVD prediction method based on direction extension, characterized in that, comprising steps:

S1: Obtain a motion vector candidate list based on the Merge mode;

S2: Select an initial motion vector in the motion vector candidate list according to a preset selection rule;

S3: Set the starting point based on the initial motion vector, and use the regular hexagon as the frame to expand the search direction of the motion vector;

S4: With the starting point as the origin, search for motion vectors with a reduced step size range according to the expanded search directions;

S5: Calculate the rate-distortion cost under the minimum rate-distortion criterion according to the motion vector set obtained by searching;

S6: Select the motion vector with the smallest rate-distortion cost as the best predicted motion vector of the current prediction unit.

2. a kind of video coding MMVD prediction method based on direction extension as claimed in claim 1, is characterized in that, in described S3 step, starting point is initial motion vector in the previous frame direction of current inter-frame image and next frame The pointing position of the direction.

3. a kind of video coding MMVD prediction method based on direction extension as claimed in claim 1, is characterized in that, in described S3 step, regular hexagon is the regular hexagon on horizontal direction, and starting point is the center of regular hexagon point.

4. a kind of video coding MMVD prediction method based on direction extension as claimed in claim 3, it is characterized in that, in described S3 step, the search direction after extension is each corner point on regular hexagon central point and regular hexagon direction of connection.

5. a kind of video coding MMVD prediction method based on direction extension as claimed in claim 1, is characterized in that, in described S4 step, motion vector search is in the step size range of initial pixel size to preset pixel size, Carry out the motion vector search with the step size increasing by 2 times.

6. A video coding MMVD prediction system based on direction extension, characterized in that, comprising:

An initial selection unit, configured to obtain a motion vector candidate list based on the Merge mode, and select an initial motion vector in the motion vector candidate list according to a preset selection rule;

The direction expansion unit is used to set the starting point based on the initial motion vector, and to expand the motion vector search direction with the regular hexagon as the frame;

The vector search unit is used for taking the starting point as the origin, and performing the motion vector search under the reduced step size range according to each search direction after expansion;

A rate-distortion calculation unit, configured to calculate the rate-distortion cost under the minimum rate-distortion criterion according to the motion vector set obtained by searching;

The vector selection unit is configured to select the motion vector with the smallest rate-distortion cost as the best prediction motion vector of the current prediction unit.

7. a kind of video coding MMVD prediction system based on direction extension as claimed in claim 6, is characterized in that, in described direction extension unit, starting point is initial motion vector in the previous frame direction of current inter-frame image and next frame direction The pointing position in the frame direction.

8. a kind of video coding MMVD prediction system based on direction extension as claimed in claim 6, is characterized in that, in described direction extension unit, regular hexagon is the regular hexagon on the horizontal direction, and starting point is regular hexagon center point.

9. a kind of video coding MMVD prediction system based on direction extension as claimed in claim 8, is characterized in that, in described direction extension unit, the search direction after extension is regular hexagon center point and each angle on regular hexagon The connection direction of the points.

10. A kind of video coding MMVD prediction system based on directional extension as claimed in claim 6, characterized in that, in the vector search unit, the motion vector search is within the step size range from the initial pixel size to the preset pixel size , carry out the motion vector search with step size 2 times increasing.