CN107027029B

CN107027029B - High-performance video coding improvement method based on frame rate conversion

Info

Publication number: CN107027029B
Application number: CN201710117876.8A
Authority: CN
Inventors: 何小海; 武其达; 滕奇志; 卿粼波; 陶青川; 林宏伟
Original assignee: Sichuan University
Current assignee: Sichuan University
Priority date: 2017-03-01
Filing date: 2017-03-01
Publication date: 2020-01-10
Anticipated expiration: 2037-03-01
Also published as: CN107027029A

Abstract

The invention provides a high-performance video coding method combined with frame rate conversion aiming at the HEVC coding standard. The method mainly comprises the steps of carrying out self-adaptive frame extraction on an original video at an encoding end to reduce the transmission data volume; the low frame rate video carries out motion vector estimation on a lost frame by HEVC coding and combining a reference frame motion vector extracted from a code stream and intra-frame block information, and the lost frame is reconstructed by combining the motion vector at a decoding end and an improved bidirectional motion compensation inserted frame based on block coverage. The invention reduces the data amount of transmission and saves the transmission bandwidth due to the self-adaptive frame extraction; the process of motion estimation for the decoded video is saved, so that a large amount of time is saved; the improved block coverage based bi-directional motion compensated frame interpolation guarantees the quality of the reconstructed video. The experimental result shows that compared with HEVC standard coding, the algorithm saves the code rate by about 15% and saves the coding time by about 40% on the premise of ensuring that the PSNR of the reconstructed video is close to the consistency at the medium and low code rate end.

Description

An Improved Method for High Performance Video Coding Based on Frame Rate Conversion

技术领域technical field

本发明涉及图像通信领域中的降低视频编码码率的问题，主要涉及一种高性能视频编码标准HEVC与视频帧率变换相结合来降低编码码率的方法。The invention relates to the problem of reducing the video coding code rate in the field of image communication, and mainly relates to a method for reducing the coding code rate by combining the high-performance video coding standard HEVC and video frame rate conversion.

背景技术Background technique

随着高清数字电视(HDTV)和高端多媒体系统的普及和发展，人们对于视频的主观享受要求也越来越高，这就对视频节目源的质量提出了更高的要求，节目源的质量越高，视频的数据量越多，传输所需的带宽也就越大。与此同时视频编码标准在不断更新，经过MPEG、H.264/AVC等视频编码标准的发展与完善，新一代视频编码标准HEVC也已经发布。并且与H.264/AVC相比，HEVC编码标准节约了近50％的编码码率，但是对于高清视频，传输数据量仍然很大。因此，在有限带宽下，如何在保证解码视频质量的前提下，减少传输数据量，实时传输编码码流，尽可能降低传输带宽的问题仍然是视频编码领域研究的热点。With the popularization and development of high-definition digital television (HDTV) and high-end multimedia systems, people have higher and higher requirements for the subjective enjoyment of video, which puts forward higher requirements for the quality of video program sources. The higher the data volume of the video, the greater the bandwidth required for transmission. At the same time, video coding standards are constantly being updated. After the development and improvement of video coding standards such as MPEG and H.264/AVC, a new generation of video coding standards, HEVC, has also been released. And compared with H.264/AVC, the HEVC encoding standard saves nearly 50% of the encoding bit rate, but for high-definition video, the amount of transmitted data is still huge. Therefore, under the limited bandwidth, how to reduce the amount of transmitted data, transmit the encoded code stream in real time, and reduce the transmission bandwidth as much as possible on the premise of ensuring the quality of the decoded video is still a hot topic in the field of video coding.

视频压缩过程中运动信息对于视频重建至关重要。本发明基于帧率变换的HEVC改进视频压缩编码框架中，对于运动信息的获取以及运动信息后处理作出了改进。在标准的HEVC编码框架中，并没有对丢失帧部分的信息恢复做深入的研究，因而，找到一种高效的视频插帧恢复方法对于提高压缩效率有很大的意义。目前，学术界使用较为广泛的丢失帧恢复的方法多为对解码后低帧率视频进行重新运动估计，来获得插帧过程中使用的运动矢量，且也有一定效果。Motion information during video compression is crucial for video reconstruction. The present invention improves the acquisition of motion information and the post-processing of motion information in the HEVC improved video compression coding framework based on frame rate conversion. In the standard HEVC coding framework, there is no in-depth research on the information recovery of lost frames. Therefore, it is of great significance to find an efficient video interpolation frame recovery method for improving the compression efficiency. At present, the most widely used methods of lost frame recovery in academia are re-motion estimation of the decoded low frame rate video to obtain the motion vector used in the frame insertion process, and it also has certain effects.

帧率变换方法作为提高压缩率的技术，与标准HEVC编码相结合，可以有效降低编码码率。Sung-Hee Lee等人提出在编码端降低视频的帧率，以减少编码帧数，降低传输码流数据，然后在解码端插帧恢复成原始帧率的视频。传统的帧率提升技术分为两种：非运动补偿插帧和运动补偿插帧。非运动补偿插帧恢复过程中不考虑原视频中物体的运动信息，只是通过前一帧和当前帧两参考帧对应位置的像素值来恢复待插帧像素值，如帧重复法和帧平均法；运动补偿插帧则是考虑了场景中物体的运动，通过结合运动矢量信息和参考帧中像素值来重建待插帧，且该方法可以有效地降低帧重复法引入的抖动和帧平均法引入的模糊现象。Choi等人专门提出了适用于帧率上转换技术的双向运动补偿内插(BMCI),根据插帧过程中使用的运动矢量信息的来源不同，运动补偿插帧可以分为两种，一类是基于压缩域的，该方法通过从编码后的码流中提取相邻帧的运动矢量信息，用于运动补偿插帧，这样就省去了对编码后视频重新运动估计以获取运动矢量的过程，大大减少了计算量和时间；第二类是基于像素域，即对解码后的视频重新进行运动估计，再用获得的运动矢量信息进行插帧重建，与前者相比，第二种方法由于引进了运动估计过程，增加编码复杂度，所以会消耗大量的时间。尽管运动补偿插帧方法效果要明显优于非运动补偿方法，但是，传统的运动补偿插帧方法插帧重建后的视频会有空洞和块效应。The frame rate conversion method, as a technique to improve the compression rate, can effectively reduce the coding rate when combined with the standard HEVC coding. Sung-Hee Lee et al. proposed to reduce the frame rate of the video at the encoding end to reduce the number of encoded frames, reduce the transmission stream data, and then insert frames at the decoding end to restore the video to the original frame rate. There are two traditional frame rate enhancement techniques: non-motion-compensated frame insertion and motion-compensated frame insertion. The motion information of the object in the original video is not considered in the non-motion compensation interpolation process, but the pixel value of the frame to be inserted is restored by the pixel value of the corresponding position of the two reference frames of the previous frame and the current frame, such as the frame repetition method and the frame average method. ; Motion compensation frame interpolation considers the motion of objects in the scene, and reconstructs the frame to be inserted by combining the motion vector information and the pixel values in the reference frame, and this method can effectively reduce the jitter introduced by the frame repetition method and the introduction of the frame averaging method. ambiguity. Choi et al. specially proposed Bidirectional Motion Compensation Interpolation (BMCI) suitable for frame rate up-conversion technology. According to the source of motion vector information used in the frame insertion process, motion compensation frame insertion can be divided into two types. One is Based on the compression domain, this method extracts the motion vector information of adjacent frames from the encoded code stream for motion compensation frame insertion, which saves the process of re-motion estimation of the encoded video to obtain the motion vector. The amount of calculation and time is greatly reduced; the second type is based on the pixel domain, that is, the motion estimation of the decoded video is performed again, and then the obtained motion vector information is used for frame interpolation reconstruction. Compared with the former, the second method is due to the introduction of The motion estimation process increases the coding complexity, so it will consume a lot of time. Although the effect of the motion-compensated frame interpolation method is obviously better than that of the non-motion-compensated method, the video reconstructed by the traditional motion-compensated frame interpolation method will have holes and block effects.

发明内容SUMMARY OF THE INVENTION

为解决传统帧率提升过程重建帧中存在空洞和块效应的问题，改善视频压缩编码的性能提高重建视频质量，本发明提出了一种结合帧率变换的高性能视频压缩编码改进方法，在保证视频质量的前提下，降低了视频编码码率，在带宽受限的实时传输系统中具有较好的应用前景。In order to solve the problem of holes and block effects in the reconstructed frames in the traditional frame rate boosting process, improve the performance of video compression coding and improve the quality of the reconstructed video, the present invention proposes an improved method for high-performance video compression coding combined with frame rate conversion, which ensures that Under the premise of video quality, the video coding rate is reduced, and it has a good application prospect in the real-time transmission system with limited bandwidth.

本发明的基本思想是在编码端对原视频进行自适应抽帧，对低帧率视频进行编解码，从码流信息中提取运动矢量信息及HEVC编码过程中特殊的帧内分块信息，在解码端，结合运动矢量对解码视频进行帧率上采恢复到原始帧率，从而达到减少编码帧数降低编码码率的目的。The basic idea of the present invention is to perform adaptive frame extraction on the original video at the encoding end, encode and decode the low frame rate video, extract motion vector information and special intra-frame block information in the HEVC encoding process from the code stream information, and At the decoding end, the frame rate of the decoded video is up-sampled and restored to the original frame rate in combination with the motion vector, so as to achieve the purpose of reducing the number of encoded frames and the encoding bit rate.

本发明针对HEVC视频编码标准提出了一种结合帧率变换的改进HEVC视频压缩编码框架。主要包括对经过自适应抽帧后的低帧率视频进行HEVC标准视频编码，并从传输的码流信息中提取相关的运动矢量信息以及相应的帧内分块信息，由于HEVC特殊的分块模式，运动矢量信息是以PU块的方式存储在码流信息中的，即同一个PU块中的像素位置具有相同的运动矢量，在IPPP帧间配置编码模式下，编码器会同时进行帧内预测和帧间预测来获取残差信息，尽管大部分编码块的预测模式为帧间预测，但当帧间预测获取的残差信息不够准确时，编码器将自动使用帧内预测获得的残差用于编码，因此若从码流信息中提取的块信息对应的是帧内编码块，则运动矢量为0，用于插帧的话就会引起空洞或者块效应，所以，在进行插帧恢复之前，对这些帧内块运动信息进行估计，以确保帧中每个编码块都有相应的运动矢量信息，本发明利用帧内块相邻块以及相邻帧中对应位置块的运动矢量信息来对其进行填充；对获得的运动矢量再进行平滑滤波处理，结合处理后的运动矢量信息对解码后的视频通过基于块覆盖的双向运动补偿插帧，恢复原始帧率。具体主要包括以下步骤：According to the HEVC video coding standard, the present invention proposes an improved HEVC video compression coding framework combined with frame rate conversion. It mainly includes HEVC standard video encoding for the low frame rate video after adaptive frame extraction, and extracts the relevant motion vector information and the corresponding intra-frame block information from the transmitted code stream information. Due to the special block mode of HEVC , the motion vector information is stored in the code stream information in the form of PU blocks, that is, the pixel positions in the same PU block have the same motion vector. In the IPPP inter-frame configuration encoding mode, the encoder will perform intra-frame prediction at the same time. and inter prediction to obtain residual information. Although the prediction mode of most coding blocks is inter prediction, when the residual information obtained by inter prediction is not accurate enough, the encoder will automatically use the residual obtained by intra prediction. Therefore, if the block information extracted from the code stream information corresponds to an intra-coded block, the motion vector is 0, which will cause holes or block effects if it is used for frame insertion. Estimate the motion information of these intra-frame blocks to ensure that each coding block in the frame has corresponding motion vector information. Filling is performed; smooth filtering is performed on the obtained motion vector, and the decoded video is interpolated by bidirectional motion compensation based on block coverage in combination with the processed motion vector information to restore the original frame rate. Specifically, it mainly includes the following steps:

(1)编码前对原始帧率视频进行自适应抽帧，并设定相邻两帧间最大抽帧数为2，最小抽帧数为0；(1) Adaptive frame extraction is performed on the original frame rate video before encoding, and the maximum number of frames extracted between two adjacent frames is set to 2, and the minimum number of frames to be extracted is 0;

(2)编码抽帧后视频，传到解码端，并从传输的码流信息中提取相关的运动矢量信息以及HEVC中特殊的帧内分块信息；(2) The video after encoding the frame extraction is transmitted to the decoding end, and the relevant motion vector information and the special intra-frame block information in HEVC are extracted from the transmitted code stream information;

(3)通过步骤(2)中获得的运动矢量信息对丢失帧进行双向运动估计获得初始运动矢量；(3) carrying out bidirectional motion estimation to the lost frame by the motion vector information obtained in step (2) to obtain an initial motion vector;

(4)结合码流信息中获取的帧内分块信息确定一帧中帧内编码块的位置，由于帧内编码块的运动信息不可用于插帧重建，因此，本发明对于帧内块运动矢量信息是通过该帧内块相邻块以及相邻两参考帧中同一位置块的运动矢量的加权平均值来代替，得到丢失帧修正运动矢量；(4) The position of the intra-frame coding block in a frame is determined in combination with the intra-frame block information obtained from the code stream information. Since the motion information of the intra-frame coding block cannot be used for the interpolation frame reconstruction, the present invention is suitable for the intra-frame block motion. The vector information is replaced by the weighted average value of the motion vectors of the adjacent blocks in the frame and the blocks at the same position in the adjacent two reference frames, so as to obtain the corrected motion vector of the lost frame;

(5)通过对帧内块运动矢量的运动估计，使得每个编码块都有了自己的运动矢量信息，对每个4*4的块进行运动矢量平滑处理；(5) Through the motion estimation of the motion vector of the intra-frame block, each coding block has its own motion vector information, and each 4*4 block is subjected to motion vector smoothing processing;

(6)结合前面得到丢失帧的最终运动矢量信息，通过基于块覆盖的双向运动补偿插帧方法对低帧率解码视频进行插帧重建；(6) In combination with the final motion vector information of the lost frame obtained above, the low frame rate decoded video is reconstructed by inserting a frame by a bidirectional motion compensation frame interpolation method based on block coverage;

在本发明的上述技术方案中，所述的自适应抽帧方法判断两相邻参考帧间最多可以连续丢弃两帧，最小丢帧数为0，具体判断一帧是否为可丢弃帧的标准为：In the above technical solution of the present invention, the adaptive frame extraction method judges that at most two frames can be dropped continuously between two adjacent reference frames, and the minimum number of dropped frames is 0. The specific criterion for judging whether a frame is a dropable frame is: :

其中，f_i-1(i,j)表示前一帧在位置(i,j)处的像素值，f_i(i,j)表示当前帧在位置(i,j)处的像素值，m和n分别表示视频的宽度和高度，MSAD_i则表示相邻两帧对应位置像素值绝对误差和的平均值，R表示相邻帧的相对变化率。设定相邻两帧之间最大连续抽帧数阈值为2，MSAD阈值为35，R阈值为3。Among them, f _i-1 (i, j) represents the pixel value of the previous frame at position (i, j), f _i (i, j) represents the pixel value of the current frame at position (i, j), m and n represent the width and height of the video respectively, MSAD _i represents the average value of the absolute error sum of the pixel values at the corresponding positions of the adjacent two frames, and R represents the relative change rate of the adjacent frames. Set the threshold of the maximum number of consecutive frames drawn between two adjacent frames to 2, the MSAD threshold to 35, and the R threshold to 3.

在本发明的上述技术方案中，传统的双向运动补偿插帧计算方法为：In the above-mentioned technical scheme of the present invention, the traditional bidirectional motion compensation frame insertion calculation method is:

其中F_t(x,y)代表待插帧在位置(x,y)处的重建值，F_t-1和F_t+1分表表示前一参考帧和当前参考帧，(V_x,V_y)为码流中获得的待插帧的前参考帧和后参考帧之间的运动矢量，

则表示由参考帧运动矢量信息计算得到的待插帧相对于前后参考帧的初始运动矢量。where F _t (x, y) represents the reconstructed value of the frame to be inserted at position (x, y), F _t-1 and F _t+1 sub-tables represent the previous reference frame and the current reference frame, (V _x , V _y ) is the motion vector between the front reference frame and the back reference frame of the frame to be inserted obtained in the code stream,

It represents the initial motion vector of the frame to be inserted relative to the reference frame before and after, calculated from the motion vector information of the reference frame.

在本发明的上述技术方案中，结合帧内分块信息对帧内编码块运动矢量进行修正的具体方法为:In the above-mentioned technical scheme of the present invention, the concrete method for revising the motion vector of the intra-frame coding block in conjunction with the intra-frame block information is:

其中，MV_intra表示帧内块运动信息估计值，MV_prev表示前一参考帧与该帧内块相同位置处的运动矢量，MV_curr表示后一参考帧中与该帧内块相同位置处的运动矢量，MV_i表示与该帧内块相邻的块运动矢量，w_i表示补偿权重，n为相邻块的个数，对于补偿权重w_i的设定规则是：若相邻块的大小与该帧内块相同则权重为1，否则，按照相邻块与帧内块的在HEVC中的划分深度的差值来确定，相差1，则权重为0.5，相差2，权重为0.25，以此类推。如果相邻块中存在帧内块则不算在内，若周围全是帧内块，则该帧内块运动矢量设为0。Among them, MV _intra represents the estimated value of the motion information of the intra-frame block, MV _prev represents the motion vector at the same position as the intra-block in the previous reference frame, and MV _curr represents the motion at the same position as the intra-block in the next reference frame vector, MV _i represents the motion vector of the block adjacent to the intra-frame block, _wi represents the compensation weight, n is the number of adjacent blocks, and the setting rule for the compensation weight _wi is: if the size of the adjacent block is equal to If the intra-frame blocks are the same, the weight is 1. Otherwise, it is determined according to the difference between the division depths of adjacent blocks and intra-frame blocks in HEVC. If the difference is 1, the weight is 0.5. If the difference is 2, the weight is 0.25. analogy. If there are intra-frame blocks in adjacent blocks, it is not included, and if all surrounding blocks are intra-frame blocks, the motion vector of the intra-frame block is set to 0.

在本发明的上述技术方案中，所述对于修正运动矢量进行平滑滤波的方法为：In the above technical solution of the present invention, the method for smoothing and filtering the corrected motion vector is:

在本发明的上述技术方案中，所述的通过基于块覆盖的双向运动补偿插帧方法对低帧率解码视频进行插帧重建方法为：In the above-mentioned technical scheme of the present invention, the described method for performing frame interpolation and reconstruction on low frame rate decoded video through the bidirectional motion compensation frame interpolation method based on block coverage is as follows:

其中，F(x,y)表示待插帧在位置(x,y)的块重建值，n表示该待插值块所覆盖参考块的个数，

表示待插块所覆盖参考块的运动矢量。Among them, F(x, y) represents the block reconstruction value of the to-be-interpolated frame at position (x, y), n represents the number of reference blocks covered by the to-be-interpolated block,

Represents the motion vector of the reference block covered by the block to be inserted.

根据本发明的上述方法可以编制执行上述基于帧率变换的高性能视频编码改进方法的HEVC视频编解码器。The above-mentioned method according to the present invention can formulate an HEVC video codec that performs the above-mentioned improved method for high-performance video coding based on frame rate conversion.

本发明是基于以下思路分析而完成的：The present invention is completed based on the analysis of the following ideas:

帧率变换技术作为可以提高视频编码效率的有效手段，可以与标准视频编码方法相结合，本发明中自适应抽帧具体方法为：判断一帧是否为可丢弃帧的依据是两帧的平均绝对误差值(MSAD_i)和相对变化率(R)，当MSAD_i和R的值均大于给定的阈值，则判断第i帧为场景跳变帧，则对该帧进行保留，不进行抽帧处理；反之，则将该帧视为可抽弃帧，判断完一帧后，继续判断相邻下一帧是否为可抽弃帧，这样就存在一个问题，若连续的几帧都为可抽弃帧，则由于抽弃帧数过多，对于视频重建时获取的运动信息准确性有较大影响，所以视频重建质量不佳，因此，为了保证插帧重建视频的质量，本发明对于相邻帧之间的最大抽帧数也设置了一个阈值。通过实验比较，当设定相邻两帧之间最大连续抽帧数阈值为2帧，MSAD阈值为35，R阈值为3时，效果较好。As an effective means that can improve video coding efficiency, frame rate conversion technology can be combined with standard video coding methods. The specific method of adaptive frame extraction in the present invention is: the basis for judging whether a frame is a discardable frame is the average absolute value of the two frames. Error value (MSAD _i ) and relative rate of change (R), when the values of MSAD _i and R are both greater than the given threshold, it is judged that the i-th frame is a scene jump frame, then the frame is reserved and no frame sampling is performed Processing; otherwise, the frame is regarded as a discardable frame, and after judging one frame, it continues to judge whether the next adjacent frame is a discardable frame, so there is a problem, if several consecutive frames are discardable. If the frame is discarded, the number of discarded frames is too large, which has a great influence on the accuracy of the motion information obtained during video reconstruction, so the video reconstruction quality is not good. The maximum number of frames drawn between frames also sets a threshold. Through experimental comparison, when the maximum number of consecutive frames drawn between two adjacent frames is set to 2 frames, the MSAD threshold is 35, and the R threshold is 3, the effect is better.

自适应抽帧后的低帧率视频通过HEVC编码框架进行编码，从编码后的码流中获取运动矢量信息，用来初步估算丢失帧的运动矢量，并对该初始运动矢量进行修正，用于双向运动补偿插帧，以提高插帧恢复后视频的质量。The low frame rate video after adaptive frame extraction is encoded by the HEVC encoding framework, and the motion vector information is obtained from the encoded code stream to initially estimate the motion vector of the lost frame, and the initial motion vector is corrected for Bidirectional motion-compensated frame insertion to improve the video quality after frame insertion.

由于HEVC特殊的分块模式，运动矢量信息是以PU块的方式存储在码流信息中的，即同一个PU块中的像素位置具有相同的运动矢量，在IPPP帧间配置编码模式下，编码器会同时进行帧内预测和帧间预测来获取残差信息，尽管大部分编码块的预测模式为帧间预测，但当帧间预测获取的残差信息不够准确时，编码器将自动使用帧内预测获得的残差用于编码，因此若从码流信息中提取的块信息对应的是帧内编码块，则运动矢量为0，用于插帧的话就会引起空洞或者块效应，所以，在进行插帧恢复之前，对这些帧内块运动信息进行估计。视频中不仅序列帧之间存在时域相关性，每一帧内部还存在空域相关性，所以插帧过程中待插值块的相邻块的运动信息同样可以作为参考，因此对于帧内编码块，借用待插块空域相邻块以及参考帧中同一位置块的运动信息来估算其运动矢量。Due to the special block mode of HEVC, the motion vector information is stored in the code stream information in the form of PU blocks, that is, the pixel positions in the same PU block have the same motion vector. In the IPPP inter-frame configuration encoding mode, the encoding The encoder will perform intra-frame prediction and inter-frame prediction at the same time to obtain residual information. Although the prediction mode of most coding blocks is inter-frame prediction, when the residual information obtained by inter-frame prediction is not accurate enough, the encoder will automatically use frame The residual obtained by intra-prediction is used for coding. Therefore, if the block information extracted from the code stream information corresponds to an intra-frame coding block, the motion vector is 0. If it is used for frame insertion, it will cause holes or block effects. Therefore, The motion information of these intra-frame blocks is estimated before inter-frame restoration. In the video, there is not only temporal correlation between sequence frames, but also spatial correlation within each frame, so the motion information of adjacent blocks of the block to be interpolated during frame interpolation can also be used as a reference. Therefore, for intra-frame coding blocks, The motion vector of the block to be inserted is estimated by borrowing the motion information of the adjacent blocks in the spatial domain of the block to be inserted and the block at the same position in the reference frame.

经过对帧内块运动矢量的估计，每个编码块都有了自己的运动信息，在对待插帧中每个4*4块进行插值恢复之前，取该块相邻八个块与自身运动矢量的平均值作为插帧恢复过程中的最终运动矢量，如果待插值的块位于边界区域，则取其边界内参考块的运动矢量进行参考，比如若此时我们需要插值图像右下角的块，则在参考帧中取其对应位置块，其左边块、上边块以及左上方三个块进行参考。其他角落的边界块插值以此类推。After estimating the motion vector of the intra-frame block, each coding block has its own motion information. Before performing interpolation and restoration on each 4*4 block in the frame to be interpolated, take the eight adjacent blocks of the block and its own motion vector. The average value is used as the final motion vector in the interpolation frame restoration process. If the block to be interpolated is located in the boundary area, the motion vector of the reference block in the boundary is taken for reference. For example, if we need to interpolate the block in the lower right corner of the image at this time, then The corresponding position block is selected in the reference frame, and the left block, the upper side block and the upper left three blocks are used for reference. Boundary block interpolation in other corners and so on.

通过基于块覆盖的双向运动补偿插帧方法对低帧率解码视频进行插帧重建实现过程如下：由于HEVC中的大部分信息如运动矢量都是以不同划分块的格式存储和传输的，在视频的重建过程中也是以块的像素值重建的方式完成的，所以在双向插帧的基础上，充分考虑视频帧中空域上的相关性，对待插值块分区域进行插值，对不同区域内的像素值采用不同的方法进行插值，即丢失帧中的一个待插值块覆盖着含有不同运动信息的参考块，当这种情况出现时，本发明使用改进的基于块覆盖的双向运动补偿插帧方法。The process of interpolating and reconstructing low frame rate decoded video through the bidirectional motion compensation interpolating method based on block coverage is as follows: Since most of the information in HEVC, such as motion vectors, are stored and transmitted in the format of different divided blocks, in the video The reconstruction process is also completed in the way of pixel value reconstruction of blocks. Therefore, on the basis of bidirectional frame interpolation, the correlation in the spatial domain of the video frame is fully considered, and the interpolation block is divided into regions. The values are interpolated using different methods, that is, a block to be interpolated in a lost frame covers a reference block containing different motion information. When this happens, the present invention uses an improved block coverage-based bidirectional motion-compensated frame interpolation method.

与标准HEVC视频编码方法相比，本发明方法首先采用自适应抽帧方法处理待编码序列，然后从码流信息中提取相关的运动矢量信息，用以估算丢失帧的运动矢量，并结合码流中的帧内分块信息对丢失帧运动矢量进行修正，再通过对每个4*4重建块的运动矢量进行平滑滤波得到最终用于视频重建的运动矢量，最后通过给予块覆盖的双向运动补偿插帧方法进行视频重建。通过本发明方法，在中低码率段，保证重建视频质量的前提下，不仅可以降低码流数据量，节省传输带宽，而且由于省去了对丢失帧的重新运动估计过程大大提高了编码时间效率。Compared with the standard HEVC video coding method, the method of the present invention first adopts the adaptive frame extraction method to process the sequence to be coded, and then extracts the relevant motion vector information from the code stream information to estimate the motion vector of the lost frame, and combines the code stream. The intra-frame block information in the frame is used to correct the motion vector of the lost frame, and then the motion vector for the final video reconstruction is obtained by smoothing the motion vector of each 4*4 reconstructed block. Interpolation method for video reconstruction. Through the method of the present invention, under the premise of ensuring the quality of the reconstructed video in the middle and low bit rate segments, not only the amount of code stream data can be reduced, the transmission bandwidth can be saved, but also the coding time can be greatly improved because the re-motion estimation process for lost frames is omitted. efficiency.

附图说明Description of drawings

图1为本发明的基于帧率变换的高性能视频编码改进方法的流程图。FIG. 1 is a flowchart of an improved method for high-performance video coding based on frame rate conversion according to the present invention.

图2为本发明中自适应抽帧示意图。FIG. 2 is a schematic diagram of adaptive frame extraction in the present invention.

图3本发明中双向运动补偿插帧方法示意图。FIG. 3 is a schematic diagram of a frame insertion method for bidirectional motion compensation in the present invention.

图4为本发明中对初始运动矢量平滑处理方法示意图。FIG. 4 is a schematic diagram of a method for smoothing an initial motion vector in the present invention.

图5为本发明中基于块覆盖的运动补偿插值示意图。FIG. 5 is a schematic diagram of motion compensation interpolation based on block coverage in the present invention.

图6～图9为本发明算法与HM16.0标准编码方法率失真曲线对比图，其中图6～图9分别为序列PartyScene、FourPeople、ParkScene、PeopleOnStreet在QP为22、27、32、37、42、44、46、48、50的情况下的率失真曲线的对比图。Figures 6 to 9 are the comparison diagrams of the rate-distortion curves between the algorithm of the present invention and the HM16.0 standard encoding method, wherein Figures 6 to 9 are the sequences PartyScene, FourPeople, ParkScene, and PeopleOnStreet at QPs of 22, 27, 32, 37, and 42 respectively. , 44, 46, 48, 50 comparison of rate-distortion curves.

具体实施方式Detailed ways

下面结合实例对本发明作进一步的详细说明，有必要指出的是，以下的实例只用于对本发明做进一步的说明，不能理解为对本发明保护范围的限制，所属领域技术熟悉人员根据上述发明内容，对本发明做出一些非本质的改进和调整进行的具体实施，应仍属于本发明的保护范围。The present invention will be described in further detail below in conjunction with examples. It is necessary to point out that the following examples are only used to further illustrate the present invention and should not be construed as limiting the protection scope of the present invention. The specific implementation of some non-essential improvements and adjustments to the present invention should still fall within the protection scope of the present invention.

本发明的基于帧率变换的高性能视频改进编码算法，与HEVC标准测试模型HM16.0的编码方法比较过程如下：The high-performance video improved coding algorithm based on frame rate conversion of the present invention is compared with the coding method of the HEVC standard test model HM16.0 as follows:

1、打开标准HM16.0测试模型，配置文件为lowdelay_P_main，在量化参数分别取22、27、32、37、42、46、48、50的情况下对标准视频测试序列进行编解码，记录标准HEVC视频编码时的码率及重建视频PSNR；1. Open the standard HM16.0 test model, the configuration file is lowdelay_P_main, encode and decode the standard video test sequence when the quantization parameters are 22, 27, 32, 37, 42, 46, 48, and 50, and record the standard HEVC Bit rate and reconstructed video PSNR during video encoding;

2、本发明方法将与上述1中的HEVC视频编码标准的参考软件HM16.0的码率及重建视频PSNR作比较。同时打开本发明和标准方法的程序，设置好相同的配置文件，在编码前通过自适应抽帧方法对视频序列进行帧率下采，标准算法对原始帧率视频序列进行编解码，本发明算法对帧率下采后的低帧率视频进行编解码，并在码流信息中提取相关的运动信息和帧内分块信息。对三种视频编码性能：峰值信噪比(PSNR)、码率、编码时间(其中PSNR体现重建视频的客观质量，码率体现数据传输量，编码时间体现视频编码处理的时间效率)进行对比分析，比较性能的差距用以下两个指标进行评价：2. The method of the present invention will be compared with the bit rate of the reference software HM16.0 of the HEVC video coding standard and the PSNR of the reconstructed video in the above 1. Open the programs of the present invention and the standard method at the same time, set the same configuration file, down-sampling the video sequence through the adaptive frame extraction method before encoding, and the standard algorithm encodes and decodes the original frame rate video sequence. The algorithm of the present invention It encodes and decodes the low frame rate video after the frame rate down-sampling, and extracts the relevant motion information and intra-frame block information from the code stream information. Comparative analysis of three video coding performances: peak signal-to-noise ratio (PSNR), bit rate, and encoding time (where PSNR reflects the objective quality of the reconstructed video, the bit rate reflects the amount of data transmission, and the encoding time reflects the time efficiency of video encoding processing). , the gap of comparative performance is evaluated by the following two indicators:

其中，△码率表示本发明算法和标准算法的码率差值相对于标准码率的百分比，即降低码率的百分率，△T则表示本发明算法与标准编解码时间之差相对于标准编码时间的百分比Among them, Δ code rate represents the percentage of the code rate difference between the algorithm of the present invention and the standard algorithm relative to the standard code rate, that is, the percentage of reduced code rate, and ΔT represents the difference between the algorithm of the present invention and the standard codec time relative to the standard code. percentage of time

3、编码对象为标准的HEVC测试视频，它们的名称、分辨率和帧率分别为：PartyScene(832x480，50帧/秒)、FourPeople(1280x720，60帧/秒)、ParkScene(1920x1080，24帧/秒)、PeopleOnStreet(2560x1600，30帧/秒)。3. The encoding objects are standard HEVC test videos, their names, resolutions and frame rates are: PartyScene (832x480, 50 frames/sec), FourPeople (1280x720, 60 frames/sec), ParkScene (1920x1080, 24 frames/sec) seconds), PeopleOnStreet (2560x1600, 30 fps).

4、利用HM16.0标准方法对原始帧率视频序列在HEVC方式下进行编码；4. Use the HM16.0 standard method to encode the original frame rate video sequence in HEVC mode;

5、利用本发明方法对同一序列的低帧率视频进行编码，并从码流中提取相关的运动矢量信息以及帧内分块信息；5. Use the method of the present invention to encode the same sequence of low frame rate videos, and extract relevant motion vector information and intra-frame block information from the code stream;

6、利用本发明方法对解码后的低帧率视频进行帧率上采；6. Using the method of the present invention to perform frame rate upsampling on the decoded low frame rate video;

7、分别记录两个方法编解码过程中的码率、PSNR、时间等相关实验数据；其中在重建视频客观质量(PSNR)接近一致的情况下本发明方法相对于标准方法节省码率如表1，本发明与标准算法在时间效率方面的对比如表2。统计结果显示，在中低码率段，本发明方法与HEVC标准方法相比在保证重建视频PSNR接近一致的前提下节省码率约15％；而在编码时间效率方面也节省时间近40％。7. Record the relevant experimental data such as code rate, PSNR, time in the encoding and decoding process of the two methods respectively; wherein the method of the present invention saves the code rate relative to the standard method under the condition that the objective quality of the reconstructed video (PSNR) is close to the same as in Table 1 , the comparison between the present invention and the standard algorithm in terms of time efficiency is shown in Table 2. Statistical results show that, compared with the HEVC standard method, the method of the present invention saves about 15% of the code rate on the premise of ensuring that the PSNR of the reconstructed video is close to the same in the medium and low code rate segment; and also saves nearly 40% in encoding time efficiency.

表1本发明算法与HM16.0标准算法之间编码码率的比较Table 1 Comparison of the coding rate between the algorithm of the present invention and the HM16.0 standard algorithm

表2本发明算法与HM16.0标准算法之间编解码时间效率的比较Table 2 Comparison of encoding and decoding time efficiency between the algorithm of the present invention and the HM16.0 standard algorithm

Claims

1. A high-performance video coding improvement method based on frame rate conversion is mainly characterized by comprising the following process steps:

(1) before coding, carrying out self-adaptive frame extraction on an original frame rate video, and setting the maximum frame extraction number between two adjacent reference frames to be 2 and the minimum frame extraction number to be 0;

(2) coding the video after frame extraction, decoding by a decoding end, and extracting related motion vectors and special intra-frame block information in HEVC from transmitted code stream information;

(3) performing bidirectional motion estimation on the lost frame through the motion vector obtained in the step (2) to obtain an initial motion vector;

(4) determining the position of an intra-frame coding block in a frame by combining intra-frame block information acquired from code stream information, wherein the motion information of the intra-frame coding block cannot be used for frame interpolation reconstruction, so that the motion vector information of the intra-frame coding block is replaced by the weighted average value of the motion vectors of adjacent blocks of the intra-frame coding block and blocks at the same position in two adjacent reference frames to obtain a lost frame correction motion vector;

(5) by motion estimation of motion vectors of the intra blocks, each coding block has own motion vector information, and motion vector smoothing processing is carried out on each 4 x4 block;

(6) and combining the final motion vector information of the lost frame obtained in the previous step, and performing interpolation reconstruction on the low frame rate decoded video by a bidirectional motion compensation interpolation method based on block coverage.

2. The method as claimed in claim 1, wherein the step (1) of adaptively extracting frames from the original frame rate video, and the specific method for determining whether a frame is a discardable frame is as follows:

wherein f is_i-1(i, j) represents the pixel value of the previous reference frame at position (i, j), f_i(i, j) represents the pixel value of the current reference frame at position (i, j), m and n represent the width and height of the video, respectively, MSAD_iThen representing the average value of the absolute errors of the pixels at the corresponding positions of two adjacent frames, R representing the relative change rate of the adjacent frames, if MSAD_iIf the value of R is greater than the given threshold value, the ith frame is judged to be a scene jump frame, and the frame is reserved without frame extraction; and otherwise, the frame is regarded as an extractable frame, and after one frame is judged, whether the next adjacent frame is an extractable frame or not is continuously judged.

3. The method of claim 2 wherein the MSAD is set during adaptive frame decimation_iThe threshold is 35, the R threshold is 3, the maximum continuous frame number of two adjacent reference frames is 2, and the minimum is 0.

4. The improved frame rate transform-based high performance video coding method as claimed in any of claims 1 to 3, wherein the motion estimation method for the lost frame is:

wherein (V)_x,V_y) For the motion vector between the front reference frame and the back reference frame of the frame to be inserted obtained in the code stream,

it represents the initial motion vector of the frame to be interpolated relative to the previous and next reference frames calculated from the reference frame motion vector information.

5. The improved frame rate transform-based high performance video coding method as claimed in claim 4, wherein the method for filling the intra block motion vector in combination with the intra block information comprises:

MV_intrarepresenting intra block motion information estimate, MV_prevAnd MV_currRespectively representing motion vector information, MV, at the same position as the intra block in the previous and subsequent reference frames_iRepresenting block motion vectors, w, adjacent to the intra block_iRepresents the compensation weight, and n is the number of adjacent blocks.

6. The method as claimed in claim 5, wherein the compensation weight w is_iThe setting rule of (1) is: if the size of the adjacent block is the same as that of the intra-frame block, the weight is 1, otherwise, the difference is determined according to the difference value of the division depths of the adjacent block and the intra-frame block in HEVC, the weight is 0.5 if the difference is 1, the difference is 2, the weight is 0.25, and the like; if there is an intra block in the neighboring blocks, the intra block motion vector is set to 0 if all the surrounding blocks are intra blocks.

7. The improved frame rate conversion based high performance video coding method of claim 6, wherein a smoothing filtering process is performed for each 4 x4 block of motion vector information before the lost frame is recovered.

8. The improved method of high-performance video coding based on frame rate conversion as claimed in claim 7, wherein the lost frame is recovered by an improved bidirectional motion compensation method based on block covering, the specific method is:

wherein F (x, y) representsBlock reconstruction value of frame to be interpolated at position (x, y), n represents number of reference blocks covered by block to be interpolated, F_t-1(x, y) denotes a frame of time preceding the frame to be interpolated, F_t+1(x, y) represents a frame at a time subsequent to the frame to be interpolated,

representing the block motion vector of the frame to be interpolated at position (x, y).