[go: up one dir, main page]

CN107155112A - A kind of compressed sensing method for processing video frequency for assuming prediction more - Google Patents

A kind of compressed sensing method for processing video frequency for assuming prediction more Download PDF

Info

Publication number
CN107155112A
CN107155112A CN201710375057.3A CN201710375057A CN107155112A CN 107155112 A CN107155112 A CN 107155112A CN 201710375057 A CN201710375057 A CN 201710375057A CN 107155112 A CN107155112 A CN 107155112A
Authority
CN
China
Prior art keywords
key frame
prediction
key frames
side information
video
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201710375057.3A
Other languages
Chinese (zh)
Inventor
武明虎
陈瑞
赵楠
刘敏
孔祥斌
刘聪
饶哲恒
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Hubei University of Technology
Original Assignee
Hubei University of Technology
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Hubei University of Technology filed Critical Hubei University of Technology
Priority to CN201710375057.3A priority Critical patent/CN107155112A/en
Publication of CN107155112A publication Critical patent/CN107155112A/en
Pending legal-status Critical Current

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/50Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
    • H04N19/503Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving temporal prediction
    • H04N19/51Motion estimation or motion compensation
    • H04N19/573Motion compensation with multiple frame prediction using two or more reference frames in a given prediction direction
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/50Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
    • H04N19/503Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving temporal prediction
    • H04N19/51Motion estimation or motion compensation
    • H04N19/58Motion compensation with long-term prediction, i.e. the reference frame for a current frame not being the temporally closest one
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/50Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
    • H04N19/59Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving spatial sub-sampling or interpolation, e.g. alteration of picture size or resolution

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Compression Or Coding Systems Of Tv Signals (AREA)

Abstract

本发明公开了一种多假设预测的压缩感知视频处理方法,所述处理方法的结构框架包括编码端和解码端;在所述编码端,视频中的帧分成关键帧和非关键帧,根据压缩感知理论,关键帧和非关键帧均通过测量矩阵Φ获得测量值;在所述解码端,所述关键帧进行BCS‑SPL重构,然后分别进行多预测假设和残差重构;所述非关键帧进行残差重构,并根据关键帧产生的边信息进行解码。本发明中提出的一种基于MH预测的新的分布式压缩视频感测框架,可以在低复杂度编码器处捕获和压缩视频,并且在解码器处有效地重建视频,改MH‑BCS‑SPL框架能够提供更好的图像重建质量。

The invention discloses a multi-hypothesis prediction compression sensing video processing method, the structural framework of the processing method includes an encoding end and a decoding end; at the encoding end, frames in the video are divided into key frames and non-key frames, according to the compression Perceptual theory, both key frames and non-key frames obtain measured values through the measurement matrix Φ; at the decoding end, the key frames undergo BCS-SPL reconstruction, and then perform multi-prediction assumptions and residual reconstruction respectively; the non-key frames Keyframes are residually reconstructed and decoded according to the side information generated by the keyframes. A new distributed compressed video sensing framework based on MH prediction proposed in this invention can capture and compress video at low-complexity encoder and reconstruct video efficiently at decoder, improving MH‑BCS‑SPL Frames can provide better image reconstruction quality.

Description

一种多假设预测的压缩感知视频处理方法A compressive sensing video processing method based on multi-hypothesis prediction

技术领域technical field

本发明涉及压缩感知视频技术领域,尤其涉及一种多假设分块视频压缩感知结合平滑滤波实现图像重构技术。The present invention relates to the technical field of compressed sensing video, in particular to a multi-hypothesis segmented video compression sensing combined with smoothing filtering to realize image reconstruction technology.

背景技术Background technique

目前,CS(压缩感知)已通过集成压缩和感测而革新了信号采样和处理系统。对于从不同方面研究用于视频的CS的应用,称为压缩视频感知(CVS)。在编码器处,输入视频帧被分组为由关键帧和多个非关键帧组成的图像组。Currently, CS (compressed sensing) has revolutionized signal sampling and processing systems by integrating compression and sensing. For the application of studying CS for video from different aspects, it is called Compressed Video Sensing (CVS). At the encoder, input video frames are grouped into picture groups consisting of a key frame and multiple non-key frames.

传统框架中使用MPEG/H.264编码对关键帧进行编码,CS测量矩阵来感测非关键帧,并使用从相邻重构的关键帧生成的边信息来重构关键帧。这个框架的缺点是仍然需要复杂的MPEG/H.264编码。在其后的改进中,CS测量应用于关键帧和非关键帧。使用用于稀疏重建的梯度投影(GPSR)来重构关键帧,并且将使用从解码的关键帧产生的边信息来重构非关键帧。MPEG/H.264 encoding is used in the traditional framework to encode keyframes, CS measurement matrix to sense non-keyframes, and reconstruct keyframes using side information generated from neighboring reconstructed keyframes. The downside of this framework is that complex MPEG/H.264 encoding is still required. In subsequent refinements, CS measurements are applied to both keyframes and non-keyframes. Keyframes are reconstructed using Gradient Projection for Sparse Reconstruction (GPSR), and non-keyframes will be reconstructed using side information generated from decoded keyframes.

随着技术的不断进步,为了减轻巨大的计算和存储器负担,L.Gan提出了一种基于块与块之间具有独立性的CS假设(BCS)用于2D图像,Do等使用珍贵解码帧中的相邻块来表示当前帧中的块,以提高边信息的准确度,并开发了残差重建方法。S.Mun扩展了Gan的BCS,并在最近的变换域中进行重建,特征在于高度方向分解。这些方法被称为单假设运动补偿(SH-MC)方案,其具有一些缺点。在编码器处,由于运动估计搜索,除了增加编码器侧的计算复杂度之外,还施加了发送块运动矢量的传输开销。此外,SH-MC隐含地假设在视频帧中发生的运动是均匀的块平移模型。由于这个假设不总是成立,所以块伪影出现在恢复的帧中。With the continuous advancement of technology, in order to reduce the huge computational and memory burden, L.Gan proposed a CS hypothesis (BCS) based on the independence between blocks for 2D images, Do et al. used precious decoding frames Neighboring blocks in the current frame are represented to improve the accuracy of side information, and a residual reconstruction method is developed. S. Mun extends Gan's BCS and reconstructs it in the nearest transform domain, characterized by a height-oriented decomposition. These methods are called Single Hypothesis Motion Compensation (SH-MC) schemes, which have some disadvantages. At the encoder, due to the motion estimation search, in addition to increasing the computational complexity on the encoder side, a transmission overhead of sending the block motion vector is imposed. Furthermore, SH-MC implicitly assumes that motion occurring in video frames is a uniform block translation model. Since this assumption does not always hold, block artifacts appear in recovered frames.

为了解决这些问题,E.Taramel等人提出了一种用于将多假设运动补偿(MH-MC)合并到BCS中的策略,并将平滑滤波Landweber用于视频重建(MH-BCS-SPL),其通过找到搜索窗口中所有块或假设的线性组合来寻找更多精准假设。MH-MC技术以解码器处更复杂的代价来提高恢复性能。其后又提出基于弹性网络使用MH和SH重建组合方案,其以与Tikhonov正则化重建相比更复杂的代价实现可接受的性能。之后又出现如假设集合更新和动态参考帧选择算法。测量域和像素域中连续部署MH预测的方法,以开发两级MH重建方案,并且R Li等呈现空时量化和运动对准重建以改善性能的CVS系统等等。但是,仍然有一些问题需要解决,因为释放了编码器的计算负担,我们可以通过简单算法得到边信息(SI),但非关键帧重构处理不能利用粗略预测有效地执行。To address these issues, E.Taramel et al. proposed a strategy for incorporating Multi-Hypothesis Motion Compensation (MH-MC) into BCS and smoothing filtering Landweber for video reconstruction (MH-BCS-SPL), It finds more precise hypotheses by finding a linear combination of all blocks or hypotheses in the search window. The MH-MC technique improves recovery performance at the cost of more complexity at the decoder. A combined scheme based on elastic networks using MH and SH reconstruction was later proposed, which achieves acceptable performance at a more complex cost compared to Tikhonov regularized reconstruction. Then came algorithms such as hypothesis set updating and dynamic reference frame selection. Methods for sequentially deploying MH prediction in the measurement and pixel domains to develop two-level MH reconstruction schemes, and R Li et al. presenting space-time quantization and motion-aligned reconstruction for improved performance of CVS systems, among others. However, there are still some problems to be solved, because the computational burden of the encoder is released, we can get the side information (SI) through simple algorithms, but the non-keyframe reconstruction processing cannot be performed efficiently with coarse prediction.

发明内容Contents of the invention

基于背景技术存在的技术问题,本发明目的之一在于提供一种新的基于多假设预测的分布式图像压缩感知视频图像处理框架,其中计算三个侧面信息候选以选择改进传统的MH预测算法,在解端利用双向估算计算出候选边信息,并引进新算法计算相关系数,选取关键边信息,恢复非关键帧。Based on the technical problems existing in the background technology, one of the purposes of the present invention is to provide a new distributed image compression sensing video image processing framework based on multi-hypothesis prediction, in which three side information candidates are calculated to select and improve the traditional MH prediction algorithm, At the solution end, bidirectional estimation is used to calculate candidate side information, and a new algorithm is introduced to calculate correlation coefficients, key side information is selected, and non-key frames are restored.

一种多假设预测的压缩感知视频处理方法,所述处理方法的结构框架包括编码端和解码端;在所述编码端,为提高视频重建质量,并根据实时性要求,视频序列帧被分为关键帧和非关键帧,每两帧构成一个图像组(GOP,Group Of Picture),即GOP等于2。通常奇数帧为关键帧,偶数帧为非关键帧。根据压缩感知理论,关键帧和非关键帧均通过测量矩阵Φ获得测量值,不同的是,关键帧的测量率高,非关键帧的测量率低;在所述解码端,关键帧经过基于块平滑投影的Landweber(BCS SPL)重建算法进行解码,然后经过多假设预测算法和残差重建后,得到重建后的关键帧并存储;非关键帧进行残差重建后,与根据关键帧产生的边信息一起联合解码,得到重建后的非关键帧。最后,将解码后的关键帧和非关键帧按照帧顺序整合成视频序列并输出。A compressive sensing video processing method for multi-hypothesis prediction, the structural framework of the processing method includes an encoding end and a decoding end; at the encoding end, in order to improve the quality of video reconstruction and according to real-time requirements, video sequence frames are divided into The key frame and the non-key frame form a group of picture (GOP, Group Of Picture) every two frames, that is, the GOP is equal to 2. Usually odd frames are key frames and even frames are non-key frames. According to compressed sensing theory, both key frames and non-key frames obtain measurement values through the measurement matrix Φ, the difference is that the measurement rate of key frames is high, and the measurement rate of non-key frames is low; The smooth projected Landweber (BCS SPL) reconstruction algorithm is used for decoding, and then after multi-hypothesis prediction algorithm and residual reconstruction, the reconstructed key frame is obtained and stored; Information is jointly decoded together to obtain reconstructed non-keyframes. Finally, the decoded key frames and non-key frames are integrated into a video sequence according to frame order and output.

优选的,所述边信息根据已解码的相邻关键帧经过多假设预测MH算法求得。Preferably, the side information is obtained through a multi-hypothesis prediction MH algorithm based on decoded adjacent key frames.

多假设预测MH算法的方法步骤如下:The method steps of the multi-hypothesis prediction MH algorithm are as follows:

(1)运用双向运动估算计算出三个候选边信息SIi(i=0,1,2);(1) Use bidirectional motion estimation to calculate three candidate side information SIi (i=0,1,2);

(2)分别计算非关键帧与三个候选边信息的相关系数,选取相关性最高SI信息。(2) Calculate the correlation coefficients between non-key frames and three candidate side information, and select the SI information with the highest correlation.

(3)在图像重构时,在测量域,利用SI信息的多假设预测生成一种信号残差,并计算假设的最佳线性组合,用改进的多尺度分块压缩感知MS-BCS-SPL技术重构图像。(3) During image reconstruction, in the measurement domain, use the multi-hypothesis prediction of SI information to generate a signal residual, and calculate the best linear combination of hypotheses, and use the improved multi-scale block compression sensing MS-BCS-SPL Techniques for reconstructing images.

优选的,根据加权正则化Tikhonov矩阵计算假设最佳线性组合。Preferably, the assumption optimal linear combination is calculated according to the weighted regularized Tikhonov matrix.

一种多假设预测的压缩感知视频处理方法应用于视频图像处理。A compressive sensing video processing method based on multi-hypothesis prediction is applied to video image processing.

与现有技术相比,本发明具有的有益效果在于:Compared with the prior art, the present invention has the beneficial effects of:

本发明中提出的一种基于MH预测的新的分布式压缩视频感测框架,可以在低复杂度编码器处捕获和压缩视频,并且在解码器处有效地重建视频。所提出的框架可以通过MH预测和BiME估计初始边信息。根据相关系数选择边信息,并用于恢复非关键帧。实验模拟结果表明,本发明所提出的框架可以提供比原始的MH-BCS-SPL算法更好的重建质量。A novel distributed compressed video sensing framework based on MH prediction proposed in this invention can capture and compress video at low-complexity encoder and efficiently reconstruct video at decoder. The proposed framework can estimate initial side information via MH prediction and BiME. Side information is selected according to the correlation coefficient and used to recover non-keyframes. Experimental simulation results show that the proposed framework can provide better reconstruction quality than the original MH-BCS-SPL algorithm.

附图说明Description of drawings

附图用来提供对本发明的进一步理解,并且构成说明书的一部分,与本发明的实施例一起用于解释本发明,并不构成对本发明的限制。在附图中:The accompanying drawings are used to provide a further understanding of the present invention, and constitute a part of the description, and are used together with the embodiments of the present invention to explain the present invention, and do not constitute a limitation to the present invention. In the attached picture:

图1为本发明提出的一种多假设预测的压缩感知视频处理方法的的CVS编解码器的框图;Fig. 1 is the block diagram of the CVS codec of a kind of multi-hypothesis prediction compressive perception video processing method that the present invention proposes;

图2为一种多假设预测的压缩感知视频处理方法基于多预测假设生成边信息框图;Fig. 2 is a kind of compressive sensing video processing method of multi-hypothesis prediction based on multi-prediction assumption to generate side information block diagram;

图3为Akiyo序列的非关键帧采用两种算法在不同采样速率下得出的PSNR平均值;Figure 3 shows the average PSNR obtained by using two algorithms at different sampling rates for non-key frames of the Akiyo sequence;

图4为Coastguard序列的非关键帧采用两种算法在不同采样速率下得出的PSNR平均值;Figure 4 shows the average PSNR obtained by using two algorithms at different sampling rates for non-key frames of the Coastguard sequence;

图5为Foreman序列的非关键帧采用两种算法在不同采样速率下得出的PSNR平均值;Figure 5 shows the average PSNR obtained by using two algorithms at different sampling rates for non-key frames of the Foreman sequence;

图6为Stefan序列的非关键帧采用两种算法在不同采样速率下得出的PSNR平均值。Figure 6 shows the average PSNR of the non-key frames of the Stefan sequence using two algorithms at different sampling rates.

具体实施方式detailed description

下面结合具体实施例对本发明作进一步解说。The present invention will be further explained below in conjunction with specific embodiments.

在传统MH-BC-SPL结构中,压缩感测通过使用维度M×N的一些基底Φ测量维度N的信号x的投影来组合信号捕获和维数降低,其中M<<N。测量向量y被获得为:In the traditional MH-BC-SPL structure, compressive sensing combines signal capture and dimensionality reduction by measuring the projection of a signal x of dimension N using some basis Φ of dimension M×N, where M<<N. The measurement vector y is obtained as:

y=Φx (1)y=Φx (1)

其中x∈RN,y∈RM。如果在一些变换基Ψ中x足够稀疏,则通过y优化重构x,如下:where x∈R N , y∈R M . If x is sufficiently sparse in some transformation base Ψ, x is optimally reconstructed by y as follows:

其中Ψ和Φ充分不相干,M足够大。where Ψ and Φ are sufficiently irrelevant, and M is large enough.

压缩采样率定义为R=M/N。The compression sampling rate is defined as R=M/N.

通常,Φ是随机矩阵,使得它与任何选择的Ψ不相干。在实际应用中,大多数自然信号在任何变换基Ψ中都不是真正稀疏的。In general, Φ is a random matrix, making it incoherent to any choice of Ψ. In practical applications, most natural signals are not truly sparse in any transform basis Ψ.

然后,x的重构问题的可以由公式(2)变化为对于边界的等式:Then, the reconstruction problem of x can be changed from formula (2) to the equation for the boundary:

为了解决公式(3)中的松弛重建的问题,L.Gan提出了一种通过密集的Φ去除x的全局采样,并且用块对角测量矩阵来代替的一种基于块的CS(BCS)算法。当对于每个块使用相同的ΦB时,Φ可采取块对角线形式如下:In order to solve the problem of relaxed reconstruction in formula (3), L.Gan proposed a block-based CS (BCS) algorithm that removes the global sampling of x by dense Φ and replaces it with the block diagonal measurement matrix . When using the same ΦB for each block, Φ can take the block diagonal form as follows:

可以以逐块的方式yi=ΦBxi重写公式(1),其中xi是图像的块i。ΦB是大小为MB×B2的测量矩阵,BCS算法的测量速率为RB=MB/B2Equation (1) can be rewritten in a block-wise manner y iB x i , where xi is block i of the image. Φ B is a measurement matrix whose size is M B ×B 2 , and the measurement rate of the BCS algorithm is R B =M B /B 2 .

在本发明中,我们将使用改进BCS-SPL进行图形恢复重建。In this invention, we will use the improved BCS-SPL for graph recovery reconstruction.

1、基于多预测假设分布式压缩感知的视频编解码器结构框架1. A video codec structure framework based on multi-prediction assumptions and distributed compressed sensing

如图1所示,虚线左边代表编码端,右边代表解码端。在编码端分拆分视频流,分块关键帧与非关键帧。As shown in Figure 1, the left side of the dotted line represents the encoding end, and the right side represents the decoding end. Split the video stream at the encoding end, and block key frames and non-key frames.

让x1表示关键帧。在解码器处关键帧的图像重构的方法是:Let x 1 denote a keyframe. The method for image reconstruction of keyframes at the decoder is:

(1)采用基于平滑投影Landweber的分块CS图像重构方法(BCS-SPL)对关键帧x1进行图像初始重构;(1) Using the block CS image reconstruction method based on smooth projection Landweber (BCS - SPL) to perform initial image reconstruction on the key frame x1;

(2)通过对图像关键帧的测量值y1初始重构图像,通过多假设性预测获得预测帧 (2) Initially reconstruct the image through the measurement value y 1 of the key frame of the image, and obtain the predicted frame through multiple hypothesis prediction

(3)关于多假设预测生成x1之间信号残差 (3) Regarding the multi-hypothesis prediction generation x 1 and signal residual

(4)因为测量值y1可以简单通过关键帧信息x1与其测量矩阵Φ1的行向量內积获得,将残差信号R1映射到测量基中得到测量值D1(4) Because the measured value y 1 can be simply obtained by the inner product of key frame information x 1 and its measurement matrix Φ 1 , the measured value D 1 is obtained by mapping the residual signal R 1 into the measurement basis:

运用基于平滑投影Landweber的分块CS图像重构方法(BCS-SPL)算法重建测量值D1得到初始残差信号 Using the block CS image reconstruction method based on smooth projection Landweber (BCS-SPL) algorithm to reconstruct the measured value D 1 to obtain the initial residual signal

关键帧x1可以通过预测帧与初始残差信号残差信号R1近似表示 Keyframe x 1 can be predicted by frame Residual signal R 1 approximate representation with the initial residual signal

令x2表示非关键帧,它可以由关键帧x1产生的边信息进行解码。Let x2 denote a non-keyframe, which can be decoded by the side information produced by keyframe x1.

如图2所示,由多假设预测MH算法得出边信息SI,则非关键帧残差信号R2与非关键帧x2及边信息SI的关系是:As shown in Figure 2, the side information SI is obtained by the multi-hypothesis prediction MH algorithm, then the relationship between the non-key frame residual signal R 2 and the non-key frame x 2 and the side information SI is:

D2=Φ2R2=Φ2(x2-SI)=y22·SI (6)D 22 R 22 (x 2 -SI)=y 22 ·SI (6)

其中,Φ2和y2分别表示非关键帧x2的测量矩阵与测量值。Among them, Φ2 and y2 represent the measurement matrix and measurement value of the non - keyframe x2 , respectively.

同理,在运用BCS-SPL算法重建测量值D2到重建的残差边信息 Similarly, when using the BCS-SPL algorithm to reconstruct the measured value D 2 to the reconstructed residual side information

非关键帧x2可由边信息与残差边信息重建: Non-key frame x 2 can be reconstructed from side information and residual side information:

2、运用多预测假设在测量域估计边信息2. Using multiple prediction assumptions to estimate side information in the measurement domain

如图1中所示,非关键帧的图像重构质量极大地依赖于生成的边信息的质量。为了充分利用两个连续关键帧和非关键帧之间的相似性,提出的基于测量域中的MH的边信息生成算法如图2所示。As shown in Figure 1, the image reconstruction quality of non-keyframes strongly depends on the quality of generated side information. To make full use of the similarity between two consecutive keyframes and non-keyframes, the proposed side information generation algorithm based on MH in the measurement domain is shown in Fig. 2.

在时间域上相邻的两个重构的关键帧,Sn是非关键帧,算法如下:make with Two reconstructed key frames adjacent to each other in the time domain, Sn is a non-key frame, the algorithm is as follows:

(1)通过双向运动估算(BiME)得出初始边信息SI。(1) Obtain the initial side information SI by Bidirectional Motion Estimation (BiME).

(2)由初始边信息SI与测量出的非关键帧Sn做多预测假设,得出边信息SI0 (2) Make multiple prediction assumptions from the initial side information SI and the measured non-key frame Sn, and get the side information SI 0

(3)同理,由初始边信息SI与关键帧分别做多预测假设得出边信息SI1和SI2.。(3) Similarly, from the initial side information SI and the key frame with Side information SI 1 and SI 2 are obtained by making multiple prediction assumptions respectively.

(4)由获得的三个候选边信息SIi(i=0,1,2)分别与非关键帧Sn计算其相似性,选出相似性最高的SIi负责重构非关键帧。(4) Calculate the similarity between the obtained three candidate side information SI i (i=0, 1, 2) and the non-key frame Sn, and select the SI i with the highest similarity to reconstruct the non-key frame.

(5)采用相关系数函数r(y1,y2)来表示两帧之间的相关性,函数如下所示:(5) Use the correlation coefficient function r(y 1 ,y 2 ) to represent the correlation between two frames, the function is as follows:

其中y1和y2是分块测量的不同块的测量向量,N是测量向量的长度。where y1 and y2 are the measurement vectors of different blocks measured by the block, and N is the length of the measurement vector.

3、对于运用边信息SI做多假设预测重构非关键帧步骤,做如下详细解答。3. For the step of using side information SI to make multiple assumptions, predict and reconstruct non-key frames, give the following detailed answers.

(1)令x表示原始图像,表示预测图像。则关于x与之间的残差信号R可以表示为 (1) Let x denote the original image, Represents the predicted image. Then about x and The residual signal R between can be expressed as

(2)在测量域,残差信号R可由公式计算出。(2) In the measurement domain, the residual signal R can be given by the formula Calculate.

(3)通过一种图像压缩感知重建算法R(D,Φ),由公式得出近似重构图像因此重构图像恢复质量在很大层度上依赖于预测图像的精准度。(3) Through an image compression sensing reconstruction algorithm R(D,Φ), the formula get an approximate reconstructed image Therefore, the reconstructed image restoration quality depends heavily on the predicted image the accuracy.

预测与原始图像最相似的图像相似度的问题可以表示为:The problem of predicting the image similarity that is most similar to the original image can be formulated as:

其中p(Xref)是相邻的关键帧或通过运动估计生成的边信息。where p(X ref ) is adjacent keyframes or side information generated by motion estimation.

由于原始图像在解码器处未知,我们用近似来替换x,并且可以(8)可以重写为Since the original image is unknown at the decoder, we use approximate to replace x, and (8) can be rewritten as

近似图像可以转换到测量域并且计算为:approximate image can be converted to the measurement domain and computed as:

由于测量值y在解码器处可以测得,因此我们可以提高预测的准确度。等式(10)可以通过多假设预测来求解。Since the measurement y is available at the decoder, we can improve the prediction accuracy. Equation (10) can be solved by multi-hypothesis forecasting.

需要预测的每个块被认为是关键帧中的边信息或多个关键帧种块的最佳线性组合,记为 Each block that needs to be predicted is considered to be the side information in the key frame or the best linear combination of multiple key frame seed blocks, denoted as

其中其中ω是最优线性组合系数,是由所有候选块组成的矩B2×MB阵,M是假设预测块的总数,中列向量是每个假设预测块的列表示,将(10)代入(9)并得到:where ω is the optimal linear combination coefficient, is a matrix B 2 ×MB matrix composed of all candidate blocks, M is the total number of assumed prediction blocks, The middle column vector is the column representation of each hypothesized prediction block, substituting (10) into (9) and getting:

其中是惩罚项,λ是拉格朗日参数。Γ是对角矩阵表示为:which is Penalty term, λ is the Lagrangian parameter. Γ is a diagonal matrix expressed as:

h1,...,hk是的的列元素。对于每个块,则ω可以通过通常的Tikhonov解直接计算:h1,...,hk yes column elements. For each block, then ω can be directly computed by the usual Tikhonov solution:

通过将(13)代入(11),可以获得预测块最后,将所有预测块与边信息SI放在一起重建非关键帧。By substituting (13) into (11), the prediction block can be obtained Finally, all predicted blocks are put together with side information SI to reconstruct non-key frames.

验证实验:为了评估本发明提出的新框架及算法,在http://trace.eas.asu.edu/yuv/网站中做具有QCIF格式的标准测试视频序列检测实验。Verification experiment: In order to evaluate the new framework and algorithm proposed by the present invention, a standard test video sequence detection experiment with QCIF format is done in the http://trace.eas.asu.edu/yuv/ website.

关键帧的采样率为0.7,非关键帧的采样率为0.1至0.5;本实验中每个图像块B大小为16×16,参考帧相对应得图像搜索区域:空间窗口大小w(图像块及其周围)±15个像素范围内。The sampling rate of key frames is 0.7, and the sampling rate of non-key frames is 0.1 to 0.5; in this experiment, the size of each image block B is 16×16, and the image search area corresponding to the reference frame is: the spatial window size w (image block and its surroundings) within ±15 pixels.

使用本发明所提出的算法和原始MH-BCS-SPL算法,针对四个序列(即Akiyo,Coastguard,Foreman和Stefan)的不同采样速率测出平均PSNR性能的平均值,结果见表1。Using the algorithm proposed by the present invention and the original MH-BCS-SPL algorithm, the average value of the average PSNR performance is measured for the different sampling rates of the four sequences (ie Akiyo, Coastguard, Foreman and Stefan), and the results are shown in Table 1.

表1.采用平均PSNR(dB)描述的不同采样率下的非关键帧重构质量Table 1. Non-keyframe reconstruction quality at different sampling rates described by average PSNR (dB)

(单位:dB).(Unit: dB).

结论:表1中的数据描述了不同采样率下不同视频的非关键帧重建质量。本发明提出的算法和MH-BCS-SPL算法相比,重建质量有0.3-1dB的提高。对于运动平缓的Akiyo序列和运动不太剧烈的Coastguard序列,本发明提出的算法提高了1dB左右;对运动剧烈的Foreman和Stefan序列,本发明提出的算法提高0.3dB左右。Conclusions: The data in Table 1 describe the non-keyframe reconstruction quality of different videos at different sampling rates. Compared with the MH-BCS-SPL algorithm, the algorithm proposed by the invention has 0.3-1dB improvement in reconstruction quality. For the Akiyo sequence with gentle motion and the Coastguard sequence with less violent motion, the algorithm proposed by the present invention improves about 1dB; for the Foreman and Stefan sequences with severe motion, the algorithm proposed by the present invention improves about 0.3dB.

从图3-6中可以看出我们的改进MH-BCS-SPL框架在整个测试范围内提供更好的图像重建质量。对于具有快速或复杂运动的序列,例如Coastguard和Foreman序列,我们所提出的方法显示出显着的性能增益;对于具有低运动的Akiyo序列,性能也所提高。From Fig. 3-6, we can see that our improved MH-BCS-SPL framework provides better image reconstruction quality in the whole test range. For sequences with fast or complex motions, such as Coastguard and Foreman sequences, our proposed method shows significant performance gain; for Akiyo sequences with low motion, the performance is also improved.

以上所述,仅为本发明较佳的具体实施方式,但本发明的保护范围并不局限于此,任何熟悉本技术领域的技术人员在本发明揭露的技术范围内,根据本发明的技术方案及其发明构思加以等同替换或改变,都应涵盖在本发明的保护范围之内。The above is only a preferred embodiment of the present invention, but the scope of protection of the present invention is not limited thereto, any person familiar with the technical field within the technical scope disclosed in the present invention, according to the technical solution of the present invention Any equivalent replacement or change of the inventive concepts thereof shall fall within the protection scope of the present invention.

Claims (5)

1. a kind of compressed sensing method for processing video frequency for assuming prediction more, the structural framing of the processing method include coding side and Decoding end, it is characterised in that:In the coding side, video sequence frame is divided into key frame and non-key frame, according to compressed sensing Theory, key frame and non-key frame obtain measured value by calculation matrix Φ;In the decoding end, key frame, which passes through, is based on block The Landweber algorithm for reconstructing smoothly projected is decoded, and after then being rebuild through excessive hypothesis prediction algorithm and residual error, obtains weight Key frame and storage after building;Non-key frame is carried out after residual error reconstruction, combines solution together with the side information produced according to key frame Code, the non-key frame after being rebuild.Finally, decoded key frame and non-key frame are integrated into video sequence according to frame sequential Arrange and export.
2. a kind of compressed sensing method for processing video frequency for assuming prediction according to claim 1, it is characterised in that described more Side information assumes that prediction MH algorithms are tried to achieve according to decoded adjacent key frame through excessive.
3. a kind of compressed sensing method for processing video frequency for assuming prediction according to claim 1, it is characterised in that described more The method and steps for assuming prediction MH algorithms as follows more:
(1) three candidate side information SIi (i=0,1,2) are calculated with bidirectional-movement estimation;
(2) coefficient correlation of non-key frame and three candidate side information is calculated respectively, chooses correlation highest SI information;
(3) in Image Reconstruction, in measurement field, a kind of signal residual error is generated using many hypothesis prediction of SI information, and calculate vacation If optimum linear combination, perceive MS-BCS-SPL technology reengineering images with improved multiple dimensioned splits' positions.
4. a kind of compressed sensing method for processing video frequency for assuming prediction according to claim 3 more, it is characterised in that according to Weight the optimum linear combination that regularization Tikhonov matrix computations are assumed.
5. a kind of compressed sensing method for processing video frequency for assuming prediction according to claim 1-4 are applied to video image more Processing.
CN201710375057.3A 2017-05-24 2017-05-24 A kind of compressed sensing method for processing video frequency for assuming prediction more Pending CN107155112A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201710375057.3A CN107155112A (en) 2017-05-24 2017-05-24 A kind of compressed sensing method for processing video frequency for assuming prediction more

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201710375057.3A CN107155112A (en) 2017-05-24 2017-05-24 A kind of compressed sensing method for processing video frequency for assuming prediction more

Publications (1)

Publication Number Publication Date
CN107155112A true CN107155112A (en) 2017-09-12

Family

ID=59793339

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201710375057.3A Pending CN107155112A (en) 2017-05-24 2017-05-24 A kind of compressed sensing method for processing video frequency for assuming prediction more

Country Status (1)

Country Link
CN (1) CN107155112A (en)

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108111861A (en) * 2017-12-25 2018-06-01 辽宁师范大学 Video elastic movement method of estimation based on 2bit depth pixels
CN108259916A (en) * 2018-01-22 2018-07-06 南京邮电大学 Best match interpolation reconstruction method in frame in a kind of distributed video compressed sensing
CN109040757A (en) * 2018-07-20 2018-12-18 西安交通大学 A kind of compressed sensing multilayer residual image coding method
CN110933429A (en) * 2019-11-13 2020-03-27 南京邮电大学 Video Compressed Sensing and Reconstruction Method and Device Based on Deep Neural Network
CN112616052A (en) * 2020-12-11 2021-04-06 上海集成电路装备材料产业创新中心有限公司 Method for reconstructing video compression signal
CN116634209A (en) * 2023-07-24 2023-08-22 武汉能钠智能装备技术股份有限公司 Breakpoint video recovery system and method based on hot plug

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103247028A (en) * 2013-03-19 2013-08-14 广东技术师范学院 Multi-hypothesis prediction block compressed sensing image processing method
CN105338357A (en) * 2015-09-29 2016-02-17 湖北工业大学 Distributed video compressed sensing coding technical method

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103247028A (en) * 2013-03-19 2013-08-14 广东技术师范学院 Multi-hypothesis prediction block compressed sensing image processing method
CN105338357A (en) * 2015-09-29 2016-02-17 湖北工业大学 Distributed video compressed sensing coding technical method

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
张奔: "基于压缩感知的分布式视频编码技术研究", 《中国优秀硕士学位论文全文数据库》 *

Cited By (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108111861A (en) * 2017-12-25 2018-06-01 辽宁师范大学 Video elastic movement method of estimation based on 2bit depth pixels
CN108111861B (en) * 2017-12-25 2021-06-11 辽宁师范大学 Video elastic motion estimation method based on 2bit depth pixel
CN108259916A (en) * 2018-01-22 2018-07-06 南京邮电大学 Best match interpolation reconstruction method in frame in a kind of distributed video compressed sensing
CN108259916B (en) * 2018-01-22 2019-08-16 南京邮电大学 Best match interpolation reconstruction method in frame in a kind of distributed video compressed sensing
CN109040757A (en) * 2018-07-20 2018-12-18 西安交通大学 A kind of compressed sensing multilayer residual image coding method
CN109040757B (en) * 2018-07-20 2020-11-10 西安交通大学 Compressed sensing multilayer residual image coding method
CN110933429A (en) * 2019-11-13 2020-03-27 南京邮电大学 Video Compressed Sensing and Reconstruction Method and Device Based on Deep Neural Network
CN110933429B (en) * 2019-11-13 2021-11-12 南京邮电大学 Video compression sensing and reconstruction method and device based on deep neural network
CN112616052A (en) * 2020-12-11 2021-04-06 上海集成电路装备材料产业创新中心有限公司 Method for reconstructing video compression signal
CN116634209A (en) * 2023-07-24 2023-08-22 武汉能钠智能装备技术股份有限公司 Breakpoint video recovery system and method based on hot plug
CN116634209B (en) * 2023-07-24 2023-11-17 武汉能钠智能装备技术股份有限公司 Breakpoint video recovery system and method based on hot plug

Similar Documents

Publication Publication Date Title
CN107155112A (en) A kind of compressed sensing method for processing video frequency for assuming prediction more
KR101366242B1 (en) Method for encoding and decoding motion model parameter, and method and apparatus for video encoding and decoding using motion model parameter
KR100664929B1 (en) Method and apparatus for efficiently compressing motion vectors in multi-layered video coder
CN110519600B (en) Intra-frame and inter-frame joint prediction method and device, coder and decoder and storage device
US20060012719A1 (en) System and method for motion prediction in scalable video coding
KR20140037056A (en) Coding method and image coding device for the compression of an image sequence
JP2006174415A (en) Image decoding apparatus, image decoding program, image decoding method, image encoding apparatus, image encoding program, and image encoding method
CN101467456A (en) Method and apparatus for encoding/decoding fgs layers using weighting factor
KR20060109249A (en) Method and apparatus for predicting motion vector for video block using motion vector of base layer picture in encoding / decoding video signal
JPH08307874A (en) Video signal encoder
JP5689291B2 (en) Image decoding apparatus, image decoding method, image decoding program, image encoding apparatus, image encoding method, and image encoding program
KR20060088461A (en) Method and apparatus for deriving motion vector for video block from motion vector of base layer picture in encoding / decoding video signal
KR100694050B1 (en) Motion prediction method and device
CN114531596A (en) Image processing method and device
CN112601095A (en) Method and system for creating fractional interpolation model of video brightness and chrominance
Abou-Elailah et al. Fusion of global and local motion estimation using foreground objects for distributed video coding
KR101431046B1 (en) Video data encoding method of encoding video data for fruc, apparatus and storage medium thereof
JP2012513141A (en) Method and apparatus for predicting and encoding motion parameters of video picture sequences
CN108769675B (en) Distributed video self-adaptive reconstruction method based on two-order multi-hypothesis prediction
KR100703751B1 (en) Method and apparatus for encoding and decoding by referring to image of virtual region
Chen et al. Compressed video sensing with multi-hypothesis prediction
Li et al. Learning-based video compression framework with implicit spatial transform for applications in the internet of things
Ratnottar et al. Comparative study of motion estimation & motion compensation for video compression
KR20060069227A (en) Method and apparatus for deriving motion vector for video block from motion vector of base layer picture in encoding / decoding video signal
Ebrahim et al. Block Compressive Sensing (BCS) Based Multi-phase Reconstruction (MPR) Framework for Video

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
WD01 Invention patent application deemed withdrawn after publication

Application publication date: 20170912

WD01 Invention patent application deemed withdrawn after publication