[go: up one dir, main page]

CN103065336A - Directional pyramid coding method of image - Google Patents

Directional pyramid coding method of image Download PDF

Info

Publication number
CN103065336A
CN103065336A CN2013100244321A CN201310024432A CN103065336A CN 103065336 A CN103065336 A CN 103065336A CN 2013100244321 A CN2013100244321 A CN 2013100244321A CN 201310024432 A CN201310024432 A CN 201310024432A CN 103065336 A CN103065336 A CN 103065336A
Authority
CN
China
Prior art keywords
pyramid
image
tree
important
frequency
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN2013100244321A
Other languages
Chinese (zh)
Other versions
CN103065336B (en
Inventor
孙继平
王洪俊
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
China University of Mining and Technology Beijing CUMTB
Original Assignee
China University of Mining and Technology Beijing CUMTB
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by China University of Mining and Technology Beijing CUMTB filed Critical China University of Mining and Technology Beijing CUMTB
Priority to CN201310024432.1A priority Critical patent/CN103065336B/en
Publication of CN103065336A publication Critical patent/CN103065336A/en
Application granted granted Critical
Publication of CN103065336B publication Critical patent/CN103065336B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Landscapes

  • Compression Or Coding Systems Of Tv Signals (AREA)

Abstract

The invention discloses a directional pyramid coding method of an image. By changing a data structure of a traditional set partitioning in hierarchical tree (SPIHT) algorithm, a directional pyramid of coding of the algorithm is possible to realize. Because a directional pyramid SPIHT algorithm is characterized by being high in redundancy and practical, the directional pyramid SPIHT algorithm is put forward and is preferable, and therefore rebuilding effects of the whole, especially rebuilding effects of image edges and other features are better improved. The algorithm can effectively code and compress images applied to the field of machine visions.

Description

图像方向金字塔的编码方法Coding Method of Image Orientation Pyramid

技术领域 technical field

本发明涉及图像压缩领域,具体地说,是涉及一种图像的方向金字塔压缩和编码的方法。  The invention relates to the field of image compression, in particular to a method for image directional pyramid compression and encoding. the

背景技术 Background technique

1.信号变换与图像金字塔  1. Signal transformation and image pyramid

信号是信息的载体。如何实现信号的有效表示是信号处理领域中的一个核心问题。从数学的观点看,图像是一个灰度值的二维矩阵。一幅图像中,通常看到的是相连接的纹理与灰度级相似的区域,它们相结合形成物体。如果物体尺寸很大或对比很强(图像的整体),通常只需要采用较低的分辨率(图1(a),(b));如果物体的尺寸很小或对比度不高(图像的细节),则需要采用较高的分辨率观察(图1(c),(d))。如果物体尺寸有大有小,或对比有强有弱的情况同时存在,以若干分辨率对它们进行研究将具有优势。当然这也就是多分辨率处理的魅力所在。  Signals are carriers of information. How to achieve effective representation of signals is a core issue in the field of signal processing. From a mathematical point of view, an image is a two-dimensional matrix of gray values. In an image, it is common to see regions of connected textures and similar gray levels that combine to form objects. If the object size is large or the contrast is strong (the whole of the image), usually only a lower resolution is required (Fig. 1(a), (b)); if the object size is small or the contrast is not high (the details of the image ), you need to use a higher resolution observation (Figure 1 (c), (d)). If objects vary in size, or in contrast, it is advantageous to study them at several resolutions. Of course, this is where the charm of multi-resolution processing lies. the

当信号的采样率满足Nyquist要求时,归一化频带必须限制在[-π,π]之间。此时可以分别用理想低通和理想高通滤波器L(ω)和H(ω)将它分解成(对正频率部分而言)频带在[0,π/2]的低频部分和频带在[π/2,π]的高频部分,分别反映信号的概貌和细节,如图2所示。因为频带之间不交叠,处理后两路输出信号正交。而且由于两种输出的带宽均减半,因此采样率可以减半而不致引起信息的丢失。这就是在滤波后可以引入“二抽取”的原因(图2中↓2符号表示“二抽取”操作),所谓二抽取就是将输入序列每隔一个输出一次,组成长度缩短一半的新序列。类似的过程对每次分解后的低频部分可重复进行下去,即:每一级分解把该级输入信号分解成一个低频的粗略逼近(概貌)和一个高频的细节部分。而且每级的输出采样率都可以再减半,这样就将原始信号x(n)进行了多分辨率分解。  When the sampling rate of the signal meets the Nyquist requirement, the normalized frequency band must be limited between [-π, π]. At this time, ideal low-pass and ideal high-pass filters L(ω) and H(ω) can be used to decompose it into (for the positive frequency part) the low-frequency part with the frequency band in [0, π/2] and the frequency band in [ π/2, π] The high-frequency part reflects the overview and details of the signal respectively, as shown in Figure 2. Because the frequency bands do not overlap, the two output signals are orthogonal after processing. And because the bandwidth of both outputs is halved, the sampling rate can be halved without loss of information. This is the reason why "double decimation" can be introduced after filtering (the symbol ↓2 in Figure 2 represents the "two decimation" operation). The so-called double decimation is to output the input sequence every other time to form a new sequence whose length is shortened by half. A similar process can be repeated for each decomposed low-frequency part, that is, each level of decomposition decomposes the input signal of this level into a low-frequency rough approximation (overview) and a high-frequency detail part. Moreover, the output sampling rate of each stage can be further halved, so that the original signal x(n) is decomposed into multiple resolutions. the

以多分辨率来解释图像的一种有效但概念简单的结构就是图像金字塔。图像金字塔最初用于机器视觉和图像压缩,一幅图像的金字塔就是一系列以金字塔形排列的分辨率逐步降低的图像集合。如图3所示,金字塔的底部是待处理图像的高分辨率表示,顶部是低分辨率的近似。当向金字塔的上层移动时,尺寸和分辨率降低。金字塔第J层的上面一层(即第(J-1)层)的尺寸为第J层的1/4。通常金字塔的低分辨率图像用于分析大的结构或图像的整体内容,而高分辨率图像用于分析单个物体的特性。这样的由粗糙到精细的分析策略在模式识别中特别适用。  An efficient but conceptually simple structure for interpreting images at multiple resolutions is the image pyramid. Image pyramids were originally used in machine vision and image compression. An image pyramid is a series of images that are arranged in a pyramid and whose resolution is gradually reduced. As shown in Figure 3, the bottom of the pyramid is a high-resolution representation of the image to be processed, and the top is a low-resolution approximation. As one moves to the upper levels of the pyramid, the size and resolution decrease. The size of the upper layer of the J-th layer of the pyramid (that is, the (J-1)th layer) is 1/4 of the J-th layer. Usually low-resolution images of pyramids are used to analyze large structures or the overall content of images, while high-resolution images are used to analyze the characteristics of individual objects. Such a coarse-to-fine analysis strategy is especially applicable in pattern recognition. the

金字塔结构是信号多尺度/多分辨率分析的重要方法。高斯金字塔(Guassian Pyramid)和拉普拉斯金字塔(Laplacian Pyramid)结构,主要用于数字图像的压缩编码算法。高斯金字塔是通过原图像不断向下采样而获得的,在频域相当于不断地低通滤波。不同层高斯金字塔在同一尺度下展开并相减,相减的量频域中相当于高通滤波,这样可以生成拉普拉斯金字塔。原则上从输入信号中去除低通平滑后的信号就可以得到该尺度(即该层)的细节信号,但是由于平滑信号的采样率比输入信号低,两信号不能直接相减。为了得到细节信号,需要对上一步的输出信号进行内插,内插M-1个像素点,并进行滤波,生成与输入信号等分辨率的图像,再与输入信号相减,得到细节信号。由于对平滑信号进行了插值运算,插值滤波器就决定了预测值与输入值之间的相似程度。为了重构输入信号,只要把平滑信号经过同样的低通滤波器处理得到近似信号,再和细节信号相加,就可以实现原始图像的完全重构。  Pyramid structure is an important method for signal multiscale/multiresolution analysis. Gaussian Pyramid (Guassian Pyramid) and Laplacian Pyramid (Laplacian Pyramid) structures are mainly used in compression coding algorithms for digital images. The Gaussian pyramid is obtained by continuously downsampling the original image, which is equivalent to continuous low-pass filtering in the frequency domain. Different layers of Gaussian pyramids are expanded and subtracted at the same scale, and the amount of subtraction is equivalent to high-pass filtering in the frequency domain, so that a Laplacian pyramid can be generated. In principle, the detail signal of this scale (that is, this layer) can be obtained by removing the low-pass smoothed signal from the input signal, but since the sampling rate of the smoothed signal is lower than that of the input signal, the two signals cannot be directly subtracted. In order to obtain the detail signal, it is necessary to interpolate the output signal of the previous step, interpolate M-1 pixels, and perform filtering to generate an image with the same resolution as the input signal, and then subtract it from the input signal to obtain the detail signal. Since the interpolation operation is performed on the smooth signal, the interpolation filter determines the similarity between the predicted value and the input value. In order to reconstruct the input signal, as long as the smooth signal is processed by the same low-pass filter to obtain an approximate signal, and then added to the detail signal, the original image can be completely reconstructed. the

2.小波变换  2. Wavelet transform

图像的二维正交小波变换是另一种与多分辨率分析相关的重要图像技术,也是图像金字塔的一种重要表现形式,它可以得到图像在水平、垂直和对角线方向上的细节信息。小波变换的应用范围较广,主要包括图像的压缩编码、图像恢复、图像去噪、特征值提取等方面。可分离的二维正交小波变换构成的多分辨率分析是应用最广的一类二维小波变换。  The two-dimensional orthogonal wavelet transform of the image is another important image technology related to multi-resolution analysis, and it is also an important form of image pyramid, which can obtain the detailed information of the image in the horizontal, vertical and diagonal directions . Wavelet transform has a wide range of applications, mainly including image compression coding, image restoration, image denoising, feature value extraction and so on. The multi-resolution analysis composed of separable two-dimensional orthogonal wavelet transform is the most widely used type of two-dimensional wavelet transform. the

从数字滤波器的角度看,图4给出了二维小波系数分解的过程。其中L和H分别表示一维低通和高通滤波器,下标x和y分别表示对矩阵沿行方向和列方向进行滤波。可分离情况的特点就是可以沿x和y两个方向分先后两步作处理(先对x方向进行滤波在对y方向进行滤波),由于进行带通滤波处理,带宽比原始图像的带宽小,子带可以进行无信息损失的抽样,因此每一级处理都要经过两次二抽取。其中LL经过x和y两个方向的低通,对应原始离散图像在下一尺度(即下一层)上的低频成分;LH经过x方向上的低通、y方向上的高通,对应原始图像水平方向的低频成分和垂直方向的高频成分;相应的,HL表示的是x方向的高频成分和y方向的低频成分;HH表示的是x和y两个方向的高频成分。图5给出了二维小波变换的频谱表示,将初始输入矩阵看做一个二维离散图像,经过一次分解后得到的四部分输出分别经过不同的滤波器,代表了原始图像的概貌(即低频)信息和垂直、水平、对角线三个方向的细节信息。经过一次小波变换后,总的输出数据量与输入数据量相同,只是根据频率信息不同,将各分量进行了分类,便于信号处理。如果将一次分解后的概貌部分继续进行小波分解,就可以得到离散图像的多尺度分析,即它在不同尺度上的细节和概貌。  From the perspective of digital filters, Figure 4 shows the process of two-dimensional wavelet coefficient decomposition. Among them, L and H represent one-dimensional low-pass and high-pass filters, respectively, and the subscripts x and y represent filtering the matrix along the row direction and column direction, respectively. The feature of the separable situation is that it can be processed in two steps along the x and y directions (first filter the x direction and then filter the y direction). Due to the band-pass filter processing, the bandwidth is smaller than that of the original image. Subbands can be sampled without loss of information, so each level of processing has to go through two binary samplings. Among them, LL passes through the low-pass in the x and y directions, corresponding to the low-frequency components of the original discrete image on the next scale (ie, the next layer); LH passes through the low-pass in the x-direction and the high-pass in the y-direction, corresponding to the original image level The low frequency components in the direction and the high frequency components in the vertical direction; correspondingly, HL represents the high frequency components in the x direction and the low frequency components in the y direction; HH represents the high frequency components in the x and y directions. Figure 5 shows the spectral representation of the two-dimensional wavelet transform. The initial input matrix is regarded as a two-dimensional discrete image, and the four parts of the output obtained after a decomposition pass through different filters respectively, representing the general appearance of the original image (that is, the low-frequency ) information and detailed information in vertical, horizontal and diagonal directions. After a wavelet transform, the total output data volume is the same as the input data volume, but each component is classified according to the frequency information, which is convenient for signal processing. If we continue to decompose the overview part after the first decomposition, we can get the multi-scale analysis of the discrete image, that is, its details and overview at different scales. the

同样,经过内插、滤波和叠加单个子带的处理,加上不同尺度上的细节信息,可以重构出任意细尺度上的概貌成分,直到最终完全重构出原始图像。  Similarly, after interpolation, filtering, and superposition of individual subbands, together with detailed information at different scales, the overview components at any fine scale can be reconstructed until the original image is finally completely reconstructed. the

3方向金字塔  3-way pyramid

离散正交小波变换因其良好的时频分析特性,成为多尺度信号和图像分析中常用的表示方法,但它存在的一个缺点是由于对平移参数均匀采样导致缺少平移不变性,当输入信号发生较小的位移变化时,小波子带系数能量会发生较大的变化。同时对于二维图像的旋转和尺度变化,小波变换系数能量也不稳定。小波变换存在的另一个缺点就是它的方向选择性有限,在每一个尺度空间只能被分解为水平、竖直和对角三个方向,很难满足图像对连续方向的要求。  Discrete orthogonal wavelet transform has become a commonly used representation method in multi-scale signal and image analysis because of its good time-frequency analysis characteristics, but it has a shortcoming that it lacks translation invariance due to uniform sampling of translation parameters. When the input signal occurs When the displacement changes small, the wavelet subband coefficient energy will change greatly. At the same time, for the rotation and scale changes of the two-dimensional image, the energy of the wavelet transform coefficients is also unstable. Another shortcoming of wavelet transform is that its direction selectivity is limited. In each scale space, it can only be decomposed into three directions: horizontal, vertical and diagonal. It is difficult to meet the continuous direction requirements of images. the

方向金字塔可以被看作是可以选择方向的拉普拉斯金字塔,是由Eero Simoncelli于1993年发明的。它是一种线性的多尺度多方向的图像表示框架,应用于图像处理和计算机视觉等领域。它能够将图像分解成不同尺度、多方向的一系列子带,不仅可以保持平移和旋转不变性,而且方向可控,从而相对于小波分析提供了更为丰富的方向信息。  The Orientation Pyramid can be seen as a Laplace Pyramid that can choose an orientation and was invented by Eero Simoncelli in 1993. It is a linear multi-scale and multi-directional image representation framework, which is applied in the fields of image processing and computer vision. It can decompose the image into a series of sub-bands of different scales and directions. It can not only maintain translation and rotation invariance, but also control the direction, thus providing more abundant direction information than wavelet analysis. the

图6给出了图像的一层方向金字塔分解和重构的系统框图,方向金字塔分解得到的子带都是极性可分的。其中H0(ω)为高通滤波器,L0(ω),L1(ω)为不同尺度的低通滤波器,Bi(ω),i=0,...,K为不同方向带通滤波器;↓2和↑2分别表示下采样和上采样的过程;X(ω)和

Figure BSA00000845759300031
分别为原始图像和重构图像;空心圆圈表示该系统的嵌套。由图6可知,首先该算法被分解为高通和低通两个子带,信号经过互补的高低通滤波器并且不进行下抽样计算。随后低频子带被进一步分解为一组带通子带的图像和一个(更)低通的子带图像。经过带通滤波器Bi(ω)处理得到的带通子带具有不同的方向性。把这些作为基函数子带,通过调整它们的频率响应的线性组合就可以得到任意方向的方向子带。带通子带不进行下抽样计算。实际应用中,可以设计个数K的基函数方向子带,这些基函数的方向分别为
Figure BSA00000845759300032
i=0,...,K-1,例如,若K=4,则基函数的4个方向分别为0°,45°,90°,135°。分解后的(更)低通子带进行下采样后再进行分解,重复以上过程,实现多分辨率分解算法。同样,经过内插、滤波和叠加每个子带的处理,可以恢复不同尺度上的细节信息,可以完全重构出原始图像。  Fig. 6 shows the system block diagram of one-layer directional pyramid decomposition and reconstruction of an image, and the subbands obtained by directional pyramid decomposition are polarity separable. Among them, H 0 (ω) is a high-pass filter, L 0 (ω), L 1 (ω) are low-pass filters of different scales, B i (ω), i=0, ..., K are bands in different directions pass filter; ↓2 and ↑2 represent the process of downsampling and upsampling respectively; X(ω) and
Figure BSA00000845759300031
Original and reconstructed images, respectively; open circles indicate the nesting of this system. It can be seen from Figure 6 that the algorithm is firstly decomposed into two sub-bands, high-pass and low-pass, and the signal passes through complementary high- and low-pass filters without down-sampling calculation. The low frequency subbands are then further decomposed into a set of bandpass subband images and a (more) low pass subband image. The band-pass sub-bands obtained by processing the band-pass filter B i (ω) have different directivities. Using these as basis function subbands, direction subbands in arbitrary directions can be obtained by adjusting the linear combination of their frequency responses. Bandpass subbands are not computed for downsampling. In practical applications, a number K of basis function direction subbands can be designed, and the directions of these basis functions are
Figure BSA00000845759300032
i=0, . . . , K−1, for example, if K=4, the four directions of the basis function are 0°, 45°, 90°, and 135° respectively. The decomposed (more) low-pass sub-bands are down-sampled and then decomposed, and the above process is repeated to realize the multi-resolution decomposition algorithm. Similarly, after interpolation, filtering, and superposition of each sub-band, detailed information at different scales can be recovered, and the original image can be completely reconstructed.

图7给出了方向金字塔三个尺度(即三个层)四个方向分解的频谱表示。其中频率轴为[-π,π]。在频域中,方向金字塔分解的各子带都是极性可分的。第一步预处理分解,高频子带对应于空间频率域的四个角,低频子带对应于大圆区域;第二步的分解过程,阴影区域对应其中一个方向的基函数带通子带;最中心的小圆区域对应于分解之后第三尺度上的最终的低频子带。  Fig. 7 shows the spectrum representation of the four direction decompositions of the three scales (ie three layers) of the direction pyramid. where the frequency axis is [-π, π]. In the frequency domain, each subband of the directional pyramid decomposition is polarity separable. In the first step of preprocessing decomposition, the high-frequency subbands correspond to the four corners of the spatial frequency domain, and the low-frequency subbands correspond to the great circle area; in the second step of the decomposition process, the shaded area corresponds to the basis function bandpass subband in one direction; The centermost small circle area corresponds to the final low-frequency subband on the third scale after decomposition. the

图8显示了对于一个输入(e),通过计算得到该图像的方向金字塔表示(a)-(d),并通过(a)-(d)重构出(f)的过程。图(a)-(c)为三尺度四方向(0°,45°,90°,135°)的方向金字塔分解的子带,(d)为低通量,高频量没有显示。  Fig. 8 shows that for an input (e), the direction pyramid representation (a)-(d) of the image is obtained through calculation, and the process of reconstructing (f) through (a)-(d). Figures (a)-(c) are the subbands of the three-scale four-direction (0°, 45°, 90°, 135°) direction pyramid decomposition, and (d) is the low flux, and the high frequency is not shown. the

方向金字塔主要的不足是它的过完备性,它的过完备度为

Figure BSA00000845759300041
其中K为方向子带的个数。这相当于,若输入一副6x6的图像,金字塔的方向个数为4,不限制分解的尺度个数(即层数),输出的方向金字塔将含有192个像素点,远超出原输入图像的36个像素点。尽管过完备性会限制金字塔算法的计算效率和储存成本,但也给许多图像处理方法提供了方便。  The main shortcoming of the direction pyramid is its over-completeness, and its over-completeness is
Figure BSA00000845759300041
where K is the number of direction sub-bands. This is equivalent to, if a 6x6 image is input, the number of directions of the pyramid is 4, and the number of decomposition scales (that is, the number of layers) is not limited, the output direction pyramid will contain 192 pixels, far exceeding the original input image. 36 pixels. Although over-completeness will limit the computational efficiency and storage cost of the pyramid algorithm, it also provides convenience for many image processing methods.

4基于嵌入式小波变换的图像编码方法  4 Image coding method based on embedded wavelet transform

由于小波变换的多分辨率性质,图像经小波变换后得到的系数在空域和频域都有良好的分布特性,因此各种基于小波变换的图像压缩技术取得了很大成功。目前比较有效的小波变换压缩方法有两种:一种是1993年由J.M.Shapiro根据一副图像的小波变换在不同级之间的相似性,提出了嵌入式零树小波编码方法(EZW:Embedded Zerotree Wavelet);另一种是1996年A.Said根据EZW算法的基本思想,提出了一种新的实现方法,即多级树集合分裂算法(SPIHT:Set Partitioning in Hierarchical Trees)。  Due to the multi-resolution nature of wavelet transform, the coefficients obtained after wavelet transform have good distribution characteristics in space and frequency domains, so various image compression techniques based on wavelet transform have achieved great success. At present, there are two effective wavelet transform compression methods: one is that in 1993, J.M.Shapiro proposed an embedded zero tree wavelet coding method (EZW: Embedded Zerotree) based on the similarity between different levels of wavelet transform of an image. Wavelet); the other is that in 1996, A. Said proposed a new implementation method based on the basic idea of the EZW algorithm, that is, the multilevel tree set splitting algorithm (SPIHT: Set Partitioning in Hierarchical Trees). the

EZW编码思想是基于不同级小波系数之间仍然有很强相关性的假设,这种相关性以小波树的父子系数关系表现出来。实际上,如果位于较低频率层的小波系数小于某一阈值,则位于较高频率层同方向和空间位置的小波系数小于该阈值的可能性极大。如果在较低频率层上的一个系数小于某阈值,而下一较高频率层以及更高频率层上该系数所对应的后代系数集内有若干系数大于该阈值,就把较低频率层上的这个系数定义为零树,然后阈值减半,再对图像扫描,如此往复下去,不断生成零树。在整个逐次逼近量化过程中,通过不断减半当前阈值,重复扫描和符号编码,直到满足目标比特率需要。  The idea of EZW coding is based on the assumption that there is still a strong correlation between wavelet coefficients at different levels, and this correlation is shown in the parent-child coefficient relationship of the wavelet tree. In fact, if the wavelet coefficients in the lower frequency layer are smaller than a certain threshold, the wavelet coefficients in the same direction and spatial position in the higher frequency layer are more likely to be smaller than the threshold. If a coefficient on the lower frequency layer is smaller than a certain threshold, and there are several coefficients in the next higher frequency layer and the descendant coefficient set corresponding to the coefficient on the higher frequency layer are greater than the threshold, the lower frequency layer This coefficient is defined as a zero tree, and then the threshold is halved, and then the image is scanned, and so on, and the zero tree is continuously generated. Throughout the successive approximation quantization process, scanning and symbol encoding are repeated by continually halving the current threshold until the target bitrate requirement is met. the

EZW是一种构建在小波变换基础上的嵌入式零树编码方案。其编码器可以将待编码的比特流按重要性的不同进行排序,根据目标码率或失真度大小要求随时结束编码;同样,对于给定码流解码器随时结束解码,并可以得到相应码流截断处的目标码率的重建图像。  EZW is an embedded zerotree coding scheme based on wavelet transform. Its encoder can sort the bit streams to be encoded according to the importance, and end the encoding at any time according to the target code rate or distortion level; similarly, for a given code stream, the decoder can end the decoding at any time, and can get the corresponding code stream The reconstructed image at the target bitrate where the truncation is performed. the

小波系数的分布特点是越往低频子带系数值越大,包含信息量越多,越往高频子带系数值越小,包含信息量越少。EZW算法充分利用了小波分解后系数分布特点,先传较低频的系数的重要比特,然后传输较高频系数的重要比特。  The distribution characteristics of the wavelet coefficients are that the lower the sub-band coefficient value is, the greater the information content is, and the higher the high-frequency sub-band coefficient value is, the smaller the information content is. The EZW algorithm makes full use of the distribution characteristics of the coefficients after wavelet decomposition, first transmits the important bits of the lower frequency coefficients, and then transmits the important bits of the higher frequency coefficients. the

EZW首先对零树结构进行了定义:任选一个小波变换后的图像系数Ci,j,对于给定门限T,如果有|Ci,j|≥T,则称小波系数Ci,j是重要系数,反之定义为不重要系数。如果一个小波系数在一个粗的尺度上关于给定门限T不重要,在较细的尺度上与该系数对应的同样空间位置 中的所有小波系数关于门限T也不重要,则称这些小波系数形成了一个零树,并规定粗尺度上的小波系数为父亲或零树根。较细尺度上相应位置上的小波系数为子孙。  EZW first defines the zero tree structure: choose an image coefficient C i, j after wavelet transformation, for a given threshold T, if |C i, j| ≥ T, then the wavelet coefficient C i, j is called An important coefficient, otherwise it is defined as an unimportant coefficient. If a wavelet coefficient is insignificant with respect to a given threshold T on a coarse scale, and all wavelet coefficients in the same spatial position corresponding to the coefficient are also insignificant with respect to the threshold T on a finer scale, these wavelet coefficients are said to form A zero tree is constructed, and the wavelet coefficients on the coarse scale are defined as the parent or root of the zero tree. The wavelet coefficients at corresponding positions on a finer scale are descendants.

SPIHT(Set Partitioning in Hierarchical Trees)算法也是通过构建零树来实现小波图像压缩的,利用渐进式传输的理论进行编码。渐进式传输理论是将数值的绝对值由大到小排列,然后将最重要的数值先传输,还原时图像的恢复质量将渐渐变好。SPIHT的输出是一连串比特流,并且SPIHT可以在任意点处停止传输而不影响解码的正确性。SPIHT使用的系数重要性的判断标准和EZW类似。它有阈值Tl,表示矩阵元素的最大值,但其采用的是位编码,先编码重要元素的高比特位,其编码参数nl=[log2Tl。  The SPIHT (Set Partitioning in Hierarchical Trees) algorithm also realizes wavelet image compression by constructing a zero tree, and uses the theory of progressive transmission for encoding. The progressive transmission theory is to arrange the absolute values of the values from large to small, and then transmit the most important values first, and the restored image quality will gradually become better during restoration. The output of SPIHT is a series of bit streams, and SPIHT can stop transmission at any point without affecting the correctness of decoding. The criterion of coefficient importance used by SPIHT is similar to that of EZW. It has a threshold T l , which represents the maximum value of matrix elements, but it adopts bit coding, first encodes high-order bits of important elements, and its coding parameter n l =[log 2 T l .

SPIHT使用叫做空间方向树的结构生成和分解系数集合,该结构利用了图像金字塔不同层间小波系数的空间关系:经验表明,金字塔每一层的子带都呈现出空间相似性,任何特别之处,如直线边缘或均匀区域,在所有层的同一位置都可以看到。  SPIHT generates and decomposes sets of coefficients using a structure called a spatially oriented tree, which exploits the spatial relationship of the wavelet coefficients at different levels of the image pyramid: experience shows that the subbands at each level of the pyramid exhibit spatial similarity, and any particular , such as straight edges or uniform areas, are visible at the same location on all layers. the

SPIHT使用的数据结构(即空间方向树)如图9所示。图中LL1和HL2标识分别代表第一层的LL子带和第二层的HL子带,其他标识以此类推。图中画除了第二层(高通)和第一层(低通)两层,每一层又分成四个子带(LL、LH、HL、HH),子带LL1又被分成四个系数组,其中阴影区域部分是位于左上角的那组。每组4个系数中的每一个(除了左上角的用黑色实心圆点所在的正方型区域外)都变成空间方向树的树根。箭头示意了这些数的不同层是如何关联在一起的,箭头指向的是该子带的孩子。图像中位置(i,j)的系数的孩子的位置是(2i,2j),(2i+1,2j),(2i,2j+1),(2i+1,2j+1)。由于靠近左上角的数据代表着重要的低频量,所以应该优先编码。  The data structure (i.e., spatial orientation tree) used by SPIHT is shown in Fig. 9. In the figure, LL1 and HL2 marks respectively represent the LL subband of the first layer and the HL subband of the second layer, and the other marks can be deduced by analogy. In addition to the second layer (high pass) and the first layer (low pass) in the figure, each layer is divided into four subbands (LL, LH, HL, HH), and the subband LL1 is divided into four coefficient groups. The shaded area is the group located in the upper left corner. Each of the 4 coefficients in each group (except the square area in the upper left corner where the black solid circle is located) becomes the root of the spatial orientation tree. The arrows show how the different layers of these numbers are related together, and the arrows point to the children of the subband. The positions of the children of the coefficients at position (i, j) in the image are (2i, 2j), (2i+1, 2j), (2i, 2j+1), (2i+1, 2j+1). Since the data near the upper left corner represents significant low-frequency volume, it should be encoded first. the

发明内容 Contents of the invention

本发明的目的在于,针对图像的方向金字塔数据结构极高的冗余性给传输和储存带来的困难,研究一种方法进行有效的编码存储。  The purpose of the present invention is to study a method for effective encoding and storage in view of the difficulties brought about by the extremely high redundancy of the image orientation pyramid data structure to transmission and storage. the

本发明的图像方向金字塔的编码方法的实现步骤如下:  The implementation steps of the encoding method of the image direction pyramid of the present invention are as follows:

第一步,将图像分解成任意给定方向个数和尺度个数的方向金字塔的形式,图像金字塔按照方向个数向和尺度个数分层储存;  The first step is to decompose the image into the form of a direction pyramid with any given number of directions and scales, and the image pyramid is stored in layers according to the number of directions and the number of scales;

第二步,可以任意给定在第一步分解完成的方向金字塔的一个方向基函数,在第三步中会对该方向所对应的子带优先的编码;  In the second step, a direction basis function of the direction pyramid decomposed in the first step can be arbitrarily given, and in the third step, the subband corresponding to the direction will be preferentially encoded;

第三步,采用SPIHT算法进行图像方向金字塔的低频和带频分量进行编码,但对低频和带频量采用不同的阈值。  In the third step, the SPIHT algorithm is used to encode the low-frequency and band-frequency components of the image-oriented pyramid, but different thresholds are used for the low-frequency and band-frequency quantities. the

本发明的有益效果是:  The beneficial effects of the present invention are:

1.通过改变传统SPIHT算法的传统数据结构,使得该算法编码方向金字塔成为可能。  1. By changing the traditional data structure of the traditional SPIHT algorithm, it is possible to encode the direction pyramid of the algorithm. the

2.针对方向金字塔这种冗余度高的数据结构,对SPIHT进行了优化,使得方向导数的编码和整体的重建效果有了较大的提升。  2. For the highly redundant data structure of the directional pyramid, SPIHT is optimized, which greatly improves the coding of the directional derivative and the overall reconstruction effect. the

3.可以选定一个优先编码方向,减少表达冗余信息的比特位数,从而达到更好的数据压缩效果。  3. A priority encoding direction can be selected to reduce the number of bits expressing redundant information, thereby achieving a better data compression effect. the

附图说明 Description of drawings

图1是自然图像及其局部变化直方图;  Figure 1 is a histogram of natural images and their local changes;

图2是频带的理想划分;  Figure 2 is an ideal division of frequency bands;

图3是图像金字塔的结构;  Figure 3 is the structure of the image pyramid;

图4是一级二维小波分解流程图;  Fig. 4 is a first-level two-dimensional wavelet decomposition flow chart;

图5是多级小波分解频谱图;  Fig. 5 is a multilevel wavelet decomposition spectrogram;

图6是拥有K个方向的方向金字塔系统框图;  Fig. 6 is a block diagram of the direction pyramid system with K directions;

图7是一个3层,4个方向的理想方向性分解的频谱图;  Figure 7 is a 3-layer, 4-direction ideal directional decomposition spectrum diagram;

图8是图像的金字塔分解和重构例子;原图像为(e),通过计算得到该图像的方向金字塔表示(a)-(d),通过(a)-(d)重构图像(f)。  Figure 8 is an example of pyramid decomposition and reconstruction of an image; the original image is (e), and the direction pyramid representation (a)-(d) of the image is obtained through calculation, and the image (f) is reconstructed through (a)-(d) . the

图9是SPIHT数据结构示意图;  Figure 9 is a schematic diagram of the SPIHT data structure;

图10是本发明的可编码图像方向金字塔的数据结构示意图;  Fig. 10 is a schematic diagram of the data structure of the codable image direction pyramid of the present invention;

图11是本发明的带有优先编码方向的数据结构的示意图;  Fig. 11 is the schematic diagram of the data structure with priority encoding direction of the present invention;

图12是本发明测试SPIHT算法采用的原图像(a),重构的图像(b),以及编码及量化后方向金字塔(c),第一排是第一层的带通量,第二排是第二层的带频分量;  Fig. 12 is the original image (a) used by the present invention to test the SPIHT algorithm, the reconstructed image (b), and the direction pyramid (c) after encoding and quantization, the first row is the band throughput of the first layer, the second row is the band-frequency component of the second layer;

图13是本发明测试的原图像(a),SPIHT重构效果(b),使用方向金字塔SPIHT算法重构效果(c),使用优先方向金字塔SPIHT算法重构效果(d),其中左面为重构的图像,右面为重构的方向金字塔,第一排是第一层的带通量,第二排是第二层的带通量;  Figure 13 is the original image (a) tested by the present invention, the SPIHT reconstruction effect (b), the reconstruction effect (c) using the direction pyramid SPIHT algorithm, and the reconstruction effect (d) using the priority direction pyramid SPIHT algorithm, where the left side is the reconstruction The reconstructed image, the right side is the reconstructed direction pyramid, the first row is the band flux of the first layer, and the second row is the band flux of the second layer;

图14是不同算法重构图像的效果比较;  Figure 14 is a comparison of the effect of reconstructing images with different algorithms;

图15是使用三种不同数据结构重构图像的效果比较,总体重构效果(a)以及对不同层,不同方向的方向导数的重构(b)。  Figure 15 is a comparison of the effects of reconstructing images using three different data structures, the overall reconstruction effect (a) and the reconstruction of directional derivatives for different layers and directions (b). the

具体实施方式 Detailed ways

为了使本发明技术方案的内容和优势更加清楚明了,以下结合附图,对本发明的针对图像方向金字塔的编码系统及方法进行进一步的详细说明。  In order to make the content and advantages of the technical solution of the present invention more clear, the encoding system and method for the image orientation pyramid of the present invention will be further described in detail below in conjunction with the accompanying drawings. the

本发明的针对方向金字塔图像分解的编码系统及方法,是一种基于SPIHT的图像方向金字塔的编码系统及方法,主要针对SPIHT在图像方向金字塔编码部分的空白,以及处理图像的过冗余表示方面的缺陷,基于图像方向金字塔数据结构特点,提出一种新的数据结构和一种结合该数据结构构成的图像方向金字塔的编码系统。  The coding system and method for image decomposition of direction pyramids of the present invention is a coding system and method for image direction pyramids based on SPIHT. Based on the defects of the data structure of the image orientation pyramid, a new data structure and a coding system for the image orientation pyramid combined with the data structure are proposed. the

从SPIHT的数据结构图(图9)可以看出,阴影区域(即最高层LL子带)的树只能沿着元素2,3,4对应的三个方向伸展,从而无法满足对编码任意个数的方向金字塔的要求。需要对这种数据结构进行修改。  From the data structure diagram of SPIHT (Figure 9), it can be seen that the tree in the shaded area (that is, the highest-level LL subband) can only be extended along the three directions corresponding to elements 2, 3, and 4, which cannot satisfy the requirements for encoding any number of elements. Number of orientation pyramid requirements. This data structure needs to be modified. the

本算法首先去掉对图像整体效果和特征提取贡献较小的高频部分,只编码低频和带频部分。图10是本发明的可编码图像方向金字塔的数据结构示意图,该数据结构突破了传统SPIHT编码将不同层系数都放在一个正方型的不同区域的数据结构,将不同的层、不同的方向的系数分别放置到大小不同的正方型中。并且为了方便表示,在图10中定义“第一层”为图像金字塔数据结构的低频分量(对应图7中圆L内部);“第二层”为图像金字塔频率最低的带频分量,对应图7中B1所在的圆环内部;“第三层”对应图7中B2所在的圆环内部,以此类推。对每一层,每一方向的系数逐一进行编码。低频量位于第一层,含有图像的重要信息,需要先进行编码。然后再编码第二层、第三层,以此类推。每一层对应的下一层均是该层的孩子,孩子系数储存在另一个正方型里。例如,第一层(低频部分)中左上部分系数在两个方向上的孩子用第二层中点填充的正方型表示。在所示的孩子中,右下角系数的孩子(也就是低频左上部分系数的孙子)用第三层中线填充的正方型表示。实际应用中,可以设计个数K的基函数方向子带,这些基函数的方向分别为i=0,...,K-1,例如,若K=4,则基函数的4个方向分别为0°,45°,90°,135°。该数据结构可以编码任意数量的方向带频分量,因此可以满足编码方向金字塔的需要。  This algorithm firstly removes the high-frequency part that contributes little to the overall image effect and feature extraction, and only codes the low-frequency and band-frequency parts. Fig. 10 is a schematic diagram of the data structure of the codable image orientation pyramid of the present invention. This data structure breaks through the data structure of traditional SPIHT coding that puts the coefficients of different layers in different areas of a square shape, and the data structure of different layers and different directions The coefficients are placed in squares of different sizes. And for convenience of representation, in Fig. 10, define " the first layer " to be the low-frequency component of image pyramid data structure (corresponding to the inside of circle L in Fig. 7); "Second layer" is the lowest band frequency component of image pyramid frequency, corresponding The inside of the ring where B1 is located in Figure 7; the "third layer" corresponds to the inside of the ring where B2 is located in Figure 7, and so on. For each layer, the coefficients in each direction are encoded one by one. The low-frequency volume is located in the first layer, which contains important information of the image and needs to be encoded first. Then encode the second layer, the third layer, and so on. The next layer corresponding to each layer is the child of this layer, and the child coefficient is stored in another square. For example, children of coefficients in the upper left part of the first layer (low frequency part) in two directions are represented by squares filled with dots in the middle of the second layer. In the children shown, the children of the coefficients in the lower right corner (that is, the grandchildren of the coefficients in the upper left part of the low frequency) are represented by squares filled with the middle line of the third layer. In practical applications, a number K of basis function direction subbands can be designed, and the directions of these basis functions are i=0, . . . , K−1, for example, if K=4, the four directions of the basis function are 0°, 45°, 90°, and 135° respectively. This data structure can encode any number of directional band-frequency components, so it can meet the needs of encoding directional pyramids.

图11是本发明的带有优先编码方向的数据结构的示意图。该数据结构改变原来的带频分量的数据结构关系,先选择一个优先编码的方向子带B(本例中为方向1)的作为金字塔的第二层,然后将其余的带频分量以及B的更高一层作为第三层,以此类推。实验表示,编码前事先选择优先方向,可以节约用来表示不重要信息的比特位。  Fig. 11 is a schematic diagram of the data structure with priority encoding direction of the present invention. This data structure changes the data structure relationship of the original band-frequency components, first selects a preferentially coded direction sub-band B (direction 1 in this example) as the second layer of the pyramid, and then uses the rest of the band-frequency components and B's A higher layer acts as the third layer, and so on. Experiments show that selecting the priority direction before encoding can save bits used to represent unimportant information. the

此外,针对方向金字塔分解的图像的方向带频部分的值会比低频分量小很多的特点,将低频分解的阈值与带频分解的阈值分开,使用两个阈值Tl、Th分别表示,Tl、Th分别表示低频 量和带频量的最大值。从而确定出两个编码参数

Figure BSA00000845759300081
Figure BSA00000845759300082
In addition, in view of the fact that the value of the directional band-frequency part of the image decomposed by the directional pyramid is much smaller than the low-frequency component, the threshold of the low-frequency decomposition is separated from the threshold of the band-frequency decomposition, using two thresholds T l and T h respectively, T l and T h represent the maximum value of the low-frequency amount and the band-frequency amount, respectively. Thus two encoding parameters are determined
Figure BSA00000845759300081
and
Figure BSA00000845759300082

最后,该算法生成一个比特位流(即一连串的比特位)表示该图像,如果接收的位流在任意点被中断,都可以无错误地解码并重构图像。  Finally, the algorithm generates a bitstream (ie, a sequence of bits) representing the image that can be decoded and reconstructed without error if the received bitstream is interrupted at any point. the

该算法引入的有序表和集合标记与SPIHT算法相同。编码时先选择一个优先编码的方向子带B的作为金字塔的第二层,然后将其余的带频分量以及B的更高一层作为第三层。  The ordered list and set marks introduced by this algorithm are the same as the SPIHT algorithm. When encoding, first select a direction sub-band B that is preferentially coded as the second layer of the pyramid, and then use the remaining frequency components and a higher layer of B as the third layer. the

我们称使用图10的数据结构的图像方向金字塔编码算法为方向金字塔SPIHT算法,把使用图11的数据结构的图像方向金字塔编码算法为优先方向金字塔SPIHT算法。  We call the image orientation pyramid encoding algorithm using the data structure in Figure 10 the orientation pyramid SPIHT algorithm, and the image orientation pyramid encoding algorithm using the data structure in Figure 11 as the priority orientation pyramid SPIHT algorithm. the

由于方向金字塔SPIHT算法和优先方向金字塔SPIHT算法只有数据结构的不同,具体实现的伪代码都是相同的。它们的伪代码如下:  Since the directional pyramid SPIHT algorithm and the priority directional pyramid SPIHT algorithm are only different in data structure, the pseudo codes for specific implementation are the same. Their pseudo codes are as follows:

(1)初始化  (1) Initialization

输入初始两个阈值Tl、Th,分别表示低频量和带频量的最大值。从而确定出两个编码参数nl和nh;  Input two initial thresholds T l and Th h , respectively representing the maximum value of the low frequency amount and the band frequency amount. Thereby, two encoding parameters n l and n h are determined;

初始坐标集为LIP={(r,c)|(r,c)∈H},LIS={Dr,c|(r,c)∈H}。  The initial coordinate set is LIP={(r,c)|(r,c)∈H}, LIS={D r,c |(r,c)∈H}.

(2)排序扫描  (2) sort scan

1)扫描LIP队列  1) Scan the LIP queue

对LIP队列的每个表项(r,c)判断重要性;  Determine the importance of each entry (r, c) of the LIP queue;

若(r,c)重要  If (r, c) is important

向Sn输出‘r’和(r,c)的符号位;  Output the sign bit of 'r' and (r, c) to Sn;

将(r,c)从LIP队列中删除,添加到LSP队列的尾部。  Delete (r, c) from the LIP queue and add it to the end of the LSP queue. the

若(r,c)不重要  If (r, c) is not important

向Sn输出‘0’。  Output '0' to Sn. the

2)扫描LIS队列  2) Scan the LIS queue

若是‘D’型表项,即Dr,c,判断重要性;  If it is a 'D' type entry, that is, D r, c , determine the importance;

若Dr,c重要  If D r, c is important

向Sn输出‘1’;  Output '1' to Sn;

对每个(rO,cO)∈Or,r,判断重要性  For each (rO, cO) ∈ O r, r , judge the importance

若重要,向Sn输出‘1’和(rO,cO)的符号位,并将Or,c添到LIS尾部;  If important, output the sign bit of '1' and (rO, cO) to Sn, and add O r, c to the end of LIS;

若不重要,则将(rO,cO)添加到LIP的尾部。  If not important, add (rO, cO) to the end of LIP. the

判断Lr,c是否为空集  Determine whether L r, c is an empty set

若非空,则将Lr,c添加到LIS的尾部;  If not empty, add L r, c to the end of LIS;

若为空集,则将Dr,c从LIS中删除。  If it is an empty set, delete D r, c from LIS.

若Dr,c不重要  If D r, c is not important

向Sn输出‘0’。  Output '0' to Sn. the

若是‘L’型表项,即Lr,c,判断重要性;  If it is an 'L' type entry, that is, L r, c , judge the importance;

若Lr,c重要  If L r, c is important

向Sn输出‘1’并将DrO,cO添加到LIS的尾部,将Lr,c从LIS中删除;  Output '1' to Sn and add D rO, cO to the tail of LIS, remove L r, c from LIS;

若Lr,c不重要  If L r, c is not important

向Sn输出‘0’。  Output '0' to Sn. the

(3)精细扫描  (3) Fine scanning

将LSP中系数的绝对值转换为二进制表示并输出第n个最重要的位到Rn。  Convert the absolute values of the coefficients in LSP to binary representation and output the nth most significant bit to Rn. the

(4)更新阈值指数  (4) Update the threshold index

将阈值指数nl减至nl-1,nh减至nh-1,返回到步骤(2)进行下一级编码扫描。  Reduce the threshold index n l to n l -1, n h to n h-1 , and return to step (2) for the next level of encoding scanning.

实际效果:  actual effect:

由于方向金字塔是过完备的,为防止误解,我们不用压缩率作为实施和评价指标,而是限制其位流的比特数。我们首先使用标准SPIHT算法利用107比特对测试图像Girl(256×256)图12(a)进行编码得到的重构的图像(图12(b))与相应的方向金字塔(图12(c))。  Because the direction pyramid is over-complete, in order to prevent misunderstanding, we do not use the compression rate as the implementation and evaluation index, but limit the number of bits of its bit stream. We first use the standard SPIHT algorithm to encode the test image Girl (256×256) in Figure 12(a) with 10 7 bits to obtain the reconstructed image (Figure 12(b)) and the corresponding orientation pyramid (Figure 12(c) ).

编码后的比特分配为:RnList:25930比特;SnList:975672比特。其中总比特数比107多了一些是由于比特流的分隔符造成的。可见到图像的恢复效果不错,但是耗费的比特太多。  The encoded bit allocation is: RnList: 25930 bits; SnList: 975672 bits. The fact that the total number of bits is more than 10 7 is caused by the delimiter of the bit stream. It can be seen that the restoration effect of the image is good, but it consumes too many bits.

使用的测试图像Lena(64×64)(图13(a)),经计算Tl=816,Th=105。对三种算法:SPIHT算法,方向金字塔SPIHT算法和优先方向金字塔SPIHT算法分别进行测试,利用不同的算法均使用5×104比特进行编码,得到的重构图像与方向金字塔如图13(b)-(d)所示,其中算法II的优先方向为0°。由于0°为优先编码的方向,所以与其他方向相比,使用算法II对0°的重建效果比使用算法I有了更大提升,图像整体的重建效果也有了很大程度的提高。  The used test image Lena (64×64) ( FIG. 13( a )) is calculated to have T l =816 and T h =105. Three algorithms: SPIHT algorithm, directional pyramid SPIHT algorithm and priority directional pyramid SPIHT algorithm are tested respectively, and different algorithms are used to encode with 5×10 4 bits. The reconstructed image and directional pyramid are shown in Figure 13(b) - As shown in (d), where the priority direction of Algorithm II is 0°. Since 0° is the direction of priority encoding, compared with other directions, the reconstruction effect of 0° using Algorithm II has a greater improvement than that of Algorithm I, and the overall reconstruction effect of the image has also been greatly improved.

对三种算法对Lena(64×64)分别使用不同的比特开销进行比较(比特开销从5×103bit直到8×104bit),使用算法II的优先的编码方向为0°。具体的PSNR曲线见图14。可以看到,算法I,算法II相对SPIHT对于方向金字塔的重建效果,尤其是对于方向导数的重建效果的提升是显著的。另外,由于0°为优先方向,算法II优先将其编码,所以可以发现低比特率时算法II对0°方向的重构效果要明显高于测试的其他数据结构。  Comparing the three algorithms with different bit overheads for Lena (64×64) (the bit overheads range from 5×10 3 bit to 8×10 4 bits), the preferred coding direction for Algorithm II is 0°. The specific PSNR curve is shown in Figure 14. It can be seen that compared with SPIHT, Algorithm I and Algorithm II have significantly improved the reconstruction effect of the direction pyramid, especially the reconstruction effect of the direction derivative. In addition, since 0° is the priority direction, Algorithm II encodes it first, so it can be found that the reconstruction effect of Algorithm II on the 0° direction is significantly higher than other data structures tested at low bit rates.

Claims (4)

1.一种图像方向金字塔的编码方法,其特征在于,包括以下步骤:1. an encoding method of an image direction pyramid, is characterized in that, comprises the following steps: A.将图像分解成任意给定方向个数和尺度个数的金字塔的形式,对个数K的基函数方向子带,这些基函数的方向分别为
Figure FSA00000845759200011
i=0,...,K-1;
A. Decompose the image into pyramids with any given number of directions and scales. For the direction subbands of the basis functions with the number K, the directions of these basis functions are respectively
Figure FSA00000845759200011
i=0,...,K-1;
B.图像金字塔按照方向个数和尺度个数分层储存;B. The image pyramid is stored hierarchically according to the number of directions and the number of scales; C.从步骤A中的K个基函数方向中确立一个优先编码的基函数方向;C. establish the basis function direction of a priority encoding from the K basis function directions in step A; D.采用优先方向金字塔SPIHT算法,对低频和带频分量采用不同的阈值;D. Using the priority direction pyramid SPIHT algorithm, using different thresholds for low-frequency and band-frequency components; E.利用位流进行编码,生成一个比特位流表示该图像的低频量和带频量。E. Using the bit stream to encode, generating a bit stream to represent the low frequency and band frequency of the image.
2.根据权利要求1所述的方法,在步骤A把所述的方向金字塔把图像分解成高频量、低频量和不同层的若干带频分量;生成方向金字塔的每一层需要下列滤波器:高通滤波器,不同尺度的低通滤波器,不同尺度、不同方向的带通滤波器。2. method according to claim 1, in step A described direction pyramid is decomposed into some band frequency components of high frequency amount, low frequency amount and different layers; Each layer of generation direction pyramid needs following filter : High-pass filter, low-pass filter of different scales, band-pass filter of different scales and directions. 3.根据权利要求1所述的方法,所述的优先方向金字塔SPIHT算法引入了两个阈值Tl、Th;使用三个有序表来存放重要信息:重要系数表LSP;不重要系数表LIP;不重要子集表LIS;并且引入下面的集合符号:所有树根的坐标集H,为金字塔的低频以及最接近低频段的一层;树为树根的子孙且树必须拥有孩子;节点(r,c)所有孩子的集合Or,c;节点(r,c)所有子孙的集合Dr,c,称为D类树;节点(r,c)所有非直系子孙的集合Lr,c,称为L类树;如果上述的树集合中有至少一个系数是重要的,则称该树是重要的;SPIHT的嵌入位流分为排序位流Sn和精细位流Rn。3. the method according to claim 1, described priority direction pyramid SPIHT algorithm has introduced two thresholds T l , T h ; Use three ordered tables to store important information: important coefficient table LSP; Unimportant coefficient table LIP; the unimportant subset table LIS; and introduce the following collection symbols: the coordinate set H of all tree roots is the low frequency of the pyramid and the layer closest to the low frequency band; the tree is the descendant of the tree root and the tree must have children; node (r, c) the set O r of all children, c ; the set D r, c of all descendants of the node (r, c ), which is called a D-type tree; the set L r of all non-direct descendants of the node (r, c), c is called L-type tree; if at least one coefficient in the above-mentioned tree set is important, the tree is said to be important; the embedded bit stream of SPIHT is divided into sorted bit stream Sn and refined bit stream Rn. 4.根据权利要求3的方法,其中所述的优先方向金字塔SPIHT算法的步骤如下:4. according to the method for claim 3, the step of wherein said preferential direction pyramid SPIHT algorithm is as follows: 第一步:设定阈值Tl、Th;令LIP为所有树根坐标集的系数;LIS为D类树;LSP为空集;The first step: set the threshold T l , T h ; let LIP be the coefficient of all tree root coordinate sets; LIS be the D-type tree; LSP be the empty set; 第二步:检测LIP中所有系数的重要性;如果重要,则输出1和符号位,并将系数移入LSP;如果不重要,则输出0;Step 2: Detect the importance of all coefficients in LIP; if important, output 1 and sign bit, and move the coefficients into LSP; if not important, output 0; 第三步:根据树的类型检查LIS中所有树的重要性,具体如下:Step 3: Check the importance of all trees in the LIS according to the type of the tree, as follows: 对于D类树:For a class D tree: 如果重要,则输出1,并编码它的孩子;output 1 if it matters, and encode its children; 如果孩子重要,则输出1,再输出一位比特表示该系数的符号,并将这个孩子加入到LSP中;If the child is important, output 1, and then output a bit to represent the sign of the coefficient, and add this child to the LSP; 如果孩子不重要,则输出0,将其加到LIP的末尾;If the child is not important, output 0, add it to the end of LIP; 如果该孩子有子孙,则将该D类树更改为L类树移到LIS的最后,否则从LIS中删除;If the child has descendants, change the D-type tree to an L-type tree and move it to the end of the LIS, otherwise delete it from the LIS; 如果不重要,则输出0;If not important, output 0; 对于L类树:For L-like trees: 如果重要,则输出1,并将每个孩子的D类树加到LIS末尾,并从LIS中移去原来的L类树;If it is important, output 1, and add each child's D-type tree to the end of the LIS, and remove the original L-type tree from the LIS; 如果不重要,则输出0;If not important, output 0; 第四步:改变阈值Tl、Th,判定是否符合停机条件,如果不符合则转移到第二步。Step 4: Change the thresholds T l and T h to determine whether the shutdown condition is met, and if not, move to the second step.
CN201310024432.1A 2013-01-23 2013-01-23 Directional pyramid coding method of image Active CN103065336B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201310024432.1A CN103065336B (en) 2013-01-23 2013-01-23 Directional pyramid coding method of image

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201310024432.1A CN103065336B (en) 2013-01-23 2013-01-23 Directional pyramid coding method of image

Publications (2)

Publication Number Publication Date
CN103065336A true CN103065336A (en) 2013-04-24
CN103065336B CN103065336B (en) 2015-05-27

Family

ID=48107953

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201310024432.1A Active CN103065336B (en) 2013-01-23 2013-01-23 Directional pyramid coding method of image

Country Status (1)

Country Link
CN (1) CN103065336B (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109308723A (en) * 2017-07-26 2019-02-05 合肥图鸭信息科技有限公司 A kind of method for compressing image and device
CN109308724A (en) * 2017-07-26 2019-02-05 合肥图鸭信息科技有限公司 A kind of method for compressing image and device
CN111861931A (en) * 2020-07-27 2020-10-30 北京金山云网络技术有限公司 Model training, image enhancement method and device, electronic device and storage medium

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101594532A (en) * 2009-06-23 2009-12-02 上海电力学院 Region Growth Compression Method Based on Quadtree Partition
US20100046849A1 (en) * 2008-08-20 2010-02-25 Microsoft Corporation Image Restoration By Vector Quantization Utilizing Visual Patterns
CN102637302A (en) * 2011-10-24 2012-08-15 北京航空航天大学 Image coding method

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20100046849A1 (en) * 2008-08-20 2010-02-25 Microsoft Corporation Image Restoration By Vector Quantization Utilizing Visual Patterns
CN101594532A (en) * 2009-06-23 2009-12-02 上海电力学院 Region Growth Compression Method Based on Quadtree Partition
CN102637302A (en) * 2011-10-24 2012-08-15 北京航空航天大学 Image coding method

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
王相海等: ""基于DCT分层结构的遥感图像分级多描述编码算法"", 《遥感学报》 *
白静等: ""基于提升方向波变换域的SAR图像压缩"", 《红外与毫米波学报》 *

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109308723A (en) * 2017-07-26 2019-02-05 合肥图鸭信息科技有限公司 A kind of method for compressing image and device
CN109308724A (en) * 2017-07-26 2019-02-05 合肥图鸭信息科技有限公司 A kind of method for compressing image and device
CN111861931A (en) * 2020-07-27 2020-10-30 北京金山云网络技术有限公司 Model training, image enhancement method and device, electronic device and storage medium

Also Published As

Publication number Publication date
CN103065336B (en) 2015-05-27

Similar Documents

Publication Publication Date Title
CN110087092B (en) Low-bit-rate video coding and decoding method based on image reconstruction convolutional neural network
DE19534943A1 (en) Embedded wavelet type data compression device e.g. for facsimile transmission of documents
CN108810534B (en) Image compression method based on direction boosting wavelet and improved SPIHT under the Internet of Things
CN105825530B (en) Distributed lossy encoding and decoding method for coastal hyperspectral images based on region of interest
CN113962882B (en) JPEG image compression artifact eliminating method based on controllable pyramid wavelet network
CN113706641A (en) Hyperspectral image compression method based on space and spectral content importance
CN103065336B (en) Directional pyramid coding method of image
CN112991169A (en) Image compression method and system based on image pyramid and generation countermeasure network
CN1284120C (en) Synthetic aperture radar complex numeric image data real time automatic compression method
JP2743208B2 (en) Image complement coding device
JP4726040B2 (en) Encoding processing device, decoding processing device, encoding processing method, decoding processing method, program, and information recording medium
Nagamani et al. EZW and SPIHT image compression techniques for high resolution satellite imageries
CN114449248B (en) Spherical image compression method based on spherical wavelet transformation
Singh et al. Performance comparison of arithmetic and huffman coder applied to ezw codec
EP2254338B1 (en) Coding and decoding methods and devices, computer program and information carrier enabling the implementation of such methods
Rashid et al. Sub-band coding of hexagonal images
Shu et al. Arbitrarily shape object-based video coding technology by wavelet-based contourlet transform
Tyagi et al. Enhanced EZW Technique for Compression of Image by Setting Detail Retaining Pass Number
Senthilkumar et al. A performance analysis of EZW, SPIHT wavelet based compressed images
Zhao Maximum Energy Subsampling: A General Scheme For Multi-resolution Image Representation And Analysis
Rammohan et al. An Efficient Image Compression Technique with Dead Zone Quantization Through Wavelet-Based Contourlet Transform with Modified SPIHT Encoding
CN119277092A (en) Dual-branch remote sensing image compression method guided by global context features
CN116320458A (en) Distillation training method and system for deep learning image coding and decoding network
Huo et al. Beamlet coder: A tree-based, hierarchical contour representation and coding method
Tamboli et al. Weighted denoising with multispectral decomposition for image compression

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant