JPH0817492B2

JPH0817492B2 - Variable rate video coding device

Info

Publication number: JPH0817492B2
Application number: JP21923690A
Authority: JP
Inventors: 勝政恩田
Original assignee: Matsushita Electric Industrial Co Ltd
Current assignee: Panasonic Holdings Corp
Priority date: 1990-08-20
Filing date: 1990-08-20
Publication date: 1996-02-21
Anticipated expiration: 2011-02-21
Also published as: JPH04100494A

Description

【発明の詳細な説明】産業上の利用分野本発明は、デレビ電話やテレビ会議システム等に使用
する可変レート動画像符号化装置に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a variable rate moving image coding apparatus used for a Delevi telephone, a video conference system or the like.

従来の技術第４図は、従来の可変レート動画像符号化装置の構成
を示している。第４図において、40はフレーム単位に入
力される動画像信号、40aは、この動画像信号と、後述
するフレームメモリ48に蓄えられた前フレーム再生信号
に動き補償等を施すことにより得た、次のフレームであ
る現フレームの予測信号とが入力される減算器であり、
これらフレーム単位の動画像信号40と現フレームの予測
信号との減算を行って、Ｎ×Ｎの予測誤差信号を出力す
る。2. Description of the Related Art FIG. 4 shows the configuration of a conventional variable rate moving image coding apparatus. In FIG. 4, 40 is a moving image signal input in frame units, 40a is obtained by subjecting this moving image signal and a previous frame reproduction signal stored in a frame memory 48 described later to motion compensation and the like, A subtractor to which the predicted signal of the current frame, which is the next frame, is input,
The frame-by-frame moving image signal 40 is subtracted from the current frame prediction signal to output an N × N prediction error signal.

41は、減算器40aから入力されるＮ×Ｎの予測誤差信
号を離散コサイン変換して、変換係数を出力する離散コ
サイン変換回路、42は、離散コサイン変換回路41から入
力される変換係数を量子化して、Ｎ×Ｎの量子化代表値
を出力する量子化回路である。Reference numeral 41 denotes a discrete cosine transform circuit that performs a discrete cosine transform on the N × N prediction error signal input from the subtractor 40a and outputs a transform coefficient. Reference numeral 42 denotes a transform coefficient input from the discrete cosine transform circuit 41. Is a quantization circuit that outputs the N × N quantized representative value.

また、43は、量子化回路42からの量子化代表値が入力
され、予測誤差信号の低周波成分を含む部分（以下、MS
Pと記す）を分離して可変レート高優先度チャネル44で
伝送すると共に、予測誤差信号の高周波成分を含む部分
（以下、LSPと記す）を分離して、可変レート低優先度
チャネル45で伝送する階層化回路である。Further, 43 is a portion to which the quantized representative value from the quantization circuit 42 is input and which includes a low frequency component of the prediction error signal (hereinafter, referred to as MS
P) is separated and transmitted on the variable rate high priority channel 44, and the part containing the high frequency component of the prediction error signal (hereinafter referred to as LSP) is separated and transmitted on the variable rate low priority channel 45. It is a hierarchical circuit.

46は、可変レート高優先度チャネル44を介して階層化
回路43から入力されるMSPを逆量子化する、局部複号の
ための逆量子化回路、47は、逆量子化回路46から入力さ
れる、逆量子化された変換係数を逆離散コサイン変換す
る逆離散コサイン変換回路であり、これも局部復号のた
めのものである。46 is a dequantization circuit for local decoding, which dequantizes the MSP input from the layering circuit 43 via the variable rate high priority channel 44, and 47 is input from the dequantization circuit 46. Is an inverse discrete cosine transform circuit for performing inverse discrete cosine transform on the inversely quantized transform coefficient, which is also for local decoding.

40bは、逆離散コサイン変換回路47から出力される逆
離散コサイン変換された変換係数と、先に前記フレーム
メモリ48から出力された前フレーム再生信号に動き補償
等を施して得た現フレームの予測信号とを加算して、現
フレーム再生信号としてフレームメモリ48に出力する加
算器である。40b is the inverse discrete cosine transform output from the inverse discrete cosine transform circuit 47, and the prediction of the current frame obtained by subjecting the previous frame playback signal output from the frame memory 48 to motion compensation and the like. It is an adder that adds the signal and outputs it to the frame memory 48 as a current frame reproduction signal.

次に、上記従来例の動作について説明する。第４図に
おいて、フレームメモリ48に蓄えられた前フレーム再生
信号には動き補償等が施され、これにより、現フレーム
の予測信号が得られる。この現フレームの予測信号とフ
レーム単位の動画像信号40とは減算器40aで減算されて
予測誤差信号となり、以下、Ｎ×Ｎの大きさのブロック
毎に処理が施される。Next, the operation of the above conventional example will be described. In FIG. 4, motion compensation or the like is applied to the previous frame reproduction signal stored in the frame memory 48, whereby a prediction signal of the current frame is obtained. The prediction signal of the current frame and the moving image signal 40 in frame units are subtracted by a subtractor 40a to become a prediction error signal, and hereinafter, processing is performed for each block of size N × N.

このＮ×Ｎの予測誤差信号は、離散コサイン変換回路
41において離散コサイン変換されて変換係数となり、量
子化回路42に入力される。This N × N prediction error signal is a discrete cosine transform circuit.
In 41, discrete cosine transform is performed to obtain transform coefficients, which are input to the quantization circuit 42.

量子化回路42は、離散コサイン変換された変換係数を
量子化し、その量子化代表値は階層化回路43に入力され
る。階層化回路43は、Ｎ×Ｎの量子化代表値を予測誤差
信号のMSPと、予測誤差信号のLSPに分離し、MSPは可変
レート高優先度チャネル44で、LSPは可変レート低優先
度チャネル45で伝送する。The quantization circuit 42 quantizes the transform coefficient obtained by the discrete cosine transform, and the quantized representative value is input to the layering circuit 43. The layering circuit 43 separates the N × N quantized representative value into the MSP of the prediction error signal and the LSP of the prediction error signal, where MSP is the variable rate high priority channel 44 and LSP is the variable rate low priority channel. Transmit at 45.

この可変レート高優先度チャネル44と可変レート低優
先度チャネル45とは、共に任意の伝送レートに対応可能
な可変レートチャネルであり、次世代通信網として現在
研究されている広帯域ISDN（ディジタルサービス総合ネ
ットワーク）で実現される。The variable rate high-priority channel 44 and the variable rate low-priority channel 45 are both variable rate channels capable of supporting arbitrary transmission rates, and are the wideband ISDN (digital service integrated services) currently being studied as a next-generation communication network. Network).

広帯域ISDNでは、優先度の異なる複数の可変レートチ
ャネルを用意し、通信網（以後、網と記す）の輻輳時に
は、優先度の低いチャネルのパケットから廃棄するよう
な制御が行われる予定であり、これに対応して、上記可
変レート高優先度チャネル44は優先度の高い、即ち網の
輻輳時においてもパケットが廃棄されることのないチャ
ネルであり、可変レート低優先度チャネル45は網の輻輳
時においては、パケットが廃棄される可能性のあるチャ
ネルとしてある。In wideband ISDN, a plurality of variable rate channels with different priorities are prepared, and when congestion occurs in a communication network (hereinafter referred to as network), it is planned to perform control such that packets of low priority channels are discarded. Correspondingly, the variable rate high priority channel 44 has a high priority, that is, the packet is not discarded even when the network is congested, and the variable rate low priority channel 45 is the network congestion. At times, it is as a channel on which packets may be dropped.

局部復号はMSPのみを用いて行い、受信側での復号
は、MSPを前フレームの値に加算した後、LSPの復号信号
を加算することにより行われる。Local decoding is performed using only MSP, and decoding on the receiving side is performed by adding MSP to the value of the previous frame and then adding the decoded signal of LSP.

従って、網の輻輳時においては、LSPは確率的に廃棄
されるが、LSP自体画質への影響が小さく、また時間的
に波及しないため、画質の劣化は少なく抑えることが可
能である。Therefore, when the network is congested, the LSP is stochastically discarded, but the LSP itself has a small influence on the image quality and does not spread over time, and therefore deterioration of the image quality can be suppressed to a small extent.

また、上記MSPは逆量子化回路46で逆量子化され、そ
の逆量子化された変換係数は逆離散コサイン変換回路47
に送られ、そこで逆離散コサイン変換される。The MSP is inversely quantized by an inverse quantization circuit 46, and the inversely quantized transform coefficient is an inverse discrete cosine transform circuit 47.
To the inverse discrete cosine transform.

この逆離散コサイン変換された変換係数は加算器40b
に送られ、ここで、フレームメモリ48から出力される現
フレームの予測信号との加算が行われ、その加算結果が
フレームメモリ48に出力される。This inverse discrete cosine transformed transform coefficient is added by the adder 40b.
Is added to the prediction signal of the current frame output from the frame memory 48, and the addition result is output to the frame memory 48.

発明が解決しようとする課題しかしながら、上記従来の可変レート動画像符号化装
置では、MSPとLSPとの符号量の比、あるいはブロック内
のS/N比（信号対雑音比）を閾値として、MSPとLSPとの
分離を行っていたため、LSPが廃棄された時の画質劣化
が視覚に及ぼす影響を考慮して、MSPとLSPとの分離が行
われていないという問題があった。DISCLOSURE OF THE INVENTION Problems to be Solved by the Invention However, in the above-mentioned conventional variable rate moving image coding device, the MSP and LSP code amount ratio, or the S / N ratio (signal to noise ratio) in a block is used as a threshold value, Since the LSP and the LSP are separated from each other, there is a problem that the MSP and the LSP are not separated from each other in consideration of the influence of the image quality deterioration when the LSP is discarded on the visual sense.

本発明は、このような従来の問題を解決するものであ
り、網の輻輳時によりブロック内のある領域のパケット
が廃棄された時に生じる画質劣化量を、予め視覚の空間
周波数特性を考慮して各領域毎に設定された画質劣化量
にすることができ、よって、視覚の空間周波数特性で重
み付けしたブロック内誤差を用いて、ブロック内をＭ個
の領域に分割・階層化し、パケット化バッファによりそ
れぞれ優先度の異なる可変レートチャネルによって伝送
することができる可変レート動画像符号化装置を提供す
ることを目的とする。The present invention is to solve such a conventional problem, in consideration of the visual spatial frequency characteristics, the image quality deterioration amount that occurs when a packet in a certain area within a block is discarded due to network congestion. It is possible to set the image quality deterioration amount set for each area. Therefore, by using the intra-block error weighted by the visual spatial frequency characteristic, the inside of the block is divided into M areas and hierarchized, and the packetization buffer is used. It is an object of the present invention to provide a variable rate moving image coding apparatus that can be transmitted via variable rate channels having different priorities.

課題を解決するための手段上記目的を達成するために本発明は、フレームメモリ
に蓄えられた前フレーム再生信号に予測係数を乗じて得
られる現フレーム予測信号とフレーム単位の動画像信号
との減算結果から得られる予測誤差信号を、Ｎ×Ｎの大
きさのブロック毎に離散コサイン変換して変換係数を出
力する離散コサイン変換回路と、この離散コサイン変換回路から出力される上記変換係
数を量子化して、量子化された変換係数とこの量子化さ
れた変換係数の量子化代表値とを出力する量子化回路
と、この量子化回路から出力される上記量子化代表値を逆
量子化して変換係数を出力する逆量子化回路と、この逆量子回路から出力される上記変換係数を逆離散
コサイン変換する逆離散コサイン変換回路と、この逆離散コサイン変換回路を出力と上記現フレーム
予測信号とを加算して、この加算結果を上記フレームメ
モリに蓄積させる加算器と、上記離散コサイン変換回路で各ブロック単位に離散コ
サイン変換を施して得られるＮ×Ｎ個の変換係数を任意
のＭ個の領域に分割、階層化する際に、このＭ個の領域
の所定の一領域の情報が廃棄された場合に生じる画質劣
化量を、視覚の空間周波数特性に基づいて算出し、この
画質劣化量が予め各領域毎に設定した画質劣化量の許容
値に近似的に等しくなるように上記変換係数の分割、階
層化を行う階層化回路と、この階層化回路によってＭ個の領域に分割、階層化さ
れた変換係数の各領域のデータを、優先度の異なる可変
レートチャネルで伝送するバケット化バッファを備える
構成とした。Means for Solving the Problems In order to achieve the above object, the present invention subtracts a current frame prediction signal obtained by multiplying a previous frame reproduction signal stored in a frame memory by a prediction coefficient and a moving image signal in frame units. A predictive error signal obtained from the result is subjected to a discrete cosine transform for each block of size N × N to output a transform coefficient, and the transform coefficient output from the discrete cosine transform circuit is quantized. A quantization circuit for outputting the quantized transform coefficient and the quantized representative value of the quantized transform coefficient; and the transform coefficient obtained by dequantizing the quantized representative value output from the quantization circuit. And an inverse discrete cosine transform circuit for inverse discrete cosine transform of the transform coefficient output from the inverse quantum circuit, and an inverse discrete cosine transform circuit. Force and the current frame prediction signal are added, and the adder for accumulating the addition result in the frame memory, and N × N number of N × N obtained by performing the discrete cosine transform for each block in the discrete cosine transform circuit. Based on the spatial frequency characteristic of the visual sense, the image quality deterioration amount that occurs when information of a predetermined area of the M areas is discarded when the conversion coefficient is divided into M areas and hierarchized A layering circuit that calculates and divides and hierarchizes the conversion coefficient so that the image quality deterioration amount is approximately equal to the allowable value of the image quality deterioration amount set in advance for each area, and M The data in each region of the transform coefficient, which is divided into a plurality of regions and hierarchized, is provided with a bucketing buffer that transmits the variable rate channels with different priorities.

作用本発明は上記構成により、網の輻輳によりブロック内
のある領域のパケットが廃棄された時に生じる画質劣化
量を、予め視覚の空間周波数特性を考慮して各領域毎に
設定された画質劣化量にすることができ、よって、視覚
の空間周波数特性で重み付けしたブロック内誤差を用い
て、ブロック内をＭ個の領域に分割・階層化し、パケッ
ト化バッファによりそれぞれ優先度の異なる可変レート
チャネルによって伝送することができる。Effect of the Invention With the above-described configuration, the image quality deterioration amount that occurs when a packet in a certain area in a block is discarded due to network congestion is set in advance for each area in consideration of visual spatial frequency characteristics. Therefore, by using the intra-block error weighted by the visual spatial frequency characteristic, the block is divided into M regions and hierarchized, and the variable rate channels with different priorities are transmitted by the packetization buffer. can do.

実施例以下、本発明の実施例を図面に基づいて説明する。Embodiment An embodiment of the present invention will be described below with reference to the drawings.

第１図は本発明の一実施例による可変レート動画像符
号化装置の構成を示すブロック図である。第１図におい
て、10はフレーム単位に入力される動画像信号、10a
は、この動画像信号と、後述するフレームメモリ18に蓄
えられた前フレーム再生信号に予測係数乗算器19で予測
係数ａを乗算して得た現フレームの予測信号とが入力さ
れる減算器であり、これらフレーム単位の動画像信号10
と現フレームの予測信号との減算を行って、Ｎ×Ｎの予
測誤差信号を出力する。FIG. 1 is a block diagram showing the configuration of a variable rate moving image coding apparatus according to an embodiment of the present invention. In FIG. 1, 10 is a moving image signal input in frame units, and 10a
Is a subtracter to which this moving image signal and the prediction signal of the current frame obtained by multiplying the reproduction signal of the previous frame stored in the frame memory 18 described later by the prediction coefficient a in the prediction coefficient multiplier 19 are input. Yes, these video signals in frame units 10
And the prediction signal of the current frame are subtracted to output an N × N prediction error signal.

11は、減算器10aから入力されるＮ×Ｎの予測誤差信
号を離散コサイン変換して、Ｎ×Ｎの変換係数Ｄ（u,
v）を出力する離散コサイン変換回路、12は、離散コサ
イン変換回路11から入力される変換係数Ｄ（u,v）を量
子化して、量子化された変換係数Dq（u,v）と、この量
子化された変換係数Dq（u,v）の量子化代表値Ix（u,v）
とを出力する量子化回路である。Reference numeral 11 denotes a discrete cosine transform of the N × N prediction error signal input from the subtractor 10a to obtain an N × N transform coefficient D (u,
The discrete cosine transform circuit for outputting v), 12 quantizes the transform coefficient D (u, v) input from the discrete cosine transform circuit 11, and quantizes the transformed coefficient Dq (u, v) Quantized representative value Ix (u, v) of the quantized transform coefficient Dq (u, v)
This is a quantization circuit that outputs and.

13は、離散コサイン変換回路11からの変換係数Ｄ（u,
v）と、量子化回路12からの、量子化された変換係数Dq
（u,v）及び量子化代表値Ix（u,v）とが入力される階層
化回路である。13 is a transform coefficient D (u, from the discrete cosine transform circuit 11
v) and the quantized transform coefficient Dq from the quantization circuit 12
(U, v) and the quantized representative value Ix (u, v) are input to the hierarchical circuit.

この階層化回路13は、Ｎ×Ｎの変換係数Ｄ（u,v）の
ブロック内のある領域Lmの情報が廃棄された時に生じる
画質劣化が、視覚の空間周波数特性を考慮した許容値TH
_Lmに近似的に等しくなるように、量子化された変換係数
Dq（u,v）と量子化代表値Ix（u,v）とを用いて、上記Ｎ
×Ｎの変換係数Ｄ（u,v）をＭ個の領域（L1〜LM）に分
割、階層化するものである。In the layering circuit 13, the image quality deterioration caused when the information of a certain area Lm in the block of the N × N conversion coefficient D (u, v) is discarded is an allowable value TH in consideration of the visual spatial frequency characteristic.
Transform coefficients quantized to be approximately equal to _Lm
Using Dq (u, v) and the quantized representative value Ix (u, v), the above N
The conversion coefficient D (u, v) of × N is divided into M areas (L1 to LM) and hierarchized.

14は、階層化回路13で階層化されたデータを、各階層
に応じた可変レートチャネル15を用いて伝送するため
に、各階層毎にパケット化するパケット化バッファであ
る。Reference numeral 14 is a packetizing buffer that packetizes the data layered by the layering circuit 13 for each layer in order to transmit the data using the variable rate channel 15 corresponding to each layer.

15は、パケット化バッファ14で各階層毎にパケット化
されたデータを伝送するための、任意の伝送レートに対
応可能な可変レートチャネルであり、優先度の異なる複
数のチャネルを有している。Reference numeral 15 is a variable rate channel for transmitting data packetized for each layer in the packetization buffer 14 and capable of supporting an arbitrary transmission rate, and has a plurality of channels having different priorities.

16は、上記量子化回路12から入力される量子化代表値
Ix（u,v）を逆量子化変換して、逆量子化された変換係
数を出力する、局部復号のための逆量子化回路、17は、
局部復号のための回路であり、逆量子化回路16から入力
される逆量子化された変換係数を逆離散コサイン変換し
て、逆離散コサイン変換された変換係数を出力する逆離
散コサイン変換回路である。16 is the quantized representative value input from the quantization circuit 12
An inverse quantization circuit for local decoding, which inversely quantizes Ix (u, v) and outputs an inversely quantized transform coefficient, 17,
A circuit for local decoding, which is an inverse discrete cosine transform circuit that outputs an inverse discrete cosine transformed transform coefficient by performing an inverse discrete cosine transform on the inverse quantized transform coefficient input from the inverse quantization circuit 16. is there.

10bは、上記逆離散コサイン変換回路17から出力され
る逆離散コサイン変換された変換係数と、上記予測係数
乗算器19から出力される現フレームの予測信号とを加算
して、現フレーム再生信号としてフレームメモリ18に出
力する加算器である。10b is the inverse discrete cosine transform circuit output transform coefficient of the inverse discrete cosine transform circuit 17, and adds the current frame prediction signal output from the prediction coefficient multiplier 19 as a current frame reproduction signal. It is an adder that outputs to the frame memory 18.

次に、上記実施例の動作について説明する。第１図に
おいて、フレームメモリ18に蓄えられた前フレーム再生
信号には動き補償等が施され、さらに、予測係数乗算器
19によって予測係数ａ（０＜ａ≦１）が乗じられ、これ
により、現フレームの予測信号が得られる。Next, the operation of the above embodiment will be described. In FIG. 1, the previous frame reproduction signal stored in the frame memory 18 is subjected to motion compensation and the like, and further, a prediction coefficient multiplier
The prediction coefficient a (0 <a ≦ 1) is multiplied by 19 to obtain the prediction signal of the current frame.

この現フレームの予測信号とフレーム単位の動画像信
号10とは減算器10aで減算されて予測誤差信号となり、
以下、Ｎ×Ｎの大きさのブロック毎に処理が施される。This current frame prediction signal and the frame-by-frame moving image signal 10 are subtracted by a subtractor 10a to become a prediction error signal,
Hereinafter, processing is performed for each block of size N × N.

このＮ×Ｎの予測誤差信号は、離散コサイン変換回路
11において離散コサイン変換された後、変換係数Ｄ（u,
v）として量子化回路12に入力されると共に、階層化回
路13にも入力される。This N × N prediction error signal is a discrete cosine transform circuit.
After the discrete cosine transform in 11, the transform coefficient D (u,
It is input to the quantization circuit 12 as v) and also to the layering circuit 13.

量子化回路12は、離散コサイン変換された変換係数Ｄ
（u,v）を量子化し、これにより得られた、量子化され
た変換係数Dq（u,v）とその量子化代表値Ix（u,v）とは
階層化回路13に入力される。これと同時に、上記量子化
された変換係数Dq（u,v）の量子化代表値Ix（u,v）は、
局部復号のために逆量子化回路16にも入力される。The quantization circuit 12 uses a transform coefficient D that has been discrete cosine transformed.
(U, v) is quantized, and the quantized transform coefficient Dq (u, v) and the quantized representative value Ix (u, v) thus obtained are input to the layering circuit 13. At the same time, the quantized representative value Ix (u, v) of the quantized transform coefficient Dq (u, v) is
It is also input to the inverse quantization circuit 16 for local decoding.

逆量子化回路16による量子化代表値Ix（u,v）の逆量
子化により得られた変換係数は、逆離散コサイン変換回
路17で逆離散コサイン変換された後、予測係数乗算器19
から出力される前記現フレームの予測信号と加算器10b
で加算れ、現フレーム再生信号としてフレームメモリ18
に蓄積される。The transform coefficient obtained by the inverse quantization of the quantized representative value Ix (u, v) by the inverse quantization circuit 16 is inverse discrete cosine transformed by the inverse discrete cosine transform circuit 17, and then the prediction coefficient multiplier 19
The predicted signal of the current frame output from the adder 10b
In the frame memory 18 as the current frame playback signal.
Is accumulated in

一方、階層化回路13は、量子化回路12の出力である量
子化された変換係数Dq（u,v）及び量子化代表値Ix（u,
v）を用いて、離散コサイン変換回路11の出力であるＮ
×Ｎの変換係数Ｄ（u,v）のブロック内を、ブロック単
位でＭ個の領域に分割し、階層化する。On the other hand, the layering circuit 13 outputs the quantized transform coefficient Dq (u, v) and the quantized representative value Ix (u,
v) is used to output N of the output of the discrete cosine transform circuit 11.
The block of the conversion coefficient D (u, v) of × N is divided into M areas in block units and hierarchized.

パケット化バッファ14は、階層化回路13で階層化され
たデータを各階層に応じたチャネルを用いて伝送するた
めに、データを各階層ごとにパケット化し、パケット化
されたデータを順次可変レートチャネル15に出力する。The packetizing buffer 14 packetizes the data layered in each layer in order to transmit the layered data in the layering circuit 13 using a channel corresponding to each layer, and sequentially packetizes the packetized data into a variable rate channel. Output to 15.

次に、階層化回路13において、各ブロック内をＭ個の
領域に分割、階層化する方法について、第２図、第３図
を併用して説明する。Next, a method of dividing each block into M areas and hierarchizing them in the hierarchization circuit 13 will be described with reference to FIGS. 2 and 3.

第２図は、Ｎ×Ｎの変換係数Ｄ（u,v）をＭ個の領域
に分割する方法を示す説明図であり、第３図は、離散コ
サイン変換領域における誤差を視覚の空間周波数特性を
考慮して評価するために、誤差に乗じる重み係数をブロ
ックサイズ８×８について求める一数値例を示す説明図
である。FIG. 2 is an explanatory diagram showing a method of dividing an N × N transform coefficient D (u, v) into M regions, and FIG. 3 is a diagram showing a visual spatial frequency characteristic of an error in a discrete cosine transform region. FIG. 8 is an explanatory diagram showing a numerical example of obtaining a weighting coefficient for multiplying an error for a block size of 8 × 8 in order to evaluate in consideration of

処理単位となるブロックのサイズをＮ×Ｎとし、離散
コサイン変換回路11の出力である変換係数をＤ（u,
v）、この変換係数Ｄ（u,v）を量子化回路12で量子化し
た後の変換係数をDq（u,v）、その量子化代表値をIx
（u,v）とする。The size of the block as the processing unit is N × N, and the transform coefficient output from the discrete cosine transform circuit 11 is D (u,
v), the transform coefficient after quantizing the transform coefficient D (u, v) by the quantizing circuit 12 is Dq (u, v), and the quantized representative value is Ix.
(U, v).

さらに、離散コサイン変換領域における誤差を視覚の
空間周波数特性を考慮して評価するために、誤差に乗じ
る重みを係数をＷ（u,v）とする。ただし、ｕ＝０、１、２、…、Ｎ−１ｖ＝０、１、２、…、Ｎ−１である。Further, in order to evaluate the error in the discrete cosine transform region in consideration of the visual spatial frequency characteristic, the weight for multiplying the error is W (u, v). However, u = 0, 1, 2, ..., N-1 v = 0, 1, 2, ..., N-1.

また、簡単のため第２図に示すように、ブロック内を
３領域に分割、階層化する場合について説明する。Further, for simplification, as shown in FIG. 2, a case where a block is divided into three areas and hierarchized will be described.

まず、パケットの廃棄が発生しない場合について考え
る。パケットの廃棄が発生しない場合、ブロック内で発
生する変換係数Ｄ（u,v）の誤差は量子化誤差のみであ
る。First, consider the case where no packet discard occurs. When the packet is not discarded, the error of the transform coefficient D (u, v) generated in the block is only the quantization error.

ブロック内で発生する誤差を視覚の空間周波数特性Ｗ
（u,v）で重み付けして算出した平均２乗誤差をWMSEと
定義し、パケットの廃棄が発生しない場合の平均２乗誤
差（WMSE）をWMSE_minとすると、WMSE_minは次式によって
算出される。The error occurring in the block is determined by the visual spatial frequency characteristic W
If the mean squared error calculated by weighting with (u, v) is defined as WMSE and the mean squared error (WMSE) when packet discard does not occur is WMSE _min , WMSE _min is calculated by the following formula. It

また、WMSEを用いて算出するS/N比をWSNRと定義し、
パケットの廃棄が発生しない場合のS/N比（WSNR）をWSN
R_maxとすると、WSNR_maxは次式に従って算出される。 In addition, the S / N ratio calculated using WMSE is defined as WSNR,
WSN is the signal-to-noise ratio (WSNR) when no packet drops occur
Assuming R _max , WSNR _max is calculated according to the following equation.

次に、第２図に示す領域L3のパケットP₁（α，β）が
廃棄された場合について考える。 Next, consider a case where the packet P ₁ (α, β) in the area L3 shown in FIG. 2 is discarded.

領域L3のパケットP₁（α，β）が廃棄された場合、ブ
ロック内で発生する変換係数Ｄ（u,v）の誤差は、廃棄
された情報による誤差と廃棄されなかった領域L1、L2の
量子化誤差の和となる。When the packet P ₁ (α, β) in the area L3 is discarded, the error of the transform coefficient D (u, v) generated in the block is the error due to the discarded information and the error in the areas L1 and L2 that are not discarded. It is the sum of the quantization errors.

従って、この場合のWMSEをWMSE_L31oSとすると、WMSE
_L31oSは次式に従って算出される。Therefore, if the WMSE in this case is WMSE _L31oS , WMSE
_L31oS is calculated according to the following formula.

となる。ただし、上記（３）式のΣ算出の際における
ｕ、ｖの走査は例えば第２図のようなジグザグ走査によ
って行う。 Becomes However, the scanning of u and v in the calculation of Σ in the equation (3) is performed by zigzag scanning as shown in FIG. 2, for example.

ここで、とおき、領域L3のパケットP₁（α，β）が廃棄された場合のWS
NRをWSNR_L31oSとすると、上記（２）式及び（４）式よ
り、となる。here, WS when the packet P ₁ (α, β) in the region L3 is discarded
_Assuming that NR is WSNR _L31oS , from the above equations (2) and (4), Becomes

上記（５）式の右辺第２項は、領域L₃のパケットがP₁
（α，β）廃棄されることによって生じる画質劣化量を
表しており、これをΔD_L3とおくと、 γ＝EXP10（ΔD_L3/10） …（６）となる。ただし、EXP10（Ｘ）＝10^Xとする。In the second term on the right side of the above equation (5), the packet in the region L ₃ is P ₁
(Α, β) represents the amount of image quality deterioration caused by discarding. When this is set to ΔD _L3 , γ = EXP10 (ΔD _L3 / 10) (6) However, EXP10 (X) = 10 ^X.

一方、上記（３）式は次式のように書き換えることが
できる。On the other hand, the above equation (3) can be rewritten as the following equation.

さらに、（４）、（６）、及び（７）式より、を得る。 Furthermore, from equations (4), (6), and (7), Get.

従って、領域L3のパケットP₁（α，β）が廃棄された
ときの画質劣化量が、予め設定した視覚の空間周波数特
性を考慮したある許容値TH_L3となるためには、（８）式
において、ΔD_L3＝TH_L3とおき、これを満たす（α、
β）をもって領域L2と領域L3との境界点とすればよい。Therefore, in order that the image quality deterioration amount when the packet P ₁ (α, β) of the region L3 is discarded becomes a certain allowable value TH _{L3 in} consideration of the preset visual spatial frequency characteristic, equation (8) is used. , ΔD _L3 = TH _L3 , and this is satisfied (α,
β) may be defined as a boundary point between the regions L2 and L3.

ただし、一般的には、（８）式において、左辺と右辺
とが完全に一致するような（α、β）は得られないの
で、左辺の値と右辺との値が最も近くなるような（α、
β）を境界点とする。However, in general, in the expression (8), it is not possible to obtain (α, β) such that the left side and the right side completely match, so that the value on the left side and the value on the right side are the closest ( α,
Let β) be the boundary point.

領域L2の分割についても同様に考えると、領域L1と領
域L2との境界点は、領域L2のパケットをＰ（ζ、η）と
すると、次式を満たす点（ζ、η）となる。Considering the division of the region L2 in the same manner, the boundary point between the regions L1 and L2 is a point (ζ, η) that satisfies the following equation, where P (ζ, η) is the packet of the region L2.

このようにして分割された領域L1、L2、L3は、領域L1
が予測誤差信号の低周波成分、領域L3が高周波成分、領
域L2が、領域L1と領域L2との中間の周波数成分を含んで
いる。そこで、各領域L1、L2、L3の順で高い優先度の変
化レートチャネルで伝送する。 Regions L1, L2, and L3 divided in this way are divided into regions L1
Indicates a low frequency component of the prediction error signal, the region L3 includes a high frequency component, and the region L2 includes an intermediate frequency component between the regions L1 and L2. Therefore, the regions L1, L2, and L3 are transmitted in the order of high priority change rate channels.

また、視覚の空間周波数特性を考慮した許容値TH_L2、
TH_L3を変化させると、各領域L1、L2、L3で発生する情報
量が変化する。Also, the allowable value TH _L2 considering the spatial frequency characteristics of vision,
When TH _L3 is changed, the amount of information generated in each area L1, L2, L3 changes.

例えば、視覚の周波数特性の許容値TH_L2を小さくする
と、（９）式から決定される領域L2は小さくなり、従っ
て、領域L2で発生する情報量は減少し、領域L1で発生す
る情報量は増加する。For example, if the permissible value TH _L2 of the visual frequency characteristic is reduced, the area L2 determined by the equation (9) becomes smaller, so that the amount of information generated in the region L2 decreases and the amount of information generated in the region L1 decreases. To increase.

この結果、パケット化バッファ14から階層化回路13へ
フィードバック制御を行い、視覚の空間周波数特性の許
容値TH_L2、TH_L3を適当に変化させることによって、各可
変レートチャネルの発生情報量を制御することができ
る。As a result, feedback control is performed from the packetization buffer 14 to the layering circuit 13, and the allowable values TH _L2 and TH _L3 of the visual spatial frequency characteristics are appropriately changed to control the amount of information generated in each variable rate channel. be able to.

よって上記実施例によれば、網の輻輳時には、優先順
位の低い領域L3のパケットから順に廃棄されるが、廃棄
されたときに生じる画質劣化量を、予め設定した視覚の
空間周波数特性を考慮した許容値TH_Lmにすることが可能
である。Therefore, according to the above-mentioned embodiment, when the network is congested, the packets in the region L3 having a lower priority are discarded in order, but the image quality deterioration amount caused when the packets are discarded is considered in consideration of the preset visual spatial frequency characteristic. It is possible to set the allowable value TH _Lm .

また、視覚の空間周波数特性を考慮した許容値TH_L2、
TH_L3を適当に変化させることによって、各可変レートチ
ャネルにおける発生情報量を制御することができる。Also, the allowable value TH _L2 considering the spatial frequency characteristics of vision,
By appropriately changing TH _L3 , the amount of information generated in each variable rate channel can be controlled.

また、ある領域のパケットが廃棄されることによっ
て、送受の間で画像データに相違が生じるが、この影響
は予測係数ａによって時間と共に減少する。Further, the packet data in a certain area is discarded, which causes a difference in image data between transmission and reception, but this influence decreases with time due to the prediction coefficient a.

尚、上記実施例ではブロック内を３つの領域に分割・
階層化する場合について説明したが、同様に考えれば、
これをＭ領域に分割・階層化するのは容易である。In the above embodiment, the block is divided into three areas.
I explained the case of layering, but if you think in the same way,
It is easy to divide this into M areas and divide them into layers.

また、上記実施例においては各式のΣ算出の際のｕ、
ｖの走査順序として第２図に示すようなジグザグ走査を
用いたが、本発明はこの走査順序に制限されるものでは
ない。Further, in the above embodiment, u when calculating Σ of each equation,
Although the zigzag scanning as shown in FIG. 2 is used as the v scanning order, the present invention is not limited to this scanning order.

さらに、離散コサイン変換領域における誤差を視覚の
空間周波数特性を考慮して評価するために、誤差に乗じ
る重み係数の例として、本実施例ではブロックサイズが
８×８の場合について第３図に示すような数値を挙げた
が、本発明はこの数値に制定されるものではない。Further, in order to evaluate the error in the discrete cosine transform domain in consideration of the visual spatial frequency characteristic, FIG. 3 shows an example of the weighting coefficient by which the error is multiplied, in the case of the block size 8 × 8 in the present embodiment. Although such a numerical value is mentioned, the present invention is not established to this numerical value.

発明の効果上述のように本発明によれば、フレームメモリに蓄え
られた前フレーム再生信号に予測係数を乗じて得られる
現フレーム予測信号とフレーム単位の動画像信号との減
算結果から得られる予測誤差信号を、Ｎ×Ｎの大きさの
ブロック毎に離散コサイン変換して変換係数を出力する
離散コサイン変換回路と、この離散コサイン変換回路から出力される上記変換係
数を量子化して、量子化された変換係数とこの量子化さ
れた変換係数の量子化代表値とを出力する量子化回路
と、この量子化回路から出力される上記量子化代表値を逆
量子化して変換係数を出力する逆量子化回路と、この逆量子化回路から出力される上記変換係数を逆離
散コサイン変換する逆離散コサイン変換回路と、この逆離散コサイン変換回路の出力と上記現フレーム
予想信号とを加算して、この加算結果を上記フレームメ
モリに蓄積させる加算器と、上記離散コサイン変換回路で各ブロック単位に離散コ
サイン変換を施して得られるＮ×Ｎ個の変換係数を任意
のＭ個の領域に分割、階層化する際に、このＭ個の領域
の所定の一領域の情報が廃棄された場合に生じる画質劣
化量を、視覚の空間周波数特性に基づいて算出し、こと
画質劣化量が予め各領域毎に設定した画質劣化量の許容
値に近似的に等しくなるように上記変換係数の分割、階
層化を行う階層化回路と、この階層化回路によってＭ個の領域に分割、階層化さ
れた変換係数の各領域のデータを、優先度の異なる可変
レートチャネルで伝送するバケット化バッファを備える
構成とした。As described above, according to the present invention, the prediction obtained from the subtraction result of the current frame prediction signal obtained by multiplying the previous frame reproduction signal stored in the frame memory by the prediction coefficient and the moving image signal in frame units A discrete cosine transform circuit that outputs a transform coefficient by performing a discrete cosine transform for each block of size N × N, and the transform coefficient output from the discrete cosine transform circuit is quantized and quantized. And a quantization circuit that outputs the quantized representative value of the quantized transform coefficient, and an inverse quantum that inversely quantizes the quantized representative value output from the quantization circuit and outputs a transform coefficient. Circuit, an inverse discrete cosine transform circuit for performing an inverse discrete cosine transform of the transform coefficient output from the inverse quantizer circuit, an output of the inverse discrete cosine transform circuit and the current frame An adder for adding the expected signal and accumulating the addition result in the frame memory, and N × N transform coefficients obtained by performing the discrete cosine transform for each block in the discrete cosine transform circuit When dividing or hierarchically dividing into M areas, the image quality deterioration amount that occurs when information of a predetermined area of the M areas is discarded is calculated based on the visual spatial frequency characteristic. A hierarchical circuit that divides and hierarchizes the conversion coefficients so that the deterioration amount is approximately equal to the allowable value of the image quality deterioration amount set in advance for each area, and the hierarchy circuit divides the area into M areas. The bucketed buffer for transmitting the data in each region of the layered transform coefficients through the variable rate channels having different priorities is adopted.

このため、網の輻輳によりブロック内のある領域のパ
ケットが廃棄された時に生じる画質劣化量を、予め視覚
の空間周波数特性を考慮して各領域毎に設定された画質
劣化量にすることができ、よって、視覚の空間周波数特
性で重み付けしたブロック内誤差を用いて、ブロック内
をＭ個の領域に分割・階層化し、パケット化バッファに
よりそれぞれ優先度の異なる可変レートチャネルによっ
て伝送することができる。Therefore, the image quality deterioration amount that occurs when a packet in a certain area in a block is discarded due to network congestion can be set to the image quality deterioration amount that is set in advance for each area in consideration of visual spatial frequency characteristics. Therefore, by using the intra-block error weighted by the visual spatial frequency characteristic, the inside of the block is divided into M areas and hierarchized, and the packetized buffer can transmit the variable rate channels having different priorities.

[Brief description of drawings]

第１図は、本発明の一実施例における可変レート動画像
符号化装置の概略構成を示すブロック図、第２図は、第
１図の階層化回路における、Ｎ×Ｎ個の変換係数をＭ個
の領域に分割する方法を示す説明図、第３図は、第１図
の可変レート動画像符号化装置の離散コサイン変換領域
において、平均２乗誤差算出の際に量子化誤差を重み付
けるための重み係数を示す説明図、第４図は従来の可変
レート動画像符号化装置の概略ブロック図である。 10……動画像信号、10b……加算器、11……離散コサイ
ン変換回路、12……量子化回路、13……階層化回路、14
……パケット化バッファ、15……可変レートチャネル、
16……逆量子化回路、18……フレームメモリ、ａ……予
測係数、L1、L2、L3……領域。FIG. 1 is a block diagram showing a schematic configuration of a variable rate moving picture coding apparatus according to an embodiment of the present invention, and FIG. 2 is a block diagram showing N × N transform coefficients in the hierarchical circuit of FIG. FIG. 3 is an explanatory diagram showing a method of dividing the region into individual regions, and FIG. 3 is for weighting the quantization error when calculating the mean square error in the discrete cosine transform region of the variable rate moving image coding apparatus of FIG. 4 is a schematic block diagram of a conventional variable rate moving image coding apparatus. 10 ... Moving image signal, 10b ... Adder, 11 ... Discrete cosine transform circuit, 12 ... Quantization circuit, 13 ... Hierarchical circuit, 14
...... Packetizing buffer, 15 …… Variable rate channel,
16 ... Inverse quantization circuit, 18 ... Frame memory, a ... Prediction coefficient, L1, L2, L3.

フロントページの続き (51)Int.Cl.⁶ 識別記号庁内整理番号ＦＩ技術表示箇所Ｈ０４Ｎ 7/32 9466−5ＫＨ０４Ｌ 11/20 １０２ＥＧ０６Ｆ 15/66 ３３０Ｄ (56)参考文献特開平２−15791（ＪＰ，Ａ) 特開昭63−73789（ＪＰ，Ａ) ＩＣＣ 88，ＩｎｔｌＳｙｍｐｏｎＣｏｍｍｕｎｉｃａｔｉｏｎ（1988) Ｐ．1257−1261 ＳＰＩＥ，ＶｉｓｕａｌＣｏｍｍｕｎｉｃａｔｉｏｎａｎｄＩｍａｇｅＰｒｏｃｅｓｓｉｎｇ 88（1988）Ｐ．991 −998Continuation of the front page (51) Int.Cl. ⁶ Identification number Office reference number FI Technical indication location H04N 7/32 9466-5K H04L 11/20 102 E G06F 15/66 330 D (56) References -15791 (JP, A) JP 63-73789 (JP, A) ICC 88, Intl Symp on Communication (1988) P. 1257-1261 SPIE, Visual Communication and Image Processing 88 (1988) P.M. 991 −998

Claims

[Claims]

1. A N × N prediction error signal obtained from a subtraction result of a current frame prediction signal obtained by multiplying a previous frame reproduction signal stored in a frame memory by a prediction coefficient and a moving image signal in frame units. A discrete cosine transform circuit that outputs a transform coefficient by performing a discrete cosine transform for each block of size, the above transform coefficient output from this discrete cosine transform circuit is quantized, and the quantized transform coefficient and this quantized transform coefficient are quantized. A quantization circuit that outputs the quantized representative value of the transformed coefficient, an inverse quantization circuit that inversely quantizes the quantized representative value output from the quantization circuit, and outputs the transformed coefficient, and an inverse quantization circuit The inverse discrete cosine transform circuit for performing the inverse discrete cosine transform of the transform coefficient output from the digitizing circuit, the output of the inverse discrete cosine transform circuit and the current frame prediction signal are added, and the addition is performed. The adder for accumulating the result in the frame memory and the N × N transform coefficients obtained by performing the discrete cosine transform for each block in the discrete cosine transform circuit are divided into arbitrary M regions and hierarchized. At this time, the image deterioration amount that occurs when information of a predetermined one of the M regions is discarded is calculated based on the visual spatial frequency characteristic, and this image deterioration amount is set in advance for each region. A layering circuit that divides and hierarchizes the conversion coefficient so that it is approximately equal to the allowable value of the image quality deterioration amount, and each region of the conversion coefficient that is divided into M areas by this layering circuit. Variable-rate video encoding device having a packetization buffer for transmitting the data of 1. by a variable-rate channel with different priority.