JPH02198273A

JPH02198273A - Picture encoding system

Info

Publication number: JPH02198273A
Application number: JP1017305A
Authority: JP
Inventors: Yoshihiko Tokunaga; 吉彦徳永; Satoshi Furukawa; 聡古川; Tei Satake; 禎佐竹
Original assignee: Matsushita Electric Works Ltd
Current assignee: Panasonic Electric Works Co Ltd
Priority date: 1989-01-26
Filing date: 1989-01-26
Publication date: 1990-08-06
Anticipated expiration: 2011-09-11
Also published as: JP2533631B2

Abstract

PURPOSE:To prevent the degradation of picture quality by changing a method for truncation to limit an output bit length after orthogonal transformation based on the statistical characteristic quantity of the signal power, etc., of a signal sequence to be inputted. CONSTITUTION:For an original picture signal to be inputted, either the signal power by one screen or a dynamic range to approximately express the signal power is obtained, and corresponding control information is generated by a signal level detecting means 1. In a discrete cosine transforming means 11a, after discrete cosine transformation is executed, a transformation coefficient sequence, for which a bit length is limited by rounding a low-order bit and cutting a high-order bit, is outputted. Here, the bit length, for which the truncation is executed, is set based on the control information obtained by the means 1. The transformation coefficient sequence is quantized by a quantizing means 12, a quantized index is given, the quantized index is encoded in an encoding means 13, the control information is also encoded as additional information, and a code word is obtained.

Description

[Detailed description of the invention] [Industrial application field]

本発明は、画像信号に直交変換を施した後、符号化する
画像符号化方式に関するものである。The present invention relates to an image encoding method that performs orthogonal transformation on an image signal and then encodes the signal.

[Conventional technology]

従来より、直交変換を利用して画像信号の冗長性を低減
することにより画像を高能率に符号化する方式として離
散コサイン変換符号化方式、外挿予測−離散サイン変換
符号化方式等が知られている。これらの符号化方式では
、入力画像信号を所定の大きさのブロックに分割し、各
ブロックごとに入力画像信号列を与える。離散コサイン変換符号化方式では、第８図に示すように
、入力画像信号列（χ１）に対して、離散コサイン変換
手段１１ａにより離散コサイン変換を施し、得られた変
換係数列（Ｘｌ）を量子化手段１２により量子化して量
子化インデクスＱを付与し、この量子化インデクスＱを
符号化手段１３により符号化するのである。ここに、上
記ブロックはＮＸＮ画素を含む正方形のブロックとする
のが普通であり、行列表現するのが一般的であるが（図
では添字をｉｊとしている）、説明の簡略化のために列
ベクトルで表現しである。符号化された信号から画像を
再生するには、復号化手段２１により量子化インデクス
Ｑを再生し、逆量子化手段２４により変換係数列（文、
）を再生し、逆離散コサイン変換手段２５ａにより逆離
散コサイン変換を施すことにより再生画像信号列（ｉｔ
）を得るのである。一方、外挿予測−離散サイン変換符号化方式では、第９
図に示すように、符号化後すでに再生された再生画像信
号列（、ｉ胎１に基づいて予測手段１７により次の入力
画像信号列を予測して予測画像信号列（足、）を求める
。入力画像信号列（χ工）は、予測画像信号列（足よ）
との差をとり、この差である予測誤差信号列（ｙ　＋ｌ
に対して離散サイン変換手段１１ｂにより離散サイン変
換を施し、得られた変換係数列（ｙ、ｌに対して量子化
手段１２により量子化インデクスＱを付与し、この量子
化インデクスＱを符号化手段１３により符号化するので
ある。予測手段１７に入力される再生画像信号列（ｋｌ
）は、量子化インデクスＱを逆量子化手段１４により逆
量子化して再生した変換係数列（掌、）に、逆離散サイ
ン変換手段１５によって逆離散サイン変換を施して予測
誤差信号列（９，）を再生し、予測手段１７によって予
測されている予測画像信号列（足□）と加算することに
より得られるのである。再生画像信号列（ｋ、）は、以
後の予測に備えて遅延メモリ１６に記憶され、遅延メモ
リ１６に記憶されている再生画像信号列（牙、）が予測
手段１７での予測に利用される。符号化された信号から
画像を再生するには、第１０図に示すように、復号化手
段２１によって量子化インデクスＱを再生し、この量子
化インデクスＱから逆量子化手段５によって変換係数列
（室、）を再生し、変換係数列（♀１）に対して逆離散
サイン変換手段２４により逆離散サイン変換を施して予
測誤差信号列（夛、）を再生し、さらに、予測手段２７
によって予測されている予測画像信号列（足、）と加算
して再生画像信号列（ｋ、）を得る。再生画像信号列（
ｋ、）は、以後の予測に備えて遅延メモリ２６に記憶さ
れ、遅延メモリ２６に記憶されている再生画像信号列（
ｋｌ）が予測手段２７での予測に利用されるのである。ここに、画像の再生は、復号化手段２１を除けば、符号
化の際に予測画像信号列（足、）を得た構成と同じであ
る。Conventionally, discrete cosine transform coding, extrapolation prediction-discrete sine transform coding, etc. have been known as methods for highly efficient image coding by reducing the redundancy of image signals using orthogonal transform. ing. In these encoding methods, an input image signal is divided into blocks of a predetermined size, and an input image signal sequence is provided for each block. In the discrete cosine transform encoding method, as shown in FIG. The quantization means 12 quantizes and gives a quantization index Q, and the encoding means 13 encodes this quantization index Q. Here, the above block is normally a square block containing NXN pixels, and is generally expressed as a matrix (in the figure, the subscripts are ij), but for the sake of simplifying the explanation, it is expressed as a column vector. It is expressed as. To reproduce an image from a coded signal, the decoding means 21 reproduces the quantization index Q, and the dequantization means 24 reproduces the transformation coefficient sequence (text,
) is reproduced and subjected to inverse discrete cosine transformation by the inverse discrete cosine transform means 25a to obtain a reproduced image signal sequence (it
) is obtained. On the other hand, in the extrapolation prediction-discrete sine transform coding method, the ninth
As shown in the figure, the prediction means 17 predicts the next input image signal sequence based on the reproduced image signal sequence (1) that has already been reproduced after encoding to obtain a predicted image signal sequence (1). The input image signal sequence (χ) is the predicted image signal sequence (foot)
This difference is the prediction error signal sequence (y + l
The discrete sine transformation means 11b performs a discrete sine transformation on the resulting transform coefficient sequence (y, l, and the quantization means 12 gives a quantization index Q. This quantization index Q is then applied to the encoding means. 13. The reproduced image signal sequence (kl
) is a prediction error signal sequence (9, ) and add it to the predicted image signal sequence (leg □) predicted by the prediction means 17. The reproduced image signal sequence (k,) is stored in the delay memory 16 in preparation for future prediction, and the reproduced image signal sequence (k,) stored in the delay memory 16 is used for prediction by the prediction means 17. . To reproduce an image from a coded signal, as shown in FIG. 10, a quantization index Q is reproduced by a decoding means 21, and a transform coefficient sequence ( ), performs inverse discrete sine transformation on the transform coefficient sequence (♀1) by the inverse discrete sine transformation means 24 to reproduce the prediction error signal sequence (夛, ), and furthermore, the prediction means 27
A reproduced image signal sequence (k,) is obtained by adding the predicted image signal sequence (k, ) predicted by . Playback image signal sequence (
k,) is stored in the delay memory 26 in preparation for future prediction, and the reproduced image signal sequence (k,) stored in the delay memory 26 is
kl) is used for prediction by the prediction means 27. Here, except for the decoding means 21, the reproduction of the image is the same as the configuration in which the predicted image signal sequence (leg) was obtained during encoding.

[Problem to be solved by the invention]

以上のように、いずれの符号化方式においても入力画像
信号列の持つ冗長性を低減するために、入力画像信号列
に対して、次式で表わされる直交変換を施している。［Ｘ　、］＝［ａ　、、］［χ□］ここに、Ｘｌは変換係数列の成分、χ、は入力画像信号
列もしくは予測誤差信号列の成分、ａ、Ｊは直交変換行
列の成分である。このような直交変換を施すと、変換前
の行列に比較して変換後の行列では、成分の最大値と最
小値との差が大きくなる。すなわち、直交変換によりダイナミックレンジが大きく
なるから、入力ビット長よりも出力ビット長の方が大き
くなる。また、上述の説明では入力画像信号列を１次元
として説明したが、画像は２次元の相関を有しているか
ら、実際には入力画像信号列は第１１図のような２次元
のブロックになっており、入力画像信号列に対する直交
変換は２次元直交変換となる。２次元直交変換における
演算では、列方向（行列の横方向）の１次元直交変換を
列数の回数行い、次に行方向（行列の縦方向）の１次元
直交変換を行数の回数行う。このように２回の直交変換
を行うから、最終的な出力ビット長は、入力ビット長に
比較して相当に大きくなる。そこで、処理するデータのビット長を小さくしてハード
ウェアを簡略にするには、データのビット長を制限する
必要がある。ビット長を制限するには、第１２図に示す
ように、下位ビットを丸めたり、上位ビットをカットす
ることが考えられる。すなわち、第１２図では、入力画像信号列のダイナミッ
クレンジが１０ビツト長であって、直交変換を施すこと
によりダイナミックレンジが１５ビツト長になるところ
を１０ビツト長に制限する例を示している。第１２図（
ａ）では、下位５ビツトを丸めて出力を１０ビツト長に
制限しているのであり、第１２図（ｂ）では、下位４ビ
ツトを丸めるとともに、上位１ビツトをカットすること
により出力を１０ビツト長に制限しているのである。こ
のように、下位ビットを丸めたり、上位ビットをカット
する処理を行うと、トランケーションエラーが生じる。すなわち、第１２図（ａ）のように処理すると、下位ビ
ットを丸めているから、丸めの誤差が大きいという問題
がある。一方、第１２図（ｂ）のように処理すると、下
位ビットについては第１２図（ｓｔ＞の場合よりも丸め
の誤差は小さくなり、また、上記ビットも１ビツトのみ
カットしているから、入力画像信号列の信号電力が比較
的小さいときには誤差の発生が小さいものである。しか
しながら、入力画像信号列の信号電力が比教的大きいと
、上位ビットの切り取りによる誤差が多く発生し再生画
像の画質が大きく劣化するという問題が生じる。入力画像信号列は、一般に多種多様であるから、第１２
図（ａ）と第１２図（ｂ）とのいずれか一方の処理によ
って出力ビット長を制限すると、入力画像信号列の持つ
多様性に対応できないという問題が生じる。本発明は上記問題点の解決を目的とするものであり、直
交変換後の出力を所定のビット長に制限しながらも、ト
ランケーションエラーによる画質の劣化を防止できるよ
うにした画像符号化方式を提供しようとするものである
。As described above, in any encoding method, in order to reduce the redundancy of the input image signal sequence, the input image signal sequence is subjected to orthogonal transformation expressed by the following equation. [X,]=[a,,][χ□] Here, Xl is a component of the transformation coefficient sequence, χ is a component of the input image signal sequence or prediction error signal sequence, and a and J are the components of the orthogonal transformation matrix. be. When such orthogonal transformation is performed, the difference between the maximum and minimum values of the components becomes larger in the matrix after transformation than in the matrix before transformation. That is, since the dynamic range increases due to orthogonal transformation, the output bit length becomes larger than the input bit length. Also, in the above explanation, the input image signal sequence was explained as being one-dimensional, but since images have two-dimensional correlation, the input image signal sequence is actually divided into two-dimensional blocks as shown in Figure 11. The orthogonal transformation for the input image signal sequence is a two-dimensional orthogonal transformation. In calculations in two-dimensional orthogonal transformation, one-dimensional orthogonal transformation in the column direction (horizontal direction of the matrix) is performed a number of times equal to the number of columns, and then one-dimensional orthogonal transformation in the row direction (vertical direction of the matrix) is performed the number of times equal to the number of rows. Since the orthogonal transformation is performed twice in this way, the final output bit length becomes considerably larger than the input bit length. Therefore, in order to simplify the hardware by reducing the bit length of data to be processed, it is necessary to limit the bit length of data. To limit the bit length, as shown in FIG. 12, it is possible to round off the lower bits or cut the upper bits. That is, FIG. 12 shows an example in which the dynamic range of the input image signal sequence is 10 bits long, and the dynamic range becomes 15 bits long by performing orthogonal transformation, but is limited to 10 bits long. Figure 12 (
In a), the output is limited to 10 bits by rounding the lower 5 bits, and in Fig. 12(b), the output is limited to 10 bits by rounding the lower 4 bits and cutting the upper 1 bit. It is limited in length. In this way, when the lower bits are rounded or the upper bits are cut, a truncation error occurs. That is, when processing as shown in FIG. 12(a), there is a problem in that the lower bits are rounded, resulting in a large rounding error. On the other hand, when processing as shown in Fig. 12(b), the rounding error for the lower bits becomes smaller than in the case of Fig. 12 (st>), and since only one bit is cut from the above bits, the input When the signal power of the image signal sequence is relatively small, the error is small. However, when the signal power of the input image signal sequence is relatively large, many errors occur due to the cutting of the upper bits, which affects the image quality of the reproduced image. A problem arises in that the input image signal sequence is generally diverse, so the 12th
If the output bit length is limited by either the processing shown in FIG. 12(a) or FIG. 12(b), a problem arises in that it cannot cope with the diversity of input image signal sequences. The present invention aims to solve the above problems, and provides an image encoding method that prevents deterioration of image quality due to truncation errors while limiting the output after orthogonal transformation to a predetermined bit length. This is what I am trying to do.

[Means to solve the problem]

本発明では、上記目的を達成するために、入力される信
号列の信号電力のような特性量に応じて直交変換後の出
力ビット長を制限する方法を切り換えるのである。また、原画像を複数の領域に分割し、各領域単位で信号
電力のような特性量に基づいて直交変換後の出力ビット
長を制限する方法を切り換えるようにしてもよい。In order to achieve the above object, the present invention switches the method of limiting the output bit length after orthogonal transformation depending on the characteristic quantity such as the signal power of the input signal sequence. Alternatively, the original image may be divided into a plurality of regions, and the method for limiting the output bit length after orthogonal transformation may be switched for each region based on a characteristic quantity such as signal power.

[Effect]

上記構成によれば、入力される信号列の信号電力のよう
な特性量に基づいて直交変換後の出力ビット長を制限す
る方法（以後、トランケーションの方法と称する）を変
えるから、直交変換の後に出力ビット長を制限する際に
入力の信号列に対応して最適なトランケーションの方法
が選択できるのであり、トランケーションエラーがほと
んど生じないようにすることができ、トランケーション
エラーによる画質の劣化を抑制することができるのであ
る。また、原画像を複数の領域に分割して、各領域単位でト
ランケーションの方法を切り換えるようにすれば、画像
の局所的な特性量の変化に対応した符号化が行えること
になり、さらに画質の劣化を抑制できるのである。According to the above configuration, the method for limiting the output bit length after orthogonal transformation (hereinafter referred to as the truncation method) is changed based on the characteristic quantity such as the signal power of the input signal sequence, so after orthogonal transformation, When limiting the output bit length, the optimal truncation method can be selected according to the input signal sequence, making it possible to almost eliminate truncation errors and suppress deterioration of image quality due to truncation errors. This is possible. Furthermore, by dividing the original image into multiple regions and switching the truncation method for each region, encoding can be performed in response to local changes in image characteristics, further improving image quality. Deterioration can be suppressed.

【実施例１】第１図は離散コサイン変換符号化方式に、本発明を適用
した実施例を示している。入力される原画像信号は、信号レベル検出手段１によっ
て、１画面分の信号電力（画素の輝度）の合計値、ある
いは信号電力の最大値と最小値との差（ダイナミックレ
ンジ）が求められ、対応した制御情報が生成される。次に、１画面を所定の大きさを有する複数のブロックに
分割し、各ブロックを入力画像信号列として従来と同様
の離散コサイン変換手段１１ａに入力する。離散コサイ
ン変換手段１１ａでは、離散コサイン変換を行った後、
下位ビットを丸めたり、上位ビットをカットしたりして
、ビット長を制限した変換係数列を出力する。ここに、
トランケーションを行うビット長は、信号レベル検出手
段１より得られた制御情報に基づいて設定される。出力された変換係数列は量子化手段１２により量子化さ
れて量子化インデクスが付与され、量子化インデクスは
符号化手段１３により符号化される。ここに、離散コサイン変換手段１１ａにより変換係数列
のビット長が入力画像信号列の信号レベルにより変化す
るから、変換係数列のスケールも変化する。そこで、信
号レベル検出手段１で生成された制御情報に応じて量子
化ステップ幅のスケールも調節する。符号化手段３では
量子化インデクスとともに、制御情報も付加情報として
符号化され、符号語が得られる。上記符号語から画像信号を再生するには、復号化手段２
１により符号語から量子化インデクスと制御情報とを再
生し、逆量子化手段２４では量子化ステップ幅を上記制
御情報に基づいてスケールを設定した後、逆量子化を行
い変換係数列を再生する。次いで逆離散コサイン変換手
段２５ａでは制御情報に応じてトランケーションされた
ビット数を元に戻して逆離散コサイン変換を行ない再生
画像信号を得るのである。上記実施例では、原画像信号の信号電力の測定を画面単
位で行っているが、原画像信号を複数のブロックを含む
ような複数の領域に分割して各領域毎に信号電力を測定
して、この領域単位で原画像信号の信号レベルを調節し
たり、１領域を１ブロツクで構成し、ブロック単位で信
号電力を測定して、ブロック単位で原画像信号の信号レ
ベルを調節するようにしてもよい。ブロック単位でレベ
ルを調節する場合には、信号レベル検出手段１に入力さ
れる信号がブロックに分割されているから、信号レベル
を調節した後にブロックに分割する必要はない。また、
領域単位、あるいはブロック単位で信号レベルを調節す
れば、画像の局所的な信号電力の変化に適応した符号化
を行なうことができる。しかも、信号レベル検出手段１
と信号レベル調節手段２との処理を並行させることがで
きるから、パイプライン化され、処理時間の短縮につな
がるのである。ただし、制御情報も各領域毎あるいは各
ブロック毎に付加情報として符号化する必要があるので
、制御情報をランレングス符号化するなどして付加情報
を圧縮するのが望ましい。Embodiment 1 FIG. 1 shows an embodiment in which the present invention is applied to a discrete cosine transform encoding method. For the input original image signal, the signal level detection means 1 determines the total value of signal power (pixel brightness) for one screen or the difference between the maximum value and minimum value of signal power (dynamic range), Corresponding control information is generated. Next, one screen is divided into a plurality of blocks having a predetermined size, and each block is input as an input image signal sequence to the conventional discrete cosine transform means 11a. In the discrete cosine transform means 11a, after performing the discrete cosine transform,
Outputs a transform coefficient sequence with a limited bit length by rounding the lower bits or cutting the upper bits. Here,
The bit length for truncation is set based on control information obtained from the signal level detection means 1. The outputted transform coefficient sequence is quantized by the quantization means 12 and given a quantization index, and the quantization index is encoded by the encoding means 13. Here, since the bit length of the transform coefficient sequence by the discrete cosine transform means 11a changes depending on the signal level of the input image signal sequence, the scale of the transform coefficient sequence also changes. Therefore, the scale of the quantization step width is also adjusted according to the control information generated by the signal level detection means 1. In the encoding means 3, the control information is also encoded as additional information along with the quantization index, and a code word is obtained. In order to reproduce an image signal from the code word, the decoding means 2
1, the quantization index and control information are reproduced from the code word, and the dequantization means 24 sets the scale of the quantization step width based on the control information, and then performs dequantization and reproduces the transform coefficient sequence. . Next, the inverse discrete cosine transform means 25a restores the truncated bit number according to the control information and performs inverse discrete cosine transform to obtain a reproduced image signal. In the above embodiment, the signal power of the original image signal is measured for each screen, but the original image signal is divided into multiple areas including multiple blocks and the signal power is measured for each area. , the signal level of the original image signal can be adjusted in units of areas, or one area can be configured with one block, the signal power can be measured in units of blocks, and the signal level of the original image signals can be adjusted in units of blocks. Good too. When adjusting the level in units of blocks, since the signal input to the signal level detecting means 1 is divided into blocks, there is no need to divide the signal into blocks after adjusting the signal level. Also,
By adjusting the signal level on a region-by-area or block-by-block basis, it is possible to perform encoding that adapts to local changes in signal power of the image. Moreover, the signal level detection means 1
Since the processing by the signal level adjusting means 2 and the signal level adjusting means 2 can be performed in parallel, pipeline processing is possible, which leads to a reduction in processing time. However, since the control information also needs to be encoded as additional information for each region or each block, it is desirable to compress the additional information by run-length encoding the control information.

【実施例２】本実施例では、第２図に示すように、外挿予測離散サイ
ン変換符号化方式に本発明を適用している。外挿予測−離散サイン変換符号化方式では、原画像信号
を複数のブロックに分割して得た入力画像信号列が入力
される。離散サイン変換部７への入力は、量子化された
後すでに再生された再生画像信号列に基づいて予測手段
１７により予測した予測画像信号列と入力画像信号列と
の差である予測誤差信号列であるので、信号レベル検出
手段１では、予測誤差信号列の信号電力を反映する特性
量（たとえば相関係数等）に基づいて、実施例１と同様
に制御情報を生成する。離散サイン変換手段１１ｂでは
信号レベル検出手段１で生成された制御情報に応じて出
力される変換係数列のトランケーションの方法を変化さ
せて入力画像信号列に対して離散サイン変換を行ない、
変換係数列を出力する。量子化手段１２では変換係数列
に対し上記制御情報に基づいてステップ幅のスケールを
設定して量子化を行い、量子化インデクスを出力する。符号化手段１３では量子化インデクスを可変長符号に符
号化するとともに信号レベル検出手段１で生成された制
御情報を付加情報として符号化し、符号語を出力する゛
。ところで、量子化インデクスは逆量子化手段１４によ
り上記制御情報に基づいてスケールを設定したステップ
幅を用いて逆量子化されて変換係数列が再生され、この
変換係数列に対して逆離散サイン変換手段１５により上
記制御情報に基づいてトランケーションされたビット数
が元にもどされて逆離散サイン変換が施され、予測誤差
信号列が再生される。予測誤差信号列は、予測手段１７
からの予測信号列と加算されて再生画像信号列が得られ
、再生画像信号列は、遅延メモリ１６に記憶され以降の
予測に使用される。符号語から画像信号を再生するには、第３図に示すよう
に、復号化手段２１により量子化インデクスおよび付加
情報としての制御情報を復号化し、逆量子化手段２４に
より上記制御情報に基づいてスケールを設定したステッ
プ幅を用いて逆量子化を行って変換係数列を再生する。さらに、変換係数列に対して逆離散サイン変換手段２５
ｂにより上記制御情報に応じてトランケーションされた
ビット数を元に戻して逆離散サイン変換を行なって予測
誤差信号列を再生した後、予測手段２７からの予測信号
列と加算して再生画像信号を得るのである。また、再生
画像信号列は、遅延メモリ２６に記憶され以降の予測に
使用される。実施例１および実施例２においてトランケーションのビ
ット数をどのように切り換えるべきかの評価のための符
号量とＳＮ比との関係を第４図に示す。ここに、トラン
ケーションの方法は第１２図（ａ）　（ｂ）に示した処
理を切り換えて行っている（図中Ｏ印を通る線が第１２
図（ａ）に対応し、口中を通る線が第１２図（ｂ）に対
応している）、また、入力画像は、一般的な自然画像（
女性の人物像）であり、原画像信号の相関が強いので予
測誤差信号の信号電力は比較的小さく、トランケーショ
ンの方法が第１２図（ａ）と第１２図（ｂ）のいずれの
方法でも上位ビットのカットは発生しない。したがって
丸めの誤差が小さい第１２図（ｂ）の方法の方がＳＮ比
が高くなっている。また、印刷文字を含むコントラストの強い画像に対して
は下表のように、部分的に予測誤差信号の信号電力が大
きくなることがあり、第１２図（ｂ）の方法のほうが上
位ビットの切り取りが多数発生しており、上記ビットの
切り取りによる誤差の影響により第１２図（ａ）の方法
に比べＳＮ比が低くEmbodiment 2 In this embodiment, as shown in FIG. 2, the present invention is applied to an extrapolation predictive discrete sine transform coding method. In the extrapolation prediction-discrete sine transform encoding method, an input image signal sequence obtained by dividing an original image signal into a plurality of blocks is input. The input to the discrete sine conversion unit 7 is a prediction error signal sequence that is the difference between the input image signal sequence and the predicted image signal sequence predicted by the prediction means 17 based on the reproduced image signal sequence that has been quantized and already reproduced. Therefore, the signal level detection means 1 generates control information in the same way as in the first embodiment, based on characteristic quantities (for example, correlation coefficients, etc.) that reflect the signal power of the prediction error signal sequence. The discrete sine transformation means 11b performs discrete sine transformation on the input image signal sequence by changing the method of truncating the output transform coefficient sequence according to the control information generated by the signal level detection means 1,
Output the conversion coefficient sequence. The quantization means 12 performs quantization by setting a step width scale for the transform coefficient sequence based on the control information, and outputs a quantization index. The encoding means 13 encodes the quantization index into a variable length code, encodes the control information generated by the signal level detection means 1 as additional information, and outputs a code word. Incidentally, the quantization index is dequantized by the dequantization means 14 using a step width whose scale is set based on the above control information to reproduce a transform coefficient sequence, and the transform coefficient sequence is subjected to inverse discrete sine transformation. The means 15 restores the truncated number of bits based on the control information and performs inverse discrete sine transformation to reproduce the prediction error signal sequence. The prediction error signal sequence is transmitted to the prediction means 17
A reproduced image signal sequence is obtained by adding the predicted signal sequence from , and the reproduced image signal sequence is stored in the delay memory 16 and used for subsequent prediction. In order to reproduce an image signal from a code word, as shown in FIG. Inverse quantization is performed using the step width with the scale set to reproduce the transform coefficient sequence. Further, the inverse discrete sine transform means 25
After restoring the number of bits truncated according to the control information and performing inverse discrete sine transformation to reproduce the prediction error signal sequence in step b, the prediction error signal sequence is added to the prediction signal sequence from the prediction means 27 to obtain the reproduced image signal. You get it. Further, the reproduced image signal sequence is stored in the delay memory 26 and used for subsequent prediction. FIG. 4 shows the relationship between the amount of code and the SN ratio for evaluating how to switch the number of truncation bits in the first and second embodiments. Here, the truncation method is performed by switching the processes shown in Figures 12(a) and (b) (the line passing through the O mark in the figure is the 12th
(corresponds to Figure 12(a), and the line passing through the mouth corresponds to Figure 12(b)), and the input image is a general natural image (
The signal power of the prediction error signal is relatively small because the correlation with the original image signal is strong, and the truncation method is ranked high in both the methods shown in Fig. 12 (a) and Fig. 12 (b). No bit cutting occurs. Therefore, the method shown in FIG. 12(b), which has a smaller rounding error, has a higher signal-to-noise ratio. In addition, for images with strong contrast that include printed characters, the signal power of the prediction error signal may increase partially as shown in the table below, so the method shown in Figure 12 (b) is better for cutting out the upper bits. A large number of errors occur, and the S/N ratio is lower than the method shown in Figure 12 (a) due to the influence of errors caused by cutting out the bits mentioned above.

【実施例３】本実施例では、外挿予測−離散サイン変換符号化方式に
本発明を適用した他の例を示す。入力信号列は、原画像信号を分割した所定の大きさのブ
ロックであって、予測誤差信号列は、離散サイン変換手
段１１ｂにより離散サイン変換が施される。Ｎ散すイン
変換手段１１ｂにおけるトランケーションの方法を切り
換えるための制御情報は、すでに符号化したブロックに
ついて再生した予測誤差信号に基づいて得られる。すなわち、第７図に示すように、これから符号化するブ
ロックＸに対してブロックＸの周辺ですでに符号化され
ているブロックＡ−Ｄについて再生した予測誤差信号の
信号電力を参照することにより、多数決や算術平均等の
一定の規則にしたがってブロックＸのトランケーション
の方法を決定する。このように符号化する対象となって
いるブロックＸの周辺ですでに符号化されたブロックＡ
−Ｄの予測誤差信号に基づいた制御情報を参照して、対
象ブロックＸの制御情報を生成しているから、ブロック
単位の処理によって、トランケーションの方法の切り換
えをブロック単位で行なうことができ、画像の局所的な
特性に適応した切り換えを行なうことができるのはもち
ろんのこと、符号化ずみの情報を参照しているので新た
に制御情報を符号化する必要がなく符号語に付加情報が
生じないのである。信号レベル検出手段１では、符号化が完了したブロック
の再生された予測誤差信号の信号電力を測定し、信号電
力の大小を示す制御情報を生成し記憶手段１８に記憶す
る。記憶手段１８には、符号化ずみのブロックの信号電
力の情報が蓄積される。離散サイン変換手段１１ｂでは
、記憶手段１８に蓄積された情報のうち符号化の対象と
なるブロックの周辺のブロックの情報を参照してトラン
ケーションの方法を決定して制御情報を生成し、これに
基づいて離散サイン変換を行うのである。符号語から画像信号を再生するには、第６図に示すよう
に、すでに再生したブロックにおける予測誤差信号の信
号電力を信号レベル検出手段１ｂにより測定し、この信
号電力に基づく制御情報を記憶手段２９に蓄積しておく
、対象となるブロックの周辺のブロックの情報を参照し
てトランケーションされたビット数を決定して制御情報
を生成し、この制御情報に基づいて離散サイン変換を行
なうのである。他の構成は実施例２と同様であるから説
明を省略する。Embodiment 3 In this embodiment, another example in which the present invention is applied to an extrapolation prediction/discrete sine transform encoding method will be described. The input signal sequence is a block of a predetermined size obtained by dividing the original image signal, and the prediction error signal sequence is subjected to discrete sine transformation by the discrete sine transformation means 11b. Control information for switching the truncation method in the N-dispersion in conversion means 11b is obtained based on the prediction error signal reproduced for the already encoded block. That is, as shown in FIG. 7, by referring to the signal power of the prediction error signal reproduced for blocks A to D that have already been encoded around the block X for the block X to be encoded, The method of truncating block X is determined according to certain rules such as majority voting or arithmetic mean. Block A that has already been encoded in the vicinity of block X that is the target of encoding in this way
Since the control information for the target block Not only is it possible to perform switching that adapts to the local characteristics of the code, but since the encoded information is referenced, there is no need to encode new control information, and no additional information is generated in the code word. It is. The signal level detection means 1 measures the signal power of the reproduced prediction error signal of the encoded block, generates control information indicating the magnitude of the signal power, and stores it in the storage means 18. The storage means 18 stores information on the signal power of encoded blocks. The discrete sign conversion means 11b determines a truncation method by referring to information on blocks surrounding the block to be encoded among the information stored in the storage means 18, and generates control information based on this. Then, the discrete sine transform is performed. To reproduce an image signal from a code word, as shown in FIG. 6, the signal power of the prediction error signal in the already reproduced block is measured by the signal level detection means 1b, and the control information based on this signal power is stored in the storage means. The number of bits to be truncated is determined by referring to the information of blocks surrounding the target block stored in the block 29, control information is generated, and discrete sine conversion is performed based on this control information. The other configurations are the same as those in the second embodiment, so their explanation will be omitted.

【Effect of the invention】

本発明は上述のように、入力される信号列の信号電力の
ような特性量に応じて直交変換後の出力ビット長を制限
する方法を切り換えるものであり、入力される信号列の
信号電力のような特性量に基づいてトランケーションの
方法を変えるから、直交変換の後に出力ビット長を制限
する際に入力の信号列に対応して最適なトランク−ジョ
ンの方法が選択できるのであり、トランケーションエラ
ーがほとんど生じないようにすることができ、トランケ
ーションエラーによる画質の劣化を抑制することができ
るという利点を有するのである。また、原画像を複数の領域に分割して、各領域単位でト
ランケーションの方法を選択すれば、画像の局所的な特
性量の変化に対応した符号化が行えることになり、さら
に画質の劣化を抑制できるという効果がある。As described above, the present invention switches the method of limiting the output bit length after orthogonal transformation according to the characteristic quantity such as the signal power of the input signal string, and Since the truncation method is changed based on such characteristic quantities, the optimal truncation method can be selected according to the input signal sequence when limiting the output bit length after orthogonal transformation, and truncation errors can be reduced. This has the advantage that it can be prevented from occurring, and deterioration in image quality due to truncation errors can be suppressed. Furthermore, by dividing the original image into multiple regions and selecting a truncation method for each region, encoding can be performed that corresponds to changes in local characteristics of the image, further reducing image quality deterioration. It has the effect of suppressing

[Brief explanation of the drawing]

第１図は本発明の実施例１を示す概略構成図、第２図は
本発明の実施例２を示す符号化処理を行う部分の概略構
成図、第３図は同上の復号化処理を行う部分の概略構成
図、第４図は本発明の動作説明図、第５図は本発明の実
施例３を示す符号化処理を行う部分の概略構成図、第６
図は同上の復号化を行う部分の概略構成図、第７図は同
上における制御情報を得る処理の概念を示す説明図、第
８図は従来例を示す概略構成図、第９図は他の従来例を
示す概略構成図、第１０図は同上における復号化処理を
行う部分の概略構成図、第１１図は同上における直交変
換の計算過程の概念を示す動作説明図、第１２図は同上
におけるトランケーションの方法の一例を示す説明図で
ある。１．１ｂ・・・信号レベル検出手段、２・・・信号レベ
ル調節手段、ｌｌａ・・・離散コサイン変換手段、１１
ｂ・・・離散サイン変換手段、１２・・・量子化手段、
１３・・・符号化手段、１４・・・逆量子化手段、１５
・・・逆離散サイン変換手段、１６・・・遅延メモリ、
１７・・・予測手段、１８・・・信号レベル逆調節手段
、１つ・・・記憶手段、２１・・・復号化手段、２４・
・・逆量子化手段、２５ａ・・・逆離散コサイン変換手
段、２５ｂ・・・逆離散サイン変換手段、２６・・・遅
延メモリ、２７・・・予測手段、２８・・・信号レベル
逆調節手段、２９・・・記憶手段。代理人　　弁理士　　石　１）長　七第７図第８図ｂａと９第１０図手続補正書く自発）１．事件の表示平成１年特許願第１７３０５号２、発明の名称画像符号化方式３、補正をする者事件との関係　　特許出願人住　所　大阪府門真市大字門真１０４８番地名称（５８
Ｂ）松下電工株式会社代表者　　三　　好　　俊　　夫４、代理人郵便番号　５３０５、補正命令の日付自　　発６、補正により増加する請求項の数　なし７、補正の対
象明細書及び図面［１］本願明細書の特許請求の範囲の記載を下記の通り
補正する。［（１）画像信号より得られた信号列に直交変換を施し
た後に符号化する画像符号化方式において、入力される
信号列の信号電力見血筺肚蓋特性量に応じて直交変換後
の出力ビット長を制限する方法を切り換えることを特徴
とする画像符号化方式。（２）原画像を複数の領域に分割し、各領域単位で信号
電力１ａ致■旬特性量に基づいて直交変換後の出力ビッ
ト長を制限する方法を切り換えることを特徴とする請求
項１記載の画像符号化方式、」［２］同上第８頁第４行
、第８頁第８行の「信号電力のような特性量」を、それ
ぞれ「信号電力等の統計的特性量」と訂正する。［３］同上第９頁第１１行乃至第１３行の全文を削除し
、以下の文を挿入する。「１によって、１画面分の信号電力（画素の輝度の２乗
平均値）、あるいはこれを近似的に表現するダイナミッ
クレンジ（画素の輝度の最大値と最小値との差）が求め
られ、対応した制」［４］同上第１０頁第１２行の「符
号化手段３」を、ンケーションを行うビット長を設定し
て」と訂正「符号化手段１３」と訂正する。［５］同上第１１頁第１行乃至第２行の「トランケーシ
ョンされたビット数を元に戻して」を、ｒトランケーシ
ョンを行うビット長を設定して」と訂正する。［６］同上第１１頁第７行乃至第８行、第１１頁第１０
行乃至第１１行の「原画像信号の信号レベルを調節」を
、それぞれ「トランケーションの方法を変化」と訂正す
る。［７コ同上第１１頁第１６行の「信号レベルを調節」を
、「トランケーションの方法を変化」と訂正する。［８］同上第１１頁第１８行の「しかも」がら第１２頁
第１行の「である。」までを削除する。［１０］同上第１３頁第１４行乃至第１５行の「トラン
ケーションされたビット数が元にもどされて」を、「ト
ランケーションを行うビット長が設定されて」と訂正す
る。［１１］同上第１４頁第７行乃至第８行の「トランケー
ションされたビット数を元に戻、して１１」を、「トラ
する。［１２］同上第１８頁第１行乃至第２行の「トランケー
ションされたＪを、「トランケーションを行う」と訂正
する。［１３］同上第１８頁第７行乃至第８行、第１８頁第１
０行の「信号電力のような特性量」を、「信号電力等の
統計的特性量」と訂正する。［１４］同上第２０頁第３行の「信号レベル逆調節手段
、１９」を削除する。［１５］同上第２０頁第７行の「２８・・・信号レベル
逆調節手段」を削除する。［１６コ添付図面中第１図、第２図、第３図、第５図、
第６図、第９図を別紙のように訂正する。代理人　弁理士　石　１）長　七FIG. 1 is a schematic block diagram showing a first embodiment of the present invention, FIG. 2 is a schematic block diagram of a part that performs encoding processing, showing a second embodiment of the present invention, and FIG. 3 is a block diagram showing the same decoding process. 4 is an explanatory diagram of the operation of the present invention; FIG. 5 is a schematic diagram of a portion that performs encoding processing according to the third embodiment of the present invention; FIG.
The figure is a schematic block diagram of the decoding part of the same as above, FIG. 7 is an explanatory diagram showing the concept of the process of obtaining control information in the above, FIG. 8 is a schematic block diagram of a conventional example, and FIG. 9 is a diagram of another A schematic configuration diagram showing a conventional example, FIG. 10 is a schematic configuration diagram of a part that performs decoding processing in the same as above, FIG. 11 is an operation explanatory diagram showing the concept of the calculation process of orthogonal transform in the same as above, and FIG. FIG. 2 is an explanatory diagram showing an example of a truncation method. 1.1b...Signal level detection means, 2...Signal level adjustment means, lla...Discrete cosine transformation means, 11
b...Discrete sine conversion means, 12...Quantization means,
13... Encoding means, 14... Inverse quantization means, 15
... Inverse discrete sine conversion means, 16... Delay memory,
17... Prediction means, 18... Signal level reverse adjustment means, one... Storage means, 21... Decoding means, 24.
... Inverse quantization means, 25a... Inverse discrete cosine transform means, 25b... Inverse discrete sine transform means, 26... Delay memory, 27... Prediction means, 28... Signal level inverse adjustment means , 29...memory means. Agent Patent Attorney Ishi 1) Chief 7 Figure 8 BA and 9 Figure 10 Voluntary writing of procedural amendments) 1. Display of the case 1999 Patent Application No. 17305 2, Name of the invention Image encoding method 3, Person making the amendment Relationship to the case Patent applicant address 1048 Kadoma, Kadoma City, Osaka Prefecture Name (58
B) Matsushita Electric Works Co., Ltd. Representative: Toshio Miyoshi 4, Agent postal code: 530 5, Date of amendment order: 6, Number of claims increased by amendment: None 7, Specification and drawings subject to amendment [1] The claims in the specification of this application are amended as follows. [(1) In an image encoding method that performs orthogonal transformation on a signal sequence obtained from an image signal and then encodes the signal sequence, the value after orthogonal transformation is determined according to the signal power characteristics of the input signal sequence. An image encoding method characterized by switching the method of limiting the output bit length. (2) The original image is divided into a plurality of regions, and the method for limiting the output bit length after orthogonal transformation is switched based on the signal power 1a characteristic quantity for each region. ``Image encoding system,'' [2] ``Characteristic quantities such as signal power'' on page 8, line 4 and page 8, line 8 of the same are corrected to ``statistical characteristic quantities such as signal power,'' respectively. . [3] Delete the entire text of page 9, line 11 to line 13 of the same as above, and insert the following sentence. 1 determines the signal power for one screen (the root mean square value of pixel brightness) or the dynamic range that approximately expresses this (the difference between the maximum and minimum pixel brightness values), and [4] "Encoding means 3" on page 10, line 12 of the above is corrected to "setting the bit length for encoding" and "encoding means 13". [5] In the 1st and 2nd lines of page 11, ``Restore the number of truncated bits'' is corrected to ``Set the bit length for r-truncation''. [6] Same as above, page 11, lines 7 to 8, page 11, line 10
"Adjust the signal level of the original image signal" in lines 1 to 11 is corrected to "change the truncation method." [7. Correct ``Adjust the signal level'' on page 11, line 16 of the same page to ``change the truncation method.'' [8] Delete the words "Mosakamo" on page 11, line 18 of the same as above to "desu" on page 12, line 1. [10] On page 13, lines 14 and 15 of the same page, the phrase "the number of truncated bits is restored" is corrected to "the bit length for truncation is set." [11] "Restore the truncated number of bits and 11" in the 7th to 8th lines of the 14th page of the same page. [12] The 1st to 2nd lines of the 18th page of the same page ``Correct the truncated J to ``perform truncation.'' [13] Same as above, page 18, lines 7 to 8, page 18, line 1
Correct "characteristic quantity such as signal power" in line 0 to "statistical characteristic quantity such as signal power." [14] Delete "Signal level inverse adjustment means, 19" on page 20, line 3 of the same. [15] Delete "28...Signal level reverse adjustment means" on page 20, line 7 of the same. [Of the 16 attached drawings, Figures 1, 2, 3, 5,
Figures 6 and 9 are corrected as shown in the attached sheet. Agent Patent Attorney Ishi 1) Choshichi

Claims

[Claims]

(1) In an image encoding method in which a signal sequence obtained from an image signal is encoded after orthogonal transformation, the output bit length after orthogonal transformation is determined according to the characteristic quantity such as the signal power of the input signal sequence. An image encoding method characterized by switching a method of limiting.

(2) The original image is divided into a plurality of regions, and the method of limiting the output bit length after orthogonal transformation is switched for each region based on a characteristic quantity such as signal power. Image encoding method.