JPH06164939A

JPH06164939A - Encoder for picture signal

Info

Publication number: JPH06164939A
Application number: JP30545592A
Authority: JP
Inventors: Kazuhiro Suzuki; 一弘鈴木; Yutaka Koshi; 裕越; Setsu Kunitake; 節國武; Shunichi Kimura; 俊一木村; Isao Uesawa; 功上澤
Original assignee: Fuji Xerox Co Ltd
Current assignee: Fujifilm Business Innovation Corp
Priority date: 1992-11-16
Filing date: 1992-11-16
Publication date: 1994-06-10
Anticipated expiration: 2012-11-19
Also published as: JP2679769B2

Abstract

PURPOSE:To realize the adaptive encoding with a high efficiency and less in picture quality deterioration by employing the waveform analysis method in a picture element space analyzing a waveform of each input block based on a direction of a gradation change and a characteristic in the amplitude direction. CONSTITUTION:A picture signal is divided into an input block comprising mXn picture elements by a block extract section 2 and an orthogonal transformation section 100 applies orthogonal transformation to the input block to obtain a transformation coefficient. An area analysis section 7 analyzes a waveform of the input block. A quantization matrix storage section 104 stores plural sets of quantization matrices suitable for each waveform in advance and a corresponding quantization matrix is read in response to the result of analysis of waveform by an area analysis section 7. A quantization section 102 quantizes the transformation coefficient with a quantization matrix read from the quantization matrix storage section 104 to obtain a quantization coefficient. A variable length coding means 106 applies variable length encoding to the quantization coefficient and a multiplexer section 108 multiplexes the encoding result to obtain code data.

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】本発明は、画像信号の符号化装置
に関するものである。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to an image signal coding apparatus.

【０００２】[0002]

【従来の技術】画像信号の符号化方式として、ファクシ
ミリの標準方式の一つであるＪＰＥＧ方式（ＩＳＯ−Ｉ
ＥＣ／ＣＤ１０９１８−１，“ＤｉｇｉｔａｌＣｏ
ｍｐｒｅｓｓｉｏｎａｎｄＣｏｄｉｎｇｏｆＣ
ｏｎｔｉｎｕｏｕｓ−ｔｏｎｅＳｔｉｌｌＩｍａｇｅ
ｓＰａｒｔ１Ｒｅｑｕｉｒｅｍｅｎｔａｎｄｇ
ｕｉｄｅｌｉｎｅ”参照）で採用されているような、直
交変換の一種である離散コサイン変換（Ｄｉｓｃｒｅｔ
ｅＣｏｓｉｎｅＴｒａｎｓｆｏｒｍ）に基づく手法
が知られている。例えば、８次の２次元離散コサイン変
換の変換は、（１）式で与えられ、逆変換は（２）式と
なる。2. Description of the Related Art As an image signal encoding method, a JPEG method (ISO-I
EC / CD 10918-1, "Digital Co
compression and Coding of C
ongoing-toneStill Image
s Part 1 Requirements andg
Discrete Cosine Transform (Discrete Transform), which is a type of orthogonal transform, such as that used in the
A method based on e Cosine Transform is known. For example, the transformation of the 8th-order two-dimensional discrete cosine transformation is given by the equation (1), and the inverse transformation becomes the equation (2).

【０００３】[0003]

【数１】ここで、[Equation 1] here,

【数２】また、ｆ（ｉ，ｊ）は、画素ブロックの各要素を表し、
ｉ，ｊは要素の位置を表す。Ｆ（ｕ，ｖ）は、変換係数
の各要素を表し、ｕ，ｖは要素の位置を表す。[Equation 2] Also, f (i, j) represents each element of the pixel block,
i and j represent the position of the element. F (u, v) represents each element of the transform coefficient, and u, v represents the position of the element.

【０００４】人物、風景等の自然画像と呼ばれる画像信
号では、隣接する画素どうしが近い画素値をとる傾向が
あり、相関性の高いことが知られている。このような相
関性の高い信号は、周波数軸上で見ると、ある特定の周
波数成分に信号電力が集中的に分布していることを意味
する。この信号電力が集中して分布する成分の係数のみ
を符号化すれば、全体としての情報量削減が可能とな
る。自然画像の場合には、離散コサイン変換を行うこと
により、大部分の信号電力が低周波領域に集中する。It is known that in an image signal called a natural image of a person, a landscape or the like, adjacent pixels tend to have pixel values close to each other, and have a high correlation. Such a highly correlated signal means that the signal power is concentratedly distributed on a specific frequency component when viewed on the frequency axis. If only the coefficient of the component in which the signal power is concentrated and distributed is encoded, it is possible to reduce the information amount as a whole. In the case of a natural image, most of the signal power is concentrated in the low frequency region by performing the discrete cosine transform.

【０００５】以下、図１０によって従来例の構成につい
て説明する。The configuration of the conventional example will be described below with reference to FIG.

【０００６】図において、１は入力画像、３はブロック
抽出部２によって入力画像より切り出された入力ブロッ
ク、１０１は直交変換部１００によって入力ブロック３
に（１）式に示す離散コサイン変換を施して得られる変
換係数、１０３は量子化マトリクス格納部１０４に格納
された量子化マトリクス、１０５は量子化部１０２によ
って変換係数１０１を量子化マトリクス１０３で量子化
することによって得られる量子化係数、１０７は量子化
係数１０５を可変長符号化して得られる可変長符号、１
０９は可変長符号１０７を多重化した符号データであ
る。また、２は入力画像１から画素の矩形領域である入
力ブロック３を抽出するブロック抽出部、１００は入力
ブロック３に対して離散コサイン変換を施す直交変換
部、１０４は量子化マトリクスを記憶する量子化マトリ
クス格納部、１０２は変換係数１０１に対して量子化マ
トリクス１０３を用いて量子化を行う量子化部、１０６
は量子化係数１０６を可変長符号化する可変長符号化
部、１０８は可変長符号１０７を多重化して符号データ
１０９を構成する多重化部である。In the figure, 1 is an input image, 3 is an input block cut out from the input image by the block extraction unit 2, 101 is an input block 3 by the orthogonal transformation unit 100.
Is a transform coefficient obtained by performing the discrete cosine transform shown in the equation (1), 103 is a quantization matrix stored in the quantization matrix storage unit 104, and 105 is a quantization matrix 103 that transforms the transform coefficient 101 into a quantization matrix 103. Quantized coefficient obtained by quantizing, 107 is a variable length code obtained by variable length encoding the quantized coefficient 105, 1
Reference numeral 09 is code data in which the variable length code 107 is multiplexed. Further, 2 is a block extraction unit that extracts an input block 3 that is a rectangular region of pixels from the input image 1, 100 is an orthogonal transformation unit that performs a discrete cosine transform on the input block 3, and 104 is a quantum that stores a quantization matrix. A quantization matrix storage unit 102, a quantization unit 106 that quantizes the transform coefficient 101 using the quantization matrix 103,
Is a variable length coding unit for variable length coding the quantized coefficient 106, and 108 is a multiplexing unit for multiplexing the variable length code 107 to form code data 109.

【０００７】以下、図１０に基づいて動作を説明する。The operation will be described below with reference to FIG.

【０００８】ブロック抽出部２では、図１１に示すよう
に入力画像１から、画素の矩形領域である入力ブロック
３が抽出される。図は８×８画素の領域の場合であり、
以下実施例では、８×８画素のサイズについて説明す
る。The block extracting section 2 extracts an input block 3 which is a rectangular area of pixels from the input image 1 as shown in FIG. The figure shows the case of the area of 8 × 8 pixels,
In the following example, a size of 8 × 8 pixels will be described.

【０００９】続いて直交変換部１００においては、入力
ブロック３に対して（１）式に示した離散コサイン変換
が施される。離散コサイン変換の結果、変換係数１０１
は８×８のサイズのマトリクスとして得られる。この
時、変換係数１０１は図１２のようにマトリクス内をジ
グザグに走査した一次元の係数列として出力される。Then, in the orthogonal transform unit 100, the discrete cosine transform shown in the equation (1) is applied to the input block 3. As a result of the discrete cosine transform, the transform coefficient 101
Is obtained as a matrix of size 8 × 8. At this time, the transform coefficient 101 is output as a one-dimensional coefficient string in which the matrix is zigzag scanned as shown in FIG.

【００１０】量子化部１０２における量子化処理は、変
換係数１０１と量子化マトリクス格納部１０４に格納さ
れた量子化マトリクス１０３を用いて行われる。量子化
は次式で定義される丸め処理である。The quantization processing in the quantization unit 102 is performed using the transform coefficient 101 and the quantization matrix 103 stored in the quantization matrix storage unit 104. Quantization is a rounding process defined by the following equation.

【００１１】 C(u,v)=(F(u,v)+(Q(u,v)/2))/Q(u,v) (F(u,v)≧0) ・・・・（４） C(u,v)=(F(u,v)-(Q(u,v)/2))/Q(u,v) (F(u,v)＜0) ・・・・（５）ここで、Ｆ（ｕ，ｖ），Ｑ（ｕ，ｖ）は、それぞれ変換
係数、量子化マトリクスの各要素を表す。ｕ，ｖは要素
の位置を表す。図１３に量子化マトリクス１０３の例を
示す。以下、係数位置ごとのそれぞれの値を量子化ステ
ップ値と呼ぶ。C (u, v) = (F (u, v) + (Q (u, v) / 2)) / Q (u, v) (F (u, v) ≧ 0) ... (4) C (u, v) = (F (u, v)-(Q (u, v) / 2)) / Q (u, v) (F (u, v) <0) ... (5) Here, F (u, v) and Q (u, v) represent the transform coefficient and each element of the quantization matrix, respectively. u and v represent the position of the element. FIG. 13 shows an example of the quantization matrix 103. Hereinafter, each value for each coefficient position is called a quantization step value.

【００１２】量子化マトリクス１０３は、量子化マトリ
クス格納部１０４に、図１２に示すジグザグスキャンの
順に格納される。これにより、量子化部１０２において
は、ジグザグスキャンの順に、変換係数１０１と対応す
る位置の量子化ステップ値が読み込まれ、（４），
（５）式の量子化処理が順次実行される。The quantization matrix 103 is stored in the quantization matrix storage unit 104 in the zigzag scan order shown in FIG. As a result, the quantization unit 102 reads the quantization step value at the position corresponding to the transform coefficient 101 in the zigzag scan order, and (4),
The quantization process of the equation (5) is sequentially executed.

【００１３】従来例においては、画質、符号化効率は量
子化処理によって決定される。符号化における情報削減
効果は、変換係数のビット精度の低減によって実現され
る。通常は、変換係数の電力分布には偏りがあるため、
電力の集中する係数のビット精度を高く、電力の集中し
ない係数のビット精度を粗く設定することによって、再
現画質と符号化効率の両立を計っている。図１３に示す
量子化マトリクスでは、低周波成分の係数には多くのビ
ットが、高周波成分の係数には少ないビットが配分され
る設定となっている。In the conventional example, the image quality and the coding efficiency are determined by the quantization process. The information reduction effect in encoding is realized by reducing the bit precision of transform coefficients. Normally, the power distribution of the conversion coefficient is biased, so
By setting the bit precision of the coefficient on which the power is concentrated and the bit precision of the coefficient on which the power is not concentrated roughly, the reproduction image quality and the coding efficiency are compatible with each other. In the quantization matrix shown in FIG. 13, many bits are allocated to the coefficient of the low frequency component, and few bits are allocated to the coefficient of the high frequency component.

【００１４】しかしながら、上記したＪＰＥＧ方式にお
いては、入力される画像の波形を分析する手法を持た
ず、また、一つの画像（カラー画像の場合は、色成分ご
と）に対して１種類の量子化特性しか用いることができ
ないことから、原稿内容に対する適応化ができず、再現
画質と符号化効率の改善を十分に両立させることが困難
であるという問題があった。However, the above-mentioned JPEG method does not have a method of analyzing the waveform of an input image, and one kind of quantization is performed for one image (in the case of a color image, each color component). Since only the characteristics can be used, there is a problem in that it cannot be adapted to the content of the original document, and it is difficult to sufficiently achieve both the reproduction image quality and the improvement of the coding efficiency.

【００１５】そこでこの問題を解決するものとして、画
像ごとの特性に合わせて、各変換係数の分散に基づいて
最適なビット配分を決定することにより、画質及び圧縮
効率の双方を改善する手法が、尾上守夫、「画像処理ハ
ンドブック（９．３濃淡静止画像の符号化）」，株式
会社昭晃堂発行，１９８７，ｐ２２１に開示されてい
る。この適応符号化におけるビット数の配分は、次式の
ように表せる。To solve this problem, a method of improving both image quality and compression efficiency by determining the optimal bit distribution based on the variance of each transform coefficient in accordance with the characteristics of each image is as follows. Morio Onoe, "Image Processing Handbook (9.3 Encoding of still image of grayscale)", published by Shokoido Co., Ltd., 1987, p221. The distribution of the number of bits in this adaptive encoding can be expressed as the following equation.

【００１６】[0016]

【数３】ここで、ｂ（ｕ，ｖ）は変換係数Ｆ（ｕ，ｖ）に割当て
るビット数、σ（ｕ，ｖ）²は変換係数Ｆ（ｕ，ｖ）の
分散、θは平均のビット数である。[Equation 3] Here, b (u, v) is the number of bits assigned to the transform coefficient F (u, v), σ (u, v) ² is the variance of the transform coefficient F (u, v), and θ is the average number of bits. .

【００１７】これにより、割当てるビット数ｂ（ｕ，
ｖ）と変換係数のダイナミックレンジＬ（ｕ，ｖ）か
ら、量子化ステップ値Ｑ（ｕ，ｖ）は次式によって決定
することができる。As a result, the number of allocated bits b (u,
v) and the dynamic range L (u, v) of the transform coefficient, the quantization step value Q (u, v) can be determined by the following equation.

【００１８】 Q(u,v)=Int［L(u,v)/2^b(u,v)］・・・・（８）ここで、Ｉｎｔ［］は整数化することを意味する。Q (u, v) = Int [L (u, v) / 2 ^{b (u, v)} ] (8) Here, Int [] means to be an integer.

【００１９】図１０に示す可変長符号化部１０６では、
量子化係数１０５に対してハフマン符号化等の可変長符
号化を行い、割当てられた可変長符号１０７が出力され
る。In the variable length coding unit 106 shown in FIG.
Variable length coding such as Huffman coding is performed on the quantized coefficient 105, and the assigned variable length code 107 is output.

【００２０】多重化部１０８では、可変長符号を多重化
して符号データ１０９を構成することにより、符号化動
作が完了する。The multiplexer 108 completes the encoding operation by multiplexing the variable length codes to form the code data 109.

【００２１】[0021]

【発明が解決しようとする課題】通常、スキャナ等で入
力される画像中には、文字、写真などの異種領域が混在
することが予想される。このような異なる領域に変換符
号化を適用する場合、領域によって変換係数の電力分布
は大きく異なる。Normally, it is expected that different types of regions such as characters and photographs will coexist in an image input by a scanner or the like. When transform coding is applied to such different regions, the power distribution of transform coefficients greatly differs depending on the regions.

【００２２】画像の特性に合わせて最適なビット配分を
行う技術自体は、上記した「画像処理ハンドブック」に
開示されているが、この文献に開示されている従来技術
では、符号化すべき画像全体の平均的な特性に基づいて
ビット配分を決定していたので、異種領域ごとの特性の
違いについては考慮されていなかった。したがって、写
真画像中に文字部が含まれている場合においても、画像
全体に対して求めた平均的なビット配分が適用されるこ
とになる。文字部では、エッジによって発生する高周波
成分を保存するために多くのビット配分が必要となり、
さらにエッジの方向によって高周波成分の分布が異な
る。平均的なビット配分のみでは、これらの特性の違い
に対応できず、文字画質の劣化を引き起こすという問題
があった。The technique itself for performing the optimal bit allocation according to the characteristics of the image is disclosed in the above-mentioned "Image Processing Handbook", but in the conventional technique disclosed in this document, the entire image to be encoded is Since the bit allocation was determined based on the average characteristics, the difference in characteristics between different areas was not considered. Therefore, even when the photographic image includes a character portion, the average bit allocation calculated for the entire image is applied. In the character part, a lot of bit allocation is required to save the high frequency component generated by the edge,
Furthermore, the distribution of high frequency components differs depending on the direction of the edge. There is a problem in that the difference in these characteristics cannot be dealt with only by the average bit allocation, and the character image quality is deteriorated.

【００２３】図１４〜図１６は、水平方向に階調変化を
有する画素ブロック（図１４）、垂直方向に階調変化を
有する画素ブロック（図１５）、斜め方向に階調変化を
有する画素ブロック（図１６）のそれぞれについて、画
素分布（図１４（ａ）〜図１６（ａ））と変換係数の電
力分布（図１４（ｂ）〜図１６（ｂ））の対応を示すも
のである。図１４〜図１６から判るように、入力ブロッ
ク中の階調変化の方向、振幅変化の大きさによって係数
電力の分布が異なっている。したがって、変換符号化の
場合には、入力ブロックの波形を分析することによっ
て、変換係数の量子化特性を決定する適応符号化が画
質、効率の観点から有望と考えられる。14 to 16 are pixel blocks having gradation changes in the horizontal direction (FIG. 14), pixel blocks having gradation changes in the vertical direction (FIG. 15), and pixel blocks having gradation changes in the diagonal direction. FIG. 16 shows the correspondence between the pixel distribution (FIGS. 14A to 16A) and the conversion coefficient power distribution (FIGS. 14B to 16B) for each of FIGS. As can be seen from FIGS. 14 to 16, the distribution of coefficient power differs depending on the direction of gradation change and the magnitude of amplitude change in the input block. Therefore, in the case of transform coding, adaptive coding that determines the quantization characteristic of transform coefficients by analyzing the waveform of the input block is considered promising from the viewpoint of image quality and efficiency.

【００２４】本発明においては、入力ブロックごとの波
形を階調変化の方向と振幅方向の特徴から分析する画素
空間での波形分析手法を用いることにより効率が高く且
つ画質の劣化が少ない適応符号化を実現することを目的
とする。According to the present invention, the adaptive coding with high efficiency and little deterioration in image quality is achieved by using the waveform analysis method in the pixel space for analyzing the waveform of each input block from the characteristics of the gradation change direction and the amplitude direction. The purpose is to realize.

【００２５】[0025]

【課題を解決するための手段】本発明の画像信号の符号
化装置においては、画像信号を複数の画素から成るｍ×
ｎ画素（ｍ，ｎは正整数）の入力ブロックに分割するブ
ロック抽出手段と、前記入力ブロックに直交変換を施し
て変換係数を得る直交変換手段と、量子化の特性を記憶
する量子化特性格納手段と、前記変換係数を前記量子化
特性格納手段に記憶した特性で量子化して量子化係数を
得る量子化手段と、前記量子化係数を可変長符号化する
符号化手段と、可変長符号化の結果を多重化して符号デ
ータを得る多重化手段を備えた画像信号の符号化装置に
おいて、前記入力ブロックの波形を分析する波形分析手
段を備えたことを特徴とする。In the image signal coding apparatus of the present invention, the image signal is composed of a plurality of pixels m ×
Block extraction means for dividing the input block into n pixels (m and n are positive integers), orthogonal transformation means for subjecting the input block to orthogonal transformation to obtain transformation coefficients, and quantization characteristic storage for storing the characteristics of quantization. Means, quantizing means for quantizing the transform coefficient with a characteristic stored in the quantizing characteristic storing means to obtain a quantized coefficient, coding means for variable-length coding the quantized coefficient, and variable-length coding The image signal coding apparatus having a multiplexing unit for multiplexing the result of (1) to obtain coded data is provided with a waveform analysis unit for analyzing the waveform of the input block.

【００２６】また、前記波形分析手段においては、前記
入力ブロック内の各画素から平均値を減算する平均値分
離手段と、前記平均値分離手段によって得られる平均値
分離ブロックの階調変化の方向の特徴量を分析する第１
の分析手段と、前記平均値分離ブロックの振幅方向の特
徴量を分析する第２の分析手段と、前記第１の分析手段
の分析結果と第２の分析手段の分析結果に基づいて前記
入力ブロックの波形の特徴を判定する判定手段を備えた
ことを特徴とする。Further, in the waveform analysis means, an average value separating means for subtracting an average value from each pixel in the input block, and a direction of gradation change of the average value separating block obtained by the average value separating means The first to analyze the feature quantity
Analyzing means, second analyzing means for analyzing the feature value in the amplitude direction of the average value separation block, and the input block based on the analysis results of the first analyzing means and the second analyzing means. It is characterized in that it is provided with a judging means for judging the characteristics of the waveform.

【００２７】[0027]

【作用】ブロック抽出手段では、画像信号を複数の画素
から成るｍ×ｎ画素（ｍ，ｎは正整数）の入力ブロック
に分割する。直交変換手段では、前記入力ブロックに直
交変換を施し、変換係数を得る。波形分析手段では、前
記入力ブロックの波形を分析する。量子化特性格納手段
では、波形の分析結果に対応する量子化特性が設定され
る。量子化手段では、前記変換係数を量子化特性格納手
段に設定された特性で量子化して量子化係数を得る。符
号化手段では、前記量子化係数を可変長符号化し、多重
化手段では、符号化結果を多重化して符号データを得
る。The block extracting means divides the image signal into input blocks of m × n pixels (m and n are positive integers) each consisting of a plurality of pixels. The orthogonal transform means performs orthogonal transform on the input block to obtain transform coefficients. The waveform analysis means analyzes the waveform of the input block. The quantization characteristic storage means sets the quantization characteristic corresponding to the waveform analysis result. The quantizing means quantizes the transform coefficient with a characteristic set in the quantizing characteristic storing means to obtain a quantized coefficient. The coding means performs variable length coding on the quantized coefficient, and the multiplexing means multiplexes the coding result to obtain coded data.

【００２８】[0028]

【実施例】図１は、本発明の実施例の構成を示す図であ
る。図において、図１０の従来例と対応する部分には同
一符号を付している。FIG. 1 is a diagram showing the configuration of an embodiment of the present invention. In the figure, parts corresponding to those of the conventional example of FIG. 10 are designated by the same reference numerals.

【００２９】図において、１は入力画像、３はブロック
抽出部２によって入力画像より切り出された入力ブロッ
ク、１３は後述する領域分析部７による入力ブロック３
の分析結果である領域情報、１０１は直交変換部１００
によって入力ブロック３に（１）式に示す離散コサイン
変換を施して得られる変換係数、１０３は量子化マトリ
クス格納部１０４に格納された量子化マトリクスのセッ
トから領域情報１３によって選ばれた量子化マトリク
ス、１０５は量子化部１０２によって変換係数１０１を
量子化マトリクス１０３で量子化することによって得ら
れる量子化係数、１０７は量子化係数１０５を可変長符
号化して得られる可変長符号、１０９は可変長符号１０
７を多重化した符号データである。また、２は入力画像
１から画素の矩形領域である入力ブロック３を抽出する
ブロック抽出部、７は入力ブロック３の波形及び利得に
関する分析を行い、結果を領域情報１３として出力する
領域分析部、１００は入力ブロック３に対して離散コサ
イン変換を施す直交変換部、１０４は量子化マトリクス
を記憶し、領域情報１３に対応する量子化マトリクス１
０３を出力する量子化マトリクス格納部、１０２は変換
係数１０１に対して量子化マトリクス１０３を用いて量
子化を行う量子化部、１０６は量子化係数１０５を可変
長符号化する可変長符号化部、１０８は可変長符号１０
７を多重化して符号データ１０９を構成する多重化部で
ある。In the figure, 1 is an input image, 3 is an input block cut out from the input image by the block extracting unit 2, 13 is an input block 3 by an area analyzing unit 7 which will be described later.
Area information which is the analysis result of the orthogonal transformation unit 100
Is a transform coefficient obtained by subjecting the input block 3 to the discrete cosine transform shown in equation (1), and 103 is a quantization matrix selected by the region information 13 from the set of quantization matrices stored in the quantization matrix storage unit 104. , 105 is a quantized coefficient obtained by quantizing the transform coefficient 101 with the quantization matrix 103 by the quantizer 102, 107 is a variable length code obtained by variable length encoding the quantized coefficient 105, and 109 is a variable length Code 10
7 is coded data. Further, 2 is a block extraction unit that extracts an input block 3 that is a rectangular region of pixels from the input image 1, 7 is an area analysis unit that analyzes the waveform and gain of the input block 3, and outputs the result as region information 13. Reference numeral 100 denotes an orthogonal transform unit that performs a discrete cosine transform on the input block 3, 104 denotes a quantization matrix, and the quantization matrix 1 corresponding to the area information 13 is stored.
03 is a quantization matrix storage unit, 102 is a quantization unit that quantizes the transform coefficient 101 using the quantization matrix 103, and 106 is a variable length coding unit that performs variable length coding on the quantized coefficient 105. , 108 is a variable length code 10
7 is a multiplexing unit that multiplexes 7 to form coded data 109.

【００３０】図２は、図１に示す領域分析部７の構成を
説明する図である。FIG. 2 is a diagram for explaining the structure of the area analysis unit 7 shown in FIG.

【００３１】図において、３はブロック抽出部２によっ
て切り出されたｍ×ｎ画素（ｍ，ｎは正整数）の入力ブ
ロック、６は平均値分離部４によって入力ブロック３の
平均値を各画素から減じた平均値分離ブロック、１８は
波形分析部１４によって平均値分離ブロック６の波形を
分析した結果である波形情報、２７は利得分析部２０に
よって平均値分離ブロック６の利得を分析した結果であ
る利得情報、１３は、領域判定部１２において、波形情
報１８と利得情報２７に基づく領域判定の結果である領
域情報である。また、４は入力ブロック３の平均値を計
算し、各画素から平均値を減じて平均値分離ブロック６
を得る平均値分離部、１４は平均値分離ブロック６の波
形を分析して波形情報１８を出力する波形分析部、２０
は平均値分離ブロック６の利得情報を分析して利得情報
２７を出力する利得分析部、１２は波形情報１８と利得
情報２７から入力ブロック３の領域を判定して領域情報
１３を出力する領域判定部である。In the figure, 3 is an input block of m × n pixels (m and n are positive integers) cut out by the block extraction unit 2, and 6 is an average value separation unit 4 which calculates the average value of the input block 3 from each pixel. The subtracted average value separation block, 18 is waveform information that is the result of analyzing the waveform of the average value separation block 6 by the waveform analysis unit 14, and 27 is the result of analyzing the gain of the average value separation block 6 by the gain analysis unit 20. Gain information 13 is area information which is a result of area determination based on the waveform information 18 and the gain information 27 in the area determination unit 12. Also, 4 calculates the average value of the input block 3 and subtracts the average value from each pixel to obtain the average value separation block 6
Is a mean value separation unit, 14 is a waveform analysis unit that analyzes the waveform of the mean value separation block 6 and outputs waveform information 18, 20
Is a gain analysis unit which analyzes the gain information of the average value separation block 6 and outputs the gain information 27, and 12 is a region determination which determines the region of the input block 3 from the waveform information 18 and the gain information 27 and outputs the region information 13. It is a department.

【００３２】図３は、図２に示す波形分析部１４の構成
図である。図において、１６は形状分析部１５によって
選択された代表ベクトルを表す形状インデックス、１８
は波形マッピング・テーブル１７が出力する波形情報で
ある。また、１５は代表的な波形情報を有する代表ベク
トルのセットとｍ×ｎ画素から成るブロックである平均
値分離ブロック６とのパターン・マッチングを行い、最
も近い波形情報を持つ代表ベクトルを選ぶ形状分析部、
１７は形状インデックス１６から波形情報１８を得るた
めの波形マッピング・テーブルである。FIG. 3 is a block diagram of the waveform analyzer 14 shown in FIG. In the figure, 16 is a shape index representing the representative vector selected by the shape analysis unit 15, and 18 is a shape index.
Is the waveform information output from the waveform mapping table 17. Further, 15 is a shape analysis for performing pattern matching between a set of representative vectors having representative waveform information and an average value separation block 6 which is a block consisting of m × n pixels, and selecting a representative vector having the closest waveform information. Department,
Reference numeral 17 is a waveform mapping table for obtaining the waveform information 18 from the shape index 16.

【００３３】図３の形状分析部１５は、例えば図４に示
すように、予め用意された代表的な波形情報を有する代
表ベクトル・セットと、分析対象ブロック（以後分析ブ
ロックと呼ぶ）すなわち平均値分離ブロック６とのパタ
ーン・マッチングにより波形情報分析を行う。波形情報
分析により、分析ブロックの階調変化の方向として形状
インデックス１６が得られる。As shown in FIG. 4, the shape analysis unit 15 in FIG. 3 sets a representative vector set having typical waveform information prepared in advance and an analysis target block (hereinafter referred to as an analysis block), that is, an average value. Waveform information analysis is performed by pattern matching with the separation block 6. By the waveform information analysis, the shape index 16 is obtained as the direction of the gradation change of the analysis block.

【００３４】ｍ×ｎ画素の分析ブロックをｘ＝｛ｘ_i｜
ｉ＝１，２，．．．，ｍ×ｎ｝、ｋ個の代表ベクトルか
らなる代表ベクトル・セットをＹ＝｛ｙ_i｜ｉ＝１，
２，．．．，ｋ｝とすると、パタン・マッチングは以下
の式で定義できる。The analysis block of m × n pixels is x = {x _i │
i = 1, 2 ,. ．． , M × n}, a representative vector set consisting of k representative vectors is Y = {y _i | i = 1,
2 ,. ．． , K}, the pattern matching can be defined by the following equation.

【００３５】ｄ（ｘ，ｙ_p）＝ｍｉｎ｛ｄ（ｘ，ｙ_i）｝（全ての
ｉに対して）（ｉ＝１，２，．．．．，ｋ）ここで、ｄ（ｘ，ｙ_i）はｘとｙ_iとの歪み測度であ
り、２乗歪み等で定義される。ｐは代表ベクトルのイン
デックスすなわち形状インデックス１６であり、ｐの表
す代表ベクトルｙ_pが、分析ブロックに最も近い波形情
報を持つ代表ベクトルとして選択されたことを示してい
る。D (x, y _p ) = min {d (x, y _i )} (for all i) (i = 1, 2, ..., k) where d (x, y _i ) is a distortion measure between x and y _i, and is defined by square distortion or the like. p is the index of the representative vector, that is, the shape index 16, and indicates that the representative vector y _p represented by _p is selected as the representative vector having the waveform information closest to the analysis block.

【００３６】以下、波形分析に関する動作について説明
する。The operation relating to the waveform analysis will be described below.

【００３７】代表ベクトルのセットは、水平、垂直、そ
の他の方向をもつ階調変化に対して主成分分析を行うこ
とによって設計する。代表ベクトルのセットを記憶する
メモリの削減のために、パターンマッチングは部分ブロ
ックに分割して行われる。例えば、入力ブロック３が８
×８のサイズであれば、４×４画素の４つの部分ブロッ
クごとにパターンマッチングが行われる。部分ブロック
ごとに得られた４つの形状インデックスは、８×８画素
の入力ブロック中の４×４画素ブロックの２次元の波形
の特徴を表している。これらの４つのインデックスは、
波形マッピングテーブル１７において８×８画素ブロッ
ク全体の波形を示す情報にマッピングされ、波形情報１
８として出力される。この時、４つの部分ブロックの波
形の方向のばらつき（複雑度）が考慮される。例えば、
４つの部分ブロックの波形の方向がすべて一致すれば複
雑度は低く、４つの部分ブロックの波形の方向がすべて
異なる場合には複雑度は高いものとする。The set of representative vectors is designed by performing a principal component analysis on gradation changes having horizontal, vertical, and other directions. In order to reduce the memory for storing the set of representative vectors, pattern matching is performed by dividing it into partial blocks. For example, the input block 3 is 8
If the size is × 8, pattern matching is performed for every four partial blocks of 4 × 4 pixels. The four shape indexes obtained for each partial block represent the characteristics of the two-dimensional waveform of the 4 × 4 pixel block in the input block of 8 × 8 pixels. These four indexes are
Waveform information 1 is mapped in the waveform mapping table 17 to information indicating the waveform of the entire 8 × 8 pixel block.
It is output as 8. At this time, variations (complexity) in the waveform directions of the four partial blocks are taken into consideration. For example,
The complexity is low if the directions of the waveforms of the four partial blocks are all the same, and the complexity is high if the directions of the waveforms of the four partial blocks are all different.

【００３８】図５は、図２に示す利得分析部２０の構成
図である。図において２３は分散算出器２２が出力する
分散値、２５はヒストグラム計数器２４が出力するヒス
トグラム情報、２７は利得情報、３１は平均値分離ブロ
ック６内の最大値と最小値の比率であるダイナミックレ
ンジ比である。２２は平均値分離ブロック６のｍ×ｎ画
素の値の分散値を算出する分散算出器、２４は平均値分
離ブロック６のｍ×ｎ画素の値の頻度分布を計数するヒ
ストグラム計数器、３０は平均値分離ブロック６内の最
大値と最小値を検出し、最大値と最小値の比率を算出す
る最大最小検出器、２６は分散値２３、ヒストグラム情
報２５、およびダイナミックレンジ比３１から利得情報
２７を得るための利得マッピング・テーブルである。FIG. 5 is a block diagram of the gain analyzer 20 shown in FIG. In the figure, 23 is a variance value output from the variance calculator 22, 25 is histogram information output from the histogram counter 24, 27 is gain information, and 31 is a dynamic ratio which is a ratio between the maximum value and the minimum value in the average value separation block 6. It is a range ratio. 22 is a variance calculator that calculates the variance of the m × n pixel values of the average value separation block 6, 24 is a histogram counter that counts the frequency distribution of the m × n pixel values of the average value separation block 6, and 30 is A maximum / minimum detector that detects the maximum value and the minimum value in the average value separation block 6 and calculates the ratio of the maximum value and the minimum value, 26 is a variance value 23, histogram information 25, and dynamic range ratio 31 to gain information 27. Is a gain mapping table for obtaining

【００３９】以下、利得分析に関する動作について説明
する。The operation relating to the gain analysis will be described below.

【００４０】図５の利得分析部２０は、平均値分離ブロ
ック６の振幅、画素値の頻度分布、最大値と最小値の比
率を分析し、その結果から入力ブロック３が、文字部の
ブロックであるか写真領域のブロックであるかを判定す
る。利得情報分析は、平均値分離ブロック６を構成する
ｍ×ｎ画素の値の分散値σ²の計算、ヒストグラムの計
数、最大値と最小値の比率によって行われる。The gain analysis unit 20 of FIG. 5 analyzes the amplitude of the average value separation block 6, the frequency distribution of pixel values, and the ratio of the maximum value and the minimum value. From the results, the input block 3 is a block of the character part. It is determined whether there is a block in the photograph area. The gain information analysis is performed by calculating the variance value σ ² of the values of m × n pixels forming the average value separation block 6, counting the histogram, and the ratio between the maximum value and the minimum value.

【００４１】図５の分散算出器２２は、平均値分離ブロ
ック６を構成するｍ×ｎ画素の値の分散値σ²すなわち
分散値２３を算出する。平均値を分離したｍ×ｎ画素の
分散値は次式で定義される。The variance calculator 22 of FIG. 5 calculates the variance value σ ² of the values of m × n pixels forming the average value separation block 6, that is, the variance value 23. The variance value of m × n pixels obtained by separating the average value is defined by the following equation.

【００４２】[0042]

【数４】あるいは、[Equation 4] Alternatively,

【数５】ここでは、以後、分散値σを用いて説明する。[Equation 5] Here, the description will be given below using the variance value σ.

【００４３】図５のヒストグラム計数器２４は、図６に
示すように、分散値σにより平均値分離ブロック６を閾
値処理して頻度を計数する。すなわち、閾値を±σ／ａ
に設定し、−σ／ａ未満、−σ／ａ以上かつσ／ａ以
下、σ／ａより大きい範囲の３か所で頻度を計数する。
ここでａは０でない正の実数であり、本実施例では、例
えばａ＝３とする。３か所で計数した頻度値をそれぞれ
Ｈ_-1、Ｈ₀、Ｈ₁とする。図６に示すように、Ｈ_-1、Ｈ
₀、Ｈ₁から、ヒストグラムが単峰分布（同図（ａ）参
照）かあるいは双峰分布（同図（ｂ）参照）かを判断し
結果をヒストグラム情報２５として得る。例えば、Ｈ_-1
≦Ｈ₀かつＨ₀≧Ｈ₁かつの場合に単峰分布であり、そ
の他の場合に双峰分布であると判断する。As shown in FIG. 6, the histogram counter 24 of FIG. 5 thresholds the average value separation block 6 with the variance value σ to count the frequency. That is, the threshold is ± σ / a
The frequency is counted at three points in the range of less than −σ / a, greater than −σ / a and less than or equal to σ / a, and greater than σ / a.
Here, a is a positive real number that is not 0, and in this embodiment, for example, a = 3. Frequency values counted in three places each H _-1, and H _0, H _1. As shown in FIG. 6, H _-1 , H
_{From 0} and H ₁ , it is determined whether the histogram is a unimodal distribution (see FIG. 11A) or a bimodal distribution (see FIG. 11B), and the result is obtained as histogram information 25. For example, H _-1
If ≦ H ₀ and H ₀ ≧ H ₁ , it is determined that the distribution has a single peak, and in other cases, the distribution has a double peak.

【００４４】一般的に文字領域の場合には、文字色と背
景色に相当する位置にヒストグラムのピークが現れるこ
とから、双峰分布の場合は文字領域と判定することがで
きる。Generally, in the case of a character area, peaks of the histogram appear at the positions corresponding to the character color and the background color, and therefore, in the case of the bimodal distribution, it can be determined as a character area.

【００４５】最大最小検出器３０では、最大値と最小値
の比率であるダイナミックレンジ比ｒが次式に基づいて
計算される。In the maximum / minimum detector 30, the dynamic range ratio r, which is the ratio of the maximum value and the minimum value, is calculated based on the following equation.

【００４６】ｒ＝ｍａｘ｛ｘ_ij｝／ｍｉｎ｛ｘ_ij｝，（ｉ＝１，・・
・，ｍ，ｊ＝１，・・・，ｎ）・・・（１１）文字領域において、ブロック境界に文字等のエッジの一
部がかかっている場合には、背景色の画素数と文字色の
画素数が著しく異なるため、分散、ヒストグラムによっ
て判定を誤る場合がある。ダイナミックレンジ比ｒを導
入することにより、ｒが大きい場合には、ブロック境界
に文字等のエッジの一部がかかっていると判定される。
これにより写真領域との判定誤りを回避できる。R = max {x _ij } / min {x _ij }, (i = 1, ...
., M, j = 1, ..., N) (11) In the character area, when part of the edge of the character or the like is applied to the block boundary, the number of pixels of the background color and the character color Since the number of pixels of is significantly different, the determination may be erroneous depending on the variance and the histogram. By introducing the dynamic range ratio r, when r is large, it is determined that a part of an edge such as a character overlaps the block boundary.
As a result, it is possible to avoid an erroneous determination as to the photograph area.

【００４７】図５の利得マッピング・テーブル２６は、
分散値２３、ヒストグラム情報２５、およびダイナミッ
クレンジ比３１から、文字領域、写真領域の識別結果で
ある利得情報２７を得る。The gain mapping table 26 of FIG.
From the variance value 23, the histogram information 25, and the dynamic range ratio 31, the gain information 27, which is the identification result of the character area and the photograph area, is obtained.

【００４８】図７に利得マッピング・テーブル２６にお
ける領域判定の木構造の例を示す。それぞれの節におい
て、ヒストグラム情報２５、分散値２３、およびダイナ
ミックレンジ比３１に対して閾値処理を行い、分岐の判
定を行う。すなわち、ヒストグラムが双峰の分布を示す
場合には文字と判定し、ヒストグラムが単峰の分布でブ
ロック内分散が大と判定された場合には、写真領域と判
定する。ヒストグラムが単峰の分布であり、ブロック内
の分散が小と判定された場合には、更にダイナミックレ
ンジの比の大小により、文字と写真領域とを区別する。
判定基準となる各節での閾値は、入力される画像の特性
に対して設定する。なお、ここでは、ヒストグラム情報
２５、分散値２３、およびダイナミックレンジ比３１の
それぞれを単一の閾値で判定する場合について説明した
が、閾値の数はこれに限るものではない。したがって、
各節での分岐の数も２に限定されるものではなく、さら
に多くの分岐を持つ木構造を構成することも可能であ
る。FIG. 7 shows an example of a tree structure for area determination in the gain mapping table 26. In each section, threshold processing is performed on the histogram information 25, the variance value 23, and the dynamic range ratio 31, and branch determination is performed. That is, when the histogram shows a bimodal distribution, it is determined to be a character, and when it is determined that the histogram has a unimodal distribution and the intra-block variance is large, it is determined to be a photographic region. When the histogram has a unimodal distribution and it is determined that the variance within the block is small, the character and the photo area are further distinguished by the ratio of the dynamic range.
The threshold value in each section, which is the criterion, is set for the characteristics of the input image. Note that, here, the case where each of the histogram information 25, the variance value 23, and the dynamic range ratio 31 is determined by a single threshold value has been described, but the number of threshold values is not limited to this. Therefore,
The number of branches in each clause is not limited to two, and it is possible to construct a tree structure having more branches.

【００４９】利得分析部２０で得られた利得情報２７
は、先に説明した波形分析部１４からの波形情報１８と
ともに図２の領域判定部１２に供給される。Gain information 27 obtained by the gain analysis unit 20
Is supplied to the area determination unit 12 of FIG. 2 together with the waveform information 18 from the waveform analysis unit 14 described above.

【００５０】領域判定部１２では、上述した波形分析と
利得分析の結果である波形情報１８と利得情報２７から
領域情報１３を決定する。The area determination unit 12 determines the area information 13 from the waveform information 18 and the gain information 27 which are the results of the above-mentioned waveform analysis and gain analysis.

【００５１】領域情報１３は、文字領域、写真領域の区
別を表す情報と、それぞれの領域での階調変化の方向を
示す情報から構成される。The area information 13 is composed of information indicating the distinction between the character area and the photograph area, and information indicating the direction of gradation change in each area.

【００５２】次に、領域情報１３に基づく適応符号化に
ついて説明する。Next, the adaptive coding based on the area information 13 will be described.

【００５３】図１に示す実施例においては、上述した手
順によって得られた領域情報１３によって量子化マトリ
クス１３が切り替えられる。これは、量子化マトリクス
格納部１０４において行われる。In the embodiment shown in FIG. 1, the quantization matrix 13 is switched by the area information 13 obtained by the procedure described above. This is performed in the quantization matrix storage unit 104.

【００５４】量子化マトリクス格納部１０４に格納され
る量子化マトリクスは、事前に作成されている必要があ
る。これは例えば、従来例で説明した「画像処理ハンド
ブック」に開示されている手法を、分離された領域ごと
に適用すればよい。The quantization matrix stored in the quantization matrix storage unit 104 needs to be created in advance. For this, for example, the method disclosed in the "Image processing handbook" described in the conventional example may be applied to each separated area.

【００５５】図８には、実際の画像に対して振幅方向の
特徴に基づいて写真領域と文字領域の分離を行い、文字
領域については階調変化の方向でさらに垂直、水平、斜
め、その他の４通りに分離する例を示す。図９は、図８
のように写真領域と４通りの文字領域に分割されたそれ
ぞれの領域に対して設計した５種類の量子化マトリクス
の例である。図９（ａ）が写真領域に対応し、同図
（ｂ），（ｃ），（ｄ），（ｅ）がそれぞれ垂直、水
平、斜め、その他の各方向の階調変化を有する文字領域
に対応している。ここでは量子化マトリクス１３をブロ
ックあたりに配分されるビット数の総計が等しくなるよ
うに設計した。また、直流に対する量子化ステップ値は
いずれも等しく設定している。In FIG. 8, the photographic area and the character area are separated from each other on the basis of the characteristics in the amplitude direction with respect to the actual image, and the character area is further vertical, horizontal, diagonal, and other in the direction of gradation change. An example of separating into four ways will be shown. FIG. 9 corresponds to FIG.
5 is an example of the quantization matrix of 5 types designed for each area divided into the photograph area and the four character areas as described above. FIG. 9A corresponds to a photograph area, and FIGS. 9B, 9C, 9D, and 9E show character areas having gradation changes in vertical, horizontal, diagonal, and other directions, respectively. It corresponds. Here, the quantization matrix 13 is designed so that the total number of bits distributed per block becomes equal. Further, the quantization step values for direct current are set equal to each other.

【００５６】このように、予め領域に対応する量子化マ
トリクス（量子化特性）を決定しておくことにより、実
際に符号化するときに、入力ブロックの領域判定結果に
応じて量子化特性を切り替える適応符号化方式が実現で
きる。In this way, by determining the quantization matrix (quantization characteristic) corresponding to the area in advance, the quantization characteristic is switched according to the area determination result of the input block when actually encoding. An adaptive coding method can be realized.

【００５７】なお、量子化後の可変長符号化の手順につ
いては従来技術と同様であるので省略する。The procedure of variable length coding after quantization is the same as that of the prior art, and will be omitted.

【００５８】復号側においては、この適応化の情報が必
要であるため、図１の多重化部１０８において、領域情
報１３は符号データ１０９に多重化される。８通りの適
応化の場合、識別のために１ブロックあたり３ビットを
追加すればよい。Since information on this adaptation is required on the decoding side, the area information 13 is multiplexed on the coded data 109 in the multiplexing section 108 in FIG. In the case of eight types of adaptation, 3 bits may be added per block for identification.

【００５９】以上、実施例においては、領域分析の結果
に基づいて量子化マトリクスを変更する手法について説
明してきたが、他の適応化についても可能である。In the above, in the embodiment, the method of changing the quantization matrix based on the result of the area analysis has been described, but other adaptations are possible.

【００６０】例えば、従来例では図１２に示すジグザグ
スキャンのみによって、係数列の一次元化を行っていた
が、分離された領域ごとにスキャンの順序を設定するこ
とも可能である。とくに階調変化の方向によって領域分
離された場合には、電力の集中する係数が異なるので、
電力の集中する係数を優先的にスキャンする経路を与え
ることが符号化効率の点で望ましい。For example, in the conventional example, the coefficient string is made one-dimensional by only the zigzag scan shown in FIG. 12, but it is also possible to set the scan order for each separated area. Especially when the regions are separated according to the direction of gradation change, the power concentration coefficient is different,
It is desirable from the viewpoint of coding efficiency to provide a path for preferentially scanning the power concentration coefficient.

【００６１】また、可変長符号化に用いる符号表につい
ても、分離された領域ごとに設定することが可能であ
る。The code table used for variable length coding can also be set for each separated area.

【００６２】復号側においては、スキャン順序、可変長
符号表とも、符号データ中の領域情報に基づいて切り替
えることができる。On the decoding side, both the scan order and the variable length code table can be switched based on the area information in the code data.

【００６３】[0063]

【発明の効果】以上、本発明においては、符号化すべき
画像信号に対して領域分析を行い、領域分析の結果に対
応する量子化マトリクスを符号化に用いるようにしてい
る。これにより、写真画像の一部に文字が存在するよう
な場合でも、文字に対しては文字の特性に適した量子化
特性を用いて符号化が行われるので、文字のエッジ部の
画質を大幅に改善することができる。As described above, in the present invention, the area analysis is performed on the image signal to be encoded, and the quantization matrix corresponding to the result of the area analysis is used for the encoding. As a result, even when characters are present in a part of the photographic image, the characters are encoded using the quantization characteristics suitable for the characteristics of the characters. Can be improved.

[Brief description of drawings]

【図１】本発明の実施例の構成図である。FIG. 1 is a configuration diagram of an embodiment of the present invention.

【図２】領域分析部の構成図である。FIG. 2 is a configuration diagram of a region analysis unit.

【図３】波形分析部の構成図である。FIG. 3 is a configuration diagram of a waveform analysis unit.

【図４】波形情報分析の説明図である。FIG. 4 is an explanatory diagram of waveform information analysis.

【図５】利得分析部の構成図である。FIG. 5 is a configuration diagram of a gain analysis unit.

【図６】利得情報分析の説明図である。FIG. 6 is an explanatory diagram of gain information analysis.

【図７】領域分離の判定の木構造を示す説明図であ
る。FIG. 7 is an explanatory diagram showing a tree structure for determining a region separation.

【図８】領域分割を説明する図である。FIG. 8 is a diagram illustrating area division.

【図９】領域別に設計した量子化マトリクスを示す図
である。FIG. 9 is a diagram showing a quantization matrix designed for each region.

【図１０】従来例の構成図である。FIG. 10 is a configuration diagram of a conventional example.

【図１１】ブロック抽出の説明図である。FIG. 11 is an explanatory diagram of block extraction.

【図１２】ジグザグスキャンを説明する図である。FIG. 12 is a diagram illustrating zigzag scanning.

【図１３】量子化マトリクスの例を示す図である。FIG. 13 is a diagram showing an example of a quantization matrix.

【図１４】水平方向に階調変化を有する画素ブロック
の画素の分布と係数電力分布の対応を示す図である。FIG. 14 is a diagram showing the correspondence between the pixel power distribution and the coefficient power distribution of a pixel block having a gradation change in the horizontal direction.

【図１５】垂直方向に階調変化を有する画素ブロック
の画素の分布と係数電力分布の対応を示す図である。FIG. 15 is a diagram showing a correspondence between a pixel power distribution and a coefficient power distribution of a pixel block having a gradation change in the vertical direction.

【図１６】斜め方向に階調変化を有する画素の分布と
係数電力分布の対応を示す図である。FIG. 16 is a diagram showing a correspondence between a pixel distribution having a gradation change in an oblique direction and a coefficient power distribution.

[Explanation of symbols]

１…入力画像、２…ブロック抽出部、３…入力ブロッ
ク、４…平均値分離部、６…平均値分離ブロック、７…
領域分析部、１２…領域判定部、１３…領域情報、１４
…波形分析部、１５…形状分析部、１６…形状インデッ
クス、１７…波形マッピング・テーブル、１８…波形情
報、２０…利得分析部、２２…分散算出器、２３…分散
値、２４…ヒストグラム計数器、２５…ヒストグラム情
報、２６…利得マッピング・テーブル、２７…利得情
報、３０…最大最小検出器、３１…ダイナミックレンジ
比、１００…直交変換部、１０１…変換係数、１０２…
量子化部、１０３…量子化マトリクス、１０４…量子化
マトリクス格納部、１０５…量子化係数、１０６…可変
長符号化部、１０７…可変長符号、１０８…多重化部、
１０９…符号データDESCRIPTION OF SYMBOLS 1 ... Input image, 2 ... Block extraction part, 3 ... Input block, 4 ... Average value separation part, 6 ... Average value separation block, 7 ...
Area analysis unit, 12 ... Area determination unit, 13 ... Area information, 14
... Waveform analysis unit, 15 ... Shape analysis unit, 16 ... Shape index, 17 ... Waveform mapping table, 18 ... Waveform information, 20 ... Gain analysis unit, 22 ... Variance calculator, 23 ... Variance value, 24 ... Histogram counter , 25 ... Histogram information, 26 ... Gain mapping table, 27 ... Gain information, 30 ... Maximum / minimum detector, 31 ... Dynamic range ratio, 100 ... Orthogonal transform unit, 101 ... Transform coefficient, 102 ...
Quantization unit, 103 ... Quantization matrix, 104 ... Quantization matrix storage unit, 105 ... Quantization coefficient, 106 ... Variable length coding unit, 107 ... Variable length code, 108 ... Multiplexing unit,
109 ... Code data

───────────────────────────────────────────────────── フロントページの続き (72)発明者木村俊一神奈川県海老名市本郷2274番地富士ゼロックス株式会社内 (72)発明者上澤功神奈川県海老名市本郷2274番地富士ゼロックス株式会社内 ─────────────────────────────────────────────────── ─── Continuation of the front page (72) Inventor Shunichi Kimura 2274 Hongo, Ebina City, Kanagawa Prefecture, Fuji Xerox Co., Ltd.

Claims

[Claims]

1. A block extracting means for dividing an image signal into an input block of m × n pixels (m and n are positive integers) composed of a plurality of pixels, and orthogonal for obtaining a transform coefficient by performing orthogonal transform on the input block. Transforming means, quantizing characteristic storing means for storing a quantizing characteristic, quantizing means for quantizing the transform coefficient with the characteristic stored in the quantizing characteristic storing means to obtain a quantizing coefficient, and the quantizing coefficient In a video signal coding apparatus having a coding means for variable-length coding and a multiplexing means for multiplexing the results of variable-length coding to obtain coded data, a waveform analysis means for analyzing the waveform of the input block. An image signal coding apparatus, comprising: and switching the quantization characteristic of the quantization characteristic storage means based on a waveform analysis result.

2. The characteristic of the direction of gradation change of the mean value separation block, wherein the waveform analysis means subtracts the mean value from each pixel in the input block, and the mean value separation block obtained by the mean value separation means. A first analyzing means for analyzing a quantity, a second analyzing means for analyzing a feature quantity in the amplitude direction of the average value separation block, an analysis result of the first analyzing means and an analysis result of the second analyzing means The image signal coding apparatus according to claim 1, further comprising: a determining unit that determines a characteristic of a waveform of the input block based on the above.

3. The first analysis means is mx obtained in advance.
n pixels (m and n are positive integers) or their positive integer ratio j
The degree of approximation between each of a plurality of sets of blocks having a representative shape composed of pixels divided by (j is a positive integer) and the average value separation block is obtained, and the index of the representative shape block having the highest degree of approximation or j The index set of the block of the representative shape having the highest degree of approximation for each of the divided blocks is output as the first feature amount in the tone change direction of the input block, and at least the index or the obtained j The ratio in which the index sets match each other is used as a parameter indicating the complexity of the shape of the average value separation block, and this complexity is determined as the second value in the gradation change direction.
The image signal encoding device according to claim 2, wherein the image signal encoding device outputs the image signal as the feature amount.

4. The second analysis means sets a root mean square value of each pixel value in the mean value separation block or a value obtained by averaging absolute values as a variance value of the input block, and the variance value is 1 The result of comparison with a threshold value of at least one kind is output as the first feature amount in the amplitude direction, the cumulative frequency distribution of each pixel value in the average value separation block is obtained, and the shape of this cumulative frequency distribution is preset. After correcting one or a plurality of normalized distributions corresponding to the variance value, the results are compared,
The image signal encoding device according to claim 2, wherein the index of the distribution that is the same or closest is output as the second feature amount in the amplitude direction.

5. The second analyzing means detects a maximum value and a minimum value in the average value separation block, calculates a ratio between the maximum value and the minimum value, and compares the ratio with one or more threshold values. The image signal encoding device according to claim 2, wherein the result is output as a third characteristic amount in the amplitude direction.

6. The quantization characteristic stored in the quantization characteristic storage means divides an image into a plurality of partial images according to the analysis result by the waveform analysis means, and performs orthogonal transformation on each partial image. The image signal coding apparatus according to claim 1, wherein the image signal coding apparatus is determined based on a variance or a standard deviation of transform coefficients obtained by applying the transform coefficient.