JPH11225330A

JPH11225330A - Digital image compressing method

Info

Publication number: JPH11225330A
Application number: JP31185098A
Authority: JP
Inventors: S Taubman David; ディビッド・エス・タウブマン
Original assignee: Hewlett Packard Co
Current assignee: HP Inc
Priority date: 1997-11-06
Filing date: 1998-11-02
Publication date: 1999-08-17

Abstract

PROBLEM TO BE SOLVED: To efficiently compress/expand a synthetic digital image while integrating lossless and loss compression by optimizing subband decomposition, continuous approximation, encoding and a bit stream operation. SOLUTION: A compressor 110 compresses an uncompressed image 105, produces a bit stream 115 in a scalable way and produces an expanded image 135 from the bit stream that applies a prescribed number scaling operation 120 by an expander 130. The bit stream operation selects the number of quantization layers which should be included in each subband. Continuous approximation and an encoding element produce plural layers of quantized subband symbols which are encoded by using context adaptive arithmetic coding, decide an encoding context based on an adjacent sample of a sample within the range of the same subband together with a quantizer that should be used for an optional specific subband sample and design in such a manner that information is concentrated on the neighborhood of an area where activity is high. The subband and the quantization layer are encoded at the same time.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は、デジタル画像処理
に関するもので、特にデジタル画像の圧縮および伸張に
関するものである。The present invention relates to digital image processing, and more particularly to compression and decompression of digital images.

【０００２】[0002]

【従来の技術】自然画像を圧縮する現在の標準はＪＰＥ
Ｇである。実際には、この標準は、いわゆるベースライ
ン方式だけが一般的に実施されている種々の関連圧縮方
式をカバーする。ベースラインＪＰＥＧは、最新技術の
圧縮性能を達成してはいないが、量子化およびハフマン
符号化テーブルの設計において顕著な柔軟性を与えてい
る。計算およびメモリ資源に関して比較的低コストを実
現しつつ高ビット率の圧縮効率および媒体を取得するた
め、このハフマン符号化を使用すことができる。事実、
自然画像圧縮効率が唯一の考慮点であるとすれば、標準
ベースラインＪＰＥＧに代わるものとして、いかなる一
層先進的圧縮方式をも使用する可能性は少ない。しかし
ながら、最近になって、一層広範囲の機能を必要とする
アプリケーションを提供することができるような単一の
圧縮方式に対する必要性が出てきた。そのような機能の
例には以下が含まれる。2. Description of the Related Art The current standard for compressing natural images is JPE.
G. In practice, this standard covers various related compression schemes in which only the so-called baseline scheme is commonly implemented. Baseline JPEG has not achieved state-of-the-art compression performance, but provides significant flexibility in the design of quantization and Huffman coding tables. This Huffman coding can be used to obtain a high bit rate compression efficiency and medium with relatively low cost in terms of computation and memory resources. fact,
Given that natural image compression efficiency is the only consideration, it is unlikely to use any more advanced compression scheme as an alternative to standard baseline JPEG. However, recently there has been a need for a single compression scheme that can provide applications that require a wider range of functions. Examples of such functions include:

【０００３】1)各々のケース毎に完全に異なる圧縮方式
に頼ることなく自然および人工の画像を効率的に圧縮す
る能力。人工画像ソースはグラフィックスおよびテキス
トを含む。これら画像ソースはスキャナの広範囲の普及
に伴って特に重要になった。スキャナは、走査したデー
タを効率的に保存し送出できるような圧縮を必要とする
が、写真画像から単調な文書まですべてを無差別に走査
するように使用されることがある。単一の圧縮方式が自
然および人工両方の画像ソース領域から成る合成画像に
対してうまく機能することができることが理想である。1) The ability to efficiently compress natural and artificial images without resorting to completely different compression schemes in each case. Artificial image sources include graphics and text. These image sources have become particularly important with the widespread use of scanners. Scanners require compression so that scanned data can be stored and transmitted efficiently, but are sometimes used to indiscriminately scan everything from photographic images to monotonous documents. Ideally, a single compression scheme can work well for composite images consisting of both natural and artificial image source regions.

【０００４】2)画像材料の複数解像度階層への自然な構
成能力。この場合、容易に識別可能なコンポーネントは
圧縮されたビットストリーム内で比較的低い解像度特性
を表現し、一方それ以外のコンポーネントは比較的高い
解像度特性を表現する。このような特性を本明細書は解
像度スケーラビリティと呼ぶ。この能力は、各々が異な
る解像度制約を持つ種々の異なるクライアントに単一の
圧縮画像を分配することを必要とするアプリケーション
にとって特に重要である。多くのクライアントがそれを
表現することができず単にネットワークを経由してそれ
を取りだすことができるにすぎないほどオリジナル画像
が非常に大きい場合、この能力は特に関心が高い。2) The ability to naturally construct image materials into multiple resolution hierarchies. In this case, easily identifiable components will exhibit relatively low resolution characteristics in the compressed bitstream, while other components will exhibit relatively high resolution characteristics. Such a property is referred to herein as resolution scalability. This capability is particularly important for applications that require distributing a single compressed image to a variety of different clients, each with different resolution constraints. This capability is of particular interest if the original image is so large that many clients cannot represent it and can simply retrieve it over the network.

【０００５】3)逐次高い再現性に対応する複数コンポー
ネントへ圧縮ビットストリームを自然に構成する能力。
このような能力を本明細書はビット率スケーラビリティ
と呼ぶ。解像度スケーラビリティの場合と同様に、単一
の圧縮ビットストリームを使用して種々のユーザの要求
を異なるバンド幅制約で満たさなければならないアプリ
ケーションにとってこの能力は重要である。この能力
は、限られたバンド幅のネットワーク接続を経由して次
第に多くの圧縮ビットストリームが受け取られるにつれ
画像再現性が逐次向上するような圧縮画像データの累進
的伝送(すなわちプログレッシブ伝送)の場合特に役に立
つ。3) Ability to naturally compose a compressed bit stream into a plurality of components corresponding to sequentially high reproducibility.
This capability is referred to herein as bit rate scalability. As in the case of resolution scalability, this ability is important for applications where a single compressed bitstream must be used to meet the needs of different users with different bandwidth constraints. This capability is especially useful for progressive transmission of compressed image data (i.e., progressive transmission) where image reproducibility is progressively improved as more and more compressed bit streams are received over a limited bandwidth network connection. Useful.

【０００６】4)損失圧縮と無損失圧縮の統合。それぞれ
の理由から真の無損失圧縮を必要とする多数のアプリケ
ーションが存在する。そのようなアプリケーションの１
つは医療画像である。この場合、無損失圧縮に対する動
機は、医学的重要性は別として、主として、データ損失
に反対する法令上の示唆である。そのようなアプリケー
ションは、損失および無損失両圧縮を実施する単一の圧
縮方式あるいは単一のスケーラブル・ビット率ストリー
ムの存在によって利益を得るであろうことは明らかであ
る。4) Integration of lossy and lossless compression. There are numerous applications that require true lossless compression for each reason. One of such applications
One is a medical image. In this case, the motivation for lossless compression, apart from its medical significance, is primarily statutory suggestions against data loss. Obviously, such an application would benefit from the presence of a single compression scheme or a single scalable bit rate stream that implements both lossy and lossless compression.

【０００７】[0007]

【発明が解決しようとする課題】上述のような諸機能が
ソフトウェアおよびＡＳＩＣを含む組み込み実施形態に
適する単一の統合された圧縮方式の範囲内でサポートさ
れるべきことは重要な点である。さもなければ、そのよ
うな機能が、大きなインパクトを持つことができるよう
に広範囲にあるいは完全に活用される可能性は少ない。
結果として、計算処理の複雑性、メモリ使用およびメモ
リ・バンド幅要件のような課題が重要な関心事となる。It is important that features such as those described above be supported within a single, unified compression scheme suitable for embedded embodiments, including software and ASICs. Otherwise, such features are unlikely to be extensively or fully exploited so that they can have a significant impact.
As a result, issues such as computational complexity, memory usage and memory bandwidth requirements are of significant interest.

【０００８】このように、合成画像、解像度およびビッ
ト率スケーラビリティの限界および損失と無損失圧縮の
統合化の無能力は、デジタル画像圧縮伸張装置に制約を
課し、多くのアプリケーションにおけるこれら装置の使
用を妨げる。従って、解像度ならびにビット率スケーラ
ビリティを提供し、無損失ならびに損失圧縮を統合しな
がら、合成デジタル画像を効率的に圧縮／伸張すること
ができる統合デジタル画像圧縮および伸張の技術に対す
る未解決の必要性が存在する。[0008] Thus, the limitations of synthetic images, resolution and bit rate scalability and the inability to integrate lossy and lossless compression impose limitations on digital image compression and decompression devices and the use of these devices in many applications. Hinder. Thus, there remains an unmet need for integrated digital image compression and decompression techniques that provide resolution and bit rate scalability, and that can efficiently compress / decompress composite digital images while integrating lossless and lossy compression. Exists.

【０００９】[0009]

【課題を解決するための手段】上記課題を解決するた
め、解像度ならびにビット率スケーラビリティを提供
し、無損失ならびに損失圧縮を統合しながら、合成デジ
タル画像を効率的に圧縮／伸張することができる方法お
よび装置が提供される。圧縮方法は、サブバンド分解、
連続近似ならびに符号化、およびビットストリーム操作
という３つの主要な要素を含む。ビットストリーム操作
は、特定のビット率または品質目標を達成するため各サ
ブバンドに関して含まれるべき量子化層の数を選択する
ことを含む。すべての３つのエレメントは、中間記憶バ
ッファ必要容量を最小にするため同時に動作することが
できる。SUMMARY OF THE INVENTION In order to solve the above problems, a method for providing resolution and bit rate scalability and efficiently compressing / expanding a composite digital image while integrating lossless and lossy compression. And an apparatus are provided. The compression method is subband decomposition,
Includes three main components: continuous approximation and coding, and bitstream manipulation. Bitstream operations involve selecting the number of quantization layers to be included for each subband to achieve a particular bit rate or quality goal. All three elements can operate simultaneously to minimize intermediate storage buffer requirements.

【００１０】連続近似ならびに符号化エレメントは、Ｃ
ＡＳＡＣ処理手順の中核である(ＣＡＳＡＣはContext A
daptive Successive Approximation and Codingの簡略
表記であって、文脈適応型連続近似並びに符号化を意味
する)。これは、組み込みデッドゾーン量子化器セット
を含む。文脈適応型算術符号化を使用して符号化される
べき量子化されたサブバンド・シンボルの層(複数)を生
成する。任意の特定のサブバンド標本に関して使用され
るべき量子化器とともに、符号化文脈は同じサブバンド
の範囲内にある当該標本の空間的にすぐ隣にある標本に
基づいて決定される。適応方式は、単純であり、活動の
高い領域の近辺に情報を集中するように設計される。算
術符号化は、さもなければ計算処理の複雑性に大きな影
響を持つので、ランレングス符号化のバリエーションを
使用して、圧縮性能を犠牲にすることなく、符号化の必
要なシンボルの総数を減少させることができる。The continuous approximation as well as the coding element is C
The core of the ASAC processing procedure (CASAC is Context A
Shorthand for daptive Successive Approximation and Coding, meaning context-adaptive continuous approximation and coding). This includes a built-in dead zone quantizer set. Generate the layer (s) of quantized subband symbols to be encoded using context-adaptive arithmetic coding. With the quantizer to be used for any particular subband sample, the encoding context is determined based on the samples that are spatially immediately adjacent to that sample within the same subband. Adaptive schemes are simple and are designed to focus information near high activity areas. Arithmetic coding has a significant impact on the computational complexity otherwise, so use a variation of run-length coding to reduce the total number of symbols that need to be coded without sacrificing compression performance Can be done.

【００１１】高性能なＡＳＩＣの実施にとって特に重要
であるとみなされる点であるが、バッファ要件を最小に
し並列性に対する機会を最大にするため、すべてのサブ
バンドおよびすべての量子化層が同時に符号化または復
号されることが可能となるようにアルゴリズムが設計さ
れる。Although deemed particularly important for high performance ASIC implementations, to minimize buffer requirements and maximize opportunities for parallelism, all subbands and all quantization layers are simultaneously encoded. The algorithm is designed to be able to be encrypted or decoded.

【００１２】すべてのサブバンド標本は、組み込みスカ
ラー・デッドゾーン量子化器を使用して量子化される
が、この場合、デッドゾーンは、量子化ステップ・サイ
ズの２倍の大きさで、ゼロの周囲に集中する。標本は、
連続的に一層精細となる量子化器を適応することによっ
て連続的に近似されるが、この場合、ステップ・サイズ
は各回毎に２で割られる。量子化器はスカラーであるけ
れども、量子化方針を適応することは可能である。なぜ
ならば、組み込み(または層化された)階層の範囲内の特
定の量子化器の選択は量子化されている標本の空間上の
隣接標本(または文脈)に依存するからである。All subband samples are quantized using a built-in scalar dead zone quantizer, where the dead zone is twice the quantization step size and zero. Concentrate around. The specimen is
A continuous approximation is made by adapting successively finer quantizers, where the step size is divided by two each time. Although the quantizer is scalar, it is possible to adapt the quantization strategy. This is because the choice of a particular quantizer within the embedded (or stratified) hierarchy depends on the neighboring samples (or context) in space of the sample being quantized.

【００１３】圧縮および伸張プロセスは、サブバンド統
合および分析が複雑さの点で匹敵し、適応型算術符号化
器のための文脈形成が符号化器および復号器に対して同
一であるので、基本的には対称的である。The compression and decompression processes are fundamental because the subband integration and analysis are comparable in complexity and the context formation for the adaptive arithmetic encoder is the same for the encoder and the decoder. Is symmetrically.

【００１４】プロセスは解像度の点で階層的であると共
に再現性に関して累進的である。所望の解像度を選択
(すなわち不必要なサブバンドを破棄)し、所望のビット
率を選択(すなわち該当サブバンド内から不必要な量子
化層を破棄)するため、圧縮ビットストリームは、圧縮
および伸張のいかなる点においてもフィルタされること
ができる(これは例えばビットストリームに組み込まれ
た識別子に従って特定のエレメントを破棄する単純な動
作である)。The process is both hierarchical in terms of resolution and progressive with respect to reproducibility. Select desired resolution
The compressed bitstream is compressed and decompressed at any point in It can be filtered (this is a simple action of discarding certain elements according to, for example, an identifier embedded in the bitstream).

【００１５】[0015]

【発明の実施の形態】1. アーキテクチャの概要図１は、概念的システム・アーキテクチャを示す。概念
的に述べれば、画像は、画像コンテントの異なる面を保
有する一層小さいコンポーネントへと次第に減少され、
最終的にそのようなコンポーネントはすべて単一の圧縮
ビットストリームへ符号化されパッケージ化される。図
１は、ＣＡＳＡＣ(すなわち文脈適応型連続近似並びに
符号化のことでContext Adaptive Successive Approxim
ation andCodingの簡略表記)に基づく統合スケーラブル
画像圧縮／伸張システム１００の実施形態を示すブロッ
ク図である。要約すれば、初期的に未圧縮の画像１０５
は圧縮器１１０によって圧縮され、スケーラブルな圧縮
されたビットストリーム１１５が生成され、次に、この
圧縮されたビットストリーム１１５に対して、必要な回
数スケーリング動作１２０が適用される。その後スケー
リングされたれビットストリームに伸張器１３０を適用
して、伸張された画像１３５が作成される。DETAILED DESCRIPTION OF THE INVENTION 1. Architecture Overview FIG. 1 shows a conceptual system architecture. Conceptually, the image is progressively reduced to smaller components that carry different aspects of the image content,
Ultimately, all such components are encoded and packaged into a single compressed bitstream. FIG. 1 is a diagram showing CASAC (ie, context-adaptive continuous approximation and encoding).
1 is a block diagram illustrating an embodiment of an integrated scalable image compression / decompression system 100 based on (simplified notation of ation and Coding). In summary, the initially uncompressed image 105
Is compressed by a compressor 110 to generate a scalable compressed bitstream 115, which is then subjected to a required number of scaling operations 120. The decompressor 130 is then applied to the scaled bit stream to create a decompressed image 135.

【００１６】カラー画像は、通常、輝度および色度成分
(C₁,C₂,...,C_K)に分割される。一般に、カラー成分はい
くつもの数が可能であるが、本発明はC_k+1に比較してC_k
が一層重要なカラー成分であるとみなす。A color image is usually composed of luminance and chromaticity components.
(C ₁ , C ₂ ,..., C _K ). In general, the color component is susceptible to several of a number, the present invention is C _k compared to C _{k + 1}
Is considered to be a more important color component.

【００１７】次に、複解像度変換を使用して、各カラー
成分はそれぞれ独立して解像度レベルl=O,1,2,...,Lに
分解される。但し、l=0は当該カラー成分の最も低い解
像度表現を表わし、一方、後続レベルlの各々はレベルl
-1を基にして使用可能となる解像度の２倍の解像度を持
つ画像を復元するために必要とされる追加の細部を表現
する。パラメータLを複解像度変換におけるレベルの数
と呼ぶ。L=0は、オリジナル画像自体が使用可能な最も
低い解像度である場合に対応する。Next, using the multi-resolution conversion, each color component is independently decomposed into resolution levels l = O, 1, 2,..., L. Where l = 0 represents the lowest resolution representation of the color component, while each of the subsequent levels l
Represents the additional detail needed to recover an image with twice the resolution that would be available based on -1. The parameter L is called the number of levels in the multi-resolution conversion. L = 0 corresponds to the case where the original image itself has the lowest available resolution.

【００１８】本発明の１つの実施形態において、実施を
２つのタイプの複解像度変換のみに制限し、圧縮システ
ムが画像の異なる領域においてこれらの異なる変換の間
で切り替えることを可能とされる。変換の２つのタイプ
は次の通りである。 (1)オーバーコンプリート(overcomplete)変換。より低
い解像度画像を補間し、次に補間された画像および実際
のより高い解像度画像の間の差分を作り上げることによ
って、レベルlに関連するより高い解像度画像がより低
い解像度画像(レベルl-1)から取得される。この場合、
各解像度レベルlにおいてその標本が差分画像を表す単
一バンドC_k,l,0が存在する。この単一バンドは、より高
い解像度画像と全く同じ数の標本を含む。 (2)非オーバーコンプリート変換。これは、多分、標準
線形サブバンド／ウエーブレット変換に制限されるだろ
う。この場合、C_k,l,1、C_k,l,2、C_k,l,3という３つのバ
ンドが存在する。それらは、通常、ＬＨバンド(すなわ
ち垂直にローパスで水平にハイパスなバンド)、ＨＬバ
ンド(すなわち垂直にハイパスで水平にローパスなバン
ド)、およびＨＨバンド(すなわち垂直にハイパスで水平
にハイパスなバンド)として解釈される。これらの３つ
のバンドの各々は、より低い解像度画像と同じ数の標
本、すなわち高解像度画像の標本数の３分の１を含む。
最も低い周波数画像は、それ自体、レベルl=0およびb=0
すなわちC_k,0,0というバンド・インデックスを持つ単一
基底バンドとして表される。In one embodiment of the present invention, the implementation is limited to only two types of multi-resolution transforms, allowing the compression system to switch between these different transforms in different regions of the image. The two types of transformation are as follows. (1) Overcomplete conversion. By interpolating the lower resolution image and then building up the difference between the interpolated image and the actual higher resolution image, the higher resolution image associated with level l becomes the lower resolution image (level 1-1). Obtained from in this case,
At each resolution level l there is a single band C _{k, l, 0} whose sample represents the difference image. This single band contains exactly the same number of specimens as the higher resolution image. (2) Non-overcomplete conversion. This will probably be limited to the standard linear subband / wavelet transform. In this case, there are three bands, C _{k, l, 1} , C _{k, l, 2} , and C _{k, l, 3} . They are usually LH bands (ie, vertically low-pass, horizontally high-pass bands), HL bands (ie, vertically high-pass, horizontally low-pass bands), and HH bands (ie, vertically high-pass, horizontally high-pass bands). Is interpreted as Each of these three bands contains the same number of samples as the lower resolution image, ie, one third of the number of samples in the high resolution image.
The lowest frequency image itself has levels l = 0 and b = 0
That is _{, it} is represented as a single base band having a band index of C _{k, 0,0} .

【００１９】次に、各バンドC_k,l,bを表現するために使
用される個々の標本は、組み込みスカラー量子化器の集
合を使用して、独立して量子化され、その結果、量子化
シンボルのQ個の層C_k,l,b,1,C_k,l,b,2,...,C_k,l,b,Qが
生成される。連続的層は当該バンドの連続的近似におけ
る再現忠実性の高いコンポーネントとして解釈される。
具体的には、量子化シンボルC_k,l,b,1が最も粗い表現を
搬送し、その後、各連続的解像度C_k,l,b,qが当該バンド
の標本値をその前の量子化層によって表現された２倍の
精度で表現するために必要とされる追加情報を提供す
る。組み込み量子化器の詳細は後述される。Next, the individual samples used to represent each band C _{k, l, b} are independently quantized using a set of built-in scalar quantizers, resulting in a quantum Q _{k, l, b, 1} , C _{k, l, b, 2} , ..., C _{k, l, b, Q} are generated for the Q symbols of the coded symbol. A continuous layer is interpreted as a component with high reproduction fidelity in a continuous approximation of the band.
Specifically, the quantization symbol C _{k, l, b, 1} carries the coarsest representation, after which each successive resolution C _{k, l, b, q} quantizes the sample values of the band Provides the additional information needed to represent with twice the precision represented by the layers. Details of the built-in quantizer will be described later.

【００２０】全般的システム・アーキテクチャにおける
次の概念上のステップは、各量子化層C_k,l,b,qをビット
・ストリングへ符号化し、可能な限り効率的に冗長情報
を活用することである。本明細書はこのようなビット・
ストリングをThe next conceptual step in the overall system architecture is to encode each quantization layer C _{k, l, b, q} into a string of bits and utilize the redundant information as efficiently as possible. is there. This document describes such a bit
String

【００２１】[0021]

【数１】として、またその総バイト数に最も近い数に丸められた
長さを(Equation 1) And the length rounded to the nearest number of bytes

【００２２】[0022]

【数２】と表す。符号化のプロセスの詳細は後述される。(Equation 2) It expresses. Details of the encoding process will be described later.

【００２３】最後に、各バンドの各量子化層に関連する
ビット・ストリングが適切な記号ならびにヘッダ情報を
含む単一のビットストリームに集約され、カラー成分の
再現忠実性、解像度および数がビットストリームを後に
伸張するアプリケーションにとって最も適切なサブセッ
トへ該ビットストリームが縮小されることを可能にす
る。圧縮の後のそのような動作をサポートするビットス
トリームはスケーラブル(scalable)ビットストリームと
呼ばれる。ビットストリームをパッケージする際、通常
なんらかの目標ビット率に合致させるため初期ビットス
トリームの全体サイズを限定するため、なんらかのスケ
ーリングを直ちに適用する場合がある。Finally, the bit strings associated with each quantization layer of each band are aggregated into a single bit stream containing the appropriate symbols and header information, and the reproduction fidelity, resolution and number of the color components are Allows the bitstream to be reduced to the most appropriate subset for applications that decompress later. A bitstream that supports such an operation after compression is called a scalable bitstream. When packaging a bitstream, some scaling may be applied immediately to limit the overall size of the initial bitstream, usually to meet some target bit rate.

【００２４】実際の圧縮システムは図１が示すような線
形シーケンス動作を実施すべきではない点に注意する必
要がある。特に、概念上のアーキテクチャの各中間段階
で全画像表現をバッファ記憶することは非実用的であ
り、不必要である。本明細書は、各カラー成分が画像の
上部から開始して底部へ向け１回に１走査行ずつ画像ソ
ースから到来すると仮定する。メモリ効率のすぐれた実
施形態は、すべてのバンドを基本的には同時に生成しな
がら各バンドの走査行を順次生成すべきである。同様
に、すべての量子化層を同時に生成しながら各バンドの
量子化および符号化は順次プロセスでなければならな
い。そのようなメモリ効率的実施を可能にするため、特
定の量子化層に関する量子化および符号化方式は、別の
バンドからの情報またはその画像においてずっと以前に
出現したものと同じバンドの別の量子化層からの情報に
基づくものではない方がよいかもしれない。このような
理由から、実際、後述される特定の符号化方式は、あら
ゆるカラー成分のあらゆるバンドを完全に独立して取り
扱う。更に、任意の特定のバンドの層qからの量子化シ
ンボルは、そのバンドの１走査行以上下がった層q-1か
らの情報を参照することなく符号化されることができ
る。It should be noted that an actual compression system should not implement a linear sequence operation as shown in FIG. In particular, buffering the entire image representation at each intermediate stage of the conceptual architecture is impractical and unnecessary. This specification assumes that each color component comes from the image source one scan line at a time, starting at the top of the image and working towards the bottom. A memory efficient embodiment should generate scan lines for each band sequentially while generating all bands essentially simultaneously. Similarly, the quantization and encoding of each band must be a sequential process while simultaneously generating all quantization layers. To enable such a memory-efficient implementation, the quantization and coding scheme for a particular quantization layer may require information from another band or another quantum of the same band that appeared much earlier in the image. May not be based on information from the chemical layer. For this reason, in fact, the particular coding scheme described below treats every band of every color component completely independently. Further, the quantized symbols from layer q of any particular band can be encoded without reference to information from layer q-1 that has gone down one or more scan lines of that band.

【００２５】2. 複解像度変換図２は、変換レベルが２レベルある場合の複解像度画像
変換サブシステムの基本的特徴を示すブロック図であ
る。複解像度階層における各レベルは、オーバーコンプ
リート変換または非オーバーコンプリート変換、あるい
はそれらの混合を用いる。混合変換は合成ドキュメント
を圧縮する際に特に役立つと期待される。オーバーコン
プリート変換のみがなんらかのレベルで使用されるとす
れば、圧縮ビットストリームには単一バンドC_k,l,0だけ
が含まれる。非オーバーコンプリート変換のみが使用さ
れるとすれば、圧縮ビットストリームには３つのバンド
C_k,l,1、C_k,l,2、C_k,l,3が含まれる。混合変換が使用さ
れれば、上記４つのバンドすべてが含まれる。2. Multi-Resolution Conversion FIG. 2 is a block diagram showing basic features of the multi-resolution image conversion subsystem when there are two conversion levels. Each level in the multi-resolution hierarchy uses an overcomplete transform or a non-overcomplete transform, or a mixture thereof. Mixed transformations are expected to be particularly useful in compressing composite documents. If only the overcomplete transform is used at some level, the compressed bitstream contains only a single band C _{k, l, 0} . Assuming that only non-overcomplete transforms are used, the compressed bitstream contains three bands.
C _{k, l, 1} , C _{k, l, 2} , C _{k, l, 3} are included. If a mixed transform is used, all four bands are included.

【００２６】2.1. バンド解像度上述のように、複解像度変換における各レベルは、前の
レベルの半分の解像度で新しい画像を生成する。ここ
で、画像生成の記述に進む前に、該当解像度が２で除す
ることができない場合やカラー成分が異なる解像度を持
つ場合が発生するのではないかという疑問を解くため、
種々のカラー成分解像度と変換バンドの関係を先ず記述
する。H(C_k,l,b)をバンドC_k,l,bの高さ(すなわち標本走
査行総数)を表すものとする。同様に、W(C_k,l,b)をこの
バンドの幅(すなわち走査行当たりの標本数)を表すもの
とする。また、H(C_k,l)およびW(C_k,l)をレベルl=0,
1,...,lから再構築されることができるカラー成分の高
さおよび幅を表し、H(C_k)およびW(C_k)をカラー成分kの
高さおよび幅を表すものとする。最後に、L_kはk番目の
カラー成分と関連した複解像度階層におけるレベル数を
表すものとする。従って、H(C_k,L_k) = H(C_k)でありW
(C_k,L_k) = W(C_k)である。所与のカラー成分kに関して複
解像度階層の範囲内において利用できる種々の解像度
は、次の数３および数４の式によって関連づけられる。2.1. Band Resolution As described above, each level in the multi-resolution conversion produces a new image at half the resolution of the previous level. Here, before proceeding to the description of image generation, in order to answer the question of whether a case where the resolution cannot be divided by 2 or a case where color components have different resolutions may occur,
First, the relationship between various color component resolutions and conversion bands will be described. Let H (C _{k, l, b} ) represent the height of band C _{k, l, b} (that is, the total number of sample scanning rows). Similarly, let W (C _{k, l, b} ) represent the width of this band (ie, the number of samples per scan line). Also, H (C _{k, l} ) and W (C _{k, l} ) are changed to level l = 0,
Represent the height and width of the color components that can be reconstructed from 1, ..., l, let H (C _k ) and W (C _k ) represent the height and width of the color component k . Finally, let L _k denote the number of levels in the multi-resolution hierarchy associated with the kth color component. Therefore, H (C _k , L _k ) = H (C _k ) and W
(C _k , L _k ) = W (C _k ). The various resolutions available within a multi-resolution hierarchy for a given color component k are related by the following equations:

【００２７】[0027]

【数３】 (Equation 3)

【００２８】[0028]

【数４】上式において、(Equation 4) In the above formula,

【００２９】[0029]

【数５】 (Equation 5)

【００３０】は最も近い整数へ丸められることを表す。
１つの実施形態において、基底バンドC_k,0,0はすべての
カラー成分kに関して同じ解像度を持ち、第１のカラー
成分が最も高い解像度を持つ。これは、一層低い解像度
カラー成分C_kがRepresents rounding to the nearest integer.
In one embodiment, the basis bands C _{k, 0,0} have the same resolution for all color components k, with the first color component having the highest resolution. This is because the lower resolution color component C _k

【００３１】[0031]

【数６】および(Equation 6) and

【００３２】[0032]

【数７】を満たす場合に限り、複数カラー成分が異なる解像度を
持つことがあることを意味する。(Equation 7) It means that a plurality of color components may have different resolutions only when.

【００３３】オーバーコンプリート変換の場合、単一の
差分バンドC_k,1,0は、再構築するために使用される解像
度レベルと同じ解像度を持つ。すなわち、H(C_k,l,0)=H
(C_k,l)およびW(C_k,l,0)=W(C_k,l)である。一方、非オー
バーコンプリート変換は、各々が多くの標本の約４分の
１の解像度を持つ３つの詳細バンドを生成する。具体的
には次の３バンドが得られる。In the case of an overcomplete transform, a single difference band C _{k, 1,0} has the same resolution as the resolution level used to reconstruct. That is, H (C _{k, l, 0} ) = H
(C _{k, l} ) and W (C _{k, l, 0} ) = W (C _{k, l} ). On the other hand, the non-overcomplete transform produces three detail bands, each with about one quarter the resolution of many samples. Specifically, the following three bands are obtained.

【００３４】[0034]

【数８】 (Equation 8)

【００３５】これらの３つのバンドにおける標本の総数
は、常に、より低い解像度レベルおよびより高い解像度
レベルC_k,lおよびC_k,l-1における標本数の間の相違に等
しい点は容易に観察される。It is easy to observe that the total number of specimens in these three bands always equals the difference between the number of specimens at the lower and higher resolution levels C _{k, l} and C _{k, l-1} . Is done.

【００３６】2.2. オーバーコンプリート変換ここに記述される統一画像圧縮システムにおいて、オー
バーコンプリート変換オプションは、振幅レベルの少な
い画像データを圧縮することを意図している。すなわ
ち、カラー成分値の周波数グラフは、少数の独立した切
れ込みから構成されなければならない。この特例は、テ
キストおよび線画を走査することによって得られる２層
またはほぼ２層のデータである。このようなタイプのソ
ースに関しては、複解像度変換を適用して、線形変換の
場合のように振幅レベルを連続的に分散させるのではな
くむしろオリジナルの振幅数を保持することが望まし
い。より一般的に述べれば、変換の対象となるピクセル
のレベル・セットを保持する変換に重点がある。そのよ
うな変換は、形態上の演算子がレベル・セットを保持す
る演算子であるので、形態的変換として知られている。
これらの必要性に対処するため、本発明は、代替的関連
変換の使用を考慮しながら、非常に特定のオーバーコン
プリート形態的変換を提案する。2.2. Overcomplete Conversion In the unified image compression system described herein, the overcomplete conversion option is intended to compress image data with low amplitude levels. That is, the frequency graph of color component values must be composed of a small number of independent cuts. A special case of this is a double or near double layer of data obtained by scanning text and line art. For these types of sources, it is desirable to apply a multi-resolution transform to preserve the original number of amplitudes rather than scatter the amplitude levels continuously as in the case of a linear transform. More generally, the emphasis is on transforms that maintain a level set of pixels to be transformed. Such a transformation is known as a morphological transformation because the morphological operator is an operator that holds a level set.
To address these needs, the present invention proposes a very specific overcomplete morphological transformation, while considering the use of alternative related transformations.

【００３７】C_k,l[m,n]をレベルを、図２のレベル１に
示されるように、カラー成分kの行mおよび列n上の標本
値を示すものとする。C_k,l-1[m,n]=M(C_k,l[2m,2n],C_k,l
[2m,2n-1],C_k,l[2m,2n+1],C_k,l[2m-1,2n],C_k,l[2m+1,2
n])を設定することによって次に低い解像度レベルが形
成される。上式において、M(a,b,c,d,e)は５個の引数a,
b,c,d,eの中位数を示す。Let C _{k, l} [m, n] denote the level and the sample values on row m and column n of the color component k, as shown at level 1 in FIG. C _{k, l-1} [m, n] = M (C _{k, l} [2m, 2n], C _{k, l}
[2m, 2n-1], C _{k, l} [2m, 2n + 1], C _{k, l} [2m-1,2n], C _{k, l} [2m + 1,2
n]), the next lower resolution level is formed. In the above equation, M (a, b, c, d, e) is the five arguments a,
Indicates the median of b, c, d, e.

【００３８】このように、低解像度画像は、非線形中位
数フィルタリング演算によって引き出される。この方法
はテキストおよび図形に関して線形フィルタリングより
一層適切なものである。As described above, the low-resolution image is extracted by the nonlinear median filtering operation. This method is more appropriate for text and graphics than linear filtering.

【００３９】差分バンドC_k,l,0の標本は、C_k,lとC_k,l-1
の形態的に補間されたバージョンの差分から形成され
る。具体的には、The samples of the difference band C _{k, l, 0} are C _{k, l} and C _{k, l−1}
From the morphologically interpolated version of the difference. In particular,

【００４０】[0040]

【数９】である。上式において、P_k,l-1[m,n] {-1,1}は、符号
予測語と解釈されるべきものであり、(Equation 9) It is. In the above equation, P _{k, l-1} [m, n] {-1,1} is to be interpreted as a code prediction word,

【００４１】[0041]

【数１０】は次の数１1の形態的補間演算によって得られる。(Equation 10) Is obtained by the following morphological interpolation operation of Expression 11.

【００４２】[0042]

【数１１】 [Equation 11]

【００４３】ここで、形態的演算子A(a,b,...)(α,
β,...)は、第２のリストα,β,...における引数の平均
に最も近い第１のリストa,b,...nにおける引数の値を返
す。これらのサブ標本化および補間演算は図３に図示さ
れている。Here, the morphological operator A (a, b, ...) (α,
..) returns the value of the argument in the first list a, b,... n closest to the average of the arguments in the second list α, β,. These sub-sampling and interpolation operations are illustrated in FIG.

【００４４】符号予測語P_k,l-1[m,n]は、小さい数のレ
ベル・セットを持つ画像標本に存在する冗長度という重
要な形式を活用する際の重要な役割を果たす。具体的に
述べれば、補間誤差The code predictor P _{k, l-1} [m, n] plays an important role in exploiting the important form of redundancy that exists in image samples with a small number of level sets. Specifically, the interpolation error

【００４５】[0045]

【数１２】が大きい場合は必ず、この誤差の符号は、(Equation 12) Whenever is large, the sign of this error is

【００４６】[0046]

【数１３】の符号と一致する可能性が高いと予想される。ただし(Equation 13) Is likely to match the sign of However

【００４７】[0047]

【数１４】 [Equation 14]

【００４８】は利用できるレベル・セットの局所重心の
測定値である。これは、画像データが本質的に２レベル
であるかを理解する最も容易な方法である(レベル・セ
ットの周囲の変動は小さいかもしれないが、頻度グラフ
は２つの明確な切れ込みを持つ)。この場合、大きい補
間誤差の符号は、予測誤差がより低いレベル・セットへ
より高いレベル・セットを運び、またより高いレベル・
セットへより低いレベル・セットを運ぶという点におい
て、完全に予測できる。この場合上記の方向は、常に、
２つのレベル・セットの重心方向と同じである。このよ
うな理由に基づいて、次の予測語(数１５)は本発明の理
念の合理的実施形態を表現するものである。しかしなが
ら、代替的な予測語を適応することも可能である。Is a measure of the local centroid of the available level set. This is the easiest way to understand if the image data is essentially two levels (the fluctuations around the level set may be small, but the frequency graph has two distinct cuts). In this case, the sign of the large interpolation error will carry the higher level set to the lower level set and the higher level error
It is completely predictable in that it carries a lower level set to the set. In this case, the above direction is always
Same as the direction of the center of gravity of the two level sets. For these reasons, the following predictive word (Equation 15) represents a rational embodiment of the idea of the present invention. However, it is also possible to adapt alternative predictors.

【００４９】[0049]

【数１５】但し、(Equation 15) However,

【００５０】[0050]

【数１６】は、局所的に隣接する最小レベル・セットの実行時推定
を表し、一方、(Equation 16) Represents a run-time estimate of a set of locally adjacent minimum levels, while

【００５１】[0051]

【数１７】 [Equation 17]

【００５２】は、局所的に隣接する最大レベル・セット
の実行時推定を表す。メモリおよび計算処理要件を最小
限にしながら、このような推定を形成する効率的な技術
は多数存在する。単純な例をあげれば、いくつかの隣接
走査行における同じ位置の標本から最小値、すなわちRepresents the run-time estimate of the locally adjacent maximum level set. There are a number of efficient techniques for forming such estimates while minimizing memory and computational requirements. To give a simple example, the minimum value from the sample at the same position in several adjacent scan lines, that is,

【００５３】[0053]

【数１８】を取り出し、この値を使用して(Equation 18) And use this value to

【００５４】[0054]

【数１９】に関して次のような再帰的更新手順を導出することがで
きる。[Equation 19] , The following recursive update procedure can be derived.

【００５５】[0055]

【数２０】但し、α (0,1)は予測語がレベル・セットにおける変
化に適応することができる率を制御する指数減衰係数で
ある。(Equation 20) Where α (0,1) is an exponential decay coefficient that controls the rate at which predicted words can adapt to changes in the level set.

【００５６】上述された形態的変換が有損失および無損
失両方の画像圧縮と本質的に互換性を持つ点に注意する
必要がある。更に、この変換は、標本当たり１ビットだ
けを持つ画像を圧縮するため特に適している。なぜなら
ば、この場合、符号予測語が、後述の5.3項において記
述する符号ビット符号化段階においてビットを全く費消
しないことを保証するからである。このようにして、画
像材料の特性とは無関係に、後述の4項および5項におい
て記述するものと全く同じ量子化および符号化技術を使
用することができる。It should be noted that the morphological transformation described above is essentially compatible with both lossy and lossless image compression. Furthermore, this transform is particularly suitable for compressing images having only one bit per sample. This is because, in this case, the code prediction word guarantees that no bits are consumed in the code bit coding step described in section 5.3 below. In this way, independent of the properties of the image material, exactly the same quantization and coding techniques as described in sections 4 and 5 below can be used.

【００５７】2.3. 非オーバーコンプリート変換既述のように、非オーバーコンプリート変換は、単に、
従来技術の完全再構築(perfect reconstruction略してP
R)または準完全再構築(near-perfect reconstruction略
してNPR)線形サブバンド変換である。本発明の実施形態
においては、Adelson氏その他による９タップNPRフィル
タ法を使用したが(参照:Proc.SPIE(Cambridge, MA), Oc
t. 1987, pp.50-58所載のE.H.Adelson,E.Simoncelliお
よびR.Hingorami３氏著"Orthogonal pyramid transform
s for image coding")、種々のフィルタ・セットまたは
完全にプログラム可能なフィルタ係数がサポートされな
い理由はない。標準的対称的拡張技術を使用して、完全
再構築特性を保存するため余分な標本をサブバンドに詰
める必要がないことを保証することができる。2.3. Non-over-Complete Transformation As mentioned above, the non-over-complete transform is simply
Perfect reconstruction (P for short)
R) or near-perfect reconstruction (NPR) linear subband transformation. In the embodiment of the present invention, a 9-tap NPR filter method by Adelson et al. Was used (see: Proc. SPIE (Cambridge, MA), Oc.
"Orthogonal pyramid transform" by EHAdelson, E. Simoncelli and R. Hingorami3, t. 1987, pp. 50-58.
s for image coding "), there is no reason why different filter sets or fully programmable filter coefficients are not supported. Using standard symmetric extension techniques, extra samples may be used to preserve the full reconstruction properties. It can be ensured that there is no need to pack into subbands.

【００５８】変換フィルタ係数に関する重要な問題は、
それらがどのように正規化されるかである。なぜなら
ば、正規化は各バンドにおける標本に適用される量子化
の解釈に影響を及ぼすからである。この点に鑑み、ロー
パス・サブバンド・フィルタが単位DC利得を持つことす
なわちその係数の合計が１となることを確実にするため
すべてのフィルタが正規化されることを以下仮定する。
これは、画像標本値に関する名目範囲R(例えば８ビット
・データの場合R=255)が、すべての解像度レベルにおけ
る標本値の名目範囲でもあることを意味する。このよう
な正規化手法は、上記2.2項に記述された線形サブバン
ド変換と形態的変換の間の互換性を保証する。An important problem with the transform filter coefficients is that
How they are normalized. This is because normalization affects the interpretation of the quantization applied to the samples in each band. In view of this, it is assumed below that all filters are normalized to ensure that the low-pass sub-band filter has unity DC gain, that is, its coefficients sum to one.
This means that the nominal range R for image sample values (eg, R = 255 for 8-bit data) is also the nominal range for sample values at all resolution levels. Such a normalization technique guarantees compatibility between the linear subband transform and the morphological transform described in section 2.2 above.

【００５９】NPRフィルタおよび大部分のPRサブバンド
・フィルタは、無損失画像圧縮に対して適していない点
は注意する必要がある。しかしながら、有損失および無
損失両方の圧縮と互換性を持つサブバンド変換が知られ
ている。これらは、厳密再構築システムとして知られて
いる。そのような変換を全く同じフレームワークの範囲
内で使用することができない理由はない。It should be noted that NPR filters and most PR subband filters are not suitable for lossless image compression. However, subband transforms that are compatible with both lossy and lossless compression are known. These are known as exact reconstruction systems. There is no reason such a transformation cannot be used within the exact same framework.

【００６０】2.4. メモリおよびバンド幅要件この項において、全般的システム・メモリ要件およびメ
モリバンド幅使用に対する複解像度変換の意義を考察す
る。この分析においては単に線形サブバンド変換のみを
考慮する。なぜならば、オーバーコンプリート形態的変
換のカーネルが非常に小さい領域をサポートし、従って
メモリ要件を減少させるからである。2.4. Memory and Bandwidth Requirements This section discusses the overall system memory requirements and the significance of multi-resolution conversion for memory bandwidth usage. In this analysis, only the linear subband transform is considered. This is because the kernel of the overcomplete morphological transformation supports very small areas, thus reducing memory requirements.

【００６１】圧縮の間、画像データの到着の方法は、画
像の左上角から開始して、下方へ、１走査行ずつ１度に
１標本が辞書順に到着する。このことは、３つの最高頻
度サブバンドの各々および２分の１解像度画像に関する
走査行を生成するため、Tが垂直的低域または高域フィ
ルタのいずれかのタップの最大数を表すとして(例えば
2.3項におけるフィルタの場合T=9)、各カラー成分のT-1
走査行がバッファ記憶されなければならないことを意味
する。各標本が到着すると、過去のT-1標本をメモリか
ら読み取り、フィルタリングを実行し、関連サブバンド
に１標本を書き出す。従って、複解像度変換の最初のレ
ベルは、オリジナルの画像標本につきT個の標本トラン
ザクションを必要とする。カラー成分k-1に関する複解
像度階層における後続のレベルl<L_kは、解像度レベルC
_k,lにおいてT-1走査行がバッファ記憶され、C_k,lにおけ
る各標本に対してT個の標本トランザクションが実行さ
れるべきことを必要とする。しかしながら、C_k,lは、C
_k,l+1に対して数の点で４分の１の標本および長さの点
で半分の走査行を持つにすぎない。従って、圧縮に関す
るメモリ要件の上限は、オリジナル画像標本当たり2(T-
1)画像走査行および(4/3)T標本トランザクションであろ
う。During compression, the method of image data arrival is such that one sample arrives at a time, one scan line at a time, in dictionary order, starting from the upper left corner of the image. This means that T represents the maximum number of taps in either the vertical low-pass or high-pass filter to generate scan rows for each of the three highest frequency subbands and the half resolution image (e.g.,
For the filter in section 2.3, T = 9), T-1 of each color component
This means that the scan line must be buffered. As each sample arrives, the previous T-1 samples are read from memory, filtered, and one sample written to the relevant subband. Thus, the first level of the bi-resolution conversion requires T sample transactions per original image sample. Subsequent levels l <L _k in the multi-resolution hierarchy for color component k-1 are the resolution levels C
_{At k, l} , T-1 scan rows are buffered, requiring that T sample transactions be performed for each sample at C _{k, l} . However, C _{k, l} is
It only has a quarter sample in number and half scan lines in length for _{k, l + 1} . Therefore, the memory requirement for compression is limited to 2 (T-
1) Will be image scan lines and (4/3) T specimen transactions.

【００６２】しかしながら、多くの画像は標本当たり８
ビットだけで表現されのに対して、本実施形態はサブバ
ンド標本に関して標本当たり最高１６ビットまでを用い
ることを望む点に注意する必要がある。その場合、バッ
ファ・メモリ要件は、However, many images are 8 per sample
It should be noted that the present embodiment desires to use up to 16 bits per sample for subband samples, while being represented in bits only. In that case, the buffer memory requirements are:

【００６３】[0063]

【数２１】として表されるであろう。但し、メモリ・バンド幅は(Equation 21) Would be represented as However, the memory bandwidth is

【００６４】[0064]

【数２２】である。(Equation 22) It is.

【００６５】伸張に関しても、分析はやや直観的ではな
いが、状況は非常に類似している。詳細は省略して結果
だけを示せば、圧縮の間、画像標本あたりWith respect to decompression, the analysis is somewhat less intuitive, but the situation is very similar. If you omit the details and show only the results, during compression

【００６６】[0066]

【数２３】 (Equation 23)

【００６７】の中間記憶標本および(4/3)T標本トランザ
クションを必要とする。しかしながら、すべてのバッフ
ァ記憶される標本は、この場合の画像標本というよりむ
しろサブバンド標本である。画像標本は深さ８ビットだ
けであるがすべてのサブバンド標本は深さ１６ビットで
あると再び仮定すれば、伸張のために必要なバッファ・
メモリ要件は、Requires an intermediate storage sample and a (4/3) T sample transaction. However, all buffered samples are sub-band samples rather than image samples in this case. Assuming again that the image samples are only 8 bits deep but all subband samples are 16 bits deep, the buffer needed for decompression is
The memory requirements are

【００６８】[0068]

【数２４】として表わされ、メモリ・バンド幅は標本あたり-1+(8/
3)Tバイトである。(Equation 24) And the memory bandwidth is -1+ (8 /
3) T bytes.

【００６９】3. 圧縮ビットストリームの構成最終的圧縮ビットストリームは量子化および符号化の後
に構成されるが、圧縮システム全般の動作を明確するた
め、構成の詳細をまず記述する。図４は、本発明の１つ
の特定の実施形態に従ったビットストリーム自体の範囲
内の構成情報の構造の１例を示す。図４に示されるよう
に、ビットストリームは４つの主要な区画に分割され
る。3. Configuration of Compressed Bitstream Although the final compressed bitstream is configured after quantization and encoding, the details of the configuration are first described to clarify the operation of the overall compression system. FIG. 4 shows an example of the structure of the configuration information within the bitstream itself according to one particular embodiment of the invention. As shown in FIG. 4, the bitstream is divided into four main partitions.

【００７０】3.1. 第１ヘッダ区画最初の区画は以下の単純なエレメントからなる。 1.５バイトのマジック文字列"CASAC"。 2.ゼロと等しい１つのヌル終端子バイト。これは、ヘッ
ダにおける追加情報の包含をサポートするため将来修正
される場合もある。 3.基底バンド幅W(C_k,0,0)=W(C_k,0)を表す１６ビット符
号なし整数。これは、2.1項に記述されたように、すべ
てのカラー成分kに関して同一である。 4.基底バンド高さH(C_k,0,0)=H(C_k,0)を表す１６ビット
符号なし整数。これは、すべてのカラー成分kに関して
同一である。 5.基底バンドC_k,0の第１の(最も粗い)量子化層に関連す
る量子化ステップのサイズを表すもので、各カラ成分k
に関する１６ビット符号なし整数。このステップ・サイ
ズの特定の解釈は後段の4項において記述される。3.1. First Header Section The first section consists of the following simple elements: 1.5-byte magic string "CASAC". 2. One null terminator byte equal to zero. This may be modified in the future to support the inclusion of additional information in the header. 3. A 16-bit unsigned integer representing the base bandwidth W (C _{k, 0,0} ) = W (C _{k, 0} ). This is the same for all color components k, as described in section 2.1. 4. A 16-bit unsigned integer representing the base band height H (C _{k, 0,0} ) = H (C _{k, 0} ). This is the same for all color components k. 5. Represents the size of the quantization step associated with the first (coarse) quantization layer of the basis band C _{k, 0} , where each color component k
16-bit unsigned integer for. The specific interpretation of this step size is described in section 4 below.

【００７１】3.2. バンド・ヘッダビットストリームの第２の区画は、ビットストリームの
範囲内で表現されるすべての分解バンドを識別すると共
に、各バンドについて利用できる符号化された量子化層
の数およびサイズを記述する追加ヘッダ情報を含む。こ
こで記述する特定の実施形態に関して、各バンドに対す
るヘッダ情報は、以下の規則に従った特定の順序で記載
される。・l=0,1,2..としてレベルlに関するすべてのバンドはレ
ベルl+1に関するいかなるバンドより前に現れる。・上記の条件に従って、k=1,2,...としてカラー成分kに
関するすべてのバンドはカラー成分k+1に関するすべて
のバンドの前に現れる。・上記の条件に従って、b=0,1,2,...として、バンドbは
常にバンドb+1の前に現れる。これらの規則は、より重
要ではない情報に対してより重要な情報優先順位を与え
るだけではなく、バンドをユニークに識別するため、各
バンドのヘッダが始まる１バイト識別子によって運ばれ
なければならない情報の量を減らす。3.2. Band Header The second section of the bitstream identifies all the decomposition bands represented within the bitstream and the number of coded quantization layers available for each band. Contains additional header information describing the size. For the particular embodiment described herein, the header information for each band is written in a particular order according to the following rules. All bands for level l appear before any bands for level l + 1 as l = 0,1,2 .. According to the above conditions, all bands for color component k appear before all bands for color component k + 1 as k = 1,2,. According to the above conditions, band b always appears before band b + 1, as b = 0, 1, 2,. These rules not only give more important information priority to less important information, but also to uniquely identify the band, the information of the information that must be carried by a one-byte identifier where the header of each band begins. Reduce the amount.

【００７２】特定のバンドC_k,l,bに関するヘッダは以下
のエントリからなる。 1.１バイト識別子：最下位２ビットがバンド・インデッ
クスbを表し、次の４ビットが最高１６までの異なるカ
ラー成分kを表し、次の１ビットは幅W(C_k,l,b)が奇数で
あれば1を保持し、最上位１ビットは高さH(C_k,l,b)が奇
数であれば1を保持する。この情報はすべてのバンドの
解像度および解像度レベルとともに複解像度階層を再構
築するのに十分な情報であることは容易に認めることが
できるであろう。 2.バンドに利用できる量子化層を識別する１バイト。こ
のバイトの下位４ビットは、スキップされる最初の量子
化層の数を識別し、一方、上位４ビットは情報が利用で
きる最も精細な量子化層のインデックスを識別する。こ
のようにして、多くとも１５の量子化層がすべてのサブ
バンドについて符号化されることができる。最初の量子
化層をスキップする理由は、最も粗い量子化層(q=1)に
関連する量子化ステップ・サイズが、ヘッダの最初の部
分に記録される基底量子化ステップ・サイズΔ_k,0,1に
対して固定されるからである(この点は後の4項で詳述さ
れる)。しかしながら、そのような粗い量子化ステップ
・サイズのバンドは役に立つ情報を含まないかもしれな
い。 3.利用できる量子化層qの各々について１または複数バ
イトであって、その層すなわちThe header for a particular band C _{k, l, b} consists of the following entries: 1. 1-byte identifier: the least significant 2 bits represent band index b, the next 4 bits represent up to 16 different color components k, and the next 1 bit has width W (C _{k, l, b} ). If the number is odd, 1 is held. If the height H (C _{k, l, b} ) is odd, the 1 most significant bit holds 1. It will be readily appreciated that this information is sufficient to reconstruct the multi-resolution hierarchy with the resolution and resolution level of all bands. 2. One byte that identifies the quantization layer available for the band. The lower 4 bits of this byte identify the number of the first quantization layer to be skipped, while the upper 4 bits identify the index of the finest quantization layer for which information is available. In this way, at most 15 quantization layers can be coded for all subbands. The reason for skipping the first quantization layer is that the quantization step size associated with the coarsest quantization layer (q = 1) is the base quantization step size Δ _{k, 0} recorded in the first part of the header. _{, 1} (this point is described in detail in section 4 below). However, such a coarse quantization step size band may not contain useful information. 3. One or more bytes for each of the available quantization layers q,

【００７３】[0073]

【数２５】 (Equation 25)

【００７４】を符号化するために使用されたバイトの総
数を標示する。この最初のバイトの下位７ビットは、上
記数２５の下位ビットを保持し、追加バイトが必要とさ
れる場合最上位１ビットは1にセットされる。最上位ビ
ットが1にセットされる場合、次のバイトは、上記数２
５の次の７ビットを表し、必要に応じて以下同様に複数
のバイトが使用される。Indicates the total number of bytes used to encode the The lower 7 bits of this first byte hold the lower 25 bits of Equation 25 above, and the most significant 1 bit is set to 1 if additional bytes are needed. If the most significant bit is set to 1, the next byte is
Represents the next 7 bits of 5, and a plurality of bytes are used in the same manner as necessary.

【００７５】3.3. レベル特有情報ビットストリームの第３の区画は、複解像度階層の各レ
ベルで使用される特定の変換を識別する。各カラー成分
毎に、C_k,1を関連基底バンドに減少するために使用され
る変換を識別する情報がまず現れ、続いてC_k,2を減少す
るための情報が現れ、以下次々の情報が続く。変換情報
は同じレベルでのすべてのカラー成分に対しては共通で
あるが、レベルからレベルへと異なる可能性がある。ビ
ットストリームのこの部分に埋め込まれる情報は、他の
実施形態においては修正されることも可能であるが、本
実施形態においては、垂直方向にのみ変更が可能とされ
るように、変換は画像の幅全体にわたって整合するよう
に保たれる。3.3. Level Specific Information The third section of the bitstream identifies the particular transform used at each level of the multi-resolution hierarchy. For each color component, information identifying the transform used to reduce C _{k, 1} to the relevant base band first appears, followed by information to reduce C _{k, 2,} and so on. Followed by The conversion information is common for all color components at the same level, but can vary from level to level. The information embedded in this part of the bitstream can be modified in other embodiments, but in this embodiment, the transformation is performed on the image so that it can only be changed in the vertical direction. It is kept aligned over the entire width.

【００７６】各レベルに関するヘッダ情報は、特定の変
換およびその変換が使用される対象の連続的走査行の数
を識別する１または複数バイトのシーケンスからなる。
各バイトの最下位ビットは、オーバーコンプリート変換
(ビットがゼロ)または非オーバーコンプリート変換(ビ
ットが1)のいずれかを識別する。The header information for each level consists of a sequence of one or more bytes identifying the particular transform and the number of successive scan lines for which the transform is to be used.
Least significant bit of each byte is overcomplete conversion
Identifies either a (zero bit) or a non-overcomplete conversion (one bit).

【００７７】残りの７ビットは、オーバーコンプリート
変換の場合にバンドC_k,l,0に関してあるいは非オーバー
コンプリート変換の場合にバンドC_k,l,1、C_k,l,2および
C_k,l _,3に関して標本を生成するためこの変換が適用され
る対象の解像度成分C_k,1の連続走査行の数を識別する。
該当する解像度成分のすべてのH(C_k,l)走査行が処理さ
れるまで後続のバイトが同じ解釈を持って登場する。変
換が画像の１つの垂直セグメントと別の垂直セグメント
の間で切り替わると、それらをサブ画像とみなして、関
連セグメントの範囲内で適切な変換が独立して適用され
る。再構築される画像に目立つ欠陥ができることを避け
るため、そのような切り替えは、できれば、例えばテキ
スト部分と自然画像の間のような円滑で空白の領域に限
定されることが推奨される。The remaining 7 bits are for bands C _{k, l, 0 for} overcomplete conversion or for bands C _{k, l, 1} , C _{k, l, 2} and C _{k, l, 1 for} non-overcomplete conversion.
Identify the number of consecutive scan rows of the resolution component C _{k, 1} to which this transform is applied to generate a sample for C _{k, l} _{, 3} .
Subsequent bytes appear with the same interpretation until all H (C _{k, l} ) scan lines of the relevant resolution component have been processed. When the transformation switches between one vertical segment of the image and another, it considers them sub-images and applies the appropriate transformation independently within the relevant segment. To avoid creating noticeable defects in the reconstructed image, it is recommended that such switching be limited, if possible, to smooth, blank areas, for example, between text portions and natural images.

【００７８】このヘッダ情報は、レベルl=1,2,...L₁に
対して順に出現するが、b=0またはb≠0のいずれの場合
のバンド識別子がそのレベルのバンド・ヘッダ区画に現
れない限り、いかなる特定のレベルlもスキップされ
る。そうでなければ、特定の変換は固定的であって、b=
0またはb≠0のいずれかの省略によって特定の変換が暗
示される。[0078] The header information is level l = 1, 2, ... but appear in the order with respect to L _1, b = 0 or b ≠ any band header partition band identifier of that level in the case of the 0 , Any particular level l is skipped. Otherwise, the particular transformation is fixed and b =
Omission of either 0 or b ≠ 0 implies a particular transformation.

【００７９】3.4. 符号化される量子化層ビットストリームの最終区画は、バンドの各々から符号
化される量子化層を表すすべてのビット文字列を含む。
各ビット文字列は、上述のように個々のバンド・ヘッダ
にすでに記録されているバイトの数を示す整数で埋めら
れ、それによって、伸張およびビットストリーム・フィ
ルタリング・アプリケーションが関連コード・バイトを
容易に探り当てるインデックスが提供される。本実施形
態においては、量子化層q=1におけるすべてのバンドに
関する符号バイトがまず現れ、引き続き後続のq=2,3,..
に関する符号バイトが現れる。更に、このような順序に
従って、適切な量子化層qにおける情報を含むすべての
バンドに関する符号バイトが、ビットストリームの第２
の区画で現れたバンドに関するヘッダの順序と全く同じ
順序で現れる。このような単純な順序づけ手法は、符号
バイトが重要度の高い方から出現するので、プロクレッ
シブ伝送アプリケーションにおいてビットストリームは
そのまま使用できることを示唆する。これは、また、ビ
ットストリームを精密に解析することを必要とせずに効
果的エラー防止方式を実施することを可能にする。なぜ
ならば、雑音が多い経路を経由して圧縮画像を分配する
際にビットストリームの最初の部分は、明らかに、後方
の部分より十分強く保護されなければならないからであ
る。3.4. Quantized Layer to be Encoded The last section of the bitstream contains all the bit strings representing the quantized layer encoded from each of the bands.
Each bit string is padded with an integer indicating the number of bytes already recorded in the individual band header, as described above, so that decompression and bitstream filtering applications can easily associate the relevant code bytes. An index to find is provided. In this embodiment, the code bytes for all bands in the quantization layer q = 1 first appear, followed by subsequent q = 2,3,.
The sign byte for appears. Furthermore, according to such an order, the code bytes for all bands containing information in the appropriate quantization layer q are stored in the second bitstream.
Appear in exactly the same order as the headers for the bands that appeared in the section. Such a simple ordering scheme suggests that the bitstream can be used as is in progressive transmission applications, since the code bytes appear from the most important one. This also allows an effective error prevention scheme to be implemented without having to analyze the bitstream precisely. This is because, when distributing the compressed image via a noisy path, the first part of the bitstream must obviously be protected more strongly than the rear part.

【００８０】3.5. ビットストリームのスケーリング上述のような表現形態を有するビットストリームを所与
とすれば、特定の伸張アプリケーションにとって重要で
ないコンポーネントを破棄することは比較的大きな問題
ではない。例えば、白黒の情報が要求される場合、単に
カラー成分k>1に関連するすべての情報を破棄すればよ
い。解像度限度があれば、適切な解像度レベルlを越え
るすべてのバンド・ヘッダおよび関連符号バイトを単に
破棄するだけでよい。最も重要な考慮すべきスケーリン
グのタスクは、伸張アプリケーションに渡されるべきビ
ットストリームの全サイズに関して制限が課せられる場
合である。これは、ビット率スケーリングと呼ばれる。
この場合、符号バイトに関して利用可能なバイト数すな
わち3.5. Bitstream Scaling Given a bitstream having the above representation, discarding components that are not important for a particular decompression application is not a major problem. For example, if black and white information is required, all information relating to color component k> 1 may simply be discarded. If there is a resolution limit, all band headers and associated code bytes beyond the appropriate resolution level l need only be discarded. The most important scaling task to consider is when restrictions are imposed on the total size of the bitstream to be passed to the decompression application. This is called bit rate scaling.
In this case, the number of bytes available for the sign byte, ie

【００８１】[0081]

【数２６】および層サイズ情報すなわち(Equation 26) And layer size information ie

【００８２】[0082]

【数２７】 [Equation 27]

【００８３】に関する限度B_maxに到達するため、すべて
の必須ヘッダ情報に関連したバイトの数をまず減ずる。
B_k,l,b,qをC_k,l,b,qとこのサイズを表すために必要とさ
れるヘッダ・バイトの数の合計を表すものとする。従っ
て、B_k,l,b,qは、フィルタされたビットストリームにお
いてC_k,l,b,qを含むために必要とされるバイトの数であ
る。このような表記法を用いれば、以下のようなビット
率スケーリング・アルゴリズムが推奨される。ステップ１：次の数２８のような最大量子化層インデッ
クスq₀を発見する。To reach the limit B _max for, the number of bytes associated with all the required header information is first reduced.
_Let B _{k, l, b, q} denote the sum of C _{k, l, b, q} and the number of header bytes needed to represent this size. Thus, B _{k, l, b, q} is the number of bytes needed to contain C _{k, l, b, q} in the filtered bitstream. Using such notation, the following bit rate scaling algorithm is recommended. Step 1: Find the maximum quantization layer index q ₀ as shown in the following Expression 28.

【００８４】[0084]

【数２８】 [Equation 28]

【００８５】B = B_max-B_minとする。これは、q₀を越え
るすべての量子化層が破棄される場合、使われなくなる
利用可能バイトの数である。ステップ２：q=q₀とする。ステップ３：ビットストリームに出現する順序とまった
く同じ順序ですべてのバンドC_k,l,bを検証する。そのよ
うなバンドの各々に関して、q_k,l,bを情報が利用可能な
最も精細な量子化層のインデックスを示すものとする。ケース１：q_k,l,b<qであれば、なすべきことはなにもな
い。ケース２：q_k,l,b>qであるがB_k,l,b,q>Bであれば、この
バンドに関して層qを越えるすべての量子化層を破棄す
る。すなわち、q_k,l,b=qとする。ケース３：q_k,l,b>qであってB_k,l,b,q≦Bであれば、Bか
らB_k,l,b,qを差し引く。ステップ４：q=15であれば、処理は完了である。さもな
ければ、q=q+1として、ステップ２へ戻る。このアルゴ
リズムは本質的にはほぼ等しい数の量子化層をすべての
バンドに割り当てようとしている点に注意する必要があ
る。4.2項で記述する量子化の方針は、等しい数の量子
化層をすべてのバンドに割り当てる方針が良好なビット
率割り当て方針であることを保証するように、基底量子
化ステップ・サイズを種々のバンドに割り当てることを
試みている。[0085] and B = B _max -B _min. This is the number of available bytes that will be unused if all quantization layers beyond q ₀ are discarded. Step 2: with q = q _0. Step 3: Verify all bands C _{k, l, b} in exactly the same order as they appear in the bitstream. For each such band, let q _{k, l, b} denote the index of the finest quantization layer for which information is available. Case 1: If q _{k, l, b} <q, there is nothing to do. Case 2: If q _{k, l, b} > q but B _{k, l, b, q} > B, discard all quantization layers beyond layer q for this band. That is, q _{k, l, b} = q. Case 3: If q _{k, l, b} > q and B _{k, l, b, q} ≦ B, subtract B _{k, l, b, q} from B. Step 4: If q = 15, the process is completed. Otherwise, set q = q + 1 and return to step 2. It should be noted that this algorithm essentially assigns approximately equal numbers of quantization layers to all bands. The quantization strategy described in Section 4.2 is to set the base quantization step size to different bands so that the strategy of assigning an equal number of quantization layers to all bands is a good bit rate assignment strategy. Trying to assign to.

【００８６】3.6. ビットストリームのバッファ記憶本明細書においては、ビットストリームが一般にオリジ
ナルの画像より非常に小さくなければならないので、圧
縮されるビットストリーム自体のメモリ使用およびメモ
リ・バンド幅要件にさほどの関心は払われていない。汎
用ＣＰＵ上で本システムを実施する場合、画像が圧縮さ
れている間圧縮ビット文字列すなわち3.6. Buffer Storage of the Bitstream In this specification, the bitstream must generally be much smaller than the original image, so the memory usage and memory bandwidth requirements of the compressed bitstream itself are very small. Interest is not paid. When implementing the present system on a general-purpose CPU, the compressed bit string, ie, while the image is compressed,

【００８７】[0087]

【数２９】 (Equation 29)

【００８８】を単にメモリにバッファ記憶し、画像が完
全に圧縮された後一度だけ最終的ビットストリームを構
成し書き出せば通常は全く満足できる。しかしながら、
高いビット率の場合あるいは極めて大きい画像を処理す
る場合、それでは満足できないこともある。類似した問
題は、特別仕様のＡＳＩＣ実施形態においても発生する
可能性がある。この点が問題であれば、以下の記述は問
題解決に役立つと判明するかもしれない。It is usually quite satisfactory to simply buffer the memory in memory and construct and write out the final bitstream only once after the image has been fully compressed. However,
At high bit rates or when processing very large images, this may not be satisfactory. A similar problem can occur in custom ASIC embodiments. If this is a problem, the following may prove useful in solving the problem.

【００８９】はじめに、注意すべきことではあるが、画
像全体が圧縮された後までは最終的ビットストリームを
生成する効率的手段はない。なぜならば、圧縮システム
のいかなるメモリ効率的実施形態も、実質的に並列的に
すべてのバンドのすべての量子化層を生成している必要
があるが、一方、ビットストリームは、関連ビット列は
１つずつ逐次出現し、各量子化層を圧縮するために使用
される符号バイトの数を反映するようにヘッダ情報が生
成されることを必要とする。First, it should be noted that there is no efficient means to generate the final bitstream until after the entire image has been compressed. Because, any memory efficient embodiment of the compression system needs to generate all quantization layers of all bands substantially in parallel, while the bit stream has one associated bit stream. And requires that header information be generated to reflect the number of code bytes used to compress each quantization layer.

【００９０】ビットストリームは、また、一定のサイズ
制限を満たすため3.5項において記述されたアルゴリズ
ムを使用してフィルタリングされる必要があるかもしれ
ない。従って、ビットストリームは、なんらかのランダ
ム・アクセス能力を持つメモリに初期的にバッファ記憶
されなければならない。しかしながら、幸いにも、メモ
リ機構は、特に速い必要はなく、バイト単位のアドレス
機能を持つ必要はない。１つの適切な手法は、例えば各
々が64バイトの記憶ブロックのリンクされたリストを作
成し、各バンドの各量子化層について１つのリストを維
持するものである。8量子化層、3カラ成分、および、7
解像度レベルの各々について完全な4バンドを仮定する
ならば、圧縮器は672個のそのようなブロックすなわち4
3Ｋバイトにのみアクセスするだけでよく、画像圧縮の
間に発生する必要性に応じてブロックを任意の順序で比
較的大きな記憶機構に書き込むことができる。当然のこ
とながら、画像が圧縮されたならば、ブロックはその大
きいメモリ機構から取り出され、新たなビットストリー
ムに再編成される必要がある。当然、実際のビットスト
リームではなく、このような中間記憶ブロックのバッフ
ァリングは、なんらかのオーバーヘッドを伴い、当該記
憶機構は最終的圧縮ビットストリームに必要とされるも
のより多いデータを保有するに足る大きさでなければな
らない。The bitstream may also need to be filtered using the algorithm described in section 3.5 to meet certain size restrictions. Therefore, the bitstream must be initially buffered in a memory with some random access capability. Fortunately, however, the memory mechanism need not be particularly fast and need not have byte-by-byte addressing capabilities. One suitable approach is to create a linked list of storage blocks, each of 64 bytes, for example, and maintain one list for each quantization layer in each band. 8 quantization layers, 3 color components, and 7
Assuming a complete 4 bands for each of the resolution levels, the compressor would have 672 such blocks or 4
Only 3 Kbytes need to be accessed, and the blocks can be written to a relatively large storage in any order, as needed during image compression. Of course, once the image has been compressed, the blocks need to be fetched from its large memory facility and rearranged into a new bitstream. Of course, buffering of such intermediate storage blocks, rather than the actual bitstream, involves some overhead, and the storage mechanism is large enough to hold more data than is required for the final compressed bitstream. Must.

【００９１】4. 組み込み量子化上記1項で言及したように、各バンドC_k,l,bは量子化層C
_k,l,b,1,C_k,l,b,2,...C_k,l,b,qという集合に分解され、
バンドの標本の連続的に高い忠実性の表現が提供され
る。各カラー成分kの基底バンドC_k,0,0以外のすべての
バンドは、組み込みデッドゾーン量子化器を使用して量
子化される。なぜなら、これらのバンドのすべては、値
ゼロが特別な重要性を持つ高い周波数情報を表現するか
らである。従って、中心の量子化間隔(すなわちデッド
ゾーン)がゼロの周囲におかれることは重要である。一
方、基底バンドは、値ゼロが特別の意味を持たない低頻
度の情報を表現する。これらのバンドは4.2項で記述さ
れるように一様に量子化される。4. Built-in quantization As mentioned in the above section 1, each band C _{k, l, b} is a quantization layer C
_{k, l, b, 1} , C _{k, l, b, 2} , ... C _{k, l, b, q}
A continuously high fidelity representation of the band samples is provided. All bands except the base band C _{k, 0,0 of} each color component k are quantized using a built-in dead zone quantizer. Because all of these bands, the value zero represents high frequency information of special importance. Therefore, it is important that the center quantization interval (ie, the dead zone) be around zero. The base band, on the other hand, represents infrequent information where the value zero has no special meaning. These bands are uniformly quantized as described in Section 4.2.

【００９２】4.1. 組み込みデッドゾーン量子化器図５は、基底バンドC_k,0,0以外のバンドに関して量子化
層を生成するために使用される複数組み込みデッドゾー
ン量子化器の間の関係を示す。量子化器の集合全体は、
最も粗い量子化ステップ・サイズを識別する単一のパラ
メータΔ_k,l,bによって特徴づけられる。注意すべき
は、これらのバンドによって表現される高周波数情報が
ゼロの周囲に群がる傾向があるという利点を生かして、
ゼロの周囲のデッドゾーンは2Δ_k,l,bの幅を持つ。実際
問題として、これらの組み込み量子化器は、最初に関連
標本値を符号つき大きさ表現に変換し、次に適切な数の
下位ビットを破棄することによって非常に効率的に実施
される。S_k,l,b[m,n]をバンドC_k,l,bの走査行mおよび列
n上の標本C_k,l,b[m,n]に関する符号ビット(1は負0は正)
を表し、M_k,l,b[m,n]はスケーリングの大きさ(整数)を
示すものとする。すなわち、4.1. Embedded Dead Zone Quantizer FIG. 5 shows the relationship between multiple embedded dead zone quantizers used to generate the quantization layer for bands other than the base band C _{k, 0,0.} Show. The whole set of quantizers is
It is characterized by a single parameter Δk _{, l, b} that identifies the coarsest quantization step size. Note that, taking advantage of the fact that the high frequency information represented by these bands tends to swarm around zero,
The dead zone around zero has a width of 2Δk _{, l, b} . As a practical matter, these built-in quantizers are implemented very efficiently by first converting the relevant sample values to a signed magnitude representation and then discarding the appropriate number of lower bits. Let S _{k, l, b} [m, n] be the scan row m and column of band C _{k, l, b}
Sign bit for sample C _{k, l, b} [m, n] on n (1 is negative, 0 is positive)
And M _{k, l, b} [m, n] indicates the magnitude of scaling (integer). That is,

【００９３】[0093]

【数３０】 [Equation 30]

【００９４】である。但し、Rはオリジナルの画像標本
値の名目範囲であり(8ビットの画像標本の場合R=255)、
Q_k,l,bは、バンドC_k,l,bに関する生成を行う場合に本実
施形態が関心を持つ最も精細な量子化層のインデックス
である。次に、量子化層q<q_k,l,bに関する量子化シンボ
ルが、M_k,l,b,q[m,n]のゼロ以外の値に関する符号ビッ
トS_k,l,b[m,n]と共に、次の数３１のようなシフト下げ
された大きさによって特徴づけられる。Is as follows. Where R is the nominal range of the original image sample values (R = 255 for an 8-bit image sample),
Q _{k, l, b} is the index of the finest quantization layer that is of interest in the present embodiment when generating the band C _{k, l, b} . Next, the quantization symbol for the quantization layer q <q _{k, l, b} becomes the code bit S _{k, l, b} [m, m for the non-zero value of M _{k, l, b, q} [m, n]. n] together with the shifted magnitude as in the following equation 31.

【００９５】[0095]

【数３１】 (Equation 31)

【００９６】このようにして、M_k,l,b[m,n]を取得する
ため、M_k,l,b[m,n]からQ_k,l,b-q+1-σ_k,l,b,q[m,n]下位
ビットを破棄すればよい。但し、σ_k,l,b,q[m,n]は量子
化適用パラメータであって、値はゼロまたは１である。
σ_k,l,b,q[m,n]=0の時、効率的量子化ステップ・サイズ
が(R/2¹⁵)(Δ_k,l,b/2^q-1)であることは容易にわかる。
これは、名目範囲が2¹⁵に等しくなるまで標本値のスケ
ーリングを上げ、次に第１の量子化層に関するステップ
・サイズとしてΔ_k,l,bを、第２の量子化層に関するス
テップ・サイズとしてΔ_k,l,b/2を、以下の各層に同様
な値を使用するものとして解釈される。σ_k,l,b,q[m,n]
=1の時、効果的量子化ステップ・サイズは半分すなわち
(R/2¹⁵)(Δ_k,l,b/2^q)である。[0096] In this _{manner, M k, l, b [} m, n] for _{acquiring, M k, l, b [} m, n] from _{Q k, l, b -q +} 1-σ k, _{l, b, q} [m, n] lower bits may be discarded. Here, σ _{k, l, b, q} [m, n] is a quantization application parameter, and its value is zero or one.
When σ _{k, l, b, q} [m, n] = 0, it is easy for the efficient quantization step size to be (R / 2 ¹⁵ ) (Δ _{k, l, b} / 2 ^q-1 ) I understand.
This increases the scaling sample values up to a nominal range is equal to 2 ^15, then delta _{k, l} as a step size for the first quantization _layer, the _b, step size for the second quantized layer as delta _{k, l,} the _b / 2, be construed as using the same values for the following layers. σ _{k, l, b, q} [m, n]
When = 1, the effective quantization step size is half, ie
(R / 2 ¹⁵ ) (Δ _{k, l, b} / 2 ^q ).

【００９７】次に、量子化適用パラメータσ_k,l,b,q[m,
n]について記述する。このパラメータがどのように形成
されるか理解するため、各量子化層が辞書の順序すなわ
ち走査行毎に生成され、符号化された後に復号されるこ
と、および、量子化qが利用可能な時はいつでも常に量
子化層q-1が利用可能であることを理解することは重要
である。更に、量子化層q-1に関する復号プロセスは、
常に量子化層qに関する復号プロセスより１走査行先行
していて、これは、M_k,l,b,q-1[m,n+1],M_k,l,b,q-1[m+
1,n-1],M_k,l,b,q-1[m+l,n]およびM_k,1,b,q-1[m+1,n+1]
が、M_k,l,b,q[m n]を復号する時までには既知となって
いることを意味する。当然、M_k,l,b,q[m,n],M_k,l,b,q[m
-1,n-1],M_k,1,b,q[m-1,n]およびM_k,l,b,q[m-1,n]も既知
であり、これら８つの量子化された隣接標本の大きさの
すべては、M_k,l,b,q[m,n]の量子化および符号化を条件
付けるために使用される。この隣接情報は図６に示され
ている。ここで、σ'_k,l,b,q[m,n]はこれら８つの隣接
標本の大きさがゼロに等しければゼロに等しいものとす
る。従って、影響する過去において標本の４つの最近接
標本が層qにおいて非ゼロであると判明した場合あるい
は残りの最近接標本が層q-1において非ゼロであると判
明した場合、σ'_k,l,b,q[m,n]=1である。次に、近接パ
ラメータσ'_k,l,b,q[m,n]の観点からσ_k,l,b,q[m,n]を
定義する。具体的に述べれば、Next, the quantization application parameter σ _{k, l, b, q} [m,
n]. To understand how this parameter is formed, each quantization layer is generated in dictionary order, i.e., every scan row, decoded after being encoded, and when quantization q is available. It is important to understand that the quantization layer q-1 is always available. Further, the decoding process for the quantization layer q-1
It is always one scan line ahead of the decoding process for the quantization layer q, which is M _{k, l, b, q-1} [m, n + 1], M _{k, l, b, q-1} [m +
1, n-1], M _{k, l, b, q-1} [m + 1, n] and M _{k, 1, b, q-1} [m + 1, n + 1]
Is known by the time M _{k, l, b, q} [mn] is decoded. Of course, M _{k, l, b, q} [m, n], M _{k, l, b, q} [m
-1, n-1], M _{k, 1, b, q} [m-1, n] and M _{k, l, b, q} [m-1, n] are also known, and these eight quantized All of the adjacent sample sizes used are used to condition the quantization and coding of M _{k, l, b, q} [m, n]. This neighbor information is shown in FIG. Here, σ ′ _{k, l, b, q} [m, n] is assumed to be equal to zero if the size of these eight adjacent samples is equal to zero. Thus, if the four closest samples of the sample were found to be non-zero in layer q in the affected past, or if the remaining closest samples were found to be non-zero in layer q-1, σ ′ _{k, l, b, q} [m, n] = 1. Next, σ _{k, l, b, q} [m, n] is defined from the viewpoint of the proximity parameter σ ′ _{k, l, b, q} [m, n]. To be specific,

【００９８】[0098]

【数３２】 (Equation 32)

【００９９】である。上記の定義は機能的というよりむ
しろプロシージャ上の観点から理解し実施する方が簡単
である。具体的に述べれば、M_k,l,b,q[m,n]を評価する
際以下のステップが実行される。 1.すでに符号化された情報に依存するσ'_k,l,b,q[m,n]
の値をチェックする。σ'_k,l,b,q[m,n]が1であればすな
わち隣接値が非ゼロであれば、σ_k,l,b,q[m,n]=1に関連
する精細量子化しきい値に基づいてM_k,l,b,q[m,n]を直
ちに計算する。 2.大きさが前の量子化層において既に非ゼロであると判
明していればすなわちM_k,l,b,q-1[m,n]≠0であれば、σ
_k,l,b,q[m,n]は１に等しいにちがいない。事実、σ
_k,l,b,q[m,n]は、qの非減少関数であるように観察され
る。 3.σ'_k,l,b,q[m,n]=0でM_k,l,b,q-1[m,n]=0であれば、現
在時標本の周囲の３×３窓の範囲内にはゼロ以外の大き
さを持つものは何もないことが判明する。この場合、σ
_k,l,b,q[m,n]がゼロに等しいという仮定に基づいてM
_k,l,b,q[m,n]をまず計算する。これがゼロという量子化
された大きさを生むならば、仮定は正しい。そうでない
場合、M_k,l,b[m,n]>2Xであり(但しX=Q_k,l,b-q+1)、これ
は、σ_{k,l,b,q[m,n]=1}であり、M_k,l,b,q[m,n]がこの新
しい情報に基づいて再計算されなければならないことを
意味する。Is as follows. The above definitions are easier to understand and implement from a procedural perspective than a functional one. Specifically, the following steps are performed when evaluating M _{k, l, b, q} [m, n]. 1. σ ' _{k, l, b, q} [m, n] depending on already encoded information
Check the value of. If σ ' _{k, l, b, q} [m, n] is 1, that is, if the adjacent value is non-zero, fine quantization related to σ _{k, l, b, q} [m, n] = 1 Calculate M _{k, l, b, q} [m, n] immediately based on the threshold. 2. If the magnitude has already been found to be non-zero in the previous quantization layer, that is, if M _{k, l, b, q-1} [m, n] ≠ 0, then σ
_{k, l, b, q} [m, n] must be equal to 1. In fact, σ
_{k, l, b, q} [m, n] is observed to be a non-decreasing function of q. 3. If σ ' _{k, l, b, q} [m, n] = 0 and M _{k, l, b, q-1} [m, n] = 0, 3 × 3 window around the current sample Turns out to have no non-zero magnitude in the range. In this case, σ
M based on the assumption that _{k, l, b, q} [m, n] is equal to zero
First _{, k, l, b, q} [m, n] are calculated. If this yields a quantized magnitude of zero, the assumption is correct. Otherwise, M _{k, l, b} [m, n]> 2X (where X = Q _{k, l, b} -q + 1), which is σ _{k, l, b, q [m, n ] = 1,} which means that M _{k, l, b, q} [m, n] must be recalculated based on this new information.

【０１００】ソフトウェアの実施においては、これらの
手続きステップは、マスク、テストおよびシフト演算の
単純なセットを用いて実施されことができ、それらセッ
トは5項において記述される符号化方式を実施するため
に必要な演算と同様である。In a software implementation, these procedural steps can be performed using a simple set of mask, test, and shift operations, which are performed to implement the encoding scheme described in Section 5. Is the same as the operation required for

【０１０１】上記のσ_k,l,b,q[m,n]の定義は、特定の標
本が粗いステップ・サイズ(R/2¹⁵)(Δ_k,l,b/2^q-1)を越
える大きさを持つことが判明すると直ちに、ステップ・
サイズ(R/2¹⁵)(Δ_k,l,b/2^q)を持つ一層精細な量子化器
の使用に切り替えて、その標本およびその近隣のすべて
を量子化し符号化することを意味する。結果として、す
べての隣接標本をゼロに量子化しながら分離された標本
値がしきい値を上まわり、特に不快な欠陥を生成するこ
ととなるような非適応型量子化方式で遭遇された共通の
問題が回避される。この方式は、また、高いコントラス
トのエッジの近辺の量子化ステップ・サイズを自然に減
少させる傾向を持ち、これによって再構築された画像に
おける「環状」欠陥が減少する。The above definition of σ _{k, l, b, q} [m, n] indicates that a specific sample has a coarse step size (R / 2 ¹⁵ ) (Δ _{k, l, b} / 2 ^q-1 ). As soon as it turns out to have a size that exceeds
Switching to the use of a finer quantizer with size (R / 2 ¹⁵ ) (Δ _{k, l, b} / 2 ^q ) implies quantizing and encoding the sample and all of its neighbors. As a result, the common sample encountered in non-adaptive quantization schemes, where the separated sample values exceed the threshold value while quantizing all adjacent samples to zero, creating particularly unpleasant defects The problem is avoided. This scheme also tends to naturally reduce the quantization step size near high contrast edges, thereby reducing "ring" defects in the reconstructed image.

【０１０２】4.2. 組み込み均一量子化器 K基底バンドC_k,0,0は、図７に示されるように、組み込
み均一量子化器の単純なセットを使用してすべて量子化
される。１つの実施形態において、これら組み込み均一
量子化器は、4.1項に記述された組み込みデッドゾーン
量子化器の場合と全く同じ方法で実施されるが、標本値
の符号つき大きさ表現ではなくこの場合は２の補数表現
を用いて量子化を開始する点が相違する。具体的に述べ
れば、V_k,0,0[m,n]を4.2. Embedded Uniform Quantizer The K basis bands C _{k, 0,0} are all quantized using a simple set of embedded uniform quantizers, as shown in FIG. In one embodiment, these built-in uniform quantizers are implemented in exactly the same way as the built-in dead zone quantizer described in Section 4.1, but instead of using a signed magnitude representation of the sample values, Differs in that the quantization is started using the two's complement representation. Specifically, V _{k, 0,0} [m, n] is

【０１０３】[0103]

【数３３】 [Equation 33]

【０１０４】によって定義される２の補数整数を表すも
のとすれば(但し、Q_k,0,0はこの場合もカラー成分kの基
底バンドに関して生成することに関心を持たれる最も精
細な量子化層のインデックスである)、層q≦Q_k,0,0に関
する量子化シンボルは、(2) where Q _{k, 0,0} is again the finest quantization that is of interest to generate with respect to the base band of color component k. The quantization symbol for layer q ≦ Q _{k, 0,0} is

【０１０５】[0105]

【数３４】 (Equation 34)

【０１０６】によって定義される符号付き２の補数整数
V_k,0,0[m,n]と１対１の対応関係にある。すなわち、V
_k,0,0[m,n]から下位Q_k,0,0-q+1ビットが単に破棄され
る。層qに関する効果的量子化ステップ・サイズが(R/2
¹⁵)(Δ_k,0,0/2^q-1)であることは容易に認められる。こ
れは、標本値をそれらの範囲が2¹⁵に等しくなるまでス
ケーリングをあげ、次に第１の量子化層に対してΔ
_k,0,0を、第２の量子化に対して1/2Δ_k,0,0を、後続の
層に同様のサイズを使用するものとして解釈される。他
のすべてのバンドと違って、基底バンドには量子化適合
方針は一切使用されない。すなわちその量子化ステップ
・サイズは、量子化されつつある標本の空間文脈とは無
関係である。Signed two's complement integer defined by
There is a one-to-one correspondence with V _{k, 0,0} [m, n]. That is, V
_The lower Q _{k, 0,0} -q + 1 bits from _{k, 0,0} [m, n] are simply discarded. The effective quantization step size for layer q is (R / 2
¹⁵ ) (Δ _{k, 0,0} / 2 ^q-1 ). This raised the scaling sample values to their range is equal to 2 ^15, then Δ relative to the first quantization layer
_{k, 0,0} is interpreted as using 1 / 2Δk _{, 0,0} for the second quantization, using a similar size for subsequent layers. Unlike all other bands, no quantization adaptation policy is used for the base band. That is, the quantization step size is independent of the spatial context of the sample being quantized.

【０１０７】4.3. 異なるバンド間の関係ここまでは、量子化方針は、各バンドに関連する基底量
子化ステップ・サイズΔ_k,l,bに依存していた。これら
の量子化ステップ・サイズがどのように関連するのかと
いうことは自然で重要な疑問である。3.1項に記述され
たビットストリーム構文が、各カラー成分に関して１つ
の基底だけの仕様を与えるという事実の観点において、
上記疑問は特に重要である。１つの実施形態において、
この疑問に対する解答は非常に単純に、4.3. Relationship Between Different Bands So far, the quantization strategy has been dependent on the base quantization step size Δ _{k, l, b} associated with each band. It is a natural and important question how these quantization step sizes are related. In view of the fact that the bitstream syntax described in Section 3.1 gives a specification of only one basis for each color component,
The above question is particularly important. In one embodiment,
The answer to this question is very simple,

【０１０８】[0108]

【数３５】 (Equation 35)

【０１０９】と定義される。すなわち、対応する量子化
層に関連する量子化ステップ・サイズは、複解像度階層
におけるすべてのバンドについて同一であって、次のよ
り高い解像度レベルへ階層を移動する度毎に２倍され
る。少なくとも線形のサブバンド／ウェーブレット変換
の場合、すべてのバンドが常に量子化層の同じ数を割り
当てられることを前提として、再構築される画像の平均
平方誤差の最小化という点からこの方針がほぼ最適であ
るとみなされる。実際、3.5項で記述されたスケーリン
グ・アルゴリズムは、通常、一層低い頻度バンドに一層
多くの加重を与える。なぜならば、それら低頻度バンド
は、一旦すべてのバンドが量子化層の初期的に同じ数を
割り当てられた後、残りのバイトに対する候補として高
い優先度を持つからである。また、人間の視覚システム
がより低い空間頻度に対して一般に比較的敏感であると
いう事実を考慮すれば、これは非常に合理的である。実
際、非公式の主観的な実験では、異なるバンドに関する
量子化ステップ・サイズの上記の関係がほぼ最適に近い
ことが示唆されている。Is defined as That is, the quantization step size associated with the corresponding quantization layer is the same for all bands in the multi-resolution hierarchy, and is doubled each time the hierarchy is moved to the next higher resolution level. For at least the linear subband / wavelet transform, this strategy is almost optimal in terms of minimizing the mean square error of the reconstructed image, assuming that all bands are always assigned the same number of quantization layers. Is assumed to be In fact, the scaling algorithm described in section 3.5 usually gives more weight to lower frequency bands. This is because those low frequency bands have high priority as candidates for the remaining bytes once all bands are initially assigned the same number of quantization layers. This is also very reasonable considering the fact that the human visual system is generally relatively sensitive to lower spatial frequencies. In fact, informal subjective experiments suggest that the above relationship of quantization step sizes for different bands is nearly optimal.

【０１１０】5. 量子化層の符号化各バンドC_k,l,b,qに関連する量子化層C_k,l,b,qを符号化
するために使用される技術を以下に記述する。この実施
形態の場合、符号化アルゴリズムは、メモリおよび計算
処理の資源を圧縮性能の犠牲なしに最小に保つように設
計されている。主として許容可能な限度内に中間バッフ
ァ・メモリを保つため以下の主要点が決定された。・各バンドを別々に符号化する。C_k,l,b,q[m,n]を符号
化するために使用される条件付け文脈において前の解像
度における同じバンドからの情報を含むことによってい
くらかの利点が得られるが、圧縮効率の改善が約５％以
上となることはほとんどない。複解像度階層における連
続するレベルの間でいくつかの走査行の自然の遅れがあ
るという事実に起因する中間記憶域容量の増加という観
点からこの程度の改善を正当化することは困難である。
また、あらゆるバンドを独立して符号化することは、ハ
ードウェアでの圧縮方式の実施を単純化し、そのような
実施に伴うメモリ・バンド幅やチップ搭載メモリ需要を
減少させる。更に、そのような符号化は、3.5項で記述
されたビットストリーム・スケーリング・アルゴリズム
に対する制約の付与を回避する。・任意の特定のバンドに関して、走査行mの層qから量子
化シンボルを符号化するため条件付け文脈を生成する
際、より粗い量子化層q-1を越えて現れるあるいは現在
時量子化層qにおける走査行m-1より前に現れる情報に本
実施形態の符号化方式は依存しない。このことは、Q
_k,l,b+1を圧縮の際生成すべき(あるいは伸張の際再構築
すべき)最精細量子化層のインデックスであるとすれ
ば、バンドC_k,l,bからの多くともQ_k,l,b+1の走査行が中
間記憶域におかれなければならないことを意味する。こ
の点を明らかにするため、図８は、バッファ記憶された
走査行とこのバッファ記憶されたデータから符号化ある
いは復号される量子化層の関係を示している。5. Quantization Layer Coding The technique used to encode the quantization layers C _{k, l, b, q} associated with each band C _{k, l, b, q} is described below. . For this embodiment, the encoding algorithm is designed to keep memory and computational resources at a minimum without sacrificing compression performance. The following key points have been determined primarily to keep the intermediate buffer memory within acceptable limits. -Encode each band separately. _Inclusion of information from the same band at the previous resolution in the conditioning context used to encode C _{k, l, b, q} [m, n] offers some advantages, but improves compression efficiency Is hardly more than about 5%. It is difficult to justify this improvement in terms of increased intermediate storage capacity due to the fact that there is a natural delay of several scan lines between successive levels in the multi-resolution hierarchy.
Also, independently encoding every band simplifies the implementation of compression schemes in hardware and reduces the memory bandwidth and on-chip memory demands associated with such implementations. Further, such encoding avoids imposing constraints on the bitstream scaling algorithm described in Section 3.5. For any particular band, appear above the coarser quantization layer q-1 or in the current quantization layer q when generating a conditioning context to encode quantization symbols from layer q of scan row m. The encoding method of the present embodiment does not depend on information appearing before the scanning line m-1. This means that Q
_{If k, l, b} + 1 is the index of the finest quantization layer to be generated during compression (or reconstructed during expansion), then at most Q _k from band C _{k, l, b} _{, l, b} +1 scan lines must be placed in intermediate storage. To illustrate this point, FIG. 8 shows the relationship between a buffered scan row and a quantization layer encoded or decoded from the buffered data.

【０１１１】符号化の詳細の説明の前に、基底バンド量
子化層C_k,0,0,qは特定の符号化方式に全く左右されない
点を注記しておきたい。各シンボルV_k,0,0,q[m,n](但し
q>1)に関して単一ビットが送信された後固定長符号ワー
ドを直接使用して初期量子化値V_k,0,0,1が送信され、V
_k,0,0,q[m,n]が2V_k,0,0,q-1または2V_k,0,0,q-1+1のいず
れに等しいのか識別される。以下の記述において、各量
子化層q=1,2,...Q_k,l,bに関してデッドゾーン量子化さ
れたバンドC_k,l,bの標本の大きさM_k,l,b,q[m,n]および
符号S_k,l,b,q[m,n]を効果的に符号化するために必要と
される手順が考察される。符号化ステップは、後述され
るように多数のカテゴリに分割されるであろうが、本実
施形態は、層qに関する各サンプルの量子化された値を
符号化するために必要とされるステップが、前の量子化
層における標本がゼロであったか否かの判別に基づいて
２つの主要グループに判別できる点を始めに指摘する。
M_k,l,b,q-1[m.n]≠0であれば、符号S_k,l,b[m,n]は前の
量子化層において既に符号化されており、また、層qに
おいてより精細な量子化サイズと整合する大きさを導出
する知識を改良するだけでよい。これを改良符号化と呼
び、単純化のためそれをまず記述する。しかし、M
_k,l,b,q-1[m,n]=0の場合、層qにおける主要な処理はM
_k,l,b,q-1[m,n]もまたゼロで有るのか否かを符号化する
ことである。これはゼロ符号化と呼ばれ、5.2項で対応
するステップが記述される。M_k,l,b,q-1[m,n]≠0である
と判明すれば、符号ビットS_k,l,b[m,n]および文脈から
明らかとなっていない場合実際の大きさを符号化するた
め付加情報が取得される。これらの点は5.3項および5.4
項において説明される。ここで概説した符号化は複雑に
見えるかもしれないが、比較的少ない労力でソフトウェ
アおよびハードウェアでそれらを実施する効率的メカニ
ズムが存在する。図９に示される流れ図は、以下の項の
記述を理解する際に特に役立つものであろう。Before describing the details of the coding, it should be noted that the baseband quantization layer C _{k, 0,0, q} is completely _independent of the particular coding scheme. Each symbol V _{k, 0,0, q} [m, n] (however
q> 1), the initial quantization value V _{k, 0,0,1} is transmitted directly using the fixed-length codeword after a single bit is transmitted, and V
It is identified whether _{k, 0,0, q} [m, n] is equal to 2V _{k, 0,0, q-1} or 2V _{k, 0,0, q-1} +1. In the following description, the sample size M _{k, l, b, of} the band C _{k, l, b} subjected to the dead zone quantization for each quantization layer q = 1,2, ... Q _{k, l, b} _The procedure required to effectively encode _q [m, n] and the code S _{k, l, b, q} [m, n] is considered. Although the encoding step will be divided into a number of categories as described below, this embodiment requires that the steps required to encode the quantized value of each sample for layer q be First, it is pointed out that two main groups can be determined based on whether or not the sample in the previous quantization layer is zero.
If M _{k, l, b, q-1} [mn] ≠ 0, the code S _{k, l, b} [m, n] has already been coded in the previous quantization layer, and in the layer q It is only necessary to improve the knowledge to derive a magnitude that matches the finer quantization size. This is called improved coding, and is described first for simplicity. But M
_{If k, l, b, q-1} [m, n] = 0, the main processing in layer q is M
It is to encode whether _{k, l, b, q-1} [m, n] is also zero. This is called zero encoding, and the corresponding steps are described in Section 5.2. If M _{k, l, b, q-1} [m, n] ≠ 0, then the actual size if not clear from the sign bit S _{k, l, b} [m, n] and the context Is obtained in order to encode. These points are described in Section 5.3 and 5.4
Section. Although the encodings outlined here may seem complicated, there are efficient mechanisms to implement them in software and hardware with relatively little effort. The flowchart shown in FIG. 9 will be particularly helpful in understanding the description in the following sections.

【０１１２】5.1. 改良符号化改良符号化は極めて容易に理解できる。このケースで
は、既にM_k,l,b,q-1[m,n]がゼロでないことはわかって
いて、前の層で符号化ビットが既に符号化されているの
で、5.1. Improved Coding Improved coding is very easy to understand. In this case, we already know that M _{k, l, b, q-1} [m, n] is not zero, and the coded bits have already been coded in the previous layer, so

【０１１３】[0113]

【数３６】という２つのケースを区別するバイナリ値シンボルを送
付するだけでよい。[Equation 36] It is only necessary to send a binary value symbol that distinguishes the two cases.

【０１１４】量子化適用パラメータは、4.1項で記述さ
れたように、量子化された大きさがゼロ以外であるとわ
かったすべての層において１に等しいすなわちσ
_k,l,b,q-1=σ_k,l,b,q=1であることが保証されているの
でこのような２つのケースしか確かに存在しない点注意
されるべきである。条件付け文脈R_k,l,b,q[m,n] {0,1,
2,3}に対応して適切なシンボルが符号化される。次のよ
うな４つの状態に区分される。状態０：M_k,l,b,q-1[m,n]が可能な最小値すなわち1を持
ち、上下左右の直近隣接標本のいずれもなおゼロである
すなわちM_k,l,b,q[m,n-1]=M_k,l,b,q[m-1,n]=M_k,l,b,q-1
[m,n+1]=M_k,l,b,q-1[m+1,n]=Oであるとすれば、この文
脈に応じて改良シンボルが符号化される。これらの条件
の下では、改良シンボルもゼロである確立が高い。状態１：M_k,l,b,q-1[m,n]が可能な最小値すなわち1を持
つが、上下左右の直近隣接標本の少なくとも１つが非ゼ
ロであるか、層qにおいて影響のある過去の２つの隣接
標本あるい層q-1において影響のあある将来の２つの隣
接標本のいずれかが非ゼロであると判明すれば、この文
脈に応じて改良シンボルが符号化される。状態２：M_k,l,b,q-1[m,n]=2であれば、この文脈に応じ
て改良シンボルが符号化される。この場合、M
_k,l,b,q[m,n]=4またはM_k,l,b,q[m,n]=5であるかを識別
するため改良シンボルが使用される。状態３：M_k,l,b,q-1[m,n]≧3であれば、この文脈に応じ
て改良シンボルが符号化される。ここで、形態的変換に
よって生成された標本が符号化されていない限り、値が
明確に異なる標本の数が限定されている画像領域におい
ては、改良シンボルの統計分布は均一分布に接近し始め
ているにちがいない。これらの文脈状態に応じてバイナ
リ値シンボルを符号化する精密な条件付き算術符号化ア
ルゴリズムは5.7項において記述される。The quantization application parameter is equal to 1 or σ for all layers whose quantized magnitude is found to be non-zero, as described in Section 4.1.
It should be noted that there are certainly only two such cases, since it is guaranteed that _{k, l, b, q-1} = σ _{k, l, b, q} = 1. Conditioning context R _{k, l, b, q} [m, n] (0,1,
2,3}, an appropriate symbol is encoded. It is divided into the following four states. State 0: M _{k, l, b, q-1} [m, n] has the smallest possible value, namely 1, and any of the nearest neighbors in the vertical, horizontal, and right directions is still zero, ie, M _{k, l, b, q} [m, n-1] = M _{k, l, b, q} [m-1, n] = M _{k, l, b, q-1}
If [m, n + 1] = M _{k, l, b, q-1} [m + 1, n] = O, the improved symbol is encoded according to this context. Under these conditions, there is a high probability that the improved symbol is also zero. State 1: M _{k, l, b, q-1} [m, n] has the smallest possible value, ie, 1, but at least one of the nearest neighbors (up, down, left, right) is non-zero, or If either of the two neighboring samples affected in a past two neighboring samples or layer q-1 is found to be non-zero, the refined symbol is encoded according to this context. State 2: If M _{k, l, b, q-1} [m, n] = 2, the improved symbol is encoded according to this context. In this case, M
_An improved symbol is used to identify whether _{k, l, b, q} [m, n] = 4 or M _{k, l, b, q} [m, n] = 5. State 3: If M _{k, l, b, q−1} [m, n] ≧ 3, the improved symbol is encoded according to this context. Here, as long as the samples generated by the morphological transformation are not coded, in the image region where the number of samples with distinct values is limited, the statistical distribution of the improved symbols is beginning to approach a uniform distribution. It must be. A precise conditional arithmetic coding algorithm that encodes binary-valued symbols according to these context states is described in Section 5.7.

【０１１５】5.2. ゼロ符号化標本がゼロ以外であるといまだ判明していない場合すな
わちM_k,l,b,q-1[m,n]=0である時ゼロ符号化が必要であ
る。この場合、２つのケースすなわちM_k,l,b,q[m,n]=0
(シンボル0)とM_k,l,b,q[m.n]≠0(シンボル1)を区別する
ため単一バイナリ・シンボルが符号化されなければなら
ない。後者のケースでは、非ゼロの大きさを持つと判明
した標本の符号を含め、付加情報が符号化されなければ
ならない。しかし、これはここで記述するゼロ符号化ス
テップの一部ではない。ゼロ符号化は、最精細量子化層
以外のすべての層に関して圧縮ビット率全体の大部分を
説明するものであるので、それは全般的圧縮効率にとっ
て最も重要である。ゼロ符号化はまた符号化されたシン
ボルの大部分を説明するので、圧縮システムの全般的計
算処理の複雑性に重要な影響を与える。このような理由
から、１つの実施形態において、図６に示されるよう
に、標本の8個の最隣接標本の量子化された大きさに基
づいて、ゼロ符号化タスクは２つのケースに分けられ
る。実際には、4.1項において記述されたバイナリ値文
脈変数σ'_k,l,b,q[m,n]を使用して、これらの２つのケ
ースが区別される。5.2. Zero Coding Zero coding is required when the sample is not yet known to be non-zero, ie when M _{k, l, b, q-1} [m, n] = 0. In this case, two cases, ie, M _{k, l, b, q} [m, n] = 0
A single binary symbol must be encoded to distinguish between (symbol 0) and M _{k, l, b, q} [mn] ≠ 0 (symbol 1). In the latter case, the additional information must be encoded, including the sign of the sample that was found to have a non-zero magnitude. However, this is not part of the zero encoding step described here. It is of paramount importance to overall compression efficiency because zero coding accounts for most of the overall compression bit rate for all layers except the finest quantization layer. Since zero encoding also accounts for most of the encoded symbols, it has a significant effect on the overall computational complexity of the compression system. For this reason, in one embodiment, the zero-encoding task is divided into two cases based on the quantized magnitude of the eight nearest neighbors of the sample, as shown in FIG. . In practice, these two cases are distinguished using the binary context variables σ ′ _{k, l, b, q} [m, n] described in section 4.1.

【０１１６】隣接が非ゼロであるゼロ符号化最初のケースは、σ'_k,l,b,q[m,n]=1によって識別され
るもので、これは、標本の8個の直近隣接標本のうちの
少くとも１つがゼロ以外の大きさを持つと判明している
条件に対応する。影響のある過去の4隣接に関して利用
できる量子化された大きさは、残りの4隣接標本と共
に、図６に示されている。この場合、これら8隣接標本
の大きさに基づいて32状態の条件付け文脈Z_k,l,b,q[m,
n]が構築され、5.7項において記述される適応型算術符
号化器を使用して、この文脈に応じて、ゼロ符号化シン
ボルが符号化される(M_k,l,b,q=0なら0、それ以外は1)。
σ_k,l,b,q-1[m,n]=1という場合には、最初の16の状態Z
_k,l,b,q[m,n] {0,1,...,15}が使用される。この場合、
前の量子化層は既に符号化され、M_k,l,b[m,n]<2X(但しX
=Q_k,l,b-q+1)であることは既知であるが一方ゼロ符号化
シンボルはM_k,l,b[m,n]<2Y(但しY=Q_k,l,b-q)であるか否
かを識別する。σ_k,l,b,q-1[m,n]=0という場合には、第
２の16の状態Z_k,l,b,q[m,n] {16,17,...,31}が使用さ
れる。この場合、大きさM_k,l,b[m,n]は既知でなない。
両方のケースにおいて、ルックアップ・テーブルを256
個のインデックスσ'_k,l,b,q[m,n]という比較的大きい
セットに適用することによって特定の文脈状態が間接的
に決定される。このインデックスの形成を図６の8個の
隣接標本の大きさの観点から次に記述する。 Zero-coding with non-zero neighbors The first case is identified by σ ′ _{k, l, b, q} [m, n] = 1, which is the eight nearest neighbors of the sample. This corresponds to a condition in which at least one of the samples has been found to have a non-zero magnitude. The quantized magnitudes available for the affected past four neighbors are shown in FIG. 6, along with the remaining four neighbor samples. In this case, based on the size of these 8 adjacent samples, a 32-state conditioning context Z _{k, l, b, q} [m,
n] is constructed and, according to this context, a zero-coded symbol is coded using the adaptive arithmetic coder described in Section 5.7 (if M _{k, l, b, q} = 0 0, otherwise 1).
If σ _{k, l, b, q-1} [m, n] = 1, the first 16 states Z
_{k, l, b, q} [m, n] {0,1, ..., 15} are used. in this case,
The previous quantization layer has already been coded and M _{k, l, b} [m, n] <2X (where X
= Q _{k, l, b} -q + 1), while the zero-coded symbol is M _{k, l, b} [m, n] <2Y (where Y = Q _{k, l, b-} q) or not. If σ _{k, l, b, q-1} [m, n] = 0, the second 16 states Z _{k, l, b, q} [m, n] {16,17, ..., 31} is used. In this case, the size M _{k, l, b} [m, n] is not known.
In both cases, the lookup table is 256
The specific context state is indirectly determined by applying to a relatively large set of the indices σ ′ _{k, l, b, q} [m, n]. The formation of this index will now be described in terms of the size of the eight adjacent samples in FIG.

【０１１７】8ビット・インデックスZ'_k,l,b,q[m,n]は
次のような4つのサブワードから成る。ビット0-2：これらは、標本の水平隣接標本の大きさの
平均に関する現在時点の情報をほぼ標示するように意図
されている。具体的には、これら3ビットにThe 8-bit index Z ′ _{k, l, b, q} [m, n] is composed of the following four subwords. Bits 0-2: These are intended to roughly indicate the current point in time information about the average horizontal adjacent sample size of the sample. Specifically, these three bits

【０１１８】[0118]

【数３７】 (37)

【０１１９】のような合計が記憶される。この明きらか
に複雑な表現は、ソフトウェアにおいてさえ計算上特に
効率的であることわかる。数３７が8つの異なる値をと
ることができることは容易に検証できる。それは次のよ
うな解釈を持つ。 0: 両方の水平隣接標本はゼロである。しかし、左側の
隣接標本はM_k,l,b,q[m,n-1]≧2Y(但しY=Q_k,l,b-q)を満
たすことは可能である。両方の水平隣接標本はゼロであ
り、1: M_k,l,b,q[m,n-1]<2Y(但しY=Q_k,l,b-q)であるこ
とは既知である。 2: M_k,l,b,q[m,n-1]=1およびM_k,l,b,q-1[m,n+1]=O。 3: M_k,l,b,q[m,n-1]≧2およびM_k,l,b,q-1[m,n+1]=O、ま
たは、M_k,l,b,q[m,n-1]=0およびM_k,l,b,q-1[m,n+1]=1の
いずれか。 4: M_k,l,b,q[m,n-1]=1およびM_k,l,b,q-1[m,n+1]=1。 5: M_k,l,b,q[m,n-1]=0およびM_k,l,b,q-1[m,n+1]≧2、ま
たは、M_k,l,b,q[m,n-1]≧2およびM_k,l,b,q-1[m,n+1]=1
のいずれか。 6: M_k,l,b,q[m,n-1]=1およびM_k,l,b,q-1[m,n+1]≧2。 7: M_k,l,b,q[m,n-1]=≧2およびM_k,l,b,q-1[m,n+1]≧2。ビット3-5：これらは、標本の垂直隣接標本の大きさの
平均に関する現在時点の情報をほぼ標示するように意図
されている。これら3ビットにThe total is stored as follows. This apparently complex representation turns out to be particularly computationally efficient, even in software. It can be easily verified that Equation 37 can take eight different values. It has the following interpretation. 0: Both horizontal neighbors are zero. However, the adjacent sample on the left side can satisfy M _{k, l, b, q} [m, n−1] ≧ 2Y (where Y = Q _{k, l, b} −q). It is known that both horizontal neighbors are zero and 1: M _{k, l, b, q} [m, n-1] <2Y, where Y = Q _{k, l, b} -q. 2: M _{k, l, b, q} [m, n-1] = 1 and M _{k, l, b, q-1} [m, n + 1] = O. 3: M _{k, l, b, q} [m, n-1] ≧ 2 and M _{k, l, b, q-1} [m, n + 1] = O or M _{k, l, b, q} One of [m, n-1] = 0 and M _{k, l, b, q-1} [m, n + 1] = 1. 4: M _{k, l, b, q} [m, n-1] = 1 and M _{k, l, b, q-1} [m, n + 1] = 1. 5: M _{k, l, b, q} [m, n-1] = 0 and M _{k, l, b, q-1} [m, n + 1] ≧ 2 or M _{k, l, b, q} [m, n-1] ≧ 2 and M _{k, l, b, q-1} [m, n + 1] = 1
One of 6: M _{k, l, b, q} [m, n-1] = 1 and M _{k, l, b, q-1} [m, n + 1] ≧ 2. 7: M _{k, l, b, q} [m, n-1] = ≧ 2 and M _{k, l, b, q-1} [m, n + 1] ≧ 2. Bits 3-5: These are intended to roughly indicate the current point in time information about the average size of the vertical neighbors of the sample. These three bits

【０１２０】[0120]

【数３８】という総和が記憶される。(38) Is stored.

【０１２１】[0121]

【数３９】によって仮定される8個の異なる値が、水平ではなく垂
直隣接標本に関して[Equation 39] The eight different values assumed by

【０１２２】[0122]

【数４０】に対する対応するものと同じ解釈を持つ。(Equation 40) Has the same interpretation as the corresponding one for.

【０１２３】ビット6：前の走査行上の標本の斜めの隣
接標本のいずれかがゼロ以外であるすなわちM_k,l,b,q[m
-1,n-1]≠0またはM_k,l,b,q[m-1,n+1]≠0のいずれかある
いは両方であると判明した場合に限って、このビットは
1にセットされる。ビット7：次の走査行上の標本の斜めの隣接標本のいず
れかがゼロ以外であるすなわちM_k,l,b,q-1[m+1,n-1]≠0
またはM_k,l,b,q-1[m+1,n+1]≠0のいずれか両方であると
判明した場合に限って、このビットは1にセットされ
る。Bit 6: Any of the diagonally adjacent samples of the sample on the previous scan line is non-zero, ie, M _{k, l, b, q} [m
-1, n-1] ≠ 0 or M _{k, l, b, q} [m-1, n + 1] ≠ 0 only if this bit is known to be
Set to 1. Bit 7: Any of the diagonal adjacent samples of the sample on the next scan line is non-zero, ie, M _{k, l, b, q-1} [m + 1, n-1] ≠ 0
Or, this bit is set to 1 only if it is found that both of M _{k, l, b, q-1} [m + 1, n + 1] ≠ 0.

【０１２４】最終的ゼロ符号化状態Z_k,l,b,q[m,n]は、The final zero encoding state Z _{k, l, b, q} [m, n] is

【０１２５】[0125]

【数４１】 [Equation 41]

【０１２６】のように、各々が0から15までの範囲の整
数を生成する２つの256エレメント・ルックアップ・テ
ーブルのうちの１つを適用することによって、中間イン
デックスZ'_k,l,b,q[m,n]から作成される。By applying one of the two 256-element look-up tables, each producing an integer in the range from 0 to 15, as in the intermediate index Z ′ _{k, l, b, q} Created from [m, n].

【０１２７】ルックアップ・テーブルの目的は文脈状態
の総数を減らすことであり、これは、圧縮方式のメモリ
要件に大きな影響を及ぼすと共に、適応型算術符号化に
関連する適応コストを減少させる。後者の効果は、比較
的小さい画像を圧縮している時に特に重要である。上述
の実施形態においてあらゆるバンドあるいはあらゆる量
子化層について別々のルックアップ・テーブルを維持し
ていないという点に注意されるべきである。実際、異な
るタイプのバンドb=0,1,2,3の各々について２つの異な
るルックアップ・テーブルだけが使用される。The purpose of the look-up table is to reduce the total number of context states, which has a significant effect on the memory requirements of the compression scheme and reduces the adaptation costs associated with adaptive arithmetic coding. The latter effect is particularly important when compressing relatively small images. It should be noted that in the embodiments described above, a separate look-up table is not maintained for every band or every quantization layer. In fact, only two different look-up tables are used for each of the different types of bands b = 0,1,2,3.

【０１２８】bの異なる値が異なる方向特性を持つ画像
特徴に対応するものであるので、bに対する依存性は重
要であり、これは最適状態減少対応付けに対して大きな
効果を持つ。これらのルックアップ・テーブルのコンテ
ントは、すべての画像について固定されることが期待さ
れるが、本明細書は、それらコンテントをカテゴリ別に
指定しない。本発明人の経験によれば、訓練画像の集合
から得られた統計量に対して単一セットのテーブルが最
適化された。その特定の実施形態に関する限り、最適化
アルゴリズムは、枝の多いツリーの刈り込みと反復的Ｌ
ＢＧ(Lind-Buzo-Gray)最適化手法の比較的単純な組み合
せである。Since different values of b correspond to image features having different directional characteristics, the dependence on b is important and has a significant effect on optimal state reduction association. Although the content of these look-up tables is expected to be fixed for all images, this specification does not specify them by category. According to the inventor's experience, a single set of tables has been optimized for the statistics obtained from the set of training images. As far as that particular embodiment is concerned, the optimization algorithm is a pruned tree pruning and iterative L
A relatively simple combination of BG (Lind-Buzo-Gray) optimization techniques.

【０１２９】隣接標本がすべてゼロの場合のゼロ符号化標本の直近8個の隣接標本のいずれもが非ゼロと判明し
ていない場合のゼロ符号化手順を以下に記述する。この
比較的頻繁な事象を利用すれば、１つのタイプのランレ
ングス符号化方式を用いることによって、符号化される
べきシンボルの総数を減らし、従って、全般的算術符号
化の複雑さを減少させることができる。考え方は、既に
符号化された情報(すなわち復号器で現在利用できる情
報)を所与として、まず、その直近隣接標本が潜在的に
ゼロのままである可能性のある同一走査行上の連続的標
本の最大数C_k,l,b[m,n'](但しN'≧n)を決定することで
ある。具体的には、そのような標本の各々は、σ"
_k,l,b,q[m,n']=0を満たさなければならない。但し、The zero-encoding procedure when none of the eight nearest neighbors of the zero-encoded sample is known to be non-zero when the adjacent samples are all zero is described below. Taking advantage of this relatively frequent event, by using one type of run-length coding scheme, the total number of symbols to be coded is reduced, thus reducing the complexity of the overall arithmetic coding. Can be. The idea is that given information that has already been encoded (i.e. the information currently available at the decoder), the first step is to make successive samples on the same scan row whose nearest neighbors can potentially remain zero. The purpose is to determine the maximum number of samples C _{k, l, b} [m, n ′] (where N ′ ≧ n). Specifically, each such sample has a σ "
_{k, l, b, q} [m, n '] = 0 must be satisfied. However,

【０１３０】[0130]

【数４２】 (Equation 42)

【０１３１】である。n'_maxをσ"_k,l,b,q[m,n']=0に関
する最大連続列インデックスを示すものとする。本実施
形態のランレングス符号化方式はほぼ次の２つのステッ
プから成る。 1)最大ランレングスn'_max-nを条件にバイナリ値シンボ
ルを符号化し、この間隔における任意のn'についてM
_k,l,b,q[m,n']≠0であるか否かを識別する。 2)そのようなゼロ以外の大きさが最大ランレングスの範
囲内に発生すれば、固定長符号ワードを使用してインデ
ックスn'-nを送る。Is as follows. Let n ′ _max denote the maximum continuous sequence index for σ ″ _{k, l, b, q} [m, n ′] = 0. The run-length encoding scheme of this embodiment consists of the following approximately two steps: 1) Encode a binary value symbol subject to a maximum run length n ' _max -n and use M for any n' in this interval.
_{k, l, b, q} [m, n '] ≠ 0 is identified. 2) If such non-zero magnitude occurs within the maximum run length, send the index n'-n using a fixed length codeword.

【０１３２】上述の通り、この方式はたくさんの落し穴
を持つ。第１に、実際的な理由から考慮される最大ラン
レングスになんらかの限度を設ける必要がある。第２
に、符号化適応コストをメモリ資源とともに最小にする
ため、最大ランレングスn'_max-nによって仮定される可
能性のある大きいセットの値から条件付け状態の比較的
限定されたセットを導出する必要がある。最後に、その
隣接標本がすべてゼロではない標本に非常に近い標本の
ため、それらが異なる統計を持つ可能性が高いので、ゼ
ロ符号化方針を修正する必要がある。上記のような問題
に対処するように設計された実際の符号化方式を次に記
述する。As described above, this method has many pitfalls. First, some limit must be placed on the maximum run length considered for practical reasons. Second
In order to minimize the coding adaptation cost along with the memory resources, it is necessary to derive a relatively limited set of conditioning states from a large set of values that may be assumed by the maximum run length n ' _max -n. is there. Finally, because the samples whose neighbors are very close to non-zero samples are likely to have different statistics, the zero encoding strategy needs to be modified. The actual coding scheme designed to address the above problems will now be described.

【０１３３】第１に、First,

【０１３４】[0134]

【数４３】 [Equation 43]

【０１３５】に従って、n"_maxを定義する。上式におい
て、n'_max=W(C_k,l,b)は、走査行の終わりに至るまです
べての標本n'≧nはσ"_k,l,b,q[m,n']=0を持つことを意
味する点に注意する必要がある。基本的には、この定義
は、最大ランレングスが127を上回らず、右側直近隣接
標本が非ゼロであるとわかっている標本で終了すること
を確実にするため最大ランレングスが切り捨てられるこ
とを意味する。According to the above equation, n ′ _max is defined as follows: n ′ _max = W (C _{k, l, b} ) indicates that all samples n ′ ≧ n are σ ″ _k, Note that it means that we have _{l, b, q} [m, n '] = 0. Basically, this definition means that the maximum run length is truncated to ensure that the maximum run length does not exceed 127 and that the right nearest neighbor sample ends with a sample known to be non-zero. I do.

【０１３６】次に、下記のような解釈を持つ8状態ラン
レングス文脈U_k,l,b,q[m,n]が定義される。状態０：n"_max<n+1またはσ'_k,l,b,q[m,n-1]≠0であれ
ば、この状態が使用される。この場合、この標本は、そ
の隣接標本がすべてゼロではない別の標本に非常に接近
しているとみなされ、実際にはランレングス符号化は使
用されない。その代わりに、この文脈状態は、M_k,l,b,q
[m,n]がゼロに等しいか否かを符号化するために使用さ
れる。状態１：n"_max=1ならこの状態を使用して、n' {n,n+1}
に関してM_k,l,b,q[m,n']が非ゼロか否かを符号化する。
そうであれば、最初のそのようなn'がnまたはn+1に等し
いか否かを識別する単一ビット(適応型算術符号化では
ない)を送る。状態２：n"_max {2,3}であればこの状態を使用して、n'
{n,n+1,n+2,n+3}に関してM_k,l,b,q[m,n']が非ゼロか
否かを符号化する。そうであれば、最初のそのようなn'
を識別する２ビット固定長符号ワードを送る。 ... 状態７：n"_max {64,65,..,127}であればこの状態を使
用して、n' {n,n+1,n+2,...,n+127}に関してM
_k,l,b,q[m,n']が非ゼロか否かを符号化する。そうであ
れば、最初のそのようなn'を識別する7ビット固定長符
号ワードを送る。Next, an 8-state run-length context U _{k, l, b, q} [m, n] having the following interpretation is defined. State 0: This state is used if n " _max <n + 1 or σ ' _{k, l, b, q} [m, n-1] ≠ 0. In this case, this sample is its neighbor Are considered very close to another sample that is not all zeros, and run-length encoding is not actually used. Instead, this context state is represented by M _{k, l, b, q}
Used to encode whether [m, n] is equal to zero. State 1: If n " _max = 1, use this state and n '{n, n + 1}
Encode whether M _{k, l, b, q} [m, n ′] is non-zero.
If so, send a single bit (not adaptive arithmetic coding) identifying whether the first such n 'is equal to n or n + 1. State 2: If n " _max {2,3}, use this state and n '
For {n, n + 1, n + 2, n + 3}, encode whether M _{k, l, b, q} [m, n ′] is non-zero. If so, first such n '
Send a 2-bit fixed-length codeword identifying ... State 7: If n " _max {64,65, .., 127}, use this state and n '{n, n + 1, n + 2, ..., n + 127} With respect to M
_Encode whether _{k, l, b, q} [m, n '] is non-zero. If so, send the first 7-bit fixed-length codeword identifying such n '.

【０１３７】5.3. 符号ビット符号化標本が量子化層qにおいて非ゼロである、すなわち、M
_k,l,b,q-1[m,n]=0であるがM_k,l,b,q[m,n]≠0と最初に判
明する時、最初のタスクは符号ビットS_k,l,b,q[m,n]を
符号化することである。符号化される必要がある情報が
他にもあるかもしれないが、その点は5.4で記述するこ
とにする。符号ビットを符号化する必要が全くない１つ
の特殊なケースがある。それは、オリジナル画像が２つ
のレベルだけを持つこと(すなわち標本当たり１ビット)
が既知で、2.2項で記述された形態的変換が使用されて
いる場合である。この場合、形態的変換の符号予測コン
ポーネントは、符号ビットが常にゼロであるべきことを
保証する。しかしながら、適応型算術符号化器は、この
事実を符号化器に具体的に通知するのではなく、そのよ
うな極度に偏った統計量を発見し、可能な限り少ないビ
ットで決定性情報を符号化することができるようにすれ
ば十分である。5.7項に記述される特定の算術符号化
は、そのような条件の下でシンボル当たり2-15ビット程
度の低率を達成することができる。5.3. Sign bit encoding The sample is non-zero in the quantization layer q, ie, M
_{When k, l, b, q-1} [m, n] = 0 but M _{k, l, b, q} [m, n] ≠ 0 is first found, the first task is the sign bit S _{k, l, b, q} [m, n]. There may be other information that needs to be encoded, but that will be described in 5.4. There is one special case where there is no need to encode the sign bit. It means that the original image has only two levels (ie one bit per sample)
Is known and the morphological transformation described in Section 2.2 is used. In this case, the sign prediction component of the morphological transformation ensures that the sign bit should always be zero. However, rather than specifically signaling the encoder to this fact, the adaptive arithmetic encoder discovers such extremely biased statistics and encodes deterministic information with as few bits as possible. It is enough to be able to. The particular arithmetic coding described in section 5.7 can achieve rates as low as 2-15 bits per symbol under such conditions.

【０１３８】符号ビットは、下記のような４ビット表現
形態の１６状態文脈G_k,l,b,q[m,n]に対応して符号化さ
れる。ビット０：水平に隣接する標本のいずれかが非ゼロの大
きさおよび正符号を持つと判明したならば、このビット
は１に設定される(符号ビットはゼロに等しい)。ビット１：垂直に隣接する標本のいずれかが非ゼロの大
きさおよび正符号を持つと判明したならば、このビット
は１に設定される(符号ビットは１に等しい)。ビット２：垂直に隣接する標本のいずれかが非ゼロの大
きさおよび正符号を持つと判明したならば、このビット
は１に設定される(符号ビットはゼロに等しい)。ビット３：水平に隣接する標本のいずれかが非ゼロの大
きさおよび負符号を持つと判明したならば、このビット
は１に設定される(符号ビットは１に等しい)。The sign bit is encoded corresponding to the 16-state context G _{k, l, b, q} [m, n] in the following 4-bit representation form. Bit 0: If any of the horizontally adjacent samples are found to have non-zero magnitude and positive sign, this bit is set to 1 (sign bit equals zero). Bit 1: If any of the vertically adjacent samples are found to have a non-zero magnitude and positive sign, this bit is set to 1 (sign bit equals 1). Bit 2: If any of the vertically adjacent samples are found to have non-zero magnitude and positive sign, this bit is set to 1 (sign bit equals zero). Bit 3: If any of the horizontally adjacent samples are found to have non-zero magnitude and negative sign, this bit is set to 1 (sign bit equals 1).

【０１３９】5.4. 初期的大きさ符号化量子化層qにおいて最初に非ゼロである(すなわちM
_k,l,b,q-1[m,n]=0)と判明する標本の実際の大きさM
_k,l,b,q[m,m]を符号化する方法を以下記述する。本実施
形態の適応型量子化方針の結果として、驚くほど多数の
考慮されるべきケースが存在する。以下の記述は図９の
流れ図の特に下の部分を参照する。5.4. Initial magnitude coding In the quantization layer q, it is first non-zero (ie, M
_{k, l, b, q-1} [m, n] = 0) actual sample size M
_The method of encoding _{k, l, b, q} [m, m] is described below. As a result of the adaptive quantization strategy of this embodiment, there are a surprising number of cases to consider. The following description refers in particular to the lower part of the flow chart of FIG.

【０１４０】本題に入る前に、いくつかの新しい表記法
を紹介する。Before going into the subject, some new notations are introduced.

【０１４１】[0141]

【数４４】 [Equation 44]

【０１４２】を復号器によって受け取られるようなバン
ドC_k,l,b,qに関してビットストリームに実際に含められ
る最初の量子化層を示すものとする。この値は、3.2項
で記述されたビットストリーム・ヘッダに記録される。
更に、Let denote the first quantization layer that is actually included in the bitstream for bands C _{k, l, b, q} as received by the decoder. This value is recorded in the bitstream header described in section 3.2.
Furthermore,

【０１４３】[0143]

【数４５】を符号化器がこのバンドに関して符号化しようとする最
初の量子化層を示すものとする。[Equation 45] Denote the first quantization layer that the encoder attempts to encode for this band.

【０１４４】[0144]

【数４６】の値は画像とは無関係であり、符号化器および復号器に
は事前に既知のものであると仮定する。最も簡単なケー
スでは、すべてのバンドについて、[Equation 46] Is independent of the image and is assumed to be known a priori to the encoder and decoder. In the simplest case, for all bands,

【０１４５】[0145]

【数４７】 [Equation 47]

【０１４６】とされる。一層具体的に述べれば、符号化
器の計算処理およびメモリ要件は、いずれにせよ一層粗
い量子化層が役立つとは期待されない一層大きい解像度
層lに対して上記数４5という一層大きい値を使用するこ
とによって減少させることができる。このケースにおい
て、符号化器および復号器の両方が事前に上記数４5の
値を知っているという本実施形態の仮定は、この情報が
(上記3項に記述のビットストリーム構文でカバーされて
ない)ビットストリームに組み込まれないか、あるいは
例えば層インデックスlと上記数４５の間の固定的関係
が普遍的に与えられなければならないことを意味する。Is set. More specifically, the computational processing and memory requirements of the encoder use the larger value of (45) for a larger resolution layer l where a coarser quantization layer is not expected to be useful anyway. Can be reduced. In this case, the assumption of this embodiment that both the encoder and the decoder know in advance the value of Equation 45 above is that this information
It must not be embedded in the bitstream (not covered by the bitstream syntax described in paragraph 3 above), or that, for example, a fixed relationship between layer index l and equation 45 above must be universally given. means.

【０１４７】上記数４５とは対照的に、画像はメモリ必
要量の減少のため到着の都度漸増的に符号化されると本
実施形態は仮定しているので、符号化器は、画像全体を
符号化するまで、In contrast to Equation 45 above, since the present embodiment assumes that the images are coded incrementally on arrival as the memory requirements are reduced, the encoder must encode the entire image. Until encoding,

【０１４８】[0148]

【数４８】 [Equation 48]

【０１４９】を知ることができない。具体的に述べれ
ば、量子化器は、量子化される大きさM_k,l,b,q[m,n]が
非ゼロであると判明した最初の層(上記数４８)を知るこ
とができた後、各バンドC_k,l,bに対してすべての量子化
層q=1,2,...,Q_k,l,bを符号化することを試みる。量子化
器は次に前のすべての層を破棄し、その結果、復号器に
よって受け取られる最終ビットストリームはそれら層Cannot be known. Specifically, the quantizer knows the first layer (Equation 48 above) where the quantized magnitude M _{k, l, b, q} [m, n] was found to be non-zero. After that, an attempt is made to encode all quantization layers q = 1, 2,..., Q _{k, l, b} for each band C _{k, l, b} . The quantizer then discards all previous layers, so that the final bit stream received by the decoder

【０１５０】[0150]

【数４９】 [Equation 49]

【０１５１】に関してのみ符号化されたビットを含むこ
ととなる。たとえ符号化器が上記数４８を知らないとし
ても、符号化器が任意の特定のバンドを符号化している
間、層Only the encoded bits are included. While the encoder is not aware of equation (48), while the encoder is encoding any particular band, the layer

【０１５２】[0152]

【数５０】 [Equation 50]

【０１５３】において、M_k,l,b,q-1[m,n]=Oである非ゼ
ロの大きさM_k,l,b,q[m,n]に出会う度毎に、層q-1が復号
器にとって実際に利用できるか否かに関係なく、復号器
はM_k,l,b,q-1[m,n]=0という事実に気付くことは確実で
ある。この点は、標本が最初に非ゼロであるとわかる時
符号化されなければならない情報の量を減らすことに役
立つ。Each time a non-zero magnitude M _{k, l, b, q} [m, n] where M _{k, l, b, q-1} [m, n] = O is encountered, the layer q Regardless of whether -1 is actually available to the decoder, it is certain that the decoder will notice the fact that M _{k, l, b, q-1} [m, n] = 0. This helps reduce the amount of information that must be encoded when the sample is first found to be non-zero.

【０１５４】従って、層q-1においてその量子化された
大きさが非ゼロであった標本の初期に量子化される大き
さM_k,l,b,q[m,n]を符号化する時区別されるべき２つの
主要なケースが存在する。それらのケースはqが上記数
４５に等しいか否かによって区別される。q>数４５であ
れば、量子化される大きさM_k,l,b,q[m,n]の最大値に関
してなんらかの限界が存在するので、付加的情報の符号
化の必要性はない。このケースは5.5項で更に取り上げ
られる。しかしながら、q=数４５であれば、大きさM
_k,l,b,q[m,n]に関して上限を仮定することはできない。
このケースでは、上限を設定するため１または複数のバ
イナリ値シンボルをまず符号化し、次に固定長符号を使
用して適切な範囲内でその大きさ自体を表現する。この
ケースは5.6項で更に取り上げられる。その前に、符号
化文脈状態のうちの１つが両方のケースで使用されるの
で、それら状態の使用について記述する。Accordingly, the size M _{k, l, b, q} [m, n] that is quantized in the early stage of the sample whose quantized size is non-zero in the layer q-1 is encoded. There are two main cases that need to be differentiated. These cases are distinguished by whether q is equal to Equation 45 above. If q> 45, there is no limit on the maximum value of the quantized magnitude M _{k, l, b, q} [m, n], so there is no need to encode additional information. This case is further addressed in Section 5.5. However, if q = Equation 45, the size M
_No upper bound can be assumed for _{k, l, b, q} [m, n].
In this case, one or more binary-valued symbols are first encoded to set an upper limit, and then the magnitude itself is represented within a suitable range using fixed-length codes. This case is further addressed in Section 5.6. Before that, the use of one of the encoding context states is described since they are used in both cases.

【０１５５】新たな非ゼロ標本値の初期的大きさに関す
る符号化限度について用意される合計４つの環境状態が
ある。それらの解釈および用法は次のとおりである。状態０：この状態を使用してM_k,l,b,q[m,n]=1(シンボル
0)またはM_k,l,b,q[m,n]≧2(シンボル1)か否かを符号化
する。状態１：この状態を使用してM_k,l,b,q[m,n] {2,3}(シ
ンボル0)またはM_k,l,b,q[m,n]≧4(シンボル1)か否かを
符号化する。状態２：この状態を使用してM_k,l,b,q[m,n] {4,5,6,7}
(シンボル0)またはM_k,l,b,q[m,n]≧8(シンボル1)か否か
を符号化する。状態３：この状態は5.6項で記述される大きさM
_k,l,b,q[m,n]に対する上限識別に必要なものとして使用
される。４つの文脈状態という独立セットは、より精細な量子化
層で使用されことはほどんどないが、各バンドの各量子
化層に対して維持される。There are a total of four environmental conditions that are provided for the encoding limits for the initial magnitude of a new non-zero sample value. Their interpretation and usage are as follows. State 0: Using this state, M _{k, l, b, q} [m, n] = 1 (symbol
0) or M _{k, l, b, q} [m, n] ≧ 2 (symbol 1). State 1: M _{k, l, b, q} [m, n] {2,3} (symbol 0) or M _{k, l, b, q} [m, n] ≧ 4 (symbol 1 ) Or not. State 2: Using this state, M _{k, l, b, q} [m, n] {4,5,6,7}
(Symbol 0) or M _{k, l, b, q} [m, n] ≧ 8 (symbol 1) is encoded. State 3: This state has the size M described in section 5.6.
_{k, l, b, q} Used as necessary for upper limit identification for [m, n]. An independent set of four context states is rarely used in finer quantization layers, but is maintained for each quantization layer in each band.

【０１５６】5.5. q>数４５の場合の初期的大きさの符
号化このケースにおいては、符号化器および復号器の両方は
_Mk,l,b,q[m,n]=0という事実を知っていて、可能なケー
スは以下の３つだけである。ケース１：現在時量子化層qおよび前の量子化層q-1の両
方における標本の直近８標本のうちの少なくとも１つが
非ゼロであるすなわちσ'_k,l,b,q[m,n]=σ'
_k,l,b,q-1[m,n]=1とわかっている。このケースでは、M
_k,l,b[m,n]<2X(但しX=Q_k,l,b-q+1)すなわちM_k,l,b,q[m,
n]<2であることが前の量子化層から判明するので、新た
な非ゼロの大きさは1であるに違いなく、符号化の必要
はない。ケース２：現在時量子化層qにおける標本の直近８標本
のうちの少なくとも１つが非ゼロであるが前の量子化層
q-1における標本のすべてはゼロであるすなわちσ'
_k,l,b,q[m,n]=１でσ'_k,l,b,q-1[m,n]=0であるとわかっ
ている。このケースでは、M_k,l,b,q[m,n]<2X(但しX=Q
_k,l,b-q+2)すなわちM_k,l,b,q[m,n] {1,2,3}であること
が前の量子化層から判明する。実際の大きさを符号化す
るため、このシンボルの符号化を条件付ける大きさ文脈
状態０を使用して、大きさが１に等しい(シンボル0)か1
より大である(シンボル1)か識別するようにバイナリ・
シンボルを符号化する。値が1より大きければ、改良符
号化プロセスに進み、上記5.1項で記述したものと全く
同じ手順および文脈状態を使用してM_k,l,b,q[m,n]を符
号化する。ケース３：標本の直近8標本のうちのいずれも非ゼロで
あると判明していないすなわちσ'_k,l,b,q[m,n]=0であ
る。このケースでは、M_k,l,b,q[m,n] {2,3}であり、や
はり改良符号化プロセスに進み、上記5.1項で記述した
ものと全く同じ手順および文脈状態を使用してM_k,l,b,q
[m,n]の最下位ビットを符号化する。以上の諸ステップ
は、図９の流れ図の右下部分に示されている。5.5. Initial Size Encoding for q> Equation 45 In this case, both the encoder and the decoder
Knowing the fact that _{Mk, l, b, q} [m, n] = 0, there are only three possible cases: Case 1: At least one of the eight most recent samples of the samples in both the current quantization layer q and the previous quantization layer q-1 is nonzero, ie, σ ′ _{k, l, b, q} [m, n ] = σ '
It is known that _{k, l, b, q-1} [m, n] = 1. In this case, M
_{k, l, b} [m, n] <2X (where X = Q _{k, l, b} -q + 1), that is, M _{k, l, b, q} [m,
Since the previous quantization layer reveals that n] <2, the new non-zero magnitude must be 1 and does not need to be coded. Case 2: at least one of the last eight samples of the current quantization layer q is non-zero but the previous quantization layer
All of the samples at q-1 are zero, ie σ '
It is known that σ ′ _{k, l, b, q−1} [m, n] = 0 when _{k, l, b, q} [m, n] = 1. In this case, M _{k, l, b, q} [m, n] <2X (where X = Q
_{k, l, b} -q + 2), that is, M _{k, l, b, q} [m, n] {1,2,3} is found from the previous quantization layer. To encode the actual magnitude, use magnitude context state 0, which conditions the encoding of this symbol, with magnitude equal to 1 (symbol 0) or 1
Binary to identify larger (symbol 1)
Encode the symbol. If the value is greater than 1, proceed to the improved encoding process and encode M _{k, l, b, q} [m, n] using exactly the same procedure and context state as described in section 5.1 above. Case 3: None of the last eight samples are known to be non-zero, ie, σ ′ _{k, l, b, q} [m, n] = 0. In this case, M _{k, l, b, q} [m, n] {2,3}, again proceeding to the improved encoding process, using exactly the same procedure and context state as described in section 5.1 above. M _{k, l, b, q}
Encode the least significant bit of [m, n]. These steps are shown in the lower right part of the flowchart in FIG.

【０１５７】5.6. q=数４５の場合の初期大きさの符号
化 qが復号器が受ける最初の量子化層であるかどうかまだ
わからない場合、大きさに関する上限を識別するため付
加情報を符号化しなければならない。符号化手順は次の
２つの主要ステップを含む。ステップ１：5.6. Encoding of Initial Size When q = Equation 45 If it is not yet known whether q is the first quantization layer received by the decoder, the additional information is encoded to identify the upper limit on the size. There must be. The encoding procedure involves two main steps: Step 1:

【０１５８】[0158]

【数５１】 (Equation 51)

【０１５９】として表記されるM_k,l,b,q[m,n]の最下位
以外のすべてのビットの値を符号化する。ステップ２：上記5.1項で記述したものと全く同じ手順
および文脈状態を使用してM_k,l,b,q[m,n]の最下位ビッ
トを符号化する。Encode the values of all but the least significant bits of M _{k, l, b, q} [m, n], denoted as Step 2: Encode the least significant bits of M _{k, l, b, q} [m, n] using exactly the same procedure and context state as described in section 5.1 above.

【０１６０】M'_k,l,b,q[m,n]を符号化する前に考慮すべ
き２つのケースがある。ケース１：標本の直近8標本のうちのいずれも非ゼロで
あると判明していないすなわちσ'_k,l,b,q[m,n]=0であ
る。このケースでは、M_k,l,b,q[m,n]≧2であり、従って
M'_k,l,b,q[m,n]≠0であることが既に判明している。ケース２：標本の直近8標本のうちの１つまたは複数が
非ゼロであるすなわちσ'_k,l,b,q[m,n]=0とわかってい
る。このケースでは、このシンボルの符号化を条件付け
る大きさ文脈状態を使用してM_k,l,b,q[m,n]=1(シンボル
0)であるかまたはM_k,l,b,q[m,n]≧2(シンボル1)である
かをまず符号化しなければならない。M_k,l,b,q[m,n]=1
であれば、これ以上の符号化の必要はなく、それ以外は
M'_k,l,b,q[m,n]≠0としなければならない。There are two cases to consider before encoding M ′ _{k, l, b, q} [m, n]. Case 1: None of the eight most recent samples is known to be non-zero, ie, σ ′ _{k, l, b, q} [m, n] = 0. In this case, M _{k, l, b, q} [m, n] ≧ 2, and therefore
It has already been found that M ′ _{k, l, b, q} [m, n] ≠ 0. Case 2: One or more of the last eight samples of the sample are known to be non-zero, ie, σ ′ _{k, l, b, q} [m, n] = 0. In this case, M _{k, l, b, q} [m, n] = 1 (symbol
0) or M _{k, l, b, q} [m, n] ≧ 2 (symbol 1) must be encoded first. M _{k, l, b, q} [m, n] = 1
Then no further encoding is needed, otherwise
M ′ _{k, l, b, q} [m, n] ≠ 0.

【０１６１】M'_k,l,b,q[m,n]≠0であると判断したなら
ば、M'_k,l,b,q[m,n]<2^eのような最小指数eをまず符号化
する。大きさ文脈状態を使用して、e=1(シンボル=0)で
あるかあるいはe≧2(シンボル1)であることを標示する
バイナリ値シンボルを符号化する。後者のケースである
と判明すれば、文脈状態２を使用して、e=2(シンボル=
0)であるかあるいはe≧3であることを標示するバイナリ
値シンボルを符号化する。更に後者のケースであると判
明すれば、文脈状態３を使用して、e=3(シンボル=0)で
あるかあるいはe≧3であることを標示するバイナリ値シ
ンボルを符号化する。このようにeの値が完全に符号化
されるまでe=3,4,..か否かの符号化を反復する。このよ
うな方式はしばしばコンマ符号と呼ばれる。eの値が符
号化されたなら、値M'_k,l,b,q[m,n]-2^eを表すためe-1ビ
ットの固定長符号ワードを送り出す。[0161] _{M 'k, l, b,} q [m, n] if it is determined that ≠ is _{0, M' k, l,} b, q [m, n] < minimum exponent e as 2 ^e Is first encoded. The magnitude context state is used to encode a binary value symbol indicating that e = 1 (symbol = 0) or e ≧ 2 (symbol 1). If it turns out to be the latter case, using context state 2, e = 2 (symbol =
0) or a binary value symbol indicating that e ≧ 3. If the latter proves to be the case further, context state 3 is used to encode a binary-valued symbol indicating that e = 3 (symbol = 0) or that e ≧ 3. In this way, the encoding of e = 3, 4,... Is repeated until the value of e is completely encoded. Such a scheme is often called a comma code. If the value of e is coded, the value M _{'k, l, b,} sends the _{q [m, n] e-} 1 -bit fixed length code words to represent -2 ^e.

【０１６２】5.7. 適応型算術符号化器前述の多数の固定長符号ワードは別として、任意の所与
の量子化層からの情報の大部分は、適応型算術符号化器
を適用してなんらかの文脈状態に対応してバイナリ値シ
ンボル(0または1)を符号化することによって作り上げら
れる。各状態に関連した分布は、同じ文脈状態の範囲内
で以前に符号化されたすべてのシンボルの統計を反映す
るように別々に更新される。P(1)をシンボルが1に等し
い確率を表すものとする。これは、シンボルがバイナリ
分布を特徴づけるのに十分なものである。実際には、実
際の確率はわからず、むしろ、P(1)は、同じ文脈状態で
以前に符号化されたシンボルに完全に基づいた確率推定
を表す。本明細書では、所与の分布に応じてバイナリ値
シンボルを符号化するために使用される基本的算術符号
化アルゴリズムを記述しない。なぜならそれはよく知ら
れていて、比較的単純であるからである。しかし、注意
すべき点であるが、確率およびこのアルゴリズムの必要
不可欠な部分を形成する中間アキュムレータに対して正
確に同じ固定小数点表現を使用することは符号化器およ
び復号器両方にとって重要である。本実施形態におい
て、アキュムレータは３１ビット精度を持ち、確率はソ
フトウェアの実施にとって特に便利な量であると認めら
れる１５ビット精度を使用して表現される。残る問題は
P(1)に関して適切な推定を効率的に計算するアルゴリズ
ムである。5.7. Adaptive Arithmetic Encoder Apart from the large number of fixed length codewords described above, most of the information from any given quantization layer is applied to the adaptive arithmetic coder to apply some Created by encoding a binary value symbol (0 or 1) corresponding to the context state. The distribution associated with each state is updated separately to reflect the statistics of all previously encoded symbols within the same context state. Let P (1) denote the probability that the symbol is equal to one. This is enough for the symbols to characterize a binary distribution. In practice, the actual probabilities are not known; rather, P (1) represents a probability estimate based entirely on previously coded symbols in the same context state. This document does not describe the basic arithmetic coding algorithm used to encode binary-valued symbols according to a given distribution. Because it is well known and relatively simple. It should be noted, however, that using the exact same fixed-point representation for the probabilities and the intermediate accumulators that form an integral part of the algorithm is important for both the encoder and the decoder. In this embodiment, the accumulator has 31-bit precision, and the probabilities are represented using 15-bit precision, which is perceived to be a particularly convenient quantity for software implementation. The remaining problem is
An algorithm that efficiently calculates an appropriate estimate for P (1).

【０１６３】極端に小さい値のP(1)は正確に表現される
必要があるかもしれないが、P(1)は実質的には1に極め
て近いという事実を活用するように以下に記述するアル
ゴリズムは設計されている。なぜなら、上述のすべての
シンボル定義が最もありそうなシンボルとしてゼロの選
択の方へ偏っているが、２，３の例外ケースとしていず
れのシンボルも別のシンボルより一層ありそうであると
いうことを支持できる先験的理由はないからである。ア
ルゴリズムは、また、すべての状態情報を単一１６ビッ
ト・ワードで保持するように設計されている。この情報
は効率的に更新され、ソフトウェアでの実施の単純なル
ックアップ・テーブルか、ハードウェアでの実施のため
の小さいルックアップ・テーブルと低精度乗数の効率的
組合せを使用して、更新情報に基づいたP(1)の推定が取
り出される。注意すべき点であるが、ＪＰＥＧおよびＪ
ＢＩＧ両方の算術符号化オプションに適応さられたＩＢ
Ｍの有名なＱ符号化器は、文脈状態情報を記憶するため
8ビットのみを必要とするが、この符号化器に依存する
と一般になにがしかの圧縮効率が犠牲にされる。Although extremely small values of P (1) may need to be accurately represented, the following is described to take advantage of the fact that P (1) is practically very close to 1. The algorithm is designed. Because all the above symbol definitions are biased towards the choice of zero as the most likely symbol, we support a few exceptional cases where any symbol is more likely than another symbol. There is no a priori reason that can be done. The algorithm is also designed to hold all state information in a single 16-bit word. This information is updated efficiently, using either a simple look-up table in software implementation or an efficient combination of a small look-up table and low precision multiplier for hardware implementation. An estimate of P (1) based on is derived. Note that JPEG and J
IB exposed to both BIG arithmetic coding options
M's famous Q encoder is used to store context state information.
Although only 8 bits are required, relying on this encoder generally sacrifice some compression efficiency.

【０１６４】16ビット状態ワードは次のように３つのグ
ループに区分される。 S:これは、上述されたシンボル数の正規表現を保持する
7ビット量である。 N:これは、上述された1の数の正規表現を保持する6ビッ
ト量である。 E:これは、Nの値がゼロと等しい時、非常に小さい値のP
(1)を推定するために使用される3ビットのべき数であ
る。The 16-bit status word is divided into three groups as follows. S: This holds the regular expression for the number of symbols mentioned above
7-bit quantity. N: This is a 6-bit quantity that holds the 1 regular expression described above. E: This is a very small value of P when the value of N is equal to zero.
A 3-bit power number used to estimate (1).

【０１６５】これらの３つの量は次のように更新され
る。シンボル1を符号化した後：E=0とセットし、次にSおよ
びNを一緒に増分する。この演算が量のオーバーフロー
を引き起こすならば、両方の量を2で除し、それらを増
分する前に剰余を切り捨てる。シンボル0を符号化した後：Sだけを増分する。この演算
がSのオーバーフローを引き起こすならば、N、S両方の
量を2で除し、Sを増分する前にそれら除算の剰余を切り
捨てる。除算の前にゼロに等しい場合、指数Eが既に最
大値7でない限り、指数Eを増分する。These three quantities are updated as follows. After encoding symbol 1: set E = 0, then increment S and N together. If this operation causes a quantity overflow, divide both quantities by two and truncate the remainder before incrementing them. After encoding symbol 0: increment S only. If this operation causes an overflow of S, divide both the N and S quantities by two and truncate the remainder of the division before incrementing S. If equal to zero before division, increment exponent E unless exponent E is already at maximum 7.

【０１６６】以下にP(1)がE、NおよびSからどのように
計算されるかを記述して、適応型符号化器の記述を完了
する。N≠0であれば、推定値は明らかにP(1)=N/Sであ
る。一方、N=0の時、Nは、最後には1に等しかったので
あるから少なくともE+1回シフトダウンされたにちがい
ない。従って、推定値はP(1)=1/(S*2^E+1)である。もち
ろん、この確率表現は１５ビット固定小数点である。更
に、重要な点であるが、この表現はハードウェアおよび
ソフトウェアでの計算が容易である。ソフトウェアにお
ける最も単純な手法は、NおよびSを一緒に使用してルッ
クアップ・テーブルへの１３ビット・インデックスを作
成し、その結果をE回シフトダウンするものである。し
かしながら、ハードウェアにおいては、そのような大き
いルックアップ・テーブルは高価で正当化できない。ハ
ードウェアでの好ましい実施形態は、非常に小さいルッ
クアップ・テーブルを使用して適当な精度を定めこの量
にNを乗ずるものである。両方の実施形態ともすべての
状況の下で全く同じ結果を生成しなければならないの
で、本発明は、ハードウェア実施形態でP(1)を計算する
アルゴリズムを特定し、次に互換性のある量で一層大き
いルックアップ・テーブルを埋めるソフトウェア実施形
態を残す。１５ビット固定小数点量としてのP(1)の正確
な定義は次の数５２の通りである。The following describes how P (1) is calculated from E, N and S to complete the description of the adaptive encoder. If N ≠ 0, the estimate is clearly P (1) = N / S. On the other hand, when N = 0, N must have been shifted down at least E + 1 times since N was finally equal to one. Therefore, the estimated value is P (1) = 1 / (S * ^{2E + 1} ). Of course, this probability expression is a 15-bit fixed point. Furthermore, importantly, this representation is easy to compute in hardware and software. The simplest approach in software is to use N and S together to create a 13-bit index into the look-up table and shift the result down E times. However, in hardware, such large look-up tables are expensive and cannot be justified. The preferred embodiment in hardware uses a very small look-up table to determine the appropriate precision and multiply this quantity by N. Since both embodiments must produce exactly the same result under all circumstances, the present invention specifies an algorithm that computes P (1) in a hardware embodiment, and then uses a compatible quantity. Leave a software embodiment that fills in a larger lookup table. The exact definition of P (1) as a 15-bit fixed-point quantity is as follows:

【０１６７】[0167]

【数５２】 (Equation 52)

【０１６８】この量が常に1≦2¹⁵P(1)≦2¹⁵-1の範囲に
あることを検証することは極めて単純な事柄であり、こ
れは、ビットストリームが正しく符号化されるであろう
ことを確認するために十分な範囲である。It is a very simple matter to verify that this quantity is always in the range 1 ≦ 2 ¹⁵ P (1) ≦ 2 ¹⁵ −1, which means that the bitstream is correctly encoded. It is enough range to confirm that it is waxy.

【０１６９】5.8. メモリ必要容量中間バッファ・サイズ上述した符号化方式を実施するために必要とされる中間
バッファ記憶の全量を計算するのは簡単である。図８に
示されるように、各バンドについて多くともQ_k,l,b+1走
査行が維持されなければならない。この場合Q_k,l,b+1は
符号化に関連する最精細量子化層である。これを単純な
表現に翻訳するため、すべてのバンドについてQ_k,l,b=8
であると仮定する。実際、８つの量子化層は大部分の目
的にとって十分適切であることが認められる。複解像度
ピラミッドを下方に移動する度毎に各走査行のサイズは
半分にされることを考慮すれば、オーバーコンプリート
形態的変換を使用して生成されるすべての差分バンドに
ついて、5.8. Required Memory Intermediate Buffer Size It is straightforward to calculate the total amount of intermediate buffer storage required to implement the encoding scheme described above. As shown in FIG. 8, at most Q _{k, l, b} +1 scan rows must be maintained for each band. In this case, Q _{k, l, b} +1 is the finest quantization layer related to coding. To translate this into a simple expression, for all bands Q _{k, l, b} = 8
Suppose that In fact, it has been found that eight quantization layers are adequate enough for most purposes. Given that each scan row is halved in size each time the multi-resolution pyramid is moved down, for all difference bands generated using the overcomplete morphological transform,

【０１７０】[0170]

【数５３】 (Equation 53)

【０１７１】という合計標本値がバッファ記憶されなけ
ればならず、非オーバーコンプリート・サブバンド変換
を使用して生成される高周波数サブバンドについて、For high frequency subbands generated using a non-overcomplete subband transform, the total sample values

【０１７２】[0172]

【数５４】 (Equation 54)

【０１７３】という合計標本値がバッファ記憶されなけ
ればならない。形態的変換およびサブバンド変換に関す
る記憶容量は、画像の異なる領域での２つの変換の間の
切り替えには関心があるとしても、加算されない。なぜ
ならば、適応型符号化のために使用された文脈状態は再
初期化されないという唯一の例外はあるとしても、切り
替えが起こるときは必ず各領域は独立の画像として圧縮
されるからである。The total sample value must be buffered. The storage capacity for morphological and sub-band transforms is not added, even if interested in switching between two transforms in different areas of the image. This is because each region is compressed as a separate image whenever a switch occurs, with the only exception that the context state used for adaptive coding is not reinitialized.

【０１７４】これに加えて、すべてのバンドのすべての
量子化層に関して文脈状態情報を記憶することができな
ければならない。計算すれば、各バンドの各量子化層に
関して、各々が2バイトの正確に64の異なる文脈状態が
備えられなければならない。複解像度分解において各カ
ラー成分に関して8量子化層および最高7レベルを仮定
し、また、各レベルにおいてすべての4バンドが使用さ
れる(すなわち形態的変換およびサブバンド変換を同時
にサポートすることが望まれる)と仮定すれば、合計K＊
28.672kバイトの文脈状態メモリが必要とされる。In addition, it must be possible to store context state information for all quantization layers of all bands. If calculated, exactly 64 different context states of 2 bytes each must be provided for each quantization layer in each band. Assuming 8 quantization layers and up to 7 levels for each color component in the bi-resolution decomposition, and all 4 bands are used at each level (i.e. it is desirable to support morphological and sub-band transforms simultaneously) ), The total K *
28.672k bytes of context state memory are required.

【０１７５】中間バッファ全体の必要容量に影響する変
換バッファと符号化バッファを組み合わせて考察するた
め、2.4項の結果を上記の数と組み合わせ、バンド標本
値は１６ビット精度に表現され一方画像標本は８ビット
精度でよいと仮定すれば、合計メモリ必要量は、To consider the combination of the transform buffer and the encoding buffer, which affect the required capacity of the entire intermediate buffer, the results of Section 2.4 are combined with the above numbers, and the band sample values are represented with 16-bit precision, while the image samples are Assuming 8-bit precision, the total memory requirement is

【０１７６】[0176]

【数５５】 [Equation 55]

【０１７７】となる。画像が、白黒(K=1)であって、走
査行につき6000ピクセルを持つとし、サブバンド・フィ
ルタのサポートためT=9が使用されるとすれば、メモリ
必要量は合計281kバイトとなる。形態的変換は使用せ
ず、サブバンド変換だけをサポートするとすれば、この
合計値は247kバイトに減じる。メモリ要件は、大きな数
の組み込み量子化層にもかかわらず、符号化段階ではな
く、変換によって実際には支配される点に注意する価値
がある。代替的には、それよりいくぶん小さいサブバン
ド・フィルタを使用することができる。例えば、リコー
社によって開発された正確な再構築フィルタ(これは損
失／無損失統合圧縮にとって適切なものである)はT=6を
持ち、サブバンドだけの場合中間バッファ必要容量は19
3kバイトに減少する。比較すれば、標準のベースライン
ＪＰＥＧ圧縮は48kバイト以上程度を必要とする。Is obtained. Assuming that the image is black and white (K = 1), has 6000 pixels per scan line, and uses T = 9 to support subband filters, the total memory requirement is 281 kbytes. This sum would be reduced to 247k bytes if no morphological transforms were used and only sub-band transforms were supported. It is worth noting that the memory requirements are actually governed by the transform, not the coding stage, despite the large number of embedded quantization layers. Alternatively, a somewhat smaller sub-band filter can be used. For example, the exact reconstruction filter developed by Ricoh (which is appropriate for lossy / lossless joint compression) has T = 6 and the intermediate buffer requirement for subbands alone is 19
Reduce to 3k bytes. By comparison, standard baseline JPEG compression requires about 48k bytes or more.

【０１７８】ハードウェア実施の場合のメモリ・バンド
幅およびチップ搭載ＲＡＭ上述の符号化方式の単純なハードウェア実施形態は各バ
ンドの各量子化層の単一走査行を逐次処理する。この場
合、チップ搭載ＲＡＭ必要容量は、以下の項目によって
支配される。・処理されているバンドの適切な層について各々が16ビ
ット幅の64個の文脈状態ワードのために必要なメモリ。
２重バッファが必要であるかもしれず、これは合計256
バイトを必要とする。・上述のような２つのルックアップ・テーブルのため、
更に256バイトのチップ搭載メモリが必要とされる。・上述されたランレングス符号化方式を実施するために
必要とされるいくらかの付加的チップ搭載メモリが必要
となる。これは、文脈情報を取得するために使用される
３走査行からの最高65の標本サンプル(各16ビット幅)の
内部バッファ記憶を要求するであろう。これは、更に39
0バイトをチップ搭載メモリ必要容量に加える。しか
し、このコンポーネントを劇的に最小にすることができ
る多数の技術が存在する。特に、これにチップ搭載メモ
リ全体のうちの無視できるほどのコンポーネントを提供
するためランレングス符号化技術をＡＳＩＣ自体の上に
含めることができる。 Memory Band for Hardware Implementation
Width and On-Chip RAM A simple hardware embodiment of the above-described encoding scheme sequentially processes a single scan row of each quantization layer in each band. In this case, the required capacity of the on-chip RAM is governed by the following items. Memory required for 64 context state words, each 16 bits wide for the appropriate layer of the band being processed.
A double buffer may be required, a total of 256
Requires bytes. • For the two lookup tables as described above,
An additional 256 bytes of on-chip memory is required. -Requires some additional on-chip memory required to implement the run-length encoding scheme described above. This would require internal buffer storage of up to 65 sample samples (16 bits wide each) from three scan lines used to obtain contextual information. This is an additional 39
Add 0 bytes to the required memory on chip. However, there are a number of techniques that can dramatically minimize this component. In particular, run-length encoding techniques can be included on the ASIC itself to provide it with negligible components of the overall on-chip memory.

【０１７９】6. 性能本明細書に記述された圧縮システムは、自然画像、グラ
フィックス、合成ドキュメントおよび２レベル画像ソー
スを含む多種多様な画像タイプに対して競争力のある圧
縮性能を達成することができる。更に、これら画像ソー
スのすべてに関して、該システムは解像度およびビット
率に関して並列的にスケーリング可能な高度にスケーリ
ング可能な圧縮されたビットストリームを生成する。6. Performance The compression system described herein achieves competitive compression performance for a wide variety of image types, including natural images, graphics, synthetic documents, and two-level image sources. Can be. Furthermore, for all of these image sources, the system produces a highly scalable compressed bit stream that can be scaled in parallel with respect to resolution and bit rate.

【０１８０】4項で記述された適応型量子化方式および5
項で記述された関連符号化器は、ともにＣＡＳＡＣ(す
なわち文脈適応型連続近似および符号化)アルゴリズム
を含むが、2.3項で記述された複レベル・サブバンド変
換に関連してテストされている。自然画像処理に関し
て、本実施形態の圧縮性能は、それ以外の最新技術水準
の圧縮方式と比較して、メモリ必要容量は大幅に少な
く、競争的であることが観察された。ほとんどすべての
既存のスケーラブルな画像圧縮方式で使用される量子化
方針を組み込みながら、適応型量子化特性をほとんど無
修正で取り出すことができることは留意されるべきであ
る。この利点を生かして、適応型量子化方針が視覚的に
一層心地よい画像を実際に生成することを検証すること
は可能であった。適応型量子化で圧縮された画像が非適
応型組み込み量子化器で圧縮されたものより若干貧弱な
(約0.1から0.2dB悪い)再構築ＭＳＥを一般的には持つと
いう事実にかかわらず、上記の点は真実であるというこ
とが観察されている。The adaptive quantization method described in section 4 and 5
The relevant encoders described in section include both the CASAC (ie, context-adaptive continuous approximation and coding) algorithm, but have been tested in conjunction with the multi-level subband transform described in section 2.3. With regard to natural image processing, it has been observed that the compression performance of the present embodiment requires much less memory and is competitive compared to other state-of-the-art compression schemes. It should be noted that the adaptive quantization characteristics can be retrieved with almost no modification, while incorporating the quantization strategies used in almost all existing scalable image compression schemes. Taking advantage of this advantage, it was possible to verify that the adaptive quantization strategy actually produced a more visually pleasing image. Images compressed with adaptive quantization are slightly poorer than those compressed with non-adaptive embedded quantizer
Despite the fact that it typically has a reconstructed MSE (approximately 0.1 to 0.2 dB), the above points have been observed to be true.

【０１８１】7. 注記以上の記述から本発明の多くの特徴および利点は明白で
ある。更に、当業者が多数の修正および変更を行うこと
は可能であるので、例示し記述したものと正確に同じ構
成および動作に本発明を制限することは望まれてない。
従って、すべての適切な修正は本発明の理念を逸脱する
ものではないと見なされるべきである。7. Notes Many features and advantages of the invention are apparent from the above description. In addition, many modifications and changes will occur to those skilled in the art, so it is not desired to limit the invention to the exact same construction and operation as illustrated and described.
Accordingly, all appropriate modifications should not be deemed to depart from the spirit of the invention.

【０１８２】本発明には、例として次のような実施様態
が含まれる。（１）デジタル画像を圧縮する方法であって、デジタル
画像に関して空間適応型組み込み量子化を実行するステ
ップと、上記ステップの結果生成される量子化シンボル
を符号化して圧縮された画像を形成するステップと、を
含むデジタル画像圧縮方法。（２）上記組み込み量子化ステップに先行してデジタル
画像の変換を行うステップを含む、上記（１）に記載の
デジタル画像圧縮方法。（３）上記画像変換ステップが形態的変換を含む、上記
（２）に記載のデジタル画像圧縮方法。（４）上記形態的変換が符号予測演算子を含む、上記
（３）に記載のデジタル画像圧縮方法。（５）上記画像変換ステップが、サブバンドおよびウェ
ーブレット変換のセットから選択される変換を含む、上
記（２）に記載のデジタル画像圧縮方法。（６）上記組み込み量子化ステップが組み込みデッドゾ
ーン量子化を含む、上記（１）に記載のデジタル画像圧
縮方法。（７）上記デッドゾーン量子化が、最も粗い量子化ステ
ップ・サイズを識別する単一のパラメータによって特徴
づけられる、上記（６）に記載のデジタル画像圧縮方
法。（８）上記組み込みデッドゾーン量子化が、画像の標本
を符号値表現に変換するステップと、その符号値表現の
適切な数の下位ビットを破棄するステップと、を含む、
上記（６）に記載のデジタル画像圧縮方法。The present invention includes the following embodiments as examples. (1) A method for compressing a digital image, wherein spatially adaptive embedded quantization is performed on the digital image, and a quantization symbol generated as a result of the step is encoded to form a compressed image. And a digital image compression method. (2) The digital image compression method according to the above (1), comprising a step of performing a digital image conversion prior to the embedded quantization step. (3) The digital image compression method according to (2), wherein the image conversion step includes morphological conversion. (4) The digital image compression method according to (3), wherein the morphological transformation includes a sign prediction operator. (5) The digital image compression method according to (2), wherein the image conversion step includes a conversion selected from a set of a subband and a wavelet transform. (6) The digital image compression method according to (1), wherein the embedded quantization step includes embedded dead zone quantization. (7) The digital image compression method according to (6), wherein the dead zone quantization is characterized by a single parameter identifying a coarsest quantization step size. (8) wherein the embedded dead zone quantization comprises: converting a sample of the image into a code value representation; and discarding an appropriate number of lower bits of the code value representation.
The digital image compression method according to the above (6).

【０１８３】（９）標本が粗いステップ・サイズを上回
る大きさを持つならば、一層精細なステップ・サイズを
使用して当該標本およびその隣接標本が量子化され符号
化される、上記（１）に記載のデジタル画像圧縮方法。（１０）変換の独立バンドが独立して量子化され復号さ
れる、上記（２）に記載のデジタル画像圧縮方法。（１１）任意の特定のバンドに関して走査行mの量子化
層を符号化するため条件付け文脈を生成する時、より粗
い量子化層q-1における走査行m+1を越えて出現する情報
は一切使用されず、また、現在時量子化層qにおける走
査行m-1に先行して出現する情報は一切使用されない、
上記（１）に記載のデジタル画像圧縮方法。（１２）組み込み量子化シンボルを符号化するステップ
が、標本が前の量子化層において非ゼロであると認めら
れていれば改良符号化を実行するステップと、標本が前
の量子化層においてゼロであると認められていればゼロ
符号化を実行するステップと、を含む、上記（１）に記
載のデジタル画像圧縮方法。（１３）標本が前の量子化層においてゼロであると認め
られる場合ゼロ符号化を実行する上記ステップが、符号
の符号化を実行するステップと、初期大きさ符号化を実
行するステップと、を含む、上記（１２）に記載のデジ
タル画像圧縮方法。（１４）前にゼロに量子化された標本の連続ランレング
スを条件にして、ランレングス符号化方式を使用してゼ
ロに量子化された標本が連続して符号化される、上記
（１２）に記載のデジタル画像圧縮方法。(9) If the sample has a size greater than the coarse step size, the sample and its neighboring samples are quantized and encoded using a finer step size. 2. The digital image compression method according to 1. (10) The digital image compression method according to (2), wherein independent bands of the transform are independently quantized and decoded. (11) When generating a conditioning context to encode the quantization layer of scan row m for any particular band, any information appearing beyond scan row m + 1 in coarser quantization layer q-1 Is not used, nor is any information that precedes the scan line m-1 in the current quantization layer q used.
The digital image compression method according to the above (1). (12) performing the improved encoding if the sample is recognized as non-zero in the previous quantization layer, and encoding the embedded quantization symbol; Performing the zero-encoding if it is determined that the digital image compression is performed. (13) If the sample is found to be zero in the previous quantization layer, performing the zero encoding comprises performing a code encoding and performing an initial magnitude encoding. The digital image compression method according to the above (12). (14) given the continuous run length of the sample previously quantized to zero, the sample quantized to zero is continuously encoded using a run-length encoding scheme, as described in (12) above. 2. The digital image compression method according to 1.

【０１８４】（１５）上記（１）に記載の圧縮された画
像に対して伸張を行う方法。（１６）圧縮された画像のビットストリーム編集を行う
ステップを含む、上記（１）に記載のデジタル画像圧縮
方法。（１７）上記圧縮画像のビットストリーム編集を行うス
テップが、ある解像度を選択するため圧縮画像から不必
要なサブバンドを破棄するステップを含む、上記（１
６）に記載のデジタル画像圧縮方法。（１８）上記圧縮画像のビットストリーム編集を行うス
テップが、再現忠実性を選択するため圧縮画像から関連
サブバンド内の不必要な量子化層を破棄するステップを
含む、上記（１６）に記載のデジタル画像圧縮方法。（１９）デジタル画像の圧縮を実行するプロセッサであ
って、デジタル画像に関して空間適応型組み込み量子化
を実行する手段と、上記量子化の結果生成される量子化
シンボルを符号化して圧縮された画像を形成する手段
と、を備えるデジタル画像圧縮プロセッサ。（２０）上記（１９）に記載のプロセッサによって圧縮
された画像のサイズを選択的に減少させるビットストリ
ーム・エディタ。（２１）上記（１９）に記載のプロセッサによって圧縮
された画像を伸張する伸張器。(15) A method of decompressing the compressed image described in (1). (16) The digital image compression method according to (1), including a step of performing bit stream editing of the compressed image. (17) The bit stream editing of the compressed image includes the step of discarding unnecessary subbands from the compressed image to select a certain resolution.
The digital image compression method according to 6). (18) The step of (16), wherein the step of performing the bitstream editing of the compressed image includes a step of discarding unnecessary quantization layers in related subbands from the compressed image to select reproduction fidelity. Digital image compression method. (19) A processor for performing compression of a digital image, a means for performing spatial adaptive built-in quantization on the digital image, and encoding a compressed symbol generated as a result of the quantization into a compressed image. Means for forming a digital image compression processor. (20) A bitstream editor for selectively reducing the size of an image compressed by the processor according to (19). (21) An expander for expanding an image compressed by the processor according to (19).

【０１８５】[0185]

【発明の効果】本発明によって、解像度ならびにビット
率スケーラビリティを提供しかつ無損失ならびに損失圧
縮を統合しながら、複合デジタル画像を効率的に圧縮／
伸張することが可能となる。The present invention efficiently compresses / compresses composite digital images while providing resolution and bit rate scalability and integrating lossless and lossy compression.
It is possible to stretch.

[Brief description of the drawings]

【図１】本発明に従ったＣＡＳＡＣ型統合スケーラブル
画像圧縮／伸張システムの実施形態を示すブロック図で
ある。FIG. 1 is a block diagram illustrating an embodiment of a CASAC integrated scalable image compression / decompression system according to the present invention.

【図２】本発明の実施形態に従った２レベル複数解像度
変換を示すブロック図である。FIG. 2 is a block diagram illustrating a two-level multi-resolution conversion according to an embodiment of the present invention.

【図３】本発明の実施形態に従ったデジタル画像の形態
上のサブ標本化および補間を示すブロック図である。FIG. 3 is a block diagram illustrating morphological sub-sampling and interpolation of a digital image according to an embodiment of the present invention.

【図４】本発明の実施形態に従った圧縮ビットストリー
ムの構成を示すブロック図である。FIG. 4 is a block diagram showing a configuration of a compressed bit stream according to an embodiment of the present invention.

【図５】本発明の実施形態に従った組み込みデッドゾー
ン量子化器の例を示すブロック図である。FIG. 5 is a block diagram illustrating an example of a built-in dead zone quantizer according to an embodiment of the present invention.

【図６】本発明の実施形態に従った量子化層におけるバ
ンドを量子化し符号化するために使用される隣接情報を
示すブロック図である。FIG. 6 is a block diagram illustrating neighbor information used to quantize and encode bands in a quantization layer according to an embodiment of the present invention.

【図７】本発明の実施形態に従った組み込み均一量子化
器の例を示すブロック図である。FIG. 7 is a block diagram illustrating an example of an embedded uniform quantizer according to an embodiment of the present invention.

【図８】３層バンドのすべての層を符号化あるいは復号
するため４つの走査行がバッファ記憶される本発明の実
施形態を示すブロック図である。FIG. 8 is a block diagram illustrating an embodiment of the present invention in which four scan rows are buffered to encode or decode all layers of a three-layer band.

【図９】本発明の実施形態に従って、量子化のためバン
ド標本と関連した情報の符号化を示す流れ図である。FIG. 9 is a flowchart illustrating encoding of information associated with a band sample for quantization, according to an embodiment of the invention.

[Explanation of symbols]

１００統合スケーラブル画像圧縮／伸張システム１０５未圧縮画像１１０圧縮器１１５、１２５スケーラブル・ビットストリーム１３０伸張器１３５伸張画像 100 Integrated Scalable Image Compression / Decompression System 105 Uncompressed Image 110 Compressor 115, 125 Scalable Bitstream 130 Decompressor 135 Decompressed Image

Claims

[Claims]

1. A method for compressing a digital image, comprising the steps of: performing spatial adaptive embedded quantization on the digital image; and encoding the quantized symbols generated as a result of the step to form a compressed image. And a digital image compression method.