JPWO1998042135A1

JPWO1998042135A1 - Image encoding device, image decoding device, image encoding method, image decoding method, and image encoding/decoding system

Info

Publication number: JPWO1998042135A1
Application number: JP10-540333A
Authority: JP
Inventors: 俊一関口; 光太郎浅井; 篤道村上; 博文西川; 慎一黒田; 芳美井須; 由里長谷川
Original assignee: Mitsubishi Electric Corp
Current assignee: Mitsubishi Electric Corp
Priority date: 1997-03-17
Filing date: 1997-10-23
Publication date: 1999-07-06
Anticipated expiration: 2017-10-23

Abstract

(57)【要約】平行移動以外の画像移動を比較的小規模で処理時間も短縮して高い精度で予測できる画像符号化装置、復号装置を得る。このため、入力画像を所定のブロックに分割して、ブロックのフレーム間の動き検出による動き補償予測手段を備えて入力画像を圧縮符号化する構成であって、動き検出用の参照画像の対応する部分領域に存在する実標本点である整数画素のみを所定の形式に変形化して切り出し、入力画像の上記ブロックの整数画素と比較する変形ブロックマッチング部を含んで、上記座標指定して抽出した最小誤差を与える動きベクトルを出力する動き検出部と、この変形ブロックマッチング部を含めた比較出力から得られる動きパラメータに従って参照画像のブロックを対応し座標指定して変形して決める対応点決定部を含んで予測部分画像を出力する動き補償部とを備えた。また、画像復号装置は、符号化装置と対応する構成とした。また、更に、画像復号装置は、動き補償手段において、参照画像の半画素の座標値も用い、複数の動きベクトルで座標計算し、得られた画素値を変形パターンにより所定の形式に変形処理するようにした。 (57) [Abstract] An image encoding device and a decoding device are provided that can predict image movements other than translational movements on a relatively small scale with reduced processing time and high accuracy. To this end, the device divides an input image into predetermined blocks and includes a motion compensation prediction means for detecting motion between blocks, compressing and encoding the input image. The device includes a motion detection unit that transforms and extracts only integer pixels, which are actual sample points in a corresponding partial region of a reference image for motion detection, into a predetermined format, and compares them with the integer pixels of the block in the input image, outputting a motion vector that provides the minimum error extracted by specifying the coordinates. The device also includes a corresponding point determination unit that transforms and determines corresponding coordinates of blocks in the reference image according to motion parameters obtained from the comparison output including the transformed block matching unit, outputting a predicted partial image. The image decoding device also has a configuration corresponding to the encoding device. Furthermore, the image decoding device further includes a motion compensation means that calculates coordinates using multiple motion vectors, also using coordinate values of half pixels of the reference image, and transforms the obtained pixel values into a predetermined format using a transformation pattern.

Description

【発明の詳細な説明】画像符号化装置及び画像復号装置及び画像符号化方法及び画像復号方法及び画像符号化復号システム技術分野この発明は、画像の高能率符号化あるいは復号において、既存の画像から符号化すべき画像もしくは復号すべき画像の動き補償予測を行い、予測誤差を符号化する、もしくは予測誤差と参照画像データとの加算により復号を行う装置とシステムに関するものである。[Detailed Description of the Invention] Image Encoding Apparatus, Image Decoding Apparatus, Image Encoding Method, Image Decoding Method, and Image Encoding/Decoding System Technical Field This invention relates to an apparatus and system for highly efficient image encoding or decoding, which performs motion-compensated prediction of an image to be encoded or decoded from an existing image, and encodes the prediction error or decodes by adding the prediction error to reference image data.

背景技術画像の高能率符号化における動き補償予測方式の従来技術を説明する。Background Art This paper describes the prior art of motion compensation prediction methods for highly efficient image coding.

従来の動き補償予測方式の第１の例として、平行移動によるブロックマッチングを用いた動き補償予測方式がある。例えばＩＳＯ／ＩＥＣ１１１７２−２（ＭＰＥＧ１ビデオ規格）では、ブロックマッチングを用いた前方向／後方向／内挿動き補償予測方式について解説している。また、従来の動き補償予測方式の第２の例として、アフィン変換を用いた動き補償がある。例えば、「アフィン変換を用いた動き補償予測に関する検討」（電子情報通信学会技術報告ＩＥ９４−３６）では、画像の任意形状領域の動き量をアフィン動きパラメータでモデル化し、そのパラメータを検出することにより動き補償予測を行う方式について解説している。A first example of a conventional motion compensation prediction method is one that uses block matching based on translation. For example, ISO/IEC 11172-2 (MPEG-1 Video Standard) describes forward, backward, and interpolation motion compensation prediction methods using block matching. A second example of a conventional motion compensation prediction method is motion compensation using affine transformation. For example, "Study on Motion Compensation Prediction Using Affine Transformation" (IEICE Technical Report IE94-36) describes a method that models the amount of motion in an arbitrarily shaped region of an image using affine motion parameters and performs motion compensation prediction by detecting these parameters.

以下、これらの解説を基にして従来の平行移動による動き補償方式及びアフィン変換を用いた動き補償方式について説明する。Based on these explanations, the conventional motion compensation method using translation and the motion compensation method using affine transformation will be explained below.

ブロックマッチングによる動き補償予測の概念を図４２に示す。The concept of motion compensation prediction using block matching is shown in FIG.

同図において、ｉは、動き補償予測の単位となるブロックの画面内位置、ｆｉ（ｘ，ｙ，ｔ）は、画面内位置ｉ、時間ｔにおけるブロックの位置（ｘ，ｙ）における画素値、Ｒは、動きベクトル探索範囲、ｖは、動きベクトル（∈Ｒ）である。ブロックマッチングは、同図のように参照画像２０１の探索範囲Ｒの中で、被予測画像２０２中の予測対象のブロックｉの画素値ｆｉ（ｘ，ｙ，ｔ）に最も近いブロック、即ち式（１）に示す誤差電力Ｄｖを最小化するブロックｆi+ v（ｘ，ｙ，ｔ−１）を見つける処理に相当する。In the figure, i is the on-screen position of the block that serves as the unit of motion compensation prediction, fi(x,y,t) is the pixel value at the block position (x,y) at on-screen position i and time t, R is the motion vector search range, v is the motion vector (∈R). Block matching corresponds to the process of finding the block closest to the pixel value fi(x,y,t) of the block i to be predicted in the predicted image 202 within the search range R of the reference image 201, as shown in the figure, i.e., the block fi+v(x,y,t-1) that minimizes the error power Dv shown in equation (1).

Ｄｖを最小にするｖの値が動きベクトルとなる。図中、参照画像中の実標本点である実画素だけを用いて適合するブロックを探索する方法を整数画素精度探索、整数画素に加えて整数画素の中間の半画素も用いる探索方法を半画素精度探索と呼ぶ。一般に、同一探索範囲の条件下では半画素精度探索の方が整数画素精度探索より探索点が多くなり、予測効率が高まる。 The value of v that minimizes Dv becomes the motion vector. In the figure, a method of searching for a matching block using only real pixels, which are real sample points in the reference image, is called integer-pixel precision search, while a search method that uses half-pixels, which are intermediate between integer pixels, in addition to integer pixels, is called half-pixel precision search. Generally, under the same search range conditions, half-pixel precision search results in more search points than integer-pixel precision search, resulting in higher prediction efficiency.

図４３は、例えば、ＭＰＥＧ１ビデオ規格などで採用されている動き補償予測方式を用いる画像符号化装置の動き補償予測部（ブロックマッチング部）の構成例を示す図である。Figure 43 shows an example of the configuration of a motion compensation prediction unit (block matching unit) in an image coding device that uses a motion compensation prediction method, such as that adopted in the MPEG-1 video standard.

図において、２０７は水平方向移動量カウンタ、２０８は垂直方向平行移動量カウンタ、２１１はメモリ読み出しアドレス生成部、２１３はパターンマッチング部、２１６は最小予測誤差電力判定部である。また、２０３は水平方向平行移動量探索範囲指示信号、２０４は垂直方向平行移動量探索範囲指示信号、２０５は被予測ブロックデータ、２０６は被予測ブロックの画像内位置信号、２０９は水平方向平行移動量探索点データ、２１０は垂直方向平行移動量探索点データ、２１２は読み出しアドレス、２１４は読み出し画像データ、２１５は予測誤差電力信号、２１７は動きベクトル、２１８は最小予測誤差電力信号である。２１９は参照画像を記憶するフレームメモリである。In the figure, 207 is a horizontal displacement counter, 208 is a vertical translation displacement counter, 211 is a memory read address generator, 213 is a pattern matching unit, and 216 is a minimum prediction error power determiner. Also, 203 is a horizontal translation displacement search range indication signal, 204 is a vertical translation displacement search range indication signal, 205 is predicted block data, 206 is a predicted block intra-image position signal, 209 is horizontal translation displacement search point data, 210 is vertical translation displacement search point data, 212 is a read address, 214 is read image data, 215 is a prediction error power signal, 217 is a motion vector, and 218 is a minimum prediction error power signal. 219 is a frame memory for storing a reference image.

一方、図４４は、上記図４３に示す構成の動き補償予測部の動作を表す動作フローチャートである。On the other hand, FIG. 44 is an operational flowchart showing the operation of the motion compensation prediction unit having the configuration shown in FIG.

図において、ｄｘは、水平方向平行移動量探索点、ｄｙは、垂直方向平行移動量探索点、ｒａｎｇｅｈｍｉｎは、水平方向平行移動量探索範囲下限値、ｒａｎｇｅｈｍａｘは、水平方向平行移動量探索範囲上限値、ｒａｎｇｅｖｍｉｎは、垂直方向平行移動量探索範囲下限値、ｒａｎｇｅｖｍａｘは、垂直方向平行移動量探索範囲上限値、Ｄｍｉｎは、最小予測誤差電力、（ｘ，ｙ）は、マクロブロック内画素位置を表す座標、Ｄ（ｄｘ，ｄｙ）は、ｄｘ，ｄｙ探索時の予測誤差電力、ｆ（ｘ，ｙ）は、被予測画像の画素（ｘ，ｙ）の値、ｆｒ（ｘ，ｙ）は、参照画像の画素（ｘ，ｙ）の値、Ｄ（ｘ，ｙ）は、ｄｘ，ｄｙ探索時の（ｘ，ｙ）における予測誤差、ＭＶｈは、動きベクトル（平行移動量）水平成分、ＭＶｖは、動きベクトル（平行移動量）垂直成分である。 In the figure, dx is the horizontal translation amount search point, dy is the vertical translation amount search point, range h min is the lower limit of the horizontal translation amount search range, h max is the upper limit of the horizontal translation amount search range, v min is the lower limit of the vertical translation amount search range, v max is the upper limit of the vertical translation search range, D min is the minimum prediction error power, (x, y) are coordinates representing the pixel position in the macroblock, D(dx, dy) is the prediction error power during dx, dy search, f(x, y) is the value of pixel (x, y) in the predicted image, fr(x, y) is the value of pixel (x, y) in the reference image, D(x, y) is the prediction error at (x, y) during dx, dy search, MV h is the horizontal component of the motion vector (parallel movement amount), MV v is the vertical component of the motion vector (translation amount).

以下、図４３、図４４をもとに、ブロックマッチングの動作について詳しく説明する。The operation of block matching will be described in detail below with reference to FIGS.

１）動きベクトル探索範囲の設定水平方向平行移動量探索範囲指示信号２０３及び垂直方向平行移動量探索範囲指示信号２０４より、水平方向移動量カウンタ２０７にｒａｎｇｅｈｍｉｎ／ｒａｎｇｅｈｍａｘを、垂直方向平行移動量カウンタ２０８にｒａｎｇｅｖｍｉｎ／ｒａｎｇｅｖｍａｘを設定する。また、カウンタ初期値をそれぞれｄｘ＝ｒａｎｇｅｈｍｉｎ、ｄｙ＝ｒａｎｇｅｖｍｉｎにセットする。最小予測誤差電力判定部２１６において最小予測誤差電力Ｄｍｉｎを最大の値ＭＡＸＩＮＴ（例えば０ｘＦＦＦＦＦＦＦＦ）にセットする。これは、図４４のＳ２０１に相当する。1) Setting of motion vector search range The horizontal translation amount search range instruction signal 203 and the vertical translation amount search range instruction signal 204 are used to set the range of the horizontal translation amount counter 207. h min /range h max is set to the vertical translation amount counter 208 v min/range v Also, set the counter initial values as dx = range h min, dy=range v The minimum prediction error power determination unit 216 sets the minimum prediction error power D min is set to the maximum value MAXINT (for example, 0xFFFFFFFF). This corresponds to S201 in FIG.

２）予測画像候補画像の読み出し被予測マクロブロックの画素位置（ｘ，ｙ）から（ｄｘ，ｄｙ）だけ離れた位置にある参照画像中の位置（ｘ＋ｄｘ，ｙ＋ｄｙ）の画素を、フレームメモリから取り出す。図４３におけるメモリ読み出しアドレス生成部２１１が水平方向移動量カウンタ２０７からｄｘの値を、垂直方向平行移動量カウンタ２０８からｄｙの値を受け取り、フレームメモリ中のアドレスを生成する。2) Reading the Candidate Predicted Image The pixel at position (x + dx, y + dy) in the reference image, which is a distance (dx, dy) from the pixel position (x, y) of the predicted macroblock, is retrieved from the frame memory. The memory read address generator 211 in Figure 43 receives the value dx from the horizontal translation counter 207 and the value dy from the vertical translation counter 208, and generates an address in the frame memory.

３）予測誤差電力の算出まず、動きベクトルが（ｄｘ，ｄｙ）の時の予測誤差電力Ｄ（ｄｘ，ｄｙ）をゼロに初期化する。これは、図４４のＳ２０２に相当する。２）で読み出された画素値と、被予測マクロブロック内の位置（ｘ，ｙ）の画素値との差をとり、その絶対値をＤ（ｄｘ，ｄｙ）に累積していく。この処理をｘ＝ｙ＝１６になるまで繰り返し、（ｄｘ，ｄｙ）時の予測誤差電力Ｄ（ｄｘ，ｄｙ）、即ち式（１）におけるＤｖを得る。この処理は、図４３におけるパターンマッチング部２１３が行い、パターンマッチング部２１３はＤ（ｄｘ，ｄｙ）を予測誤差電力信号２１５によって、最小予測誤差電力判定部２１６に受け渡す。ここでの処理は、図４４におけるＳ２０３〜Ｓ２０９の処理に相当する。3) Calculating Prediction Error Power First, the prediction error power D(dx, dy) for a motion vector (dx, dy) is initialized to zero. This corresponds to S202 in Figure 44. The difference between the pixel value read in 2) and the pixel value at position (x, y) within the predicted macroblock is calculated, and the absolute value is accumulated as D(dx, dy). This process is repeated until x = y = 16 to obtain the prediction error power D(dx, dy) at (dx, dy), i.e., Dv in equation (1). This process is performed by the pattern matching unit 213 in Figure 43, which then passes D(dx, dy) to the minimum prediction error power determination unit 216 via the prediction error power signal 215. This process corresponds to S203 to S209 in Figure 44.

４）最小予測誤差電力値の更新３）の結果得られたＤ（ｄｘ，ｄｙ）が、それまでの探索結果の中で最小の誤差電力を与えるかどうかを判定する。判定は、図４３における最小予測誤差電力判定部２１６が行う。また、図４４におけるＳ２１０がこの判定処理に相当する。最小予測誤差電力判定部２１６は、内部に持つ最小予測誤差電力Ｄｍｉｎの値と、予測誤差電力信号２１５によって受け渡されるＤ（ｄｘ，ｄｙ）の大小を比較し、Ｄ（ｄｘ，ｄｙ）の方が小さいときに限りＤｍｉｎの値をＤ（ｄｘ，ｄｙ）で更新する。また、そのときの（ｄｘ，ｄｙ）の値を動きベクトル候補（ＭＶｈ，ＭＶｖ）として保持しておく。これらの更新処理は、図４４におけるＳ２１１に相当する。4) Updating the minimum prediction error power value It is determined whether D(dx, dy) obtained as a result of 3) provides the minimum error power among the search results up to that point. This determination is made by the minimum prediction error power determination unit 216 in FIG. 43. S210 in FIG. 44 corresponds to this determination process. The minimum prediction error power determination unit 216 updates the minimum prediction error power D The value of min is compared with the magnitude of D(dx, dy) delivered by the prediction error power signal 215, and only when D(dx, dy) is smaller, D The value of min is updated with D(dx, dy). The value of (dx, dy) at that time is used as the motion vector candidate (MV h, MV v) These update processes correspond to S211 in FIG.

５）動きベクトル値の決定上記２）〜４）を動きベクトル探索範囲Ｒ中のすべての（ｄｘ，ｄｙ）について繰り返し（図４４のＳ２１２〜Ｓ２１５）、最終的に最小予測誤差電力判定部２１６内に保持されている計算値（ＭＶｈ，ＭＶｖ）を動きベクトル２１７として出力する。5) Determination of the motion vector value The above steps 2) to 4) are repeated for all (dx, dy) in the motion vector search range R (S212 to S215 in FIG. 44), and the calculated value (MV) stored in the minimum prediction error power determination unit 216 is finally determined. h, MV v) is output as a motion vector 217.

図４５は、ＭＰＥＧ１ビデオ規格で採用されている動き補償予測方式の概要を示した図である。Figure 45 shows an overview of the motion compensation prediction method adopted in the MPEG-1 video standard.

ＭＰＥＧ１ビデオ規格では動画像の１枚１枚のフレームをピクチャと呼び、ピクチャをマクロブロックという１６×１６画素（色差信号は８×８画素）のブロックに分割して、各マクロブロックについてブロックマッチングによる動き補償予測を行う。その結果得られる動きベクトルと予測誤差信号とを符号化する。In the MPEG1 video standard, each frame of a moving image is called a picture, and the picture is divided into blocks of 16x16 pixels (8x8 pixels for color difference signals) called macroblocks. Motion compensation prediction is performed on each macroblock using block matching, and the resulting motion vector and prediction error signal are then coded.

ＭＰＥＧ１ビデオ規格では異なるピクチャごとに動き補償の方式が変えられるようになっており、図中Ｉピクチャでは動き補償予測を行わずにピクチャ内で閉じた符号化を行うが、Ｐピクチャでは時間的に前に表示される画像から予測を行う前方向動き補償予測を行い、Ｂピクチャでは前方向動き補償予測のほか、時間的に後に表示される画像から予測を行う後方向動き補償予測と、前方向動き補償予測及び後方向動き補償予測から得られる２つの予測画像の加算平均によって予測を行う内挿動き補償予測が許される。ただし、前方向／後方向／内挿の各動き補償予測は、予測に用いる参照画像の違いだけで、基本的にすべてブロックマッチングによる動き補償予測である。The MPEG-1 video standard allows different motion compensation methods to be used for different pictures. In the diagram, I-pictures perform closed coding within the picture without motion compensation prediction. P-pictures perform forward motion compensation, which predicts from a temporally earlier image. B-pictures allow not only forward motion compensation prediction, but also backward motion compensation, which predicts from a temporally later image, and interpolated motion compensation, which predicts by averaging two predicted images obtained from forward and backward motion compensation predictions. However, forward, backward, and interpolated motion compensation are all essentially block-matching motion compensation predictions; the only difference is the reference image used for prediction.

ブロックマッチングは上述のごとく、現在のビデオ符号化方式における動き補償予測の主たる実現手法として確立されている。しかしながら、ブロックマッチングの処理は「輝度の同じ領域は同一物体である」という等輝度仮定に立脚して、マクロブロックの様な正方ブロック単位に物体の平行移動量を求めていることに相当する。よって、正方ブロック形状方向への移動以外の動きを検出することは原理上不可能であり、回転や拡大、縮小、カメラのズーミング、３次元的な物体の動きなど、平行移動で十分に説明できない動きが発生する領域では予測精度が落ちる。As mentioned above, block matching has been established as the primary method for implementing motion-compensated prediction in current video coding systems. However, the block matching process is based on the isoluminance assumption that "areas of equal luminance represent the same object," and is equivalent to calculating the translational displacement of an object in square block units such as macroblocks. Therefore, it is theoretically impossible to detect movement other than in the direction of the square block shape. Prediction accuracy drops in areas where movement that cannot be adequately explained by translation, such as rotation, enlargement, reduction, camera zooming, and three-dimensional object movement, occurs.

このようなブロックマッチングによる動き検出の問題点を解消し、より正確な動き量を検出することを目指して、平行移動量だけでなく、回転やスケーリングといった動き量を含めて精度の高い動き補償予測を行おうというのが、アフィン変換を用いた動き補償予測である。この方式では、予測対象の画素値（ｘ，ｙ）が以下の式（２）に示すアフィン変換によって参照画像中の画素値（ｘ’，ｙ’）に変換されるという仮定に基づき、アフィン変換の各パラメータを動きパラメータとして検出する。「アフィン変換を用いた動き補償予測に関する検討」（電子情報通信学会技術報告ＩＥ９４− ３６）では、任意の形状の予測画像領域に対してアフィン動きパラメータを検出し、動き補償予測を行う手法を提案している。Affine transformation-based motion compensation prediction aims to resolve these problems with block-matching-based motion estimation and achieve more accurate motion estimation by incorporating not only translation but also rotation and scaling motions. This method estimates the parameters of the affine transformation as motion parameters, based on the assumption that the pixel value (x, y) to be predicted is transformed to the pixel value (x', y') in the reference image by the affine transformation shown in Equation (2) below. "Study on Motion Compensation Prediction Using Affine Transformation" (IEICE Technical Report IE94-36) proposes a method for detecting affine motion parameters for a prediction image region of arbitrary shape and performing motion compensation prediction.

ここで、θ、（Ｃｘ，Ｃｙ）、（ｔｘ，ｔｙ）の定義は下記に示す。 Here, the definitions of θ, (Cx, Cy), and (tx, ty) are shown below.

図４６は、アフィン変換を用いた動き補償予測処理の概念を示したものである。Figure 46 shows the concept of motion compensation prediction processing using affine transformation.

同図において、ｉは、動き補償予測の単位となる領域の画面内位置、ｆｉ（ｘ，ｙ，ｔ）は、画面内位置ｉ、時間ｔとして領域の位置（ｘ，ｙ）における画素値、Ｒｖは、平行移動量探索範囲、Ｒｒｏｔ，ｓｃａｌｅは、回転／スケール量探索範囲、ｖは、平行移動パラメータ（＝（ｔｘ，ｔｙ））を含む動きベクトル、ｒｏｔは、回転パラメータ（＝回転角θ）、ｓｃａｌｅは、スケールパラメータ（＝（Ｃｘ，Ｃｙ））である。アフィン動き補償予測では、動きベクトルに相当する平行移動パラメータ（ｔｘ，ｔｙ）に加え、回転角θ、スケール（Ｃｘ，Ｃｙ）の計５パラメータから成るアフィン動きパラメータを検出しなければならない。最適解は全パラメータの全探索で与えられるが非常に膨大な演算量となるため、ここでは平行移動量が支配的であるとの仮定に基づき、２段階の探索アルゴリズムを採用している。まず、第１段階では領域の平行移動量（ｔｘ，ｔｙ）を探索する。第２段階では、第１段階で決定された（ｔｘ，ｔｙ）の近傍で回転角θ、スケール（Ｃｘ，Ｃｙ）の探索を行い、さらに平行移動量の微調整を行うという手順を踏む。探索候補中、最小の予測誤差電力を与える予測領域と現在の領域との差分をとり、予測誤差を符号化する。アフィン変換方式の予測誤差電力は、以下の式（３）で示される。In the figure, i is the on-screen position of the region that serves as the unit of motion compensation prediction, fi(x,y,t) is the pixel value at the region position (x,y) at on-screen position i and time t, Rv is the translation search range, Rrot,scale is the rotation/scale search range, v is the motion vector including the translation parameter (=(tx,ty)), rot is the rotation parameter (=rotation angle θ), scale is the scale parameter (=(Cx,Cy)). Affine motion compensation prediction requires the detection of affine motion parameters, consisting of five parameters: the translation parameter (tx,ty) equivalent to the motion vector, the rotation angle θ, and the scale (Cx,Cy). The optimal solution would be found by exhaustively searching all parameters, which would require an enormous amount of computation. Therefore, a two-stage search algorithm is adopted here, based on the assumption that translation is dominant. In the first step, the translation amount (tx, ty) of the region is searched for. In the second step, the rotation angle θ and scale (Cx, Cy) are searched for near the (tx, ty) determined in the first step, and the translation amount is further fine-tuned. The difference between the current region and the prediction region that yields the smallest prediction error power is calculated and the prediction error is coded. The prediction error power for the affine transformation method is expressed by the following equation (3):

図４７は、アフィン変換を用いた動き補償予測部の構成例を示す図である。 FIG. 47 is a diagram showing an example of the configuration of a motion compensation prediction unit using affine transformation.

同図において、２２０は平行移動微調整量探索範囲指示信号、２２１は回転量探索範囲指示信号、２２２はスケール量探索範囲指示信号、２２３は平行移動量探索範囲指示信号、２２４は被予測領域画面内位置信号、２２５は被予測領域データ、２２６は水平方向平行移動量カウンタ、２２７は垂直方向平行移動量カウンタ、２２８は平行移動量加算部、２２９は第１段最小予測誤差電力判定部、２３０はメモリ読み出しアドレス生成部、２３１は補間演算部、２３２は半画素生成部、２３３は回転量カウンタ、２３４はスケール量カウンタ、２３５は平行移動／回転／スケール量加算部、２３６は第２段最小予測誤差電力判定部、２３７は平行移動微調整量カウンタ、２３８は平行移動微調整量加算部、２３９は最終最小予測誤差電力判定部である。In the figure, 220 denotes a translation fine-adjustment amount search range indication signal, 221 denotes a rotation amount search range indication signal, 222 denotes a scale amount search range indication signal, 223 denotes a translation amount search range indication signal, 224 denotes a predicted region intra-screen position signal, 225 denotes predicted region data, 226 denotes a horizontal translation amount counter, 227 denotes a vertical translation amount counter, 228 denotes a translation amount adder, 229 denotes a first-stage minimum prediction error power determiner, 230 denotes a memory read address generator, 231 denotes an interpolation calculation unit, 232 denotes a half-pixel generator, 233 denotes a rotation amount counter, 234 denotes a scale amount counter, 235 denotes a translation/rotation/scale amount adder, 236 denotes a second-stage minimum prediction error power determiner, 237 denotes a translation fine-adjustment amount counter, 238 denotes a translation fine-adjustment amount adder, and 239 denotes a final minimum prediction error power determiner.

図４８は、従来の装置のその動作フローチャートである。また、図４９は、図４８中のＳ２２４で示されるアフィン動きパラメータ検出行程の詳細を示すフローチャートである。FIG. 48 is a flowchart showing the operation of the conventional device, and FIG. 49 is a flowchart showing the details of the affine motion parameter detection step shown in S224 in FIG.

これらの図において、ＭＶｈ［４］は、動きベクトル水平成分（４候補）、ＭＶｖ［４］は、動きベクトル垂直成分（４候補）、Ｄｍｉｎは、最小予測誤差電力、 θは、回転量［ｒａｄｉａｎ］、Ｃｘ，Ｃｙは、スケール量、ｔｘ，ｔｙは、動きベクトル微調整量で、更に、Ｄ（θ［ｉ］，Ｃｘ［ｉ］，Ｃｙ［ｉ］，ｔｘ［ｉ］，ｔｙ［ｉ］）は、ＭＶｈ［ｉ］，ＭＶｖ［ｉ］選択時におけるアフィン動きパラメータ検出の結果得られる最小予測誤差電力、ｄθは、回転量探索点、ｄＣｘは、水平方向スケール量探索点、ｄＣｙは、垂直方向スケール量探索点、ｄｔｘは、水平方向平行移動微調整量探索点、ｄｔｙは、垂直方向平行移動微調整量探索点、ｒａｎｇｅｒａｄiaｎｍｉｎは、回転量探索範囲下限値、ｒａｎｇｅｒａｄｉａｎｍａｘは、回転量探索範囲上限値、ｒａｎｇｅｓｃａｌｅｍｉｎは、スケール量探索範囲下限値、ｒａｎｇｅｓｃａｌｅｍａｘは、スケール量探索範囲上限値、ｒａｎｇｅｔｈｍｉｎは、水平方向平行移動微調整量探索範囲下限値、ｒａｎｇｅｔｈｍａｘは、水平方向平行移動微調整量探索範囲上限値、ｒａｎｇｅｔｖｍｉｎは、垂直方向平行移動微調整量探索範囲下限値、ｒａｎｇｅｔｖｍａｘは、垂直方向平行移動微調整量探索範囲上限値、Ｄｍｉｎは、最小予測誤差電力、（ｘ，ｙ）は、被予測領域内画素位置、ｆ（ｘ，ｙ）は、被予測画像の画素（ｘ，ｙ）の値、ｆｒ（ｘ，ｙ）は、参照画像の画素（ｘ，ｙ）の値、ａｘは、水平方向アフィン変換値、ａｙは、垂直方向アフィン変換値、Ｄ（ａｘ，ａｙ）は、ａｘ，ａｙ探索時の予測誤差電力、Ｄ（ｘ，ｙ）は、ａｘ，ａｙ探索時の（ｘ，ｙ）における予測誤差である。 In these figures, MV h[4] is the horizontal component of the motion vector (4 candidates), MV v[4] is the vertical component of the motion vector (4 candidates), D min is the minimum prediction error power, θ is the rotation amount [radian], Cx, Cy are the scale amounts, tx, ty are the motion vector fine adjustment amounts, and D(θ[i], Cx[i], Cy[i], tx[i], ty[i]) is the MV h[i],MV The minimum prediction error power obtained as a result of affine motion parameter detection when v[i] is selected, dθ is a rotation amount search point, dCx is a horizontal scale amount search point, dCy is a vertical scale amount search point, dtx is a horizontal translation fine adjustment amount search point, dty is a vertical translation fine adjustment amount search point, range radian min is the lower limit of the rotation amount search range, radian max is the upper limit of the rotation amount search range, scale min is the lower limit of the scale amount search range, scale max is the upper limit of the scale amount search range, t h min is the lower limit of the horizontal translation fine adjustment amount search range, t h max is the upper limit of the horizontal translation fine adjustment amount search range, t v min is the lower limit of the vertical translation fine adjustment amount search range, t v max is the upper limit of the vertical translation fine adjustment amount search range, D min is the minimum prediction error power, (x, y) is the pixel position within the predicted region, f(x, y) is the value of pixel (x, y) in the predicted image, fr(x, y) is the value of pixel (x, y) in the reference image, ax is the horizontal affine transformation value, ay is the vertical affine transformation value, D(ax, ay) is the prediction error power when searching ax, ay, and D(x, y) is the prediction error at (x, y) when searching ax, ay.

以下、図４７〜図４９をもとに、アフィン変換を用いた動き補償予測処理の動作について詳しく説明する。The operation of the motion compensation prediction process using affine transformation will be described in detail below with reference to FIGS.

これらの図において、前記の図と同一の符号を付した要素もしくはステップについては、同一の動作もしくは処理を行うものとする。In these figures, elements or steps with the same reference numerals as those in the previous figures perform the same operations or processes.

１）第１段階第１段階として、従来のものでは、まず、前述のブロックマッチング相当の処理により、領域ごとに与えられた探索範囲内で平行移動パラメータ（＝動きベクトル）の検出を行う。1) First Stage In the first stage, conventional methods first use a process similar to the block matching described above to detect translation parameters (i.e., motion vectors) within a given search range for each region.

図４７において、平行移動量探索範囲指示信号２２３より、水平方向移動量カウンタ２２６及び垂直方向平行移動量カウンタ２２７に探索範囲を設定し、探索点を変化させていく。平行移動量加算部２２８で、このカウント値に被予測画像領域における現在の領域位置を加算し、その結果がメモリ読み出しアドレス生成部２３０に渡され、予測画像候補の画素値がフレームメモリ２１９から読み出される。読み出された画素値はパターンマッチング部２１３に渡され、ブロックマッチングと同様の誤差計算がなされる。このマッチング結果が第１段最小予測誤差電力判定部２２９に送られ、予測誤差の小さい方から４候補の平行移動パラメータを得る。これらをＭＶｈ［４］（水平成分）及びＭＶｖ［４］（垂直成分）と表記する。第１段最小予測誤差電力判定部２２９の動作は、最小予測誤差電力判定部２１６と同様である。この処理過程は、図４８のＳ２２１、Ｓ２２２に相当する。 In Fig. 47, a translation search range instruction signal 223 sets a search range in a horizontal translation counter 226 and a vertical translation counter 227, and changes the search point. A translation amount adder 228 adds the current area position in the predicted image area to this count value, and the result is passed to a memory read address generator 230, which reads out pixel values of predicted image candidates from the frame memory 219. The read pixel values are passed to a pattern matching unit 213, which performs an error calculation similar to block matching. This matching result is sent to a first-stage minimum prediction error power determiner 229, which obtains translation parameters for four candidates with the smallest prediction errors. These are then used to calculate the MV h[4] (horizontal component) and MV The operation of the first stage minimum prediction error power determination unit 229 is the same as that of the minimum prediction error power determination unit 216. This processing step corresponds to S221 and S222 in FIG.

２）第２段階２−１）準備（探索範囲の設定、最小予測誤差電力値の初期化）各ＭＶｈ［ｉ］／ＭＶｖ［ｉ］（０≦ｉ≦３）について、その近傍の微小空間において回転量／スケール量を探索する。これは、図４８のＳ２２４に相当し、図４９に詳細な処理過程を示す。図４７の装置の動作と関連付けながら動作を説明する。2) Second stage 2-1) Preparation (setting the search range, initializing the minimum prediction error power value) h[i]/MV For v[i] (0≦i≦3), the rotation amount/scale amount is searched for in the small space nearby. This corresponds to S224 in Fig. 48, and the detailed processing steps are shown in Fig. 49. The operation will be explained in relation to the operation of the device in Fig. 47.

まず、回転量探索範囲指示信号２２１及びスケール量探索範囲指示信号２２２より、回転量カウンタ２３３、スケール量カウンタ２３４にそれぞれ探索範囲を設定する。また、平行移動微調整量探索範囲指示信号２２０より平行移動微調整量カウンタ２３７にも探索範囲の設定を行う。第２段最小予測誤差電力判定部２３６は、内部に持つ最小予測誤差電力Ｄｍｉｎの値をＭＡＸＩＮＴに設定する。これは、図４９のＳ２２９に相当する。 First, search ranges are set in the rotation amount counter 233 and the scale amount counter 234 by the rotation amount search range instruction signal 221 and the scale amount search range instruction signal 222. Also, a search range is set in the translation fine adjustment amount counter 237 by the translation fine adjustment amount search range instruction signal 220. The second-stage minimum prediction error power determination unit 236 determines the minimum prediction error power D The value of min is set to MAX_INT. This corresponds to S229 in FIG.

２−２）回転量の探索以下、各ＭＶｈ［ｉ］／ＭＶｖ［ｉ］（０≦ｉ≦３）について同じ処理を繰り返すため、ＭＶｈ［０］／ＭＶｖ［０］のケースについてのみ説明する。スケール量Ｃｘ，Ｃｙ及び平行移動微調整量ｔｘ，ｔｙの値を固定し、回転量 θの値を探索範囲内で変化させ、以下のアフィン変換値ａｘ，ａｙを得る。2-2) Search for the amount of rotation For each MV, h[i]/MV Repeat the same process for v[i] (0≦i≦3), and then use MV h[0]/MV We will only explain the case of v[0]. The values of the scale amounts Cx and Cy and the translation fine-tuning amounts tx and ty are fixed, and the value of the rotation amount θ is varied within the search range to obtain the following affine transformation values ax and ay.

参照画像中の（ａｘ，ａｙ）における画素値ｆｒ（ａｘ，ａｙ）とｆ（ｘ，ｙ）との差分絶対値を求め、これをＤ（ａｘ，ａｙ）に累積していく。 The absolute difference between pixel values fr(ax, ay) at (ax, ay) in the reference image and f(x, y) is calculated and accumulated in D(ax, ay).

以上の処理は、図４７において、スケール量カウンタ２３４及び平行移動微調整量カウンタ２３７のカウント値を固定し、回転量カウンタ２３３のカウント値に応じて平行移動／回転／スケール量加算部２３５で式（４）のａｘ，ａｙを求め、メモリ読み出しアドレス生成部２３０を介してｆｒ（ａｘ，ａｙ）を算出するために必要な画素をフレームメモリ２１９から読み出し、次いで補間演算部２３１において、これらの画素からｆｒ（ａｘ，ａｙ）を算出して、パターンマッチング部２１３において被予測画素値ｆ（ｘ，ｙ）との差分絶対値を求める動作によって実行される。図４９では、Ｓ２３１〜Ｓ２３４に相当する。The above processing is performed by fixing the count values of the scale amount counter 234 and the translation fine adjustment amount counter 237 in Fig. 47, determining ax and ay in equation (4) in the translation/rotation/scaling amount adder 235 according to the count value of the rotation amount counter 233, reading the pixels necessary to calculate fr(ax, ay) from the frame memory 219 via the memory read address generator 230, then calculating fr(ax, ay) from these pixels in the interpolation calculation unit 231, and calculating the absolute difference between the calculated value and the predicted pixel value f(x, y) in the pattern matching unit 213. This corresponds to steps S231 to S234 in Fig. 49.

以上の処理を回転量探索範囲全域に渡って行い、第２段最小予測誤差電力判定部２３６において、回転量探索範囲内での最小予測誤差を与える回転量θが決定される。The above process is performed over the entire rotation search range, and the second-stage minimum prediction error power determination unit 236 determines the rotation θ that results in the minimum prediction error within the rotation search range.

２−３）スケール量の探索回転量の探索と同様、平行移動微調整量カウンタ２３７のカウント値を固定し、回転量として２−２）で決定された回転量θを式（４）に代入して、スケール量Ｃｘ，Ｃｙの値を探索範囲内で変化させ、式（４）のアフィン変換値ａｘ，ａｙを得る。2-3) Searching for the Scale Amount As with the search for the rotation amount, the count value of the translation fine adjustment amount counter 237 is fixed, and the rotation amount θ determined in 2-2) is substituted into equation (4) as the rotation amount. The values of the scale amounts Cx and Cy are varied within the search range to obtain the affine transformation values ax and ay of equation (4).

以下、回転量の探索と同様の処理を行って、Ｄ（ａｘ，ａｙ）を最小にするスケール量Ｃｘ，Ｃｙを得る。スケール量探索点のカウントは、スケール量カウンタ２３４が行う。Thereafter, a process similar to that for searching for the rotation amount is performed to obtain the scale amounts Cx and Cy that minimize D(ax, ay). The scale amount search points are counted by the scale amount counter 234.

２−４）平行移動微調整量の探索２−２）及び２−３）で決定された回転量θ／スケール量Ｃｘ，Ｃｙを用いて、平行移動微調整量ｔｘ，ｔｙの値を探索範囲内で変化させ、式（４）のアフィン変換値ａｘ，ａｙを得る。2-4) Search for the Fine Translation Adjustment Amount Using the rotation amount θ and scale amounts Cx and Cy determined in 2-2) and 2-3), the values of the fine translation adjustment amounts tx and ty are changed within the search range to obtain the affine transformation values ax and ay of equation (4).

以下、回転量／スケール量の探索と同様の処理を行う。平行移動微調整量探索点のカウントは、平行移動微調整量カウンタ２３７が行う。ただし、ｔｘ，ｔｙは半画素精度で探索されるので、パターンマッチング部２１３に送られる前に必要に応じて半画素生成部２３２において半画素値が計算される。半画素値の計算は、図５０に示すように、整数画素との空間的な位置関係に基づいて以下の式（５）のように計算される。ただし、ｘ，ｙは共に０から計数し、整数画素位置は共にＥＶＥＮとする。The following process is similar to the rotation/scale search. The translation fine adjustment amount search Point counting is performed by the translation fine adjustment amount counter 237. However, since tx and ty are searched with half-pixel accuracy, half-pixel values are calculated as necessary in the half-pixel generation unit 232 before being sent to the pattern matching unit 213. The half-pixel values are calculated using the following equation (5) based on the spatial relationship with integer pixels, as shown in Figure 50. However, both x and y are counted from 0, and the integer pixel positions are both even.

以上で、図４９の処理フローを終了する。 This completes the processing flow of FIG.

２−５）最終アフィン動きパラメータの決定すべてのＭＶｈ［ｉ］／ＭＶｖ［ｉ］について、上記２−２）から２−４）のパラメータ探索を行った結果得られるθ［ｉ］，Ｃｘ［ｉ］，Ｃｙ［ｉ］，ｔｘ［ｉ］，ｔｙ［ｉ］を用いて得られる予測画像との予測誤差を求め、最も小さい誤差値を与える領域位置ｉと、そのパラメータセットを最終的な探索結果とする。これは図４８におけるＳ２２５〜Ｓ２２８に相当する。2-5) Final affine motion parameter determination All MVs h[i]/MV For v[i], the prediction error between v[i] and the predicted image obtained using θ[i], Cx[i], Cy[i], tx[i], and ty[i] obtained as a result of the parameter search of 2-2) to 2-4) above is calculated, and the region position i that gives the smallest error value and its parameter set are determined as the final search results. This corresponds to S225 to S228 in Figure 48.

以上のように、アフィン動きパラメータ探索は非常に多くの処理過程を要するだけでなく、探索の際の演算負荷も大きい。As mentioned above, affine motion parameter search not only requires a large number of processing steps, but also imposes a large computational load during the search.

図５１は、回転量及びスケール量を探索する過程で生じる非整数画素値の算出方法、即ち補間演算部２３１におけるｆｒ（ａｘ，ａｙ）の算出方法を示す図である。Figure 51 shows how to calculate non-integer pixel values that arise during the process of searching for rotation and scale amounts, i.e., how the interpolation calculation unit 231 calculates fr(ax, ay).

図において、○は画像の実標本点、●は演算によって生成する画素値である。In the figure, circles represent actual sample points of the image, and black circles represent pixel values generated by calculation.

ｆｒ（ａｘ，ａｙ）は、参照画像上で計算した、以下の式（６）のＩハット（ｘ，ｙ）（ただし、ｘ＝ａｘ，ｙ＝ａｙ）の値で表わされる。fr(ax, ay) is expressed as the value of I(x, y) (where x = ax, y = ay) in the following equation (6), calculated on the reference image.

即ち、アフィン動きパラメータ探索では、画素間マッチングをとり誤差電力最小のものを選ぶため、上記５パラメータを変化させるたびに予測画像候補となる画像領域を再度、生成しなければならない。回転やスケーリングは非整数画素値を発生させるので、式（６）の演算が探索処理中何度も繰り返される。これにより、アフィン動きパラメータ探索処理は非常に負荷が大きく、時間もかかるものとなる。 That is, in the affine motion parameter search, pixel-to-pixel matching is performed to select the one with the smallest error power, so that the image region serving as the predicted image candidate must be generated again each time the above five parameters are changed. Because rotation and scaling generate non-integer pixel values, the calculation of equation (6) is repeated many times during the search process. This makes the affine motion parameter search process very time-consuming and burdensome.

単純な拡大または縮小画像に対してマッチングを行って動き補償を得る方法として、特開平６−１５３１８５号公報に示された動き補償装置及びこれを用いた符号化装置が開示されている。これは参照画像となるフレームメモリ中の画像を、間引き回路または補間回路を設けて、画像を縮小または拡大した後、動きベクトルを検出する構成となっている。この構成では、アフィン変換のような複雑な演算はしないが、参照画像から固定ブロックを取り出して補間または間引き演算をしている。つまり、固定の画面領域を切り出し、予め設定した処理を施して後、入力画像と比較するので、処理の内容が固定的で事実上単純な拡大、縮小等に限定される。Japanese Patent Application Laid-Open No. 6-153185 discloses a motion compensation device and an encoding device using the same, as a method of performing matching on a simple enlarged or reduced image to obtain motion compensation. This device uses a reference image stored in a frame memory, which is then enlarged or reduced using a thinning or interpolation circuit, and motion vectors are then detected. This configuration does not perform complex operations such as affine transformations, but instead extracts fixed blocks from the reference image and performs interpolation or thinning operations. In other words, a fixed screen area is extracted, subjected to predetermined processing, and then compared with the input image. Therefore, the processing is fixed and effectively limited to simple enlargement or reduction.

従来の画像符号化装置の動き予測方式は、以上のように構成され動作する。The motion prediction method of the conventional image coding device is configured and operates as described above.

従って、第１の従来例においては、予測画像領域の形成は参照画面の切り出し領域を平行移動して行うので単純な、平行移動の動きしか予測できず、回転や拡大、縮小、カメラのズーミングなど、平行移動以外の移動の場合には、性能劣化が激しいという課題がある。Therefore, in the first conventional example, the predicted image area is formed by translating the extracted area of the reference image, so only simple translational movements can be predicted. There is a problem in that performance deteriorates significantly when movements other than translational movements, such as rotation, enlargement, reduction, and camera zooming, occur.

一方、第２の例においては、予測画像の生成をアフィン変換によって行っているため、回転など、予測可能対象の種類は多くなるが、演算の処理が複雑になり、装置規模が大きくなるという課題がある。On the other hand, in the second example, the predicted image is generated using an affine transformation, which increases the number of predictable objects, such as rotation, but also increases the complexity of the calculation process and the size of the device.

以上、処理を単純化すれば予測しきれない場合が多く、アフィン変換を用いれば予測できる場合が増えるものの、処理が大変になるというジレンマがあった。As mentioned above, simplifying the process often results in incomplete predictions, while using affine transformations increases the number of predictable cases, making the process more difficult.

復号処理については、従来装置の構成を保ちながら複雑な処理を行うものの具体的な提案がなかった。Regarding the decoding process, there were no concrete proposals for performing complex processing while maintaining the configuration of the conventional device.

発明の開示この発明は、上記のような課題を解消するためになされたもので、参照画面上の既存の画素、もしくは簡易なフィルタ処理によって得られた画素を用いて、予測すべき画像領域とは異なる形状もしくは異なる大きさの予測画像領域を形成して、比較的簡単な処理によって、様々な種類の動きや時間変化に対応できる画像の動き補償予測方式を用いた符号化装置を得ることを目的とする。Disclosure of the Invention This invention was conceived to address the above-mentioned problems. It aims to provide a coding device using a motion compensation prediction method that can accommodate various types of motion and temporal changes with relatively simple processing by using existing pixels on a reference screen or pixels obtained through simple filtering to form a prediction image area of a different shape or size from the image area to be predicted.

また、比較的簡単な符号化処理に対応する復号を行う復号装置と、同様構成で精密でよりスムーズな動きを再現する画像復号装置を得ることを目的とする。Another object of the present invention is to provide a decoding device that performs decoding corresponding to a relatively simple encoding process, and an image decoding device with a similar configuration that reproduces precise and smoother motion.

この発明に係る画像符号化装置は、入力画像を所定のブロックに分割して、このブロックのフレーム間の動き検出による動き補償予測手段を備えて、入力画像を圧縮符号化する構成であって、動き検出用の参照画像の対応する部分領域に存在する実標本点である整数画素のみを所定の形式に変形化して座標指定して抽出し、入力画像の上記ブロックの整数画素と比較する変形ブロックマッチング部を含んで上記座標指定して抽出した最小誤差を与える動きベクトルを出力する動き検出部と、この変形ブロックマッチング部を含めた比較出力から得られる動きパラメータに従って参照画像のブロックを対応し座標指定して変形して決める対応点決定部を含んで予測部分画像を出力する動き補償部とを備えた。The image coding device according to the present invention is configured to divide an input image into predetermined blocks, and includes motion compensation prediction means for detecting inter-frame motion between these blocks, and compression-encodes the input image. The motion detection unit includes a modified block matching unit that transforms only integer pixels, which are actual sample points present in a corresponding partial region of a reference image for motion detection, into a predetermined format, specifies coordinates, extracts them, and compares them with the integer pixels of the block in the input image, outputting the motion vector extracted using the specified coordinates that provides the minimum error; and a motion compensation unit that includes a corresponding point determination unit that transforms corresponding blocks of the reference image using the specified coordinates in accordance with motion parameters obtained from the comparison output including the modified block matching unit, and outputs a predicted partial image.

また更に、変形ブロックマッチング部は、参照画像の部分領域の所定の形式の変形化に際しては、整数画素と、整数画素の中点である半画素とを使用して変形化するようにした。Furthermore, when transforming a partial region of the reference image in a predetermined format, the transformation block matching unit performs the transformation using integer pixels and half pixels that are the midpoints of the integer pixels.

また更に、入力画像を符号化の対象領域として画像オブジェクトの部分領域に分離する前処理部を付加して、分離したこれら各画像オブジェクトをブロックに分割して動き検出及び動き補償をするようにした。Furthermore, a pre-processing unit is added that separates the input image into subregions of image objects as target regions for encoding, and each of these separated image objects is divided into blocks for motion detection and motion compensation.

また更に、変形ブロックマッチング部及び対応点決定部は、整数画素または半画素を座標指定する際に、隣接または所定数倍した隣接点を座標指定して抽出して比較する変形ブロックマッチング部と、同様に参照画像を処理して出力する対応点決定部とした。Furthermore, the modified block matching unit and corresponding point determination unit are configured such that, when specifying the coordinates of integer pixels or half pixels, the modified block matching unit specifies the coordinates of adjacent points or adjacent points that are a predetermined multiple of the coordinates, extracts them, and compares them, and the corresponding point determination unit similarly processes and outputs a reference image.

また更に、変形ブロックマッチング部及び対応点決定部は、整数画素または半画素を所定の角度方向に回転した座標指定をして抽出し比較する変形ブロックマッチング部と、同様に参照画像を処理して出力する対応点決定部とした。Furthermore, the modified block matching unit and corresponding point determination unit are configured as follows: the modified block matching unit extracts and compares coordinates designated by rotating integer pixels or half pixels in a predetermined angle direction, and the corresponding point determination unit similarly processes and outputs a reference image.

所定の角度方向の回転は、正負４５度、９０度、１３５度または１８０度とした。The rotation in the predetermined angular direction was plus or minus 45 degrees, 90 degrees, 135 degrees, or 180 degrees.

また更に、変形ブロックマッチング部及び対応点決定部は、平行移動後の参照画像の部分領域が示す領域を探索して、この探索領域を拡大または縮小、または所定の角度方向の回転を組み合わせて動かして比較する変形ブロックマッチング部と、同様に参照画像を処理して出力する対応点決定部とした。Furthermore, the deformation block matching unit and the corresponding point determination unit are configured as follows: The deformation block matching unit searches for the area indicated by the partial area of the reference image after translation, and then enlarges or reduces this search area, or moves it by combining rotations in a predetermined angular direction, and compares it; The corresponding point determination unit similarly processes and outputs the reference image.

また更に、変形ブロックマッチング部は、参照画像の部分領域を変形加工して比較するための変形パターンテーブルを備え、この変形パターンテーブルから引き出された変換値に基づく部分領域の画像を入力画像のブロックの整数画素または半画素と比較する変形ブロックマッチング部とし、対応点決定部も、同様に参照画像を処理して出力する対応点決定部とした。Furthermore, the modified block matching unit is provided with a deformation pattern table for modifying and processing a partial region of the reference image for comparison, and the image of the partial region based on the transformation values extracted from this deformation pattern table is compared with integer pixels or half pixels of the block of the input image. Similarly, the corresponding point determination unit is a corresponding point determination unit that processes the reference image and outputs the result.

また更に、変形ブロックマッチング部は、対応評価のために抽出された参照画像の特定画素を選択的にフィルタ処理をして比較するようにした。Furthermore, the modified block matching unit selectively filters specific pixels of the reference image extracted for correspondence evaluation and performs comparison.

動き検出のためのフレームは、時間的に前のまたは後ろのフレームとし、参照画像は、上記時間的に前のまたは後ろのフレームを記憶して入力画像と比較するようにした。The frame for motion detection is the previous or next frame in time, and the reference image is the previous or next frame stored and compared with the input image.

この発明に係る画像復号装置は、動き補償手段を備えた入力情報の画像圧縮符号を伸張再生する構成であって、入力情報中の動きパラメータを抽出して動きの方向と量を表す動きベクトルと、変形処理の指示内容を表す変形パターン情報とを得るエントロピー復号部と、このエントロピー復号部出力の動きパラメータにより、フレームに対応して記憶されている参照画像の部分領域の整数画素の座標値を入力情報中の変形パターン情報に基づいて所定の形式に変形処理して得られた画素値で入力の被予測画像に加算するための画像を生成する動き補償手段とを備えた。An image decoding device according to the present invention is configured to decompress and reproduce image compression codes of input information and includes a motion compensation means. The image decoding device includes an entropy decoding unit that extracts motion parameters from the input information to obtain motion vectors representing the direction and amount of motion and transformation pattern information representing instructions for transformation processing. The motion compensation means uses the motion parameters output by the entropy decoding unit to transform integer pixel coordinate values of a subregion of a reference image stored corresponding to a frame into a predetermined format based on the transformation pattern information in the input information, thereby generating an image to be added to the input predicted image using the resulting pixel values.

また更に、動き補償手段は、参照画像の半画素の座標値も用いて座標計算して、得られた画素値を所定の形式に変形処理するようにした。Furthermore, the motion compensation means calculates coordinates using coordinate values of half pixels of the reference image, and transforms the obtained pixel values into a predetermined format.

この発明に係る画像符号化方法は、入力のディジタル画像の圧縮符号化のために参照画像を記憶し所定ブロックに分割してフレーム間の動き検出をする動き補償予測手段を備えて、参照画像の部分領域の整数画素を所定の形式に変形化して座標指定して抽出し、予測部分画像を生成して入力画像の上記ブロックと比較する変形ブロックマッチングステップと、上記変形ブロックマッチングを含んで選ばれた最小誤差を与える動きベクトルから上記座標指定により上記部分領域を対応点決定して動き補償出力とする対応点決定ステップとを備えた。The image coding method according to the present invention includes a motion compensation prediction means for storing a reference image for compression coding of an input digital image, dividing the reference image into predetermined blocks, and detecting motion between frames. The method also includes a modified block matching step for transforming integer pixels of a partial region of the reference image into a predetermined format, extracting the transformed integer pixels by specifying coordinates, generating a predicted partial image, and comparing the resulting predicted partial image with the block of the input image. The method also includes a corresponding point determination step for determining corresponding points of the partial region by specifying the coordinates from a motion vector that provides the smallest error and is selected using the modified block matching, thereby producing a motion-compensated output.

また更に、変形ブロックマッチングステップは、参照基準として参照画像の部分領域の整数画素の他にその中点の半画素も加えて所定の形式に変形化し座標指定して抽出し、予測部分画像を生成して比較する変形ブロックマッチングステップとした。Furthermore, the modified block matching step is a step in which a partial region of the reference image is transformed into a predetermined format using not only integer pixels but also half pixels at its midpoint as a reference standard, and then the coordinates are specified and extracted to generate a predicted partial image for comparison.

また更に、参照画像の部分領域を変形加工する変形パターンテーブルを備えて、変形ブロックマッチングに際して、この変形パターンテーブルを参照して対応アドレスを読み出した変換値に基づく部分領域の画像を入力画像と比較する変形ブロックマッチングステップとした。Furthermore, a deformation pattern table is provided for deforming a partial region of the reference image, and during deformation block matching, the deformation pattern table is referenced, and the image of the partial region based on the transformation value read from the corresponding address is compared with the input image.

この発明に係る画像復号方法は、動き補償を行い入力情報の画像圧縮符号を伸張再生するために、入力情報中の動きパラメータを抽出して動きの方向と量を表す動きベクトルと、変形処理の指示内容を表す変形パターン情報とを得るエントロピー復号ステップと、このエントロピー復号ステップで得られた動きパラメータにより、フレームに対応して記憶されている参照画像の部分領域の整数画素の座標値を入力情報中の変形パターン情報に基づいて所定の形式に変形処理して得られた画素値で入力の被予測画像に加算するための画像を生成する動き補償ステップとを備えた。The image decoding method according to the present invention includes an entropy decoding step for extracting motion parameters from input information to obtain motion vectors representing the direction and amount of motion and transformation pattern information representing transformation processing instructions, in order to perform motion compensation and decompress and reproduce the image compression code of input information; and a motion compensation step for generating an image to be added to the input predicted image using pixel values obtained by transforming integer pixel coordinate values of a partial region of a reference image stored corresponding to a frame into a predetermined format based on the transformation pattern information in the input information, using the motion parameters obtained in the entropy decoding step.

また更に、動き補償ステップは、参照画像の半画素の座標値も用いて座標計算して、得られた画素値を所定の形式に変形処理するようにした。Furthermore, the motion compensation step also uses coordinate values of half pixels of the reference image to perform coordinate calculations, and transforms the obtained pixel values into a predetermined format.

この発明に係る画像符号化復号システムは、入力画像を所定のブロックに分割して、上記入力画像を圧縮符号化するため、動き検出用の参照画像の対応する部分領域に存在する実標本点である整数画素のみを所定の形式に変形化して座標指定して抽出し、入力画像の上記ブロックの整数画素と比較する変形ブロックマッチング部を含んで、上記座標指定して抽出した最小誤差を与える動きベクトルを出力する動き検出部と、上記変形ブロックマッチング部を含めた比較出力から得られる動きパラメータに従って上記参照画像のブロックを対応して変形し座標指定して決める対応点決定部を含んで予測部分画像を出力する動き補償部とを備えた画像符号化装置と、フレーム間の動き検出による動き補償予測手段を備えて入力情報の画像圧縮符号を伸張再生するため、動き補償予測手段には、入力情報中の動きパラメータに基づいて対応する部分領域の予め用意された整数画素を所定の形式に座標指定して抽出する機構を備え、上記所定の形式に処理した部分領域の画像信号を出力加算するようにした画像復号装置、とで構成される。An image coding/decoding system according to the present invention comprises an image coding device that divides an input image into predetermined blocks and, in order to compress and encode the input image, includes a motion estimation unit that includes a modified block matching unit that transforms only integer pixels, which are actual sample points present in corresponding partial regions of a reference image for motion estimation, into a predetermined format, specifies coordinates, and extracts them, and compares them with the integer pixels of the blocks of the input image, outputting a motion vector that provides the minimum error extracted using the specified coordinates; a motion compensation unit that includes a corresponding point determination unit that transforms corresponding blocks of the reference image in accordance with motion parameters obtained from the comparison output including the modified block matching unit, and determines the corresponding points using specified coordinates, and outputs a predicted partial image; and an image decoding device that includes motion compensation prediction means that detects motion between frames and, in order to decompress and reproduce image compression codes of input information, the motion compensation prediction means includes a mechanism that specifies coordinates and extracts pre-prepared integer pixels of corresponding partial regions based on the motion parameters in the input information, and outputs and adds the image signals of the partial regions processed in the predetermined format.

図面の簡単な説明図１は、本発明の画像符号化装置の基本構成図である。BRIEF DESCRIPTION OF THE DRAWINGS Figure 1 shows the basic configuration of the image encoding device of the present invention.

図２は、図１中の動き検出部８の内部構成図である。FIG. 2 is a diagram showing the internal configuration of the motion detection unit 8 in FIG.

図３は、図２の構成による動き検出部８の動作を表すフローチャート図である。FIG. 3 is a flowchart showing the operation of the motion detection unit 8 in the configuration of FIG. 2.

図４は、実施の形態１における変形ブロックマッチング部２１における動作の概要を説明する図である。FIG. 4 is a diagram illustrating an overview of the operation of the modified block matching unit 21 in embodiment 1.

図５は、変形ブロックマッチング部２１の内部構成図である。FIG. 5 is a diagram showing the internal configuration of the modified block matching unit 21. As shown in FIG.

図６は、変形ブロックマッチング部２１の動作を表すフローチャート図である。FIG. 6 is a flowchart showing the operation of the modified block matching unit 21.

図７は、図１中の動き補償部９の内部構成図である。FIG. 7 is a diagram showing the internal configuration of the motion compensation unit 9 in FIG.

図８は、動き補償部９の動作の様子を表すフローチャート図である。FIG. 8 is a flowchart showing the operation of the motion compensation unit 9.

図９は、前処理部２による画像オブジェクトの分離動作を説明する図である。FIG. 9 is a diagram illustrating the image object separation operation performed by the preprocessing unit 2. In FIG.

図１０は、実施の形態２における動き検出部８ｂの別の内部構成図である。FIG. 10 is another internal configuration diagram of the motion detection unit 8b according to the second embodiment.

図１１は、実施の形態３における動き検出部８ｃの内部構成図である。FIG. 11 is a diagram showing the internal configuration of the motion detection unit 8c according to the third embodiment.

図１２は、変形ブロックマッチング部４２における動作の概要を説明する図である。FIG. 12 is a diagram illustrating an overview of the operation of the modified block matching unit 42.

図１３は、変形ブロックマッチング部４２の内部構成図である。FIG. 13 is a diagram showing the internal configuration of the modified block matching unit 42.

図１４は、変形ブロックマッチング部４２の動作を表すフローチャート図である。FIG. 14 is a flowchart illustrating the operation of the modified block matching unit 42.

図１５は、実施の形態４における変形ブロックマッチング部４２ｂの動作の概要を説明する図である。FIG. 15 is a diagram illustrating an overview of the operation of the modified block matching unit 42b in embodiment 4.

図１６は、実施の形態４における変形ブロックマッチング部４２ｂの内部構成図である。FIG. 16 shows the internal configuration of the modified block matching unit 42b in the fourth embodiment.

図１７は、変形ブロックマッチング部４２ｂの動作を表すフローチャート図である。FIG. 17 is a flowchart illustrating the operation of the modified block matching unit 42b.

図１８は、実施の形態４における他の変形ブロックマッチングを説明する図である。FIG. 18 illustrates another modified block matching method according to the fourth embodiment.

図１９は、実施の形態４における他の変形ブロックマッチングを説明する図である。FIG. 19 illustrates another modified block matching method according to the fourth embodiment.

図２０は、実施の形態５における対応点決定部３４の別の内部構成図である。FIG. 20 is another internal configuration diagram of the corresponding point determination unit 34 according to the fifth embodiment.

図２１は、実施の形態６における変形ブロックマッチングを説明する図である。FIG. 21 illustrates modified block matching in embodiment 6.

図２２は、実施の形態６における予測画像を構成する整数画素に対して施すフィルタリングの例を示した図である。FIG. 22 illustrates an example of filtering applied to integer pixels constituting a predicted image in the sixth embodiment.

図２３は、変形ブロックマッチング部４２ｃの動作の概要を説明する図である。Figure 23 is a diagram illustrating the operation of the modified block matching unit 42c.

図２４は、変形ブロックマッチング部４２ｃの内部構成図である。FIG. 24 is a diagram showing the internal configuration of the modified block matching unit 42c.

図２５は、変形ブロックマッチング部４２ｃの動作を表すフローチャート図である。FIG. 25 is a flowchart illustrating the operation of the modified block matching unit 42c.

図２６は、実施の形態６における動き補償部９ｂの内部構成図である。FIG. 26 is a diagram showing the internal configuration of the motion compensation unit 9b according to the sixth embodiment.

図２７は、実施の形態６における動き補償部９ｂの動作を表すフローチャート図である。FIG. 27 is a flowchart showing the operation of the motion compensation unit 9b in the sixth embodiment.

図２８は、実施の形態７における画像復号装置の構成を示した図である。FIG. 28 is a diagram showing a configuration of an image decoding apparatus according to the seventh embodiment.

図２９は、実施の形態７における動き補償部９の内部構成図である。FIG. 29 is a diagram showing the internal configuration of the motion compensation unit 9 according to the seventh embodiment.

図３０は、図２９の動き補償部９の動作を表すフローチャート図である。FIG. 30 is a flowchart showing the operation of the motion compensation unit 9 in FIG.

図３１は、図２９の動き補償部９が行う座標点移動動作を説明する図である。FIG. 31 is a diagram for explaining the coordinate point moving operation performed by the motion compensation unit 9 in FIG.

図３２は、図２９の動き補償部９が行う変形処理の例を説明する図である。FIG. 32 is a diagram illustrating an example of the transformation process performed by the motion compensation unit 9 in FIG.

図３３は、座標点演算が半画素を求める演算を示す図である。FIG. 33 is a diagram showing a coordinate point calculation for obtaining a half pixel.

図３４は、変形処理が回転拡大である場合の動作を説明する図である。Figure 34 illustrates the operation when the transformation process is rotation and enlargement.

図３５は、実施の形態８における画像復号装置の構成を示す図である。FIG. 35 is a diagram showing a configuration of an image decoding apparatus according to the eighth embodiment.

図３６は、実施の形態８における動き補償部９０の内部構成図である。FIG. 36 is a diagram showing the internal configuration of the motion compensation unit 90 according to the eighth embodiment.

図３７は、図３６の動き補償部９０の動作を表すフローチャート図である。FIG. 37 is a flowchart showing the operation of the motion compensation unit 90 in FIG.

図３８は、図３６の動き補償部９０が行う変形処理の例を説明する図である。FIG. 38 is a diagram illustrating an example of the transformation process performed by the motion compensation unit 90 in FIG.

図３９は、図３６の動き補償部９０が行う座標点演算の例を示す図である。FIG. 39 is a diagram showing an example of coordinate point calculation performed by the motion compensation unit 90 in FIG.

図４０は、実施の形態９における動き補償部中の対応点決定部３７ｃの動作を表すフローチャート図である。FIG. 40 is a flowchart illustrating the operation of the corresponding point determination unit 37c in the motion compensation unit according to the ninth embodiment.

図４１は、実施の形態９における動き補償部が行う変形処理の例を説明する図である。FIG. 41 illustrates an example of the transformation process performed by the motion compensation unit in the ninth embodiment.

図４２は、従来例１のブロックマッチングによる動き補償予測の概念を説明する図である。FIG. 42 is a diagram illustrating the concept of motion compensation prediction using block matching according to Conventional Example 1.

図４３は、従来例１の画像符号化装置の動き補償予測部（ブロックマッチング部）の構成を示す図である。FIG. 43 shows the configuration of the motion compensation prediction unit (block matching unit) of the image coding device of Prior Art 1.

図４４は、従来例１の動き補償予測部の動作を表すフローチャート図である。FIG. 44 is a flowchart showing the operation of the motion compensation prediction unit of the first conventional example.

図４６は、従来例２のアフィン変換を用いた動き補償予測の概念を説明する図である。FIG. 46 illustrates the concept of motion compensation prediction using affine transformation in Conventional Example 2.

図４７は、従来例２のアフィン変換を用いた動き補償予測部の構成を示す図である。FIG. 47 shows the configuration of a motion compensation prediction unit using affine transformation in Conventional Example 2.

図４８は、従来例２の動き補償予測部の動作を表すフローチャート図である。FIG. 48 is a flowchart showing the operation of the motion compensation prediction unit of the second conventional example.

図４９は、図４８中のアフィン動きパラメータ検出ステップの詳細を示すフローチャート図である。FIG. 49 is a flowchart showing details of the affine motion parameter detection step of FIG.

図５０は、半画素生成部２３２における半画素値の計算方法を説明する図である。FIG. 50 is a diagram illustrating a method for calculating half-pixel values in the half-pixel generation unit 232.

図５１は、補間演算部２３１における回転／スケール量の探索ステップで生じる非整数画素値の算出方法を説明する図である。FIG. 51 is a diagram illustrating a method for calculating non-integer pixel values that occur in the rotation/scale amount search step in the interpolation calculation unit 231.

発明を実施するための最良の形態実施の形態１．本発明の符号化装置、復号装置は、具体的には衛星や地上波、優先通信網を介して行うディジタル画像伝送システム、ディジタル画像記録装置、ディジタル画像蓄積データベースと検索・閲覧システムなどに使用される。BEST MODE FOR CARRYING OUT THE INVENTION Embodiment 1. The encoding and decoding devices of the present invention are specifically used in digital image transmission systems via satellite, terrestrial, or wired communication networks, digital image recording devices, digital image storage databases, and search/viewing systems.

図１は、画像符号化装置の基本構成図である。FIG. 1 is a diagram showing the basic configuration of an image encoding device.

図において、１は入力ディジタル画像信号、２は前処理部、３及び１３はイントラ（フレーム内）／インター（フレーム間）符号化選択部、４は直交変換部、５は量子化部、６は逆量子化部、７は逆直交変換部、８は動き検出部、９は動き補償部、１０はフレームメモリ（参照画像）、１１は動きベクトルを含む動きパラメータ、１２は予測画像データ、１４は符号化制御部、１５は強制モード指示フラグ、１６はイントラ／インター符号化指示フラグ、１７は量子化ステップ・パラメータ、１８はエントロピー符号化部、１９は圧縮画像データである。本発明の重要な要素は動き検出部８及び動き補償部９である。In the figure, reference numeral 1 denotes an input digital image signal, 2 denotes a preprocessing unit, 3 and 13 denote intra (within-frame)/inter (between-frame) coding selection units, 4 denotes an orthogonal transform unit, 5 denotes a quantization unit, 6 denotes an inverse quantization unit, 7 denotes an inverse orthogonal transform unit, 8 denotes a motion estimation unit, 9 denotes a motion compensation unit, 10 denotes a frame memory (reference image), 11 denotes motion parameters including motion vectors, 12 denotes predicted image data, 14 denotes a coding control unit, 15 denotes a forced mode indication flag, 16 denotes an intra/inter coding indication flag, 17 denotes a quantization step parameter, 18 denotes an entropy coding unit, and 19 denotes compressed image data. The motion estimation unit 8 and motion compensation unit 9 are key elements of the present invention.

以下、発明の実施の形態１における画像符号化装置の動作を説明する。The operation of the image coding device according to the first embodiment of the invention will now be described.

本装置は、カラー動画像系列の構成要素である各フレームの画像信号１を入力とし、入力画像信号１はディジタル化されて、前処理部２において前処理とフォーマット変換、ブロックデータへの切り出しを行う。本実施形態では、ここで切り出されるブロックデータは輝度信号成分及びそれに空間的に対応する色差信号成分のペアから構成されるものとし、以降、輝度成分を輝度ブロック、色差成分を色差ブロックと呼ぶ。This device receives as input image signals 1 for each frame, which are components of a color video sequence. The input image signals 1 are digitized, and then preprocessed, formatted, and segmented into block data in a preprocessing unit 2. In this embodiment, the segmented block data consists of a pair of a luminance signal component and its spatially corresponding color difference signal component. Hereinafter, the luminance component will be referred to as a luminance block, and the color difference component will be referred to as a color difference block.

次いで、各ブロックデータをフレーム内符号化するかフレーム間符号化するかをイントラ／インター符号化選択部３において決定する。イントラ（フレーム内）符号化が選択された場合は、前処理部２から出力された原画像データから構成されるブロックデータを直交変換部４に入力し、インター（フレーム間）符号化が選択された場合は、前処理部２から出力された原画像データと動き補償部９から出力される予測画像データ１２との差分から構成される予測誤差ブロックデータを直交変換部４に入力する。このイントラ／インター・フレーム符号化の選択は、符号化制御部１４からの強制モード指示フラグ１５によって強制的に行われることもある。選択された符号化モードは、イントラ／インター符号化指示フラグ１６としてエントロピー符号化部１８に送られ、符号化ビットストリーム１９に多重化される。Next, the intra/inter coding selection unit 3 determines whether each block of data is intraframe coded or interframe coded. If intraframe coding is selected, block data composed of the original image data output from the preprocessing unit 2 is input to the orthogonal transform unit 4. If interframe coding is selected, prediction error block data composed of the difference between the original image data output from the preprocessing unit 2 and the predicted image data 12 output from the motion compensation unit 9 is input to the orthogonal transform unit 4. This selection of intraframe/interframe coding may be forced by a forced mode instruction flag 15 from the coding control unit 14. The selected coding mode is sent to the entropy coding unit 18 as the intraframe/interframe coding instruction flag 16 and multiplexed into the coded bitstream 19.

直交変換部４には、例えば、離散コサイン変換（ＤＣＴ）などが用いられる。The orthogonal transform unit 4 uses, for example, a discrete cosine transform (DCT).

直交変換係数は、量子化部５において符号化制御部１４で算出された量子化ステップ・パラメータ１７を用いて量子化され、量子化後の直交変換係数はエントロピー符号化部１８で冗長度を削減した後、符号化ビットストリーム１９に多重される。同時に、逆量子化部６で逆量子化され、さらに逆直交変換部７で逆直交変換されて予測誤差信号が復元される。これに動き補償部９から出力される予測画像データ１２を加算して、局所復号画像が生成される。ただし、イントラ／インター符号化指示フラグ１６がイントラモードの場合は、符号化選択部１３で０信号が選択され、予測誤差信号の加算は行われない。局所復号画像は、次フレーム以降の動き補償予測の参照画像として用いるため、その内容がフレームメモリ１０に書き込まれる。The orthogonal transform coefficients are quantized in the quantization unit 5 using the quantization step parameter 17 calculated by the encoding control unit 14. The quantized orthogonal transform coefficients are then multiplexed into an encoded bitstream 19 after redundancy is reduced in the entropy encoding unit 18. At the same time, the coefficients are inversely quantized in the inverse quantization unit 6 and then inversely orthogonally transformed in the inverse orthogonal transform unit 7 to restore a prediction error signal. A locally decoded image is generated by adding predicted image data 12 output by the motion compensation unit 9 to this locally decoded image. However, if the intra/inter encoding instruction flag 16 indicates intra mode, the encoding selection unit 13 selects a 0 signal, and no prediction error signal is added. The locally decoded image is used as a reference image for motion-compensated prediction of the next frame and thereafter, and its contents are written to the frame memory 10.

以下、本実施の形態の装置の最も重要な要素の１つである動き補償予測の動作について説明する。Below we will explain the operation of motion compensation prediction, which is one of the most important elements of the device of this embodiment.

本実施形態においては、前処理部２において切り出されるブロックを動き補償予測における被予測ブロックとする。動き補償予測処理は、動き検出部８及び動き補償部９において行われ、動き検出部８において被予測ブロックの動きベクトルを含む動きパラメータ１１が検出され、動き補償部９が動きパラメータ１１を用いてフレームメモリ１０から予測画像データ１２を取り出す。動き検出処理は、輝度ブロックを用いて行い、色差ブロックの動き補償予測は輝度ブロックの動き検出結果を利用する。以下では、輝度ブロックの動き補償予測の動作に限定して説明する。In this embodiment, the block extracted by the preprocessing unit 2 is used as the block to be predicted in motion-compensated prediction. Motion-compensated prediction processing is performed by a motion estimation unit 8 and a motion compensation unit 9. The motion estimation unit 8 estimates motion parameters 11, including a motion vector for the block to be predicted, and the motion compensation unit 9 uses the motion parameters 11 to extract predicted image data 12 from a frame memory 10. The motion estimation processing is performed using the luminance block, and motion-compensated prediction of the chrominance block utilizes the motion estimation results for the luminance block. The following explanation is limited to the operation of motion-compensated prediction of the luminance block.

まず、動き検出処理から説明する。First, the motion detection process will be described.

動き検出処理は、動き検出部８で行われる。動き検出部８は、参照画像中の所定の範囲内で被予測ブロックの輝度ブロックに最も類似する領域を探索し、被予測ブロックの画面内位置からの変化を表すパラメータを検出する。従来例で述べたブロックマッチングでは、被予測ブロックの輝度ブロックに最も類似するブロックを探索し、被予測ブロックの画面内位置からの平行移動量を動きベクトルとして検出する。The motion detection process is performed by motion detection unit 8. Motion detection unit 8 searches within a specified range of the reference image for an area that is most similar to the luminance block of the block to be predicted, and detects parameters that represent the change in the block from its on-screen position. In the conventional block matching, a block that is most similar to the luminance block of the block to be predicted is searched for, and the amount of translation from the on-screen position of the block to be predicted is detected as a motion vector.

本実施形態の動き検出部８は、従来の正方ブロックに基づくブロックマッチングと、後述する変形ブロックを用いたブロックマッチングの両方を実行し、より予測精度の高い方を選択する構成をとる。The motion estimation unit 8 of this embodiment is configured to perform both conventional block matching based on square blocks and block matching using modified blocks, which will be described later, and select the one with the higher prediction accuracy.

以下、本実施の形態における動き検出部８の動作を説明する。The operation of the motion detection unit 8 in this embodiment will now be described.

図２は、図１中の動き検出部８の詳細構成図、図３は、その動作の様子を示すフローチャートである。FIG. 2 is a detailed block diagram of the motion detection unit 8 in FIG. 1, and FIG. 3 is a flowchart showing its operation.

図２において、２０はブロックマッチング部、２１は変形ブロックマッチング部、２２は動き補償予測モード判定部、２３は変形ブロックマッチングによる動きベクトル、２４は変形ブロックマッチングによる最小予測誤差値、２５は最終動きベクトル、２６は動き補償予測モード信号である。最終動きベクトル２５及び動き補償予測モード信号２６をひとまとめで表現したものが動きパラメータ１１であるとする。In FIG. 2, 20 denotes a block matching unit, 21 denotes a modified block matching unit, 22 denotes a motion compensation prediction mode determination unit, 23 denotes a motion vector obtained by modified block matching, 24 denotes a minimum prediction error value obtained by modified block matching, 25 denotes a final motion vector, and 26 denotes a motion compensation prediction mode signal. The final motion vector 25 and the motion compensation prediction mode signal 26 are collectively represented as motion parameters 11.

ブロックマッチング部２０の内部構成及び動作フローチャートは、従来例で示した図４３及び図４４と同様である。また、図３において、ＤＢＭはブロックマッチングによる最小予測誤差電力の値、ＤＤＥＦは変形ブロックマッチングによる最小予測誤差電力の値を表す。 The internal configuration and operation flowchart of the block matching unit 20 are the same as those shown in FIGS. 43 and 44 in the conventional example. BM is the minimum prediction error power value by block matching, D DEF represents the value of the minimum prediction error power by modified block matching.

図４は、本発明の最重要部位である変形ブロックマッチング部２１における動作の概要説明図、図５は、変形ブロックマッチング部２１の詳細な内部構成図、図６は、変形ブロックマッチング部２１の動作を示すフローチャートである。Figure 4 is a diagram outlining the operation of the modified block matching unit 21, which is the most important part of this invention. Figure 5 is a detailed diagram of the internal configuration of the modified block matching unit 21. Figure 6 is a flowchart showing the operation of the modified block matching unit 21.

図５において、２９は水平方向平行移動量探索範囲指示信号、３０は垂直方向平行移動量探索範囲指示信号、３１は水平方向移動量カウンタ、３２は垂直方向移動量カウンタ、３３は新要素である回転量カウンタ、３４は同じく新要素である対応点決定部、３５はメモリ読み出しアドレス生成部である。パターンマッチング部２１３、最小予測誤差電力判定部２１６は、図４７に示す構成の対応要素と同一の動作を行う。In Figure 5, 29 denotes a horizontal translation amount search range indication signal, 30 denotes a vertical translation amount search range indication signal, 31 denotes a horizontal translation amount counter, 32 denotes a vertical translation amount counter, 33 denotes a rotation amount counter (a new element), 34 denotes a corresponding point determination unit (also a new element), and 35 denotes a memory read address generation unit. The pattern matching unit 213 and minimum prediction error power determination unit 216 perform the same operations as their corresponding elements in the configuration shown in Figure 47.

また、図６において、ｄｘは、水平方向平行移動量探索点、ｄｙは、垂直方向平行移動量探索点、ｒａｎｇｅｈｍｉｎは、水平方向探索範囲下限値、ｒａｎｇｅｈｍａｘは、水平方向探索範囲上限値、ｒａｎｇｅｖｍｉｎは、垂直方向探索範囲下限値、ｒａｎｇｅｖｍａｘは、垂直方向探索範囲上限値、Ｄｍｉｎは、最小予測誤差電力、Ｄ（ｄｘ，ｄｙ）は、ｄｘ，ｄｙ探索時の予測誤差電力（ｘ，ｙ）は、被予測ブロック内画素位置、（ｒｘ，ｒｙ）は、（ｘ，ｙ）に対する参照画像中の対応点、（ｒｄｘ，ｒｄｙ）は、回転パラメータ、Ｄ（ｄｘ，ｄｙ）は、ｄｘ，ｄｙ探索時の（ｘ，ｙ）における予測誤差、ｆ（ｘ，ｙ）は、被予測画像中の画素（ｘ，ｙ）の値、ｆｒ（ｘ，ｙ）は、参照画像中の画素（ｘ，ｙ）の値、ＭＶｈは、動きベクトル水平成分、ＭＶｖは、動きベクトル垂直成分、ｉｘは、水平方向オフセット値（定数）、ｉｙは、垂直方向オフセット値（定数）、ｂｌｏｃｋｓｉｚｅは、被予測ブロックサイズである。 In addition, in FIG. 6, dx is a horizontal translation amount search point, dy is a vertical translation amount search point, and range h min is the lower limit of the horizontal search range, h max is the upper limit of the horizontal search range, v min is the lower limit of the vertical search range, v max is the upper limit of the vertical search range, min is the minimum prediction error power, D(dx, dy) is the prediction error power during dx, dy search, (x, y) is the pixel position in the block to be predicted, (rx, ry) is the corresponding point in the reference image for (x, y), (rdx, rdy) is the rotation parameter, D(dx, dy) is the prediction error at (x, y) during dx, dy search, f(x, y) is the value of pixel (x, y) in the image to be predicted, fr(x, y) is the value of pixel (x, y) in the reference image, MV h is the horizontal component of the motion vector, v is the vertical component of the motion vector, ix is the horizontal offset value (constant), iy is the vertical offset value (constant), block size is the predicted block size.

１）ブロックマッチングによる動きベクトルの検出ブロックマッチング部２０において、従来例で示した手順と動作で被予測ブロックに対する動きベクトルを求める。この結果、動きベクトル２１７、ブロックマッチング部２０における最小予測誤差電力ＤＢＭ２１８を得る。これは、図３におけるＳ１に相当する。1) Detection of motion vector by block matching In the block matching unit 20, a motion vector for a block to be predicted is calculated by the procedure and operation shown in the conventional example. As a result, the motion vector 217 and the minimum prediction error power D BM218 is obtained, which corresponds to S1 in FIG.

２）変形ブロックマッチングによる動きベクトルの検出次いで、変形ブロックマッチング部２１において、変形ブロックマッチングの処理を行う（図３のＳ２）。2) Motion Vector Detection by Modified Block Matching Next, the modified block matching unit 21 performs modified block matching (S2 in Figure 3).

以下、この動作についてさらに詳しく説明する。なお、以下の説明においては、整数画素を単位とする８×８画素ブロックを被予測ブロックとして説明を進める。This operation is explained in more detail below. Note that in the following explanation, the prediction block will be an 8x8 pixel block in integer pixel units.

２−１）処理概要変形ブロックマッチング部２１における処理の概要を図４に示す。2-1) Processing Overview Figure 4 shows an overview of the processing performed by the modified block matching unit 21.

同図において、被予測画像２７は、動き補償予測によって符号化される。例えば、前処理部２中にあるフレーム（ピクチャ）、参照画像２８は、被予測画像２７より以前に符号化されてフレームメモリ１０に蓄えられている局所復号フレーム（ピクチャ）画像とする。各画像内の○は、フレーム内に実際に存在する輝度信号の実標本点である整数画素を、×は実標本点間の中点画素である半画素を示す。被予測画像２７の８×８（整数画素）からなる部分領域を被予測ブロック（の輝度ブロック部分）とし、参照画像２８の□の画素からなるグループが予測画像候補の変形ブロックを構成するものとする。即ち、図１、図２の（）表示のフレームメモリ１０出力と前処理部２出力の一部が切り出されて、動き検出部８内の変形ブロックマッチング部２１で比較される。In the figure, the predicted image 27 is coded using motion-compensated prediction. For example, the frame (picture) in the preprocessing unit 2, the reference image 28, is a locally decoded frame (picture) image coded before the predicted image 27 and stored in the frame memory 10. The circles in each image represent integer pixels, which are actual sample points of the luminance signal that actually exist within the frame, and the crosses represent half-pixels, which are midpoints between the actual sample points. A subregion of 8x8 (integer pixels) in the predicted image 27 is considered the predicted block (the luminance block portion), and a group of square pixels in the reference image 28 constitutes the modified block of the predicted image candidate. That is, portions of the frame memory 10 output and the preprocessing unit 2 output, shown in parentheses in Figures 1 and 2, are extracted and compared in the modified block matching unit 21 in the motion estimation unit 8.

本実施の形態では、参照画像の輝度ブロックを右もしくは左４５度に回転させ、各辺のスケールを√２倍した、つまり、参照画像の大きさは被予測画像（入力画像）と比較する際は、１／√２の距離として、フレームの入力ディジタル画像１の水平垂直方向の標本点距離と合致させた領域を変形ブロックとして定義する。この領域は、参照画像２８の整数画素間隔の画素点のみから構成されることに特徴がある。即ち、本実施形態における変形ブロックマッチングは、与えられた探索範囲内で、図４に示す８×８整数画素からなる被予測ブロックの輝度ブロックに、最も類似した同図の変形ブロック領域を参照画像２８中から見つける処理に相当する。In this embodiment, the luminance block of the reference image is rotated 45 degrees right or left, and each side is scaled by √2. In other words, when comparing the size of the reference image with the predicted image (input image), the distance is set to 1/√2. The region that matches the horizontal and vertical sampling point distance of the input digital image 28 is defined as a modified block. This region is characterized by being composed only of pixel points in the reference image 28 spaced at integer pixel intervals. In other words, modified block matching in this embodiment corresponds to the process of finding, within a given search range, the modified block region shown in Figure 4 that most closely resembles the luminance block of the predicted block, which consists of 8x8 integer pixels.

２−２）初期設定（探索範囲の設定、初期値の設定）被予測ブロックと予測画像候補領域の形状とが異なるため、探索に際しては、検出される動きベクトルがどこを起点としているかを特定する必要がある。即ち、あらかじめ被予測ブロックの輝度ブロックの各構成点と予測画像候補領域の変形ブロックの各構成点とを１対１に対応させる。2-2) Initial Settings (Search Range and Initial Value Settings) Because the shapes of the predicted block and the candidate predicted image area are different, it is necessary to determine the origin of the detected motion vector during the search. That is, each component point of the luminance block of the predicted block is first matched one-to-one with each component point of the transformation block of the candidate predicted image area.

以下では、図４の点線矢印に示すように、あらかじめ被予測ブロックの左上隅の画素位置と、変形ブロックの左側頂点とを対応させるものとする。つまり、予測画像候補画像は参照画像２８中の変形ブロックを右４５度回転させ、各辺を１／√２倍の長さに修正した部分画像ということになる。この対応付けを変えれば、回転の方向が変わることになる。このように取り決めておくことにより、他の各構成点は一意に対応がとれる。被予測ブロックの各構成点と予測画像候補の各構成点が１対１に対応付けられているので、動き検出はブロックマッチングと同様に実行することができる。In the following, as shown by the dotted arrow in Figure 4, we assume that the pixel position of the upper left corner of the predicted block corresponds to the left vertex of the transformed block. In other words, the candidate predicted image is a partial image obtained by rotating the transformed block in reference image 28 45 degrees to the right and modifying each side to a length of 1/√2. Changing this correspondence changes the direction of rotation. By defining it in this way, each of the other construction points can be uniquely matched. Because each construction point of the predicted block corresponds one-to-one to each construction point of the candidate predicted image, motion estimation can be performed in the same way as block matching.

即ち、図４の参照画像２８の比較のための部分領域の取り出し型をパターン化してアドレッシング（座標）で指定しておき、しかもこの場合には、整数画素が選ばれるよう指示しておき、このアドレッシング指示された画素と対応する被予測画像２７である原画像データ中の画素との誤差を累積して、最小誤差電力を判定している。従って、アドレッシングの指示だけで演算を伴わないので高速の比較ができ、しかもアドレッシング（座標指定）の仕方で単純な拡大、縮小だけでなくて回転も、また、回転と拡大、縮小の同時処理等、フレキシブルな抽出指示ができる具体的には、水平方向平行移動量探索範囲指示信号２９及び垂直方向平行移動量探索範囲指示信号３０より、水平方向移動量カウンタ３１及び垂直方向移動量カウンタ３２に対して変形ブロックマッチングの探索範囲を設定する。最小予測誤差電力判定部２１６において、最小予測誤差電力Ｄｍｉｎを最大値のＭＡＸＩＮＴ（例えば、０ｘＦＦＦＦＦＦＦＦ）にセットする。これは、図６のＳ４に相当する。 4 is patterned and specified by addressing (coordinates), and in this case, integer pixels are specified to be selected, and the error between the pixel specified by this addressing and the corresponding pixel in the original image data, which is the predicted image 27, is accumulated to determine the minimum error power. Therefore, since only addressing instructions are required and no calculation is involved, high-speed comparison is possible, and the addressing (coordinate specification) method allows flexible extraction instructions, such as not only simple enlargement and reduction but also rotation, or simultaneous processing of rotation and enlargement and reduction, etc. Specifically, the horizontal translation amount search range instruction signal 29 and the vertical translation amount search range instruction signal 30 set the search ranges for modified block matching for the horizontal translation amount counter 31 and the vertical translation amount counter 32. The minimum prediction error power determination unit 216 calculates the minimum prediction error power D min is set to the maximum value MAX INT (for example, 0xFFFFFFFF). This corresponds to S4 in FIG.

２−３）ブロック変形パラメータの設定本実施の形態においては、ブロック変形パラメータとして、図６のＳ６，Ｓ８に示すｒｄｘ，ｒｄｙを用いる。このパラメータの設定は、回転量カウンタ３３が行う。つまり、図４の参照画像を４５度右回転させる関係を定義する。これらの初期値としてｙの値を与え、以下、ｘがインクリメントされるたびにｒｄｘをインクリメント、ｒｄｙをデクリメントする。これらの処理は、図６におけるＳ６〜Ｓ８に相当する。なお、この設定は右回転の設定であり、Ｓ６でｒｄｙ＝− ｙ、Ｓ８でｒｙ＝ｉｙ＋（ｒｄｙ＋＋）と設定すると、左回転の変形を意味する。なお、Ｓ８は、ｒｘ＝ｉｘ＋（ｒｄｘ＋１）、ｒｙ＝ｉｙ＋（ｒｄｙ−１）とも表現される。2-3) Setting Block Transformation Parameters In this embodiment, the rdx and rdy parameters shown in S6 and S8 of Figure 6 are used as block transformation parameters. These parameters are set by the rotation amount counter 33. In other words, they define the relationship for rotating the reference image in Figure 4 by 45 degrees to the right. The initial value of these is the y value, and rdx is incremented and rdy is decremented each time x is incremented. These processes correspond to S6 through S8 in Figure 6. Note that this setting is for right rotation; setting rdy = -y in S6 and ry = iy + (rdy++) in S8 implies a left rotation transformation. Note that S8 can also be expressed as rx = ix + (rdx + 1), ry = iy + (rdy - 1).

即ち、Ｓ８では、参照画像２８から抽出する画素のアドレッシングを指示しており、次の画素であるｒｘ，ｒｙのアドレスが４５度右下方向の次の整数画素を指示している。これをＳ１２でｘのブロックサイズまで繰り返し、Ｓ１４でｙのブロックサイズまで繰り返していることである。このように、Ｓ８のアドレッシングによる抽出画素をＳ９で誤差比較し、Ｓ１０で累積しているので、図６の動作フローにおいては、一切の演算をしておらず、高速動作ができる。なお、Ｓ１０も、Ｄ（ｄｘ，ｄｙ）＝Ｄ（ｄｘ，ｄｙ）＋Ｄ（ｘ，ｙ）とも表現される。同様に、Ｓ１１，Ｓ１３，Ｓ１７，Ｓ１９は、ｘ＝ｘ＋１、ｙ＝ｙ＋１とも表現される。これは、以後のフローチャートでも同様である。That is, S8 specifies the addressing of the pixel to be extracted from the reference image 28, and the address of the next pixel, rx and ry, specifies the next integer pixel 45 degrees downward to the right. This is repeated in S12 up to the block size x, and in S14 up to the block size y. In this way, the extracted pixel addressing in S8 undergoes error comparison in S9 and accumulation in S10, so the operational flow of FIG. 6 does not require any calculations, enabling high-speed operation. Note that S10 can also be expressed as D(dx, dy) = D(dx, dy) + D(x, y). Similarly, S11, S13, S17, and S19 can also be expressed as x = x + 1 and y = y + 1. This also applies to subsequent flowcharts.

２−４）予測画像候補画像の読み出しまず、被予測ブロックの輝度ブロック内の位置（ｘ，ｙ）に対応する参照画像中の対応点ｒｘ，ｒｙを決定する。つまり、図４の最初の位置間の対応付けを行う。これは対応点決定部３４で行われる。図６のＳ８に示すように、ｒｘ，ｒｙは、あらかじめ与えられるオフセット値ｉｘ，ｉｙに、２−３）で得られたｒｄｘ，ｒｄｙを加算することによって得られる。次いで、参照画像から（ｒｘ＋ｄｘ，ｒｙ＋ｄｙ）だけ離れた位置にある参照画像中の画素をフレームメモリから取り出す。図５におけるメモリ読み出しアドレス生成部３５が水平方向移動量カウンタ３１からｄｘの値を、垂直方向移動量カウンタ３２からｄｙの値を、対応点決定部３４からｒｘ，ｒｙを受け取り、フレームメモリ中のアドレスを生成する。2-4) Reading Candidate Predicted Images First, determine the corresponding points rx and ry in the reference image that correspond to the position (x, y) in the luminance block of the predicted block. In other words, the initial position matching shown in Figure 4 is performed. This is performed by the corresponding point determination unit 34. As shown in S8 of Figure 6, rx and ry are obtained by adding the rdx and rdy obtained in 2-3) to the pre-specified offset values ix and iy. Next, the pixel in the reference image that is located a distance (rx + dx, ry + dy) from the reference image is retrieved from the frame memory. The memory read address generation unit 35 in Figure 5 receives the value dx from the horizontal movement amount counter 31, the value dy from the vertical movement amount counter 32, and rx and ry from the corresponding point determination unit 34, and generates an address in the frame memory.

２−５）予測誤差電力の算出まず、動きベクトルが（ｄｘ，ｄｙ）の時の予測誤差電力Ｄ（ｄｘ，ｄｙ）をゼロに初期化する。これは、図６のＳ５に相当する。２−４）で読み出された画素値と、被予測ブロックの輝度ブロックの対応する位置の画素値との差をとり、その絶対値をＤ（ｄｘ，ｄｙ）に累積していく。この処理をｘ＝ｙ＝ｂｌｏｃｋｓｉｚｅ（ここでは、ｂｌｏｃｋｓｉｚｅ＝８）になるまで繰り返し、（ｄｘ，ｄｙ）時の予測誤差電力Ｄ（ｄｘ，ｄｙ）を得る。この処理は、図５におけるパターンマッチング部２１３が行い、パターンマッチング部２１３は、Ｄ（ｄｘ，ｄｙ）を予測誤差電力信号２１５によって最小予測誤差電力判定部２１６に受け渡す。2-5) Calculation of prediction error power First, the prediction error power D(dx, dy) when the motion vector is (dx, dy) is initialized to zero. This corresponds to S5 in Fig. 6. The difference between the pixel value read in 2-4) and the pixel value at the corresponding position in the luminance block of the predicted block is calculated, and the absolute value is accumulated in D(dx, dy). This process is repeated for x = y = block size (here, block This process is repeated until the minimum prediction error power D(dx, dy) is obtained at the time of (dx, dy). This process is performed by the pattern matching unit 213 in FIG. 5, and the pattern matching unit 213 passes D(dx, dy) to the minimum prediction error power determination unit 216 via a prediction error power signal 215.

以上の処理は、図６におけるＳ９〜Ｓ１４の処理に相当する。The above processing corresponds to the processing in steps S9 to S14 in FIG.

２−６）最小予測誤差電力値の更新２−５）の結果得られたＤ（ｄｘ，ｄｙ）が、それまでの探索結果の中で最小の誤差電力を与えるかどうかを判定する。判定は、図５における最小予測誤差電力判定部２１６が行う。また、図６におけるＳ１５がこの判定処理に相当する。2-6) Update of Minimum Prediction Error Power Value 2-5) determines whether D(dx, dy) obtained as a result of the search provides the minimum power error among the search results up to that point. This determination is made by the minimum prediction error power determination unit 216 in Figure 5. S15 in Figure 6 corresponds to this determination process.

最小予測誤差電力判定部２１６は、内部に持つ最小予測誤差電力Ｄｍｉｎの値と、予測誤差電力信号２１５によって受け渡されるＤ（ｄｘ，ｄｙ）の大小を比較し、Ｄ（ｄｘ，ｄｙ）の方が小さいときに限りＤｍｉｎの値をＤ（ｄｘ，ｄｙ）で更新する。また、そのときの（ｄｘ，ｄｙ）の値を動きベクトル候補（ＭＶｈ，ＭＶｖ）として保持しておく。これらの更新処理は、図６におけるＳ１６に相当する。The minimum prediction error power determination unit 216 determines the minimum prediction error power D The value of min is compared with the magnitude of D(dx, dy) delivered by the prediction error power signal 215, and only when D(dx, dy) is smaller, D The value of min is updated with D(dx, dy). The value of (dx, dy) at that time is used as the motion vector candidate (MV h, MV v) These update processes correspond to S16 in FIG.

２−７）動きベクトル値の決定上記２−２）〜２−６）を探索範囲中のすべての（ｄｘ，ｄｙ）について繰り返し（図６のＳ１７〜Ｓ２０）、最終的に最小予測誤差電力判定部２１６内に保持されている（ＭＶｈ，ＭＶｖ）を動きベクトル２３として出力する。2-7) Determination of Motion Vector Values The above steps 2-2) to 2-6) are repeated for all (dx, dy) in the search range (S17 to S20 in FIG. 6), and finally the (MV h, MV v) is output as a motion vector 23.

以上のようにして、被予測ブロックに誤差電力最小の意味で最も類似した予測画像を探し出す。探索の結果、選ばれた予測画像の起点からの偏移量が変形ブロックマッチングの結果としての動きベクトル２３として得られ、その時の予測誤差電力ＤＤＥＦ２４も保持される。 In this way, a predicted image that is most similar to the predicted block in the sense of having the smallest error power is found. As a result of the search, the displacement of the selected predicted image from the starting point is obtained as a motion vector 23 as a result of modified block matching, and the prediction error power D DEF24 is also retained.

３）最終動き補償予測モードの判定次に、動き補償予測モード判定部２２において、ブロックマッチング部２０で得られた最小予測誤差電力ＤＢＭ２１８と、変形ブロックマッチング部２１で得られた最小予測誤差電力ＤＤＥＦ２４とを比較し、ブロックマッチングか変形ブロックマッチングかいずれか小さいほうを最終的な動き補償モードとして選択する。これは、図３のＳ３に相当する。3) Determination of Final Motion Compensation Prediction Mode Next, the motion compensation prediction mode determination unit 22 determines the minimum prediction error power D obtained by the block matching unit 20. BM218 and the minimum prediction error power D obtained by the modified block matching unit 21 DEF24 and the smaller of either block matching or modified block matching is selected as the final motion compensation mode. This corresponds to S3 in FIG.

動き補償予測モード判定部２２は、最終的に選択した動き補償予測モード信号２６及び最終動きベクトル２５を動きパラメータ１１として動き補償部９及びエントロピー符号化部１８に送る。The motion compensation prediction mode determination unit 22 sends the finally selected motion compensation prediction mode signal 26 and the final motion vector 25 as motion parameters 11 to the motion compensation unit 9 and the entropy coding unit 18.

次に、動き補償処理について説明する。Next, the motion compensation process will be described.

動き補償処理は、動き補償部９で行われる。動き補償部９は、動き検出部８において得られた動きパラメータ１１に基づいて、参照画像中から予測画像を抽出する。本実施形態の動き補償部９は、従来の正方ブロックに基づくブロックマッチングと、特定の変形ブロックを用いたブロックマッチングのいずれの動き補償処理もサポートし、動きパラメータ１１中の動き補償予測モードによってこれらの処理を切り替える構成をとる。Motion compensation processing is performed by the motion compensation unit 9. The motion compensation unit 9 extracts a predicted image from a reference image based on the motion parameters 11 obtained by the motion estimation unit 8. The motion compensation unit 9 of this embodiment supports both conventional block matching based on square blocks and block matching using specific distorted blocks, and is configured to switch between these types of motion compensation processing depending on the motion compensation prediction mode in the motion parameters 11.

以下、本実施の形態における動き補償部９の動作を説明する。The operation of the motion compensation unit 9 in this embodiment will now be described.

図７は、図１中の動き補償部９の構成図、図８は、その動作の様子を示すフローチャートである。FIG. 7 is a block diagram of the motion compensation unit 9 in FIG. 1, and FIG. 8 is a flowchart showing its operation.

図７において、３７は新要素である対応点決定部、３８はメモリ読み出しアドレス生成部である。In FIG. 7, reference numeral 37 denotes a corresponding point determination unit, which is a new element, and 38 denotes a memory read address generation unit.

１）対応点の決定図８のＳ２１に相当する処理で、被予測ブロックの画面内位置指示信号２０６と動き検出部８から送られてくる動きパラメータ１１とから、参照画像２８中の予測画像に対応する標本点を決定する。この処理は、図７における対応点決定部３７において行われる。動きパラメータ１１に含まれる動き補償予測モードがブロックマッチングを示している時は、対応点は被予測ブロックの画面内位置信号２０６から動きベクトルで指示される量だけ平行移動させた領域に含まれる標本点となる。この処理は図４４におけるＳ２０４で、（ｄｘ，ｄｙ）を動きベクトルとした時の参照画像２８中の位置（ｘ＋ｄｘ，ｙ＋ｄｙ）を決定する動作に相当する。動きパラメータ１１に含まれる動き補償予測モードが変形ブロックマッチングを示している時は、動き検出部８の説明における２−４）で述べたように、被予測ブロックの画面内位置信号２０６に各画素位置に応じた回転量分を加算した後、動きベクトルで指示される量だけ平行移動させた標本点となる。この処理は図６におけるＳ９で、（ｄｘ，ｄｙ）を動きベクトルとした時の参照画像２８中の位置（ｒｘ＋ｄｘ，ｒｙ＋ｄｙ）を決定する動作に相当する。1) Determining Corresponding Points This process corresponds to S21 in Figure 8. A sample point in the reference image 28 corresponding to the predicted image is determined based on the intra-screen position indication signal 206 of the predicted block and the motion parameters 11 sent from the motion estimation unit 8. This process is performed by the corresponding point determination unit 37 in Figure 7. When the motion compensation prediction mode included in the motion parameters 11 indicates block matching, the corresponding point is a sample point included in an area translated by the amount indicated by the motion vector from the intra-screen position signal 206 of the predicted block. This process corresponds to S204 in Figure 44, which determines the position (x + dx, y + dy) in the reference image 28 when (dx, dy) is the motion vector. When the motion compensation prediction mode included in the motion parameters 11 indicates modified block matching, as described in 2-4) of the description of the motion detection unit 8, the in-screen position signal 206 of the predicted block is added with a rotation amount corresponding to each pixel position, and then the sample point is translated by the amount indicated by the motion vector. This process corresponds to the operation of S9 in Figure 6, in which the position (rx + dx, ry + dy) in the reference image 28 is determined when (dx, dy) is the motion vector.

２）予測画像データの読み出し図８のＳ２２〜Ｓ２５に相当する処理で、対応点決定部３４の結果を受けて、メモリ読み出しアドレス生成部３８がフレームメモリ１０に蓄積される参照画像２８中の予測画像位置を特定するメモリアドレスを生成し、予測画像を読み出す。2) Reading Predicted Image Data In the process corresponding to steps S22 through S25 in Figure 8, the memory read address generator 38 receives the results of the corresponding point determiner 34, generates a memory address that identifies the position of the predicted image in the reference image 28 stored in the frame memory 10, and reads the predicted image.

この場合に、予測画像が半画素精度の画素を含んでいると、動き補償部９から出力される前に半画素生成部２３２によって半画素値が生成される。これは、図８のＳ２３、Ｓ２４に相当する処理で、予測画像が半画素精度の画素を含むか否かは、対応点決定部３７が動きパラメータ１１中の動きベクトル値をもとに識別し、選択スイッチ３６に知らせる。In this case, if the predicted image contains pixels with half-pel accuracy, half-pel values are generated by the half-pel generator 232 before it is output from the motion compensation unit 9. This process corresponds to steps S23 and S24 in FIG. 8. The correspondence point determiner 37 determines whether the predicted image contains pixels with half-pel accuracy based on the motion vector values in the motion parameters 11, and notifies the selection switch 36.

図５の変形ブロックマッチング部２１の構成では、図４の説明に対応するように実標本点のみの対応点を生成した。しかし、半画素がある場合の構成は、後に説明するように図１３の半画素生成部２３２を持つ変形ブロックマッチング部４２となる。In the configuration of the modified block matching unit 21 in FIG. 5, corresponding points are generated only for actual sample points, as explained in FIG. 4. However, in the case of a configuration in which half pixels are present, the modified block matching unit 42 has a half pixel generation unit 232 as shown in FIG. 13, as will be explained later.

以上の処理過程を経て、最終的な予測画像データ１２が出力される。なお、上記実施の形態での変形ブロックマッチングとしての回転は、４５度の例を説明したが、９０度、１３５度、１８０度などは勿論、ｄｘとｄｙのとり方で他の回転も実現できる。After the above processing steps, the final predicted image data 12 is output. Note that while the rotation for modified block matching in the above embodiment is an example of 45 degrees, other rotations such as 90 degrees, 135 degrees, and 180 degrees can also be achieved by changing the settings of dx and dy.

また、本実施の形態では、画像フレームを単位とする画像符号化装置を説明したが、前処理部２において入力ディジタル画像系列を画像オブジェクト（動きや絵柄などの特徴を同じくする部分領域、一つの被写体など）に分離する処理を行わせ、各画像オブジェクトをそれを包含するブロック群として定義するようにしておけば、画像オブジェクトを単位として符号化する装置であってもこの発明を適用することができる。Furthermore, although this embodiment describes an image coding device that uses image frames as units, this invention can also be applied to devices that code image objects as units by configuring the preprocessing unit 2 to separate the input digital image sequence into image objects (subregions with similar characteristics, such as movement or pattern, or a single subject), and defining each image object as a group of blocks that contain it.

例えば、図９に示すように、静止した背景の前に人物像が存在するようなシーンにおいて、人物像を画像オブジェクトとして、図のようにそれを取り囲む外接四角形内の領域を小ブロックに分割し、画像オブジェクトを含むブロックを有効ブロックとして符号化するような場合が考えられる。この場合は、これら有効ブロックに対し、上記実施の形態で述べた変形ブックマッチングと動き補償に関して同様の処理を適用する。これは、以下の実施の形態においても同様である。For example, in a scene with a human figure in front of a stationary background, as shown in Figure 9, the human figure may be treated as an image object, the area within the circumscribing rectangle surrounding the figure may be divided into small blocks, and the block containing the image object may be coded as a valid block. In this case, the same processing as for the modified book matching and motion compensation described in the above embodiment is applied to these valid blocks. This also applies to the following embodiments.

本実施形態では、直交変換符号化による符号化装置を説明したが、動き補償予測誤差信号を別の符号化方式を用いて符号化する装置であってもこの発明を適用することができるのは言うまでもない。これは、以下の実施の形態においても同様である。In this embodiment, a coding device using orthogonal transform coding has been described, but it goes without saying that the present invention can also be applied to a device that codes a motion compensation prediction error signal using a different coding method. This also applies to the following embodiments.

実施の形態２．平行移動による動きベクトルの値から、変形ブロックマッチング処理の対象となる部分領域の大まかな移動量が把握できる。変形ブロックマッチングの部分領域の設定先を、ブロックマッチング部２０の探索結果である動きベクトル２１７が示す領域情報を受け、この付近に限定して変形して比較すると、処理ステップ及び処理時間を短縮することができる。本実施の形態では、この構成について説明する。なお、このことは、以後の他の実施の形態においても同様である。Embodiment 2. The value of the translational motion vector allows for a rough understanding of the amount of movement of the subregion to be subjected to the modified block matching process. By receiving the region information indicated by the motion vector 217, which is the search result of the block matching unit 20, and limiting the destination of the modified block matching subregion to this vicinity for comparison, the number of processing steps and processing time can be reduced. This configuration will be described in this embodiment. Note that this also applies to the other embodiments described below.

本実施の形態は動き検出部８の別の実施形態を示すものである。This embodiment shows another embodiment of the motion detection unit 8.

図１０は、本実施形態における動き検出部８ｂの内部構成図で、３９は変形ブロックマッチング部、４０は加算部、４１は探索初期位置指示信号である。なお、変形ブロックマッチング部３９は入力２０６の代わりに探索初期位置指示信号４１を用いるだけで、その他の動作は実施の形態１における変形ブロックマッチング部２１と全く同じである。FIG. 10 shows the internal configuration of the motion detection unit 8b in this embodiment, with reference numeral 39 representing a modified block matching unit, 40 representing an adder, and 41 representing a search initial position indication signal. Note that the modified block matching unit 39 operates in exactly the same way as the modified block matching unit 21 in embodiment 1, except that it uses the search initial position indication signal 41 instead of input 206.

大まかな値を得る装置の具体回路を図１０に示す。A specific circuit for obtaining a rough value is shown in FIG.

図１０において、変形ブロックマッチング部３９に、被予測ブロックの画面内位置信号２０６の代わりに、被予測ブロックデータ２０５にブロックマッチング部２０の結果得られた動きベクトル２１７を加算部４０によって加算し、その加算結果を探索初期位置指示信号４１として入力する。また、水平方向平行移動量探索範囲指示信号２９及び垂直方向平行移動量探索範囲指示信号３０から設定する探索範囲は実施の形態１よりも小さめに設定しておく。これにより、図６におけるＳ１７〜Ｓ２０における反復処理を短縮することができる。In FIG. 10 , instead of the intra-screen position signal 206 of the predicted block, an adder 40 adds a motion vector 217 obtained as a result of the block matching unit 20 to the predicted block data 205, and the result of this addition is input to the modified block matching unit 39 as a search initial position indication signal 41. Furthermore, the search range set by the horizontal translation amount search range indication signal 29 and the vertical translation amount search range indication signal 30 is set smaller than in the first embodiment. This allows the iterative processing from S17 to S20 in FIG. 6 to be shortened.

実施の形態３．先の実施の形態では、変形ブロック領域が参照画像２８中の整数画素間隔の画素点のみから構成される場合を説明した。本実施の形態では、変形ブロック領域が参照画像２８中の半画素間隔の画素点をも含めて構成される場合を説明する。Embodiment 3. In the previous embodiment, we described a case where the deformation block region is composed only of pixel points at integer pixel intervals in the reference image 28. In this embodiment, we describe a case where the deformation block region is composed also of pixel points at half-pixel intervals in the reference image 28.

本実施形態では、図１における動き検出部８及び動き補償部９の内部構成が実施の形態１と異なる。また、動作が異なるのは、動き検出部中の変形ブロックマッチング部及び動き補償部中の対応点決定部だけであり、その他の部材及び動作は実施の形態１と全く同じである。よって以下では、変形ブロックマッチング部の動作とそれに対応する動き補償部の動作についてのみ詳しく説明する。実施の形態１と同様、動き検出部８ｃと動き補償部９とに分けて動作を説明する。In this embodiment, the internal configurations of the motion estimation unit 8 and the motion compensation unit 9 shown in FIG. 1 differ from those of the first embodiment. Furthermore, the only differences in operation are the distorted block matching unit in the motion estimation unit and the corresponding point determination unit in the motion compensation unit; the other components and operations are identical to those of the first embodiment. Therefore, the following will only describe in detail the operation of the distorted block matching unit and the corresponding operation of the motion compensation unit. As in the first embodiment, the operations of the motion estimation unit 8c and the motion compensation unit 9 will be described separately.

図１１は、本実施形態における動き検出部８ｃの内部構成図、図１２は、本発明の最重要部位の１つである変形ブロックマッチング部４２における動作の概要説明図、図１３は、変形ブロックマッチング部４２の詳細な内部構成図、図１４は、変形ブロックマッチング部４２の動作を示すフローチャートである。Figure 11 shows the internal configuration of the motion detection unit 8c in this embodiment. Figure 12 shows an overview of the operation of the modified block matching unit 42, one of the most important components of this invention. Figure 13 shows a detailed internal configuration of the modified block matching unit 42. Figure 14 is a flowchart showing the operation of the modified block matching unit 42.

これらの図において、前記までの図面と同一の番号を付した要素、ステップは同一の要素、動作を意味するものとする。In these figures, elements and steps with the same numbers as in the previous figures represent the same elements and operations.

まず、変形ブロックマッチング部４２の動作について説明する。First, the operation of the modified block matching unit 42 will be described.

１）処理概要変形ブロックマッチング部４２における処理の概要を図１２に示す。1) Processing Overview Figure 12 shows an overview of the processing performed by the modified block matching unit 42.

同図において、被予測画像２７及び参照画像２８は実施の形態１で定義した通りである。各画像内の○はフレームの輝度信号の実標本点（整数画素）を、×は実標本点間の中点画素（半画素）を示す。被予測画像２７の８×８（整数画素）からなる部分領域を被予測ブロック（の輝度ブロック部分）とし、参照画像２８の□の画素からなるグループが予測画像候補の変形ブロックを構成するものとする。In the figure, the predicted image 27 and reference image 28 are as defined in embodiment 1. In each image, the circles represent actual sample points (integer pixels) of the luminance signal of the frame, and the crosses represent midpoint pixels (half pixels) between the actual sample points. A subregion consisting of 8x8 (integer pixels) in the predicted image 27 is defined as the predicted block (its luminance block portion), and a group of square pixels in the reference image 28 constitutes the transformed block of the predicted image candidate.

本実施の形態では、輝度ブロックを右もしくは左４５度に回転させ、各辺のスケールを１／√２倍した、つまり、参照画像の大きさは√２の距離としてフレームの入力ディジタル画像１の水平垂直方向の標本点距離と合致させた領域を変形ブロックとして定義する。この領域は、参照画像２８の半画素間隔の画素点をも含んで構成されることに特徴がある。即ち、本実施形態における変形ブロックマッチングは、与えられた探索範囲内で、図１２に示す８×８サンプル（以下サンプルは、整数画素または半画素の意味である）からなる被予測ブロックの輝度ブロックに最も類似した同図の変形ブロック領域を参照画像２８中から見つける処理に相当する。In this embodiment, the luminance block is rotated 45 degrees to the right or left and each side scaled by 1/√2. In other words, the size of the reference image is a distance of √2, and the region that matches the horizontal and vertical sample point distances of the input digital image 1 of the frame is defined as the deformation block. This region is characterized by including pixel points at half-pixel intervals in the reference image 28. In other words, deformation block matching in this embodiment corresponds to the process of finding, within a given search range, the deformation block region shown in Figure 12 in the reference image 28 that most closely resembles the luminance block of the prediction block, which consists of 8 x 8 samples (hereinafter, "sample" refers to either integer pixels or half pixels).

２）初期設定（探索範囲の設定、初期値の設定）実施の形態１と同様の論理で、あらかじめ被予測ブロックの輝度ブロックの各構成点と予測画像候補領域の変形ブロックの各構成点とを１対１に対応させる。2) Initial Settings (Search Range Setting, Initial Value Setting) Using the same logic as in Embodiment 1, a one-to-one correspondence is established between each component point of the luminance block of the predicted block and each component point of the transformation block of the candidate predicted image region.

以下では、図１２の点線矢印に示すように、あらかじめ被予測ブロックの左上隅の画素位置と、変形ブロックの左側頂点とを対応させるものとする。同図では、変形ブロック側の左側頂点が半画素位置にのっているが、これは動きベクトルが半画素成分を含む場合を示している。つまり、予測画像候補画像は参照画像２８中の変形ブロックを右４５度回転させ、各辺を√２倍の長さに修正した部分画像ということになる。この対応付けを変えれば回転の方向が変わることになる。このように取り決めておくことにより、他の各構成点は一意に対応がとれる。被予測ブロックの各構成点と予測画像候補の各構成点が１対１に対応付けられているので、動き検出はブロックマッチングと同様に実行することができる。実際の装置における探索範囲の設定の動作は、実施の形態１と同じで、図１３における必要な要素を用いて設定する。この動作は、図１４ではＳ２６のステップに相当する。In the following, we assume that the pixel position of the upper left corner of the predicted block corresponds to the left vertex of the transformation block, as indicated by the dotted arrow in Figure 12. In this figure, the left vertex of the transformation block is located at a half-pixel position, indicating that the motion vector includes a half-pixel component. In other words, the candidate predicted image is a partial image obtained by rotating the transformation block in reference image 28 by 45 degrees clockwise and correcting each side to a length that is √2 times longer. Changing this correspondence changes the direction of rotation. This arrangement ensures that each of the other composition points corresponds uniquely. Because each composition point of the predicted block corresponds one-to-one to each composition point of the candidate predicted image, motion estimation can be performed in a similar manner to block matching. The search range setting operation in the actual device is the same as in embodiment 1, and is set using the necessary elements in Figure 13. This operation corresponds to step S26 in Figure 14.

３）ブロック変形パラメータの設定本実施形態においては、実施の形態１と同様、ブロック変形パラメータとして図１４に示すｒｄｘ，ｒｄｙを用いる。このパラメータの設定は、回転量カウンタ４５が行う。これらの初期値としてｙの値を与え、以下ｘが１ずつインクリメントされるたびに、ｒｄｘを０．５ずつインクリメント、ｒｄｙを０．５ずつデクリメントする。これらの処理は、図１４におけるＳ２８〜Ｓ３０に相当する。3) Setting Block Transformation Parameters In this embodiment, as in embodiment 1, the rdx and rdy shown in Figure 14 are used as block transformation parameters. These parameters are set by the rotation amount counter 45. The value of y is given as the initial value, and rdx is incremented by 0.5 and rdy is decremented by 0.5 each time x is incremented by 1. These processes correspond to steps S28-S30 in Figure 14.

この設定は右回転の設定となり、Ｓ２８でｒｄｙ＝−ｙ、Ｓ３０でｒｙ＝ｉｙ＋（ｒｄｙ＋＝０．５）と設定すると、左回転の変形を意味する。This setting is a right rotation setting, and setting rdy = -y in S28 and ry = iy + (rdy + = 0.5) in S30 means a left rotation deformation.

４）予測画像候補画像の読み出しまず、被予測ブロックの輝度ブロック内の位置（ｘ，ｙ）に対応する参照画像中の対応点ｒｘ，ｒｙを決定する。これは、対応点決定部４６で行われる。図１４のＳ３０に示すように、ｒｘ，ｒｙはあらかじめ与えられるオフセット値ｉｘ，ｉｙに３）で得られたｒｄｘ，ｒｄｙを加算することによって得られる。4) Reading the Candidate Predicted Image First, determine the corresponding points rx and ry in the reference image that correspond to the position (x, y) in the luminance block of the predicted block. This is performed by the corresponding point determination unit 46. As shown in S30 of Figure 14, rx and ry are obtained by adding the rdx and rdy obtained in 3) to the pre-specified offset values ix and iy.

次いで、参照画像から（ｒｘ＋ｄｘ，ｒｙ＋ｄｙ）だけ離れた位置にある参照画像中の画素をフレームメモリから取り出す。図１３におけるメモリ読み出しアドレス生成部４７が水平方向移動量カウンタ３１からｄｘの値を、垂直方向平行移動量カウンタ３２からｄｙの値を、対応点決定部４６からｒｘ，ｒｙを受け取り、フレームメモリ中のアドレスを生成する。また、図１４のＳ３１において読み出されたデータは、必要に応じて半画素生成部２３２において半画素値を生成するために使用される。Next, a pixel in the reference image located a distance (rx + dx, ry + dy) from the reference image is retrieved from the frame memory. The memory read address generator 47 in FIG. 13 receives the value of dx from the horizontal displacement counter 31, the value of dy from the vertical translation counter 32, and rx and ry from the corresponding point determiner 46, and generates an address in the frame memory. The data read in S31 in FIG. 14 is also used by the half-pixel generator 232 to generate half-pixel values as needed.

５）予測誤差電力の算出まず、動きベクトルが（ｄｘ，ｄｙ）の時の予測誤差電力Ｄ（ｄｘ，ｄｙ）をゼロに初期化する。これは、図１４のＳ２７に相当する。４）で読み出された画素値と、被予測ブロックの輝度ブロックの対応する位置の画素値との差をとり、その絶対値をＤ（ｄｘ，ｄｙ）に累積していく。この処理をｘ＝ｙ＝ｂｌｏｃｋ＿ｓｉｚｅ（ここではｂｌｏｃｋ＿ｓｉｚｅ＝８）になるまで繰り返し、（ｄｘ，ｄｙ）時の予測誤差電力Ｄ（ｄｘ，ｄｙ）を得る。この処理は、図１３におけるパターンマッチング部２１３が行い、パターンマッチング部２１３はＤ（ｄｘ，ｄｙ）を予測誤差電力信号２１５によって最小予測誤差電力判定部２１６に受け渡す。ここでの処理は、図１４におけるＳ３２〜Ｓ３７の処理に相当する。5) Calculating Prediction Error Power First, the prediction error power D(dx, dy) for the motion vector (dx, dy) is initialized to zero. This corresponds to S27 in Figure 14. The difference between the pixel value read in 4) and the pixel value at the corresponding position in the luminance block of the predicted block is calculated, and the absolute value is accumulated as D(dx, dy). This process is repeated until x = y = block_size (here, block_size = 8) to obtain the prediction error power D(dx, dy) for (dx, dy). This process is performed by the pattern matching unit 213 in Figure 13, which then passes D(dx, dy) to the minimum prediction error power determination unit 216 via the prediction error power signal 215. This process corresponds to S32 through S37 in Figure 14.

６）最小予測誤差電力値の更新５）の結果得られたＤ（ｄｘ，ｄｙ）が、それまでの探索結果の中で最小の誤差電力を与えるかどうかを判定する。判定は、図１３における最小予測誤差電力判定部２１６が行う。また、図１４におけるＳ３８がこの判定処理に相当する。判定処理は実施の形態１と全く同じであり、そのときの（ｄｘ，ｄｙ）の値を動きベクトル候補（ＭＶ＿ｈ，ＭＶ＿ｖ）として保持しておく。この更新処理は、図１４におけるＳ３９に相当する。6) Update of Minimum Prediction Error Power Value It is determined whether D(dx, dy) obtained as a result of 5) provides the smallest error power value among the search results up to that point. This determination is performed by the minimum prediction error power determination unit 216 in Figure 13. S38 in Figure 14 corresponds to this determination process. The determination process is exactly the same as in Embodiment 1, and the (dx, dy) value at that time is stored as the motion vector candidate (MV_h, MV_v). This update process corresponds to S39 in Figure 14.

７）動きベクトル値の決定上記２）〜６）を探索範囲中のすべての（ｄｘ，ｄｙ）について繰り返し（図１４のＳ４０〜Ｓ４３）、最終的に最小予測誤差電力判定部２１６内に保持されている（ＭＶ＿ｈ，ＭＶ＿ｖ）を動きベクトル４３として出力する。7) Determining the Motion Vector Value Steps 2) through 6) above are repeated for all (dx, dy) values in the search range (S40 through S43 in Figure 14), and the final (MV_h, MV_v) value stored in the minimum prediction error power determination unit 216 is output as the motion vector 43.

以上のようにして、被予測ブロックに誤差電力最小の意味で最も類似した予測画像を探し出す。探索の結果、選ばれた予測画像の起点からの偏移量が変形ブロックマッチングの結果としての動きベクトル４３として得られ、その時の予測誤差電力Ｄ＿ＤＥＦ４４も保持される。In this way, the predicted image that is most similar to the predicted block in terms of minimizing error power is found. As a result of the search, the displacement of the selected predicted image from the origin is obtained as a motion vector 43 resulting from modified block matching, and the prediction error power D_DEF 44 at that time is also retained.

上記動きベクトル４３、予測誤差電力Ｄ＿ＤＥＦ４４が最終的な動き補償モード判定に用いられ、最終的な動き補償モードが決定される。この決定方法は、実施の形態１と全く同じである。The motion vector 43 and prediction error power D_DEF 44 are used for the final motion compensation mode determination, which is exactly the same as in the first embodiment.

動き補償処理は、動き補償部９で行われる。本実施の形態では、対応点決定部３７の動作のみが実施の形態１と異なるので、その部分だけを説明する。動き補償の全体的なフローチャートは、図８に準ずる。The motion compensation process is performed by the motion compensation unit 9. In this embodiment, only the operation of the corresponding point determination unit 37 differs from that of the first embodiment, and only this part will be described. The overall flowchart of the motion compensation process is as shown in Figure 8.

本実施形態においては、対応点の決定は以下のように行う。In this embodiment, the corresponding points are determined as follows.

動きパラメータ１１に含まれる動き補償予測モードがブロックマッチングを示している時は、対応点は被予測ブロックの画面内位置信号２０６から動きベクトルで指示される量だけ平行移動させた領域に含まれる標本点となる。この処理は、図４４におけるＳ２０４で、（ｄｘ，ｄｙ）を動きベクトルとした時の参照画像２８中の位置（ｘ＋ｄｘ，ｙ＋ｄｙ）を決定する動作に相当する。When the motion compensation prediction mode included in the motion parameters 11 indicates block matching, the corresponding point is a sample point included in an area translated by the amount indicated by the motion vector from the intra-screen position signal 206 of the predicted block. This process corresponds to the operation of determining the position (x + dx, y + dy) in the reference image 28 when (dx, dy) is the motion vector in S204 in Figure 44.

動きパラメータ１１に含まれる動き補償予測モードが変形ブロックマッチングを示している時は、動き検出部８の説明における４）で述べたように、被予測ブロックの画面内位置信号２０６に各画素位置に応じた回転量分を加算した後、動きベクトルで指示される量だけ平行移動させた領域に含まれる標本点となる。この処理は、図１４におけるＳ３２で、（ｄｘ，ｄｙ）を動きベクトルとした時の参照画像２８中の位置（ｒｘ＋ｄｘ，ｒｙ＋ｄｙ）を決定する動作に相当する。When the motion compensation prediction mode included in the motion parameters 11 indicates modified block matching, as described in section 4) of the motion estimation unit 8, the in-screen position signal 206 of the predicted block is added with a rotation amount corresponding to each pixel position, and then the resulting sample point is located within an area translated by the amount indicated by the motion vector. This process corresponds to the operation of determining the position (rx + dx, ry + dy) in the reference image 28 when (dx, dy) is the motion vector in step S32 of FIG. 14.

以下の予測画像データの読み出し、予測画像の生成については、実施の形態１に準ずる。The following reading of predicted image data and generation of predicted images conform to the first embodiment.

実施の形態４．本実施の形態は、被予測ブロックの面積が単純に縮小される変形ブロックを用いる場合について説明する。また、説明は省略するが単純な拡大も同じである。Embodiment 4. This embodiment describes the use of a modified block, in which the area of the predicted block is simply reduced. Although not explained here, the same applies to simple enlargement.

こうして、より単純な変形ブロックマッチングと動き補償について述べる。Thus, we describe simpler modified block matching and motion compensation.

以下では、上記実施形態と動作の異なる動き検出部中の変形ブロックマッチング部４２ｂ及び動き補償部中の対応点決定部の動作についてのみ、図１６を参照しながら詳しく説明する。なお、説明の混乱を避けるため、変形ブロックマッチング部４２ｂは、図１３における変形ブロックマッチング部４２のバリエーションであるとし、その入力は全く同じであり、出力は動きベクトル４３ならびに予測誤差電力４４のバリエーションであるものとする。また、動き補償部９中の対応点決定部についても、図７における対応点決定部３７のバリエーションであるものとする。よって、以下では、本実施形態の変形ブロックマッチング部の番号は４２ｂとして、対応点決定部の番号は３７として説明を進める。The following describes in detail, with reference to FIG. 16, only the operation of the distorted block matching unit 42b in the motion estimation unit and the corresponding point determiner in the motion compensation unit, which operate differently from the above-described embodiment. To avoid confusion, the distorted block matching unit 42b is assumed to be a variation of the distorted block matching unit 42 in FIG. 13, with the same inputs and variations of the motion vector 43 and prediction error power 44 as its outputs. The corresponding point determiner in the motion compensation unit 9 is also assumed to be a variation of the corresponding point determiner 37 in FIG. 7. Therefore, the following description will be given assuming that the distorted block matching unit in this embodiment is numbered 42b and the corresponding point determiner is numbered 37.

図１５は、本実施の形態における変形ブロックマッチング部４２ｂにおける動作の概要説明図、図１６は、変形ブロックマッチング部４２ｂの詳細な内部構成図、図１７は、変形ブロックマッチング部４２ｂの動作を示すフローチャートである。FIG. 15 is a diagram outlining the operation of the modified block matching unit 42b in this embodiment, FIG. 16 is a detailed diagram of the internal configuration of the modified block matching unit 42b, and FIG. 17 is a flowchart illustrating the operation of the modified block matching unit 42b.

まず、変形ブロックマッチング部４２ｂの動作について説明する。First, the operation of the modified block matching unit 42b will be described.

１）処理概要変形ブロックマッチング部４２ｂにおける処理の概要を図１５に示す。被予測画像２７及び参照画像２８、各画像内の印の説明は前述の通りである。本実施の形態では、輝度ブロックの各辺を単純に１／２倍した縮小領域を変形ブロックとして定義する。本実施の形態における変形ブロックマッチングは、与えられた探索範囲内で、図１５に示す８×８サンプルからなる被予測ブロックの輝度ブロックに最も類似した同図の変形ブロック領域を参照画像２８中から見つける処理に相当する。1) Processing Overview Figure 15 shows an overview of the processing performed by the modified block matching unit 42b. The predicted image 27, reference image 28, and the marks within each image are as described above. In this embodiment, a modified block is defined as a reduced area obtained by simply halving each side of a luminance block. Modified block matching in this embodiment corresponds to the process of finding, within a given search range, the modified block area shown in Figure 15 in the reference image 28 that is most similar to the luminance block of the predicted block consisting of 8x8 samples.

本実施形態では、図１５の点線矢印に示すように、あらかじめ被予測ブロックの左上隅の画素位置と、変形ブロックの左上隅の画素位置とを対応させる。被予測ブロックの各構成点と予測画像候補の各構成点が１対１に対応付けられているので、動き検出はブロックマッチングと同様に実行することができる。実際の装置における探索範囲の設定の動作は実施の形態１と同じで、図１６における必要な要素を用いて設定する。この動作は、図１７では、Ｓ４４のステップに相当する。In this embodiment, as shown by the dotted arrow in Figure 15, the pixel position of the upper left corner of the predicted block is previously associated with the pixel position of the upper left corner of the transformed block. Because there is a one-to-one correspondence between each component point of the predicted block and each component point of the candidate predicted image, motion estimation can be performed in the same manner as block matching. The search range setting operation in the actual device is the same as in embodiment 1, and is set using the necessary elements in Figure 16. This operation corresponds to step S44 in Figure 17.

３）予測画像候補画像の読み出し本実施の形態においては、特定のブロック変形パラメータは用いず、図１７のＳ４７に示すように、水平垂直各成分のオフセット値ｉｘ，ｉｙに、ｘ／２，ｙ／２の値を加算することによってｘ，ｙの対応点ｓｘ，ｓｙを得る。この対応点は対応点決定部４８で行われる。次いで、参照画像から（ｓｘ＋ｄｘ，ｓｙ＋ｄｙ）だけ離れた位置にある参照画像中の画素をフレームメモリから取り出す。図１６におけるメモリ読み出しアドレス生成部４９が水平方向移動量カウンタ３１からｄｘの値を、垂直方向平行移動量カウンタ３２からｄｙの値を、対応点決定部４８からｓｘ，ｓｙを受け取り、フレームメモリ中のアドレスを生成する。また、図１７のＳ４８で読み出されたデータは必要に応じて半画素生成部２３２において半画素値を生成するために使用される。3) Reading Candidate Predicted Images In this embodiment, no specific block transformation parameters are used. Instead, as shown in S47 of FIG. 17, the corresponding points sx and sy of x and y are obtained by adding the values x/2 and y/2 to the offset values ix and iy of the horizontal and vertical components. This corresponding point is determined by the corresponding point determination unit 48. Next, a pixel in the reference image located a distance (sx + dx, sy + dy) from the reference image is retrieved from the frame memory. The memory read address generation unit 49 in FIG. 16 receives the value dx from the horizontal translation amount counter 31, the value dy from the vertical translation amount counter 32, and sx and sy from the corresponding point determination unit 48, and generates an address in the frame memory. Furthermore, the data read in S48 of FIG. 17 is used as needed to generate half-pixel values in the half-pixel generation unit 232.

４）予測誤差電力の算出まず、動きベクトルが（ｄｘ，ｄｙ）の時の予測誤差電力Ｄ（ｄｘ，ｄｙ）をゼロに初期化する。これは図１７のＳ４５に相当する。３）で読み出された画素値と、被予測ブロックの輝度ブロックの対応する位置の画素値との差をとり、その絶対値をＳ５０でＤ（ｄｘ，ｄｙ）に累積していく。この処理をｘ＝ｙ＝ｂｌｏｃｋ＿ｓｉｚｅ（ここでは、ｂｌｏｃｋ＿ｓｉｚｅ＝８）になるまでＳ５２，Ｓ５４で繰り返し、（ｄｘ，ｄｙ）時の予測誤差電力Ｄ（ｄｘ，ｄｙ）を得る。4) Calculating Prediction Error Power First, the prediction error power D(dx, dy) for the motion vector (dx, dy) is initialized to zero. This corresponds to S45 in Figure 17. The difference between the pixel value read in 3) and the pixel value at the corresponding position in the luminance block of the predicted block is calculated, and the absolute value is accumulated as D(dx, dy) in S50. This process is repeated in S52 and S54 until x = y = block_size (here, block_size = 8), thereby obtaining the prediction error power D(dx, dy) for (dx, dy).

この処理は、図１６におけるパターンマッチング部２１３が行い、パターンマッチング部２１３は、Ｄ（ｄｘ，ｄｙ）を予測誤差電力信号２１５によって最小予測誤差電力判定部２１６に受け渡す。ここでの処理は、図１７におけるＳ４９〜Ｓ５４の処理に相当する。This process is performed by the pattern matching unit 213 in FIG. 16 , which passes D(dx, dy) to the minimum prediction error power determination unit 216 via the prediction error power signal 215. This process corresponds to steps S49 to S54 in FIG. 17 .

５）最小予測誤差電力値の更新４）の結果得られたＤ（ｄｘ，ｄｙ）が、それまでの探索結果の中で最小の誤差電力を与えるかどうかを判定する。判定は、図１６における最小予測誤差電力判定部２１６が行う。また、図１７におけるＳ５５がこの判定処理に相当する。5) Update of minimum prediction error power value It is determined whether D(dx, dy) obtained as a result of step 4) provides the minimum power error among the search results up to that point. This determination is made by the minimum prediction error power determination unit 216 in Figure 16. This determination process corresponds to S55 in Figure 17.

判定処理は、実施の形態１と全く同じであり、そのときの（ｄｘ，ｄｙ）の値を動きベクトル候補として保持しておく。この更新処理は、図１７におけるＳ５６に相当する。The determination process is exactly the same as in embodiment 1, and the (dx, dy) values at that time are stored as motion vector candidates. This update process corresponds to S56 in Figure 17.

６）動きベクトル値の決定上記２）〜５）を図１７のＳ５７〜Ｓ６０で探索範囲中のすべての（ｄｘ，ｄｙ）について繰り返し、最終的に最小予測誤差電力判定部２１６内に保持されている（ｄｘ，ｄｙ）を動きベクトル４３として出力する。6) Determining the Motion Vector Value Steps 2) through 5) above are repeated for all (dx, dy) values in the search range in steps S57 through S60 of Figure 17, and the final (dx, dy) value stored in the minimum prediction error power determination unit 216 is output as the motion vector 43.

動き補償処理は動き補償部９で行われる。本実施形態では、対応点決定部３７の動作のみが実施の形態１と異なるので、その部分だけを説明する。動き補償の全体的なフローチャートは、図８に準ずる。Motion compensation processing is performed by the motion compensation unit 9. In this embodiment, only the operation of the corresponding point determination unit 37 differs from that of embodiment 1, so only that part will be described. The overall flowchart for motion compensation is as shown in Figure 8.

動きパラメータ１１に含まれる動き補償予測モードが変形ブロックマッチングを示している時は、被予測ブロックの画面内位置信号２０６に各画素位置に応じた編移量分を加算した後、動きベクトルで指示される量だけ平行移動させた領域に含まれる標本点となる。この処理は、図１７におけるＳ４７で、（ｄｘ，ｄｙ）を動きベクトルとした時の参照画像２８中の位置（ｓｘ＋ｄｘ，ｓｙ＋ｄｙ）を決定する動作に相当する。以下の予測画像データの読み出し、予測画像の生成については、実施の形態１に準ずる。When the motion compensation prediction mode included in the motion parameters 11 indicates modified block matching, the in-screen position signal 206 of the predicted block is added with a displacement amount corresponding to each pixel position, and then the sample point is included in the area translated by the amount indicated by the motion vector. This process corresponds to the operation of determining the position (sx + dx, sy + dy) in the reference image 28 when (dx, dy) is the motion vector in S47 of Figure 17. The following reading of predicted image data and generation of predicted images are similar to those in embodiment 1.

上記各実施の形態における変形ブロックは、１）被予測ブロックと予測画像の各構成画素位置の１対１の対応付けが行われている、２）参照画像側の対応画素点が整数画素間隔で構成されるという２つの前提のもとであれば、どのような形状をもとり得る。例えば、図１８や図１９に示すような形状を考えることもできる。更に、片方のみ半分に縮小というだけでなく、それぞれの辺を独立に任意の比率で縮小、拡大すれば各種の形状に変形してブロックマッチングができる。こうして、あらかじめ様々な形状を定義しておくことにより、最も良好な予測結果が得られる変形ブロックを選択するように構成することができる。このときは、選択された変形ブロックの種類を動きパラメータ１１の中に含めてエントロピー符号化部１８に送ればよい。In each of the above embodiments, the deformed blocks can have any shape, provided that two assumptions are met: 1) there is a one-to-one correspondence between the pixel positions of the predicted block and the predicted image, and 2) corresponding pixels in the reference image are spaced at integer pixel intervals. For example, shapes such as those shown in Figures 18 and 19 are possible. Furthermore, block matching can be achieved by deforming blocks into various shapes by independently scaling each side by any ratio, rather than simply shrinking only one side by half. By defining various shapes in this way, it is possible to select the deformed block that provides the best prediction results. In this case, the type of selected deformed block is simply included in the motion parameters 11 and sent to the entropy coding unit 18.

上記各実施の形態によれば、半画素精度の補間画素値の生成だけで、アフィン変換のように複雑な演算による補間画素値を生成することなく回転及び縮小スケーリングを含む動き補償を行うことができ、平行移動量である動きベクトルだけでは予測誤差を最小にできない、つまり、予測がうまく的中しないような部分画像についても、良好な予測を行うことができる。According to the above embodiments, motion compensation, including rotation and downscaling, can be performed simply by generating interpolated pixel values with half-pel accuracy, without generating interpolated pixel values through complex calculations such as affine transformation. This allows for accurate prediction even for partial images where prediction error cannot be minimized using motion vectors alone, which are translational translations.

なお、上記の各実施の形態では、予め用意される固定点として、整数画素、または半画素の場合を説明したが、例えば、１：３等、他の割合の中間点の画素を比較対象用として用意してもよい。この場合でも、従来のアフィン変換の場合と異なり、比較処理動作中の補間処理が不要であり、それだけ処理規模を小さくでき、高速処理が可能となる。In the above embodiments, the fixed points prepared in advance are described as integer pixels or half pixels. However, intermediate pixels with other ratios, such as 1:3, may also be prepared for comparison. Even in this case, unlike conventional affine transformations, no interpolation is required during the comparison process, thereby reducing the processing scale and enabling faster processing.

実施の形態５．上記各実施の形態においては、画素ごとにブロック変形パラメータのカウント処理、もしくはそれに相当する座標変換処理を行う構成となっていたが、この画素ごとの座標変換値をあらかじめＲＯＭなどの変形パターンテーブルとして用意しておき、被予測ブロックの各画素位置に応じてテーブルから引き出した変換値をもとに対応点を決定する構成をとることもできる。こうすることで、演算式では表現しにくい任意の対応関係を持つ変形ブロックマッチングと動き補償が効果的にできる。Embodiment 5. In the above embodiments, the block transformation parameter counting process or the corresponding coordinate transformation process is performed for each pixel. However, it is also possible to prepare these coordinate transformation values for each pixel in advance as a transformation pattern table in ROM or other storage, and determine corresponding points based on the transformation values retrieved from the table for each pixel position in the predicted block. This enables effective transformation block matching and motion compensation for arbitrary correspondences that are difficult to express using mathematical formulas.

例えば、実施の形態１を例にとる。For example, take the first embodiment as an example.

図２０は、図５における対応点決定部３４の別の内部構成図であり、本実施の形態を実現する構成（対応点決定部３４ｂ）を示している。実施の形態１の具体的な動作を示す図６におけるＳ８で、パラメータｒｄｘ，ｒｄｙの値をインクリメントあるいはデクリメント演算する代わりに、ｘ，ｙに対応するｒｄｘ，ｒｄｙの値をＲＯＭとして持っておき、そこからｘ，ｙの値に応じて対応点ｒｘ，ｒｙを引き出すことによって求めることができる。この場合、図５における回転量カウンタ３３は不要となり、図２０に示すように、対応点決定部３４ｂ内にＲＯＭテーブル（変形パターンテーブル１００）を持たせる構成で実現できる。対応点決定部３４ｂは、被予測ブロックの各画素位置（ｘ，ｙ）によって変形パターンテーブル１００から変形パラメータｒｄｘ，ｒｄｙの値を引き出し、これを加算部１１０で加算することによって対応点を決定する。そして、メモリ読み出しアドレス生成部３５に向けて出力する。これは、上記の他の実施の形態でも同様である。こうして、若干のＲＯＭメモリ（変形パターンテーブル１００）への追加だけで、対応点の演算処理を行う要素を削除して回路を簡略化し、かつ対応点演算処理量を削減することができる。また、図２１に示すような簡単な数式では表現できない変形をサポートすることも可能になり、より豊富な変形パターンライブラリが考えられることになる。FIG. 20 is another internal block diagram of the corresponding point determiner 34 in FIG. 5, showing a configuration (corresponding point determiner 34b) for implementing this embodiment. Instead of incrementing or decrementing the values of the parameters rdx and rdy in S8 in FIG. 6, which shows the specific operation of embodiment 1, the values of rdx and rdy corresponding to x and y can be stored in ROM, and the corresponding points rx and ry can be obtained by retrieving them from there according to the values of x and y. In this case, the rotation amount counter 33 in FIG. 5 is unnecessary, and as shown in FIG. 20, this can be implemented by providing a ROM table (deformation pattern table 100) within the corresponding point determiner 34b. The corresponding point determiner 34b retrieves the values of the deformation parameters rdx and rdy from the deformation pattern table 100 according to each pixel position (x, y) of the predicted block, and determines the corresponding points by adding them in the adder 110. The result is then output to the memory read address generator 35. This is also true for the other embodiments described above. Thus, by simply adding a small amount of ROM memory (deformation pattern table 100), it is possible to eliminate elements that perform corresponding point calculations, simplifying the circuit and reducing the amount of corresponding point calculation processing. It also becomes possible to support deformations that cannot be expressed with simple mathematical formulas such as those shown in Figure 21, allowing for the creation of a richer deformation pattern library.

実施の形態６．本実施の形態では、上記各実施の形態で示したような方法によって変形ブロックとして切り出される予測画像中の周波数特性を均一にし、被予測ブロックの予測を行う際のミスマッチを低減する符号化装置について説明する。Embodiment 6. This embodiment describes a coding device that equalizes the frequency characteristics in a predicted image extracted as a modified block using the methods described in the above embodiments, thereby reducing mismatches when predicting a predicted block.

予測画像が整数画素空間及び半画素空間に存在する画素値から構成される場合、整数画素と半画素では空間周波数特性が異なる。一方、被予測ブロックはすべて整数画素空間の画素から構成されているので、この特性の違いが予測時のミスマッチの要因になることが考えられる。そこで本実施の形態では、上記各実施の形態で述べた変形ブロックの定義と同様の定義を行った後、整数画素空間上の画素に対してフィルタリングを行う。When a predicted image is composed of pixel values existing in integer pixel space and half pixel space, the spatial frequency characteristics differ between integer pixels and half pixels. On the other hand, since the predicted block is composed entirely of pixels in integer pixel space, this difference in characteristics can potentially cause mismatches during prediction. Therefore, in this embodiment, after defining a distorted block in the same way as in the previous embodiments, filtering is performed on pixels in integer pixel space.

半画素空間上の画素は、周辺の整数画素に対して［１／２、１／２］のフィルタリングを行うことによって生成される。即ち、ｃｏｓ（ωｔ／２）の特性を持つローパスフィルタが施されることになる。上記各実施の形態で定義した予測画像は、フィルタの施されていない整数画素と、上記フィルタリングによって生成される半画素精度の画素とが混在しており、予測画像内の空間周波数特性にばらつきがある。このばらつきが原因で予測精度が落ちる場合には、以下に述べるように、整数画素に対しても同等の特性を持つフィルタを施せば効果的である。Pixels in the half-pel space are generated by filtering the surrounding integer pixels by [1/2, 1/2]. In other words, a low-pass filter with cos(ωt/2) characteristics is applied. The predicted image defined in each of the above embodiments contains a mixture of unfiltered integer pixels and half-pel-accurate pixels generated by the filtering, resulting in variations in the spatial frequency characteristics within the predicted image. If this variation reduces prediction accuracy, it is effective to apply a filter with equivalent characteristics to the integer pixels, as described below.

図２２は、フィルタリングの例を示したもので、ここでは整数画素について式（７）に示す、［１／８、６／８、１／８］のローパスフィルタＦを施す例を示している。Figure 22 shows an example of filtering, where a low-pass filter F of [1/8, 6/8, 1/8] is applied to integer pixels as shown in Equation (7).

このフィルタの特性は｛ｃｏｓ（ωｔ／２）｝２であり、予測画像内の空間周波数特性のばらつきが緩和される。このようなフィルタ処理の後、上記各実施の形態と同様、被予測ブロックの各点と予測画像の各点との１対１対応付け、探索、動きベクトルの決定、モード判定を行う。 The filter characteristic is {cos(ωt/2)}2, which reduces variations in spatial frequency characteristics within the predicted image. After this filtering process, similar to the above embodiments, one-to-one correspondence is established between each point of the predicted block and each point of the predicted image, search is performed, motion vectors are determined, and mode determination is performed.

具体的な装置構成と動作について説明する。The specific configuration and operation of the device will be described below.

本実施の形態では、これまでの実施形態とは、変形ブロックマッチング部と動き補償部が異なる。以下では、変形ブロックの定義は実施の形態４に基づく単純縮小パターンとし、変形ブロックマッチング部の内部構成は動き検出部８ｃの中の変形ブロックマッチング部４２のバリエーション、動き補償部も動き補償部９のバリエーションとして考える。したがって、以下の説明においては、変形ブロックマッチングの番号は４２ｃとし、動き補償部の番号は９ｂとして説明を進める。In this embodiment, the modified block matching unit and motion compensation unit differ from those of the previous embodiments. In the following, the modified block is defined as a simple downsized pattern based on the fourth embodiment, the internal configuration of the modified block matching unit is considered to be a variation of the modified block matching unit 42 in the motion estimation unit 8c, and the motion compensation unit is considered to be a variation of the motion compensation unit 9. Therefore, in the following explanation, the modified block matching unit will be numbered 42c and the motion compensation unit will be numbered 9b.

図２３は、本実施形態における変形ブロックマッチング部４２ｃの動作の概要説明図、図２４は、変形ブロックマッチング部４２ｃの詳細な内部構成図、図２５は、本実施の形態における変形ブロックマッチング部４２ｃの動作を示すフローチャートである。Figure 23 is a diagram outlining the operation of the modified block matching unit 42c in this embodiment, Figure 24 is a detailed internal configuration diagram of the modified block matching unit 42c, and Figure 25 is a flowchart illustrating the operation of the modified block matching unit 42c in this embodiment.

これらの図において、前記までの図面と同一の番号を付した要素、ステップは同一要素、動作を意味するものとする。In these figures, elements and steps with the same numbers as in the previous figures represent the same elements and operations.

まず、変形ブロックマッチング部４２ｃの動作について説明する。実施の形態４と同じ動作の記述は省略する。First, the operation of the modified block matching unit 42c will be described. Description of operations that are the same as those in the fourth embodiment will be omitted.

１）処理概要変形ブロックの定義については実施の形態４と全く同じであるが、本実施の形態では、整数画素位置の画素に対してフィルタリングを行うことが異なる。即ち、図２３のように、参照画像中にフィルタ処理対象画素として△の画素が定義されており、変形ブロックは△及び□で示す画素から構成される。1) Processing Overview The definition of a transformation block is exactly the same as in embodiment 4, but this embodiment differs in that filtering is performed on pixels at integer pixel positions. That is, as shown in Figure 23, pixels marked with a triangle are defined in the reference image as pixels to be filtered, and the transformation block is composed of the pixels marked with a triangle and a square.

２）初期設定（探索範囲の設定、初期値の設定）実施の形態４と全く同じである。2) Initial settings (search range setting, initial value setting) Identical to embodiment 4.

３）予測画像候補画像の読み出し被予測ブロック内の位置ｘ，ｙの画素に対応する対応点ｓｘ，ｓｙを得る方法は、実施の形態４と全く同じである。次いで、参照画像から（ｓｘ＋ｄｘ，ｓｙ＋ｄｙ）だけ離れた位置にある参照画像中の画素をフレームメモリから取り出す。この際、ｓｘ＋ｄｘ，ｓｙ＋ｄｙに対応する画素位置が整数画素空間上にあるか半画素空間上にあるかを判定する。これは、単にｓｘ＋ｄｘ，ｓｙ＋ｄｙがそれぞれ半画素成分を持つかどうかで判定できる。この判定は、図２４における対応点決定部４８において行う。図２５では、Ｓ６１のステップに相当する。ここで、半画素空間にあると判定された場合は、半画素生成部２３２において半画素値が生成される。また、整数画素空間にあると判定された場合は、フィルタ部５０において図２２に示したフィルタリングを施す。これは、図２５におけるＳ６２のステップに相当する。3) Reading the Candidate Predicted Image The method for obtaining the corresponding points sx and sy corresponding to the pixel at position x and y in the predicted block is exactly the same as in embodiment 4. Next, a pixel in the reference image located a distance (sx + dx, sy + dy) from the reference image is retrieved from the frame memory. At this time, it is determined whether the pixel position corresponding to sx + dx and sy + dy is in integer pixel space or half pixel space. This can be determined simply by checking whether sx + dx and sy + dy each have half pixel components. This determination is made by the corresponding point determination unit 48 in Figure 24. In Figure 25, this corresponds to step S61. If it is determined to be in half pixel space, the half pixel generation unit 232 generates half pixel values. If it is determined to be in integer pixel space, the filter unit 50 performs the filtering shown in Figure 22. This corresponds to step S62 in Figure 25.

４）予測誤差電力の算出５）最小予測誤差電力値の更新６）動きベクトル値の決定実施の形態４と全く同じである。4) Calculating the prediction error power 5) Updating the minimum prediction error power value 6) Determining the motion vector value This is exactly the same as in embodiment 4.

動き補償処理は動き補償部９ｂで行われる。The motion compensation process is carried out by the motion compensation unit 9b.

図２６は、本実施形態における動き補償部９ｂの内部構成図、図２７は、本実施の形態における動き補償部９ｂの動作を示すフローチャートである。FIG. 26 is a diagram showing the internal configuration of the motion compensation unit 9b in this embodiment, and FIG. 27 is a flowchart showing the operation of the motion compensation unit 9b in this embodiment.

本実施の形態では、図７に示す動き補償部９に比べ、フィルタ部５０が加えられていることに特徴がある。対応点決定部３７は、実施の形態４で示したものと全く同じ動作をする。動きパラメータ１１に含まれる動き補償予測モードがブロックマッチングを示している時は、対応点は被予測ブロックの画面内位置信号２０６から動きベクトルで指示される量だけ平行移動させた領域に含まれる標本点とする。この処理は、図４４におけるＳ２０４で、（ｄｘ，ｄｙ）を動きベクトルとした時の参照画像２８中の位置（ｘ＋ｄｘ，ｙ＋ｄｙ）を決定する動作に相当する。This embodiment is characterized by the addition of a filter unit 50 compared to the motion compensation unit 9 shown in FIG. 7. The corresponding point determination unit 37 operates in exactly the same way as in embodiment 4. When the motion compensation prediction mode included in the motion parameters 11 indicates block matching, the corresponding point is determined to be a sample point included in an area translated by the amount indicated by the motion vector from the intra-screen position signal 206 of the predicted block. This process corresponds to the operation of S204 in FIG. 44, which determines the position (x + dx, y + dy) in the reference image 28 when (dx, dy) is the motion vector.

動きパラメータ１１に含まれる動き補償予測モードが変形ブロックマッチングを示している時は、被予測ブロックの画面内位置信号２０６に各画素位置に応じた編移量分を加算した後、動きベクトルで指示される量だけ平行移動させた領域に含まれる標本点となる。この処理は、図１７におけるＳ４７で、（ｄｘ，ｄｙ）を動きベクトルとした時の参照画像２８中の位置（ｓｘ＋ｄｘ，ｓｙ＋ｄｙ）を決定する動作に相当する。いずれの場合でも、各画素ごとに半画素空間上にあるか否かを判定し、整数画素空間上にある画素については、上述の変形ブロックマッチング部の予測画像生成処理と全く同じように、図２２に示すフィルタリングを施す。フィルタリングは、フィルタ部で行う。以下の予測画像データの読み出し、予測画像の生成については、実施の形態１に準ずる。When the motion compensation prediction mode included in the motion parameters 11 indicates modified block matching, the in-screen position signal 206 of the predicted block is added with a displacement amount corresponding to each pixel position, and then the resulting sample point is located in a region translated by the amount indicated by the motion vector. This process corresponds to the operation of determining the position (sx + dx, sy + dy) in the reference image 28 when (dx, dy) is the motion vector in S47 of FIG. 17. In either case, each pixel is determined to be in half-pixel space, and pixels in integer pixel space are subjected to filtering shown in FIG. 22, exactly as in the predicted image generation process of the modified block matching unit described above. Filtering is performed by the filter unit. The following reading of predicted image data and generation of predicted images conform to those in embodiment 1.

本実施の形態における変形ブロックマッチング部４２ｃは、フィルタを施さない場合の予測画像、及びフィルタＦを施した場合の予測画像のそれぞれの場合について独立に探索を行って、その結果を動き補償予測モード判定部２２に送ってもよいし、フィルタＦを施さない場合のみ探索を行い、その結果だけに対してフィルタＦを施して良好な結果を選択するようにしてもよい。The modified block matching unit 42c in this embodiment may perform a search independently for the predicted image without filter application and the predicted image with filter F applied, and send the search results to the motion compensation prediction mode determination unit 22, or may perform a search only for the case without filter F application, and apply filter F only to the search results to select the best result.

このように、フィルタＦを適応的にＯＮ／ＯＦＦする機構を設ける場合は、動きパラメータ１１の中にフィルタＯＮ／ＯＦＦの情報も含める。In this way, when a mechanism for adaptively turning the filter F on and off is provided, the motion parameters 11 also include information on whether the filter is on or off.

本実施の形態によれば、整数画素値へのフィルタリングだけで予測画像内の空間周波数のばらつきを除くことができ、平行移動量である動きベクトルだけでは予測誤差を最小にできない、つまり、予測がうまく的中しないような部分画像についても、良好な予測を行うことができる。According to this embodiment, it is possible to eliminate spatial frequency variations within the predicted image simply by filtering integer pixel values, and it is possible to perform good predictions even for subimages where prediction error cannot be minimized solely by using motion vectors (translation amounts), i.e., where prediction accuracy is poor.

実施の形態７．図２８は、この実施の形態における画像の予測方式を用いて圧縮符号化されたディジタル画像を伸長再生する画像復号装置の構成を示したものである。ここでは、実施の形態１に示す画像符号化装置によって生成される圧縮符号化データ（以下、ビットストリーム）１９を受信して伸長再生を行う画像復号装置として説明する。Embodiment 7. Figure 28 shows the configuration of an image decoding device that decompresses and plays back digital images that have been compression-encoded using the image prediction method of this embodiment. This image decoding device receives and decompresses the compression-encoded data (hereinafter, "bitstream") 19 generated by the image encoding device shown in Embodiment 1.

図２８において、５１はエントロピー復号部、６は逆量子化部、７は逆直交変換部、５３は復号加算部、５４はフレームメモリ、５６は表示制御部である。In FIG. 28, reference numeral 51 denotes an entropy decoding unit, 6 denotes an inverse quantization unit, 7 denotes an inverse orthogonal transform unit, 53 denotes a decoding and adding unit, 54 denotes a frame memory, and 56 denotes a display control unit.

本発明の復号装置は、動き補償部９の構成と動作に特徴があり、動き補償部９以外の上記の各要素について構成とその動作は既に知られているので、詳細説明は省略する。動き補償部９は、図１における動き補償部９と同一であることを示す。つまり、その内部構成図は、図７に示した内部構成図と同一であり、その動作フローチャートは、図８に示した動作フローチャートと同一である。The decoding device of the present invention is characterized by the configuration and operation of the motion compensation unit 9. Since the configuration and operation of each of the above-mentioned elements other than the motion compensation unit 9 are already known, detailed description will be omitted. The motion compensation unit 9 is identical to the motion compensation unit 9 in FIG. 1. In other words, its internal configuration diagram is identical to the internal configuration diagram shown in FIG. 7, and its operational flowchart is identical to the operational flowchart shown in FIG. 8.

以下、上記構成の装置の動作を説明する。The operation of the device having the above configuration will now be described.

まず、エントロピー復号部５１においてビットストリームが解析され、個々の符号化データに切り分けられる。量子化直交変換係数５２は逆量子化部６に送られ、逆量子化ステップ・パラメータ１７を用いて逆量子化される。この結果が逆直交変換部７において逆直交変換され、復号加算部５３に送られる。逆直交変換部は、ＤＣＴ等、符号化装置で用いるものと同じものを用いる。First, the bitstream is analyzed in the entropy decoding unit 51 and separated into individual pieces of coded data. The quantized orthogonal transform coefficients 52 are sent to the inverse quantization unit 6, where they are inversely quantized using the inverse quantization step parameter 17. The result is then inversely orthogonally transformed in the inverse orthogonal transform unit 7 and sent to the decoding and adding unit 53. The inverse orthogonal transform unit uses the same techniques as those used in the coding device, such as DCT.

動き補償部９には、動きパラメータ１１として、次の３種の情報が送られる。The motion compensation unit 9 receives the following three types of information as motion parameters 11:

即ち、エントロピー復号部５１でビットストリームから復号された動きベクトル２５、変形パターン情報２６ａと、被予測画像領域（本実施の形態では、固定サイズブロック）の画面内位置を示す情報２７ａが入力される。この際、動きベクトル２５、被予測画像領域の画面内位置２７ａは、被予測画像領域毎に固有の値であるが、変形パターン情報２６ａは、被予測画像領域毎に固有の値であっても、被予測画像領域を複数まとめたより大きな画像（例えば、画像フレームやＩＳＯ／ＩＥＣＪＴＣ１／ＳＣ２９／ＷＧ１１に開示されるＶＯＰなど）毎に符号化されていて、その単位に含まれる全ての被予測画像領域について同じ変形パターン情報を用いるように符号化されていてもよい。動き補償部９は、これらの３種類の情報に従ってフレームメモリ５４中の参照画像から予測画像データ１２を取り出す。予測画像生成の処理については、動き補償部９の動作説明の箇所で述べる。That is, the entropy decoding unit 51 inputs the motion vector 25, transformation pattern information 26a, and information 27a indicating the on-screen position of the predicted image region (in this embodiment, a fixed-size block) decoded from the bitstream. The motion vector 25 and the on-screen position 27a of the predicted image region are unique to each predicted image region. However, the transformation pattern information 26a may be unique to each predicted image region, or may be coded for a larger image (e.g., an image frame or a VOP as disclosed in ISO/IEC JTC1/SC29/WG11) that combines multiple predicted image regions, with the same transformation pattern information being used for all predicted image regions included in that unit. The motion compensation unit 9 extracts predicted image data 12 from the reference image in the frame memory 54 based on these three types of information. The process of generating a predicted image will be described in the section explaining the operation of the motion compensation unit 9.

動き補償部９には、エントロピー復号部５１で復号された動きパラメータ１１が送られる。The motion compensation unit 9 receives the motion parameters 11 decoded by the entropy decoding unit 51.

動き補償部９は、これらの動きパラメータ１１にしたがってフレームメモリ５４中の参照画像から予測画像データ１２を取り出す。この発明による画像の予測方式は、被予測ブロックを構成する画素と予測画像を構成する画素が１対１に対応しているので、従来のブロックマッチングにおける動き補償と同様、動きパラメータ１１によって予測画像領域が一意に決定される。The motion compensation unit 9 extracts predicted image data 12 from the reference image in the frame memory 54 in accordance with these motion parameters 11. In the image prediction method of this invention, the pixels constituting the predicted block correspond one-to-one to the pixels constituting the predicted image, so the predicted image area is uniquely determined by the motion parameters 11, just like in conventional block matching motion compensation.

復号加算部５３は、イントラ／インター符号化指示フラグ１６の値に基づいて、イントラ符号化ブロックならば、逆直交変換部の出力をそのまま復号画像５５として出力し、インター符号化ブロックなら、逆直交変換部の出力に予測画像データ１２を加算して復号画像５５として出力する。復号画像５５は表示制御部５６に送られ、図示していない表示デバイスに出力されるともに、以降のフレームの復号処理において参照画像として用いるために、フレームメモリ５４に書き込まれる。Based on the value of the intra/inter coding indication flag 16, the decoding and adding unit 53 outputs the output of the inverse orthogonal transform unit as a decoded image 55 if the block is intra-coded, or adds the predicted image data 12 to the output of the inverse orthogonal transform unit and outputs the result as a decoded image 55 if the block is inter-coded. The decoded image 55 is sent to the display control unit 56 and output to a display device (not shown), and is also written to the frame memory 54 for use as a reference image in the decoding process of subsequent frames.

次に、動き補償部９における予測画像生成処理について説明する。Next, the predicted image generation process in the motion compensation unit 9 will be described.

本実施の形態では、画像の予測方式は、被予測画像領域を構成する画素と予測画像を構成する画素の位置の対応が予め変形パターン情報２６ａによって規定されているので、動きベクトル２５による変位と変形パターン情報２６ａによる位置補正に基づく簡単なアドレス計算と内挿処理とによって予測画像が生成される。In this embodiment, the image prediction method uses transformation pattern information 26a to predetermine the positional correspondence between pixels constituting the predicted image area and pixels constituting the predicted image. Therefore, a predicted image is generated by simple address calculation and interpolation based on the displacement caused by motion vector 25 and the position correction caused by transformation pattern information 26a.

動き補償部９の内部構成を図２９に示す。The internal configuration of the motion compensation unit 9 is shown in FIG.

同図において、３７は対応点決定部、３８はメモリ読み出しアドレス生成部である。In the figure, 37 is a corresponding point determination unit, and 38 is a memory read address generation unit.

また、図３０は、その動作を示すフローチャートである。FIG. 30 is a flowchart showing the operation.

また、図３１は、動きベクトルにより参照画像から切り出されて指定された量だけ、被予測画像の座標位置に移動する動きを説明する図、図３２は、移動先で更に指定された変形パターンでアドレッシングを行う動作を説明する図である。FIG. 31 illustrates the movement of a reference image extracted by a motion vector and moved to a coordinate position in a predicted image by a specified amount, and FIG. 32 illustrates the operation of addressing the destination image using a further specified transformation pattern.

いずれの図においても、○は整数画素、×は半画素の位置を示すものとする。In all the figures, ◯ indicates an integer pixel position, and × indicates a half pixel position.

以下、図２９及び図３０をもとに、本実施の形態における動き補償部９の動作を説明する。The operation of the motion compensation unit 9 in this embodiment will be explained below with reference to Figures 29 and 30.

１）対応点の決定まず、対応点決定部３７において、入力される動きベクトル２５、変形パターン情報２６ａに基づき、被予測が増量域内の各画素に対応する予測画像のサンプル位置を算出する。まず、動きベクトル２５に基づき、被予測画像の現在位置に対する予測画像の基準位置を決定する。この処理は、図３１に示すように、被予測画像の画面内位置２７ａを（ｉ，ｊ）、動きベクトル２５を（ｄｘ，ｄｙ）としたとき、（ｉ’，ｊ’）＝（ｉ＋ｄｘ，ｊ＋ｄｙ）を定めることに相当する（図３０のＳ７１）。1) Determining Corresponding Points First, the corresponding point determination unit 37 calculates the sample position of the predicted image corresponding to each pixel in the predicted image's increased area based on the input motion vector 25 and transformation pattern information 26a. First, the reference position of the predicted image relative to the current position of the predicted image is determined based on the motion vector 25. As shown in Figure 31, this process corresponds to determining (i', j') = (i + dx, j + dy) when the in-screen position 27a of the predicted image is (i, j) and the motion vector 25 is (dx, dy) (S71 in Figure 30).

次いで、変形パターン情報２６ａに基づいて座標点（ｉ’，ｊ’）を補正し、最終的な予測画像のサンプル位置を求める。図３２は、変形パターン情報２６ａが「縦横１／２縮小」を示す場合の例を示している。この変形パターンによれば、予測画像の実行面積は、被予測画像領域の画面中に占める実行面積の１／４になる。つまり、予測画像が被予測画像領域に対して縮小される形となり、これにより、画面中で拡大を伴う動きなどの予測を効率化できる。具体的な位置補正の処理としては、参照画像中の位置（ｉ’，ｊ’）に対応する補正位置（ｉ”，Ｊ ”）を求める。これは、次の演算で実現できる（図３０のＳ７２）。Next, the coordinate point (i', j') is corrected based on the deformation pattern information 26a, to determine the final sample position of the predicted image. Figure 32 shows an example where the deformation pattern information 26a indicates "1/2 reduction vertically and horizontally." According to this deformation pattern, the effective area of the predicted image is 1/4 of the effective area of the image region to be predicted on the screen. In other words, the predicted image is reduced relative to the image region to be predicted, which improves the efficiency of prediction of movements that involve enlargement on the screen. Specifically, the position correction process determines the corrected position (i", J") corresponding to the position (i', j') in the reference image. This can be achieved by the following calculation (S72 in Figure 30):

なお、図３２では、ｂｌｏｃｋ＿ｗｉｄｔｈ＝ｂｌｏｃｋ＿ｈｅｉｇｈｔ＝４の場合、つまり、画素数が４×４を１ブロックとした場合を示しているが、これらは任意の正の整数、つまり、任意の画素数の高さと幅のブロックを取り得る。 Note that Figure 32 shows the case where block_width = block_height = 4, that is, where one block has 4 x 4 pixels, but these can be any positive integers, that is, blocks with height and width of any number of pixels.

以上によって求めた座標点（ｉ”，ｊ”）が、（ｉ，ｊ）に対応する予測画像サンプル位置として出力される。The coordinate point (i", j") obtained in this way is output as the predicted image sample position corresponding to (i, j).

２）予測画像生成用データの読み出し対応点決定部３７から出力される予測画像サンプル位置をもとに、メモリ読み出しアドレス生成部３８がフレームメモリ５４に蓄積されている参照画像中の予測画像生成に必要な画像データの位置を特定するメモリアドレスを生成し、予測画像生成用データを読み出す。2) Reading Data for Predicted Image Generation Based on the predicted image sample positions output by the corresponding point determination unit 37, the memory read address generation unit 38 generates a memory address that identifies the location of the image data required to generate the predicted image within the reference image stored in the frame memory 54, and reads the data for generating the predicted image.

３）予測画像の生成予測画像を生成する画素の内、整数画素位置の座標値のみをアドレッシングする場合は、予測画像生成用データがそのまま予測画像構成画素となる。一方、半画素精度の位置の座標値がアドレッシングされた場合、半画素生成部２３２によって予測画像生成用データの内挿処理がされて、半画素値が生成される。具体的に、半画素値の生成は、図３３による。図３３の方法は、単に加算２分演算であり、符号化装置の実施の形態１で説明した半画素生成部２３２のフロー図である図８のＳ２４を、再び説明したものである。3) Generation of Predicted Image When addressing only integer-pel coordinate values of pixels for generating a predicted image, the data for generating the predicted image directly becomes the pixels that make up the predicted image. On the other hand, when addressing coordinate values of positions with half-pel accuracy, the half-pel generator 232 interpolates the data for generating the predicted image to generate half-pel values. Specifically, half-pel values are generated as shown in Figure 33. The method shown in Figure 33 is a simple addition/division operation, and is a re-explanation of S24 in Figure 8, the flow diagram for the half-pel generator 232 described in the first embodiment of the encoding device.

なお、上記図３２による動き補償部９の動作を説明したが、変形パターン情報が図３２とは異なる内容を含んでいる場合には、変形処理が異なってくる。Although the operation of the motion compensation unit 9 has been described above using FIG. 32, if the transformation pattern information contains content different from that shown in FIG. 32, the transformation process will be different.

他の変形パターンの例として、図３４の場合を説明する。この場合、変形後の（ｉ”，ｊ”）は、以下のようにして求められる。As an example of another transformation pattern, the case of Figure 34 will be explained. In this case, (i", j") after transformation can be calculated as follows:

このように、変形パターン情報が変形ブロックをどのように切り出すかを取り決めておけば、それに基づいて簡単なアドレッシングで変形処理した動き補償を行った復号ができる。 In this way, if the transformation pattern information determines how to extract the transformation blocks, it is possible to perform the transformation-processed motion-compensated decoding using simple addressing based on the transformation pattern information.

以上のように、本実施の形態の画像復号装置によれば、予め変形パターンを用意しておき、対応するモード情報に従って簡単なサンプル位置の計算を行うだけで、平行移動では追跡しきれない複雑な動きを効率よく予測して符号化されたビットストリームから再生画像を得ることができる。As described above, the image decoding device of this embodiment can efficiently predict complex motion that cannot be tracked by translation alone, and generate a reconstructed image from the coded bitstream, simply by preparing transformation patterns and performing simple calculations of sample positions according to the corresponding mode information.

本実施の形態では、直交変換符号化以外の別の符号化方式によって予測誤差信号を符号化したビットストリームであっても、動き補償部９以外の予測誤差信号復号処理のための要素を変更することで、同様の効果を得ることができる。In this embodiment, even if the bitstream is one in which the prediction error signal is coded using a coding method other than orthogonal transform coding, the same effect can be achieved by changing elements for the prediction error signal decoding process other than the motion compensation unit 9.

また、本実施の形態では、固定サイズブロックを単位として復号処理を行う例について述べたが、これは通常のテレビ信号のフレームを単位とする復号装置に適用できるだけでなく、固定サイズブロックから構成される任意形状画像オブジェクト（例：ＩＳＯ／ＩＥＣＪＴＣＩ／ＳＣ２９／ＷＧ１１／Ｎ１７９６で開示されるＶｉｄｅｏＯｂｊｅｃｔＰｌａｎｅなど）を単位とする復号装置にも適用可能である。例えば、実施の形態１で述べた図９に示すように、静止した背景の前に人物像が存在するようなシーンにおいて、人物像を１つの画像オブジェクトとして、それを取り囲む外接四角形内の領域を小ブロックに分割し、画像オブジェクトを含むブロックを有効ブロックとして符号化されたビットストリームを復号する場合が考えられる。この場合は、これら有効ブロックに対して同様の処理を適用すればよい。Furthermore, while this embodiment describes an example of decoding processing in units of fixed-size blocks, this is applicable not only to decoding devices that process frames of regular television signals as units, but also to decoding devices that process arbitrary-shaped image objects composed of fixed-size blocks (e.g., the Video Object Plane disclosed in ISO/IEC JTCI/SC29/WG11/N1796). For example, in a scene with a human figure in front of a stationary background, as shown in Figure 9 described in embodiment 1, the human figure may be treated as a single image object, the area within the circumscribing rectangle surrounding the figure may be divided into small blocks, and the block containing the image object may be treated as a valid block, and the encoded bitstream may be decoded. In this case, similar processing may be applied to these valid blocks.

実施の形態８．実施の形態７の画像復号装置は、実施の形態１ないし６の画像復号装置に対応した整数画素又は半画素のみを用いてアドレッシング（座標指定）するだけで、予め決められた変形処理をして動き補償を行う装置を説明した。本実施の形態では、アドレッシングの際に、半画素生成以外の演算を行って、より精密な動き補償を行う画像復号装置を説明する。Embodiment 8. The image decoding device of Embodiment 7 corresponds to the image decoding device of Embodiments 1 through 6. It performs motion compensation by performing predetermined transformations using only integer or half-pixel addressing (coordinate specification). In this embodiment, we describe an image decoding device that performs more precise motion compensation by performing calculations other than half-pixel generation during addressing.

図３５は、本実施の形態における圧縮符号化されたディジタル画像を伸長再生する画像復号装置の構成を示したものである。FIG. 35 shows the configuration of an image decoding device for decompressing and reproducing compressed and encoded digital images in this embodiment.

同図において、９０は動き補償部、２５ｂは０〜４本の動きベクトル、６０は内挿処理精度指示情報である。In the figure, 90 denotes a motion compensation unit, 25b denotes zero to four motion vectors, and 60 denotes interpolation processing precision instruction information.

また、図３６は、動き補償部９０の内部構成図である。FIG. 36 is a diagram showing the internal configuration of the motion compensation unit 90.

図において、３７ｂは動きパラメータとして、図３５に示された動きベクトル２５ｂ、変形パターン情報２６ａ、被予測画像領域の画面内位置２７ａ及び内挿処理精度指示情報６０を入力として対応点を決める対応点決定部であり、２３２ｂは演算によって内挿した座標位置を求める内挿処理部である。この際、被予測画像領域の画面内位置２７ａは、被予測画像領域毎に固有の値であるが、動きベクトル２５ｂと変形パターン情報２６ａは、被予測画像領域毎に固有の値であっても、被予測画像領域を複数まとめたより大きな画像（例えば、画像フレームやＩＳＯ／ＩＥＣＪＴＣ１／ＳＣ２９／ＷＧ１１に開示されるＶＯＰなど）毎に符号化されていて、その単位に含まれる全ての被予測画像領域について同じ動きベクトルと変形パターン情報を用いるように符号化されていてもよい。In the figure, reference numeral 37b denotes a corresponding point determination unit that determines corresponding points using motion parameters, such as the motion vector 25b shown in FIG. 35, deformation pattern information 26a, the on-screen position 27a of the predicted image region, and interpolation processing accuracy instruction information 60, as input. Reference numeral 232b denotes an interpolation processing unit that calculates the interpolated coordinate position. In this case, the on-screen position 27a of the predicted image region is a unique value for each predicted image region. However, the motion vector 25b and deformation pattern information 26a may be unique values for each predicted image region. Alternatively, they may be coded for a larger image (e.g., an image frame or a VOP as disclosed in ISO/IEC JTC1/SC29/WG11) that combines multiple predicted image regions, and the same motion vector and deformation pattern information may be used for all predicted image regions included in that unit.

また、図３７は、図３６の動き補償部の動作フローチャート、図３８は、同じく動作を説明する図である。FIG. 37 is a flowchart illustrating the operation of the motion compensation unit of FIG. 36, and FIG. 38 is a diagram illustrating the same operation.

本実施の形態においては、従来の動きベクトルが該当ブロックを代表する１本のみであったのに対して、参照画像のブロックの四角の頂点４本までが入力され、それに対応して、まず、座標位置が後に対応点の決定の動作で説明する演算で求められる。更に、その求まった座標位置を内挿処理指示情報で丸め込んで座標位置を確定する。In this embodiment, whereas conventionally only one motion vector was used to represent the block, up to four vertices of the rectangle of the reference image block are input, and corresponding coordinate positions are first determined by a calculation described later in the operation for determining corresponding points. The determined coordinate positions are then rounded off using interpolation processing instruction information to determine the coordinate positions.

動き補償部９０以外の部分の動作は、実施の形態７の装置と同様である。即ち、エントロピー復号部５１において、ビットストリームが解析され、個々の符号化データに切り分けられる。量子化直交変換係数５２は、量子化ステップ・パラメータ１７を用いて逆量子化部６、逆直交変換部７で復号処理され、復号加算部５３に送られる。復号加算部５３は、イントラ／インター符号化指示フラグ１６の値に基づいて、イントラ符号化ブロック、インター符号化ブロックの区別に応じて予測画像データ１２をそのまま又は加算して復号画像５５として出力する。復号画像５５は、表示制御部５６に送られ、表示デバイスに出力され、また、参照画像としてフレームメモリ５４に書き込まれる。The operation of the components other than the motion compensation unit 90 is the same as in the device of embodiment 7. That is, the entropy decoding unit 51 analyzes the bitstream and separates it into individual coded data. The quantized orthogonal transform coefficients 52 are decoded by the inverse quantization unit 6 and the inverse orthogonal transform unit 7 using the quantization step parameter 17 and sent to the decoding and summing unit 53. The decoding and summing unit 53 outputs the predicted image data 12 as is or adds it to the predicted image data 12 depending on whether it is an intra-coded block or an inter-coded block, based on the value of the intra/inter coding indication flag 16, as a decoded image 55. The decoded image 55 is sent to the display control unit 56, output to a display device, and written to the frame memory 54 as a reference image.

以下、動き補償部９０における予測画像生成処理について説明する。The predicted image generation process in the motion compensation unit 90 will be described below.

本実施の形態では、変形パターン情報２６ａに従って、必要な本数の動きベクトル２５ｂを用いて変形に必要な変換式を得て、その変換式によって被予測画像領域の各画素に対応する予測画像構成画素のサンプル位置を決定した後、内挿処理精度指示情報で定められた画素制度に従った簡単な内挿処理によって予測画像が生成される。In this embodiment, the transformation formula required for transformation is obtained using the required number of motion vectors 25b in accordance with the transformation pattern information 26a, and the sample positions of the predicted image constituent pixels corresponding to each pixel in the predicted image region are determined using the transformation formula. After that, the predicted image is generated by a simple interpolation process in accordance with the pixel precision determined by the interpolation process precision instruction information.

以下、図３６ないし図３８をもとに、本実施の形態における動き補償部９０の動作を説明する。The operation of the motion compensation unit 90 in this embodiment will be described below with reference to Figures 36 to 38.

１）対応点の決定対応点決定部３７ｂにおいて、入力される動きベクトル２５ｂ、変形パターン情報２６ａに基づき、被予測画像領域内の各画素に対応する予測画像のサンプルすべき座標位置を算出する。図３８のように、ここでは動きベクトル２５ｂは、被予測画像領域の外接四角形の各頂点の４つの動きベクトルとする。まず、変形パターン情報２６ａに対応して変形に必要な変換式を得る。例えば、以下のような変換式を用いる。1) Determining Corresponding Points The corresponding point determiner 37b calculates the coordinate positions at which to sample the predicted image corresponding to each pixel in the predicted image area based on the input motion vector 25b and transformation pattern information 26a. As shown in Figure 38, the motion vectors 25b are the four motion vectors at the vertices of the circumscribing rectangle of the predicted image area. First, the transformation formula required for transformation is obtained in accordance with the transformation pattern information 26a. For example, the following transformation formula is used:

１−１）動きがなく、静止状態（必要な動きベクトルの本数：０本）（ｉ’，ｊ’）＝（ｉ，ｊ）（９）１−２）平行移動（必要な動きベクトルの本数：１本）（ｉ’，ｊ’）＝（ｉ＋ｄｘ０，ｊ＋ｄｙ０）（１０）１−３）等方変換（必要な動きベクトルの本数：２本）但し、（ｘ０，ｙ０）：被予測画像領域の外接四角形の左上隅頂点座標（ｘ１，ｙ１）：被予測画像領域の外接四角形の右上隅頂点座標（ｘ０’，ｙ０’）：第１の動きベクトル（ｄｘ０，ｄｙ０）によって（ｘ０，ｙ０）を変位させた座標（ｘ１’，ｙ１’）：第２の動きベクトル（ｄｘ１，ｄｙ１）によって（ｘ１，ｙ１）を変位させた座標Ｗ：ｘ１−ｘ０１−４）アフィン変換（必要な動きベクトルの本数：３本）但し、（ｘ０，ｙ０）：被予測画像領域の外接四角形の左上隅頂点座標（ｘ１，ｙ１）：被予測画像領域の外接四角形の右上隅頂点座標（ｘ２，ｙ２）：被予測画像領域の外接四角形の左下隅頂点座標（ｘ０’，ｙ０’）：第１の動きベクトル（ｄｘ０，ｄｙ０）によって（ｘ０，ｙ０）を変位させた座標（ｘ１’，ｙ１’）：第２の動きベクトル（ｄｘ１，ｄｙ１）によって（ｘ１，ｙ１）を変位させた座標（ｘ２’，ｙ２’）：第３の動きベクトル（ｄｘ２，ｄｙ２）によって（ｘ２，ｙ２）を変位させた座標Ｗ：ｘ１−ｘ０Ｈ：ｙ２−ｙ０１−５）透視変換（必要なベクトルの本数：４本）但し、（ｘ０，ｙ０）：被予測画像領域の外接四角形の左上隅頂点座標（ｘ１，ｙ１）：被予測画像領域の外接四角形の右上隅頂点座標（ｘ２，ｙ２）：被予測画像領域の外接四角形の左下隅頂点座標（ｘ３，ｙ３）：被予測画像領域の外接四角形の右下隅頂点座標（ｘ０’，ｙ０’）：第１の動きベクトル（ｄｘ０，ｄｙ０）によって（ｘ０，ｙ０）を変位させた座標（ｘ１’，ｙ１’）：第２の動きベクトル（ｄｘ１，ｄｙ１）によって（ｘ１，ｙ１）を変位させた座標（ｘ２’，ｙ２’）：第３の動きベクトル（ｄｘ２，ｄｙ２）によって（ｘ２，ｙ２）を変位させた座標（ｘ３’，ｙ３’）：第４の動きベクトル（ｄｘ３，ｄｙ３）によって（ｘ３，ｙ３）を変位させた座標Ｗ：ｘ１−ｘ０Ｈ：ｙ２−ｙ０変形パターン情報２６ａの形式としては、上記の変換式である式（９）ないし式（１３）を直接識別するビットでもよいし、各変換が動きベクトルの本数に対応していることから、動きベクトルの本数を表現するビットでもよい。以上の変換式によって、被予測画像領域の点（ｉ，ｊ）が参照画像中の（ｉ’，ｊ’）に対応付けられる。また、対応点位置計算の際に、予測画像のサンプル位置は、内挿処理精度指示情報６０で定められる精度の値まで取り得るようにする。例えば、半画素精度までに丸め込むとすれば、上記変換式によって得られた（ｉ’，ｊ ’）は、半画素精度の値に丸められる。１／４画素情報までとすれば、（ｉ’，ｊ’）は、１／４画素精度の値に丸め込められる。このサンプル位置精度を表す情報は、ビットストリーム中から抽出する。1-1) No motion, stationary state (number of required motion vectors: 0) (i', j') = (i, j) (9) 1-2) Translation (number of required motion vectors: 1) (i', j') = (i + dx0, j + dy0) (10) 1-3) Isotropic transformation (number of required motion vectors: 2) where (x0, y0): coordinates of the upper left vertex of the circumscribing rectangle of the image area to be predicted (x1, y1): coordinates of the upper right vertex of the circumscribing rectangle of the image area to be predicted (x0', y0'): coordinates obtained by displacing (x0, y0) by the first motion vector (dx0, dy0) (x1', y1'): coordinates obtained by displacing (x1, y1) by the second motion vector (dx1, dy1) W: x1 - x0 1-4) Affine transformation (number of motion vectors required: 3) where (x0, y0): coordinates of the upper left vertex of the circumscribing rectangle of the image area to be predicted (x1, y1): coordinates of the upper right vertex of the circumscribing rectangle of the image area to be predicted (x2, y2): coordinates of the lower left vertex of the circumscribing rectangle of the image area to be predicted (x0', y0'): coordinates obtained by displacing (x0, y0) by the first motion vector (dx0, dy0) (x1', y1'): coordinates obtained by displacing (x1, y1) by the second motion vector (dx1, dy1) (x2', y2'): coordinates obtained by displacing (x2, y2) by the third motion vector (dx2, dy2) W: x1 - x0 H: y2 - y0 1-5) Perspective transformation (number of vectors required: 4) however, (x0, y0): Coordinates of the upper left vertex of the circumscribing rectangle of the image area to be predicted (x1, y1): Coordinates of the upper right vertex of the circumscribing rectangle of the image area to be predicted (x2, y2): Coordinates of the lower left vertex of the circumscribing rectangle of the image area to be predicted (x3, y3): Coordinates of the lower right vertex of the circumscribing rectangle of the image area to be predicted (x0', y0'): Coordinates obtained by displacing (x0, y0) by the first motion vector (dx0, dy0) (x1', y1'): Coordinates obtained by displacing (x1, y1) by the second motion vector (dx1, dy1) (x2', y2'): Coordinates obtained by displacing (x2, y2) by the third motion vector (dx2, dy2) (x3', y3'): Coordinates obtained by displacing (x3, y3) by the fourth motion vector (dx3, dy3) W: x1-x0 H: y2 - y0 The format of the transformation pattern information 26a may be bits that directly identify the above transformation formulas (9) to (13), or bits that represent the number of motion vectors, since each transformation corresponds to the number of motion vectors. The above transformation formulas associate point (i, j) in the predicted image area with point (i', j') in the reference image. Furthermore, when calculating the corresponding point position, the sample position of the predicted image can take on values up to the accuracy specified by the interpolation processing accuracy instruction information 60. For example, if rounding is to be performed to half-pixel accuracy, (i', j') obtained by the above transformation formulas is rounded to a value with half-pixel accuracy. If quarter-pixel information is used, (i', j') is rounded to a value with quarter-pixel accuracy. This information representing the sample position accuracy is extracted from the bitstream.

以上のように、本実施の形態では、動きベクトル２５ｂからダイレクトに対応点決定ルールを定め、これに基づいて予測画像のサンプル位置を決定する。As described above, in this embodiment, a corresponding point determination rule is determined directly from the motion vector 25b, and the sample positions of the predicted image are determined based on this rule.

２）予測画像生成用データの読み出し対応点決定部３７ｂから出力される予測画像サンプル位置をもとに、メモリ読み出しアドレス生成部３８ｂがフレームメモリ５４に蓄積されている参照画像中の予測画像生成に必要な画像データの位置を特定するメモリアドレスを生成し、予測画像生成用データを読み出す。2) Reading Data for Predicted Image Generation Based on the predicted image sample positions output by the corresponding point determination unit 37b, the memory read address generation unit 38b generates a memory address that identifies the location of the image data required for predictive image generation within the reference images stored in the frame memory 54, and then reads the predicted image generation data.

３）予測画像の生成予測画像を構成する画素の内、整数画素位置の座標値のみをアドレッシングする場合は、予測画像生成用データがそのまま予測画像構成画素となる。本実施の形態では、予測画像をアドレッシングしてサンプルする位置は、上記のように予め定められた精度、例えば、半画素、１／４画素の値を取り得る。実数精度の位置の画素の場合は、内挿処理部２３２ｂにおいて、内挿処理精度指示情報６０で定められる整数精度とする指示に基づき、予測画像の整数画素値が生成される。3) Generation of Predicted Image When addressing only the coordinate values of integer pixel positions of the pixels that make up the predicted image, the data used to generate the predicted image becomes the pixels that make up the predicted image. In this embodiment, the positions at which the predicted image is addressed and sampled can have a predetermined precision, such as half-pixel or quarter-pixel values, as described above. For pixels located at real-number precision, the interpolation processing unit 232b generates integer pixel values of the predicted image based on the integer precision specified in the interpolation processing precision instruction information 60.

本実施の形態では、対応点決定部において、既に最終的なサンプル位置を内挿処理精度指示情報６０で指定される精度で丸めるが、内挿処理は、図３９のように、次の式（１５）の処理をする。なお、半画素精度の位置であれば、実施の形態１に述べた半画素生成部２３２と全く同じ処理となる。In this embodiment, the corresponding point determination unit already rounds the final sample positions to the precision specified by the interpolation processing precision instruction information 60, and the interpolation processing is performed using the following equation (15), as shown in Figure 39. Note that if the position is at half-pixel precision, the processing is exactly the same as that performed by the half-pixel generation unit 232 described in embodiment 1.

以上のように、本実施の形態の画像復号装置によれば、ゼロ又は複数本の動きベクトルを用いて簡単なサンプル位置計算を行うことで、複雑度の異なる動きを効率よく予測して符号化されたビットストリームから再生画像を得ることができる。 As described above, according to the image decoding device of this embodiment, by performing simple sample position calculations using zero or multiple motion vectors, it is possible to efficiently predict motion of different degrees of complexity and obtain a reconstructed image from an encoded bitstream.

実施の形態１ないし実施の形態６における画像符号化装置及び実施の形態７における画像復号装置は、整数画素及び半画素のアドレッシングのみで変形処理した動き補償を用いて高速で複雑な画像符号化、復号を行っている。The image coding apparatuses according to the first through sixth embodiments and the image decoding apparatus according to the seventh embodiment perform high-speed, complex image coding and decoding using motion compensation with modified processing using only integer-pixel and half-pixel addressing.

これに対して、本実施の形態における画像復号装置は、同様の構成を用いて、しかし、対応点決定の演算を参照画像と被予測画像の対象ブロックがよりマッチングし、従って、より適切な動きを得るため強化したものである。これにより、よりスムーズな動きを得ることができる。In contrast, the image decoding device in this embodiment uses a similar configuration, but the calculation for determining corresponding points has been strengthened to ensure a better match between the reference image and the current block in the predicted image, thereby obtaining more appropriate motion. This allows for smoother motion.

本実施の形態では、直交変換符号化以外の別の符号化方式によって予測誤差信号を符号化したビットストリームであっても、動き補償部９０以外の予測誤差信号復号処理のための要素を変更することで、同様の効果を得ることができるのは、実施の形態７と同じである。In this embodiment, as in the seventh embodiment, even if the bitstream is one in which the prediction error signal is coded by a coding method other than orthogonal transform coding, the same effect can be obtained by changing elements for the prediction error signal decoding process other than the motion compensation unit 90.

また、本実施の形態では、固定サイズブロックを単位として復号処理を行う例について述べたが、これは通常のテレビ信号のフレームを単位とする復号装置に適用できるだけでなく、固定サイズブロックから構成される任意形状画像オブジェクト（ＶｉｄｅｏＯｂｊｅｃｔＰｌａｎｅなど）を単位とする復号装置にも適用可能であるのも、実施の形態７と同じである。Furthermore, while this embodiment describes an example of decoding processing performed in units of fixed-size blocks, this is applicable not only to decoding devices that use frames of regular television signals as units, but also to decoding devices that use arbitrary-shape image objects (e.g., video object planes) composed of fixed-size blocks as units, as in embodiment 7.

実施の形態９．上記各実施の形態では、動きを検出する被予測画像の１ブロックを構成する画素数については言及しなかった。言い換えれば、任意の高さ（Ｈ）と幅（Ｗ）の画素を対象と考えてきた。本実施の形態では、このＨとＷの画素数を２のべき乗に制限して座標演算を簡略化する場合を説明する。こうすることで、対応点決定部の負荷が減り、演算を高速化できる。Embodiment 9. In the above embodiments, no mention was made of the number of pixels constituting one block of a predicted image for which motion is to be detected. In other words, we considered pixels of any height (H) and width (W). In this embodiment, we explain a case in which the number of pixels in H and W is limited to a power of two, thereby simplifying coordinate calculations. This reduces the load on the corresponding point determination unit and speeds up calculations.

本実施の形態では、図３６に示した実施の形態８における動き補償部９０の内、３７ｃとして対応点決定部の動作のみが異なるので、対応点決定部の動作についてのみ説明する。In this embodiment, only the operation of the corresponding point determination unit 37c of the motion compensation unit 90 in the eighth embodiment shown in FIG. 36 is different, and therefore only the operation of the corresponding point determination unit will be described.

図４０は、対応点決定部３７ｃの動作の様子を示すフローチャートである。FIG. 40 is a flowchart showing the operation of the corresponding point determining unit 37c.

また、図４１は、対応点決定部３７ｃの動作を説明する図である。FIG. 41 is a diagram for explaining the operation of the corresponding point determining unit 37c.

以下、図４０をもとに、本実施の形態における対応点決定部３７ｃの動作を説明する。The operation of the corresponding point determination unit 37c in this embodiment will be described below with reference to FIG.

本実施の形態における対応点決定部３７ｃは、動きベクトル２５ｂ、変形パターン情報２６ａ、内挿処理精度指示情報９１、被予測画像領域の画面内位置２７ａを入力とし、被予測画像領域内の各画素に対応する予測画像のサンプル位置を以下の式に基づいて算出して出力する。この際、被予測画像領域の画面内位置２７ａは、被予測画像領域毎に固有の値であるが、動きベクトル２５ｂと変形パターン情報２６ａは、被予測画像領域毎に固有の値であっても、被予測画像領域を複数まとめたより大きな画像（例えば、画像フレームやＩＳＯ／ＩＥＣＪＴＣ１／ＳＣ２９／ＷＧ１１に開示されるＶＯＰなど）毎に符号化されていて、その単位に含まれる全ての被予測画像領域について同じ動きベクトルと変形パターン情報を用いるように符号化されていてもよい。以下では、動きベクトルを最大３本使用する場合の例について説明する。In this embodiment, the corresponding point determination unit 37c receives the motion vector 25b, transformation pattern information 26a, interpolation processing accuracy instruction information 91, and the on-screen position 27a of the predicted image area as input, and calculates and outputs the sample position of the predicted image corresponding to each pixel in the predicted image area based on the following formula. In this case, the on-screen position 27a of the predicted image area is a unique value for each predicted image area. However, the motion vector 25b and transformation pattern information 26a may be unique values for each predicted image area. Alternatively, they may be coded for a larger image (e.g., an image frame or a VOP as disclosed in ISO/IEC JTC1/SC29/WG11) that combines multiple predicted image areas, and the same motion vector and transformation pattern information may be used for all predicted image areas included in that unit. The following describes an example in which up to three motion vectors are used.

動きベクトル２５ｂは、（ｘ０，ｙ０）と、図４１のように、被予測画像領域の外接四角形の左上隅及び右下隅の頂点を、左上隅の頂点から２のべき乗で表現可能な距離まで延長した点（ｘ０＋Ｗ’，ｙ０）（Ｗ’≧Ｗ，Ｗ’＝２^m）及び（ｘ０，ｙ０＋Ｈ’）（Ｈ’≧Ｈ，Ｈ’＝２ⁿ）の動きベクトルであるとする。 Motion vector 25b is defined as the motion vectors of (x0, y0) and the points (x0+W', y0) (W'≧W, W'=2 ^m ) and (x0, y0+H') (H'≧H, H'=2 ⁿ ) obtained by extending the vertices of the upper left and lower right corners of the circumscribing rectangle of the predicted image area from the upper left vertex to a distance that can be expressed as a power of 2, as shown in Figure 41.

これらの動きベクトルに基づいて、変形パターン情報２６ａに対応して、以下の変形に必要な変換式である式（１６）ないし式（１９）を得る。Based on these motion vectors, the transformation formulas (16) through (19) necessary for the following transformation are obtained in accordance with the transformation pattern information 26a.

１−１）動きなし（必要なベクトルの本数：０本）（ｉ’，ｊ’）＝（ｉ，ｊ）（１６）１−２）平行移動（必要なベクトルの本数：１本）（ｉ’，ｊ’）＝（ｉ＋ｄｘ０，ｊ＋ｄｙ０）（１７）１−３）等方変換（必要なベクトルの本数：２本）但し、（ｘ０，ｙ０）：被予測画像領域の外接四角形の左上隅頂点座標（ｘ１，ｙ１）：被予測画像領域の外接四角形の右上隅頂点座標（ｘ０’，ｙ０’）：第１の動きベクトル（ｄｘ０，ｄｙ０）によって（ｘ０，ｙ０）を変位させた座標（ｘ１’，ｙ１’）：第２の動きベクトル（ｄｘ１，ｄｙ１）によって（ｘ０＋Ｗ’，ｙ０）を変位させた座標１−４）アフィン変換（必要な動きベクトルの本数：３本）但し、（ｘ０，ｙ０）：被予測画像領域の外接四角形の左上隅頂点座標（ｘ１，ｙ１）：被予測画像領域の外接四角形の右上隅頂点座標（ｘ２，ｙ２）：被予測画像領域の外接四角形の左下隅頂点座標（ｘ０’，ｙ０’）：第１の動きベクトル（ｄｘ０，ｄｙ０）によって（ｘ０，ｙ０）を変位させた座標（ｘ１”，ｙ１”）：第２の動きベクトル（ｄｘ１，ｄｙ１）によって（ｘ０＋Ｗ’，ｙ０）を変位させた座標（ｘ２”，ｙ２”）：第３の動きベクトル（ｄｘ２，ｄｙ２）によって（ｘ０，ｙ０＋Ｈ’）を変位させた座標変形パターン情報２６ａの形式としては、上記の変換式である式（１６）ないし式（１９）を直接識別するために表記した複数のビットで構成されたビット列でもよいし、各変換が動きベクトルの本数に対応していることから、動きベクトルの本数を表現するビットでもよい。1-1) No movement (number of vectors required: 0) (i', j') = (i, j) (16) 1-2) Translation (number of vectors required: 1) (i', j') = (i + dx0, j + dy0) (17) 1-3) Isotropic transformation (number of vectors required: 2) where (x0, y0): coordinates of the upper left vertex of the circumscribing rectangle of the image area to be predicted; (x1, y1): coordinates of the upper right vertex of the circumscribing rectangle of the image area to be predicted; (x0', y0'): coordinates obtained by displacing (x0, y0) by the first motion vector (dx0, dy0); (x1', y1'): coordinates obtained by displacing (x0 + W', y0) by the second motion vector (dx1, dy1); 1-4) Affine transformation (number of motion vectors required: 3) where (x0, y0): coordinates of the upper left vertex of the circumscribing rectangle of the image area to be predicted (x1, y1): coordinates of the upper right vertex of the circumscribing rectangle of the image area to be predicted (x2, y2): coordinates of the lower left vertex of the circumscribing rectangle of the image area to be predicted (x0', y0'): coordinates obtained by displacing (x0, y0) by the first motion vector (dx0, dy0) (x1", y1"): coordinates obtained by displacing (x0 + W', y0) by the second motion vector (dx1, dy1) (x2", y2"): coordinates obtained by displacing (x0, y0 + H') by the third motion vector (dx2, dy2) The format of the transformation pattern information 26a may be a bit string composed of a plurality of bits expressed in order to directly identify the above transformation formulas (16) to (19), or may be bits representing the number of motion vectors, since each transformation corresponds to the number of motion vectors.

以上の変換式によって、被予測画像領域の点（ｉ，ｊ）が参照画像中の（ｉ’ ，ｊ’）に対応付けられる。また、対応点位置計算の際に、予測画像のサンプル位置は、ある定められた精度の値まで取り得るようにする。例えば、半画素精度までに丸め込むとすれば、上記変換式によって得られる（ｉ’，ｊ’）は、半画素精度の値となり、１／４画素精度に丸め込む指示とすれば、（ｉ’，ｊ’）は、１／４画素精度の値となる。このサンプル位置精度を表す情報は、ビットストリーム中から抽出する。The above conversion formula associates point (i,j) in the predicted image region with point (i',j') in the reference image. Furthermore, when calculating the corresponding point position, the sample position of the predicted image can take on values up to a certain precision. For example, if rounding is performed to half-pixel precision, (i',j') obtained by the above conversion formula will have half-pixel precision; if rounding is performed to quarter-pixel precision, (i',j') will have quarter-pixel precision. Information indicating this sample position precision is extracted from the bitstream.

２）予測画像生成用データの読み出し３）予測画像の生成に関しては、実施の形態８と全く同じ動作をするので、詳細記述は省略する。2) Reading data for generating a predicted image 3) Generating a predicted image The operations are exactly the same as in embodiment 8, so detailed description is omitted.

以上のように、本実施の形態の画像復号装置によれば、ゼロ又は複数本の動きベクトルを用いてサンプル位置計算を行う際に、Ｗ’又はＨ’による除算演算を全てビットシフト演算に置き換えて計算できるので、より高速にサンプル位置の決定を行うことができるとともに、複雑度の異なる動きを効率よく予測して符号化されたビットストリームから再生画像を得ることができる。As described above, the image decoding device of this embodiment can replace all division operations by W' or H' with bit shift operations when calculating sample positions using zero or multiple motion vectors. This allows for faster determination of sample positions and efficient prediction of motion with different degrees of complexity to produce a reconstructed image from the coded bitstream.

本実施の形態の動き補償を、他の符号化方式に基づく画像復号装置に用いる場合も、対応する要素を変更することで同様の効果を得ることができる。また、固定サイズブロックから構成される任意形状画像オブジェクト（ＶｉｄｅｏＯｂｊｅｃｔＰｌａｎｅなど）を単位とする復号装置にも適用可能であることは、実施の形態７と同じである。When the motion compensation of this embodiment is applied to an image decoding device based on another encoding method, similar effects can be achieved by changing the corresponding elements. Also, as with the seventh embodiment, it can be applied to a decoding device that uses an arbitrary-shape image object (such as a video object plane) composed of fixed-size blocks as a unit.

なお、本発明の画像符号化装置と、画像復号装置は、組にして特徴のある画像符号化復号システムを構成する。The image encoding device and image decoding device of the present invention are combined to form a distinctive image encoding/decoding system.

また、各動作フローチャートで表される動作を行うことにより、即ち、変形ブロックマッチングステップと、対応点決定ステップと動き補償画像生成ステップと復号加算ステップを備えることにより、特徴ある画像符号化方法、画像復号方法を得ることができる。Furthermore, by performing the operations shown in each operational flowchart, i.e., by including a modified block matching step, a corresponding point determination step, a motion-compensated image generation step, and a decoding and addition step, a distinctive image encoding method and an image decoding method can be obtained.

産業上の利用可能性以上のように、この発明によれば、実標本点の整数画素またはその中間の半画素を用いて、座標指定のみで得られる変形ブロックで画像の動きの予測を行うため、動きベクトルのような平行移動量だけでは予測がうまくいかない部分画像でも、アフィン変換のような複雑な演算なしに効率よく予測できる効果がある。また、回転やスケーリングなどの数式で記述可能な変形だけでなく、数式で簡単に記述できない、即ち、演算による実現が困難な変形にも対応できる効果がある。INDUSTRIAL APPLICABILITY As described above, this invention predicts image motion using integer pixels or intermediate half-pixels of actual sample points, using deformation blocks obtained by coordinate specification alone. This effectively enables efficient prediction without complex calculations such as affine transformations, even for partial images where prediction is difficult using only translational displacements such as motion vectors. Furthermore, this invention can handle not only transformations that can be described mathematically, such as rotation and scaling, but also transformations that cannot be easily described mathematically, i.e., transformations that are difficult to achieve through calculations.

対応復号装置でも、効率よく優れた画像を再現できる効果がある。This has the effect of enabling efficient reproduction of high-quality images even on compatible decoding devices.

また更に、アフィン変換のような複雑な画素補間演算を行うことなく、対応点の決定による座標指定のみによって回転と縮小または拡大を組合せた動きをうまく予測できる効果がある。Furthermore, it has the advantage that it can effectively predict movements that combine rotation and shrinking or enlarging by simply specifying the coordinates of corresponding points, without the need for complex pixel interpolation operations such as affine transformation.

また更に、平行移動によるブロックマッチングの動きベクトルを利用することで、変形ブロックマッチングの探索範囲を効果的に削減することができ、動き補償予測全体の演算量を低減できる効果がある。Furthermore, by using motion vectors from translational block matching, the search range for modified block matching can be effectively reduced, thereby reducing the overall computational complexity of motion compensated prediction.

また更に、アフィン変換のような複雑な画素補間演算を行うことなく、座標指定のみによって単純縮小または拡大スケーリングによる動きを効率良く予測できる効果がある。Furthermore, there is an advantage that the motion due to simple scaling can be efficiently predicted by specifying coordinates only, without performing complex pixel interpolation operations such as affine transformation.

また更に、変形パターンテーブルを参照するだけで対応点が決定できるので、アフィン変換のような簡単な数式では表現できないような任意の変形に伴う動きをもうまく予測できる効果がある。Furthermore, since corresponding points can be determined simply by referencing a deformation pattern table, it has the effect of being able to effectively predict movements associated with arbitrary deformations that cannot be expressed by simple mathematical formulas such as affine transformations.

また更に、フィルタを用いて変形ブロック内の空間周波数特性をフラットにすることができ、予測のミスマッチを低減できる効果がある。Furthermore, the spatial frequency characteristics within the transformed block can be flattened using a filter, which has the effect of reducing prediction mismatch.

画像符号化装置の変形ブロックマッチングと動き予測に対応した復号装置を構成したので、高速で最適な動き予測を行った画像データを復号再生できる効果がある。The decoder is designed to support the modified block matching and motion prediction of the image coding device, enabling high-speed decoding and playback of image data with optimal motion prediction.

また更に、画像復号装置のアドレッシングにおいて、自由度が高い動き予測を復号できるので、動きのスムーズな画像を再生できる効果がある。Furthermore, the addressing of the image decoding device allows for highly flexible motion prediction, which has the effect of enabling smooth image reproduction.

───────────────────────────────────────────────────── フロントページの続き (72)発明者西川博文東京都千代田区丸の内２丁目２番３号三菱電機株式会社内 (72)発明者黒田慎一東京都千代田区丸の内２丁目２番３号三菱電機株式会社内 (72)発明者井須芳美東京都千代田区丸の内２丁目２番３号三菱電機株式会社内 (72)発明者長谷川由里東京都千代田区丸の内２丁目２番３号三菱電機株式会社内（注）この公表は、国際事務局（ＷＩＰＯ）により国際公開された公報を基に作成したものである。なおこの公表に係る日本語特許出願（日本語実用新案登録出願）の国際公開の効果は、特許法第１８４条の１０第１項（実用新案法第４８条の１３第２項）により生ずるものであり、本掲載とは関係ありません。───────────────────────────────────────────────────── Continued from the front page (72) Inventor: Hirofumi Nishikawa 2-2-3 Marunouchi, Chiyoda-ku, Tokyo Ryo Electric Co., Ltd. (72) Inventor: Shinichi Kuroda 2-2-3 Marunouchi, Chiyoda-ku, Tokyo Ryo Electric Co., Ltd. (72) Inventor: Yoshimi Isu 2-2-3 Marunouchi, Chiyoda-ku, Tokyo Ryo Electric Co., Ltd. (72) Inventor: Yuri Hasegawa 2-2-3 Marunouchi, Chiyoda-ku, Tokyo Ryo Electric Co., Ltd. (Note) This publication is based on the publication published internationally by the International Bureau of International Patent Publication (WIPO). Please note that the effect of the international publication of the Japanese-language patent application (Japanese-language utility model registration application) related to this publication arises pursuant to Article 184-10, Paragraph 1 of the Patent Act (Article 48-13, Paragraph 2 of the Utility Model Act), and is unrelated to this publication.

Claims

[Claims]

1. An image coding device that divides an input image into predetermined blocks, includes a motion compensation prediction means for detecting inter-frame motion between the blocks, and performs compression coding of the input image. The image coding device includes: a motion detection unit that transforms only integer pixels, which are actual sample points present in corresponding partial regions of a reference image for motion detection, into a predetermined format, extracts them by specifying coordinates, and compares them with the integer pixels of the blocks of the input image, outputting a motion vector that provides the minimum error extracted by specifying coordinates; and a motion compensation unit that transforms the blocks of the reference image correspondingly according to motion parameters obtained from the comparison output including the transformed block matching unit, and determines the corresponding points by specifying coordinates, outputting a predicted partial image.

2. The image coding device according to claim 1, wherein the modified block matching unit, when performing a predetermined format of modification on a partial region of the reference image, performs the modification using integer pixels and half pixels that are midpoints of the integer pixels.

3. The image coding device according to claim 1 or 2, further comprising a preprocessing unit that separates an input image into subregions of image objects as target regions for coding, and divides each of the separated image objects into blocks for motion estimation and motion compensation.

4. The image coding device according to claim 1 or 2, wherein the modified block matching unit and the motion compensation unit are configured as follows: when specifying integer pixel or half pixel coordinates, the modified block matching unit specifies the coordinates of adjacent points or adjacent points that are a predetermined integer multiple of the coordinates, extracts them, and compares them; and the motion compensation unit similarly processes and outputs a reference image.

5. The image coding device according to claim 1 or 2, wherein the modified block matching unit and the motion compensation unit are configured as a modified block matching unit that extracts and compares integer pixels or half pixels by specifying coordinates rotated in a predetermined angle direction, and a motion compensation unit that similarly processes and outputs a reference image.

6. The image encoding device according to claim 5, wherein the rotation in the predetermined angular direction is positive or negative 45 degrees, 90 degrees, 135 degrees, or 180 degrees.

7. The image coding device according to claim 1, wherein the modified block matching unit and the motion compensation unit are a modified block matching unit that searches for an area indicated by a partial area of the reference image after translation, and moves and compares the search area by enlarging or reducing it or by rotating it in a predetermined direction, and a motion compensation unit that similarly processes and outputs the reference image.

8. The image coding device according to claim 1 or 2, wherein the modified block matching unit includes a transformation pattern table for transforming and comparing a partial region of a reference image, and the modified block matching unit compares an image of the partial region based on transformation values extracted from the transformation pattern table with integer pixels or half pixels of the block of the input image, and the motion compensation unit similarly processes and outputs the reference image.

9. The image coding device according to claim 1 or 2, wherein the modified block matching unit selectively filters and compares specific pixels of the reference image extracted for correspondence evaluation.

10. The image coding device according to claim 1 or 2, characterized in that the frame used for motion detection is a temporally preceding or succeeding frame, and the reference image is a stored temporally preceding or succeeding frame that is compared with the input image.

11. An image decoding device that decompresses and reproduces image compression codes of input information and is equipped with a motion compensation prediction means that detects motion between frames, wherein the motion compensation prediction means includes a mechanism for manually extracting pre-prepared integer pixels of corresponding partial regions based on motion parameters in the input information by specifying coordinates in a predetermined format, and the image signals of the partial regions that have been transformed into the predetermined format are output and added.

12. The image decoding device according to claim 11, wherein the motion compensation prediction means also has a mechanism for specifying and extracting coordinates for half pixels, and performs processing corresponding to the enlargement, reduction, or rotation of the modified block matching of the motion estimation means of the corresponding image encoding device.

13. An image coding method comprising: a motion compensation prediction means for storing a reference image for compression coding of an input digital image, dividing the reference image into predetermined blocks, and detecting motion between frames; a modified block matching step for transforming integer pixels of a partial region of the reference image into a predetermined format, extracting the transformed pixels by specifying coordinates, generating a predicted partial image, and comparing it with the block of the input image; and a corresponding point determination step for determining corresponding points of the partial region by specifying coordinates from a motion vector that provides the smallest error, selected using the modified block matching, to produce a motion-compensated output.

14. The image coding method according to claim 13, wherein the modified block matching step extracts a reference block by specifying coordinates, transforms the reference image into a predetermined format by adding half pixels at its midpoint in addition to the integer pixels of the partial region of the reference image, extracts the reference image by specifying coordinates, and generates a predicted partial image for comparison.

15. The image coding method according to claim 13 or 14, further comprising a transformation pattern table for transforming a partial region of a reference image, and a transformation block matching step for comparing an image of a partial region based on transformation values read from corresponding addresses by referencing the transformation pattern table with the input image during the transformation block matching.

16. An image decoding method comprising: a motion compensation prediction means for storing a reference image and dividing it into predetermined blocks to perform inter-frame motion compensation in order to decompress and reproduce an input image compression code; a motion compensation image generation step for transforming pre-prepared integer pixels of a partial region of the reference image corresponding to a partial region based on reference parameters of the input image code into a predetermined format corresponding to the image encoding method of the transmitting side, extracting the pixels by specifying coordinates, and generating a predicted partial image; and a decoding and addition step for adding the predicted partial images to obtain a reproduced image.

17. The image decoding method according to claim 16, wherein the motion-compensated image generating step is a step of transforming a partial region of the reference image into a predetermined format by adding a half pixel at its midpoint in addition to integer pixels of the partial region to generate a predicted image, specifying and extracting the coordinates, and generating a predicted partial image.

18. An image encoding/decoding system comprising: an image encoding device for dividing an input image into predetermined blocks and compressing and encoding the input image; a motion estimation unit including a modified block matching unit that transforms only integer pixels, which are actual sample points present in corresponding partial regions of a reference image for motion estimation, into a predetermined format, extracts them by specifying their coordinates, and compares them with the integer pixels of the blocks of the input image, outputting a motion vector that minimizes the error extracted by specifying the coordinates; a motion compensation unit including a corresponding point determination unit that transforms corresponding blocks of the reference image in accordance with motion parameters obtained from the comparison output including the modified block matching unit, and determines the corresponding coordinates, outputting a predicted partial image; and an image decoding device including motion compensation prediction means for detecting motion between frames, for decompressing and reproducing image compression codes of input information, wherein the motion compensation prediction means includes a mechanism for extracting pre-prepared integer pixels of corresponding partial regions by specifying their coordinates in a predetermined format based on the motion parameters in the input information, and outputs and adds the image signals of the partial regions processed in the predetermined format.