JP2006311589A

JP2006311589A - Moving image decoding method and apparatus

Info

Publication number: JP2006311589A
Application number: JP2006155050A
Authority: JP
Inventors: Shinichiro Koto; 晋一郎古藤; Takeshi Nakajo; 健中條; Yoshihiro Kikuchi; 義浩菊池
Original assignee: Toshiba Corp
Current assignee: Toshiba Corp
Priority date: 2006-06-02
Filing date: 2006-06-02
Publication date: 2006-11-09

Abstract

<P>PROBLEM TO BE SOLVED: To perform fast forwarding reproduction with high encoding efficiency and a higher degree of freedom at a decoding side when decoding a moving image using motion compensated prediction inter-frame encoding. <P>SOLUTION: Identification information indicating a group of a layer to which an encoding target frame included in encoded data is distributed, and a reference frame used for motion compensated prediction inter-frame encoding and side information including information indicating a picture type of the encoding target frame are decoded (S21), a reference frame belonging to a group of layers lower than the layer to which the encoding target frame is distributed, is selected according to the decoded identification information (S22), and the selected reference frame is used to decode a result of motion compensated prediction inter-frame encoding included in the encoded data (S23). <P>COPYRIGHT: (C)2007,JPO&INPIT

Description

本発明は、動き補償予測フレーム間符号化を用いた動画像復号化方法及び装置に関する。 The present invention relates to a moving picture decoding method and apparatus using motion compensated prediction interframe coding.

動画像の圧縮符号化技術として、ＭＰＥＧ１(ISO/IEC11172-2)，ＭＰＥＧ２(ISO/IEC13818-2)，ＭＰＥＧ４(ISO/IEC14496-2)などが広く実用化されている。これらの動画像符号化方式では、フレーム内符号化（イントラ符号化）、前方予測フレーム間符号化及び両方向予測フレーム間符号化の組み合わせによる符号化が行われ、これらの符号化モードで符号化されるフレームはそれぞれＩピクチャ、Ｐピクチャ及びＢピクチャと呼ばれる。Ｐピクチャは直前のＰまたはＩピクチャを参照フレームとして符号化され、Ｂピクチャは直前及び直後のＰまたはＩピクチャを参照フレームとして符号化される。前方予測フレーム間符号化及び両方向予測フレーム間符号化は、動き補償予測フレーム間符号化と呼ばれる。 MPEG1 (ISO / IEC11172-2), MPEG2 (ISO / IEC13818-2), MPEG4 (ISO / IEC14496-2), and the like have been widely put into practical use as moving image compression coding techniques. In these moving image encoding systems, encoding is performed by a combination of intra-frame encoding (intra encoding), forward prediction inter-frame encoding, and bidirectional prediction inter-frame encoding, and encoding is performed in these encoding modes. Each frame is called an I picture, a P picture, and a B picture. The P picture is encoded using the immediately preceding P or I picture as a reference frame, and the B picture is encoded using the immediately preceding and immediately following P or I picture as a reference frame. Forward prediction interframe coding and bidirectional prediction interframe coding are called motion compensated prediction interframe coding.

ＭＰＥＧ方式による動画像の符号化データを早送り再生する場合は、参照フレームを必要としないＩピクチャのみを再生するか、あるいはＢピクチャが参照フレームとして用いられないという性質を利用し、Ｂピクチャを飛ばしてＩピクチャ及びＰピクチャのみを復号化する方法が一般的である。しかし、Ｉピクチャのみを再生する場合、Ｉピクチャの周期が長いと、高速な早送りは実現できるものの、滑らかな早送り再生が出来ない。Ｉピクチャ及びＰピクチャを用いた早送りでは、Ｐピクチャにフレーム間予測符号化が用いられているため、全てのＩピクチャ及びＰピクチャの復号化を行う必要があり、早送り速度を自由に変更することが困難となる。 When fast-forwarding reproduction of encoded data of a moving picture according to the MPEG system, only an I picture that does not require a reference frame is reproduced, or a B picture is skipped by utilizing the property that a B picture is not used as a reference frame. In general, a method of decoding only the I picture and the P picture is used. However, when reproducing only an I picture, if the period of the I picture is long, high-speed fast-forwarding can be realized, but smooth fast-forwarding reproduction cannot be performed. In fast-forward using I and P pictures, inter-frame predictive coding is used for P pictures, so it is necessary to decode all I and P pictures, and the fast-forward speed can be freely changed. It becomes difficult.

また、従来のＭＰＥＧ方式の動画像符号化では、Ｂピクチャは参照フレームとして用いられないため、複数のＢピクチャが連続する予測構造の場合、Ｂピクチャの符号化において時間的に離れたＰピクチャを参照フレームとせざるを得ず、Ｂピクチャの符号化効率が低下するという問題がある。一方、復号化されたＢピクチャをＰピクチャにおける参照フレームとして用いる構成とすると、上述のＢピクチャを飛ばした早送り再生時にも、Ｂピクチャを含む全てのフレームを復号化することが必要となり、効率的に早送り再生を行うことが困難となる。 In addition, in the conventional MPEG video coding, B pictures are not used as reference frames. Therefore, in the case of a prediction structure in which a plurality of B pictures are continuous, P pictures that are separated in time are encoded in the B picture coding. There is a problem that the encoding efficiency of the B picture decreases because it must be a reference frame. On the other hand, when the decoded B picture is used as a reference frame in the P picture, it is necessary to decode all the frames including the B picture even during the fast-forward playback with the B picture skipped. It becomes difficult to perform fast forward playback.

上述したように、ＭＰＥＧのように動き補償予測フレーム間符号化を含む符号化によって得られた動画像符号化データについて早送り再生を行う場合、Ｉピクチャのみを再生すると滑らかな早送りを自由な再生速度で行うことが難しく、またＢピクチャを復号化せずにスキップした早送り再生を行う場合は、復号化されたＢピクチャを参照フレームとして用いることが困難であり、Ｂピクチャが連続した予測構造では、符号化効率が低下するという問題があった。 As described above, when fast-forward playback is performed on moving image encoded data obtained by encoding including motion-compensated prediction inter-frame encoding as in MPEG, smooth playback can be freely performed when only I pictures are played back. In the case of performing fast forward playback skipped without decoding the B picture, it is difficult to use the decoded B picture as a reference frame. In a prediction structure in which the B picture is continuous, There was a problem that the encoding efficiency was lowered.

本発明の目的は、動き補償予測フレーム間符号化を用いた動画像の符号化及び復号化において、符号化効率が高く、また復号化側でより自由度の高い早送り再生を可能とすることにある。 An object of the present invention is to enable high-speed playback with high coding efficiency and higher degree of freedom on the decoding side in coding and decoding of moving images using motion compensated prediction interframe coding. is there.

上記の課題を解決するため、本発明は動画像の符号化対象フレームに対して、少なくとも一つの復号化済みフレームを参照フレームとして用いる動き補償予測フレーム間符号化を含む符号化処理を行う動画像符号化において、フレーム間予測構造を複数の階層化された予測グループ構造とし、符号化対象フレームを複数の参照フレームがそれぞれ属する複数階層の予測グループのうちの何れかの階層の予測グループに振り分け、符号化対象フレームが振り分けられた階層以下の少なくとも一つの階層の予測グループに属する参照フレームを用いて動き補償予測フレーム間符号化を行う。 In order to solve the above-described problem, the present invention provides a moving image in which a coding process including a motion compensated prediction inter-frame coding using at least one decoded frame as a reference frame is performed on a coding target frame of a moving image. In encoding, the inter-frame prediction structure is set to a plurality of hierarchical prediction group structures, and the encoding target frame is allocated to a prediction group of any one of a plurality of prediction groups to which a plurality of reference frames belong, Motion compensation prediction interframe coding is performed using reference frames belonging to a prediction group of at least one layer below the layer to which the encoding target frame is distributed.

また、符号化対象フレームが振り分けられた階層の予測グループを示す第１の識別情報及び動き補償予測フレーム間符号化に使用された参照フレームを示す第２の識別情報を、該符号化対象フレームに対する動き補償予測フレーム間符号化の結果と共に符号化データとして出力する。 Also, the first identification information indicating the prediction group of the layer to which the encoding target frame is distributed and the second identification information indicating the reference frame used for the motion compensated prediction interframe encoding are set for the encoding target frame. The result is output as encoded data together with the result of the motion compensation prediction interframe encoding.

一方、動画像の符号化対象フレームに対して動き補償予測フレーム間符号化を含む符号化処理を行って得られた符号化データを復号化して動画像を再生する動画像復号化においては、符号化データに含まれる符号化対象フレームが振り分けられた階層の予測グループを示す第１の識別情報及び動き補償予測フレーム間符号化に使用された参照フレームを示す第２の識別情報を復号化し、復号化された第１の識別情報及び第２の識別情報に従って、符号化対象フレームが振り分けられた階層の予測グループ又はそれより下位の階層の予測グループに属する少なくとも一つの参照フレームを選択し、選択した参照フレームを用いて符号化データに含まれる動き補償予測フレーム間符号化の結果を復号化する。 On the other hand, in the moving picture decoding in which the encoded data obtained by performing the encoding process including the motion compensated prediction inter-frame encoding on the encoding target frame of the moving picture is decoded to reproduce the moving picture, Decoding the first identification information indicating the prediction group of the layer to which the encoding target frame included in the encoded data is distributed and the second identification information indicating the reference frame used for the motion compensated prediction interframe encoding In accordance with the first identification information and the second identification information, the at least one reference frame belonging to the prediction group of the layer to which the encoding target frame is distributed or the prediction group of the lower layer is selected and selected. The result of the motion compensation prediction interframe coding included in the coded data is decoded using the reference frame.

このように各階層の予測グループに振り分けられた符号化対象フレームは、該階層以下の予測グループに属する参照フレームを用いて動き補償予測フレーム間符号化が行われることにより、復号化に際しては上位階層の予測グループに属する符号化対象フレームの符号化結果を復号化することなしに、正常に復号化することが可能となる。従って、復号化する予測グループの最上位階層を変化させることで、復号化されるフレーム数を変化させることが可能となり、階層に応じて再生されるフレームレートを可変としたり、あるいは復号化したフレームの表示フレームレートを変更することで、速度可変の早送り再生を行うことなどが容易に実現できる。また、特定の階層以下の符号化データのみを選択して送出することで、伝送帯域に応じたビットレート可変のストリーミングを行うことも可能となる。 In this way, the encoding target frames allocated to the prediction groups of each layer are subjected to motion compensation prediction interframe encoding using reference frames belonging to the prediction groups below the layer, so that an upper layer is used for decoding. It is possible to normally decode without decoding the encoding result of the encoding target frame belonging to the prediction group. Therefore, it is possible to change the number of frames to be decoded by changing the highest layer of the prediction group to be decoded. The frame rate to be reproduced can be made variable according to the layer or the decoded frame can be changed. By changing the display frame rate, fast-forward playback with variable speed can be easily realized. In addition, by selecting and transmitting only encoded data below a specific layer, it is possible to perform streaming with variable bit rate according to the transmission band.

この場合、複数階層の予測グループにそれぞれ属する前記参照フレームの最大フレーム数を予め前記予測グループ毎に個別に定めておき、この最大参照フレーム数に従って、各階層の予測グループの参照フレームのメモリ管理を行うようにしてもよい。 In this case, the maximum number of reference frames belonging to each of the prediction groups of a plurality of hierarchies is determined separately for each prediction group in advance, and memory management of the reference frames of the prediction groups of each hierarchy is performed according to the maximum reference frame number. You may make it perform.

このようにすると、復号化に必要となる参照フレームの最大フレーム数を、復号化する最上位階層の予測グループ以下の予測グループにおける最大参照フレーム数の総和から一意に決定できる。従って、動画像復号化において限られたメモリ資源の中で、復号化可能な予測グループの最上位階層が一意に求められ、また上述したように復号化する予測グループの最上位階層を変化させて早送り再生や伝送ビットレートを変更する際に、復号化で用いる参照メモリの必要最小限の量を一意に決定することができ、復号化時の必要最小限のメモリ確保を容易に行うことか可能となる。 In this way, the maximum number of reference frames required for decoding can be uniquely determined from the sum of the maximum number of reference frames in the prediction group below the prediction group of the highest hierarchy to be decoded. Accordingly, the highest hierarchy of a predictable prediction group is uniquely determined in a limited memory resource in moving picture decoding, and the highest hierarchy of the prediction group to be decoded is changed as described above. When changing fast-forward playback or transmission bit rate, the minimum amount of reference memory used for decoding can be uniquely determined, and it is possible to easily secure the minimum memory required for decoding. It becomes.

さらに、複数階層の予測グループにそれぞれ属する参照フレームの最大フレーム数の総和を一定とし、該最大フレーム数を示す情報を第１の識別情報と第２の識別情報及び動き補償予測フレーム間符号化の結果と共に符号化データとして出力し、各階層の予測グループの参照フレームのメモリ管理を行うようにしてもよい。 Furthermore, the total sum of the maximum number of reference frames belonging to each of the prediction layers of a plurality of hierarchies is made constant, and information indicating the maximum number of frames is used for the first identification information, the second identification information, and the motion compensated prediction interframe coding. It may be output as encoded data together with the result, and memory management of reference frames of prediction groups in each layer may be performed.

このようにすると、総容量が固定の参照メモリを各階層の予測グループに動的に配分させることが可能となるため、動画像信号の性質の変化に応じて各階層の予測グループの参照メモリ数の比が最適になるように動的にフレームメモリを配分することで、符号化効率を向上させることが可能となる。また、総参照フレーム数が一定であるため、符号化及び復号化に必要な参照フレーム数は常に固定量を確保すればよく、メモリ管理を容易にすることができる。さらに、各階層の予測グループに属する参照フレーム数を示す情報をヘッダ情報として符号化することで、符号化側と復号化側で各階層の予測グループの参照フレーム数を一致させることが可能となり、各階層の予測グループの動的な参照フレーム数の変化が発生しても、破綻無く復号化することが可能となる。 In this way, the reference memory having a fixed total capacity can be dynamically allocated to the prediction groups in each hierarchy, so the number of reference memories in the prediction group in each hierarchy according to the change in the nature of the video signal By dynamically allocating the frame memory so that the ratio is optimal, it is possible to improve the coding efficiency. In addition, since the total number of reference frames is constant, the number of reference frames necessary for encoding and decoding only needs to be secured at all times, and memory management can be facilitated. Furthermore, by encoding information indicating the number of reference frames belonging to the prediction group of each layer as header information, it is possible to match the number of reference frames of the prediction group of each layer on the encoding side and the decoding side, Even if a dynamic change in the number of reference frames in the prediction group of each layer occurs, decoding can be performed without failure.

さらに、符号化処理として符号化対象フレームのフレーム毎にフレーム内符号化、前方予測フレーム間符号化及び両方向予測フレーム間符号化に切り替えて行い、フレーム内符号化及び前方予測フレーム間符号化を行う符号化対象フレーム及び該符号化対象フレームに対応する参照フレームを第１階層の予測グループに振り分け、両方向予測フレーム間符号化を行う符号化対象フレーム及び該符号化対象フレームに対応する参照フレームを該第１階層よりも上位の第２階層の予測グループに振り分けるようにしてもよい。 Further, as the encoding process, intra-frame encoding, forward prediction inter-frame encoding, and bidirectional prediction inter-frame encoding are switched for each frame of the encoding target frame, and intra-frame encoding and forward prediction inter-frame encoding are performed. An encoding target frame and a reference frame corresponding to the encoding target frame are allocated to a prediction group of the first layer, and an encoding target frame for bi-directional prediction interframe encoding and a reference frame corresponding to the encoding target frame are You may make it distribute to the prediction group of the 2nd hierarchy higher than the 1st hierarchy.

このようにすると、従来のＭＰＥＧ等の符号化と同様に、Ｉピクチャのみ、あるいはＩピクチャ及びＰピクチャのみを復号化する早送り再生を、Ｂピクチャを復号化すること無しに行うことが可能となる。さらに、Ｂピクチャにおいては復号化されたＩピクチャまたはＰピクチャに加えて、さらに１つまたは複数の復号化されたＢピクチャも参照フレームとして用いることが可能となり、Ｂピクチャの予測効率を従来よりも改善することが可能となる。 In this way, as with conventional encoding such as MPEG, it is possible to perform fast-forward playback for decoding only I pictures or only I pictures and P pictures without decoding B pictures. . Further, in addition to the decoded I picture or P picture, in addition to the decoded I picture or P picture, one or more decoded B pictures can be used as a reference frame. It becomes possible to improve.

さらに、本発明によると上述した動画像符号化及び動画像復号化の処理をコンピュータに行わせるためのプログラムを提供することができる。 Furthermore, according to the present invention, it is possible to provide a program for causing a computer to perform the above-described video encoding and video decoding processes.

本発明によればフレーム間予測構造を複数の階層化されたグループ構造とし、上位階層の予測グループの参照フレームからのフレーム間予測を禁止し、また総参照フレーム数が一定の下で各階層の予測グループの参照フレーム数を動的に変化させることで、符号化効率を従来より向上させ、かつより自由度の高い早送り再生を可能とすることができる。 According to the present invention, the inter-frame prediction structure is formed into a plurality of hierarchized group structures, inter-frame prediction from reference frames of a higher-level prediction group is prohibited, and each layer has a fixed total number of reference frames. By dynamically changing the number of reference frames in the prediction group, it is possible to improve the encoding efficiency as compared with the prior art and enable fast-forward playback with a higher degree of freedom.

以下、図面を参照して本発明の一実施形態について説明する。
（符号化側について）
図１は、本実施形態に係る動画像符号化装置の構成を示すブロック図である。図２は、動き補償予測フレーム間符号化に関する概略的な手順を示すフローチャートである。図１に示す動画像符号化装置は、ハードウェアで実現してもよいし、コンピュータを用いてソフトウェアにより実行してもよい。一部の処理をハードウェアで実現し、他の処理をソフトウェアにより行ってもよい。 Hereinafter, an embodiment of the present invention will be described with reference to the drawings.
(About encoding side)
FIG. 1 is a block diagram showing a configuration of a video encoding apparatus according to the present embodiment. FIG. 2 is a flowchart showing a schematic procedure for motion compensated prediction interframe coding. The moving picture encoding apparatus shown in FIG. 1 may be realized by hardware, or may be executed by software using a computer. Some processing may be realized by hardware, and other processing may be performed by software.

本実施形態は、従来のＭＰＥＧ方式に代表されるような、動き補償予測と直交変換及び可変長符号化を組み合わせた動画像符号化をベースとしている。以下の説明では、予測グループが２階層の場合について説明する。 The present embodiment is based on video coding that combines motion compensated prediction, orthogonal transform, and variable length coding, as represented by the conventional MPEG system. In the following description, a case where the prediction group has two layers will be described.

フレーム毎に入力される動画像信号１００（符号化対象フレーム）は、まず動き補償予測器１１１によって２階層の予測グループのうちの何れかの階層の予測グループに振り分けられる（ステップＳ１１）。次に、符号化対象フレームが振り分けられた階層以下の少なくとも一つの階層の予測グループに属する少なくとも一つの参照フレームとして動き補償予測フレーム間符号化が行われる（ステップＳ１２）。 The moving image signal 100 (encoding target frame) input for each frame is first distributed to a prediction group of any one of the two layers of prediction groups by the motion compensation predictor 111 (step S11). Next, motion compensated prediction interframe coding is performed as at least one reference frame belonging to a prediction group of at least one layer below the layer to which the encoding target frame is distributed (step S12).

各階層の予測グループに対する符号化対象フレームの振り分けは、例えば偶数フレームは第１階層の予測グループ、奇数フレームは第２階層の予測グループといったように、時間方向で変化するように行われる。各階層の予測グループに属する参照フレームも、参照フレームとなる符号化済みフレームの元となった符号化対象フレームが属する予測グループに応じて決定される。すなわち、ある符号化対象フレームがある階層の予測グループに振り分けられるとすると、その符号化対象フレームを符号化し局部復号化して得られた符号化済みフレームも、同じ階層の予測グループに属する。ステップＳ１１〜Ｓ１２の処理は、具体的には次のようにして行われる。 The allocation of the encoding target frame to the prediction group of each layer is performed so as to change in the time direction, for example, an even frame is a first layer prediction group and an odd frame is a second layer prediction group. The reference frames that belong to the prediction group of each layer are also determined according to the prediction group to which the encoding target frame that is the source of the encoded frame that becomes the reference frame belongs. That is, if a certain encoding target frame is allocated to a prediction group in a certain hierarchy, an already encoded frame obtained by encoding and local decoding the encoding target frame also belongs to a prediction group in the same hierarchy. Specifically, the processes of steps S11 to S12 are performed as follows.

上述のように第１階層及び第２階層の予測グループには、予め複数の符号化済みフレームが参照フレームとして属している。符号化済みフレームを参照フレームとして一時保存するために、二組の参照メモリセット１１８，１１９が用意されている。第１の参照メモリセット１１８には、既に符号化され且つ復号化された複数の動画像フレーム（これらを符号化済みフレームという）のうち、第１階層の予測グループに属する複数の符号化済みフレームが参照フレームとして一時保存されている。第２の参照メモリセット１１９には、複数の符号化済みフレームのうち、第２階層の予測グループに属する複数の符号化済みフレームが参照フレームとして一時保存されている。 As described above, a plurality of encoded frames belong to the first layer and second layer prediction groups in advance as reference frames. In order to temporarily store the encoded frame as a reference frame, two sets of reference memory sets 118 and 119 are prepared. The first reference memory set 118 stores a plurality of encoded frames belonging to the prediction group of the first hierarchy among a plurality of already encoded and decoded moving image frames (these are referred to as encoded frames). Is temporarily stored as a reference frame. In the second reference memory set 119, among the plurality of encoded frames, a plurality of encoded frames belonging to the second layer prediction group are temporarily stored as reference frames.

第１階層の予測グループに振り分けられた符号化対象フレームは、第１の参照メモリセット１１８に保存されている、第１階層の予測グループに属する参照フレームを用いて動き補償予測フレーム間符号化が行われる。一方、第２階層の予測グループに振り分けられた符号化対象フレームは、第１及び第２の参照メモリセット１１８，１１９に保存されている、第１及び第２階層の予測グループ両方に属する参照フレームを用いて動き補償予測フレーム間符号化が行われる。 The frame to be encoded distributed to the prediction group of the first layer is subjected to motion compensation prediction interframe encoding using the reference frame belonging to the prediction group of the first layer, which is stored in the first reference memory set 118. Done. On the other hand, the encoding target frame allocated to the second layer prediction group is a reference frame that is stored in the first and second reference memory sets 118 and 119 and belongs to both the first and second layer prediction groups. Is used to perform motion-compensated prediction interframe coding.

動き補償予測フレーム符号化について具体的に説明すると、まず入力動画像信号１００である符号化対象フレームが第１階層の予測グループに属する場合、第１の参照メモリセット１１８に一時保存されている一つまたは複数の参照フレームが読み出され、動き補償予測器１１１に入力される。このときスイッチ１２０はオフ状態にあり、第１の参照メモリセット１１９からの参照フレームは動き補償予測器１１１に入力されない。動き補償予測器１１１では、第１の参照メモリセット１１８から読み出される一つまたは複数の参照フレームを用いて動き補償予測が行われることにより、予測画像信号１０４が生成される。予測画像信号１０４は減算器１１０に入力され、ここで入力動画像信号１００に対する予測画像信号１０４の誤差信号である予測誤差信号１０１が生成される。 The motion compensated prediction frame encoding will be described in detail. First, when the encoding target frame that is the input moving image signal 100 belongs to the prediction group of the first hierarchy, the first temporarily stored in the first reference memory set 118 is stored. One or more reference frames are read out and input to the motion compensated predictor 111. At this time, the switch 120 is in an OFF state, and the reference frame from the first reference memory set 119 is not input to the motion compensation predictor 111. The motion compensated predictor 111 generates a predicted image signal 104 by performing motion compensation prediction using one or a plurality of reference frames read from the first reference memory set 118. The predicted image signal 104 is input to the subtractor 110, where a predicted error signal 101 that is an error signal of the predicted image signal 104 with respect to the input moving image signal 100 is generated.

入力動画像信号１００である符号化対象フレームが第２階層の予測グループに属する場合、スイッチ１２０はオン状態にあり、第１の参照メモリセット１１８及び第２の参照メモリセット１１９に一時保存されている一つまたは複数の参照フレームが読み出され、動き補償予測器１１１に入力されることにより、上記と同様に動き補償予測器１１１により予測画像信号１０４が生成され、さらに減算器１１０によって予測誤差信号１０１が生成される。 When the encoding target frame that is the input moving image signal 100 belongs to the prediction group of the second hierarchy, the switch 120 is in the on state, and is temporarily stored in the first reference memory set 118 and the second reference memory set 119. One or a plurality of reference frames are read out and input to the motion compensated predictor 111, so that the motion compensated predictor 111 generates the predicted image signal 104 in the same manner as described above, and the subtractor 110 further generates a prediction error. A signal 101 is generated.

予測誤差信号１０１は、ＤＣＴ変換器１１２により離散コサイン変換され、これによって得られたＤＣＴ係数が量子化器１１３により量子化される。量子化されたＤＣＴ係数データ１０２は二分岐され、一方において可変長符号化器１１４により符号化される。量子化され二分岐されたＤＣＴ係数データ１０２は、他方において逆量子化器１１５及び逆ＤＣＴ変換器１１６を経て予測誤差信号として再生される。この再生された予測誤差信号が予測画像信号１０４と加算されることにより、局部復号化画像信号１０３が生成される。 The prediction error signal 101 is subjected to discrete cosine transform by the DCT transformer 112, and the DCT coefficient obtained thereby is quantized by the quantizer 113. The quantized DCT coefficient data 102 is bifurcated and is encoded by the variable length encoder 114 on one side. On the other hand, the quantized and bifurcated DCT coefficient data 102 is reproduced as a prediction error signal through an inverse quantizer 115 and an inverse DCT transformer 116. The reproduced prediction error signal is added to the predicted image signal 104, whereby a locally decoded image signal 103 is generated.

局部復号化画像信号１０３である符号化済みフレームは、該符号化済みフレームの元となった入力動画像信号１００である符号化対象フレームに振り分けられた階層の予測グループに応じて第１及び第２の参照メモリセット１１８，１１９のいずれかに一時保存される（ステップＳ１３）。すなわち、符号化済みフレームは該符号化済みフレームの元となった符号化対象フレームが第１階層の予測グループに属する場合には、第１の参照メモリセット１１８に一時保存され、該符号化対象フレームが第２階層の予測グループに属する場合には、第２の参照メモリセット１１９に一時保存される。 The encoded frame that is the locally decoded image signal 103 is the first and the first according to the prediction group of the hierarchy allocated to the encoding target frame that is the input moving image signal 100 that is the source of the encoded frame. Temporarily stored in one of the two reference memory sets 118 and 119 (step S13). That is, the encoded frame is temporarily stored in the first reference memory set 118 when the encoding target frame that is the source of the encoded frame belongs to the first layer prediction group, and the encoding target If the frame belongs to the second layer prediction group, it is temporarily stored in the second reference memory set 119.

動き補償予測器１１１からは、動き補償予測のために用いた動きベクトルと、符号化対象フレームが所属する予測グループを識別するインデックス（第１の識別情報）、及び動き補償予測フレーム間符号化に使用した参照フレームを特定するインデックス（第２の識別情報）を含むいわゆるサイド情報１０５が出力され、可変長符号化器１１４によって符号化される（ステップＳ１４）。この場合、予測グループを識別するインデックスは、例えば予測構造を表すピクチャタイプとして符号化され、参照フレームを特定するインデックスはマクロブロック毎に符号化される。 From the motion compensation predictor 111, a motion vector used for motion compensation prediction, an index (first identification information) for identifying a prediction group to which an encoding target frame belongs, and motion compensation prediction interframe coding are used. So-called side information 105 including an index (second identification information) specifying the used reference frame is output and encoded by the variable length encoder 114 (step S14). In this case, an index for identifying a prediction group is encoded as, for example, a picture type representing a prediction structure, and an index for specifying a reference frame is encoded for each macroblock.

これらのサイド情報は、動き補償予測フレーム間符号化の結果である量子化されたＤＣＴ係数データと共に可変長符号の符号化データ１０６として出力される（ステップＳ１５）。例えば、サイド情報は符号化データ１０６にヘッダ情報として符号化される。さらに、各階層の予測グループに属する参照フレーム数の総和を予め規定して、各階層の予測グループに割り当てる参照フレームの最大フレーム数を設定する後述の第２の参照フレーム数設定方法を採用した場合には、該最大フレーム数を示す情報も符号化データ１０６のヘッダ情報として符号化される。符号化データ１０６は、図示しない蓄積媒体や伝送媒体に送出される。 The side information is output as encoded data 106 of a variable length code together with quantized DCT coefficient data that is a result of motion compensation prediction interframe coding (step S15). For example, the side information is encoded in the encoded data 106 as header information. Further, when a second reference frame number setting method (to be described later) that predefines the total number of reference frames belonging to the prediction group of each layer and sets the maximum number of reference frames to be allocated to the prediction group of each layer is adopted In addition, information indicating the maximum number of frames is also encoded as header information of the encoded data 106. The encoded data 106 is sent to a storage medium or transmission medium (not shown).

参照メモリセット１１８，１１９では、新しい復号化済みフレームが参照フレームとして順次書き込まれ、また時間的に最も古い参照フレームから順次削除される、いわゆるＦＩＦＯ(First-In First-Out)型の制御がフレーム単位で行われる。ただし、参照フレームが読み出されるときには、各参照メモリセット１１８，１１９内の任意の参照フレームへのランダムアクセスが行われる。 In the reference memory sets 118 and 119, a so-called FIFO (First-In First-Out) type control in which new decoded frames are sequentially written as reference frames and sequentially deleted from the oldest reference frame in time is used as a frame. Done in units. However, when a reference frame is read, random access to any reference frame in each reference memory set 118, 119 is performed.

参照メモリセット１１８，１１９にそれぞれ一時保存される参照フレームの数（言い換えれば、参照メモリセット１１８，１１９にそれぞれ含まれる参照メモリの数）は、次に示す２つの方法の何れかにより設定される。 The number of reference frames temporarily stored in the reference memory sets 118 and 119 (in other words, the number of reference memories included in the reference memory sets 118 and 119, respectively) is set by one of the following two methods. .

第１の参照フレーム数設定方法では、符号化の方法あるいはプロファイルやレベルといった符号化仕様に応じて、予め各階層の予測グループに属する参照フレームの最大フレーム数を個別に定める。動画像符号化装置及び動画像復号化装置においては、こうして予め定められた最大フレーム数の参照フレームを予測グループ毎に確保して符号化及び復号化を行う。この場合、符号化仕様を動画像符号化装置及び動画像復号化装置で一致させることにより、必要な数の参照フレームを自動的に確保することが可能である。 In the first reference frame number setting method, the maximum number of reference frames belonging to the prediction group of each layer is individually determined in advance according to the encoding method or encoding specifications such as profile and level. In the moving picture coding apparatus and the moving picture decoding apparatus, the reference frame having the maximum number of frames determined in this way is ensured for each prediction group to perform coding and decoding. In this case, it is possible to automatically secure the necessary number of reference frames by matching the encoding specifications between the video encoding device and the video decoding device.

第２の参照フレーム数設定方法では、符号化の方法あるいはプロファイルやレベルといった符号化仕様に応じて、各階層の予測グループに属する参照フレーム数の総和を予め規定し、各階層の予測グループにどれだけの参照フレーム数を割り当てるかという配分に関する情報、すなわち最大フレーム数を示す情報を符号化データ１０６のヘッダ情報として符号化する。 In the second reference frame number setting method, the total number of reference frames belonging to the prediction group of each layer is defined in advance according to the encoding method or encoding specifications such as profile and level, Information relating to the distribution of whether to allocate only the number of reference frames, that is, information indicating the maximum number of frames is encoded as header information of the encoded data 106.

このように第２の参照フレーム数設定方法では、符号化側で各階層の予測グループにそれぞれ最適な参照フレームの最大フレーム数を動的に割り当て、その割り当てた最大フレーム数を示す情報を符号化することで、符号化側と復号化側で各階層の予測グループに属する参照フレームの最大フレーム数を動的に一致させることが可能となる。従って、各階層の予測グループに属する参照フレームの最大フレーム数の割合を入力動画像信号１００の画像の性質の変化に応じて最適に変更することにより、符号化効率を向上させることが可能となる。 As described above, in the second reference frame number setting method, the encoding side dynamically assigns the optimum maximum number of reference frames to the prediction group of each layer, and encodes information indicating the assigned maximum number of frames. By doing so, it is possible to dynamically match the maximum number of reference frames belonging to the prediction group of each layer on the encoding side and the decoding side. Therefore, it is possible to improve the encoding efficiency by optimally changing the ratio of the maximum number of reference frames belonging to the prediction group of each layer according to the change in the image properties of the input moving image signal 100. .

（復号化側について）
図３は、本実施形態に係る図１に示した動画像符号化装置に対応する動画像復号化装置の構成を示すブロック図である。図４は、動き補償予測フレーム間符号化に対応する復号化に関する概略的な手順を示すフローチャートである。図３に示す動画像復号化装置も、ハードウェアで実現してもよいし、ソフトウェアにより実行してもよく、また一部の処理をハードウェアで実現し、他の処理をソフトウェアにより行ってもよい。 (About decryption side)
FIG. 3 is a block diagram showing a configuration of a moving picture decoding apparatus corresponding to the moving picture encoding apparatus shown in FIG. 1 according to the present embodiment. FIG. 4 is a flowchart showing a schematic procedure for decoding corresponding to motion compensated prediction interframe coding. The moving picture decoding apparatus shown in FIG. 3 may also be realized by hardware or may be executed by software, or some processes may be realized by hardware and other processes may be performed by software. Good.

図３に示す動画像復号化装置には、図１に示した動画像符号化装置から出力された符号化データ１０６が、図示しない蓄積媒体または伝送媒体を介して入力される。入力された符号化データ２００は、可変長復号化器２１４により可変長符号の復号化が行われ、量子化ＤＣＴ係数データ２０１とサイド情報２０２が出力される。量子化ＤＣＴ係数データ２０１は、逆量子化器２１５及び逆ＤＣＴ変換器２１６を経て復号されることにより、予測誤差信号が再生される。 The encoded data 106 output from the moving image encoding apparatus illustrated in FIG. 1 is input to the moving image decoding apparatus illustrated in FIG. 3 via a storage medium or a transmission medium (not illustrated). The input encoded data 200 is subjected to variable length code decoding by a variable length decoder 214, and quantized DCT coefficient data 201 and side information 202 are output. The quantized DCT coefficient data 201 is decoded through an inverse quantizer 215 and an inverse DCT transformer 216, thereby reproducing a prediction error signal.

一方、サイド情報２０２としてマクロブロック毎に符号化された動きベクトルと、符号化対象フレーム毎に所属する予測グループを識別するインデックス（第１の識別情報）、及び参照フレームを特定するインデックス（第２の識別情報）が復号化される（ステップ２１）。これらのサイド情報に従って、符号化時と同様に参照フレームの選択及び動き補償が行われることにより予測画像信号２０３が生成される。 On the other hand, a motion vector encoded for each macroblock as the side information 202, an index for identifying a prediction group belonging to each encoding target frame (first identification information), and an index for identifying a reference frame (second Are identified (step 21). According to these pieces of side information, the prediction image signal 203 is generated by selecting a reference frame and performing motion compensation in the same manner as in encoding.

すなわち、符号化対象フレーム毎に所属する予測グループを識別するインデックス（第１の識別情報）と参照フレームを特定するインデックス（第２の識別情報）に従って参照フレームが選択され（ステップＳ２２）、この選択された参照フレームを用いて動き補償予測フレーム間符号化の結果が復号化される（ステップＳ２３）。さらに、予測画像信号２０３と逆ＤＣＴ変換器２１６からの予測誤差信号が加算されることにより、復号化画像信号２０４が生成される
復号化画像信号２０４である復号化済みフレームは、該復号化済みフレームの元となった符号化済みフレームが所属する予測グループに応じて、第１及び第２の参照メモリセット２１９の何れか一方に一時保存され、参照フレームとして用いられる（ステップＳ２４）。これらの参照メモリセット２１８，２１９は、動画像符号化装置と同様にＦＩＦＯ型の制御が行われる。ここで、各階層の予測グループに属する参照フレームの数については、先の動画像符号化装置において説明した第１または第２の参照フレーム数設定方法に従って設定される。 That is, a reference frame is selected according to an index (first identification information) for identifying a prediction group belonging to each encoding target frame and an index (second identification information) for identifying a reference frame (step S22). The result of the motion compensation prediction interframe coding is decoded using the reference frame thus set (step S23). Further, the decoded image signal 204 is generated by adding the prediction image signal 203 and the prediction error signal from the inverse DCT converter 216. The decoded frame, which is the decoded image signal 204, is decoded. Depending on the prediction group to which the encoded frame that is the source of the frame belongs, it is temporarily stored in one of the first and second reference memory sets 219 and used as a reference frame (step S24). These reference memory sets 218 and 219 are subjected to FIFO type control in the same manner as the moving picture coding apparatus. Here, the number of reference frames belonging to the prediction group of each layer is set according to the first or second reference frame number setting method described in the previous video encoding apparatus.

すなわち、第１の参照フレーム数設定方法に従って符号化仕様に応じて予め各階層の予測グループに属する参照フレームの最大フレーム数を個別に定めている場合には、各階層の予測グループに属する参照フレームの数は符号化仕様毎に固定の値とされる。また、第２の参照フレーム数設定方法に従って符号化仕様に応じて各階層の予測グループに属する参照フレーム数の総和を予め規定し、各階層の予測グループに参照フレームの最大フレーム数を割り当てている場合には、参照フレームの総和のみ固定で、符号化データのヘッダ情報から復号化される参照フレームの最大数を示す情報に基づいて動的に各階層の予測グループに属する参照フレームの数が制御される。 That is, when the maximum number of reference frames belonging to the prediction group of each layer is determined in advance according to the encoding specification according to the first reference frame number setting method, the reference frame belonging to the prediction group of each layer Is a fixed value for each encoding specification. Further, according to the second reference frame number setting method, the total number of reference frames belonging to the prediction group of each layer is defined in advance according to the coding specification, and the maximum number of reference frames is assigned to the prediction group of each layer. In this case, only the sum of reference frames is fixed, and the number of reference frames belonging to the prediction group of each layer is dynamically controlled based on information indicating the maximum number of reference frames decoded from header information of encoded data. Is done.

図５は、図１に示した動画像符号化装置における動き補償予測器１１１、及び図３に示した動画像復号化装置における動き補償予測器２１１として用いられる動き補償予測器の詳細な構成を示したものである。前述した通り、符号化あるいは復号化すべきフレームが属する階層の予測グループに応じて、使用可能な参照フレームが異なる。図５におけるフレームメモリ３０２から３０４は、１つの階層の予測グループに属する符号化フレームに対する参照フレームとして使用可能な参照フレームを保存しいるものとする。 FIG. 5 shows a detailed configuration of the motion compensated predictor 111 used as the motion compensated predictor 111 in the moving picture encoding apparatus shown in FIG. 1 and the motion compensated predictor 211 in the moving picture decoding apparatus shown in FIG. It is shown. As described above, usable reference frames differ depending on the prediction group of the hierarchy to which the frame to be encoded or decoded belongs. Assume that frame memories 302 to 304 in FIG. 5 store reference frames that can be used as reference frames for encoded frames belonging to a prediction group of one layer.

この動き補償予測器では、マクロブロック毎に使用可能な参照フレームの中から１つを選択するか、あるいは線形予測器３０１により使用可能な参照フレームの線形和をとってその線形和による予測の何れかを選択し、動き補償を行って予測マクロブロックを生成する。 In this motion compensation predictor, either one of the reference frames that can be used for each macroblock is selected, or a linear sum of the reference frames that can be used by the linear predictor 301 is taken and prediction based on the linear sum is performed. Is selected and motion compensation is performed to generate a prediction macroblock.

動画像符号化装置では、予測誤差が小さく符号化効率の最も高い予測マクロブロックが選択されるように、参照フレーム及び動きベクトルがマクロブロック毎に選択される。選択された参照フレームの情報及び動きベクトルの情報は、マクロブロック毎に符号化される。 In the moving picture coding apparatus, the reference frame and the motion vector are selected for each macroblock so that the prediction macroblock with the smallest prediction error and the highest coding efficiency is selected. The selected reference frame information and motion vector information are encoded for each macroblock.

動画像復号化装置では、受信した動きベクトル及び参照フレームの情報に応じて動き補償器で予測マクロブロックを生成し、復号化を行う。線形和による予測を行う場合は、線形予測係数に関する情報を符号化データのヘッダ情報として符号化を行い、符号化と復号化で線形予測係数を一致させる。 In the moving picture decoding apparatus, a prediction macroblock is generated by a motion compensator in accordance with the received motion vector and reference frame information, and decoding is performed. In the case of performing prediction by linear sum, encoding is performed using information on the linear prediction coefficient as header information of the encoded data, and the linear prediction coefficient is matched between encoding and decoding.

図６及び図７は、従来のＭＰＥＧ動画像符号化におけるフレーム間予測構造及び参照メモリ制御の例を示す図である。横軸は時間を示しており、Ｉ０，Ｐ１，Ｐ２等は表示順のフレームを示している。例えば、Ｉ０はＩピクチャでフレーム番号が０番、Ｐ１はＰピクチャでフレーム番号が１番、Ｂ２はＢピクチャでフレーム番号が２番ということをそれぞれ示している。図中の矢印はフレーム間の予測構造を示しており、参照フレームから符号化対象フレームへの向きを示している。例えば、図６において符号化対象フレームＰ1 に対して、Ｉ０が参照フレームであることを示している。以下、図６及び図７のそれぞれの例について詳しく説明する。
まず、図６はＩピクチャ及びＰピクチャのみから構成される予測構造を示している。ＭＰＥＧ符号化では、Ｐピクチャは直前に符号化されたＩピクチャまたはＰピクチャのみを参照フレームとする。図中ＦＭ１及びＦＭ２は、復号化における参照メモリ（フレームメモリ）の使い回しを示している。ここで、各フレームは１フレーム期間かけて復号化するものとする。 6 and 7 are diagrams showing an example of an inter-frame prediction structure and reference memory control in conventional MPEG moving image coding. The horizontal axis indicates time, and I0, P1, P2, etc. indicate frames in display order. For example, I0 indicates an I picture and the frame number is 0, P1 indicates a P picture and the frame number is 1, and B2 indicates a B picture and the frame number is 2, respectively. The arrows in the figure indicate the prediction structure between frames, and indicate the direction from the reference frame to the encoding target frame. For example, FIG. 6 shows that I0 is a reference frame for the encoding target frame P1. Hereinafter, each example of FIGS. 6 and 7 will be described in detail.
First, FIG. 6 shows a prediction structure composed of only an I picture and a P picture. In MPEG coding, only the I picture or P picture coded immediately before is used as a reference frame for the P picture. FM1 and FM2 in the figure indicate how the reference memory (frame memory) is used in decoding. Here, each frame is decoded over one frame period.

Ｉ０の復号化は、図中Ｉ０フレームからＰ１フレームまでの間の期間に行われるものとし、復号化画像信号は参照メモリＦＭ１に順次書き込まれ、次にＰ１の復号化が終了するまで参照メモリＦＭ１に保存される。Ｐ１の復号化は、参照メモリＦＭ１に保存されたＩ０フレームを参照フレームとして行われ、復号化されたＰ１フレームは順次参照フレームＦＭ２に書き込まれ、次にＰ２の復号化が終了するまでＦＭ２に保存される。Ｐ２の復号化は、参照メモリＦＭ２に保存されたＰ１フレームを参照フレームとして行われ、復号化されたＰ２フレームは参照メモリＦＭ１に既に保存されたＩ０フレームを上書きしながら書き込まれる。以上のように、Ｉピクチャ及びＰピクチャから構成される符号化データの復号化は２フレーム分の参照メモリを用いて行われる。 It is assumed that the decoding of I0 is performed in the period from the I0 frame to the P1 frame in the figure, and the decoded image signal is sequentially written in the reference memory FM1, and then the reference memory FM1 until the decoding of P1 is completed. Saved in. Decoding of P1 is performed using the I0 frame stored in the reference memory FM1 as a reference frame, and the decoded P1 frame is sequentially written in the reference frame FM2, and then stored in FM2 until the decoding of P2 is completed. Is done. P2 is decoded using the P1 frame stored in the reference memory FM2 as a reference frame, and the decoded P2 frame is written while overwriting the I0 frame already stored in the reference memory FM1. As described above, decoding of encoded data composed of an I picture and a P picture is performed using a reference memory for two frames.

図７は、従来のＭＰＥＧ動画像符号化でＢピクチャを含む場合の例である。Ｂピクチャは、後方フレームからの予測も用いられるため、表示順と異なる順序にフレームの並べ替えが行われ、符号化及び復号化が行われる。図７の上段は表示順のフレーム及びフレーム間予測構造を示し、下段は符号化及び復号化におけるフレーム順を示している。ＦＭ１及びＦＭ２は復号化における参照メモリの使い回しを示している。 FIG. 7 is an example in the case of including a B picture in the conventional MPEG moving image coding. Since the B picture is also used for prediction from the rear frame, the frames are rearranged in an order different from the display order, and encoded and decoded. The upper part of FIG. 7 shows the frames in the display order and the inter-frame prediction structure, and the lower part shows the frame order in encoding and decoding. FM1 and FM2 indicate the use of the reference memory for decoding.

図７に示すフレーム間予測構造を持つ符号化データの復号化時には、まずＩ０フレームが復号化され、参照メモリＦ１に書き込まれる。続いて、参照メモリＦＭ１に保存された復号化されたＩ０フレームを参照フレームとしてＰ３フレームの復号化が行われ、復号化されたＰ３フレームは参照メモリＦＭ２に書き込まれる。次に、参照メモリＦＭ１に保存された復号化されたＩ０フレームを前方予測の参照フレームとし、また参照メモリＦＭ２に保存された復号化されたＰ３フレームを後方予測の参照フレームとして、ＢピクチャＢ１及びＢ２の復号化が順次行われる。次に、Ｐ６フレームが参照メモリＦＭ２に保存されたＰ３フレームを参照フレームとして復号化され、復号化されたＰ６フレームは参照メモリＦＭ１に既に保存されたＩ０フレームを上書きしながら書き込まれる。Ｂピクチャは、参照フレームとしては用いられないため、Ｂピクチャの復号化画像は参照メモリには保存されず、順次出力して表示される。 When decoding the encoded data having the inter-frame prediction structure shown in FIG. 7, the I0 frame is first decoded and written to the reference memory F1. Subsequently, the P3 frame is decoded using the decoded I0 frame stored in the reference memory FM1 as a reference frame, and the decoded P3 frame is written in the reference memory FM2. Next, the decoded I0 frame stored in the reference memory FM1 is used as a reference frame for forward prediction, and the decoded P3 frame stored in the reference memory FM2 is used as a reference frame for backward prediction. Decoding of B2 is sequentially performed. Next, the P6 frame is decoded using the P3 frame stored in the reference memory FM2 as a reference frame, and the decoded P6 frame is written while overwriting the I0 frame already stored in the reference memory FM1. Since the B picture is not used as a reference frame, the decoded picture of the B picture is not stored in the reference memory, but is sequentially output and displayed.

Ｉピクチャ及びＰピクチャの復号化時は、参照メモリに保存されている１つ前のＩピクチャまたはＰピクチャが出力され表示される。例えば、Ｐ３のデコード時には参照メモリＦＭ１に保存されたＩ０が表示され、Ｐ６のデコード時には参照メモリＦＭ２に保存されたＰ３が表示される。このようにＩピクチャ及びＰピクチャの表示を１周期遅らせることで、復号化順が正しい表示順に並べ替えられる。以上のように、Ｂピクチャを含む場合においても、２フレームの参照フレームで復号化が行われる。 When decoding an I picture and a P picture, the previous I picture or P picture stored in the reference memory is output and displayed. For example, I0 stored in the reference memory FM1 is displayed when P3 is decoded, and P3 stored in the reference memory FM2 is displayed when P6 is decoded. Thus, by delaying the display of the I picture and P picture by one cycle, the decoding order is rearranged in the correct display order. As described above, even when a B picture is included, decoding is performed with two reference frames.

図８から図１３は、本実施形態におけるフレーム間予測構造及び参照メモリ制御の例を示す図である。以下、それぞれの例について説明する。図８は、図６と同様にＩピクチャ及びＰピクチャから構成されるが、各フレームを予測グループａと予測グループｂとの交互に切り替える例であり、予測グループｂは予測グループａの上位階層とする。また、予測グループａ及び予測グループｂの参照メモリ数はそれぞれ１フレームであるとする。 8 to 13 are diagrams illustrating examples of the inter-frame prediction structure and the reference memory control in the present embodiment. Each example will be described below. FIG. 8 is composed of an I picture and a P picture as in FIG. 6, but is an example in which each frame is alternately switched between the prediction group a and the prediction group b. The prediction group b is an upper layer of the prediction group a. To do. Further, it is assumed that the number of reference memories of the prediction group a and the prediction group b is 1 frame.

図中Ｉa0，Ｐa2，Ｐa4で示されるように、サフィックスａの付加されたピクチャは予測グループａであることを示し、また、Ｐb1，Ｐb3，Ｐb5のようにサフィックスｂの付加されたピクチャは予測グループｂであることを示している。これらの予測グループの属性は、ピクチャタイプの拡張、または独立したインデックスとして符号化対象フレームのヘッダ情報として符号化される。予測グループａに属する符号化対象フレームは、予測フレームａに属する既に復号化されたフレームのみを参照フレームとして用いることが可能であり、また、上位階層の予測フレームｂでは、既に復号化された予測グループａ及び予測グループｂの何れかに属する１フレームまたは両方の復号化フレームの線形和を用いて予測画像を生成することが可能である。 As shown by Ia0, Pa2, and Pa4 in the figure, the picture to which the suffix a is added indicates the prediction group a, and the pictures to which the suffix b is added such as Pb1, Pb3, and Pb5 are prediction groups. b. The attributes of these prediction groups are encoded as header information of the encoding target frame as an extension of the picture type or as an independent index. For the encoding target frame belonging to the prediction group a, it is possible to use only the already decoded frame belonging to the prediction frame a as a reference frame, and in the upper layer prediction frame b, the already decoded prediction is used. It is possible to generate a predicted image using a linear sum of one frame or both decoded frames belonging to either group a or prediction group b.

各階層の予測グループともに１フレームの参照メモリを持っているため、予測グループａの符号化対象フレームの参照フレーム数は最大１フレームとなり、予測グループｂの符号化対象フレームの参照フレーム数は最大２フレームが使用可能である。例えば、予測グループａに属するＰa2フレームは、復号化されたＩa0フレームのみを参照フレームとして用いるが、予測グループｂに属するＰb3は、予測グループａに属する復号化されたＰa2フレームと、予測グループｂに属する復号化されたＰb1フレームの２フレームを参照フレームとして用いる。 Since each prediction group in each layer has one frame of reference memory, the reference frame number of the encoding target frame of the prediction group a is 1 frame at maximum, and the reference frame number of the encoding target frame of the prediction group b is 2 at maximum. The frame is usable. For example, the Pa2 frame belonging to the prediction group a uses only the decoded Ia0 frame as the reference frame, but the Pb3 belonging to the prediction group b is assigned to the decoded Pa2 frame belonging to the prediction group a and the prediction group b. Two of the decoded Pb1 frames to which it belongs belong to the reference frame.

図８において、ＦＭ１，ＦＭ２及びＦＭ３は図６あるいは図７と同様の物理的な参照メモリの使い回しを示している。また、ＤＥＣ，ＲＥＦa及びＲＥＦbはそれぞれ論理的な参照メモリの使い回しを示している。換言すると、ＤＥＣ，ＲＥＦa及びＲＥＦbは仮想アドレスで表現したフレームメモリであり、ＦＭ１，ＦＭ２及びＦＭ３は同フレームメモリを物理アドレスで表現したものである。仮想アドレス表現では、ＤＥＣは現在復号化中のフレームを一時保存するためのフレームメモリであり、またＲＥＦa及びＲＥＦbはそれぞれ予測グループａ及び予測グループｂの参照メモリを示している。従って、ＲＥＦaには復号化された予測グループａに属するフレームが順次一時保存され、ＲＥＢbには復号化された予測グループｂに属するフレームが順次一時保存される。 In FIG. 8, FM1, FM2, and FM3 indicate the use of physical reference memory as in FIG. 6 or FIG. In addition, DEC, REFa, and REFb indicate the usage of the logical reference memory. In other words, DEC, REFa, and REFb are frame memories expressed by virtual addresses, and FM1, FM2, and FM3 are the frame memories expressed by physical addresses. In the virtual address expression, DEC is a frame memory for temporarily storing a frame currently being decoded, and REFa and REFb indicate reference memories of the prediction group a and the prediction group b, respectively. Therefore, the frames belonging to the decoded prediction group a are temporarily stored in REFa, and the frames belonging to the decoded prediction group b are temporarily stored in REBb.

図８の例では、例えば上位階層の予測グループｂに属する符号化対象フレームを破棄して、予測グループａに属するフレームのみを復号化することが可能である。この場合、必要な参照メモリ数は、現在復号化中のフレームを一時保存するためのフレームメモリＤＥＣと、予測グループａの参照メモリＲＥＦaの２フレーム分があれば復号化が可能である。 In the example of FIG. 8, for example, it is possible to discard only the frames belonging to the prediction group a by discarding the encoding target frame belonging to the upper layer prediction group b. In this case, the required number of reference memories can be decoded if there are two frames of the frame memory DEC for temporarily storing the currently decoded frame and the reference memory REFa of the prediction group a.

予測グループａに属するフレームのみを復号化することで、フレーム周期を半分にした復号化を予測構造に破綻をきたすことなく行うことができる。例えば、予測グループａに属する復号化フレームを２倍のフレームレートで再生することで、滑らかな早送り再生を行うことが可能である。また、映像ストリーミング等において伝送路の帯域幅が時間変動する場合に、通常は全ての符号化データを送出し、伝送路の有効帯域幅が低下した場合は、予測グループｂに属する符号化データを破棄して、下位階層の予測グループａ属する符号化データのみを送出しても、受信側では破綻無く再生することが可能である。 By decoding only the frames belonging to the prediction group a, decoding with half the frame period can be performed without causing a failure in the prediction structure. For example, it is possible to perform smooth fast-forward playback by playing back decoded frames belonging to the prediction group a at twice the frame rate. In addition, when the bandwidth of the transmission path fluctuates with time in video streaming or the like, normally all encoded data is sent out, and when the effective bandwidth of the transmission path decreases, the encoded data belonging to the prediction group b is Even if it is discarded and only the encoded data belonging to the prediction group a in the lower layer is transmitted, the reception side can reproduce it without failure.

図９は、図８の例を変形した例であり、予測グループａに属するフレームの間に予測グループｂに属するフレームが２フレーム挿入された予測構造であり、また各階層の予測グループの参照メモリ数は何れも１フレームである。この場合も、図８と同様に３フレーム分のフレームメモリを使いまわすことで、図６と同様の復号化を行うことが可能である。図９の例では、例えば予測グループａのフレームのみを復号化し、符号化されたフレームを本来のフレームレートで再生することで、滑らかな３倍速再生を行うことも可能である。 FIG. 9 is a modified example of the example of FIG. 8, and is a prediction structure in which two frames belonging to the prediction group b are inserted between frames belonging to the prediction group a, and the reference group reference memory of each layer Each number is one frame. Also in this case, it is possible to perform the same decoding as in FIG. 6 by using the frame memory for three frames as in FIG. In the example of FIG. 9, for example, only the frame of the prediction group “a” is decoded, and the encoded frame is reproduced at the original frame rate, whereby smooth 3 × speed reproduction can be performed.

図１０では、図６と同様にＩピクチャ及びＰピクチャから構成され、また、予測グループはａ，ｂ，ｃの３階層とし、入力フレーム４フレーム毎に予測グループａのフレームが割り当てられ、予測グループａのフレーム間に予測グループｂの１フレームと予測グループｃの２フレームが配置された予測構造としている。 In FIG. 10, similarly to FIG. 6, the prediction group is composed of an I picture and a P picture, and the prediction group has three hierarchies a, b, and c. The prediction structure is such that one frame of the prediction group b and two frames of the prediction group c are arranged between the frames of a.

各階層の予測グループａ，ｂ，ｃの参照フレーム数はぞれぞれ１フレームであり、ａ，ｂ，ｃの順で階層が上がるものとする。つまり、予測グループａに属するのフレームは、復号化された予測グループａの１フレームのみを参照フレームとし、予測グループｂに属するのフレームは、復号化された予測グループａ及び予測グループｂの２フレームを参照フレームとし、予測グループｃに属するのフレームは、復号化された予測グループａ，ｂ及びｃの３フレームを参照フレームとして用いることができる。 It is assumed that the number of reference frames of the prediction groups a, b, and c in each hierarchy is one frame, and the hierarchy goes up in the order of a, b, and c. That is, the frame belonging to the prediction group a uses only one frame of the decoded prediction group a as a reference frame, and the frame belonging to the prediction group b includes two frames of the decoded prediction group a and prediction group b. Can be used as the reference frame, and the three frames of the decoded prediction groups a, b and c can be used as the reference frame.

図１０において、ＤＥＣ，ＲＥＦa，ＲＥＦb及びＲＥＦcは復号化フレームの一時保存フレームメモリ、予測グループａの参照フレーム、予測グループｂの参照フレーム、及び予測グループｃの参照フレームを示す論理的なフレームメモリの使い回しを示すものであり、またＦＭ１，ＦＭ２，ＦＭ３及びＦＭ４は、上記４フレーム分のフレームメモリの物理的な使い回しを示している。各階層の予測グループ毎に、直前に復号化された１フレームが、参照メモリＲＥＦa，ＲＥＦb及びＲＥＦcに一時保存され、また、現在復号中のフレームは復号化フレームメモリＤＥＣに書き込まれる。 In FIG. 10, DEC, REFa, REFb and REFc are logical frame memories indicating temporarily stored frame memories of decoded frames, reference frames of the prediction group a, reference frames of the prediction group b, and reference frames of the prediction group c. Reuse is indicated, and FM1, FM2, FM3, and FM4 indicate physical reuse of the frame memory for the four frames. For each prediction group in each layer, one frame decoded immediately before is temporarily stored in the reference memories REFa, REFb, and REFc, and the frame currently being decoded is written in the decoded frame memory DEC.

図１０の構成では、予測グループが３階層で構成されるため、予測グループｃ以下の全ての符号化フレームを復号化すると通常の再生が行われ、また予測グループｂ以下の符号化フレームを復号化すると、通常の１／２のフレームが復号化され、また予測グループａの符号化フレームのみを復号化すると、通常の１／４のフレームが復号化されることになる。また、上記何れの復号化においても、予測構造の破綻は発生せずに、正常に復号化画像を生成することが可能である。復号化する階層を動的に制御することで、滑らかな可変速の早送り再生を実現したり、あるいは送出する階層を動的に制御することで、送出ビットレートを動的に変更することが可能となる。 In the configuration of FIG. 10, since the prediction group is composed of three layers, normal decoding is performed when all the encoded frames below the prediction group c are decoded, and the encoded frames below the prediction group b are decoded. Then, a normal half frame is decoded, and if only a coded frame of the prediction group a is decoded, a normal quarter frame is decoded. In any of the above decoding, a decoded image can be normally generated without causing a failure of the prediction structure. It is possible to realize smooth variable-speed fast-forward playback by dynamically controlling the decoding layer or dynamically changing the sending bit rate by dynamically controlling the sending layer. It becomes.

図１１では、図７と同様にＩピクチャ、Ｐピクチャ及びＢピクチャから構成され、Ｉピクチャ及びＰピクチャを予測グループａとし、Ｂピクチャを予測グループｂとしている。予測グループｂは予測グループａの上位階層とする。また、予測グループａの参照メモリ数は２フレーム、予測グループｂの参照メモリ数は１フレームである。図１１の例では、予測グループａのＩピクチャ及びＰピクチャの参照メモリ数を２フレームとしているので、Ｐピクチャにおいて、直前に符号化あるいは復号化されたＩピクチャまたはＰピクチャと、さらにその前に符号化された符号化あるいは復号化されたＩピクチャまたはＰピクチャの２フレームを参照フレームとして用いることが可能である。また、Ｂピクチャにおいては、予測グループｂが参照フレームを１フレーム持つため、直前に符号化あるいは復号化されたＢピクチャ１フレームを参照フレームとし、さらに下位階層の予測グループである過去２フレームのＩピクチャ及びＰピクチャと合わせて、３フレームの参照フレームを用いることが可能である。 In FIG. 11, similar to FIG. 7, the picture is composed of an I picture, a P picture, and a B picture. The I picture and the P picture are set as a prediction group a, and the B picture is set as a prediction group b. The prediction group b is a higher hierarchy than the prediction group a. The number of reference memories in the prediction group a is 2 frames, and the number of reference memories in the prediction group b is 1 frame. In the example of FIG. 11, the number of reference memories of the I picture and P picture of the prediction group a is two frames. Therefore, in the P picture, the I picture or P picture coded or decoded immediately before, and further before that Two frames of an encoded or decoded I picture or P picture can be used as a reference frame. In addition, in the B picture, since the prediction group b has one reference frame, the B picture 1 frame that has been encoded or decoded immediately before is used as a reference frame, and the I of the past two frames that are prediction groups in the lower layers. It is possible to use a reference frame of 3 frames together with a picture and a P picture.

図８から図１０と同様に、ＦＭ１，ＦＭ２，ＦＭ３及びＦＭ４が物理的なフレームメモリの使い回しを示しており、ＤＥＣ，ＲＥＦa1，ＲＥＦa2及びＲＥＦbが論理的なフレームメモリの使い回しを示している。ＤＥＣは復号化中のフレームの一時保存フレームメモリであり、ＲＥＦa1及びＲＥＦa2は予測グループａの２フレーム分の参照メモリであり、ＲＥＦbは予測グループｂの１フレーム分の参照メモリを示している。 As in FIGS. 8 to 10, FM1, FM2, FM3, and FM4 indicate physical frame memory usage, and DEC, REFa1, REFa2, and REFb indicate logical frame memory usage. . DEC is a temporarily stored frame memory of a frame being decoded, REFa1 and REFa2 are reference memories for two frames of the prediction group a, and REFb is a reference memory for one frame of the prediction group b.

図１１におけるＩdx0，Ｉdx1及びＩdx2は、復号化中のフレームの参照フレームを特定するためのインデックスを示している。例えば、Ｐa6フレームの復号化においては、予測グループａに属する直前の２フレームＰa3，Ｉa0が参照フレームの候補であり、参照フレームのインデックスは符号化対象フレームに時間的に近いものから順次番号を割り当てる。参照フレームを示すインデックスは、マクログロック毎に符号化するものとし、マクロブロック毎に参照フレームの選択を行われる。インデックス０のマクロブロックは、直前のＩピクチャまたはＰピクチャから予測画像が生成され、インデックス１のマクロブロックは、２つ前のＩピクチャまたはＰピクチャから予測画像を生成する。また、直前のＩピクチャまたはＰピクチャ、及び２つ前のＩピクチャまたはＰピクチャの線形和によって予測画像を生成する場合、インデックス０及びインデック１の組み合わせであることを識別するインデックスがマクロブックのヘッダ情報として符号化される。 Idx0, Idx1, and Idx2 in FIG. 11 indicate indexes for specifying the reference frame of the frame being decoded. For example, in the decoding of the Pa6 frame, the immediately preceding two frames Pa3 and Ia0 belonging to the prediction group a are reference frame candidates, and the reference frame index is assigned sequentially from the one closest in time to the encoding target frame. . The index indicating the reference frame is encoded for each macro block, and the reference frame is selected for each macro block. The macroblock with index 0 generates a predicted image from the immediately preceding I picture or P picture, and the macroblock with index 1 generates a predicted image from the previous I picture or P picture. In addition, when a predicted image is generated by a linear sum of the immediately preceding I picture or P picture and the immediately preceding I picture or P picture, an index for identifying a combination of index 0 and index 1 is a macrobook header. It is encoded as information.

さらに、図１１におけるＢＷrefはＢピクチャにおける後方予測の参照フレームを示している。図１１の例では、例えばＢb1及びＢb2の後方参照フレームはＰa3，Ｂb4及びＢb5の後方参照フレームはＰa6となる。後方予測の参照フレームは、フレーム並べ替えの制約から、直前に符号化あるいは復号化されたＩピクチャまたはＰピクチャに制限されるため、参照フレームは一意に決定される。従って、後方予測の参照フレームＢＷrefについては、ヘッダ情報として符号化する必要はない。 Further, BWref in FIG. 11 indicates a reference frame for backward prediction in a B picture. In the example of FIG. 11, for example, the backward reference frames of Bb1 and Bb2 are Pa3, and the backward reference frames of Bb4 and Bb5 are Pa6. The reference frame for backward prediction is limited to the I picture or the P picture that has been encoded or decoded immediately before due to the restriction of the frame rearrangement, so that the reference frame is uniquely determined. Therefore, it is not necessary to encode the backward prediction reference frame BWref as header information.

また、Ｂピクチャにおける前方予測については、図１１の例では最大２フレームの中から選択可能である。例えば、Ｂb4の符号化及び復号化においては、時間的に直前のフレームであり予測グループａに属するＰa3及び時間的にさらにその前のフレームであり予測グループｂに属するＢb2を参照フレームとして用いることが可能であり、マクロブロック毎に何れの参照フレームを選択されたか、あるいは両者の線形和に予測を行うかを示すインデックスが符号化される。Ｂb5についても同様に、Ｂb4及びＰa3の２種類が参照フレームとして用いられる。 Further, forward prediction in a B picture can be selected from a maximum of two frames in the example of FIG. For example, in encoding and decoding of Bb4, Pa3 that is the previous frame in time and belonging to the prediction group a and Bb2 that is temporally previous and that belongs to the prediction group b are used as reference frames. This is possible, and an index indicating which reference frame is selected for each macroblock or whether a prediction is performed on the linear sum of both is encoded. Similarly for Bb5, two types Bb4 and Pa3 are used as reference frames.

参照フレームのインデックスは、符号化対象フレーム毎に前方予測の参照フレームとして使用可能な参照フレームに対して、時間的に近いものから順次番号を振る。図１１の例では、Ｐピクチャの符号化及び復号化においては参照メモリに保存されているＩまたはＰピクチャを時間順に並べて番号を振る。Ｂピクチャの符号化及び復号化においては、後方予測の参照フレームとして用いられる直前に符号化あるいは復号化されたＩピクチャまたはＰピクチャを除く、参照メモリに保存されている全ての参照フレームを時間順に並べて番号を振る。図１１におけるＩdx0及びＩdx1は、上述ルールに従って生成したインデックスである。 For reference frame indexes, numbers are assigned sequentially from the closest to the reference frame that can be used as a reference frame for forward prediction for each encoding target frame. In the example of FIG. 11, in encoding and decoding of a P picture, I or P pictures stored in the reference memory are arranged in time order and numbered. In the encoding and decoding of a B picture, all reference frames stored in the reference memory except for an I picture or a P picture encoded or decoded immediately before being used as a reference frame for backward prediction are sorted in time order. Number them side by side. Idx0 and Idx1 in FIG. 11 are indexes generated according to the above rules.

図１２は図１１の例の拡張であり、予測グループｂつまりＢピクチャについても参照フレーム数を２フレームに設定し、総フレームメモリ数を５フレームとした場合を示している。ＦＭ1〜ＦＭ５は、物理的な参照フレームの使い回しを示しており、ＤＥＣは復号化時の一時保存バッファ、ＲＥＦa1及びＲＥＦa2は予測グループａ、すなわちＩピクチャ及びＰピクチャの参照メモリ、ＲＥＦｂ１及びＲＥＦｂ２は予測グループｂ、すなわちＢピクチャの参照メモリの論理的な使い回しをそれぞれ示している。また、Ｉdx0，Ｉdx1及びＩdx2は前方予測における、参照フレームインデックスの割り付け、ＢＷrefはＢピクチャにおける後方予測の参照フレームをそれぞれ示している。図１１の例と同様に前方予測における参照フレームインデックスは、マクログロック毎にヘッダ情報として符号化される。 FIG. 12 is an extension of the example of FIG. 11 and shows a case where the reference frame number is set to 2 frames and the total frame memory number is 5 frames for the prediction group b, that is, the B picture. FM1 to FM5 indicate the reuse of physical reference frames, DEC is a temporary storage buffer at the time of decoding, REFa1 and REFa2 are prediction groups a, that is, reference memories of I and P pictures, and REFb1 and REFb2 are The logical use of the reference memory of the prediction group b, that is, the B picture is shown. Idx0, Idx1, and Idx2 are reference frame index assignments in forward prediction, and BWref is a reference frame for backward prediction in a B picture. Similar to the example of FIG. 11, the reference frame index in the forward prediction is encoded as header information for each macro clock.

図８から図１２の例では、各階層の予測グループの参照メモリ数は固定としたが、参照フレーム数の総数が一定の下で、各階層の予測グループの参照メモリ数の配分を動的に変更する構成としてもよい。例えば、図８の構成であるタイミングで予測グループｂの参照メモリ数を０として、同時に予測グループａの参照メモリ数を２とする再配分を行い、その配分変更を符号化データのヘッダ情報で符号化側から復号化側へ通知する構成とすればよい。その際、符号化側では予測グループaのフレームについては、予測グループaの過去２フレームからの予測を使用可能とし、また、予測グループｂのフレームについては、予測グループｂの過去のフレームからの予測は禁止し、予測グループaの過去２フレームからの予測を行うように、動き補償予測の選択を制御する。 In the examples of FIGS. 8 to 12, the number of reference memories in the prediction group in each layer is fixed. However, the distribution of the reference memory numbers in the prediction group in each layer is dynamically distributed under a fixed total number of reference frames. It is good also as a structure to change. For example, at the timing shown in FIG. 8, the redistribution is performed by setting the number of reference memories of the prediction group b to 0 and simultaneously the number of reference memories of the prediction group a to 2, and the distribution change is encoded with the header information of the encoded data. The configuration may be such that notification is made from the encryption side to the decoding side. At this time, on the encoding side, prediction from the past two frames of the prediction group a can be used for the frame of the prediction group a, and prediction from the past frames of the prediction group b for the frame of the prediction group b. Is prohibited and the selection of motion compensation prediction is controlled so that prediction from the past two frames of the prediction group a is performed.

図１３は、図８の例に対して上記のように参照メモリ数の配分を変化させた場合の予測構造及びフレームメモリの使い回しを示している。このようにすることで、限られた参照フレーム数の中で入力動画像に適した最適な予測構造を動的に設定することが可能となり、予測効率を向上させて高能率の符号化を行うことが可能となる。 FIG. 13 shows the prediction structure and the reuse of the frame memory when the distribution of the number of reference memories is changed as described above with respect to the example of FIG. In this way, it becomes possible to dynamically set an optimal prediction structure suitable for an input moving image within a limited number of reference frames, and to improve prediction efficiency and perform highly efficient encoding. It becomes possible.

本発明の一実施形態に係る動画像符号化装置の構成を示すブロック図The block diagram which shows the structure of the moving image encoder which concerns on one Embodiment of this invention. 動画像符号化における動き補償予測フレーム間符号化に関する主要な処理の流れを示す図The figure which shows the flow of the main processes regarding the motion compensation prediction inter-frame coding in moving image coding. 本発明の一実施形態に係る動画像復号化装置の構成を示すブロック図The block diagram which shows the structure of the moving image decoding apparatus which concerns on one Embodiment of this invention. 動画像復号化における動き補償予測フレーム間符号化結果の復号化に関する主要な処理の流れを示す図The figure which shows the flow of the main processes regarding decoding of the motion compensation prediction inter-frame encoding result in moving image decoding. 同実施形態に係る動画像符号化装置及び動画像復号化装置で用いられる動き補償予測器の構成例を示すブロック図FIG. 3 is a block diagram showing a configuration example of a motion compensated predictor used in the video encoding device and the video decoding device according to the embodiment. 従来のＭＰＥＧ動画像符号化におけるフレーム間予測構造及び参照メモリ制御の例を示す図The figure which shows the example of the prediction structure between frames in the conventional MPEG moving image encoding, and reference memory control 従来のＭＰＥＧ動画像符号化におけるフレーム間予測構造及び参照メモリ制御の例を示す図The figure which shows the example of the prediction structure between frames in the conventional MPEG moving image encoding, and reference memory control 本発明の一実施形態に係るフレーム間予測構造及び参照メモリ制御の例を示す図The figure which shows the example of the inter-frame prediction structure and reference memory control which concern on one Embodiment of this invention 本発明の一実施形態に係るフレーム間予測構造及び参照メモリ制御の例を示す図The figure which shows the example of the inter-frame prediction structure and reference memory control which concern on one Embodiment of this invention 本発明の一実施形態に係るフレーム間予測構造及び参照メモリ制御の例を示す図The figure which shows the example of the inter-frame prediction structure and reference memory control which concern on one Embodiment of this invention 本発明の一実施形態に係るフレーム間予測構造及び参照メモリ制御の例を示す図The figure which shows the example of the inter-frame prediction structure and reference memory control which concern on one Embodiment of this invention 本発明の一実施形態に係るフレーム間予測構造及び参照メモリ制御の例を示す図The figure which shows the example of the inter-frame prediction structure and reference memory control which concern on one Embodiment of this invention 本発明の一実施形態に係るフレーム間予測構造及び参照メモリ制御の例を示す図The figure which shows the example of the inter-frame prediction structure and reference memory control which concern on one Embodiment of this invention

Explanation of symbols

１００…入力動画像信号
１０１…予測誤差信号
１０２…量子化ＤＣＴ係数データ
１０３…局部復号化画像信号
１０４…予測画像信号
１０５…サイド情報
１０６…符号化データ
１１１…動き補償予測器
１１２…ＤＣＴ変換器
１１３…量子化器
１１４…可変長符号化器
１１５…逆量子化器
１１６…逆ＤＣＴ器
１１７…参照フレーム書き込み制御スイッチ
１１８，１１９…参照メモリセット
１２０…参照フレーム読み出し制御器
２００…符号化データ
２０１…量子化ＤＣＴ係数データ
２０２…サイド情報
２０３…予測画像信号
２０４…復号化画像信号
２１１…動き補償予測器
２１４…可変長復号化器
２１５…逆量子化器
２１６…逆ＤＣＴ変換器
２１７…参照フレーム書き込み制御スイッチ
２１８，２１９…参照メモリセット
２２０…参照フレーム読み出し制御スイッチ
３００…予測マクロブロック選択器
３０１…線形予測器
３０２，３０３，２０４…参照メモリ DESCRIPTION OF SYMBOLS 100 ... Input moving image signal 101 ... Prediction error signal 102 ... Quantized DCT coefficient data 103 ... Local decoded image signal 104 ... Predicted image signal 105 ... Side information 106 ... Encoded data 111 ... Motion compensation predictor 112 ... DCT converter DESCRIPTION OF SYMBOLS 113 ... Quantizer 114 ... Variable length encoder 115 ... Inverse quantizer 116 ... Inverse DCT device 117 ... Reference frame write control switch 118, 119 ... Reference memory set 120 ... Reference frame read controller 200 ... Encoded data 201 Quantized DCT coefficient data 202 ... Side information 203 ... Predicted image signal 204 ... Decoded image signal 211 ... Motion compensation predictor 214 ... Variable length decoder 215 ... Inverse quantizer 216 ... Inverse DCT converter 217 ... Reference frame Write control switch 218, 219 ... Reference memory set 22 ... reference frame read control switch 300 ... prediction macroblock selector 301 ... linear predictor 302,303,204 ... reference memory

Claims

A moving picture decoding method for reproducing a moving picture by decoding coded data obtained by performing a coding process including motion compensated prediction interframe coding on a coding target frame of a moving picture,
First identification information indicating a group of layers to which the encoding target frame included in the encoded data is distributed, second identification information indicating a reference frame used for the motion compensation prediction interframe encoding, and Decoding information indicating a picture type of the encoding target frame;
In accordance with the decoded first identification information and second identification information, select at least one reference frame belonging to a group of layers below the layer to which the encoding target frame is distributed,
A moving picture decoding method for decoding a result of the motion compensated prediction interframe coding included in the coded data using a selected reference frame.

A moving image decoding apparatus that decodes encoded data obtained by performing an encoding process including motion compensated prediction inter-frame encoding on a moving image encoding target frame and reproduces a moving image,
First identification information indicating a group of layers to which the encoding target frame included in the encoded data is distributed, second identification information indicating a reference frame used for the motion compensation prediction interframe encoding, and Means for decoding information indicating a picture type of the encoding target frame;
Means for selecting at least one reference frame belonging to a group of layers below the layer to which the encoding target frame is distributed according to the decoded first identification information and second identification information;
A moving picture decoding apparatus comprising: means for decoding a result of the motion compensated prediction interframe coding included in the coded data using a selected reference frame.

A program for causing a computer to perform a process of reproducing a moving image by decoding encoded data obtained by performing an encoding process including motion compensated prediction inter-frame encoding on a moving image encoding target frame. There,
First identification information indicating a group of layers to which the encoding target frame included in the encoded data is distributed, second identification information indicating a reference frame used for the motion compensation prediction interframe encoding, and A process of decoding information indicating a picture type of the encoding target frame;
A process of selecting at least one reference frame belonging to a group in a layer to which the encoding target frame is distributed or a group in a lower layer according to the decoded first identification information and second identification information; ,
A program for causing the computer to perform a moving picture decoding process including a process of decoding a result of the motion compensated prediction interframe encoding included in the encoded data using a selected reference frame.