JP2004274732A

JP2004274732A - Video encoding method and video decoding method

Info

Publication number: JP2004274732A
Application number: JP2004027829A
Authority: JP
Inventors: Yoji Notoya; 陽司能登屋; Shinya Sumino; 眞也角野
Original assignee: Matsushita Electric Industrial Co Ltd
Current assignee: Panasonic Holdings Corp
Priority date: 2003-02-19
Filing date: 2004-02-04
Publication date: 2004-09-30
Anticipated expiration: 2024-02-04
Also published as: JP4405272B2

Abstract

<P>PROBLEM TO BE SOLVED: To provide a moving picture coding method and a moving picture decoding method, capable of editing in any place of the picture even though it is not an IDR picture which is a special intra picture. <P>SOLUTION: The motion picture coding device comprises a flag information generating unit 112 for generating a flag indicative of discontinuation of display order information POC when the display order information POC becomes discontinuous by editing, etc., and a variable length coding unit 113 for performing a variable length coding etc. in response to an inputted coded signal, and for generating a coded stream Str by adding information such as the flag etc. generated by the flag information generating unit 112. <P>COPYRIGHT: (C)2004,JPO&NCIPI

Description

本発明は、動画像信号をピクチャ単位で符号化する動画像符号化方法、及び、上記符号化された動画像信号を復号化する動画像復号化方法、並びにそれをソフトウェアで実施するためのプログラムに関する。 The present invention relates to a moving picture coding method for coding a moving picture signal on a picture basis, a moving picture decoding method for decoding the coded moving picture signal, and a program for implementing the same with software About.

近年、音声、画像、その他の画素値を統合的に扱うマルチメディア時代を迎え、従来からの情報メディア、つまり新聞、雑誌、テレビ、ラジオ、電話等の情報を人に伝達する手段がマルチメディアの対象として取り上げられるようになってきた。一般に、マルチメディアとは、文字だけでなく、図形、音声、特に画像等を同時に関連づけて表すことをいうが、上記従来の情報メディアをマルチメディアの対象とするには、その情報をディジタル形式にして表すことが必須条件となる。 In recent years, the multimedia era, in which audio, images, and other pixel values are integrated, has been approached, and the traditional information media, that is, means for transmitting information such as newspapers, magazines, televisions, radios, and telephones to humans, has It has been taken up as an object. Generally, multimedia means not only characters, but also figures, sounds, and especially images, etc., that are simultaneously associated with each other. Is an essential condition.

ところが、上記各情報メディアの持つ情報量をディジタル情報量として見積もってみると、文字の場合１文字当たりの情報量は１〜２バイトであるのに対し、音声の場合１秒当たり64Kbits（電話品質）、さらに動画については１秒当たり100Mbits（現行テレビ受信品質）以上の情報量が必要となり、上記情報メディアでその膨大な情報をディジタル形式でそのまま扱うことは現実的では無い。例えば、テレビ電話は、64Kbit/s〜1.5Mbits/sの伝送速度を持つサービス総合ディジタル網（ISDN : Integrated Services Digital Network）によってすでに実用化されているが、テレビ・カメラの映像をそのままISDNで送ることは不可能である。 However, when the amount of information of each information medium is estimated as a digital information amount, the amount of information per character is 1-2 bytes in the case of characters, whereas 64 Kbits per second in the case of voice (telephone quality). In addition, for a moving image, an information amount of 100 Mbits per second (current television reception quality) or more is required, and it is not realistic to handle the vast amount of information in the above-mentioned information medium as it is in a digital format. For example, videophones have already been put into practical use by the Integrated Services Digital Network (ISDN), which has a transmission speed of 64 Kbit / s to 1.5 Mbits / s. It is impossible.

そこで、必要となってくるのが情報の圧縮技術であり、例えば、テレビ電話の場合、ITU-T（国際電気通信連合電気通信標準化部門）で勧告されたH.261やH.263規格の動画圧縮技術が用いられている。また、MPEG-１規格の情報圧縮技術によると、通常の音楽用CD（コンパクト・ディスク）に音声情報とともに画像情報を入れることも可能となる。
ここで、MPEG（Moving Picture Experts Group）とは、ISO/IEC（国際標準化機構国際電気標準会議）で標準化された動画像信号圧縮の国際規格であり、MPEG-１は、動画像信号を１．５Mbpsまで、つまりテレビ信号の情報を約１００分の１にまで圧縮する規格である。また、MPEG-１規格では対象とする品質を伝送速度が主として約１．５Mbpsで実現できる程度の中程度の品質としたことから、さらなる高画質化の要求をみたすべく規格化されたMPEG-２では、動画像信号を２〜１５MbpsでＴＶ放送品質を実現する。さらに現状では、MPEG-１，MPEG-２と標準化を進めてきた作業グループ（ISO/IEC JTC1/SC29/WG11）によって、MPEG-１，MPEG-２を上回る圧縮率を達成し、更に物体単位で符号化・復号化・操作を可能とし、マルチメディア時代に必要な新しい機能を実現するMPEG-４が規格化された。MPEG-４では、当初、低ビットレートの符号化方法の標準化を目指して進められたが、現在はインタレース画像も含む高ビットレートも含む、より汎用的な符号化に拡張されている。更に、現在は、ISO/IECとITU-Tが共同でより高圧縮率の次世代画像符号化方式として、MPEG-4 AVCおよびITU H.264 の標準化活動が進んでいる。2002年8月の時点で、次世代画像符号化方式はコミッティー・ドラフト（CD）と呼ばれるものが発行されている（また、例えば、非特許文献１参照。）。 Therefore, it is necessary to use information compression technology. For example, in the case of videophones, videos based on the H.261 and H.263 standards recommended by the ITU-T (International Telecommunication Union Telecommunication Standardization Sector) Compression technology is used. Further, according to the information compression technology of the MPEG-1 standard, it is possible to store image information together with audio information in a normal music CD (compact disc).
Here, MPEG (Moving Picture Experts Group) is an international standard for moving image signal compression standardized by ISO / IEC (International Organization for Standardization), and MPEG-1 is used for moving image signals. It is a standard that compresses information of television signals to 5 Mbps, that is, to about 1/100. In addition, since the target quality is set to a medium quality that can be realized at a transmission speed of about 1.5 Mbps mainly in the MPEG-1 standard, MPEG-2 standardized to meet the demand for higher image quality. Then, the TV broadcast quality of the moving image signal is realized at 2 to 15 Mbps. Furthermore, at present, the working group (ISO / IEC JTC1 / SC29 / WG11), which has been working on the standardization of MPEG-1 and MPEG-2, has achieved compression ratios exceeding those of MPEG-1 and MPEG-2, and furthermore, on a per-object basis. MPEG-4, which enables encoding, decoding, and operation, and realizes new functions required in the multimedia age, has been standardized. MPEG-4 was initially aimed at standardizing a low bit rate coding method, but is now expanded to more general-purpose coding including high bit rates including interlaced images. Further, at present, ISO / IEC and ITU-T are jointly working on standardization of MPEG-4 AVC and ITU H.264 as next-generation image coding systems with higher compression ratios. As of August 2002, a next-generation image coding system called a committee draft (CD) has been issued (for example, see Non-Patent Document 1).

一般に動画像の符号化では、時間方向および空間方向の冗長性を削減することによって情報量の圧縮を行う。そこで時間的な冗長性の削減を目的とする画面間予測符号化では、前方または後方のピクチャを参照してブロック単位で動きの検出および予測画像の作成を行い、得られた予測画像と符号化対象ピクチャとの差分値に対して符号化を行う。ここで、ピクチャとは1枚の画面を表す用語であり、プログレッシブ画像ではフレームを意味し、インタレース画像ではフレームもしくはフィールドを意味する。ここで、インタレース画像とは、１つのフレームが時刻の異なる２つのフィールドから構成される画像である。インタレース画像の符号化や復号化処理においては、１つのフレームをフレームのまま処理したり、２つのフィールドとして処理したり、フレーム内のブロック毎にフレーム構造またはフィールド構造として処理したりすることができる。 Generally, in coding of a moving image, the amount of information is compressed by reducing redundancy in the time direction and the space direction. Therefore, in inter-picture predictive coding for the purpose of reducing temporal redundancy, motion detection and a predicted image are created in block units with reference to a forward or backward picture, and the obtained predicted image and coding are performed. Encoding is performed on the difference value from the target picture. Here, a picture is a term representing one screen, and means a frame in a progressive image and a frame or a field in an interlaced image. Here, an interlaced image is an image in which one frame is composed of two fields at different times. In encoding and decoding of an interlaced image, one frame may be processed as a frame, processed as two fields, or processed as a frame structure or a field structure for each block in the frame. it can.

参照画像を持たず画面内予測符号化を行うものをＩピクチャと呼ぶ。また、１枚のピクチャのみを参照し画面間予測符号化を行うものをＰピクチャと呼ぶ。また、同時に２枚のピクチャを参照して画面間予測符号化を行うことのできるものをＢピクチャと呼ぶ。Ｂピクチャは表示時間が前方もしくは後方から任意の組み合わせとして２枚のピクチャを参照することが可能である。参照画像（参照ピクチャ）は符号化および復号化の基本単位であるブロックごとに指定することができるが、符号化を行ったビットストリーム中に先に記述される方の参照ピクチャを第１参照ピクチャ、後に記述される方を第２参照ピクチャとして区別する。ただし、これらのピクチャを符号化および復号化する場合の条件として、参照するピクチャが既に符号化および復号化されている必要がある。 A picture that does not have a reference picture and performs intra-picture prediction coding is called an I picture. A picture that performs inter-picture predictive encoding by referring to only one picture is called a P picture. A picture that can perform inter-picture predictive encoding by simultaneously referring to two pictures is called a B picture. The B picture can refer to two pictures as an arbitrary combination of display time from the front or the rear. The reference picture (reference picture) can be specified for each block, which is a basic unit of encoding and decoding, but the reference picture described earlier in the encoded bit stream is replaced with the first reference picture. , The one described later is distinguished as the second reference picture. However, as a condition for encoding and decoding these pictures, the picture to be referred to must be already encoded and decoded.

Ｐピクチャ又はＢピクチャの符号化には、動き補償画面間予測符号化が用いられている。動き補償画面間予測符号化とは、画面間予測符号化に動き補償を適用した符号化方式である。動き補償とは、単純に参照フレームの画素値から予測するのではなく、ピクチャ内の各部の動き量（以下、これを動きベクトルと呼ぶ）を検出し、当該動き量を考慮した予測を行うことにより予測精度を向上すると共に、データ量を減らす方式である。例えば、符号化対象ピクチャの動きベクトルを検出し、その動きベクトルの分だけシフトした予測値と符号化対象ピクチャとの予測残差を符号化することによりデータ量を減らしている。この方式の場合には、復号化の際に動きベクトルの情報が必要になるため、動きベクトルも符号化されて記録又は伝送される。 Motion-compensated inter-picture prediction coding is used for coding a P picture or a B picture. The motion compensated inter-picture predictive coding is a coding method in which motion compensation is applied to the inter-picture predictive coding. Motion compensation does not simply predict from the pixel value of a reference frame, but detects the amount of motion of each part in a picture (hereinafter referred to as a motion vector) and performs prediction in consideration of the amount of motion. Is a method for improving the prediction accuracy and reducing the amount of data. For example, a data amount is reduced by detecting a motion vector of a current picture to be coded and coding a prediction residual between a predicted value shifted by the motion vector and the current picture to be coded. In the case of this method, since information of a motion vector is required at the time of decoding, the motion vector is also encoded and recorded or transmitted.

動きベクトルはマクロブロック単位で検出されており、具体的には、符号化対象ピクチャ側のマクロブロックを固定しておき、参照ピクチャ側のマクロブロックを探索範囲内で移動させ、基準ブロックと最も似通った参照ブロックの位置を見つけることにより、動きベクトルが検出される。
図１９は、従来の動画像符号化装置の構成を示すブロック図である。 The motion vector is detected in units of macroblocks. Specifically, the macroblock on the picture to be coded is fixed, and the macroblock on the reference picture is moved within the search range, so that it is most similar to the reference block. The motion vector is detected by finding the position of the reference block.
FIG. 19 is a block diagram showing a configuration of a conventional moving picture encoding device.

動画像符号化装置は、動き検出部１０３、減算演算部１０４、符号化部１０５、動き補償部１０６、可変長符号化部１０７、復号化部１０８、加算演算部１０９、およびメモリ１１０、１１１を備えている。
動画像信号Vinは、減算演算部１０４と、動き検出部１０３に入力される。
動き検出部１０３は、メモリ１１０から読み出した符号化済みの復号化画像データを参照ピクチャとして用いて、そのピクチャ内の探索領域において最適と予測される位置を示す動きベクトルMVの検出し、動き補償部１０６へ出力する。 The moving picture coding apparatus includes a motion detection unit 103, a subtraction calculation unit 104, a coding unit 105, a motion compensation unit 106, a variable length coding unit 107, a decoding unit 108, an addition calculation unit 109, and memories 110 and 111. Have.
The moving image signal Vin is input to the subtraction operation unit 104 and the motion detection unit 103.
Using the encoded decoded image data read from the memory 110 as a reference picture, the motion detection unit 103 detects a motion vector MV indicating a position predicted to be optimal in a search area in the picture, and performs motion compensation. Output to the unit 106.

動き補償部１０６は、動き検出部１０３により検出された動きベクトルMVを用いて動き補償画像信号MCRefを生成し、減算演算部１０４および加算演算部１０９へ出力する。
減算演算部１０４は、入力された動画像信号Vinと、動き補償部１０６より入力された動き補償画像信号MCRefとの差分を演算し、差分信号Difを符号化部１０５へ出力する。
符号化部１０５は、入力された差分信号Difに対して周波数変換や量子化等の符号化処理を行い、符号化信号を生成し、可変長符号化部１０７および復号化部１０８へ出力する。可変長符号化部１０７は、入力された符号化信号に対して可変長符号化等を行い、さらに動き補償部１０６から入力された動きベクトルMV等を付加することにより符号化ストリームStrを生成し、動画像符号化装置の外部へ出力する。 The motion compensating unit 106 generates a motion-compensated image signal MCRef using the motion vector MV detected by the motion detecting unit 103, and outputs the motion-compensated image signal MCRef to the subtraction operation unit 104 and the addition operation unit 109.
The subtraction operation unit 104 calculates the difference between the input moving image signal Vin and the motion compensated image signal MCRef input from the motion compensation unit 106, and outputs a difference signal Dif to the encoding unit 105.
The coding section 105 performs coding processing such as frequency conversion and quantization on the input difference signal Dif, generates a coded signal, and outputs the coded signal to the variable-length coding section 107 and the decoding section 108. The variable-length coding unit 107 generates a coded stream Str by performing variable-length coding and the like on the input coded signal and further adding the motion vector MV and the like input from the motion compensation unit 106. Output to the outside of the video encoding device.

復号化部１０８は、入力された符号化信号に対して逆量子化や逆周波数変換等の復号化処理を行い、復号した差分信号RecDifを加算演算部１０９へ出力する。
加算演算部１０９は、復号化部１０８より入力された差分信号RecDifと、動き補償部１０６より入力された画像信号RecMCRefとを加算し、ローカル復号画像LocalReconを生成する。生成されたローカル復号画像LocalReconは、メモリ１１１へ出力される。 The decoding unit 108 performs decoding processing such as inverse quantization and inverse frequency conversion on the input encoded signal, and outputs the decoded difference signal RecDif to the addition operation unit 109.
The addition operation unit 109 adds the difference signal RecDif input from the decoding unit 108 and the image signal RecMCRef input from the motion compensation unit 106 to generate a local decoded image LocalRecon. The generated local decoded image LocalRecon is output to the memory 111.

ローカル復号画像は、動画像復号化装置で復号化された結果と一致する画像であり、次の時刻の動画像信号Vinを符号化する際には、参照画像として使われる。よって、メモリ１１１に書き込まれたローカル復号画像LocalReconは、次の動画像信号Vinを入力するまでにメモリ１１０へコピーされるか、あるいは、メモリ１１０とメモリ１１１は交換される。 The local decoded image is an image that matches the result decoded by the video decoding device, and is used as a reference image when coding the video signal Vin at the next time. Therefore, the local decoded image LocalRecon written in the memory 111 is copied to the memory 110 before the next moving image signal Vin is input, or the memory 110 and the memory 111 are exchanged.

図２０はJVTの表示順情報（Picture Order Count:POC)とフレーム番号（Frame Number:FN)の概念を説明する図である。表示順情報POCはピクチャの表示の順序を示している。ただし、実際の表示時間を意味しているのではない。例えば、図中のピクチャIDR19の表示順情報POCは“０”であり、次のピクチャB20のPOCは“１”であるので、ピクチャB20はピクチャIDR19の次に表示すべきことはわかるが、どれくらいの時間を経過した後に表示すべきかはわからない。実際の表示時間は、各ピクチャに関連づけられた、ビデオ以外のデータから得られ、ビデオデコーダ（動画像復号化装置）の関与しない装置で管理される。表示順情報POCは特別なイントラピクチャであるIDRピクチャで常に“０”にリセットされ、表示順でピクチャ単位に１ずつ増加するように付与される。あらかじめ定めた最大値になると、再び“０”にリセットされる。図の例では、IDRピクチャであるピクチャIDR19とピクチャIDR29で表示順情報POCが“０”に戻る様子を示し、また、表示順情報POCの最大値を“４”と設定して、ピクチャB24で循環して“０”に戻る様子を示している。 FIG. 20 is a diagram for explaining the concept of the display order information (Picture Order Count: POC) and the frame number (Frame Number: FN) of the JVT. The display order information POC indicates the display order of the pictures. However, this does not mean the actual display time. For example, since the display order information POC of the picture IDR19 in the figure is “0” and the POC of the next picture B20 is “1”, it is known that the picture B20 should be displayed next to the picture IDR19. I do not know if it should be displayed after the time has passed. The actual display time is obtained from non-video data associated with each picture, and is managed by a device that does not involve a video decoder (video decoding device). The display order information POC is always reset to “0” in an IDR picture, which is a special intra picture, and is added so as to increase by one for each picture in the display order. When it reaches a predetermined maximum value, it is reset to "0" again. In the example of the figure, the display order information POC returns to “0” in the picture IDR pictures IDR19 and IDR29, and the maximum value of the display order information POC is set to “4”. The state of circulating and returning to “0” is shown.

FNは被参照ピクチャに付与される番号である。図中（A）は、ピクチャB21をデコードする前のメモリの状態を示していて、３枚の参照ピクチャが格納されている。図中（B）は、ピクチャB21をデコードし、メモリに格納した後の状態を示している。ここでピクチャB21のFNは、次にデコードするピクチャP25と同じ値を持っているが、このようにデコード順で連続する複数のピクチャが同じFNを持つ場合、デコード順で最後のピクチャが参照ピクチャであり、それ以外のピクチャは参照ピクチャではないことを意味している。この例では、ピクチャB21は参照ピクチャではないので、メモリに格納されると、“参照ピクチャとして不使用”とマークされる（マークされた状態をunusedと略する）。参照ピクチャがメモリに格納される場合は、“参照ピクチャとして使用”とマークされる（マークされた状態をusedと略する）。なお、図では“unused”だけを記載している。また、参照ピクチャであるか否かは符号化ストリーム中のnal_ref_idcというフィールドからも分かるが、本発明の説明に直接関わらないのでここでは説明しない。また、フレーム番号FNも表示順情報POCと同様にIDRピクチャで常に“０”にリセットされ、あらかじめ定めた最大値になると、再び“０”に戻される。この例では、ピクチャIDR19とピクチャIDR29で“０”にリセットされ、ピクチャB24で“０”に戻されている様子を示している。 FN is a number assigned to the referenced picture. (A) in the figure shows the state of the memory before decoding the picture B21, in which three reference pictures are stored. (B) in the figure shows a state after the picture B21 is decoded and stored in the memory. Here, the FN of the picture B21 has the same value as the picture P25 to be decoded next, but when a plurality of pictures consecutive in the decoding order have the same FN as described above, the last picture in the decoding order is the reference picture. And that the other pictures are not reference pictures. In this example, since the picture B21 is not a reference picture, when it is stored in the memory, it is marked as "unused as a reference picture" (the marked state is abbreviated as unused). When the reference picture is stored in the memory, it is marked as “used as a reference picture” (the marked state is abbreviated as used). In the figure, only “unused” is described. Also, whether or not the picture is a reference picture can be seen from a field called nal_ref_idc in the encoded stream, but is not described here because it does not directly relate to the description of the present invention. Also, the frame number FN is always reset to “0” in the IDR picture, similarly to the display order information POC, and is returned to “0” again when it reaches a predetermined maximum value. In this example, it is shown that the picture IDR19 and the picture IDR29 are reset to “0” and the picture B24 is reset to “0”.

図２１と図２２を用いて、メモリから、空き領域を確保するために、ピクチャを消去する動作を、説明する。図２１はunusedされたピクチャがある場合の、消去動作を説明する図である。ピクチャP23をデコードする直前のメモリにはピクチャIDR19、P22、B20、B21をデコードしたピクチャが格納され、ピクチャB20は参照されないピクチャなので、あらかじめunusedされているとする（図中（A）参照）。次に、メモリ管理を行うMMCO(Memory management control operation)あるいは古いものから順に不用であるものとするスライディングウィンドウなどの手法を用いて必要に応じてピクチャをunusedする。これら操作を本明細書では不使用マーキング処理と呼ぶ。ここでは、ピクチャP22がunusedされたとする（図中（B）参照）。次に、空き領域を確保するためにピクチャを消去するが、このようにunusedピクチャがある場合には、unusedピクチャの中で、表示順(POC)の最も早いピクチャを消去する。ここでは、ピクチャP22の表示順が“３”で、ピクチャB20の表示順が“１”なので、ピクチャB20を消去する（図中（C）参照）。この消去して空いた領域へピクチャP23を格納する（図中（D）参照）。 The operation of erasing a picture in order to secure a free area from the memory will be described with reference to FIGS. 21 and 22. FIG. 21 is a diagram for explaining an erasing operation when there is an unused picture. Immediately before decoding the picture P23, pictures obtained by decoding the picture IDs R19, P22, B20, and B21 are stored. Since the picture B20 is not referenced, it is assumed that the picture B20 has been previously unused (see (A) in the figure). Next, the pictures are unused as necessary using a technique such as an MMCO (Memory management control operation) for performing memory management or a sliding window that is determined to be unnecessary from the oldest one. These operations are referred to herein as unused marking processing. Here, it is assumed that the picture P22 has been unused (see (B) in the figure). Next, the picture is erased in order to secure a free area. If there is such an unused picture, the picture with the earliest display order (POC) among the unused pictures is erased. Here, since the display order of the picture P22 is "3" and the display order of the picture B20 is "1", the picture B20 is deleted (see (C) in the figure). The picture P23 is stored in the empty area after the deletion (see (D) in the figure).

なお、ピクチャにはフレームとフィールドとがあり、本明細書ではピクチャとして説明しているが、メモリに格納する際は、フレーム単位（同一時刻の奇数フィールドと偶数フィールド）に格納してもよい。また、メモリに空き領域を確保するために消去する際も、フレーム単位に消去してもよい。
なお、図中stageで示した番号は、メモリの遷移段階を示していて、stage1はそのピクチャの処理において不使用マーク処理をする前の段階、stage2は不使用マーク処理がされた後の段階、stage3は空き領域を確保した後の段階、stage4はピクチャを格納した後の段階を意味している。 Note that a picture includes a frame and a field, and is described as a picture in this specification. However, when storing in a memory, the picture may be stored in a frame unit (an odd field and an even field at the same time). Also, when erasing to secure a free area in the memory, erasing may be performed in frame units.
In the figure, the number indicated by stage indicates the transition stage of the memory, stage 1 is the stage before performing the unused mark processing in the processing of the picture, stage 2 is the stage after the unused mark processing is performed, stage3 means a stage after securing a free area, and stage4 means a stage after storing a picture.

図２２はunusedされたピクチャがメモリに無い場合の、消去動作を説明する図である。図に示すようにピクチャは、ピクチャIDR19、P22、B20、B21、P23の順でデコードされる。図中（A）に示すように、ピクチャP23をデコードする前の段階で、メモリにIDR19、P22、B20、B21のピクチャが格納されて、いずれもunusedでないとする。そして、図中（B）に示すように、不使用マーキング処理においても、いずれもunusedされなかったとする。このように、unusedされたピクチャが無い場合に、空き領域を確保する際は、メモリに格納されているピクチャの中で、最初にデコードしたピクチャを消去する。図中（C）に示すように、ここでは、メモリに格納されているピクチャの中ではIDR19が最初にデコードしたピクチャなのでIDR19を消去する。最後に図中（D）に示すように、空いた領域へ、デコードしたピクチャP23を格納する。 FIG. 22 is a view for explaining an erasing operation when there is no unused picture in the memory. As shown in the figure, the pictures are decoded in the order of picture IDR19, P22, B20, B21, P23. As shown in (A) in the figure, it is assumed that the pictures of IDR19, P22, B20, and B21 are stored in the memory before the picture P23 is decoded, and none of them is unused. Then, as shown in (B) in the figure, it is assumed that none of the unused marking processes have been unused. As described above, when there is no unused picture, in order to secure a free area, the picture decoded first among the pictures stored in the memory is deleted. As shown in (C) in the figure, since the IDR19 is the first decoded picture among the pictures stored in the memory, the IDR19 is deleted. Finally, as shown in (D) in the figure, the decoded picture P23 is stored in the empty area.

図２３は従来の動画像復号化装置の構成を示すブロック図である。
動画像復号化装置は、可変長復号化部４０２、画像復号化部２０２、MMCO復号化部２０４、メモリ２０６、およびメモリ管理部４０１を備えている。
動画像符号化信号Strを入力とし、可変長復号化部４０２にて可変長復号化を行い、符号化されているピクチャデータcomp_picを画像復号化部２０２にて復号化し復号画像信号Reconをメモリ２０６へ格納する。画像復号化部２０２は、ピクチャが画面間符号化されている場合には、復号化の際、動き情報MVをメモリ２０６へ送り動き補償済み参照画像MCPicを作成し、動き補償を行う。ピクチャの格納領域の決定、空き領域の確保等の、メモリ管理の指示mctrlはメモリ管理部４０１により出力される。表示順情報POCが可変長復号化部４０２からメモリ管理部４０１へ出力され保持される。また、前述した不使用マーキング処理の一つであるMMCOコマンド MMCOは可変長復号化部４０２からMMCO復号化部２０４へ入力され、デコードされメモリ管理部４０１へunusedの指示が入力される。また、メモリ２０６から表示される復号画像信号Voutが出力される。 FIG. 23 is a block diagram showing a configuration of a conventional video decoding device.
The moving picture decoding apparatus includes a variable length decoding unit 402, an image decoding unit 202, an MMCO decoding unit 204, a memory 206, and a memory management unit 401.
The video encoded signal Str is input, the variable-length decoding unit 402 performs variable-length decoding, the encoded picture data comp_pic is decoded by the image decoding unit 202, and the decoded image signal Recon is stored in the memory 206. To store. When the picture is inter-coded, the image decoding unit 202 sends the motion information MV to the memory 206 to create a motion-compensated reference image MCPic and performs motion compensation at the time of decoding. The memory management unit 401 outputs a memory management instruction mctrl for determining a picture storage area, securing a free area, and the like. The display order information POC is output from the variable length decoding unit 402 to the memory management unit 401 and held. Further, the MMCO command MMCO, which is one of the above-described non-use marking processes, is input from the variable length decoding unit 402 to the MMCO decoding unit 204, decoded, and an instruction of unused is input to the memory management unit 401. Further, the decoded image signal Vout displayed from the memory 206 is output.

図２４は従来の動画像復号化装置のメモリ関連の動作のフロー図である。本フローはピクチャ単位の動作をステップS1からステップS2で示している。不使用マーキング処理を行い、メモリの各ピクチャに対して必要に応じてunusedとマーキングする（ステップS１３）。次に、空き領域確保処理を行い、空き領域をメモリに確保する（ステップS14）。次に、空き領域へ復号画像信号Voutを格納する（ステップS15）。 FIG. 24 is a flowchart of a memory-related operation of the conventional video decoding device. In this flow, the operation for each picture is shown in steps S1 to S2. Unused marking processing is performed, and each picture in the memory is marked as unused as necessary (step S13). Next, a free area securing process is performed to secure a free area in the memory (step S14). Next, the decoded image signal Vout is stored in the empty area (step S15).

図２５は従来の動画像復号化装置の空き領域確保処理の動作のフロー図であり、図２４のステップS14を詳細に説明するフロー図である。空き領域を確保する処理（ステップS14）は、unusedとマークされたピクチャがメモリ２０６にあるかを調べ（ステップS141）、ある場合には、メモリ２０６に格納されたunusedとマークされたピクチャの中で表示順の最も古いピクチャを削除し（ステップS143）、ない場合には、メモリ２０６に格納されたピクチャの中で最初にデコードしたピクチャを削除する（ステップS142）。 FIG. 25 is a flow chart of the operation of the free space securing process of the conventional moving picture decoding apparatus, and is a flow chart for explaining step S14 of FIG. 24 in detail. The process of securing a free area (step S14) checks whether a picture marked as unused is present in the memory 206 (step S141). Deletes the oldest picture in the display order (step S143), and if not, deletes the first decoded picture among the pictures stored in the memory 206 (step S142).

図２７は無効ピクチャ処理の動作を説明する概念図である。JVTでは動画像復号化装置に入力されるシーケンスの一部のピクチャがロストした時、ロストした枚数分だけ無効ピクチャを挿入するというメモリ管理の動作が規定されている。この動作はシーケンスパラメータセット内のrequired_frame_num_update_behaviour_flagが“１”のとき動画像復号化装置において行われる。無効ピクチャとは実際の復元画像信号を持たず、特別にマーキングされたピクチャのことであり、参照ピクチャとして参照してはいけないことになっている。同図に示すような、ピクチャI19、P20、P21、P22、P23をデコードした後のメモリの状態は同図（A）に示す状態であったとする。次にピクチャB24をデコードする際、デコード順で新しいunusedでないピクチャに参照インデックス ref_idxの値が小さくなるように、参照ピクチャを特定するために用いる参照インデックスを割り当てる。この割り当ては一例であり、ピクチャタイプ等によって方法が異なるが、メモリに格納されているピクチャに依存して参照関係のインデックスが割り当てられるという、依存性質があることは同じである。この図の例では、最後にデコードしたunusedでないピクチャP22をref_idx=0、その前にデコードしたunusedでないピクチャP21をref_idx=1というように割り当てる。 FIG. 27 is a conceptual diagram illustrating the operation of the invalid picture process. In the JVT, a memory management operation is defined in which, when some pictures of a sequence input to a moving picture decoding apparatus are lost, invalid pictures are inserted by the number of lost pictures. This operation is performed in the video decoding device when required_frame_num_update_behaviour_flag in the sequence parameter set is “1”. An invalid picture is a picture that has no actual restored image signal and is specially marked, and is not to be referred to as a reference picture. It is assumed that the state of the memory after decoding the pictures I19, P20, P21, P22, and P23 as shown in the figure is as shown in FIG. Next, when decoding the picture B24, a reference index used for specifying a reference picture is assigned to a new non-unused picture in the decoding order so that the value of the reference index ref_idx becomes smaller. This assignment is an example, and the method differs depending on the picture type and the like, but the assignment has the dependency property that the index of the reference relationship is assigned depending on the picture stored in the memory. In the example of this figure, the unused decoded picture P22 decoded last is assigned as ref_idx = 0, and the previously decoded non-used picture P21 decoded as ref_idx = 1.

ここで、ピクチャP21とピクチャP23が伝送の途中等で失われ、デコーダに入力されなかった場合、無効ピクチャを挿入しないと、ピクチャB24をデコードする際は、同図（B）に示すように、参照インデックス ref_idxが割り当てられる。本来、ピクチャB24の参照するピクチャP22とピクチャP20はそれぞれ、ref_idx=0と、ref_idx=2と割り当てられるところが、ref_idx=0はピクチャP22へ、ref_idx=2はピクチャI19へ割り当てられるため、ピクチャP20と間違えてピクチャI19を参照してしまうという問題がある。これを回避するために、無効ピクチャが挿入される。 Here, if the picture P21 and the picture P23 are lost during transmission or the like and are not input to the decoder, and if an invalid picture is not inserted, when decoding the picture B24, as shown in FIG. Reference index ref_idx is assigned. Originally, reference picture P22 and picture P20 of picture B24 are respectively assigned ref_idx = 0 and ref_idx = 2.However, ref_idx = 0 is assigned to picture P22, and ref_idx = 2 is assigned to picture I19. There is a problem that the picture I19 is referred to by mistake. To avoid this, invalid pictures are inserted.

無効ピクチャを挿入した場合の、ピクチャB24をデコードする前のメモリの様子を示したのが同図（C）である。フレームナンバー FNの不連続を検出したならば、不連続の枚数分だけ無効ピクチャを挿入する。この例では、FN=3であるピクチャP22をデコードする際、その直前にデコードしたピクチャP20はFN=1であるので、本来１以上増えないところが２増えているので、１枚ロストしたことがわかる。したがって、ピクチャP22をデコードする前に、無効ピクチャを１枚挿入する。前述のように無効ピクチャは特別なピクチャであり、実際の復元された画像信号を持たないけれどusedとマークされ、参照ピクチャの割り当ての際は参照ピクチャとして扱われるが、実際に参照はしてはいけないので、さらに、”存在しない（non-exist）“とマークされる。 FIG. 11C shows the state of the memory before decoding picture B24 when an invalid picture is inserted. If a discontinuity in the frame number FN is detected, invalid pictures are inserted by the number of discontinuities. In this example, when the picture P22 with FN = 3 is decoded, the picture P20 decoded immediately before is FN = 1, so that the number of places where the picture P20 does not increase by one or more is increased by two, so it can be seen that one picture has been lost. . Therefore, before decoding the picture P22, one invalid picture is inserted. As described above, an invalid picture is a special picture, and has no actual restored image signal, but is marked as used, and is treated as a reference picture when a reference picture is assigned. Otherwise, it is marked as "non-exist".

図２８は従来の動画像復号化装置の構成を示すブロック図である。図２３で説明した従来の動画像符号化装置との違いは、FNギャップ検出部２１１があることと、メモリ管理部４１２の動作が異なることである。FNギャップ検出部２１１は可変長復号化部４１１からフレームナンバー FNを取得し、ギャップがある場合には、必要な枚数だけ無効ピクチャを挿入するように、メモリ管理部４１２へ指示する。メモリ管理部４１２は指示された枚数分だけ無効ピクチャをメモリ２０６へ格納する。 FIG. 28 is a block diagram showing a configuration of a conventional video decoding device. The difference from the conventional moving picture coding apparatus described with reference to FIG. 23 is that the FN gap detecting section 211 is provided and the operation of the memory managing section 412 is different. The FN gap detection unit 211 acquires the frame number FN from the variable length decoding unit 411, and when there is a gap, instructs the memory management unit 412 to insert a necessary number of invalid pictures. The memory management unit 412 stores invalid pictures in the memory 206 by the designated number.

図２９は従来の動画像復号化装置の無効ピクチャ処理の動作のフロー図である。図２４で説明した従来の動画像復号化装置のメモリ関連の動作と異なる点は、不使用マーキング処理（ステップS13）の前に、フレームナンバー FNのギャップを調べ（ステップS11）、ギャップがある場合には、ギャップの枚数分だけ無効ピクチャをメモリ２０６へ格納（ステップS12）した後、不使用マーキング処理（ステップS13）へ進み、ギャップが無い場合には、不使用マーキング処理（ステップS13）へ進む。ステップS12では、ギャップの枚数分だけ無効ピクチャを格納するが、１枚を挿入しようとする度に、図２４で示す通常のピクチャを格納するのと同様の処理を行う。 FIG. 29 is a flowchart showing the operation of the invalid picture processing of the conventional video decoding device. The difference from the memory-related operation of the conventional video decoding device described with reference to FIG. 24 is that the gap of the frame number FN is checked before the non-use marking process (step S13) (step S11). , The invalid pictures are stored in the memory 206 by the number of gaps in the memory 206 (step S12), and then the process proceeds to the unused marking process (step S13). If there is no gap, the process proceeds to the unused marking process (step S13). . At step S12, invalid pictures are stored by the number of gaps. Every time one picture is to be inserted, the same processing as that of storing a normal picture shown in FIG. 24 is performed.

図３１は従来のMPEG-2ストリームの構造を説明する概念図である。図に示すようにMPEG2のストリームは以下のような階層構造を有している。ストリーム（Stream）は複数のグループ・オブ・ピクチャ（Group Of Picture）から構成されており、これを符号化処理の基本単位とすることで動画像の編集やランダムアクセスが可能になっている。グループ・オブ・ピクチャは、複数のピクチャから構成され、各ピクチャは、Ｉピクチャ、Ｐピクチャ又はＢピクチャがある。ストリーム、GOPおよびピクチャはさらにそれぞれの単位の区切りを示す同期信号（sync）と当該単位に共通のデータであるヘッダ（header）から構成されている。MPEG-2では、Pピクチャは表示時刻が直前1枚のIピクチャもしくはPピクチャのみを参照した予測符号化が可能である。また、Bピクチャは表示時刻が直前1枚と直後１枚のIピクチャもしくはPピクチャを参照した予測符号化が可能である。更に、ストリームに配置される順序も決まっており、IピクチャもしくはPピクチャの直後に配置される。従って、ランダムアクセスの際、Iピクチャから復号を開始すれば、Iピクチャ以降に配置されるピクチャは全て復号・表示が可能であった。また、参照ピクチャは最大でも２枚までしかメモリに格納できないので、参照構造の自由度は限られていた。 FIG. 31 is a conceptual diagram illustrating the structure of a conventional MPEG-2 stream. As shown in the figure, the MPEG2 stream has the following hierarchical structure. A stream is composed of a plurality of groups of pictures, and by using this as a basic unit of encoding processing, editing of moving images and random access are possible. The group of pictures is composed of a plurality of pictures, and each picture includes an I picture, a P picture, or a B picture. The stream, GOP, and picture further include a synchronization signal (sync) indicating a break of each unit and a header (header) which is data common to the unit. In MPEG-2, a P picture can be predictively coded with reference to only one I picture or P picture whose display time is immediately before. The B picture can be subjected to predictive encoding with reference to one I picture or one P picture immediately before and after one display time. Furthermore, the order in which they are arranged in the stream is also determined, and they are arranged immediately after an I picture or a P picture. Therefore, at the time of random access, if decoding is started from an I picture, all pictures arranged after the I picture can be decoded and displayed. Further, since at most two reference pictures can be stored in the memory, the degree of freedom of the reference structure is limited.

図３２は従来のJVTの動画像符号化方法を説明する概念図である。JVTでは特別なイントラピクチャであるIDRピクチャをまたがない限り、任意に離れたピクチャを参照することも可能である。従って、例えば、符号化効率を大きくするために、多数のピクチャの符号化順を並び替えて符号化することも可能である。図ではピクチャ19、20、21、25、26、27の画像間の相関が非常に強く、また、ピクチャ22、23、24、28、29、30の画像間の相関が非常に強いとする。この場合には、ピクチャ19、20、21、25、26、27をまず画面間符号化し（GOP1）、ピクチャ22、23、24、28、29、30を画面間符号化（GOP2）することで、符号化効率を高くすることが期待できる。 FIG. 32 is a conceptual diagram illustrating a conventional JVT video coding method. In JVT, it is also possible to refer to pictures arbitrarily distant as long as they do not cross an IDR picture which is a special intra picture. Therefore, for example, in order to increase the coding efficiency, it is also possible to rearrange the coding order of a large number of pictures and perform coding. In the figure, it is assumed that the correlation between the pictures 19, 20, 21, 25, 26, and 27 is very strong, and the correlation between the pictures 22, 23, 24, 28, 29, and 30 is very strong. In this case, pictures 19, 20, 21, 25, 26, and 27 are first inter-coded (GOP1), and pictures 22, 23, 24, 28, 29, and 30 are inter-coded (GOP2). It can be expected that the coding efficiency will be increased.

図３３は従来のJVTの動画像符号化方法の動作フロー図である。JVTの動画像符号化方法では、全ての未符号化ピクチャを符号化候補とすることができる（ステップS55）。そして、符号化候補から、何らかの観点でピクチャを選択して符号化する（ステップS56）。例えば、未符号化のピクチャが10枚あるとき、この10枚を全て符号化候補とし、表示順で10枚目のピクチャを選択して符号化してもよい。符号化した後、未符号化のピクチャがあれば、またステップS55へ戻る。ステップS56では、符号化せず、さらに未符号化のピクチャが入力されるのを待ってもよい。
ISO/IEC 14496-10 Editor's Proposed Changes Relative to JVT-E146d37ncm, revision 4, 2002-12 FIG. 33 is an operation flow diagram of a conventional JVT video coding method. In the video coding method of JVT, all uncoded pictures can be set as coding candidates (step S55). Then, a picture is selected and encoded from some viewpoint from the encoding candidates (step S56). For example, when there are 10 uncoded pictures, all 10 pictures may be set as coding candidates, and the 10th picture may be selected and coded in the display order. After the encoding, if there is an uncoded picture, the process returns to step S55. In step S56, it is possible to wait for an uncoded picture to be input without coding.
ISO / IEC 14496-10 Editor's Proposed Changes Relative to JVT-E146d37ncm, revision 4, 2002-12

さて、このような従来の動画像復号化装置、及び、従来の動画像復号装置では、前述のように特別なイントラピクチャであるIDRピクチャの場所以外では、符号化されたストリームを編集が出来なかった。この問題を以下に説明する。
図２６はシーケンスの不連続が表示順情報POCの不連続をひきおこし、未表示のピクチャを消去してしまう問題を説明する概念図である。あるシーケンスの二つ部分Clip1とClip2をつなぎ合わせてデコードする場合を示している。このように編集等によって発生したシーケンスの不連続が発生している場所を編集ポイントと呼ぶことにする。この例では表示順情報POCの巡回は考えなくてもよいような表示順情報POCの最大値が設定されているとする。図中（A）はClip1をデコードした後のメモリの状態を示していて、ピクチャI19、P22、B20、B21が格納されている。それぞれの表示順情報POCは図に示すようにそれぞれ“４”、“７”、“５”、“６”であり、ピクチャI19、B20、B21がunusedとマークされているとする。次にClip2の最初のピクチャI85をデコードし、2枚目のピクチャP86をデコードする前の状態を図中（B）に示す。ここでは、ピクチャI85はピクチャB20のあった位置へ格納されたとする。次に、不使用マーキング処理を行うが、このClip2の場合、ピクチャI85がunusedとマークされたとする（図中（B））。次に、空き領域確保処理を行うが、前述のように、unusedのピクチャがあるので、unusedのピクチャの中で、最初の表示順を持つピクチャを削除するので、ピクチャI85を削除する。ここで、デコードしてから表示するまでの遅延が平均して３枚だとすると、ピクチャB21、P22、I85はまだ表示されていない。しかし、ピクチャI85はまだ表示していないにも関わらず、メモリから消去されてしまう。 By the way, in such a conventional video decoding device, and in the conventional video decoding device, it is not possible to edit an encoded stream except for the location of an IDR picture which is a special intra picture as described above. Was. This problem will be described below.
FIG. 26 is a conceptual diagram illustrating the problem that the discontinuity of the sequence causes the discontinuity of the display order information POC and the undisplayed picture is deleted. A case where two parts Clip1 and Clip2 of a certain sequence are connected and decoded is shown. The place where the discontinuity of the sequence caused by editing or the like occurs as described above is referred to as an edit point. In this example, it is assumed that the maximum value of the display order information POC is set so that it is not necessary to consider the circulation of the display order information POC. (A) in the figure shows the state of the memory after decoding of Clip1, in which pictures I19, P22, B20 and B21 are stored. It is assumed that the display order information POC is “4”, “7”, “5”, and “6”, respectively, as shown in the figure, and that the pictures I19, B20, and B21 are marked as unused. Next, the state before decoding the first picture I85 of Clip2 and before decoding the second picture P86 is shown in FIG. Here, it is assumed that picture I85 is stored at the position where picture B20 was. Next, an unused marking process is performed. In the case of this Clip2, it is assumed that the picture I85 is marked as unused ((B) in the figure). Next, a free area securing process is performed. As described above, since there is an unused picture, a picture having the first display order among the unused pictures is deleted, so that the picture I85 is deleted. Here, assuming that the delay from decoding to display is three on average, pictures B21, P22, and I85 have not been displayed yet. However, the picture I85 is deleted from the memory even though it is not displayed yet.

図３０はシーケンスの不連続がフレームナンバー FNの不連続をひきおこし、無効ピクチャが未表示のピクチャを消去してしまう問題を説明する概念図である。この例では、あるシーケンスの不連続な別の部分Clip1とClip2をつなぎあわせてデコードした様子を示している。同図（A）はピクチャP25をデコードした後のメモリの様子を示していて、ピクチャP21からピクチャP25まで5枚のピクチャが格納されている。次にClip2の最初のピクチャI60をデコードする際に、無効ピクチャを挿入した後の状態を、同図（B）が示している。ピクチャI60はFN=12であり、直前にデコードしたピクチャP25はFN=5であるので、6枚のピクチャがロストしたと判定され6枚の無効ピクチャが挿入される。この場合、メモリのピクチャは全て消去されるので、例えば同図（A）の状態で、ピクチャP23、P24、P25をまだ表示していなかったとしても、消去されてしまうという問題がある。 FIG. 30 is a conceptual diagram illustrating the problem that the discontinuity of the sequence causes the discontinuity of the frame number FN, and the invalid picture deletes the undisplayed picture. This example shows a state where another discontinuous portion Clip1 and Clip2 of a certain sequence are connected and decoded. FIG. 11A shows the state of the memory after decoding the picture P25, in which five pictures P21 to P25 are stored. Next, FIG. 12B shows a state after inserting an invalid picture when decoding the first picture I60 of Clip2. Since the picture I60 has FN = 12 and the picture P25 decoded immediately before has FN = 5, it is determined that six pictures have been lost, and six invalid pictures are inserted. In this case, since all the pictures in the memory are erased, there is a problem that the pictures P23, P24, and P25 are erased even if the pictures P23, P24, and P25 have not been displayed in the state of FIG.

図３４はJVTの符号化の自由度が、編集時やランダムアクセス時に引き起こす問題を説明する概念図である。同図（B）はオリジナルのストリームであり、図３２のストリームと同じである。同図（A）は、GOP1が無くGOP2のみをデコードする様子を示している。この場合、ピクチャ25、26、27が得られないので、ピクチャ22からピクチャ24まで再生した後、ピクチャ25からピクチャ27を再生することができないという、再生の不連続が発生する。これは編集によってGOP1を削除した場合、GOP2からランダムアクセスする場合などに問題となる。同図（C）は、GOP2が無く、GOP1までをデコードする様子を示している。この場合、やはり、ピクチャ22、23、24が得られないので、再生の不連続が発生する。これは編集によりGOP2を削除した場合に問題となる。 FIG. 34 is a conceptual diagram illustrating a problem caused by the degree of freedom of JVT coding at the time of editing or random access. FIG. 29B shows an original stream, which is the same as the stream shown in FIG. FIG. 7A shows a state in which only GOP2 is decoded without GOP1. In this case, since the pictures 25, 26, and 27 cannot be obtained, a playback discontinuity occurs in that the pictures 25 to 27 cannot be played after the pictures 22 to 24 have been played. This poses a problem when GOP1 is deleted by editing, or when GOP2 is randomly accessed. FIG. 7C shows a state in which there is no GOP2 and decoding is performed up to GOP1. In this case, the pictures 22, 23, and 24 cannot be obtained, so that discontinuity in reproduction occurs. This is a problem when GOP2 is deleted by editing.

そこで、本発明は上記の事情に鑑みてなされたものであり、特別のイントラピクチャであるIDRピクチャ以外のピクチャの場所でも編集を行うことができる動画像符号化方法および動画像復号化方法等を提供することを目的とする。 Therefore, the present invention has been made in view of the above circumstances, and a moving picture coding method and a moving picture decoding method that can perform editing at a picture location other than an IDR picture which is a special intra picture. The purpose is to provide.

上記目的を達成するために、本発明に係る動画像符号化方法は、動画像信号をピクチャ単位で符号化して符号化ストリームを生成する動画像符号化方法であって、前記ピクチャの順が不連続であることを示すフラグ情報を作成するフラグ情報作成ステップと、前記フラグ情報を前記符号化ストリームに付加する情報付加ステップとを含むことを特徴とする。 In order to achieve the above object, a moving picture coding method according to the present invention is a moving picture coding method for coding a moving picture signal on a picture basis to generate a coded stream, wherein the order of the pictures is out of order. It is characterized by including a flag information creating step of creating flag information indicating continuity, and an information adding step of adding the flag information to the encoded stream.

これによって、ピクチャの順が不連続であることを示す情報を符号化ストリームに付加することができる。
また、本発明に係る動画像符号化方法は、動画像信号をピクチャ単位で符号化して符号化ストリームを生成する動画像符号化方法であって、複数のピクチャにより構成される所定の符号化単位における最初の画面内符号化ピクチャより表示順が後となるピクチャは、当該符号化単位以降の符号化単位に含むように符号化する符号化ステップを含むことを特徴とする。 Thereby, information indicating that the order of pictures is discontinuous can be added to the encoded stream.
Also, the moving picture coding method according to the present invention is a moving picture coding method for coding a moving picture signal on a picture basis to generate a coded stream, wherein the predetermined coding unit is composed of a plurality of pictures. Is characterized in that it includes a coding step of coding a picture whose display order is later than the first intra-coded picture in, so as to be included in a coding unit subsequent to the coding unit.

これによって、ある符号化単位以降を復号化しても、再生の不連続が発生することなく、再生を行うことができる。
また、本発明に係る動画像復号化方法は、符号化ストリームをピクチャ単位で復号化する動画像復号化方法であって、前記ピクチャの順が不連続であることを示すフラグ情報を抽出する情報抽出ステップと、前記フラグ情報に基づいて復号化済みピクチャを格納する領域を管理する管理ステップとを含むことを特徴とする。 As a result, even if decoding is performed after a certain coding unit, reproduction can be performed without causing discontinuity in reproduction.
Further, the moving picture decoding method according to the present invention is a moving picture decoding method for decoding an encoded stream in units of pictures, wherein information for extracting flag information indicating that the order of the pictures is discontinuous is provided. It is characterized by including an extracting step and a managing step of managing an area for storing a decoded picture based on the flag information.

また、前記フラグ情報は、ピクチャの表示順情報が不連続であることを示す情報であり、前記管理ステップでは、前記表示順情報および前記フラグ情報に基づいて、前記領域に格納されている復号化済みピクチャの中で表示順が最も前であるピクチャを決定し、決定されたピクチャを削除対象ピクチャとしてもよい。
これによって、ピクチャの表示順情報が不連続であることに起因して非表示ピクチャを消去してしまうことを防止することができる。 Further, the flag information is information indicating that display order information of pictures is discontinuous. In the managing step, the decoding stored in the area is performed based on the display order information and the flag information. The picture with the earliest display order among the completed pictures may be determined, and the determined picture may be set as the deletion target picture.
This makes it possible to prevent a non-display picture from being erased due to discontinuous display order information of pictures.

また、前記動画像復号化方法は、さらに、ピクチャの符号化順情報が不連続である場合に、前記領域に無効ピクチャを格納する無効ピクチャ格納ステップを含み、前記フラグ情報は、前記符号化順情報が不連続であることを示す情報であり、前記管理ステップでは、前記フラグ情報および前記符号化順情報に基づいて、前記領域に無効ピクチャを格納するか否かを判定し、前記無効ピクチャ格納ステップでは、前記管理ステップでの判定結果に基づいて前記領域に無効ピクチャを格納してもよい。 Further, the video decoding method further includes an invalid picture storing step of storing an invalid picture in the area when the encoding order information of the picture is discontinuous, wherein the flag information includes the encoding order. Information indicating that the information is discontinuous. In the managing step, it is determined whether or not an invalid picture is stored in the area based on the flag information and the coding order information. In the step, an invalid picture may be stored in the area based on a determination result in the management step.

これによって、ピクチャの符号化順情報が不連続であることに起因して未表示ピクチャを消去してしまうことを防止することができる。
さらに、本発明は、このような動画像符号化方法および動画像復号化方法として実現することができるだけでなく、このような動画像符号化方法および動画像復号化方法が含む特徴的なステップを手段として備える動画像符号化装置および動画像復号化装置として実現したり、それらのステップをコンピュータに実行させるプログラムとして実現したりすることもできる。そして、そのようなプログラムは、ＣＤ−ＲＯＭ等の記録媒体やインターネット等の伝送媒体を介して配信することができるのは言うまでもない。 This makes it possible to prevent a non-display picture from being deleted due to discontinuity in the coding order information of the pictures.
Further, the present invention can be realized not only as such a moving picture coding method and a moving picture decoding method, but also includes the characteristic steps included in such a moving picture coding method and a moving picture decoding method. It can also be realized as a moving image encoding device and a moving image decoding device provided as means, or as a program for causing a computer to execute those steps. Needless to say, such a program can be distributed via a recording medium such as a CD-ROM or a transmission medium such as the Internet.

以上の説明から明らかなように、本発明に係る動画像符号化方法によれば、特別のイントラピクチャであるIDRピクチャ以外のピクチャの場所でも編集が可能になる。 As is clear from the above description, according to the moving picture coding method according to the present invention, editing can be performed at a place of a picture other than an IDR picture which is a special intra picture.

以下、本発明の実施の形態について、図面を参照して説明する。
（実施の形態１）
図１は、本実施の形態の動画像符号化装置の構成を示すブロック図である。
動画像符号化装置は、動き検出部１０３、減算演算部１０４、符号化部１０５、動き補償部１０６、可変長符号化部１１３、復号化部１０８、加算演算部１０９、メモリ１１０、１１１、およびフラグ情報生成部１１２を備えている。 Hereinafter, embodiments of the present invention will be described with reference to the drawings.
(Embodiment 1)
FIG. 1 is a block diagram showing a configuration of the moving picture coding apparatus according to the present embodiment.
The moving picture coding apparatus includes a motion detection unit 103, a subtraction calculation unit 104, a coding unit 105, a motion compensation unit 106, a variable length coding unit 113, a decoding unit 108, an addition calculation unit 109, memories 110 and 111, A flag information generator 112 is provided.

従来の動画像符号化装置（図１９）との違いは、フラグ情報生成部１１２と、可変長符号化部１０７とは動作が相違する可変長符号化部１１３とを備えている点である。
フラグ情報生成部１１２は、編集等によって、表示順情報POCが不連続になる場合、表示順情報POCが不連続であることを示すフラグを生成する。
可変長符号化部１１３は、入力された符号化信号に対して可変長符号化等を行い、さらにフラグ情報生成部１１２によって生成されたフラグ、および動き補償部１０６から入力された動きベクトルMV等の情報を付加することにより符号化ストリームStrを生成し、動画像符号化装置の外部へ出力する。 The difference from the conventional moving picture coding apparatus (FIG. 19) is that a flag information generating unit 112 and a variable length coding unit 113 that operates differently from the variable length coding unit 107 are provided.
When the display order information POC is discontinuous due to editing or the like, the flag information generation unit 112 generates a flag indicating that the display order information POC is discontinuous.
The variable-length coding unit 113 performs variable-length coding and the like on the input coded signal, and further generates a flag generated by the flag information generation unit 112, a motion vector MV input from the motion compensation unit 106, and the like. To generate an encoded stream Str, and output the encoded stream Str to the outside of the video encoding apparatus.

図２は本発明の動画像符号化方法および動画像復号化方法の概念を説明する図である。この図は従来の課題を説明した図２６の場合を解決する様子を説明している。まず編集等によって、表示順情報POCが不連続になっていることを符号化時に付加されたフラグによって検出する。このフラグをフラグAと呼ぶ。フラグAとは、編集等によって表示順情報POCが不連続であることを示すフラグである。 FIG. 2 is a diagram for explaining the concept of the moving picture coding method and the moving picture decoding method of the present invention. This figure illustrates how to solve the case of FIG. 26 which describes the conventional problem. First, the discontinuity of the display order information POC is detected by a flag added at the time of encoding by editing or the like. This flag is called flag A. The flag A is a flag indicating that the display order information POC is discontinuous due to editing or the like.

このフラグAは図中に示すように、Clipの直前に置く特別な情報とする。JVTではSupplemental enhancement information (以下、SEIと略す)と呼ばれるビデオデコードの付加的な情報を格納する単位が定義されているのでこれに格納することとする。ユーザが独自に定義可能な User data registered SEIに格納してもよいし、あるいは、ランダムアクセスのための情報を格納する Random access point SEI (以下、RAP SEIと略す)に格納してもよい。RAP SEIには編集等によりデコードした動画像が本来の動画像とは異なるかもしれないことを示すbroken_link_flag、RAP SEIのある位置から表示順で n 枚目以降のピクチャをデコードした動画像が本来の動画像と同等もしくはほぼ同等となる場合、その n枚を示す recovery_frame_cnt等の情報が格納されている。本発明では、RAP SEIのbroken_link_flagが“１”のときは、編集が行われていると検出し、RAP SEI以降の最初のピクチャの直前を編集ポイントとする。あるいは、recovery_frame_cntの指すピクチャの直前を編集ポイントする。あるいは、RAP_SEI以降の最初の独立にデコード可能なピクチャ（例えばIntra Picture）を編集ポイントとする。なお編集ポイントとはピクチャの境界のみを指していて、SEIの境界を定めているわけではない。他にも、シーケンスを格納しているファイルフォーマットには、各ピクチャのランダムアクセス情報を格納していることがあり、それら情報には編集されていることを示す情報、さらには、編集ポイントの情報が格納されている場合もある。その場合には、ファイルフォーマットの情報にしたがって、編集の検出と編集ポイントの特定を行えばよい。これらの格納形式をフラグ形式Aの格納形式と呼ぶ。 This flag A is special information to be placed immediately before Clip as shown in the figure. JVT defines a unit called Supplemental enhancement information (hereinafter abbreviated as SEI) for storing additional information for video decoding, and will store it in this unit. The information may be stored in User data registered SEI that can be uniquely defined by the user, or may be stored in Random access point SEI (hereinafter abbreviated as RAP SEI) that stores information for random access. The RAP SEI contains a broken_link_flag indicating that the video decoded by editing etc. may be different from the original video, and the video decoded from the nth picture in the display order from the position of the RAP SEI contains the original video. When the moving image is equal to or almost equal to the moving image, information such as recovery_frame_cnt indicating the n images is stored. In the present invention, when the broken_link_flag of the RAP SEI is “1”, it is detected that editing has been performed, and the edit point is set immediately before the first picture after the RAP SEI. Or, edit point immediately before the picture pointed to by recovery_frame_cnt. Alternatively, the first independently decodable picture after RAP_SEI (for example, Intra Picture) is set as the edit point. Note that the edit point indicates only the boundary of a picture, and does not define the boundary of SEI. In addition, the file format storing the sequence may store random access information of each picture, which information indicates that the image is being edited, and further includes information on the editing point. May be stored. In that case, the detection of the editing and the specification of the editing point may be performed according to the information of the file format. These storage formats are referred to as storage formats of the flag format A.

図３は本実施の形態の動画像復号化装置の構成を示すブロック図である。従来の動画像復号化装置（図２３）との違いは、編集検出部２０３が追加されたこと、メモリ管理部４０１とは動作が相違するメモリ管理部２０５を備えていることである。
編集検出部２０３は可変長復号化部２０１から表示順情報POCが不連続になっていることを示すフラグ、あるいは、編集ポイント情報を格納する情報を取得し、解析し、メモリ管理部２０５へ制御信号mctrlcを出力する。メモリ管理部２０５は編集を意味する制御信号mctrlcが入力されたならば、編集ポイントより前のピクチャが、編集ポイントより後のピクチャよりも表示順が前になるように管理する。つまり、unusedピクチャの中から削除するピクチャを選択するときに、編集ポイント前のピクチャの表示順は、編集ポイント後のピクチャよりも前であるとする。 FIG. 3 is a block diagram showing a configuration of the moving picture decoding apparatus according to the present embodiment. The difference from the conventional video decoding device (FIG. 23) is that an edit detection unit 203 has been added and that a memory management unit 205 having an operation different from that of the memory management unit 401 is provided.
The edit detection unit 203 acquires a flag indicating that the display order information POC is discontinuous or information storing edit point information from the variable length decoding unit 201, analyzes the information, and controls the memory management unit 205. Outputs signal mctrlc. When the control signal mctrlc indicating the editing is input, the memory management unit 205 manages the picture before the editing point so that the display order is before the picture after the editing point. That is, when a picture to be deleted is selected from unused pictures, the display order of the picture before the edit point is assumed to be before the picture after the edit point.

メモリ管理部２０５は、表示順を管理するために、編集ポイントを超えるたびに１ずつ増加するClipカウンタを各ピクチャに保持させる。図中（B）に示すように、ピクチャB20、P22、B21はClip=1と記録され、編集ポイントより後のピクチャI85はClip=2と記録される。この状態での不使用マーキング処理の処理では、unusedピクチャの最初にデコードしたクリップのピクチャ（Clip=1とマークされているピクチャB20、P22、B21）から最も表示順の前のピクチャB20を削除する。これによって、未表示のピクチャ（従来の課題ではピクチャI85）を削除してしまうという問題を解決できる。 In order to manage the display order, the memory management unit 205 causes each picture to hold a Clip counter that increases by one each time the edit point is exceeded. As shown in (B) in the figure, pictures B20, P22, and B21 are recorded as Clip = 1, and picture I85 after the edit point is recorded as Clip = 2. In the unused marking processing in this state, the picture B20 in the display order that is the earliest in the display order is deleted from the pictures of the first decoded clip of the unused picture (pictures B20, P22, and B21 marked Clip = 1). . This solves the problem of deleting an undisplayed picture (picture I85 in the conventional problem).

図４は本実施の形態の動画像復号化方法の動作フロー図である。従来の動画像復号化方法（図２４）との違いは、ステップS31とステップS32を追加したこと、ステップS14を修正しステップS14Bとしたことである。ピクチャ単位の処理開始（ステップS1）後、編集されているかを調べ（ステップS31）、編集されているならば編集ポイントでの処理を行う（ステップS32）。編集を検出しなければ、従来と同じように不使用マーキング処理（ステップS13）を行い、編集ポイント前後のデコード順を考慮した空き領域確保処理（ステップS14B）を行う。編集ポイントでの処理とは、編集ポイントの前後を区別できるようにすることであり、メモリ管理部２０５は、編集ポイントを超えるごとにClipカウンタを一つ増加させる。 FIG. 4 is an operation flowchart of the moving picture decoding method according to the present embodiment. The difference from the conventional moving picture decoding method (FIG. 24) is that step S31 and step S32 are added, and step S14 is modified to step S14B. After the start of processing in units of pictures (step S1), it is checked whether the image has been edited (step S31). If the image has been edited, processing is performed at the edit point (step S32). If the editing is not detected, the unused marking processing (step S13) is performed as in the conventional case, and the free area securing processing (step S14B) is performed in consideration of the decoding order before and after the editing point. The processing at the edit point is to enable the discrimination between before and after the edit point, and the memory management unit 205 increases the Clip counter by one each time the edit point is exceeded.

図５は本実施の形態の動画像復号化方法の空き領域確保処理の動作フロー図である。従来の空き領域確保の方法（図２４）との違いは、ステップS43を修正したステップS43Bである。unusedのピクチャがメモリに格納されている場合（ステップS41）は、デコード順で前のクリップを優先してunusedピクチャを含むクリップを探し、そのクリップのunused ピクチャの中で、最初の表示順のピクチャを削除する。言い換えると、デコード順で最初にunusedのピクチャを含むClipにあるunusedピクチャであって、それらピクチャの中で最初の表示順のピクチャを削除する。あるいは、言い換えると、デコード順で最初のunusedのピクチャの直前、直後の編集ポイント間に含まれる unusedピクチャの中で、最初の表示順のピクチャを削除する。 FIG. 5 is an operation flowchart of a free area securing process of the moving picture decoding method according to the present embodiment. The difference from the conventional method of securing an empty area (FIG. 24) is that step S43B is a modification of step S43. If the unused picture is stored in the memory (step S41), the clip including the unused picture is searched for in priority to the previous clip in the decoding order, and the picture in the first display order among the unused pictures of the clip is searched. Remove. In other words, the unused pictures in the clip including the unused picture first in the decoding order, and the picture in the first display order among those pictures is deleted. Alternatively, in other words, the picture in the first display order is deleted from among the unused pictures included immediately before and immediately after the first unused picture in the decoding order and between the edit points.

上記のように、編集等によって表示順情報POCが不連続になっていることを符号化時に付加されたフラグにより検出した上で、削除するピクチャ決定しているので、未表示のピクチャ（従来の図２６に示す例ではピクチャI85）を削除してしまうという問題を解決できる。
このような本発明の動画像復号化方法（図２、図３、図４、図５）で編集の不連続が解決できるためには、編集ポイントを示す情報が必要である。従って、編集ポイント情報が含まれていることを示す情報が復号化装置にとって入手しやすい場所にあることが望ましい。 As described above, since the discontinuation of the display order information POC due to editing or the like is detected by the flag added at the time of encoding, and the picture to be deleted is determined, the undisplayed picture (conventional In the example shown in FIG. 26, the problem of deleting picture I85) can be solved.
In order to solve the discontinuity of editing by the moving picture decoding method of the present invention (FIGS. 2, 3, 4, and 5), information indicating an editing point is required. Therefore, it is desirable that the information indicating that the edit point information is included be located in a place easily available to the decoding apparatus.

本実施の形態では、表示順情報POCが不連続になっていることを示す符号化時に付加されたフラグを、表示順情報POCが不連続になっているピクチャの間に付加しているが、これに限られるものではない。例えば、フラグ情報生成部１１２が、表示順情報POCが不連続になっていることを示すフラグと、表示順情報POCが不連続になっている位置（編集ポイント）を特定する情報とを生成しても構わない。そして、これらの情報をシーケンスパラメータセットへ格納する、あるいは、ユーザが独自に定義可能な User data registered SEIへ格納し、シーケンスの入手しやすい場所、例えば、先頭に配置する、あるいは、シーケンスを記録する媒体に格納する、あるいは、シーケンスを管理するファイルフォーマットに格納する。これらの格納形式をフラグ形式A2と呼ぶ。 In the present embodiment, a flag added during encoding indicating that the display order information POC is discontinuous is added between pictures in which the display order information POC is discontinuous. It is not limited to this. For example, the flag information generation unit 112 generates a flag indicating that the display order information POC is discontinuous, and information identifying a position (edit point) where the display order information POC is discontinuous. It does not matter. Then, store this information in the sequence parameter set, or store it in User data registered SEI, which can be uniquely defined by the user, and place it at a location where the sequence is easily available, for example, at the beginning, or record the sequence It is stored in a medium or stored in a file format for managing the sequence. These storage formats are called flag format A2.

この場合、復号化時には、編集検出部２０３はこれらの場所からフラグA2を取得し、編集ポイント情報が得られるのであれば、本発明の動画像復号化方法（図２、図３、図４、図５）を行う。
（実施の形態２）
本実施の形態における動画像符号化装置の構成は、図１に示す実施の形態１のブロック図と同様である。 In this case, at the time of decoding, the editing detection unit 203 acquires the flag A2 from these locations, and if the editing point information can be obtained, the moving picture decoding method of the present invention (FIGS. 2, 3, 4, Perform FIG. 5).
(Embodiment 2)
The configuration of the moving picture coding apparatus according to the present embodiment is the same as the block diagram of the first embodiment shown in FIG.

本実施の形態では、フラグ情報生成部１１２は、編集等によってフレームナンバー FNが不連続になる場合、フレームナンバー FNが不連続になっていることを示すフラグBを生成する。
なお、フラグ情報生成部１１２が生成するフラグBは、無効ピクチャの挿入を行わないように指示するフラグであってもよい。フラグBの付与の形式は実施の形態1に示すフラグAの形式と同等である。 In the present embodiment, when the frame number FN is discontinuous due to editing or the like, the flag information generating unit 112 generates a flag B indicating that the frame number FN is discontinuous.
Note that the flag B generated by the flag information generation unit 112 may be a flag that instructs not to insert an invalid picture. The format of the flag B is the same as the format of the flag A shown in the first embodiment.

図６は本実施の形態の動画像復号化装置の構成を示すブロック図である。この図は図２８で説明した従来の動画像復号化装置に対して、編集検出部２１４を追加し、メモリ管理部２１２を変更したものである。
編集検出部２１４は、可変長復号化部２０１からフレームナンバー FNが不連続になっていることを示すフラグを取得し、メモリ管理部２１２へ制御信号ctrl_cを出力する。メモリ管理部２１２は、FNギャップ検出部２１１から制御信号ctrl_cによって無効ピクチャの挿入の要求が入力された場合でも、編集検出部２１４から編集されていることが通知された場合は、無効ピクチャの挿入を行わないこととする。 FIG. 6 is a block diagram showing a configuration of the moving picture decoding apparatus according to the present embodiment. This figure is obtained by adding an edit detection unit 214 and changing the memory management unit 212 to the conventional video decoding device described with reference to FIG.
The edit detection unit 214 acquires a flag indicating that the frame number FN is discontinuous from the variable length decoding unit 201, and outputs a control signal ctrl_c to the memory management unit 212. Even when a request to insert an invalid picture is input by the control signal ctrl_c from the FN gap detection unit 211, the memory management unit 212 inserts an invalid picture when notified by the edit detection unit 214 that it is being edited. Shall not be performed.

図７は本発明の動画像復号化方法の動作フロー図である。従来の動画像復号化方法（図２９）との違いは、ステップS31を追加したことと、ステップS14を修正しステップS14Bとしたことである。これ以外のステップは図２９の同じ符号をもつステップと同じ動作をするので、説明を省略する。また、ステップS31とステップS14Bは実施の形態1で説明した本発明の復号化装置のステップS31、及び、ステップS14Bと同じであるので、説明を省略する。 FIG. 7 is an operation flowchart of the moving picture decoding method of the present invention. The difference from the conventional moving picture decoding method (FIG. 29) is that step S31 is added and step S14 is modified to step S14B. The other steps perform the same operation as the steps having the same reference numerals in FIG. 29, and thus the description will be omitted. Further, Step S31 and Step S14B are the same as Step S31 and Step S14B of the decoding apparatus of the present invention described in Embodiment 1, and thus description thereof is omitted.

上記のように、編集等によってフレームナンバー FNが不連続になっていることを符号化時に付加されたフラグにより検出した上で、無効ピクチャの挿入を決定しているので、未表示のピクチャ（従来の図３０に示す例ではピクチャP23、P24、P25）を削除してしまうという問題を解決できる。
なお、本実施の形態では、実施の形態1と同様にフレームナンバー FNが不連続になっていることを示す符号化時に付加されたフラグを、フレームナンバー FNが不連続になっているピクチャの間に付加しているが、これに限られるものではない。例えば、実施の形態1と同様に、フラグ情報生成部１１２が、フレームナンバー FNが不連続になっていることを示すフラグと、フレームナンバー FNが不連続になっている位置（編集ポイント）を特定する情報とを生成しても構わない。そして、これらの情報を、実施の形態1と同様に格納する。 As described above, the insertion of an invalid picture is determined based on the flag added at the time of encoding that the frame number FN is discontinuous due to editing or the like. In the example shown in FIG. 30, the problem of deleting the pictures P23, P24, and P25) can be solved.
Note that, in the present embodiment, a flag added at the time of encoding indicating that the frame number FN is discontinuous, as in Embodiment 1, is used between pictures in which the frame number FN is discontinuous. , But is not limited to this. For example, as in the first embodiment, the flag information generation unit 112 specifies a flag indicating that the frame number FN is discontinuous and a position (edit point) where the frame number FN is discontinuous. May be generated. Then, these pieces of information are stored as in the first embodiment.

この場合、復号化時には、編集検出部２０３はこれらの場所から実施の形態1と同様に、フラグB2を取得し、編集ポイント情報が得られるならば、本発明の動画像復号化方法（図６、図７）の処理を行う。
図８は、実施の形態1と実施の形態2における、本発明の動画像符号化方法の出力するデータの構造、本発明の動画像復号化方法の入力するデータの構造を示す図である。符号化された動画像信号であるシーケンスは、同図（A）に示すように、RAP、MMCO、PICTUREのデータを含む。RAPはRandom access point SEIのことであり、その中の broken_link_fieldが、実施の形態1のフラグAであり、また、実施の形態2におけるフラグBである。PICTUREはピクチャ単位に符号化された動画像信号であり、PICTUREの前にはMMCOがある場合もある（ない場合もある）。MMCOとはMemory management control operationの指示情報である。また、同図（B）に示すように、シーケンスの内部、あるいは、シーケンスに関連付けられたファイルフォーマットの所定の位置、あるいは、シーケンスを記録する記録媒体等に、実施の形態1のフラグA2であり、また、実施の形態2におけるフラグB2が格納されている。 In this case, at the time of decoding, the editing detection unit 203 acquires the flag B2 from these locations as in Embodiment 1, and if the editing point information is obtained, the moving picture decoding method of the present invention (FIG. 6). , FIG. 7).
FIG. 8 is a diagram showing the structure of data output by the moving picture coding method of the present invention and the structure of data input by the moving picture decoding method of the present invention in Embodiments 1 and 2. The sequence as an encoded moving image signal includes RAP, MMCO, and PICTURE data as shown in FIG. RAP stands for Random access point SEI, in which broken_link_field is the flag A in the first embodiment and the flag B in the second embodiment. PICTURE is a moving picture signal encoded in picture units, and PICTURE may be preceded by MMCO (or may not exist). MMCO is instruction information of Memory management control operation. Also, as shown in FIG. 3B, the flag A2 of the first embodiment is provided inside the sequence, at a predetermined position in a file format associated with the sequence, or on a recording medium for recording the sequence. Further, the flag B2 in the second embodiment is stored.

（実施の形態３）
図９は、本実施の形態に係る動画像符号化装置の構成を示すブロック図である。
動画像符号化装置は、並び替えメモリ１０１、符号化スケジューリング部１０２、動き検出部１０３、減算演算部１０４、符号化部１０５、動き補償部１０６、可変長符号化部１０７、復号化部１０８、加算演算部１０９、およびメモリ１１０、１１１を備えている。 (Embodiment 3)
FIG. 9 is a block diagram showing a configuration of the video encoding device according to the present embodiment.
The moving picture coding apparatus includes a rearrangement memory 101, a coding scheduling unit 102, a motion detection unit 103, a subtraction calculation unit 104, a coding unit 105, a motion compensation unit 106, a variable length coding unit 107, a decoding unit 108, An addition operation unit 109 and memories 110 and 111 are provided.

並べ替え用メモリ１０１は、表示時間順にピクチャ単位で入力された動画像を格納する。符号化スケジューリング部１０２は、並べ替え用メモリ１０１に格納された各ピクチャを符号化が行われる順に並び替えを行う。
図１０は本実施の形態の動画像符号化方法の概念を説明する図である。図３４に示す問題を解決する、本実施の形態の動画像符号化方法は、あるGOPには表示順の連続するピクチャしか格納しないこと、そして、あるGOPの任意のピクチャの表示順は、次にデコードするGOPの任意のピクチャの表示順よりも前となるように符号化することである。このように符号化することで、GOP1、GOP2ともに、図３４に示すケースにおいて、再生の不連続は発生しない。 The reordering memory 101 stores moving images input in picture units in display time order. The encoding scheduling unit 102 rearranges the pictures stored in the rearrangement memory 101 in the order in which the encoding is performed.
FIG. 10 is a diagram for explaining the concept of the moving picture coding method according to the present embodiment. The moving picture coding method according to the present embodiment that solves the problem shown in FIG. 34 stores only consecutive pictures in a certain GOP in a display order, and the display order of an arbitrary picture in a certain GOP is as follows. Is to be coded so as to be earlier than the display order of an arbitrary picture of the GOP to be decoded. By performing encoding in this manner, in GOP1 and GOP2, reproduction discontinuity does not occur in the case shown in FIG.

図１１は本実施の形態の動画像符号化方法の動作フロー図である。同図（a）を用いて動作を説明する。未符号化のピクチャの中から、表示順で最も前の方から連続するピクチャを表示基本単位とする（ステップS61）。つまり、表示順で不連続にならないような1枚以上のピクチャを表示基本単位とし、この表示基本単位よりも、表示が前である未符号化ピクチャは無いように、表示基本単位を定める。次に、表示基本単位の中に未符号化のピクチャがあるかを調べ（ステップS62）、あれば（ステップS62でYes）、表示基本単位の中の未符号化ピクチャを符号化候補とし、この符号化候補から選択して符号化する（ステップS63）。未符号化のピクチャがあるか調べ（ステップS64）、あれば（ステップS64でYes）、ステップS62へ進む。無ければ（ステップS64でNo）終了する。なお、表示基本単位は、条件「未符号化のピクチャの中で、表示順が最も前のピクチャから、符号化済みピクチャの中で表示順が最も後のピクチャまでは、少なくとも表示基本単位に含まれる」を満たす限りにおいて、任意のタイミングで変更可能である。 FIG. 11 is an operation flowchart of the moving picture coding method according to the present embodiment. The operation will be described with reference to FIG. Among the uncoded pictures, a picture continuous from the earliest in the display order is set as a display basic unit (step S61). That is, one or more pictures that do not become discontinuous in the display order are set as the display basic units, and the display basic units are determined so that there is no uncoded picture preceding the display basic unit. Next, it is checked whether or not there is an uncoded picture in the display basic unit (step S62). If there is (Yes in step S62), the uncoded picture in the display basic unit is set as a coding candidate. The encoding is selected from the encoding candidates (step S63). It is checked whether there is an uncoded picture (step S64), and if there is one (Yes in step S64), the process proceeds to step S62. If not (No in step S64), the process ends. Note that the display basic unit includes at least the condition “from the earliest picture in display order among uncoded pictures to the last picture in display order among encoded pictures, Can be changed at any time as long as the condition is satisfied.

また、同図（B）も、本実施の形態の動画像符号化方法の動作フロー図であるが、この方法では、未符号化のピクチャの中から、表示順で最も前のピクチャから、符号化済みピクチャの中で表示順が最も後のピクチャまでを、必須の符号化候補とし（ステップS71）、必須の符号化候補を含めて、未符号化のピクチャから選択して符号化する（ステップS72）。次にIピクチャ以外の未符号化のピクチャがあるかを調べ（ステップS73）、あれば（ステップS73でYes）、ステップS71へ進み、無ければ（ステップS73でNo）終了する。なお、ここでは次のIピクチャまでをGOPの候補としているが、これに限られるものではなく、例えばファイルフォーマットにおけるGOPの記載によって、GOPの最後を決定してもよい。 FIG. 2B is also an operation flow diagram of the moving picture coding method according to the present embodiment. In this method, the coding is performed from the uncoded pictures in the display order from the earliest picture in display order. Among the coded pictures, up to the picture in the display order which is the last one is set as an essential coding candidate (step S71), and is selected and coded from uncoded pictures, including the essential coding candidate (step S71). S72). Next, it is checked whether there is any uncoded picture other than the I picture (step S73), and if there is (Yes in step S73), the process proceeds to step S71, and if not (No in step S73), the process ends. Here, the GOP candidates up to the next I picture are used, but the present invention is not limited to this. For example, the end of the GOP may be determined by describing the GOP in a file format.

上記のようにGOP を決定しているので、例えば図３４に示す場合のようにGOP1、GOP2それぞれにおいて再生の不連続が発生するのを防止することができる。
なお、本実施の形態で説明したように符号化されたことを示すフラグを符号化ストリームに付加しても構わない。 Since the GOP is determined as described above, discontinuity in reproduction can be prevented from occurring in each of GOP1 and GOP2 as shown in FIG. 34, for example.
Note that a flag indicating that encoding has been performed as described in the present embodiment may be added to the encoded stream.

（実施の形態４）
図１２は本実施の形態の動画像符号化方法の概念を説明する図である。実施の形態３では、編集の問題解決とランダムアクセス時の問題解決を同時に行う方法であったが、本実施の形態における本動画像符号化方法は、ランダムアクセス時の問題を解決する方法である。実施の形態３の方法よりも、制約が緩いので、符号化効率などを向上させることができる。 (Embodiment 4)
FIG. 12 is a diagram for explaining the concept of the moving picture coding method according to the present embodiment. In the third embodiment, the problem of editing and the problem of random access are simultaneously solved. However, the main moving picture coding method in the present embodiment is a method for solving the problem of random access. . Since restrictions are less strict than in the method of the third embodiment, coding efficiency and the like can be improved.

図１２のGOP２を例として説明すると、この動画像符号化方法では、１）あるGOPのイントラピクチャ（I25）よりも表示時間が後のピクチャ（B26、B27、P28）は、そのイントラピクチャを含むGOP（GOP2）の直前のGOP（GOP1）で符号化しない。このように制御することで、図中CaseAで示しているように、GOP2の最初のピクチャ（I25）からデコードを開始しても、最初のピクチャ以降のピクチャを全て正しく表示することが可能となる。 Taking GOP2 in FIG. 12 as an example, in this video coding method, 1) pictures (B26, B27, P28) whose display time is later than the intra-picture (I25) of a certain GOP include the intra-picture It is not coded in the GOP (GOP1) immediately before the GOP (GOP2). By performing such control, as shown by Case A in the figure, even if decoding is started from the first picture (I25) of GOP2, it is possible to correctly display all pictures after the first picture. .

２）さらに、あるGOPのイントラピクチャ（I25）よりも表示時間が前で、かつ、そのGOPの直前GOPのイントラピクチャ（I19）よりも表示時間が後のピクチャ（I19、B20、B21、B22、B23、P24）は、そのGOP（GOP2）または直前GOP（GOP1）内で符号化する。このように制御することで、GOP1の最初のピクチャ（I19）からデコードを開始しても、GOP1の最初のピクチャ（I19）以降のピクチャを全て正しく表示することが可能となる。 2) Further, pictures (I19, B20, B21, B22, and B22) whose display time is earlier than the intra picture (I25) of a certain GOP and whose display time is later than the intra picture (I19) of the GOP immediately before the GOP. B23, P24) are encoded within the GOP (GOP2) or the immediately preceding GOP (GOP1). By controlling in this way, even if decoding is started from the first picture (I19) of GOP1, all pictures after the first picture (I19) of GOP1 can be correctly displayed.

あるいは、いいかえると、１）GOP1を例にとって、あるGOPの最後に表示するピクチャは、次のGOPのIピクチャ（I25）よりも前に表示されるようなピクチャを選んで符号化する（つまり、ピクチャP24以前を選択しなければならない）。２）GOP2を例にとって、あるGOP内の最初に表示するピクチャは、直前GOPのIピクチャ（I19）よりも後に表示されるようなピクチャから選択して符号化する（つまり、ピクチャB20以降を選択しなければならない）。 Or, in other words, 1) Taking GOP1 as an example, the picture to be displayed at the end of a certain GOP is selected and coded by a picture that is displayed before the I picture (I25) of the next GOP (that is, Picture P24 or earlier must be selected). 2) Taking GOP2 as an example, the first picture to be displayed in a certain GOP is selected from the pictures displayed after the I picture (I19) of the immediately preceding GOP and encoded (that is, picture B20 and subsequent pictures are selected). Must).

あるいは、いいかえると、あるGOPの最初に表示するピクチャの表示順は直前GOPのIピクチャよりも表示順が後で、そのGOPの最後に表示するピクチャの表示順は直後のGOPのIピクチャよりも表示順が前であるように、符号化する動画像符号化方法である。なおここで、Iピクチャとして説明しているが、独立にデコード可能なピクチャに対しても同等に適用可能である。 Or, in other words, the display order of the picture displayed first in a GOP is later than the I picture of the immediately preceding GOP, and the display order of the picture displayed last in the GOP is later than the I picture of the immediately following GOP. This is a moving picture coding method for coding so that the display order is first. Here, the description is made as an I picture, but the present invention is equally applicable to a picture that can be decoded independently.

図１３は本実施の形態の動画像符号化方法の動作フロー図である。まず、未符号化のピクチャを選択して、エントリピクチャとして符号化する（ステップS81）。エントリピクチャとは独立にデコード可能なピクチャのことである。次に、未符号化のピクチャの中から、最後に符号化したエントリピクチャよりも表示順が前のピクチャを、必須の符号化候補とし、次に符号化予定のエントリピクチャよりも表示順が前の未符号化のピクチャを省略可能な符号化候補とする（ステップS82）。次に、必須の符号化候補に未符号化のピクチャがあるかを調べ（ステップS83）、あれば（ステップS83でYes）、必須の符号化候補、及び、省略可能な符号化候補から選択して符号化する（ステップS85）。次に、未符号化のピクチャがあるかを調べ（ステップS86）、あれば、ステップS83へ進み、なければ、処理を終了する。ステップS83において、必須の符号化候補に未符号化のピクチャがない場合（ステップS83でNo）は、ステップS84において、次のエントリピクチャの符号化をするかを判定する（ステップS84）。エントリピクチャの符号化をする場合（ステップS84でYes）には、ステップS81へ進み、符号化をしない場合（ステップS84でNo）には、ステップS85へ進む。 FIG. 13 is an operation flowchart of the moving picture coding method according to the present embodiment. First, an uncoded picture is selected and coded as an entry picture (step S81). An entry picture is a picture that can be decoded independently. Next, among the uncoded pictures, a picture whose display order is earlier than the last encoded entry picture is set as an essential encoding candidate, and a display order which is earlier than the next entry picture to be encoded is set as a required encoding candidate. The uncoded picture is set as a coding candidate that can be omitted (step S82). Next, it is checked whether there is an uncoded picture in the essential coding candidate (step S83), and if there is (Yes in step S83), it is selected from the essential coding candidate and the optional coding candidate. (Step S85). Next, it is checked whether there is an uncoded picture (step S86). If there is, the process proceeds to step S83; otherwise, the process ends. In step S83, if there is no uncoded picture in the essential coding candidates (No in step S83), it is determined in step S84 whether to encode the next entry picture (step S84). If the entry picture is to be coded (Yes in step S84), the process proceeds to step S81. If not to be coded (No in step S84), the process proceeds to step S85.

上記のようにGOPにおける最初のイントラピクチャより表示順が後となるピクチャは、このGOP以降のGOPに含むように符号化しているので、あるGOP 以降を復号化しても、再生の不連続が発生することなく、再生を行うことができる。
なお、本実施の形態で説明したように符号化されたことを示すフラグを符号化ストリームに付加しても構わない。 As described above, pictures that are displayed after the first intra picture in the GOP are encoded so as to be included in the GOPs after this GOP, so even if a certain GOP or later is decoded, playback discontinuity occurs. The reproduction can be performed without performing.
Note that a flag indicating that encoding has been performed as described in the present embodiment may be added to the encoded stream.

（実施の形態５）
さらに、上記各実施の形態で示した動画像符号化方法または動画像復号化方法の構成を実現するためのプログラムを、フレキシブルディスク等の記憶媒体に記録するようにすることにより、上記各実施の形態で示した処理を、独立したコンピュータシステムにおいて簡単に実施することが可能となる。 (Embodiment 5)
Furthermore, by recording a program for realizing the configuration of the moving picture encoding method or the moving picture decoding method described in each of the above embodiments on a storage medium such as a flexible disk, The processing described in the embodiment can be easily performed by an independent computer system.

図１４は、上記各実施の形態の動画像符号化方法または動画像復号化方法を格納したフレキシブルディスクを用いて、コンピュータシステムにより実施する場合の説明図である。
図１４(b) は、フレキシブルディスクの正面からみた外観、断面構造、及びフレキシブルディスクを示し、図１４(a) は、記録媒体本体であるフレキシブルディスクの物理フォーマットの例を示している。フレキシブルディスクＦＤはケースＦ内に内蔵され、該ディスクの表面には、同心円状に外周からは内周に向かって複数のトラックＴｒが形成され、各トラックは角度方向に１６のセクタＳｅに分割されている。従って、上記プログラムを格納したフレキシブルディスクでは、上記フレキシブルディスクＦＤ上に割り当てられた領域に、上記プログラムとしての動画像符号化方法および動画像復号化方法が記録されている。 FIG. 14 is an explanatory diagram of a case where the present invention is implemented by a computer system using a flexible disk storing the moving picture coding method or the moving picture decoding method of each of the above embodiments.
FIG. 14B shows the appearance, cross-sectional structure, and flexible disk of the flexible disk viewed from the front, and FIG. 14A shows an example of the physical format of the flexible disk which is a recording medium body. The flexible disk FD is built in the case F, and a plurality of tracks Tr are formed concentrically from the outer circumference toward the inner circumference on the surface of the disk, and each track is divided into 16 sectors Se in an angular direction. ing. Therefore, in the flexible disk storing the program, the moving image encoding method and the moving image decoding method as the program are recorded in the area allocated on the flexible disk FD.

また、図１４(c) は、フレキシブルディスクＦＤに上記プログラムの記録再生を行うための構成を示す。上記プログラムをフレキシブルディスクＦＤに記録する場合は、コンピュータシステムＣｓから上記プログラムとしての動画像符号化方法または動画像復号化方法をフレキシブルディスクドライブを介して書き込む。また、フレキシブルディスク内のプログラムにより上記動画像符号化方法および動画像復号化方法をコンピュータシステム中に構築する場合は、フレキシブルディスクドライブによりプログラムをフレキシブルディスクから読み出し、コンピュータシステムに転送する。 FIG. 14C shows a configuration for recording and reproducing the program on the flexible disk FD. When the above program is recorded on the flexible disk FD, the moving picture coding method or the moving picture decoding method as the above program is written from the computer system Cs via the flexible disk drive. When the moving picture encoding method and the moving picture decoding method are to be constructed in a computer system using a program in a flexible disk, the program is read from the flexible disk by a flexible disk drive and transferred to the computer system.

なお、上記説明では、記録媒体としてフレキシブルディスクを用いて説明を行ったが、光ディスクを用いても同様に行うことができる。また、記録媒体はこれに限らず、ＩＣカード、ＲＯＭカセット等、プログラムを記録できるものであれば同様に実施することができる。
さらにここで、上記実施の形態で示した動画像符号化方法や動画像復号化方法の応用例とそれを用いたシステムを説明する。 In the above description, the description has been made using a flexible disk as a recording medium. However, the same description can be made using an optical disk. Further, the recording medium is not limited to this, and the present invention can be similarly implemented as long as the program can be recorded, such as an IC card or a ROM cassette.
Further, here, application examples of the moving picture coding method and the moving picture decoding method described in the above embodiment and a system using the same will be described.

図１５は、コンテンツ配信サービスを実現するコンテンツ供給システムex１００の全体構成を示すブロック図である。通信サービスの提供エリアを所望の大きさに分割し、各セル内にそれぞれ固定無線局である基地局ex１０７〜ex１１０が設置されている。
このコンテンツ供給システムex１００は、例えば、インターネットex１０１にインターネットサービスプロバイダex１０２および電話網ex１０４、および基地局ex１０７〜ex１１０を介して、コンピュータex１１１、ＰＤＡ（personal digital assistant）ex１１２、カメラex１１３、携帯電話ex１１４、カメラ付きの携帯電話ｅｘ１１５などの各機器が接続される。 FIG. 15 is a block diagram illustrating an overall configuration of a content supply system ex100 that realizes a content distribution service. A communication service providing area is divided into desired sizes, and base stations ex107 to ex110, which are fixed wireless stations, are installed in each cell.
The content supply system ex100 includes, for example, a computer ex111, a PDA (personal digital assistant) ex112, a camera ex113, a mobile phone ex114, and a camera on the Internet ex101 via the Internet service provider ex102 and the telephone network ex104, and the base stations ex107 to ex110. Each device such as a mobile phone ex115 with a tag is connected.

しかし、コンテンツ供給システムex１００は図１５のような組合せに限定されず、いずれかを組み合わせて接続するようにしてもよい。また、固定無線局である基地局ex１０７〜ex１１０を介さずに、各機器が電話網ex１０４に直接接続されてもよい。
カメラex１１３はデジタルビデオカメラ等の動画撮影が可能な機器である。また、携帯電話は、ＰＤＣ（Personal Digital Communications）方式、ＣＤＭＡ（Code Division Multiple Access）方式、Ｗ−ＣＤＭＡ（Wideband-Code Division Multiple Access）方式、若しくはＧＳＭ（Global System for Mobile Communications）方式の携帯電話機、またはＰＨＳ（Personal Handyphone System）等であり、いずれでも構わない。 However, the content supply system ex100 is not limited to the combination as shown in FIG. 15, and may be connected in any combination. Further, each device may be directly connected to the telephone network ex104 without going through the base stations ex107 to ex110 which are fixed wireless stations.
The camera ex113 is a device capable of shooting moving images, such as a digital video camera. In addition, a mobile phone is a PDC (Personal Digital Communications) system, a CDMA (Code Division Multiple Access) system, a W-CDMA (Wideband-Code Division Multiple Access) system, or a GSM (Global System for Mobile Communications) system. Or PHS (Personal Handyphone System) or the like, and either may be used.

また、ストリーミングサーバex１０３は、カメラex１１３から基地局ex１０９、電話網ex１０４を通じて接続されており、カメラex１１３を用いてユーザが送信する符号化処理されたデータに基づいたライブ配信等が可能になる。撮影したデータの符号化処理はカメラex１１３で行っても、データの送信処理をするサーバ等で行ってもよい。また、カメラex１１６で撮影した動画データはコンピュータex１１１を介してストリーミングサーバex１０３に送信されてもよい。カメラex１１６はデジタルカメラ等の静止画、動画が撮影可能な機器である。この場合、動画データの符号化はカメラex１１６で行ってもコンピュータex１１１で行ってもどちらでもよい。また、符号化処理はコンピュータex１１１やカメラex１１６が有するＬＳＩex１１７において処理することになる。なお、動画像符号化・復号化用のソフトウェアをコンピュータex１１１等で読み取り可能な記録媒体である何らかの蓄積メディア（ＣＤ−ＲＯＭ、フレキシブルディスク、ハードディスクなど）に組み込んでもよい。さらに、カメラ付きの携帯電話ex１１５で動画データを送信してもよい。このときの動画データは携帯電話ex１１５が有するＬＳＩで符号化処理されたデータである。 The streaming server ex103 is connected from the camera ex113 to the base station ex109 and the telephone network ex104, and enables live distribution and the like based on the encoded data transmitted by the user using the camera ex113. The encoding process of the captured data may be performed by the camera ex113, or may be performed by a server or the like that performs the data transmission process. Also, moving image data captured by the camera ex116 may be transmitted to the streaming server ex103 via the computer ex111. The camera ex116 is a device such as a digital camera capable of shooting still images and moving images. In this case, encoding of the moving image data may be performed by the camera ex116 or the computer ex111. The encoding process is performed by the LSI ex117 included in the computer ex111 and the camera ex116. The moving image encoding / decoding software may be incorporated in any storage medium (a CD-ROM, a flexible disk, a hard disk, or the like) that is a recording medium readable by the computer ex111 or the like. Further, the moving image data may be transmitted by a mobile phone with camera ex115. The moving image data at this time is data that has been encoded by the LSI included in the mobile phone ex115.

このコンテンツ供給システムex１００では、ユーザがカメラex１１３、カメラex１１６等で撮影しているコンテンツ（例えば、音楽ライブを撮影した映像等）を上記実施の形態同様に符号化処理してストリーミングサーバex１０３に送信する一方で、ストリーミングサーバex１０３は要求のあったクライアントに対して上記コンテンツデータをストリーム配信する。クライアントとしては、上記符号化処理されたデータを復号化することが可能な、コンピュータex１１１、ＰＤＡex１１２、カメラex１１３、携帯電話ex１１４等がある。このようにすることでコンテンツ供給システムex１００は、符号化されたデータをクライアントにおいて受信して再生することができ、さらにクライアントにおいてリアルタイムで受信して復号化し、再生することにより、個人放送をも実現可能になるシステムである。 In the content supply system ex100, the content (for example, a video of a live music shot) captured by the user with the camera ex113, the camera ex116, or the like is encoded and transmitted to the streaming server ex103 in the same manner as in the above embodiment. On the other hand, the streaming server ex103 stream-distributes the content data to the requesting client. Examples of the client include a computer ex111, a PDA ex112, a camera ex113, a mobile phone ex114, and the like, which are capable of decoding the encoded data. In this way, the content providing system ex100 can receive and reproduce the encoded data on the client, and further, can receive, decode, and reproduce the encoded data on the client in real time, thereby realizing personal broadcasting. It is a system that becomes possible.

このシステムを構成する各機器の符号化、復号化には上記各実施の形態で示した動画像符号化装置あるいは動画像復号化装置を用いるようにすればよい。
その一例として携帯電話について説明する。
図１６は、上記実施の形態で説明した動画像符号化方法と動画像復号化方法を用いた携帯電話ex１１５を示す図である。携帯電話ex１１５は、基地局ex１１０との間で電波を送受信するためのアンテナex２０１、ＣＣＤカメラ等の映像、静止画を撮ることが可能なカメラ部ex２０３、カメラ部ex２０３で撮影した映像、アンテナex２０１で受信した映像等が復号化されたデータを表示する液晶ディスプレイ等の表示部ex２０２、操作キーｅｘ２０４群から構成される本体部、音声出力をするためのスピーカ等の音声出力部ex２０８、音声入力をするためのマイク等の音声入力部ex２０５、撮影した動画もしくは静止画のデータ、受信したメールのデータ、動画のデータもしくは静止画のデータ等、符号化されたデータまたは復号化されたデータを保存するための記録メディアex２０７、携帯電話ex１１５に記録メディアex２０７を装着可能とするためのスロット部ex２０６を有している。記録メディアex２０７はＳＤカード等のプラスチックケース内に電気的に書換えや消去が可能な不揮発性メモリであるＥＥＰＲＯＭ（Electrically Erasable and Programmable Read Only Memory）の一種であるフラッシュメモリ素子を格納したものである。 The encoding and decoding of each device constituting this system may be performed using the video encoding device or the video decoding device described in each of the above embodiments.
A mobile phone will be described as an example.
FIG. 16 is a diagram illustrating the mobile phone ex115 using the moving picture coding method and the moving picture decoding method described in the above embodiment. The mobile phone ex115 includes an antenna ex201 for transmitting and receiving radio waves to and from the base station ex110, a video image of a CCD camera or the like, a camera unit ex203 capable of taking a still image, a video image captured by the camera unit ex203, and an antenna ex201. A display unit ex202 such as a liquid crystal display for displaying data obtained by decoding a received video or the like, a main unit including a group of operation keys ex204, an audio output unit ex208 such as a speaker for outputting audio, and audio input. Input unit ex205 such as a microphone for storing encoded or decoded data, such as data of captured moving images or still images, received mail data, moving image data or still image data, etc. Recording medium ex207, and a slot ex20 for allowing the recording medium ex207 to be attached to the mobile phone ex115. The it has. The recording medium ex207 stores a flash memory element which is a kind of EEPROM (Electrically Erasable and Programmable Read Only Memory) which is a nonvolatile memory which can be electrically rewritten and erased, in a plastic case such as an SD card.

さらに、携帯電話ex１１５について図１７を用いて説明する。携帯電話ex１１５は表示部ex２０２及び操作キーｅｘ２０４を備えた本体部の各部を統括的に制御するようになされた主制御部ex３１１に対して、電源回路部ex３１０、操作入力制御部ex３０４、画像符号化部ex３１２、カメラインターフェース部ex３０３、ＬＣＤ（Liquid Crystal Display）制御部ex３０２、画像復号化部ex３０９、多重分離部ex３０８、記録再生部ex３０７、変復調回路部ex３０６及び音声処理部ex３０５が同期バスex３１３を介して互いに接続されている。 Further, the mobile phone ex115 will be described with reference to FIG. The mobile phone ex115 controls a power supply circuit unit ex310, an operation input control unit ex304, an image encoding unit, and a main control unit ex311 that integrally controls each unit of a main unit including a display unit ex202 and operation keys ex204. Unit ex312, camera interface unit ex303, LCD (Liquid Crystal Display) control unit ex302, image decoding unit ex309, demultiplexing unit ex308, recording / reproducing unit ex307, modulation / demodulation circuit unit ex306, and audio processing unit ex305 via a synchronous bus ex313. Connected to each other.

電源回路部ex３１０は、ユーザの操作により終話及び電源キーがオン状態にされると、バッテリパックから各部に対して電力を供給することによりカメラ付ディジタル携帯電話ex１１５を動作可能な状態に起動する。
携帯電話ex１１５は、ＣＰＵ、ＲＯＭ及びＲＡＭ等でなる主制御部ex３１１の制御に基づいて、音声通話モード時に音声入力部ex２０５で集音した音声信号を音声処理部ex３０５によってディジタル音声データに変換し、これを変復調回路部ex３０６でスペクトラム拡散処理し、送受信回路部ex３０１でディジタルアナログ変換処理及び周波数変換処理を施した後にアンテナex２０１を介して送信する。また携帯電話機ex１１５は、音声通話モード時にアンテナex２０１で受信した受信データを増幅して周波数変換処理及びアナログディジタル変換処理を施し、変復調回路部ex３０６でスペクトラム逆拡散処理し、音声処理部ex３０５によってアナログ音声データに変換した後、これを音声出力部ex２０８を介して出力する。 The power supply circuit unit ex310 activates the camera-equipped digital mobile phone ex115 in an operable state by supplying power to each unit from the battery pack when the call end and the power key are turned on by a user operation. .
The mobile phone ex115 converts an audio signal collected by the audio input unit ex205 into digital audio data by the audio processing unit ex305 in the audio communication mode based on the control of the main control unit ex311 including a CPU, a ROM, a RAM, and the like. This is spread-spectrum processed by a modulation / demodulation circuit unit ex306, subjected to digital-analog conversion processing and frequency conversion processing by a transmission / reception circuit unit ex301, and then transmitted via an antenna ex201. The mobile phone ex115 amplifies received data received by the antenna ex201 in the voice call mode, performs frequency conversion processing and analog-to-digital conversion processing, performs spectrum despreading processing in the modulation / demodulation circuit unit ex306, and performs analog voice decoding in the voice processing unit ex305. After the data is converted, the data is output via the audio output unit ex208.

さらに、データ通信モード時に電子メールを送信する場合、本体部の操作キーｅｘ２０４の操作によって入力された電子メールのテキストデータは操作入力制御部ex３０４を介して主制御部ex３１１に送出される。主制御部ex３１１は、テキストデータを変復調回路部ex３０６でスペクトラム拡散処理し、送受信回路部ex３０１でディジタルアナログ変換処理及び周波数変換処理を施した後にアンテナex２０１を介して基地局ex１１０へ送信する。 Further, when an e-mail is transmitted in the data communication mode, text data of the e-mail input by operating the operation key ex204 of the main body is sent to the main control unit ex311 via the operation input control unit ex304. The main control unit ex311 performs spread spectrum processing on the text data in the modulation / demodulation circuit unit ex306, performs digital / analog conversion processing and frequency conversion processing on the transmission / reception circuit unit ex301, and transmits the text data to the base station ex110 via the antenna ex201.

データ通信モード時に画像データを送信する場合、カメラ部ex２０３で撮像された画像データをカメラインターフェース部ex３０３を介して画像符号化部ex３１２に供給する。また、画像データを送信しない場合には、カメラ部ex２０３で撮像した画像データをカメラインターフェース部ex３０３及びＬＣＤ制御部ex３０２を介して表示部ex２０２に直接表示することも可能である。 When transmitting image data in the data communication mode, the image data captured by the camera unit ex203 is supplied to the image encoding unit ex312 via the camera interface unit ex303. When the image data is not transmitted, the image data captured by the camera unit ex203 can be directly displayed on the display unit ex202 via the camera interface unit ex303 and the LCD control unit ex302.

画像符号化部ex３１２は、本願発明で説明した動画像符号化装置を備えた構成であり、カメラ部ex２０３から供給された画像データを上記実施の形態で示した動画像符号化装置に用いた符号化方法によって圧縮符号化することにより符号化画像データに変換し、これを多重分離部ex３０８に送出する。また、このとき同時に携帯電話機ex１１５は、カメラ部ex２０３で撮像中に音声入力部ex２０５で集音した音声を音声処理部ex３０５を介してディジタルの音声データとして多重分離部ex３０８に送出する。 The image encoding unit ex312 includes the moving image encoding device described in the present invention, and encodes the image data supplied from the camera unit ex203 using the moving image encoding device described in the above embodiment. The image data is converted into encoded image data by performing compression encoding according to a demultiplexing method, and is transmitted to the demultiplexing unit ex308. Further, at this time, the mobile phone ex115 simultaneously transmits the voice collected by the voice input unit ex205 during imaging by the camera unit ex203 to the demultiplexing unit ex308 as digital voice data via the voice processing unit ex305.

多重分離部ex３０８は、画像符号化部ex３１２から供給された符号化画像データと音声処理部ex３０５から供給された音声データとを所定の方式で多重化し、その結果得られる多重化データを変復調回路部ex３０６でスペクトラム拡散処理し、送受信回路部ex３０１でディジタルアナログ変換処理及び周波数変換処理を施した後にアンテナex２０１を介して送信する。 The demultiplexing unit ex308 multiplexes the encoded image data supplied from the image encoding unit ex312 and the audio data supplied from the audio processing unit ex305 by a predetermined method, and modulates and outputs the resulting multiplexed data. The signal is subjected to spread spectrum processing in ex306 and subjected to digital / analog conversion processing and frequency conversion processing in the transmission / reception circuit unit ex301, and then transmitted via the antenna ex201.

データ通信モード時にホームページ等にリンクされた動画像ファイルのデータを受信する場合、アンテナex２０１を介して基地局ex１１０から受信した受信データを変復調回路部ex３０６でスペクトラム逆拡散処理し、その結果得られる多重化データを多重分離部ex３０８に送出する。
また、アンテナex２０１を介して受信された多重化データを復号化するには、多重分離部ex３０８は、多重化データを分離することにより画像データのビットストリームと音声データのビットストリームとに分け、同期バスex３１３を介して当該符号化画像データを画像復号化部ex３０９に供給すると共に当該音声データを音声処理部ex３０５に供給する。 When data of a moving image file linked to a homepage or the like is received in the data communication mode, the data received from the base station ex110 via the antenna ex201 is subjected to spectrum despreading processing by the modulation / demodulation circuit unit ex306, and the resulting multiplexed data is obtained. The demultiplexed data is sent to the demultiplexing unit ex308.
To decode the multiplexed data received via the antenna ex201, the demultiplexing unit ex308 separates the multiplexed data into a bit stream of image data and a bit stream of audio data, and performs synchronization. The coded image data is supplied to the image decoding unit ex309 via the bus ex313, and the audio data is supplied to the audio processing unit ex305.

次に、画像復号化部ex３０９は、本願発明で説明した動画像復号化装置を備えた構成であり、画像データのビットストリームを上記実施の形態で示した符号化方法に対応した復号化方法で復号化することにより再生動画像データを生成し、これをＬＣＤ制御部ex３０２を介して表示部ex２０２に供給し、これにより、例えばホームページにリンクされた動画像ファイルに含まれる動画データが表示される。このとき同時に音声処理部ex３０５は、音声データをアナログ音声データに変換した後、これを音声出力部ex２０８に供給し、これにより、例えばホームページにリンクされた動画像ファイルに含まる音声データが再生される。 Next, the image decoding unit ex309 has a configuration including the moving image decoding device described in the present invention, and converts a bit stream of image data into a decoding method corresponding to the encoding method described in the above embodiment. By decoding, the reproduced moving image data is generated and supplied to the display unit ex202 via the LCD control unit ex302, whereby, for example, the moving image data included in the moving image file linked to the homepage is displayed. . At this time, at the same time, the audio processing unit ex305 converts the audio data into analog audio data, and then supplies the analog audio data to the audio output unit ex208, whereby, for example, the audio data included in the moving image file linked to the homepage is reproduced. You.

なお、上記システムの例に限られず、最近は衛星、地上波によるディジタル放送が話題となっており、図１８に示すようにディジタル放送用システムにも上記実施の形態の少なくとも動画像符号化装置または動画像復号化装置のいずれかを組み込むことができる。具体的には、放送局ex４０９では映像情報のビットストリームが電波を介して通信または放送衛星ex４１０に伝送される。これを受けた放送衛星ex４１０は、放送用の電波を発信し、この電波を衛星放送受信設備をもつ家庭のアンテナex４０６で受信し、テレビ（受信機）ex４０１またはセットトップボックス（ＳＴＢ）ex４０７などの装置によりビットストリームを復号化してこれを再生する。また、記録媒体であるCDやDVD等の蓄積メディアex４０２に記録したビットストリームを読み取り、復号化する再生装置ex４０３にも上記実施の形態で示した動画像復号化装置を実装することが可能である。この場合、再生された映像信号はモニタex４０４に表示される。また、ケーブルテレビ用のケーブルex４０５または衛星／地上波放送のアンテナex４０６に接続されたセットトップボックスex４０７内に動画像復号化装置を実装し、これをテレビのモニタex４０８で再生する構成も考えられる。このときセットトップボックスではなく、テレビ内に動画像復号化装置を組み込んでも良い。また、アンテナex４１１を有する車ex４１２で衛星ex４１０からまたは基地局ex１０７等から信号を受信し、車ex４１２が有するカーナビゲーションex４１３等の表示装置に動画を再生することも可能である。 It should be noted that the present invention is not limited to the example of the system described above, and digital broadcasting using satellites and terrestrial waves has recently become a topic. As shown in FIG. Any of the video decoding devices can be incorporated. Specifically, at the broadcasting station ex409, the bit stream of the video information is transmitted to the communication or the broadcasting satellite ex410 via radio waves. The broadcast satellite ex410 receiving this transmits a broadcast radio wave, receives this radio wave with a home antenna ex406 having a satellite broadcast reception facility, and outputs the radio wave to a television (receiver) ex401 or a set-top box (STB) ex407. The device decodes the bit stream and reproduces it. In addition, the moving picture decoding apparatus described in the above embodiment can be mounted on a reproducing apparatus ex403 that reads and decodes a bit stream recorded on a storage medium ex402 such as a CD or DVD that is a recording medium. . In this case, the reproduced video signal is displayed on the monitor ex404. Further, a configuration is also conceivable in which a moving image decoding apparatus is mounted in a set-top box ex407 connected to a cable ex405 for cable television or an antenna ex406 for satellite / terrestrial broadcasting, and this is reproduced on a monitor ex408 of the television. At this time, the moving picture decoding device may be incorporated in the television instead of the set-top box. In addition, a car ex412 having an antenna ex411 can receive a signal from the satellite ex410 or a base station ex107 or the like, and can reproduce a moving image on a display device such as a car navigation ex413 included in the car ex412.

更に、画像信号を上記実施の形態で示した動画像符号化装置で符号化し、記録媒体に記録することもできる。具体例としては、DVDディスクｅｘ４２１に画像信号を記録するDVDレコーダや、ハードディスクに記録するディスクレコーダなどのレコーダｅx４２０がある。更にSDカードｅｘ４２２に記録することもできる。レコーダｅｘ４２０が上記実施の形態で示した動画像復号化装置を備えていれば、DVDディスクｅｘ４２１やSDカードｅｘ４２２に記録した画像信号を再生し、モニタｅｘ４０８で表示することができる。 Furthermore, an image signal can be encoded by the moving image encoding device described in the above embodiment and recorded on a recording medium. As a specific example, there is a recorder ex420 such as a DVD recorder for recording an image signal on a DVD disc ex421 or a disc recorder for recording on a hard disk. Furthermore, it can be recorded on the SD card ex422. If the recorder ex420 includes the moving picture decoding device described in the above embodiment, the video signal recorded on the DVD disc ex421 or the SD card ex422 can be reproduced and displayed on the monitor ex408.

なお、カーナビゲーションex４１３の構成は例えば図１７に示す構成のうち、カメラ部ex２０３とカメラインターフェース部ex３０３、画像符号化部ｅｘ３１２を除いた構成が考えられ、同様なことがコンピュータex１１１やテレビ（受信機）ex４０１等でも考えられる。
また、上記携帯電話ex１１４等の端末は、符号化器・復号化器を両方持つ送受信型の端末の他に、符号化器のみの送信端末、復号化器のみの受信端末の３通りの実装形式が考えられる。 The configuration of the car navigation system ex413 may be, for example, a configuration excluding the camera unit ex203, the camera interface unit ex303, and the image encoding unit ex312 from the configuration illustrated in FIG. 17, and the same applies to the computer ex111 and the television (receiver). ) Ex401 and the like are also conceivable.
In addition, the terminal such as the above-mentioned mobile phone ex114 has three mounting formats, in addition to a transmitting / receiving terminal having both an encoder and a decoder, a transmitting terminal having only an encoder and a receiving terminal having only a decoder. Can be considered.

このように、上記実施の形態で示した動画像符号化方法あるいは動画像復号化方法を上述したいずれの機器・システムに用いることは可能であり、そうすることで、上記実施の形態で説明した効果を得ることができる。
また、本発明はかかる上記実施形態に限定されるものではなく、本発明の範囲を逸脱することなく種々の変形または修正が可能である。 As described above, the moving picture coding method or the moving picture decoding method described in the above embodiment can be used for any of the devices and systems described above. The effect can be obtained.
Further, the present invention is not limited to the above embodiment, and various changes or modifications can be made without departing from the scope of the present invention.

以上のように、本発明に係る動画像符号化方法および動画像復号化方法は、例えば携帯電話、ＤＶＤ装置、およびパーソナルコンピュータ等で、動画像を構成する各ピクチャを符号化して符号化ストリームを生成したり、生成された符号化ストリームを復号化したりするための方法として有用である。 As described above, the moving picture encoding method and the moving picture decoding method according to the present invention encode each picture constituting a moving picture by using, for example, a mobile phone, a DVD device, and a personal computer to form an encoded stream. It is useful as a method for generating or decoding the generated coded stream.

本発明の動画像符号化装置の構成を示すブロック図（実施の形態1）である。FIG. 1 is a block diagram (Embodiment 1) showing a configuration of a video encoding device of the present invention. 本発明の動画像復号化方法の概念を説明する図（実施の形態１）である。FIG. 3 is a diagram (first embodiment) for explaining the concept of the moving picture decoding method of the present invention. 本発明の動画像復号化装置の構成を示すブロック図（実施の形態１）である。[Fig. 3] Fig. 3 is a block diagram (first embodiment) illustrating a configuration of a video decoding device of the present invention. 本発明の動画像復号化方法の動作フロー図（実施の形態1）である。FIG. 4 is an operation flowchart (first embodiment) of the moving picture decoding method of the present invention. 本発明の動画像復号化方法の空き領域確保処理の動作フロー図（実施の形態１）である。FIG. 9 is an operation flowchart (Embodiment 1) of the free area securing process of the moving picture decoding method of the present invention. 本発明の動画像復号化装置の構成を示すブロック図（実施の形態２）である。FIG. 3 is a block diagram (Embodiment 2) showing the configuration of the video decoding device of the present invention. 本発明の動画像復号化方法の動作フロー図（実施の形態２）である。FIG. 10 is an operation flowchart (second embodiment) of the moving picture decoding method of the present invention. 本発明の動画像符号化方法の出力するデータの構造、本発明の動画像復号化方法の入力するデータの構造を示す図である。It is a figure which shows the structure of the data which the moving image encoding method of this invention outputs, and the structure of the data which the moving image decoding method of this invention inputs. 本発明の動画像符号化装置の構成を示すブロック図（実施の形態３）である。FIG. 9 is a block diagram (Embodiment 3) showing a configuration of a video encoding device of the present invention. 本発明の動画像符号化方法の概念を説明する図（実施の形態３）である。FIG. 10 is a diagram (third embodiment) for explaining the concept of the moving picture encoding method of the present invention. 本発明の動画像符号化方法の動作フロー図（実施の形態３）である。FIG. 9 is an operation flowchart (third embodiment) of the moving picture encoding method of the present invention. 本発明の動画像符号化方法の概念を説明する図（実施の形態４）である。FIG. 14 is a diagram (fourth embodiment) for explaining the concept of the moving picture coding method of the present invention. 本発明の動画像符号化方法の動作フロー図（実施の形態４）である。FIG. 9 is an operation flowchart (Embodiment 4) of the moving picture encoding method of the present invention. 各実施の形態の動画像符号化方法および動画像復号化方法をコンピュータシステムにより実現するためのプログラムを格納するための記録媒体についての説明図であり、(a) 記録媒体本体であるフレキシブルディスクの物理フォーマットの例を示した説明図、(b) フレキシブルディスクの正面からみた外観、断面構造、及びフレキシブルディスクを示した説明図、(c) フレキシブルディスクＦＤに上記プログラムの記録再生を行うための構成を示した説明図である。It is an explanatory diagram of a recording medium for storing a program for realizing the moving image encoding method and the moving image decoding method of each embodiment by a computer system, (a) of a flexible disk which is a recording medium body Explanatory diagram showing an example of a physical format, (b) Appearance, sectional structure, and explanatory diagram showing a flexible disk as viewed from the front of a flexible disk, (c) Configuration for recording and reproducing the above program on the flexible disk FD FIG. コンテンツ配信サービスを実現するコンテンツ供給システムの全体構成を示すブロック図である。FIG. 1 is a block diagram illustrating an overall configuration of a content supply system that realizes a content distribution service. 携帯電話の一例を示す図である。It is a figure showing an example of a mobile phone. 携帯電話の内部構成を示すブロック図である。FIG. 2 is a block diagram illustrating an internal configuration of the mobile phone. ディジタル放送用システムの全体構成を示すブロック図である。FIG. 1 is a block diagram illustrating an overall configuration of a digital broadcasting system. 従来の動画像符号化装置の構成を示すブロック図である。FIG. 11 is a block diagram illustrating a configuration of a conventional moving image encoding device. 表示順（POC）と被参照ピクチャ番号の概念を説明する図である。It is a figure explaining the concept of a display order (POC) and a referenced picture number. 不使用とマークされたピクチャがある場合、メモリに空き領域を確保するために、ピクチャを消去する動作を説明する図である。FIG. 11 is a diagram illustrating an operation of deleting a picture in order to secure a free area in a memory when there is a picture marked as unused. 不使用とマークされたピクチャがない場合、メモリに空き領域を確保するために、ピクチャを消去する動作を説明する図である。FIG. 11 is a diagram illustrating an operation of deleting a picture in order to secure a free area in a memory when there is no picture marked as unused. 従来の動画像復号化装置の構成を示すブロック図である。FIG. 15 is a block diagram illustrating a configuration of a conventional video decoding device. 従来の動画像復号化装置のメモリ関連の動作のフロー図である。FIG. 10 is a flowchart of a memory-related operation of the conventional video decoding device. 従来の動画像復号化装置の空き領域確保処理の動作のフロー図である。FIG. 10 is a flowchart of an operation of a free space securing process of the conventional video decoding device. シーケンスの不連続が表示順情報POCの不連続をひきおこし、未表示のピクチャを消去してしまう問題を説明する概念図である。FIG. 9 is a conceptual diagram illustrating a problem that a discontinuity in a sequence causes a discontinuity in display order information POC and an undisplayed picture is deleted. 無効ピクチャの動作を説明する概念図である。It is a conceptual diagram explaining operation | movement of an invalid picture. 従来の動画像復号化装置の構成を示すブロック図である。FIG. 15 is a block diagram illustrating a configuration of a conventional video decoding device. 従来の動画像復号化装置の無効ピクチャの動作のフロー図である。FIG. 10 is a flowchart of an operation of an invalid picture of the conventional video decoding device. シーケンスの不連続がフレームナンバー FNの不連続をひきおこし、無効ピクチャが未表示のピクチャを消去してしまう問題を説明する概念図である。FIG. 7 is a conceptual diagram illustrating a problem that a discontinuity in a sequence causes a discontinuity in a frame number FN, and an invalid picture deletes an undisplayed picture. 従来のMPEG-2ストリームの構造を説明する概念図である。FIG. 3 is a conceptual diagram illustrating the structure of a conventional MPEG-2 stream. 従来のJVTの動画像符号化方法を説明する概念図である。FIG. 11 is a conceptual diagram illustrating a conventional JVT video encoding method. 従来のJVTの動画像符号化方法の動作フロー図である。FIG. 11 is an operation flowchart of a conventional JVT video encoding method. JVTの符号化の自由度が、編集時やランダムアクセス時に引き起こす問題を説明する概念図である。FIG. 7 is a conceptual diagram illustrating a problem caused by the degree of freedom of JVT coding at the time of editing or random access.

Explanation of reference numerals

１０１並び替えメモリ
１０２符号化スケジューリング部
１０３動き検出部
１０４減算演算部
１０５符号化部
１０６動き補償部
１０７可変長符号化部
１０８復号化部
１０９加算演算部
１１０、１１１メモリ
１１２フラグ情報生成部
１１３可変長符号化部
２０１、２１３、４０２、４１１可変長復号化部
２０２画像復号化部
２０３、２１４編集検出部
２０４ MMCO復号化部
２０５、２１２、４０１、４１２メモリ管理部
２０６メモリ
２１１ FNギャップ検出部 Reference Signs List 101 rearrangement memory 102 encoding scheduling unit 103 motion estimation unit 104 subtraction operation unit 105 encoding unit 106 motion compensation unit 107 variable length encoding unit 108 decoding unit 109 addition operation unit 110, 111 memory 112 flag information generation unit 113 variable Long encoding section 201, 213, 402, 411 Variable length decoding section 202 Image decoding section 203, 214 Edit detection section 204 MMCO decoding section 205, 212, 401, 412 Memory management section 206 Memory 211 FN gap detection section

Claims

A moving picture coding method for coding a moving picture signal on a picture basis to generate a coded stream,
A flag information creating step of creating flag information indicating that the order of the pictures is discontinuous;
An information adding step of adding the flag information to the encoded stream.

The moving picture coding method according to claim 1, wherein, in the flag information creating step, when the display order information of the pictures is discontinuous, the order of the pictures is discontinuous.

The moving picture encoding method according to claim 1, wherein, in the flag information creating step, if the encoding order information of the pictures is discontinuous, the order of the pictures is discontinuous.

The moving picture coding method according to claim 1, wherein, in the information adding step, the flag information is added between two pictures in which the order of the pictures in the coded stream is discontinuous.

The video encoding method further includes:
Including a position information creating step of creating position information specifying a position where the order of the pictures is discontinuous,
The moving image encoding method according to claim 1, wherein, in the information adding step, the position information is added together with the flag information.

A moving picture coding method for coding a moving picture signal on a picture basis to generate a coded stream,
An encoding step of encoding a picture whose display order is later than the first intra-coded picture in a predetermined coding unit composed of a plurality of pictures so as to be included in a coding unit subsequent to the coding unit. A moving picture coding method comprising:

In the encoding step, the display order of the pictures in the predetermined coding unit is continuous, and the display of the pictures in the predetermined coding unit immediately after the predetermined coding unit in the coding order is performed. The moving image encoding method according to claim 6, wherein encoding is performed before the order.

A moving image decoding method for decoding an encoded stream in units of pictures,
An information extracting step of extracting flag information indicating that the order of the pictures is discontinuous;
A management step of managing an area for storing a decoded picture based on the flag information.

The flag information is information indicating that display order information of pictures is discontinuous,
In the managing step, a picture whose display order is the earliest among decoded pictures stored in the area is determined based on the display order information and the flag information, and the determined picture is deleted. The moving picture decoding method according to claim 8, wherein the moving picture is a picture.

In the managing step, clip information to be updated when the flag information is extracted is provided to the decoded picture stored in the area, and the area is assigned to the area based on the display order information and the clip information. The moving picture decoding method according to claim 9, wherein a picture whose display order is the earliest among the stored decoded pictures is determined, and the determined picture is set as a deletion target picture.

The video decoding method may further include:
When the coding order information of the picture is discontinuous, the method includes an invalid picture storing step of storing an invalid picture in the area,
The flag information is information indicating that the encoding order information is discontinuous,
In the managing step, based on the flag information and the encoding order information, determine whether to store an invalid picture in the area,
The moving picture decoding method according to claim 8, wherein in the invalid picture storing step, an invalid picture is stored in the area based on a determination result in the management step.

A moving image encoding apparatus that encodes a moving image signal on a picture basis to generate an encoded stream,
Flag information creating means for creating flag information indicating that the order of the pictures is discontinuous,
A video encoding device, comprising: information adding means for adding the flag information to the encoded stream.

A moving picture decoding apparatus for decoding an encoded stream in units of pictures,
Information extraction means for extracting flag information indicating that the order of the pictures is discontinuous,
Management means for managing an area for storing a decoded picture based on the flag information.

A program for encoding a moving image signal on a picture basis to generate an encoded stream,
A flag information creating step of creating flag information indicating that the order of the pictures is discontinuous;
An information adding step of adding the flag information to the encoded stream.

A program for decoding an encoded stream in units of pictures,
An information extracting step of extracting flag information indicating that the order of the pictures is discontinuous;
And a management step of managing an area for storing a decoded picture based on the flag information.