JP6268715B2

JP6268715B2 - Moving picture encoding apparatus and program, moving picture decoding apparatus and program, and moving picture distribution system

Info

Publication number: JP6268715B2
Application number: JP2013037576A
Authority: JP
Inventors: 和仁迫水
Original assignee: Oki Electric Industry Co Ltd
Current assignee: Oki Electric Industry Co Ltd
Priority date: 2013-02-27
Filing date: 2013-02-27
Publication date: 2018-01-31
Anticipated expiration: 2033-02-27
Also published as: JP2014165849A

Description

この発明は、動画像符号化装置及びプログラム、動画像復号装置及びプログラム、並びに、動画像配信システムに関し、例えば、動画像をＤＶＣ（ＤｉｓｔｒｉｂｕｔｅｄＶｉｄｅｏＣｏｄｉｎｇ）方式で符号化して配信する動画像配信システムに適用し得る。 The present invention relates to a moving image encoding device and program, a moving image decoding device and program, and a moving image distribution system. For example, the present invention relates to a moving image distribution system that encodes and distributes a moving image by a DVC (Distributed Video Coding) method. Applicable.

従来のＤＶＣ（ＤｉｓｔｒｉｂｕｔｅｄＶｉｄｅｏＣｏｄｉｎｇ）方式を採用したシステムは、Ｓｌｅｐｉａｎ‐Ｗｏｌｆ理論およびＷｙｎｅｒ−Ｚｉｖ理論に基づき動画像の符号化および復号をおこなうシステムである。 A system that adopts a conventional DVC (Distributed Video Coding) system is a system that encodes and decodes a moving picture based on the Slepian-Wolf theory and the Wyner-Ziv theory.

ＤＶＣ方式は、デコーダで生成される符号化対象画像の予測画像（以下、「デコーダ予測画像」と呼ぶ）から符号化対象画像を再構成する符号（以下、「Ｗｙｎｅｒ−Ｚｉｖ符号」と呼ぶ）を、デコーダ予測画像を直接参照することなく生成することを特徴とする。この特徴より、ＤＶＣ方式を採用した動画像符号化装置は複雑な予測画像生成部を備える必要がなく、符号化にかかる演算量の削減が可能になる。ＤＶＣ方式の間題は、デコーダ予測画像自体を保持して、復号画像を生成するのに必要十分なＷｙｎｅｒ−Ｚｉｖ符号のレートを求めることが困難であることである。例えば、ＤＶＣ方式を採用した動画像符号化装置では、Ｗｙｎｅｒ−Ｚｉｖ符号が多すぎると圧縮効果が得られず、Ｗｙｎｅｒ−Ｚｉｖ符号が少なすぎるとデコーダ予測画像から復号画像を生成できない。 In the DVC method, a code (hereinafter referred to as “Wyner-Ziv code”) that reconstructs an encoding target image from a prediction image (hereinafter referred to as “decoder predicted image”) of the encoding target image generated by the decoder. The decoder prediction image is generated without referring directly. Due to this feature, a moving picture encoding apparatus adopting the DVC method does not need to include a complicated predicted image generation unit, and the amount of calculation required for encoding can be reduced. The problem with the DVC scheme is that it is difficult to obtain a Wyner-Ziv code rate that is necessary and sufficient to generate a decoded image while retaining the decoder predicted image itself. For example, in a video encoding apparatus adopting the DVC method, if there are too many Wyner-Ziv codes, a compression effect cannot be obtained, and if there are too few Wyner-Ziv codes, a decoded picture cannot be generated from the predicted decoder picture.

このようなＤＶＣ方式の課題を解決する従来技術として、非特許文献１の記載技術がある。非特許文献１では、デコーダ予測画像から復号画像が生成されるまで追加のＷｙｎｅｒ−Ｚｉｖ符号を要求し続ける（フィードバックし続ける）ことで必要十分なレートを探る方式（以下、「デコーダレート制御方式」と呼ぶ）を実現している。 As a conventional technique for solving such a problem of the DVC method, there is a technique described in Non-Patent Document 1. In Non-Patent Document 1, a method for searching for a necessary and sufficient rate by continuously requesting (feeding back) an additional Wyner-Ziv code until a decoded image is generated from a decoder predicted image (hereinafter referred to as “decoder rate control method”). Called).

しかし、従来のデコーダレート制御方式には、ネットワークを介した復号処理の存在なしに符号化することができないため、用途が限定されるという短所がある。そして、この短所を改善するための従来技術として非特許文献２、３の記載技術がある。 However, the conventional decoder rate control method has a disadvantage in that the application is limited because it cannot be encoded without the presence of a decoding process via a network. And as a prior art for improving this shortcoming, there are technologies described in Non-Patent Documents 2 and 3.

この短所を解消するため、非特許文献２や非特許文献３では、エンコーダ予測画像（デコーダ予測画像ほど複雑な演算を必要とせずに生成が可能な予測画像）を動画像符号化装置側で生成し、符号化対象画像とエンコーダ予測画像を比較することで、レートを制御する方式（以下、「エンコーダレート制御方式」と呼ぶ）が提案されている。 In order to eliminate this shortcoming, in Non-Patent Document 2 and Non-Patent Document 3, an encoder predicted image (a predicted image that can be generated without requiring a calculation as complicated as a decoder predicted image) is generated on the moving image encoding device side. A method of controlling the rate by comparing the encoding target image and the predicted encoder image (hereinafter referred to as “encoder rate control method”) has been proposed.

非特許文献２、３のエンコーダレート制御方式を採用した動画像符号化装置では、レート制御を行うことで、初回に送られるＷｙｅｒ−Ｚｉｖ符号のレートを、必要十分なレートに近づけ、フィードバック回数を少なくしている。 In the moving picture coding apparatus adopting the encoder rate control method of Non-Patent Documents 2 and 3, by performing rate control, the rate of the Wyer-Ziv code sent for the first time is brought close to a necessary and sufficient rate, and the number of feedbacks is set. Less.

Ｂ．Ｇｉｒｏｄ，ａＭ．Ａａｒｏｎ，Ｓ．Ｒａｎｅ，ａｎｄＤ．Ｒｅｂｏｌｌｏ−Ｍｏｎｅｄｅｒｏ，“ＤｉｓｔｒｉｂｕｔｅｄＶｉｄｅｏＣｏｄｉｎｇ，”ＰｒｏｃｅｅｄｉｎｇｓｏｆｔｈｅＩＥＥE，ｖｏｌ．９３，Ｊａｎ．２００５，ｐｐ．７１−８３．B. Girod, a M.M. Aaron, S.A. Rane, andD. Rebolo-Monedero, “Distributed Video Coding,” Proceedings of the IEEE, vol. 93, Jan. 2005, pp. 71-83. Ｃ．ＢｒｉｔｅｓａｎｄＦ．Ｐｅｒｅｉｒａ，“ＥｎｃｏｄｅｒｒａｔｅｃｏｎｔｒｏｌｆｏｒｔｒａｎｓｆｏｒｍｄｏｍａｉｎＷｙｎｅｒ−Ｚｉｖｖｉｄｅｏｃｏｄｉｎｇ，”１ｍａｇｅＰｒｏｃｅｓｓｉｎｇ２００７．ＩＣＩＰ２００７．ＩＥＥＥＩｎｔｅｒｎａｔｉｏｎａｌＣｏｎｆｅｒｅｎｃｅｏｎ，ＩＥＥＥ，２００７，ｐｐ．４−７．C. Brites and F.M. Pereira, “Encoder rate control for domain domain Wyner-Ziv video coding,” 1 image Processing 2007. ICIP 2007. IEEE International Conference on, IEEE, 2007, pp. 4-7. Ｍ．Ｔａｇｌｉａｓａｃｃｈｉ，Ａ．Ｍａｊｕｍｄａｒ，ａｎｄＲ．Ｒａｍｃｈａｎｄｒａｎ，“Ａｄｉｓｔｒｉｂｕｔｅｄ−ｓｏｕｒｃｅ−ｃｏｄｉｎｇｂａｓｅｄｒｏｂｕｓｔｓｐａｔｉｏ−ｔｅｍｐｏｒａｌｓｃａｌａｂｌｅｖｉｄｅｏｃｏｄｅｃ，”Ｐｒｏｃ．ＰｉｃｔｕｒｅＣｏｄｉｎｇＳｙｍｐｏｓｉｕｍ，Ｃｉｔｅｓｅｅｒ，２００４．M.M. Tagliasacchi, A .; Majmardar, and R.M. Ramchandran, “A distributed-source-coding based robust-spatial-temporal scalable video codec,” Proc. Picture Coding Symposium, Citeseeer, 2004. Ｊ．Ａｓｃｅｎｓｏ，Ｃ．Ｂｒｉｔｅｓ，ａｎｄＦ．Ｐｅｒｅｉｒａ，‘Ｍｏｔｉｏｎｃｏｍｐｅｎｓａｔｅｄｒｅｆｉｎｅｍｅｎｔｆｏｒｌｏｗｃｏｍｐｌｅｘｉｔｙｐｉｘｅｌｂａｓｅｄｄｉｓｔｒｉｂｕｔｅｄｖｉｄｅｏｃｏｄｉｎｇ，”Ｐｒｏｃｅｅｄｉｎｇｓ．ＩＥＥＥＣｏｎｆｅｒｅｎｃｅｏｎＡｄｖａｎｃｅｄＶｉｄｅｏａｎｄＳｉｇｎａｌＢａｓｅｄＳｕｒｖｅｉｌｌａｎｃｅ，２００５．，ｐｐ．５９３−５９８．J. et al. Ascenso, C.I. Brites, and F.B. Pereira, 'Motion compensated refining for low complexity pixel based distributed video coding, "Proceedings. IEEE Conference on Advanced.

従来の動画像符号化装置でエンコーダ予測画像（サイドインフォメーション）を生成する場合、生成したエンコーダ予測画像の量子化値と、Ｗｙｎｅｒ−Ｚｉｖ画像の量子化値との間の誤りの量に応じて、動画像復号装置側に供給する符号量を増減させる。すなわち、従来の動画像符号化装置では、上述の量子化値の誤りの量が多いほど多くの符号量のパリティビットを、動画像復号装置側に供給することになる。 When an encoder predicted image (side information) is generated by a conventional moving image encoding device, according to the amount of error between the quantized value of the generated encoder predicted image and the quantized value of the Wyner-Ziv image, Increase or decrease the amount of code supplied to the video decoding device side. That is, in the conventional video encoding device, the larger the amount of quantization value errors described above, the larger the amount of code bits supplied to the video decoding device side.

一方、動画像符号化装置において、上述の符号量はより少ない方が、伝送効率等の面から好ましいが、符号量を少なくすると動画像復号装置側で生成する復号画像の品質が低下する。 On the other hand, in the video encoding device, it is preferable from the viewpoint of transmission efficiency and the like that the code amount is smaller, but if the code amount is reduced, the quality of the decoded image generated on the video decoding device side is lowered.

そのため、復号画像の品質を低下させずに符号量を少なくすることができる動画像符号化装置及びプログラム、動画像復号装置及びプログラム、並びに、動画像配信システムが望まれている。 Therefore, a moving image encoding device and program, a moving image decoding device and program, and a moving image distribution system that can reduce the amount of code without degrading the quality of the decoded image are desired.

第１の本発明は、フレーム列を有する動画像を符号化する動画像符号化装置において、（１）上記フレーム列のうちキーフレームを利用して、非キーフレームの予測画像を生成する予測画像生成手段と、（２）非キーフレームの複雑度合を計算する複雑度計算部と、（３）上記複雑度計算部の計算結果に基づいて、画像を構成する各パラメータの量子化に係るビットデプスを決定するビットデプス決定手段と、（４）上記ビットデプス決定手段で決定されたビットデプスで、非キーフレームの各パラメータ、及び、上記予測画像の各パラメータを量子化して、量子化後非キーフレーム及び量子化後予測画像を生成するものであって、最多のビットデプスよりも少ないビットデプスが決定されたパラメータについては、当該パラメータに対して決定されたビットデプス分の量子化データの最上位ビット側又は最下位ビット側のいずれかに、最多のビットデプスとの差分に相当する所定パターンのパディングビットを追加したデータを、当該パラメータの量子化データとして生成する量子化手段と、（５）量子化後非キーフレームに対する量子化後予測画像の誤りを訂正するための誤り訂正符号を生成する誤り訂正符号生成手段とを有し、（６）上記ビットデプス決定手段及び上記量子化手段で処理される各画像のパラメータは、当該画像を構成する各画素の画素値、又は、当該画像を所定の変換方式で変換することで得られる変換係数領域ごとの量子化値であることを特徴とする。 According to a first aspect of the present invention, there is provided a moving image encoding apparatus for encoding a moving image having a frame sequence. (1) A predicted image that generates a predicted image of a non-key frame using a key frame in the frame sequence. A generating unit; (2) a complexity calculating unit that calculates the degree of complexity of a non-key frame; and (3) a bit depth related to quantization of each parameter constituting an image based on a calculation result of the complexity calculating unit. And (4) quantizing each parameter of the non-key frame and each parameter of the predicted image with the bit depth determined by the bit depth determining means, For generating a frame and a predicted image after quantization, for a parameter for which a bit depth smaller than the maximum bit depth is determined, Data obtained by adding padding bits of a predetermined pattern corresponding to the difference from the most bit depth to either the most significant bit side or the least significant bit side of the quantized data corresponding to the specified bit depth And (5) error correction code generation means for generating an error correction code for correcting an error in the predicted image after quantization with respect to the non-key frame after quantization, (6) ) parameters of each image processed by the bit depth determination means and the quantizing means, the pixel values of the pixels constituting the image, or transform coefficient obtained by converting the image by a predetermined conversion method It is characterized by a quantized value for each of several regions.

第２の本発明は、フレーム列を有する動画像がフレーム単位に符号化された動画像データを復号する動画像復号装置において、（１）キーフレームのデータを保持するキーフレーム保持手段と、（２）上記キーフレーム保持手段が保持したキーフレームを利用して、上記フレーム列のうち非キーフレームの予測画像を生成する予測画像生成手段と、（３）上記動画像データを構成する非キーフレームの符号化データの生成過程で、画像を構成する各パラメータの量子化に適用されたビットデプスに係るビットデプス情報を保持するビットデプス保持手段と、（４）上記ビットデプス保持手段で保持されたビットデプスで、上記予測画像の各パラメータを量子化して、量子化後予測画像を生成するものであって、最多のビットデプスよりも少ないビットデプスが決定されたパラメータについては、当該パラメータに対して設定されたビットデプス分の量子化データの最上位ビット側又は最下位ビット側のいずれかに、最多のビットデプスとの差分に相当する所定パターンのパディングビットを追加したデータを、当該パラメータの量子化データとして生成する量子化手段と、（５）上記動画像データを構成する非キーフレームの符号化データが、量子化後予測画像を訂正するための誤り訂正符号だった場合、その誤り訂正符号を用いて量子化後予測画像を訂正した訂正後画像を生成する誤り訂正手段と、（６）訂正後画像と、上記予測画像とに基づいて、非キーフレームの画像を再構成した再構成画像を生成する再構成手段とを有し、（７）上記ビットデプス決定手段及び上記量子化手段で処理される各画像のパラメータは、当該画像を構成する各画素の画素値、又は、当該画像を所定の変換方式で変換することで得られる変換係数領域ごとの量子化値であることを特徴とする。 According to a second aspect of the present invention, in a moving image decoding apparatus for decoding moving image data in which a moving image having a frame sequence is encoded in units of frames, (1) key frame holding means for holding key frame data; 2) predicted image generation means for generating a predicted image of a non-key frame in the frame sequence using the key frame held by the key frame holding means; and (3) a non-key frame constituting the moving image data. Bit depth holding means for holding bit depth information related to the bit depth applied to quantization of each parameter constituting the image in the process of generating the encoded data of (4), and (4) held by the bit depth holding means The bit depth is used to quantize each parameter of the predicted image to generate a quantized predicted image, which is smaller than the maximum bit depth. The parameter for which the bit depth is determined corresponds to the difference from the most bit depth on either the most significant bit side or the least significant bit side of the quantized data for the bit depth set for the parameter. Quantization means for generating data to which padding bits of a predetermined pattern are added as quantized data of the parameter; and (5) encoded data of non-keyframes constituting the moving image data includes a predicted image after quantization. In the case of an error correction code for correction, error correction means for generating a corrected image obtained by correcting the quantized predicted image using the error correction code, (6) the corrected image, and the predicted image And (7) the bit depth determination unit and the quantization unit, the reconstruction unit configured to generate a reconstructed image based on the reconstruction of the non-keyframe image. Parameters of each image processed by the step, the pixel values of the pixels constituting the image, or it is quantized value for each conversion coefficient region obtained by converting the image by a predetermined conversion method It is characterized by.

第３の本発明の動画像符号化プログラムは、フレーム列を有する動画像を符号化する動画像符号化装置に搭載されたコンピュータを、（１）上記フレーム列のうちキーフレームを利用して、非キーフレームの予測画像を生成する予測画像生成手段と、（２）非キーフレームの複雑度合を計算する複雑度計算部と、（３）上記複雑度計算部の計算結果に基づいて、画像を構成する各パラメータの量子化に係るビットデプスを決定するビットデプス決定手段と、（４）上記ビットデプス決定手段で決定されたビットデプスで、非キーフレームの各パラメータ、及び、上記予測画像の各パラメータを量子化して、量子化後非キーフレーム及び量子化後予測画像を生成するものであって、最多のビットデプスよりも少ないビットデプスが決定されたパラメータについては、当該パラメータに対して決定されたビットデプス分の量子化データの最上位ビット側又は最下位ビット側のいずれかに、最多のビットデプスとの差分に相当する所定パターンのパディングビットを追加したデータを、当該パラメータの量子化データとして生成する量子化手段と、（５）量子化後非キーフレームに対する量子化後予測画像の誤りを訂正するための誤り訂正符号を生成する誤り訂正符号生成手段として機能させ、（６）上記ビットデプス決定手段及び上記量子化手段で処理される各画像のパラメータは、当該画像を構成する各画素の画素値、又は、当該画像を所定の変換方式で変換することで得られる変換係数領域ごとの量子化値であることを特徴とする。 A moving image encoding program according to a third aspect of the present invention provides a computer mounted on a moving image encoding apparatus that encodes a moving image having a frame sequence, using (1) a key frame in the frame sequence, A predicted image generating means for generating a predicted image of a non-key frame; (2) a complexity calculating unit for calculating a complexity level of the non-key frame; and (3) an image based on a calculation result of the complexity calculating unit. Bit depth determining means for determining a bit depth relating to quantization of each parameter constituting; (4) each bit parameter determined by the bit depth determining means, each parameter of the non-key frame, and each prediction image The parameter is quantized to generate a non-key frame after quantization and a predicted image after quantization, and a bit depth that is less than the maximum bit depth is determined. For the meter, padding bits of a predetermined pattern corresponding to the difference from the most bit depth are placed on either the most significant bit side or the least significant bit side of the quantized data for the bit depth determined for the parameter. Quantization means for generating the added data as quantized data of the parameter; and (5) an error correction code for generating an error correction code for correcting an error in the predicted image after quantization for the non-key frame after quantization. (6) The parameter of each image processed by the bit depth determination unit and the quantization unit is the pixel value of each pixel constituting the image, or the image is converted by a predetermined conversion method. characterized in that it is a quantized value for each conversion coefficient region obtained by converting.

第４の本発明の動画像復号プログラムは、フレーム列を有する動画像がフレーム単位に符号化された動画像データを復号する動画像復号装置に搭載されたコンピュータを、（１）キーフレームのデータを保持するキーフレーム保持手段と、（２）上記キーフレーム保持手段が保持したキーフレームを利用して、上記フレーム列のうち非キーフレームの予測画像を生成する予測画像生成手段と、（３）上記動画像データを構成する非キーフレームの符号化データの生成過程で、画像を構成する各パラメータの量子化に適用されたビットデプスに係るビットデプス情報を保持するビットデプス保持手段と、（４）上記ビットデプス保持手段で保持されたビットデプスで、上記予測画像の各パラメータを量子化して、量子化後予測画像を生成するものであって、最多のビットデプスよりも少ないビットデプスが決定されたパラメータについては、当該パラメータに対して設定されたビットデプス分の量子化データの最上位ビット側又は最下位ビット側のいずれかに、最多のビットデプスとの差分に相当する所定パターンのパディングビットを追加したデータを、当該パラメータの量子化データとして生成する量子化手段と、（５）上記動画像データを構成する非キーフレームの符号化データが、量子化後予測画像を訂正するための誤り訂正符号だった場合、その誤り訂正符号を用いて量子化後予測画像を訂正した訂正後画像を生成する誤り訂正手段と、（６）訂正後画像と、上記予測画像とに基づいて、非キーフレームの画像を再構成した再構成画像を生成する再構成手段として機能させ、（７）上記ビットデプス決定手段及び上記量子化手段で処理される各画像のパラメータは、当該画像を構成する各画素の画素値、又は、当該画像を所定の変換方式で変換することで得られる変換係数領域ごとの量子化値であることを特徴とする。 A moving image decoding program according to a fourth aspect of the present invention provides a computer mounted on a moving image decoding apparatus that decodes moving image data in which a moving image having a frame sequence is encoded in units of frames. (2) a predicted image generating unit that generates a predicted image of a non-key frame in the frame sequence using the key frame held by the key frame holding unit; and (3) Bit depth holding means for holding bit depth information related to bit depth applied to quantization of each parameter constituting an image in the process of generating encoded data of non-key frames constituting the moving image data; (4 ) The quantized prediction image is generated by quantizing each parameter of the prediction image with the bit depth held by the bit depth holding means. For a parameter for which a bit depth less than the maximum bit depth is determined, either the most significant bit side or the least significant bit side of the quantized data corresponding to the bit depth set for the parameter is set. , A quantization means for generating, as quantized data of the parameter, data added with a predetermined pattern of padding bits corresponding to the difference from the most bit depth, and (5) a non-key frame of the moving image data If the encoded data is an error correcting code for correcting the quantized predicted image, an error correcting means for generating a corrected image obtained by correcting the quantized predicted image using the error correcting code; (6 ) Based on the corrected image and the predicted image, it functions as a reconstruction unit that generates a reconstructed image obtained by reconstructing a non-keyframe image. (7) The parameters of each image processed by the bit depth determining means and the quantizing means are obtained by converting the pixel value of each pixel constituting the image or converting the image by a predetermined conversion method. characterized in that it is a quantized value for each conversion coefficient region.

第５の本発明は、フレーム列を有する動画像をフレーム単位に符号化した動画像データを生成する動画像符号化装置と、上記動画像符号化装置で生成された動画像データを復号する動画像復号装置とを備える動画像配信システムにおいて、（１）上記動画像符号化装置として第１の本発明の動画像符号化装置を適用し、（２）上記動画像復号装置として第２の本発明の動画像復号装置を適用したことを特徴とする。 According to a fifth aspect of the present invention, there is provided a moving image encoding device for generating moving image data obtained by encoding a moving image having a frame sequence in units of frames, and a moving image for decoding the moving image data generated by the moving image encoding device. In a moving image distribution system including an image decoding device, (1) the moving image encoding device according to the first aspect of the present invention is applied as the moving image encoding device, and (2) the second image as the moving image decoding device. The moving picture decoding apparatus of the invention is applied.

本発明によれば、復号画像の品質を低下させずに符号量を少なくすることができる動画像配信システムを提供することができる。 ADVANTAGE OF THE INVENTION According to this invention, the moving image delivery system which can reduce code amount, without reducing the quality of a decoded image can be provided.

第１の実施形態に係る動画像配信システムの全体構成について示したブロック図である。It is the block diagram shown about the whole structure of the moving image delivery system which concerns on 1st Embodiment. 第１の実施形態に係る動画像符号化装置の機能的構成について示したブロック図である。It is the block diagram shown about the functional structure of the moving image encoder which concerns on 1st Embodiment. 第１の実施形態に係る動画像復号装置の機能的構成について示したブロック図である。It is the block diagram shown about the functional structure of the moving image decoding apparatus which concerns on 1st Embodiment. 第１の実施形態に係る動画像配信システムの動作（動画像符号化装置側の動作）について示したフローチャートである。It is the flowchart shown about operation | movement (operation | movement by the moving image encoder side) of the moving image delivery system which concerns on 1st Embodiment. 第１の実施形態に係る動画像配信システムの動作（動画像復号装置側の動作）について示したフローチャートである。It is the flowchart shown about operation | movement (operation | movement by the moving image decoding apparatus side) of the moving image delivery system which concerns on 1st Embodiment. 第１の実施形態に係る動画像符号化装置で生成される複雑度及びビットデプス制御信号の内容について示した説明図である。It is explanatory drawing shown about the complexity produced | generated with the moving image encoder which concerns on 1st Embodiment, and the content of the bit depth control signal. 第１の実施形態に係る動画像符号化装置で行われる画像の量子化処理の例について示した説明図である。It is explanatory drawing shown about the example of the quantization process of the image performed with the moving image encoder which concerns on 1st Embodiment. 第２の実施形態に係る動画像符号化装置の機能的構成について示したブロック図である。It is the block diagram shown about the functional structure of the moving image encoder which concerns on 2nd Embodiment. 第２の実施形態に係る動画像配信システムの動作（動画像符号化装置側の動作）について示したフローチャートである。It is the flowchart shown about operation | movement (operation | movement by the moving image encoder side) of the moving image delivery system which concerns on 2nd Embodiment. 第３の実施形態に係る動画像符号化装置の機能的構成について示したブロック図である。It is the block diagram shown about the functional structure of the moving image encoder which concerns on 3rd Embodiment. 第３の実施形態に係る動画像復号装置の機能的構成について示したブロック図である。It is the block diagram shown about the functional structure of the moving image decoding apparatus which concerns on 3rd Embodiment. 第３の実施形態に係る動画像配信システムの動作（動画像符号化装置側の動作）について示したフローチャートである。It is the flowchart shown about operation | movement (operation | movement by the moving image encoder side) of the moving image delivery system which concerns on 3rd Embodiment. 第３の実施形態に係る動画像配信システムの動作（動画像復号装置側の動作）について示したフローチャートである。It is the flowchart shown about operation | movement (operation | movement by the moving image decoding apparatus side) of the moving image delivery system which concerns on 3rd Embodiment. 第４の実施形態に係る動画像符号化装置の機能的構成について示したブロック図である。It is the block diagram shown about the functional structure of the moving image encoder which concerns on 4th Embodiment. 第４の実施形態に係る動画像復号装置の機能的構成について示したブロック図である。It is the block diagram shown about the functional structure of the moving image decoding apparatus which concerns on 4th Embodiment. 第４の実施形態に係る動画像配信システムの動作（動画像符号化装置側の動作）について示したフローチャートである。It is the flowchart shown about operation | movement (operation | movement by the moving image encoder side) of the moving image delivery system which concerns on 4th Embodiment. 第４の実施形態に係る動画像配信システムの動作（動画像復号装置側の動作）について示したフローチャートである。It is the flowchart shown about the operation | movement (operation | movement by the moving image decoding apparatus side) of the moving image delivery system which concerns on 4th Embodiment.

（Ａ）第１の実施形態
以下、本発明による動画像符号化装置及びプログラム、動画像復号装置及びプログラム、並びに、動画像配信システムの第１の実施形態を、図面を参照しながら詳述する。 (A) First Embodiment Hereinafter, a first embodiment of a moving image encoding apparatus and program, a moving image decoding apparatus and program, and a moving image distribution system according to the present invention will be described in detail with reference to the drawings. .

（Ａ−１）第１の実施形態の構成
動画像配信システム１は、動画像符号化装置１０及び動画像復号装置２０を有している。 (A-1) Configuration of First Embodiment The moving image distribution system 1 includes a moving image encoding device 10 and a moving image decoding device 20.

図１は、この実施形態の動画像配信システム１の全体構成を示すブロック図である。 FIG. 1 is a block diagram showing the overall configuration of a moving image distribution system 1 of this embodiment.

動画像配信システム１は、動画像符号化装置１０及び動画像復号装置２０を有している。 The moving image distribution system 1 includes a moving image encoding device 10 and a moving image decoding device 20.

動画像符号化装置１０は、フレーム単位（画像単位）で入力される入力動画像信号（入力映像信号）を符号化して、符号化したデータをストリーム化（ビットストリーム化）して出力する。図１に示すように、動画像符号化装置１０では、入力動画像信号を構成する各画像が、Ｗｙｎｅｒ−Ｚｉｖ画像Ｆ１１とＫｅｙ画像Ｆ１２とに分けて符号化される。そして、動画像符号化装置１０は、Ｗｙｎｅｒ−Ｚｉｖ画像Ｆ１１を符号化した符号化データのストリーム（以下、「Ｓｌｅｐｉａｎ−ＷｏｌｆストリームＳＴ１」と呼ぶ）と、Ｋｅｙ画像Ｆ１２を符号化したデータのストリーム（以下、「ＫｅｙストリームＳＴ２」と呼ぶ）と、後述するビットデプス伝達符号Ｃ１（詳細については後述する）を出力する。 The moving image encoding device 10 encodes an input moving image signal (input video signal) input in frame units (image units), and streams the encoded data (bit stream) to output. As shown in FIG. 1, in the moving image encoding device 10, each image constituting the input moving image signal is encoded by being divided into a Wyner-Ziv image F11 and a Key image F12. Then, the moving image encoding apparatus 10 encodes a stream of encoded data obtained by encoding the Wyner-Ziv image F11 (hereinafter referred to as “Slepian-Wolf stream ST1”), and a stream of data obtained by encoding the Key image F12 ( Hereinafter, it is referred to as “Key stream ST2”) and a bit depth transfer code C1 (described later in detail) is output.

動画像復号装置２０は、動画像符号化装置１０から出力された符号化データ（Ｓｌｅｐｉａｎ−Ｗｏｌｆストリーム、及びＫｅｙストリーム）を復号して、復号画像（復号フレーム）を生成し、フレーム単位（画像単位）で復号動画像信号（復号動画像データ）として出力するものである。動画像復号装置２０は、Ｓｌｅｐｉａｎ−ＷｏｌｆストリームＳＴ１を復号して得たＷｙｎｅｒ−Ｚｉｖ復号画像Ｆ４１と、ＫｅｙストリームＳＴ２を復号して得た復号画像（Ｋｅｙ画像Ｆ１２）とを合わせた復号動画像信号を出力する。 The video decoding device 20 decodes the encoded data (Slepian-Wolf stream and Key stream) output from the video encoding device 10, generates a decoded image (decoded frame), and generates a frame unit (image unit). ) Is output as a decoded moving image signal (decoded moving image data). The moving picture decoding apparatus 20 combines a decoded moving picture signal obtained by decoding a Wyner-Ziv decoded picture F41 obtained by decoding the Slepian-Wolf stream ST1 and a decoded picture (Key picture F12) obtained by decoding the Key stream ST2. Is output.

なお、動画像符号化装置１０が出力するデータ形式、及び動画像復号装置２０に入力されるデータ形式については限定されないものである。例えば、動画像符号化装置１０から、リアルタイムデータ伝送に適したストリーム形式ではなく、同様のデータを１つのデータファイルとして出力して、オフライン（例えば、ハードディスク等のデータ記録媒体）で動画像復号装置２０に供給するようにしてもよい。 Note that the data format output by the video encoding device 10 and the data format input by the video decoding device 20 are not limited. For example, instead of the stream format suitable for real-time data transmission from the moving image encoding device 10, similar data is output as one data file, and the moving image decoding device is offline (for example, a data recording medium such as a hard disk). 20 may be supplied.

次に、動画像符号化装置１０の内部構成について図２を用いて説明する。 Next, the internal configuration of the moving picture coding apparatus 10 will be described with reference to FIG.

動画像符号化装置１０は、ビットデプス制御部１０１、符号化対象画像量子化部１０４、エンコーダ予測画像生成部１０５、エンコーダ予測画像量子化部１０６、レート制御部１０７、Ｓｌｅｐｉａｎ−Ｗｏｌｆ符号化部１０８、ビットデプス通知部１０９、及びＫｅｙ画像符号化部１２０を有している。また、ビットデプス制御部１０１は、複雑度計算部１０２、及び制御部１０３を有している。 The moving image encoding apparatus 10 includes a bit depth control unit 101, an encoding target image quantization unit 104, an encoder predicted image generation unit 105, an encoder predicted image quantization unit 106, a rate control unit 107, and a Slepian-Wolf encoding unit 108. , A bit depth notification unit 109 and a key image encoding unit 120. The bit depth control unit 101 includes a complexity calculation unit 102 and a control unit 103.

Ｋｅｙ画像符号化部１２０は、画面内符号化処理（例えば、Ｈ．２６４／ＡＶＣやＪＰＥＧ等の符号化技術による符号化処理）を行って、Ｋｅｙ画像Ｆ１２を符号化し、その符号化データをストリーム化して、ＫｅｙストリームＳＴ２として出力するものである。 The Key image encoding unit 120 performs an intra-screen encoding process (for example, an encoding process using an encoding technique such as H.264 / AVC or JPEG), encodes the Key image F12, and streams the encoded data. And output as a Key stream ST2.

エンコーダ予測画像生成部１０５は、Ｋｅｙ画像Ｆ１２からエンコーダ予測画像Ｆ２１を生成するものである。エンコーダ予測画像生成部１０５がエンコーダ予測画像Ｆ２１を生成する方式については限定されないものであり、ＤＶＣシステムを採用したシステムにおける種々の生成方式を適用することができる。 The encoder predicted image generation unit 105 generates an encoder predicted image F21 from the Key image F12. The method by which the encoder predicted image generation unit 105 generates the encoder predicted image F21 is not limited, and various generation methods in a system employing the DVC system can be applied.

エンコーダ予測画像量子化部１０６は、エンコーダ予測画像Ｆ２１を量子化（例えば、画素ごとにビット列のデータに変換）し、量子化後エンコーダ予測画像Ｆ２２を生成するものである。 The encoder predicted image quantization unit 106 quantizes the encoder predicted image F21 (for example, converts it into bit string data for each pixel), and generates a quantized encoder predicted image F22.

レート制御部１０７は、量子化後エンコーダ予測画像Ｆ２２と量子化後Ｗｙｎｅｒ−Ｚｉｖ画像Ｆ２３からＳｌｅｐｉａｎ−Ｗｏｌｆ符号に必要な符号量を推定し、その推定結果をレートＲとして出力するものである。レート制御部１０７が符号量を推定する具体的処理については、限定されないものであり、ＤＶＣを採用したシステムにおける種々の推定処理を適用することができる。 The rate control unit 107 estimates a code amount necessary for the Slepian-Wolf code from the quantized encoder predicted image F22 and the quantized Wyner-Ziv image F23, and outputs the estimation result as a rate R. The specific process in which the rate control unit 107 estimates the code amount is not limited, and various estimation processes in a system employing DVC can be applied.

Ｓｌｅｐｉａｎ−Ｗｏｌｆ符号化部１０８は、量子化後Ｗｙｎｅｒ−Ｚｉｖ画像Ｆ２３をＳｌｅｐｉａｎ−Ｗｏｌｆ符号化し、レートＲに基づく符号量で、Ｓｌｅｐｉａｎ−ＷｏｌｆストリームＳＴ１を生成するものである。Ｓｌｅｐｉａｎ−Ｗｏｌｆ符号化部１０８がＳｌｅｐｉａｎ−Ｗｏｌｆ符号化処理を行う具体的方式については、限定されないものであり、ＤＶＣを採用したシステムにおける種々の符号化処理を適用することができる。 The Slepian-Wolf encoding unit 108 performs Slepian-Wolf encoding on the quantized Wyner-Ziv image F23, and generates a Slepian-Wolf stream ST1 with a code amount based on the rate R. The specific method in which the Slepian-Wolf encoding unit 108 performs the Slepian-Wolf encoding process is not limited, and various encoding processes in a system employing DVC can be applied.

符号化対象画像量子化部１０４は、Ｗｙｎｅｒ−Ｚｉｖ画像Ｆ１１とエンコーダ予測画像Ｆ２１と、ビットデプス通知部１０９の制御（後述するビットデプス制御信号Ｓ１２）とに基づき、量子化後Ｗｙｎｅｒ−Ｚｉｖ画像Ｆ２３を生成するものである。 The encoding target image quantizing unit 104 is a quantized Wyner-Ziv image F23 based on the Wyner-Ziv image F11, the encoder predicted image F21, and the control of the bit depth notifying unit 109 (a bit depth control signal S12 described later). Is generated.

ビットデプス制御部１０１は、フレームを任意の大きさのブロックに分割し、Ｗｙｎｅｒ−Ｚｉｖ画像Ｆ１１を構成するブロックごとに、画像の量子化に用いるパラメータ（後述するビットデプス）を計算するものであり、複雑度計算部１０２及び制御部１０３を有している。 The bit depth control unit 101 divides a frame into blocks of an arbitrary size, and calculates a parameter (bit depth described later) used for image quantization for each block constituting the Wyner-Ziv image F11. And a complexity calculation unit 102 and a control unit 103.

複雑度計算部１０２は、フレーム（Ｗｙｎｅｒ−Ｚｉｖ画像Ｆ１１）を所定の大きさのブロックに分割して（この実施形態では、例として４×４のブロックに分割するものとする）、Ｗｙｎｅｒ−Ｚｉｖ画像Ｆ１１を構成するブロックごとに、複雑度を計算する。そして、複雑度計算部１０２は、ブロックごとに求めた複雑度を複雑度Ｓ１１として出力する。 The complexity calculation unit 102 divides the frame (Wyner-Ziv image F11) into blocks of a predetermined size (in this embodiment, it is assumed to be divided into 4 × 4 blocks as an example), and Wyner-Ziv. The complexity is calculated for each block constituting the image F11. The complexity calculator 102 outputs the complexity obtained for each block as the complexity S11.

複雑度Ｓ１１が複雑度を計算する方法については限定されないものであるが、例えば、ブロックごとの交流成分の大きさや、エッジの強さ等に基づいて複雑度を計算するようにしてもよい。 The method of calculating the complexity by the complexity S11 is not limited. For example, the complexity may be calculated based on the size of the AC component for each block, the strength of the edge, or the like.

まず、複雑度Ｓ１１がブロックごとの交流成分に基づいて、各ブロックの複雑度を計算する例について説明する。複雑度計算部１０２は、例えば、Ｗｙｎｅｒ−Ｚｉｖ画像Ｆ１１の各ブロック内の画素値の平均値を、全てのブロックの画素値（画像全体の画素値）の平均値と比較し、ブロックごとの差分が大きいほど当該ブロックの交流成分が大きい（すなわち、複雑度が高い）と判定するようにしてもよい。また、例えば、動画像符号化装置１０が、非特許文献４の記載技術のように、スケーラブル符号化機構を備える場合、複雑度計算部１０２は、スケーラブル符号化機構が生成したＷｙｎｅｒ−Ｚｉｖ画像Ｆ１１の縮小画像を拡大処理した拡大画像を生成し、Ｗｙｎｅｒ−Ｚｉｖ画像Ｆ１１と該拡大画像との差分画像の信号の大きさを評価することで、複雑度Ｓ１１を計算するようにしてもよい。また、複雑度計算部１０２は、該縮小画像を直流成分の情報とみなし、Ｗｙｎｅｒ−Ｚｉｖ画像Ｆ１１から差し引くことで、交流成分を求め、交流成分の大きさを評価することで複雑度Ｓ１１を計算するようにしてもよい。 First, an example in which the complexity S11 calculates the complexity of each block based on the AC component for each block will be described. For example, the complexity calculation unit 102 compares the average value of the pixel values in each block of the Wyner-Ziv image F11 with the average value of the pixel values of all the blocks (the pixel values of the entire image), and calculates the difference for each block. It may be determined that the AC component of the block is larger (that is, the complexity is higher) as is larger. Further, for example, when the moving image encoding apparatus 10 includes a scalable encoding mechanism as in the technique described in Non-Patent Document 4, the complexity calculation unit 102 generates the Wyner-Ziv image F11 generated by the scalable encoding mechanism. An enlarged image obtained by enlarging the reduced image may be generated, and the complexity S11 may be calculated by evaluating the magnitude of the signal of the difference image between the Wyner-Ziv image F11 and the enlarged image. Further, the complexity calculation unit 102 regards the reduced image as DC component information, calculates the AC component by subtracting it from the Wyner-Ziv image F11, and calculates the complexity S11 by evaluating the size of the AC component. You may make it do.

次に、複雑度Ｓ１１がブロックごとのエッジの強さに基づいて、各ブロックの複雑度を計算する例について説明する。複雑度計算部１０２は、例えば、まず、Ｗｙｎｅｒ−Ｚｉｖ画像Ｆ１１の全体にエッジを検出するためのフィルタ（例えば、Ｓｏｂｅｌフィルタ等のフィルタ）を用いて、画素単位でエッジ部分に属するか否かを検出する。そして、複雑度計算部１０２は、ブロックごとに、エッジ部分に属する画素（以下、「エッジ画素」と呼ぶ）の数を数え、エッジ画素数が多いブロックほど、エッジが強い（すなわち、複雑度が高い）ブロックと判定するようにしてもよい。 Next, an example in which the complexity S11 calculates the complexity of each block based on the edge strength for each block will be described. For example, the complexity calculation unit 102 first uses a filter (for example, a filter such as a Sobel filter) for detecting an edge in the entire Wyner-Ziv image F11 to determine whether or not the pixel belongs to the edge portion. To detect. The complexity calculation unit 102 counts the number of pixels belonging to the edge portion (hereinafter referred to as “edge pixels”) for each block, and the block having a larger number of edge pixels has a stronger edge (that is, the complexity is higher). It may be determined that the block is high.

この実施形態では、説明を簡易とするために、複雑度Ｓ１１がブロックごとに算出する複雑度は０又は１のいずれかの数値により示されるものとする。言い換えると、この実施形態では、複雑度Ｓ１１がブロックごとに算出する複雑度は２段階の度合いでしめされるものとする。 In this embodiment, in order to simplify the description, the complexity calculated by the complexity S11 for each block is indicated by a numerical value of 0 or 1. In other words, in this embodiment, the complexity calculated by the complexity S11 for each block is assumed to be expressed in two steps.

例えば、複雑度計算部１０２は、ブロックごとに複雑度の基礎となる１又は複数のパラメータ（例えば、交流成分及び又はエッジの強さ）を算出し、算出したパラメータに基づいて、当該ブロックについて複雑度（０又は１）を決定するようにしてもよい。例えば、複雑度計算部１０２は、所定の閾値ｋと各ブロックの交流成分ＡＣを比較し、ＡＣ≧ｋの場合に当該ブロックの複雑度を１とし、そうでない場合に当該ブロックの複雑度を０とするようにしてもよい。 For example, the complexity calculation unit 102 calculates one or a plurality of parameters (for example, an AC component and / or edge strength) as a basis of complexity for each block, and the complexity of the block is calculated based on the calculated parameters. The degree (0 or 1) may be determined. For example, the complexity calculation unit 102 compares a predetermined threshold value k with the AC component AC of each block, sets the complexity of the block to 1 when AC ≧ k, and sets the complexity of the block to 0 otherwise. You may make it.

制御部１０３は、複雑度計算部１０２から供給される複雑度Ｓ１１に基づいて、ブロックごとに、Ｗｙｎｅｒ−Ｚｉｖ画像Ｆ１１の量子化に用いる画素ごとのビット数（以下、「ビットデプス」と呼ぶものとする）を計算する。そして、制御部１０３は、ブロックごとに求めたビットデプスを、ビットデプス制御信号Ｓ１１として出力する。 Based on the complexity S11 supplied from the complexity calculation unit 102, the control unit 103 uses, for each block, the number of bits for each pixel used for quantization of the Wyner-Ziv image F11 (hereinafter referred to as “bit depth”). )). And the control part 103 outputs the bit depth calculated | required for every block as the bit depth control signal S11.

ビットデプス制御信号Ｓ１２は、たとえば、複雑度Ｓ１１が高い場合に、予測画像の予測困難性が高いとみなして、高いビットデプスで符号化するようにビットデプス制御信号Ｓ１２を生成し、複雑度Ｓ１１が低い場合に、予測画像の予測困難性が低いとみなして、低いビットデプスで符号化するようにビットデプス制御信号Ｓ１２を生成する。 For example, when the complexity S11 is high, the bit depth control signal S12 is considered to have high prediction difficulty in the predicted image, and the bit depth control signal S12 is generated so as to be encoded with a high bit depth. Is low, the bit depth control signal S12 is generated so that the prediction difficulty of the predicted image is low and the coding is performed with a low bit depth.

この実施形態の複雑度Ｓ１１は、上述のように、０又は１で表される。また、この実施形態において、Ｗｙｎｅｒ−Ｚｉｖ画像Ｆ１１の各画素値は０〜２５５（１０進数）のいずれかの整数で表されるものとする。すなわち、Ｗｙｎｅｒ−Ｚｉｖ画像Ｆ１１の各画素値は、８桁の２進数（８ビットのデータ）で表されることになる。 The complexity S11 of this embodiment is represented by 0 or 1 as described above. In this embodiment, each pixel value of the Wyner-Ziv image F11 is represented by an integer from 0 to 255 (decimal number). That is, each pixel value of the Wyner-Ziv image F11 is represented by an 8-digit binary number (8-bit data).

そこで、この実施形態では、制御部１０３は、複雑度Ｓ１１において複雑度が０と設定されたブロックのビットデプスを３、複雑度Ｓ１１において複雑度が１と設定されたブロックのビットデプスを５と決定するものとして説明する。 Therefore, in this embodiment, the control unit 103 sets the bit depth of the block in which the complexity is set to 0 in the complexity S11 to 3, and the bit depth of the block in which the complexity is set to 1 in the complexity S11 to 5. It will be described as being determined.

次に、動画像復号装置２０の内部構成について図３を用いて説明する。 Next, the internal configuration of the video decoding device 20 will be described with reference to FIG.

動画像復号装置２０は、ビットデプス受取部２００、デコーダ予測画像生成部２０１、デコーダ予測画像量子化部２０２、ＬＬＲ推定部２０３、Ｓｌｅｐｉａｎ−Ｗｏｌｆ復号部２０４、再構成部２０５及びＫｅｙ画像復号部２２０を有している。 The video decoding device 20 includes a bit depth receiving unit 200, a decoder predicted image generation unit 201, a decoder predicted image quantization unit 202, an LLR estimation unit 203, a Lepian-Wolf decoding unit 204, a reconstruction unit 205, and a Key image decoding unit 220. have.

Ｋｅｙ画像復号部２２０は、ＫｅｙストリームＳＴ２を復号処理（動画像符号化装置１０側に対応する復号処理）して、Ｋｅｙ画像Ｆ１２を得るものである。なお、この実施形態では、説明を簡易とするため、Ｋｅｙ画像復号部２２０において、ＫｅｙストリームＳＴ２を復元して、符号化側に入力されるＫｅｙ画像Ｆ１２と同じ内容の復元画像が得られるものとする。Ｋｅｙ画像復号部２２０で得られる復号画像は、できるだけ符号化側に入力されるＫｅｙ画像に近い内容であることが望ましいが、完全に同一でなくてもよい。 The key image decoding unit 220 decodes the key stream ST2 (decoding process corresponding to the moving image encoding device 10 side) to obtain a key image F12. In this embodiment, in order to simplify the description, the key image decoding unit 220 restores the key stream ST2, and a restored image having the same content as the key image F12 input to the encoding side is obtained. To do. The decoded image obtained by the key image decoding unit 220 is preferably as close as possible to the key image input to the encoding side, but may not be completely the same.

ビットデプス受取部２００は、動画像符号化装置１０から供給されたビットデプス伝達符号Ｃ１から、ビットデプス制御信号Ｓ１１を復元して取得するものである。 The bit depth receiving unit 200 restores and acquires the bit depth control signal S11 from the bit depth transmission code C1 supplied from the moving image encoding device 10.

デコーダ予測画像生成部２０１は、Ｋｅｙ画像復号部２２０で取得されたＫｅｙ画像Ｆ１２から、デコーダ予測画像Ｆ３１を生成するものである。デコーダ予測画像生成部２０１がデコーダ予測画像Ｆ３１を生成する方式については限定されないものであり、ＤＶＣ方式を採用したシステムにおける種々の生成方式を適用することができる。 The predicted decoder image generation unit 201 generates a predicted decoder image F31 from the Key image F12 acquired by the Key image decoding unit 220. The method by which the decoder predicted image generation unit 201 generates the decoder predicted image F31 is not limited, and various generation methods in a system employing the DVC method can be applied.

デコーダ予測画像量子化部２０２は、デコーダ予測画像Ｆ３１を量子化（例えば、画素ごとにビット列のデータに変換）し、量子化後デコーダ予測画像Ｆ３２を生成するものである。 The decoder predicted image quantization unit 202 quantizes the decoder predicted image F31 (for example, converts it into bit string data for each pixel), and generates a quantized decoder predicted image F32.

ＬＬＲ推定部２０３は、量子化後デコーダ予測画像Ｆ３２から量子化されたビット列の各ビットについてＬＬＲ（ＬｏｇＬｉｋｅｌｉｈｏｏｄＲａｔｉｏ）を推定し、ＬＬＲ通知信号Ｓ２２として出力するものである。 The LLR estimation unit 203 estimates an LLR (Log Likelihood Ratio) for each bit of the bit string quantized from the quantized decoder predicted image F32, and outputs the LLR notification signal S22.

Ｓｌｅｐｉａｎ−Ｗｏｌｆ復号部２０４は、ＬＬＲ通知信号Ｓ２２とＳｌｅｐｉａｎ−ＷｏｌｆストリームＳＴ１から、訂正後Ｗｙｎｅｒ−Ｚｉｖ画像Ｆ３３を生成するものである。 The Slepian-Wol decoding unit 204 generates a corrected Wyner-Ziv image F33 from the LLR notification signal S22 and the Slepian-Wolf stream ST1.

動画像符号化装置１０では、ＬＬＲ推定部２０３及びＳｌｅｐｉａｎ−Ｗｏｌｆ復号部２０４により、Ｓｌｅｐｉａｎ−Ｗｏｌｆ復号を行う復号手段が形成されている。そして、ＬＬＲ推定部２０３及びＳｌｅｐｉａｎ−Ｗｏｌｆ復号部２０４が行う具体的処理としては、種々のＤＶＣシステムにおける復号手段を適用することができる。 In the moving image encoding device 10, the LLR estimation unit 203 and the Slepian-Wol decoding unit 204 form a decoding unit that performs Slepian-Wolf decoding. And as a concrete process which the LLR estimation part 203 and the Slepian-Wol decoding part 204 perform, the decoding means in various DVC systems is applicable.

再構成部２０５は、訂正後Ｗｙｎｅｒ−Ｚｉｖ画像Ｆ３３と、デコーダ予測画像Ｆ３１とを用いて、画像の再構成処理（逆量子化しながらＷｙｎｅｒ−Ｚｉｖ画像Ｆ１１を推定する処理）を行い、Ｗｙｎｅｒ−Ｚｉｖ復号画像Ｆ４１を生成するものである。なお、再構成部２０５の詳細処理については後述する。 The reconstruction unit 205 uses the corrected Wyner-Ziv image F33 and the decoder predicted image F31 to perform image reconstruction processing (processing for estimating the Wyner-Ziv image F11 while performing inverse quantization), and Wyner-Ziv. The decoded image F41 is generated. Detailed processing of the reconstruction unit 205 will be described later.

（Ａ−２）第１の実施形態の動作
次に、以上のような構成を有する第１の実施形態の動画像配信システム１の動作を説明する。 (A-2) Operation of the First Embodiment Next, the operation of the moving image distribution system 1 of the first embodiment having the above configuration will be described.

（Ａ−２−１）動画像符号化装置１０の動作
まず、動画像配信システム１の動画像符号化装置１０側の動作について、図４のフローチャートを用いて説明する。 (A-2-1) Operation of Moving Image Encoding Device 10 First, the operation on the moving image encoding device 10 side of the moving image distribution system 1 will be described with reference to the flowchart of FIG.

まず、複雑度計算部１０２において、Ｗｙｎｅｒ−Ｚｉｖ画像Ｆ１１の各ブロックについて複雑度Ｓ１１が計算され、その計算結果が複雑度Ｓ１１として出力されたものとする（Ｓ１０１）。 First, it is assumed that the complexity calculation unit 102 calculates the complexity S11 for each block of the Wyner-Ziv image F11 and outputs the calculation result as the complexity S11 (S101).

制御部１０３において、複雑度Ｓ１１に基づき、各ブロックのビットデプスが計算され、その計算結果が、ビットデプス制御信号Ｓ１２として出力される（Ｓ１０２）。 The control unit 103 calculates the bit depth of each block based on the complexity S11, and outputs the calculation result as the bit depth control signal S12 (S102).

図６は、複雑度Ｓ１１及びビットデプス制御信号Ｓ１２の内容について概念的に図示した例である。図６では、Ｗｙｎｅｒ−Ｚｉｖ画像Ｆ１１について、縦４ブロック×横４ブロック（１６のブロック）に分割した構成となっている。図６（ａ）、図６（ｂ）は、それぞれ複雑度Ｓ１１、ビットデプス制御信号Ｓ１２の内容を概念的に表している。図６では、四角形の枠のそれぞれを１つのブロックとし、当該四角形の中に当該ブロックに対応するパラメータ（複雑度又はビットデプス）となっている。 FIG. 6 is an example conceptually illustrating the contents of the complexity S11 and the bit depth control signal S12. In FIG. 6, the Wyner-Ziv image F11 is divided into 4 vertical blocks × 4 horizontal blocks (16 blocks). FIGS. 6A and 6B conceptually show the contents of the complexity S11 and the bit depth control signal S12, respectively. In FIG. 6, each square frame is a block, and the parameters (complexity or bit depth) corresponding to the block are included in the square.

図６では、一番左側の列について、上方向のブロックから順に、ブロックＢ１、Ｂ２、Ｂ３、Ｂ４という符号を付している。また、図６（ａ）では、ブロックＢ１〜Ｂ４の複雑度が、それぞれ１、０、０、１となっている。さらに、図６（ｂ）では、ブロックＢ１〜Ｂ４のビットデプスがそれぞれ図６（ａ）の複雑度に対応したビットデプスとなっている。具体的には、図６（ｂ）では、ブロックＢ１〜Ｂ４のビットデプスがそれぞれ５、３、３、５となっている。 In FIG. 6, for the leftmost column, blocks B1, B2, B3, and B4 are assigned in order from the upper block. In FIG. 6A, the complexity of the blocks B1 to B4 is 1, 0, 0, and 1, respectively. Further, in FIG. 6B, the bit depths of the blocks B1 to B4 are bit depths corresponding to the complexity of FIG. 6A, respectively. Specifically, in FIG. 6B, the bit depths of the blocks B1 to B4 are 5, 3, 3, and 5, respectively.

符号化対象画像量子化部１０４において、ビットデプス制御信号Ｓ１２に設定された各ブロックのビットデプスを用いて、Ｗｙｎｅｒ−Ｚｉｖ画像Ｆ１１が量子化され、量子化後Ｗｙｎｅｒ−Ｚｉｖ画像Ｆ２３が生成される（Ｓ１０３）。 In the encoding target image quantization unit 104, the Wyner-Ziv image F11 is quantized using the bit depth of each block set in the bit depth control signal S12, and a post-quantization Wyner-Ziv image F23 is generated. (S103).

図７は、符号化対象画像量子化部１０４により、ビットデプス制御信号Ｓ１２に基づく量子化処理の例について示している。 FIG. 7 shows an example of quantization processing based on the bit depth control signal S12 by the encoding target image quantization unit 104.

図７（ａ）〜（ｄ）は、それぞれ、図６に示すブロックＢ１〜Ｂ４を構成する１つの画素を量子化する例について示している。 FIGS. 7A to 7D show examples of quantizing one pixel constituting each of the blocks B1 to B4 shown in FIG.

符号化対象画像量子化部１０４では、まず、Ｗｙｎｅｒ−Ｚｉｖ画像Ｆ１１を構成する各ブロックの画素値（０〜２５５の１０進数）を８ビットの２進数に変換し、ＭＳＢ（最上位ビット）からビットデプス分のビットデータを、量子化値として取得する。例えば、図７（ａ）では、ブロックＢ１を構成する任意の画素の画素値を２進数に変換すると「１０１１１０１１」であり、ブロックＢ１のビットデプスは５となっている。したがって、符号化対象画像量子化部１０４は、当該画素の量子化値として、ＭＳＢから５ビット分のデータである「１０１１１」を取得する。 The encoding target image quantization unit 104 first converts the pixel value (decimal number from 0 to 255) of each block constituting the Wyner-Ziv image F11 into an 8-bit binary number, and starts from the MSB (most significant bit). Bit data for bit depth is acquired as a quantized value. For example, in FIG. 7A, when the pixel value of an arbitrary pixel constituting the block B1 is converted to a binary number, it is “10111011”, and the bit depth of the block B1 is 5. Therefore, the encoding target image quantization unit 104 acquires “10111”, which is data of 5 bits, from the MSB as the quantization value of the pixel.

ただし、この実施形態の符号化対象画像量子化部１０４は、各ブロックでビットデプスが異なる場合には、全てのブロックについて、最も長いビットデプスに合わせた桁数のビットデータを出力するものとする。例えば、図６の例では、ビットデプス制御信号Ｓ１２において、ビットデプスが３と５のブロックが混在している。したがって、符号化対象画像量子化部１０４は、最も長いビットデプスである５に正規化したビットデータを量子化値として出力するものとする。この場合、最も長いビットデプスのブロックよりも短いビットデプスのブロックでは、正規化に不足するビットデータが発生するが、符号化対象画像量子化部１０４は、この不足する分のビットデータとして、任意のパターン（以下、このパターンを「シンボル」とも呼ぶ）のダミーデータ（以下、「パディングビット」とも呼ぶ）を埋め込むものとする。この実施形態では、符号化対象画像量子化部１０４は、パディングビットに用いるシンボルとして、全てが「１」となるビットパターンを適用するものとして説明する。 However, when the bit depth is different in each block, the encoding target image quantization unit 104 of this embodiment outputs bit data having the number of digits corresponding to the longest bit depth for all blocks. . For example, in the example of FIG. 6, in the bit depth control signal S12, blocks having a bit depth of 3 and 5 are mixed. Therefore, the encoding target image quantization unit 104 outputs bit data normalized to 5 which is the longest bit depth as a quantized value. In this case, bit data that is insufficient for normalization is generated in a bit depth block that is shorter than the longest bit depth block. However, the encoding target image quantizing unit 104 arbitrarily selects bit data that is insufficient as bit data. The dummy data (hereinafter also referred to as “padding bits”) of the above pattern (hereinafter also referred to as “symbol”) is embedded. In this embodiment, the encoding target image quantization unit 104 will be described as applying a bit pattern that is all “1” as symbols used for padding bits.

例えば、図７（ｂ）では、ブロックＢ２を構成する任意の画素の画素値を２進数に変換すると「００１０１１０１」であり、ブロックＢ２のビットデプスは３となっている。しかし、画像全体で最大のビットデプスは５であるため、符号化対象画像量子化部１０４は、ＭＳＢから順に、ビットデプス分のビットデータ（３ビット分のデータ）を取得し、不足する分のパディングビット「１１」を追加したビットデータを量子化値として取得する。 For example, in FIG. 7B, when the pixel value of an arbitrary pixel constituting the block B2 is converted into a binary number, it is “00101101”, and the bit depth of the block B2 is 3. However, since the maximum bit depth of the entire image is 5, the encoding target image quantization unit 104 acquires bit data corresponding to bit depth (data corresponding to 3 bits) in order from the MSB. Bit data to which padding bit “11” is added is acquired as a quantized value.

シンボルとして用いるビットパターンは任意であるが、動画像配信システム１内の他の量子化部でも共通のシンボルを用いる必要がある。また、図７では、パディングビットをＬＳＢ側に埋め込んでいるが、ＭＳＢ側に埋め込んでも良い。ただし、パディングビットの埋め込み場所についても、動画像配信システム１内の他の量子化部や再構成部と一致している必要がある。 Although the bit pattern used as a symbol is arbitrary, it is necessary to use a common symbol also in other quantization units in the moving image distribution system 1. In FIG. 7, padding bits are embedded on the LSB side, but may be embedded on the MSB side. However, the padding bit embedding location also needs to coincide with other quantization units and reconstruction units in the moving image distribution system 1.

そして、エンコーダ予測画像生成部１０５において、Ｋｅｙ画像Ｆ１２から、動き推定などを用いて、エンコーダ予測画像Ｆ２１が生成される（Ｓ１０４）。 Then, the encoder predicted image generation unit 105 generates an encoder predicted image F21 from the key image F12 using motion estimation or the like (S104).

エンコーダ予測画像量子化部１０６において、ビットデプス制御信号Ｓ１２を用いて、エンコーダ予測画像Ｆ２１が量子化され、量子化後エンコーダ予測画像Ｆ２２が生成される（Ｓ１０５）。なお、エンコーダ予測画像Ｆ２１における量子化処理の方法は、符号化対象画像量子化部１０４と同様であるため、詳しい説明は省略する。 In the encoder predicted image quantization unit 106, the encoder predicted image F21 is quantized using the bit depth control signal S12, and a quantized encoder predicted image F22 is generated (S105). Note that the method of quantization processing in the encoder predicted image F21 is the same as that of the encoding target image quantization unit 104, and thus detailed description thereof is omitted.

レート制御部１０７において、量子化後Ｗｙｎｅｒ−Ｚｉｖ画像Ｆ２３と量子化後エンコーダ予測画像Ｆ２２とを比較して、動画像復号装置２０（デコーダ）で復号に必要とするＳｌｅｐｉａｎ−Ｗｏｌｆ符号化のレートＲが生成される（Ｓ１０６）。 In the rate control unit 107, the quantized Wyner-Ziv image F23 and the quantized encoder predicted image F22 are compared, and the rate R of the Slepian-Wolf encoding required for decoding by the video decoding device 20 (decoder). Is generated (S106).

Ｓｌｅｐｉａｎ−Ｗｏｌｆ符号化部１０８において、レートＲに基づき、量子化後Ｗｙｎｅｒ−Ｚｉｖ画像Ｆ２３がＳｌｅｐｉａｎ−Ｗｏｌｆ符号化され、Ｓｌｅｐｉａｎ−ＷｏｌｆストリームＳＴ１が生成される（Ｓ１０７）。 In the Slepian-Wolf encoding unit 108, the quantized Wyner-Ziv image F23 is subjected to Slepian-Wolf encoding based on the rate R, and a Slepian-Wolf stream ST1 is generated (S107).

ビットデプス通知部１０９において、ビットデプス制御信号Ｓ１２がビットデプス伝達符号Ｃ１に変換され出力される（Ｓ１０８）。なお、ビットデプス制御信号Ｓ１２は、ブロック単位の情報であるため、画像データに比べると、ビットデプス伝達符号Ｃ１の符号量は僅かである。 In the bit depth notification unit 109, the bit depth control signal S12 is converted into a bit depth transfer code C1 and output (S108). Since the bit depth control signal S12 is information in units of blocks, the bit depth transmission code C1 has a small amount of code compared to the image data.

（Ａ−２−２）動画像復号装置２０の動作
次に、動画像復号装置２０の復号処理の動作について、図５のフローチャートを用いて説明する。 (A-2-2) Operation of Video Decoding Device 20 Next, the operation of the decoding process of the video decoding device 20 will be described using the flowchart of FIG.

まず、ビットデプス受取部２００において、入力されたビットデプス伝達符号Ｃ１に基づいて、ビットデプス制御信号Ｓ１１が復元される（Ｓ１０９）。 First, the bit depth receiving unit 200 restores the bit depth control signal S11 based on the input bit depth transmission code C1 (S109).

そして、デコーダ予測画像生成部２０１において、動き推定などを用いて、Ｋｅｙ画像Ｆ１２からデコーダ予測画像Ｆ３１が生成される（Ｓ１１０）。 Then, the decoder predicted image generation unit 201 generates a decoder predicted image F31 from the Key image F12 using motion estimation or the like (S110).

デコーダ予測画像量子化部２０２において、ビットデプス制御信号Ｓ１２を用いて、デコーダ予測画像Ｆ３１が量子化され、量子化後デコーダ予測画像Ｆ３２が生成される（Ｓ１１１）。なお、デコーダ予測画像量子化部２０２における量子化処理の方法は、符号化対象画像量子化部１０４と同様であるため、詳しい説明は省略する。 The decoder predicted image quantization unit 202 quantizes the decoder predicted image F31 using the bit depth control signal S12, and generates a quantized decoder predicted image F32 (S111). Note that the method of quantization processing in the decoder predicted image quantization unit 202 is the same as that of the encoding target image quantization unit 104, and thus detailed description thereof is omitted.

ＬＬＲ推定部２０３において、量子化後デコーダ予測画像Ｆ３２から、ＬＬＲが推定され、推定されたＬＬＲに基づいたＬＬＲ通知信号Ｓ２２が出力される（Ｓ１１２）。 The LLR estimation unit 203 estimates an LLR from the quantized decoder predicted image F32, and outputs an LLR notification signal S22 based on the estimated LLR (S112).

Ｓｌｅｐｉａｎ−Ｗｏｌｆ復号部２０４において、ＬＬＲ通知信号Ｓ２２とＳｌｅｐｉａｎ−ＷｏｌｆストリームＳＴ１を用いてＳｌｅｐｉａｎ−Ｗｏｌｆ復号が行われ、訂正後Ｗｙｎｅｒ−Ｚｉｖ画像Ｆ３３が生成される（Ｓ１１３）。 In the Slepian-Wol decoding unit 204, Slepian-Wolf decoding is performed using the LLR notification signal S22 and the Slepian-Wolf stream ST1, and a corrected Wyner-Ziv image F33 is generated (S113).

最後に、再構成部２０５において、ビットデプス制御信号Ｓ１２と、訂正後Ｗｙｎｅｒ−Ｚｉｖ画像Ｆ３３と、デコーダ予測画像Ｆ３１とに基づいて、Ｗｙｎｅｒ−Ｚｉｖ復号画像Ｆ４１が生成される（Ｓ１１４）。 Finally, the reconstruction unit 205 generates a Wyner-Ziv decoded image F41 based on the bit depth control signal S12, the corrected Wyner-Ziv image F33, and the predicted decoder image F31 (S114).

なお、上述の通り、動画像配信システム１を構成する各量子化部において、ビットデプス制御信号Ｓ１２に基づくパディングビットの埋め込み位置は共通である。したがって、再構成部２０５では、ビットデプス制御信号Ｓ１２に基づいて、訂正後Ｗｙｎｅｒ−Ｚｉｖ画像Ｆ３３に含まれるパディングビットの位置（すなわち、パディングビットが含まれるブロックに所属する画素の量子化値）を認識することができる。したがって、再構成部２０５では、訂正後Ｗｙｎｅｒ−Ｚｉｖ画像Ｆ３３のうち、パディングビットを無視（捨象）した再構成処理を行うことができる。 As described above, the padding bit embedding position based on the bit depth control signal S12 is common in each quantization unit constituting the moving image distribution system 1. Therefore, the reconstruction unit 205 determines the position of the padding bit included in the corrected Wyner-Ziv image F33 (that is, the quantized value of the pixel belonging to the block including the padding bit) based on the bit depth control signal S12. Can be recognized. Accordingly, the reconstruction unit 205 can perform reconstruction processing in which padding bits are ignored (discarded) in the corrected Wyner-Ziv image F33.

（Ａ−３）第１の実施形態の効果
第１の実施形態によれば、以下のような効果を奏することができる。 (A-3) Effects of First Embodiment According to the first embodiment, the following effects can be achieved.

動画像符号化装置１０では、複雑度の高い領域について、予測画像（エンコーダ予測画像Ｆ２１、デコーダ予測画像Ｆ３１）の品質が低いとみなし、効果的に多くの誤り訂正符号を割り当てる（多いビットデプスを割当てる）ことができる。これにより、動画像符号化装置１０における符号化効率及び動画像復号装置２０での復号品質を向上させることができる。 The moving image coding apparatus 10 regards a region having high complexity as low quality of the prediction image (encoder prediction image F21, decoder prediction image F31), and effectively allocates many error correction codes (a large bit depth is set). Can be assigned). Thereby, the encoding efficiency in the moving image encoder 10 and the decoding quality in the moving image decoder 20 can be improved.

また、動画像符号化装置１０及び動画像復号装置２０では、パディングビットを利用して、異なるビットデプスの量子化値のギャップを埋めることで、ブロックサイズとＳｌｅｐｉａｎ−Ｗｏｌｆ符号の処理単位（何ビットごとにＳｌｅｐｉａｎ−Ｗｏｌｆ符号化又は復号するか）の間に依存関係がなく、コンテンツに応じて、ブロックサイズを容易に変更できる。例えば、複雑度が少ない画素間隔で変化するコンテンツの場合、たとえ、ビットデプス伝達信号の符号量が増えたとしても、動画像配信システム１において、ブロックサイズを小さくし手処理した方が効率的となる場合がある。 In addition, the moving image encoding device 10 and the moving image decoding device 20 use padding bits to fill in gaps between quantized values of different bit depths, so that the block size and the processing unit of the Slepian-Wolf code (how many bits There is no dependency between each of them (Slipian-Wolf encoding or decoding), and the block size can be easily changed according to the content. For example, in the case of content that changes at low pixel intervals with low complexity, even if the code amount of the bit depth transmission signal increases, it is more efficient to manually process by reducing the block size in the moving image distribution system 1. There is a case.

（Ｂ）第２の実施形態
以下、本発明による動画像符号化装置及びプログラム、動画像復号装置及びプログラム、並びに、動画像配信システムの第２の実施形態を、図面を参照しながら詳述する。 (B) Second Embodiment Hereinafter, a second embodiment of a moving image encoding apparatus and program, a moving image decoding apparatus and program, and a moving image distribution system according to the present invention will be described in detail with reference to the drawings. .

（Ｂ−１）第２の実施形態の構成
第２の実施形態の動画像配信システム１Ａの全体構成についても上述の図１を用いて示すことができる。以下、第２の実施形態について第１の実施形態との差異について説明する。 (B-1) Configuration of Second Embodiment The overall configuration of the moving image distribution system 1A of the second embodiment can also be shown using FIG. Hereinafter, the difference between the second embodiment and the first embodiment will be described.

第２の実施形態の動画像配信システム１Ａでは、動画像符号化装置１０が、動画像符号化装置１０Ａに置き換わっている。 In the moving image distribution system 1A of the second embodiment, the moving image encoding device 10 is replaced with the moving image encoding device 10A.

図８は、第２の実施形態の動画像符号化装置１０Ａの機能的構成について示したブロック図である。 FIG. 8 is a block diagram illustrating a functional configuration of the moving image encoding device 10A according to the second embodiment.

そして、第２の実施形態の動画像符号化装置１０Ａでは、ビットデプス制御部１０１がビットデプス制御部１０１Ａに置き換わっている。また、ビットデプス制御部１０１Ａでは、制御部１０３が制御部１０３Ａに置き換わり、さらに動き検出部１０９が追加されている。 In the moving image encoding device 10A of the second embodiment, the bit depth control unit 101 is replaced with the bit depth control unit 101A. In the bit depth control unit 101A, the control unit 103 is replaced with the control unit 103A, and a motion detection unit 109 is further added.

動き検出部１０９は、ブロックごと（複雑度計算部１０２と同じ方式で分割したブロックごと）に、Ｗｙｎｅｒ−Ｚｉｖ画像Ｆ１１とＫｅｙ画像Ｆ１２の間で動きの有無を判定し、その判定結果に基づいて動き情報Ｓ１３を生成する。動き情報Ｓ１３は、ブロックごとの動きの有無の情報で構成されている。 The motion detection unit 109 determines the presence or absence of motion between the Wyner-Ziv image F11 and the Key image F12 for each block (for each block divided in the same manner as the complexity calculation unit 102), and based on the determination result The motion information S13 is generated. The motion information S13 includes information on the presence / absence of motion for each block.

動き情報Ｓ１３は、たとえば、動きベクトルをゼロとしたときの、ＳＡＤ（差分絶対値和、ＳｕｍｏｆＡｂｓｏｌｕｔｅＤｉｆｆｅｒｅｎｃｅ)やＭＡＤ（差分絶対値平均、ＭｅａｎＡｂｓｏｌｕｔｅＤｉｆｆｅｒｅｎｃｅ)などのコスト関数の大きさが、予め定められた閾値以上の場合に、動きが有りと判定し、それ以外の場合は動き無しと判定するようにしてもよい。また、動画像符号化装置１０が、非特許文献４の記載技術のように、スケーラブル符号化機構を備える場合、スケーラブル符号化機構が生成した、Ｗｙｎｅｒ−Ｚｉｖ画像Ｆ１１と該Ｋｅｙ画像Ｆ１２にそれぞれ対応する縮小画像間を比較することによって、ブロックごとの動きの有無を計算しても良い。 The motion information S13 has a size of a cost function such as SAD (sum of absolute difference, sum of absolute difference) or MAD (mean absolute difference) when the motion vector is set to zero. It may be determined that there is a motion when the threshold is equal to or greater than a predetermined threshold, and it is determined that there is no motion in other cases. Further, when the moving image encoding apparatus 10 includes a scalable encoding mechanism as described in Non-Patent Document 4, it corresponds to the Wyner-Ziv image F11 and the Key image F12 generated by the scalable encoding mechanism, respectively. The presence / absence of motion for each block may be calculated by comparing the reduced images.

制御部１０３Ａは、複雑度Ｓ１１に加えて、動き情報Ｓ１３も考慮してブロックごとのビットデプスを決定する点で、第１の実施形態異なっている。 The control unit 103A is different from the first embodiment in that the bit depth for each block is determined in consideration of the motion information S13 in addition to the complexity S11.

（Ｂ−２）第２の実施形態の動作
次に、以上のような構成を有する第２の実施形態の動画像配信システム１Ａの動作を説明する。 (B-2) Operation | movement of 2nd Embodiment Next, operation | movement of 1 A of moving image delivery systems of 2nd Embodiment which has the above structures is demonstrated.

以下では、第２の実施形態の動画像配信システム１Ａの動作のうち、第１の実施形態との差異について図９を用いて説明する。 Hereinafter, differences from the first embodiment among the operations of the moving image distribution system 1A according to the second embodiment will be described with reference to FIG.

第２の実施形態の動画像配信システム１Ａでは、上述のステップＳ１０２がＳ２０２に置き換わっている点で異なっている。 The moving image distribution system 1A according to the second embodiment is different in that the above-described step S102 is replaced with S202.

図９（ａ）は、上述のステップＳ２０２における動画像符号化装置１０Ａの動作について示したフローチャートである。 FIG. 9A is a flowchart showing the operation of the moving image encoding apparatus 10A in step S202 described above.

ステップＳ２０２−１では、動き検出部１０９において、Ｗｙｎｅｒ−Ｚｉｖ画像Ｆ１１とＫｅｙ画像Ｆ１２からブロックごとの動きの有無が判定され、その判定結果に基づいて動き情報Ｓ１３が生成される。 In step S202-1, the motion detection unit 109 determines the presence / absence of motion for each block from the Wyner-Ziv image F11 and the Key image F12, and generates motion information S13 based on the determination result.

ステップＳ２０２−２では、制御部２０３において、複雑度Ｓ１１と動き情報Ｓ１３とに基づいて、ブロックごとのビットデプスが決定され、その決定に基づいてビットデプス制御信号Ｓ１２が生成される。 In step S202-2, the control unit 203 determines a bit depth for each block based on the complexity S11 and the motion information S13, and generates a bit depth control signal S12 based on the determination.

たとえば、動き情報Ｓ１３が、動きが少ないことを示唆している場合に、複雑度Ｓ１１の高さに寄らず、予測画像の予測困難性が低いとみなし、低いビットデプスで符号化するようにビットデプス制御信号２０３を生成し、動き情報Ｓ１３が、動きの大きいことを示唆している場合に、複雑度Ｓ１１の高さによってビットデプス制御信号２０３を生成する。 For example, when the motion information S13 suggests that there is little motion, it is considered that the prediction image is difficult to predict regardless of the complexity S11, and the bit is encoded so as to be encoded with a low bit depth. When the depth control signal 203 is generated and the motion information S13 indicates that the motion is large, the bit depth control signal 203 is generated according to the height of the complexity S11.

図９（ｂ）は、制御部１０３Ａで各ブロックのビットデプスを決定するルールについて示した説明図である。 FIG. 9B is an explanatory diagram showing rules for determining the bit depth of each block by the control unit 103A.

この実施形態では、制御部１０３Ａは、動き情報Ｓ１３において動き無し（動きが少ないこと）が示されているブロックについては、図９（ｂ）に示すように、複雑度Ｓ１１における複雑度の高さによらず、予測画像の予測困難性が低いとみなし、少ないビットデプス（３）を決定するものとする。 In this embodiment, the control unit 103A has a high degree of complexity in the complexity S11, as shown in FIG. 9B, for a block in which no motion (less motion) is indicated in the motion information S13. Regardless of this, it is assumed that the prediction difficulty of the predicted image is low, and a small bit depth (3) is determined.

また、制御部１０３Ａは、動き情報Ｓ１３において動き有り（動きが大きいこと）が示されているブロックについては、図９（ｂ）に示すように、複雑度Ｓ１１の高さに応じたビットデプスを決定（第１の実施形態と同じ方式でビットデプスを決定）するものとする。 Further, the control unit 103A applies a bit depth corresponding to the height of the complexity S11, as shown in FIG. 9B, for the block in which motion is present (the motion is large) in the motion information S13. Assume that the bit depth is determined (by the same method as in the first embodiment).

（Ｂ−３）第２の実施形態の効果
第２の実施形態によれば、第１の実施形態と比較して以下のような効果を奏することができる。 (B-3) Effects of Second Embodiment According to the second embodiment, the following effects can be achieved as compared with the first embodiment.

動画像符号化装置１０Ａでは、動きの少ない領域（動き無しの領域）については、例え、複雑度の高い領域であったとしても、予測画像（エンコーダ予測画像Ｆ２１、デコーダ予測画像Ｆ３１）の品質が高いという仮定に基づき、動きの少ない領域に関しては、少ないビットデプスで符号化している。上述の仮説は、多くの動画像で適用できるものであるため、動画像符号化装置１０Ａでのビットデプスの制御が更に妥当（適切）となり、結果として符号化効率及び復号品質を更に向上させることができる。 In the moving image coding apparatus 10A, the quality of the predicted image (encoder predicted image F21, decoder predicted image F31) is low even if the region with little motion (the region without motion) is a highly complex region. Based on the assumption that it is high, an area with little motion is encoded with a small bit depth. Since the above hypothesis can be applied to many moving pictures, the control of the bit depth in the moving picture coding apparatus 10A becomes more appropriate (appropriate), and as a result, the coding efficiency and the decoding quality are further improved. Can do.

（Ｃ）第３の実施形態
以下、本発明による動画像符号化装置及びプログラム、動画像復号装置及びプログラム、並びに、動画像配信システムの第３の実施形態を、図面を参照しながら詳述する。 (C) Third Embodiment Hereinafter, a third embodiment of a moving image encoding device and program, a moving image decoding device and program, and a moving image distribution system according to the present invention will be described in detail with reference to the drawings. .

（Ｃ−１）第３の実施形態の構成
第３の実施形態の動画像配信システムＢの全体構成についても上述の図１を用いて示すことができる。以下、第３の実施形態について第２の実施形態との差異について説明する。 (C-1) Configuration of Third Embodiment The overall configuration of the moving image distribution system B of the third embodiment can also be shown using FIG. Hereinafter, the difference between the third embodiment and the second embodiment will be described.

第３の実施形態の動画像配信システムＢでは、動画像符号化装置１０Ａ及び動画像復号装置２０が、動画像符号化装置１０Ｂ及び動画像復号装置２０Ｂに置き換わっている点で第２の実施形態と異なっている。 In the moving image distribution system B of the third embodiment, the second embodiment is that the moving image encoding device 10A and the moving image decoding device 20 are replaced with the moving image encoding device 10B and the moving image decoding device 20B. Is different.

図１０は、第３の実施形態の動画像符号化装置１０Ｂの機能的構成について示したブロック図である。 FIG. 10 is a block diagram illustrating a functional configuration of a moving image encoding device 10B according to the third embodiment.

そして、第３の実施形態の動画像符号化装置１０Ｂでは、レート制御部１０７がレート制御部１０７Ｂに置き換わっている。 In the video encoding device 10B of the third embodiment, the rate control unit 107 is replaced with the rate control unit 107B.

レート制御部１０７Ｂは、ビットデプス制御信号Ｓ１２と量子化後エンコーダ予測画像Ｆ２２と量子化後Ｗｙｎｅｒ−Ｚｉｖ画像Ｆ２３とに基づいて、パディングビットの位置を特定し、パディングビット以外のビットのみを対象に、ビット反転確率を求め、求めたビット反転確率からレートＲを生成する処理を行う。 The rate control unit 107B specifies the position of padding bits based on the bit depth control signal S12, the quantized encoder predicted image F22, and the quantized Wyner-Ziv image F23, and targets only bits other than the padding bits. Then, a bit inversion probability is obtained, and a process for generating a rate R from the obtained bit inversion probability is performed.

図１１は、第３の実施形態の動画像復号装置２０Ｂの機能的構成について示したブロック図である。 FIG. 11 is a block diagram illustrating a functional configuration of a video decoding device 20B according to the third embodiment.

第３の実施形態の動画像復号装置２０Ｂでは、ＬＬＲ補正部２０９が追加されている点で第１の実施形態と異なっている。 The moving picture decoding apparatus 20B according to the third embodiment is different from the first embodiment in that an LLR correction unit 209 is added.

ＬＬＲ補正部２０９は、ビットデプス制御信号Ｓ１１より、パディングビットの位置を明らかにし、パディングビットに対応するＬＬＲの大きさが、その他のビットのＬＬＲに比べて十分大きくなるように補正する処理（例えば、１以上の係数を乗じて補正する処理や、定数を加算する処理）を行って、補正後ＬＬＲを求める。そして、ＬＬＲ補正部２０９は、各ビットの補正後ＬＬＲを表す補正後ＬＬＲ通知信号Ｓ２３を生成して、Ｓｌｅｐｉａｎ−Ｗｏｌｆ復号部２０４に供給する。 The LLR correction unit 209 clarifies the position of the padding bit from the bit depth control signal S11 and corrects the size of the LLR corresponding to the padding bit so that it is sufficiently larger than the LLRs of the other bits (for example, The corrected LLR is obtained by performing a process of correcting by multiplying one or more coefficients or a process of adding a constant. Then, the LLR correction unit 209 generates a post-correction LLR notification signal S23 representing the post-correction LLR of each bit, and supplies the generated LLR notification signal S23 to the Slepian-Wolf decoding unit 204.

Ｓｌｅｐｉａｎ−Ｗｏｌｆ復号部２０４は、補正後ＬＬＲ通知信号Ｓ２３とＳｌｅｐｉａｎ−ＷｏｌｆストリームＳＴ１を用いて、Ｓｌｅｐｉａｎ−Ｗｏｌｆ復号をおこない、訂正後Ｗｙｎｅｒ−Ｚｉｖ画像Ｆ３３を生成する。 The Slepian-Wol decoding unit 204 performs Slepian-Wolf decoding using the corrected LLR notification signal S23 and the Slepian-Wolf stream ST1, and generates a corrected Wyner-Ziv image F33.

（Ｃ−２）第３の実施形態の動作
次に、以上のような構成を有する第３の実施形態の動画像配信システムＢの動作を説明する。 (C-2) Operation of the Third Embodiment Next, the operation of the moving image distribution system B of the third embodiment having the above configuration will be described.

以下では、第３の実施形態の動画像配信システムＢの動作のうち、第１の実施形態との差異について図１２、図１３を用いて説明する。 Hereinafter, of the operation of the moving image distribution system B according to the third embodiment, differences from the first embodiment will be described with reference to FIGS. 12 and 13.

第３の実施形態の動画像配信システムＢでは、上述のステップＳ１０６、Ｓ１１２、Ｓ１１３が、それぞれステップＳ３０６、Ｓ３１２、Ｓ３１３に置き換わっている点で異なっている。 The moving image distribution system B according to the third embodiment is different in that the above-described steps S106, S112, and S113 are replaced with steps S306, S312 and S313, respectively.

まず、動画像符号化装置１０Ｂ側の処理（ステップＳ３０６）について説明する。 First, the process (step S306) on the moving image encoding device 10B side will be described.

図１２に示すようにステップＳ３０６は、３つのステップＳ３０６−１〜Ｓ３０６−３に分かれている。 As shown in FIG. 12, step S306 is divided into three steps S306-1 to S306-3.

ステップＳ３０６−１では、レート制御部１０７Ｂにより、ビットデプス制御信号Ｓ１２に基づき、パディングビット以外の位置が特定される。 In step S306-1, the rate control unit 107B specifies a position other than the padding bits based on the bit depth control signal S12.

ステップＳ３０６−２では、レート制御部１０７Ｂにより、パディングビット以外のビットのみを対象にビット反転確率が求められる。 In step S306-2, the rate control unit 107B obtains a bit inversion probability for only bits other than the padding bits.

ステップＳ３０６−３では、レート制御部１０７Ｂにより、求めたビット反転確率からレートＲが生成される。 In step S306-3, the rate R is generated from the obtained bit inversion probability by the rate control unit 107B.

第１の実施形態のＳｌｅｐｉａｎ−Ｗｏｌｆ符号化処理においては、ビット反転確率が定常であることを前提に符号化するので、パディングビットの位置でもビット反転が発生する可能性を配慮したＳｌｅｐｉａｍ−Ｗｏｌｆ符号を生成する。しかし、パディングビットの位置は、符号化側と復号側とで情報（ビットデプス制御信号Ｓ１２）が共有されている領域であり、新たな情報を必要としない領域である。そこで、第３の実施形態の動画像復号装置２０Ｂでは、パディングビットの位置を特定し、パディングビット以外のビットのみで、ビット反転確率およびレートＲを計算することで、符号量の増加を防止する。 In the Slepian-Wolf encoding process of the first embodiment, encoding is performed on the assumption that the bit inversion probability is steady, so that the Sleiam-Wolf code in consideration of the possibility of bit inversion occurring at the position of the padding bit. Is generated. However, the position of the padding bit is an area where information (bit depth control signal S12) is shared between the encoding side and the decoding side, and no new information is required. Therefore, in the video decoding device 20B of the third embodiment, the position of the padding bit is specified, and the bit inversion probability and the rate R are calculated using only the bits other than the padding bit, thereby preventing an increase in the code amount. .

次に、動画像復号装置２０Ｂ側の処理（ステップＳ３１２、Ｓ３１３）について説明する。 Next, the process (steps S312 and S313) on the video decoding device 20B side will be described.

図１３に示すようにステップＳ３１２は、２つのステップＳ３１２−１、Ｓ３１２−２に分かれている。 As shown in FIG. 13, step S312 is divided into two steps S312-1 and S312-2.

ステップＳ３１２−１では、ＬＬＲ推定部２０３により、量子化後デコーダ予測画像Ｆ３２から、ＬＬＲが推定され、ＬＬＲ通知信号Ｓ２２として出力される。 In step S312-1, the LLR estimation unit 203 estimates the LLR from the quantized decoder predicted image F32 and outputs the LLR notification signal S22.

ステップＳ３１２−２では、ＬＬＲ補正部２０９により、ビットデプス制御信号Ｓ１１に基づき、ＬＬＲ通知信号Ｓ２２が補正され、補正後ＬＬＲ通知信号Ｓ２３として生成される。 In step S312-2, the LLR correction unit 209 corrects the LLR notification signal S22 based on the bit depth control signal S11, and generates the corrected LLR notification signal S23.

具体的には、ＬＬＲ補正部２０９は、ビットデプス制御信号Ｓ１１より、パディングビットの位置を認識し、パディングビット以外のビットに対応するＬＬＲの大きさが、パディングビットのＬＬＲに比べて十分大きくなるように補正する。 Specifically, the LLR correction unit 209 recognizes the position of the padding bit from the bit depth control signal S11, and the size of the LLR corresponding to the bit other than the padding bit is sufficiently larger than the LLR of the padding bit. Correct as follows.

そして、ステップＳ３１３では、Ｓｌｅｐｉａｎ−Ｗｏｌｆ復号部２０４により、補正後ＬＬＲ通知信号Ｓ２３とＳｌｅｐｉａｎ−ＷｏｌｆストリームＳＴ１を用いて、復号処理が行われ、訂正後Ｗｙｎｅｒ−Ｚｉｖ画像Ｆ３３が生成される。 In step S313, the Slepian-Wol decoding unit 204 performs decoding using the corrected LLR notification signal S23 and the Slepian-Wolf stream ST1, and generates a corrected Wyner-Ziv image F33.

第２の実施形態のＳｌｅｐｉａｎ−Ｗｏｌｆ復号処理では、ビット反転確率が定常であることを前提に処理されるため、パディングビットの位置でもビット反転が発生する可能性を配慮したＳｌｅｐｉａｍ−Ｗｏｌｆ復号処理を行う。しかし、パディングビットの位置は、符号化側と復号側とで情報（ビットデプス制御信号Ｓ１２）が共有されている領域であり、新たな情報を必要としない領域である。そこで、第３の実施形態の動画像復号装置２０Ｂでは、パディングビットの位置を特定し、ＬＬＲを補正することで、復号成功確率（復号品質）を向上させることができる。特に、第３の実施形態の動画像符号化装置１０Ｂのように、パディングビットの位置を特定し、パディングビット以外のビットのみで、ビット反転確率およびレートＲを計算し、符号量を削減している場合、ＬＬＲの補正による復号品質向上効果は大きくなる。 In the Slepian-Wolf decoding process of the second embodiment, since the process is performed on the assumption that the bit inversion probability is steady, the Sleiam-Wolf decoding process considering the possibility of bit inversion occurring even at the padding bit position is performed. Do. However, the position of the padding bit is an area where information (bit depth control signal S12) is shared between the encoding side and the decoding side, and no new information is required. Therefore, in the video decoding device 20B of the third embodiment, it is possible to improve the decoding success probability (decoding quality) by specifying the position of the padding bit and correcting the LLR. In particular, as in the moving image encoding apparatus 10B of the third embodiment, the position of padding bits is specified, the bit inversion probability and the rate R are calculated only with bits other than the padding bits, and the code amount is reduced. If so, the effect of improving the decoding quality by correcting the LLR is increased.

（Ｃ−３）第３の実施形態の効果
第３の実施形態によれば、第２の実施形態と比較して以下のような効果を奏することができる。 (C-3) Effects of the Third Embodiment According to the third embodiment, the following effects can be achieved as compared with the second embodiment.

動画像符号化装置１０Ｂにおいて、動画像復号装置２０Ｂとの間で、情報共有がされていないパディングビット以外のビットに限定してレートを推定することで、符号量を削減し、効率的な符号化を行うことができる。 In the video encoding device 10B, the rate is estimated only for bits other than the padding bits for which information is not shared with the video decoding device 20B, thereby reducing the amount of codes and efficient coding. Can be made.

また、動画像復号装置２０Ｂにおいても、動画像符号化装置１０Ｂと情報共有がなされている、パディングビットのＬＬＲを、パディングビット以外のビットよりも大きくすることで、Ｓｌｅｐｉａｎ−Ｗｏｌｆ復号を助けることができ、復号品質を向上させることができる。 Also, in the video decoding device 20B, the information sharing with the video encoding device 10B is facilitated by increasing the LLR of the padding bits larger than the bits other than the padding bits. And decoding quality can be improved.

（Ｄ）第４の実施形態
以下、本発明による動画像符号化装置及びプログラム、動画像復号装置及びプログラム、並びに、動画像配信システムの第４の実施形態を、図面を参照しながら詳述する。 (D) Fourth Embodiment Hereinafter, a fourth embodiment of a moving image encoding device and program, a moving image decoding device and program, and a moving image distribution system according to the present invention will be described in detail with reference to the drawings. .

（Ｄ−１）第４の実施形態の構成
第４の実施形態の動画像配信システム１Ｃの全体構成についても上述の図１を用いて示すことができる。以下、第４の実施形態について第２の実施形態との差異について説明する。 (D-1) Configuration of Fourth Embodiment The overall configuration of a moving image distribution system 1C of the fourth embodiment can also be shown using FIG. Hereinafter, the difference between the fourth embodiment and the second embodiment will be described.

第４の実施形態の動画像配信システム１Ｃでは、動画像符号化装置１０Ａ及び動画像復号装置２０が、動画像符号化装置１０Ｃ及びに置き換わっていうる点で第２の実施形態と異なっている。 The moving image distribution system 1C of the fourth embodiment is different from the second embodiment in that the moving image encoding device 10A and the moving image decoding device 20 can be replaced with the moving image encoding device 10C.

図１４は、第４の実施形態の動画像符号化装置１０Ｃの機能的構成について示したブロック図である。 FIG. 14 is a block diagram illustrating a functional configuration of a moving image encoding device 10C according to the fourth embodiment.

第４の実施形態の動画像符号化装置１０Ｃでは、Ｓｌｅｐｉａｎ−Ｗｏｌｆ符号化部１０８がＳｌｅｐｉａｎ−Ｗｏｌｆ符号化部１０８Ｃに置き換わり、さらに、Ｓｌｅｐｉａｎ−Ｗｏｌｆ符号選択部１１０が追加されている点で第２の実施形態と異なっている。 In the moving picture encoding apparatus 10C of the fourth embodiment, the second is that the Slepian-Wolf encoding unit 108 is replaced by the Lepian-Wolf encoding unit 108C, and a Slepian-Wolf code selection unit 110 is added. This is different from the embodiment.

Ｓｌｅｐｉａｎ−Ｗｏｌｆ符号選択部１１０は、ビットデプス制御信号Ｓ１２に基づき、パディングビット以外のビットを特定して、特定したビットを対象としてＳｌｅｐｉａｎ−Ｗｏｌｆ符号化処理を行うための情報（以下、「専用Ｓｌｅｐｉａｎ−Ｗｏｌｆ符号化情報Ｓ１３」と呼ぶ）を生成する。 The Slepian-Wolf code selection unit 110 identifies bits other than padding bits based on the bit depth control signal S12, and information for performing the Slepian-Wolf encoding process on the identified bits (hereinafter referred to as “dedicated Slepian”). -Wolf encoded information S13 ").

この実施形態では、専用Ｓｌｅｐｉａｎ−Ｗｏｌｆ符号化情報Ｓ１３は、上述の特定されたビットのビット数（符号化対象となるビット数）で符号化するのに最適なＳｌｅｐｉａｎ−Ｗｏｌｆ符号（Ｓｌｅｐｉａｎ−Ｗｏｌｆ符号化に用いる符号の行列）を含むものとする。 In this embodiment, the dedicated Slepian-Wolf encoding information S13 is an optimal Slepian-Wolf code (Slepian-Wolf code) that is optimal for encoding with the number of bits of the above specified bits (the number of bits to be encoded). Code matrix used for conversion).

Ｓｌｅｐｉａｎ−Ｗｏｌｆ符号選択部１１０は、例えば、符号化対象となるビット数（特定ビット数）ごとに最適なＳｌｅｐｉａｎ−Ｗｏｌｆ符号を予め保持しておき、符号化対象となるビット数（特定ビット数）に応じたＳｌｅｐｉａｎ−Ｗｏｌｆ符号を選択して、専用Ｓｌｅｐｉａｎ−Ｗｏｌｆ符号化情報Ｓ１３の生成に用いるようにしてもよい。 For example, the Slepian-Wolf code selection unit 110 stores in advance an optimum Lepian-Wolf code for each number of bits to be encoded (specific number of bits), and the number of bits to be encoded (specific number of bits). It is also possible to select a Slepian-Wolf code according to the above and use it for generating the dedicated Slepian-Wolf encoded information S13.

Ｓｌｅｐｉａｎ−Ｗｏｌｆ符号選択部１１０で、任意のビット数に応じたＳｌｅｐｉａｎ−Ｗｏｌｆ符号を保持する具体的な処理については限定されないものである。例えば、参考文献１（Xiao-Yu Hu,Regular and irregular progressive edge-growth tanner graphs, Information Theory, IEEE Transactions on, Volume 51, Issue 1, Pages 386 - 398.）の記載技術を用いて生成したＬＤＰＣ符号（Ｌｏｗ−ＤｅｎｓｉｔｙＰａｒｉｔｙ−ｃｈｅｃｋＣｏｄｅ）に対して、参考文献２（D. Varodayan, A. Aaron and B. Girod,Rate-adaptive codes for distributed source coding, EURASIP Signal Processing Journal, Special Section on Distributed Source Coding, vol. 86, no. 11, pp. 3123-3130, November 2006.）の記載技術に基づく処理を加えて、レートを容易に変更可能なＬＤＰＣ符号が得られる。そこで、Ｓｌｅｐｉａｎ−Ｗｏｌｆ符号選択部１１０は、例えば、上述の参考文献１、２の記載技術を用いて、任意のビット数に応じて調整された上述のＬＤＰＣ符号を、Ｓｌｅｐｉａｎ−Ｗｏｌｆ符号として保持するようにしてもよい。 The specific process of holding the Slepian-Wolf code corresponding to an arbitrary number of bits in the Slepian-Wolf code selection unit 110 is not limited. For example, an LDPC code generated using the technique described in Reference 1 (Xiao-Yu Hu, Regular and irregular progressive edge-growth tanner graphs, Information Theory, IEEE Transactions on, Volume 51, Issue 1, Pages 386-398.) (Low-Density Parity-check Code), Reference 2 (D. Varodayan, A. Aaron and B. Girod, Rate-adaptive codes for distributed source coding, EURASIP Signal Processing Journal, Special Section on Distributed Source Coding, vol. 86, no. 11, pp. 3123-3130, November 2006.), an LDPC code whose rate can be easily changed is obtained. Therefore, the Slepian-Wolf code selection unit 110 holds, for example, the above-described LDPC code adjusted according to an arbitrary number of bits as the Slepian-Wolf code using the techniques described in the above-mentioned References 1 and 2. You may do it.

Ｓｌｅｐｉａｎ−Ｗｏｌｆ符号化部１０８Ｃは、専用Ｓｌｅｐｉａｎ−Ｗｏｌｆ符号化情報Ｓ１３を用いてＳｌｅｐｉａｎ−Ｗｏｌｆ符号化し、Ｓｌｅｐｉａｎ−ＷｏｌｆストリームＳＴ１を生成する。 The Slepian-Wolf encoding unit 108C performs Slepian-Wolf encoding using the dedicated Slepian-Wolf encoding information S13, and generates a Slepian-Wolf stream ST1.

図１５は、第４の実施形態の動画像復号装置２０Ｃの機能的構成について示したブロック図である。 FIG. 15 is a block diagram illustrating a functional configuration of a video decoding device 20C according to the fourth embodiment.

第４の実施形態の動画像復号装置２０Ｃでは、Ｓｌｅｐｉａｎ−Ｗｏｌｆ復号部２０４がＳｌｅｐｉａｎ−Ｗｏｌｆ復号部２０４Ｃに置き換わり、さらに、Ｓｌｅｐｉａｎ−Ｗｏｌｆ符号選択部２０８が追加されている点で第２の実施形態と異なっている。 In the moving picture decoding apparatus 20C according to the fourth embodiment, the second embodiment is that a Slepian-Wol decoding unit 204 is replaced with a Lepian-Wol decoding unit 204C, and a Slepian-Wolf code selection unit 208 is further added. Is different.

Ｓｌｅｐｉａｎ−Ｗｏｌｆ符号選択部２０８は、ビットデプス制御信号Ｓ１１に基づき、パディングビット以外のビットを特定して、特定したビットを対象としてＳｌｅｐｉａｎ−Ｗｏｌｆ復号処理を行うための情報（以下、「専用Ｓｌｅｐｉａｎ−Ｗｏｌｆ復号情報Ｓ２４」と呼ぶ）を生成する。 The Slepian-Wolf code selection unit 208 specifies bits other than the padding bits based on the bit depth control signal S11, and performs information on the specified bit for the Slepian-Wolf decoding process (hereinafter referred to as “dedicated Slepian- Wolf decoding information S24 ").

専用Ｓｌｅｐｉａｎ−Ｗｏｌｆ復号情報Ｓ２４は、上述の特定されたビットのビット数で復号するのに最適なＳｌｅｐｉａｎ−Ｗｏｌｆ符号（Ｓｌｅｐｉａｎ−Ｗｏｌｆ復号に用いる符号の行列）を含むものとする。 The dedicated Slepian-Wolf decoding information S24 includes a Slepian-Wolf code (a matrix of codes used for Slepian-Wolf decoding) that is optimal for decoding with the number of bits specified above.

Ｓｌｅｐｉａｎ−Ｗｏｌｆ符号選択部２０８は、例えば、復号対象となるビット数（特定ビット数）ごとに最適なＳｌｅｐｉａｎ−Ｗｏｌｆ符号を予め保持しておき、復号対象となるビット数（特定ビット数）に応じたＳｌｅｐｉａｎ−Ｗｏｌｆ符号を選択して、専用Ｓｌｅｐｉａｎ−Ｗｏｌｆ復号情報Ｓ２４の生成に用いるようにしてもよい。Ｓｌｅｐｉａｎ−Ｗｏｌｆ符号選択部２０８は、例えば、上述のＳｌｅｐｉａｎ−Ｗｏｌｆ符号選択部１１０と同様に、上述の参考文献１、２の記載技術を用いて生成された専用Ｓｌｅｐｉａｎ−Ｗｏｌｆ符号化情報Ｓ１３を保持するようにしてもよい。 For example, the Slepian-Wolf code selection unit 208 holds in advance an optimal Lepian-Wolf code for each bit number (specific bit number) to be decoded, and according to the bit number (specific bit number) to be decoded. Alternatively, the selected Lepian-Wolf code may be selected and used to generate the dedicated Lepian-Wolf decoding information S24. The Slepian-Wolf code selection unit 208 holds, for example, the dedicated Slepian-Wolf encoded information S13 generated by using the techniques described in the above-mentioned References 1 and 2 in the same manner as the above-mentioned Lepian-Wolf code selection unit 110. You may make it do.

Ｓｌｅｐｉａｎ−Ｗｏｌｆ符号化部１０８Ｃは、専用Ｓｌｅｐｉａｎ−Ｗｏｌｆ符号化情報Ｓ１３を用いてＳｌｅｐｉａｎ−Ｗｏｌｆ復号し、訂正後Ｗｙｎｅｒ−Ｚｉｖ画像Ｆ３３を生成する点で第２の実施形態と異なっている。 The Slepian-Wolf encoding unit 108C is different from the second embodiment in that it performs Slepian-Wolf decoding using the dedicated Slepian-Wolf encoding information S13 and generates a corrected Wyner-Ziv image F33.

（Ｃ−２）第４の実施形態の動作
次に、以上のような構成を有する第４の実施形態の動画像配信システム１Ｃの動作を説明する。 (C-2) Operation of the Fourth Embodiment Next, the operation of the moving image distribution system 1C of the fourth embodiment having the above configuration will be described.

以下では、第４の実施形態の動画像配信システム１Ｃの動作のうち、第２の実施形態との差異について図１６、図１７を用いて説明する。 Hereinafter, of the operation of the moving image distribution system 1C according to the fourth embodiment, differences from the second embodiment will be described with reference to FIGS.

第４の実施形態の動画像配信システム１Ａでは、上述のステップＳ１０７、Ｓ１１３が、それぞれステップＳ４０７、Ｓ４１３に置き換わっている点で第２の実施形態と異なっている。 The moving image distribution system 1A according to the fourth embodiment is different from the second embodiment in that steps S107 and S113 described above are replaced with steps S407 and S413, respectively.

まず、動画像符号化装置１０Ｃ側の処理（ステップＳ４０７）について説明する。 First, the process (step S407) on the moving image encoding apparatus 10C side will be described.

図１６に示すようにステップＳ４０７は、２つのステップＳ４０７−１、Ｓ４０７−２に分かれている。 As shown in FIG. 16, step S407 is divided into two steps S407-1 and S407-2.

ステップＳ４０７−１では、Ｓｌｅｐｉａｎ−Ｗｏｌｆ符号選択部１１０により、ビットデプス制御信号Ｓ１２に基づき、パディングビット以外のビットが特定され、特定されたビットに基づいて専用Ｓｌｅｐｉａｎ−Ｗｏｌｆ符号化情報Ｓ１３が生成される。 In step S407-1, the Slepian-Wolf code selection unit 110 specifies bits other than padding bits based on the bit depth control signal S12, and dedicated Slepian-Wolf encoded information S13 is generated based on the specified bits. The

ステップＳ４０７−２では、Ｓｌｅｐｉａｎ−Ｗｏｌｆ符号化部１０８Ｃにより、レートＲに基づき、専用Ｓｌｅｐｉａｎ−Ｗｏｌｆ符号化情報Ｓ１３を用いて、量子化後Ｗｙｎｅｒ−Ｚｉｖ画像Ｆ２３が符号化され、Ｓｌｅｐｉａｎ−ＷｏｌｆストリームＳＴ１が生成される。具体的には、Ｓｌｅｐｉａｎ−Ｗｏｌｆ符号化部１０８Ｃは、専用Ｓｌｅｐｉａｎ−Ｗｏｌｆ符号化情報Ｓ１３の情報に基づき、符号化するビットを集め、集めたビットを上述の最適なＳｌｅｐｉａｎ−Ｗｏｌｆ符号で符号化する。 In step S407-2, the quantized Wyner-Ziv image F23 is encoded by the Slepian-Wolf encoding unit 108C based on the rate R, using the dedicated Slepian-Wolf encoding information S13, and the Slepian-Wolf stream ST1. Is generated. Specifically, the Slepian-Wolf encoding unit 108C collects bits to be encoded based on the information of the dedicated Slepian-Wolf encoding information S13, and encodes the collected bits with the above-described optimal Sleian-Wolf code. .

次に、動画像復号装置２０Ａ側の処理（ステップＳ４１３）について説明する。 Next, the process (step S413) on the moving image decoding apparatus 20A side will be described.

図１７に示すようにステップＳ４１３は、２つのステップＳ４１３−１、Ｓ４１３−２に分かれている。 As shown in FIG. 17, step S413 is divided into two steps S413-1 and S413-2.

ステップＳ４１３−１では、Ｓｌｅｐｉａｎ−Ｗｏｌｆ符号選択部２０８により、ビットデプス制御信号Ｓ１１に基づき、パディングビット以外のビットが特定され、特定されたビットに基づいて専用Ｓｌｅｐｉａｎ−Ｗｏｌｆ復号情報Ｓ２４が生成される。 In step S413-1, the Slepian-Wolf code selection unit 208 specifies bits other than padding bits based on the bit depth control signal S11, and generates dedicated Slepian-Wolf decoding information S24 based on the specified bits. .

ステップＳ４１３−２では、Ｓｌｅｐｉａｎ−Ｗｏｌｆ復号部２０４Ｃにより、ＬＬＲ通知信号Ｓ２２、及び専用Ｓｌｅｐｉａｎ−Ｗｏｌｆ復号情報Ｓ２４を用いて、Ｓｌｅｐｉａｎ−ＷｏｌｆストリームＳＴ１が復号され、訂正後Ｗｙｎｅｒ−Ｚｉｖ画像Ｆ３３が生成される。 In step S413-2, the Slepian-Wol decoding unit 204C uses the LLR notification signal S22 and the dedicated Slepian-Wolf decoding information S24 to decode the Slepian-Wolf stream ST1, and generates a corrected Wyner-Ziv image F33. The

具体的には、Ｓｌｅｐｉａｎ−Ｗｏｌｆ復号部２０４Ｃは、専用Ｓｌｅｐｉａｎ−Ｗｏｌｆ復号情報Ｓ２４の情報に基づき、復号するビットに対応するＬＬＲを集め、集めたＬＬＲを上述の最適なＳｌｅｐｉａｎ−Ｗｏｌｆ符号で、Ｓｌｅｐｉａｎ−ＷｏｌｆストリームＳＴ１を復号する。 Specifically, the Slepian-Wol decoding unit 204C collects LLRs corresponding to the bits to be decoded based on the information of the dedicated Slepian-Wolf decoding information S24, and the collected LLRs are converted into the above-mentioned optimal Sleian-Wolf codes using the above-described Slepian-Wolf codes. -Decodes the Wolf stream ST1.

（Ｄ−３）第４の実施形態の効果
第４の実施形態によれば、第２の実施形態の効果に加えて以下のような効果を奏することができる。 (D-3) Effects of Fourth Embodiment According to the fourth embodiment, the following effects can be obtained in addition to the effects of the second embodiment.

動画像符号化装置１０Ｃでは、動画像復号装置２０Ｃとの間でパディングビットの位置が共有されていることを利用し、パディングビット以外のビットを符号化するのに最適なＳｌｅｐｉａn−Ｗｏｌｆ符号を用いて符号化することで、符号量の増加を防止することができる。 The moving image encoding apparatus 10C uses the Slipian-Wolf code that is optimal for encoding bits other than the padding bits by using the fact that the position of padding bits is shared with the moving image decoding apparatus 20C. Thus, an increase in code amount can be prevented.

また、動画像復号装置２０Ｃでは、動画像符号化装置１０Ｃから供給されたビットデプス制御信号Ｓ１２に基づいて、パディングビットの位置を特定し、パディングビット以外のビットの誤り訂正に最適なＳｌｅｐｉａn−Ｗｏｌｆ符号を用いて復号することで、復号品質を向上させることができる。 Also, the moving picture decoding apparatus 20C specifies the position of padding bits based on the bit depth control signal S12 supplied from the moving picture coding apparatus 10C, and is optimal for correcting errors of bits other than the padding bits. Decoding quality can be improved by decoding using a code.

（Ｅ）他の実施形態
本発明は、上記の各実施形態に限定されるものではなく、以下に例示するような変形実施形態も挙げることができる。 (E) Other Embodiments The present invention is not limited to the above-described embodiments, and may include modified embodiments as exemplified below.

（Ｆ−１）上記の各実施形態では、説明を簡易とするために、Ｗｙｎｅｒ−Ｚｉｖ画像の符号化及び復号に用いる各画像（Ｗｙｎｅｒ−Ｚｉｖ画像、エンコーダ予測画像、デコーダ予測画像等）について、非特許文献１等のようにＤＣＴ変換等による変換処理を行わずに、各画素の画素値又は量子化値で表わしたパラメータ形式（ピクセルドメイン形式）のまま処理するものとして説明している。ただし、本発明では、各画像を表現するためのパラメータ形式は、ピクセルドメイン形式に限定されず、ＤＣＴ変換等により変換係数領域ごとの量子化値で表わしたパラメータ形式（トランスフォームドメイン形式）としても良い。トランスフォームドメイン形式の場合、上記の各実施形態の画素の画素値（量子化値）が、変換係数領域（複数のブロックに分割して処理する場合には、各ブロック内での変換係数領域）ごとの量子化値に置き換えられることになる。上記の各実施形態で、符号化及び復号に用いる各画像について、トランスフォームドメイン形式で処理する場合には、量子化処理の前段に、トランスフォームドメイン形式のパラメータ列に変換する変換処理部を追加すれば良い。 (F-1) In each of the above embodiments, for simplification of description, for each image (Wyner-Ziv image, encoder predicted image, decoder predicted image, etc.) used for encoding and decoding a Wyner-Ziv image, As described in Non-Patent Document 1, etc., it is assumed that processing is performed in the parameter format (pixel domain format) represented by the pixel value or quantized value of each pixel without performing conversion processing by DCT conversion or the like. However, in the present invention, the parameter format for representing each image is not limited to the pixel domain format, and may be a parameter format (transform domain format) represented by a quantized value for each transform coefficient region by DCT transformation or the like. good. In the case of the transform domain format, the pixel value (quantized value) of the pixel in each of the above embodiments is a transform coefficient region (in the case of processing divided into a plurality of blocks, the transform coefficient region in each block). It will be replaced with the quantized value of each. In each of the above embodiments, when each image used for encoding and decoding is processed in the transform domain format, a conversion processing unit that converts the parameter sequence in the transform domain format is added before the quantization process. Just do it.

上記の各実施形態では、各画素の画素値をパラメータとして並べたパラメータ列としてみることができる。そして、Ｓｌｅｐｉａｎ−Ｗｏｌｆ理論およびＷｙｎｅｒ−Ｚｉｖ理論に基づいたＤＶＣ方式の符号化処理及び復号処理では、符号化及び復号する対象となるパラメータ列の数や各パラメータの持つ意味は限定されない。したがって、各画像を表現するためのパラメータ列の形式（パラメータの数等）を変更したとしても、上記の各実施形態と同様の効果を奏することができる。 In each of the above embodiments, the pixel value of each pixel can be viewed as a parameter string arranged as a parameter. In the DVC method encoding process and decoding process based on the Slepian-Wolf theory and the Wyner-Ziv theory, the number of parameter sequences to be encoded and decoded and the meaning of each parameter are not limited. Therefore, even if the format of the parameter string for expressing each image (number of parameters, etc.) is changed, the same effects as those of the above embodiments can be obtained.

（Ｆ−２）上記の各実施形態において、動画像復号装置では、Ｋｅｙ画像からエンコーダ予測画像やデコーダ予測画像を生成しているが、他時刻のＷｙｎｅｒ−Ｚｉｖ復号画像から生成しても良い。また、動画像復号装置では、非特許文献３のようにスケーラブル構造を有する場合は、ベースレイヤの情報を用いて、デコーダ予測画像を生成しても良い。 (F-2) In each of the above embodiments, the moving image decoding apparatus generates an encoder predicted image and a decoder predicted image from a Key image. However, the moving image decoding device may generate a Wyner-Ziv decoded image at another time. Moreover, in a moving image decoding apparatus, when it has a scalable structure like the nonpatent literature 3, you may produce | generate a decoder prediction image using the information of a base layer.

（Ｆ−３）上記の各実施形態の動画像符号化装置では、レート制御部による推定結果により、レートＲを決定する構成となっているが、非特許文献１のように、動画像復号装置側からのフィードバックアプローチによってレート制御する構成としても良い。フィードバックアプローチによってレート制御する場合、エンコーダ予測画像量子化部１０６を省略するようにしてもよい。 (F-3) In the video encoding device of each of the embodiments described above, the rate R is determined based on the estimation result of the rate control unit. The rate control may be performed by a feedback approach from the side. When rate control is performed by a feedback approach, the encoder predicted image quantization unit 106 may be omitted.

（Ｆ−４）上記の各実施形態では、動画像符号化装置と動画像復号装置はネットワーク等の通信路で接続されているものとして説明しているが、動画像符号化装置と動画像復号装置は直接通信可能な構成としなくてもよい。例えば、動画像符号化装置が生成した動画像データ（Ｓｌｅｐｉａｎ−Ｗｏｌｆストリーム及びＫｅｙストリームのデータ）を、ＤＶＤやハードディスク等の媒体に記録し、オフラインで動画像復号装置に供給するようにしてもよい。 (F-4) In each of the above embodiments, the video encoding device and the video decoding device are described as being connected via a communication path such as a network. However, the video encoding device and the video decoding are described. The apparatus may not be configured to be able to communicate directly. For example, the moving image data (Slepian-Wolf stream and Key stream data) generated by the moving image encoding device may be recorded on a medium such as a DVD or a hard disk and supplied to the moving image decoding device offline. .

（Ｆ−５）第４の実施形態において、Ｓｌｅｐｉａｎ−Ｗｏｌｆ符号選択部１１０とＳｌｅｐｉａｎ−Ｗｏｌｆ符号選択部２０８の形態として、エンコーダ予測画像Ｆ２１が選択されている画素に属するビットが符号化処理および復号処理に関わらないＳｌｅｐｉａｎ−Ｗｏｌｆ符号を生成しても良い。該Ｓｌｅｐｉａｎ−Ｗｏｌｆ符号は、符号化および復号するビットの位置情報を、Ｓｌｅｐｉａｎ−Ｗｏｌｆ符号に内在したものとなる。この場合、動画像配信システム１Ｃでは、ステップＳ４０７−２で符号化するビットを集めたり、ステップＳ３１３−２で復号するビットに対応するＬＬＲを集めたりする必要はなくなる。 (F-5) In the fourth embodiment, as a form of the Slepian-Wolf code selection unit 110 and the Slepian-Wolf code selection unit 208, the bits belonging to the pixel for which the encoder predicted image F21 is selected are encoded and decoded. A Slepian-Wolf code that is not related to processing may be generated. In the Slepian-Wolf code, the position information of the bits to be encoded and decoded is inherent in the Slepian-Wolf code. In this case, the moving image distribution system 1C does not need to collect the bits to be encoded in step S407-2 or the LLRs corresponding to the bits to be decoded in step S313-2.

（Ｆ−６）上記の各実施形態において、ビットデプス制御部は、ブロック単位でビットデプスを決定しているが、画素ごと（パラメータごと）にビットデプスを決定し、ビットデプス制御信号Ｓ１１に反映するようにしてもよい。 (F-6) In each of the above embodiments, the bit depth control unit determines the bit depth for each block, but determines the bit depth for each pixel (for each parameter) and reflects it in the bit depth control signal S11. You may make it do.

１…動画像配信システム、１０…動画像符号化装置、１０１…ビットデプス制御部、１０２…複雑度計算部、１０３…制御部１０３、１０４…符号化対象画像量子化部、１０５…エンコーダ予測画像生成部、１０６…エンコーダ予測画像量子化部、１０７…レート制御部、１０８…Ｓｌｅｐｉａｎ−Ｗｏｌｆ符号化部、１０９…ビットデプス通知部、１２０…Ｋｅｙ画像符号化部、２０…動画像復号装置、２００…ビットデプス受取部、２０１…デコーダ予測画像生成部、２０２…デコーダ予測画像量子化部、２０３…ＬＬＲ推定部、２０４…Ｓｌｅｐｉａｎ−Ｗｏｌｆ復号部、２０５…再構成部、２２０…Ｋｅｙ画像復号部。 DESCRIPTION OF SYMBOLS 1 ... Moving image delivery system, 10 ... Moving image encoding apparatus, 101 ... Bit depth control part, 102 ... Complexity calculation part, 103 ... Control part 103, 104 ... Encoding object image quantization part, 105 ... Encoder prediction image Generation unit 106 ... Encoder prediction image quantization unit 107 ... Rate control unit 108 ... Slepian-Wolf encoding unit 109 ... Bit depth notification unit 120 ... Key image encoding unit 20 ... Video decoding device 200 DESCRIPTION OF SYMBOLS ... Bit depth receiving part, 201 ... Decoder prediction image generation part, 202 ... Decoder prediction image quantization part, 203 ... LLR estimation part, 204 ... Slepian-Wol decoding part, 205 ... Reconstruction part, 220 ... Key image decoding part.

Claims

In a video encoding device that encodes a video having a frame sequence,
Predicted image generation means for generating a predicted image of a non-key frame using a key frame of the frame sequence;
A complexity calculator for calculating the complexity of non-keyframes;
Based on the calculation result of the complexity calculation unit, bit depth determination means for determining a bit depth related to quantization of each parameter constituting the image;
Quantize each parameter of the non-key frame and each parameter of the predicted image with the bit depth determined by the bit depth determining means to generate a non-key frame after quantization and a predicted image after quantization. For a parameter for which a bit depth less than the maximum bit depth is determined, either the most significant bit side or the least significant bit side of the quantized data corresponding to the bit depth determined for the parameter, Quantization means for generating, as quantized data of the parameter, data obtained by adding padding bits of a predetermined pattern corresponding to the difference from the most bit depth,
An error correction code generation means for generating an error correction code for correcting an error of the predicted image after quantization for the non-key frame after quantization;
Parameters of each image processed by the bit depth determination means and the quantizing means, the pixel values of the pixels constituting the image, or, conversion coefficients obtained by converting the image by a predetermined conversion method A moving picture coding apparatus characterized by having a quantized value for each region.

The apparatus further comprises output means for outputting data including the error correction code generated by the error correction code generation means and the bit depth information related to the bit depth determined by the bit depth determination means as data relating to a non-key frame. The moving picture encoding apparatus according to claim 1.

The complexity calculation unit calculates the complexity for each block constituting the predicted image of the non-key frame,
The moving picture encoding apparatus according to claim 1 or 2, wherein the bit depth determining means determines a common bit depth for each parameter in the block for each block constituting the image.

The output rate of the error correction code output by the error correction code generation means is determined, and an output rate necessary for error correction processing only for bits other than padding bits in the non-key frame after quantization is determined. 4. The moving picture encoding apparatus according to claim 3, further comprising rate control means for determining.

The moving image according to any one of claims 1 to 4, wherein the error correction code generation means generates an error correction code only for bits other than padding bits in a non-key frame after quantization. Encoding device.

In a moving image decoding apparatus for decoding moving image data in which a moving image having a frame sequence is encoded in units of frames,
Key frame holding means for holding key frame data;
A predicted image generating means for generating a predicted image of a non-key frame in the frame sequence using the key frame held by the key frame holding means;
Bit depth holding means for holding bit depth information related to bit depth applied to quantization of each parameter constituting the image in the process of generating encoded data of non-key frames constituting the moving image data;
The bit depth held by the bit depth holding unit quantizes each parameter of the predicted image to generate a quantized predicted image, and a bit depth smaller than the maximum bit depth is determined. For a parameter, padding bits of a predetermined pattern corresponding to the difference from the most bit depth are placed on either the most significant bit side or the least significant bit side of the quantized data for the bit depth set for the parameter. Quantization means for generating the added data as quantized data of the parameter;
When the encoded data of the non-key frame that constitutes the moving image data is an error correction code for correcting the predicted image after quantization, the corrected image obtained by correcting the predicted image after quantization using the error correction code Error correction means for generating an image;
Reconstructing means for generating a reconstructed image obtained by reconstructing a non-keyframe image based on the corrected image and the predicted image;
Parameters of each image processed by the bit depth holding means and the quantizing means, the pixel values of the pixels constituting the image, or, conversion coefficients obtained by converting the image by a predetermined conversion method A moving picture decoding apparatus characterized by having a quantized value for each region.

The error correction means is
An LLR determining unit that determines an LLR for each bit constituting the quantized predicted image;
An error correction process is performed based on the LLR for each bit determined by the LLR determination unit,
The moving picture decoding apparatus according to claim 6, wherein the LLR determination unit sets an LLR having a larger value for padding bits than a bit excluding padding bits.

8. The moving picture decoding apparatus according to claim 6 or 7, wherein the error correction means performs error correction only for bits excluding padding bits in the predicted image after quantization.

A computer mounted on a video encoding device that encodes a video having a frame sequence,
Predicted image generation means for generating a predicted image of a non-key frame using a key frame of the frame sequence;
A complexity calculator for calculating the complexity of non-keyframes;
Based on the calculation result of the complexity calculation unit, bit depth determination means for determining a bit depth related to quantization of each parameter constituting the image;
Quantize each parameter of the non-key frame and each parameter of the predicted image with the bit depth determined by the bit depth determining means to generate a non-key frame after quantization and a predicted image after quantization. For a parameter for which a bit depth less than the maximum bit depth is determined, either the most significant bit side or the least significant bit side of the quantized data corresponding to the bit depth determined for the parameter, Quantization means for generating, as quantized data of the parameter, data obtained by adding padding bits of a predetermined pattern corresponding to the difference from the most bit depth,
Function as an error correction code generation means for generating an error correction code for correcting an error of a predicted image after quantization for a non-key frame after quantization;
Parameters of each image processed by the bit depth determination means and the quantizing means, the pixel values of the pixels constituting the image, or, conversion coefficients obtained by converting the image by a predetermined conversion method A moving picture coding program characterized by being a quantized value for each region.

A computer mounted on a moving image decoding apparatus for decoding moving image data in which a moving image having a frame sequence is encoded in units of frames;
Key frame holding means for holding key frame data;
A predicted image generating means for generating a predicted image of a non-key frame in the frame sequence using the key frame held by the key frame holding means;
Bit depth holding means for holding bit depth information related to bit depth applied to quantization of each parameter constituting the image in the process of generating encoded data of non-key frames constituting the moving image data;
The bit depth held by the bit depth holding unit quantizes each parameter of the predicted image to generate a quantized predicted image, and a bit depth smaller than the maximum bit depth is determined. For a parameter, padding bits of a predetermined pattern corresponding to the difference from the most bit depth are placed on either the most significant bit side or the least significant bit side of the quantized data for the bit depth set for the parameter. Quantization means for generating the added data as quantized data of the parameter;
When the encoded data of the non-key frame that constitutes the moving image data is an error correction code for correcting the predicted image after quantization, the corrected image obtained by correcting the predicted image after quantization using the error correction code Error correction means for generating an image;
Based on the corrected image and the predicted image, function as a reconstruction unit that generates a reconstructed image obtained by reconstructing a non-keyframe image;
Parameters of each image processed by the bit depth holding means and the quantizing means, the pixel values of the pixels constituting the image, or, conversion coefficients obtained by converting the image by a predetermined conversion method A moving picture decoding program characterized by being a quantized value for each area.

A moving image including a moving image encoding device that generates moving image data obtained by encoding a moving image having a frame sequence in units of frames, and a moving image decoding device that decodes the moving image data generated by the moving image encoding device. In the image distribution system,
Applying the video encoding device according to claim 1 as the video encoding device,
A moving image distribution system, wherein the moving image decoding device according to claim 6 is applied as the moving image decoding device.