JP2018117239A

JP2018117239A - Video data compression apparatus and video data compression method

Info

Publication number: JP2018117239A
Application number: JP2017006670A
Authority: JP
Inventors: 一樹客野; Kazuki Kakuno
Original assignee: Axell Corp
Current assignee: Axell Corp
Priority date: 2017-01-18
Filing date: 2017-01-18
Publication date: 2018-07-26
Anticipated expiration: 2037-01-18
Also published as: JP6738091B2

Abstract

【課題】透明度の情報を含む動画の圧縮率を向上させることのできる動画データの圧縮装置、動画データの圧縮方法を提供する。【解決手段】少なくとも一部に透明度の情報が設定されたフレーム画像データを用いて構成される動画データを圧縮する動画圧縮装置１０は、特定のフレーム画像データと、該特定のフレーム画像データの前方向、及び／又は、後方向に存在する他のフレーム画像データとの間で動きベクトルを検出する動きベクトル検出部１７と、検出された動きベクトルを用いて動画データを圧縮する符号生成部１３とを備え、動きベクトル検出部１７は、特定のフレーム画像データ、及び／又は、他のフレーム画像データの有する透明度の情報を用いて、特定のフレーム画像データと他のフレーム画像データとの間の動きベクトルを検出する。【選択図】図１A moving image data compression apparatus and moving image data compression method capable of improving the compression rate of a moving image including transparency information are provided. A moving image compression apparatus for compressing moving image data configured using frame image data in which transparency information is set at least partially includes a specific frame image data and a front of the specific frame image data. A motion vector detection unit 17 that detects a motion vector between other frame image data existing in the direction and / or backward direction, and a code generation unit 13 that compresses moving image data using the detected motion vector; The motion vector detection unit 17 uses the information on the transparency of the specific frame image data and / or the other frame image data, and moves between the specific frame image data and the other frame image data. Detect vectors. [Selection] Figure 1

Description

本発明は、透明度の情報が設定された動画データのフレーム画像データを圧縮する動画データの圧縮装置、動画データの圧縮方法に関する。 The present invention relates to a moving image data compression apparatus and moving image data compression method for compressing frame image data of moving image data in which transparency information is set.

従来、動画データを圧縮する際に用いられる手法としてフレーム間予測が知られている（例えば、特許文献１参照）。これは、符号化対象フレーム（例えば時系列順に並んだフレームの中で最新のフレーム）と、符号化対象フレームとは異なる時刻の参照フレーム（例えば最新のフレームの１つ前の過去のフレーム）から生成された予測画像との差分画像を符号化することで、動画データのデータ量を削減するための技術である。 Conventionally, inter-frame prediction is known as a technique used when compressing moving image data (see, for example, Patent Document 1). This is based on the encoding target frame (for example, the latest frame among frames arranged in chronological order) and the reference frame at a different time from the encoding target frame (for example, the past frame one before the latest frame). This is a technique for reducing the data amount of moving image data by encoding a difference image with a generated predicted image.

特開２０１０−２００３５７号公報JP 2010-200377 A

ここで、フレーム間予測においてフレーム間の差分を符号化する際の処理としては、符号化対象フレームに含まれる画像情報と参照フレームに含まれる画像情報を数値化し、所定の演算に基づく処理によって画像情報の動きの方向や大きさとしての動きベクトルを求め、求めた動きベクトルを符号化することが考えられる。 Here, as processing when encoding the inter-frame difference in inter-frame prediction, the image information included in the encoding target frame and the image information included in the reference frame are digitized, and the image is processed by processing based on a predetermined calculation. It is conceivable to obtain a motion vector as the direction and size of information motion and encode the obtained motion vector.

一方、近年、画像情報には、色情報（例えば、ＲＧＢ色空間を形成するＲ，Ｇ，Ｂの情報や、それを輝度情報（Ｙ）と色差情報（Ｕ，Ｖ）とに変換したＹ，Ｕ，Ｖの情報など）に加え、透明度（複数の画像を重ね合わせる際の、重ねられた他の画像の透過度のこと）を規定する透明度の情報が含まれる場合がある。そして、画像情報が色情報に加えて透明度の情報を有する場合、単に色情報のみの場合とは画像情報の特質が異なる場合があるので、単に色情報の場合の演算や処理と同じ処理を行うことは必ずしも適切ではない。即ち、符号化対象フレームと参照フレームとに透明な部分や、透明度が高い部分が含まれる場合、符号化対象フレームや参照フレームの色情報は、不透明な画像情報に比べ、動きベクトルを求める際の演算や処理における重要度が高くない場合が多い。 On the other hand, in recent years, image information includes color information (for example, R, G, and B information forming an RGB color space, and Y and Y converted from luminance information (Y) and color difference information (U, V)). In addition to information on U, V, etc., transparency information that defines transparency (transparency of other superimposed images when a plurality of images are superimposed) may be included. If the image information has transparency information in addition to the color information, the characteristics of the image information may differ from the case of only the color information, so the same processing as the calculation and processing for the color information is performed. That is not always appropriate. That is, when the encoding target frame and the reference frame include a transparent part or a part with high transparency, the color information of the encoding target frame or the reference frame is more suitable for determining the motion vector than the opaque image information. In many cases, the degree of importance in computation and processing is not high.

しかし、上記特許文献１においては、画像情報として透明度を有する場合や、画像情報同士の透明度の大きさが相違する場合について考慮されていない。そのため、特許文献１においては、透明な画像も不透明な画像と同様の演算や処理によって動きベクトルを求めることになる。そのため、特許文献１においては、透明な部分を有する動画データについて、色情報の重要度が高くない場合であっても、色情報を中心にした演算や処理を行って動きベクトルを求めることになる。これは、いたずらに演算量を増加させ、かつ、いたずらに符号量を増加させることになり、動画の圧縮率の向上を抑止させてしまうという問題がある。 However, in the above-mentioned Patent Document 1, no consideration is given to the case where the image information has transparency, or the case where the image information has different degrees of transparency. For this reason, in Patent Document 1, a motion vector is obtained by the same calculation and processing for a transparent image as for an opaque image. Therefore, in Patent Document 1, even if the importance of color information is not high for moving image data having a transparent portion, a motion vector is obtained by performing calculation and processing centering on the color information. . This unnecessarily increases the amount of computation and unnecessarily increases the amount of code, which has the problem of inhibiting improvement in the compression rate of moving images.

本発明はこのような課題に鑑みてなされたものであり、透明度の情報を含む動画の圧縮率を向上させることのできる動画データの圧縮装置、動画データの圧縮方法を提供することを課題としている。 The present invention has been made in view of such problems, and an object of the present invention is to provide a moving image data compression device and a moving image data compression method capable of improving the compression rate of moving images including transparency information. .

かかる課題を解決するために、請求項１に記載の発明は、少なくとも一部に透明度の情報が設定されたフレーム画像データを用いて構成される動画データを圧縮する動画データの圧縮装置であって、特定のフレーム画像と、該特定のフレーム画像の前方向、及び／又は、後方向に存在する他のフレーム画像データとの間で動きベクトルを検出する動きベクトル検出手段と、検出された前記動きベクトルを用いて前記フレーム画像データを圧縮する圧縮手段とを備え、該動きベクトル検出手段は、前記特定のフレーム画像データ、及び／又は、前記他のフレーム画像データの有する前記透明度の情報を用いて、前記特定のフレーム画像データと前記他のフレーム画像データとの間の前記動きベクトルを検出することを特徴とする。 In order to solve such a problem, the invention described in claim 1 is a moving image data compression device that compresses moving image data configured using frame image data in which transparency information is set at least in part. A motion vector detecting means for detecting a motion vector between a specific frame image and other frame image data existing in the forward direction and / or backward direction of the specific frame image, and the detected motion Compression means for compressing the frame image data using a vector, and the motion vector detection means uses the transparency information of the specific frame image data and / or the other frame image data. The motion vector between the specific frame image data and the other frame image data is detected.

請求項２に記載の発明は、請求項１に記載の構成に加え、前記動きベクトル検出手段は、前記特定のフレーム画像データ、及び／又は、前記他のフレーム画像データの有する前記透明度の情報の、前記透明度の高さの値に依存して、動きベクトルの決定における画質の寄与度を補正することで前記動きベクトルを検出することを特徴とする。 According to a second aspect of the present invention, in addition to the configuration according to the first aspect, the motion vector detecting means may include the transparency information of the specific frame image data and / or the other frame image data. The motion vector is detected by correcting the contribution of image quality in determining the motion vector depending on the value of the transparency level.

請求項３に記載の発明は、請求項２に記載の構成に加え、前記動きベクトル検出手段は、前記動きベクトルを検出する際に、前記特定のフレーム画像データの前記透明度の情報の規定する前記透明度、及び／又は、前記他のフレーム画像データの前記透明度の情報の規定する前記透明度、が高いほど、前記特定のフレーム画像データの色情報、及び／又は、前記他のフレーム画像データの色情報が前記動きベクトルの決定に与える影響が小さくなるようにすることを特徴とする。 According to a third aspect of the present invention, in addition to the configuration of the second aspect, the motion vector detecting means defines the transparency information of the specific frame image data when detecting the motion vector. As the transparency and / or the transparency specified by the transparency information of the other frame image data is higher, the color information of the specific frame image data and / or the color information of the other frame image data. The influence on the determination of the motion vector is made small.

請求項４に記載の発明は、請求項１乃至３の何れか一つに記載の構成に加え、前記動きベクトル検出手段は、前記特定のフレーム画像データの色情報に前記特定のフレーム画像データの前記透明度の情報を乗算した値、及び／又は、前記他のフレーム画像データの色情報に前記他のフレーム画像データの前記透明度の情報を乗算した値、を用いて前記動きベクトルを検出することを特徴とする。 According to a fourth aspect of the present invention, in addition to the configuration according to any one of the first to third aspects, the motion vector detecting means includes the color information of the specific frame image data in the color information of the specific frame image data. Detecting the motion vector using a value obtained by multiplying the transparency information and / or a value obtained by multiplying the color information of the other frame image data by the transparency information of the other frame image data. Features.

請求項５に記載の発明は、請求項４に記載の構成に加え、前記動きベクトル検出手段は、前記特定のフレーム画像データの色情報に前記特定のフレーム画像データの前記透明度の情報を乗算した値、及び／又は、前記他のフレーム画像データの色情報に前記他のフレーム画像データの前記透明度の情報を乗算した値の、絶対値誤差又は二乗誤差を用いて前記動きベクトルを検出することを特徴とする。 According to a fifth aspect of the present invention, in addition to the configuration of the fourth aspect, the motion vector detecting means multiplies the color information of the specific frame image data by the transparency information of the specific frame image data. Detecting the motion vector using an absolute value error or a square error of a value and / or a value obtained by multiplying the color information of the other frame image data by the transparency information of the other frame image data. Features.

請求項６に記載の発明は、請求項１乃至５の何れか一つに記載の構成に加え、前記動きベクトル検出手段は、前記特定のフレーム画像データの透明度情報、及び／又は、前記他のフレーム画像データの透明度情報、を用いて前記動きベクトルを検出することを特徴とする。 According to a sixth aspect of the present invention, in addition to the configuration according to any one of the first to fifth aspects, the motion vector detecting means includes transparency information of the specific frame image data and / or the other The motion vector is detected using transparency information of frame image data.

請求項７に記載の発明は、少なくとも一部に透明度の情報が設定されたフレーム画像データを用いて構成される動画データを圧縮するための動画データの圧縮方法であって、特定のフレーム画像データと、該特定のフレーム画像データの前方向、及び／又は、後方向に存在する他のフレーム画像データとの間で動きベクトルが検出される動きベクトル検出手順と、検出された前記動きベクトルを用いて前記フレーム画像データが圧縮される圧縮手順とを備え、該動きベクトル検出手順においては、前記特定のフレーム画像データ、及び／又は、前記他のフレーム画像データの有する前記透明度の情報を用いて、前記特定のフレーム画像データと前記他のフレーム画像データとの間の前記動きベクトルが検出されることを特徴とする。 The invention according to claim 7 is a moving image data compression method for compressing moving image data configured using frame image data in which transparency information is set at least in part, and the specific frame image data A motion vector detection procedure in which a motion vector is detected between the frame image data and other frame image data existing in the forward direction and / or backward direction of the specific frame image data, and using the detected motion vector A compression procedure for compressing the frame image data, and in the motion vector detection procedure, using the transparency information of the specific frame image data and / or the other frame image data, The motion vector between the specific frame image data and the other frame image data is detected.

請求項１、請求項７に記載の発明によれば、特定のフレーム画像データ、及び／又は、他のフレーム画像データの有する透明度の情報を用いて、特定のフレーム画像データと他のフレーム画像データとの間の動きベクトルを検出することにより、透明度の情報が設定された動画データのフレーム画像において、設定された透明度の情報に依存した形で動きベクトルを検出し、フレーム画像データを圧縮することができる。そして、動きベクトルを透明度の情報に依存して検出できるので、透明な部分を有するフレーム画像データの動きベクトルの検出と符号化における、透明な部分の検出に際して重要度の低い色情報が、動きベクトルの検出と符号化に反映する度合いを低減させることが可能になる。これにより、透明度の情報を含む動画の圧縮率を向上させることができる。 According to the first and seventh aspects of the invention, the specific frame image data and / or the other frame image data using the transparency information of the specific frame image data and / or other frame image data. By detecting the motion vector between the frame image data, the motion vector is detected depending on the set transparency information in the frame image of the moving image data in which the transparency information is set, and the frame image data is compressed. Can do. Since the motion vector can be detected depending on the transparency information, color information of low importance in detecting the transparent portion in the detection and encoding of the motion vector of the frame image data having the transparent portion is the motion vector. It is possible to reduce the degree of reflection in the detection and encoding. Thereby, the compression rate of the moving image containing transparency information can be improved.

請求項２に記載の発明によれば、透明度の高さの値に依存して、動きベクトルの決定における画質の貢献度を補正することで動きベクトルを検出することにより、透明度の高さの値を動きベクトルの検出と符号化に反映させて、透明度の情報を含む動画の圧縮率を向上させることができる。 According to the second aspect of the invention, depending on the value of the transparency level, the motion vector is detected by correcting the contribution of the image quality in the determination of the motion vector. Can be reflected in the detection and encoding of motion vectors, and the compression rate of a moving image including transparency information can be improved.

請求項３に記載の発明によれば、フレーム画像データの透明度の情報の規定する透明度が高いほど、フレーム画像データの色情報が動きベクトルの大きさに与える影響が小さくなるようにすることにより、フレーム画像データの透明度が高くなって動きベクトルの検出に際して色情報の重要度が低くなるほど、動きベクトルの検出と符号化に色情報が反映される度合いを低下させることができる。これにより、透明度の情報を含む動画の圧縮において、重要度の低い情報を削減しつつ、圧縮率を向上させることができる。 According to the third aspect of the present invention, the higher the transparency defined by the transparency information of the frame image data, the smaller the influence of the color information of the frame image data on the magnitude of the motion vector, As the transparency of the frame image data becomes higher and the importance of the color information becomes lower when detecting the motion vector, the degree to which the color information is reflected in the detection and encoding of the motion vector can be reduced. Thereby, in the compression of a moving image including transparency information, it is possible to improve the compression rate while reducing less important information.

請求項４に記載の発明によれば、フレーム画像データの色情報に基づいて得られる値を透明度の情報に基づいて補正することで、動きベクトルを検出することができるので、透明度の情報を含む動画の圧縮において、透明度の大きさを動きベクトルの検出と符号化における具体的な演算処理に反映させることができる。これにより、透明度の大きさに基づいて重要度の低いデータを削減しつつ、圧縮率を向上させることを、具体的な演算処理において実現することができる。 According to the invention described in claim 4, since the motion vector can be detected by correcting the value obtained based on the color information of the frame image data based on the transparency information, the transparency information is included. In moving image compression, the degree of transparency can be reflected in specific calculation processing in motion vector detection and encoding. Accordingly, it is possible to realize in a specific calculation process that the compression rate is improved while reducing the data with low importance based on the degree of transparency.

請求項５に記載の発明によれば、色情報に透明度の情報を乗算した値の、絶対値誤差又は二乗誤差によって動きベクトルを検出することにより、透明度の情報を反映させた値に基づいて精度の高い演算処理を行い、適切な動きベクトルを検出することができる。 According to the invention described in claim 5, by detecting a motion vector based on an absolute value error or a square error of a value obtained by multiplying color information by transparency information, accuracy can be determined based on a value reflecting transparency information. It is possible to detect an appropriate motion vector by performing a high calculation process.

請求項６に記載の発明によれば、透明度の情報を用いて動きベクトルを検出することにより、透明度の情報を反映させた値に基づいて精度の高い演算処理を行い、適切な動きベクトルを検出することができる。 According to the invention described in claim 6, by detecting a motion vector using transparency information, high-precision arithmetic processing is performed based on a value reflecting the transparency information, and an appropriate motion vector is detected. can do.

この実施の形態に係る動画圧縮装置における全体構成を説明する機能ブロック図である。It is a functional block diagram explaining the whole structure in the moving image compression apparatus which concerns on this embodiment. 同上動画圧縮装置における予測処理部の詳細を示す機能ブロック図である。It is a functional block diagram which shows the detail of the prediction process part in a moving image compression apparatus same as the above. 同上動画圧縮装置における動きベクトルの算出の原理を模式的に示す図である。It is a figure which shows typically the principle of the calculation of the motion vector in a moving image compression apparatus same as the above. 同上動画圧縮装置における動きベクトル検出部での演算における、色情報と透明度の情報の相関関係を模式的に示す図である。It is a figure which shows typically the correlation of the information of color information and transparency information in the calculation in the motion vector detection part in a moving image compression apparatus same as the above. 同上動画圧縮装置における処理手順の一例を示すフローチャートである。It is a flowchart which shows an example of the process sequence in a moving image compression apparatus same as the above.

以下、本発明の実施形態について図を用いて詳細に説明する。なお、本実施形態ではＭＰＥＧの符号化方式に適用した例に基づき説明するが、ＭＰＥＧ以外のどのような画像の符号化方式に本発明が適用されてもよい。 Hereinafter, embodiments of the present invention will be described in detail with reference to the drawings. Although the present embodiment will be described based on an example applied to an MPEG encoding method, the present invention may be applied to any image encoding method other than MPEG.

［この実施の形態で用いられる動画データ］
この実施の形態で用いられる動画データは、透明度（α値）の情報を有する動画データである。具体的には、例えば、動画データを構成する「フレーム画像データ」としてのフレームのそれぞれの画素の画像情報としてに、ＲＧＢ色空間の「Ｒ」情報、「Ｇ」情報、「Ｂ」情報に加え、透明度（α値）を規定する「Ａ」情報が、０〜２５５の値で設定されている場合（ただし、「Ａ」情報は０が透明度最大（透明度１００％）、２５５が透明度最小（透明度０％））がこれに相当する。 [Movie data used in this embodiment]
The moving image data used in this embodiment is moving image data having transparency (α value) information. Specifically, for example, in addition to “R” information, “G” information, and “B” information in the RGB color space, as image information of each pixel of a frame as “frame image data” that constitutes moving image data. When the “A” information that defines the transparency (α value) is set with a value from 0 to 255 (however, in the “A” information, 0 is the maximum transparency (transparency 100%), 255 is the minimum transparency (transparency) 0%)) corresponds to this.

この実施の形態においては、動画圧縮装置１０（図１参照）において、この透明度情報を含むフレームを圧縮する。なお、動画圧縮装置１０（図１参照）においては、透明度（α値）の情報を含むフレーム全てを圧縮の対象としてもよいし、一部のみを圧縮の対象としてもよい。また、動画圧縮装置１０において圧縮の対象とされる動画データは、全てのフレームが透明度（α値）の情報を有していてもよいし、少なくとも一部のフレームのみが透明度（α値）の情報を有していてもよい。また、動画圧縮装置１０（図１参照）においては、透明度（α値）の情報を用いた圧縮（後述）と、透明度（α値）の情報を用いた圧縮以外の圧縮方法とを併用して動画データの圧縮を行うものであってもよい。 In this embodiment, the moving image compression apparatus 10 (see FIG. 1) compresses a frame including this transparency information. Note that in the moving picture compression apparatus 10 (see FIG. 1), all frames including transparency (α value) information may be compression targets, or only a part may be compression targets. In addition, the moving image data to be compressed in the moving image compression apparatus 10 may have transparency (α value) information for all frames, or at least some of the frames may have transparency (α value). You may have information. Further, in the moving picture compression apparatus 10 (see FIG. 1), compression (described later) using transparency (α value) information and a compression method other than compression using transparency (α value) information are used in combination. The video data may be compressed.

［動画圧縮装置の基本構成］
図１に、この実施の形態の動画圧縮装置１０の機能ブロック図を、図２はこの実施の形態の動画圧縮装置の予測処理部を示す機能ブロック図をそれぞれ示す。この実施の形態の「動画データの圧縮装置」としての動画圧縮装置１０は、エンコーダとしての機能を有し、透明度の情報を有する動画データを圧縮符号化する。具体的には、例えばこの動画圧縮装置は、別個の動画が複数重ね合わされ、かつ重ね合わされた際に透明度に応じてアルファブレンドして表示され、その部分から後側で表示される動画の一部が透過した状態でディスプレイに表示される、遊技機の動画の圧縮符号化に用いられる。ただし、遊技機以外に用いられる、複数重ね合わされて表示される、透明度の情報を有するいかなる動画の圧縮符号化に用いられてもよい。 [Basic configuration of video compression device]
FIG. 1 shows a functional block diagram of the moving picture compression apparatus 10 of this embodiment, and FIG. 2 shows a functional block diagram showing a prediction processing unit of the moving picture compression apparatus of this embodiment. The moving image compression apparatus 10 as the “moving image data compression apparatus” of this embodiment has a function as an encoder, and compresses and encodes moving image data having transparency information. Specifically, for example, in this video compression device, a plurality of separate videos are superimposed, and when they are superimposed, alpha blending is displayed according to the transparency, and a part of the video displayed on the rear side from that part is displayed. Is used for compression encoding of moving images of gaming machines, which are displayed on the display in a state where is transmitted. However, it may be used for compression encoding of any moving image having transparency information that is displayed in a superimposed manner and is used for other than gaming machines.

この動画圧縮装置１０においては、動画データが所定の矩形領域の単位で、即ち所定数の画素からなるマトリックスの単位で処理される（詳しくは後述する）。 In this moving image compression apparatus 10, moving image data is processed in units of a predetermined rectangular area, that is, in units of a matrix composed of a predetermined number of pixels (details will be described later).

一方の画像と他方の画像とを重ね合わせたフレーム画像データにおいて、画像データを透明度に応じて圧縮する場合、一方の画像の透明度に応じて、一方の画像のフレーム画像データを圧縮しても、他方の画像のフレーム画像データを圧縮しても、両方の画像のフレーム画像データを圧縮してもよい。 In the frame image data in which one image and the other image are superimposed, when compressing the image data according to the transparency, even if the frame image data of one image is compressed according to the transparency of the one image, The frame image data of the other image may be compressed, or the frame image data of both images may be compressed.

この実施の形態では、一方の画像の透明度に応じて一方の画像のフレーム画像データを圧縮する装置の例を用いて説明する。 This embodiment will be described using an example of an apparatus that compresses frame image data of one image in accordance with the transparency of one image.

この実施の形態の動画圧縮装置１０は、エンコーダとしての圧縮符号化を行うための構成として、予測処理部１１、ＤＣＴ部１２、符号生成部１３、フレームメモリ１４を有する。また、動画圧縮装置１０は、デコーダとしての圧縮データの復号処理を行うための構成として、逆量子化部１５、逆ＤＣＴ部１６をも有する。 The moving picture compression apparatus 10 of this embodiment includes a prediction processing unit 11, a DCT unit 12, a code generation unit 13, and a frame memory 14 as a configuration for performing compression coding as an encoder. The moving image compression apparatus 10 also includes an inverse quantization unit 15 and an inverse DCT unit 16 as a configuration for performing a decoding process of compressed data as a decoder.

予測処理部１１は、「動画データ」としてのフレームを複数用いてフレーム間予測を行うための処理を行う。具体的には、入力画像としての動画データから予測画像（過去の動画フレームに対して所定の予測処理を行って生成された動画フレーム）を生成して出力する。この予測画像は、入力画像を減算されて生成された予測誤差としてＤＣＴ部１２に供給されると共に、逆ＤＣＴ部１６で生成されたＤＣＴデータに加算されて参照画像を生成してフレームメモリ１４に記憶される。 The prediction processing unit 11 performs a process for performing inter-frame prediction using a plurality of frames as “moving image data”. Specifically, a predicted image (a moving image frame generated by performing a predetermined prediction process on a past moving image frame) is generated and output from moving image data as an input image. The predicted image is supplied to the DCT unit 12 as a prediction error generated by subtracting the input image, and is added to the DCT data generated by the inverse DCT unit 16 to generate a reference image, which is stored in the frame memory 14. Remembered.

ＤＣＴ部１２は、各画像データのＤＣＴ（離散コサイン変換）を行う。具体的には、各画像データに対してブロックの単位で離散コサイン変換等の周波数変換を行うことで周波数成分に分解し、係数化されたＤＣＴデータを生成する。この実施の形態のＤＣＴ部１２は、予測処理部１１におけるフレーム間予測処理の結果としての、入力画像と予測画像との差分により形成された予測誤差の周波数変換を行うことで、ＤＣＴデータを生成する。なお、この実施の形態のＤＣＴ部１２は構成の一例であり、ＤＣＴ（離散コサイン変換）に替えて、ウェーブレット変換やアダマール変換を利用することは勿論のこと、ＤＰＣＭ（Differential Pulse Code Modulation：差分パルス符号変調）を用いることも可能である。 The DCT unit 12 performs DCT (discrete cosine transform) on each image data. Specifically, frequency conversion such as discrete cosine transformation is performed on each image data in units of blocks, thereby decomposing the frequency components into coefficientized DCT data. The DCT unit 12 of this embodiment generates DCT data by performing frequency conversion of the prediction error formed by the difference between the input image and the prediction image as a result of the inter-frame prediction processing in the prediction processing unit 11. To do. Note that the DCT unit 12 of this embodiment is an example of a configuration, and of course, wavelet transform or Hadamard transform is used instead of DCT (discrete cosine transform), DPCM (Differential Pulse Code Modulation: differential pulse) It is also possible to use (code modulation).

符号生成部１３は、所定の符号化方式により処理された画像データを量子化係数に基づいて量子化することで量子化データを生成して符号化する。なお、ここでの「所定の符号化方式」とは、直前の構成であるＤＣＴ部１２における離散コサイン変換等の周波数変換に基づく変換符号化方式を指す。 The code generation unit 13 generates and encodes quantized data by quantizing the image data processed by a predetermined encoding method based on the quantization coefficient. Here, the “predetermined encoding scheme” refers to a transform encoding scheme based on frequency conversion such as discrete cosine transform in the DCT unit 12 which is the immediately preceding configuration.

なお、図１に示す通り、符号生成部１３は、ＤＣＴデータを量子化係数等を用いて量子化し量子化データを生成する量子化部１９と、ハフマン符号や算術符号等のエントロピー符号化を用いて量子化データをさらに圧縮して符号化データを生成する可変長符号化部２０とを備えている。 As shown in FIG. 1, the code generation unit 13 uses a quantization unit 19 that quantizes DCT data using quantization coefficients and the like to generate quantization data, and entropy coding such as Huffman code and arithmetic code. And a variable length encoding unit 20 for further generating the encoded data by further compressing the quantized data.

フレームメモリ１４は、ＲＡＭ、キャッシュ等の各種記憶媒体であって、各種の動画データを記憶する。ここでは、予測処理部１１におけるフレーム間予測処理の結果生成された動画データが記憶される。また、逆ＤＣＴ部１６における逆ＤＣＴ処理の結果生成された動画データも記憶される。 The frame memory 14 is various storage media such as a RAM and a cache, and stores various moving image data. Here, moving image data generated as a result of inter-frame prediction processing in the prediction processing unit 11 is stored. The moving image data generated as a result of the inverse DCT process in the inverse DCT unit 16 is also stored.

逆量子化部１５は、既定の量子化係数を用い、入力された量子化データを逆量子化することで、ＤＣＴデータを復号する。この実施の形態において逆量子化部１５に入力される量子化データは、量子化部１９から出力されたもの（図１参照）である。逆ＤＣＴ部１６は、逆量子化部１５から出力されたＤＣＴデータに逆ＤＣＴ処理を行う。逆量子化部から出力された動画データは、予測処理部１１から出力された予測画像（後述）と合成されて過去の動画フレームとしての参照画像（後述）を形成し、フレームメモリ１４に記憶される。 The inverse quantization unit 15 decodes the DCT data by inversely quantizing the input quantized data using a predetermined quantization coefficient. In this embodiment, the quantized data input to the inverse quantization unit 15 is output from the quantization unit 19 (see FIG. 1). The inverse DCT unit 16 performs inverse DCT processing on the DCT data output from the inverse quantization unit 15. The moving image data output from the inverse quantization unit is combined with the predicted image (described later) output from the prediction processing unit 11 to form a reference image (described later) as a past moving image frame, and is stored in the frame memory 14. The

予測処理部１１は、「動きベクトル検出手段」としての動きベクトル検出部１７と、動き補償部１８とを有する。 The prediction processing unit 11 includes a motion vector detection unit 17 as a “motion vector detection unit” and a motion compensation unit 18.

動きベクトル検出部１７は、複数の動画フレームの間で動きベクトルを検出する。具体的には、動きベクトル検出部１７は、特定のフレーム画像データと、特定のフレーム画像データの前方向、及び／又は、後方向に存在する他のフレーム画像データとの間で動きベクトルを検出する。この実施の形態においては、特定の時間、例えば現在のフレーム画像と、その一つ前のフレーム画像とをそれぞれ略矩形の複数のブロックに区分し、特定の時間のフレーム画像の基準位置となるブロックの画像情報（色情報や透明度情報）が、他のフレーム画像のどのブロックから移動したものかを探索し、それらのブロック同士の移動方向や移動量としての動きベクトルを検出する。 The motion vector detection unit 17 detects a motion vector between a plurality of moving image frames. Specifically, the motion vector detection unit 17 detects a motion vector between specific frame image data and other frame image data existing in the forward direction and / or backward direction of the specific frame image data. To do. In this embodiment, a specific time, for example, the current frame image and the previous frame image are each divided into a plurality of substantially rectangular blocks, and a block serving as a reference position of the frame image at a specific time The image information (color information and transparency information) is searched from which block of the other frame image, and a motion vector as a movement direction and a movement amount of these blocks is detected.

動き補償部１８は、参照画像（過去の動画フレーム）から、動きのある画像を動きベクトル分移動させる動き補償の処理を行う。 The motion compensation unit 18 performs a motion compensation process for moving a motion image by a motion vector from a reference image (past moving image frame).

図２は、予測処理部１１の詳細を示す機能ブロック図である。同図に示す通り、予測処理部１１の動きベクトル検出部１７は、動き探索制御部２１、動きベクトル探索部２２、符号化コスト算出部２３を備えている。また、動きベクトル探索部２２は、二乗誤差演算部２２１、動きベクトル決定部２２２を備えている。 FIG. 2 is a functional block diagram illustrating details of the prediction processing unit 11. As shown in the figure, the motion vector detection unit 17 of the prediction processing unit 11 includes a motion search control unit 21, a motion vector search unit 22, and an encoding cost calculation unit 23. The motion vector search unit 22 includes a square error calculation unit 221 and a motion vector determination unit 222.

動き探索制御部２１は、動きベクトル探索部２２や符号化コスト算出部２３の処理を制御し、動きベクトル探索部２２で探索された動きベクトルの中から最も適切なものを検出する。 The motion search control unit 21 controls the processing of the motion vector search unit 22 and the coding cost calculation unit 23 to detect the most appropriate motion vector searched by the motion vector search unit 22.

動きベクトル探索部２２は、動きベクトルの探索のための処理を行う。具体的には、符号化コスト算出部２３の算出したコストを用いて、現在のフレーム画像の一つ前のフレーム画像（参照画像）の基準位置のブロックに表示された画像が、現在のフレーム画像（符号化対象画像）のどのブロックに移動したかを探索するための処理を行う。 The motion vector search unit 22 performs a process for searching for a motion vector. Specifically, using the cost calculated by the encoding cost calculation unit 23, the image displayed in the block at the standard position of the previous frame image (reference image) of the current frame image is the current frame image. Processing for searching to which block of the (encoding target image) has been performed is performed.

二乗誤差演算部２２１は、動きベクトルを算出する際に二乗誤差を用いた演算を行うことで、現在のフレーム画像の一つ前のフレーム画像の基準位置のブロックと現在のフレーム画像のブロックとの間の誤差を演算する処理を行う。 The square error calculation unit 221 performs a calculation using a square error when calculating the motion vector, so that the block at the reference position of the previous frame image of the current frame image and the block of the current frame image are calculated. Processing to calculate the error between.

動きベクトル決定部２２２は、二乗誤差演算部２２１の演算結果に基づいて、符号化コスト算出部２３で計算されるコストが小さくなるよう、現在のフレーム画像の一つ前のフレーム画像の基準位置から、現在のフレーム画像のブロックにおける移動位置までの位置と方向とに基づく動きベクトルを決定する。 The motion vector determination unit 222 starts from the reference position of the previous frame image of the current frame image so that the cost calculated by the encoding cost calculation unit 23 is reduced based on the calculation result of the square error calculation unit 221. Then, a motion vector based on the position and direction up to the movement position in the block of the current frame image is determined.

符号化コスト算出部２３は、現在のフレーム画像の一つ前のフレーム画像の基準位置の有する情報と、現在のフレーム画像のブロックの有する情報とに基づいて、それらのブロック同士の距離を決定するための情報としての符号化コストの値を演算する。 The encoding cost calculation unit 23 determines the distance between the blocks based on the information of the reference position of the previous frame image of the current frame image and the information of the blocks of the current frame image. The value of the encoding cost as the information for calculating is calculated.

［動きベクトルの算出］
以下、この実施の形態における動きベクトルの算出の原理について、図３の模式図を参酌して説明する。この実施の形態の動画圧縮装置１０においては、この原理に基づいて動きベクトルの算出を行う。 [Calculation of motion vector]
Hereinafter, the principle of motion vector calculation in this embodiment will be described with reference to the schematic diagram of FIG. In the moving picture compression apparatus 10 of this embodiment, the motion vector is calculated based on this principle.

例えば、図３に示すように、参照画像である最新のフレーム画像の一つ前の過去のフレーム画像（以下「直前フレーム３１」と称する）と、符号化対象画像である最新のフレーム画像（以下「最新フレーム３２」と称する）とを所定の数の矩形ブロックのマトリックス（例えばｍ行×ｎ列（ただしｍ＞１，ｎ＞１））に区分して、それぞれのフレーム画像のブロックの近似の度合いを対比することで動きベクトルを算出する場合を考える。この場合、過去の直前フレーム３１の特定のブロック例えばブロック３１１に最も画像情報が近似した現在のフレーム画像のブロックを選択し、双方のブロックの距離や方向に基づいて動きベクトルが決定される。 For example, as shown in FIG. 3, a previous frame image immediately before the latest frame image that is the reference image (hereinafter referred to as “immediately preceding frame 31”) and the latest frame image that is the encoding target image (hereinafter referred to as “encoding target image”). Is divided into a matrix of a predetermined number of rectangular blocks (for example, m rows × n columns (where m> 1, n> 1)) and approximates the blocks of the respective frame images. Consider a case where a motion vector is calculated by comparing degrees. In this case, a block of the current frame image whose image information is closest to a specific block of the previous previous frame 31, for example, the block 311 is selected, and a motion vector is determined based on the distance and direction of both blocks.

このように動きベクトルを算出する場合、一般に、下記式（１）に示すような二乗誤差を用いた式で算出されていた（なお、この式は色情報がＲＧＢ色空間の「Ｒ」「Ｇ」「Ｂ」である場合を示す）。

ただし、
ＳＳＤ：Sum of Squared Difference（二乗誤差）
Σ：ブロック内の全ての画素の値の総和
Ｒｄｓｔ：符号化対象フレームの特定の画素の色情報としてのＲ情報
Ｒｓｒｃ：参照フレームの特定の画素の色情報としてのＲ情報
Ｇｄｓｔ：符号化対象フレームの特定の画素の色情報としてのＧ情報
Ｇｓｒｃ：参照フレームの特定の画素の色情報としてのＧ情報
Ｂｄｓｔ：符号化対象フレームの特定の画素の色情報としてのＢ情報
Ｂｓｒｃ：参照フレームの特定の画素の色情報としてのＢ情報
＾：乗算記号。たとえば「○○＾２」は「○○の２乗」を示す。 When the motion vector is calculated in this way, it is generally calculated by an equation using a square error as shown in the following equation (1) (note that this equation has color information “R” “G” in the RGB color space. ”And“ B ”).

However,
SSD: Sum of Squared Difference
Σ: Sum of values of all pixels in the block Rdst: R information as color information of specific pixels of the encoding target frame Rsrc: R information as color information of specific pixels of the reference frame Gdst: Encoding target frame G information as color information of a specific pixel of the G information Gsrc: G information as color information of a specific pixel of the reference frame Bdst: B information as color information of a specific pixel of the encoding target frame Bsrc: Specific information of the reference frame B information ^ as pixel color information: multiplication symbol. For example, “XX ^ 2” indicates “square of XX”.

また、画素情報に透明度情報が含まれている場合には、画素毎の透明度情報を二乗誤差を算出する一要素として単純に利用することが考えられ、その場合には下記色（２）に示すような式となる。

ただし、
ＳＳＤ：Sum of Squared Difference（二乗誤差）
Σ：ブロック内の全ての画素の値の総和
Ａｄｓｔ：符号化対象フレームの特定の画素の透明度情報（α値の情報）
Ａｓｒｃ：参照フレームの特定の画素の透明度情報（α値の情報）
Ｒｄｓｔ：符号化対象フレームの特定の画素の色情報としてのＲ情報
Ｒｓｒｃ：参照フレームの特定の画素の色情報としてのＲ情報
Ｇｄｓｔ：符号化対象フレームの特定の画素の色情報としてのＧ情報
Ｇｓｒｃ：参照フレームの特定の画素の色情報としてのＧ情報
Ｂｄｓｔ：符号化対象フレームの特定の画素の色情報としてのＢ情報
Ｂｓｒｃ：参照フレームの特定の画素の色情報としてのＢ情報
＾：乗算記号。たとえば「○○＾２」は「○○の２乗」を示す。 In addition, when the pixel information includes transparency information, it is conceivable to simply use the transparency information for each pixel as one element for calculating the square error. In this case, the following color (2) is shown. It becomes an expression like this.

However,
SSD: Sum of Squared Difference
Σ: Sum of values of all pixels in the block Adst: Transparency information of specific pixels of the encoding target frame (α value information)
Asrc: Transparency information (alpha value information) of a specific pixel in the reference frame
Rdst: R information as color information of specific pixels of the encoding target frame Rsrc: R information as color information of specific pixels of the reference frame Gdst: G information Gsrc as color information of specific pixels of the encoding target frame : G information as color information of specific pixel of reference frame Bdst: B information as color information of specific pixel of encoding target frame Bsrc: B information as color information of specific pixel of reference frame ^: multiplication symbol . For example, “XX ^ 2” indicates “square of XX”.

また、符号化対象フレーム中の最も近似したブロックを探索する際は、上記式（２）で算出したＳＳＤの値に、動きベクトルの大きさに関する値を加算した、コストの値が用いられる場合が多い。このコストの値（以下「コスト値」と称する）は、例えば下記式（３）で求められる。そして、コスト値が最小となるブロックが動きベクトルの始点と終点として検出される。
コスト値＝ＳＳＤ＋λ動きベクトルの消費ビット・・・（３）
ただし
λ：ビット数の単位をＳＳＤの単位に揃えるための所定の係数
上記式（２）においても画素の透明度情報を二乗誤差を算出する際に利用することになるが、透明度情報をより一層活用するため、この実施の形態においては、動きベクトル探索部２２の動きベクトル決定部２２２は、下記の式（４）によってブロック同士の近似の値を求める。

ただし、
ＳＳＤ’：Sum of Squared Difference（二乗誤差）
Σ：ブロック内の全ての画素の値の総和
Ａｄｓｔ：符号化対象フレームの特定の画素の透明度情報（α値の情報）
Ａｓｒｃ：参照フレームの特定の画素の透明度情報（α値の情報、ただし０≦Ａｓｒｃ≦２５５）
Ｒｄｓｔ：符号化対象フレームの特定の画素の色情報としてのＲ情報
Ｒｓｒｃ：参照フレームの特定の画素の色情報としてのＲ情報
Ｇｄｓｔ：符号化対象フレームの特定の画素の色情報としてのＧ情報
Ｇｓｒｃ：参照フレームの特定の画素の色情報としてのＧ情報
Ｂｄｓｔ：符号化対象フレームの特定の画素の色情報としてのＢ情報
Ｂｓｒｃ：参照フレームの特定の画素の色情報としてのＢ情報
＾：乗算記号。たとえば「○○＾２」は「○○の２乗」を示す。 When searching for the most approximate block in the encoding target frame, a cost value obtained by adding a value related to the magnitude of the motion vector to the SSD value calculated by the above equation (2) may be used. Many. The cost value (hereinafter referred to as “cost value”) is obtained by, for example, the following equation (3). Then, the block having the minimum cost value is detected as the start point and end point of the motion vector.
Cost value = SSD + λ consumed motion vector bit (3)
However, λ: a predetermined coefficient for aligning the unit of the number of bits to the unit of SSD In the above formula (2), the transparency information of the pixel is used when calculating the square error, but the transparency information is further utilized. Therefore, in this embodiment, the motion vector determination unit 222 of the motion vector search unit 22 obtains an approximate value between blocks by the following equation (4).

However,
SSD ': Sum of Squared Difference
Σ: Sum of values of all pixels in the block Adst: Transparency information of specific pixels of the encoding target frame (α value information)
Asrc: Transparency information of a specific pixel of the reference frame (alpha value information, where 0 ≦ Asrc ≦ 255)
Rdst: R information as color information of specific pixels of the encoding target frame Rsrc: R information as color information of specific pixels of the reference frame Gdst: G information Gsrc as color information of specific pixels of the encoding target frame : G information as color information of specific pixel of reference frame Bdst: B information as color information of specific pixel of encoding target frame Bsrc: B information as color information of specific pixel of reference frame ^: multiplication symbol . For example, “XX ^ 2” indicates “square of XX”.

そして、符号化コスト算出部２３は、上記式（３）の「ＳＳＤ」に替えて、上記式（４）で算出した「ＳＳＤ’」を用い、下記式（５）のコスト値が最小となるブロックを動きベクトルの始点と終点として検出する。
コスト値＝ＳＳＤ’＋λ動きベクトルの消費ビット・・・（５）
ただし
λ：ビット数の単位をＳＳＤ’の単位に揃えるための所定の係数
上記式（４）においては、右辺の「Ｒ」「Ｇ」「Ｂ」の各項に「Ａｓｒｃ／２５５（ただし０≦Ａｓｒｃ≦２５５）」を乗算している点が上記式（２）と異なる。 Then, the coding cost calculation unit 23 uses “SSD ′” calculated by the above equation (4) instead of “SSD” in the above equation (3), and the cost value of the following equation (5) is minimized. Blocks are detected as the start and end points of motion vectors.
Cost value = SSD ′ + λ consumption vector consumption bit (5)
However, λ: a predetermined coefficient for aligning the unit of the number of bits with the unit of SSD ′ In the above equation (4), “Arc / 255 (where 0 ≦ (Asrc ≦ 255) ”is different from the above formula (2).

上記式（４）は、画素ごとに０≦Ａｓｒｃ≦２５５（０：完全に透明な状態、２５５：完全に不透明な状態）で透明度が設定されている場合が該当する。例えば、特定の画素が完全に透明（つまりＡｓｒｃ＝０）であれば、「Ｒ」「Ｇ」「Ｂ」の項の値は全て「０」になる。また例えば、特定の画素が半透明（つまり０＜Ａｓｒｃ＜２５５）であれば、「Ｒ」「Ｇ」「Ｂ」の項の値は、完全不透明な場合（つまりＡｓｒｃ＝２５５の場合）の値よりも小さくなる。かつ、特定の画素が半透明の場合、透明度が高いほど「Ｒ」「Ｇ」「Ｂ」の項の値は小さくなる。つまり、動きベクトルを決定する際に、対比する画素同士やブロック同士の色情報の誤差よりも、透明度の誤差の方が、動きベクトルの決定において重要度が高くなる。そして、色情報の誤差を無視したり、色情報の誤差が値全体の中で占める比率を低くしたりする方が、符号化する際の符号量が小さくなる場合が多い。 The above equation (4) corresponds to the case where the transparency is set for each pixel in a range of 0 ≦ Asrc ≦ 255 (0: completely transparent state, 255: completely opaque state). For example, if a specific pixel is completely transparent (that is, Asrc = 0), the values of the terms “R”, “G”, and “B” are all “0”. Also, for example, if a specific pixel is translucent (that is, 0 <Arc <255), the values of the terms “R”, “G”, and “B” are the values when completely opaque (that is, when Asrc = 255). Smaller than. And when a specific pixel is translucent, the value of the term of "R" "G" "B" becomes small, so that transparency is high. That is, when determining a motion vector, an error in transparency is more important in determining a motion vector than an error in color information between contrasting pixels or blocks. In many cases, the amount of code at the time of encoding becomes smaller when the error of the color information is ignored or the ratio of the error of the color information to the entire value is reduced.

ここで、画像の透明度が高いほど、色情報は、画像上の位置を特定する上での重要度が低くなる。しかし、従来の、上記（２）の式における演算においては、ブロックに含まれる画素の画像情報が完全に透明である場合や透明度が高い場合であっても、常に「Ｒ」「Ｇ」「Ｂ」の値が演算され、式（２）によって得られる値全体の中で「Ｒ」「Ｇ」「Ｂ」の値が高い比率を占めることになる。この場合、重要度が低い色情報を無視したり値全体の中で占める比率を低くすることはできないので、符号化する際の符号量を小さくすることが抑止されてしまう。 Here, the higher the transparency of the image, the lower the importance of the color information in specifying the position on the image. However, in the conventional calculation in the above equation (2), even when the image information of the pixels included in the block is completely transparent or highly transparent, “R”, “G”, “B” The value of “R”, “G” and “B” occupy a high ratio in the entire value obtained by the equation (2). In this case, since color information with low importance cannot be ignored or the ratio of the entire value in the entire value cannot be reduced, reducing the amount of code at the time of encoding is prevented.

一方、上記式（４）の演算においては、透明度が高いほど「Ｒ」「Ｇ」「Ｂ」の項の値が「ＳＳＤ’」の値全体に占める比率は小さくなる。つまり、式（４）の演算によって得られる値は、画像の透明度が高いほど、位置情報を特定する際に、色情報よりも透明度の重要度の高さが反映されたものとなり、Ｒ、Ｇ、Ｂで表現される画質による寄与度が補正される。そして、式（４）の演算結果は、色情報の誤差を無視したり、色情報の誤差が値全体の中で占める比率を低くしたりすることができるので、符号化する際の符号量が小さくなって、圧縮率を高くすることが可能になる。 On the other hand, in the calculation of the above formula (4), the higher the transparency, the smaller the ratio of the values of the terms “R”, “G”, and “B” to the entire value of “SSD ′”. That is, the value obtained by the calculation of Expression (4) reflects the importance of transparency more than color information when specifying position information as the transparency of the image is higher. , B contributes to the degree of contribution due to the image quality. Since the calculation result of Expression (4) can ignore the error of the color information or reduce the ratio of the error of the color information in the entire value, the code amount when encoding is small. It becomes small and it becomes possible to make a compression rate high.

［色情報と透明度の相関関係］
図４は、この実施の形態の動画圧縮装置１０における色情報と透明度の相関関係の事例を模式的に示す図である。同図は、横軸が透明度（α値、ただし０≦α値≦２５５）、縦軸が色情報の寄与度又は画質の寄与度（ＲＧＢの寄与度又はＹＵＶの寄与度、ただし０％≦寄与度≦１００％）を示している。 [Correlation between color information and transparency]
FIG. 4 is a diagram schematically showing an example of the correlation between color information and transparency in the moving picture compression apparatus 10 of this embodiment. In this figure, the horizontal axis is transparency (α value, where 0 ≦ α value ≦ 255), and the vertical axis is color information contribution or image quality contribution (RGB contribution or YUV contribution, where 0% ≦ contribution Degree ≦ 100%).

同図に示す通り、この実施の形態の動画圧縮装置１０の二乗誤差演算部２２１においては、上記式（３）に基づいて動きベクトルが検出されることにより、動きベクトルを探索する際の色情報と透明度の相関関係は、図４に示す第一の関数１０１のように、α値が０／２５５のとき（つまり透明度が１００％の全透明の場合）は色情報の寄与度は０、α値が１２７／２５５（つまり透明度が５０％のとき）は色情報の寄与度は５０％、α値が２５５／２５５（つまり透明度０％の完全不透明のとき）は色情報の寄与度が１００％になる。つまりα値と色情報とは比例関係にある。 As shown in the figure, in the square error calculation unit 221 of the moving image compression apparatus 10 of this embodiment, color information when searching for a motion vector is detected by detecting the motion vector based on the above equation (3). As shown in the first function 101 shown in FIG. 4, when the α value is 0/255 (that is, when the transparency is 100% transparency), the contribution degree of the color information is 0, α When the value is 127/255 (that is, when the transparency is 50%), the contribution of the color information is 50%, and when the α value is 255/255 (that is, when the transparency is 0%, it is completely opaque), the contribution of the color information is 100%. become. That is, the α value and the color information are in a proportional relationship.

ただし、この実施の形態において、動画圧縮装置１０の二乗誤差演算部２２１における演算は、完全な比例関係でなくてもよい。例えば、図４に示す第二の関数１０２、第三の関数１０３、第八の関数１０８のように、透明度と色情報とが、α値が０／２５５のときの色情報の寄与度は０、α値が２５５／２５５の色情報の寄与度が１００％となり、その途中（０／２５５＜α値＜２５５／２５５，０％＜色情報＜１００％のとき）は概ねの相関関係を持った状態で透明度と色情報とが推移するように設定されていてもよく、第四の関数１０４のように、α値が０／２５５以上１２７／２５５未満のときは色情報の寄与度が０％で、α値が１２７／２５５以上２５５／２５５以下のときは色情報の寄与度が１００％になるように設定されていてもよい。 However, in this embodiment, the calculation in the square error calculation unit 221 of the moving image compression apparatus 10 may not be a complete proportional relationship. For example, as in the second function 102, the third function 103, and the eighth function 108 shown in FIG. 4, the degree of contribution of the color information when the α value is 0/255 is 0. The contribution of color information with an α value of 255/255 is 100%, and there is a general correlation in the middle (when 0/255 <α value <255/255, 0% <color information <100%). In this state, the transparency and the color information may be set to change, and when the α value is 0/255 or more and less than 127/255 as in the fourth function 104, the contribution of the color information is 0. %, When the α value is 127/255 or more and 255/255 or less, the contribution of color information may be set to be 100%.

また、この実施の形態において、動画圧縮装置１０の二乗誤差演算部２２１における演算は、透明度や色情報の最小値が０／２５５や０％以外の値でもよいし、最大値が２５５／２５５や１００％以外の値でもよい。例えば、図４に示す第五の関数１０５のように、透明度と色情報とが概ねの相関関係を持った状態で、色情報の最大値が１００％未満の値となるように設定されていてもよい。また、第六の関数１０６のように、透明度と色情報とが概ねの相関関係を持った状態で、色情報の最小値が０％より上の値となるように設定されていてもよい。また、第七の関数１０７のように、透明度と色情報とが概ねの相関関係を持った状態で、色情報の最小値が０％より上の値となり、最大値が１００未満の値となるように設定されていてもよい。さらに、第一の関数１０１乃至第七の関数１０７以外のいかなる関数にて、これらの関数と同様に設定されていてもよい。すなわち、二乗誤差演算で利用するα値の範囲（本実施の形態では０〜２５５）において、α値と色情報の寄与度との関係が右肩上がりの傾向を有していれば、α値を考慮した動きベクトル検出となり、符号量を低減することができる。なお、“右肩上がり”とは関数１０１、１０３、１０５、１０６、１０７のように単調な上昇のものや、関数１０２、１０８のように増減を伴いつつ上昇するものや、関数１０４のように所定のα値を境に急激に色情報寄与度が変化するものも含む。 In this embodiment, the calculation in the square error calculation unit 221 of the moving image compression apparatus 10 may be a value other than 0/255 or 0% for the minimum value of transparency and color information, or a value of 255/255 for the maximum value. A value other than 100% may be used. For example, as in the fifth function 105 shown in FIG. 4, the maximum value of the color information is set to be less than 100% in a state where the transparency and the color information have a general correlation. Also good. Further, as in the sixth function 106, the minimum value of the color information may be set to a value higher than 0% in a state where the transparency and the color information have a general correlation. Also, as in the seventh function 107, in a state where the transparency and the color information have a general correlation, the minimum value of the color information is a value above 0% and the maximum value is a value less than 100. It may be set as follows. Further, any function other than the first function 101 to the seventh function 107 may be set similarly to these functions. In other words, in the range of the α value used in the square error calculation (0 to 255 in the present embodiment), if the relationship between the α value and the contribution degree of the color information has a tendency to increase, the α value Thus, the motion vector detection is performed in consideration of the above, and the amount of codes can be reduced. Note that “rising upward” is a monotonically increasing function such as the functions 101, 103, 105, 106, and 107, an increasing function such as the functions 102 and 108, or a function 104. Also included are those in which the color information contribution abruptly changes at a predetermined α value.

［処理手順］
図５は、この実施の形態の動画圧縮装置１０の処理手順の一例を示すフローチャートである。以下、図５及び図１乃至図４に基づいて、「動画データの圧縮方法」としての、この実施の形態の処理手順の一例を説明する。なお、図５の処理手順では、いわゆる「ダイヤモンドサーチ」に基づいて動きベクトルを探索する処理を記載しているが、これに限らず、たとえば、処理対象のフレームの全てのブロックを総当たり的に探索する「フルサーチ」等、他のあらゆる処理方法によって動きベクトルの探索が行われてもよい。 [Processing procedure]
FIG. 5 is a flowchart illustrating an example of a processing procedure of the moving image compression apparatus 10 according to this embodiment. Hereinafter, an example of the processing procedure of this embodiment as a “moving image data compression method” will be described with reference to FIG. 5 and FIGS. 1 to 4. Note that the processing procedure of FIG. 5 describes processing for searching for a motion vector based on so-called “diamond search”. However, the processing procedure is not limited to this. For example, all blocks of the processing target frame are brute-forced. The motion vector search may be performed by any other processing method such as “full search” for searching.

動きベクトル検出部１７の動き探索制御部２１は、動きベクトル探索部２２を制御して動きベクトルの探索を開始する。動きベクトル探索部２２の動きベクトル決定部２２２は、参照画像としての直前フレーム３１（以下「直前フレーム」と称する。）を所定の数の矩形ブロックのマトリックス（例えばｍ行×ｎ列（ただしｍ＞１，ｎ＞１））に区分し、所定の位置のブロック（例えば一番右下のブロック３１１）を探索して基準位置とする。 The motion search control unit 21 of the motion vector detection unit 17 controls the motion vector search unit 22 to start a motion vector search. The motion vector determination unit 222 of the motion vector search unit 22 uses a matrix of a predetermined number of rectangular blocks (for example, m rows × n columns (where m>) as the immediately preceding frame 31 (hereinafter referred to as “preceding frame”) as a reference image. 1, n> 1)), and a block at a predetermined position (for example, the lower right block 311) is searched for as a reference position.

次に、動きベクトル決定部２２２は、符号化対象画像としての最新フレーム３２を直前フレーム３１と同じ矩形ブロックのマトリックス（例えばｍ行×ｎ列のマトリックス）に区分し、最新フレーム３２の直前フレーム３１と同じ位置（例えば一番右下のブロック３２１）を探索する。そして、動きベクトル決定部２２２は、直前フレーム３１と最新フレーム３２の探索したブロック３１１，３２１をベクトルの始点と終点とする、初期動きベクトルとして決定する。 Next, the motion vector determination unit 222 divides the latest frame 32 as an encoding target image into a matrix of the same rectangular blocks as the immediately preceding frame 31 (for example, a matrix of m rows × n columns), and immediately preceding the frame 31 of the latest frame 32. The same position (for example, the block 321 at the bottom right) is searched. Then, the motion vector determination unit 222 determines an initial motion vector that uses the searched blocks 311 and 321 of the immediately preceding frame 31 and the latest frame 32 as the start point and end point of the vector.

そして、動きベクトル決定部２２２は、初期動きベクトルの始点と終点である直前フレーム３１のブロック３１１と最新フレーム３２のブロック３２１とから画像情報（それぞれのブロックを構成する複数の画素のそれぞれの有する、色情報（Ｒ，Ｇ，Ｂの値）と透明度の情報（Ａの値））を抽出する。抽出された情報は二乗誤差演算部２２１に送られ、二乗誤差演算部２２１は取得した情報を上記式（３）に代入し演算を行う。動きベクトル決定部２２２と二乗誤差演算部２２１におけるこの処理と演算は、初期動きベクトルの始点と終点である直前フレーム３１のブロック３１１と最新フレーム３２のブロック３２１を構成する画素それぞれについて繰り返し行われ、その総和が式（３）の「ＳＳＤ’」の値として算出される。 Then, the motion vector determination unit 222 includes image information (each of a plurality of pixels constituting each block) from the block 311 of the immediately preceding frame 31 and the block 321 of the latest frame 32 which are the start point and end point of the initial motion vector. Color information (R, G, B values) and transparency information (A values)) are extracted. The extracted information is sent to the square error calculation unit 221, and the square error calculation unit 221 performs calculation by substituting the acquired information into the above equation (3). This processing and calculation in the motion vector determination unit 222 and the square error calculation unit 221 are repeatedly performed for each of the pixels constituting the block 311 of the immediately preceding frame 31 and the block 321 of the latest frame 32 that are the start and end points of the initial motion vector, The sum is calculated as the value of “SSD ′” in equation (3).

そして、動きベクトル探索部２２は、算出した「ＳＳＤ’」の値を符号化コスト算出部２３に送る。動きベクトル探索部２２は、取得した値を上記式（４）に代入し、「コスト」の値を算出する（ステップＳ１、動きベクトル検出手順）。 Then, the motion vector search unit 22 sends the calculated “SSD ′” value to the encoding cost calculation unit 23. The motion vector search unit 22 substitutes the acquired value into the above equation (4), and calculates the value of “cost” (step S1, motion vector detection procedure).

動きベクトル探索部２２は、算出した「コスト」の値を動きベクトル探索部２２に送る。動きベクトル探索部２２は、取得した「コスト」の値を一時記憶する。 The motion vector search unit 22 sends the calculated “cost” value to the motion vector search unit 22. The motion vector search unit 22 temporarily stores the acquired “cost” value.

次に、動きベクトル探索部２２は、動きベクトルを上下左右に移動したコストを計算する（ステップＳ２、動きベクトル検出手順）。具体的には、動きベクトル決定部２２２は、（直前フレーム３１の基準位置のブロック３１１は固定したままで）最新フレーム３２において決定されているブロック３２１の一つ上隣のブロック３２２、一つ下隣のブロック（図５においては存在せず）、一つ左隣のブロック３２３、一つ右隣のブロック（図５においては存在せず）をそれぞれ探索し、直前フレーム３１の基準位置のブロック３１１とそれら探索したブロック３２２，３２３とを両端とした動きベクトルを決定する。そして、二乗誤差演算部２２１は、決定された動きベクトルの両端のブロック３１１，３２２，３２３についてそれぞれ式（３）を用いた「ＳＳＤ’」の値を算出する。さらに、符号化コスト算出部２３は、それらの「ＳＳＤ’」の値を式（４）に代入して「コスト」の値を算出する。動きベクトル決定部２２２は、算出されたそれぞれの「コスト」の値を記憶する。 Next, the motion vector search unit 22 calculates the cost of moving the motion vector up, down, left, and right (step S2, motion vector detection procedure). Specifically, the motion vector determining unit 222 (one with the block 311 at the reference position of the immediately preceding frame 31 fixed) and the block 322 immediately above and immediately below the block 321 determined in the latest frame 32 The adjacent block (not present in FIG. 5), the left adjacent block 323, and the right adjacent block (not present in FIG. 5) are respectively searched, and the block 311 at the reference position of the immediately preceding frame 31 is searched. And motion vectors having the searched blocks 322 and 323 as both ends are determined. Then, the square error calculation unit 221 calculates the value of “SSD ′” using Equation (3) for the blocks 311, 322, and 323 at both ends of the determined motion vector. Furthermore, the encoding cost calculation unit 23 calculates the value of “cost” by substituting the value of “SSD ′” into Expression (4). The motion vector determination unit 222 stores the calculated “cost” values.

そして、動きベクトル決定部２２２は、最新フレーム３２のブロック３２１を上下左右に移動させた後のブロック３２２，３２３によって決定される「コスト」の値をそれぞれ対比し、その中で「コスト」の値が最小のものを検出し、その「コスト」の値が最小の方向を、最新フレーム３２のブロック３２１を移動させる方向として採用する（ステップＳ３、動きベクトル検出手順）。例えば、移動させた後の最新フレーム３２のブロック３２２，３２３の中で、当初のブロック３２１よりも一つ上隣のブロック３２２の「コスト」が最小である場合、動きベクトル決定部２２２は、一つ上隣のブロック３２２の方向、即ち上方向を、最新フレーム３２におけるブロック３２１の移動方向として採用する。 Then, the motion vector determination unit 222 compares the “cost” values determined by the blocks 322 and 323 after the block 321 of the latest frame 32 is moved up, down, left, and right, and among them, the “cost” value Is detected as the direction in which the block 321 of the latest frame 32 is moved (step S3, motion vector detection procedure). For example, when the “cost” of the block 322 which is one block higher than the original block 321 in the blocks 322 and 323 of the latest frame 32 after being moved is the smallest, the motion vector determination unit 222 The direction of the next adjacent block 322, that is, the upward direction, is adopted as the moving direction of the block 321 in the latest frame 32.

そして、動きベクトル決定部２２２は、初期動きベクトルの「コスト」と、ステップＳ３で移動方向の採用に用いられた、移動させた後のブロック３２２における最小の「コスト」とを対比する。移動させた後の最小の「コスト」が初期動きベクトルの「コスト」よりも小さい場合（ステップＳ４の“ＮＯ”）、その移動方向のブロック３２２を最新フレーム３２の新たな基準となるブロックに設定し、そのブロック３２２の「コスト」と、そのブロック３２２の上下左右のブロック（上隣のブロック３２４、左隣のブロック３２５、下隣のブロック３２１）との動きベクトルの「コスト」を算出し（ステップＳ２，Ｓ３）、コストの比較を行うことを、ブロック３２１の「コスト」より移動させた後のブロック３２２の「コスト」の方が小さくなる間は繰り返し行う（ステップＳ４の“ＮＯ”
、動きベクトル検出手順）。一方、初期動きベクトルの「コスト」の方が移動させた後のブロック３２２最小の「コスト」よりも小さい場合（ステップＳ４の“ＹＥＳ”）は、動きベクトル決定部２２２はステップＳ３にて「コスト」を採用したブロック３２２への移動は行わず、移動させる前のブロック３２１を動きベクトルの終点（又は始点）として検出し、処理は終了する。 Then, the motion vector determination unit 222 compares the “cost” of the initial motion vector with the minimum “cost” of the moved block 322 used for adopting the moving direction in step S3. When the minimum “cost” after the movement is smaller than the “cost” of the initial motion vector (“NO” in step S4), the block 322 in the movement direction is set as a new reference block of the latest frame 32. Then, the “cost” of the motion vector between the “cost” of the block 322 and the upper, lower, left and right blocks of the block 322 (the upper adjacent block 324, the left adjacent block 325, and the lower adjacent block 321) is calculated ( In steps S2 and S3, the comparison of costs is repeated while the “cost” of the block 322 after moving from the “cost” of the block 321 is smaller (“NO” in step S4).
Motion vector detection procedure). On the other hand, if the “cost” of the initial motion vector is smaller than the minimum “cost” of the moved block 322 (“YES” in step S4), the motion vector determination unit 222 determines that the “cost” in step S3. ”Is not performed, the block 321 before being moved is detected as the end point (or start point) of the motion vector, and the process ends.

動きベクトル検出部１７は、直前フレーム３１の各フレームを順番に基準位置として設定しながらステップＳ１〜Ｓ４の処理を行っていく。この処理が完了すると、予測処理部１１は、動き補償部１８における動き補償の処理ののち、生成された予測画像を送信する。動画圧縮装置１０は、この予測画像と入力画像とによって予測誤差を生成してＤＣＴ部１２でＤＣＴ処理を行う。ＤＣＴ部１２で生成されたＤＣＴデータは符号生成部１３に供給され、符号生成部１３の量子化部１９における処理によって量子化データが生成され、可変長符号化部２０において符号化データが生成されて、データの圧縮符号化が行われる（圧縮手順）。 The motion vector detection unit 17 performs steps S1 to S4 while sequentially setting each frame of the immediately preceding frame 31 as a reference position. When this processing is completed, the prediction processing unit 11 transmits the generated predicted image after the motion compensation processing in the motion compensation unit 18. The moving image compression apparatus 10 generates a prediction error based on the predicted image and the input image, and performs DCT processing in the DCT unit 12. The DCT data generated by the DCT unit 12 is supplied to the code generation unit 13, quantized data is generated by the processing in the quantization unit 19 of the code generation unit 13, and encoded data is generated in the variable length coding unit 20. Thus, compression encoding of the data is performed (compression procedure).

以上、この実施の形態においては、特定の動画データ、及び／又は、他の動画データの有する透明度の情報を用いて、特定の動画データと他の動画データとの間の動きベクトルを検出することにより、透明度の情報が設定された動画データにおいて、設定された透明度の情報に依存した形で動きベクトルを検出し、動画データを圧縮することができる。そして、動きベクトルを透明度の情報に依存して検出できるので、透明な部分を有する動画データの動きベクトルの検出と符号化における、透明な部分の検出に際して重要度の低い色情報が、動きベクトルの検出と符号化に反映する度合いを低減させることが可能になる。これにより、透明度の情報を含む動画の圧縮率を向上させることができる。 As described above, in this embodiment, the motion vector between the specific moving image data and the other moving image data is detected using the specific moving image data and / or the transparency information of the other moving image data. Thus, in the moving image data in which the transparency information is set, the motion vector can be detected in a form depending on the set transparency information, and the moving image data can be compressed. Since the motion vector can be detected depending on the transparency information, color information of low importance in detecting the transparent portion in the detection and encoding of the motion vector of the moving image data having the transparent portion is the motion vector. The degree of reflection in detection and encoding can be reduced. Thereby, the compression rate of the moving image containing transparency information can be improved.

この実施の形態においては、透明度の高さの値に依存して、特定の動画データの位置情報と他の動画データの位置情報とによって形成されるベクトルの大きさを補正することで動きベクトルを検出することにより、透明度の高さの値を動きベクトルの検出と符号化に反映させて、透明度の情報を含む動画の圧縮率を向上させることができる。 In this embodiment, the motion vector is corrected by correcting the magnitude of the vector formed by the position information of the specific moving image data and the position information of the other moving image data, depending on the value of the transparency level. By detecting, the value of the transparency level is reflected in the detection and encoding of the motion vector, and the compression rate of the moving image including the transparency information can be improved.

この実施の形態においては、動画データの透明度の情報の規定する透明度が高いほど、動画データの色情報が動きベクトルの大きさに与える影響が小さくなるようにすることにより、動画データの透明度が高くなって動きベクトルの検出に際して色情報の重要度が低くなるほど、動きベクトルの検出と符号化に色情報が反映される度合いを低下させることができる。これにより、透明度の情報を含む動画の圧縮において、重要度の低い情報を削減しつつ、圧縮率を向上させることができる。 In this embodiment, the higher the transparency defined by the transparency information of the moving image data, the less the influence of the color information of the moving image data on the size of the motion vector, thereby increasing the transparency of the moving image data. Thus, the lower the importance of the color information when detecting the motion vector, the lower the degree to which the color information is reflected in the detection and encoding of the motion vector. Thereby, in the compression of a moving image including transparency information, it is possible to improve the compression rate while reducing less important information.

この実施の形態においては、動画データの色情報に基づいて得られる値を透明度の情報に基づいて補正することで、動きベクトルを検出することができるので、透明度の情報を含む動画の圧縮において、透明度の大きさを動きベクトルの検出と符号化における具体的な演算処理に反映させることができる。これにより、透明度の大きさに基づいて重要度の低いデータを削減しつつ、圧縮率を向上させることを、具体的な演算処理において実現することができる。 In this embodiment, since the motion vector can be detected by correcting the value obtained based on the color information of the moving image data based on the transparency information, in compression of the moving image including the transparency information, The degree of transparency can be reflected in a specific calculation process in motion vector detection and encoding. Accordingly, it is possible to realize in a specific calculation process that the compression rate is improved while reducing the data with low importance based on the degree of transparency.

この実施の形態においては、色情報に透明度の情報を乗算した値の、絶対値誤差又は二乗誤差によって動きベクトルを検出することにより、透明度の情報を反映させた値に基づいて精度の高い演算処理を行い、適切な動きベクトルを検出することができる。 In this embodiment, by detecting a motion vector by an absolute value error or a square error of a value obtained by multiplying color information by transparency information, high-precision arithmetic processing based on a value reflecting transparency information And an appropriate motion vector can be detected.

この実施の形態においては、透明度の情報を用いて動きベクトルを検出することにより、透明度の情報を反映させた値に基づいて精度の高い演算処理を行い、適切な動きベクトルを検出することができる。 In this embodiment, by detecting the motion vector using the transparency information, it is possible to perform a highly accurate calculation process based on the value reflecting the transparency information and detect an appropriate motion vector. .

なお、この実施の形態においては、最新フレームを符号化対象画像、直前フレームを参照画像としたが、これに限定されず、過去の特定のフレームを符号化対象画像、この特定のフレームの一つ後（時系列的に後）のフレームを参照画像として処理を行ってもよい。また、符号化対象フレームの一つ前や一つ後のフレームのみならず、複数枚前や複数枚後（例えば特定のフレームの二つ前のフレームや二つ後のフレーム）を参照画像として処理を行ってもよいし、所定の時間のフレームの前のフレームと後のフレームとを参照画像として処理を行ってもよい。 In this embodiment, the latest frame is the encoding target image and the immediately preceding frame is the reference image. However, the present invention is not limited to this, and the past specific frame is the encoding target image, one of the specific frames. Processing may be performed using a later (time-series later) frame as a reference image. Also, not only the frame before or after the encoding target frame, but also multiple frames before or after multiple frames (for example, two frames before or two frames after a specific frame) are processed as reference images. Alternatively, processing may be performed using a frame before and after a frame of a predetermined time as a reference image.

上記の実施の形態は本発明の例示であり、本発明が上記の実施の形態のみに限定されることを意味するものではないことは、いうまでもない。 The above embodiment is an exemplification of the present invention, and it is needless to say that the present invention is not limited to the above embodiment.

１０・・・動画圧縮装置（動画データの圧縮装置）
１１・・・符号生成部（圧縮手段）
１７・・・動きベクトル検出部（動きベクトル検出手段） 10... Video compression device (video data compression device)
11: Code generator (compression means)
17... Motion vector detection unit (motion vector detection means)

Claims

A video data compression device that compresses video data configured using frame image data in which transparency information is set at least in part,
Motion vector detection means for detecting a motion vector between a specific frame image and other frame image data existing in the forward and / or backward direction of the specific frame image;
Compression means for compressing the frame image data using the detected motion vector,
The motion vector detection means uses the transparency information of the specific frame image data and / or the other frame image data to provide a gap between the specific frame image data and the other frame image data. A motion data compressing device, wherein the motion vector is detected.

The motion vector detection means is configured to determine an image quality in determining a motion vector depending on a value of the transparency of the transparency information of the specific frame image data and / or the other frame image data. The moving image data compression apparatus according to claim 1, wherein the motion vector is detected by correcting a degree of contribution.

When the motion vector is detected, the motion vector detection means defines the transparency specified by the transparency information of the specific frame image data and / or the transparency information of the other frame image data. The higher the transparency, the smaller the influence of the color information of the specific frame image data and / or the color information of the other frame image data on the determination of the motion vector. The moving image data compression apparatus according to claim 2.

The motion vector detection means is a value obtained by multiplying the color information of the specific frame image data by the transparency information of the specific frame image data and / or the color information of the other frame image data. 4. The moving image data compression apparatus according to claim 1, wherein the motion vector is detected using a value obtained by multiplying the transparency information of the frame image data.

The motion vector detection means is a value obtained by multiplying the color information of the specific frame image data by the transparency information of the specific frame image data and / or the color information of the other frame image data. 5. The moving image data compression apparatus according to claim 4, wherein the motion vector is detected using an absolute value error or a square error of a value obtained by multiplying the transparency information of the moving image data.

6. The motion vector detection means detects the motion vector using transparency information of the specific frame image data and / or transparency information of the other frame image data. The apparatus for compressing moving image data according to any one of the above.

A method for compressing moving image data for compressing moving image data configured using frame image data in which transparency information is set at least in part,
A motion vector detection procedure in which a motion vector is detected between specific frame image data and other frame image data existing in the forward and / or backward direction of the specific frame image data;
A compression procedure in which the frame image data is compressed using the detected motion vector;
In the motion vector detection procedure, using the transparency information of the specific frame image data and / or the other frame image data, the specific frame image data and the other frame image data are A method for compressing moving image data, wherein the motion vector is detected in between.