JP2586715B2

JP2586715B2 - Video signal coding method

Info

Publication number: JP2586715B2
Application number: JP25290790A
Authority: JP
Inventors: 淳一大木
Original assignee: Nippon Electric Co Ltd
Current assignee: NEC Corp
Priority date: 1990-09-20
Filing date: 1990-09-20
Publication date: 1997-03-05
Anticipated expiration: 2012-03-05
Also published as: JPH04129491A

Description

【発明の詳細な説明】（産業上の利用分野）本発明は、帯域圧縮技術を用いた動画像信号の符号化
方式に関する。Description: TECHNICAL FIELD The present invention relates to a video signal encoding system using a band compression technique.

（従来の技術）従来の帯域圧縮技術を用いた動画像信号の符号化方式
としては、たとえば1989年電予情報通信学会春季全国大
会、資料番号Ｄ−233に記載の「ISDN対応カラー動画像
テレビ電話装置」などが知られている。この動画像信号
の符号化方式では、画面における顔領域を抽出してマッ
プを作成する。そして、画像符号化部ではフレーム間フ
レーム内適応予測を行い、この時もし顔の領域であれば
最終段まで符号化をし、それ以外の領域であれば１つ前
の段階で符号化を止めることにより符号化量を減らして
いる。(Prior Art) As a coding method of a moving image signal using a conventional band compression technique, for example, an "ISDN-compatible color moving image television" described in Document No. D-233, 1989, National Institute of Information and Communication Technology Spring Meeting. Telephone devices "are known. In this moving image signal encoding method, a map is created by extracting a face area on a screen. Then, the image encoding unit performs inter-frame intra-frame adaptive prediction. At this time, if the region is a face region, the encoding is performed up to the last stage, and if the region is other than the region, the encoding is stopped at the previous stage. This reduces the amount of coding.

（発明が解決しようとする課題）しかしながら上述した従来の動画像信号の符号化方式
では、顔以外の背景の部分も粗く符号化するから背景部
分の雑音により無駄な情報が発生してしまう。また、連
続する画面間で背景部分から顔部分に変化したとする
と、粗い符号化から細かい符号化に変るから、予測画素
信号がここでもかなり発生してしまい、無駄な情報を符
号化することになってしまう。その結果符号化効率が低
下してしまう。(Problems to be Solved by the Invention) However, in the above-described conventional moving image signal encoding method, unnecessary portions are generated due to noise in the background portion because the background portion other than the face is roughly encoded. Also, if the background changes from a background portion to a face portion between successive screens, the coding changes from coarse coding to fine coding, so that a considerable amount of predicted pixel signals are generated here, resulting in coding of useless information. turn into. As a result, the coding efficiency decreases.

（課題を解決するための手段）本発明の動画像信号の符号化方式は、画面間の相関を
利用した動画像信号の符号化方式であって、入力する動
画像信号の１画面を複数画素からなるブロックに分割
し、ブロック毎に前画面との差分を検出し、該差分値が
予め定められた第１の閾値以上のときには有効ブロック
とし、前記差分値が予め定められた第１の閾値未満のと
きには無効ブロックとしてフレーム毎に第１の有効ブロ
ックマップを作成する手段と、該第１の有効ブロックマ
ップに対して第１の重みづけを行う手段と、前画面にお
ける第６の有効ブロックマップに対して第２の重みづけ
を行う手段と、前記第１の重みづけを行った第１の有効
ブロックマップと、前記第２の重みづけを行った第６の
有効ブロックマップとを加算合成して重みづけが成され
た第２の有効ブロックマップを得る手段と、該第２の有
効ブロックマップに対して、各ブロックの近傍のブロッ
クを参照し、近傍のブロックおよび対象ブロックの値の
合計値が予め定められた第２の閾値以上のときには当該
対象ブロックを有効ブロックとし、第２の閾値未満のと
きには当該対象ブロックを無効ブロックとするセグメン
テーションを行って第３の有効ブロックマップを得る手
段と、該第３の有効ブロックマップ内の無効ブロックに
ついて、近傍のブロックを参照し、近傍のブロックの値
の合計値が予め定められた第３の閾値以上のときには当
該無効ブロックを有効ブロックに置き替え、第３の閾値
未満のときには当該無効ブロックを無効ブロックのまま
として第４の有効ブロックマップを得る手段と、該４の
有効ブロックマップの有効ブロック数が予め定められた
第４の閾値以上の場合は前記第４の有効ブロックマップ
の有効ブロックを全て無効ブロックに置き換えて第５の
有効ブロックマップとし、前記第４の有効ブロックマッ
プの有効ブロック数が予め定められた第４の閾値未満の
場合は前記第４の有効ブロックマップをそのままで第５
の有効ブロックマップとする手段と、該第５の有効ブロ
ックマップを１フレーム時間遅延させて第６の有効ブロ
ックマップを得る手段と、前記動画像信号の入力時から
前記第４の有効ブロックマップの生成時までの時間の遅
延を前記動画像信号に与える手段と、遅延を与えられた
前記動画像信号について、前記第４の有効ブロックマッ
プで有効ブロックとされた領域を、画面間の相関、画面
内の相関またはその両方を用いて符号化を行う手段とを
有する。(Means for Solving the Problems) The moving picture signal encoding method of the present invention is a moving picture signal encoding method using correlation between screens, and one screen of an input moving image signal is composed of a plurality of pixels. , And a difference from the previous screen is detected for each block. When the difference value is equal to or greater than a predetermined first threshold value, the block is regarded as an effective block, and the difference value is set to a predetermined first threshold value. Means for creating a first effective block map for each frame as an invalid block when it is less than, means for performing a first weighting on the first effective block map, and a sixth effective block map on the previous screen. A second weighting means, a first effective block map having the first weighting, and a sixth effective block map having the second weighting are added and synthesized. Weight Means for obtaining a second effective block map in which is performed, and referring to blocks near each block with respect to the second effective block map, the total value of the values of the neighboring blocks and the target block is determined in advance. Means for obtaining a third valid block map by performing segmentation by setting the target block as an effective block when the target block is equal to or more than the second threshold and setting the target block as an invalid block when the target block is less than the second threshold; In the invalid block in the effective block map, the neighboring block is referred to, and when the total value of the neighboring blocks is equal to or more than a predetermined third threshold value, the invalid block is replaced with the valid block, Means for obtaining a fourth valid block map by keeping the invalid block as an invalid block when the value is less than the threshold value; If the number of valid blocks in the map is equal to or greater than a predetermined fourth threshold value, all valid blocks in the fourth valid block map are replaced with invalid blocks to form a fifth valid block map, and the fourth valid block map is used. If the number of effective blocks is less than a predetermined fourth threshold value, the fourth effective block map is used as it is in the fifth case.
Means for delaying the fifth effective block map by one frame time to obtain a sixth effective block map, and means for obtaining the sixth effective block map from the input of the moving image signal. Means for giving a delay of time until generation to the moving image signal, and, for the moving image signal with the delay, a region defined as an effective block in the fourth effective block map, Means for performing coding by using the correlation within or both.

（作用）テレビ電話などにおいては、背景部分は固定でおもに
話者が動くことから、話者の部分を切出して符号化を行
えば、背景などからの雑音によって発生する無駄な符号
化情報量を除去でき符号化能率を上げることができる。(Operation) In a videophone or the like, since the background portion is fixed and the speaker mainly moves, if the speaker portion is cut out and encoded, the amount of useless encoded information generated due to noise from the background or the like is reduced. The coding efficiency can be improved.

本発明においては、話者の部分を切出して話者部分の
みを符号化することにより、符号化効率を高める。話者
の切出し方について図面を参照しながら詳細に説明す
る。第１図の時刻t0,t1,t2に示すように話者が動いたと
仮定する。そして、時刻t0および時刻t1の画面間での差
分を求めると第２図の斜線で示される領域が求められ、
背景部分の孤立した斜線部分は背景の雑音により発生し
た差分信号であるとする。次に、画面を水平方向ｎ画素
×垂直方向ｎ画素の複数の画素からなるブロックに分割
し、各ブロック内の差分信号の絶対値和が予め定められ
た第１の閾値以上のときには、そのブロックを有効ブロ
ックとし、差分信号の絶対値和が第１の閾値未満のとき
にはそのブロックを無効ブロックとする。以上の処理に
よって得られた時刻t1における有効ブロックマップを第
３図（Ｂ）に示す。第３図（Ｂ）の黒く塗られた部分が
有効ブロックである。第３図（Ａ）は、時刻t0と時刻t0
よりも１画面前の時刻t0−１との画面間で求められた第
６の有効ブロックマップであるとする。そして、現画面
の有効ブロックマップ（第３図（Ｂ））すなわち第１の
有効ブロックマップに第一の重みづけを行い、前画面の
有効ブロックマップ（第３図（Ａ））である第６の有効
ブロックマップに対しては、第２の重みづけを行う。以
下に重みづけの一例を示す。例えば、前フレームの有効
ブロックを１とし、無効ブロックを０とする。現フレー
ムの有効ブロックは２とし、現フレームの無効ブロック
は前フレームの無効ブロックと同様に０とする。この様
にして重みづけを行った前フレームの有効ブロックマッ
プと、現フレームの有効ブロックマップとを加算合成
し、第２の有効ブロックマップを得る。第２の有効ブロ
ックマップは、第３図（Ｃ）の様になる。次に第３図
（Ｃ）の加算合成された第２の有効ブロックマップに対
して、セグメンテーンションを行う。セグメンテーショ
ンの一例を第３図および第４図を参照しながら説明す
る。例えば、第４図のｋをセグメンテーションの対象ブ
ロックとすると、ブロックｋの近傍のブロックa,b,c,d,
e,f,g,hの値を参照する。すなわち第３図（Ｃ）の第２
の有効ブロックマップの値を参照する。近傍のブロック
a,b,c,d,e,f,g,hおよびブロックｋの値の合計値が予め
定められた第２の閾値以上のときには、対象ブロックｋ
を有効ブロックとし、近傍のブロックa,b,c,d,e,f,g,h
およびブロックｋの値の合計値が予め定められた第２の
閾値未満のときには、対象ブロックｋを無効ブロックと
する。In the present invention, the coding efficiency is increased by cutting out the speaker portion and coding only the speaker portion. The method of extracting a speaker will be described in detail with reference to the drawings. Assume that the speaker has moved as shown at times t0, t1, and t2 in FIG. Then, when the difference between the screens at the time t0 and the time t1 is obtained, an area shown by oblique lines in FIG. 2 is obtained,
It is assumed that an isolated hatched portion in the background is a difference signal generated by background noise. Next, the screen is divided into blocks each including a plurality of pixels of n pixels in the horizontal direction × n pixels in the vertical direction, and when the sum of absolute values of the difference signals in each block is equal to or larger than a predetermined first threshold value, Is an effective block, and when the sum of absolute values of the difference signal is less than the first threshold value, the block is regarded as an invalid block. FIG. 3B shows an effective block map at time t1 obtained by the above processing. The portion painted black in FIG. 3 (B) is an effective block. FIG. 3A shows time t0 and time t0.
It is assumed that it is the sixth effective block map obtained between the screens at time t0-1 one screen before. Then, first weighting is performed on the effective block map of the current screen (FIG. 3 (B)), that is, the first effective block map, and the sixth block which is the effective block map of the previous screen (FIG. 3 (A)) is obtained. The second weighting is performed for the effective block map. An example of weighting is shown below. For example, the valid block of the previous frame is set to 1 and the invalid block is set to 0. The number of valid blocks in the current frame is 2, and the number of invalid blocks in the current frame is 0, like the invalid blocks in the previous frame. The weighted effective block map of the previous frame and the weighted effective block map of the current frame are added and synthesized to obtain a second effective block map. The second effective block map is as shown in FIG. Next, segmentation is performed on the second effective block map obtained by addition and synthesis in FIG. 3 (C). An example of the segmentation will be described with reference to FIGS. 3 and 4. For example, if k in FIG. 4 is a target block for segmentation, blocks a, b, c, d,
Refer to the values of e, f, g, h. That is, the second of FIG.
Refer to the value of the effective block map. Nearby blocks
When the sum of the values of a, b, c, d, e, f, g, h and the block k is equal to or greater than a predetermined second threshold, the target block k
Is an effective block, and neighboring blocks a, b, c, d, e, f, g, h
When the sum of the values of the block k and the block k is smaller than a second predetermined threshold value, the target block k is regarded as an invalid block.

新たにセグメンテーションによって得られた第３の有
効ブロックマップを第３図（Ｄ）に示す。第３の有効ブ
ロックマップには場合によって、動き部分に孤立無効ブ
ロック領域が発生することがある。これは、第１の有効
ブロックマップを得る際、動き部分において画面間での
差分が第１の閾値よりも少し低かったブロックは、無効
ブロックとなるから、動き部分に孤立した無効ブロック
領域が発生する。孤立無効ブロック領域の一例を第５図
に示す。第５図の様に孤立無効ブロック領域を含む第３
の有効ブロックマップ内の有効ブロック領域のみ符号化
を実行させると、有効ブロック領域内の孤立した無効ブ
ロック領域は、符号化が行われないから無効ブロックの
部分と周囲の部分とで符号化画像の連続性がなくなり、
符号化歪が発生してしまう。その結果、非常に見苦しい
符号化画像となってしまうことがある。そこで、孤立無
効ブロック領域の除去を行う。孤立無効ブロック領域の
除去方法としては、セグメンテーションと同様な処理を
無効ブロックを対象に行う。すなわち無効ブロックの近
傍のブロックを参照し、近傍のブロックの合計値が予め
定められた第３の閾値以上のときに、その対象となる無
効ブロックを有効ブロックを示す値に置き替える。以上
の処理により第５図で孤立無効ブロックであった領域を
除去し、第４の有効ブロックマップを得る。第４の有効
ブロックマップを第３図（Ｄ）に示す。FIG. 3D shows a third effective block map newly obtained by segmentation. In some cases, an isolated invalid block area may occur in a moving portion in the third valid block map. This is because when a first effective block map is obtained, a block in which a difference between screens in a moving portion is slightly lower than a first threshold value becomes an invalid block, and an isolated invalid block region occurs in the moving portion. I do. FIG. 5 shows an example of the isolated invalid block area. As shown in FIG. 5, the third area including the isolated invalid block area
When only the effective block area in the effective block map is encoded, the isolated invalid block area in the effective block area is not encoded. Loss of continuity,
Coding distortion occurs. As a result, the encoded image may be very unsightly. Therefore, the isolated invalid block area is removed. As a method for removing an isolated invalid block area, a process similar to the segmentation is performed on an invalid block. That is, a block near the invalid block is referred to, and when the total value of the blocks in the vicinity is equal to or larger than a predetermined third threshold, the target invalid block is replaced with a value indicating the valid block. By the above processing, the area which was the isolated invalid block in FIG. 5 is removed, and the fourth valid block map is obtained. FIG. 3D shows a fourth effective block map.

次に時刻t2における処理について説明する。時刻t1と
時刻t2の画面間での差分を求め、前記第１の閾値にした
がって有効無効判定を行うと、第６図（Ａ）に示す第１
の有効ブロックマップが得られる。この第１の有効ブロ
ックマップに対して第１の重みづけを行う。そして前画
面である時刻t1の有効ブロックマップが第３図（Ｄ）で
あるから、第３図（Ｄ）の有効ブロックマップに対して
第２の重みづけを行って第１の重みづけを行った第１の
有効ブロックマップと加算合成すると、第６図（Ｂ）に
示す第２の有効ブロックマップが得られる。第６図
（Ｂ）の第２の有効ブロックマップに対して、第２の閾
値を用いて前記セグメンテーションを行うと、第６図
（Ｃ）に示す第３の有効ブロックマップが得られる。次
に、第３の有効ブロックマップに対して、孤立無効ブロ
ック領域の除去を行う。第６図（Ｃ）の第３の有効ブロ
ックマップには、孤立無効ブロック領域が存在していな
かったので、第３の有効ブロックマップがそのまま第４
の有効ブロックマップとされ、セグメンテーションによ
って得られた話者領域となる。時刻t2における実際の話
者領域は、画面のほぼ左半分であるのに対し、セグメン
テーションによって得られた話者領域は、画面の右半分
の背景部分にだいぶはみだしているから、第６図（Ｃ）
の第４の有効ブロックマップをこのまま用いると、背景
の雑音も符号化してしまう可能性があり、あまり好まし
くない。時刻t1,t2の場合の様に動きが大きく、セグメ
ンテーションで得られた有効ブロックの数が多い場合に
は、前画面における有効ブロックマップの影響を受け
て、前画面の話者領域にふくらんでしまうからである。
従って画面間での動きが大きい場合、すなわち第４の有
効ブロックマップの有効ブロック数が予め定められた第
４の閾値以上の場合には、第４の有効ブロックマップに
対してリセットを行い、第４の有効ブロックマップ内の
有効ブロックを全て無効ブロックに置き換えて第５の有
効ブロックマップとする。第５の有効ブロックマップ
は、１フレーム時間遅延されて第６の有効ブロックマッ
プとなり、次の時刻においてセグメンテーションに用い
られる。たとえば、第３図（Ａ）を前フレームの第４の
有効ブロックマップとし、第３図（Ｂ）を現フレームの
有効ブロックマップすなわち第１の有効ブロックマップ
とする。そして、時刻t1において得られた第４の有効ブ
ロックマップの有効ブロック数が、前記第４の閾値以上
であったとすると、第４の有効ブロックマップ内の有効
ブロックを、全て無効ブロックに置き換えて第５の有効
ブロックマップとするから、第５の有効ブロックマップ
が１フレーム時間遅延されて得られる時刻t2における第
６の有効ブロックマップも全て無効ブロックとなる。そ
の結果、時刻t2における第１の有効ブロックマップが、
第６図（Ａ）であったとすると、重みづけが行われた第
２の有効ブロックマップは第６図（Ｄ）の様になり、こ
の第２の有効ブロックマップに対して第２の閾値を用い
て前記セグメンテーションを行うと、第６図（Ａ）に示
す様な第３の有効ブロックマップが得られる。この第３
の有効ブロックマップには孤立無効ブロック領域が含ま
れていなかったから、第３の有効ブロックマップがその
まま第４の有効ブロックマップとなり、背景部分を削除
することができる。Next, the processing at time t2 will be described. When the difference between the screens at the time t1 and the time t2 is obtained, and the validity / invalidity determination is performed according to the first threshold, the first time shown in FIG.
Is obtained. The first weighting is performed on the first effective block map. Then, since the effective block map at the time t1, which is the previous screen, is that shown in FIG. 3D, the effective block map shown in FIG. 3D is subjected to the second weighting and the first weighting. The second effective block map shown in FIG. 6 (B) is obtained by addition and synthesis with the first effective block map. When the above-described segmentation is performed on the second effective block map of FIG. 6 (B) using the second threshold value, a third effective block map shown in FIG. 6 (C) is obtained. Next, an isolated invalid block area is removed from the third valid block map. Since the isolated invalid block area did not exist in the third valid block map of FIG. 6C, the third valid block map was used as it is in the fourth valid block map.
And a speaker area obtained by the segmentation. The actual speaker area at the time t2 is almost in the left half of the screen, whereas the speaker area obtained by the segmentation protrudes considerably into the background part of the right half of the screen. )
If the fourth effective block map is used as it is, the background noise may be coded, which is not preferable. When the motion is large and the number of effective blocks obtained by the segmentation is large as in the case of the times t1 and t2, the speaker area on the previous screen expands due to the effect of the effective block map on the previous screen. Because.
Therefore, when the movement between the screens is large, that is, when the number of effective blocks of the fourth effective block map is equal to or more than a predetermined fourth threshold value, the fourth effective block map is reset and the fourth effective block map is reset. All the valid blocks in the fourth effective block map are replaced with invalid blocks to form a fifth effective block map. The fifth effective block map is delayed by one frame time to become the sixth effective block map, and is used for segmentation at the next time. For example, FIG. 3A is a fourth effective block map of the previous frame, and FIG. 3B is an effective block map of the current frame, that is, a first effective block map. If the number of valid blocks in the fourth valid block map obtained at time t1 is equal to or greater than the fourth threshold, all valid blocks in the fourth valid block map are replaced with invalid blocks, and Since the fifth effective block map is the fifth effective block map, all the sixth effective block maps at time t2 obtained by delaying the fifth effective block map by one frame time are also invalid blocks. As a result, the first valid block map at time t2 is
6 (A), the weighted second effective block map is as shown in FIG. 6 (D), and a second threshold value is set for this second effective block map. When the above-described segmentation is performed, a third effective block map as shown in FIG. 6A is obtained. This third
Does not include an isolated invalid block area, the third valid block map becomes the fourth valid block map as it is, and the background portion can be deleted.

以上の様にして得た第４の有効ブロックマップの有効
ブロック領域内すなわち話者領域を、画面間の相関、画
面内の相関またはその両方を用いて符号化することによ
り、背景などの雑音により発生する無駄な情報を容易に
削除でき、符号化効率を高めることができる。By coding the effective block area of the fourth effective block map obtained as described above, that is, the speaker area using the correlation between the screens, the correlation within the screen, or both, the noise caused by the background or the like is reduced. The generated unnecessary information can be easily deleted, and the encoding efficiency can be improved.

上記各閾値および重みづけの値については、予め統計
的に調べた最適値を用いる。また、セグメンテーション
および孤立無効ブロック除去における参照ブロックの配
置は、上記以外の配置およびブロック数でもかまわな
い。As the above thresholds and weights, optimal values statistically checked in advance are used. The arrangement of the reference blocks in the segmentation and the removal of the isolated invalid block may be an arrangement other than the above and the number of blocks.

（実施例）次に、図面を参照しながら本発明の一実施例について
詳細に説明する。第７図に本発明の一実施例を示す。入
力する動画像信号は、線100を介して有効無効判定部１
および遅延部11に供給される。有効無効判定部１は、前
画面の動画像信号を蓄えておき、この前画面における動
画像信号と新たに線100を介して入力された動画像信号
とのフレーム差分信号を求め、このフレーム差分信号を
水平方向ｎ画素×垂直方向ｎ画素の複数画素からなるブ
ロックに分割し、それぞれのブロック毎に、ブロック内
のフレーム差分値の絶対値和を求める。求められたフレ
ーム差分値の絶対値和が予め定められた第１の閾値以上
であればそのブロックを有効ブロックとし、フレーム差
分値の絶対値和が第１の閾値未満のときはそのブロック
を無効ブロックとして、第１の有効ブロックマップを得
る。有効無効判定部１で得られた第１の有効ブロックマ
ップは、重みづけ部２に与えられる。重みづけ部２は、
有効無効判定部１から与えられた第１の有効ブロックマ
ップに対して、予め定められた第１の重みづけを行う。
重みづけ部２で重みづけが成された第１の有効ブロック
マップは、加算器４に与えられる。加算器４は、重みづ
け部２から与えられた第１の有効ブロックマップと、重
みづけ部３から与えられる前画面における有効ブロック
マップである第６の有効ブロックマップとを加算し、重
みづけが成された第２の有効ブロックマップを得る。加
算器４で得られた第２の有効ブロックマップは、セグメ
ンテーション部５に与えられる。セグメンテーション部
５は、加算器４から与えられた第２の有効ブロックマッ
プ内の全てのブロックに対して、セグメンテーション処
理を行う。例えば、第４図に示す様にセグメンテーショ
ンの対象となるブロックをｋとすると、ｋおよびｋの近
傍のa,b,c,d,e,f,g,hのブロックの値を参照し、それら
の値の合計値が予め定められた第２の閾値以上であれば
そのブロックｋを有効ブロックとし、近傍のブロックお
よびｋの値が第２の閾値未満の場合にはそのブロックｋ
を無効ブロックとして第３の有効ブロックマップを得
る。セグメンテーション部５で得られた第３の有効ブロ
ックマップは、孤立無効ブロック除去部６に与えられ
る。孤立無効ブロック除去部６は、セグメンテーション
部５から与えられた第３の有効ブロックマップに含まれ
ている無効ブロックに対して孤立無効ブロック除去の処
理を用い、有効ブロックの連結を行う。孤立無効ブロッ
クの処理は、セグメンテーションと同様に対象となる無
効ブロックの近傍のブロックを参照し、その近傍のブロ
ックの値の合計値が予め定められた第３の閾値以上の場
合は、その孤立無効ブロックを有効ブロックとする。近
傍のブロックの値の合計値が予め定められた第３の閾値
未満の場合は、その無効ブロックは無効ブロックのまま
とし、以上の処理によって孤立無効ブロックの除去を行
った第４の有効ブロックマップを得る。孤立無効ブロッ
ク除去部６で得られた第４の有効ブロックマップは、有
効ブロック数判定部８、有効ブロックリセット部９およ
び符号化部７に与えられる。有効ブロック数判定部８
は、孤立無効ブロック除去部６から与えられた第４の有
効ブロックマップの有効ブロック数が予め定められた第
４の閾値以上の場合には、有効ブロックリセット部９に
リセット実行の指示を与える。また、有効ブロック数判
定部８は、孤立無効ブロック除去部６から与えられた第
４の有効ブロックマップの有効ブロック数が予め定めら
れた第４の閾値未満の場合には、有効ブロックリセット
部９にリセット停止の指示を与える。有効ブロックリセ
ット部９は、有効ブロック数判定部８からリセット実行
の指示が与えられた場合には、孤立無効ブロック除去部
６から与えられた第４の有効ブロックマップの有効ブロ
ックを、全て無効ブロックに置き換えて第５の有効ブロ
ックマップとする。また、有効ブロックリセット部９
は、有効ブロック数判定部８からリセット停止の指示が
与えられた場合には、孤立無効ブロック除去部６から与
えられた第４の有効ブロックマップに何の処理も行わず
にそのままで第５の有効ブロックマップとする。有効ブ
ロックリセット部９で得られた第５の有効ブロックマッ
プは、フレーム遅延部10に与えられる。フレーム遅延部
10は、有効ブロックリセット部９から与えられた第５の
有効ブロックマップを１フレーム時間遅延し、第６の有
効ブロックマップを得る。フレーム遅延部10で得られた
第６の有効ブロックマップは、重みづけ部３に与えられ
る。重みづけ部３は、フレーム遅延部10から与えられた
第６の有効ブロックマップに対して、第２の重みづけを
行って加算器４に重みづけが成された第４の有効ブロッ
クマップを与える。遅延部11は、入力した動画像信号に
対して入力動画像信号が供給されてから第４の有効ブロ
ックマップが符号化部７に与えられるまでの遅延時間補
償を行い、第４の有効ブロックマップと入力動画像信号
の時間合せを行う。遅延部11の出力の時間補償された動
画像信号は、符号化部７に与えられる。符号化部７は、
孤立無効ブロック除去部６から与えられた第４の有効ブ
ロックマップ内の有効ブロック領域すなわち話者領域で
あると示されている部分についてのみ、遅延11から与え
られた動画像信号の符号化を行い、無効ブロックで示さ
れる背景部分は符号化を行わない。(Example) Next, an example of the present invention will be described in detail with reference to the drawings. FIG. 7 shows an embodiment of the present invention. The input moving image signal is sent to a valid / invalid determination unit 1 via a line 100.
And to the delay unit 11. The valid / invalid determination unit 1 stores a moving image signal of the previous screen, obtains a frame difference signal between the moving image signal on the previous screen and the moving image signal newly input via the line 100, and calculates the frame difference. The signal is divided into blocks each including a plurality of pixels of n pixels in the horizontal direction × n pixels in the vertical direction, and for each block, the sum of absolute values of frame difference values in the block is obtained. If the sum of absolute values of the obtained frame difference values is equal to or greater than a predetermined first threshold, the block is regarded as a valid block, and if the sum of absolute values of the frame difference values is less than the first threshold, the block is invalidated. As a block, a first valid block map is obtained. The first valid block map obtained by the valid / invalid determination unit 1 is provided to the weighting unit 2. The weighting unit 2
The first valid block map given by the valid / invalid determination unit 1 is subjected to a predetermined first weighting.
The first effective block map weighted by the weighting unit 2 is provided to the adder 4. The adder 4 adds the first effective block map provided from the weighting unit 2 and the sixth effective block map, which is an effective block map in the previous screen, provided from the weighting unit 3 and the weighting is performed. Obtain the generated second effective block map. The second effective block map obtained by the adder 4 is provided to the segmentation unit 5. The segmentation unit 5 performs a segmentation process on all blocks in the second effective block map provided from the adder 4. For example, if a block to be segmented is k as shown in FIG. 4, the values of blocks a, b, c, d, e, f, g, and h near k and k are referred to, and Is equal to or greater than a second predetermined threshold, the block k is regarded as an effective block. If the value of the neighboring blocks and the value of k are smaller than the second threshold, the block k is regarded as an effective block.
Is used as an invalid block to obtain a third valid block map. The third valid block map obtained by the segmentation unit 5 is provided to the isolated invalid block removal unit 6. The isolated and invalid block removing unit 6 connects the effective blocks to the invalid blocks included in the third valid block map provided from the segmentation unit 5 by using the isolated and invalid block removing process. The processing of an isolated invalid block refers to a block near the target invalid block in the same manner as the segmentation, and if the total value of the blocks in the vicinity is equal to or larger than a third threshold value, the isolated invalid block is determined. Let the block be a valid block. If the total value of the neighboring blocks is less than the third threshold value, the invalid block remains an invalid block, and the fourth valid block map from which the isolated invalid block has been removed by the above processing. Get. The fourth valid block map obtained by the isolated invalid block removing unit 6 is provided to the valid block number determining unit 8, the valid block reset unit 9, and the encoding unit 7. Effective block number determination unit 8
When the number of valid blocks in the fourth valid block map provided from the isolated invalid block removing unit 6 is equal to or larger than a predetermined fourth threshold value, the reset instruction is issued to the valid block reset unit 9. When the number of valid blocks in the fourth valid block map provided from the isolated / invalid block removing unit 6 is less than a predetermined fourth threshold, the valid block number determining unit 8 determines whether the valid block is a valid block reset unit 9. To stop resetting. When receiving an instruction to execute reset from the number of valid blocks determination unit 8, the valid block reset unit 9 replaces all valid blocks in the fourth valid block map supplied from the isolated invalid block removal unit 6 with invalid blocks. To obtain a fifth effective block map. Also, an effective block reset unit 9
When the reset stop instruction is given from the valid block number determining unit 8, the fifth valid block map supplied from the isolated invalid block removing unit 6 is directly processed in the fifth valid block map without performing any processing. This is an effective block map. The fifth effective block map obtained by the effective block reset unit 9 is provided to the frame delay unit 10. Frame delay section
The reference numeral 10 delays the fifth effective block map provided from the effective block reset unit 9 by one frame time to obtain a sixth effective block map. The sixth effective block map obtained by the frame delay unit 10 is provided to the weighting unit 3. The weighting unit 3 performs the second weighting on the sixth effective block map provided from the frame delay unit 10 and provides the weighted fourth effective block map to the adder 4. . The delay unit 11 performs delay time compensation for the input moving image signal from when the input moving image signal is supplied to when the fourth effective block map is supplied to the encoding unit 7, and outputs the fourth effective block map. And the input moving image signal. The time-compensated video signal output from the delay unit 11 is provided to the encoding unit 7. The encoding unit 7
The moving image signal given from the delay 11 is coded only for the effective block area in the fourth effective block map provided from the isolated invalid block removing unit 6, that is, for the portion indicated as the speaker area. , The background portion indicated by the invalid block is not coded.

符号化の方法としては、動き補償などの画面間の相関
を利用した方法、または直交交換などの画面内の相関を
利用した方法、あるいは画面間及画面内の両方の相関を
利用した符号化方法を用いる。As a coding method, a method using correlation between screens such as motion compensation, a method using correlation within a screen such as orthogonal exchange, or an encoding method using correlation between both screens and within a screen Is used.

上記の各閾値および参照ブロック配置などについて
は、予め統計的に調べた最適値を用いる。一例として、
第１の重み付けで現フレームの有効ブロックを２、無効
ブロックを０とし、第２の重み付けで前フレームの有効
ブロックを１、無効ブロックを０とした場合には、第２
の閾値を８、第３の閾値を５とすることで実現できる。For each of the threshold values and the reference block arrangement, an optimal value statistically checked in advance is used. As an example,
When the effective weight of the current frame is set to 2 and the invalid block is set to 0 by the first weight, and the valid block of the previous frame is set to 1 and the invalid block is set to 0 by the second weight, the second block is set.
Is set to 8 and the third threshold is set to 5.

（発明の効果）以上に詳しく説明したように、本発明の動画像信号の
符号化方式は、セグメンテーションによって得た話者領
域内のみ符号化をすることにより、背景部分の雑音によ
り発生する無駄な情報を削除でき、符号化の効率を高め
ることができる。(Effects of the Invention) As described in detail above, the moving picture signal encoding method according to the present invention encodes only the speaker region obtained by the segmentation, so that wasteful noise caused by background noise is generated. Information can be deleted, and coding efficiency can be improved.

[Brief description of the drawings]

第１図、第２図、第３図、第４図、第５図および第６図
は本発明の作用を説明する図、第７図は本発明の一実施
例を示す図である。１……有効無効判定部、2,3……重みづけ部、４……加
算器、５……セグメンテーション部、６……孤立無効ブ
ロック除去部、７……符号化部、８……有効ブロック数
判定部、９……有効ブロックリセット部、10……フレー
ム遅延部、11……遅延部。FIG. 1, FIG. 2, FIG. 3, FIG. 4, FIG. 5, and FIG. 6 are diagrams for explaining the operation of the present invention, and FIG. 1 valid / invalid determination unit 2,3 weighting unit 4, adder 5, segmentation unit 6, isolated invalid block removal unit 7, coding unit 8, effective block Number judging section, 9 effective block reset section, 10 frame delay section, 11 delay section.

Claims

(57) [Claims]

In a moving picture signal encoding method utilizing correlation between screens, one screen of an input moving picture signal is divided into blocks composed of a plurality of pixels, and a difference from a previous screen is detected for each block. When the difference value is equal to or greater than a predetermined first threshold, a first effective block map is created for each frame as an invalid block when the difference value is less than a first threshold. Means, means for performing a first weighting on the first effective block map, means for performing a second weighting on a sixth effective block map in the previous screen, and the first weighting Means for obtaining a weighted second effective block map by adding and combining the first effective block map having been subjected to the above and the sixth effective block map having been subjected to the second weighting; 2 for the effective block map, refer to the blocks near each block, and when the total value of the values of the neighboring blocks and the target block is equal to or greater than a predetermined second threshold, set the target block as an effective block; When the value is smaller than the second threshold value, the target block is segmented as an invalid block,
Means for obtaining an effective block map, and referring to a nearby block for an invalid block in the third effective block map, and when the total value of the neighboring blocks is equal to or more than a predetermined third threshold value, Means for obtaining a fourth valid block map by replacing the block with a valid block and leaving the invalid block as an invalid block when the value is less than a third threshold value, and the number of valid blocks in the fourth valid block map is predetermined. If it is equal to or greater than the fourth threshold, all effective blocks of the fourth effective block map are replaced with invalid blocks to form a fifth effective block map,
When the number of effective blocks in the fourth effective block map is less than a predetermined fourth threshold value, the fourth effective block map is used as it is as a fifth effective block map; Means for delaying the block map by one frame time to obtain a sixth effective block map;
Means for giving a delay of the time from the input of the video signal to the generation of the fourth effective block map to the video signal, and for the video signal given a delay,
Means for coding a region determined as an effective block in the fourth effective block map using a correlation between screens, a correlation within a screen or both of them. method.