JPH10108012A

JPH10108012A - Image area separating device

Info

Publication number: JPH10108012A
Application number: JP8261918A
Authority: JP
Inventors: Sadao Takahashi; 禎郎高橋
Original assignee: Ricoh Co Ltd
Current assignee: Ricoh Co Ltd
Priority date: 1996-10-02
Filing date: 1996-10-02
Publication date: 1998-04-24

Abstract

PROBLEM TO BE SOLVED: To provided the image area separate device where a character area and a pattern area are separated with high precision from an original of lots of kinds including a pseudo medium tone processing original and a copy original. SOLUTION: At least outputs of two sections among three sections as an edge area extract section 101, a dot area extract section 102 and a white background area extract section 103 are selected by multiplexers 104-106 depending on the kind of an original and given to an AND section 107, where a character area and a pattern area are separated and discriminated. In the case of an original subject to pseudo-intermediate tone processing, outputs of all the area detection sections 101, 102, 103 are selected and ANDed, and areas of edge, non-dot and white background areas are discriminated to be character areas, and areas other than these are discriminated to be pattern areas.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は、画像処理の分野に
係り、特に、スキャナ等で読み取られた画像中の文字領
域と絵柄領域の分離技術に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to the field of image processing, and more particularly to a technique for separating a character area and a picture area in an image read by a scanner or the like.

【０００２】[0002]

【従来の技術】例えばディジタル複写機において、文字
と絵柄が混在した原稿の画像を読み取って再生する場
合、高画質の再生画像を得るためには、絵柄領域に対し
ては高階調な処理を施し、文字領域に対しては解像度を
重視した処理を施すことが望ましい。このような処理を
実現するには、原稿画像中の文字領域と絵柄領域とを高
精度に分離する必要がある。2. Description of the Related Art For example, in a digital copying machine, when reading and reproducing an image of an original in which characters and patterns are mixed, a high-gradation process is applied to a pattern area in order to obtain a high-quality reproduced image. It is desirable to perform processing with emphasis on resolution for the character area. In order to realize such processing, it is necessary to separate a character area and a picture area in a document image with high accuracy.

【０００３】このような像域分離に関しては、特許第２
５０７９４８号（特開平２−１９３２７２号）のよう
に、背景領域とエッジ領域を文字領域とし、それ以外の
領域を中間調領域候補とすることにより、２値領域と中
間調領域との識別の確実化を図る方式がある。しかし、
この方式は網点のハイライト部分において文字領域と誤
判定しやすいという問題があった。別の従来技術とし
て、網点領域判定とエッジ領域判定を利用して文字領域
と絵柄領域を分離する特開平３−１５３１６７号の方式
がある。この方式によれば、網点のハイライト領域にお
ける領域判定の精度が向上する。Regarding such image area separation, Japanese Patent No.
As described in Japanese Patent Application Laid-Open No. 507948 (JP-A-2-193272), the background area and the edge area are used as character areas, and the other areas are used as halftone area candidates. There is a method that aims to But,
This method has a problem that it is easy to erroneously determine a character area in a highlighted portion of a halftone dot. As another conventional technique, there is a method disclosed in Japanese Patent Application Laid-Open No. 3-153167 in which a character area and a picture area are separated by using a halftone area determination and an edge area determination. According to this method, the accuracy of region determination in a highlight region of a halftone dot is improved.

【０００４】さて従来技術では、特開平３−１５３１６
７号公報に述べられているように、原稿の種類を文字／
写真／網点に限定していた。ところが、ここ数年来の低
価格カラープリンタの普及により、低線数のディザ、誤
差拡散等の２値擬似中間調処理されたカラー原稿をコピ
ーする機会が増えてきた。また、カラー複写機で再生し
た複写原稿を、再びカラー複写機で再生するといったジ
ェネレーションコピーの機会も増えてきた。このように
取り扱う原稿の種類が増えてくると、従来技術のよう
に、限られた２つの特徴量を組み合わせる像域分離方式
では、文字領域／絵柄領域の分離における誤判定が増大
し、高画質な再生画像を得られなくなってきた。In the prior art, Japanese Patent Application Laid-Open No. 3-15316 is disclosed.
As described in Japanese Patent Publication No. 7, the type of manuscript is
Limited to photos / dots. However, with the spread of low-cost color printers in recent years, the number of opportunities to copy a color original that has been subjected to binary pseudo halftone processing such as low line ruling dither and error diffusion has increased. In addition, there has been an increasing number of opportunities for generation copying in which a copy original reproduced by a color copying machine is reproduced again by a color copying machine. As the types of originals handled in this way increase, in the image area separation method that combines two limited feature amounts as in the related art, erroneous determination in the separation of the character area / picture area increases, resulting in high image quality. It is no longer possible to obtain a proper playback image.

【０００５】[0005]

【発明が解決しようとする課題】よって、本発明の目的
は、誤差拡散原稿、ディザ原稿、複写原稿といった最近
複写する機会が増えた原稿も含め多様な原稿に対し、文
字領域／絵柄領域の高精度な分離が可能な像域分離装置
を提供することにある。本発明のもう一つの目的は、扱
う原稿の種類に応じて文字領域／絵柄領域の分離判定方
法を最適化できる像域分離装置を提供することにある。
本発明の目的は、擬似中間調処理された原稿の文字領域
／絵柄領域を高精度に分離する像域分離装置を提供する
ことにある。SUMMARY OF THE INVENTION Accordingly, an object of the present invention is to increase the height of the character area / picture area for a variety of originals such as an error diffusion original, a dither original, and a copy original, which have recently increased the number of copies. An object of the present invention is to provide an image area separating apparatus capable of performing accurate separation. It is another object of the present invention to provide an image area separating apparatus that can optimize a method for determining the separation of a character area / picture area according to the type of a document to be handled.
SUMMARY OF THE INVENTION It is an object of the present invention to provide an image area separating apparatus for separating a character area / picture area of a document subjected to pseudo halftone processing with high accuracy.

【０００６】[0006]

【課題を解決するための手段】本発明による像域分離装
置は、原稿の画像信号より少なくとも３種類の特徴量を
抽出する特徴量抽出手段と、この特徴量抽出手段により
抽出された少なくとも３種類の特徴量の中から、原稿の
種類に応じて少なくとも２種類の特徴量を選択する特徴
量選択手段と、この特徴量選択手段により選択された特
徴量を利用して該原稿の画像上の文字領域と絵柄領域の
分離判定を行う手段とを具備する。特徴量抽出手段は少
なくとも、エッジ領域を抽出する手段、網点領域を抽出
する手段及び白背景領域を抽出する手段を含む（請求項
２）。原稿種類に応じた特徴量選択に関しては、原稿の
種類が網点領域を含む原稿の場合には、特徴量選択手段
によりエッジ領域抽出手段の抽出結果及び網点領域抽出
手段の抽出結果が選択される（請求項３）。原稿の種類
が擬似中間調処理された領域を含む原稿の場合には、特
徴量選択手段によりエッジ領域抽出手段の抽出結果、網
点領域抽出手段の抽出結果及び白背景領域抽出手段の抽
出結果が選択される（請求項４）。原稿の種類が印画紙
又はベタ領域を含む原稿の場合には、特徴量選択手段に
よりエッジ領域抽出手段の抽出結果及び白背景領域抽出
手段の抽出結果が選択される（請求項５）。原稿の種類
が複写原稿の場合には、特徴量選択手段によりエッジ領
域抽出手段の抽出結果及び白背景領域抽出手段の抽出結
果が選択される（請求項６）。SUMMARY OF THE INVENTION An image area separating apparatus according to the present invention includes a feature amount extracting means for extracting at least three kinds of feature amounts from an image signal of a document, and at least three kinds of feature amounts extracted by the feature amount extracting means. A feature amount selecting means for selecting at least two types of feature amounts from the feature amounts according to the type of the document, and a character on an image of the document using the feature amount selected by the feature amount selecting means. Means for making a separation determination between the area and the picture area. The feature amount extracting means includes at least means for extracting an edge area, means for extracting a halftone dot area, and means for extracting a white background area. Regarding the feature amount selection according to the manuscript type, when the manuscript type is a manuscript including a halftone area, the feature amount selecting means selects the extraction result of the edge area extracting means and the halftone area extracting means. (Claim 3). If the type of the original is an original including a region subjected to pseudo halftone processing, the extraction result of the edge region extraction unit, the extraction result of the halftone dot region extraction unit, and the extraction result of the white background region extraction unit are output by the feature amount selection unit. Selected (Claim 4). When the type of the original is a photographic paper or an original including a solid area, the extraction result of the edge area extracting means and the extraction result of the white background area extracting means are selected by the feature amount selecting means (claim 5). When the type of the original is a copy original, the feature amount selecting means selects the extraction result of the edge area extracting means and the extraction result of the white background area extracting means (claim 6).

【０００７】また、本発明によれば、擬似中間調処理さ
れた原稿の像域分離に好適な像域分離装置が提供され、
同装置は、原稿の画像信号よりエッジ領域、網点領域及
び白背景領域をそれぞれ抽出する手段と、該手段により
抽出されたエッジ領域、白背景領域、及び網点領域を除
く領域が全て重なった領域を文字領域と判定し、それ以
外の領域を絵柄領域と判定する手段とを具備する（請求
項７）。Further, according to the present invention, there is provided an image area separation apparatus suitable for image area separation of a document subjected to pseudo halftone processing,
In the apparatus, an edge area, a halftone area, and a white background area are respectively extracted from an image signal of a document, and an area excluding the edge area, the white background area, and the halftone area extracted by the means are all overlapped. Means for determining the area as a character area and determining the other area as a picture area.

【０００８】[0008]

【発明の実施の形態】図１は本発明の一実施例を示すブ
ロック図である。図１において、図示しないスキャナ等
の画像入力装置を用いて原稿を読み取って量子化した画
像信号１００が、エッジ領域抽出部１０１、網点領域抽
出部１０２及び白背景領域抽出部１０３に入力される。
エッジ領域抽出部１０１は、文字領域のエッジ部分に現
れる特徴量の抽出、もしくは、そのような特徴量による
領域判定を行う部分である。網点領域抽出部１０２は、
網点領域に現れる特徴量の抽出、もしくは、そのような
特徴量による領域判定を行う部分である。白背景領域抽
出部１０３は、白背景領域に現れる特徴量の抽出、もし
くは、そのような特徴量による領域判定を行う部分であ
る。FIG. 1 is a block diagram showing an embodiment of the present invention. In FIG. 1, an image signal 100 obtained by reading and quantizing an original using an image input device such as a scanner (not shown) is input to an edge region extraction unit 101, a halftone dot region extraction unit 102, and a white background region extraction unit 103. .
The edge area extraction unit 101 is a part that extracts a feature amount appearing at an edge portion of a character area or performs area determination based on such a feature amount. The dot area extraction unit 102
This is a part for extracting a feature amount appearing in a halftone dot region or determining an area based on such a feature amount. The white background region extraction unit 103 is a part that extracts a feature amount appearing in a white background region or performs a region determination based on such a feature amount.

【０００９】各領域抽出部１０１，１０２，１０３の出
力は、それぞれに対応した２入力１出力マルチプレクサ
（ＭＰＸ）１０４，１０５，１０６の一方の入力に与え
られる。各マルチプレクサ１０４，１０５，１０６の他
方の入力には常に”１”が与えられる。各マルチプレク
サ１０４，１０５，１０６には、入力選択信号として、
処理すべき原稿の種類に応じた信号が原稿種選択部１０
８より与えられる。この原稿種選択部１０８とマルチプ
レクサ１０４，１０５，１０６により、文字／絵柄領域
判定のために利用する特徴量を、原稿の種類に応じて選
択する特徴量選択手段が構成される。原稿種選択部１０
８は、例えば、図示しない操作部により手動で指定され
た原稿種類に応じた信号を与えるものであり、あるい
は、原稿のプレスキャンによって自動的に原稿種類を認
識し、その認識結果に応じた信号を与えるものである。
マルチプレクサ１０４，１０５，１０６の出力は論理積
部１０７に入力される。この論理積部１０７の出力は、
最終的な文字／絵柄領域分離判定結果を表すもので、文
字領域の画素に対しては”１”となり、絵柄領域の画素
に対しては”０”となる。The output of each of the area extraction units 101, 102, 103 is given to one input of a corresponding two-input, one-output multiplexer (MPX) 104, 105, 106. The other input of each of the multiplexers 104, 105, 106 is always supplied with "1". Each of the multiplexers 104, 105, and 106 has an input selection signal
A signal corresponding to the type of the document to be processed is sent to a document type selection unit 10.
8 The document type selection unit 108 and the multiplexers 104, 105, and 106 constitute a feature value selection unit that selects a feature value used for character / picture area determination according to the type of document. Document type selection unit 10
Reference numeral 8 denotes, for example, a signal for giving a signal corresponding to the document type manually designated by an operation unit (not shown), or a signal for automatically recognizing the document type by pre-scanning the document and outputting a signal corresponding to the recognition result. Is to give.
The outputs of the multiplexers 104, 105, and 106 are input to the AND unit 107. The output of the AND unit 107 is
It represents the final result of character / picture area separation determination, and is "1" for pixels in the character area and "0" for pixels in the picture area.

【００１０】エッジ領域抽出部１０１及び網点領域抽出
部１０２における領域判定手法としては、例えば「大
内，今尾，山田，”文字／絵柄（網点，写真）混在画像
の像域分離方式”，電子情報通信学会論文誌 Vol.Ｊ７
５_Ｄ_２，No.1，１９９２_０１」に述べられている方
法を用いることができる。As the area determination method in the edge area extraction unit 101 and the halftone area extraction unit 102, for example, “Ouchi, Imao, Yamada,“ Image area separation method for character / picture (halftone, photograph) mixed image) ”, IEICE Transactions Vol.J7
5_D_2, No. 1, 1992_01 ".

【００１１】文字領域は、高レベル濃度の画素と低レベ
ル濃度の画素（以下、黒画素、白画素と呼ぶ）が多く、
かつ、エッジ部分では、これらの黒画素及び白画素が連
続している。エッジ領域抽出部１０１は、このような黒
画素及び白画素それぞれの連続性に基づいて文字エッジ
を検出する。文字領域抽出部１０１の具体例を図２に示
し説明する。The character area has many high-level density pixels and low-level density pixels (hereinafter referred to as black pixels and white pixels).
At the edge portion, these black pixels and white pixels are continuous. The edge area extraction unit 101 detects a character edge based on the continuity of each of the black pixels and the white pixels. A specific example of the character area extraction unit 101 is shown in FIG. 2 and described.

【００１２】図２において、３値化処理部２０１は、２
種の閾値ＴＨ１，ＴＨ２を用い入力画像信号１００に対
する３値化（白画素＜ＴＨ１、ＴＨ１≦中間調画素＜Ｔ
Ｈ２、ＴＨ２≦黒画素）を行う。閾値ＴＨ１，ＴＨ２
は、例えば、入力画像信号が０から２５５までの２５６
階調（０＝白）で表される場合にＴＨ１＝２０、ＴＨ２
＝８０に選ぶことができる。In FIG. 2, the ternarization processing section 201
Binarization of the input image signal 100 using the threshold values TH1 and TH2 (white pixels <TH1, TH1 ≦ halftone pixel <T
H2, TH2 ≦ black pixel). Threshold values TH1, TH2
Is, for example, 256 when the input image signal is from 0 to 255.
TH1 = 20, TH2 when expressed by gradation (0 = white)
= 80.

【００１３】３値化後の画像信号に対し、黒画素連続性
検出部２０２は黒画素が連続する箇所を、白画素連続性
検出部２０３は白画素が連続する箇所を、それぞれパタ
ーンマッチングにより検出する。このパターンマッチン
グには、本実施例では、図３に示す３×３画素のパター
ンが用いられる。黒画素連続性検出部２０２は図３の上
段に示したいずれかのパターンにマッチングした画素
（この例では３×３画素の中央画素）を黒連続画素と
し、同様に、白画素連続性検出部２０３は図３の下段に
示したいずれかのパターンにマッチングした画素（３×
３画素の中央画素）を白連続画素とする。For the image signal after the ternarization, the black pixel continuity detecting unit 202 detects a portion where black pixels are continuous and the white pixel continuity detecting unit 203 detects a portion where white pixels are continuous by pattern matching. I do. In this embodiment, a pattern of 3 × 3 pixels shown in FIG. 3 is used for this pattern matching. The black pixel continuity detecting unit 202 sets a pixel (the center pixel of 3 × 3 pixels in this example) matching any one of the patterns shown in the upper part of FIG. Reference numeral 203 denotes a pixel (3 × 3) that matches one of the patterns shown in the lower part of FIG.
The central pixel of the three pixels) is a white continuous pixel.

【００１４】近傍検出部２０４では、黒画素連続性検出
部２０２及び白画素連続性検出部２０３の検出結果につ
いて、黒連続画素と白連続画素が近傍にあるか否かを調
べることにより、エッジ領域と非エッジ領域を判定す
る。より具体的に述べれば、本実施例にあっては、５×
５画素単位のサイズのブロック毎に、その内部に黒連続
画素と白連続画素がそれぞれ１つ以上存在するときに、
そのブロックをエッジ領域と判定し、そうでないとき
に、そのブロックを非エッジ領域と判定する。そして、
エッジ領域と判定したブロック内の画素に対応して”
１”を出力し、非エッジ領域と判定したブロック内の画
素に対応して”０”を出力する。The neighborhood detection unit 204 checks the detection results of the black pixel continuity detection unit 202 and the white pixel continuity detection unit 203 to determine whether the black continuity pixel and the white continuity pixel are in the vicinity. And a non-edge area are determined. More specifically, in this embodiment, 5 ×
When one or more black continuous pixels and one or more white continuous pixels exist inside each block of a size of 5 pixels,
The block is determined as an edge area, and if not, the block is determined as a non-edge area. And
Corresponding to the pixel in the block determined as the edge area "
1 is output, and "0" is output corresponding to the pixel in the block determined to be the non-edge area.

【００１５】網点領域では、高い濃度値を持つ画素と低
い濃度値を持つ画素が交互に周期的に現れる。網点領域
抽出部１０２は、この高い濃度値又は低い濃度値を持つ
極値画素を検出することによって網点領域を識別する。
網点領域抽出部１０２の具体例を図４に示す説明する。In the halftone dot region, pixels having a high density value and pixels having a low density value appear alternately and periodically. The halftone dot area extraction unit 102 identifies a halftone dot area by detecting the extreme pixel having the high density value or the low density value.
A specific example of the halftone dot region extraction unit 102 will be described with reference to FIG.

【００１６】図４において、極値画素検出部３０１は、
演算により極値画素を検出する。本実施例では、図５に
示すように、３×３画素単位のブロックにおいて、次の
条件Ａ，Ｂを同時に満たすときに、中心画素を極値画素
として検出する。条件Ａ：中心画素の濃度レベル（Ｌ）が周囲のどの画素
の濃度レベルより高い、又は低い。条件Ｂ：中心画素の濃度レベル（Ｌ）と、中心画素を挟
んで対角線上にあるペア画素の濃度レベル（ａ，ｂ）
が、４ペアすべてについて、｜２×Ｌ−ａ−ｂ｜＞ＴＨの関係にある。ただし、ＴＨは固定の閾値である。In FIG. 4, an extreme pixel detection unit 301 includes:
The extreme pixel is detected by the calculation. In this embodiment, as shown in FIG. 5, when the following conditions A and B are simultaneously satisfied in a block of 3 × 3 pixels, the center pixel is detected as an extreme pixel. Condition A: The density level (L) of the center pixel is higher or lower than that of any surrounding pixels. Condition B: the density level (L) of the center pixel and the density levels (a, b) of the paired pixels diagonally across the center pixel
Are in the relationship of | 2 × Lab−> TH for all four pairs. Here, TH is a fixed threshold.

【００１７】網点領域検出部３０２は、４×４画素単位
のブロック内に、極値画素検出部３０１で検出された極
値画素が１つ以上存在するならば同ブロックを網点候補
領域と判定し、極値画素が１つも存在しなければ同ブロ
ックを非網点候補領域と判定する。この判定結果に対し
て、網点領域補正部３０３は最終的な網点／非網点の判
定を行う。本実施例では、図６に示すように、注目ブロ
ックを中心とした３×３ブロック（１ブロックは３×３
画素）において、４ブロック以上が網点候補領域であれ
ば注目ブロックを網点領域とし、そうでなけれぱ注目ブ
ロックを非網点領域とする。そして、網点領域とされた
フロック内の画素に対応して”０”を出力し、非網点領
域とされたブロック内の画素に対応した”１”を出力す
る。If one or more extremal pixels detected by the extremal pixel detecting unit 301 are present in a block of 4 × 4 pixels, the halftone dot area detecting unit 302 regards the block as a dot candidate area. If it is determined that there is no extremal pixel, the block is determined as a non-dot halftone dot candidate area. In response to this determination result, the halftone area correction unit 303 makes a final halftone / non-halftone determination. In the present embodiment, as shown in FIG. 6, 3 × 3 blocks centered on the block of interest (one block is 3 × 3
Pixel), if four or more blocks are halftone dot candidate areas, the block of interest is set to a halftone dot area; otherwise, the block of interest is set to a non-halftone dot area. Then, “0” is output corresponding to the pixel in the block which is regarded as the halftone area, and “1” is output corresponding to the pixel in the block which is regarded as the non-halftone area.

【００１８】白背景領域抽出部１０３は、背景が白であ
るか否かを判定する。この白背景領域抽出部１０３の具
体例を図７に示し説明する。図７において、２値化処理
部４０４は入力画像信号１００を閾値ＴＨＷを用いて２
値化する。すなわち、ＴＨＷ以下の値を持つ画素を白画
素、ＴＨＷを超える値を持つ画素を黒画素とする。The white background area extraction unit 103 determines whether the background is white. A specific example of the white background area extraction unit 103 will be described with reference to FIG. In FIG. 7, the binarization processing unit 404 converts the input image signal 100 into a binary signal using a threshold value THW.
Value. That is, a pixel having a value equal to or less than THW is defined as a white pixel, and a pixel having a value exceeding THW is defined as a black pixel.

【００１９】パターンマッチング部４０２は、２値化後
の画像信号に対し、４×４画素単位のブロック毎に、４
×１画素又は１×４画素単位の白画素塊（縦又は横方向
に連続する４個の白画素の塊）の検出を行う。そして、
白画素塊が検出されたブロックを白候補ブロックとす
る。The pattern matching unit 402 converts the image signal after binarization into blocks of 4 × 4 pixels.
A white pixel block (a block of four continuous white pixels in the vertical or horizontal direction) in units of × 1 pixels or 1 × 4 pixels is detected. And
A block in which a white pixel block is detected is defined as a white candidate block.

【００２０】白補正部４０３は、図８に示すように、注
目した白候補ブロックを中心とした９×９ブロック（１
ブロックは４×４画素サイズ）において、網掛けして示
す４つの４×４ブロック領域それぞれに１つ以上の白候
補ブロックが存在するときに、注目した白候補ブロック
（図８の中心ブロック）を白背景領域と判定する。そう
でなければ注目白候補ブロックを非白背景領域と判定す
る。そして、白背景領域の画素に対応して”１”を、非
白背景領域の画素に対応して”０”をそれぞれ出力す
る。As shown in FIG. 8, the white correction unit 403 includes a 9 × 9 block (1
When one or more white candidate blocks exist in each of the four 4 × 4 block areas shaded in a 4 × 4 pixel size block, the focused white candidate block (the center block in FIG. 8) is used. It is determined as a white background area. Otherwise, the target white candidate block is determined to be a non-white background area. Then, “1” is output corresponding to the pixels in the white background area, and “0” is output corresponding to the pixels in the non-white background area.

【００２１】処理対象の原稿種と、原稿種選択部１０８
からの信号によるマルチプレクサ１０４，１０５，１０
６の入力選択（特徴量選択）との関係は、図９に示す通
りである。まず、対象原稿が網点原稿の場合、網点領域
判定とエッジ領域判定に基づいて十分に高精度の文字／
絵柄の像域分離が可能である。よって、マルチプレクサ
１０４，１０５はそれぞれエッジ領域抽出部１０１及び
網点領域抽出部１０２の出力を入力として選択するよう
に制御され、マルチプレクサ１０６は”１”を入力とし
て選択するように制御される。したがって、この原稿の
場合には、エッジ領域抽出部１０１の出力が”１”（エ
ッジ領域）であり、かつ網点領域抽出部１０２の出力
が”１”（非網点領域）のときにのみ、論理積部１０７
の出力は”１”（文字領域）となり、これ以外の条件で
は”０”（絵柄領域）となる。A document type to be processed and a document type selection unit 108
Multiplexers 104, 105, 10 based on signals from
The relationship with the input selection (feature amount selection) of No. 6 is as shown in FIG. First, when the target document is a dot document, a sufficiently accurate character / text is determined based on the dot region determination and the edge region determination.
Image area separation of the picture is possible. Therefore, the multiplexers 104 and 105 are controlled so as to select the outputs of the edge area extracting unit 101 and the halftone dot area extracting unit 102 as inputs, and the multiplexer 106 is controlled so as to select "1" as an input. Therefore, in the case of this original, only when the output of the edge area extracting unit 101 is “1” (edge area) and the output of the halftone area extracting unit 102 is “1” (non-dot area). , AND unit 107
Is "1" (character area) and "0" (picture area) under other conditions.

【００２２】対象原稿が誤差拡散やディザ等の擬似中間
調処理された原稿の場合、網点領域、エッジ領域、白背
景領域の単体の判定のみでは十分な像域分離精度を得る
ことが難しい。そこで、マルチプレクサ１０４，１０
５，１０６はそれぞれエッジ領域抽出部１０１、網点領
域抽出部１０２、白背景領域抽出部１０３の出力をすべ
て入力として選択するように制御される。したがって、
エッジ領域抽出部１０１の出力が”１”（エッジ領
域）、かつ、網点領域抽出部１０２の出力が”１”（非
網点領域）、かつ、白背景領域抽出部１０３の出力が”
１”（白背景領域）のときにのみ、論理積部１０７の出
力は”１”（文字領域）となり、これ以外の条件では”
０”（絵柄領域）となる。つまり、エッジ領域抽出部１
０１により抽出されたエッジ領域、網点領域抽出部１０
２により抽出された網点領域を除く領域（非網点領
域）、白背景領域抽出部１０３により抽出された白背景
領域が全て重なる領域を文字領域として、それ以外の領
域を絵柄領域として、それぞれ像域分離する。If the target document is a document subjected to pseudo-halftone processing such as error diffusion or dither, it is difficult to obtain sufficient image area separation accuracy only by judging a single dot area, edge area, and white background area. Therefore, the multiplexers 104 and 10
5 and 106 are controlled so that the outputs of the edge region extraction unit 101, the halftone dot region extraction unit 102, and the white background region extraction unit 103 are all selected as inputs. Therefore,
The output of the edge area extraction unit 101 is “1” (edge area), the output of the halftone dot area extraction unit 102 is “1” (non-dot area), and the output of the white background area extraction unit 103 is “1”.
Only in the case of “1” (white background area), the output of the AND unit 107 becomes “1” (character area), and under other conditions, the output becomes “1” (character area).
0 "(pattern area). That is, the edge area extracting unit 1
01 and the halftone dot area extraction unit 10
2, a region excluding the halftone dot region (non-dot region), a region where all the white background regions extracted by the white background region extraction unit 103 overlap each other are defined as a character region, and the other regions are defined as a picture region. Image area separation.

【００２３】対象原稿が、印画紙やベタ領域を含む原稿
の場合、網点領域判定は像域分離には効果がない。よっ
て、マルチプレクサ１０４，１０６はエッジ領域抽出部
１０１、白背景領域抽出部１０３の出力を入力として選
択するように制御され、マルチプレクサ１０５は”１”
入力を選択するように制御される。したがって、論理積
部１０７の出力が”１”（文字領域）となるのは、エッ
ジ領域抽出部１０１の出力が”１”（エッジ領域）であ
り、かつ、白背景領域抽出部１０３の出力が”１”（白
背景領域）であるときのみである。When the target document is a document containing photographic paper or a solid area, the halftone dot area determination has no effect on image area separation. Therefore, the multiplexers 104 and 106 are controlled so as to select the outputs of the edge region extraction unit 101 and the white background region extraction unit 103 as inputs, and the multiplexer 105 is set to “1”.
Controlled to select input. Therefore, the reason why the output of the logical product unit 107 is “1” (character area) is that the output of the edge area extraction unit 101 is “1” (edge area) and the output of the white background area extraction unit 103 is Only when it is "1" (white background area).

【００２４】対象原稿が複写原稿の場合、網点領域判定
の精度は一般に不十分であるので、その判定結果は利用
しないほうが確実な領域分離が可能である。よって、こ
の場合には、エッジ領域抽出部１０１及び白背景領域抽
出部１０３の出力をそれぞれ選択するようにマルチプレ
クサ１０４，１０６が制御され、マルチプレクサ１０５
は”１”入力を選択するように制御される。したがっ
て、エッジ領域抽出部１０１の出力が”１”（エッジ領
域）、かつ、白背景領域抽出部１０３の出力が”１”
（白背景領域）である条件でのみ論理積部１０７の出力
が”１”（文字領域）となる。When the target document is a copy document, the accuracy of the halftone dot region determination is generally insufficient, so that it is possible to perform more reliable region separation without using the result of the determination. Therefore, in this case, the multiplexers 104 and 106 are controlled to select the outputs of the edge area extraction unit 101 and the white background area extraction unit 103, respectively, and the multiplexer 105
Is controlled to select the "1" input. Therefore, the output of the edge area extraction unit 101 is “1” (edge area), and the output of the white background area extraction unit 103 is “1”.
Only under the condition of (white background area), the output of the AND unit 107 becomes “1” (character area).

【００２５】このように図９に示した４種類の原稿だけ
を対象とする場合には、エッジ領域判定は常に文字／絵
柄領域判定に利用されるので、マルチプレクサ１０４を
省き、エッジ領域抽出部１０１の出力を直接的に論理積
部１０７に入力してもよい。ただし、さらに多様な原稿
種類を扱う場合を考慮するならば、マルチプレクサ１０
４を介在させるほうがよい。When only the four types of originals shown in FIG. 9 are to be used, the edge area determination is always used for character / picture area determination. Therefore, the multiplexer 104 is omitted and the edge area extraction unit 101 May be directly input to the logical product unit 107. However, if the case of dealing with more various types of originals is considered, the multiplexer 10
It is better to interpose 4.

【００２６】以上に述べた実施例の像域分離装置は、デ
ィジタル複写機、ファクシミリ、スキャナ、その他各種
の画像処理機器の像域分離装置として広く利用できるも
のである。なお、本実施例では、エッジ領域判定、網点
領域判定、白背景領域判定を同時並列的に行う構成であ
ったが、それら領域判定を１つずつ順次に、ある単位毎
に行ってその結果を保存し、ある単位について全ての領
域判定結果が得られた段階で最終的な文字／絵柄領域判
定を行うようにしてもよい。また、像域分離のためにエ
ッジ、網点、白背景という３種類の特徴量を利用した
が、さらに別の特徴量を追加することも可能である。ま
た、像域分離装置の各部をハードウエアで実現すれば処
理速度の面で有利であるが、速度が問題にならなければ
各部をコンピュータシステム上でソフトウエアにより実
現してもよい。また、そのような本発明のソフトウエア
を格納したコンピュータ記憶媒体を用意し、このコンピ
ュータ記憶媒体を用いて本発明のソフトウエアを汎用の
コンピュータシステムにロードすることにより、汎用の
コンピュータシステム上で本発明による像域分離を実行
させることも可能である。The image area separating apparatus of the embodiment described above can be widely used as an image area separating apparatus of a digital copying machine, a facsimile, a scanner, and various other image processing apparatuses. In this embodiment, the edge area determination, the halftone area determination, and the white background area determination are performed simultaneously and in parallel. However, these area determinations are sequentially performed one by one for each unit. May be stored, and the final character / picture area determination may be performed when all the area determination results are obtained for a certain unit. In addition, although three types of feature amounts, such as an edge, a halftone dot, and a white background, are used for image area separation, another feature amount can be added. It is advantageous in terms of processing speed if each section of the image area separating device is realized by hardware, but if speed does not matter, each section may be realized by software on a computer system. Also, a computer storage medium storing such software of the present invention is prepared, and the software of the present invention is loaded into a general-purpose computer system using the computer storage medium, so that the program is stored on a general-purpose computer system. It is also possible to carry out the image area separation according to the invention.

【００２７】[0027]

【発明の効果】以上に説明したように、本発明によれば
次のような効果を得られる。請求項１記載の発明によれ
ば、限定された２種類の特徴量を利用する方式に比べ、
様々な種類の原稿に対し文字領域／絵柄領域の高精度な
分離が可能な像域分離装置を実現できる。請求項２記載
の発明によれば、網点画像を含む原稿、印画紙やベタ領
域を含む原稿、擬似中間調処理された原稿、複写原稿の
いずれに対しても文字領域／絵柄領域の分離を高精度に
行うことができる像域分離装置を提供できる。請求項３
記載の発明によれば網点画像を含む原稿に対し、請求項
４記載の発明によれば誤差拡散やディザ等の擬似中間調
処理が施された原稿に対し、請求項５記載の発明によれ
ば印画紙やベタ領域を含む原稿に対し、また請求項６記
載の発明によれば複写原稿に対し、それぞれ文字領域／
絵柄領域の高精度分離が可能である。請求項７記載の発
明によれば、擬似中間調処理された原稿の文字領域／絵
柄領域を高精度に分離可能な像域分離装置を実現でき
る。As described above, according to the present invention, the following effects can be obtained. According to the first aspect of the present invention, compared to a method using two limited types of feature values,
An image area separating apparatus capable of separating character areas / picture areas with high accuracy from various types of originals can be realized. According to the second aspect of the present invention, the separation of the character area / picture area is performed for any of an original including a halftone image, an original including photographic paper or a solid area, an original subjected to pseudo halftone processing, and a copied original. An image area separation device that can be performed with high accuracy can be provided. Claim 3
According to the invention according to the invention described in claim 5, the invention including the halftone image is subjected to pseudo halftone processing such as error diffusion or dither according to the invention according to claim 4, For example, for a document including a photographic paper or a solid area, and according to the invention of claim 6, for a copy document, a character area /
High precision separation of the picture area is possible. According to the seventh aspect of the present invention, it is possible to realize an image area separating apparatus capable of separating a character area / a picture area of a document subjected to pseudo halftone processing with high accuracy.

[Brief description of the drawings]

【図１】本発明の一実施例による像域分離装置のブロッ
ク図である。FIG. 1 is a block diagram of an image area separating apparatus according to an embodiment of the present invention.

【図２】エッジ領域抽出部の一例を示すブロック図であ
る。FIG. 2 is a block diagram illustrating an example of an edge area extraction unit.

【図３】黒画素又は白画素が連続する部分を検出するた
めのパターンマッチングに用いるパターンの例を示す図
である。FIG. 3 is a diagram illustrating an example of a pattern used for pattern matching for detecting a portion where black pixels or white pixels continue.

【図４】網点領域抽出部の一例を示すブロック図であ
る。FIG. 4 is a block diagram illustrating an example of a halftone dot area extraction unit.

【図５】極値画素検出の説明のための図である。FIG. 5 is a diagram for explaining extreme value pixel detection.

【図６】網点領域補正の説明のための図である。FIG. 6 is a diagram for explaining halftone dot area correction;

【図７】白背景領域抽出部の一例を示すブロック図であ
る。FIG. 7 is a block diagram illustrating an example of a white background region extraction unit.

【図８】白補正の説明のための図である。FIG. 8 is a diagram for explaining white correction.

【図９】原稿種類と選択される特徴量の関係を示す図で
ある。FIG. 9 is a diagram illustrating a relationship between a document type and a selected feature amount.

[Explanation of symbols]

１００入力画像信号１０１エッジ領域抽出部１０２網点領域抽出部１０３白背景領域抽出部１０４，１０５，１０６マルチプレクサ１０７論理積部１０８原稿種選択部２０１３値化処理部２０２黒画素連続性検出部２０３白画素連続性検出部２０４近傍検出部３０１極値画素検出部３０２網点領域検出部３０３網点領域補正部４０１２値化処理部４０２パターンマッチング部４０３白補正部 REFERENCE SIGNS LIST 100 input image signal 101 edge region extracting unit 102 halftone region extracting unit 103 white background region extracting unit 104, 105, 106 multiplexer 107 logical product unit 108 document type selecting unit 201 ternarization processing unit 202 black pixel continuity detecting unit 203 White pixel continuity detection unit 204 Neighborhood detection unit 301 Extreme pixel detection unit 302 Halftone dot detection unit 303 Halftone dot correction unit 401 Binarization processing unit 402 Pattern matching unit 403 White correction unit

Claims

[Claims]

1. A feature amount extracting unit for extracting at least three types of feature amounts from an image signal of a document, and at least three types of feature amounts extracted by the feature amount extracting unit.
A feature amount selecting unit for selecting at least two types of feature amounts in accordance with the type of the document; and separating a character region and a picture region on the image of the document using the feature amounts selected by the feature amount selecting unit. An image area separating apparatus comprising: means for performing determination.

2. The image area separating apparatus according to claim 1, wherein
An image area separating apparatus characterized in that the feature amount extracting means includes at least means for extracting an edge area, means for extracting a halftone dot area, and means for extracting a white background area.

3. The image area separating apparatus according to claim 2, wherein
An image area separating apparatus, wherein, when the type of a document is a document including a halftone area, an extraction result of the edge area extraction means and an extraction result of the halftone area extraction means are selected by the feature amount selection means.

4. The image area separating apparatus according to claim 2, wherein
When the type of the original includes a pseudo-halftone-processed area, the feature amount selection unit selects the extraction result of the edge region extraction unit, the extraction result of the halftone dot region extraction unit, and the extraction result of the white background region extraction unit. An image area separation device.

5. The image area separating apparatus according to claim 2, wherein
When the type of original is photographic paper or an original that includes a solid area,
An image area separating apparatus, wherein an extraction result of an edge area extracting means and an extraction result of a white background area extracting means are selected by a feature amount selecting means.

6. The image area separating apparatus according to claim 2, wherein
An image area separating apparatus, wherein when a type of a document is a copy document, an extraction result of an edge region extraction unit and an extraction result of a white background region extraction unit are selected by a feature amount selection unit.

7. A means for extracting an edge area, a halftone area, and a white background area from an image signal of a document, respectively, and an area excluding the edge area, the white background area, and the halftone area extracted by the means overlap each other. An image area separating device, comprising: means for determining an area that has been rendered as a character area and determining the other area as a picture area.