JPH02191086A

JPH02191086A - Optimum binarizing method

Info

Publication number: JPH02191086A
Application number: JP1011480A
Authority: JP
Inventors: Hideaki Yamagata; 秀明山形
Original assignee: Ricoh Co Ltd
Current assignee: Ricoh Co Ltd
Priority date: 1989-01-20
Filing date: 1989-01-20
Publication date: 1990-07-26
Anticipated expiration: 2014-06-23
Also published as: JP2910926B2

Abstract

PURPOSE:To improve the rate of recognition by holding plural threshold values into a memory, counting the number of black picture elements in the respective threshold value and the number of the black picture elements which are macro-visualized, obtaining the optimum threshold value from the threshold values, in which the parameter of the fractal dimension of a picture is made minimum, based on the number of these black picture elements and outputting a binary picture. CONSTITUTION:A multilevel picture is read by a multilevel picture read part 1 and held in a multilevel image memory 4. Next, the multilevel picture is read from the multilevel image memory 4 and the black and white binary picture is generated and held in a binary image memory 6. After that, to the whole picture held in the binary image memory 6, the number of the black picture elements and the number of the black picture elements, which are macro-visualized, are counted by a black picture element number count part 7 and a counted result is held in a black picture element number memory 8. Then, in a parameter calculation part 9, the number of the black picture elements and the number of the black picture elements, which are macro-visualized, are respectively read from the black picture element number memory 8 and the parameter is calculated in the fractal dimension of the picture. Based on the threshold value, in which this parameter value is made minimum, the optimum threshold value is obtained and the binary picture is outputted.

Description

【発明の詳細な説明】産業上の利用分野本発明は、文字認識などのパターン認識装置における最
適２値化方法に関する。DETAILED DESCRIPTION OF THE INVENTION Field of the Invention The present invention relates to an optimal binarization method in a pattern recognition device such as character recognition.

従来の技術一般に、文字認識などのパターン認識装置において処理
される画像は、スキャナのＣＣＤセンサ出力などの値を
閾値（スレッシュレベル）によって白黒２値化したもの
である。この際、印字状態の良くない原稿であっても最
適なる２値化を可能とするためには、原稿の濃度の相違
に対応して各々最適な２値画像を生成する必要がある。2. Description of the Related Art In general, images processed in pattern recognition devices such as character recognition are obtained by converting values such as the output of a CCD sensor of a scanner into black and white binary values using a threshold value (threshold level). At this time, in order to enable optimal binarization even for originals with poor printing conditions, it is necessary to generate optimal binary images corresponding to the differences in density of the originals.

ここに、このような２値化方法に関しては、種々の方法
が提案されている。例えば、「田利秀行著、総研出版、
１９８５［ｉ’コンピュータ画像処理入門」中、第６７
頁」なる文献に示されるモード法や微分ヒストグラム法
がある。モード法は、与えられた画像の濃度値のヒスト
グラムを求め、２つのピークを持つ分布となる場合に、
２つのピークの間の谷のところに閾値を決めるものであ
る。Here, various methods have been proposed regarding such a binarization method. For example, “Written by Hideyuki Tari, published by Souken Publishing,
1985 [i' Introduction to Computer Image Processing], No. 67
There are the modal method and the differential histogram method, which are shown in the literature "P. The mode method calculates the histogram of the density values of a given image, and when the distribution has two peaks,
The threshold value is determined at the valley between two peaks.

微分ヒストグラム法は、画像中の対象物と背景の境界は
、濃度値が急に変化する部分に位置すると考えられるた
め、画像の濃度値を直接利用するのではなく、微分値（
濃度の変化率）を利用して閾値を決めるというものであ
る。The differential histogram method uses differential values (
The threshold value is determined using the rate of change in concentration.

また、［昭和５２年度電子通信学会情報部門全国大会、
大津展之、「濃度分布からの閾値決定法Ｊ中、１４５］
なる文献に示される濃度分布からの閾値決定法がある。In addition, [1972 National Conference of the Institute of Electronics and Communication Engineers, Information Section,
Nobuyuki Otsu, “Threshold Determination Method from Concentration Distribution J, 145”
There is a method for determining the threshold value from the concentration distribution, which is shown in the literature.

これは、濃度分布の０次、１次モーメントのみを利用し
、積分に基づいて最適なる閾値を決定するものである。This method uses only the 0th and 1st moments of the concentration distribution and determines the optimal threshold based on integration.

さらに、特公昭６０−３７９５２号公報に示される「最
適二値化方式」がある。これは、多値ビデオ信号をビデ
オ・バッファに格納し、ビデオ・バッファから読出され
たビデオ信号を可変スライスレベルのスライス回路によ
り２値化し、多値ビデオ情報を異なるスライスレベルで
スライスして２値化ビデオ信号に変換し、異なるスライ
スレベルでスライスして作成した複数の２値化ビデオ信
号の各々について（黒点数）／（周囲数）なる線幅増幅
率を求め、複数の線幅増幅率と基準の線幅増幅率とに基
づきスライス回路のスライスレベルを設定するものであ
る。Furthermore, there is an "optimal binarization method" disclosed in Japanese Patent Publication No. 60-37952. This involves storing a multilevel video signal in a video buffer, converting the video signal read from the video buffer into a binary signal using a slicing circuit with variable slice levels, and slicing the multilevel video information at different slice levels to create a binary signal. The line width amplification factor (number of black points)/(number of surroundings) for each of the plurality of binarized video signals created by converting it to a digital video signal and slicing it at different slice levels is calculated, and the line width amplification factor and the number of line width amplification factors are calculated. The slice level of the slice circuit is set based on the reference line width amplification factor.

発明が解決しようとする課題ところが、モード法にあっては、印字状態の悪い原稿で
は、ヒストグラムに明確な谷を生じないので、適用でき
ない方法である。However, the problem to be solved by the present invention is that the mode method cannot be applied to documents with poor printing conditions because the histogram does not have clear valleys.

また、微分ヒストグラム法にあっては、対象物と背景の
境界付近の濃度値が複雑に変化するものに対しては、有
効に働かない方法である。Further, the differential histogram method does not work effectively when the density value near the boundary between the object and the background changes in a complex manner.

また、濃度分布からの閾値決定法による場合、文字認識
などのパターン認識において扱われる画像としてのｒ線
」のつぶれやかすれに対する処理としては、効果的な方
法ではない。Furthermore, the method of determining a threshold value from the density distribution is not an effective method for processing blurring or blurring of "r-line" as an image used in pattern recognition such as character recognition.

さらに、上記公報の最適二値化法では、実験の結果、原
稿の濃淡によっては適正な閾値決定が不安定なる結果が
得られたものである。Furthermore, in the optimal binarization method disclosed in the above-mentioned publication, as a result of experiments, it was found that appropriate threshold value determination was unstable depending on the shading of the document.

課題を解決するための手段請求項１記載の発明では、多値量子化された画像に対し
て白黒２値の画像に変換するパターン認識装置における
最適２値化方法において、複数の閾値で画像を２値化し
、各閾値での２値画像を常にメモリに保有し、各閾値で
の黒画素数と粗視化した時の黒画素数とを計数し、これ
らの黒画素数に基づき画像のフラクタル次元なるパラメ
ータを計算し、当該パラメータ値が極小となる閾値から
最適閾値を求め、この最適閾値による２値画像を出力さ
せるようにした。Means for Solving the Problems The invention according to claim 1 provides an optimal binarization method for a pattern recognition device that converts a multi-level quantized image into a black and white binary image. Binarize the image, keep the binary image at each threshold in memory, count the number of black pixels at each threshold and the number of black pixels when coarse-grained, and calculate the fractal image of the image based on these numbers of black pixels. A parameter called a dimension is calculated, an optimal threshold value is determined from the threshold value at which the parameter value becomes minimum, and a binary image based on this optimal threshold value is output.

請求項２記載の発明では、複数の閾値で画像を２値化し
た後、各閾値での黒画素数と粗視化した時の黒画素数と
を計数する時のみ各閾値での２値画像をメモリに保有し
て、各閾値での黒画素数と粗視化した時の黒画素数との
引数値に基づき画像のフラクタル次元なるパラメータを
計算し、当該パラメータ値が極小となる閾値から最適閾
値を求め、当該最適閾値で画像を再び２値化して２値画
像を出力させるようにした。In the invention according to claim 2, after an image is binarized using a plurality of threshold values, only when counting the number of black pixels at each threshold value and the number of black pixels when coarse-grained, a binary image is generated at each threshold value. is stored in memory, and a parameter called the fractal dimension of the image is calculated based on the argument value of the number of black pixels at each threshold value and the number of black pixels when coarse-grained, and the optimal value is calculated from the threshold value at which the parameter value becomes the minimum. A threshold value is determined, and the image is binarized again using the optimum threshold value to output a binary image.

さらに、請求項３記載の発明では、請求項２記載の発明
と同じく、複数の閾値で画像を２値化し、各閾値での黒
画素数と粗視化した時の黒画素数とを計数する時のみ各
閾値での２値画像をメモリに保有した後、閾値を中心値
から変化させてその閾値での黒画素数と粗視化した時の
黒画素数との計数値に基づき画像のフラクタル次元なる
パラメータを計算し、当該パラメータ値が常に小さい方
の閾値とそのパラメータ値とを保有し、パラメータ値が
極小となる閾値を最適閾値として、画像を再び２値化し
て２値画像を出ノＪさせるようにした。Furthermore, in the invention described in claim 3, as in the invention described in claim 2, the image is binarized using a plurality of threshold values, and the number of black pixels at each threshold value and the number of black pixels when coarse-grained are counted. After storing the binary image at each threshold value in memory, the fractal image of the image is calculated based on the count value of the number of black pixels at that threshold value and the number of black pixels when coarse-grained by changing the threshold value from the center value. Calculate the parameter called dimension, always keep the threshold value with the smaller parameter value and the parameter value, and use the threshold value where the parameter value is the minimum as the optimal threshold value, and binarize the image again to output a binary image. I made it J.

作用請求項１記載の発明によれば多値量子化された画像は複
数の閾値で２値化され、メモリに保有される。そして、
各閾値での黒画素数を計数するとともに粗視化した時の
黒画素数も計数し、計数結果に基づき、画像のフラクタ
ル次元なるパラメータを計算する。このパラメータ値が
極小となる閾値に基づき最適な閾値を求めて、２値画像
を出力させるため、原稿の濃度に応じた最適なる２値化
の閾値が自動的に設定され、認識率が安定する。According to the invention described in claim 1, a multi-level quantized image is binarized using a plurality of threshold values and stored in a memory. and,
The number of black pixels at each threshold value is counted, and the number of black pixels when coarse-grained is also counted, and based on the counting results, a parameter called the fractal dimension of the image is calculated. The optimal threshold value is determined based on the threshold value at which this parameter value becomes minimum, and a binary image is output. Therefore, the optimal threshold value for binary conversion according to the density of the document is automatically set, and the recognition rate is stabilized. .

この時、請求項２記載の発明のように、２値画像を保有
するメモリを１つに節約しても、黒画素数及び粗視化し
た時の黒画素数を計数する時のみ利用すれば、何んら支
障ないものとなる。In this case, even if the memory storing the binary image is saved to one as in the invention described in claim 2, it is only used when counting the number of black pixels and the number of black pixels when coarse-grained. , there will be no problem.

一方、フラクタル次元なるパラメータ値が最大となる閾
値は複数の閾値中の中心値伺近となる性質を持つ。そこ
で、請求項３記載の発明のように、閾値を中心値から変
化させて画像のフラクタル次元なるパラメータを計算し
、当該パラメータ値か常に小さい方の閾値とそのパラメ
ータ値とを保有し、パラメータ値が最小となる当該閾値
に注目することにより、最適閾値を求める処理の高速化
が図られる。On the other hand, the threshold value at which the parameter value of the fractal dimension is maximum has the property of being close to the center value among the plurality of threshold values. Therefore, according to the invention as claimed in claim 3, a parameter called the fractal dimension of the image is calculated by changing the threshold value from the central value, and the parameter value and the threshold value, whichever is smaller, are always stored, and the parameter value is By focusing on the threshold value with the minimum value, it is possible to speed up the process of determining the optimal threshold value.

実施例請求項１記載の発明の一実施例を第１図ないし第３図を
参照して説明する。第１図に本実施例を実施するブロッ
ク構成図を示す。本実施例は、多値画像読取り部１から
２値画像出力部２までに関するものである。多値画像読
取り部１にてスキャナ３から多値画像を読取り、例えば
１６値に量子化し、多値イメージメモリ４に保有する。Embodiment An embodiment of the invention set forth in claim 1 will be described with reference to FIGS. 1 to 3. FIG. 1 shows a block diagram for implementing this embodiment. This embodiment relates to a multivalued image reading section 1 to a binary image outputting section 2. A multi-value image reading section 1 reads a multi-value image from a scanner 3, quantizes it into, for example, 16 values, and stores it in a multi-value image memory 4.

次に、多値イメージメモリ４から１６階調の多値画像（
濃度レベルＯから１５）を２値化部５により読込み、閾
値ｔ＝１５以上を黒、それ以外を白とする白黒２値画像
を生成し、１５個の２値イメージメモリ６中のＮｏ、　
（１）で示すものに保有する。このＮｏ、　（１）なる
２値イメージメモリ６に保有された画像全体に対し、黒
画素数カウント部７においてその２値画像の黒画素の総
数（黒画素数）及び粗視化した時の黒画素数を計数し、
その計数結果を黒画素数メモリ８に保有する。Next, a 16-gradation multi-value image (
The density levels 0 to 15) are read by the binarization unit 5, and a black and white binary image in which the threshold value t=15 or more is black and the rest is white is generated, and the numbers in the 15 binary image memories 6,
It is held in the items shown in (1). For the entire image stored in the binary image memory 6, which is No. Count the number of pixels,
The counting result is stored in the black pixel number memory 8.

但し、粗視化された時の黒画素数とは、第３図（ａ）に
示すような通常の画素に対し、隣接する４画素（同図（
ｂ）の場合）又は１６６画素同図（ｃ）の場合）を１つ
の画素とみなしく粗視化された画素）、その粗視化され
た画素を粗視化された際の黒画素とみなし、粗視化され
た際の黒画素数を計数したものである。However, the number of black pixels when coarse-grained refers to the number of adjacent pixels (see Figure 3(a)) for a normal pixel as shown in Figure 3(a).
b)) or 166 pixels (in the case of (c) in the same figure) is treated as one pixel and the coarse-grained pixel is treated as a black pixel when coarse-grained. , the number of black pixels when coarse-grained.

次に、パラメータ計算部９において、これらの黒画素数
メモリ８から各々黒画素数と粗視化された際の黒画素数
とを読込み、画像のフラクタル次元なるパラメータを計
算し、パラメータメモリＩＯ中の対応するＮｏ、　（１
）なるメモリに保有する。Next, the parameter calculation unit 9 reads the number of black pixels and the number of coarse-grained black pixels from the black pixel number memory 8, calculates a parameter called the fractal dimension of the image, and stores it in the parameter memory IO. The corresponding No. of (1
) is stored in memory.

これが、閾値ｔ−１５に対する処理であり、次に、閾値
ｔ−１４とし、２値化部５によりこの閾値ｔ＝１４によ
る２個画像を生成し、その結果を２値イメージメモリ６
中のＮｏ、　（２）のものに保有する。このＮｏ、　（
２）なるイメージメモリ６に保有された２個画像に対し
ても、上記と同様に、黒画素数と粗視化された際の黒画
素数との計数、フラクタル次元なるパラメータの計算を
し、パラメータメモリ１０中の対応するＮｏ、　（２）
なるメモリに保有する。他の閾値ｔ＝１３，１２．〜，
２，１についても各々同様の処理を繰返す。This is the processing for the threshold value t-15.Next, the threshold value t-14 is set, two images are generated by the threshold value t=14 by the binarization unit 5, and the results are stored in the binary image memory 6.
No. (2) is retained. This No, (
2) Similarly to the above, for the two images stored in the image memory 6, count the number of black pixels and the number of black pixels when coarse-grained, calculate the parameter called fractal dimension, Corresponding No. in parameter memory 10, (2)
stored in memory. Other threshold values t=13, 12. ~,
The same process is repeated for 2 and 1 as well.

これらの１５種類の閾値ｔの各々について処理が終了し
たら、パラメータ比較部１１において、パラメータメモ
リ１０のＮｏ、　（１）〜Ｎｏ、（１５）の各々より各
閾値毎のパラメータ値を取出して比較し、その内、パラ
メータ値（フラクタル次元）が最大となる閾値Ｔ　ｍａ
ｘを求める。このようにパラメータ値が最大となる閾値
から、順に閾値を減らしていき、パラメータ値（フラグ
タル次元）が極小となる閾値Ｔ　ｉｎｆを求める。この
ようにして最適閾値決定部１２にて最適なる閾値Ｔ　＝
　Ｔ　ｉｎｆを決定する。この最適閾値下による２個画
像を２値イメージメモリ６中から選択し、２値画像出力
部２に出力し、さらに文字認識部１３などに送出して認
識処理等に供される。When the processing for each of these 15 types of thresholds t is completed, the parameter comparison unit 11 extracts the parameter values for each threshold from each of No. (1) to No. (15) in the parameter memory 10 and compares them. , among which the parameter value (fractal dimension) is the maximum threshold T ma
Find x. In this way, the threshold value is decreased in order from the threshold value at which the parameter value becomes the maximum, and the threshold value T inf at which the parameter value (fragmental dimension) becomes the minimum value is determined. In this way, the optimal threshold value determination unit 12 determines the optimal threshold value T =
Determine T inf. Two images under this optimal threshold are selected from the binary image memory 6, outputted to the binary image output section 2, and further sent to the character recognition section 13 etc. for recognition processing.

ここに、本実施例における特徴の一つであるフラクタル
次元の計算方法について第３図を参照して説明する。ま
ず、図に示すように、ｒ＝ｌ、ｒ＝２．ｒ＝４の各々の
場合の黒画素数Ｎ１１．を計数する。そして、ｒ＝１の
時の黒画素数をＮ。Here, a method of calculating the fractal dimension, which is one of the features of this embodiment, will be explained with reference to FIG. First, as shown in the figure, r=l, r=2. Number of black pixels N11 for each case of r=4. Count. Then, the number of black pixels when r=1 is N.

ｒ＝２　（４画素単位）に粗視化された時の黒画素数を
Ｎ（２１、ｒ＝４　（１６画素単位）に粗視化された時
の黒画素数をＮ１４１　　とする。そして、フラクタル
次元をＤとすると、ＱＯｇＮｔｒ＋　　＝−ＤＱｏｇ　ｒ　　十Ｃ（Ｃは定
数）なる関係が成り立つので、ｒ＝１．２．４の３点を
用いた最小２乗法により、フラクタル次元りが求められ
る。The number of black pixels when coarse-grained to r=2 (in units of 4 pixels) is N(21, and the number of black pixels when coarse-grained to r=4 (in units of 16 pixels) is N141.And, If the fractal dimension is D, then the relationship QOgNtr+ = -DQog r 10C (C is a constant) holds, so the fractal dimension is determined by the least squares method using the three points of r=1.2.4.

つづいて、請求項２記載の発明の実施例を第４図及び第
５図により説明する。前記実施例で示した部分と同一部
分は同一符号を用いて示す。本実施例は、前記実施例中
の１５個の２値イメージメモリ６を制約し、１個のみの
２値イメージメモリ１４とし、黒画素数及び粗視化され
た際の黒画素数を計数する場合のみ、この１個の２値イ
メージメモリ１４を用いるようにしたものである。よっ
て、最適閾値Ｔ　＝　Ｔ　ｉｎｆが決定された後、その
閾値を用いて再び２値化処理することにはなる。Next, an embodiment of the invention according to claim 2 will be described with reference to FIGS. 4 and 5. The same parts as those shown in the previous embodiment are indicated using the same reference numerals. In this example, the 15 binary image memories 6 in the previous example are limited to only one binary image memory 14, and the number of black pixels and the number of black pixels when coarse-grained are counted. In this case, this one binary image memory 14 is used. Therefore, after the optimal threshold value T = T inf is determined, the binarization process is performed again using that threshold value.

まず、前記実施例と同様に、多値画像読取り部１にてス
キャナ３から多値画像を読取り、例えば１６値に量子化
し、多値イメージメモリ４に保有する。次に、多値イメ
ージメモリ４から１６階調の多値画像を２値化部５によ
り読込み、閾値し１５以上を黒、それ以外を白とする白
黒２値画像を生成し、２値イメージメモリ１４に保有す
る。First, similarly to the embodiment described above, a multi-value image reading section 1 reads a multi-value image from a scanner 3, quantizes it into, for example, 16 values, and stores it in a multi-value image memory 4. Next, the 16-gradation multivalued image is read from the multivalued image memory 4 by the binarization unit 5, thresholded, and a black and white binary image in which 15 or more is black and the rest is white is generated, and the binary image memory 14 held.

この２値イメージメモリ１４に保有された画像全体に対
し、黒画素数カウント部７においてその２個画像の黒画
素の総数（黒画素数）及び粗視化された際の黒画素数を
計数し、その計数結果を黒画素数メモリ８に保有する。For the entire image held in the binary image memory 14, the black pixel number counting unit 7 counts the total number of black pixels (black pixel number) of the two images and the number of black pixels when coarse-grained. , the counting results are held in the black pixel number memory 8.

次に、パラメータ計算部９において、この黒画素数メモ
リ８から、各々黒画素数と粗視化された際の黒画素数と
を読込み、フラクタル次元なるパラメータを計算し、パ
ラメータメモリ１０中のＮｏ、　（１）なるメモリに保
有する。Next, the parameter calculation unit 9 reads the number of black pixels and the number of coarse-grained black pixels from the black pixel number memory 8, calculates a parameter called fractal dimension, and calculates the number of black pixels in the parameter memory 10. , (1) Retained in memory.

これが、閾値し−１５に対する処理であり、次に、閾値
ｔ＝１４とし、２値化部５によりこの閾値ｔ−１４によ
る２個画像を生成し、その結果を２値イメージメモリ１
４に保有する。このイメージメモリ１４に保有された２
個画像に対しても、上記と同様に、黒画素数と粗視化さ
れた際の黒画素数との計数、フラクタル次元なるパラメ
ータの計算をし、パラメータメモリ１０中のＮｏ、　（
２）なるメモリに保有する。他の閾値ｔ＝１３，１２．
〜２，１についても各々同様の処理を繰返す。This is the process for the threshold value t-15.Next, the threshold value t=14 is set, two images are generated by the threshold value t-14 by the binarization unit 5, and the results are stored in the binary image memory 1.
4. 2 held in this image memory 14
Similarly to the above, for individual images, the number of black pixels and the number of coarse-grained black pixels are counted, the parameter called fractal dimension is calculated, and the No. in the parameter memory 10, (
2) Retained in memory. Other threshold values t=13, 12.
The same process is repeated for ~2 and 1 as well.

これらの１５種類の閾値しの各々について処理が終了し
たら、パラメータ比較部１１において、パラメータメモ
リ１０のＮα（１）〜Ｎｏ、（１５）の各々より各閾値
毎のパラメータ値を取出して比較し、その内、パラメー
タ値（フラクタル次元）が最大となる閾値Ｔ　ｍａｘを
求める。このように求められたパラメータ値が最大なる
閾値Ｔ　ｍａｘから、順に閾値を減らしていき、パラメ
ータ値（フラクタル次元）が極小となる閾値Ｔｉｎｆを
求める。このようにして最適閾値決定部１２にて最適な
る閾値ＴＴ　ｉｎｆを決定する。この最適閾値Ｔを用い
て、多値イメージメモリ４から読込んだ画像を２値化部
５により２値化して２値イメージメモリ１４に保有する
。この２値イメージメモリ１４に保有させた２値画像を
２値画像出力部２に出力し、さらに文字認識部１３など
に送出して認識処理等に供される。When the processing for each of these 15 types of thresholds is completed, the parameter comparison unit 11 extracts the parameter value for each threshold from each of Nα (1) to No. (15) in the parameter memory 10 and compares it. Among them, a threshold value T max at which the parameter value (fractal dimension) becomes maximum is determined. Starting from the threshold value T max where the parameter value obtained in this way is the maximum, the threshold value is decreased in order, and the threshold value Tinf where the parameter value (fractal dimension) is the minimum value is determined. In this way, the optimal threshold value determining section 12 determines the optimal threshold value TT inf. Using this optimum threshold T, the image read from the multi-valued image memory 4 is binarized by the binarization section 5 and stored in the binary image memory 14. The binary image held in the binary image memory 14 is output to the binary image output section 2, and further sent to the character recognition section 13 etc. for recognition processing.

さらに、請求項３記載の発明の実施例を第６図ないし第
８図により説明する。本実施例は、閾値（１〜１５＝１
＋−ｆ＋＋）に対するフラクタル次元の変化が第６図に
示すような特性を示し、フラクタル次元が最大となる閾
値Ｔとしては、中心値付近になるという性質を持つ点を
利用し、最適閾値Ｔを求める際の高速化を図るようにし
たものである。このため、構成的には、前記実施例の第
４図に比し、１５個のパラメータメモリ１０がＮｏ、（
１）（２）のみのパラメータメモリ１５に置き換えられ
ている。Further, an embodiment of the invention according to claim 3 will be explained with reference to FIGS. 6 to 8. In this example, the threshold value (1 to 15=1
The change in fractal dimension with respect to This is intended to speed up the calculation. Therefore, in terms of configuration, compared to FIG. 4 of the above embodiment, 15 parameter memories 10 are
1) It is replaced with the parameter memory 15 only for (2).

まず、前述の場合と同様にスキャナ３から多値画像を読
取り、多値イメージメモリ４に保有する。First, a multivalued image is read from the scanner 3 and stored in the multivalued image memory 4 as in the case described above.

そして、２値化部５により多値イメージメモリ４から１
６階調の多値画像を読込み、閾値ｔ＝８（中心値）とし
、第８図（ｂ）に示すように、この閾値ｔ＝８以上を黒
、それ以外を白とする２値画像を生成し、２値イメージ
メモリ１４に保有する。Then, the binarization unit 5 converts the multivalued image memory 4 into 1
A 6-gradation multivalued image is read in, the threshold value t = 8 (center value), and a binary image is created in which the threshold value t = 8 or higher is black and the rest is white, as shown in Figure 8 (b). The image data is generated and stored in the binary image memory 14.

そして、この２値イメージメモリ１４に保有された画像
全体に対し、黒画素数カウント部７において黒画素数及
び粗視化された際の黒画素数を計数し、それらの計数結
果を黒画素数メモリ８に保有する。次に、パラメータ計
算部９において、閾値ｔ＝８の時のフラクタル次元なる
パラメータｐｔＰ、を計算する。このパラメータｐｔ＝
ｐ、　をパラメータメモリ１５中のＮｏ、　（１）の方
に保有する。Then, for the entire image held in the binary image memory 14, the black pixel number and the coarse-grained black pixel number are counted in the black pixel number counting section 7, and the counting results are used as the black pixel number. It is held in memory 8. Next, the parameter calculation unit 9 calculates a parameter ptP, which is a fractal dimension when the threshold value t=8. This parameter pt=
p, is held in No. (1) in the parameter memory 15.

次に、今度は閾値ｔ＝７とし、同様の処理を繰返し、閾
値し＝７に対するパラメータｐｔ＝ｐ。Next, this time, set the threshold value t=7, repeat the same process, and set the parameter pt=p for the threshold value 7.

を計算し、このパラメータＰｔ＝Ｐ、　　をパラメータ
メモリ１５中の他方のＮα（２）の方に保有する。This parameter Pt=P, is stored in the other Nα(2) in the parameter memory 15.

そして、パラメータメモリ１５中のＮｏ、　（１）のも
のとＮｏ、　（２）のものとの大小を比較し、小さい方
の閾値を求める。もし、新しい閾値の方が小さければ、
その閾値でのパラメータ値ｐｔを保存し、閾値ｔを１つ
減らして、前述の場合と同様に、閾値対応のパラメータ
値ｐｔを求め、前の閾値の方が小さくなるまで（Ｐｔ（
Ｐｍｉｎでなくなるまで）、これを繰返す。また、前の
閾値ｔの方が小さければ、閾値ｔを９にし、同様の処理
を繰返す。この場合も、パラメータ値Ｐｔを保存するの
は、常に小さい方とする。このような９以上の閾値しに
対しては、もし、新しい閾値の方が小さければ、閾値り
を１つ増やし、同様に閾値対応のパラメータ値Ｐｔを求
める処理を行い、前の閾値の方が小さくなるまで、これ
を繰返す。Then, the magnitude of No. (1) and No. (2) in the parameter memory 15 is compared, and the smaller threshold value is determined. If the new threshold is smaller,
Save the parameter value pt at that threshold, reduce the threshold t by one, and calculate the parameter value pt corresponding to the threshold in the same way as in the previous case until the previous threshold becomes smaller (Pt(
Repeat this until Pmin is no longer present. Furthermore, if the previous threshold t is smaller, the threshold t is set to 9 and the same process is repeated. In this case as well, the smaller parameter value Pt is always saved. For such a threshold value of 9 or more, if the new threshold value is smaller, the threshold value is increased by 1, and the process of calculating the parameter value Pt corresponding to the threshold value is performed in the same way, and the previous threshold value is Repeat this until it becomes smaller.

このような処理後、最後に残った閾値ｔがパラメータ値
Ｐｔの最小となるもの（Ｐ＋＋＋ｉｎ　）であるので、
これを閾値Ｔ　ｍｉｎとし、この閾値Ｔ　ｍｉｎより最
適閾値Ｔを求める。最適閾値Ｔが決定されたら、この閾
値Ｔを用いて、多値イメージメモリ４から読込んだ画像
を２値化し、２値イメージメモリ１４に保有し、２値画
像出力部２に出力し、さらに文字認識部１３などに送出
する。After such processing, the last remaining threshold t is the minimum value (P+++in) of the parameter value Pt, so
This is set as a threshold value T min, and an optimal threshold value T is determined from this threshold value T min. Once the optimal threshold value T is determined, the image read from the multivalued image memory 4 is binarized using this threshold value T, stored in the binary image memory 14, outputted to the binary image output unit 2, and further It is sent to the character recognition unit 13 or the like.

また、本発明の他の実施例を第９図により説明する。本
実施例は、前記実施例をさらに改良したものである。即
ち、最適閾値を決定した後で再び多値画像を２値化する
という手間を省き、がっ、メモリを最小限とするため、
（］）（２）で示す２個のメモリ構成による２値イメー
ジメモリ■６を′用い、かつ、前記実施例のようにフラ
クタル次元が最大となる閾値は中央値付近になるという
性質を用いるようにしたものである。即ち、前記実施例
のように閾値を変化させ、小さい方のパラメータ値とそ
の閾値での２値画像とを常に保存しておくものである。Further, another embodiment of the present invention will be explained with reference to FIG. This example is a further improvement of the previous example. That is, in order to avoid the trouble of binarizing the multivalued image again after determining the optimal threshold value, and to minimize the memory,
(]) The binary image memory ■6 with the two memory configuration shown in (2) is used, and the property that the threshold value at which the fractal dimension is maximum is near the median value as in the above embodiment is used. This is what I did. That is, as in the embodiment described above, the threshold value is changed and the smaller parameter value and the binary image at that threshold value are always saved.

まず、前述の場合と同様にスキャナ３がら多値画像を読
取り、多値イメージメモリ４に保有する。First, as in the case described above, a multivalued image is read using the scanner 3 and stored in the multivalued image memory 4.

そして、２値化部５により多値イメージメモリ４から１
６階調の多値画像を読込み、閾値ｔ＝８（中心値）とし
、この閾値し一８以上を黒、それ以外を白とする２値画
像を生成し、２値イメージメモリ１６中の（１）の方に
保有する。そして、この２値イメージメモリ１６中の（
１）に保有された画像全体に対し、黒画素数カウント部
７において黒画素数及び粗視化された際の黒画素数を計
数し、それらの計数結果を黒画素数メモリ８に保有する
。Then, the binarization unit 5 converts the multivalued image memory 4 into 1
A 6-gradation multivalued image is read in, the threshold value t is set to 8 (center value), a binary image is generated in which the threshold value 18 or above is black, and the rest is white, and ( 1). Then, in this binary image memory 16 (
For the entire image held in step 1), the black pixel number counting unit 7 counts the number of black pixels and the number of coarse-grained black pixels, and the counting results are stored in the black pixel number memory 8.

次に、パラメータ計算部９において、閾値１．−８の時
のフラクタル次元なるパラメータＰｔ＝Ｐ。Next, in the parameter calculation section 9, the threshold value 1. The fractal dimension parameter Pt=P when -8.

を計算する。このパラメータＰｔ＝Ｐ、　　をパラメー
タメモリ１５中のＮｏ、（１）の方に保有する。Calculate. This parameter Pt=P, is held in No. (1) in the parameter memory 15.

次に、今度は閾値ｔ＝７とし、同様の処理を繰返しく但
し、２値イメージメモリ１６中の（２）の方が用いられ
る）、閾値ｔ＝７に対するパラメータＰｔ＝Ｐ、を計算
し、このパラメータＰｔ＝Ｐ。Next, this time, the threshold value t=7 is set, and the same process is repeated (however, (2) in the binary image memory 16 is used), and the parameter Pt=P for the threshold value t=7 is calculated, This parameter Pt=P.

をパラメータメモリ１５中の他方のＮｏ、　（２）の方
に保有する。そして、パラメータメモリ】５中のＮｏ、
　（１）のものとＮｏ、　（２）のものとの大小を比較
し、小さい方の閾値を求める。もし、新しい閾値の方が
小さければ、その閾値でのパラメータ値ＰＬを保存し、
閾値しを１つ減らして、前述の場合と同様に、閾値対応
のパラメータ値ＰＬを求め、前の閾値の方が小さくなる
まで（Ｐｔ（Ｐｍｉｎでなくなるまで）、これを繰返す
。この時、２値画像はパラメータ値の大きかった方の閾
値による２値イメージメモリ１６中の（１）又は（２）
に上書きされる。また、前の閾値ｔの方が小さければ、
閾値しを９にし、同様の処理を繰返す。この場合も、パ
ラメータ値Ｐｔを保存するのは、常に小さい方とする。is held in the other No. (2) in the parameter memory 15. And parameter memory] No. of 5,
Compare the size of (1) with No. (2) and find the smaller threshold. If the new threshold is smaller, save the parameter value PL at that threshold,
Decrease the threshold value by one, find the parameter value PL corresponding to the threshold value in the same way as in the previous case, and repeat this until the previous threshold value becomes smaller (Pt (until it is no longer Pmin). At this time, 2 The value image is (1) or (2) in the binary image memory 16 according to the threshold value of the larger parameter value.
will be overwritten. Also, if the previous threshold t is smaller,
Set the threshold value to 9 and repeat the same process. In this case as well, the smaller parameter value Pt is always saved.

こ′のような９以上の閾値しに対しては、もし、新しい
閾値の方か小さければ、閾値しを１つ増やし、同様に閾
値対応のパラメータ値Ｐｔを求める処理を行い、前の閾
値の方が小さくなるまで、これを繰返す。For a threshold value of 9 or more like this, if the new threshold value is smaller, increase the threshold value by one, perform the same process to obtain the parameter value Pt corresponding to the threshold value, and calculate the value of the previous threshold value. Repeat this until it becomes smaller.

このような処理後、最後に残った閾値ｔがパラメータ値
Ｐｔの最小となるもの（Ｐｍｉｎ　）でるので、これを
閾値Ｔ　ｍｉｎとし、この閾値Ｔｍ１ｎを最適閾値Ｔと
する。この時、この最適閾値Ｔによる２値画像は２つの
２値イメージメモリ１６中の（１）又は（２）の何れか
に保存されている筈であるので、この２値画像を２値画
像出ツク部２に出力し、さらに文字認識部１３などに送
出する。After such processing, the last remaining threshold t is the minimum value (Pmin) of the parameter value Pt, so this is set as the threshold T min, and this threshold Tm1n is set as the optimal threshold T. At this time, since the binary image based on this optimal threshold value T should be stored in either (1) or (2) of the two binary image memories 16, this binary image is output as a binary image. The data is output to the checking section 2, and further sent to the character recognition section 13, etc.

発明の効果本発明は、上述したように構成したので、印字状態の良
くない原稿に対してもその原稿の濃度に応じた最適な２
値化の閾値を自動的に設定することができ、認識率を向
上・安定させることができ、特に、請求項２又は３記載
の発明によれば、黒画素数や粗視化された際の黒画素数
の計数時にのみ２値画像をメモリに保有させることによ
り、２値画像用のメモリを節約することができ、また、
フラクタル次元なるパラメータが閾値変化特性において
、パラメータ値が最大となる閾値は中心値付近となる性
質を持つ点に着目した、請求項３記載の発明によれば、
閾値を中心値から変化させてフラクタル次元なるパラメ
ータを計算し、当該パラメータ値が常に小さい方の閾値
とそのパラメータ値とを保有し、パラメータ値が最小と
なる当該閾値に注目するようにしたので、最適閾値を求
める処理の高速化を図ることもできる。Effects of the Invention Since the present invention is configured as described above, even for a document with poor printing condition, the optimum two-dimensional image can be printed according to the density of the document.
It is possible to automatically set the threshold value for conversion, and it is possible to improve and stabilize the recognition rate. In particular, according to the invention described in claim 2 or 3, the number of black pixels and the By storing the binary image in the memory only when counting the number of black pixels, the memory for the binary image can be saved, and
According to the invention according to claim 3, which focuses on the fact that the parameter called fractal dimension has a property that the threshold value at which the parameter value is maximum is near the center value in the threshold value change characteristic.
We calculated a parameter called fractal dimension by changing the threshold value from the center value, always kept the threshold value with the smaller parameter value, and focused on the threshold value where the parameter value was the minimum. It is also possible to speed up the process of determining the optimal threshold value.

[Brief explanation of the drawing]

第１図ないし第３図は請求項ｌ記載の発明の実施例を示
すもので、第１図はブロック図、第２図はフローチャー
ト、第３図は粗視化についての説明図、第４図及び第５
図は請求項２記載の発明の一実施例を示すもので、第４
図はブロック図、第５図はフローチャート、第６図ない
し第８図は請求項３記載の発明の一実施例を示すもので
、第６図は閾値に対するフラクタル次元変化の特性図、
第７図はブロック図、第８図はフローチャート、第９図
は本発明の他の実施例を示すブロック図である。1 to 3 show an embodiment of the invention as claimed in claim 1, in which FIG. 1 is a block diagram, FIG. 2 is a flowchart, FIG. 3 is an explanatory diagram of coarse-graining, and FIG. and fifth
The figure shows one embodiment of the invention according to claim 2, and
FIG. 5 is a block diagram, FIG. 5 is a flowchart, FIGS. 6 to 8 show an embodiment of the invention according to claim 3, and FIG. 6 is a characteristic diagram of fractal dimension change with respect to a threshold value.
FIG. 7 is a block diagram, FIG. 8 is a flow chart, and FIG. 9 is a block diagram showing another embodiment of the present invention.

Claims

[Claims] 1. In an optimal binarization method in a pattern recognition device that converts a multilevel quantized image into a black and white binary image, the image is binarized using a plurality of threshold values, and each threshold value The binary image of is always held in memory, the number of black pixels at each threshold value and the number of black pixels when coarse-grained are counted, and a parameter called the fractal dimension of the image is calculated based on these numbers of black pixels. An optimal binarization method characterized in that an optimal threshold value is determined from threshold values at which the parameter value becomes minimum, and a binary image based on this optimal threshold value is output. 2. In the optimal binarization method in a pattern recognition device that converts a multilevel quantized image into a black and white binary image, the image is binarized using multiple thresholds, and the number of black pixels and roughness at each threshold are calculated. Only when counting the number of black pixels when visualized, the binary image at each threshold is stored in memory, and the count value of the number of black pixels at each threshold and the number of black pixels when coarse-grained is calculated. The method is characterized in that a parameter called a fractal dimension of the image is calculated based on the image, an optimal threshold value is determined from the threshold value at which the parameter value becomes minimum, and the image is binarized again using the optimal threshold value to output a binary image. Optimal binarization method. 3. In the optimal binarization method in a pattern recognition device that converts a multilevel quantized image into a black and white binary image, the image is binarized using multiple thresholds, and the number of black pixels and roughness at each threshold are calculated. Only when counting the number of black pixels when visualized, store the binary image at each threshold value in memory, change the threshold from the center value, and calculate the number of black pixels at that threshold and the black when coarse-grained. A parameter called the fractal dimension of the image is calculated based on the count value with the number of pixels, the threshold value with the smaller parameter value and the parameter value are always retained, and the threshold value with the minimum parameter value is set as the optimal threshold value. An optimal binarization method characterized by binarizing again and outputting a binary image.