JPH0795336B2

JPH0795336B2 - Character recognition method

Info

Publication number: JPH0795336B2
Application number: JP61256379A
Authority: JP
Inventors: 道義立川; 公一江尻
Original assignee: Ricoh Co Ltd
Current assignee: Ricoh Co Ltd
Priority date: 1986-10-28
Filing date: 1986-10-28
Publication date: 1995-10-11
Anticipated expiration: 2010-10-11
Also published as: JPS63109592A

Description

【発明の詳細な説明】〔技術分野〕本発明は手書き漢字などの文字の認識方式に関し、さら
に詳細には、多層方向ヒストグラム法による特徴ベクト
ルを用いる文字認識方式に関する。Description: TECHNICAL FIELD The present invention relates to a recognition method for characters such as handwritten Chinese characters, and more particularly to a character recognition method using a feature vector based on a multi-direction histogram method.

[Prior art]

本出願人は、多層方向ヒストグラム法による文字認識方
式を既に提案している。本発明は、このような文字認識
方式の改良に関するものである。The applicant has already proposed a character recognition method based on the multilayer directional histogram method. The present invention relates to the improvement of such a character recognition method.

この多層方向ヒストグラム法による文字認識方式におい
ては、文字パターンの文字線輪郭画素に方向コードを付
け、その文字の各辺から対向する辺に向って文字パター
ンを走査して白画素（背景）の次に出現する方向コード
を検出し、その方向コードをそれが走査線上で何番目に
検出されたかによって複数の層に層別する。そして、文
字の枠内の分割領域毎に、ある層までの層別に方向コー
ド別ヒストグラムを求め、それぞれのヒストグラム（特
徴量）を成分としたベクトルを、文字の特徴ベクトルと
して用いる。In the character recognition method using the multi-layered directional histogram method, a direction code is attached to the character line contour pixel of the character pattern, and the character pattern is scanned from each side of the character toward the opposite side and next to the white pixel (background). The directional code appearing in is detected, and the directional code is classified into a plurality of layers according to the number of the detected directional code on the scan line. Then, for each divided area in the character frame, a histogram for each direction code is obtained for each layer up to a certain layer, and a vector having each histogram (feature amount) as a component is used as a feature vector of the character.

一般に、特徴ベクトルの各成分（特徴量）を次元で表わ
す。例えば、方向コードとして８種類のコードを付け、
文字枠内を４×４のメッシュ領域に分割し、第１層およ
び第２層の方向コードまでを対象とすると、特徴ベクト
ルの次元数は256（＝４×４×２×８）となる。In general, each component (feature amount) of the feature vector is represented by a dimension. For example, attach 8 kinds of codes as direction codes,
When the character frame is divided into 4 × 4 mesh regions and the direction codes of the first layer and the second layer are targeted, the dimension number of the feature vector is 256 (= 4 × 4 × 2 × 8).

辞書に関しては、文字種毎の複数のパターンから同様の
特徴ベクトルを抽出し、その平均を辞書パターン（標準
パターン）の特徴ベクトルとして登録する。Regarding the dictionary, similar feature vectors are extracted from a plurality of patterns for each character type, and the average thereof is registered as the feature vector of the dictionary pattern (standard pattern).

また、本出願人は、層別のための走査方向も加味してさ
らに詳細に方向コードを層別する同様の文字認識方式も
既に提案している。本発明は、この文字認識方式にも同
様に適用できるものである。Further, the applicant has already proposed a similar character recognition method in which the direction code is stratified in more detail in consideration of the scanning direction for stratification. The present invention can be similarly applied to this character recognition system.

さて、このような多層方向ヒストグラム法による文字認
識方式においては、入力文字から抽出された特徴ベクト
ルと、辞書の特徴ベクトルとの距離演算を行って、入力
文字の文字種を同定する。In the character recognition method based on the multi-layered histogram method, the distance between the feature vector extracted from the input character and the feature vector of the dictionary is calculated to identify the character type of the input character.

しかし、前記のように特徴ベクトルの次元数が大きくな
ると、距離演算が多く、マッチング処理時間が長くなる
という問題があった。However, as described above, when the dimension number of the feature vector is large, there is a problem that the distance calculation is large and the matching processing time is long.

〔Purpose〕

したがって、本発明の目的は、多層方向ヒストグラム法
による文字認識方式において、マッチング処理の効率化
を図ることにある。Therefore, an object of the present invention is to improve the efficiency of the matching process in the character recognition method based on the multi-direction histogram method.

本発明のもう一つの目的は、多層方向ヒストグラム法に
よる文字認識方式において、マッチング処理の高速化と
ゝもに「つぶれ」文字の認識率の向上を図ることにあ
る。Another object of the present invention is to speed up the matching process and improve the recognition rate of "blurred" characters in the character recognition method based on the multi-layered histogram method.

〔Constitution〕

多層方向ヒストグラム法による特徴ベクトルは、文字識
別効果の大きい次元の成分と、その効果がそれほと顕著
でない次元の成分とがある。これに関し、説明を簡単に
するために、２次元の特徴ベクトルを考える。The feature vector based on the multi-direction histogram method has a component of a dimension having a large character recognition effect and a component of a dimension in which the effect is not so remarkable. In this regard, consider a two-dimensional feature vector for ease of explanation.

次元数を２として多層方向ヒストグラム法による辞書を
作成した場合、“文",“字",“認",“識”のそれぞれの
特徴ベクトルは、それぞれ第６図のg₁，g₂，g₃，g₄のよ
うになる。この例では、図から明らかなように、各特徴
ベクトルの成分（特徴量）Ａのほうが、成分（特徴量）
Ｂよりも分散（または標準偏差）が大きい。換言すれ
ば、成分Ａのほうが、未知文字に対する識別能力が高
い。When a dictionary is created by the multi-directional histogram method with the number of dimensions being 2, the feature vectors of “sentence”, “letter”, “recognition”, and “knowledge” are respectively g ₁ , g ₂ , g in FIG. It becomes like ₃ and g ₄ . In this example, as is clear from the figure, the component (feature amount) A of each feature vector is the component (feature amount)
The variance (or standard deviation) is larger than B. In other words, the component A has a higher identification ability for unknown characters.

このような特徴ベクトルの性質に着目すれば、文字識別
能力の高い成分から優先的に距離を演算することによ
り、識別能力の高い一部の次元まで距離演算を実行した
段階で、候補となり得ない文字種を排除し、候補となり
得る文字種を早い段階で絞り込むことができるであろ
う。Focusing on the property of such a feature vector, by preferentially calculating the distance from a component having a high character recognition ability, it cannot be a candidate when the distance calculation is executed up to a part of the dimensions having a high recognition ability. It will be possible to eliminate character types and narrow down possible character types at an early stage.

また、多層方向ヒストグラム法により特徴ベクトルは、
その各次元成分の順番を入れ替えてもパターンの特徴と
保存されるという性質がある。In addition, the feature vector by the multi-direction histogram method is
Even if the order of the respective dimensional components is exchanged, the characteristic of the pattern is preserved.

以上のような点に着目し、本発明にあっては、多層方向
ヒストグラム法によって全文字種の仮辞書パターンの特
徴ベクトルを得て、その特徴ベクトルの成分を標準偏差
（もしくは偏差）または分散の大きい次元順に並べ替え
たベクトルを辞書パターンの特徴ベクトルとして辞書に
登録する。Focusing on the above points, in the present invention, the feature vector of the temporary dictionary pattern of all the character types is obtained by the multilayer directional histogram method, and the component of the feature vector has the standard deviation (or deviation) or large variance. A vector rearranged in the order of dimensions is registered in the dictionary as a feature vector of the dictionary pattern.

そして、文字認識は次のようにして行う。入力文字から
多層方向ヒストグラム法によって特徴ベクトルを抽出
し、その成分を前記標準偏差または分散の大きい次元順
に並べ替えたベクトルと、各パターン種の辞書パターン
の特徴ベクトルとの距離演算を上位のある次元まで行
い、その距離と当該文字種に対応した判定閾値とを比較
し、その比較結果によって当該文字種に関する距離演算
を中断するか、さらに下位次元までの詳細マッチングを
行うか否かの判定を行う。Then, character recognition is performed as follows. A feature vector is extracted from the input character by the multi-direction histogram method, and the distance calculation between the vector in which the components are rearranged in order of the standard deviation or the order of the variance and the feature vector of the dictionary pattern of each pattern type has a higher rank. The distance is compared with the determination threshold value corresponding to the character type, and it is determined whether the distance calculation for the character type is interrupted or whether detailed matching is performed up to the lower dimension according to the comparison result.

また、その判定閾値は例えば次のようにして文字種毎に
決定する。文字種毎に複数のパターンから多層方向ヒス
トグラム法によって特徴ベクトルをそれぞれ抽出して、
その成分を前記標準偏差（偏差）または分散の大きい次
元順に並べかえたベクトルと、同一文字種の前記辞書パ
ターンの特徴ベクトルとの上位のある次元までの距離の
標準偏差（偏差）または分散を求め、この値に基づいて
前記判定閾値を決定する。Further, the determination threshold value is determined for each character type as follows, for example. Feature vectors are extracted from multiple patterns for each character type using the multi-direction histogram method,
The standard deviation (deviation) or the variance of the distance to a certain upper dimension of the vector in which the components are arranged in the order of the larger dimension of the standard deviation (deviation) or the variance and the feature vector of the dictionary pattern of the same character type, The determination threshold is determined based on the value.

このような文字認識方式によれば、入力文字の候補とな
り得ないような文字種に関するマッチング処理が早期に
中断するので、マッチング処理の効率が向上して認識速
度が上がる。また、マッチング処理の中断の判定閾値を
前記のように文字種別に決定するので、候補から排除す
べきでない文字種のマッチング処理が誤って中断するエ
ラーも確実に防止できるため、多層方向ヒストグラム法
による文字認識方式の本質的な利点は損なわれず、高い
認識率を達成できる。According to such a character recognition method, the matching process for a character type that cannot be a candidate for an input character is interrupted early, so that the efficiency of the matching process is improved and the recognition speed is increased. In addition, since the threshold for determining the interruption of the matching process is determined for the character type as described above, it is possible to reliably prevent an error in which the matching process of the character type that should not be excluded from the candidates is interrupted by mistake. A high recognition rate can be achieved without losing the essential advantages of the recognition method.

〔Example〕

以下、本発明の実施例について図面を参照し説明する。 Hereinafter, embodiments of the present invention will be described with reference to the drawings.

第１図は本発明の一実施例の機能的構成を簡略化して示
す概略ブロック図である。FIG. 1 is a schematic block diagram showing a simplified functional configuration of an embodiment of the present invention.

図において、10は原稿から文字パターンを読み取り、文
字パターン情報を前処理部12に入力する読み取り部であ
る。前処理部12は、入力文字パターンの切り出し、正規
化などの前処理を行い、処理後の文字パターンを１字毎
に特徴抽出部14に入力する部分である。In the figure, 10 is a reading unit for reading a character pattern from a document and inputting the character pattern information to the preprocessing unit 12. The preprocessing unit 12 is a unit that performs preprocessing such as clipping and normalization of an input character pattern, and inputs the processed character pattern to the feature extraction unit 14 for each character.

特徴抽出部14は、入力文字パターンから多層方向ヒスト
グラム法により特徴ベクトルを抽出する部分である。The feature extraction unit 14 is a unit that extracts a feature vector from the input character pattern by the multilayer directional histogram method.

こゝで、この実施例においては、動作モードとして辞書
作成モードと文字認識モードとがある。まず、辞書作成
モードの場合について以下に説明する。また、このモー
ドにおける辞書作成処理の概略フローチャートを第２図
に示し、以下の説明において対応するステップ番号を
（）内に示す。Here, in this embodiment, there are a dictionary creation mode and a character recognition mode as operation modes. First, the case of the dictionary creation mode will be described below. A schematic flow chart of the dictionary creating process in this mode is shown in FIG. 2, and corresponding step numbers are shown in parentheses in the following description.

一つの文字種について、Ｍ個の文字パターンが読み取り
部10より順次入力される（ステップ50）。その入力文字
パターンは前処理部12で前処理を受け（ステップ52）、
特徴抽出部14に入力されて多層方向ヒストグラム法によ
る特徴ベクトル（こゝでは256次元のベクトルとする）
を抽出される（ステップ54）。抽出された特徴ベクトル
は並べ替え部16を介して辞書作成部20に入力される。こ
の辞書作成部20により、Ｍ個の入力文字パターンから抽
出された特徴ベクトルを平均したベクトルが求められ
（ステップ56）、その平均ベクトルは、その文字種の仮
辞書パターンの特徴ベクトルとして辞書22に仮登録され
る（ステップ58）。For one character type, M character patterns are sequentially input from the reading unit 10 (step 50). The input character pattern is preprocessed by the preprocessing unit 12 (step 52),
The feature vector is input to the feature extraction unit 14 and is a feature vector obtained by the multi-directional histogram method (in this case, a 256-dimensional vector)
Are extracted (step 54). The extracted feature vector is input to the dictionary creating unit 20 via the rearrangement unit 16. The dictionary creation unit 20 obtains an average vector of the feature vectors extracted from the M input character patterns (step 56), and the average vector is temporarily stored in the dictionary 22 as the feature vector of the temporary dictionary pattern of the character type. It is registered (step 58).

辞書作成部20により、最後の文字種まで処理が終了した
か調べられ（ステップ60）、未処理の文字種が残ってい
れば、ステップ51に戻り、次の文字種に対する同様の処
理が行われる。The dictionary creating unit 20 checks whether the processing has been completed up to the last character type (step 60), and if there is an unprocessed character type, the process returns to step 51 and the same processing is performed for the next character type.

全ての文字種について処理が終ると、並べ替えテーブル
作成部26において、辞書22に仮登録されている全仮辞書
パターンの特徴ベクトルの、次元毎の標準偏差（もしく
は偏差）または分散が計算される（ステップ62）。そし
て、もとの特徴ベクトルを標準偏差（偏差）または分散
の大きい次元順に並べ替えるための情報のテーブル（並
べ替えテーブル）が作成され、並べ替えテーブル部18に
登録される（ステップ64）。When the processing is completed for all the character types, the rearrangement table creation unit 26 calculates the standard deviation (or deviation) or variance for each dimension of the feature vectors of all the temporary dictionary patterns temporarily registered in the dictionary 22 ( Step 62). Then, a table of information (sorting table) for sorting the original feature vectors in order of the standard deviation (deviation) or the dimension having the largest variance is created and registered in the sorting table unit 18 (step 64).

こゝまでは、並べ替えテーブルを作成するための処理段
階であり、この後に実際の辞書作成処理が始まる。Up to this point, this is the processing stage for creating the rearrangement table, after which the actual dictionary creation processing begins.

一つの文字種ｋについて、Ｍ個の文字パターンが読み取
り部10より順次入力される（ステップ66）。その入力文
字パターンは前処理部12で前処理を受け（ステップ6
8）、特徴抽出部14に入力されて多層方向ヒストグラム
法による特徴ベクトルを抽出される（ステップ70）。For one character type k, M character patterns are sequentially input from the reading unit 10 (step 66). The input character pattern is preprocessed by the preprocessing unit 12 (step 6
8) Then, the feature vector is input to the feature extraction unit 14 and the feature vector is extracted by the multi-direction histogram method (step 70).

抽出された特徴ベクトルは、並べ替え部16によって、並
べ替えテーブル部18に登録されている並べ替えテーブル
の情報に従い、前記標準偏差（偏差）または分散の大き
い次元順に成分が並べ替えられる（ステップ72）。この
ように成分の並べ替え後のＭ個の特徴ベクトルは辞書作
成部20に送られ、それらの平均ベクトルが、その文字種
ｋの辞書パターンの特徴ベクトルとして辞書20に本登録
される（ステップ74）。The extracted feature vector is rearranged by the rearrangement unit 16 according to the information of the rearrangement table registered in the rearrangement table unit 18, in which the components are rearranged in the order of the standard deviation (deviation) or the larger variance (step 72). ). In this way, the M feature vectors after the rearrangement of the components are sent to the dictionary creating unit 20, and their average vector is main-registered in the dictionary 20 as the feature vector of the dictionary pattern of the character type k (step 74). .

辞書作成部20により、全部の文字種（Ｋ種）について辞
書作成処理が終了したか調べられ（ステップ76）、未処
理の文字種が残っているならば、未処理の文字種につい
てステップ66から同様の処理が行われる。The dictionary creation unit 20 checks whether or not the dictionary creation processing has been completed for all the character types (K types) (step 76), and if there are unprocessed character types, the same processing from step 66 is performed for the unprocessed character types. Is done.

全文字種について処理が完了すると、辞書そのものゝ作
成は終わりであるが、次に判定閾値決定の処理が行われ
る。When the processing is completed for all the character types, the dictionary itself is finished, but the processing for determining the judgment threshold value is performed next.

この処理においては、文字種毎に辞書作成に用いられた
と同様なＭ個の文字パターンが入力され（ステップ7
8）、前処理（ステップ80）の後、それぞれ多層方向ヒ
ストグラム法による特徴ベクトルを抽出される（ステッ
プ82）、このＭ個の文字パターンの特徴ベクトルはそれ
ぞれ、並べ替え部16によって並べ替えテーブルの情報に
従い成分が並べ替えられ（ステップ84）、マッチング部
24に入力される。In this process, M character patterns similar to those used to create the dictionary for each character type are input (step 7).
8) After pre-processing (step 80), feature vectors by the multi-direction histogram method are extracted (step 82). The feature vectors of the M character patterns are respectively sorted by the sorting unit 16 in the sorting table. The components are rearranged according to the information (step 84), and the matching unit
Entered in 24.

次にマッチング部24および閾値決定部28の起動により、
閾値決定が行われる。すなわち、マッチング部24によ
り、その各特徴ベクトルと文字種ｋの辞書パターンの特
徴ベクトルとの距離が上位Ｎ次元まで演算され、閾値決
定部28において、その演算結果から、上位Ｎ次元まで距
離の標準偏差が求められる（ステップ86）。そして、そ
の標準値が文字値ｋの判定閾値として閾値テーブル部30
に登録される（ステップ88）。Next, by starting the matching unit 24 and the threshold value determining unit 28,
A threshold decision is made. That is, the matching unit 24 calculates the distance between each feature vector and the feature vector of the dictionary pattern of the character type k up to the upper N dimensions, and the threshold value determination unit 28 calculates the standard deviation of the distance from the calculation result to the upper N dimensions. Is required (step 86). Then, the standard value is used as the determination threshold of the character value k in the threshold table unit
Is registered (step 88).

同様にして、他の文字種の判定閾値が決定され、閾値テ
ーブル部30に登録される。Similarly, determination thresholds for other character types are determined and registered in the threshold table unit 30.

全文字種について閾値の決定登録が完了したと判定され
ると（ステップ90）、辞書作成モードの処理全体が終了
する。When it is determined that the thresholds have been determined and registered for all character types (step 90), the entire processing in the dictionary creation mode ends.

つぎに、文字認識モードの場合について以下に説明す
る。また、このモードにおける処理の概略フローチャー
トを第３図に示し、以下の説明において対応するステッ
プ番号を（）内に示す。Next, the case of the character recognition mode will be described below. Further, a schematic flowchart of the processing in this mode is shown in FIG. 3, and the corresponding step numbers are shown in parentheses in the following description.

認識対象の未知の文字パターンは、読み取り部10より入
力され（ステップ100）、前処理部12により前処理（ス
テップ102）の後、特徴抽出部14で多層方向ヒストグラ
ム法によって特徴ベクトル（こゝでは256次元のベクト
ル）を抽出される（ステップ104）。The unknown character pattern to be recognized is input from the reading unit 10 (step 100), pre-processed by the pre-processing unit 12 (step 102), and then the feature vector ( A 256-dimensional vector) is extracted (step 104).

この特徴ベクトルは、並べ替え部16によって並べ替えテ
ーブルの情報に従い成分が並べ替えられ、その並べ替え
後の特徴ベクトルYnがマッチング部24に入力される（ス
テップ106）。The rearrangement unit 16 rearranges the components of the feature vector according to the information in the rearrangement table, and the rearranged feature vector Yn is input to the matching unit 24 (step 106).

マッチング部24において、文字種ｋの辞書パターンの特
徴ベクトルFknと、入力文字の特徴ベクトルYnとの上位
Ｎ次元までのマッチング距離ｄが演算される（ステップ
108）。そして、その距離ｄと、閾値テーブル部30に登
録されている対応文字種ｋの判定閾値Thkとの比較判定
が行われる（ステップ110）。In the matching unit 24, the matching distance d between the feature vector Fkn of the dictionary pattern of the character type k and the feature vector Yn of the input character to the upper N dimensions is calculated (step
108). Then, the distance d is compared with the determination threshold Thk of the corresponding character type k registered in the threshold table unit 30 (step 110).

ｄ＞Thkであれば、その入力文字の文字種は現在の文字
種ｋである可能性はないので、その辞書パターンとのマ
ッチング処理を中断し、終了判定を行う（ステップ11
6）。If d> Thk, there is no possibility that the character type of the input character is the current character type k, so the matching process with the dictionary pattern is interrupted and an end determination is made (step 11).
6).

ｄ≦Thkであれば、現在の文字種は入力文字の文字種で
ある可能性が高いので、詳細マッチングが行われる。す
なわち、全次元についてマッチング距離Ｄが演算される
（ステップ112）。そして、その距離Ｄと、それまでの
候補の距離とが比較され、距離が小さいほうが文字種が
新しい候補として残され（ステップ114）。終了判定の
ステップ116に進む。If d ≦ Thk, the current character type is highly likely to be the character type of the input character, and therefore detailed matching is performed. That is, the matching distance D is calculated for all dimensions (step 112). Then, the distance D is compared with the distances of the candidates so far, and the smaller the distance, the character type is left as a new candidate (step 114). The process proceeds to step 116 of end determination.

ステップ116において、マッチングすべき文字種が残っ
ていると判定されると、次の文字種についてステップ10
8以降のマッチング処理が行われる。When it is determined in step 116 that the character type to be matched remains, the step 10 for the next character type is performed.
Matching processing after 8 is performed.

すべての文字種についてマッチング処理が行われると、
ステップ116で終了と判定され、最終的に残った候補の
文字種の文字コードが認識結果として出力される。これ
で、一つの入力文字の認識処理を完了する。When matching processing is performed for all character types,
In step 116, it is determined that the process has ended, and the character code of the finally remaining candidate character type is output as the recognition result. This completes the recognition process for one input character.

さて、手書き漢字などの文字の変形や傾向や程度は、記
入者（認識システムの使用者）によって様々である。こ
れに対処するために、様々な文字変形を考慮した文字パ
ターンを用意し、辞書作成に用いる。Now, the deformation, tendency, and degree of characters such as handwritten Chinese characters vary depending on the writer (user of the recognition system). In order to deal with this, character patterns considering various character transformations are prepared and used for dictionary creation.

しかし、そのようにしても、使用者によって、その手書
き文字パターンと辞書パターンとのマッチングの良し悪
しがある。また、同一の使用者でも、認識システムに対
し慣れるに従い、辞書とのマッチング性が向上し、認識
率が高く成る傾向が認められる。However, even in such a case, the matching between the handwritten character pattern and the dictionary pattern is good or bad depending on the user. Also, even the same user tends to improve the matching property with the dictionary and increase the recognition rate as the user gets used to the recognition system.

このような点に着目すると、詳細マッチングを行うか否
かの判定のための閾値を変更できると好都合である。す
なわち、辞書とのマッチングの良い使用者の場合または
慣れた使用者の場合、判定閾値を下げて詳細マッチング
の頻度を下げることにより、認識率を下げることなく、
認識速度を上げることができる。Focusing on such a point, it is convenient that the threshold for determining whether or not to perform the detailed matching can be changed. That is, in the case of a user who has a good matching with the dictionary or a user who is accustomed to it, by lowering the determination threshold value and reducing the frequency of detailed matching, without lowering the recognition rate,
The recognition speed can be increased.

次に、このような閾値の調整を可能とした本発明の第２
実施例について説明する。なお、この実施例の機能的構
成は前記実施例と同様であるので、そのブロック図は省
略し、以下の説明において必要に応じ第１図に参照す
る。Next, the second aspect of the present invention that enables such a threshold adjustment
Examples will be described. Since the functional configuration of this embodiment is the same as that of the previous embodiment, its block diagram is omitted, and reference will be made to FIG. 1 as necessary in the following description.

この実施例においては、辞書作成モードにおいて、前記
実施例と同様に、並べ替えテーブルと辞書が作成され
る。たゞし、詳細マッチングを行うか否かの判定のため
の閾値が、文字種毎の複数種類決定される。In this embodiment, in the dictionary creation mode, the sorting table and the dictionary are created as in the previous embodiments. However, a plurality of thresholds for determining whether or not to perform detailed matching are determined for each character type.

この点について、前記第１実施例に関連した第２図のフ
ローチャートを参照して説明すれば、ステップ88におい
て決定される閾値をThkとすれば、この実施例にあって
は、Thk/1,Thk/2〜Thk/L（Ｌは正の整数）の値が、それ
ぞれ文字種ｋの閾値Thk（１）,Thk（２）〜Thk（Ｌ）と
して閾値決定部28により決定され、閾値テーブル部30に
登録される。This point will be described with reference to the flowchart of FIG. 2 related to the first embodiment. If the threshold value determined in step 88 is Thk, in this embodiment, Thk / 1, The values of Thk / 2 to Thk / L (L is a positive integer) are determined by the threshold determination unit 28 as the thresholds Thk (1) and Thk (2) to Thk (L) of the character type k, respectively, and the threshold table unit 30. Be registered with.

第４図は、この実施例におけるパターン認識モードの概
略フローチャートである。この図を参照し、この実施例
におけるパターン認識モードについて説明する。FIG. 4 is a schematic flow chart of the pattern recognition mode in this embodiment. The pattern recognition mode in this embodiment will be described with reference to this drawing.

この実施例においては、閾値の選択のためのパラメータ
であるユーザレベルUL（＝1,2,3,…,L）が最初のステッ
プ200で設定される。このユーザレベルULは、この実施
例に係るパターン認識システムの使用者側で指定できる
ものである。In this embodiment, the user level UL (= 1, 2, 3, ..., L), which is a parameter for selecting the threshold value, is set in the first step 200. The user level UL can be specified by the user of the pattern recognition system according to this embodiment.

ステップ202からステップ210までは、第３図のステップ
100からステップ108までと同一内容である。ステップ21
4からステップ220までは、第３図のステップ112からス
テップ118までと同一内容である。Steps 202 to 210 are the steps shown in FIG.
The contents are the same as 100 to step 108. Step 21
Steps 4 to 220 have the same contents as steps 112 to 118 in FIG.

ステップ212は、上位Ｎ次元までのマッチング距離から
詳細マッチングを行うか否かを判定するためのステップ
であり、第３図のステップ110と対応する。この実施例
の場合、文字種ｋの複数の閾値Th（１）〜Th（Ｌ）の中
から、ユーザレベルULに対応する一つの閾値Th（UL）
が、その判定の閾値として用いられる。Step 212 is a step for determining whether or not to perform detailed matching from the matching distance to the upper N dimensions, and corresponds to step 110 in FIG. In the case of this embodiment, one threshold value Th (UL) corresponding to the user level UL is selected from a plurality of threshold values Th (1) to Th (L) of the character type k.
Is used as a threshold for the determination.

つまり、この実施例に係わるパターン認識システムに慣
れた使用者の場合、ユーザレベルULを高いレベル（最高
はＬ）に指定すれば、より小さい閾値（最低はTh
（Ｌ））が選択されて判定に用いられるから、詳細マッ
チング（ステップ214）に進む頻度が少なくなり、それ
だけマッチング効率が向上して文字認識速度が上がる。That is, in the case of a user who is familiar with the pattern recognition system according to this embodiment, if the user level UL is designated as a high level (the highest is L), a smaller threshold value (the lowest is Th.
(L)) is selected and used for the determination, the frequency of proceeding to the detailed matching (step 214) is reduced, the matching efficiency is improved and the character recognition speed is increased accordingly.

逆に、慣れていない使用者または辞書とのマッチングの
良くない使用者の場合、低いユーザレベルUL（最低は
１）を指定すれば、大きい閾値（最高はThk（１））が
選択されて判定に用いられ、詳細マッチングの頻度が上
がるから、認識速度は落ちるが認識率を上げることがで
きる。On the other hand, if the user is unfamiliar or has poor matching with the dictionary, if a low user level UL (lowest is 1) is specified, a large threshold value (highest is Thk (1)) is selected and judged. Since the frequency of detailed matching increases, the recognition speed decreases but the recognition rate can be increased.

さて、前述のように、認識率は使用者によって変動する
が、さらに文字種によっても変動する。これは、辞書作
成用のパターンとして、ある癖を持ったパターンが用い
られた場合などに顕著であるが、このような点は漢字の
ように文字種が多い場合、辞書作成段階ではチェックが
困難である。As described above, the recognition rate varies depending on the user, but also varies depending on the character type. This is remarkable when a pattern with a certain habit is used as a pattern for dictionary creation, but such a point is difficult to check at the dictionary creation stage when there are many character types such as Kanji. is there.

そのような文字種ごとに認識率の変動を吸収することが
できる。本発明の第３実施例について以下に説明する。
なお、この実施例に係るパターン認識システムの機能的
構成は前記第１実施例と同様である。また、辞書作成モ
ードの処理は前記第２実施例と同様であり、文字種毎に
複数種類の判定用閾値Th（１）〜Th（Ｌ）が決定され
る。It is possible to absorb the variation in the recognition rate for each such character type. The third embodiment of the present invention will be described below.
The functional configuration of the pattern recognition system according to this embodiment is the same as that of the first embodiment. The processing in the dictionary creation mode is similar to that in the second embodiment, and a plurality of types of determination thresholds Th (1) to Th (L) are determined for each character type.

第５図は、この実施例におけるパターン認識モードの処
理の概略フローチャートである。この図において、ステ
ップ300〜308は第４図のステップ202〜210と同一内容で
あり、ステップ314〜320も第４図のステップ214〜220と
同一内容である。FIG. 5 is a schematic flowchart of the processing in the pattern recognition mode in this embodiment. In this figure, steps 300 to 308 have the same contents as steps 202 to 210 in FIG. 4, and steps 314 to 320 have the same contents as steps 214 to 220 in FIG.

ステップ310では、文字種毎のレベルLV（ｋ）が、閾値
選択用のパラメータＶとして設定される。このレベルLV
（ｋ）は、例えばパターン認識システムの運用中に文字
種ｋのリジェクト・認識率Rkを蓄えておき、そのPk値を
ある定数で割り算した値として与えられる。In step 310, the level LV (k) for each character type is set as the threshold selection parameter V. This level LV
(K) is given as a value obtained by storing the reject / recognition rate Rk of the character type k during operation of the pattern recognition system and dividing the Pk value by a certain constant.

そして、ステップ312は第４図のステップ212に対応する
ステップであり、文字種ｋの複数の閾値Thk（１）〜Th
（Ｌ）の中から、パラメータＶに対応する一つの閾値Th
k（Ｖ）を選択し、距離ｄと比較することにより詳細マ
ッチング（ステップ314）に進むか否かを判定する。Then, step 312 is a step corresponding to step 212 in FIG. 4, and is a plurality of thresholds Thk (1) to Thk of the character type k.
One threshold Th corresponding to the parameter V from (L)
It is determined whether or not to proceed to the detailed matching (step 314) by selecting k (V) and comparing it with the distance d.

なお、この実施例と、前記第２実施例とを組み合わせた
第４の実施例について、便宜上、第５図のフローチャー
トを参照して説明する。A fourth embodiment, which is a combination of this embodiment and the second embodiment, will be described with reference to the flowchart of FIG. 5 for convenience.

この実施例にあっては、ステップ300の前にユーザレベ
ルULの設定ステップ（第４図のステップ200に相当）が
追加される。また、ステップ310において、文字種ｋの
レベルLV（ｋ）とユーザレベルULとの和がパラメータＶ
として設定される。そして、ステップ312において、パ
ラメータＶに対応する閾値Thk（Ｖ）が判定に用いられ
る。In this embodiment, a user level UL setting step (corresponding to step 200 in FIG. 4) is added before step 300. In step 310, the sum of the level LV (k) of the character type k and the user level UL is the parameter V.
Is set as. Then, in step 312, the threshold Thk (V) corresponding to the parameter V is used for the determination.

これ以外は、前記各実施例と同様である。Other than this, it is the same as each of the above-mentioned embodiments.

この実施例によれば、使用者および文字種による変動要
因を吸収し、最適な閾値を用いて詳細マッチングを行う
か否かの判定を行うことができるので、前記第２実施例
と第３実施例の両方の利点が得られる。According to this embodiment, since it is possible to absorb the variation factors depending on the user and the character type and determine whether or not to perform the detailed matching using the optimum threshold value, the second and third embodiments described above. Both advantages are obtained.

さて、漢字のような複雑な文字は、活字印刷であっても
手書きであっても、「つぶれ」が起こりやすい。発明者
の研究によれば、第１層の画素は文字の「つぶれ」の影
響を受けにくゝ、第１層の特徴量は「つぶれ」文字の場
合でも安定しており、「つぶれ」文字の認識に有効であ
ることが分かった。By the way, complicated characters such as kanji are apt to be "crushed" regardless of whether they are printed by handprint or handwritten. According to the inventor's research, the pixels of the first layer are not affected by the "blurred" character, and the feature amount of the first layer is stable even in the case of the "blurred" character, It was found to be effective in recognizing.

このような点に基づき、「つぶれ」文字に対しても高い
認識率を達成でき、また、効率的な認識処理の可能な他
の実施例について、以下に説明する。Based on this point, another embodiment in which a high recognition rate can be achieved even for "blurred" characters and efficient recognition processing is possible will be described below.

なお、この実施例の機能的構成は前記各実施例と殆ど同
様であるので、第１図のブロック図を参照して説明す
る。Since the functional structure of this embodiment is almost the same as that of each of the above-mentioned embodiments, it will be described with reference to the block diagram of FIG.

この実施例においては、第１図に破線で示すようにつぶ
れ判定部32が追加される。このつぶれ判定部32は、入力
文字の周囲長Scと黒画素数Tbとの比Rcを測定し、その比
Rc（つぶれ度合い）が所定の閾値以上の場合には、入力
文字を正常文字と判定し、その閾値を下回る場合には、
入力文字を「つぶれ」文字と判定する。この判定結果は
並べ替え部16およびマッチング部24に与えられる。In this embodiment, a collapse determination unit 32 is added as shown by the broken line in FIG. The crush determination unit 32 measures the ratio Rc between the perimeter length Sc of the input character and the number of black pixels Tb, and calculates the ratio.
If the Rc (degradation degree) is greater than or equal to a predetermined threshold value, the input character is judged to be a normal character, and if it falls below that threshold value,
Determine the input character as a "blank" character. The determination result is given to the rearrangement unit 16 and the matching unit 24.

なお、つぶれ判定のアルゴリズムは適宜変更してもよい
ものである。It should be noted that the algorithm for determining the collapse may be changed as appropriate.

また、この実施例においては、正常文字用の第１の並べ
替えテーブルと、つぶれ文字用の第２の並べ替えテーブ
ルとが辞書作成時に作成されて、それぞれ並べ替えテー
ブル部18に登録される。Further, in this embodiment, the first rearrangement table for normal characters and the second rearrangement table for collapsed characters are created at the time of dictionary creation and are registered in the rearrangement table unit 18, respectively.

正常文字用の第１の並べ替えテーブルには、辞書の特徴
ベクトルの並べ替え後の成分順に合わせて、入力文字か
ら抽出された特徴ベクトルを、第１層および第２層の特
徴量に関して標準偏差もしくは偏差または分散の大きい
次元順に並べ替えるための情報が格納される。つぶれ文
字用の第２の並べ替えテーブルには、入力文字から抽出
された特徴ベクトルを、第１層の特徴量に関して標準偏
差もしくは偏差または分散の大きい次元順に特徴ベクト
ルの成分を並べ替えるための情報が格納される。In the first rearrangement table for normal characters, the feature vectors extracted from the input characters are matched with the component order after rearrangement of the feature vectors of the dictionary, and the standard deviations of the feature amounts of the first layer and the second layer are set. Alternatively, information for sorting in the order of larger deviation or variance is stored. Information for rearranging the feature vectors extracted from the input characters in the second rearrangement table for the collapsed characters with respect to the feature amount of the first layer Is stored.

文字認識モードにおいては、入力文字が正常文字と判定
された場合、入力文字から抽出された特徴ベクトルは、
並べ替え部16により第１の並べ替えテーブルの情報に従
って第１層および第２層の特徴量に関して成分の並べ替
えが行われてからマッチング部24へ送られる。他方、入
力文字が「つぶれ」文字と判定された場合、その抽出特
徴ベクトルは、第２の並べ替えテーブルの情報に従っ
て、第１層の特徴量に関してだけ成分量の並べ替えが行
われる。In the character recognition mode, when the input character is determined to be a normal character, the feature vector extracted from the input character is
The rearrangement unit 16 rearranges the components with respect to the characteristic amounts of the first layer and the second layer according to the information in the first rearrangement table, and then sends the components to the matching unit 24. On the other hand, when the input character is determined to be a “blurred” character, the extracted feature vector is rearranged in the component amount only with respect to the feature amount of the first layer according to the information of the second rearrangement table.

マッチング部24においては、その並べ替え後の入力文字
の特徴ベクトルと辞書の特徴ベクトルとの距離演算を行
うが、正常文字の場合には第１層および第２層の特徴量
に関して最高256次元まで距離の演算を行い、他方、
「つぶれ」文字の場合には第１層の特徴量に関して最高
128次元まで距離の演算を行う。つまり、「つぶれ」文
字に関しては、「つぶれ」の影響を受けにくい第１層の
特徴量に限って距離演算が行われ、しかも有効次元を限
定している。The matching unit 24 calculates the distance between the rearranged input character feature vector and the dictionary feature vector, but in the case of normal characters, up to a maximum of 256 dimensions regarding the feature amounts of the first and second layers. Calculates the distance, while
In the case of "blank" characters, it is the highest regarding the feature amount of the first layer
Calculates distance up to 128 dimensions. That is, for the “blurred” character, the distance calculation is performed only for the feature amount of the first layer that is not easily affected by the “blurred”, and the effective dimension is limited.

いずれの場合にも、前記実施例と同様に、上位のある次
元までの距離を判定閾値と比較し、距離演算を中止する
か継続するかの判定を行う。In any case, as in the above-described embodiment, the distance to a certain higher-order dimension is compared with the determination threshold, and it is determined whether the distance calculation should be stopped or continued.

この実施例によれば、「つぶれ」文字に対しても高い認
識率を達成でき、しかも距離演算量が削減されるので、
マッチング処理の一層の効率化が可能である。According to this embodiment, a high recognition rate can be achieved even for "blurred" characters, and the distance calculation amount can be reduced.
It is possible to further improve the efficiency of the matching process.

以上、本発明を実施例について説明したが、本発明はそ
れだけに限定されるものではなく、様々な変形が許され
るものである。Although the present invention has been described above with reference to the embodiments, the present invention is not limited thereto and various modifications are allowed.

〔effect〕

以上の説明から明らかなように、本発明によれば、多層
方向ヒストグラム法の利点を損なうことなく、マッチン
グ処理の効率を大幅に向上し、高速の文字認識が可能で
あり、また、つぶれ文字に対しても高い認識率を達成で
きる。As is clear from the above description, according to the present invention, the efficiency of the matching process is significantly improved, high-speed character recognition is possible, and the collapsed characters are A high recognition rate can also be achieved.

【図面の簡単な説明】第１図は本発明の第１の実施例ならびに他の実施例に拘
わるブロック図、第２図は第１の実施例における辞書作
成処理の概略フローチャート、第３図は第１実施例にお
ける文字認識処理の概略フローチャート、第４図は本発
明の第２実施例における文字認識処理の概略フローチャ
ート、第５図は本発明の第３実施例における文字認識処
理の概略フローチャート、第６図は多層方向ヒストグラ
ム法の特徴ベクトルの性質を説明するためのベクトル図
である。 10…読み取り部、14…特徴抽出部、16…並べ替え部、18
…並べ替えテーブル部、22…辞書、24…マッチング部、
30…閾値テーブル部。BRIEF DESCRIPTION OF THE DRAWINGS FIG. 1 is a block diagram relating to the first embodiment and other embodiments of the present invention, FIG. 2 is a schematic flowchart of a dictionary creating process in the first embodiment, and FIG. FIG. 4 is a schematic flowchart of character recognition processing in the first embodiment, FIG. 4 is a schematic flowchart of character recognition processing in the second embodiment of the present invention, and FIG. 5 is a schematic flowchart of character recognition processing in the third embodiment of the present invention. FIG. 6 is a vector diagram for explaining the property of the feature vector of the multi-layered histogram method. 10 ... reading unit, 14 ... feature extracting unit, 16 ... sorting unit, 18
… Sort table section, 22… Dictionary, 24… Matching section,
30 ... Threshold table section.

Claims

[Claims]

1. A character recognition method using a feature vector according to a multi-directional histogram method, comprising a dictionary for storing feature vectors of various characters in which components are rearranged in order of large standard deviation or deviation or variance, and the rearrangement. Table for storing information for a character type, a threshold value table for storing a judgment threshold value corresponding to a character type, a means for extracting a feature vector from an input character, and a rearrangement table for the components of the extracted feature vector. A means for rearranging the standard deviation or the deviation or the variance in descending order according to the stored information, and the rearranged feature vector and the feature vector stored in the dictionary. Distance calculation is performed only on the characteristic component, and the distance is stored in the threshold table. Only when it is less than or equal to the character type determination threshold value, a means for recognizing an input character by performing a distance calculation only on a part of characteristic components added with a characteristic component having a smaller standard deviation or deviation or variance is provided. Character recognition method.

2. A character recognition method using a feature vector by a multi-layered directional histogram method, which stores a feature vector of various characters in which components are rearranged in order of large standard deviation or deviation or variance, and a first layer. And the second
A first rearrangement table that stores information for rearranging the components of the feature vector in descending order of standard deviation or deviation or variance with respect to the layer feature amount, and large standard deviation, deviation or variance with respect to the first layer feature amount. A second rearrangement table that stores information for rearranging the components of the feature vector in order, a threshold table that stores the determination threshold corresponding to the character type, a feature extraction unit that extracts the feature vector from the input character, and an input. Determination means for determining whether the character is a normal character or a collapsed character, and the components of the feature vector extracted from the input character determined to be a normal character by the determination are arranged according to the storage information of the first rearrangement table. In other words, the components of the feature vector extracted from the input character determined to be a collapsed character by the determination are subjected to the second rearrangement. Means for rearranging the input characters after the rearrangement, and the feature vector stored in the dictionary and the feature vector of the input characters after the sort, the first layer and the first layer With respect to the feature amounts of the two layers, and in the case of the input character determined to be a collapsed character, the distance calculation is performed only for the feature amount of the standard deviation or the deviation or the large variance for the feature amount of the first layer, Only when the distance is equal to or less than the judgment threshold value of the corresponding character type stored in the threshold value data, the distance calculation is performed only on a part of the characteristic components to which the characteristic components having the standard deviation or the smaller deviation or variance are added, and the input character is input. And a means for recognizing characters.