JPH03214283A

JPH03214283A - Conversion table creation circuit using systolic array

Info

Publication number: JPH03214283A
Application number: JP2008421A
Authority: JP
Inventors: Masayuki Kimura; 木村　正行; Hirotomo Aso; 阿曽　弘具; Shinichiro Omachi; 真一郎大町; Yutaka Katsuyama; 裕勝山
Original assignee: Fujitsu Ltd
Current assignee: Fujitsu Ltd
Priority date: 1990-01-19
Filing date: 1990-01-19
Publication date: 1991-09-19

Abstract

(57)【要約】本公報は電子出願前の出願データであるた
め要約のデータは記録されません。(57) [Summary] This bulletin contains application data before electronic filing, so abstract data is not recorded.

Description

【発明の詳細な説明】〔概　　　要〕切り出した１文字のパターンをシストリックアレイ構造
によって正規化のための変換表を作成するシストリック
アレイにおける変換表作成回路に関し、切り出した文字データを正規化するためのヒストグラム
情報をパイプライン処理でかつ並列に行いパターン認識
を高速化するシストリノルアレイによる変換表作成回路
を提供することを目的とし、Ｎ個のセルを直列接続した
直列回路をＮ列設け、第ｍ列目の入力側からｍ番目のセ
ルは、入力データをシフトして出力するとともに、前記
入力データと第ｍ−１列目のｍ−１列目のセルよりプ川
わる幅データとカウントデータとから幅データトカウン
トデータとを更新してｍ＋１列目の入力側からｍ＋１番
目のセルに出力するヒストグラム計算セルであり、該ヒ
ストグラム計算セル以外のセルはシフトレシスタである
ように構成する。[Detailed Description of the Invention] [Summary] This invention relates to a conversion table creation circuit in a systolic array that creates a conversion table for normalizing an extracted single character pattern using a systolic array structure, and normalizes extracted character data. The purpose is to provide a conversion table creation circuit using a systrinol array that speeds up pattern recognition by processing histogram information in pipeline processing in parallel to speed up pattern recognition. The m-th cell from the input side of the m-th column shifts and outputs the input data, and the input data and the width data of the m-1th column are different from the cell of the m-1th column. This is a histogram calculation cell that updates the width data and count data from the input side of the m+1 column and outputs it to the m+1 cell from the input side of the m+1 column, and cells other than the histogram calculation cell are configured to be shift registers. .

[Industrial application field]

本発明はパターン認識装置に係り、更に詳しくは切り出
した１文字のパターンをシストリツクアレイ構造によっ
て正規化のための変換表を作成するシストリックアレイ
における変換表作成回路に関する。The present invention relates to a pattern recognition device, and more particularly to a conversion table creation circuit in a systolic array that creates a conversion table for normalizing a single character pattern cut out using a systolic array structure.

（従　来　の　技　術〕コンピュータシステムの発展により、画像データを取り
込むとともに、取り込んだ画像データから文字を切り出
し、読み取った書類の文章のそれぞれの文字を認識する
読み取り装置が実用化している。この読み取り装置はた
とえばイメージスキャナ等によって読み取ったドットデ
ータをあらかじめ定められた領域単位で分割し、その分
割内での文字とあらかじめ定められた文字とを比較し、
１番似にかよった文字を結果として出力してい“る。(Conventional technology) With the development of computer systems, reading devices have been put into practical use that capture image data, cut out characters from the captured image data, and recognize each character in the text of the read document. The device divides the dot data read by an image scanner, etc. into predetermined area units, compares the characters within the division with the predetermined characters,
The most similar character is output as the result.

このあらかじめ定められたデータは一般的には辞書メモ
リに格納されており、たとえば各規定の文字を特徴化し
たデータとして記憶されている。そして認識すべき文字
が入力した時、同様にその入力した文字を特徴化し、前
述の辞書メモリに格納されているあらかじめ定められた
特徴データとの距離を求めている。この求めた距離から
最も小さい文字を認識結果として出力している。This predetermined data is generally stored in a dictionary memory, and is stored, for example, as data characterizing each prescribed character. When a character to be recognized is input, the input character is similarly characterized and the distance from the predetermined characteristic data stored in the dictionary memory is determined. The smallest character from this determined distance is output as the recognition result.

前述のようなシステムにおいでは、切り出した１個の文
字を認識する場合、それらの文字があらかじめ決められ
た大きさとなっていると認識率が向上する。このため従
来においては切り出した文字単位で樅横方向にあらかじ
め決められた高さと、幅にしている。すなわち正規化し
ている。In the above-mentioned system, when recognizing a single extracted character, the recognition rate improves if the characters have a predetermined size. For this reason, conventionally, each character cut out has a predetermined height and width in the lateral direction of the fir tree. In other words, it is normalized.

この正規化の方式は種々あるが、その１方式として切り
出した文字の縦方向と横方向に対してヒストグラムを求
め、得られたヒストグラムから文字を拡大や縮小して正
規化する方式がある。There are various methods for this normalization, one of which is to obtain histograms in the vertical and horizontal directions of cut out characters, and to normalize the characters by enlarging or reducing them based on the obtained histograms.

このような方式によって認識すべき文字が一定の大きさ
になるので、認識率が向上する。With this method, the characters to be recognized have a constant size, so the recognition rate improves.

［発明が解決しようとする課題］前述し，たヒストグラムを求める場合、従来においては
、切り出した領域をドット単位で読み出し、例えばそれ
以前に読み出した左右や上下のドノトが黒であるか白で
あるか等のフラグから次のドッＩ・への情報を求めて順
次処理している。このような従来の方式はドット単位で
処理してヒストグラムを求めているので、その処理に多
くの時間を必要とする問題を有していた。すなわち、認
識処理を行う時間が大となる問題を有していた。[Problems to be Solved by the Invention] When obtaining the above-mentioned histogram, conventionally, the cut out area is read out dot by dot, and for example, the left and right, top and bottom dots read previously are either black or white. The information for the next dot I is obtained from the flags of these flags and processed sequentially. Since such a conventional method calculates a histogram by processing dot by dot, it has the problem that the processing requires a lot of time. That is, there was a problem in that it took a long time to perform the recognition process.

本発明は切り出した文字データを正規化するためのヒス
トグラム情報をパイプライン処理でかつ並列に行いパタ
ーン認識を高速化するシストリノルアレイによる変換表
作成回路を提供することを目的とする。SUMMARY OF THE INVENTION An object of the present invention is to provide a conversion table creation circuit using a systrinol array that speeds up pattern recognition by processing histogram information for normalizing extracted character data in parallel in a pipeline process.

[Means to solve the problem]

第１図は本発明の原理ブロック図である。 FIG. 1 is a block diagram of the principle of the present invention.

Ｎ個のセルＨ（１．１）〜Ｈ（Ｎ，１）・・・Ｈ（１，
Ｎ）〜Ｈ（Ｎ，Ｎ）を直列接続した直列回路をＮ列設け
ている。N cells H(1.1) to H(N,1)...H(1,
N series circuits in which N) to H(N, N) are connected in series are provided.

そのセルのうち第ｍ列目の入力側からｍ番目のセルはヒ
ストグラム計算セルであり、残りのセルはシフトレジス
タである。Among the cells, the m-th cell from the input side of the m-th column is a histogram calculation cell, and the remaining cells are shift registers.

ヒストグラム計算セルは入力データをシフトして次のセ
ルへ出力するとともに、このデータと隣の列の１つ前の
セルから加わる幅データとカウントデータとから幅デー
タとカウントデータとを更新して反対隣の列の１つ後の
セルに加える。The histogram calculation cell shifts the input data and outputs it to the next cell, and also updates the width data and count data from this data and the width data and count data added from the previous cell in the adjacent column. Add to the next cell in the next column.

また、例えばヒストグラム計算セルは横方向のフラグレ
ジスタ、縦方向の幅レジスタやカウンタを有する。Further, for example, the histogram calculation cell has a flag register in the horizontal direction, a width register in the vertical direction, and a counter.

［作　　用］Ｎ個のドット（１ドット行）がＮ個の列に対応するドノ
ト位置単位で加わり順次シフトする。[Operation] N dots (one dot row) are added in donot position units corresponding to N columns and shifted sequentially.

ヒストグラム計算セルは１ドット行に対して斜め方向に
設けられているので、例えばシストリックアレイの下方
向（列の下方向）から１ドット行が加わった時には、第
１列目の入力側から１番目のヒストグラム計算セルが縦
方向と横方向のヒストグラム計算を行う。そして順次シ
ストリックアレイ中をデータがシフトするたびに次の行
と次の列（斜め右上方向にヒストグラム計算セルが設け
られている）のヒストグラム計算を行う。これを順次繰
り返すことにより、右上最終のヒストグラム計算セルか
ら順次カウント値や幅の結果が出力される。また各ヒス
トグラム計算セル内のレジスタやカウンタにしても対向
する方向のカウント値や幅データが残る。Since the histogram calculation cells are provided diagonally with respect to one dot row, for example, when one dot row is added from the bottom of the systolic array (bottom of the column), one dot row is added from the input side of the first column. The th histogram calculation cell performs vertical and horizontal histogram calculations. Then, each time the data is sequentially shifted in the systolic array, the histogram calculation for the next row and the next column (histogram calculation cells are provided diagonally in the upper right direction) is performed. By repeating this process sequentially, the results of count values and widths are sequentially output from the last histogram calculation cell in the upper right corner. Further, count values and width data in opposing directions remain in the registers and counters in each histogram calculation cell.

シストリックアレイ中を１文字のデータが通過する時に
同時に縦方向や横方向のヒストグラムを高速に求めるこ
とができる。When one character of data passes through the systolic array, vertical and horizontal histograms can be obtained simultaneously at high speed.

［実　　施　　例］以下図面を用いて本発明を詳細に説明する。[Example] The present invention will be explained in detail below using the drawings.

第２図は本発明の実施例のシステム構成図である。FIG. 2 is a system configuration diagram of an embodiment of the present invention.

イメージスキャナ等によって読み取られた情報は画像デ
ータとして画像メモリ１０に格納される。Information read by an image scanner or the like is stored in the image memory 10 as image data.

この画像メモリ１０はイメージスキャナで読み取る１頁
分の記憶容量を有しており、読み取った情報のそれぞれ
各ドットを白あるいは黒の２｛Ｉ！すなわち０．１のデ
ータとして記憶する。This image memory 10 has a storage capacity for one page read by an image scanner, and each dot of the read information is divided into white or black 2{I! That is, it is stored as data of 0.1.

画像メモリ１０に格納された画像データはノイズ除去モ
ジュール１１に加わり、読み取り時に発生した雑音を除
去する。例えば、このノイズ除去モジュール１１によっ
て除去されるノイズは文字情報等に無関係な雑音例えば
３×３のマスクで中心を黒、その中心のドットを囲む８
ドットが白等の雑音であり、その中心のドットをノイズ
除去モジュール１１は白とする。このノイズ除去モジュ
ールは文字認識前処理部１２内に設けているがこれに限
るわけでなく、例えば後述する正規化モジュール１６内
に文字単位で格納する時に行ってもよく、またさらには
細線化、線素化の時に行ってもよい。The image data stored in the image memory 10 is applied to a noise removal module 11 to remove noise generated during reading. For example, the noise removed by this noise removal module 11 is noise unrelated to character information, etc. For example, a 3 x 3 mask with a black center and 8 pixels surrounding the center dot.
The dots are noise such as white, and the noise removal module 11 makes the dot in the center white. Although this noise removal module is provided in the character recognition pre-processing section 12, it is not limited thereto, and may be performed when storing each character in the normalization module 16, which will be described later. It may be performed at the time of line element formation.

ノイズ除去モジュール１１によってノイズ除去された画
像情報は行ヒストグラムモジュール１３、列ヒストグラ
ムモジュール１４、さらには読み出し制御モジュール１
５に加わる。行ヒストグラムモジュール１３は読み取っ
た情報、例えば前述したイメージスキャナによって読み
取った用紙の内容を各ドット単位で列方向に投影し、各
ドット単位の行のドット数を求めるモジュールである。The image information from which noise has been removed by the noise removal module 11 is sent to the row histogram module 13, the column histogram module 14, and further to the readout control module 1.
Join 5. The row histogram module 13 is a module that projects the read information, for example, the content of the paper read by the above-mentioned image scanner, in the column direction in units of dots, and calculates the number of dots in the row in each dot unit.

すなわち、１ドットの行（横方向）に対し、その１ドッ
ト行にい《つの黒ドソトが存在するかを各１ドット行単
位で求める処理である。また列ヒストグラム１４は前述
した行ヒストグラムと同様に列方向に対し投影し、その
投影した黒ドノトの数を求める処理である。That is, this is a process of determining for each dot row (horizontal direction) whether or not there are three black dots in that one dot row. Also, the column histogram 14 is a process of projecting in the column direction in the same way as the row histogram described above, and calculating the number of projected black dots.

画像メモリ１０から行方向に順次１ドット単位で読み出
し、ノイズ除去モジュール１１を介して加わったデータ
（ラスタースキャンと同様のドットの読み出し）を、行
ヒストグラムモジュール１３は順次黒のドットをカウン
トする（１ドノト行分）。そして、順次行単位で黒のド
ット数を求める。この黒のドット数が各行に対応する行
ヒストグラムとなる。また列ヒストグラム１４は１ドッ
ト行内のドット数に対応してそれぞれカウンタを有し１
行のドットが順次加わる度に黒ドノトに対応するカウン
タをインクリメントする。前述した動作を１頁分行うこ
とにより行ヒストグラムモジュール１６ならびに列ヒス
トグラムモジュール１４からは、それぞれ行位置ならび
に列位置に対するドット数を表したいわゆる行ヒストグ
ラム，列ヒストグラムが求められる。そしてその結果は
読み出し制御モジュール１５に加わる。Data is sequentially read out dot by dot in the row direction from the image memory 10 and added via the noise removal module 11 (reading of dots similar to raster scanning), and the row histogram module 13 sequentially counts black dots (1 line). Then, the number of black dots is sequentially calculated for each row. This number of black dots becomes the row histogram corresponding to each row. The column histogram 14 also has counters corresponding to the number of dots in one dot row.
Each time a row of dots is added in sequence, the counter corresponding to the black dot is incremented. By performing the above-described operations for one page, the row histogram module 16 and the column histogram module 14 obtain so-called row histograms and column histograms representing the number of dots for row positions and column positions, respectively. The result is then applied to the read control module 15.

読み出し制御モジュール１５はそれらの行ヒストグラム
，列ヒストグラムから行の位置ならびに列の位置を順次
求める。例えばこの位置は行ヒストグラムの周期や列ヒ
ストグラムの周期によって得ることができる。The readout control module 15 sequentially obtains row positions and column positions from these row histograms and column histograms. For example, this position can be obtained by the period of the row histogram or the period of the column histogram.

読み出し制御モジュール１５は行ならびに列の位置を求
めるが、この他に以下の処理を行う。画像データ例えば
イメージスキャナから読みとった情報は紙の位置等によ
り傾きを有することがある。The read control module 15 determines the row and column positions, but also performs the following processing. Image data, for example, information read from an image scanner, may have a tilt depending on the position of the paper and the like.

このため、読み出し制御モジュール１５は列ヒストグラ
ムならびに行ヒストグラムが最大値をとるよう、ヒスト
グラムを求める角度を順次変更し、補正角度を求める。For this reason, the readout control module 15 sequentially changes the angle at which the histogram is obtained so that the column histogram and the row histogram take the maximum value, and obtains a correction angle.

そして前述したノイズ除去モジュール１１から加わる画
像情報を再度入力して、最終的なヒストグラムを求め、
その補正した傾きにより得られた行ヒストグラム（ヒス
トグラムが最大値をとる）がＯから正に変化する点（正
から０でも可）より１周期分その傾きに対応した１行の
データを読み出し、読み出し制御モジュール１５内に設
けられた行バッファに格納する。Then, input the image information added from the above-mentioned noise removal module 11 again to obtain the final histogram,
From the point where the row histogram obtained by the corrected slope (the histogram takes the maximum value) changes from 0 to positive (possible to 0), read out one row of data corresponding to the slope for one period. The data is stored in a row buffer provided within the control module 15.

読み出し制御モジュールｌ５はさらにその行バッファに
格納した１行のデータの内、行内における列ヒストグラ
ムを再度求め、列ヒストグラムが０から正に変化する位
置からそのデータを切り出し正規化モジュールｌ６に出
力する。また変換表作成モジュールｌ７にも出力する。The read control module l5 further obtains the column histogram within the row of one row of data stored in the row buffer, cuts out the data from the position where the column histogram changes from 0 to positive, and outputs it to the normalization module l6. It is also output to the conversion table creation module l7.

この切り出したデータは１文字領域のデータである。This extracted data is data for a single character area.

変換表作成モジュールＩ７は正規化モジュール１６によ
って１文字を正規化するための変換デー夕を求めるモジ
ュールであり、たとえば読み出し制御モジュール１５に
よって切り出した１文字領域に対し、列方向並びに行方
向に投影し、黒ドノトが存在する列並びに行からドット
単位（行や列単位）で、列並びに行方向のカウンタをイ
ンクリメントし、１文字の領域内の最終値までの値を求
める。このモジュールの回路並びに動作については後述
する正規化の原理に基づき後に詳細に説明する。The conversion table creation module I7 is a module that obtains conversion data for normalizing one character by the normalization module 16. For example, it projects data in the column direction and row direction onto the one character area cut out by the readout control module 15. , the counters in the column and row directions are incremented dot by dot (row and column) from the column and row where the black dot exists, and the value up to the final value in the region of one character is calculated. The circuit and operation of this module will be explained in detail later based on the normalization principle described later.

正規化モジュール１６では、この１文字で切り出したド
ットの行方向並びに列方向の最終値並びに切り出した１
文字の大きさから、その文字が切り出し領域内の全域に
わたって存在する文字に拡大する。例えば６４Ｘ６４ド
ットの領域を１文字領域とする拡大処理を行う。文字の
列方向並びに行方向の値が変換表作成モジュール］７に
おいて４８（列並びに行とも）ドノトであったならば、
４８ドノトの文字を６４ドットに変換する処理を行う。In the normalization module 16, the final value of the dot cut out by this one character in the row direction and column direction and the cut out 1
Based on the size of the character, the character is expanded to cover the entire area within the extraction area. For example, an enlargement process is performed to make an area of 64×64 dots into one character area. If the column and row values of the character are 48 (both column and row) in conversion table creation module]7, then
Performs processing to convert 48 dot characters to 64 dots.

この処理では特定位置の行や列のデータを繰り返して同
じデータとし文字を拡大する。また、縮小の場合には特
定位置の行や列を繰り返し読み出してＯＲ加算し同一行
や同一列として縮小する。In this process, data in a row or column at a specific position is repeated to make the same data and enlarge the characters. Furthermore, in the case of reduction, rows and columns at specific positions are repeatedly read out and ORed together to reduce them as the same row or column.

正規化モジュール１６によって１文字領域例えば６４Ｘ
６４ドット内に１文字が拡大された後は、細線化モジュ
ールｌ８がその文字を細線化する処理を行う。この細線
化モジュール１８では中心ドットの上下左右１ドット（
３Ｘ３）とさらにその左１ドットと中心からの上２ドッ
ト目の合計１１ドットのマスクで細線化処理を行う。ま
たこのマスクは３×３の９ドットで行うこともできる。The normalization module 16 allows one character area, for example 64X
After one character is enlarged within 64 dots, the thinning module l8 performs a process of thinning the character. In this thinning module 18, one dot above, below, left and right of the center dot (
Thinning processing is performed using a mask of a total of 11 dots: 3×3), one dot to the left, and two dots above from the center. Moreover, this mask can also be performed using 9 dots of 3×3.

前述のマスクによってあらかじめ決められたパターンで
あるときに中心ドットをＯとする制御により１回の処理
によって文字を構成するドットの１ドット分の回りの細
線化が図れる。このマスクの細線化を順次繰り返すこと
により１ドットの線による文字とすることができる。By controlling the central dot to be O when the pattern is predetermined by the mask described above, thinning of lines around one dot forming a character can be achieved in one process. By sequentially repeating this thinning of the mask, a character can be formed by a one-dot line.

細線化モジュール１８によって得られた例えば６４Ｘ６
４ドットの細線化文字は線素化モジュール１９に加わり
線素化される。この線素化モジュールでは目的のドット
すなわち中心ドットから上下方向の黒ドットが存在する
場合、ならびに左右方向に存在する場合、右上、左下に
存在する場合、さらには左上、右下に存在する場合の合
計４種類の線素によって各ドットを表す。なお上述の４
種類の内、複数に属する場合には例えば、上下方向、続
いて左右方向等の順に優先化を行い、各ドット単位でそ
の線素がどちらの方向の存在するかを求める。なお中心
がＯドットすなわち白であった場合には線は存在しない
とする。For example, 64×6 obtained by the thinning module 18
The 4-dot thin line character is added to the line element generation module 19 and converted into line elements. In this line elementization module, when black dots exist in the vertical direction from the target dot, that is, the center dot, when they exist in the horizontal direction, when they exist in the upper right and lower left, and when they exist in the upper left and lower right, Each dot is represented by a total of four types of line elements. In addition, the above 4
If the line element belongs to more than one of the types, priority is given in the order of, for example, the vertical direction, then the horizontal direction, etc., and in which direction the line element exists is determined for each dot unit. Note that if the center is an O dot, that is, white, it is assumed that no line exists.

線素化モジュール１９においては、上下、左右、右上が
り斜め、左上がり斜めの４方向さらには線素が存在しな
い場合の５種類があるので、その状態を各ドット単位で
３ビットの値で表し、合計３Ｘ６　４　Ｘ６　４の情報
とし、特徴ベクトルモジュール２０に加える。In the line segmentation module 19, there are four directions: up and down, left and right, diagonal upwards to the right, diagonally upwards to the left, and five types, including the case where there is no line element, so the state is expressed in a 3-bit value for each dot. , a total of 3×6 4×6 4 information and added to the feature vector module 20.

特徴ベクトルモジュール２０においては前述した線素化
モジュール１９で得られた線素化情報を、左右上下にそ
れぞれ８ドット単位で分割し、その分割した領域を下と
右方向に１領域づつ（２×２領域）の合計１６ドットの
領域を１ベクトルモジュール領域とし、その１ベクトル
モジュール’ｐＭ域内にいくつの上下方向、左右方向、
右上方向、左上方向の４方向の線素が存在するかをカウ
ントする。１　６Ｘ１　６ドットの領域を１ベクトルモ
ジュール領域として特徴ベクトルを求めるが、この１ベ
クトルモジュール領域は８ドット単位で移動させるので
行方向ならびに列方向に対しそれぞれ７領域であり合計
７×７の特徴ベクトルの領域となる。The feature vector module 20 divides the line segmentation information obtained by the line segmentation module 19 described above into units of 8 dots in the left, right, top, and bottom, respectively, and divides the divided regions into 8 dot units each in the downward and right directions (2× 2 areas) with a total of 16 dots is defined as one vector module area, and how many vertical, horizontal, horizontal,
It is counted whether there are line elements in four directions: upper right direction and upper left direction. A feature vector is calculated using an area of 1 6 x 1 6 dots as one vector module area, but since this one vector module area is moved in units of 8 dots, there are 7 areas in each of the row and column directions, resulting in a total of 7 x 7 feature vectors. This is the area of

特徴ベクトル化モジュール２０においては前述したＩＮ
域単位でその方向の数を求めているが、この数の求める
場合にはそれぞれ重み付けをし、中心部を高く周り部を
外にいくにしたがって低くしている。例えばその重み付
けを中心の４×４の領域の各ドットを重み４、その周り
の２ドット分の各ドットを３、さらにその周りの２ドッ
ト分の各ドットを２、さらにその回りの２ドット分の各
ドットを１とし、重み付けを行って特徴ベクトルを求め
る。In the feature vectorization module 20, the above-mentioned IN
The number of directions is calculated for each area, and when calculating this number, weighting is applied to each area, with the center being higher and the surrounding areas becoming lower as they move outward. For example, each dot in the center 4x4 area has a weight of 4, each dot in the 2 dots around it has a weight of 3, each dot in the 2 dots around it has a weight of 2, and then the 2 dots around it have a weight of 4. Each dot is set to 1, weighting is performed, and a feature vector is determined.

この特徴ベクトルは特定の認識すべき文字を正規化モジ
ュールｌ６によってすべて同じ大きさにしているので、
同一文字であるならばほぼ同一の特徴ベクトルを有し、
文字単位でその特徴ベクトルが異なってくる。しかしな
がら非常によく僚たモジュールも存在するので、本発明
の実施例においては演算の処理の高速化さらには認識率
の向上をはかるため、特徴ベクトルの標準パターンを用
いてそれぞれの特徴ベクトル化領域すなわちマス内でク
ラス分けを行い、各マス内で２０クラスの標準パターン
と、加わる未知入力との距離を求める。すなわち標準パ
ターンの各マス内の特徴ベクトルと特報ベクトルモジュ
ール２０によって得られたマス内の特徴ベクトルとの距
離をマス単位で求める。その各マスはクラス分け（クラ
ス１〜クラス２０）されており、各マス内クラスの距離
の順位を距離の小さい順に第５番目までのクラスを求め
る。This feature vector uses the normalization module l6 to make all the characters to be recognized the same size, so
If they are the same character, they have almost the same feature vector,
The feature vectors differ for each character. However, since there are some very well-developed modules, in order to speed up the calculation process and improve the recognition rate, in the embodiment of the present invention, a standard pattern of feature vectors is used for each feature vectorization region, i.e. Classification is performed within each square, and the distance between the 20 class standard patterns and the added unknown input within each square is determined. That is, the distance between the feature vector in each square of the standard pattern and the feature vector in the square obtained by the special notice vector module 20 is determined for each square. Each of the squares is divided into classes (classes 1 to 20), and the distance ranking of the classes within each square is determined in descending order of distance to the fifth class.

距離計算モジュール２１はこの距離をクラス辞書２３−
１　（標準パターンをクラス単位で記憶）を用いて演算
する。尚、個別でもその個々の候補文字に対して求める
場合には候補辞書２３−２を用いる（この時にはスイッ
チＳＷは候補辞書２３２を選択する）。The distance calculation module 21 stores this distance in the class dictionary 23-
1 (standard patterns are stored in class units). Note that when searching for individual candidate characters, the candidate dictionary 23-2 is used (at this time, the switch SW selects the candidate dictionary 232).

上位選出＆得点割当モジュール２２では前述の上位５ク
ラスを求めるとともに、各クラスに対応した得点を各マ
ス単位で決定する。すなわち上位選出＆得点割当モジュ
ール２２は距離計算モジュール２１より得られた距離か
らクラス単位で第１〜第５番目の順位の各クラスに対し
与える得点を決定し、各文字の得点を求める。例えば第
１番目の距離（短い距離）であったときには５点、その
次に４点、３，２，Ｉとクラスに対し得点を与える。こ
れはマスｌからマス４９に対応してそれぞれ設けられる
。上位選出得点モジュール２２の処理結果は総合評価モ
ジュール２４に加わる。　総合評価モジュール２４は入
力対象すなわち入力文字とその候補とが整合する度合い
を計算するモジュールであり、連想整合モード、全数整
合モード、個別整合モードの３種類の動作がある。The top selection and score allocation module 22 determines the top five classes mentioned above and determines the score corresponding to each class for each square. That is, the top selection and score assignment module 22 determines the score to be given to each of the first to fifth ranking classes in class units based on the distance obtained from the distance calculation module 21, and calculates the score of each character. For example, for the first distance (short distance), 5 points are given, then 4 points, 3, 2, I, and so on for the classes. These are provided corresponding to cells 1 to 49, respectively. The processing results of the top selection score module 22 are added to the comprehensive evaluation module 24. The comprehensive evaluation module 24 is a module that calculates the degree of matching between an input object, that is, an input character and its candidate, and has three types of operation: an associative matching mode, an exhaustive matching mode, and an individual matching mode.

連想整合モードは、連想辞書２３−３に格納されている
候補に対応したマスクとその属するクラスからその候補
の得点を計算するモードである。The associative matching mode is a mode in which the score of a candidate is calculated from the mask corresponding to the candidate stored in the associative dictionary 23-3 and the class to which the candidate belongs.

連想辞書は第２図０））の如く、各マスク毎に候補■Ｄ
をアドレスとして、その候補がそのマスクにおいて属す
るクラスのクラスＩＤを格納している。As shown in Figure 2 0)), the associative dictionary has candidates ■D for each mask.
The class ID of the class to which the candidate belongs in the mask is stored, using the address as the address.

このデータは、各候補のマスクＩＤに対応するＣｄｉｍ
次元の部分ベクトルの集合をその（重み付き）距離によ
ってクラスタリングして得られるものであり、結果だけ
が連想辞書に格納される。This data is the Cdim corresponding to each candidate's mask ID.
It is obtained by clustering a set of dimensional subvectors according to their (weighted) distances, and only the results are stored in an associative dictionary.

同時に距離計算モジュールにおけるクラス辞書２３−１
も対応して作成される。Class dictionary 23-1 in the distance calculation module at the same time
is also created correspondingly.

尚、連想辞書２３−３とクラス辞書２３−１は対応して
おり、そのＩＩ類は同じになる．２種類以上の辞書を１
つのメモリに格納する場合、使用辞書指定は辞書参照開
始位置となる。（この辞書を候補ＩＤについて分割して
、それぞれについて並列に総合評価を行うことができ、
より高速なものが要求される場合容易に実現できる）。Note that the associative dictionary 23-3 and the class dictionary 23-1 correspond, and their class II is the same. 2 or more types of dictionaries in 1
When storing in one memory, the specification of the dictionary to be used becomes the dictionary reference start position. (This dictionary can be divided into candidate IDs and a comprehensive evaluation can be performed on each in parallel.
If higher speed is required, it can be easily realized).

連想辞書２３−３は、候補ａがマスクｍで属するクラス
のクラスＩＤ：Ｋを記した表であり、これをＣ　（ｍ，
ａ）＝Ｋと表すと、候補ａ（＝１〜ｃ　　ｃａｎｄ）に
対して、で得られる。尚、ここでＰ　（ｍ，ｋ）は得点を表して
いる。この式により候補ａに対する総合評価値Ｖ　（ａ
）を得る。The associative dictionary 23-3 is a table in which the class ID: K of the class to which candidate a belongs with mask m is written, and this is written as C (m,
When expressed as a)=K, for candidate a (=1 to c cand), it is obtained as follows. Note that P (m, k) represents the score here. Based on this formula, the overall evaluation value V (a
).

総合評価モジュールの全数整合モード、個別整合モード
は各候補に対し、計算するモードであり。The total matching mode and individual matching mode of the comprehensive evaluation module are modes in which calculations are made for each candidate.

全数整合モードはａ＝１〜ｃ　　ｃａｎｄ、個別整合モ
ードはＪ−１　〜ｃ　　ｋｉｎｄ，ａ＝ｂ（ｊ）とし、
距離をｄ　（ｍ，ａ）で表しを求める。この値Ｖ　（ａ）は候補ａと入力対象との特
徴ベクトル間の（重み付き）距離である。The total matching mode is a=1 to c can, the individual matching mode is J-1 to c kind, a=b(j),
Express the distance by d (m, a). This value V (a) is the (weighted) distance between the feature vectors of candidate a and the input object.

上位候補選出モジュール２５は各文字対応での上位から
決められた複数の文字例えば５文字を選出し出力する。The top candidate selection module 25 selects and outputs a plurality of characters, for example, five characters determined from the top in each character correspondence.

この上位５文字が読みとった画像データにおける認識結
果となる。The top five characters become the recognition result in the read image data.

前述した動作は全てパイプライン処理で成されるもので
ある。すなわち画像データを記憶する画像メモリ１０内
の例えば１頁分のデータをバイブライン処理のよって読
み出し、制御モジュール１５で行単位に分割するととも
に、正規化モジュール１６に１文字単位で出力する。そ
の文字単で前述の細線化，線素化，特徴ベクトル化さら
には認識処理を行う上位選出モジュール２５は総合評価値に基づいて、候補
に順位をつけ、上位５個を選出するモジュールであり、
入力は連想全数整合モードであるならば｛　（ａ’，　
Ｖ（ａ）ｌａ’，　ａ　＝　１　〜ｃ　　ｃａｎｄを修
正したもの｝個別整数合モードであるならば（　（ｊ，　ｖ（ａＮｊ　＝　　１〜ｃ　　ｋｉｎｄ，
　ａ　＝　ｂ　（ｊ））（個別整合の総合評価出力）降／昇順＝　（文字連想：大きい順、その他：小さい１
１（資））である。また出力は入力のソート結果の順に
並んだ候補ＩＤ（または入力順序）とその総合評価値で
ある。All of the operations described above are performed by pipeline processing. That is, data for one page, for example, in the image memory 10 that stores image data is read out by vibrating processing, divided into lines by the control module 15, and outputted to the normalization module 16 in units of characters. The top selection module 25, which performs the above-mentioned thinning, line elementization, feature vectorization, and recognition processing on the single character, is a module that ranks candidates based on the comprehensive evaluation value and selects the top five.
If the input is in associative exhaustive matching mode, then { (a',
V(a)la', a = 1~c cand} If it is in the discrete integer sum mode ((j, v(aNj = 1~c kind,
a = b (j)) (Comprehensive evaluation output of individual matching) Descending/ascending order = (Character association: Largest order, Others: Smallest 1
1 (fund)). Further, the output is the candidate IDs (or input order) arranged in the order of the input sort results and their comprehensive evaluation values.

前述した第２図における本発明の実施例においては全体
のシステムから変換表作成モジュールについて説明した
。以下ではさらに詳細に変換表作成モジュールにおける
原理と詳細な回路について説明する。In the embodiment of the present invention shown in FIG. 2 described above, the conversion table creation module was explained from the perspective of the entire system. Below, the principles and detailed circuits of the conversion table creation module will be explained in more detail.

第２図における正規化モジュールｌ６が拡大や縮小処理
を実行する場合には、読み出し制御モジュール１５によ
って切り出した一文字Ｍ域内の文字の大きさを求めなく
てはならない。何故ならば本発明の実施例における認識
処理においては認識率を高めるため文字を同じ大きさに
しなくてはならないからである。このため第３図におけ
る拡大の原理説明図からも明確なようにＸ軸上の０≦Ｘ
≦Ｗの領域をＹ軸上のＯ≦Ｙ≦Ｄの領域に変更する処理
を行う。When the normalization module l6 in FIG. 2 executes enlargement or reduction processing, the size of the character within the region M of one character cut out by the readout control module 15 must be determined. This is because in the recognition process in the embodiment of the present invention, characters must be made the same size in order to increase the recognition rate. Therefore, as is clear from the diagram explaining the principle of enlargement in Figure 3, 0≦X on the X axis.
Processing is performed to change the area where ≦W to an area where O≦Y≦D on the Y axis.

以下では、先ず拡大原理から説明し、続いて変換表作成
モジュール１７の詳細な回路動作について説明する。In the following, the enlargement principle will be explained first, and then the detailed circuit operation of the conversion table creation module 17 will be explained.

Ｘ，Ｙが任意の実数値をとるものとすればＹ軸上の座標
Ｙに対応するＸ軸上のＸはＸ＝ＷｘＹ／Ｄ　　　　　　　　　　・−　−　１）で
表わされる。これを用いて拡大された図形の座標Ｉ（Ｉ
は整数、工≦１≦Ｄ）に対応する元の図形上の座標Ｘ（
Ｘは整数）Ｘ−１＜ＷＸＩ／Ｄ≦Ｘ　　　　　・・・２）を満たす
点であるものとする。この２）式を変形するとＤ（Ｘ−１）＜ＩＸＷ≦ＤＸ　　　　−−−３）が得ら
れる。従って各Ｉ（１≦■≦Ｄ）について座標Ｉの要素
を３）式を満たす元の図形の座標Ｘの要素とすることに
より、幅Ｄに変形拡大された図形が得られる。Assuming that X and Y take arbitrary real values, the X on the X axis corresponding to the coordinate Y on the Y axis is expressed as X=WxY/D·−−1). Using this, the coordinates I (I
is an integer, the coordinate on the original figure corresponding to work≦1≦D)
(X is an integer) X-1<WXI/D≦X...2). When this equation 2) is modified, D(X-1)<IXW≦DX ---3) is obtained. Therefore, by setting the element at the coordinate I for each I (1≦■≦D) to the element at the coordinate X of the original figure that satisfies equation 3), a figure deformed and enlarged to the width D can be obtained.

この考え方で入力画像データを正規化するため、先ず入
力図形に対して文字の幅Ｗを調べるとともに横方向や縦
方向のヒストグラムを作成する。すなわち、変換表を作
成する。In order to normalize input image data based on this idea, first, the character width W of the input figure is checked, and horizontal and vertical histograms are created. That is, create a conversion table.

ヒストグラムは線形であるならば画像データ上の文字の
領域に含まれる最も左の点が属数列の値を１とし、それ
より右の列は１づつ増やすことによって得られる。また
行に対しても同様に、文字の領域に加わる最も上の点が
属する行の値を１としそれより下方の行を１づつ増やし
ていくことによって得られる。この文字の幅Ｗと列の先
端並びに行の先端を変換表作成モジュール１７は求める
。If the histogram is linear, the leftmost point included in the character area on the image data is obtained by setting the value of the genus sequence to 1, and increasing the value by 1 for the columns to the right. Similarly, for lines, the value is obtained by setting the value of the line to which the uppermost point added to the character area belongs to 1 and increasing the value by 1 for the lines below it. The conversion table creation module 17 determines the width W of this character, the leading edge of the column, and the leading edge of the row.

第４図は本発明の実施例の構成図である。頌域ＲＸが読
み出し制御部１５（第２図参照）より切り出されて行単
位でヒストグラム生成回路網（ＮｘＮ）３１に入力する
。ヒストグラム生成回路網（ＮＸＮ）３　１は第５図に
示した縦方向、横方向のヒストグラムとその入力した文
字の幅を求める回路であり、縦方向のヒストグラムはバ
ッファ（ＭＸＩ）３２に行単位の値として格納される。FIG. 4 is a block diagram of an embodiment of the present invention. The numeral area RX is extracted by the readout control unit 15 (see FIG. 2) and input to the histogram generation circuit network (NxN) 31 in units of rows. Histogram generation circuit network (NXN) 3 1 is a circuit for calculating the vertical and horizontal histograms shown in FIG. Stored as a value.

また横方向のヒストグラムはヒストグラム生成回路３１
から正規化回路網３４に図示しないが直接加わる。Also, the horizontal histogram is generated by the histogram generation circuit 31.
The normalization circuitry 34 is directly connected to the normalization circuitry 34 (not shown).

ヒストグラム生成回路網３１はＮＸＮのシストリックア
レイ構造を有しており、ヒストグラム生成回路綱３１を
通過した画像データ（画像データに変化はない）はバッ
ファ（ＮｘＮ）３３に格納される。すなわち画像ＲＸが
最終的にはバッファ３３に格納される。尚第２図におい
ては読み出し制御モジュール１５の出力は正規化回路網
１６にも加わるので、この場合にはこのバッファ３３は
必要ではなく正規化モジュール１６内に設けても良い。The histogram generation circuit 31 has an NXN systolic array structure, and the image data (the image data remains unchanged) that has passed through the histogram generation circuit 31 is stored in a buffer (NxN) 33. That is, the image RX is finally stored in the buffer 33. In FIG. 2, the output of the read control module 15 is also applied to the normalization circuit network 16, so in this case, the buffer 33 is not necessary and may be provided within the normalization module 16.

このヒストグラム生成回路網３１によって得られた縦方
向並びに横方向のヒストグラムは正規化回路モジュール
１６に加わり、このヒストグラムをもとに正規化モジュ
ール１６は動作する。尚正規化回路網３４は横方向（列
単位）の正規化を行う回路である。縦方向（行単位）の
正規化はバッファ３３　（ＭｘＮ）からのデータ読み込
み用セル構造回路３５によるドット行単位の呼び出しに
よって正規化している。すなわち縦方向の正規化を行い
ながら横方向の正規化に必要な計算をし、この値と入力
データを正規化回路綱３４に出力する。The vertical and horizontal histograms obtained by this histogram generation circuitry 31 are applied to the normalization circuit module 16, and the normalization module 16 operates based on this histogram. Note that the normalization circuit network 34 is a circuit that performs horizontal direction (column unit) normalization. The normalization in the vertical direction (in units of rows) is performed by calling the data in units of dot rows by the cell structure circuit 35 for reading data from the buffer 33 (MxN). That is, calculations necessary for horizontal normalization are performed while performing vertical normalization, and this value and input data are output to the normalization circuit 34.

データ読み込み用セル構造回路綱３５は縦方向を正規化
する回路であり、データ読み込み用セル回路網３５と正
規化回路綱３４の動作が始まる時刻をｔ＝１とするなら
ば、縦方向の正規化を行うため、時刻Ｌで正規化後の図
形のｔ行目に対応する行を読ｂ込む。The data reading cell structure circuitry 35 is a circuit that normalizes the vertical direction.If the time when the data reading cell circuitry 35 and the normalization circuitry 34 start operating is t=1, then the vertical normalization In order to perform the normalization, at time L, a line corresponding to the t-th line of the figure after normalization is read.

Ｄ−ｈ２（ｉ’−１）＜ｔ　Ｌ　　≦　Ｄ　−　ｈ２（
ｉ’　）　　　・　・　・４〕が成り立つよう、入力画
像の行ｉ″を時刻ｔで読み取る。換言するならば、ヒス
トグラムの値が４）式を満足するまで縦方向（行の読み
出し順方向）のヒストグラムと入力画像を読み込めば、
縦方向の正規化が行える。これはｗｈｉｌｅ　（　ｔ　Ｌ　＞　Ｄ−ｈ２（ｉ’）　＆＆
　ｉ’＜Ｍ　）ｒｅａｄ　ｄａｔａ　＆＆　ｈｉｓｔｏ
ｇｒａｍ；なる処理をセルが行うことで実現できる。ま
た、横方向の正規化を行う時も３）式を満足する処理を
行えば良い。横方向のヒストグラムの値および横方向の
文字幅Ｗ呼び込み、１列のセルでは’Ｄ　−　ｈＨｊ−
１）’　，　　“ｊＷ”，　　“Ｄ　−　ｈｌ（ｊ）”
を計算し、さらにＤ−ｈｌ（ｊ”）＜ｊ　Ｗ　　　ならば　ｊ→ｊ−１，
ｊり≦Ｄ−ｈｌ（ｊ’　−１）　　ならば　ｊ−＋ｊ＋
１と変換していくことで行う。尚ｊは正規化の図形にお
ける列、ｊ′は入力図形のおける列を表している。D-h2(i'-1)<t L ≦ D-h2(
i' ) ・・・・4] is read at time t so that row i'' of the input image is satisfied.In other words, read the row i'' of the input image at time t so that the histogram value satisfies equation 4) in the vertical direction (line reading forward direction). If you load the histogram and input image,
Vertical normalization can be performed. This is while (t L >D-h2(i')&&
i'<M) read data && histo
This can be achieved by the cell performing the following process. Also, when normalizing in the horizontal direction, processing that satisfies equation 3) may be performed. The value of the horizontal histogram and the horizontal character width W are called in. 'D − hHj−
1)', "jW", "D - hl(j)"
If D−hl(j”)<j W, then j→j−1,
If jri≦D−hl(j' −1), then j−+j+
This is done by converting to 1. Note that j represents a column in the normalization figure, and j' represents a column in the input figure.

このような正規化回路網３４の動作、ならびにデータ読
み込み用セル回路網３５の動作により、横方向と縦方向
の正規化がなされ、得られる図形はＤＸＤの正規化され
た図形になる。By the operation of the normalization circuit network 34 and the data reading cell circuit network 35, horizontal and vertical normalization is performed, and the resulting figure becomes a DXD normalized figure.

前述の正規化を行うためには入力文字のヒストグラムを
必要とする。以下ではこのヒストグラム生成についてさ
らに詳細に説明する。In order to perform the above-mentioned normalization, a histogram of input characters is required. This histogram generation will be explained in more detail below.

第６図は本発明の実施例のヒストグラム生成回路網図で
ある。各セルＨ（１．１）〜Ｈ　（Ｎ，Ｎ）はヒストグ
ラム計算セルあるいはシフトレジスタより成る。第４図
に示した如＜ＭＸＭの入カデータＲＸがヒストグラム計
算セルＨ（Ｎ，１）とシフトレジスタＨ（Ｎ，２）〜Ｈ
　（Ｎ，Ｎ）に１行ドット単位で加わる。そしてそのヒ
ストグラム計算セルＨ（Ｎ，１）の出力はシフトレジス
タＨ（Ｎ−１．１）にまたシフトレジスタＨ（Ｎ，２）
の出力はヒストグラム計算セルＨ　（Ｎ−１．２）に加
わる。さらにシフトレジスタＨ（Ｎ．３）〜Ｈ　（Ｎ，
Ｎ）の出力はシフトレジスタ（Ｎ−１、３）〜Ｈ　（Ｎ
−１，Ｎ）に加わる。すなわち、第１番目においては行
の左端にヒストグラム計算セルを、次の段においては二
番目のドット位置、また３段目においては３ドット目の
位置に順次ヒストグラム計算セルを設け、それぞれ１番
目から２番目、３番目へとそのデータを出力する構造と
している。さらに換言するならばシフトレジスタをそれ
ぞれド７｝対応でセルＨ（Ｎ，１）から順次セルＨ（１
．１）までをシフトレジスタで構成し、その時セルＨ（
Ｎ，１）をヒストグラム計算セルとし、２番目において
は２番目のセルＨ　（Ｎ−１．２）をヒストグラム計算
セルとし、３番目は同様に３個目の位置に、そして順々
に最終的にはセルＨ　（１，Ｎ）にヒストグラム計算セ
ルを設けそれぞれのドット位置単位で左から次のヒスト
グラム計算セルの位置に結果を出力する構造としている
。FIG. 6 is a histogram generation circuit diagram according to an embodiment of the present invention. Each cell H(1.1) to H(N,N) consists of a histogram calculation cell or a shift register. As shown in FIG.
Add dots to (N, N) in one line. The output of the histogram calculation cell H (N, 1) is sent to the shift register H (N-1.1) and also to the shift register H (N, 2).
The output of is applied to the histogram calculation cell H (N-1.2). Furthermore, shift registers H (N.3) to H (N,
The output of N) is the shift register (N-1, 3) to H (N
-1, N). That is, in the first row, the histogram calculation cell is placed at the left end of the row, in the next row, the histogram calculation cell is placed at the second dot position, and in the third row, the histogram calculation cell is placed at the third dot position, and the cells are placed sequentially from the first to the third dot position. The structure is such that the data is output to the second and third nodes. In other words, the shift registers are sequentially moved from cell H(N, 1) to cell H(1) corresponding to
．． Up to 1) are configured with shift registers, and at that time cell H (
N, 1) is the histogram calculation cell, in the second the second cell H (N-1.2) is the histogram calculation cell, in the third the same way, in the third position, and in turn the final A histogram calculation cell is provided in cell H (1, N), and the result is output to the next histogram calculation cell position from the left for each dot position.

このそれぞれのヒストグラム計算セル並びにシフトレジ
スタは１クロノクに対応してデータを次の段のシフトレ
ジスタやヒストグラム計算セルに出力する。尚シフトレ
ジスタは送られてきたデータを１クロックを遅延するも
のである。Each histogram calculation cell and shift register outputs data to the next stage shift register or histogram calculation cell in correspondence with one clock. Note that the shift register delays the sent data by one clock.

以下ではさらに詳細に動作を説明する。ヒストダラム生
成回路網の動作の始まる時刻ｔをｔ＝１とし、エクロッ
クごとにｔが１づつ増えていくものとするならば、時刻
ｔでの各セルでの動作は次のようになる。The operation will be explained in more detail below. Assuming that the time t at which the operation of the histodaram generation circuit network begins is t=1, and t is incremented by 1 for each ex-clock, the operation of each cell at time t is as follows.

■　ｔ≦Ｍのとき、セル（Ｎ，１）〜セル（Ｎ．Ｎ）は
入力データのｔ行目を読み込む。Ｍ＜ｔのとき，セル（
Ｎ，１）〜セル（Ｎ，Ｎ）は０を読み込む。(2) When t≦M, cells (N, 1) to (N.N) read the t-th row of input data. When M<t, the cell (
N, 1) to cell (N, N) read 0.

■　ヒストグラム計算セル（ｉ，Ｎ−ｉ＋１）はセル（
ｉ＋１，Ｎ−ｉ）から送られてきた演算結果とセル（ｉ
＋１，Ｎ−ｉ＋１）から送られてきたデータを用い、後
述のセルの動作に従って処理を行う。■ Histogram calculation cell (i, N-i+1) is cell (
The calculation results sent from cell (i+1,N-i) and cell (i
+1, N-i+1), processing is performed according to the operation of the cell, which will be described later.

■　ヒストグラム計算セル（ｉ，Ｎ−ｉ＋１）は、演算
結果をセル（ｉ−１，Ｎ−ｉ＋２）に送る。(2) Histogram calculation cell (i, N-i+1) sends the calculation result to cell (i-1, N-i+2).

また、ｌ〜Ｎ−１行のすべてのセル（ｉ．ｊ）は、セル
（ｉ＋１，ｊ）から送られてきたデータをそのままセル
（ｉ−１，ｊ）に送る。Furthermore, all cells (i.j) in rows 1 to N-1 send the data sent from cell (i+1, j) to cell (i-1, j) as they are.

但し、Ｎ≦ｔのときセル（１，Ｎ）の演算結果は入力デ
ータの横方向のヒストグラム（（Ｎ−ｔ＋１）行目）の
値（ｘｓｖｉｄｔｈ　，　ｘｃｏｕｎυとなるから、（
ＭＸＩ）のバンファ３２に格納していく。また、セル（
１，ｊ）（０≦ｊ≦１）のデータ（ｙｗｉｄｔｈ，ｘｃ
ｏｕｎｔ）は（ＭｘＮ）のバッファに送られる。However, when N≦t, the calculation result of cell (1, N) is the value (xsvidth, xcounυ) of the horizontal histogram ((N-t+1)th row) of the input data, so (
MXI) is stored in the buffer 32. Also, the cell (
1, j) (0≦j≦1) data (ywidth, xc
ount) is sent to (MxN) buffers.

以上の動作はＭ＋Ｎ−１クロックで全て完了し、Ｊ行目
のヒストグラム計算セルに入力データのＪ行目（横方向
）のヒストグラムの値が格納され、（ＭＸＩ）のバッフ
ァの■行目に入力データの■行目（縦方向）のヒストグ
ラムの値が格納されることになる。All the above operations are completed in M+N-1 clocks, and the histogram value of the J-th row (horizontal direction) of the input data is stored in the histogram calculation cell of the J-th row, and is input to the ■ row of the (MXI) buffer. The histogram value of the ■th row (vertical direction) of the data will be stored.

セルを第６図のように配置すれば、たとえば入力図形の
１行目のデータがセル（Ｎ，１）で処理され、時刻（Ｉ
＋１）で処理結果とその２行目のデータがセル（Ｎ−１
．２）で処理されるという具合に同じ行のデータが順に
出会っていくので縦方向と横方向のヒストグラムが同時
に作れる。尚セルを第６図のように配置するかわりに１
次限に配置されたヒストグラム計算セルを用い、図７に
示すようにデータを１クロックづつ別単位で遅延させて
入力してもよい。前述では横方向、縦方向の動作につい
て説明したが以下では前述のヒストグラム計算セルの動
作についてさらに詳細に説明する。If the cells are arranged as shown in Figure 6, for example, the data in the first row of the input figure will be processed in cell (N, 1), and the time (I
+1), the processing result and its second row data are stored in cell (N-1)
．． As data from the same row is encountered in sequence in step 2), vertical and horizontal histograms can be created at the same time. Note that instead of arranging the cells as shown in Figure 6,
Using a histogram calculation cell placed at the next limit, data may be input after being delayed in units of one clock at a time, as shown in FIG. Although the operations in the horizontal and vertical directions have been described above, the operations of the histogram calculation cell described above will be explained in more detail below.

先ず線形の場合のヒストグラム計算セルの横方向のヒス
トグラムについて説明する。横方向のヒストグラムは、
前述したように、画像データ上で文字の領域に含まれる
最も左の点が属する列の値を１とし、それより右の列は
１づつ値を増やしていくことで得られる。従って、入力
画像のある列を走査したとき、もしその列に黒画像が含
まれていて、しかもその列より左の列には黒画像が含ま
れていないとき、その列のヒストグラムの値を１とし、
その列より右の列は、ヒストグラムの値を１ずつ増やす
。ｊ列目のセルはまず（ｊ−１）列目のセルの値を判別
し、それが１以上であればその値に１を加えたものを自
分のセルの値とする。First, a horizontal histogram of a histogram calculation cell in a linear case will be explained. The horizontal histogram is
As described above, the value is obtained by setting the value of the column to which the leftmost point included in the character area belongs to 1 on the image data, and increasing the value by 1 in the columns to the right. Therefore, when scanning a certain column of the input image, if that column contains a black image, and the column to the left of that column does not contain any black images, the value of the histogram of that column is set to 1. year,
In the columns to the right of that column, the histogram values are increased by 1. The cell in the j-th column first determines the value of the cell in the (j-1)th column, and if it is greater than or equal to 1, the value obtained by adding 1 to that value is set as the value of its own cell.

もし（ｊ−１）列目のセルの値がＯであれば、ｊ列目に
黒画素があるか否か調べ、あればセルの値を１にし、な
ければセルの値を０にする。各セルに前述の処理を設け
、入力画像に対し１行目から順に処理すれば、最終的に
横方向のヒストグラムが得られる。If the value of the cell in the (j-1)th column is O, it is checked whether there is a black pixel in the jth column, and if so, the cell value is set to 1, and if not, the cell value is set to 0. By applying the above-described processing to each cell and sequentially processing the input image from the first row, a horizontal histogram is finally obtained.

また、文字の幅は、入力画像データ上の最も右の黒画素
の存在する列のヒストグラムの値に等しく、これを求め
ればよい。Further, the width of the character is equal to the value of the histogram of the column in which the rightmost black pixel exists on the input image data, and this can be calculated.

なお、縦方向のヒストグラムも、全く同様の考え方で得
ることができる。但し、横方向のときに空間的に分布さ
せていた機能を時間的に配置することになる。Note that a vertical histogram can also be obtained using exactly the same concept. However, the functions that were distributed spatially in the horizontal direction will now be distributed temporally.

第８図は線形におけるヒストグラム計算セルの詳細な構
成図である。ｆｌａｇは黒画素があったかどうかを判定
するフラグであり、ｃｏｕｎｔはヒストグラムの値を格
納し、Ｗｉｄｔｈは文字幅の値を格納する。　１ｘ１　
　１ｙ１　は縦方向か横方向かを表している。また、ｙ
ｆｌａｇ　ｘｉｕｉｄｔｈ，ｘｃｏｕｎｔは、左の列の
セルの値を用いて処理して結果を右のセルに送り、ｙｗ
ｉ６ｔｈ，ｙｃｏｕｎｔ，ｘｆａｌｇは自分のセルの値
を用いて処理して結果を自分のセルに格納する。そして
ｘｆｌａｇ，ｘｃｏｕｎｔ，ｘｗｉｄｔｈ並びにｙｆｌ
ａｇ，ｙｃｏｕｎｔ，ｙｗｉ６ｔｈはそれぞれ以下の式
によって決定される。FIG. 8 is a detailed configuration diagram of a linear histogram calculation cell. flag is a flag for determining whether there is a black pixel, count stores a histogram value, and Width stores a character width value. 1x1
1y1 indicates whether the direction is vertical or horizontal. Also, y
flag xiuidth, xcount is processed using the value of the cell in the left column and sends the result to the right cell, yw
i6th, ycount, and xfalg process using the values of their own cells and store the results in their own cells. and xflag, xcount, xwidth and yfl
ag, ycount, and ywi6th are each determined by the following formulas.

横方向ｘｆｌａｇ　＝　ｉｆ　ｄａｔａ　＝＝１　ｔｈｅｎ　
１ｅｌｓｅ　ｘｆｌａｇｘｃｏｕｎｔ　　＝　　ｉｆ　　ｘｃｏｕｎｔ＞Ｏ　　
ｔｈｅｎ　　ｘｃｏｕｎｔ＋１ｅｌｓｅ　ｉｆ　ｘｆｌ
ａｇ−＝１　ｏｒ　ｄａｔａ＝＝Ｉ　ｔｈｅｎ　１ｅｌ
ｓｅ　ｘｃｏｕｎｔｘｗｉｄｔｈ　＝　ｉｆ　ｘｆｌａｇ＝＝１　ｏｒ　ｄ
ａｔａ＝＝ｌ　ｔｈｅｎ　ｘｃｏｕｎｔ＋１ｅｌｓｅ　
ｘｗｉｄｔｈ縦方向ｙｆｌａｇ　＝　ｉｆ　ｄａｔａ＝＝１　ｔｈｅｎ　１
ｅｌｓｅ　ｙｆｌａｇｙｃｏｕｎｔ　＝　ｉｆ　ｙｃｏｕｎｔ＞Ｏ　ｔｈｅｎ
　ｙｃｏｕｎｔ＋１ｅｌｓｅ　ｉｆ　ｙｆｌａｇ＝＝Ｉ
　ｏｒ　ｄａｔａ＝＝ｌ　ｔｈｅｎ　１ｅｌｓｅ　ｙｃ
ｏｕｎｔｙｗｉｄｔｈ　＝　ｉｆ　ｙｆｌａｇ＝＝１　ｏｒ　ｄ
ａｔａ＝＝１　ｔｈｅｎ　ｙｃｏｕｎｔ＋１ｅｌｓｅ　
ｙｗｉｄｔｈこのようなセルを用いて処理を行うと、最終的に、横力
同のヒストグラムは各セル内のｘｃｏｕｎｅｆ．，横方
向の文字幅はセル（１，Ｎ）のｘｗｉｄｔｈに、縦方向
の文字幅はセル（１，Ｎ）のｙｗｉｄｔｈに格納され、
縦方向のヒストグラムは、時刻Ｎ以降セル（１，Ｎ）の
ｙｃｏｕｎｔとして順に出力されることになる。Horizontal xflag = if data ==1 then
1else xflag xcount = if xcount>O
then xcount+1else if xfl
ag-=1 or data==I then 1el
se xcount xwidth = if xflag==1 or d
ata==l then xcount+1else
xwidth vertical direction yflag = if data==1 then 1
else yflag ycount = if ycount>O then
ycount+1 else if yflag==I
or data==l then 1else yc
ount ywidth = if yflag==1 or d
ata==1 then ycount+1else
ywidth When processing is performed using such cells, the histogram of the lateral force is finally created by xcounef. , the horizontal character width is stored in xwidth of cell (1, N), the vertical character width is stored in ywidth of cell (1, N),
The vertical histogram is sequentially output as the ycount of the cell (1, N) after time N.

次に非線形の場合のヒストグラム計算セルの横方向のヒ
ストグラムについて説明する。非線形の場合は、各列ご
とに、黒画素を横切る回数を数え、それを積算していく
。従って、ｊ列目のセルは、ｊ列内で黒画素を横切った
回数を計算し、その値を積算する。Next, the horizontal histogram of the histogram calculation cell in the nonlinear case will be explained. In the case of non-linearity, the number of times black pixels are crossed for each column is counted and the numbers are integrated. Therefore, the cell in the j-th column calculates the number of times a black pixel is crossed within the j-th column, and integrates the values.

第９図は非線形におけるヒストグラム計算セルの詳細な
構成図である。ｆｌａｇは黒画素を横切っている途中で
あるかどうかを判定するフラグであり、ｃｏｕｎｔはヒ
ストグラムの値を格納する。ｓｔａｃｋはその行でヒス
トグラムに加算された値の積算値を格納する。　゛Ｘ゛
ｙ′　は縦方向か横方向かを表している。また、ｙｆａ
ｌｇ，ｙｃｏｕｎｔ，ｘｓｔａｃｋは、左の列のセルの
値を用いて処理して結果を右のセルに送り、ｙｓｔａｃ
ｋ，ｘｃｏｕｎｔ，ｘｆｌａｇは自分のセルの値を用い
て処理して結果を自分のセルに格納する。FIG. 9 is a detailed configuration diagram of a nonlinear histogram calculation cell. flag is a flag that determines whether or not a black pixel is being crossed, and count stores the value of the histogram. The stack stores the integrated value of the values added to the histogram in that row.゛X゛y' represents the vertical direction or the horizontal direction. Also, yfa
lg, ycount, xstack process using the value of the cell in the left column and send the result to the right cell, and ystack
k, xcount, and xflag are processed using the values of their own cells, and the results are stored in their own cells.

そして、ｘｆｌａｇ，ｘｃｏｕｎｔ，ｘｓｔａｃｋ，並
びにｙｆｌａｇ，ｙｃｏｕｎｔ　，　ｙｓｔａｃｋはそ
れぞれ以下の弐によって決定される。Then, xflag, xcount, xstack, and yflag, ycount, ystack are each determined by the following two.

横方向ｘｆｌａｇ　＝　ｉｆ　ｄａｔａ＝＝ｏ　ｔｈｅｎ　Ｏ
ｅｌｓｅ　１ｉｆ　ｄａｔａ＝＝１　ａｎｄ　ｘｆｌａｇ＝＝Ｏｔｈ
ｅｎ　　ｘｃｏｕｎｔ＋ｘｓｔａｃｋ＋１ｅｌｓｅ　ｘ
ｃｏｕｎｔ＋ｘｓｔａｃｋｘｓｔａｃｋ　＝　ｉｆ　ｄ
ａｔａ−＝１　ａｎｄ　ｘｆｌａｇ＝＝Ｏ　ｔｈｅｎ　
ｘｓｔａｃｋ＋１ｅｌｓｅ　ｘｓｔａｃｋｘｃｏｕｎｔ縦方向ｙｆｌａｇ　＝　ｉｆ　ｄａｔａ＝＝Ｏ　ｔｈｅｎ　Ｏ
ｅｌｓｅ　１ｙｃｏｕｎｔ　＝　ｉｆ　ｄａｔａ＝＝Ｉ　ａｎｄ　ｙ
ｆｌａｇ＝司ｔｈｅｎ　　ｙｃｏｕｎｔ＋ｙｓｔａｃｋ
＋１ｅｌｓｅ　ｙｃｏｕｎｔ＋ｙｓｔａｃｋ＋１ｙｓｔ
ａｃｋ　＝　ｉｆ　ｄａｔａ＝＝１　ａｎｄ　ｙｆｌａ
ｇ＝＝ｏ　ｔｈｅｎ　ｙｓｔａｃｋ＋１ｅｌｓｅ　ｙｓ
ｔａｃｋこのようなセルを用いて処理を行うと、最終的に、横方
向のヒストグラムは第１０図の例に示すごとくに、横方
向の文字幅はセル（１，Ｎ）のＸＣｏｕｎｔに、縦方向
の文字幅はセル（１．Ｎ）のｙｃｏｕｎｔに格納され、
縦方向のヒストグラムは、時刻Ｎ以降セル（１，Ｎ）の
ｙｃｏｕｎｔとして順に出力されることになる。Horizontal xflag = if data = = o then O
else 1 if data==1 and xflag==Oth
en xcount+xstack+1else x
count+xstackxstack = if d
ata-=1 and xflag==O then
xstack+1else xstack xcount Vertical direction yflag = if data==O then O
else 1 ycount = if data==I and y
flag=ycount+ystack
+1else ycount+ystack+1yst
ack=if data==1 and yfla
g==o then ystack+1 else ys
tack When processing is performed using such a cell, the horizontal histogram will eventually look like the example in Figure 10, where the character width in the horizontal direction is equal to the XCount of cell (1, N), and the character width in the vertical direction is The character width of is stored in ycount of cell (1.N),
The vertical histogram is sequentially output as the ycount of the cell (1, N) after time N.

以上のようにヒストグラム計算セルを構成することによ
り、線形、非線形のヒストグラムである変換表を得るこ
とができる。By configuring the histogram calculation cell as described above, a conversion table that is a linear or nonlinear histogram can be obtained.

〔Effect of the invention〕

以上述べたように本発明によればヒストグラムヲハイブ
ライン構造で並列に処理するのでその変換処理を高速化
することができる。このヒストグラムを高速で求めるの
で、正規化処理を同時に高速化することができる。また
その結果、高速の文字等の認識装置を行うことができる
。As described above, according to the present invention, since the histograms are processed in parallel in a high line structure, the conversion process can be speeded up. Since this histogram is obtained at high speed, the normalization process can be speeded up at the same time. As a result, a high-speed character recognition device can be realized.

[Brief explanation of drawings]

第１図は本発明の原理ブロソク図、第２図は本発明の実施例の構成図、第３図は拡大の原理説明図、第４図は本発明の実施例の全体の構成図、第５図は縦方
向と横方向のヒストグラム図、第６図は本発明のヒスト
グラム計算回路網図、第７図はデータの流れ図、第８図はヒストグラム計算セル（ｔｉＩ形）図、第９図
は横方向のヒストグラム（非線形）図、第１０図はヒス
トグラム計算セル（非線形）図である。Ｈ（１．１）〜Ｈ（Ｎ　］）Ｈ（Ｎ、１）〜Ｈ（Ｎ，Ｎ）・・・・・セル、Ｈ（Ｎ，
１）．　Ｈ（Ｎ〜１，２）・・・Ｈ（１，Ｎ）・・・・
・ヒストグラム計算セル．Fig. 1 is a diagram of the principle of the present invention, Fig. 2 is a block diagram of an embodiment of the present invention, Fig. 3 is a diagram explaining the principle of enlargement, Fig. 4 is a block diagram of the entire embodiment of the present invention, Figure 5 is a vertical and horizontal histogram diagram, Figure 6 is a histogram calculation circuit diagram of the present invention, Figure 7 is a data flow diagram, Figure 8 is a histogram calculation cell (tiI type) diagram, and Figure 9 is a diagram of a histogram calculation cell (tiI type). A horizontal histogram (nonlinear) diagram, FIG. 10 is a histogram calculation cell (nonlinear) diagram. H(1.1)~H(N]) H(N,1)~H(N,N)...Cell, H(N,
1). H(N~1,2)...H(1,N)...
・Histogram calculation cell.

Claims

[Claims] 1) N cells (H(1,1) to H(N,1)...H
(1, N) to H(N, N)) are connected in series, and the m-th cell from the input side of the m-th column shifts and outputs the input data, and A histogram that updates the width data and count data from the data and the width data and count data added from the cell in the m-1th column and outputs it from the input side in the m+1st column to the m+1th cell. A conversion table creation circuit using a systolic array, characterized in that the cells are calculation cells, and the cells other than the histogram calculation cells are shift registers. 2) The histogram calculation cell has a flag register, which sets the flag to 1 when the added input data is 1, holds the current flag value when it is 0, and increments the input count data when the count value is greater than 0. Then, when the flag is 1 or the input data is 1, the count data is set to 1, and at other times, the count value is updated and output as is, and when the flag is 1 or the input data is 1, the count data is set to 1. 2. The conversion table creation circuit using a systolic array according to claim 1, wherein the circuit adds 1 to the width data and outputs the width data which is input as the same value at other times. 3) The input data is character data, and the dots in one character area are added to the N series-connected inputs in units of one dot row, and the Nth cell from the input side of the Nth column is a horizontal histogram. 2. A conversion table creation circuit using a systolic array according to claim 1, wherein the circuit outputs the following. 4) N cells (H(1,1) to H(N,1)...
N columns of series circuits in which H(1,N) to H(N,N)) are connected in series are provided, and the m-th cell from the input side of the m-th column is a shift register that shifts input data and outputs it. It has a width register and a counter, and when the input data is 1, the flag data is set to 1, and when it is 0, it is set to m-1 from the input side of the m-1th column.
The flag data added from the cell in the row is outputted as output flag data to the m+1 cell from the input side in the m+1 column, and when the counter value is greater than 0, the counter is incremented by one.
When the input flag data is 1 or the input data is 1, the counter is set to 1, otherwise the counter is not changed, and the flag data is added from the m-1st cell from the input side of the m-1th column. It is a histogram calculation cell that adds 1 to the counter value and stores it in the width register when is 1 or the input data is 1, and holds the value of the width register at other times, and otherwise shifts the input data and outputs it. A conversion table creation circuit using a systolic array characterized by being a shift register. 5) The input data is character data, and the dots in one character area are added to the N series-connected inputs in units of one dot row, and the histogram calculation cell calculates a vertical histogram using the width register and counter, respectively. 5. A conversion table creation circuit using a systolyre array according to claim 4.