JPH0782522B2

JPH0782522B2 - Document reader

Info

Publication number: JPH0782522B2
Application number: JP60047678A
Authority: JP
Inventors: 啓二小林; 元南部
Original assignee: Mitsubishi Electric Corp
Current assignee: Mitsubishi Electric Corp
Priority date: 1985-03-11
Filing date: 1985-03-11
Publication date: 1995-09-06
Anticipated expiration: 2010-09-06
Also published as: JPS61206087A

Description

【発明の詳細な説明】〔産業上の利用分野〕この発明は文字を含む文書から文字を認識して読取る文
書読取装置に係り、特に文書の読取りフォーマットを登
録する機構に関するものである。Description: BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a document reading device for recognizing and reading characters from a document containing characters, and more particularly to a mechanism for registering a document reading format.

[Conventional technology]

従来この種の文書読取装置としては、いわゆる光学的文
字読取装置といわれるものに、第６図に示すものがあ
る。第６図は従来の文書読取装置を示すブロック構成図
である。同図に示すようにこの装置は、文書，すなわち
用紙１上に記入された文字を走査手段２で走査して光電
変換し、１行分の画像，すなわちイメージデータを行記
憶手段３に格納し、さらに１行分のイメージデータから
文字切出し手段４で１文字分の文字パターンを切出し、
切出された文字パターンから特徴を抽出し、上記抽出さ
れた特徴値と文字認識手段５内に格納されている認識基
準パターンの特徴値とを比較して認識し、認識結果とし
て文字コードを出力していた。また、上記処理を行なう
場合、事前に切出すべき位置や認識対象字種などを示す
読取りに必要な情報すなわち、読取フォーマットを入力
手段６から入力し、制御手段７の読取フォーマットテー
ブルに格納し、これにもとづき処理を行なっていた。す
なわち、用紙１上に記入される文字の位置や認識対象字
種を示す認識に必要な読取りフォーマットを事前に文書
に対応させて設定しておき、この読取フォーマットに従
って文字を切出して認識を行なっていた。As a conventional document reading device of this type, a so-called optical character reading device is shown in FIG. FIG. 6 is a block diagram showing a conventional document reading device. As shown in the figure, this device scans a document, that is, characters written on a sheet 1 by a scanning means 2 and photoelectrically converts them, and stores an image for one line, that is, image data in a row storage means 3. Further, a character pattern for one character is cut out from the image data for one line by the character cutting means 4,
A feature is extracted from the cut-out character pattern, the extracted feature value is compared with the feature value of the recognition reference pattern stored in the character recognizing means 5 to be recognized, and a character code is output as a recognition result. Was. In the case of performing the above processing, the information necessary for reading indicating the position to be cut out and the character type to be recognized, that is, the reading format is input from the input unit 6 and stored in the reading format table of the control unit 7. Processing was performed based on this. That is, the reading format required for recognition indicating the position of the characters to be written on the paper 1 and the character type to be recognized is set in advance in correspondence with the document, and the characters are cut out according to this reading format for recognition. It was

[Problems to be solved by the invention]

ところで、この種装置が普及するに従って、文書作成時
に即座に文書を読取る用途が増え、また文書の種類が多
様化してきたため、極端な場合文書１枚１枚の読取りフ
ォーマットが異なる場合が増えてきた。しかしながら、
事前に読取り領域や認識対象字種を指示する上記従来装
置によると、多様化する文書に対応してそれぞれの読取
りフォーマットを入力するには膨大な時間が掛かり、文
書の多様化に対応することができないという問題点を有
していた。By the way, with the spread of this type of device, the applications for reading a document immediately at the time of creating a document have increased, and the types of documents have become diversified. Therefore, in extreme cases, the reading format of each document has become different. . However,
According to the above-mentioned conventional apparatus that specifies the reading area and the character type to be recognized in advance, it takes a huge amount of time to input each reading format corresponding to diversifying documents, and it is possible to cope with diversifying documents. It had a problem that it could not be done.

この発明は、上記のような問題点を解消するためになさ
れたもので、その場で容易に読取フォーマットが入力で
き、多様化した文書に対応できる文書読取装置を得るこ
とを目的とする。The present invention has been made to solve the above problems, and an object of the present invention is to provide a document reading device that can easily input a reading format on the spot and can cope with diversified documents.

[Means for solving problems]

この発明に係る文書読取装置は、入力された文書の画像
を記憶する記憶手段と、この記憶手段に記憶された画像
を表示する表示手段と、この表示手段で表示された画像
の認識すべき文字領域とこの文字領域に表示されている
文字の字種等の属性とを選択する入力手段と、この入力
手段により選択された上記文字領域と上記属性とを格納
する格納手段と、この格納手段に格納されている上記選
択された文字領域の文字を切出す文字切出し手段と、こ
の文字切出し手段により切出された文字を上記格納手段
に格納されている上記選択された属性に基づき認識する
認識手段とを備えたものである。A document reading apparatus according to the present invention includes storage means for storing an image of an input document, display means for displaying the image stored in the storage means, and characters to be recognized in the image displayed by the display means. Input means for selecting an area and an attribute such as a character type of a character displayed in the character area, storage means for storing the character area and the attribute selected by the input means, and the storage means Character cutout means for cutting out the stored character in the selected character area, and recognition means for recognizing the character cut out by the character cutout means based on the selected attribute stored in the storage means. It is equipped with and.

[Action]

この発明においては、記憶手段に文書全体の画像が一旦
記憶され、この内容が表示手段に表示される。表示手段
で表示された画像の認識すべき文字領域とこの文字領域
に表示されている文字の字種等の属性とを入力手段によ
り選択し、格納手段は選択された文字領域と属性とを格
納し、文字切出し手段は格納されている選択された文字
領域の文字を切出し、認識手段は格納されている選択さ
れた属性に基づき文字を認識する。In the present invention, the image of the entire document is temporarily stored in the storage means, and the content is displayed on the display means. The character area to be recognized in the image displayed by the display means and the attribute such as the character type of the character displayed in this character area are selected by the input means, and the storage means stores the selected character area and attribute. Then, the character cut-out means cuts out the character in the stored selected character area, and the recognition means recognizes the character based on the stored selected attribute.

〔Example〕

以下、この発明の実施例を図示して説明する。 Hereinafter, embodiments of the present invention will be illustrated and described.

第１図はこの発明の文書読取装置の一実施例を示すブロ
ック構成図である。なお、第６図従来例と同一符号のも
のは同一構成要素を示しており、その説明は省略する。
同図において、８は文書全体の画像，すなわち１ページ
分のイメージデータを記憶するページ記憶手段、９は上
記ページ記憶手段８に記憶された全イメージデータを表
示する表示手段、10は上記表示手段の表示画面を見なが
ら、読取り領域の位置及びこの読取り領域内の文字を認
識するに必要な情報，すなわち読取りフォーマットを指
示入力する入力手段であり、ここではキーボードを用い
ている。一方、11は上記入力手段で入力された情報を読
取りフォーマットテーブルに格納し、文書の読取りフォ
ーマットを制御する制御手段、12は上記ページ記憶手段
に記憶されたイメージデータから上記制御手段の制御に
よりまず行を切出し、その行から文字パターンを切出す
文字切出し手段である。FIG. 1 is a block diagram showing an embodiment of the document reading apparatus of the present invention. The same reference numerals as those in the conventional example of FIG. 6 indicate the same components, and the description thereof will be omitted.
In the figure, 8 is an image of the entire document, that is, page storage means for storing image data for one page, 9 is display means for displaying all image data stored in the page storage means 8, and 10 is the display means. This is an input means for instructing and inputting the information necessary for recognizing the position of the reading area and the characters in the reading area, that is, the reading format while looking at the display screen of 1. On the other hand, 11 is control means for storing the information input by the input means in the read format table and controlling the read format of the document, and 12 is first controlled by the control means from the image data stored in the page storage means. It is a character cutting means for cutting out a line and cutting out a character pattern from the line.

次に同図を用いて本実施例の概略動作を説明し、更に第
２図ないし第５図を用いてその要部を詳細に説明する。
第１図に示すように、用紙１に記入された文字を走査手
段２で走査して光電変換し、１ページ分のイメージデー
タをページ記憶手段８に格納する。次いで上記１ページ
分のイメージを表示手段９に表示する。ここでオペレー
タは表示された文書イメージの表示画面を観測し、キー
ボードでカーソル走査を行なれる入力手段10から記入文
字領域を表示画面上で指示入力する。更に上記入力手段
10のキーボードを走査して認識対象字種に対応する番号
を先端入力してこれらを制御手段11に送る。入力され記
入文字領域及び認識対象字種セットは制御手段11内の読
取りフォーマットテーブル内に格納される。文字切出し
手段12では上記記入文字領域内において、まず文字の記
入されている行を自動的に切出し、さらに行内から１字
ずつ文字パターンを切出して文字認識手段５を送る。文
字認識手段５では切出された文字パターンを用いて特徴
を抽出し、抽出された特徴値と文字認識手段５内に格納
されている認識基準パターンの特徴値とを比較して認識
し、その結果として文字コードを出力する。Next, the schematic operation of the present embodiment will be described with reference to the same drawing, and the main parts thereof will be described in detail with reference to FIGS.
As shown in FIG. 1, the characters written on the sheet 1 are scanned by the scanning means 2 and photoelectrically converted, and the image data for one page is stored in the page storage means 8. Then, the image for one page is displayed on the display means 9. Here, the operator observes the display screen of the displayed document image, and inputs an input character area on the display screen from the input means 10 which can perform cursor scanning with the keyboard. Furthermore, the input means
The keyboard 10 is scanned to input the numbers corresponding to the character types to be recognized, and these are sent to the control means 11. The entered character area and the character set to be recognized are stored in the reading format table in the control means 11. The character cutting means 12 automatically cuts out a line in which characters are written in the written character area, and further cuts out a character pattern one by one from the line and sends the character recognition means 5. The character recognition means 5 extracts a feature using the cut-out character pattern, compares the extracted feature value with the feature value of the recognition reference pattern stored in the character recognition means 5, and recognizes the feature value. As a result, the character code is output.

第２図は文書のイメージを表示手段９の画面上に表示し
た一例である。また第３図は領域指定により切出された
矩形領域を示した一例である。まず上記文書例は、漢
字，ひらがな，カタカナ，記号，数字の字種を含む第１
の領域13と、図を含む第２の領域14と、英数字，記号の
字種を含む第３の領域15の３つの領域に分けることがで
きる。入力手段10のカーソル移動キーを用いて、表示さ
れるカーソル位置に対応する切出し指示点で文書のイメ
ージの領域を指示する。すなわち、第１の領域13は第１
の切出し指示点16と第２の切出し指示点17とで指示する
ことにより第３図に示す第１の矩形領域18を指定する。
また第２の領域14は第３の切出し指示点19と第４の切出
し指示点20とで指示することにより第２の矩形領域21を
指定する。同様に第３の領域15は第５の切出し指示点22
と第６の切出し指示点23とで指示することにより第３の
矩形領域24を指定する。FIG. 2 is an example in which an image of a document is displayed on the screen of the display means 9. Further, FIG. 3 is an example showing a rectangular area cut out by specifying an area. First, the above document example includes the first character type including kanji, hiragana, katakana, symbols, and numbers.
Can be divided into three areas, that is, a second area 14 including the figure, and a third area 15 including the character types of alphanumeric characters and symbols. The cursor movement key of the input means 10 is used to indicate the area of the image of the document at the cutout indication point corresponding to the displayed cursor position. That is, the first region 13 is the first
The first rectangular area 18 shown in FIG. 3 is designated by instructing with the cutout instruction point 16 and the second cutout instruction point 17.
In addition, the second rectangular area 21 is designated in the second area 14 by instructing with the third cutout instruction point 19 and the fourth cutout instruction point 20. Similarly, the third region 15 has the fifth cutout point 22.
And the sixth cut-out instruction point 23 to specify the third rectangular area 24.

第４図は制御手段11内に格納されている読取りフォーマ
ットテーブルの内容を示す構成図である。第１の矩形領
域18の認識対象字種が漢字，ひらがな，カタカナ，記
号，数字であることを示す値“1"が字種の欄25に、また
第２の矩形領域21は図領域であり認識対象字種がないこ
とを示す値“0"が同じく字種の欄26に、また第３の矩形
領域24の認識対象字種が英数字，記号であることを示す
値“2"が同様に字種の欄27に格納されている。また、上
記第１の矩形領域18の文書イメージの位置に対応して、
第１及び第２の切出し指示点16及び17のX,Y方向の座標
値1,1及び20,16が切出し領域の欄28及び29に格納されて
いる。同様に第３の切出し指示点から第６の切出し指示
点19,20,22,23のX,Y方向の座標値がそれぞれ切出し領域
の欄30,31,32,33に格納されている。FIG. 4 is a block diagram showing the contents of the read format table stored in the control means 11. The value "1" indicating that the recognition target character type of the first rectangular area 18 is kanji, hiragana, katakana, symbol, or numeral is in the character type column 25, and the second rectangular area 21 is a figure area. The value “0” indicating that there is no recognition target character type is also in the character type column 26, and the value “2” indicating that the recognition target character type of the third rectangular area 24 is alphanumeric or symbol is the same. The character type is stored in the column 27. In addition, in correspondence with the position of the document image in the first rectangular area 18,
Coordinate values 1, 1 and 20, 16 in the X and Y directions of the first and second cutout instruction points 16 and 17 are stored in the cutout area columns 28 and 29. Similarly, the coordinate values in the X and Y directions of the third cut-out designating point to the sixth cut-out designating point 19, 20, 22, 23 are stored in the cut-out area columns 30, 31, 32, 33, respectively.

第５図は第１の矩形領域18内の文書イメージデータから
行を切出し、さらに行内の文字パターンを切出す方法の
一例を示した図である。これは、すでに広く用いられて
いる文字部の黒パターンの周辺分布を用いて行及び文字
パターンを切出す方法であり、まず、同図（ａ）に示す
ように第１の矩形領域18を行方向に走査して黒パターン
のヒストグラム33を求め、このヒストグラムの値が１以
上で連続する第１の部分34を第１の行35として切出し、
第２の部分36を第２の行37として切出す。更に、同図
（ｂ）に示すように第１の行35内を列方向に走査して黒
パターンのヒストグラム38を求め、このヒストグラムの
値が１以上で連続する部分を切出すことにより、第１の
文字部分から第11の文字部分39〜49が切出される。FIG. 5 is a diagram showing an example of a method of cutting out a line from the document image data in the first rectangular area 18 and further cutting out a character pattern in the line. This is a method of cutting out lines and character patterns by using the marginal distribution of the black pattern of the character part which is already widely used. First, as shown in FIG. Direction to obtain a black pattern histogram 33, and a first portion 34 in which the histogram value is 1 or more and which is continuous is cut out as a first row 35,
The second portion 36 is cut out as the second row 37. Further, as shown in FIG. 6B, the first row 35 is scanned in the column direction to obtain a histogram 38 of a black pattern, and a continuous portion where the value of this histogram is 1 or more is cut out, The eleventh character portions 39 to 49 are cut out from the first character portion.

以上のようにして切出された文字パターンは第４図で示
す読取りフォーマットで指示される字種を認識対象とし
て文字認識手段５で認識される。なお、上記第２の矩形
領域21は図領域であり、文字切出し及び文字認識は行な
われない。The character pattern cut out as described above is recognized by the character recognition means 5 with the character type designated by the reading format shown in FIG. 4 as the recognition target. The second rectangular area 21 is a drawing area, and character cutting and character recognition are not performed.

従って本実施例によれば、読取られた文書のイメージを
表示しながら切出し領域や認識対象字種をオペレータが
即時に指示入力し、かつ指示された領域内の文字を自動
的に切出して認識することにより、読取りフォーマット
を作成する作業時間が大幅に短縮できかつ容易に入力で
き、読取りフォーマットの異なる文書でも効率よく入力
できる利点がある。また読取られた文書イメージを画面
上で観測してフォーマットを入力するため、切出し領域
が正確に設定できる利点がある。Therefore, according to the present embodiment, the operator immediately inputs and inputs the cutout area and the recognition target character type while displaying the image of the read document, and the characters in the specified area are automatically cut out and recognized. As a result, there is an advantage that the working time for creating the reading format can be significantly shortened and can be easily input, and even a document having a different reading format can be efficiently input. Further, since the read document image is observed on the screen and the format is input, there is an advantage that the cutout area can be accurately set.

なお、上記説明では読取りフォーマットテーブルに登録
する認識情報として、認識対象字種と切出し領域の位置
とについて説明したが、この発明はこれに限らず、認識
対象の文字が手書き文字か印刷文字かを示す情報を読取
りフォーマットテーブルに登録して使用してよい。ま
た、入力手段はキーボードについて説明したが、これに
限らず点座標入力と数値入力ができる入力手段ならばい
かなる入力手段を使用してもよい。In the above description, as the recognition information registered in the reading format table, the recognition target character type and the position of the cutout area have been described, but the present invention is not limited to this, and whether the recognition target character is a handwritten character or a printed character is recognized. The information shown may be registered in the read format table for use. Although the keyboard has been described as the input means, the input means is not limited to this, and any input means capable of inputting point coordinates and numerical values may be used.

〔The invention's effect〕

以上説明したように、この発明による文書読取装置は、
表示手段で表示された画像の文字領域を選択し、文字切
出し手段が選択された文字領域から文字を切出すので、
あらかじめ認識領域の取捨選択を行うことで、高速に認
識処理を行うことができる。また、文字領域に表示され
ている文字の字種等の属性を選択することにより、精度
の高い認識処理を行うことができる。As described above, the document reading device according to the present invention is
Since the character area of the image displayed by the display means is selected and the character cutting means cuts out the character from the selected character area,
The recognition process can be performed at high speed by selecting the recognition area in advance. Further, by selecting an attribute such as a character type of a character displayed in the character area, highly accurate recognition processing can be performed.

[Brief description of drawings]

第１図はこの発明による文書読取装置の一実施例を示す
ブロック構成図、第２図は文書イメージの表示画面の一
例を示す図、第３図は上記実施例により切出された矩形
領域を示す図、第４図は読取りフォーマットテーブルの
構成図、第５図は文字切出し方法の一例を示す図、第６
図は従来例を示すブロック構成図である。１……文書、８……記憶手段、９……表示手段、10……
入力手段。なお、図中同一又は相当部分には同一符号を用いてい
る。FIG. 1 is a block diagram showing an embodiment of a document reading apparatus according to the present invention, FIG. 2 is a view showing an example of a document image display screen, and FIG. 3 is a rectangular area cut out by the above embodiment. FIG. 4, FIG. 4 is a configuration diagram of a reading format table, FIG. 5 is a diagram showing an example of a character cutting method, and FIG.
FIG. 1 is a block diagram showing a conventional example. 1 ... document, 8 ... storage means, 9 ... display means, 10 ...
Input means. The same reference numerals are used for the same or corresponding parts in the drawings.

Claims

[Claims]

1. A storage means for storing an image of an input document, a display means for displaying the image stored in the storage means, a character area to be recognized by the image displayed by the display means, and this character. Input means for selecting an attribute such as a character type of a character displayed in the area, storage means for storing the character area and the attribute selected by the input means, and storage means for storing the character area And a recognition unit for recognizing the character cut out by the character cutting unit based on the selected attribute stored in the storage unit. A document reading device characterized by the above.