JP2003323586A

JP2003323586A - Document form registering method and document recognizing method

Info

Publication number: JP2003323586A
Application number: JP2002127021A
Authority: JP
Inventors: Takuma Akagi; 琢磨赤木; Bunpei Irie; 文平入江; Hideo Horiuchi; 秀雄堀内; Naoki Natori; 直毅名取; Akihiko Nakao; 昭彦中尾; Yasuhiro Aoki; 泰浩青木; Tomoyuki Hamamura; 倫行浜村
Original assignee: Toshiba Corp
Current assignee: Toshiba Corp
Priority date: 2002-04-26
Filing date: 2002-04-26
Publication date: 2003-11-14

Abstract

<P>PROBLEM TO BE SOLVED: To provide a document form registering method extensively reducing labor of a user in form registration, and allowing even an inexperienced user to easily carry out form registration. <P>SOLUTION: In preregistering a form of a document and recognizing characters written on the document on the basis of the registered form, an image of a blank document with nothing written is captured by using an image inputting means, positions, thicknesses, line types or the like of ruled lines are extracted from the captured image, and information of the extracted ruled lines is registered as form information. <P>COPYRIGHT: (C)2004,JPO

Description

Detailed Description of the Invention

【０００１】[0001]

【発明の属する技術分野】本発明は、あらかじめ帳票上
のフォームを登録し、この登録したフォームに基づき帳
票上に記載された文字を認識する帳票認識装置等におい
て、帳票上のフォームを登録する帳票フォーム登録方
法、および、帳票上に記載された文字を認識する帳票認
識方法に関する。TECHNICAL FIELD The present invention relates to a form for registering a form on a form in advance and recognizing characters written on the form based on the registered form, such as a form recognizing form on the form. The present invention relates to a form registration method and a form recognition method for recognizing characters written on a form.

【０００２】[0002]

【従来の技術】一般的な帳票認識は、大きく分けて、帳
票上のフォームを登録するフォーム登録処理と、登録さ
れたフォームを基に帳票上に記載された文字を認識する
文字認識処理とに分かれる。基本的な帳票の例を図１６
に示す。図１６（ａ）に示すブランク帳票とは、帳票に
まだ何も書込まれていない、プレ印刷だけのものをい
う。これの各項目にその項目にあった文字記入を行なっ
たものが、図１６（ｂ）に示す記入済の帳票である。帳
票認識技術は、この各記入欄に書込まれた文字を認識
し、各項目の認識結果を格納する記憶部に、その認識結
果を格納する技術である。その際に、帳票のフォーム情
報を必要とする。フォーム情報は、図１６（ｃ）に示す
ように、帳票のどの位置にどのような情報が書かれてい
るかをあらかじめ指定している情報である。フォーム情
報には、その他にも、どの位置に罫線が引かれている
か、どの位置を基準に文字認識の位置合わせをすればよ
いかなどの情報が格納されている。なお、位置合わせの
基準をターゲットマークと呼ぶ。また、罫線情報は、文
字認識の際、文字が罫線に掛かっている場合などに効率
良く文字を切り出すために用いられる。2. Description of the Related Art Generally, general form recognition is roughly divided into a form registration process for registering a form on a form and a character recognition process for recognizing characters written on the form based on the registered form. Divide. 16 shows an example of a basic form.
Shown in. The blank form shown in FIG. 16 (a) means a pre-printed form in which nothing has been written on the form. 16B is a completed form in which characters corresponding to the respective items are entered. The form recognition technique is a technique for recognizing the characters written in each entry field and storing the recognition result in a storage unit that stores the recognition result of each item. At that time, the form information of the form is required. As shown in FIG. 16C, the form information is information that pre-designates what kind of information is written at which position on the form. In addition to this, the form information also stores information such as which position the ruled line is drawn, and which position should be used as the reference for character recognition alignment. The reference for alignment is called a target mark. In addition, the ruled line information is used to efficiently cut out a character when the character is hung on the ruled line during character recognition.

【０００３】さらに、フォーム情報には、記入欄に書か
れている文字の認識結果をデータベース上のどこに記憶
すればよいのかを表わす属性情報も登録されている。た
とえば、「氏名」という属性を持った記入欄に書かれて
いる文字を認識した結果は、データベースの「氏名」を
表わす部位に記憶される。Further, in the form information, attribute information indicating where in the database the recognition result of the character written in the entry field should be stored is also registered. For example, the result of recognizing the character written in the entry field having the attribute of "name" is stored in the portion of the database representing "name".

【０００４】[0004]

【発明が解決しようとする課題】ところが、従来のフォ
ーム登録処理は、全て人間が手作業で行なっていた。初
期のものでは、あらかじめ帳票の中の記入欄などの位置
を手作業で計り、その位置を帳票認識装置に手作業で入
力していた。また、ＧＵＩ（グラフィカル・ユーザ・イ
ンタフェイス）を用いる手法では、取込んだブランク帳
票の画像を画面に表示し、マウスポインタなどで記入欄
の内側を丁寧に指し示すことにより、登録を行なってい
た。しかし、フォーム情報は、１ドット単位の精度で認
識率が大きく変化することがあり、登録にはコストと時
間がかかる。また、罫線などが特殊な形をしている場
合、罫線情報として登録するだけでは、その複雑さを表
現できず、文字認識の際に罫線と記入された文字を分離
するのに支障をきたすことがある。However, all of the conventional form registration processing has been manually performed by humans. In the early version, positions such as entry fields in the form were manually measured in advance, and the positions were manually input to the form recognition device. In addition, in the method of using a GUI (graphical user interface), registration is performed by displaying the captured blank form image on the screen and carefully pointing the inside of the entry field with a mouse pointer or the like. However, in the form information, the recognition rate may change greatly with the accuracy of one dot, and the registration takes cost and time. Also, if the ruled lines have a special shape, the complexity cannot be expressed simply by registering them as ruled line information, and it may be difficult to separate the ruled lines from the entered characters during character recognition. There is.

【０００５】そこで、本発明は、ユーザのフォーム登録
にかける手間が大幅に軽減され、かつ、不慣れなユーザ
でも簡易にフォーム登録を行なうことができる帳票フォ
ーム登録方法を提供することを目的とする。また、本発
明は、文字認識精度の向上が図れる帳票認識方法を提供
することを目的とする。Therefore, an object of the present invention is to provide a form form registration method in which the user's time and effort for registering a form can be greatly reduced, and even an inexperienced user can easily perform form registration. Another object of the present invention is to provide a form recognition method capable of improving character recognition accuracy.

【０００６】[0006]

【課題を解決するための手段】本発明の帳票フォーム登
録方法は、あらかじめ帳票上のフォームを登録し、この
登録したフォームに基づき帳票上に記載された文字を認
識するものにおいて、何も記載されていないブランク帳
票の画像を画像入力手段を用いて取込み、この取込んだ
画像から罫線の位置、太さ、線種等を抽出し、この抽出
した罫線の情報をフォーム情報として登録することを特
徴とする。According to the form form registration method of the present invention, when a form on a form is registered in advance and the characters written on the form are recognized based on the registered form, nothing is written. The feature is that an image of a blank form that has not been captured is captured using the image input means, the position of the ruled line, the thickness, the line type, etc. are extracted from the captured image, and the information of the extracted ruled line is registered as form information. And

【０００７】また、本発明の帳票フォーム登録方法は、
あらかじめ帳票上のフォームを登録し、この登録したフ
ォームに基づき帳票上に記載された文字を認識するもの
において、何も記載されていないブランク帳票の画像を
画像入力手段を用いて取込み、この取込んだ画像から実
線または破線に囲まれた領域を記入欄として抽出し、こ
の抽出した記入欄の情報をフォーム情報として登録する
ことを特徴とする。The form form registration method of the present invention is
In a form in which a form on a form is registered in advance and characters recognized on the form are recognized based on the registered form, an image of a blank form on which nothing is described is captured using the image input means, and this capture is performed. It is characterized in that an area surrounded by a solid line or a broken line is extracted from the image as an entry field, and the information in the extracted entry field is registered as form information.

【０００８】また、本発明の帳票認識方法は、あらかじ
め帳票上のフォームを登録し、この登録したフォームに
基づき帳票上に記載された文字を認識するものにおい
て、フォーム登録時、何も記載されていないブランク帳
票の画像を画像入力手段を用いて取込み、この取込んだ
画像の少なくとも一部をフォーム情報とともに登録して
おき、文字認識時、入力された認識対象帳票の画像から
前記先に登録されているブランク帳票の画像を用いて当
該帳票にあらかじめ印刷されている罫線や文字等を除去
し、その後当該帳票に対する文字認識を行なうことを特
徴とする。Further, in the form recognition method of the present invention, a form on a form is registered in advance and the characters written on the form are recognized based on the registered form. When the form is registered, nothing is written. An image of an empty blank form is captured using the image input means, at least a part of the captured image is registered together with the form information, and at the time of character recognition, the image of the input recognition target form is registered first in the above. It is characterized in that ruled lines and characters printed in advance on the form are removed by using the image of the blank form, and then character recognition is performed on the form.

【０００９】さらに、本発明の帳票認識方法は、帳票上
の画像内に記載された文字を認識する帳票認識方法にお
いて、あらかじめ文字が記載されていない帳票上の画像
を登録しておき、文字認識時、入力された認識対象帳票
の画像から前記先に登録されている画像を用いて当該帳
票上の画像を除去し、その後当該帳票に対する文字認識
を行なうことを特徴とする。Further, in the form recognition method of the present invention, in the form recognition method for recognizing the characters described in the image on the form, the image on the form in which no character is described is registered in advance, and the character recognition is performed. At this time, the image on the form is removed from the input image of the form to be recognized by using the previously registered image, and then character recognition is performed on the form.

【００１０】[0010]

【発明の実施の形態】以下、本発明の実施の形態につい
て図面を参照して説明する。まず、第１の実施の形態に
ついて説明する。第１の実施の形態は、ブランク帳票の
罫線を自動検知することにより、罫線のフォーム登録作
業を軽減するもので、罫線抽出装置の概略構成を図１に
示し、以下、それについて説明する。BEST MODE FOR CARRYING OUT THE INVENTION Embodiments of the present invention will be described below with reference to the drawings. First, the first embodiment will be described. The first embodiment reduces the form registration work of ruled lines by automatically detecting the ruled lines of the blank form. A schematic configuration of the ruled line extraction device is shown in FIG. 1, which will be described below.

【００１１】まず、ブランク帳票を読取部にセットし、
画像入力手段としてのスキャナ１１によって当該ブラン
ク帳票の濃淡画像を取込み、濃淡画像記憶部１２に格納
する。次に、濃淡画像記憶部１２に格納されたブランク
帳票の画像を２値化部１３により２値化し、その結果を
２値化画像記憶部１４に格納する。First, set a blank form on the reading unit,
The grayscale image of the blank form is captured by the scanner 11 as an image input unit and stored in the grayscale image storage unit 12. Next, the image of the blank form stored in the grayscale image storage unit 12 is binarized by the binarization unit 13, and the result is stored in the binarized image storage unit 14.

【００１２】次に、２値化画像記憶部１４に格納された
２値化画像に対し、縦方向射影部１５および横方向射影
部１６により、各方向の黒画素の射影を取り、その結果
をそれぞれ縦方向射影結果記憶部１７、横方向射影結果
記憶部１８に格納する。次に、罫線抽出部１９は、縦方
向射影結果記憶部１７、横方向射影結果記憶部１８に格
納された縦方向射影、横方向射影の情報を基に、２値化
画像記憶部１４に記憶された２値化画像の黒画素を追跡
して行き、罫線の始点と終点を求める。また、罫線抽出
部１９では、その罫線の黒画素のつながり方も検知し、
それが実線であるのか破線であるのかなども識別する。Next, with respect to the binarized image stored in the binarized image storage unit 14, the vertical projection unit 15 and the horizontal projection unit 16 project black pixels in each direction, and the result is obtained. The results are stored in the vertical projection result storage unit 17 and the horizontal projection result storage unit 18, respectively. Next, the ruled line extraction unit 19 stores in the binarized image storage unit 14 based on the information of the vertical projection and the horizontal projection stored in the vertical projection result storage unit 17 and the horizontal projection result storage unit 18. The black pixels of the binarized image are traced to find the start and end points of the ruled line. The ruled line extraction unit 19 also detects how the black pixels of the ruled line are connected,
It also identifies whether it is a solid line or a broken line.

【００１３】罫線抽出部１９によって抽出された罫線の
情報は、そのままフォーム情報として登録してもよい
が、誤抽出を省き、精度を上げるために、ＧＵＩ上での
登録作業を入れるのが望ましい。ＧＵＩ上では、たとえ
ば、図２に示すように、抽出された罫線が色を変えて表
示される。また、罫線の線種（実線か破線かなど）によ
っても色を変えて表示される。ユーザは、マウスなどに
より、この色の変わった部分をクリックすることによ
り、罫線を選択し、フォーム情報に罫線情報を登録す
る。The information on the ruled lines extracted by the ruled line extraction unit 19 may be registered as the form information as it is, but it is desirable to perform registration work on the GUI in order to avoid erroneous extraction and improve accuracy. On the GUI, for example, as shown in FIG. 2, the extracted ruled lines are displayed in different colors. Also, the color is displayed in different colors depending on the line type of the ruled line (solid line, broken line, etc.). The user selects a ruled line by clicking this color-changed portion with a mouse or the like, and registers the ruled line information in the form information.

【００１４】次に、第２の実施の形態について説明す
る。第２の実施の形態は、ブランク帳票の記入欄を自動
検知することにより、記入欄の位置やその属性などを識
別し、フォーム登録作業を軽減するもので、記入欄抽出
装置の概略構成を図３に示し、以下、それについて説明
する。Next, a second embodiment will be described. In the second embodiment, the position of the entry column and its attributes are identified by automatically detecting the entry column of the blank form, and the form registration work is reduced. 3 and will be described below.

【００１５】まず、ブランク帳票を読取部にセットし、
画像入力手段としてのスキャナ２１によって当該ブラン
ク帳票の濃淡画像を取込み、濃淡画像記憶部２２に格納
する。次に、濃淡画像記憶部２２に格納されたブランク
帳票の画像を２値化部２３により２値化し、その結果を
２値化画像記憶部２４に格納する。２値化画像記憶部２
４に格納された画像の例を図４（ａ）に示す。First, set a blank form on the reading unit,
The grayscale image of the blank form is captured by the scanner 21 as image input means and stored in the grayscale image storage unit 22. Next, the image of the blank form stored in the grayscale image storage unit 22 is binarized by the binarization unit 23, and the result is stored in the binarized image storage unit 24. Binary image storage unit 2
An example of the image stored in No. 4 is shown in FIG.

【００１６】次に、２値化画像記憶部２４に格納されて
いる罫線の情報は、掠れなどによって途切れていること
があり、また、破線のように直線的に繋がっていない場
合もある。そこで、罫線延長部２５によって、これらの
途切れをなくす。すなわち、罫線延長部２５では、黒画
素を縦横両方に調べ、まず、その罫線がどの方向に向い
ているかの情報を得る。その後、その線分の両端の途切
れまで検索する。線分が他の線分と交わって終端を迎え
ていれば、検索はそこで終了する。線分が途中で切れた
形で検索されれば、その方向にしばらく検索を続け、そ
こに他の線分の端点または他の垂直方向の線分があれ
ば、その方向に他の線分と接触するまで黒画素を伸ば
す。こうして得られた結果を、２値化画像記憶部２６に
格納する。２値化画像記憶部２６に格納された画像の例
を図４（ｂ）に示す。Next, the ruled line information stored in the binarized image storage unit 24 may be interrupted due to blurring or the like, and may not be linearly connected like a broken line. Therefore, these interruptions are eliminated by the ruled line extension portion 25. That is, the ruled line extension unit 25 examines the black pixels in both the vertical and horizontal directions, and first obtains information on which direction the ruled line is facing. After that, the line segment is searched up to the break. If the line segment intersects another line segment and reaches the end, the search ends there. If a line segment is searched for in the middle, search continues in that direction for a while, and if there is an endpoint of another line segment or another line segment in the vertical direction, it is regarded as another line segment in that direction. Stretch the black pixels until they touch. The result thus obtained is stored in the binarized image storage unit 26. An example of the image stored in the binarized image storage unit 26 is shown in FIG.

【００１７】次に、反転画像作成部２７によって、２値
化画像記憶部２６に記憶されている画像を反転する。反
転した画像は、反転画像記憶部２８に格納する。反転画
像記憶部２８に格納された画像の例を図４（ｃ）に示
す。次に、ラベリング部２９は、反転画像記憶部２８内
の反転画像を用いてラベリング処理を行なう。この場
合、反転画像をわざわざ生成しなくても、２値化画像の
黒画素を背景とみなして、直接ラベリングを行なっても
よい。ラベリング部２９によって抽出されたラベルの位
置の例を図４（ｄ）に示す。Next, the inverted image creating unit 27 inverts the image stored in the binarized image storage unit 26. The inverted image is stored in the inverted image storage unit 28. An example of the image stored in the reverse image storage unit 28 is shown in FIG. Next, the labeling unit 29 uses the inverted image in the inverted image storage unit 28 to perform the labeling process. In this case, the black pixels of the binarized image may be regarded as the background and the labeling may be performed directly without the need to generate the inverted image. An example of the position of the label extracted by the labeling unit 29 is shown in FIG.

【００１８】次に、記入欄抽出部３１は、ラベリング部
２９によって得られたラベル情報（大きさや位置など）
を基に記入欄を検出する処理を行なう。すなわち、記入
欄抽出部３１は、あらかじめ設定されている記入欄のサ
イズの基準を基にラベリング結果を検索し、適当なラベ
ルがみつかると、反転画像記憶部２８に格納されている
反転画像を調べ、そのラベルのほとんどの位置が黒画素
で満たされていそかどうかを調べる。条件を満たせば、
そのラベルの位置を記入欄候補の位置と決定する。Next, the entry section extraction section 31 has label information (size, position, etc.) obtained by the labeling section 29.
Based on, the process of detecting the entry column is performed. That is, the entry column extraction unit 31 searches the labeling result based on the preset entry column size standard, and when an appropriate label is found, checks the inverted image stored in the inverted image storage unit 28. Find out if most of the label's position is filled with black pixels. If the conditions are met,
The position of the label is determined as the position of the entry field candidate.

【００１９】こうして抽出された記入欄の候補は、その
ままフォーム情報として登録してもよいが、精度を上げ
るため、ＧＵＩ上で人手により登録することが望まし
い。ＧＵＩ上では、前記罫線検出後の表示と同じよう
に、抽出された記入欄の外接矩形を色や太さ明度を変え
て表示したり、記入欄自身を色や明度を変えて表示し、
ユーザがマウスでクリックするなどして登録する。The entry field candidates thus extracted may be directly registered as form information, but it is desirable to manually register them on the GUI in order to improve accuracy. On the GUI, in the same way as the display after detecting the ruled line, the circumscribed rectangle of the extracted entry field is displayed in different colors and thicknesses, or the entry field itself is displayed in different colors and lightness.
The user registers it by clicking it with the mouse.

【００２０】次に、第３の実施の形態について説明す
る。第３の実施の形態は、前記記入欄抽出処理において
抽出された記入欄の属性を検出して、フォーム情報に自
動登録するもので、記入欄の属性検出方法には以下の３
種類が考えられる。Next, a third embodiment will be described. In the third embodiment, the attributes of the entry fields extracted in the entry field extraction process are detected and automatically registered in the form information.
Types can be considered.

【００２１】(1) 記入欄の近辺に印刷されている項目名
を認識する手法 (2) ユーザが記入欄に明記した項目名を認識する手法 (3) ユーザが属性ごとにあらかじめ決められた数字やマ
ークを記入欄に明記し、その数字やマークを認識する手
法まず、記入欄の近辺に印刷されている項目名を認識する
手法について説明する。この手法は、前記第２の実施の
形態において抽出された記入欄の属性を、その記入欄の
近辺に印刷されている項目名を認識することによって取
得し、フォーム情報に登録するものである。(1) A method for recognizing the item name printed near the entry field (2) A method for recognizing the item name specified by the user in the entry field (3) A number predetermined by the user for each attribute A method of recognizing a number or a mark in the entry field and recognizing the number or the mark First, a method of recognizing an item name printed near the entry field will be described. In this method, the attribute of the entry field extracted in the second embodiment is acquired by recognizing the item name printed near the entry field and registered in the form information.

【００２２】図５（ａ）に、帳票上の特定の記入欄、た
とえば、氏名の記入欄の例を示す。本実施の形態では、
氏名の記入欄が抽出されると、自動的にその近辺に印刷
されている文字を文字認識処理により認識する。図５
（ａ）の例では、記入欄の左隣りに「氏名」の記述がさ
れているので、「氏名」と認識される。これを、その記
入欄の属性としてフォ一ム情報に登録する。FIG. 5A shows an example of a specific entry field on the form, for example, a name entry field. In this embodiment,
When the name entry field is extracted, the characters printed in the vicinity are automatically recognized by the character recognition processing. Figure 5
In the example of (a), since "name" is written on the left side of the entry field, "name" is recognized. This is registered in the form information as an attribute of the entry field.

【００２３】この場合、記入欄が横に長い場合は、左部
に書かれている文字を優先的に属性として認識し、記入
欄が縦に長い場合は、上部に書かれている文字を優先的
に属性として認識する。記入欄の近辺に複数の文字が印
刷されている場合は、記入欄に最も近い印刷文字の認識
結果を属性として登録する。印刷されている文字が、デ
ータベースにあらかじめ登録されていない属性であった
場合は、ＧＵＩを用いてユーザに登録しなおしてもら
う。In this case, when the entry field is long horizontally, the characters written on the left side are preferentially recognized as attributes, and when the entry field is vertically long, the characters written on the upper side are given priority. Recognized as an attribute. When a plurality of characters are printed near the entry field, the recognition result of the print character closest to the entry field is registered as an attribute. If the printed characters have an attribute that is not registered in the database in advance, the user is requested to register again using the GUI.

【００２４】次に、ユーザが記入欄に明記した項目名を
認識する手法について説明する。この手法は、図５
（ｂ）に示すように、記入欄の近辺または記入欄の中に
ユーザがあらかじめ記入した文字が検出された場合に
は、その文字を認識し、その認識結果をその記入欄の属
性として登録するものである。この場合、ユーザが記述
した文字が、データベースにあらかじめ登録されていな
い属性であった場合は、ＧＵＩを用いてユ一ザに登録し
なおしてもらう。Next, a method for the user to recognize the item name specified in the entry field will be described. This method is shown in FIG.
As shown in (b), when a character previously entered by the user is detected near the entry field or in the entry field, the character is recognized and the recognition result is registered as an attribute of the entry field. It is a thing. In this case, if the character described by the user has an attribute that is not registered in the database in advance, the user is requested to register it again using the GUI.

【００２５】なお、指定された色のペンなどを用いて、
属性文字を記述することにより、文字抽出の精度を上
げ、属性自動登録を円滑に進めることができる。この場
合、スキャナで取込まれたカラー画像に特定の色を抽出
するフィルタをかけ、その画像情報を用いて文字切出
し、文字認識を行なう。In addition, using a pen of a specified color,
By describing the attribute character, the accuracy of character extraction can be improved and the automatic attribute registration can be smoothly performed. In this case, a filter for extracting a specific color is applied to the color image captured by the scanner, and character extraction and character recognition are performed using the image information.

【００２６】次に、ユーザが属性ごとにあらかじめ決め
られた数字やマークを記入欄に明記し、その数字やマー
クを認識する手法について説明する。この手法は、図５
（ｃ）に示すように、属性の種類によってあらかじめ決
められたマークまたは数字を、ユーザが記入欄の近辺ま
たは記入欄の中に書き、それを認識することによって、
記入欄の属性を自動登録するものである。Next, a method will be described in which the user specifies a predetermined number or mark for each attribute in the entry field and recognizes the number or mark. This method is shown in FIG.
As shown in (c), the user writes a mark or a number predetermined according to the type of attribute in the vicinity of the entry field or in the entry field and recognizes it.
The attributes of the entry fields are automatically registered.

【００２７】ユーザには、たとえば、図６に示すような
属性と１対１に対応した数字やマークの表をあらかじめ
渡しておき、ユーザはその表にしたがって、記入欄の内
部または近辺に、該当する数字や文字を記入する。シス
テムは、その数字やマークを認識し、あらかじめ登録さ
れているマークや文字と属性との関連テーブルを参照し
て、記入欄の属性を決定し、フォーム情報として登録す
る。この場合も、指定された色のペンなどを用いて、属
性文字を記述することにより、数字やマークの抽出の精
度を上げ、属性自動登録を円滑に進めることができる。To the user, for example, a table of numbers and marks corresponding to the attributes as shown in FIG. 6 is handed in advance, and the user can apply to the inside or the vicinity of the entry field according to the table. Enter the numbers and letters you want. The system recognizes the numeral or mark, refers to the previously registered mark or character and attribute relation table, determines the attribute of the entry field, and registers it as form information. Also in this case, by writing the attribute character using a pen or the like of a designated color, it is possible to improve the accuracy of extraction of numbers and marks and to smoothly perform automatic attribute registration.

【００２８】次に、第４の実施の形態について説明す
る。前述したフォーム登録の自動化は、ブランク帳票が
ないと行なえない。また、ブランク帳票の画像入力の際
に入るノイズなどの影響を受けやすい。そこで、第４の
実施の形態は、複数のブランク帳票あるいは記入済の帳
票を用いて、安定したフォーム登録を行なうもので、そ
の処理装置の概略構成を図７に示し、以下、それについ
て説明する。Next, a fourth embodiment will be described. The above-mentioned automation of form registration cannot be performed without a blank form. In addition, it is easily affected by noise and the like that occur when the blank form image is input. Therefore, in the fourth embodiment, stable form registration is performed using a plurality of blank forms or completed forms. A schematic configuration of the processing device is shown in FIG. 7, which will be described below. .

【００２９】まず、複数枚のブランク帳票あるいは記入
済の帳票を供給部にセットし、読取部に１枚ずつ順次供
給することにより、画像入力手段としてのスキャナ４１
によって当該帳票の濃淡画像を順次取込み、この取込ん
だ各画像をそれぞれフォーム画像記憶部４２ａ，４２
ｂ，４２ｃ，４２ｄ，…に格納する。First, a plurality of blank forms or completed forms are set in the supply unit and are sequentially supplied to the reading unit one by one, so that the scanner 41 as an image input unit.
The grayscale images of the form are sequentially fetched by the, and the fetched images are respectively stored in the form image storage sections 42a and 42a.
b, 42c, 42d, ...

【００３０】次に、画像比較部４３は、画像記憶部４２
ａ，４２ｂ，４２ｃ，４２ｄ，…内の各画像を基に、あ
らかじめ印刷された罫線や文字、背景模様等だけを抽出
した比較結果画像を作成し、これを比較結果画像記憶部
４４に格納する。システムは、この比較結果画像を用い
て、記入欄や罫線の抽出を行ない、その結果をフォーム
情報として登録する。Next, the image comparison unit 43 is connected to the image storage unit 42.
Based on each image in a, 42b, 42c, 42d, ..., a comparison result image is created in which only preprinted ruled lines, characters, background patterns, etc. are created and stored in the comparison result image storage unit 44. . The system uses this comparison result image to extract entry fields and ruled lines and register the results as form information.

【００３１】画像比較部４３の処理をさらに詳細に説明
すると、入力された複数の帳票画像の同じ位置にある画
素の濃度値を全ての帳票分加算する。その結果、あらか
じめ印刷された罫線や文字、背景模様などの部位は濃度
値が高くなり、逆に記入欄に記載されている文字等は毎
回書かれる場所がずれるため、濃度値はさほど高くはな
らない。そこで、濃度値の高い部位だけを黒画素として
比較結果画像記憶部４４に格納する。これにより、あら
かじめ印刷された部位（罫線や文字、背景模様など）を
安定して抽出することができる。各帳票の濃度の加算結
果のうち、最も高い値を持った画素の周辺を位置合わせ
のターゲットマークとして抽出することにより、従来、
ユーザが経験で指定していたターゲットマークを、より
効率良く登録することができる。The processing of the image comparison unit 43 will be described in more detail. The density values of pixels at the same position in a plurality of input form images are added for all forms. As a result, the density value is high in the parts such as ruled lines, characters, and background patterns that have been printed in advance, and on the other hand, the characters written in the entry field are not written in the same place each time, so the density value is not so high. . Therefore, only the portion having a high density value is stored in the comparison result image storage unit 44 as a black pixel. This makes it possible to stably extract preprinted parts (ruled lines, characters, background patterns, etc.). By extracting the periphery of the pixel with the highest value among the addition results of the densities of each form as the alignment target mark,
The target mark specified by the user through experience can be registered more efficiently.

【００３２】次に、第５の実施の形態について説明す
る。第５の実施の形態は、フォーム情報の登録の際に、
複数の記入済の帳票の画像を入力し、その文字を認識す
ることによって、その帳票に書かれている記入欄の属性
を得るもので、その処理装置の概略構成を図８に示し、
以下、それについて説明する。Next, a fifth embodiment will be described. In the fifth embodiment, when registering form information,
By inputting the images of a plurality of completed forms and recognizing the characters, the attributes of the entry fields written in the form are obtained, and the schematic configuration of the processing device is shown in FIG.
This will be described below.

【００３３】まず、複数枚の記入済の帳票を供給部にセ
ットし、読取部に１枚ずつ順次供給することにより、画
像入力手段としてのスキャナ５１によって当該帳票の濃
淡画像を順次取込み、この取込んだ各画像をそれぞれフ
ォーム画像記憶部５２ａ，５２ｂ，５２ｃ，５２ｄ，…
に格納する。First, a plurality of completed forms are set in the supply unit and sequentially supplied to the reading unit one by one, so that the grayscale images of the form are sequentially acquired by the scanner 51 as an image input means. Each of the incorporated images is stored in the form image storage units 52a, 52b, 52c, 52d, ...
To store.

【００３４】次に、文字認識部５３は、記載位置情報格
納部５４に格納された記載位置情報に基づき、フォーム
画像記憶部５２ａ，５２ｂ，５２ｃ，５２ｄ，…内の各
画像の文字記載部の中に対し文字認識を行ない、その文
字認識結果を各帳票ごとに文字認識結果格納部５５に格
納する。次に、属性決定部５６は、記載位置情報格納部
５４に格納された記載位置情報と、文字認識結果格納部
５５に格納された文字認識結果とに基づき各記入欄の属
性を決定する。Next, the character recognition unit 53, based on the written position information stored in the written position information storage unit 54, the character writing unit of each image in the form image storage units 52a, 52b, 52c, 52d ,. Character recognition is performed on the inside, and the character recognition result is stored in the character recognition result storage unit 55 for each form. Next, the attribute determination unit 56 determines the attribute of each entry field based on the written position information stored in the written position information storage unit 54 and the character recognition result stored in the character recognition result storage unit 55.

【００３５】属性決定部５６では、複数の帳票の同じ記
入欄の文字認識結果を基に属性を決定する。たとえば、
その記入欄の文字認識結果が「東京都」や「川崎市」な
ど、住所名を表わす単語が多く含まれていれば、その記
入欄の属性は「住所」と決定する。また、「０」から始
まる１０桁以上の数字が多く見られる場合は、その記入
欄の属性を「電話番号」とする。また、氏名によく用い
られている「男」、「子」、「夫」、「木村」、「郎」
などの文字が頻繁に現われている場合には、その記入欄
の属性を「氏名」とする。The attribute determining unit 56 determines the attribute based on the character recognition result of the same entry field of a plurality of forms. For example,
If the character recognition result of the entry field includes many words representing the address name such as "Tokyo" or "Kawasaki City", the attribute of the entry field is determined to be "address". If there are many numbers of 10 digits or more starting from "0", the attribute of the entry field is "telephone number". In addition, "man", "child", "husband", "Kimura", "ro" often used for names
When a character such as “” appears frequently, the attribute of the entry field is “name”.

【００３６】その他にも、あらかじめデータベースに項
目の要素別に現われやすい文字や文字並びを登録してお
き、登録された文字や文字並びが書かれていることが多
ければ、その記入欄の属性をデータベースに登録されて
いる要素とすることも可能である。たとえば、「男女
欄」の項目のデータベースには、「男」や「女」の文字
が登録されており、記入欄の大体の大きさなども登録さ
れている。「個数欄」データベースには、数字や
「個」、「つ」、「コ」などが登録されている。これに
より、ユーザがいちいち指定しなくても、帳票の各記入
欄の属性が自動的に取得される。In addition, if a character or character sequence that is likely to appear for each element of the item is registered in advance in the database and the registered character or character sequence is written in many cases, the attribute of the entry field is stored in the database. It is also possible to use the elements registered in. For example, the characters "male" and "female" are registered in the database of the item "gender column", and the approximate size of the entry column is also registered. Numbers, "pieces", "tsu", "ko", etc. are registered in the "quantity column" database. As a result, the attributes of each entry field of the form are automatically acquired even if the user does not specify one by one.

【００３７】次に、第６の実施の形態について説明す
る。第６の実施の形態は、フォーム登録の際に、ブラン
ク帳票の画像を画像のままで登録しておき、その画像を
帳票認識時に使用するもので、その処理装置の概略構成
を図９に示し、以下、それについて説明する。Next, a sixth embodiment will be described. In the sixth embodiment, an image of a blank form is registered as an image at the time of form registration, and the image is used at the time of form recognition. A schematic configuration of the processing device is shown in FIG. This will be explained below.

【００３８】まず、認識対象帳票を読取部にセットし、
画像入力手段としてのスキャナ６１によって当該帳票の
濃淡画像を取込み、認識対象帳票画像記憶部６２に格納
する。次に、認識対象帳票画像記憶部６２に格納された
認識対象帳票の画像を２値化部６３により２値化し、そ
の結果を認識対象帳票２値化画像記憶部６４に格納す
る。First, the form to be recognized is set in the reading unit,
The scanner 61 as an image input means captures a grayscale image of the form and stores it in the recognition target form image storage unit 62. Next, the image of the recognition target form stored in the recognition target form image storage unit 62 is binarized by the binarization unit 63, and the result is stored in the recognition target form binary image storage unit 64.

【００３９】登録ブランク帳票画像記憶部６５には、フ
ォーム登録の際にフォーム情報と共に取込んだブランク
帳票の画像（２値化済）が格納されている。そこで、位
置合わせ処理部６６は、この２つの画像記憶部６４，６
５に格納された各画像のターゲットマークを基準とし
て、当該２つの画像の位置合わせを行ない、その位置ず
れ情報を位置合わせ結果格納部６７に格納する。The registered blank form image storage section 65 stores an image (binarized) of the blank form taken together with the form information at the time of form registration. Therefore, the alignment processing unit 66 uses the two image storage units 64 and 6
Based on the target mark of each image stored in No. 5, the two images are aligned, and the positional deviation information is stored in the alignment result storage unit 67.

【００４０】ここで、ターゲットマークとは、認識対象
の帳票には必ず印刷されている特徴的なマークのこと
で、このマークの位置をあらかじめフォーム情報に登録
しておき、認識対象帳票２値化画像で最もターゲットマ
ークらしい部位を抽出し、２つの画像（登録ブランク帳
票画像と認識対象帳票画像）の縦横方向のずれを推定す
るものである。Here, the target mark is a characteristic mark that is always printed on the form to be recognized. The position of this mark is registered in the form information in advance, and the form to be recognized is binarized. The most likely target mark part is extracted from the image, and the vertical and horizontal shifts of the two images (registered blank form image and recognition target form image) are estimated.

【００４１】次に、プレ印刷消去部６８は、図１０に示
すように、位置合わせ結果格納部６７に格納された位置
ずれ情報を基準として、登録ブランク帳票画像記憶部６
５に格納された登録ブランク帳票画像を用いて、認識対
象帳票２値化画像記憶部６４に格納された認識対象帳票
画像から、帳票にあらかじめ印刷されている罫線や文字
など（プレ印刷部分）を除去し、認識対象となる文字画
像のみ（プレ印刷消去画像）を抽出する。これは、登録
ブランク帳票画像の黒画素と同じ位置にあたる認識対象
帳票２値化画像上の画素を位置合わせ結果情報（位置ず
れ情報）から計算し、その画素を白画素にすることによ
って行なう。こうして得られたプレ印刷消去画像は、プ
レ印刷消去画像記憶部６９に格納され、このプレ印刷消
去画像に対して文字認識が行なわれる。Next, as shown in FIG. 10, the pre-print erasure section 68 uses the registration blank form image storage section 6 as a reference, based on the positional deviation information stored in the registration result storage section 67.
Using the registered blank form image stored in 5, the recognition target form image stored in the recognition target form binarized image storage unit 64 is used to create ruled lines and characters (pre-printed portion) that are printed in advance on the form. Only the character image to be recognized (pre-print erased image) is extracted. This is performed by calculating the pixel in the binarized image of the recognition target form, which corresponds to the same position as the black pixel of the registered blank form image, from the alignment result information (positional shift information), and making the pixel a white pixel. The pre-print erased image thus obtained is stored in the pre-print erased image storage unit 69, and character recognition is performed on the pre-print erased image.

【００４２】このように、認識対象帳票の画像からプレ
印刷部分を除去することにより、従来、記載された文字
の切出しや認識に悪影響の多かった罫線や印刷文字など
のノイズを除去することができる。As described above, by removing the pre-printed portion from the image of the form to be recognized, it is possible to remove noises such as ruled lines and printed characters which have a bad influence on the cut-out and recognition of the described characters. .

【００４３】次に、第７の実施の形態について説明す
る。第７の実施の形態は、あらかじめブランク帳票の背
景画像を登録しておくことによって、人力された認識対
象帳票の画像から背景画像を消去し、認識対象文字の切
出し、文字認識の精度をあげるもので、その処理装置の
概略構成を図１１に示し、以下、それについて説明す
る。Next, a seventh embodiment will be described. In the seventh embodiment, a background image of a blank form is registered in advance, so that the background image is erased from the image of the manually recognized recognition target form, and the recognition target character is cut out and the accuracy of character recognition is improved. Then, the schematic configuration of the processing apparatus is shown in FIG. 11, which will be described below.

【００４４】まず、認識対象帳票を読取部にセットし、
画像入力手段としてのスキャナ７１によって当該帳票の
濃淡画像を取込み、認識対象画像記憶部７２に格納す
る。登録背景画像記憶部７３には、あらかじめブランク
帳票の背景画像が格納されている。そこで、位置合わせ
処理部７４は、この２つの画像記憶部７２，７３に格納
された各画像のターゲットマークを基準として、当該２
つの画像の位置合わせを行ない、その位置ずれ情報を位
置合わせ結果格納部７５に格納する。First, the form to be recognized is set in the reading unit,
A scanner 71 serving as an image input unit captures a grayscale image of the form and stores it in the recognition target image storage unit 72. The background image of the blank form is stored in advance in the registered background image storage unit 73. Therefore, the registration processing unit 74 uses the target mark of each image stored in the two image storage units 72 and 73 as a reference to determine the target mark.
The two images are aligned, and the positional deviation information is stored in the alignment result storage unit 75.

【００４５】背景画像消去部７６は、図１２に示すよう
に、位置合わせ結果格納部７５に格納された位置ずれ情
報を基準として、登録背景画像記憶部７３に格納された
登録背景画像を用いて、認識対象画像記憶部７２に格納
された認識対象帳票画像から、帳票にあらかじめ印刷さ
れている背景画像を除去し、認識対象となる文字画像の
み（背景画像消去画像）を抽出する。As shown in FIG. 12, the background image erasing unit 76 uses the registered background image stored in the registered background image storage unit 73 with reference to the positional deviation information stored in the alignment result storage unit 75. The background image previously printed on the form is removed from the recognition target form image stored in the recognition target image storage unit 72, and only the character image to be recognized (background image erased image) is extracted.

【００４６】背景画像消去部７６の処理をさらに詳細に
説明すると、位置合わせ結果情報（位置ずれ情報）を用
いて、登録背景画像のある画素の濃度と認識対象画像の
位置的に一致する画素の濃度とを比較し、その濃度差が
あまりない場合は背景画像のテクスチャが一致したとし
て、認識対象画像の画素を白画素にし、濃度差が大きい
場合は、ここに文字が書かれている可能性が高いので、
その画素は認識対象画像の濃度と同じにする。こうする
ことにより、認識対象文字が背景画像と密に接している
場合などでも、背景画像のみを確実に取除くことができ
る。こうして得られた背景画像消去画像は、背景画像消
去画像記憶部７７に格納され、この背景画像消去画像に
対して文字認識が行なわれる。The processing of the background image erasing unit 76 will be described in more detail. By using the alignment result information (positional deviation information), the density of a pixel in the registered background image and the pixel in which the position of the recognition target image coincides with that of the pixel. If the density difference is not significant and the background image textures match if there is not much difference in density, the pixels in the recognition target image are set to white pixels, and if the density difference is large, there is a possibility that text is written here. Is high,
The pixel has the same density as the recognition target image. By doing so, even if the recognition target character is in close contact with the background image, it is possible to reliably remove only the background image. The background image erased image thus obtained is stored in the background image erased image storage unit 77, and character recognition is performed on the background image erased image.

【００４７】次に、第８の実施の形態について説明す
る。運送業界の帳票（配送伝票等）において、重量・容
量の単位として「才」の文字が用いられることが多い。
これを認識することにより、重量認識の精度を向上させ
ることができる。「才」は重量や容積の単位を表わし、
通常、１才＝１０Ｋｇである。Next, an eighth embodiment will be described. In the form of shipping industry (delivery slips, etc.), the character "sai" is often used as a unit of weight and capacity.
By recognizing this, the accuracy of weight recognition can be improved. "Sai" represents a unit of weight or volume,
Usually, 1 year old = 10 kg.

【００４８】図１３に「才」の文字を含んだ文字列の例
を示す。また、図１４は、「才」の文字を伴った重量欄
の認識手順を示し、以下、それについて説明する。入力
された帳票画像は、２値化処理した後、文字切出し処理
を経て、重量認識処理にくる。まず、切出された文字に
対する認識を行なう（ステップＳ１）。次に、認識結果
を基に重量の数字の部分の認識結果を求める（ステップ
Ｓ２）。その後、重量欄の単位の部分の認識結果を求め
る（ステップＳ３）。単位を認識した結果、単位が無記
入かまたは「Ｋｇ」の場合は、数字認識の結果をそのま
ま出力する（ステップＳ４）。単位を認識した結果が
「才」であった場合には、数字認識の結果を１０倍し
（ステップＳ５）、単位を揃えることによって、結果を
出力する。FIG. 13 shows an example of a character string including the character "sai". In addition, FIG. 14 shows a procedure for recognizing the weight column accompanied by the character "sai", which will be described below. The input form image undergoes binarization processing, character cutting processing, and weight recognition processing. First, the cut-out character is recognized (step S1). Next, the recognition result of the number portion of the weight is obtained based on the recognition result (step S2). After that, the recognition result of the unit of the weight column is obtained (step S3). As a result of recognizing the unit, if the unit is blank or "Kg", the result of the numeral recognition is output as it is (step S4). When the result of unit recognition is "age", the result of numeral recognition is multiplied by 10 (step S5), and the result is output by aligning the units.

【００４９】次に、第９の実施の形態について説明す
る。図１３に「才」の文字を含んだ文字列の例を示した
が、図１３（ａ）は「１才」を表わし、図１３（ｂ）は
「３才」を表わしている。図１３を見てもわかるよう
に、「才」の書かれ方には非常に多くのバリユーション
がある。そこで、従来の文字認識技術で「才」を認識し
ようとしても、その認識率は低くなる。Next, a ninth embodiment will be described. FIG. 13 shows an example of a character string containing the characters "age". Fig. 13 (a) represents "1 year old" and Fig. 13 (b) represents "3 year old". As can be seen from FIG. 13, there are many variations in the writing of “age”. Therefore, even if an attempt is made to recognize "age" by the conventional character recognition technology, the recognition rate becomes low.

【００５０】帳票の重量・容量欄には、通常、数字
「０」〜「９」と「才」の文字種しか現われない。そこ
で、文字認識処理で「才」が認識できなかった場合で
も、「０」〜「９」までの全ての文字候補でないことが
わかれば、その文字を「才」としてもよいことになる。In the weight / capacity column of the form, only the character types of the numbers "0" to "9" and "age" usually appear. Therefore, even if "character" cannot be recognized in the character recognition process, if it is found that all the character candidates "0" to "9" are not recognized, the character may be regarded as "character".

【００５１】図１５に、「才」の文字の認識決定の流れ
を示す。図１５によれば、文字認識部によって「才」と
は言い切れなかった場合、「０」判別部から「９」判別
部までの１０個の判別部によって、その文字かどうか、
その文字じゃないかどうかを判別する。この判別の結
果、「０」〜「９」まで全てのカテゴリでないことが分
かった場合は、その文字の認識結果を「才」とする。こ
のように、記入される文字のカテゴリが決まっていて、
ある１カテゴリの書かれる変形のバラエティが多く、認
識率が低い場合、他の文字ではないことを判別すること
によって、その文字の認識率を高めることができる。FIG. 15 shows a flow for determining the recognition of the character "sai". According to FIG. 15, when the character recognizing unit cannot say “senior”, whether the character is the character is judged by the 10 discriminating units from the “0” discriminating unit to the “9” discriminating unit.
Determine if it is not the letter. As a result of this determination, if it is found that all the categories from “0” to “9” are not found, the recognition result of the character is set to “age”. In this way, the category of characters to be entered is decided,
When there is a large variety of written transformations in a certain category and the recognition rate is low, the recognition rate of that character can be increased by determining that it is not another character.

【００５２】以上説明したように、上記実施の形態によ
れば、帳票上の罫線や記入欄の位置を自動的に抽出し、
この抽出した罫線や記入欄の情報をそのまま、あるい
は、ユーザがそれを指し示すだけで登録を行なうことが
できる。また、記入欄の属性も、記入欄の近傍に印刷さ
れている文字や、ユーザが帳票にあらかじめ手書きで書
いた属性を文字認識技術を用いて認識し、その結果を用
いることによって、自動的に登録できる。これにより、
ユーザのフォーム登録にかける手間が大幅に軽減され、
また、不慣れなユーザでも簡易にフォーム登録を行なう
ことができる。As described above, according to the above embodiment, the positions of ruled lines and entry fields on the form are automatically extracted,
The information on the extracted ruled lines and entry fields can be registered as it is, or the user can simply register the information. In addition, the attributes of the entry fields are automatically recognized by using the character recognition technology to recognize the characters printed near the entry fields and the attributes that the user has previously handwritten on the form, and using the results. You can register. This allows
The time and effort required for the user to register the form is greatly reduced,
Further, even an inexperienced user can easily perform form registration.

【００５３】また、ブランク帳票の画像を画像のままで
登録しておき、その画像を帳票認識時に使用して、認識
対象帳票の画像からプレ印刷部分を除去することによ
り、従来、記載された文字の切出しや認識に悪影響の多
かった罫線や印刷文字などのノイズを除去することがで
き、文字認識精度の向上が図れる。さらに、あらかじめ
ブランク帳票の背景画像を登録しておくことによって、
人力された認識対象帳票の画像から背景画像を消去し、
認識対象文字の切出し、文字認識の精度をあげることが
できる。Further, the blank form image is registered as it is, and the image is used at the time of form recognition to remove the preprinted portion from the image of the form to be recognized. It is possible to remove noises such as ruled lines and printed characters, which have a bad influence on clipping and recognition of characters, and improve the character recognition accuracy. Furthermore, by registering the background image of the blank form in advance,
Erase the background image from the image of the recognition target form that was manually operated,
It is possible to improve the accuracy of character segmentation and character recognition.

【００５４】[0054]

【発明の効果】以上詳述したように本発明によれば、ユ
ーザのフォーム登録にかける手間が大幅に軽減され、か
つ、不慣れなユーザでも簡易にフォーム登録を行なうこ
とができる帳票フォーム登録方法を提供できる。また、
本発明によれば、文字認識精度の向上が図れる帳票認識
方法を提供できる。As described above in detail, according to the present invention, a form form registration method is possible in which the user's time and effort for registering a form is significantly reduced, and even an inexperienced user can easily perform form registration. Can be provided. Also,
According to the present invention, it is possible to provide a form recognition method capable of improving character recognition accuracy.

[Brief description of drawings]

【図１】第１の実施の形態に係る罫線抽出装置の概略構
成を示すブロック図。FIG. 1 is a block diagram showing a schematic configuration of a ruled line extraction device according to a first embodiment.

【図２】罫線抽出結果の表示例を示す図。FIG. 2 is a diagram showing a display example of a ruled line extraction result.

【図３】第２の実施の形態に係る記入欄抽出装置の概略
構成を示すブロック図。FIG. 3 is a block diagram showing a schematic configuration of an entry field extraction device according to a second embodiment.

【図４】記入欄抽出処理の説明に用いる各種画像例を示
す図。FIG. 4 is a diagram showing various image examples used for explaining an entry column extraction process.

【図５】第３の実施の形態に係る記入欄属性検出例を説
明する図。FIG. 5 is a diagram illustrating an example of entry field attribute detection according to the third embodiment.

【図６】第３の実施の形態において、ユーザが帳票に記
入する数字と属性情報との対応表の一例を示す図。FIG. 6 is a diagram showing an example of a correspondence table of numbers and attribute information that a user fills in a form in the third embodiment.

【図７】第４の実施の形態に係る処理装置の概略構成を
示すブロック図。FIG. 7 is a block diagram showing a schematic configuration of a processing device according to a fourth embodiment.

【図８】第５の実施の形態に係る処理装置の概略構成を
示すブロック図。FIG. 8 is a block diagram showing a schematic configuration of a processing device according to a fifth embodiment.

【図９】第６の実施の形態に係る処理装置の概略構成を
示すブロック図。FIG. 9 is a block diagram showing a schematic configuration of a processing device according to a sixth embodiment.

【図１０】第６の実施の形態において、ブランク帳票画
像を用いた帳票認識処理を説明する図。FIG. 10 is a diagram illustrating a form recognition process using a blank form image according to the sixth embodiment.

【図１１】第７の実施の形態に係る処理装置の概略構成
を示すブロック図。FIG. 11 is a block diagram showing a schematic configuration of a processing device according to a seventh embodiment.

【図１２】第７の実施の形態において、テクスチャ画像
を用いた帳票認識処理を説明する図。FIG. 12 is a diagram illustrating a form recognition process using a texture image in the seventh embodiment.

【図１３】第８の実施の形態において処理する「才」の
文字を含んだ文字列の例を示す図。FIG. 13 is a diagram showing an example of a character string including a character of “age” to be processed in the eighth embodiment.

【図１４】第８の実施の形態において、「才」の文字を
伴った重量欄の認識手順を示すフローチャート。FIG. 14 is a flowchart showing a procedure for recognizing a weight column accompanied by the characters “sai” in the eighth embodiment.

【図１５】第８の実施の形態において、「才」の文字の
認識決定の流れを示す図。FIG. 15 is a diagram showing a flow of recognition determination of a character “sai” in the eighth embodiment.

【図１６】認識対象帳票の一例を示す図。FIG. 16 is a diagram showing an example of a recognition target form.

[Explanation of symbols]

１１…スキャナ（画像入力手段）、１２…濃淡画像記憶
部、１３…２値化部、１４…２値化画像記憶部、１５…
縦方向射影部、１６…横方向射影部、１７…縦方向射影
結果記憶部、１８…横方向射影結果記憶部、１９…罫線
抽出部。11 ... Scanner (image input means), 12 ... Gray image storage unit, 13 ... Binarization unit, 14 ... Binary image storage unit, 15 ...
Vertical projection unit, 16 ... Horizontal projection unit, 17 ... Vertical projection result storage unit, 18 ... Horizontal projection result storage unit, 19 ... Ruled line extraction unit.

───────────────────────────────────────────────────── フロントページの続き (51)Int.Cl.⁷ 識別記号ＦＩテーマコート゛(参考）Ｇ０６Ｔ 7/60 １８０Ｇ０６Ｔ 7/60 １８０Ａ２００２００Ｋ (72)発明者堀内秀雄神奈川県川崎市幸区柳町70番地株式会社東芝柳町事業所内 (72)発明者名取直毅神奈川県川崎市幸区柳町70番地株式会社東芝柳町事業所内 (72)発明者中尾昭彦神奈川県川崎市幸区柳町70番地株式会社東芝柳町事業所内 (72)発明者青木泰浩神奈川県川崎市幸区柳町70番地株式会社東芝柳町事業所内 (72)発明者浜村倫行神奈川県川崎市幸区柳町70番地株式会社東芝柳町事業所内Ｆターム(参考） 5B029 BB02 CC26 CC29 EE08 5B057 AA11 BA29 CA02 CA08 CA12 CB02 CB06 CB12 CE12 CH01 CH11 DA08 DB02 DB05 DB09 DC16 5B064 AA01 BA01 EA20 5L096 BA20 EA37 FA03 FA16 FA46 FA73 LA05 ─────────────────────────────────────────────────── ─── Continuation of front page (51) Int.Cl. ⁷ Identification code FI theme code (reference) G06T 7/60 180 G06T 7/60 180A 200 200K (72) Inventor Hideo Horiuchi 70 Yanagicho, Yuki-ku, Kawasaki-shi, Kanagawa Address: Toshiba Yanagicho Office (72) Inventor Naoki Natori 70 Yanagicho, Saiwai-ku, Kawasaki City, Kanagawa Prefecture In-house Toshiba Yanagimachi Office (72): Akihiko Nakao 70, Yanagicho, Saiwai-ku, Kawasaki City, Kanagawa Toshiba Yanagimachi Business In-house (72) Yasuhiro Aoki 70 Yanagi-cho, Saiwai-ku, Kawasaki-shi, Kanagawa Prefecture, Yanagi-cho, Toshiba Office (72) Inventor Noriyuki Hamamura 70, Yanagi-cho, Sai-ku, Kawasaki-shi, Kanagawa F-Term, Toshiba Yanagi-cho, Ltd. (reference) 5B029 BB02 CC26 CC29 EE08 5B057 AA11 BA29 CA02 CA08 CA12 CB02 CB06 CB12 CE12 CH01 CH11 DA08 DB02 DB05 DB 09 DC16 5B064 AA01 BA01 EA20 5L096 BA20 EA37 FA03 FA16 FA46 FA73 LA05

Claims

[Claims]

1. A form on a form is registered in advance,
In recognizing the characters written on the form based on this registered form, an image of a blank form on which nothing is written is captured using the image input means, and the position of the ruled line from the captured image,
A form form registration method comprising extracting thickness, line type, etc., and registering the extracted ruled line information as form information.

2. The form on the form is registered in advance,
In recognizing the characters written on the form based on this registered form, an image of a blank form with nothing written is captured by using the image input means, and the captured image is surrounded by a solid line or a broken line. A form form registration method, characterized in that the extracted area is extracted as an entry field, and the information in the extracted entry field is registered as form information.

3. The image captured by the image input means is
3. The form form registration method according to claim 2, wherein the binarized image is binarized, the black pixels of the binarized image are regarded as the background to extract the connected component, and the entry field is extracted using the extracted information.

4. The form registration according to claim 2, wherein the item name printed near the extracted entry field is recognized, and the recognition result is registered in the form information as an attribute of the entry field. Method.

5. The character or mark written by the user in the extracted entry field or in the vicinity thereof is recognized, and the recognition result is registered in the form information as an attribute of the entry field. Report form registration method.

6. A form on a form is registered in advance,
In the one that recognizes the characters written on the form based on this registered form, each image of a plurality of forms is captured by using the image input means, and the plurality of captured images are preprinted on all the forms. The form form registration method is characterized in that the extracted information is extracted and the extracted information is registered as form information.

7. The total value of pixels obtained by adding the density values of pixels corresponding to the same position in a plurality of captured images and dividing the pixel at each position into a pixel having a high sum and an average value and a pixel having a low value. 7. The form form registration method according to claim 6, wherein ruled lines, characters, background patterns, and the like that are printed in advance on the sheet are extracted.

8. A form on a form is registered in advance,
In the case of recognizing characters written on a form based on this registered form, each image of a plurality of completed forms is captured using image input means, and character recognition is performed on the captured multiple images. The form form registration method characterized in that the attribute of the entry field is determined based on the character recognition result of the same entry field of the plurality of forms, and the determined attribute is registered in the form information.

9. The form on the form is registered in advance,
When recognizing the characters written on the form based on this registered form, at the time of form registration, an image of a blank form that has no description is captured using the image input means, and at least one of the captured images is captured. The part is registered together with the form information, and at the time of character recognition, the ruled lines and characters etc. printed in advance on the form are input from the image of the input form to be recognized using the blank form image previously registered. A form recognition method, which comprises removing and then performing character recognition on the form.

10. A form recognition method for recognizing a character described in an image on a form, wherein an image on a form in which no character is described is registered in advance, and a recognition target form input at the time of character recognition. A form recognition method, characterized in that the image on the form is removed from the image using the previously registered image, and then character recognition is performed on the form.

11. A form recognition method for recognizing a character string written in a weight column of a form used in the transportation industry, wherein a character string in the weight column is recognized, and a weight unit portion of the recognition result indicates a weight unit. In the case of the character "zai" shown, the recognition result of the numeral which is the weight part of the recognition result is multiplied by a predetermined numerical value and output, and the weight unit part of the recognition result is other than the character "zai". In this case, the form recognition method is characterized in that the recognition result of the numeral which is the weight portion of the recognition result is output as it is.

12. A form recognition method for recognizing a specific character written in a specific entry field of a form when a limited number of character types appear in the particular entry field, When recognizing a specified specific character, if it can be determined that the specified character is not all the character types except one of the possible character types, the recognition result of the specific character is regarded as the remaining one character. A form recognition method characterized by the above.