JP3720078B2

JP3720078B2 - Multicolor character image reader

Info

Publication number: JP3720078B2
Application number: JP15949095A
Authority: JP
Inventors: 康裕岡田; 美由紀林; 敬平野
Original assignee: Mitsubishi Electric Corp
Current assignee: Mitsubishi Electric Corp
Priority date: 1995-06-26
Filing date: 1995-06-26
Publication date: 2005-11-24
Anticipated expiration: 2020-11-24
Also published as: JPH0916714A

Description

【０００１】
【産業上の利用分野】
本発明は標準として設定した帳票と比較して、多色の文字画像の入力から画像の抽出、文字の認識を行う多色の文字画像読取装置に関するものである。
【０００２】
【従来の技術】
従来の帳票読取装置は、帳票作成時に使用した色指定情報を用いてドロップアウトさせるデータと、認識処理の対象とするデータに変換して認識対象のみに文字認識処理を行っていた。
図３４は、特開平２−１２３４８８号公報に開示されている従来の帳票読取装置を示すブロック図である。図において、５１１はカラーセンサ、５１２は帳票、５１３はレンズ、５１４は量子化部、５１５、５１６、５１７はイメージメモリ、５１８は制御部、５１９は色判定制御部、５２０は文字認識部である。
図３５は、フィールド枠、文字枠と記入文字が存在する画像に対する量子化部の出力例を示す図である。図において、５３１はフィールド枠、５３２は文字枠、５３３は記入文字である。図中、実線、斜線、網かけは色の違いを示す。
図３６は、文字切り出しの動作説明図である。図において、５４１は文字切り出し位置である。
図３７は、文字切り出し結果を示す図である。図において、５５１、５５２、５５３、５５４は個々の文字切り出し結果である。
図３８は、フィールド枠、プレプリントと記入文字が存在する画像に対する量子化部の出力例を示す図である。図において、５６１はフィールド枠、５６２はプレプリント、５６３は記入文字である。図中、実線、斜線、網かけは色の違いを示す。
図３９は、文字列切り出し結果を示す図である。図において、５７１はプレプリント”年”に対する文字列、５７２はプレプリント”月”に対する文字列である。
【０００３】
次に上記構成の装置の動作について説明する。
まず読取領域を示す枠が任意の色で印刷され、文字が黒色で記録された帳票５１２と、読取領域ごとに記入された色を指定するコード（色指定コード）と指定された色をドロップアウトさせるか、または認識処理の対象とするかを指定するコード（認識指定コード）を格納したフォーマット情報を用意する。
カラーセンサ５１１で、帳票５１２を読み取り赤、青、緑の各成分に分解して光電変換を行い、量子化部５１４で量子化したデータをイメージメモリ５１５〜５１７に格納する。制御部５１８は、外部から与えられるフォーマット情報に設定された色指定コードから各色成分毎の色成分データに変換し、色判定制御部５１９に出力する。色判定制御部５１９は、色成分データとイメージメモリ５１５〜５１７に格納された読取領域内画素位置の濃度情報との比較を行い、判定結果に基づいて２値の帳票イメージを生成する。文字認識部５２０は、２値イメージについて文字認識処理を行う。
例えば、フォーマット情報に文字枠の印刷色を示す色指定コードとそれをドロップアウトさせることを示すコードを設定した場合、色判定制御部５１９では色指定コードに属する色は消去され白データに、それ以外の色は黒データに変換する。文字認識部５２０では、そのイメージを対象に文字を認識する。
【０００４】
上記により動作する従来装置に対して図３５に示す画像が量子化部５１４から得られ、フォーマット情報に文字枠の印刷色、フィールド枠の印刷色を示す色指定コードとそれをドロップアウトさせることを示すコードを設定した場合、色判定制御部５１９では文字枠５３２とフィールド枠５３１を白データに、記入文字５３３を黒データに変換する。
さらに、記入文字５３３が図３５のように文字が文字枠の中央に記入されず文字同士が近接して記入された場合や帳票画像がずれて入力された場合には、文字切り出しが正常に動作せず、図３６に示す誤った文字切り出し位置５４１が得られる。元の枠はドロップアウトされて区分が不明になり、この結果、図３７の５５１と５５２に示す誤った文字切り出し結果が得られる。従って、文字認識部５２０では誤った認識結果を出力する。
【０００５】
また、上記により動作する従来装置に対して図３８に示す画像が量子化部５１４から得られ、フォーマット情報にプレプリントの印刷色、フィールド枠の印刷色を示す色指定コードとそれをドロップアウトさせることを示すコードを設定した場合、色判定制御部５１９ではフィールド枠５６１とプレプリント５６２を白データに、記入文字５６３を黒データに変換する。
色判定制御部５１９から出力された図３９に示す２値イメージについて、文字認識処理を行う。この時既に、図３８に示すプレプリント５６２は画像から消滅してしまうため、プレプリントの区切りがわからない。そのため、図３９に示すように”年”に対応して”Ｓ６１”５７１の画像が、”月”に対応して”２”５７２の画像が抽出され、誤った文字列切り出し結果が得られる場合がある。
【０００６】
【発明が解決しようとする課題】
従来の帳票読取装置は以上のように構成されていて区分用の枠はドロップアウトで除去され、記入文字と記入文字以外の帳票構成要素との相対関係が不明であるため、正確に画像抽出や文字認識ができないという課題があった。
また更に、帳票の読取を行う前に、あらかじめ読み込む帳票に対応したフォーマット情報を特定しておかねばならず、登録外の複数種類の帳票を混在して読み込むことができない。また読取帳票の新規設計や変更を行う場合、フォーマット情報に色指定コードを逐一設定する必要があり、フォーマット情報の設定作業に手間がかかるという課題もあった。
【０００７】
本発明は上記のような課題を解決するためになされたものであり、複数種類のフィールドが混在した多色の文字画像の入力に対して正確に画像抽出や文字認識が行え、また、フォーマットの設定フィールドに対して色情報を容易に設定できる多色文字画像読取装置を得ることを目的とする。
【０００８】
【課題を解決するための手段】
この発明に係る多色文字画像読取装置は、文字画像入力の指定された色毎に範囲を設定して範囲位置と色とをフィールド毎にまとめた標準帳票を作成する多色帳票読取フォーマット作成変更手段と、この多色帳票読取フォーマット作成変更手段で作成された標準帳票と文字画像入力とを比較して入力の帳票の種別を判定する帳票種別判定手段と、この帳票種別判定手段で判定した種別に基づいて上記文字画像入力から対象領域を抽出する画像抽出手段と、この画像抽出手段で抽出した読取領域と帳票種別判定手段で判定した種別の標準帳票の対応フィールドの設定値とから読取対象文字を切り出す文字切り出し手段と、この文字切り出し手段で切り出された領域毎の読取対象文字に文字認識処理を行い文字を認識する文字認識手段を備えた。
【０００９】
また基本構成に更に、文字画像入力に対して類似した色成分を持つ画素を統合して指定の色とする色統合手段を付加した。
【００１０】
また基本構成に更に、画像抽出手段で抽出した画像と、上記文字認識手段出力の認識結果を出力する結果出力手段を付加した。
【００１１】
また更に、多色帳票読取フォーマット作成変更手段は、特定の色に対応する画像のみを抽出して出力し、範囲位置と色の設定を変更できるようにした。
【００１２】
また更に、帳票種別判定手段は、標準帳票と、文字画像入力の各々の色情報で帳票種別を判定するようにした。
【００１３】
また更に、帳票種別判定手段は、標準帳票と文字画像入力の各々の色情報に加え、各色毎のフィールド毎にデータを対応付けて帳票種別を判定するようにした。
【００１４】
また更に、文字切り出し手段は、フィールドの色情報と、文字画像入力との相対位置情報を用いて文字画像入力の切り出しを行うようにした。
【００１５】
また更に、画像抽出手段は、対象領域から複数の色情報を抽出して重畳出力するようにした。
【００１６】
【作用】
この発明による多色文字画像読取装置は、入力された文字画像情報からフィールド毎に範囲位置と色とのデータがまとめられて標準帳票が記憶され、新しい文字画像入力はこれら標準帳票と比較されて帳票の種別が判定され、判定された標準帳票のデータに基づいて対象領域が抽出され、更に標準帳票の領域とから文字が切り出され、切り出された文字に対して文字認識がされる。
【００１７】
また更に、入力の文字画像情報で類似の色成分を持つ情報はまとめて１つの色であるとして記憶または比較される。
【００１８】
また更に、文字画像入力から抽出した読取領域の画像と、文字認識結果が重畳して出力される。
【００１９】
また更に、入力の文字画像または標準帳票の特定範囲の画像が抽出され、範囲位置と色の設定が自由に変更できる。
【００２０】
また更に、標準帳票と、文字画像入力の各々の色情報が比較されてそれに基づいて帳票種別が判定される。
【００２１】
また更に、標準帳票と文字画像入力の各々の色情報に加え、各色毎のフィールド毎のデータが比較されて帳票種別が判定される。
【００２２】
また更に、フィールドの色情報と文字画像入力との相対位置情報により文字画像入力の切り出しが行われる。
【００２３】
また更に、文字画像情報の対象領域から複数の色情報が抽出され、重畳出力される。
【００２４】
【実施例】
実施例１．
図１は本発明の一実施例である多色文字画像読取装置の全体構成を示す構成ブロック図である。本実施例では、全体構成のうち、読み込んだ帳票のフィールドと称する部分領域毎の領域範囲とその色を確定していく多色帳票読取フォーマット作成変更手段の構成動作を説明する。
図１において、１は多色帳票を画像入力装置に入力することにより得られる多色帳票画像を示す。２は上記多色帳票画像から画像抽出して後に述べるように多くのフィールドに区分して属性をまとめて標準帳票として登録された多色帳票読取フォーマット群を記憶したメモリである。３は上記の多色帳票読取フォーマットに含まれるデータの作成・変更を行う多色帳票読取フォーマット作成・変更手段、４は上記多色の文字画像の入力に対して類似した色成分を持つ画素を統合し、上記多色帳票画像の色成分値を代表値に分類する色統合手段、５は上記多色帳票読取フォーマット群に登録された個々の標準帳票に含まれる読取領域及び色の情報と、上記色統合手段によって統合された画像情報とを比較し、対応する多色帳票読取フォーマット（標準帳票）を特定して帳票の種別を判定する帳票種別判定手段、６は上記色統合手段によって作成された画像情報から多色帳票読取フォーマットに規定される特定領域の画像情報を抽出する画像抽出手段、７は上記画像抽出手段によって抽出された画像情報から文字の切り出しを行い、文字領域の画像を出力する文字切り出し手段、８は上記文字切り出し手段によって得られた画像を認識し、対応する文字コード及び色情報を出力する文字認識手段である。９は上記画像抽出手段と上記文字切り出し手段から出力される画像および上記文字認識手段から出力される文字コード、色情報を編集出力する結果出力手段である。
【００２５】
図２は、多色帳票読取フォーマット作成・変更手段の詳細構成図である。即ち、帳票サイズや帳票を構成するフィールド枠・文字枠・記入文字などの帳票構成要素の帳票内位置・サイズ・名称・色などのデータを得て、帳票内の文字の認識や特定領域内の画像を抽出するために参照する帳票読取フォーマット（標準帳票）を作成したり変更したりする。
図において、多色帳票画像１は、画像入力装置で入力された帳票画像である。多色画像表示制御手段２２は、多色帳票画像１の入力範囲として帳票画像内の表示範囲を決定する。通常は、帳票画像全体を表示するような決定を行う。多色画像表示手段２３は、帳票画像１のうち多色画像表示制御手段２２によって決定された範囲を表示する。画像範囲指定手段２４は、たとえばマウスのような座標入力装置を用いて、多色画像表示手段２３によって表示された多色画像のうち、特定領域または帳票構成要素を指定・選択する。
指定範囲への色情報設定／変更手段２５は、画像範囲指定手段２４によって指定・選択された多色帳票画像１内の特定領域または帳票構成要素に対して、色情報を設定／変更する。読取フォーマット構造設定手段２６は、多色帳票画像１内の文字を認識したり特定領域を抽出したりする際に参照するフィールド数・フィールド位置・フィールド内文字数などのフォーマットデータを設定する手段である。この手段によって、読取フォーマットの具体的なデータ（フィールド数が４であるなど）のうち、色に関するデータが空または既定値（任意の値）の、読取フォーマットが作成される。読取フォーマットへの色情報設定手段２７は、指定範囲への色情報設定／変更手段２５によって設定／変更された色情報を、読取フォーマット構造設定手段２６が作成した読取フォーマットに設定する。
【００２６】
図３は多色帳票読取フォーマットのデータの例を示す説明図である。図において、３１はフィールドを特定する項目、３２はフィールド枠色の属性要素を指示する項目、３３は３２の属性要素に対する指定値、３４はフィールドを特定する項目、３５はフィールド内の文字枠色の属性要素を指示する項目、３６は３５の属性要素に対する指定値である。３７はフィールド１の文字切り出し条件の指定値、３８はフィールド１の文字認識対象、３９は背景色の指定値、３０１はフィールド２の文字切り出し条件（１）の指定値、３０２はフィールド２の文字切り出し条件（２）の指定値、３０３はフィールド２のプレプリント（１）の位置指定値、３０４はフィールド２のプレプリント（２）の位置指定値、３０５はフィールド２のプレプリント色の指定値、３０６はフィールド２の文字認識対象の指定値、３０７は記入文字色の指定値、３０８はフィールド３の画像切り出し条件の指定値、３０９はフィールド３のフィールド枠色の指定値、３１０はフィールド３の画像切り出し対象色（１）の指定値、３１１は画像切り出し対象色（２）の指定値、３１２は多色帳票読取フォーマット、３１３は色情報である。
【００２７】
図４は多色帳票画像の例を示す図である。図において、４０はプレプリントされた”社員番号”の文字、４１は同じくプレプリント”入社年月”、４２はプレプリント”氏名（印）”、４３はプレプリント”自宅周辺図”、４４は社員番号に対応する記入フィールド枠、４５は入社年月に対応する記入フィールド枠、４６は氏名（印）に対する記入フィールド枠、、４７は自宅周辺図に対するフィールド枠、４８は社員番号に対するフィールド内文字枠、４９は入社年月に対するフィールド内プレプリントである。
【００２８】
次に、図２にしたがって、多色帳票読取フォーマット２８を作成する手順を説明する。ここで、多色画像表示制御手段２２は通常動作を行い、多色画像の全体を表示することを決定するものとする。
読取フォーマット構造設定手段２６が作成する読取フォーマットの例を図３に示す。読取フォーマットは、１項目につき１行で設定され、たとえば項目３１にはフィールドを特定するデータ、「フィールド１」と記述されている。項目３２は項目３２に関する属性を表す詳細項目であり、「フィールド１」の「フィールド枠色」に関してのデータが設定されていることを記述する。項目３３は項目３１・項目３２に対する具体的なデータを表し、「黒」「赤」などの色名もしくは色の成分値が記述されるが、図３においては項目３３はまだ設定されていない。
【００２９】
多色帳票画像１を読み込み、多色画像表示手段２３によって表示している例を図４に示す。操作者は、図４に示される多色画像を見ながら、画像範囲指定手段２４を用いてその一部を選択または範囲指定する。たとえば、図４中のフィールド枠４５に示される帳票構成要素を選択したものとする。
画像からの色情報獲得手段２９は、画像範囲指定手段２４により指示された指定範囲内画像の色情報を取得する。操作者が図４のフィールド枠４５を選択した状態で、図３のフィールド枠色の設定値３３部分を選択すると、読取フォーマットへの色情報設定手段２７は画像からの色情報獲得手段２９により取得した色情報を図３のフィールド枠色の設定値３３に設定する。
【００３０】
本実施例では、色の指定が１種類の例を示したが、これに限らず複数の色を指定することもできる。
以上のように本実施例によれば、操作者は多色帳票画像中の帳票構成要素または特定領域を選択／範囲指定するだけで、多色帳票読取フォーマットに色情報を容易に設定することができ、多色帳票読取フォーマット（標準帳票）を設定することができる。
【００３１】
実施例２．
本実施例では、全体構成の中の色統合手段と、帳票種別判定手段の動作を説明する。
図５は多色帳票画像の例であり、図４の多色帳票画像にデータが記入されたものである。図において、点線で囲まれた領域５０は社員番号記入欄の領域である。
図６は色統合手段４の動作を説明するための図５の枠５０を拡大して色の区別を表した図である。即ち、社員番号記入欄の領域５０に対する処理例である。
図７は、帳票種別判定手段５の動作を説明するための図であり、図５とは異なった文字画像入力に対し、標準帳票に記述した色情報に従い、要素データを分類した色分類要素データである。図において、７１は色Ａに対応した色分類要素データ、７２は色Ｂに対応した色分類要素データ、７３は多色帳票読取フォーマットに未登録の色に対応した色分類要素データである。
図８は、帳票種別判定手段５の動作を説明するための図であり、多色帳票読取フォーマットと色分類要素データとの照合動作を説明するものである。図において、８１は多色帳票読取フォーマットである。
図９は、帳票種別判定手段５の動作を説明するための、図８とは異なった文字画像の図である。図において、９０は色分類要素データ上の文字と対応がつかない多色帳票読取フォーマットに記述された文字、９１は色分類要素データ上の表領域と対応つかない多色帳票読取フォーマットに記述された表領域、９２は多色帳票読取フォーマットに記述された表領域と対応付かない色分類要素データ上の表領域、９３は多色帳票読取フォーマットに記述された文字と対応は付くが色が異なっている色分類要素データ上の文字、９４は８１とは異なる多色帳票読取フォーマットである。
【００３２】
次に動作について説明する。
図５に示した多色帳票画像を参照しながら、色統合手段４と帳票種別判別手段５の動作を説明する。
図５に示した多色帳票画像の部分領域５０に対して色統合手段４を適用した場合、色統合手段４では多色帳票画像上の各画素毎に色成分値を調べ、類似した色成分値を有する画素を幾つかの代表値に分類する。この処理は例えば、色成分値によるクラスタリングを行うことにより実現できる。
次に、同じ代表値に分類される画素について領域の連続性を求めてグループ化する。これにより、同じ代表値に分類されかつ画像上で連接する画素群が抽出できる。この処理は例えば、各画素の代表値を用いたラベリング処理を行うことにより実現できる。これにより、図６に示した６１で示す画素群を抽出する。
最後に、６１で示す画素群について要素データ（各要素毎の位置と形状、代表値の色成分値）を算出する。これを図３の対応するフィールドのデータとして記憶する。
実際の処理は多色帳票画像全体に対して行う。即ち、入力の帳票の全部のフィールドについて処理を行い、データを記憶する。
【００３３】
色統合手段４の終了後、帳票種別判定手段５を実行する。
帳票種別判定手段５では、多色帳票読取フォーマット群２と色統合手段４で抽出したフィールド毎の要素データをもとにして、入力した多色帳票の文字画像１の帳票種別を判定する。
まず、多色帳票読取フォーマット群２から１つの標準帳票を選ぶ。
次に、選んだ標準帳票に記述した色情報に従ってフィールド毎の要素データを分類し、色分類要素データを作成する。
ここで、選んだ標準帳票にＮ種類の色が記述されている場合、要素データはＮ＋１個（Ｎ種類の色＋それ以外の色）に分類する。例えば、入力の文字画像の要素データが要素Ａ（色のＲＧＢ成分値がＲ１００，Ｇ５０，Ｂ７０）、要素Ｂ（Ｒ１００，Ｇ６０，Ｂ７０）、要素Ｃ（Ｒ５０，Ｇ５０，Ｂ５０）、要素Ｄ（Ｒ２０，Ｇ２０，Ｂ１０）であり、一方、標準帳票には色Ａ（Ｒ１００±２０，Ｇ５０±１０，Ｂ７０±１０）、色Ｂ（Ｒ５０±５，Ｇ５０±１０，Ｂ５０±１０）が記述された場合、要素Ａ、Ｂが色Ａに、要素Ｃが色Ｂに分類され、要素Ｄは標準帳票に未登録の色として分類される。
具体的には、図５の入力の文字画像の例に基づいて、図７に示す色分類要素データが作成できる。
【００３４】
次に、作成した色分類要素データに基づいて文字と表領域、罫線を抽出する。
即ち、図８に示すように、色分類要素データとして抽出した文字、表領域を標準帳票８１に記述した文字、表領域と対応付ける。ここで対応付けは、帳票上に描かれた文字や表領域の位置、大きさ、色、文字コード、表の構造等の情報を用いて最適となるように行う。
更に、対応付けの結果を基にして、標準帳票と入力された多色帳票画像１の類似度を判定する。
この類似性は、文字、表領域を対応付けた際の、位置、大きさ、色、文字コード、表の構造等の違いや、対応が付かずに残った文字、表領域の個数等から求める。ここで、表の構造の違いは、多色帳票読取フォーマットに記述した表を構成する罫線と、色分類要素データとして抽出した罫線とを対応付ける手法により求める。これは例えば、出願平６−７６７９５、「表を含む帳票処理装置」により実現できる。
例えば、図９は、図８に示したものとは異なる多色帳票読取フォーマット９４で各フィールド毎の色分類要素データの文字、表領域対応付け結果を示している。ここで多色帳票読取フォーマット上の文字９０と表領域９１は、色分類要素データの既知の文字、表領域と対応付かない。また、色分類要素データとして抽出した表領域９２は標準帳票上の表領域と対応が付かない。さらに、色分類要素データとして抽出した文字９３は、標準帳票上の文字とは対応が付くが、その色が異なる。
そのため、例えば図９に示した他の標準帳票９４と、入力の図５の多色帳票画像１との類似度は、図８の標準帳票８１との類似度に比べて低くなる。
【００３５】
以降、上記処理を多色帳票読取フォーマット群２に格納された複数の標準帳票に対して行ない、各標準帳票毎に類似度を算出する。そして、類似度が最も高い標準帳票が、入力した多色帳票画像（文字画像）１の種別であるとする。
ただし、最も高い類似度であっても、類似の度合いが所定値より低い場合や、（最も高い類似度） −（２番目に高い類似度）が小さい場合は、入力した多色帳票画像１の種別判定が不能として、入力の多色帳票画像１を棄却する。
ここで棄却されない場合は、作成した色分類要素データと、類似度が最も高い標準帳票の番号と、文字、表領域の対応付け結果を出力する。
【００３６】
本実施例では各フィールドでの画素単位に帳票種別判定を行なう例を示したが、複数画素からなる小領域を単位として帳票種別判定を行なうことも可能である。
本実施例では、帳票種別を判定するため文字と表領域を対応付ける例を示したが、文字、表領域以外の写真領域等も同時に対応付けることが可能である。
以上のように本実施例によれば、複数種類の帳票を混在した読取においても帳票の種別が判定でき、帳票の印刷ムラなどが発生した画像品質の悪い多色帳票画像に対しても正確な帳票種別判定が行える。
【００３７】
実施例３．
本実施例では、多色帳票読取フォーマット作成変更手段に付加機構を加えて特定の色の画像のみを抽出したり、その色情報を変更する動作を説明する。
図１０は、多色帳票読取フォーマット作成・変更のための操作ボタンを示した図である。図において、１１は色範囲縮小ボタン、１２は色範囲拡大ボタンを示す。
【００３８】
色情報を標準帳票に設定した後、設定した色情報が正確でないために画像抽出／文字認識に不具合が発生する場合に、読取フォーマットの色情報を変更する手順について説明する。
例えば、多色帳票読取フォーマットが図３のように定まっているものとし、色情報３３は「色Ｂ」に、色情報３６は「色Ｃ」に既に決まっているものとする。
次に、読取対象の帳票を画像入力装置に入力し多色帳票画像１を得る。図３に示す情報を格納した標準帳票のみを多色帳票読取フォーマット群２に格納し、この画像データを色統合手段４に入力した上で、帳票種別判定手段５を動作させる。こうして、多色帳票画像１と標準帳票との対応をとったフィールド毎の色分類要素データが得られる。この色分類要素データを多色画像表示制御手段２２に入力する。
多色画像表示制御手段２２では、画像全体を表示させるような決定を行う通常モードではなく色情報修正モードがあり、色情報の値が「色Ｃ」である色分類要素データのみを表示させるような制御を行う。
多色画像表示制御手段２２の制御により、多色画像表示手段２３によって色情報の値が「色Ｃ」であるような色分類要素データのみが表示される。図３に示す読み取りフォーマットによれば、色情報の値が「色Ｃ」であるような帳票構成要素は「フィールド１」の「文字枠色」であるから、多色帳票画像のうち、文字枠４８のみが表示されるはずである。
【００３９】
操作者は図４の文字枠４８のみが表示されているかどうかを画面上で確認し、もし文字枠４８が表示されなかったり、文字枠４８以外にも帳票構成要素または特定領域が表示されている場合には、多色画像表示制御手段２２のモードを多色帳票画像全体を表示させるように変更する。
たとえば「色Ｃ」と「色Ｂ」が似ているために、文字枠４８のみならずフィールド枠４４まで表示されてしまったとする。操作者は、指定範囲への色情報設定／変更手段２５により、色Ｃの色範囲を変更する。この場合は、色範囲縮小ボタン１１を押下し色Ｃの色の範囲を縮小することにより、色Ｂとの混同を防止することができる。
また文字枠４９が表示されなかった場合は、色範囲拡大ボタン１２を押下し色Ｃの色の範囲を拡大することにより、文字枠４９を抽出できるように色の調整をする。
【００４０】
本実施例では、色範囲の設定を縮小・拡大で設定する例を示したが、数値の直接入力、赤、青、緑などの色属性単独の調整、最大値のみ・最小値のみを対象とした変更などによって行うことも可能である。
以上にように本実施例によれば、多色帳票読取フォーマットに設定された色情報が実際に有効に作用するかどうかを簡単に調べ、不都合があっても簡単に修正することができる。
【００４１】
実施例４．
本実施例では、文字切り出し手段の動作を説明する。
図１１は、文字枠がある場合の文字切り出し手段７の動作を説明するフローチャートである。
図１２は、フィールド枠色に対応した画像情報を示した図である。図において１３１は図３５に示した画像５３１と同じものである。
図１３は、文字枠色に対応した画像情報を示した図である。図において１３２は図３５に示した画像５３２と同一である。
図１４は、標準帳票１３３に未登録の色に対応した画像情報を示した図である。図において１３３は図３５に示した画像５３３と同一である。
図１５は、文字枠色に対応した画像情報に対する文字枠切り出しの動作を説明する図である。図において、１５１は文字枠切り出し位置である。
図１６は、文字認識対象画像情報と文字枠切り出し位置との関係を示した図である。図において１５１は図１５に示した同番の画像と同じものである。
図１７は、文字切り出し結果を示した図である。
【００４２】
次に、文字枠がある場合の文字切り出し手段の動作を説明する。
図１１のフローチャートをもとに、図３５の画像領域に対する動作を図１２ないし、図１７の画像情報例を参照しながら説明する。また、標準帳票は図３のように定まっているものとし、色情報３３は「色Ｂ」に、色情報３６は「色Ｃ」に既に決まっているものとする。
また、入力の文字画像の認識に際して、各領域の位置は各々相対的に認識されるものとする。
【００４３】
帳票種別判定手段５から得た領域に対応する画像情報を対象として文字切り出しを行う。帳票種別判定手段５から得た領域に対応する画像情報は、標準帳票に登録された色で分類されている。色Ｂに対しては図１２に示す画像、色Ｃに対しては図１３に示す画像、標準帳票に未登録の色に対しては図１４に示す画像が色統合手段４および帳票種別判定手段５により得られている。
まずステップＳ１１１で、文字切り出し対象画像の抽出を行う。この場合、図３に示すように標準帳票には文字切り出し条件として、文字枠準拠３７が選択されている。さらに、文字枠色は標準帳票により色Ｃ３６に設定されている。そこで文字切り出し対象画像として色Ｃを選択し、図１３に示す色Ｃに対する画像情報を抽出する。
【００４４】
ステップＳ１１２では、図１３の画像を対象に文字枠切り出し処理を行う。文字枠切り出しは、垂直方向に画像ヒストグラムを抽出し、多色帳票読取フォーマットに格納された文字枠ピッチを元に等ピッチで切り出し候補点を設定する。さらにこの切り出し候補点を左右にずらして、切り出し候補点上のヒストグラム値の和が最小となる候補点列を文字枠切り出し位置とする。これにより図１３に対して、図１５に示した文字枠切り出し位置１５１が得られる。
【００４５】
ステップＳ１１３では、文字認識対象画像の抽出を行う。この場合、多色帳票読取フォーマットには文字認識対象として、（フィールド枠色以外かつ背景色以外かつ文字枠色以外）３８が選択されている。また、背景色には色Ｆ３９、フィールド枠色には色Ｂ３３、文字枠色には色Ｃ３６が設定されている。そこで文字認識対象として色Ｆ以外かつ色Ｂ以外かつ色Ｃ以外に該当する色を選択する。これにより、文字認識対象画像として図１４の画像を得る。
【００４６】
ステップＳ１１４では、ステップＳ１１３で得た文字認識対象画像に、各々の画像の相対位置は先に認識されているので、ステップＳ１１２で得た文字枠切り出し位置を重ねて、文字を切り出す。図１６に文字認識対象画像と文字切り出し位置の関係、図１７に文字切り出し結果を示す。
文字認識は文字認識手段８により、図１７の文字切り出し結果の個々の切り出し結果について認識を行う。これは極く通常に行われている手段なので構成と動作の詳細説明は省略する。
【００４７】
本実施例では、文字枠、フィールド枠の形状は実線の長方形で構成されているが、円やひし形などの他の形状、また破線、波線などを用いてもよい。
また本実施例では、文字枠色、フィールド枠色、文字認識対象の色指定が１種類の例を示したが、これに限らず複数種類の色を指定することができる。
また本実施例では、最終的に文字を認識する例を示したが、これに限らず文字切り出しの結果画像を出力することもできる。
以上のように本実施例によれば、フィールド毎の色情報と位置情報を得て相対位置が定まるので、多色帳票に記入した文字が文字枠の中央に記入されず文字同士が近接して記入された場合や帳票画像がずれて入力された場合でも、正確に文字を切り出すことができ、ひいては文字を正確に認識することができる。
【００４８】
実施例５．
本実施例では、文字切り出し手段の動作を説明する。
図１８は、プレプリントがある場合の文字切り出し手段７の動作を説明するフローチャートである。
図１９は、帳票種別判定手段５から得た領域に対応する帳票画像の例を示す図である。図において、１９１はフィールド枠、１９２はプレプリント画像（１）、１９３はプレプリント画像（２）であり、１９４は記入文字である。
図２０は、図１９に示した領域内でのフィールド枠色に対応した画像情報を示した図である。図において１９１は図１９のそれと同一画像である。
図２１は、図１９に示した領域内でのプレプリント色に対応した画像情報を示した図である。図において１９２、１９３は図１９のそれと同一でそれぞれ年、月のプレプリントである。
図２２は、図１９に示した領域内での記入文字色に対応した画像情報を示した図である。図において１９４は図１９のそれと同一である。
【００４９】
図２３は、文字記入領域抽出の処理内容を説明するための枠等の画像図である。図において、２３１はプレプリント（１）に対する文字記入領域（１）、２３２はプレプリント（２）に対する文字記入領域（２）である。
図２４は、文字列切り出しの動作を説明するための画像図である。図において、１９４は記入文字、２３１、２３２は図２３で示した枠である。
図２５は、文字列切り出しの結果を示す図である。図において、２５１はプレプリント（１）に対応する文字列切り出し結果、２５２はプレプリント（２）に対応する文字列切り出し結果であり、１９４は図１９に示す記入文字である。
図２６は、文字切り出しの結果を示す図である。図において、２６１はプレプリント（１）に対応する第１文字目の文字切り出し結果、２６２はプレプリント（１）に対応する第２文字目の文字切り出し結果、２６３はプレプリント（２）に対応する第１文字目の文字切り出し結果、２６４はプレプリント（２）に対応する第２文字目の文字切り出し結果である。
【００５０】
次に、プレプリントがある場合の文字切り出し手段の動作を説明する。
図１８のフローチャートをもとに、図１９の画像入力に対する動作を、図２０ないし図２６の画像情報例を参照しながら説明する。
帳票種別判定手段５から得た領域に対応する画像情報を対象として文字切り出しを行う。帳票種別判定手段５から得た領域に対応する画像情報は、標準帳票に登録された色で分類されている。色Ｂに対しては図２０に示す画像、色Ｄに対しては図２１に示す画像、色Ｅに対しては図２２に示す画像が色統合手段４により得られている。
まずステップＳ１８１で、プレプリント切り出し対象画像の抽出を行う。この場合、図３に示す多色帳票読取フォーマットには、文字切り出し条件（１）としてプレプリント文字（１）準拠３０１、文字切り出し条件（２）としてプレプリント文字（２）準拠３０２が選択されている。さらに、プレプリント文字（１）の記入座標３０３、プレプリント文字（２）の記入座標３０４とともに、プレプリント色が色Ｄ３０５が設定されている。そこでプレプリント切り出し対象画像として色Ｄを選択し、図２１に示す色Ｄに対する画像情報を抽出する。
【００５１】
ステップＳ１８２では、図２１に示す画像情報を対象にプレプリント切り出し処理を行う。プレプリント切り出しは垂直方向に画像ヒストグラムを抽出し、多色帳票読取フォーマットに格納されたプレプリントの位置を用いて切り出す。
この場合プレプリント文字は２つあり、プレプリント（１）の記入座標３０３がプレプリント（２）の記入座標３０４より左にあることを利用して、図２１のプレプリント画像（１）１９２がプレプリント（１）に、プレプリント画像（２）１９３がプレプリント（２）と対応が付く。
次に、図２０のフィールド枠色に対する画像情報と、図２１のプレプリント色に対する画像を重畳した図２３の画像情報を用いて文字記入領域の抽出を行う。プレプリント（１）に対応付いたプレプリント画像（１）１９２より左方にある領域を文字記入領域（１）１９１として抽出し、プレプリント（２）に対応付いたプレプリント画像（２）１９３より左方、プレプリント（１）に対応付いたプレプリント画像（１）１９２より右方にある領域を文字記入領域（２）２３２として抽出する。文字記入領域の上端・下端は、フィールド枠１９１の上下端位置を基準に求める。
【００５２】
ステップＳ１８３では、文字認識対象画像の抽出を行う。この場合、多色帳票読取フォーマットには文字認識対象として、記入文字色３０６が選択されている。また、記入文字色には色Ｅ３０７が設定されている。そこで文字認識対象画像として色Ｅに該当する色を選択する。これにより、文字認識対象画像として図２２の画像を得る。
ステップＳ１８４では、図２４に示すように、ステップＳ１８３で得た文字認識対象画像に、ステップＳ１８２で得た文字記入領域を重ね、文字列を切り出す。文字列は図２５に示すプレプリント（１）に対応する文字列切り出し結果２５１と、プレプリント（２）に対応する文字列切り出し結果２５２に分割される。
【００５３】
ステップＳ１８５では、ステップＳ１８４で得られた文字列切り出し結果から各文字を切り出す。文字切り出しは垂直方向に画像ヒストグラムを抽出し、ヒストグラムの塊を切り出すことより水平方向の切り出し位置を求める。次に左右の切り出し座標に挾まれた塊画像を、上から下に走査し最初に黒画素にぶつかる点を上端、下から上に走査し最初に黒画素にぶつかる点を下端とする。文字切り出しの結果、図２６に示すようにプレプリント（１）に対応する第１文字目の文字切り出し結果２６１、プレプリント（１）に対応する第２文字目の文字切り出し結果２６２、プレプリント（２）に対応する第１文字目の文字切り出し結果２６３、プレプリント（２）に対応する第２文字目の文字切り出し結果２６４に分割される。
【００５４】
文字認識は文字認識手段８により、図２６の文字切り出し結果の個々の切り出し結果について認識を行う。この時、切り出し結果２６１、２６２の認識結果は、プレプリント（１）つまり”年”に対応する認識結果となる。同様に、切り出し結果２６３、２６４の認識結果は、プレプリント（２）つまり”月”に対応する認識結果となる。
【００５５】
本実施例では、プレプリントは文字で記入された例を示したが、これに限らず記号や絵などを用いてもよい。
また本実施例では、プレプリント色の色指定が１種類の例を示したが、これに限らず、プレプリント色を各々のプレプリントに対して指定することにより複数種類の色を指定することができる。
また本実施例では、帳票読取フォーマットに格納されたプレプリント情報とプレプリント画像とを位置情報で対応付けたが、これに限らず色情報、形状情報などを用いて対応をとることもできる。
また本実施例では、文字記入領域の抽出をプレプリント画像の隣接位置から抽出したが、これに限らずプレプリント画像から一定量離れた位置など自由に設定できる。
以上のように本実施例によれば、プレプリントがある多色帳票に記入した文字をプレプリントに対応付けて正確に切り出すことができ、ひいては文字を正確に認識することができる。
【００５６】
実施例６．
本実施例では、画像抽出手段の動作を説明する。
図２７は、画像抽出手段６の動作を説明するフローチャートである。
図２８は、帳票種別判定手段５から得た領域に対応する帳票画像の例を示す図である。図において、２８１はフィールド枠、２８２は記入文字、２８３は記入文字色、フィールド枠色、背景色のいずれにも該当しない色で記載された印影である。
図２９は図２８に示した領域内での、フィールド枠色に対応した画像情報を示した図である。図において２８１は図２８中の枠と同じ画像である。
図３０は図２８に示した領域内での、記入文字色に対応した画像情報を示した図である。図において２８２は図２８中の記入文字と同じ画像である。
図３１は図２８に示した領域内での、多色帳票読取フォーマットに未登録の色に対応した画像情報を示した図である。図において２８３は図２８中の印影と同じものである。
図３２は、画像情報の重畳を説明するための説明画像図である。図において２８２、２８３は図２８中の対応する画像と同じである。
【００５７】
図２７のフローチャートをもとに、図２８ないし図３２の具体的な画像例を参照しながら画像抽出手段６の動作を説明する。多色帳票読取フォーマットには、図３に示したものを適用する。
帳票種別判定手段５から得た領域に対応する画像情報を対象として画像抽出を行う。帳票種別判定手段５から得た領域に対応する画像情報は、多色帳票読取フォーマットに登録された色で分類されている。例えば、色Ｂに対しては図２９に示す画像、色Ｅに対しては図３０に示す画像、多色帳票読取フォーマットに未登録の色に対しては図３１に示す画像が色統合手段４および帳票種別判定手段５により得られている。
【００５８】
まずステップＳ２７１で、多色帳票読取フォーマットに格納された画像切り出し条件で指定された構成要素の色で構成される画像を抽出する。この場合、図３の多色帳票読取フォーマットには画像切り出し条件としてフィールド枠準拠３０８、フィールド枠色として色Ｂ３０９が設定されているので、図２９に示す色Ｂに対する画像情報を抽出する。
ステップＳ２７２では、図２９の画像情報に内包される画像情報をフィールド枠自身を含んで抽出する。
ステップＳ２７３では、記入文字画像の抽出を行う。この場合、多色帳票読取フォーマットには画像切り出し対象色（１）として、記入文字色３１０が選択されている。さらに、記入文字色には色Ｅ３０７が設定されている。そこで色Ｅに該当する色を内包画像情報から抽出する。抽出した画像情報は図３０と同一の画像情報となる。
ステップＳ２７３では、記入文字以外の画像抽出を行う。この場合、多色帳票読取フォーマットには画像切り出し対象色（２）として、（記入文字色以外かつ背景色以外かつフィールド枠色以外）３１１が選択されている。さらに、記入文字色には色Ｅ３０７、背景色には色Ｆ３９、フィールド枠色には色Ｂ３０９が設定されている。そこで（色Ｅ以外かつ色Ｆ以外かつ色Ｂ以外）に該当する色を内包画像から抽出する。抽出した画像情報は図３１と同一の画像情報となる。
【００５９】
ステップＳ２７５では、ステップＳ２７３、ステップＳ２７４で得た抽出画像の重畳を行う。具体的には、ステップＳ２７３で得た抽出画像とステップＳ２７４で得た抽出画像の重畳した画像の抽出を多色帳票読取フォーマットの指示に従い抽出する。
多色帳票読取フォーマットには、３１０、３１２に示した２つの画像切り出し対象色が選択されているので図３０と図３１を重畳した図３２の画像が出力される。
本実施例では、画像切り出し対象色を２つに設定した場合の例を示したが、これに限らず、３以上の種類及び全色などを設定してもよい。
【００６０】
また本実施例では、フィールド枠は長方形の例を示したが、これに限らず円やひし形などの他の形状を設定することもできる。
また本実施例では、記入文字色と多色帳票読取フォーマットに未登録の色についての例を示したが、文字枠色やフィールド枠色などを指定することにより、他の要素も抽出することができる。
また本実施例では、１つの領域に対して１回だけ画像抽出手段を適用したが、画像切り出し対象色のみが異なり他の条件が同一のフィールドを複数個定義して、同一領域に対して複数回画像抽出手段を適用することにより、印影のみの画像、記入文字色のみの画像の各々を抽出することも可能である。
以上のように本実施例によれば多色帳票で枠で領域設定された画像から、色によって区別された構成要素を自由に抽出することができ、必要とする画像を効率よく抽出することができる。
【００６１】
実施例７．
本実施例では、結果出力手段の動作を説明する。
図３３は、結果出力手段９の動作を説明するフローチャートである。
次に、結果出力手段９の動作を図１および図３３を用いて説明する。
まずステップＳ３３１において、画像抽出手段６および文字切り出し手段７で取得した画像や、文字認識手段８で得た文字認識結果を結果出力手段９に入力する。
ステップＳ３３２では、文字認識結果および取得画像の表示の実行の有無を判定する。表示の有無の指定は多色帳票読取フォーマットにより指定する。
Ｓ３３３では、多色帳票読取フォーマットに基づいた表示と、画像色に基づいた表示のどちらかを選択する。表示色の選択指定は多色帳票読取フォーマットにより指定する。
ここで多色帳票読取フォーマットに基づいた表示が選択されている場合には、Ｓ３３４で、文字認識結果と取得画像を多色帳票読取フォーマットに記述した文字色と取得画像の色で表示装置に表示する。
また、画像色に基づいた表示が選択されている場合には、Ｓ３３５で、文字認識結果と取得画像を多色帳票画像１から求めた文字の色および取得画像の色を用いて表示する。例えば、画像色に基づいた表示では、多色帳票画像１に記入された文字列の一部が赤、残りの文字が黒で書かれていた場合、赤い文字部分に対応する文字認識結果を赤色で、黒い文字部分に対応する文字認識結果を黒色で表示する。
その後Ｓ３３６では、表示した文字認識結果や画像に対して、オペレータが誤読文字の修正作業等を加工行ない、その加工した結果をデータファイル化する。
Ｓ３３７では、作成したデータファイルを記憶装置に格納し、処理を終了する。
これとは逆に、文字認識結果と取得画像を表示しない場合は、取得した画像と文字認識結果を直接データファイル化して、記憶装置に格納後、処理を終了する。
【００６２】
本実施例では、表示の有無、表示色の選択は多色帳票読取フォーマットにより指定した例を示したが、操作者が対話的に指定してもよい。
また本実施例では、結果出力手段９は画像色もしくは多色帳票読取フォーマットに指定した色で出力するように構成したが、画像色と色帳票読取フォーマットに記述した色が異なる場合に色の異なる領域を画面に表示する構成としてもよい。
以上のように本実施例によれば、入力画像や多色帳票読取フォーマットと同一色で表示でき修正作業を誤りなく円滑に進めることができる。
【００６３】
【発明の効果】
以上のようにこの発明によれば、標準帳票を設定または変更する多色帳票読取フォーマット作成変更手段と、文字画像入力の帳票の種別を判定する帳票種別判定手段と、判定した種別に基づいて文字画像入力から対象領域を抽出する画像抽出手段と、この抽出した読取領域と判定した種別の標準帳票の対応フィールドの設定値とで対象文字を切り出す文字切り出し手段と、この読取対象文字に対する文字認識手段を備えたので、色の異なる複数種類のフィールドを持つ文字画像入力に対しても、切り出し位置を正確にして文字画像認識ができる効果がある。
【００６４】
また更に、類似した色成分を統合して指定の色とする色統合手段を付加したので、認識する色の種類を減らして比較時間を短縮できる効果が加わる。
【００６５】
また更に、抽出した画像と、文字認識結果を出力する結果出力手段を付加したので、確認と訂正が容易になる効果が加わる。
【００６６】
また更に、特定の色の画像のみを抽出して出力し、範囲位置と色の設定を変更できるようにしたので、装置の動作が確認でき、また動作の補正も容易にできる効果もある。
【００６７】
また更に、標準多色帳票と文字画像入力の色情報で帳票種別を判定するようにしたので、詳細な帳票の区別ができる効果もある。
【００６８】
また更に、各色毎のフィールド毎にデータを対応付けて帳票種別を判定するようにしたので、色が類似した帳票でも区別でき、また他の任意の標準帳票のフィールドの情報を利用できる効果もある。
【００６９】
また更に、フィールドの色情報と文字画像入力との相対位置情報を用いて文字画像入力の切り出しを行うようにしたので、文字を記入枠の位置に併せて読み取って誤認識を防ぐ効果もある。
【００７０】
また更に、対象領域から複数の色情報を抽出して重畳出力するようにしたので、必要とする画像を効率よく抽出でき、また文字画像情報を正確に認識できる効果もある。
【図面の簡単な説明】
【図１】本発明の一実施例である多色文字画像読取装置の構成図である。
【図２】実施例１および実施例３で示す多色帳票読取フォーマット作成・変更手段の構成ブロック図である。
【図３】多色帳票読取フォーマット群メモリに格納される標準帳票のデータの例を示す図である。
【図４】入力の文字画像を読み込んで得た多色帳票画像の例を示す図である。
【図５】実施例２における多色帳票画像の一例を示す図である。
【図６】実施例２における色統合手段４の動作を説明するための拡大した枠の図である。
【図７】実施例２における帳票種別判定手段５の動作を説明するための図である。
【図８】実施例２のおける帳票種別判定手段５の照合動作を説明するための図である。
【図９】実施例２における帳票種別判定手段５の動作を説明するための別の画像図である。
【図１０】実施例３における色情報の変更用の操作ボタンを示す図である。
【図１１】実施例４における文字切り出し手段７の処理動作を示すフローチャート図である。
【図１２】実施例４におけるフィールド枠色に分類される色分類要素データを示す図である。
【図１３】実施例４における文字枠色に分類される色分類要素データを示す図である。
【図１４】実施例４における標準帳票に未登録の色に分類される色分類要素データを示す図である。
【図１５】実施例４における文字枠切り出し結果を示す図である。
【図１６】実施例４における文字切り出し対象と文字切り出し位置との関係を示す図である。
【図１７】実施例４における文字切り出し結果の例を示す図である。
【図１８】実施例５における文字切り出し手段７の処理動作を示すフローチャート図である。
【図１９】実施例５における多色帳票画像の一例を示す図である。
【図２０】実施例５におけるフィールド枠色に分類される色分類要素データを示す図である。
【図２１】実施例５におけるプレプリント色に分類される色分類要素データを示す図である。
【図２２】実施例５における記入文字色に分類される色分類要素データを示す図である。
【図２３】実施例５における文字列領域抽出の動作を説明するための枠等の画像図である。
【図２４】実施例５における文字列切り出しの動作を説明するための画像図である。
【図２５】実施例５における文字列切り出し結果を示す図である。
【図２６】実施例５における文字切り出し結果を示す図である。
【図２７】実施例６における画像抽出手段の処理動作を示すフローチャート図である。
【図２８】実施例６における多色帳票画像の一例を示す図である。
【図２９】実施例６におけるフィールド枠色に分類される色分類要素データを示す図である。
【図３０】実施例６における記入文字色に分類される色分類要素データを示す図である。
【図３１】実施例６における標準帳票に未登録の色に分類される色分類要素データを示す図である。
【図３２】実施例６における記入文字色と標準帳票に未登録の色を重畳した画像を示す図である。
【図３３】実施例７における結果出力手段の処理動作を示すフローチャート図である。
【図３４】従来の多色帳票処理装置の構成ブロック図である。
【図３５】文字枠のある多色帳票画像の一例を示す図である。
【図３６】従来の読取装置による文字切り出し動作を説明するための図である。
【図３７】図３５記載の入力に対する従来の文字切り出し結果を示す図である。
【図３８】プレプリントのある多色帳票画像の一例を示す図である。
【図３９】図３８記載の入力に対する従来の文字切り出し結果を示す図である。
【符号の説明】
１多色帳票画像、２多色帳票読取フォーマット（標準帳票）群メモリ、３多色帳票読取フォーマット作成・変更手段、４色統合手段、５帳票種別判定手段、６画像抽出手段、７文字切り出し手段、８文字認識手段、９結果出力手段、２４画像範囲指定手段、２５色情報設定／変更手段、２６読取フォーマット構造設定手段、２７読取フォーマットへの色情報設定手段。[0001]
[Industrial application fields]
The present invention relates to a multi-color character image reading apparatus that performs image extraction and character recognition from input of multi-color character images as compared with a form set as a standard.
[0002]
[Prior art]
A conventional form reading apparatus performs character recognition processing only on a recognition target by converting it into data to be dropped out using color designation information used at the time of form creation and data to be recognized.
FIG. 34 is a block diagram showing a conventional form reading apparatus disclosed in Japanese Patent Laid-Open No. 2-123488. In the figure, 511 is a color sensor, 512 is a form, 513 is a lens, 514 is a quantization unit, 515, 516 and 517 are image memories, 518 is a control unit, 519 is a color determination control unit, and 520 is a character recognition unit. .
FIG. 35 is a diagram illustrating an output example of the quantization unit for an image including a field frame, a character frame, and an entry character. In the figure, 531 is a field frame, 532 is a character frame, and 533 is an entry character. In the figure, solid lines, diagonal lines, and shading indicate color differences.
FIG. 36 is an explanatory diagram of the character segmentation operation. In the figure, 541 is a character cutout position.
FIG. 37 is a diagram showing a result of character segmentation. In the figure, 551, 552, 553, and 554 are individual character cutout results.
FIG. 38 is a diagram illustrating an output example of the quantization unit for an image including field frames, preprints, and entry characters. In the figure, 561 is a field frame, 562 is a preprint, and 563 is an entry character. In the figure, solid lines, diagonal lines, and shading indicate color differences.
FIG. 39 is a diagram illustrating a character string cutout result. In the figure, 571 is a character string for the preprint “year”, and 572 is a character string for the preprint “month”.
[0003]
Next, the operation of the apparatus having the above configuration will be described.
First, a form 512 in which a frame indicating a reading area is printed in an arbitrary color and characters are recorded in black, a code (color designation code) for designating a color written for each reading area, and a designated color are dropped out. Format information that stores a code (recognition designation code) that designates whether or not to perform recognition processing is prepared.
The color sensor 511 reads the form 512, decomposes it into red, blue, and green components, performs photoelectric conversion, and stores the data quantized by the quantization unit 514 in the image memories 515 to 517. The control unit 518 converts the color designation code set in the format information given from the outside into color component data for each color component, and outputs it to the color determination control unit 519. The color determination control unit 519 compares the color component data with the density information of the pixel positions in the reading area stored in the image memories 515 to 517, and generates a binary form image based on the determination result. The character recognition unit 520 performs character recognition processing on the binary image.
For example, when a color designation code indicating a print color of a character frame and a code indicating that the character frame is to be dropped out are set in the format information, the color determination control unit 519 deletes the color belonging to the color designation code into white data. Colors other than are converted to black data. The character recognition unit 520 recognizes characters for the image.
[0004]
The image shown in FIG. 35 is obtained from the quantizing unit 514 for the conventional apparatus operating as described above, and the color designation code indicating the print color of the character frame and the print color of the field frame is dropped in the format information. When the code shown is set, the color determination control unit 519 converts the character frame 532 and the field frame 531 into white data, and the entry character 533 into black data.
Further, when the entered character 533 is not entered in the center of the character frame as shown in FIG. 35 and the characters are entered close to each other or when the form image is shifted and input, the character segmentation operates normally. Instead, an incorrect character cutout position 541 shown in FIG. 36 is obtained. The original frame is dropped out and the section becomes unclear. As a result, erroneous character cutout results shown in 551 and 552 in FIG. 37 are obtained. Therefore, the character recognition unit 520 outputs an incorrect recognition result.
[0005]
Further, the image shown in FIG. 38 is obtained from the quantizing unit 514 for the conventional apparatus operating as described above, and the color designation code indicating the preprint print color and the field frame print color is dropped in the format information. When the code indicating this is set, the color determination control unit 519 converts the field frame 561 and the preprint 562 into white data, and the entry character 563 into black data.
Character recognition processing is performed on the binary image shown in FIG. 39 output from the color determination control unit 519. At this time, since the preprint 562 shown in FIG. 38 has already disappeared from the image, the separation of the preprint is not known. Therefore, as shown in FIG. 39, an image of “S61” 571 corresponding to “year” and an image of “2” 572 corresponding to “month” are extracted, and an erroneous character string cutout result is obtained. There is.
[0006]
[Problems to be solved by the invention]
The conventional form reading device is configured as described above, and the frame for classification is removed by dropout, and the relative relationship between the entered characters and the form components other than the entered characters is unknown, so image extraction and There was a problem that character recognition was not possible.
Furthermore, before reading the form, format information corresponding to the form to be read must be specified in advance, and a plurality of types of unregistered forms cannot be read together. Further, when a new reading form is designed or changed, it is necessary to set color designation codes in the format information one by one, and there is a problem that it takes time to set the format information.
[0007]
The present invention has been made in order to solve the above-described problems, and can accurately perform image extraction and character recognition with respect to input of a multicolor character image in which a plurality of types of fields are mixed. It is an object of the present invention to obtain a multicolor character image reading apparatus capable of easily setting color information for a setting field.
[0008]
[Means for Solving the Problems]
The multicolor character image reading device according to the present invention is a multicolor form reading format creation change in which a range is set for each designated color of character image input and a standard form is created in which the range position and color are grouped for each field. A form type determining means for comparing the standard form created by the multicolor form reading format creation / changing means and the character image input to determine the type of the input form, and the type determined by the form type determining means The character to be read from the image extraction means for extracting the target area from the character image input based on the above, the read area extracted by the image extraction means and the setting value of the corresponding field of the standard form of the type determined by the form type determination means And a character recognition means for performing character recognition processing on the character to be read for each area cut out by the character cutout means and recognizing the character.
[0009]
In addition to the basic configuration, color integration means is added to integrate pixels having similar color components with respect to character image input into a designated color.
[0010]
Further, a result output means for outputting the image extracted by the image extraction means and the recognition result of the character recognition means output is added to the basic configuration.
[0011]
Furthermore, the multi-color form reading format creation changing means extracts only the image corresponding to a specific color and outputs it, so that the range position and color setting can be changed.
[0012]
Furthermore, the form type determination means determines the form type based on the color information of each of the standard form and the character image input.
[0013]
Furthermore, the form type determining means determines the form type by associating data for each field for each color in addition to the color information of each of the standard form and the character image input.
[0014]
Further, the character cutout means cuts out the character image input using the relative position information between the field color information and the character image input.
[0015]
Furthermore, the image extraction means extracts a plurality of color information from the target area and outputs the extracted information.
[0016]
[Action]
In the multicolor character image reading device according to the present invention, the data of the range position and the color is collected for each field from the input character image information, and the standard form is stored, and the new character image input is compared with these standard forms. The type of the form is determined, a target area is extracted based on the determined standard form data, characters are further cut out from the standard form area, and character recognition is performed on the cut out characters.
[0017]
Furthermore, information having similar color components in the input character image information is collectively stored or compared as one color.
[0018]
Furthermore, the image of the reading area extracted from the character image input and the character recognition result are superimposed and output.
[0019]
Furthermore, an input character image or an image of a specific range of a standard form is extracted, and the setting of the range position and color can be freely changed.
[0020]
Furthermore, the color information of the standard form and each character image input is compared, and the form type is determined based on the comparison.
[0021]
Furthermore, in addition to the color information of the standard form and character image input, the data for each field for each color is compared to determine the form type.
[0022]
Furthermore, the character image input is cut out based on the relative position information between the field color information and the character image input.
[0023]
Furthermore, a plurality of pieces of color information are extracted from the target area of the character image information and are superimposed and output.
[0024]
【Example】
Example 1.
FIG. 1 is a block diagram showing the overall configuration of a multicolor character image reading apparatus according to an embodiment of the present invention. In the present embodiment, the configuration operation of the multi-color form reading format creation / changing means for determining the area range and its color for each partial area, which is referred to as a read form field, will be described.
In FIG. 1, reference numeral 1 denotes a multicolor form image obtained by inputting a multicolor form into an image input device. Reference numeral 2 denotes a memory that stores a multi-color form reading format group registered as a standard form by extracting an image from the multi-color form image and dividing the attributes into a number of fields as described later. 3 is a multi-color form reading format creation / change means for creating / changing data included in the multi-color form reading format, and 4 is a pixel having a color component similar to the input of the multi-color character image. Color integrating means for integrating and classifying the color component values of the multicolor form image into representative values, 5 includes information on reading areas and colors included in individual standard forms registered in the multicolor form reading format group, A form type determining means for comparing the image information integrated by the color integrating means and specifying the corresponding multi-color form reading format (standard form) to determine the form type, 6 is created by the color integrating means Image extracting means for extracting image information of a specific area defined in the multi-color form reading format from the obtained image information, and 7 cuts out characters from the image information extracted by the image extracting means. , Character segmentation means for outputting the image of the character region, 8 is a character recognition means for outputting character codes and color information recognizing an image obtained by said character segmentation means, corresponding. Reference numeral 9 denotes a result output means for editing and outputting the image output from the image extraction means and the character cutout means, the character code output from the character recognition means, and the color information.
[0025]
FIG. 2 is a detailed block diagram of the multicolor form reading format creation / change means. In other words, data such as form size, field frame, character frame, entry character, etc., such as the position, size, name, color, etc. in the form are obtained to recognize the character in the form or within a specific area. A form reading format (standard form) to be referred to extract an image is created or changed.
In the figure, a multi-color form image 1 is a form image input by an image input device. The multicolor image display control means 22 determines the display range in the form image as the input range of the multicolor form image 1. Usually, a decision is made to display the entire form image. The multicolor image display means 23 displays the range determined by the multicolor image display control means 22 in the form image 1. The image range designating unit 24 designates / selects a specific area or form component among the multicolor images displayed by the multicolor image display unit 23 using a coordinate input device such as a mouse.
The color information setting / changing means 25 for the designated range sets / changes color information for a specific area or form component in the multicolor form image 1 designated / selected by the image range designation means 24. The reading format structure setting means 26 is means for setting format data such as the number of fields, the field position, and the number of characters in the field to be referred to when recognizing characters in the multicolor form image 1 or extracting a specific area. . By this means, a reading format in which the color-related data is empty or has a predetermined value (arbitrary value) among specific data of the reading format (for example, the number of fields is 4) is created. The color information setting means 27 for the reading format sets the color information set / changed by the color information setting / changing means 25 for the designated range to the reading format created by the reading format structure setting means 26.
[0026]
FIG. 3 is an explanatory diagram showing an example of data in the multicolor form reading format. In the figure, 31 is an item for specifying a field, 32 is an item for designating an attribute element of a field frame color, 33 is a specified value for 32 attribute elements, 34 is an item for specifying a field, 35 is a character frame color in the field , 36 is a specified value for 35 attribute elements. 37 is the specified value of the field 1 character extraction condition, 38 is the character recognition target of the field 1, 39 is the background color specification value, 301 is the specified value of the field 2 character extraction condition (1), and 302 is the field 2 character The specified value of the clipping condition (2), 303 is the position specified value of the preprint (1) of field 2, 304 is the position specified value of the preprint (2) of field 2, and 305 is the specified value of the preprint color of field 2 , 306 is the designated value of the character recognition target in field 2, 307 is the designated value of the entry character color, 308 is the designated value of the image cropping condition of field 3, 309 is the designated value of the field frame color of field 3, and 310 is the field 3 311 is the specified value of the image cropping target color (1), 311 is the specified value of the image cropping target color (2), 312 is the multicolor form reading format, 31 Is the color information.
[0027]
FIG. 4 shows an example of a multicolor form image. In the figure, 40 is a preprinted character of “employee number”, 41 is also a preprint “date of employment”, 42 is a preprint “name (mark)”, 43 is a preprint “periphery of home”, 44 is An entry field frame corresponding to the employee number, 45 an entry field frame corresponding to the entry date, 46 an entry field frame for the name (mark), 47 a field frame for the home surroundings diagram, and 48 an in-field character for the employee number A frame 49 is an in-field preprint for the hire date.
[0028]
Next, a procedure for creating the multi-color form reading format 28 will be described with reference to FIG. Here, it is assumed that the multicolor image display control means 22 performs a normal operation and determines to display the entire multicolor image.
An example of a reading format created by the reading format structure setting means 26 is shown in FIG. The reading format is set in one line per item. For example, item 31 describes data specifying a field, “field 1”. The item 32 is a detailed item representing an attribute relating to the item 32, and describes that data relating to “field frame color” of “field 1” is set. The item 33 represents specific data for the item 31 and the item 32, and color names or color component values such as “black” and “red” are described, but the item 33 is not yet set in FIG.
[0029]
An example in which the multicolor form image 1 is read and displayed by the multicolor image display means 23 is shown in FIG. The operator selects or designates a part of the image using the image range designating unit 24 while viewing the multicolor image shown in FIG. For example, it is assumed that the form component shown in the field frame 45 in FIG. 4 is selected.
The color information acquisition unit 29 from the image acquires the color information of the image in the designated range designated by the image range designation unit 24. When the operator selects the field frame color setting value 33 portion of FIG. 3 while the field frame 45 of FIG. 4 is selected, the color information setting means 27 for the reading format is acquired by the color information acquisition means 29 from the image. The set color information is set to the field frame color setting value 33 in FIG.
[0030]
In the present embodiment, an example in which the color designation is one type has been described.
As described above, according to this embodiment, the operator can easily set the color information in the multicolor form reading format simply by selecting / specifying a specific area or a form component in the multicolor form image. It is possible to set a multi-color form reading format (standard form).
[0031]
Example 2
In the present embodiment, operations of the color integration unit and the form type determination unit in the overall configuration will be described.
FIG. 5 is an example of a multicolor form image, in which data is entered in the multicolor form image of FIG. In the figure, an area 50 surrounded by a dotted line is an area of an employee number entry column.
FIG. 6 is an enlarged view of the frame 50 shown in FIG. 5 for explaining the operation of the color integration means 4 and shows the distinction between colors. That is, this is an example of processing for the area 50 in the employee number entry column.
FIG. 7 is a diagram for explaining the operation of the form type determination unit 5. Color classification element data obtained by classifying element data according to color information described in a standard form for character image input different from that in FIG. It is. In the figure, 71 is color classification element data corresponding to the color A, 72 is color classification element data corresponding to the color B, and 73 is color classification element data corresponding to a color not registered in the multicolor form reading format.
FIG. 8 is a diagram for explaining the operation of the form type determination means 5 and explains the collation operation between the multicolor form reading format and the color classification element data. In the figure, reference numeral 81 denotes a multicolor form reading format.
FIG. 9 is a diagram of a character image different from FIG. 8 for explaining the operation of the form type determination means 5. In the figure, 90 is a character described in a multi-color form reading format that does not correspond to characters on the color classification element data, and 91 is described in a multi-color form reading format that does not correspond to a table area on the color classification element data. Table area 92 is a table area on the color classification element data that does not correspond to the table area described in the multi-color form reading format, and 93 corresponds to the character described in the multi-color form reading format but has a different color. The character on the color classification element data 94 is a multi-color form reading format different from 81.
[0032]
Next, the operation will be described.
The operations of the color integration unit 4 and the form type determination unit 5 will be described with reference to the multicolor form image shown in FIG.
When the color integration unit 4 is applied to the partial area 50 of the multicolor form image shown in FIG. 5, the color integration unit 4 examines the color component value for each pixel on the multicolor form image, and displays similar color components. A pixel having a value is classified into several representative values. This process can be realized, for example, by performing clustering based on color component values.
Next, the pixels classified into the same representative value are grouped by obtaining the continuity of the region. Thereby, pixel groups classified into the same representative value and connected on the image can be extracted. This process can be realized, for example, by performing a labeling process using the representative value of each pixel. Thereby, the pixel group indicated by 61 shown in FIG. 6 is extracted.
Finally, element data (position and shape for each element, color value of representative value) is calculated for the pixel group indicated by 61. This is stored as data of the corresponding field in FIG.
The actual processing is performed on the entire multicolor form image. That is, all fields of the input form are processed and data is stored.
[0033]
After the color integration unit 4 ends, the form type determination unit 5 is executed.
The form type determination means 5 determines the form type of the character image 1 of the input multicolor form based on the element data for each field extracted by the multicolor form reading format group 2 and the color integration means 4.
First, one standard form is selected from the multicolor form reading format group 2.
Next, element data for each field is classified according to the color information described in the selected standard form, and color classification element data is created.
Here, when N types of colors are described in the selected standard form, the element data is classified into N + 1 pieces (N types of colors + other colors). For example, the element data of the input character image is element A (RGB color component values are R100, G50, B70), element B (R100, G60, B70), element C (R50, G50, B50), element D (R20). , G20, B10), while color A (R100 ± 20, G50 ± 10, B70 ± 10) and color B (R50 ± 5, G50 ± 10, B50 ± 10) are described in the standard form , Elements A and B are classified as color A, element C is classified as color B, and element D is classified as an unregistered color in the standard form.
Specifically, the color classification element data shown in FIG. 7 can be created based on the input character image example shown in FIG.
[0034]
Next, characters, table regions, and ruled lines are extracted based on the created color classification element data.
That is, as shown in FIG. 8, the characters and table areas extracted as color classification element data are associated with the characters and table areas described in the standard form 81. Here, the correspondence is performed so as to be optimized by using information such as the position, size, color, character code, and table structure of characters and table areas drawn on the form.
Furthermore, the similarity between the standard form and the input multicolor form image 1 is determined based on the result of the association.
This similarity is obtained from the difference in position, size, color, character code, table structure, etc. when characters and table regions are associated, the remaining characters without correspondence, the number of table regions, etc. . Here, the difference in the structure of the table is obtained by a method of associating the ruled lines constituting the table described in the multicolor form reading format with the ruled lines extracted as the color classification element data. This can be realized by, for example, application No. 6-76795, “form processing apparatus including table”.
For example, FIG. 9 shows a character / table area association result of color classification element data for each field in a multi-color form reading format 94 different from that shown in FIG. Here, the character 90 and the table area 91 on the multi-color form reading format do not correspond to the known character and table area of the color classification element data. Further, the table area 92 extracted as the color classification element data cannot be associated with the table area on the standard form. Further, the character 93 extracted as the color classification element data corresponds to the character on the standard form, but the color is different.
Therefore, for example, the similarity between the other standard form 94 shown in FIG. 9 and the input multicolor form image 1 in FIG. 5 is lower than the similarity with the standard form 81 in FIG.
[0035]
Thereafter, the above process is performed for a plurality of standard forms stored in the multicolor form reading format group 2, and the similarity is calculated for each standard form. Assume that the standard form having the highest similarity is the type of the input multicolor form image (character image) 1.
However, even if the degree of similarity is the highest, if the degree of similarity is lower than a predetermined value, or if (the highest degree of similarity) − (second highest degree of similarity) is small, the input multicolor form image 1 Since the type determination is impossible, the input multicolor form image 1 is rejected.
If not rejected here, the created color classification element data, the number of the standard form with the highest degree of similarity, and the result of associating characters and table areas are output.
[0036]
In this embodiment, an example in which the form type determination is performed in units of pixels in each field has been described. However, it is also possible to perform form type determination in units of small areas composed of a plurality of pixels.
In this embodiment, an example is shown in which characters and table areas are associated with each other in order to determine the form type. However, it is possible to associate characters, photo areas other than the table areas, and the like at the same time.
As described above, according to the present embodiment, the type of a form can be determined even in reading with a mixture of a plurality of types of forms, and accurate even for a multicolor form image having poor image quality in which uneven printing of the form has occurred. The form type can be determined.
[0037]
Example 3 FIG.
In the present embodiment, an operation of adding an additional mechanism to the multi-color form reading format creation changing means to extract only a specific color image or changing the color information will be described.
FIG. 10 is a diagram showing operation buttons for creating / changing the multi-color form reading format. In the figure, 11 is a color range reduction button, and 12 is a color range enlargement button.
[0038]
A procedure for changing the color information of the reading format when the color information is set in the standard form and a problem occurs in image extraction / character recognition because the set color information is not accurate will be described.
For example, it is assumed that the multi-color form reading format is determined as shown in FIG. 3, the color information 33 is already determined as “color B”, and the color information 36 is already determined as “color C”.
Next, the form to be read is input to the image input device to obtain a multicolor form image 1. Only the standard form storing the information shown in FIG. 3 is stored in the multi-color form reading format group 2 and this image data is input to the color integrating means 4 and then the form type determining means 5 is operated. In this way, color classification element data for each field in which the correspondence between the multicolor form image 1 and the standard form is obtained. This color classification element data is input to the multicolor image display control means 22.
The multi-color image display control means 22 has a color information correction mode instead of a normal mode for making a decision to display the entire image, and displays only color classification element data whose color information value is “color C”. Control.
Under the control of the multicolor image display control means 22, only the color classification element data whose color information value is “color C” is displayed by the multicolor image display means 23. According to the reading format shown in FIG. 3, the form component whose color information value is “color C” is the “character frame color” of “field 1”. Only 48 should be displayed.
[0039]
The operator checks on the screen whether or not only the character frame 48 of FIG. 4 is displayed. If the character frame 48 is not displayed, a form component or a specific area other than the character frame 48 is displayed. In this case, the mode of the multicolor image display control means 22 is changed so that the entire multicolor form image is displayed.
For example, it is assumed that not only the character frame 48 but also the field frame 44 is displayed because “color C” and “color B” are similar. The operator changes the color range of the color C by the color information setting / changing means 25 for the designated range. In this case, pressing the color range reduction button 11 to reduce the color range of the color C can prevent confusion with the color B.
If the character frame 49 is not displayed, the color is adjusted so that the character frame 49 can be extracted by depressing the color range expansion button 12 and expanding the color C color range.
[0040]
In this example, the example of setting the color range by reduction / enlargement was shown, but direct input of numerical values, adjustment of individual color attributes such as red, blue, green, etc., only for the maximum value and only the minimum value It is also possible to make this change.
As described above, according to this embodiment, it is possible to easily check whether the color information set in the multicolor form reading format actually works effectively, and to easily correct even if there is a problem.
[0041]
Example 4
In this embodiment, the operation of the character cutout means will be described.
FIG. 11 is a flowchart for explaining the operation of the character cutout means 7 when there is a character frame.
FIG. 12 is a diagram showing image information corresponding to the field frame color. In the figure, 131 is the same as the image 531 shown in FIG.
FIG. 13 is a diagram showing image information corresponding to the character frame color. In the figure, 132 is the same as the image 532 shown in FIG.
FIG. 14 is a diagram showing image information corresponding to colors that are not registered in the standard form 133. In the drawing, reference numeral 133 is the same as the image 533 shown in FIG.
FIG. 15 is a diagram for explaining the operation of character frame extraction for image information corresponding to the character frame color. In the figure, reference numeral 151 denotes a character frame cutout position.
FIG. 16 is a diagram showing the relationship between the character recognition target image information and the character frame cutout position. In the figure, reference numeral 151 denotes the same image as that shown in FIG.
FIG. 17 is a diagram showing a result of character cutout.
[0042]
Next, the operation of the character cutout means when there is a character frame will be described.
Based on the flowchart of FIG. 11, the operation on the image area of FIG. 35 will be described with reference to the image information examples of FIGS. Further, it is assumed that the standard form is determined as shown in FIG. 3, the color information 33 is already determined as “color B”, and the color information 36 is already determined as “color C”.
In addition, when recognizing an input character image, the position of each region is relatively recognized.
[0043]
Character segmentation is performed on the image information corresponding to the area obtained from the form type determination means 5. The image information corresponding to the area obtained from the form type determination means 5 is classified by the color registered in the standard form. The image shown in FIG. 12 for color B, the image shown in FIG. 13 for color C, and the image shown in FIG. 14 for colors not registered in the standard form are color integration means 4 and form type determination means. 5 is obtained.
First, in step S111, a character cut-out target image is extracted. In this case, as shown in FIG. 3, the character frame conformity 37 is selected as the character cutout condition for the standard form. Furthermore, the character frame color is set to color C36 by the standard form. Therefore, the color C is selected as the character cutout target image, and image information for the color C shown in FIG. 13 is extracted.
[0044]
In step S112, a character frame cutout process is performed on the image of FIG. In character frame cutout, an image histogram is extracted in the vertical direction, and candidate points are set at equal pitches based on the character frame pitch stored in the multicolor form reading format. Further, this candidate cutout point is shifted to the left and right, and a candidate point sequence that minimizes the sum of the histogram values on the candidate cutout point is set as a character frame cutout position. Thereby, the character frame cutout position 151 shown in FIG. 15 is obtained with respect to FIG.
[0045]
In step S113, a character recognition target image is extracted. In this case, 38 (other than the field frame color, other than the background color, and other than the character frame color) 38 is selected as the character recognition target in the multicolor form reading format. The background color is set to color F39, the field frame color is set to color B33, and the character frame color is set to color C36. Therefore, a color other than color F, color B, and color C is selected as a character recognition target. Thereby, the image of FIG. 14 is obtained as a character recognition target image.
[0046]
In step S114, since the relative position of each image is previously recognized on the character recognition target image obtained in step S113, the character frame cutout position obtained in step S112 is overlapped to cut out the character. FIG. 16 shows the relationship between the character recognition target image and the character cutout position, and FIG. 17 shows the character cutout result.
In character recognition, the character recognition means 8 recognizes individual cutout results of the character cutout results of FIG. Since this is an extremely common means, detailed description of the configuration and operation will be omitted.
[0047]
In this embodiment, the shape of the character frame and the field frame is a solid rectangle, but other shapes such as a circle and a rhombus, a broken line, a wavy line, and the like may be used.
Further, in the present embodiment, an example in which the character frame color, the field frame color, and the character recognition target color designation are one type is shown, but the present invention is not limited to this, and a plurality of types of colors can be designated.
In the present embodiment, an example of finally recognizing a character has been described. However, the present invention is not limited to this, and a character cutout image can also be output.
As described above, according to this embodiment, since the relative position is determined by obtaining color information and position information for each field, the characters entered in the multicolor form are not entered in the center of the character frame, and the characters are close to each other. Even when it is entered or when the form image is shifted and input, it is possible to cut out the character accurately, and thus the character can be recognized accurately.
[0048]
Example 5 FIG.
In this embodiment, the operation of the character cutout means will be described.
FIG. 18 is a flowchart for explaining the operation of the character cutout means 7 when there is a preprint.
FIG. 19 is a diagram illustrating an example of a form image corresponding to the area obtained from the form type determination unit 5. In the figure, 191 is a field frame, 192 is a preprint image (1), 193 is a preprint image (2), and 194 is an entry character.
FIG. 20 is a diagram showing image information corresponding to the field frame color in the region shown in FIG. In the figure, 191 is the same image as that of FIG.
FIG. 21 is a diagram showing image information corresponding to the preprint colors in the area shown in FIG. In the figure, reference numerals 192 and 193 denote the same year and month preprints as those in FIG.
FIG. 22 is a diagram showing image information corresponding to the entered character color in the area shown in FIG. In the figure, 194 is the same as that of FIG.
[0049]
FIG. 23 is an image diagram of a frame and the like for explaining the processing contents of character entry area extraction. In the figure, reference numeral 231 denotes a character entry area (1) for the preprint (1), and 232 denotes a character entry area (2) for the preprint (2).
FIG. 24 is an image diagram for explaining the operation of character string segmentation. In the figure, reference numeral 194 denotes entry characters, and reference numerals 231 and 232 denote frames shown in FIG.
FIG. 25 is a diagram illustrating a result of character string segmentation. In the figure, 251 is the character string cutout result corresponding to preprint (1), 252 is the character string cutout result corresponding to preprint (2), and 194 is the entry characters shown in FIG.
FIG. 26 shows the result of character segmentation. In the figure, 261 is the result of character cutout for the first character corresponding to preprint (1), 262 is the result of character cutout for the second character corresponding to preprint (1), and 263 is corresponding to preprint (2). The character cutout result 264 for the first character is a character cutout result for the second character corresponding to preprint (2).
[0050]
Next, the operation of the character cutout means when there is a preprint will be described.
The operation corresponding to the image input in FIG. 19 will be described based on the flowchart in FIG. 18 with reference to the image information examples in FIGS.
Character segmentation is performed on the image information corresponding to the area obtained from the form type determination means 5. The image information corresponding to the area obtained from the form type determination means 5 is classified by the color registered in the standard form. The image shown in FIG. 20 is obtained for the color B, the image shown in FIG. 21 is obtained for the color D, and the image shown in FIG.
First, in step S181, a preprint cutout target image is extracted. In this case, in the multi-color form reading format shown in FIG. 3, preprint character (1) compliant 301 is selected as the character cutout condition (1), and preprint character (2) compliant 302 is selected as the character cutout condition (2). Yes. Further, the color D305 is set as the preprint color together with the entry coordinates 303 of the preprint character (1) and the entry coordinates 304 of the preprint character (2). Therefore, the color D is selected as the preprint cutout target image, and image information for the color D shown in FIG. 21 is extracted.
[0051]
In step S182, preprint cutout processing is performed on the image information shown in FIG. In the preprint cutout, an image histogram is extracted in the vertical direction, and cut out using the preprint position stored in the multicolor form reading format.
In this case, there are two preprint characters, and the fact that the preprint (1) entry coordinates 303 are to the left of the preprint (2) entry coordinates 304, the preprint image (1) 192 in FIG. Preprint image (2) 193 corresponds to preprint (2) to preprint (1).
Next, the character entry area is extracted using the image information for the field frame color of FIG. 20 and the image information of FIG. 23 in which the image for the preprint color of FIG. 21 is superimposed. The area to the left of the preprint image (1) 192 associated with the preprint (1) is extracted as the character entry area (1) 191 and the preprint image (2) 193 associated with the preprint (2) is extracted. The area on the left side of the preprint image (1) 192 associated with the preprint (1) is extracted as the character entry area (2) 232. The upper and lower ends of the character entry area are obtained based on the upper and lower end positions of the field frame 191.
[0052]
In step S183, a character recognition target image is extracted. In this case, the entry character color 306 is selected as the character recognition target in the multicolor form reading format. Further, the color E307 is set as the entry character color. Therefore, a color corresponding to the color E is selected as the character recognition target image. Thereby, the image of FIG. 22 is obtained as the character recognition target image.
In step S184, as shown in FIG. 24, the character entry area obtained in step S182 is superimposed on the character recognition target image obtained in step S183, and the character string is cut out. The character string is divided into a character string cutout result 251 corresponding to the preprint (1) shown in FIG. 25 and a character string cutout result 252 corresponding to the preprint (2).
[0053]
In step S185, each character is cut out from the character string cutout result obtained in step S184. In character cutout, an image histogram is extracted in the vertical direction, and a cutout position in the horizontal direction is obtained by cutting out a block of the histogram. Next, a block image encircled by left and right cutout coordinates is scanned from the top to the bottom and the point that first hits the black pixel is the upper end, and the point that scans from the bottom to the top and hits the black pixel first is the bottom. As a result of character cutout, as shown in FIG. 26, the character cutout result 261 of the first character corresponding to preprint (1), the character cutout result 262 of the second character corresponding to preprint (1), preprint ( The character cutout result 263 of the first character corresponding to 2) and the character cutout result 264 of the second character corresponding to preprint (2) are divided.
[0054]
In character recognition, the character recognition means 8 recognizes individual cutout results shown in FIG. At this time, the recognition results of the cutout results 261 and 262 are recognition results corresponding to preprint (1), that is, “year”. Similarly, the recognition results of the cutout results 263 and 264 are recognition results corresponding to preprint (2), that is, “month”.
[0055]
In this embodiment, an example in which preprints are written in characters is shown, but the present invention is not limited to this, and symbols, pictures, and the like may be used.
In the present embodiment, an example of one type of preprint color designation is shown. However, the present invention is not limited to this, and a plurality of types of colors can be designated by designating a preprint color for each preprint. Can do.
In this embodiment, the preprint information and the preprint image stored in the form reading format are associated with each other by the position information. However, the present invention is not limited to this, and correspondence can be made by using color information, shape information, and the like.
In this embodiment, extraction of the character entry area is extracted from the adjacent position of the preprint image. However, the present invention is not limited to this, and a position away from the preprint image by a certain amount can be freely set.
As described above, according to the present embodiment, characters entered in a multi-color form with preprints can be accurately cut out in association with the preprints, so that the characters can be accurately recognized.
[0056]
Example 6
In this embodiment, the operation of the image extraction means will be described.
FIG. 27 is a flowchart for explaining the operation of the image extracting means 6.
FIG. 28 is a diagram illustrating an example of a form image corresponding to the area obtained from the form type determination unit 5. In the figure, 281 is a field frame, 282 is an entry character, 283 is an imprint written in a color that does not correspond to any of the entry character color, field frame color, and background color.
FIG. 29 shows image information corresponding to the field frame color in the area shown in FIG. In the figure, 281 is the same image as the frame in FIG.
FIG. 30 is a diagram showing image information corresponding to the entered character color in the area shown in FIG. In the figure, 282 is the same image as the entered characters in FIG.
FIG. 31 is a diagram showing image information corresponding to colors not registered in the multicolor form reading format in the area shown in FIG. In the figure, 283 is the same as the seal in FIG.
FIG. 32 is an explanatory image diagram for explaining the superimposition of image information. In the figure, 282 and 283 are the same as the corresponding images in FIG.
[0057]
Based on the flowchart of FIG. 27, the operation of the image extracting means 6 will be described with reference to specific image examples of FIGS. The multi-color form reading format shown in FIG. 3 is applied.
Image extraction is performed on the image information corresponding to the area obtained from the form type determination means 5. The image information corresponding to the area obtained from the form type determination means 5 is classified by the color registered in the multicolor form reading format. For example, the image shown in FIG. 29 for the color B, the image shown in FIG. 30 for the color E, and the image shown in FIG. 31 for the color not registered in the multicolor form reading format are the color integration means 4. And the form type determination means 5.
[0058]
First, in step S271, an image composed of the colors of the constituent elements specified by the image clipping conditions stored in the multicolor form reading format is extracted. In this case, in the multi-color form reading format of FIG. 3, field frame compliant 308 is set as the image cut-out condition, and color B 309 is set as the field frame color, so image information for color B shown in FIG. 29 is extracted.
In step S272, the image information included in the image information of FIG. 29 is extracted including the field frame itself.
In step S273, a written character image is extracted. In this case, the entry character color 310 is selected as the image cutout target color (1) in the multicolor form reading format. Further, a color E307 is set as the entry character color. Therefore, the color corresponding to the color E is extracted from the included image information. The extracted image information is the same image information as FIG.
In step S273, an image other than the entered characters is extracted. In this case, 311 (other than the entry character color, other than the background color, and other than the field frame color) is selected as the image clipping target color (2) in the multicolor form reading format. Further, color E307 is set as the entry character color, color F39 is set as the background color, and color B309 is set as the field frame color. Therefore, colors corresponding to (other than color E, color F, and color B) are extracted from the included image. The extracted image information is the same image information as FIG.
[0059]
In step S275, the extracted images obtained in steps S273 and S274 are superimposed. Specifically, the extraction of the image obtained by superimposing the extracted image obtained in step S273 and the extracted image obtained in step S274 is extracted according to the instruction of the multicolor form reading format.
In the multi-color form reading format, since two image cutout target colors shown in 310 and 312 are selected, the image of FIG. 32 in which FIGS. 30 and 31 are superimposed is output.
In the present embodiment, an example in which two image cutout target colors are set is shown, but the present invention is not limited to this, and three or more types and all colors may be set.
[0060]
In the present embodiment, the field frame is an example of a rectangle, but the present invention is not limited to this, and other shapes such as a circle and a rhombus can be set.
In the present embodiment, an example has been shown for the entry character color and the color that is not registered in the multicolor form reading format, but other elements can also be extracted by specifying the character frame color, field frame color, etc. it can.
In this embodiment, the image extraction means is applied only once to one area. However, a plurality of fields having the same conditions except for the color to be cut out of the image are defined, and a plurality of fields are defined for the same area. By applying the time image extracting means, it is also possible to extract each of the image of only the seal imprint and the image of only the written character color.
As described above, according to the present embodiment, it is possible to freely extract components distinguished by color from an image set with a frame in a multicolor form, and to efficiently extract a necessary image. it can.
[0061]
Example 7
In this embodiment, the operation of the result output means will be described.
FIG. 33 is a flowchart for explaining the operation of the result output means 9.
Next, the operation of the result output means 9 will be described with reference to FIGS.
First, in step S 331, an image acquired by the image extraction unit 6 and the character cutout unit 7 and a character recognition result obtained by the character recognition unit 8 are input to the result output unit 9.
In step S332, it is determined whether or not to display the character recognition result and the acquired image. Whether to display or not is specified by the multi-color form reading format.
In step S333, one of display based on the multicolor form reading format and display based on the image color is selected. The display color selection is specified by the multi-color form reading format.
If display based on the multicolor form reading format is selected here, the character recognition result and the acquired image are displayed on the display device in the character color described in the multicolor form reading format and the color of the acquired image in S334. To do.
If display based on the image color is selected, the character recognition result and the acquired image are displayed using the color of the character obtained from the multicolor form image 1 and the color of the acquired image in S335. For example, in the display based on the image color, when a part of the character string written in the multicolor form image 1 is written in red and the remaining characters are written in black, the character recognition result corresponding to the red character part is displayed in red. Thus, the character recognition result corresponding to the black character portion is displayed in black.
Thereafter, in S336, the operator performs a correction operation for misread characters on the displayed character recognition result or image, and the processed result is converted into a data file.
In S337, the created data file is stored in the storage device, and the process ends.
On the contrary, when the character recognition result and the acquired image are not displayed, the acquired image and the character recognition result are directly converted into a data file, stored in the storage device, and the process is terminated.
[0062]
In this embodiment, an example in which the presence / absence of display and the selection of display color are specified by the multi-color form reading format is shown, but the operator may specify interactively.
In the present embodiment, the result output means 9 is configured to output the image color or the color specified in the multi-color form reading format. However, when the image color and the color described in the color form reading format are different, the color is different. It is good also as a structure which displays an area | region on a screen.
As described above, according to the present embodiment, it is possible to display the input image and the same color as the multi-color form reading format, and it is possible to smoothly perform the correction work without error.
[0063]
【The invention's effect】
As described above, according to the present invention, the multi-color form reading format creation / changing means for setting or changing the standard form, the form type determining means for determining the form type of the character image input, and the character based on the determined type Image extracting means for extracting the target area from the image input, character cutout means for cutting out the target character with the setting value of the corresponding field of the standard form of the type determined as the extracted reading area, and character recognition means for the read target character Therefore, there is an effect that character images can be recognized with accurate cutout positions even for character image inputs having a plurality of types of fields with different colors.
[0064]
Furthermore, since a color integration unit that integrates similar color components into a designated color is added, an effect of reducing the types of colors to be recognized and reducing the comparison time is added.
[0065]
Furthermore, since the extracted image and the result output means for outputting the character recognition result are added, the effect of facilitating confirmation and correction is added.
[0066]
Furthermore, since only a specific color image is extracted and output, and the setting of the range position and color can be changed, the operation of the apparatus can be confirmed, and the operation can be easily corrected.
[0067]
Furthermore, since the form type is determined based on the standard multi-color form and the color information of the character image input, there is an effect that the detailed form can be distinguished.
[0068]
Furthermore, since the form type is determined by associating the data for each field for each color, it is possible to distinguish even forms with similar colors, and there is an effect that information on fields of other arbitrary standard forms can be used. .
[0069]
Further, since the character image input is cut out using the relative position information between the color information of the field and the character image input, there is an effect of preventing erroneous recognition by reading the characters together with the position of the entry frame.
[0070]
Furthermore, since a plurality of pieces of color information are extracted from the target area and superimposed and output, necessary images can be extracted efficiently, and character image information can be recognized accurately.
[Brief description of the drawings]
FIG. 1 is a configuration diagram of a multicolor character image reading apparatus according to an embodiment of the present invention.
FIG. 2 is a configuration block diagram of a multi-color form reading format creation / change unit shown in the first and third embodiments.
FIG. 3 is a diagram illustrating an example of standard form data stored in a multicolor form reading format group memory;
FIG. 4 is a diagram illustrating an example of a multicolor form image obtained by reading an input character image.
FIG. 5 is a diagram illustrating an example of a multi-color form image in the second embodiment.
FIG. 6 is an enlarged frame diagram for explaining the operation of the color integration unit 4 in the second embodiment.
7 is a diagram for explaining an operation of a form type determination unit 5 in Embodiment 2. FIG.
FIG. 8 is a diagram for explaining a collation operation of the form type determination unit 5 in the second embodiment.
FIG. 9 is another image diagram for explaining the operation of the form type determination unit 5 in the second embodiment.
FIG. 10 is a diagram illustrating operation buttons for changing color information according to the third embodiment.
FIG. 11 is a flowchart showing the processing operation of the character cutout means 7 according to the fourth embodiment.
FIG. 12 is a diagram illustrating color classification element data classified into field frame colors according to the fourth embodiment.
FIG. 13 is a diagram illustrating color classification element data classified into character frame colors according to the fourth embodiment.
14 is a diagram showing color classification element data classified into colors not registered in a standard form in Example 4. FIG.
FIG. 15 is a diagram illustrating a character frame cutout result according to the fourth embodiment.
FIG. 16 is a diagram illustrating a relationship between a character cutout target and a character cutout position according to the fourth embodiment.
FIG. 17 is a diagram illustrating an example of character cutout results according to the fourth embodiment.
FIG. 18 is a flowchart showing the processing operation of the character cutout means 7 according to the fifth embodiment.
FIG. 19 is a diagram illustrating an example of a multicolor form image according to the fifth embodiment.
FIG. 20 is a diagram illustrating color classification element data classified into field frame colors according to the fifth embodiment.
FIG. 21 is a diagram illustrating color classification element data classified into preprint colors according to the fifth embodiment.
22 is a diagram showing color classification element data classified into entry character colors in Example 5. FIG.
FIG. 23 is an image diagram of a frame and the like for explaining the operation of extracting a character string area in the fifth embodiment.
FIG. 24 is an image diagram for explaining a character string cut-out operation according to the fifth embodiment.
FIG. 25 is a diagram illustrating a character string cutout result according to the fifth embodiment.
FIG. 26 is a diagram showing a result of character segmentation in Example 5.
27 is a flowchart showing the processing operation of the image extracting means in Embodiment 6. FIG.
FIG. 28 is a diagram illustrating an example of a multicolor form image according to the sixth embodiment.
FIG. 29 is a diagram illustrating color classification element data classified into field frame colors according to the sixth embodiment.
30 is a diagram showing color classification element data classified into entry character colors in Example 6. FIG.
FIG. 31 is a diagram showing color classification element data classified into colors not registered in a standard form in Example 6.
FIG. 32 is a diagram illustrating an image in which an entry character color and a non-registered color are superimposed on a standard form in Example 6.
FIG. 33 is a flowchart showing the processing operation of the result output means in the seventh embodiment.
FIG. 34 is a configuration block diagram of a conventional multicolor form processing apparatus.
FIG. 35 is a diagram illustrating an example of a multicolor form image having a character frame.
FIG. 36 is a diagram for explaining a character cut-out operation by a conventional reading device.
FIG. 37 is a diagram showing a conventional character cutout result for the input shown in FIG. 35.
FIG. 38 is a diagram illustrating an example of a multicolor form image having a preprint.
FIG. 39 is a diagram showing a conventional character cutout result for the input shown in FIG. 38;
[Explanation of symbols]
1 Multicolor form image 2 Multicolor form reading format (standard form) group memory 3 Multicolor form reading format creation / change means 4 Color integration means 5 Form type determination means 6 Image extraction means 7 Character extraction means 8 character recognition means 9 result output means 24 image range designation means 25 color information setting / changing means 26 reading format structure setting means 27 color information setting means for reading format

Claims

A multi-color form reading format creation change means for creating a standard form in which a range is set for each designated color of character image input and the range position and color are grouped for each field;
A standard form created by the multicolor form reading format creation change means, and a form type determination means for comparing the character image input to determine the type of the input form;
Image extracting means for extracting a target area from the character image input based on the type determined by the form type determining means;
A character cutout means for cutting out a read target character from the reading area extracted by the image extraction means and the setting value of the corresponding field of the standard form of the type determined by the form type determination means;
A multicolor character image reading apparatus comprising character recognition means for recognizing a character to be read for each region cut out by the character cutout means.

2. The multicolor character image reading apparatus according to claim 1, further comprising color integration means for integrating pixels having similar color components with respect to character image input into a designated color.

2. The multicolor character image reading apparatus according to claim 1, further comprising a result output means for outputting the image extracted by the image extraction means and the recognition result of the character recognition means output to the basic configuration.

Furthermore, the multi-color form reading format creation changing means extracts and outputs only an image corresponding to a specific color so that the range position and color setting can be changed. Multicolor character image reader.

2. The multicolor character image reading apparatus according to claim 1, wherein the form type determining means determines the form type based on the color information of each of the standard form and the character image input.

Further, the form type determining means determines the form type by associating data for each color field in addition to the color information of each of the standard form and character image input. Color character image reader.

2. The multi-color character image reading apparatus according to claim 1, wherein the character cutout means cuts out the character image input by using the relative color information of the field color information and the character image input.

2. The multicolor character image reading apparatus according to claim 1, wherein the image extracting unit extracts a plurality of pieces of color information from the target region and outputs the extracted information.