JP4718699B2

JP4718699B2 - Character recognition device, character recognition method, program, and computer-readable recording medium

Info

Publication number: JP4718699B2
Application number: JP2001075106A
Authority: JP
Inventors: 利夫宮澤
Original assignee: Ricoh Co Ltd
Current assignee: Ricoh Co Ltd
Priority date: 2001-03-15
Filing date: 2001-03-15
Publication date: 2011-07-06
Anticipated expiration: 2021-03-15
Also published as: JP2002279353A

Description

【０００１】
【発明の属する技術分野】
本発明は、文字認識装置、文字認識方法、プログラム、およびコンピュータ読み取り可能な記録媒体に関し、特に、文字認識処理結果後の認識結果の後処理に関する。
【０００２】
【従来の技術】
スキャナ等から計算機に取り込んだ文書画像データ中の文字画像を識別し、文字コードとして出力する光学的文字読取（ＯＣＲ）ソフトウェアは、近年非常に広範囲に用いられている。これは、文字コードとして電子データ化した情報は紙ベースのものや画像データと比較して、再利用が容易であって、保管や交換も効率的に行えるというメリットを持つためである。
しかしながら、文字認識においては、文字画像からの１文字単位の認識では誤認識を完全に排除することはきわめて困難である。このため後処理として、文字認識された結果の文字列に対して、言語的制約を与えて候補の選択を行ったり、単語照合や形態素解析等により自動的に修正することが行われることが多い。
この方法は多くの誤りを除去できるが、適用の仕方によっては誤訂正のために新たな誤りを作り出す場合もある。
また、この修正によって１００％修正されるわけではないため、他の文字切り出し方法による認識結果と比較して、確信度の高い認識結果を採用したり（例えば、特開平９−２７４６４５号公報参照）、各文字に確信度を付与してユーザへ通知して、ユーザに修正を任せたりしている。
例えば、文字認識の後処理方法として、特開平５−４０８５３号公報の技術は、文字認識結果の各単語に対して候補単語の作成と確信度の算出を行い、その確信度が大きな単語との文法的関係を用いて確信度の小さな単語の認識結果を修正するものである。
確信度の計算としては、例えば、特開平９−１３４４１０号公報の技術のように、確信度計算に先立つ言語処理や単語の表記長、品詞、出現頻度、前接する語との接続の強度等のパラメータを合成して算出している。
【０００３】
【発明が解決しようとする課題】
しかしながら、文字認識の後処理として単語辞書、文法知識を用いて言語処理を行っても、もともと文法的誤りや、認識不能の領域、文法的に特殊な領域等では、結果を改善することは期待できない。
結局、ユーザに文字認識結果を提示し、これを修正することに落ち着くことになる。
ユーザに提示された認識結果の修正を手助けする技術としては、例えば、特開平５−４６８０３号公報では、認識結果の確からしさが低い認識結果に網掛けし、文字種に応じてさらに区別が可能な識別子を同時に表示することによって、ユーザがカタカナの「リ」と平仮名の「り」のように類似する文字に対する認識結果を修正し易くしている。
また、誤認識結果の修正を容易に行うために、特開平６−１７６１８９号公報では、認識結果が怪しいと判断された文字と前後１文字ずつの合計３文字の候補を表示したり、特公平７−１１３９５１号公報や特許第２９１５４１７号公報では、認識結果と原画像を同時に表示したりしている。
しかしながら、上記従来の技術では、主に視覚情報に頼った表示方法であるため、どの文字が不確かな認識結果であるかを視覚障害者等には判断できなかった。
本発明は、上記の問題点を解決するために、認識結果を視覚情報、聴覚情報や触覚情報等を用いて認識結果を修正し、後処理の効率を向上させる文字認識装置、その方法および記録媒体を提供することを目的とする。
特に、視覚障害者であっても、文字認識後の後処理の作業効率が向上する文字認識装置、その方法および記録媒体を提供することを目的とする。
【０００４】
【課題を解決するための手段】
この発明は上記の目的を達成するため、文書画像を入力する画像入力部と、前記画像入力部で入力された画像の文字領域に対して文字認識する文字認識部と、前記文字認識部から得た各文字に対する文字認識結果の確信度を算出する確信度算出部と、上記文字認識部により認識された文字から単語を取り出す単語抽出部と、上記確信度算出部で計算された各文字に対する認識結果から所定の確信度を得られない文字に対し、その文字の前後に２字以上の単語となっていない隣接した単字があった場合に、上記所定の確信度を得られない文字とその文字に隣接した単字とを１つの文字列として連結して抽出する低確信度抽出部と、上記文字認識部による認識結果について、上記低確信度抽出部によって抽出された上記文字列を他の単語と区別して出力する結果出力部を備えた文字認識装置を提供する。
また、上記のような文字認識装置において、上記単語抽出部は、所定の記憶部が保持する単語辞書を用いて、上記文字認識部により認識された文字を含む単語を検索し、その検索された単語および上記検索された単語の品詞情報を取り出し、上記品詞情報に基づいて上記検索された単語の評価値を計算し、上記検索された単語の中から最も評価値の高い単語を抽出するようにするとよい。
さらに、上記のような文字認識装置において、前記結果出力部は、上記所定の確信度が得られない文字を含む単語が修正されたとき、上記所定の確信度が得られない文字を含む単語を修正後の単語によって置き換え、次の上記所定の確信度が得られない文字を含む単語の修正へ移るようにするとよい。
【０００５】
また、文字認識装置で実行される文字認識方法であって、文書画像を入力する画像入力工程と、上記画像入力工程で入力された画像の文字領域に対して文字認識する文字認識工程と、上記文字認識工程から得た各文字に対する文字認識結果の確信度を算出する確信度算出工程と、上記文字認識工程により認識された文字から単語を取り出す単語抽出工程と、上記確信度算出工程で計算された各文字に対する認識結果から所定の確信度を得られない文字に対し、その文字の前後に２字以上の単語となっていない隣接した単字があった場合に、上記所定の確信度を得られない文字とその文字に隣接した単字とを１つの文字列として連結して抽出する低確信度抽出工程と、上記文字認識工程による認識結果について、上記低確信度抽出工程によって抽出された上記文字列を他の単語と区別して出力する結果出力工程とからなる文字認識方法を提供する。
さらに、上記のような文字認識方法において、上記単語抽出工程は、所定の記憶部が保持する単語辞書を用いて、上記文字認識工程により認識された文字を含む単語を検索し、その検索された単語および上記検索された単語の品詞情報を取り出し、上記品詞情報に基づいて上記検索された単語の評価値を計算し、上記検索された単語の中から最も評価値の高い単語を抽出する工程を含むようにするとよい。
また、コンピュータに、上記のような文字認識方法の各工程を実行させるためのプログラムも提供する。
さらに、上記のようなプログラムを記録したコンピュータ読み取り可能な記録媒体も提供する。
【０００６】
【発明の実施の形態】
以下に、図面を用いて本発明の実施の形態の構成および動作を詳細に述べる。
＜実施例＞
（１）実施例の構成
図１は、本発明の実施例の文字認識装置の構成を示すブロック図である。
本発明の実施例の文字認識装置１００は、制御部１０、画像入力部２０、文字認識部３０、確信度算出部４０、結果出力部５０、原画像記憶部２５、認識辞書３５、認識結果記憶部４５、言語辞書５５から構成されている。
さらに、結果出力部５０は低確信度抽出部６０を含んでいる。
制御部１０は、スキャナやファイルからの画像を読取り、画像情報から文字認識し、最終結果である認識結果を出力するまでの全体を制御する。
画像入力部２０は、スキャナやファイルからの画像を読取り、その画像データを原画像記憶部２５へ格納する。
文字認識部３０は、原画像記憶部２５に記憶された画像情報から文字画像領域を判別し、その文字領域から行を切り出し、切り出された行から文字を切り出し、その文字部分を囲む矩形の対角座標値を抽出し、その文字部分の大きさの正規化やノイズ（汚れ等）除去し、特徴量を計算し、その特徴量と標準パターンを保持する認識辞書３５とでパターンマッチングを行い、１文字あたり単数または複数の認識候補文字とその順位、およびそれらに対応する標準パターンとの距離値を認識結果記憶部４５へ記憶する。
認識辞書３５は、文字ごとに文字コード、その文字の標準パターンの特徴量等の情報を保持する。
【０００７】
次に、文字認識部３０は、各文字位置を開始点とする単語候補を生成して言語辞書５５の単語辞書を検索し、マッチした単語とその品詞情報を取り出し、処理対象領域の先頭から候補単語を接続して単語パスを生成すると同時に言語辞書５５の品詞間接続コストテーブルを用いてその単語パスのコストを計算する。この生成された単語パスが一定数以下になるように、その単語パスのコストの高い順に選択し、もっともコストの小さい単語パスに基づいて認識候補文字を修正し、認識結果記憶部４５を更新する。
言語辞書５５は、単語の表記、よみ、品詞等の情報を保持する単語辞書と、連接する単語の品詞が接続可能かどうかを示す重みを保持する品詞間接続コストテーブルとからなっている。例えば、日本語の場合は、単語の表記に、漢字のみの単語の他に、「漢字＋かな」、「漢字＋英字」等も登録できる。
確信度算出部４０は、確信度を算出するためのパラメータに、以下のような条件に適合する数値（重み）を割り当て、これらのパラメータの一次結合や平均値として確信度を算出する（例えば、特開平９−１３４４１０号公報）。
（Ａ）その文字が画像中の同一文字に対する認識結果の中で高い順位にあるほど確信度は高い。
（Ｂ）その文字の文字認識における類似度が高いほど、確信度は高い。
（Ｃ）その文字が属すると判定された単語の表記長が長いほど確信度は大きい。
（Ｄ）その文字またはその文字のカテゴリーと前後の文字のそれとの間の連接可能性が大きいほど、確信度は高い。
（Ｅ）その単語と前後の単語との接続可能性が大きいほど確信度は大きい。
この規則は、予め多くの文書の統計的性質やヒューリスティックなルールとして、例えば、ルールテーブルや品詞間接続コストテーブル等に保持しておく。
結果出力部５０は、後処理を行わせるために認識結果記憶部４５に格納されている認識結果とそれに対応する原画像データと対応させて出力する。
この出力された結果をユーザは、参照することによって認識結果に誤りがあれば修正して、最終結果をディスプレイ、プリンタやファイル等の出力装置に出力したり、ネットワークを介して他のコンピュータへ送信したりする。
【０００８】
この後処理において、従来の方法では、誤認識した文字のみを修正することを念頭においているために、正解文字を入力する際に、１文字の「単字」を入力している。
しかし、ユーザは、漢字の場合、同音の単漢字が多いのでなかなか正解文字を選択することができないが、単語単位で考えると、同音の単語は単漢字に比べるとその数は少なく、誤認識した文字のみを修正するのではなく、その前後の正解している文字も含め、単語単位で修正を行ったほうが効率が良い場合が多い。
一方、確信度算出部４０で算出した認識結果の確信度が低い文字を含んでいる単語があるということは、言語辞書５５とのマッチングがとれなかった場合が多い。
このような場合には、低確信度抽出部６０は、まず、単語パス中に確信度の低い認識結果の文字を探し、この検出された認識文字の前後に２字以上の単語として登録されていない隣接した単字を単語パスの中に探し、それらを１つの文字列として連結し、その文字列を１つの単語（以下、本発明では、この文字列も単語と呼ぶ。）として抽出する。
例えば、図２を参照すると、「周辺イメージ表示処理」という画像データを文字認識した結果、「周辺イメージ麦示処理」という文字列が得られ、単語パスは「周辺」、「イメージ」、「麦」、「示」、「処理」が得られている場合を考える。また、それぞれの文字に対する確信度は、「90」、「89」、「80」、「75」、「70」、「80」、「50」、「80」、「99」、「92」と算出されたとする。
低い確信度「50」の「麦」が検出されるので、「麦」から前後に探索をすると、前方向には「イメージ」という単語があり、後ろ方向には、単字の「示」と単語の「処理」があるので、確からしさの低い単語として「麦示」が抽出される。
【０００９】
次に、結果出力部５０は、認識された結果の後処理を行うときに、低確信度抽出部６０で上記のように抽出した確信度が低い単語と、確信度の高い単語とを区別して表示する。
例えば、表示の方法として、色を変えたり、フォントの大きさを変えたり、下線を引いたり、ウインクさせたりして視覚的に区別できるようにする。
この後処理では、１つの単語を修正すると、この修正された単語によって認識結果記憶部４５を更新した後、次の確信度の低い単語に移るようにする。
これにより、ユーザはキーボードから手を離すことなく、単語単位で効率的に誤認識の修正ができる。
このように実施例を構成することにより、従来のように文字単位で修正せずに、単語単位で確信度の低い単語を表示させて、単語単位で修正するので、ユーザの入力の手間も減り、正しい単語へ修正しやすくなり、編集作業の効率も向上する。
【００１０】
（２）処理手順
図３は、本実施例の処理手順を示すフローチャートである。
スキャナやファイルからの画像を読取り、その画像データを原画像記憶部２５へ格納する（ステップＳ１００）。これにより画像入力部２０を構成する。
原画像記憶部２５に記憶された画像情報から文字画像領域を判別し、その文字領域から行を切り出す（ステップＳ１１０）。
切り出された行から文字を切り出し、その文字部分を囲む矩形の対角座標値を抽出し、その文字部分の大きさの正規化やノイズ（汚れ等）除去し、特徴量を計算する（ステップＳ１２０）。
この特徴量と標準パターンを保持する認識辞書３５とからパターンマッチングを行い、１文字あたり単数または複数の認識候補文字とその順位、およびそれらに対応する標準パターンとの距離値を認識結果記憶部４５へ記憶する（ステップＳ１３０）。
認識結果記憶部４５に登録した認識候補を基に、各文字位置を開始点とする単語候補を生成して言語辞書５５の単語辞書を検索し、マッチした単語とその品詞情報を取り出し、処理対象領域の先頭から候補単語を接続して単語パスを生成すると同時に言語辞書５５の品詞間接続コストテーブルを用いてその単語パスのコストを計算する。この生成された単語パスが一定数以下になるように、その単語パスのコストの高い順に選択し、もっともコストの小さい単語パスに基づいて認識候補文字を修正し、認識結果記憶部４５を更新する（ステップＳ１４０）。
ステップＳ１１０からステップ１４０で文字認識部３０を構成する。
確信度を算出するためのパラメータに対して、例えば、ルールテーブルや品詞間接続コストテーブル等に保持してある重みを取り出して、その一次結合や平均値として確信度を算出する（ステップＳ１５０）。
これにより確信度算出部４０を構成する。
ステップＳ１４０で生成された単語パス中に確信度の低い認識結果の文字を探し、この検出された認識文字の前後に２字以上の単語として登録されていない隣接した単字を単語パスの中に探し、それらを１つの文字列として連結し、その文字列を１つの単語として抽出する（ステップＳ１６０）。
これにより低確信度抽出部６０を構成する。
認識結果の後処理を行わせるために認識結果記憶部４５に格納されている認識結果とそれに対応する原画像データと対応させ、ステップＳ１６０で抽出した確信度が低い単語と、確信度の高い単語とを色を変えたり、フォントの大きさを変えたり、下線を引いたり、ウインクさせることによって、視覚的に区別して出力する。
この出力された結果をユーザは、参照することによって認識結果に誤りがあれば修正する（ステップＳ１７０）。
この後処理のとき、１つの単語を修正すると、この修正された単語によって認識結果記憶部４５を更新した後、次の確信度の低い単語に移るようにする。
すべての認識結果の後処理が終了した後、認識結果記憶部４５に記憶された最終結果をディスプレイ、プリンタやファイル等の出力装置に出力したり、ネットワークを介して他のコンピュータへ送信したりする（ステップＳ１８０）。
ステップＳ１６０からステップＳ１８０により結果出力部５０を構成する。
【００１１】
＜変形例１＞
一方、視覚障害者の場合、認識結果を読み上げることによって、情報を得ていることが多い。しかし、上記実施例の方法では、確からしさによって、視覚的に判断することは可能であるが、視覚障害者には利用できるものではない。
そこで、本変形例１では、結果出力部５０の後処理のときに、低い確信度の単語を抽出し、この単語を視覚的に表示する代わりに、読み上げる音声情報を確信度によって、音質を変えたり、音の大きさを変えたりすることにより区別できるようにする。この低確信度の単語を読むときには、単語としての意味をなしていないので、単語としてではなく１文字ごとに複数候補を読み上げるようにする。
例えば、ステレオ方式で音声を出力するときには、確信度が高いときは、右と左信号を同時に出力するが、低確信度のときは、右のみまたは左のみの信号に出力するなど、音声の出力先を変更するようにしても構わない。
また、認識結果を読み上げる前後に、区別するための信号音やメッセージを入れてもよい。
また、視覚障害者には、画像を見ることができないので、完全に正しい文字に修正することは不可能かもしれないが、前後の流れ等から、推測し修正することや、想像することは可能であろう。
従って、後処理で、確信度の低い単語を修正した後、次の低確信度の単語を直接読み上げるのではなく、その単語の近辺も合わせて読むようにすることによって、視覚障害者が何を修正すべきかをわかるようにする。
【００１２】
または、聴覚障害者の場合には、次の低確信度の単語へジャンプするのではなく、順番にすべて読んでいくようにしてもよい。
健常者では、このような読み方をせず、次の低確信度の単語へジャンプする方が作業効率が上がるので、健常者と視覚障害者の場合とで、ジャンプするかどうかを指定して、使い分けるようにしてもよい。
本変形例１のように構成することにより、視覚障害者でも単語単位で確信度の低い単語を読み上げるので、ユーザが正しい単語へ修正しやすくなり、編集作業の効率も向上する。
また、この後処理で認識結果を出力するときには、視覚的と聴覚的を合わせて出力するようにしておけば、視覚障害者とその介助者とが共同して作業できるので、より編集作業の効率が向上する。
また、視覚的にチェックしたとき、似た形の文字では修正漏れを起こすことがあるので、視覚的と聴覚的を合わせて出力するようにしておけば、似た形の文字を音声で読み上げることによって、健常者であっても誤りを発見しやすくなり、より編集作業の効率が向上する。
【００１３】
＜変形例２＞
また、視覚障害者の場合は、認識結果を点字によって情報を得ていることが多い。
しかし、上記実施例の方法では、確からしさによって、視覚的に判断することは可能であるが、視覚障害者には利用できるものではない。
そこで、本変形例２では、結果出力部５０の後処理のときに、低い確信度の単語を抽出し、この単語を視覚的に表示する代わりに、点字出力装置によって触覚的に判断可能な方式、例えば、点字の凹凸の高さを変更したり、低確信度の文字や単語の前後に区別するための、マークなどを入れるようにする。
また、低確信度の単語を後処理で修正するとき、この低確信度の単語へのジャンプは、低確信度の単語を含む行またはその周辺を点字のカーソル行（点字のピンが出る）で示すようにする。
本変形例２のように構成することにより、視覚障害者でも単語単位で確信度の低い単語を点字として出力するので、ユーザが正しい単語へ修正しやすくなり、編集作業の効率も向上する。
また、この後処理で認識結果を出力するときには、触覚的と聴覚的とを合わせて出力するようにしておけば、視覚障害者が聴覚的に聞き漏らしたときも、触覚的に確かめられるので、より編集作業の効率が向上する。
尚、本実施の形態では、文字認識部３０で認識された候補の中から言語辞書５５によって、単語パスを作成し、この単語パスの中から確信度の低い文字を含む文字列（単語）を抽出しているが、結果出力部５０の認識結果の後処理において、認識結果に対して言語辞書５５に単語として登録されていない未知語を探すように構成しても、同一の効果をもたらすことができる。
【００１４】
＜コンピュータによる実施例＞
さらに、本発明は上記の実施の形態のみに限定されたものではない。例えば、図１に示した文字認識装置１００は、図４のようなハードウェア構成を持つコンピュータ装置２００によっても実現が可能である。
即ち、コンピュータ装置２００は、キーボード、マウス、タッチパネル、スキャナ、点字入力装置等により構成され、情報の入力に使用される入力装置１と、種々の出力情報や入力装置１からの入力された情報などを表示したり、プリンタや点字出力装置等へ出力させる出力装置２と、種々のプログラムを動作させるＣＰＵ（Central Processing Unit；中央処理ユニット）３と、プログラム自身を保持し、またそのプログラムがＣＰＵ３によって実行されるときに一時的に作成される情報等を保持するメモリ４と、本発明の文字認識装置の原画像記憶部２５、認識辞書３５、認識結果記憶部４５、言語辞書５５およびプログラムやプログラム実行時の一時的な情報等を保持する記憶装置５と、プログラムやデータ等を記憶した記録媒体を装着してそれらを読み込み、メモリ４または記憶装置５へ格納するのに用いられる媒体駆動装置６と、ネットワーク９へ接続するためのインタフェースであるネットワーク接続装置７とから構成され、それらはバス８で接続されている。
また、ネットワーク９は、コンピュータ装置２００と他のコンピュータ装置２００とを結合するための伝送路であって、一般には、ケーブルで実現され、通信プロトコルにはＴＣＰ／ＩＰが使われる。但し、伝送路としてはケーブルだけではなく、それらの間の通信プロトコルが一致するものであれば無線、有線または放送波のいずれでもよく、例えば、ＬＡＮ(Local Area Network)、ＷＡＮ（Wide Area Network）、インターネット、アナログ電話網、デジタル電話網（ＩＳＤＮ：Integral Service Digital Network）、ＰＨＳ（パーソナルハンディホンシステム）、携帯電話網、衛星通信網などを用いることができる。
【００１５】
このようなコンピュータ装置２００の構成において、図１に示した文字認識装置を構成する各機能をそれぞれプログラム化し、予めＣＤ−ＲＯＭ等の記録媒体に書き込んでおき、このＣＤ−ＲＯＭを各サイトのＣＤ−ＲＯＭドライブのような媒体駆動装置６を搭載したコンピュータ装置２００に装着して、これらのプログラムをそれぞれのコンピュータ装置２００のメモリ４あるいは記憶装置５に格納し、それを実行することによって、上記の実施の形態と同様な機能を実現することができる。
尚、記録媒体としては半導体媒体（例えば、ＲＯＭ、ＩＣメモリカード等）、光媒体（例えば、ＤＶＤ、ＭＯ、ＭＤ、ＣＤ−Ｒ等）、磁気媒体（例えば、磁気テープ、フレキシブルディスク等）等のいずれであってもよい。
また、コンピュータ装置２００のメモリ４へロードしたプログラムを実行することにより上記した実施の形態の機能が実現されるだけでなく、そのプログラムの指示に基づき、オペレーティングシステム等が実際の処理の一部または全部を行い、その処理によって上記した実施の形態の機能が実現される場合も含まれる。
また、上記した実施の形態を実現するプログラムがＲＯＭ等のような半導体の記録媒体である場合には、媒体駆動装置６からではなく、直接、メモリ４へロードして実行される。
【００１６】
＜本発明のネットワーク環境での運用＞
図５は、本発明を有線または無線の通信ネットワークに接続して運用する形態の構成を示している。
例えば、文字認識プログラムを保持するサーバー２１０と複数のユーザが利用する端末２２０とをネットワーク９で接続する。
この場合、サーバー２１０およびユーザの端末２２０は、図４に示した汎用のコンピュータ装置２００で構成される。
ユーザは、端末２２０からサーバー２１０に対してログインしたり、文字認識のための画像データを入力し、サーバー２１０の文字認識プログラムへ文字認識の実行を依頼する。
サーバー２１０の文字認識プログラムは、送信された画像データの文字領域に対する文字認識結果を要求元の端末２２０へ戻す。
ユーザの端末２２０は、この認識結果やもとの画像データとを対比させながら出力したり、後処理を行ったりする。
このようにすることで、常に最新の文字認識プログラムを使えるという利点がある。
また、図５のようにサーバー２１０と端末２２０とを有線または無線の通信ネットワークで接続した場合、サーバー２１０の磁気ディスク等の記憶装置に本発明の機能を実現する文字認識プログラムを格納しておき、端末２２０に対してダウンロード等の形式で頒布することも可能である。
さらに、本発明の機能を実現する文字認識プログラムを媒体や放送波による配布で提供するようにしてもよい。
【００１７】
【発明の効果】
以上説明したように、本発明によれば、視覚・聴覚などの障害の有無によらず、認識結果の確からしさを確認することが可能となる。
また、単語単位で確からしくない単語を抽出するので、その単語の修正が容易となり、後処理の作業効率が向上する。
【図面の簡単な説明】
【図１】実施例の機能構成を示すブロック図である。
【図２】従来例と本発明の文字認識結果を説明する図である。
【図３】実施例の処理手順を説明するフローチャートである。
【図４】本発明の文字認識装置が稼動するためのコンピュータ装置を示す図である。
【図５】本発明のネットワーク環境での運用例を説明するための図である。
【符号の説明】
１入力装置、２出力装置、３ＣＰＵ、４メモリ、５記憶装置、６媒体駆動装置、７ネットワーク接続装置、８バス、９ネットワーク、１０制御部、２０画像入力部、２５原画像記憶部、３０文字認識部、３５認識辞書、４０確信度算出部、４５認識結果記憶部、５０結果出力部、５５言語辞書、６０低確信度抽出部、１００文字認識装置、２００コンピュータ装置、２１０サーバー、２２０端末[0001]
BACKGROUND OF THE INVENTION
  The present invention relates to a character recognition device,Character recognitionMethod,program,andComputer readableThe present invention relates to a recording medium, and more particularly to post-processing of recognition results after character recognition processing results.
[0002]
[Prior art]
In recent years, optical character reading (OCR) software for identifying a character image in document image data taken into a computer from a scanner or the like and outputting it as a character code has been very widely used. This is because information converted into electronic data as a character code is easier to reuse and can be stored and exchanged more efficiently than paper-based information or image data.
However, in character recognition, it is extremely difficult to completely eliminate misrecognition by recognizing one character unit from a character image. For this reason, as a post-processing, a character string obtained as a result of character recognition is often subjected to linguistic restrictions to select a candidate or automatically corrected by word matching, morphological analysis, or the like. .
This method can eliminate many errors, but depending on the application method, a new error may be created for error correction.
In addition, since the correction is not 100% correction, a recognition result having a high certainty factor is adopted as compared with a recognition result obtained by another character segmentation method (for example, see Japanese Patent Laid-Open No. 9-274645). , A certainty factor is given to each character, the user is notified, and the correction is left to the user.
For example, as a post-processing method for character recognition, the technique of Japanese Patent Laid-Open No. Hei 5-40853 creates candidate words for each word of the character recognition result and calculates a certainty factor. This corrects the recognition result of words with low confidence using grammatical relations.
As the calculation of the certainty factor, for example, as in the technique of JP-A-9-134410, language processing prior to the certainty factor calculation, the notation length of the word, the part of speech, the appearance frequency, the strength of the connection with the preceding word, etc. It is calculated by combining parameters.
[0003]
[Problems to be solved by the invention]
However, even if language processing is performed using word dictionary and grammatical knowledge as post-processing of character recognition, it is expected to improve the results in grammatical errors, unrecognizable areas, grammatical special areas, etc. Can not.
Eventually, the user is presented with a character recognition result and settled on correcting it.
As a technique for helping to correct the recognition result presented to the user, for example, in Japanese Patent Laid-Open No. 5-46803, the recognition result with low probability of the recognition result is shaded, and can be further distinguished according to the character type. By displaying the identifiers at the same time, it is easy for the user to correct the recognition result for similar characters such as “ri” in katakana and “ri” in hiragana.
In order to easily correct erroneous recognition results, Japanese Patent Application Laid-Open No. 6-176189 displays a total of three candidate characters, one character before and after the character judged to be suspicious. In Japanese Patent Laid-Open No. 7-113951 and Japanese Patent No. 2915417, the recognition result and the original image are displayed simultaneously.
However, since the above conventional technique is a display method mainly relying on visual information, a visually impaired person or the like cannot determine which character is an uncertain recognition result.
In order to solve the above-described problems, the present invention corrects a recognition result using visual information, auditory information, tactile information, or the like to improve the efficiency of post-processing, a method thereof, and a recording The purpose is to provide a medium.
In particular, it is an object of the present invention to provide a character recognition device, a method thereof, and a recording medium that improve the work efficiency of post-processing after character recognition even for visually impaired persons.
[0004]
[Means for Solving the Problems]
  In order to achieve the above object, the present inventionAn image input unit for inputting a document image, a character recognition unit for character recognition for a character area of the image input by the image input unit, and a character recognition result for each character obtained from the character recognition unit.ConfidenceA certainty factor calculation unit for calculatingA word extraction unit that extracts a word from characters recognized by the character recognition unit, and a character that cannot obtain a predetermined certainty factor from the recognition result for each character calculated by the certainty factor calculation unit, before and after the character When there is an adjacent single character that is not two or more words, the character that cannot obtain the predetermined certainty and the single character adjacent to the character are combined and extracted as one character string. There is provided a character recognition device including a certainty factor extraction unit and a result output unit that outputs the character string extracted by the low certainty factor extraction unit in distinction from other words with respect to the recognition result by the character recognition unit.
  Also,As aboveIn a character recognition device,The word extraction unit searches for a word including a character recognized by the character recognition unit using a word dictionary held in a predetermined storage unit, and obtains the searched word and the part of speech information of the searched word. The evaluation value of the searched word is calculated based on the part-of-speech information, and the word having the highest evaluation value may be extracted from the searched words.
  Furthermore, as aboveIn the character recognition device, the result output unit includes:When a word including a character for which the predetermined certainty factor cannot be obtained is corrected, the word including the character for which the predetermined certainty factor cannot be obtained is replaced with a corrected word, and the next predetermined certainty factor is obtained. You may want to move on to correcting words that contain no letters.
[0005]
  Also,A character recognition method executed by a character recognition device, comprising: an image input step for inputting a document image; a character recognition step for recognizing a character region of an image input in the image input step; and the character recognition A certainty factor calculating step for calculating the certainty factor of the character recognition result for each character obtained from the steps, a word extracting step for extracting a word from the characters recognized by the character recognition step, and the respective confidence factor calculating steps. If there is an adjacent single character that is not a word of two or more characters before and after the character for which the predetermined certainty cannot be obtained from the recognition result for the character, the predetermined certainty cannot be obtained. A low certainty factor extraction step for extracting a character and a single character adjacent to the character as a single character string, and a recognition result obtained by the character recognition step are extracted by the low certainty factor extraction step. And to provide a character recognition method comprising the result output step of outputting the character string to distinguish from other words.
Further, in the character recognition method as described above, the word extraction step searches for a word including the character recognized by the character recognition step using a word dictionary held in a predetermined storage unit, and the search is performed. Extracting a word and part of speech information of the searched word, calculating an evaluation value of the searched word based on the part of speech information, and extracting a word having the highest evaluation value from the searched word It should be included.
Also provided is a program for causing a computer to execute each step of the character recognition method as described above.
Furthermore, a computer-readable recording medium recording the program as described above is also provided.
[0006]
DETAILED DESCRIPTION OF THE INVENTION
The configuration and operation of the embodiment of the present invention will be described below in detail with reference to the drawings.
<Example>
(1) Configuration of the embodiment
FIG. 1 is a block diagram showing a configuration of a character recognition apparatus according to an embodiment of the present invention.
A character recognition device 100 according to an embodiment of the present invention includes a control unit 10, an image input unit 20, a character recognition unit 30, a certainty factor calculation unit 40, a result output unit 50, an original image storage unit 25, a recognition dictionary 35, and a recognition result storage. It consists of a unit 45 and a language dictionary 55.
Further, the result output unit 50 includes a low confidence extraction unit 60.
The control unit 10 reads the image from the scanner or file, recognizes characters from the image information, and controls the entire process from outputting the recognition result as the final result.
The image input unit 20 reads an image from a scanner or a file and stores the image data in the original image storage unit 25.
The character recognition unit 30 discriminates a character image area from the image information stored in the original image storage unit 25, cuts out a line from the character area, cuts out a character from the cut out line, and pairs of rectangles surrounding the character part. Extract the angular coordinate value, normalize the size of the character part, remove noise (dirt, etc.), calculate the feature value, perform pattern matching with the recognition dictionary 35 that holds the feature value and the standard pattern, One or a plurality of recognition candidate characters per character, their ranks, and distance values between the corresponding standard patterns are stored in the recognition result storage unit 45.
The recognition dictionary 35 holds information such as a character code for each character and a feature amount of a standard pattern of the character.
[0007]
Next, the character recognizing unit 30 generates word candidates starting from each character position, searches the word dictionary of the language dictionary 55, extracts the matched word and its part of speech information, and selects candidates from the top of the processing target area. The word path is generated by connecting the words, and at the same time, the cost of the word path is calculated using the part-of-speech connection cost table of the language dictionary 55. The word paths are selected in descending order of cost so that the number of generated word paths is below a certain number, the recognition candidate characters are corrected based on the word path with the lowest cost, and the recognition result storage unit 45 is updated. .
The language dictionary 55 includes a word dictionary that holds information such as word notation, reading, part of speech, and a part-of-speech connection cost table that holds weights indicating whether or not the parts of speech of connected words can be connected. For example, in the case of Japanese, “Kanji + Kana”, “Kanji + English”, etc. can be registered in the word notation, in addition to words of only Kanji.
The certainty factor calculation unit 40 assigns numerical values (weights) that meet the following conditions to the parameters for calculating the certainty factor, and calculates the certainty factor as a linear combination or an average value of these parameters (for example, JP-A-9-134410).
(A) The certainty level is higher as the character ranks higher in the recognition result for the same character in the image.
(B) The higher the degree of similarity in character recognition of the character, the higher the certainty level.
(C) The certainty factor increases as the notation length of the word determined to belong to the character is longer.
(D) The greater the likelihood of connection between the character or category of the character and that of the preceding and succeeding characters, the higher the certainty.
(E) The greater the possibility of connection between the word and the preceding and following words, the greater the certainty.
This rule is stored in advance in, for example, a rule table or a part-of-speech connection cost table as statistical properties or heuristic rules of many documents.
The result output unit 50 outputs the recognition result stored in the recognition result storage unit 45 in correspondence with the original image data corresponding to the recognition result for post-processing.
The user corrects the recognition result if there is an error by referring to the output result, and outputs the final result to an output device such as a display, a printer or a file, or transmits it to another computer via a network. To do.
[0008]
In this post-processing, the conventional method is intended to correct only misrecognized characters, and therefore, when inputting correct characters, one “single character” is input.
However, in the case of kanji, there are many single kanji characters with the same sound, so it is difficult to select the correct answer character. However, when considered in terms of words, the number of homophones is less than that of single kanji characters and misrecognized. In many cases, it is more efficient not to correct only the characters, but to correct them in units of words, including correct characters before and after.
On the other hand, the fact that there is a word including a character with low confidence in the recognition result calculated by the certainty calculation unit 40 often fails to match the language dictionary 55.
In such a case, the low confidence extraction unit 60 first searches for a character with a low recognition result in the word path, and is registered as a word of two or more characters before and after the detected recognized character. Searching for non-adjacent single characters in the word path, concatenating them as one character string, and extracting the character string as one word (hereinafter, this character string is also referred to as a word in the present invention).
For example, referring to FIG. 2, as a result of character recognition of image data “peripheral image display processing”, a character string “peripheral image display processing” is obtained, and word paths are “peripheral”, “image”, “wheat ”,“ Indication ”, and“ Process ”are considered. In addition, the certainty for each character is "90", "89", "80", "75", "70", "80", "50", "80", "99", "92" Suppose that it is calculated.
Since “wheat” with a low certainty factor of “50” is detected, when searching forward and backward from “wheat”, there is the word “image” in the forward direction and the single letter “show” in the backward direction. Since there is a word “processing”, “barley” is extracted as a word with low probability.
[0009]
Next, when performing the post-processing of the recognized result, the result output unit 50 distinguishes a word having a low certainty extracted from the low certainty extraction unit 60 as described above from a word having a high certainty. indicate.
For example, as a display method, it is possible to distinguish visually by changing a color, changing a font size, underlining or winking.
In this post-processing, when one word is corrected, the recognition result storage unit 45 is updated with the corrected word, and then the next word with a low certainty level is transferred.
Thus, the user can correct erroneous recognition efficiently in units of words without taking his hands off the keyboard.
By configuring the embodiment in this way, it is possible to display words with low confidence level in units of words and correct them in units of words without correcting them in units of characters as in the past. This makes it easier to correct words and improves the efficiency of editing.
[0010]
(2) Processing procedure
FIG. 3 is a flowchart showing the processing procedure of this embodiment.
An image from a scanner or a file is read, and the image data is stored in the original image storage unit 25 (step S100). Thus, the image input unit 20 is configured.
A character image region is determined from the image information stored in the original image storage unit 25, and a line is cut out from the character region (step S110).
A character is cut out from the cut out line, a diagonal coordinate value of a rectangle surrounding the character portion is extracted, the size of the character portion is normalized, noise (dirt, etc.) is removed, and a feature amount is calculated (step S120). ).
Pattern matching is performed from the feature dictionary and the recognition dictionary 35 that holds the standard pattern, and one or more recognition candidate characters per character, their ranks, and distance values between the corresponding standard patterns are recognized. (Step S130).
Based on the recognition candidates registered in the recognition result storage unit 45, word candidates starting from each character position are generated, the word dictionary of the language dictionary 55 is searched, the matched word and its part of speech information are extracted, and the processing target Candidate words are connected from the top of the region to generate a word path, and at the same time, the cost of the word path is calculated using the part-of-speech connection cost table of the language dictionary 55. The word paths are selected in descending order of cost so that the number of generated word paths is below a certain number, the recognition candidate characters are corrected based on the word path with the lowest cost, and the recognition result storage unit 45 is updated. (Step S140).
The character recognition unit 30 is configured from step S110 to step 140.
For the parameters for calculating the certainty factor, for example, the weights held in the rule table, the part-of-speech connection cost table, etc. are extracted, and the certainty factor is calculated as a linear combination or an average value (step S150).
Thereby, the certainty factor calculation unit 40 is configured.
In the word path generated in step S140, a character with a low recognition result is searched for, and adjacent single characters not registered as two or more words before and after the detected recognized character are entered in the word path. They are searched and connected as one character string, and the character string is extracted as one word (step S160).
Thereby, the low certainty degree extraction unit 60 is configured.
In order to perform post-processing of the recognition result, the recognition result stored in the recognition result storage unit 45 is associated with the corresponding original image data, and the word with low confidence and the word with high confidence extracted in step S160. By changing the color, changing the font size, underlining, or winking, you can visually distinguish them.
By referring to the output result, the user corrects the recognition result if there is an error (step S170).
In this post-processing, when one word is corrected, the recognition result storage unit 45 is updated with the corrected word, and then moved to the next word with a low certainty factor.
After the post-processing of all the recognition results is completed, the final result stored in the recognition result storage unit 45 is output to an output device such as a display, a printer or a file, or transmitted to another computer via a network. (Step S180).
The result output unit 50 is configured by steps S160 to S180.
[0011]
<Modification 1>
On the other hand, visually impaired people often obtain information by reading the recognition result. However, in the method of the above embodiment, it is possible to make a visual judgment depending on the certainty, but it cannot be used for a visually impaired person.
Therefore, in the first modification, in the post-processing of the result output unit 50, instead of extracting a word with a low certainty level and visually displaying this word, the sound quality of the voice information to be read is changed according to the certainty level. Or by changing the volume of the sound. When reading a word with low confidence, it does not make sense as a word, so a plurality of candidates are read out for each character, not as a word.
For example, when outputting sound in stereo format, when the confidence level is high, the right and left signals are output at the same time. The destination may be changed.
In addition, before and after reading out the recognition result, a signal sound or a message for distinguishing may be inserted.
Also, visually impaired people can not see the image, so it may not be possible to correct it to a completely correct character, but it can be guessed and corrected from the flow of front and back, etc. Will.
Therefore, after correcting a low-confidence word in post-processing, instead of directly reading the next low-confidence word, what the visually impaired person does is Make sure you know what to fix.
[0012]
Or in the case of a hearing-impaired person, you may make it read in order instead of jumping to the word of the next low certainty degree.
For healthy people, it is more efficient to jump to the next low confidence word without reading this way, so specify whether to jump between healthy and visually impaired people. You may make it use properly.
With the configuration as in the first modification, a visually impaired person reads a word with a low certainty factor in units of words, so that the user can easily correct it to a correct word, and the efficiency of editing work is improved.
Also, when outputting the recognition result in this post-processing, if visual and auditory are output together, the visually impaired and their caregiver can work together, so the editing efficiency is higher. Will improve.
Also, when visually checked, similar characters may cause omissions, so if you output both visual and auditory, you can read similar characters aloud. Thus, even a healthy person can easily find an error, and the efficiency of editing work is further improved.
[0013]
<Modification 2>
In the case of a visually impaired person, the recognition result is often obtained by Braille.
However, in the method of the above embodiment, it is possible to make a visual judgment depending on the certainty, but it cannot be used for a visually impaired person.
Therefore, in the second modification, in the post-processing of the result output unit 50, instead of extracting a word with a low certainty factor and visually displaying the word, a method that can be tactilely determined by the braille output device For example, the height of the unevenness of the Braille is changed, or a mark or the like for distinguishing before and after a character or word with low confidence is inserted.
Also, when correcting low confidence words in post-processing, the jump to this low confidence word is a braille cursor line (a braille pin appears) around or around the line containing the low confidence word. As shown.
By configuring as in the second modification, a visually impaired person outputs a word with low confidence in word units as braille, so that the user can easily correct the word to a correct word, and the efficiency of editing work is improved.
In addition, when outputting the recognition result in this post-processing, if tactile and auditory are output together, it can be confirmed tactilely when a visually handicapped person hears it audibly. The efficiency of editing work is improved.
In the present embodiment, a word path is created from the candidates recognized by the character recognition unit 30 by using the language dictionary 55, and a character string (word) including a character with a low certainty factor is selected from the word path. Even if it is configured to search for an unknown word that is not registered as a word in the language dictionary 55 for the recognition result in the post-processing of the recognition result by the result output unit 50, the same effect can be obtained. Can do.
[0014]
<Example by computer>
Furthermore, the present invention is not limited only to the above-described embodiment. For example, the character recognition device 100 shown in FIG. 1 can also be realized by a computer device 200 having a hardware configuration as shown in FIG.
That is, the computer device 200 includes a keyboard, a mouse, a touch panel, a scanner, a Braille input device, and the like. The input device 1 used for inputting information, various output information, information input from the input device 1, and the like. Is displayed, or is output to a printer, a Braille output device, or the like, a CPU (Central Processing Unit) 3 that operates various programs, and the program itself is held. A memory 4 that stores information temporarily created when executed, an original image storage unit 25, a recognition dictionary 35, a recognition result storage unit 45, a language dictionary 55, and a program or program of the character recognition device of the present invention A storage device 5 that holds temporary information and the like at the time of execution and a recording medium that stores programs, data, etc. Are connected to each other by a bus 8 and a medium connection device 6 used for connecting to the network 9 and a network connection device 7 which is an interface for connecting to the network 9. Yes.
The network 9 is a transmission path for connecting the computer apparatus 200 and another computer apparatus 200, and is generally realized by a cable, and TCP / IP is used as a communication protocol. However, the transmission path is not limited to cables, and may be any of wireless, wired, and broadcast waves as long as the communication protocol between them is the same. For example, LAN (Local Area Network), WAN (Wide Area Network) Internet, analog telephone network, digital telephone network (ISDN: Integral Service Digital Network), PHS (Personal Handyphone System), mobile phone network, satellite communication network, and the like can be used.
[0015]
In such a configuration of the computer apparatus 200, each function constituting the character recognition apparatus shown in FIG. 1 is programmed and written in a recording medium such as a CD-ROM in advance, and this CD-ROM is stored in the CD of each site. The above-mentioned program is mounted on a computer device 200 equipped with a medium driving device 6 such as a ROM drive, stored in the memory 4 or the storage device 5 of each computer device 200, and executed as described above. Functions similar to those in the embodiment can be realized.
As the recording medium, a semiconductor medium (eg, ROM, IC memory card, etc.), an optical medium (eg, DVD, MO, MD, CD-R, etc.), a magnetic medium (eg, magnetic tape, flexible disk, etc.), etc. Either may be sufficient.
Further, not only the functions of the above-described embodiments are realized by executing a program loaded into the memory 4 of the computer device 200, but an operating system or the like can execute a part of actual processing or The case where all the functions are performed and the functions of the above-described embodiments are realized by the processing is also included.
When the program for realizing the above-described embodiment is a semiconductor recording medium such as a ROM, the program is loaded directly into the memory 4 and executed instead of the medium driving device 6.
[0016]
<Operation in Network Environment of the Present Invention>
FIG. 5 shows a configuration of an embodiment in which the present invention is operated by connecting to a wired or wireless communication network.
For example, a server 210 that holds a character recognition program and a terminal 220 that is used by a plurality of users are connected via the network 9.
In this case, the server 210 and the user terminal 220 are constituted by the general-purpose computer apparatus 200 shown in FIG.
The user logs in to the server 210 from the terminal 220 or inputs image data for character recognition, and requests the character recognition program of the server 210 to execute character recognition.
The character recognition program of the server 210 returns the character recognition result for the character area of the transmitted image data to the requesting terminal 220.
The user's terminal 220 outputs the recognition result or the original image data while comparing them, or performs post-processing.
This has the advantage that the latest character recognition program can always be used.
When the server 210 and the terminal 220 are connected via a wired or wireless communication network as shown in FIG. 5, a character recognition program for realizing the functions of the present invention is stored in a storage device such as a magnetic disk of the server 210. The terminal 220 can be distributed in the form of download or the like.
Furthermore, a character recognition program that implements the functions of the present invention may be provided by distribution through a medium or broadcast wave.
[0017]
【The invention's effect】
As described above, according to the present invention, it is possible to confirm the certainty of the recognition result regardless of the presence or absence of obstacles such as vision and hearing.
In addition, since a word that is not certain is extracted in units of words, it is easy to correct the word, and the work efficiency of post-processing is improved.
[Brief description of the drawings]
FIG. 1 is a block diagram illustrating a functional configuration of an embodiment.
FIG. 2 is a diagram illustrating a character recognition result of a conventional example and the present invention.
FIG. 3 is a flowchart illustrating a processing procedure according to the embodiment.
FIG. 4 is a diagram showing a computer device for operating the character recognition device of the present invention.
FIG. 5 is a diagram for explaining an operation example in a network environment according to the present invention.
[Explanation of symbols]
1 input device, 2 output device, 3 CPU, 4 memory, 5 storage device, 6 medium drive device, 7 network connection device, 8 bus, 9 network, 10 control unit, 20 image input unit, 25 original image storage unit, 30 Character recognition unit, 35 recognition dictionary, 40 confidence calculation unit, 45 recognition result storage unit, 50 result output unit, 55 language dictionary, 60 low confidence extraction unit, 100 character recognition device, 200 computer device, 210 server, 220 terminal

Claims

An image input unit for inputting a document image, and a character recognizing character recognition unit for the character region of the image input by the image input unit,
A confidence factor calculation unit that calculates a confidence of the character recognition result for each character obtained from the character recognition unit,
A word extraction unit for extracting words from the characters recognized by the character recognition unit;
When there is an adjacent single character that is not two or more words before and after the character for a character for which a predetermined certainty factor cannot be obtained from the recognition result for each character calculated by the certainty factor calculation unit A low certainty factor extraction unit that extracts a character that cannot obtain the predetermined certainty factor and a single character adjacent to the character as a single character string;
A character recognition apparatus, comprising: a result output unit that outputs the character string extracted by the low certainty factor extraction unit as distinguished from other words with respect to a recognition result by the character recognition unit.

The word extraction unit searches for a word including a character recognized by the character recognition unit using a word dictionary held in a predetermined storage unit, and stores the searched word and the part of speech information of the searched word. 2. The character recognition apparatus according to claim 1, wherein the character recognition device extracts the word having the highest evaluation value from the searched word by taking out the calculated word and calculating the evaluation value of the searched word based on the part of speech information. .

The character recognition device according to claim 1 or 2,
The result output unit, when the word containing the predetermined confidence is not obtained character is modified, replaced by the word after correction words with the predetermined confidence is not obtained character, following the predetermined A character recognition apparatus characterized in that it moves to correction of a word including a character for which the certainty cannot be obtained .

A character recognition method executed by a character recognition device, comprising: an image input step for inputting a document image; a character recognition step for character recognition with respect to a character region of an image input in the image input step; and the character recognition A certainty factor calculating step for calculating the certainty factor of the character recognition result for each character obtained from the step, a word extracting step for extracting a word from the characters recognized by the character recognition step, and the certainty factor calculating step If there is an adjacent single character that is not a word of two or more characters before and after the character for which the predetermined certainty factor cannot be obtained from the recognition result for the character, the predetermined certainty factor cannot be obtained. About the recognition result by the low reliability extraction process which extracts a character and the single character adjacent to the character connected as one character string, and the recognition result by the character recognition process, it was extracted by the low reliability extraction process Character recognition method characterized by comprising the serial strings from a result outputting step of outputting separately from other words.

The word extraction step searches for a word including the character recognized by the character recognition step using a word dictionary held in a predetermined storage unit, and obtains the searched word and the part of speech information of the searched word. 5. The method according to claim 4, further comprising: extracting, calculating an evaluation value of the searched word based on the part of speech information, and extracting a word having the highest evaluation value from the searched words. Character recognition method.

The program for making a computer perform each process of the character recognition method of Claim 4 or 5.

A computer-readable recording medium on which the program according to claim 6 is recorded.