JPH04163681A

JPH04163681A - Information processor and character recognizing device

Info

Publication number: JPH04163681A
Application number: JP2291059A
Authority: JP
Inventors: Hiroshi Yoshida; 浩史吉田; Toru Ishikawa; 石河　融; Koichi Higuchi; 浩一樋口; Yoshiyuki Yamashita; 山下　義征
Original assignee: Oki Electric Industry Co Ltd
Current assignee: Oki Electric Industry Co Ltd
Priority date: 1990-10-29
Filing date: 1990-10-29
Publication date: 1992-06-09
Anticipated expiration: 2014-06-07
Also published as: JP2902097B2

Abstract

PURPOSE:To output each character block always in correct connection order by deciding connection relation between the character blocks, and outputting each character block from a document picture constituted of plural character blocks in this decided order. CONSTITUTION:An information processor 200 is provided with a character block extracting part 210, a character segmenting part 220, a character recognizing part 230, a word segmenting part 240, a feature word selecting part 250, a feature word storage part 260, a feature word collating part 270, and a character block selecting part 280. Then, the feature word of every character block extracted from the document picture is selected, and further, the feature word of each character block is compared and collated with each other, and the connection relation between the respective character blocks is decided on the basis of this result, and the character block is outputted in the connection order based on this. Namely, each character block is outputted in the order of the connection relation added with the contents of a sentence. Thus, each block is outputted always in the correct connection order.

Description

【発明の詳細な説明】（産業上の利用分野）この発明は、複数の文字ブロックで構成された文書画像
から各文字ブロックを、これら間の接続間係を判定して
この順に出力することか出来る情報処理装置と、この情
報処理袋Ｍを具える文字認識装置とに関するものである
。[Detailed Description of the Invention] (Field of Industrial Application) This invention is a method of outputting each character block from a document image composed of a plurality of character blocks in this order by determining the connections between them. The present invention relates to an information processing device that can be used as an information processing device, and a character recognition device that includes this information processing bag M.

（従来の技ｍ）機械か媒体上の文書画像から原文書か持つ情報を正確に
得ることか出来れば、種々の機能を持つ情報処理装置、
例えば文字認識装置、認識した文字を音声により出力す
るような装置、認識した文字を翻訳する装置等の構築か
可能になる。(Conventional technique m) If it is possible to accurately obtain the information contained in the original document from the document image on a machine or medium, an information processing device with various functions,
For example, it becomes possible to construct a character recognition device, a device that outputs recognized characters as voice, a device that translates recognized characters, etc.

しかし、文書画像には、例えば第２図に示すように、罫
線或いは空白（図示例は空白）と、文字列か茫に配列さ
れ上述の空白により周囲と隔れた複数の文字ブロックｌ
ｌａ〜１１ｄとで構成された文書画像１１かあり、この
ような文書画像１１の認識処理に当っては、書式既知の
文字画像の処理とは異なり、文字ブロックを抽出し各文
字ブロック周の接続関係を判定するという特別の処置か
必要になる。However, as shown in FIG. 2, for example, a document image contains ruled lines or blank spaces (the illustrated example is blank), and character strings or multiple character blocks arranged in grids and separated from the surroundings by the above-mentioned blank spaces.
There is a document image 11 composed of letters 1a to 11d, and in recognition processing of such document image 11, unlike processing of character images of known formats, character blocks are extracted and connections between each character block are determined. Special measures are required to determine the relationship.

そこで、これを行い得る従来装置として、例えば文献■
（電子情報通信学会技術研究報告ＰＲＵ８６−３３）に
開示された書式未知文書の自動読取装置かあった。Therefore, as a conventional device that can perform this, for example, the literature
(The Institute of Electronics, Information and Communication Engineers Technical Research Report PRU86-33) disclosed an automatic reading device for unknown format documents.

この文献に開示されている装置ては、書式未知の文書よ
り、周辺分布特徴、線密度特徴、外接矩形特徴を用いて
、見出し文字ブロックや本文文字ブロック等の文書構成
要素か抽出され、ざらにこれら各文字ブロックから各文
字か抽出され、さらに各文字に対して特徴抽出か行なわ
れで各文字が認識される。The device disclosed in this document uses marginal distribution features, line density features, and circumscribed rectangle features to extract document constituent elements such as heading character blocks and body character blocks from a document with an unknown format. Each character is extracted from each of these character blocks, and feature extraction is performed on each character to recognize each character.

また、この際、各文字ブロックの接続順序は、例えば文
献■（電子情報通信学会論文誌、（１９８６，３）、Ｖ
Ｏｌ、Ｊ６９−Ｄ、Ｎｏ、３゜ｐ、４０４）に開示され
ているように、下記（１）式に従い各文字ブロックの文
書中における順序評価値りが算出され、このｈが小さい
順とされるのが一般的であった。In addition, at this time, the connection order of each character block can be determined, for example, in the literature (IEICE Transactions, (1986, 3),
As disclosed in J69-D, No. 3゜p, 404), the order evaluation value of each character block in the document is calculated according to the following formula (1), and the h is ranked in descending order. It was common to

ｈ＝ｘ、＋ε・Ｙｃ　　　＝（１）ここで、ＸＣは各文字ブロックの左上隅点の×座標、Ｙ
ｃは各文字ブロックの左上隅点の’１標、εは０．１程
度の定数をそれぞれ示す。h=x, +ε・Yc = (1) Here, XC is the x coordinate of the upper left corner point of each character block, Y
c represents the '1 mark of the upper left corner point of each character block, and ε represents a constant of approximately 0.1.

（発明が解決しようとする課題）しかしながら、上述した文献に開示された自動読取装置
では、第２図に示した文書画像においで各ブロックｌｌ
ａ〜ｌｉｄの正規な接続関係か１１ａ→１１Ｃ→１１ｂ
→１１ｄというような場合であっても、上記位置評価値
りに従うため１１ａ→１１ｂ→１１ｃ→１１ｄという接
続間係であると判断されてしまい、各文字ブロックの正
確な順序判定をすることか出来ないという問題点かあっ
た。(Problem to be Solved by the Invention) However, in the automatic reading device disclosed in the above-mentioned document, each block in the document image shown in FIG.
Is the connection relationship between a and lid normal? 11a → 11C → 11b
→11d, it is determined that the connection is 11a → 11b → 11c → 11d because it follows the position evaluation value above, and it is not possible to accurately determine the order of each character block. There was a problem that it wasn't there.

従って、この自動読取装Ｍを文字認識装置に適用した場
合、正しい順序で文字認識を行うことが出来ないため正
確な認識結果を得ることが出来ないことになる。また、
これを防止するためには、オペレータによる修正作業か
必要となるか、この作業は頻雑であるため、文字認識装
置の処理′ｉ能の著しい低下、揉作牲の悪化につながる
ので好ましいことではない。Therefore, when this automatic reading device M is applied to a character recognition device, accurate recognition results cannot be obtained because character recognition cannot be performed in the correct order. Also,
In order to prevent this, it is necessary for the operator to perform correction work, but this work is not desirable because it is frequent and leads to a significant decrease in the processing performance of the character recognition device and worsening of errors. do not have.

この発明はこのような点に鑑みなされたものであり、従
ってこの発明の目的は、文字ブロック間の接続関係を正
確に判定することか出来る情報処理装置及びこれを具え
る文字認識装置を提供することにある。The present invention has been made in view of the above points, and therefore, an object of the present invention is to provide an information processing device that can accurately determine the connection relationship between character blocks, and a character recognition device equipped with the same. There is a particular thing.

（課題を解決するための手段）この目的の達成を図るため、この出願の第一発明によれ
ば、文書画像より文字ブロックを抽出するための文字ブ
ロック抽出部を具える情報処理装置において、抽出された文字ブロックより文字パタンを切り出す文字
切り出し部と、前述の切り出された文字パタンを認識する文字認識部と
、前述の認識された文字パタン列より単語を切り出す単語
切り出し部と、文字ブロック毎に、前述の切り出された単語の中から特
徴となる単語を選択する特徴単語選択部と、前述の選択された特徴単語を文字ブロック毎に記憶する
ための特徴単語記憶部と、各ブロック別の特徴単語を各文字ブロック問相互で比較
・照合する特徴単語照合部と、前記特徴単語の比較・照
合結果に基づいて各文字ブロック間の接続関係を判定し
、該判定結果に基づいて文字ブロックを出力する文字ブ
ロック選択部とを具えたことを特徴とする。(Means for Solving the Problem) In order to achieve this object, according to the first invention of this application, in an information processing apparatus comprising a character block extracting section for extracting character blocks from a document image, A character extraction unit that extracts a character pattern from the character block that has been identified, a character recognition unit that recognizes the aforementioned character pattern, and a word extraction unit that extracts a word from the aforementioned recognized character pattern string, for each character block. , a feature word selection section that selects a feature word from the aforementioned cut out words, a feature word storage section that stores the aforementioned selected feature words for each character block, and features for each block. A feature word matching unit that compares and matches words between each character block, and determines the connection relationship between each character block based on the comparison and matching results of the feature words, and outputs a character block based on the determination result. The present invention is characterized in that it includes a character block selection section.

なお、この第一発明の実施に当り、前述の特徴単語選択
部は、単語を構成する文字数に基づいて特徴単語を選択
する構成とするのか好適である。In carrying out the first invention, it is preferable that the feature word selection section described above selects feature words based on the number of characters that make up the word.

または、文字ブロック中の単語の中で、予め登録しでお
いた一般単語と一致しない単語を、特徴単語として選択
する構成とするのか好適である。Alternatively, it is preferable to select a word among words in a character block that does not match a general word registered in advance as a characteristic word.

ざらにこの第一発明の実施に当り、前述の特徴単語照合
部は、前述の特徴単語の比較・照合において着目文字ブ
ロックと他の文字ブロックとに共通に現われる単語数を
計数する構成とするのか好適である。Roughly speaking, in implementing the first invention, is the feature word matching unit configured to count the number of words that appear in common between the character block of interest and other character blocks in the comparison and matching of the feature words? suitable.

さらにこの発明の実施に当り、前述の文字ブロック選択
部は、前述の共通単語数か予め定めた値以上でありかつ
最も大きい値を示した文字ブロックを、着目文字ブロッ
クと接続される他の文字ブロックと判定する構成とする
のか好適である。Further, in carrying out the present invention, the character block selection section described above selects a character block having the maximum number of common words or more than a predetermined value, and selects a character block that is the same as the number of common words or other characters connected to the character block of interest. It is preferable to adopt a configuration in which it is determined that the block is a block.

また、この出願の死二発明の文字認識装置によれば、第
一発明の情報処理装置と、該情報処理装置から出力され
る文字ブロックの文字を認識する文字認識段とを具えた
ことを特徴とする。Further, the character recognition device of the second invention of this application is characterized by comprising the information processing device of the first invention and a character recognition stage that recognizes the characters of the character block output from the information processing device. shall be.

（作用）この出願の第一発明の情報処理装置によれば、文書画像
から抽出された各文字ブロック毎の特徴単語を選択し、
ざらに各文字ブロックの特徴単語同士を比較・照合して
この結果に基づいて各文字ブロックの接続関係を判定し
、これに基づいた接続順て文字ブロックか出力される。(Operation) According to the information processing device of the first invention of this application, feature words are selected for each character block extracted from a document image,
The characteristic words of each character block are roughly compared and collated, and based on this result, the connection relationship of each character block is determined, and the character blocks are output in the order of connection based on this.

つまり、各文字ブロックは文章の内容が加味された接続
関係で出力される。従って、各文字ブロックのレイアウ
トに規則性の無い文書から文字ブロックを抽出し出力す
る場合でも、単に各文字ブロックの文書画像中の位曹座
標に基づいて各文字ブロックを出力していた従来製画に
比べ、各ブロックは正しい順序で出力される。In other words, each character block is output in a connection relationship that takes into account the content of the text. Therefore, even when extracting and outputting character blocks from a document with no regularity in the layout of each character block, conventional methods that output each character block simply based on the coordinates of each character block in the document image Compared to , each block is output in the correct order.

また、この第二発明の文字認識装置によれば、上述のよ
うに正しい順序で出力される文字ブロックについて文字
認識かすされる。このため、レイアウトに規則性の無い
文書の文字認識を文書に即しで行うことが出来る。Further, according to the character recognition device of the second invention, character blocks that are output in the correct order as described above are recognized. Therefore, character recognition of a document with irregular layout can be performed based on the document.

（実施例）以下、この出願の第一発明の情報処理装置及び第二発明
の文字認識装置の実施例について、第一発明の情報処理
袋Ｍ％適用した文字認識装置の例により説明する。(Example) Examples of the information processing device of the first invention and the character recognition device of the second invention of this application will be described below using an example of a character recognition device to which the information processing bag M% of the first invention is applied.

一＋＠Ｊ７：　　　の　　の８日第１図は、実施例の情報処理装置２００を組み込んた実
施例の文字認識装置１００の構成を概略的に示したブロ
ック図である。なあ、この文字認識部Ｍ１０ｏの説明を
容易とするために、以下の説明においては認識対象の文
書画像例を挙げで説明する。ここでは、空白及び文字ブ
ロックｌｌａ〜１１ｄて構成される第２図に示すような
文書画像１１を、認識対象の文書画像例とする。1+@J7: No. 8th FIG. 1 is a block diagram schematically showing the configuration of a character recognition device 100 of an embodiment incorporating an information processing device 200 of the embodiment. In order to facilitate the explanation of this character recognition unit M10o, the following explanation will be given using an example of a document image to be recognized. Here, a document image 11 as shown in FIG. 2, which is composed of blank spaces and character blocks lla to 11d, is taken as an example of a document image to be recognized.

この文字認識装置１００は、光電変換部１１０、実施例
の情報処理装置２００、文字認識段１２０及び出力端子
１３０を具える。また、この情報処理装置１１！２００
は、文字ブロック抽出部２１０、文字切り出し部２２０
、文字認識部２３０、単語切り出し部２４ｏ、特徴単語
選択部２５０、特徴単語記憶部２６０、特徴単語照合部
２７０及び文字ブロック選択部２８０！具え、また、文
字認識段１２０は、文字切り出し部１２２及び文字認識
部１２４を具える。This character recognition device 100 includes a photoelectric conversion section 110, an information processing device 200 of the embodiment, a character recognition stage 120, and an output terminal 130. In addition, this information processing device 11!200
are a character block extraction unit 210 and a character extraction unit 220.
, character recognition unit 230, word extraction unit 24o, feature word selection unit 250, feature word storage unit 260, feature word matching unit 270, and character block selection unit 280! The character recognition stage 120 also includes a character segmentation section 122 and a character recognition section 124.

次に、各構成成分の詳細について説明する。Next, details of each component will be explained.

光電変換部１１０は、例えば従来公知のイメージセンサ
等で構成出来、文字・図形等（以下、文字と称する。）
か記！２された例えば第２図に示すような文書画像１１
からの光信号Ｓ％、例えば文字線部を黒ヒツト及び背景
部を白ビットで表現した２値のティジタル信号（これを
入力画像テークという。）に変換し、この入力画像デー
タを文字ブロック抽出部２１０に出力するものである。The photoelectric conversion unit 110 can be configured with, for example, a conventionally known image sensor, etc., and can be configured with characters, figures, etc. (hereinafter referred to as characters).
A record! For example, a document image 11 as shown in FIG.
For example, the optical signal S% from 210.

文字ブロック抽出部２］０は、光電変換部７１０より入
力された入力画像データより文書画像の文字ブロックに
対応するデータ（これを文字ブロックテークと称する。The character block extraction unit 2 ] 0 extracts data corresponding to character blocks of a document image from the input image data input from the photoelectric conversion unit 710 (this is referred to as character block extraction).

）を抽出しこれを文字ブロック選択部２８０及び文字切
り出し部２２０に夫々出力するものである。例えば第２
図に示す文書画像１１の例で説明すれば、文字列が規則
正しく完に配列し空白等により周囲と隔れでいる領域１
１ａ〜］１ｄ（これらの領域各々が文字ブロックになる
。）に対応する入力画像データ部分を抽出し、文字ブロ
ック選択部２８０及び文字切り出し部２２０に夫々出力
する。) is extracted and outputted to the character block selection section 280 and character cutting section 220, respectively. For example, the second
To explain using the example of the document image 11 shown in the figure, an area 1 in which character strings are regularly and completely arranged and separated from the surroundings by spaces etc.
1a to ]1d (each of these areas becomes a character block) is extracted and output to the character block selection unit 280 and character cutting unit 220, respectively.

ここで、文字ブロックデータの抽出は従来公知の種々の
方法で行うことか出来るか、この実施例では以下に説明
するような方法で行う。Here, character block data can be extracted by various conventionally known methods, but in this embodiment, it is performed by the method described below.

先ず、入力画像データｔｍ方向（第２図中ｙて示す方向
に相当する方向）に走査し周辺分布を作成する。次に、
この周辺分布において、黒ヒツト数かＯから値以上に変
化する走査線より値以上からＯに変化する直前の走査線
までで規定される領域を列方向の１ブロツクとして検出
する。次に、この列方向の１ブロツクに対応する入力画
像データ部分を、今度は横方向（第２図中×で示す方向
に相当する方向）に走査し横方向の周辺分布を作成する
。そして、この横方向の周辺分布において、黒ビット数
かＯから値以上に変化する走査線よつ値以上からＯに変
化する直前の走査線までで規定される領域の幅か閾値Ｔ
Ｈより大きかったならばこの領域と前述の列方向のブロ
ックとの重複部分のデータを文字ブロックデータとして
抽出する。なお、閾値ＴＨは、例えば、文書中の汚れ等
により文字ブロックか誤って決定されないようにするた
めの値であり、この実施例の場合は１４０としてある。First, input image data is scanned in the tm direction (corresponding to the direction indicated by y in FIG. 2) to create a peripheral distribution. next,
In this peripheral distribution, an area defined by the number of black hits from the scanning line where the number changes from O to the value or more to the scanning line immediately before the number changes from the value or more to O is detected as one block in the column direction. Next, the input image data portion corresponding to one block in the column direction is scanned in the horizontal direction (corresponding to the direction indicated by the x in FIG. 2) to create a peripheral distribution in the horizontal direction. In this horizontal peripheral distribution, the width of the area defined by the number of black bits or the scanning line that changes from O to the value or more and the scanning line immediately before changing from the value or more to O or the threshold T
If it is larger than H, the data of the overlapping portion between this area and the aforementioned block in the column direction is extracted as character block data. Note that the threshold value TH is a value to prevent erroneous determination of a character block due to, for example, dirt in the document, and is set to 140 in this embodiment.

文字切り出し部２２０は、文字ブロック抽出部２１０よ
り入力された文字ブロックデータより文字行を切り出し
、さらにこの文字行より１文字ずつの文字パタンを切り
出し、さらにこれら文字パタンを順次文字認識部２３０
に出力するものである。ここで、文字行の切り出し及び
この文字行からの文字パタンの切り出しは、例えば、入
力画像データから文字ブロックデータを周辺分布に基づ
いて抽出しを上述の手順と同様な手順で行うことか出来
る。The character cutting unit 220 cuts out a character line from the character block data input from the character block extracting unit 210, further cuts out character patterns of each character from this character line, and then sequentially passes these character patterns to the character recognition unit 230.
This is what is output to. Here, cutting out a character line and cutting out a character pattern from this character line can be performed by, for example, extracting character block data from input image data based on the peripheral distribution and performing the same procedure as described above.

文字認識部２３０は、文字切り出し部２２０より入力さ
れた文字パタンの特徴抽出を行いざらに特徴マトリクス
を作成し、さらに、この特徴マトリクスと、予め用意さ
れた図示しない辞書マド１ノクスとの類似度を算出し、
類似度の最も高い辞書マド１ノクスの文字名を当該文字
パタンの認識結果として単語切り出し部２４０に出力す
るものである。ここで、上述の特徴抽出は従来公知の＋
！々の方法で行うことか出来るか、この実施例の場合（
よ以下に説明するような方法で行う。The character recognition unit 230 extracts the features of the character pattern input from the character cutting unit 220, roughly creates a feature matrix, and further calculates the similarity between this feature matrix and a dictionary Mad1 Nox (not shown) prepared in advance. Calculate,
The character name of the dictionary Mad 1 Nox with the highest degree of similarity is output to the word extraction unit 240 as the recognition result of the character pattern. Here, the above-mentioned feature extraction is performed using the conventionally known +
! In the case of this example (
This is done in the manner described below.

先ず、文字パタンについてその文字線部に外接する例え
は方形の枠ｔＳ出し、これを文字枠とする。さらに当該
文字パタンの線幅Ｗを例えば下記（２）式で示すような
周知の近似式を用いて算出する。First, for a character pattern, a rectangular frame tS, for example, circumscribing the character line portion is drawn, and this is used as a character frame. Furthermore, the line width W of the character pattern is calculated using, for example, a well-known approximation formula as shown in equation (2) below.

Ｗ＝１／　（１−（Ｑ／Ａ））・・・（２）但し、（２
）式において、Ｑは文字パタンを構成する各点をこれら
の点か（２Ｘ２）個づつの節回て見られる窓で走査した
ときの、この窓内の点全てか黒ビットとなる窓の数であ
り、またへは文字枠内の黒ビットの総数である。W=1/ (1-(Q/A))...(2) However, (2
) In the formula, Q is the number of windows in which all the points within this window are black bits when each point constituting the character pattern is scanned with a window that can be seen in (2 x 2) nodes at a time. and is the total number of black bits in the character frame.

さらに、この文字パタンを複数の方向に走査を行って各
走査列毎の黒ビットの連続個数を検出し、この黒ビット
の連続個数と、上述の線幅Ｗとに基づいて上述の複数の
方向毎に対応したサブパターンをそれぞれ抽出する。そ
して、この文字パタンの上述の文字枠内を各サブパタン
についで（ＮＸＭ）個の領域（Ｍ、Ｎは定数）に分割し
、さらに各分割領域内の文字線長を表わす特徴量を各分
割領域毎に計算し、この特徴量を文字枠の大きさで正規
化して特徴マトリクスを得る。この実施例では、特徴量
を（ｄＸ＋ｄＹ）／２なる値で除することによって正規
化する。ここで、ｄ×は文字枠の水平方向の長さ、ｄＹ
は文字枠の垂直方向の長さである。Furthermore, this character pattern is scanned in a plurality of directions to detect the number of consecutive black bits in each scanning line, and based on this number of consecutive black bits and the line width W described above, the character pattern is scanned in a plurality of directions. The corresponding sub-patterns are extracted for each. Then, the above-mentioned character frame of this character pattern is divided into (NXM) regions (M and N are constants) for each sub-pattern, and the feature amount representing the character line length in each divided region is calculated for each divided region. This feature quantity is normalized by the size of the character frame to obtain a feature matrix. In this embodiment, the feature amount is normalized by dividing it by a value of (dX+dY)/2. Here, d× is the horizontal length of the character frame, dY
is the vertical length of the character frame.

また、上述の類似度は、この実施例の場合以下に示す（
３）式に基いて求める。In addition, the above-mentioned similarity is shown below in this example (
3) Calculate based on the formula.

但し、（３）式において、日は類似度、ｆ、は被認識文
字の文字パタンの特徴マトリクスの要素値、９Ｉは辞書
マトリクスの要素値、ＮＸＭは被認識文字の特徴マトリ
クス及び辞書マトリクスの次元数をそれぞれ示す。However, in equation (3), day is the similarity, f is the element value of the feature matrix of the character pattern of the character to be recognized, 9I is the element value of the dictionary matrix, and NXM is the dimension of the feature matrix of the character to be recognized and the dictionary matrix. Indicate the number of each.

単語切り出し部２４０は、文字切り出し部より入力され
た文字パタン列より単語（単語文字列と称する場合もあ
る。）を切り出しこれを特徴単語選択部に出力するもの
である。ここで、単語の切り出しは、従来公知の種々の
方法で行うことか出来るか、この実施例の場合は以下に
説明するような方法で行う。The word cutting section 240 cuts out a word (sometimes referred to as a word character string) from the character pattern string inputted from the character cutting section and outputs it to the characteristic word selection section. Here, words can be cut out using various conventionally known methods, and in the case of this embodiment, the following method is used.

先ず、各文字パタンの行方向（第２図中のＸで示す方向
に相当する方向）にあける始点座標及び終点座標より単
語領域を抽出する。具体的には、当該行の１番目の文字
に対応する文字パタンか下記の（４）式を満足する場合
には当該文字を単語の開始文字であると判断し、この文
字から次の単語の開始文字直前の文字までを単語領域と
する。First, a word area is extracted from the start point coordinates and end point coordinates of each character pattern in the row direction (direction corresponding to the direction indicated by X in FIG. 2). Specifically, if the character pattern corresponding to the first character of the line satisfies the following formula (4), that character is determined to be the start character of a word, and the next word starts from this character. The word area extends up to the character immediately before the start character.

Ｘ　−ｉ　　Ｘ　−＋　−＋　＞■　　　・・・（４）
Ｘ　！Ｉｎ　Ｘ　ｓＪ　：文字パタン始点座標Ｘ＠ＩＴ
　Ｘ＠Ｊ・文字パタン終点座標１、ｊ：当該行中の文字
番号ｎ：当該行中の文字数次に、単語切り出し部２４０は、この単語領域内の各文
字パタンの、文字認識部２３０より入力された認識結果
文字名で構成される列を単語文字列（単語と称すること
もある。）として特徴単語選択部２５０に出力する。X −i X −+ −+ ＞■ ...(4)
X! In X sJ: Character pattern starting point coordinates X@IT
X@J Character pattern end point coordinates 1, j: Character number in the line n: Number of characters in the line Next, the word cutting unit 240 receives input from the character recognition unit 230 of each character pattern in this word area. A string made up of the recognized character names is output to the feature word selection unit 250 as a word character string (sometimes referred to as a word).

単語選択部２５０は、単語切り出し部２４０より入力さ
れた単語文字列より各文字ブロックの特徴となるような
単語文字列を選択しくこの選択された単語文字列を特徴
単語と称する。）、これを特徴単語記憶部２６０に出力
するものである。The word selection section 250 selects a word string that is characteristic of each character block from the word string inputted from the word cutting section 240, and the selected word string is referred to as a characteristic word. ), which is output to the feature word storage section 260.

特徴単語を選択する方法としては種々の方法か考えられ
るか、この実施例では以下に説明するように、選択対象
となっている単語文字列の文字列長に着目して特徴単語
を選択するものとする。Various methods can be considered for selecting characteristic words. In this example, as explained below, characteristic words are selected by focusing on the string length of the word string to be selected. shall be.

具体的には、単語文字列の文字列長さをβとすると、下
記（５）式の条件を満たす文字列長の単語文字列即ち文
字列長か予め定めた閾値に以上の単語文字列を当該文字
ブロックの特徴単語として選択する。但し、ｋはこの実
施例の場合６としてある。Specifically, if the character string length of a word character string is β, then a word character string with a character string length that satisfies the condition of equation (5) below, that is, a word character string whose character string length is equal to or larger than a predetermined threshold value. Select as the characteristic word of the character block. However, k is set to 6 in this embodiment.

β≧ｋ　　　・・・（５）特徴単語記憶部２６０は、特徴単語選択部２５０より入
力された特徴単語をこれか所属しでいた文字ブロック番
号と対応づけで記憶するものである。特徴単語記憶部２
６０は、従来公知のメモリにより構成出来る。なお、こ
の場合、説明の都合上、文字ブロック番号は、第２図中
に示した１１ａ〜１１ｄとする。β≧k (5) The feature word storage unit 260 stores the feature word input from the feature word selection unit 250 in association with the character block number to which it belongs. Feature word storage unit 2
60 can be constructed from a conventionally known memory. In this case, for convenience of explanation, the character block numbers are assumed to be 11a to 11d shown in FIG.

第２図に示した文書画像１１の各文字ブロック１７ａ〜
１１ｄ＠々からは、この実施例の場合、文字列長さβか
６以上の単語か特徴単語として特徴単語選択部２５０に
より夫々選択される。従って特徴単語記憶部２６０には
、文字ブロック別に第１表のように特徴単語か記憶され
る。なお、第１表においては特徴単語の一部のみ示しで
あるこ特徴単語照合部２７０は、特徴単語記憶部より各
文字ブロック別の特徴単語を読み込み各ブロック別特徴
単語を相互に比較・照合するものである。特にこの実施
例の特徴単語照合部２７０は、異なる文字ブロック中に
共通の特徴単語が存在するか否かを検定すると共に着目
文字ブロックと他の各文字ブロックとの間においで共通
の特徴単語数を各地の文字ブロック別に計数する構成と
してある。Each character block 17a~ of the document image 11 shown in FIG.
In this embodiment, from 11d@, words with character string length β or 6 or more are selected by the characteristic word selection unit 250 as characteristic words. Therefore, characteristic words are stored in the characteristic word storage section 260 for each character block as shown in Table 1. In Table 1, only some of the characteristic words are shown.The characteristic word matching unit 270 reads the characteristic words of each character block from the characteristic word storage unit and compares and collates the characteristic words of each block with each other. It is. In particular, the feature word matching unit 270 of this embodiment tests whether or not a common feature word exists in different character blocks, and also tests the number of common feature words between the character block of interest and each other character block. The structure is such that the numbers are counted for each character block in each location.

この特徴単語照合部２７０の機能について、第１表に示
した各文字ブロック別の特徴単語に対する動作により説
明すれば次のようになる。The function of the feature word collation unit 270 will be explained below using operations for feature words for each character block shown in Table 1.

文字ブロックｌｌａ＠着目文字ブロックとした場合、特
徴単語照合部２７０は、文字ブロック１１ａと１１ｃと
の間においでｒｌａｓｅｒｓ４、ｒｗｅａｐｏｎｓ　Ｊ
及びｒ　１ｎｔｅｌｌｉｑｅｎｃｅ」という３つの単語
を共通の特徴単語として見出すと共に共通特徴単語数３
を計数する。同様に、文字ブロックｌｌａとｌｌｄとの
間においてｒ　ｐｒｅｓｉｄｅｎｔ　Ｊを共通の特徴単
語として見出すと共に共通特徴単語数１を計数する。ま
た、文字ブロック１１ａと１１ｂとの間においては、共
通な特徴単語は見出せないため共通特徴単語数は０とす
る。特徴単語照合部２７０は、文字ブロックｌｌｂ、ｌ
ｌｃについても、文字ブロック１１ａの場合と同様な手
順で共通な特徴単語数を計数する。しかし、第２図の文
書画像例の場合、文字プロツウ１１ｂと文字ブロック１
１ｃ、１１ｄとの間に共通特徴単語はなく、文字ブロッ
ク１１ｃと文字ブロックｌｌｄとの間にも共通特徴単語
はない、従って、第２図に示した文書画像の各文字ブロ
ックの共通特徴単語の計数結果は、以下の第２表のよう
になる。この計数結果は文字ブロック選択部２８０に出
力される。When character block lla@character block of interest is selected, the feature word matching unit 270 selects rlasers4, rweapons J between character blocks 11a and 11c.
and r 1ntelliqence" are found as common characteristic words, and the number of common characteristic words is 3.
Count. Similarly, r president J is found as a common feature word between character blocks lla and lld, and the number of common feature words is counted. Furthermore, since no common feature words can be found between character blocks 11a and 11b, the number of common feature words is set to 0. The feature word matching unit 270 compares character blocks llb, l
For lc, the number of common feature words is counted in the same manner as for the character block 11a. However, in the case of the document image example in FIG. 2, character block 11b and character block 1
1c and 11d, and there is no common feature word between character block 11c and character block lld. Therefore, the common feature words of each character block in the document image shown in FIG. The counting results are shown in Table 2 below. This counting result is output to character block selection section 280.

第２表文字ブロック選択部２８０は、特徴単語照合部２７０か
ら入力される比較・照合結果に基づいて各文字ブロック
間の接続関係を判定し、該判定結果に基づいて文字ブロ
ックを出力するものである。特にこの実施例の文字ブロ
ック選択部２８０は、特徴単語照合部２７０から入力さ
れた第２表に示すような共通特徴単語数と、文字ブロッ
ク抽出部２１０より入力されている各文字ブロックデー
タとに基づいて各文字ブロックの接続順序を決定しこの
順序で各文字ブロックデータを文字認識段１２０の文字
切り出し部１２２に出力する構成としてある。ここでの
接続順序決定手順は以下の通りである。The second table character block selection unit 280 determines the connection relationship between each character block based on the comparison/matching results input from the feature word matching unit 270, and outputs a character block based on the determination result. be. In particular, the character block selection unit 280 of this embodiment uses the number of common characteristic words as shown in Table 2 inputted from the characteristic word matching unit 270 and each character block data inputted from the character block extraction unit 210. Based on this, the connection order of each character block is determined, and each character block data is output to the character cutting section 122 of the character recognition stage 120 in this order. The connection order determination procedure here is as follows.

先ず、各文字ブロックの位置座標に基づき各文字ブロッ
クの仮の接続順序を決定する。この仮の接続順序の決定
は、この実施例の場合、下記（６）式に従い各文字ブロ
ックの文書画像における位置評価値Ｅ、（但し、１は文
字ブロック番号であり、この例では第２図中のｌｌａ〜
１１ｄとする。）ヲ求め、このＥｌか小さな順により行
うものとする。First, a temporary connection order of each character block is determined based on the position coordinates of each character block. In this embodiment, this provisional connection order is determined by the position evaluation value E of each character block in the document image according to equation (6) below (where 1 is the character block number, and in this example, Inside lla~
11d. ), and perform the calculation in descending order of El.

Ｅ＋　＝ｙ□＋Ｆ−ｘ、、　　　・・・（６）但し、（
６）式にあいで、Ｘ、ｌは当該文字ブロックに外接する
矩形の左上隅点のＸ座標であり、Ｙｅｔは前記左上隅点
のｙ座標であり、ｆは予め定めた定数でありこの実施例
の場合１０である。なお、ｘ、ｙ座標は茎２図中に示す
ｘ、ｙ座標系による。E+ =y□+F−x, ・・・(6) However, (
6) In the formula, X and l are the X coordinates of the upper left corner of the rectangle circumscribing the character block, Yet is the y coordinate of the upper left corner, and f is a predetermined constant. In the example case, it is 10. Note that the x and y coordinates are based on the x and y coordinate system shown in Figure 2 of the stem.

従って、第２図に示した文書画像の各文字ブロックの仮
の接続順序は、ｌｌａ→１１ｂ→１１Ｃ→１１ｄとなる
。Therefore, the tentative connection order of each character block in the document image shown in FIG. 2 is lla→11b→11C→11d.

次に、文字ブロック選択部２８０は、特徴単語照合部２
７０より出力されでいる共通特徴単語数に基づいて、上
記仮の接続順序を以下に説明するように修正する。Next, the character block selection unit 280 selects the feature word matching unit 2.
Based on the number of common feature words outputted from 70, the tentative connection order is modified as described below.

先ず、着目文字ブロックの他の文字ブロックに対する各
共通特徴単語数か予め定めた特定の＠Ｐ以上であるか否
かを検定し、値以上の値を示した他の文字ブロックは着
目文字ブロックに対し接続関係かあるものと判定する。First, it is tested whether the number of common feature words of the character block of interest with respect to other character blocks is greater than a predetermined specific @P, and other character blocks that have a value greater than the value are added to the character block of interest. However, it is determined that there is a connection relationship.

但し、共通特徴単語数が値以上となる他の文字ブロック
が複数存在した場合は、これらの中で共通特徴単語数が
最も多い文字ブロックを着目文字ブロックに対し最も接
続関係かあると判定するものとする。However, if there are multiple other character blocks with the number of common feature words equal to or greater than the value, the character block with the largest number of common feature words among these is determined to have the most connected relationship with the character block of interest. shall be.

次に、このように着目文字ブロックに対し接続関係かあ
ると判定された他の文字ブロックを、当該着目文字ブロ
ックの次の順序に位置する文字ブロックとして、仮の接
続順序の修正を行う。Next, the provisional connection order is corrected by setting the other character blocks that have been determined to have a connection relation to the character block of interest as the character blocks positioned next to the character block of interest.

なお、この実施例では、Ｐ＝２とする。また、上記仮の
接続順序において上位にある側の文字ブロックを着目文
字ブロックと順次に定め、上記判定及び修正を順に行う
ものとする。Note that in this embodiment, P=2. Further, the upper character block in the tentative connection order is sequentially determined as the character block of interest, and the above determination and correction are performed in order.

第２図に示した文書画像の例の場合は、第２表に示した
ように、文字ブロック１１ａと文字ブロック１１Ｃとの
間で共通特徴単語数か３であり値以上の値であるので、
文字ブロック選択部２８０は、文字ブロックｌｌａと文
字ブロック１１Ｃとは接続関係があるものと判定し、ざ
らに、１１ａ→１１ｂ→１１ｃ→１１ｄであった仮の接
続順序を、１１ａ→１１Ｃ→１１ｂ→１１ｄという接続
順序に修正する。そして、この文字ブロック選択部２８
０は、この修正された接続順序に従い各文字ブロックデ
ータを文字認識段１２０の文字切り出し部１２２に順次
に出力する。In the case of the document image example shown in FIG. 2, as shown in Table 2, the number of common characteristic words between the character block 11a and the character block 11C is 3, which is greater than the value.
The character block selection unit 280 determines that there is a connection relationship between the character block lla and the character block 11C, and roughly changes the tentative connection order of 11a → 11b → 11c → 11d to 11a → 11C → 11b → Correct the connection order to 11d. Then, this character block selection section 28
0 sequentially outputs each character block data to the character cutting section 122 of the character recognition stage 120 according to this modified connection order.

文字認識段１２０の文字切り出し部１２２は、文字ブロ
ック選択部２８０より入力された文字ブロックデータよ
り、文字行を切り出しさらにこの文字行より１文字づつ
の文字パタンを切り出しこのデータを文字認識部１２４
に順次に出力するものである。なお、この文字切り出し
部１２２は、例えば、情報処理装置１２００内の文字切
り出し部２２０と同様な構成とすれば良い。The character cutting section 122 of the character recognition stage 120 cuts out a character line from the character block data inputted from the character block selection section 280, cuts out a character pattern of each character from this character line, and sends this data to the character recognition section 124.
The data is output sequentially. Note that this character cutting section 122 may have a configuration similar to that of the character cutting section 220 in the information processing device 1200, for example.

文字認識段１２０の文字認識部１２４は、文字切り出し
部１２２より入力された文字パタンを認識しこれにより
得られた認識結果文字名を出力端子１３０より稜段の装
置Ｉ（例えばデイスプレィ、コンピュータ等）に出力す
るものである。なお、この文字認識部１２４は、例えば
、情報処理装置２００内の文学誌ｌ１部２３０と同様な
構成とすれば良い。The character recognition unit 124 of the character recognition stage 120 recognizes the character pattern input from the character cutting unit 122, and outputs the recognition result character name from the output terminal 130 to the device I (for example, a display, computer, etc.) at the edge stage. This is what is output to. Note that this character recognition section 124 may have a configuration similar to that of the literary magazine I1 section 230 in the information processing device 200, for example.

この実施例の文字認識装置１００によれば、情報処理装
置２００か文書画像から各文字ブロックを切り出しこれ
らを文字認識段１２０１こ出力する際に、各文字ブロッ
ク内に開運性のある単語か存在するか否かを判定し関連
性のある単語を含む文字ブロック同士が接続されるよう
な順序で各ブロックか順に出力される。このため、文字
認識段１２０は、文書の流れに即した認識が行えるので
、例えば、認識結果を音声合成に用いる場合や、翻訳す
る場合に、文字ブロック間て突然意味が通しること等か
起こらす、正確な音声合成や翻訳が行える。According to the character recognition device 100 of this embodiment, when the information processing device 200 cuts out each character block from a document image and outputs them to the character recognition stage 1201, it is determined that a word with good luck is present in each character block. Each block is output in order so that character blocks containing related words are connected to each other. For this reason, the character recognition stage 120 can perform recognition in accordance with the flow of the document, so that, for example, when using recognition results for speech synthesis or translation, meanings will not suddenly be understood between character blocks. This enables accurate speech synthesis and translation.

上述においては、この出願の情報処理装置及び文字認識
装置の実施例について説明したか、この発明は上述の実
施例のみに限られるものではなく以下に説明するような
種々の変更を加えることか出来る。In the above, embodiments of the information processing device and character recognition device of this application have been described, but the present invention is not limited to the above-mentioned embodiments and can be modified in various ways as described below. .

例えば、上述の実施例では、特徴単語選択部２５０は、
単語切り出し部２４０から切り出された単語文字列の文
字列長βに着目しβ≧になる条件式を満足する単語を特
徴単語として選択する構成としていたか、例えば、一般
文書中に搬出する一般単語を登録した一般単語辞書を予
め用意しでおき、対象文字ブロック内の単語のうち該一
般単語辞書に未登録の単語を選び出しこれを当該対象文
字ブロックの特徴単語とする構成としても良い。For example, in the embodiment described above, the feature word selection unit 250
Was there a configuration in which a word that satisfies the conditional expression β≧ by focusing on the character string length β of the word string extracted from the word extraction unit 240 is selected as a characteristic word?For example, a general word to be exported into a general document It is also possible to prepare in advance a general word dictionary in which are registered, and select words that are not registered in the general word dictionary from among the words in the target character block and use them as characteristic words of the target character block.

また、上述の実施例では、情報処理袋！２００に文字切
り出し部２２０及び文字認識部２３０を設け、文字認識
段１２０に文字切り出し部１２２及び文字認識部１２４
を設けた構成としていたか、文字切り出し部２２０及び
１２４を１つの文字切り出し部で共用し文字認識部２３
０及び１２４を１つの文字認識部で共用する構成として
も勿論良い。In addition, in the above embodiment, the information processing bag! 200 is provided with a character extraction section 220 and a character recognition section 230, and a character recognition stage 120 is provided with a character extraction section 122 and a character recognition section 124.
The character recognition unit 23 may be configured so that the character extraction units 220 and 124 are shared by one character extraction unit.
Of course, a configuration in which 0 and 124 are shared by one character recognition unit is also possible.

また、文字ブロックの抽出、文字行の切り出し、文字パ
タンの切り出し、また文字認識に当っての特徴マトリク
スの作成や類似度計算、単語切り出し等は、実施例で説
明した方法に限られるものではなく他の好適な方法を用
い得ることは明らかである。Furthermore, extraction of character blocks, extraction of character lines, extraction of character patterns, creation of feature matrices for character recognition, similarity calculation, word extraction, etc. are not limited to the methods explained in the examples. Obviously, other suitable methods may be used.

また、上述の実施例では第一発明の情報処理袋Ｍを文字
認識装置に適用していたが、この情報処理装置は、文字
ブロックの接続順序を決定しこの順序に従い文字ブロッ
クを順次に出力する画像処理装置、さらに、文字領域の
接続関係を抽出しで文書のレイアウト構造を抽出する装
置等にも適用出来る。Furthermore, in the above embodiment, the information processing bag M of the first invention was applied to a character recognition device, but this information processing device determines the connection order of character blocks and sequentially outputs the character blocks according to this order. The present invention can be applied to image processing devices, and also to devices that extract the layout structure of a document by extracting connection relationships between character regions.

（発明の効果）上述した説明からも明らかなように、この出願の第一発
明の情報処理装置によれば、文書画像から抽出された各
文字プロ・ｙり毎の特徴単語を選択し、ざらに各文字ブ
ロックの特徴単語同士を比較・照合してこの結果に基づ
いて各文字ブロックの接続関係を判定し、これに基づい
た接続順で文字ブロックを出力することが出来る。従っ
て、各文字ブロックのレイアウトに規則性の無い文書か
ら文字ブロックを抽出し出力する場合でも、単に各文字
ブロックの文書画像中の位Ｍ座標に基づいて各文字ブロ
ックを出力していた従来装置に比べ、各ブロックを正し
い接続順序で出力出来る。(Effects of the Invention) As is clear from the above description, the information processing device of the first invention of this application selects characteristic words for each character profile extracted from a document image, and The feature words of each character block are then compared and collated, and based on this result, the connection relationship of each character block is determined, and the character blocks can be output in the connection order based on this. Therefore, even when extracting and outputting character blocks from a document with no regularity in the layout of each character block, conventional devices that output each character block simply based on the position M coordinate of each character block in the document image can be used. By comparing, each block can be output in the correct connection order.

また、この第二発明の文字認識装置によれば、上述のよ
うに正しい順序で出力される文字ブロックについて文字
認識を行うことが出来る。このため、例えば、レイアウ
トに規則性の無い文書の文字認識を文書の流れに即して
行うことが出来るので、高性能な文字認識部Ｍを寅現出
来る。Further, according to the character recognition device of the second aspect of the invention, character recognition can be performed on character blocks output in the correct order as described above. Therefore, for example, character recognition of a document with irregular layout can be performed in accordance with the flow of the document, so that a high-performance character recognition section M can be realized.

[Brief explanation of the drawing]

第１図は、実施例の説明に供するブロック図であり、実
施例の情報処理袋Ｍを組み込んた実施例の文字認識装置
の構成を概略的に示したブロック図、第２図は、処理対象の文書画像例を示した図である。Ｓ・・・文書画像からの光信号１１・・・文書画像１１ａ〜ｌｌｄ・・・文字ブロック１００・・・文字認識装置、１１０・・・光電変換部１
２０・・・文字認識段、　　１２２・・・文字切り出し
部１２４・・・文字認識部、　　１３０・・・出力端子
２００・・・情報処理装置２１０・・・文字ブロック抽出部２２０・・・文字切り出し部２３０・・・文字認識部、　２４０・・・単語切り出し
部２５０・・・特徴単語選択部２６０・・・特徴単語記憶部２７０・・・特徴単語照合部２８０・・・文字ブロック選択部。特許出願人　　沖電気工業株式会社１１　　文書画像　　１１ａ〜ｌｉｄ　　文字ブロック
処理対象の文書画像例を示した図第２図FIG. 1 is a block diagram for explaining the embodiment, and is a block diagram schematically showing the configuration of the character recognition device of the embodiment incorporating the information processing bag M of the embodiment. FIG. 2 is a diagram showing an example of a document image. S... Optical signal from document image 11... Document images 11a to lld... Character block 100... Character recognition device, 110... Photoelectric conversion unit 1
20...Character recognition stage, 122...Character cutting unit 124...Character recognition unit, 130...Output terminal 200...Information processing device 210...Character block extraction unit 220...Character cutting out Units 230...Character recognition unit, 240...Word extraction unit 250...Feature word selection unit 260...Feature word storage unit 270...Feature word collation unit 280...Character block selection unit. Patent applicant: Oki Electric Industry Co., Ltd. 11 Document images 11a-lid Figure 2 shows an example of a document image subject to character block processing

Claims

[Claims]

(1) An information processing device comprising a character block extraction section for extracting character blocks from a document image, comprising: a character extraction section for cutting out a character pattern from the extracted character block; and a character for recognizing the cut out character pattern. a recognition unit; a word extraction unit that extracts a word from the recognized character pattern string; a feature word selection unit that selects a characteristic word from among the extracted words for each character block; a feature word storage section for storing feature words for each character block; a feature word matching section for comparing and matching the feature words for each block between each character block; 1. An information processing device comprising: a character block selection unit that determines the connection relationship between each character block by using the above method, and outputs a character block based on the determination result.

(2) The information processing apparatus according to claim 1, wherein the characteristic word selection section selects the characteristic word based on the number of characters composing the word.

(3) The information processing device according to claim 1, wherein the characteristic word matching unit counts the number of words that appear in common between the character block of interest and other character blocks in comparing and collating the characteristic words. Information processing equipment.

(4) In the information processing device according to claim 3, the character block selection unit connects the character block whose number of common words is greater than or equal to a predetermined value and exhibits the largest value to the character block of interest. An information processing device characterized in that the information processing device determines that the character block is another character block.

(5) In the information processing device according to claim 1, the feature word selection unit selects, as a feature word, a word that does not match a pre-registered general word among the words in the character block. An information processing device characterized by:

(6) Character recognition comprising the information processing device according to any one of claims 1 to 5 and a character recognition stage that recognizes characters in a character block output from the information processing device. Device.