JP2010003182A

JP2010003182A - Device and method for generating character string, character string generating program, and computer-readable recording medium with the character string generating program recorded thereon

Info

Publication number: JP2010003182A
Application number: JP2008162463A
Authority: JP
Inventors: Hitoshi Shimizu; 清水　　仁; Chikayuki Koyama; 至幸小山
Original assignee: Sharp Corp
Current assignee: Sharp Corp
Priority date: 2008-06-20
Filing date: 2008-06-20
Publication date: 2010-01-07
Anticipated expiration: 2028-06-20
Also published as: JP5095518B2

Abstract

<P>PROBLEM TO BE SOLVED: To provide a character string generating device which enhances accuracy in character recognition with less computational complexity. <P>SOLUTION: The character string generating device for character recognition includes: a computation part 4 which computes evaluation values, which represent the likelihood of characters to be recognized as a result of the recognition, for each candidate character on each prescribed conception, by using the similarity of the respective candidate characters, and the frequencies in which the respective candidate characters are used when one of the prescribed concepts is expressed or the degree of the relations between the candidate characters and one of the prescribed concepts; a character string connecting part 6 which connects the candidate characters which are selected one by one from a candidate character group corresponding to the characters to be recognized by using the evaluation values for each prescribed concept to generate candidate character strings, which are the candidates as a result of recognizing the character strings to be recognized, for each prescribed concept; and a determination part 7 which determines the recognition result character strings, as a result of recognizing the character strings to be recognized, from the candidate character strings generated by the character string-connecting part 6. <P>COPYRIGHT: (C)2010,JPO&INPIT

Description

本発明は、文字列を含む画像から文字を認識する文字列生成装置及び方法、並びに、文字列生成プログラム及びその文字列生成プログラムを記録したコンピュータ読み取り可能な記録媒体に関する。 The present invention relates to a character string generation apparatus and method for recognizing characters from an image including a character string, a character string generation program, and a computer-readable recording medium on which the character string generation program is recorded.

現在、ＰＣ、ＰＤＡ、携帯電話等の機器の多くが、光学文字読取技術（ＯｐｔｉｃａｌＣｈａｒａｃｔｅｒＲｅａｄｅｒ）（以下、「ＯＣＲ」という。）を搭載している。これらの機器に搭載されるＯＣＲには、読み取った画像からキーワードを取り出す機能が必要となる場合がある。例えば、認識対象の画像からキーワードを取り出して画像に付与し、そのキーワードを画像の検索に利用する場合や、認識結果として得られた文字列を分類する判定の根拠としてキーワードの有無を利用する場合等である。 Currently, many devices such as a PC, a PDA, and a mobile phone are equipped with an optical character reader (hereinafter referred to as “OCR”). OCR installed in these devices may require a function of extracting a keyword from a read image. For example, when a keyword is extracted from an image to be recognized and assigned to the image, and the keyword is used for image search, or when the presence / absence of a keyword is used as a basis for classification of character strings obtained as recognition results Etc.

画像からキーワードを取り出す技術として、例えば、特許文献１〜３に開示されたものがある。特許文献１〜３に開示されたキーワード抽出・検索装置においては、あらかじめ用意されたキーワードリストと、画像を認識して得られた文字列から取り出したキーワード候補とを比較し、その比較結果に基づいて画像に含まれるキーワードをキーワード候補の中から特定するものである。このキーワード抽出・検索装置は、文字認識結果から多数のキーワード候補を取り出し、キーワード候補の各々の信頼度（類似度）に基づいて、それらの中から使用すべきキーワードを決定する。
特開平７−３１９８８０号公報（平成７年１２月８日公開）特開２００４−５７６１号公報（平成１６年１月８日公開）特開２００４−３０６９５号公報（平成１６年１月２９日公開） As a technique for extracting a keyword from an image, for example, there are those disclosed in Patent Documents 1 to 3. In the keyword extraction / search apparatus disclosed in Patent Literatures 1 to 3, a keyword list prepared in advance is compared with keyword candidates extracted from a character string obtained by recognizing an image, and based on the comparison result Thus, the keyword included in the image is specified from the keyword candidates. This keyword extraction / retrieval apparatus extracts a large number of keyword candidates from the character recognition result, and determines a keyword to be used from them based on the reliability (similarity) of each keyword candidate.
Japanese Patent Laid-Open No. 7-31880 (published on December 8, 1995) JP 2004-5761 A (published January 8, 2004) JP 2004-30695 A (published January 29, 2004)

一般に、画像情報から認識される文字認識結果を構成する文字数Ｎは以下の式で表わされる。 In general, the number N of characters constituting the character recognition result recognized from the image information is expressed by the following equation.

Ｎ＝認識対象文字列の文字数×認識候補順位数
例えば、紙文書から読み出された画像情報中の認識対象文字列の文字数が３個、認識文字列の認識候補順位数が第１位〜第４位までの４個である場合、文字認識結果を構成する文字数Ｎは３×４＝１２個である。 N = number of characters in the recognition target character string × number of recognition candidate ranks For example, the number of recognition target character strings in the image information read from the paper document is three, and the recognition character string recognition rank numbers are from first to first. In the case of four characters up to the fourth place, the number N of characters constituting the character recognition result is 3 × 4 = 12.

しかしながら、画像情報から認識される文字列の中からキーワードの有無を調べる際、単に文字認識結果の認識候補順位毎に調べるだけでは十分とは言えない。認識対象文字列の中で隣接する文字同士においては、互いに異なる認識候補順位の文字列に含まれる文字間においてもキーワードが構成されないかを調べることが必要である。 However, when examining the presence or absence of a keyword from a character string recognized from image information, it is not sufficient to simply check for each recognition candidate rank of a character recognition result. It is necessary to check whether a keyword is formed between characters included in character strings having different recognition candidate ranks between adjacent characters in the recognition target character string.

したがって、上記の文字認識結果の例では、単純計算では、４^３＝６４個の文字の組み合わせについてキーワードの有無を調べなければならない。つまり、認識候補順位数をｍとし、認識対象文字列の文字数をｎとすれば、ｍ^ｎの組み合わせのすべてが調査対象となる。さらに、認識対象文字列の文字数未満の字数を持つキーワードも調査対象とする場合では、その組み合わせ数Ｓは以下の式で表わされることになる。 Therefore, in the example of the character recognition result described above, it is necessary to check whether or not there is a keyword for a combination of 4 ³ = 64 characters in a simple calculation. In other words, if the number of recognition candidate ranks is m and the number of characters in the recognition target character string is n, all combinations of m ⁿ are to be investigated. Further, when a keyword having a number of characters less than the number of characters in the recognition target character string is also to be investigated, the number of combinations S is expressed by the following expression.

例えば、認識対象文字列の文字数が３個、認識候補順位数が第１位〜第５位までの５個である場合、Ｓ＝２１５通りの組み合わせがあることになる。また、認識対象文字列の文字数が３個、認識候補順位数が第１位〜第１０位までの１０個である場合、Ｓ＝１３３０通りとなる。 For example, when the number of characters in the recognition target character string is three and the number of recognition candidate ranks is five from first to fifth, there are S = 215 combinations. When the number of characters in the recognition target character string is three and the number of recognition candidate ranks is ten from the first to the tenth, S = 1330.

このため、特許文献１〜３に開示されたキーワード抽出・検索装置において、これら組み合わせのすべてからキーワード候補を抽出し、あらかじめ用意されたキーワード辞書との照合を行うとした場合には、上記のように文字の組み合わせが非常に多くなると、その照合に要する計算量は増大し、その結果、文字認識結果中におけるキーワードの有無の判定に要する時間が増大する問題点がある。 Therefore, in the keyword extraction / search apparatus disclosed in Patent Documents 1 to 3, when keyword candidates are extracted from all of these combinations and collated with a keyword dictionary prepared in advance, the above is performed. When the number of character combinations is extremely large, the amount of calculation required for the collation increases, and as a result, the time required for determining the presence or absence of a keyword in the character recognition result increases.

一方、文字認識結果中の文字について各分野の頻度情報を保持しておき、単純に候補文字中で最も頻度が高い文字を取り出し、それらを結合して文字列を作成した後に、キーワード辞書中のキーワードの有無を調べる方法も考えられる。 On the other hand, the frequency information of each field is retained for the characters in the character recognition result, and the character with the highest frequency is simply extracted from the candidate characters and combined to create a character string. A method for checking the presence or absence of keywords is also conceivable.

しかしながら、この方法の場合、認識対象文字列との類似度、すなわち、文字の形状としての近さが考慮されていないため、キーワードと一致する可能性が低下する問題点がある。 However, in the case of this method, since the similarity with the recognition target character string, that is, the proximity as the character shape is not considered, there is a problem that the possibility of matching with the keyword is lowered.

上記問題点に鑑み、本発明の目的は、少ない計算量で文字認識の精度を高めることができる文字列生成装置及び方法、並びに、文字列生成プログラム及びその文字列生成プログラムを記録したコンピュータ読み取り可能な記録媒体を提供することにある。 In view of the above problems, an object of the present invention is to provide a character string generation device and method capable of increasing the accuracy of character recognition with a small amount of calculation, a character string generation program, and a computer readable recording of the character string generation program Is to provide a simple recording medium.

上記目的を達成するために、本発明にかかる文字列生成装置は、認識対象となる認識対象文字から構成される認識対象文字列に含まれる前記認識対象文字と、当該認識対象文字の認識結果の候補である複数の候補文字との間の類似度を用いて文字認識を行う文字列生成装置であって、１つ以上の候補文字からなる候補文字群を、各認識対象文字について取得し、前記類似度と、複数設定された所定の概念のうちの１つを表現する場合にその候補文字が使用される頻度または前記候補文字と前記所定の概念のうちの１つとの関連性の程度を表す関連度とを用いて、前記認識対象文字の認識結果としての確からしさを表す評価値を各候補文字について、前記所定の概念ごとに算出する算出手段と、前記算出手段により算出された、前記所定の概念ごとの評価値を用いて、前記各認識対象文字に対応する候補文字群からそれぞれ１つずつ選ばれた候補文字を結合することによって、前記認識対象文字列の認識結果の候補である候補文字列を、前記所定の概念ごとに生成する文字列結合手段と、前記文字列結合手段が生成した候補文字列の中から、前記認識対象文字列の認識結果となる認識結果文字列を決定する決定手段とを備えることを特徴とする。 In order to achieve the above object, a character string generation device according to the present invention includes a recognition target character included in a recognition target character string including recognition target characters to be recognized, and a recognition result of the recognition target character. A character string generation device that performs character recognition using similarity between a plurality of candidate characters that are candidates, and acquires a candidate character group including one or more candidate characters for each recognition target character, Represents the degree of similarity and the frequency with which a candidate character is used when expressing one of a plurality of predetermined concepts or the degree of relevance between the candidate character and one of the predetermined concepts An evaluation value representing the certainty as a recognition result of the recognition target character using the relevance level, for each candidate character, a calculation unit for calculating the predetermined concept, and the predetermined unit calculated by the calculation unit Concept The candidate character string that is a candidate for the recognition result of the recognition target character string by combining candidate characters selected one by one from the candidate character group corresponding to each recognition target character using the evaluation value A character string combining unit that generates a recognition result character string as a recognition result of the recognition target character string from among candidate character strings generated by the character string combining unit. It is characterized by providing.

上記の文字列生成装置では、算出手段が、類似度と、所定の概念の１つを表現する場合に候補文字が使用される頻度、候補文字と所定の概念のうちの１つとの関連性の程度等を表す関連度とを用いて、認識対象文字の認識結果としての確からしさを表す評価値を各候補文字について、所定の概念ごとに算出するので、各候補文字の認識結果としての確からしさを、類似度のみを用いた場合と比べて、向上させることができる。 In the above character string generation device, the calculation means determines the similarity, the frequency with which the candidate character is used when expressing one of the predetermined concepts, and the relationship between the candidate character and one of the predetermined concepts. Since the evaluation value indicating the certainty as the recognition result of the recognition target character is calculated for each predetermined concept using the degree of association representing the degree, etc., the certainty as the recognition result of each candidate character Can be improved as compared with the case where only the similarity is used.

このため、文字列結合手段が、その算出された評価値を用いて候補文字列を所定の概念ごとに生成し、決定手段が、その生成された候補文字列の中から認識結果文字列を決定することにより、認識結果文字列の文字認識の精度を高めることができる。 For this reason, the character string combining unit generates candidate character strings for each predetermined concept using the calculated evaluation value, and the determining unit determines the recognition result character string from the generated candidate character strings. By doing so, the accuracy of character recognition of the recognition result character string can be improved.

さらに、各候補文字の認識結果としての確からしさが向上することにより、従来では必要であった、文字認識の精度向上のためのキーワードとの照合処理を低減することができる。したがって、文字認識の精度を少ない計算量で高めることが可能となる。 Furthermore, by improving the certainty as the recognition result of each candidate character, it is possible to reduce collation processing with a keyword for improving the accuracy of character recognition, which has been conventionally required. Therefore, the accuracy of character recognition can be increased with a small amount of calculation.

前記文字列結合手段は、前記認識対象文字列に含まれる各認識対象文字について、対応する候補文字の群から最高の評価値を有する最高評価候補文字を選択し、選択した最高評価候補文字を結合することにより、前記候補文字列を生成することが好ましい。 The character string combining means selects, for each recognition target character included in the recognition target character string, a highest evaluation candidate character having the highest evaluation value from a group of corresponding candidate characters, and combines the selected highest evaluation candidate characters Thus, it is preferable to generate the candidate character string.

この場合、文字列結合手段が生成する候補文字列に含まれる各候補文字は、各認識対象文字に対応する候補文字の群に含まれる候補文字のうちの最高の評価値を有するものとなる。それゆえ、文字列結合手段が生成する候補文字列の精度を高めることができる。 In this case, each candidate character included in the candidate character string generated by the character string combining unit has the highest evaluation value among the candidate characters included in the group of candidate characters corresponding to each recognition target character. Therefore, the accuracy of the candidate character string generated by the character string combining means can be improved.

前記決定手段は、前記文字列結合手段が生成した候補文字列に含まれる各候補文字が有する前記評価値を当該候補文字列ごとに加算し、当該評価値の合計を用いて、前記認識結果文字列を決定することが好ましい。 The determination unit adds the evaluation value of each candidate character included in the candidate character string generated by the character string combination unit for each candidate character string, and uses the total of the evaluation values to determine the recognition result character Preferably the column is determined.

この場合、決定手段は各候補文字の評価値を候補文字列ごとに加算するだけで認識結果文字列を決定できるので、決定手段による認識結果文字列の決定を効率よく行うことができる。 In this case, since the determining means can determine the recognition result character string only by adding the evaluation value of each candidate character for each candidate character string, the determining means can efficiently determine the recognition result character string.

前記算出手段は、前記関連度または前記類似度に対して重み付けを行った後に、当該関連度と当該類似度とを加算することにより前記評価値を算出することが好ましい。 It is preferable that the calculation means calculates the evaluation value by weighting the relevance or the similarity and then adding the relevance and the similarity.

この場合、関連度及び類似度のいずれも評価値の算出に寄与させることができるので、各候補文字の評価値の精度を高めることができる。 In this case, since both the relevance level and the similarity level can contribute to the calculation of the evaluation value, the accuracy of the evaluation value of each candidate character can be increased.

前記算出手段は、前記所定の概念のうち、どの所定の概念についての関連度を、前記評価値の算出のために用いればよいかを示すユーザからの指示を取得し、当該取得された関連度を用いて前記評価値を算出することが好ましい。 The calculation means acquires an instruction from a user indicating which predetermined concept of the predetermined concepts should be used for calculating the evaluation value, and the acquired related degree It is preferable to calculate the evaluation value using.

この場合、評価値の算出のために用いる関連度をユーザからの指示に基づいて取得することができるので、その関連度を用いて評価値を算出することにより、算出される評価値の精度を向上させることができる。 In this case, since the degree of association used for calculating the evaluation value can be acquired based on an instruction from the user, the accuracy of the calculated evaluation value can be increased by calculating the evaluation value using the degree of association. Can be improved.

前記所定の概念のいずれかと関連性を有するキーワードを格納するキーワード辞書と、前記文字列結合手段が生成した候補文字列と、前記キーワード辞書に格納されているキーワードとを照合することにより、両者が一致するかどうかを判定する照合手段とをさらに備え、前記決定手段は、前記照合手段の判定結果を用いて前記認識結果文字列を決定することが好ましい。 By collating a keyword dictionary that stores keywords having relevance with any of the predetermined concepts, a candidate character string generated by the character string combining unit, and a keyword stored in the keyword dictionary, It is preferable that the information processing apparatus further includes a collating unit that determines whether or not they match, and the determining unit determines the recognition result character string using a determination result of the collating unit.

この場合、照合手段が、候補文字列とキーワード辞書に格納されているキーワードとを照合することにより、両者が一致するかどうかを判定し、決定手段が、その判定結果を用いて認識結果文字列を決定する。このため、例えば、両者が一致する場合のみ、決定手段が、認識結果文字列を決定することにより、キーワードとして存在しない認識結果文字列を決定することがなくなり、認識結果文字列の精度を向上させることができる。 In this case, the collating unit collates the candidate character string with the keyword stored in the keyword dictionary to determine whether or not they match, and the determining unit uses the determination result to determine the recognition result character string. To decide. For this reason, for example, only when the two match, the determining means determines the recognition result character string, so that the recognition result character string that does not exist as a keyword is not determined, and the accuracy of the recognition result character string is improved. be able to.

前記キーワードは、前記所定の概念ごとに分類されており、前記照合手段は、所定の規則に従って、前記候補文字列と照合するキーワードが属する所定の概念を選択し、選択した所定の概念に属するキーワードと前記候補文字列とを照合することが好ましい。 The keywords are classified for each of the predetermined concepts, and the collating unit selects a predetermined concept to which the keyword to be collated with the candidate character string belongs according to a predetermined rule, and the keywords belonging to the selected predetermined concept And the candidate character string are preferably collated.

この場合、候補文字列と照合するキーワードが属する所定の概念の選択を効率よく行うことができるので、キーワードと候補文字列との照合回数を減らすことができる。 In this case, since the predetermined concept to which the keyword to be matched with the candidate character string belongs can be selected efficiently, the number of matching times between the keyword and the candidate character string can be reduced.

前記候補文字列と前記キーワードとが所定の条件の範囲内で部分的に異なっていると前記照合手段が判定した場合に、前記キーワードと一致しない候補文字を、当該候補文字を含む候補文字群が有する他の候補文字に置換する置換手段をさらに備え、前記決定手段は、前記置換手段が生成した候補文字列を、前記認識結果文字列の候補に加えることが好ましい。 When the collating unit determines that the candidate character string and the keyword are partially different within a predetermined condition range, a candidate character group including the candidate character is selected as a candidate character that does not match the keyword. It is preferable that the image forming apparatus further includes a replacement unit that replaces the candidate character with another candidate character, and the determination unit adds the candidate character string generated by the replacement unit to the recognition result character string candidate.

この場合、候補文字列とキーワードとが部分的に異なっている場合でも、置換手段が、キーワードと一致しない候補文字を他の候補文字に置換し、決定手段が、置換手段が生成した候補文字列を認識結果文字列の候補に加えるので、認識結果文字列の候補の生成を効率よく行うことができる。 In this case, even when the candidate character string and the keyword are partially different, the replacement unit replaces the candidate character that does not match the keyword with another candidate character, and the determination unit generates the candidate character string generated by the replacement unit. Is added to the recognition result character string candidate, so that the recognition result character string candidate can be efficiently generated.

前記照合手段が、前記候補文字列と前記キーワードとが一致しないと判定した場合に、当該候補文字列が複数のキーワードの組み合わせによって構成されているかどうかを判定するキーワード解析手段をさらに備え、前記照合手段は、前記候補文字列が複数のキーワードの組み合わせによって構成されていると前記キーワード解析手段が判定した場合に、当該キーワードの組み合わせと前記候補文字列とを照合することが好ましい。 When the collation unit determines that the candidate character string and the keyword do not match, the collation unit further includes a keyword analysis unit that determines whether the candidate character string includes a combination of a plurality of keywords. Preferably, the means collates the keyword combination with the candidate character string when the keyword analyzing means determines that the candidate character string is composed of a plurality of keyword combinations.

この場合、候補文字列とキーワードとが一致しない場合でも、キーワード解析手段が、候補文字列が複数のキーワードの組み合わせによって構成されているかどうかを判定し、照合手段が、そのキーワードの組み合わせと前記候補文字列とを照合する。このため、候補文字列が複数のキーワードの組み合わせによって構成されている場合でも、その候補文字列とキーワードとの照合を行うことができる。 In this case, even when the candidate character string and the keyword do not match, the keyword analysis means determines whether the candidate character string is composed of a combination of a plurality of keywords, and the matching means determines that the keyword combination and the candidate Matches against a string. For this reason, even when a candidate character string is composed of a combination of a plurality of keywords, the candidate character string and the keyword can be collated.

前記キーワード解析手段は、前記照合手段が、前記候補文字列と前記キーワードとが一致しないと判定した場合に、当該候補文字列が属する所定の概念と関連性の高いキーワードと、当該候補文字列とを照合することにより、当該候補文字列が複数のキーワードの組み合わせによって構成されているかどうかを判定することが好ましい。 The keyword analysis means, when the matching means determines that the candidate character string and the keyword do not match, the keyword highly relevant to the predetermined concept to which the candidate character string belongs, and the candidate character string It is preferable to determine whether or not the candidate character string is composed of a combination of a plurality of keywords.

前記キーワード解析手段は、前記照合手段が、当該候補文字列が属する所定の概念と関連性の高いキーワードと、当該候補文字列とが一致しないと判定した場合に、当該候補文字列が属する所定の概念とは異なる所定の概念と関連性の高いキーワードと、当該候補文字列とを照合することにより、当該候補文字列が複数のキーワードの組み合わせによって構成されているかどうかを判定することが好ましい。 The keyword analysis unit determines that the candidate character string to which the candidate character string belongs when the matching unit determines that the keyword highly relevant to the predetermined concept to which the candidate character string belongs does not match the candidate character string. It is preferable to determine whether or not the candidate character string is composed of a combination of a plurality of keywords by comparing the candidate character string with a keyword highly relevant to a predetermined concept different from the concept.

この場合、候補文字列が属する所定の概念と関連性の高いキーワードと、候補文字列とが一致しないと判定された場合でも、候補文字列が属する所定の概念とは異なる所定の概念と関連性の高いキーワードと照合することにより、候補文字列が複数のキーワードの組み合わせによって構成されているかどうかを判定することができる。 In this case, even when it is determined that the keyword that is highly relevant to the predetermined concept to which the candidate character string belongs does not match the candidate character string, the predetermined concept and the relevance different from the predetermined concept to which the candidate character string belongs. It is possible to determine whether or not the candidate character string is composed of a combination of a plurality of keywords by collating with a high keyword.

前記キーワード解析手段は、前記候補文字列が複数のキーワードの組み合わせによって構成されていると判定した場合、前記照合手段が前記キーワードの組み合わせと前記候補文字列とを照合した照合結果に基づいて、前記候補文字列の確からしさを表わす確度情報を当該候補文字列に付与し、前記決定手段は、前記候補文字列に付加された確度情報を用いて前記認識結果文字列を決定することが好ましい。 When the keyword analyzing unit determines that the candidate character string is configured by a combination of a plurality of keywords, the matching unit is based on a matching result obtained by matching the keyword combination with the candidate character string. It is preferable that accuracy information indicating the probability of the candidate character string is given to the candidate character string, and the determination unit determines the recognition result character string using the accuracy information added to the candidate character string.

この場合、キーワード解析手段が、候補文字列が複数のキーワードの組み合わせによって構成されていると判定した場合に、照合手段がキーワードの組み合わせと候補文字列とを照合した照合結果に基づいて、候補文字列の確からしさを表わす確度情報を候補文字列に付与する。このため、決定手段が、候補文字列に付加された確度情報を用いて認識結果文字列を決定することができるので、決定手段が決定する認識結果文字列の精度を向上させることができる。 In this case, when the keyword analysis unit determines that the candidate character string is configured by a combination of a plurality of keywords, the matching unit compares the candidate character string with the candidate character string based on the matching result obtained by matching the keyword combination with the candidate character string. The accuracy information indicating the likelihood of the column is given to the candidate character string. For this reason, since the determination means can determine the recognition result character string using the accuracy information added to the candidate character string, the accuracy of the recognition result character string determined by the determination means can be improved.

本発明にかかる文字列生成プログラムは、上記の文字列生成装置の上記各手段としてコンピュータを動作させるための文字列生成プログラムである。 A character string generation program according to the present invention is a character string generation program for causing a computer to operate as each means of the character string generation device.

上記の文字列生成プログラムでは、算出手段が、類似度と、所定の概念の１つを表現する場合に候補文字が使用される頻度、候補文字と所定の概念のうちの１つとの関連性の程度等を表す関連度とを用いて、認識対象文字の認識結果としての確からしさを表す評価値を各候補文字について、所定の概念ごとに算出するので、各候補文字の認識結果としての確からしさを、類似度のみを用いた場合と比べて、向上させることができる。 In the above-described character string generation program, the calculation means determines the similarity, the frequency with which the candidate character is used when expressing one of the predetermined concepts, and the relationship between the candidate character and one of the predetermined concepts. Since the evaluation value indicating the certainty as the recognition result of the recognition target character is calculated for each predetermined concept using the degree of association representing the degree, etc., the certainty as the recognition result of each candidate character Can be improved as compared with the case where only the similarity is used.

本発明にかかるコンピュータ読み取り可能な記録媒体に記録された文字列生成プログラムは、上記の文字列生成装置の上記各手段としてコンピュータを動作させるための文字列生成プログラムである。 A character string generation program recorded on a computer-readable recording medium according to the present invention is a character string generation program for causing a computer to operate as each means of the character string generation device.

本発明にかかる文字列生成方法は、認識対象となる認識対象文字から構成される認識対象文字列に含まれる前記認識対象文字と、当該認識対象文字の認識結果の候補である複数の候補文字との間の類似度を用いて文字認識を行う文字列生成方法であって、１つ以上の候補文字からなる候補文字群を、各認識対象文字について取得し、前記類似度と、複数設定された所定の概念のうちの１つを表現する場合にその候補文字が使用される頻度または前記候補文字と前記所定の概念のうちの１つとの関連性の程度を表す関連度とを用いて、前記認識対象文字の認識結果としての確からしさを表す評価値を各候補文字について、前記所定の概念ごとに算出する算出ステップと、前記算出ステップにおいて算出された、前記所定の概念ごとの評価値を用いて、前記各認識対象文字に対応する候補文字群からそれぞれ１つずつ選ばれた候補文字を結合することによって、前記認識対象文字列の認識結果の候補である候補文字列を、前記所定の概念ごとに生成する文字列結合ステップと、前記文字列結合ステップにおいて生成された候補文字列の中から、前記認識対象文字列の認識結果となる認識結果文字列を決定する決定ステップとを備えることを特徴とする。 The character string generation method according to the present invention includes a recognition target character string included in a recognition target character string including recognition target characters to be recognized, and a plurality of candidate characters that are candidates for recognition results of the recognition target character. A character string generation method that performs character recognition using a similarity between two or more candidate characters composed of one or more candidate characters, acquired for each recognition target character, and a plurality of the similarities are set Using the frequency with which the candidate character is used when expressing one of the predetermined concepts or the degree of association representing the degree of relevance between the candidate character and one of the predetermined concepts, For each candidate character, an evaluation value representing the probability as a recognition result of the recognition target character is calculated for each predetermined concept, and the evaluation value for each predetermined concept calculated in the calculation step is used. By combining candidate characters selected one by one from the candidate character group corresponding to each recognition target character, a candidate character string that is a recognition result candidate of the recognition target character string is converted into the predetermined concept. A character string combining step that is generated every time, and a determination step that determines a recognition result character string that is a recognition result of the recognition target character string from among the candidate character strings generated in the character string combining step. Features.

上記の文字列生成方法では、類似度と、所定の概念の１つを表現する場合に候補文字が使用される頻度、候補文字と所定の概念のうちの１つとの関連性の程度等を表す関連度とを用いて、認識対象文字の認識結果としての確からしさを表す評価値を各候補文字について、所定の概念ごとに算出するので、各候補文字の認識結果としての確からしさを、類似度のみを用いた場合と比べて、向上させることができる。 In the above character string generation method, the degree of similarity, the frequency of use of candidate characters when expressing one of the predetermined concepts, the degree of relevance between the candidate character and one of the predetermined concepts, and the like are represented. Since the evaluation value representing the certainty as the recognition result of the recognition target character is calculated for each candidate character for each predetermined concept using the relevance, the certainty as the recognition result of each candidate character Compared with the case of using only, it can be improved.

このため、その算出された評価値を用いて候補文字列を所定の概念ごとに生成し、その生成された候補文字列の中から認識結果文字列を決定することにより、認識結果文字列の文字認識の精度を高めることができる。 For this reason, the character string of the recognition result character string is generated by generating a candidate character string for each predetermined concept using the calculated evaluation value and determining the recognition result character string from the generated candidate character string. Recognition accuracy can be increased.

なお、前記文字列生成装置は、コンピュータによって実現してもよく、この場合には、コンピュータを前記各部として動作させることにより前記文字列生成装置をコンピュータにて実現させる文字列生成プログラム、及びそれを記録したコンピュータ読み取り可能な記録媒体も、本発明の範疇に入る。 The character string generation device may be realized by a computer. In this case, a character string generation program for causing the character string generation device to be realized by a computer by causing the computer to operate as each unit, and A recorded computer-readable recording medium also falls within the scope of the present invention.

本発明にかかる文字列生成装置は、以上のように、１つ以上の候補文字からなる候補文字群を、各認識対象文字について取得し、前記類似度と、複数設定された所定の概念のうちの１つを表現する場合にその候補文字が使用される頻度または前記候補文字と前記所定の概念のうちの１つとの関連性の程度を表す関連度とを用いて、前記認識対象文字の認識結果としての確からしさを表す評価値を各候補文字について、前記所定の概念ごとに算出する算出手段と、前記算出手段により算出された、前記所定の概念ごとの評価値を用いて、前記各認識対象文字に対応する候補文字群からそれぞれ１つずつ選ばれた候補文字を結合することによって、前記認識対象文字列の認識結果の候補である候補文字列を、前記所定の概念ごとに生成する文字列結合手段と、前記文字列結合手段が生成した候補文字列の中から、前記認識対象文字列の認識結果となる認識結果文字列を決定する決定手段とを備えている。 As described above, the character string generation device according to the present invention acquires a candidate character group composed of one or more candidate characters for each recognition target character, and among the similarity and a plurality of predetermined concepts set. Recognition of the recognition target character using the frequency with which the candidate character is used when expressing one of the characters or the degree of relevance representing the degree of relevance between the candidate character and one of the predetermined concepts For each candidate character, a calculation unit that calculates an evaluation value representing the certainty as a result for each predetermined concept, and the evaluation value for each predetermined concept calculated by the calculation unit is used for each recognition. Characters that generate candidate character strings that are candidates for recognition results of the recognition target character strings for each predetermined concept by combining candidate characters selected one by one from the candidate character group corresponding to the target characters. Column And covering means, from among the candidate character strings in which the character string coupling means has generated and a determination means for determining a recognition result string of the recognition result of the recognition target character string.

それゆえ、少ない計算量で文字認識の精度を高めることができるという効果を奏する。 Therefore, it is possible to improve the accuracy of character recognition with a small amount of calculation.

以下、本発明の実施の形態について、図面を参照しながら説明する。なお、同一部分には同一符号を付し、図面で同一の符号が付いたものは、説明を省略する場合もある。 Hereinafter, embodiments of the present invention will be described with reference to the drawings. In addition, the same code | symbol is attached | subjected to the same part, and what attached the same code | symbol in drawing may abbreviate | omit description.

（実施の形態１）
（文字列生成装置３０の構成）
図１は、本発明の実施の形態１にかかる文字列生成装置３０の概略構成を示すブロック図である。図１において、本実施の形態にかかる文字列生成装置３０は、制御部１と、文字認識部２と、候補蓄積部３と、算出部（算出手段）４と、関連情報蓄積部５と、文字列結合部（文字列結合手段）６と、決定部（決定手段）７と、入力部８と、出力部９と、を備えている。 (Embodiment 1)
(Configuration of character string generation device 30)
FIG. 1 is a block diagram showing a schematic configuration of a character string generation device 30 according to the first embodiment of the present invention. In FIG. 1, a character string generation device 30 according to the present embodiment includes a control unit 1, a character recognition unit 2, a candidate storage unit 3, a calculation unit (calculation unit) 4, a related information storage unit 5, A character string coupling unit (character string coupling unit) 6, a determination unit (determination unit) 7, an input unit 8, and an output unit 9 are provided.

文字列生成装置３０において、制御部１は入力部８から入力される画像から認識対象文字列の各文字を文字認識し、最終結果である認識結果文字列を出力部９から出力するまでの全体の処理を制御する。制御部１は文字認識部２、候補蓄積部３、算出部４、関連情報蓄積部５、文字列結合部６及び決定部７の各々と接続されており、例えば、各部間における各種のデータのやり取りを制御し、それら各種のデータを一時的に記憶する。また、制御部１は、各部の処理に必要となる基準、ルール等も適宜記憶する。制御部１は、例えば、ＣＰＵ及び、ＲＡＭ、ＲＯＭ等のメモリから構成すればよい。 In the character string generation device 30, the control unit 1 recognizes each character of the recognition target character string from the image input from the input unit 8, and outputs the recognition result character string that is the final result from the output unit 9. Control the processing. The control unit 1 is connected to each of the character recognition unit 2, candidate storage unit 3, calculation unit 4, related information storage unit 5, character string combination unit 6, and determination unit 7. The exchange is controlled and these various data are temporarily stored. In addition, the control unit 1 appropriately stores criteria, rules, and the like necessary for processing of each unit. The control unit 1 may be composed of, for example, a CPU and a memory such as a RAM and a ROM.

入力部８は、例えば、スキャナであるが、画像を取得できるものであれば、どのようなものであってもよい。 The input unit 8 is, for example, a scanner, but may be any device that can acquire an image.

文字認識部２は、入力部８を用いて入力された画像（入力画像）に含まれる認識対象文字列を構成するの各認識対象文字を認識する。この入力画像には、少なくとも１つの文字列が含まれており、入力画像に含まれる文字列のうちの、認識対象となる文字列が、認識対象文字列である。 The character recognition unit 2 recognizes each recognition target character constituting the recognition target character string included in the image (input image) input using the input unit 8. The input image includes at least one character string, and the character string to be recognized among the character strings included in the input image is the recognition target character string.

この文字認識処理では、まず、入力画像から、認識対象文字列を構成する各認識対象文字列を構成する各認識対象文字を１つずつ含む文字画像が切り出され、その切り出された文字画像に含まれる認識対象文字ごとに特徴量が算出される。 In this character recognition process, first, a character image including each recognition target character constituting each recognition target character string constituting the recognition target character string is cut out from the input image, and included in the cut character image. A feature amount is calculated for each recognition target character.

次に、その算出された特徴量に基づいて、あらかじめ用意された標準パターンの認識辞書から１文字当たり単数または複数の認識候補文字が選出される。 Next, based on the calculated feature amount, one or a plurality of recognition candidate characters per character is selected from a standard pattern recognition dictionary prepared in advance.

換言すれば、文字認識部２は、認識対象文字に対応する１つ以上の候補文字からなる候補文字群を、各認識対象文字について生成する。 In other words, the character recognition unit 2 generates a candidate character group including one or more candidate characters corresponding to the recognition target character for each recognition target character.

各認識候補文字には、各々の候補順位及び類似度が付与される。この類似度とは、切り出された認識対象文字と選出された認識候補文字とがどれだけ近いかを表わす割合を示すものであり、例えば、最も近いものが第１候補とされる。文字認識部２は認識対象文字列の１文字ごとに認識候補文字とその候補順位、及び、各々に対応する類似度を互いに関連付けて候補蓄積部３に蓄積する。 Each candidate ranking and similarity are given to each recognition candidate character. This similarity indicates a ratio indicating how close the extracted recognition target character and the selected recognition candidate character are. For example, the closest one is the first candidate. The character recognition unit 2 stores the recognition candidate characters, their candidate ranks, and the similarities corresponding to each character in the recognition target character string in the candidate storage unit 3 in association with each other.

候補蓄積部３は、例えばフラッシュメモリ、ＵＳＢ、ＲＯＭ等の半導体メモリ、ハードディスク、ＣＤ、ＤＶＤ等のＤＶＤ磁気メモリから構成されており、文字認識部２の認識結果を順次蓄積する。 The candidate accumulating unit 3 includes, for example, a flash memory, a semiconductor memory such as a USB and a ROM, and a DVD magnetic memory such as a hard disk, a CD, and a DVD, and sequentially accumulates the recognition results of the character recognition unit 2.

算出部４は、認識対象文字の認識結果としての確からしさを表す評価値を各候補文字について算出する。具体的には、算出部４は、候補蓄積部３に蓄積された各認識候補文字の類似度に加えて、関連情報蓄積部５にあらかじめ蓄積された分野別の関連情報（関連度）も利用し、認識対象文字列に含まれる各認識候補文字に対して上記評価値を算出する。 The calculation unit 4 calculates, for each candidate character, an evaluation value that represents the likelihood as the recognition result of the recognition target character. Specifically, the calculation unit 4 uses the related information (relevance) for each field stored in advance in the related information storage unit 5 in addition to the similarity of each recognition candidate character stored in the candidate storage unit 3. Then, the evaluation value is calculated for each recognition candidate character included in the recognition target character string.

関連情報蓄積部５は、例えば、あらかじめ選定された分野において出現する頻度の高い順に並べられた多数の文字のリストを上記の関連情報として蓄積している。この関連情報とは、選定された分野において、当該分野と各文字との関連性の強さを表わすものである。換言すれば、関連情報とは、分野という所定の概念を表現する場合にその文字が使用される頻度を示す指標である。 The related information accumulation unit 5 accumulates, for example, a list of a large number of characters arranged in the descending order of appearance in a preselected field as the related information. This related information represents the strength of the relationship between the field and each character in the selected field. In other words, the related information is an index indicating how often the character is used when a predetermined concept of a field is expressed.

上記のリストにおいては、より高い順位の文字ほど、当該分野との関連性が強い、つまり、当該分野に関する文書等において出現する頻度が高いことを意味する。あらかじめ選定される分野（所定の概念）は単数であっても複数であってもよく、例えば、「住所」、「姓」、「名」、「技術」、「科学」、「文学」、「一般」等が挙げられる。もちろん、これらは一例に過ぎず、他の分野であっても構わない。 In the above list, it means that the higher the character in the ranking, the stronger the relation with the field, that is, the higher the frequency of appearance in a document or the like related to the field. The field (predetermined concept) selected in advance may be singular or plural. For example, “address”, “surname”, “first name”, “technology”, “science”, “literature”, “ General "etc. are mentioned. Of course, these are merely examples, and other fields may be used.

文字列結合部６は、算出部４による認識候補文字列の妥当性の評価結果に基づいて認識結果文字列を生成する。具体的には、文字列結合部６は認識対象文字列の各文字に対する認識候補文字の妥当性の評価結果（すなわち、各認識候補文字に対して付与された評価値）を算出部４から取得する。そして、文字列結合部６は認識対象文字の１文字ごとに、候補蓄積部３に蓄積された、各認識対象文字の認識候補文字の群のうちから認識結果文字として出力すべき文字を１つずつ決定し、それらを結合することにより認識候補文字列を生成する。 The character string combination unit 6 generates a recognition result character string based on the validity evaluation result of the recognition candidate character string by the calculation unit 4. Specifically, the character string combination unit 6 acquires from the calculation unit 4 the evaluation result of the validity of the recognition candidate character for each character of the recognition target character string (that is, the evaluation value given to each recognition candidate character). To do. Then, for each character of the recognition target character, the character string combining unit 6 outputs one character to be output as a recognition result character from the recognition candidate character group of each recognition target character stored in the candidate storage unit 3. By determining them and combining them, a recognition candidate character string is generated.

決定部７は、文字列結合部６が生成した認識候補文字列の中から、認識対象文字列の認識結果となる認識結果文字列を決定する。決定部７によって決定された認識結果文字列は出力部９に出力される。 The determination unit 7 determines a recognition result character string that is a recognition result of the recognition target character string from among the recognition candidate character strings generated by the character string combining unit 6. The recognition result character string determined by the determination unit 7 is output to the output unit 9.

出力部９は、液晶、有機ＥＬ等のディスプレイを有しており、そのディスプレイ上に決定部７から出力された認識結果文字列を表示させる。もちろん、出力部９はこの構成に限られるものではなく、例えば、識結果文字列を紙面上に印刷する印刷機能を有していても構わない。要は、出力部９が、文字列生成装置３０の利用者が認識結果文字列を視認可能とするディスプレイ装置、プリンタ装置等を有していればよい。 The output unit 9 has a display such as a liquid crystal or an organic EL, and displays the recognition result character string output from the determination unit 7 on the display. Of course, the output unit 9 is not limited to this configuration. For example, the output unit 9 may have a printing function for printing the character string as a result. In short, the output unit 9 only needs to have a display device, a printer device, or the like that allows the user of the character string generation device 30 to visually recognize the recognition result character string.

なお、文字認識部２、算出部４、文字列結合部６及び決定部７は、例えば、個別の汎用の論理素子、機能素子等を組み合わせて実現すればよい。あるいは、ＡＳＩＣ等の専用素子で実現しても構わない。また、本実施の形態においては、図１にも示したように、制御部１、文字認識部２、算出部４、文字列結合部６及び決定部７は、それぞれ、別体としているが、例えば、制御部１のＣＰＵが所定の実行ファイルを実行することにより文字認識部２、算出部４、文字列結合部６及び決定部７の各々の機能が実施されるようにしても構わない。もちろん、文字認識部２、算出部４、文字列結合部６及び決定部７の各々の機能を実施するＣＰＵは、制御部１のＣＰＵと同一であっても、異なるものであっても構わない。 In addition, what is necessary is just to implement | achieve combining the character recognition part 2, the calculation part 4, the character string coupling | bond part 6, and the determination part 7, for example, an individual general purpose logic element, a functional element, etc. Or you may implement | achieve with exclusive elements, such as ASIC. Further, in the present embodiment, as shown in FIG. 1, the control unit 1, the character recognition unit 2, the calculation unit 4, the character string combination unit 6, and the determination unit 7 are each separated. For example, the functions of the character recognition unit 2, the calculation unit 4, the character string combination unit 6, and the determination unit 7 may be implemented by the CPU of the control unit 1 executing a predetermined execution file. Of course, the CPU that performs the functions of the character recognition unit 2, the calculation unit 4, the character string combination unit 6, and the determination unit 7 may be the same as or different from the CPU of the control unit 1. .

（文字列生成装置３０の動作）
次に、文字列生成装置３０の動作の一例について説明する。図２は、文字列生成装置３０における処理手順の一例を示すフローチャートである。以下では、上記分野（所定の概念）として、「住所」、「姓」、「名」、「一般」を用いる場合について説明する。もちろん、本発明はこの４種類の分野に限定されるものではなく、文字列生成装置３０の利用形態に応じて増やすことも減らすことも可能である。 (Operation of the character string generation device 30)
Next, an example of the operation of the character string generation device 30 will be described. FIG. 2 is a flowchart illustrating an example of a processing procedure in the character string generation device 30. Hereinafter, a case where “address”, “last name”, “first name”, and “general” are used as the field (predetermined concept) will be described. Of course, the present invention is not limited to these four types of fields, and can be increased or decreased according to the usage form of the character string generation device 30.

図２に示すように、まず、制御部１は、利用者の指示に基づいて文字列生成装置３０に対する初期設定を実行する（ステップＳ１０１）。具体的には、利用者による各種のパラメータの設定が行われる。このパラメータとしては、例えば、この文字列生成装置３０を用いて利用者が実施する「業務」の種類が挙げられる。 As shown in FIG. 2, first, the control unit 1 performs initial setting for the character string generation device 30 based on a user instruction (step S <b> 101). Specifically, various parameters are set by the user. As this parameter, for example, the type of “work” performed by the user using the character string generation device 30 can be cited.

複数設定された「分野（所定の概念）」は、互いに関連する「分野」を含む集合（分野集合（概念集合））のいずれかに分類されている。上記「業務」の種類とは、上記分野集合のひとつであり、上記「業務」の種類を設定することは、複数設定された分野集合のうちのひとつを選択することを意味する。選択された分野集合に含まれる「分野」を「対象分野」と称する。 A plurality of “fields (predetermined concepts)” set are classified into any of a set (field set (concept set)) including mutually related “fields”. The type of “business” is one of the field sets. Setting the type of “business” means selecting one of a plurality of set field sets. A “field” included in the selected field set is referred to as a “target field”.

この「業務」の種類が設定されると、この種類に応じた設定が算出部４及び関連情報蓄積部５に対して実行される。例えば、制御部１は、関連情報蓄積部５に対して、あらかじめ蓄積されている各分野の関連情報のうち、利用者が指定した「業務」に含まれる分野（対象分野）の関連情報を利用するよう指示する。また、制御部１は、算出部４に対して、利用者が指定した「業務」に含まれる対象分野ごとに、認識候補文字の評価値を算出するよう指示する。 When the type of “business” is set, the setting corresponding to the type is executed for the calculation unit 4 and the related information storage unit 5. For example, the control unit 1 uses the related information of the field (target field) included in the “business” designated by the user among the related information of each field accumulated in advance to the related information storage unit 5. Instruct them to do so. In addition, the control unit 1 instructs the calculation unit 4 to calculate the evaluation value of the recognition candidate character for each target field included in the “business” designated by the user.

換言すれば、算出部４は、どの分野についての関連度を、評価値の算出のために用いればよいかを示すユーザの指示を、制御部１を介して取得する。これにより、算出部４は、取得した指示が示す関連度を用いて評価値を算出することになる。 In other words, the calculation unit 4 acquires, via the control unit 1, a user instruction indicating which field the degree of association should be used for calculating the evaluation value. As a result, the calculation unit 4 calculates the evaluation value using the degree of association indicated by the acquired instruction.

次に、文字認識部２が、文字の認識を実行する（ステップＳ１０２）。この文字認識ステップＳ１０２では、入力部８を介して入力された入力画像に含まれる認識対象文字列を構成する各認識対象文字の認識が実行される。具体的には、入力画像から切り出された各文字画像について、当該各文字画像に含まれる認識対象文字の特徴を数十次元の数値列に変換した特徴ベクトルが生成される。この特徴ベクトルの生成には種々の公知手法があるが、例えば、文字画像を６４×６４個の要素に分割し、各要素で白が多ければ０、黒が多ければ１として６４×６４個の要素を持つ特徴ベクトルを作成すればよい。 Next, the character recognition unit 2 executes character recognition (step S102). In this character recognition step S102, recognition of each recognition target character constituting the recognition target character string included in the input image input via the input unit 8 is executed. Specifically, for each character image cut out from the input image, a feature vector is generated by converting the features of the recognition target character included in each character image into a numerical sequence of tens of dimensions. There are various known methods for generating this feature vector. For example, a character image is divided into 64 × 64 elements. Each element has 0 × if there are many whites and 1 if there are many blacks. A feature vector having elements may be created.

そして、文字画像から生成された特徴ベクトルと、辞書に登録されている文字ごとにあらかじめ生成されている辞書特徴ベクトルの各々との間で内積をとる。この辞書特徴ベクトルは、文字認識部２が利用可能な記憶部に格納されている。文字の形状が互いに似通っていれば、特徴ベクトル間の対応する要素が同じになるため、内積の値が高くなる。この内積の値が各認識候補文字の認識対象文字に対する類似度を表わしており、各認識対象文字において類似度の高い認識候補文字ほどその候補順位が上位となる。 Then, an inner product is calculated between the feature vector generated from the character image and each dictionary feature vector generated in advance for each character registered in the dictionary. The dictionary feature vector is stored in a storage unit that can be used by the character recognition unit 2. If the character shapes are similar to each other, the corresponding elements between the feature vectors are the same, so the inner product value is high. The value of the inner product represents the similarity of each recognition candidate character to the recognition target character. The recognition candidate character having the higher similarity in each recognition target character has a higher candidate rank.

次に、文字認識部２は、文字認識結果の蓄積を実行する（ステップＳ１０３）。この文字認識結果蓄積ステップＳ１０３では、文字認識ステップＳ１０２において認識された各認識候補文字と、その候補順位及び類似度とを対応付け、候補順位及び類似度と対応付けられた認識候補文字を含む候補文字群を候補蓄積部３に蓄積する。 Next, the character recognition unit 2 accumulates character recognition results (step S103). In this character recognition result accumulation step S103, each recognition candidate character recognized in the character recognition step S102 is associated with its candidate rank and similarity, and the candidate includes a recognition candidate character associated with the candidate rank and similarity. The character group is stored in the candidate storage unit 3.

次に、制御部１は、初期設定ステップＳ１０１において設定された「業務」が示す複数の対象分野のひとつを選択する（ステップＳ１０４）。 Next, the control unit 1 selects one of a plurality of target fields indicated by the “business” set in the initial setting step S101 (step S104).

次に、制御部１は、関連情報の蓄積を実行する（ステップＳ１０５）。この関連情報蓄積ステップＳ１０５では、文字認識結果蓄積ステップＳ１０３において蓄積された認識候補文字の各々に対し、対象分野決定ステップＳ１０４において選択された対象分野の関連情報を付与し、再び、候補蓄積部３に蓄積する。 Next, the control unit 1 performs accumulation of related information (step S105). In this related information accumulation step S105, the relevant information of the target field selected in the target field determination step S104 is assigned to each recognition candidate character stored in the character recognition result storage step S103, and again the candidate storage unit 3 To accumulate.

上記の対象分野決定ステップＳ１０４及び関連情報蓄積ステップＳ１０５が、関連情報蓄積部５にあらかじめ蓄積されている分野別関連情報の分野のうち、対象分野のすべてについて実行されるまで（Ｓ１０６にてＮＯ）、上記の対象分野決定ステップＳ１０４及び関連情報蓄積ステップＳ１０５が繰り返される。すなわち、制御部１は、候補蓄積部３に蓄積された認識候補文字のそれぞれに対して、当該認識候補文字に関する、すべての対象分野の関連情報を対応付ける。 Until the target field determination step S104 and the related information storage step S105 are executed for all the target fields among the fields of the field-related related information stored in the related information storage unit 5 in advance (NO in S106) The target field determining step S104 and the related information accumulating step S105 are repeated. That is, the control unit 1 associates each piece of recognition candidate character stored in the candidate storage unit 3 with related information of all target fields related to the recognition candidate character.

対象分野のすべてについて上記の対象分野決定ステップＳ１０４及び関連情報蓄積ステップＳ１０５が実行されると（Ｓ１０６にてＹＥＳ）、次に、算出部４は、認識候補文字の妥当性評価を実行する（ステップＳ１０７）。この認識候補文字妥当性評価ステップＳ１０７では、認識対象文字列の各認識候補文字のうち、いずれの認識候補文字を認識結果文字として選択すべきかを判断するための評価値を算出する。 When the target field determination step S104 and the related information accumulation step S105 are executed for all the target fields (YES in S106), the calculation unit 4 next executes validity evaluation of the recognition candidate characters (steps). S107). In this recognition candidate character validity evaluation step S107, an evaluation value for determining which of the recognition candidate characters of the recognition target character string should be selected as a recognition result character is calculated.

具体的には、算出部４は、上記の文字認識結果蓄積ステップＳ１０３及び関連情報蓄積ステップＳ１０５において候補蓄積部３に蓄積された各認識候補文字の類似度及び関連情報を用いて、あらかじめ設定された基準に基づき、各認識候補文字の妥当性を表わす評価値を算出する。この基準は、各認識候補文字の類似度及び関連情報から評価値を決めるための基準である。その内容としては、例えば、後述するように、各認識候補文字の類似度と関連情報とを単純に加算して評価値を算出する、あるいは、単純には加算せずに、類似度、関連情報に対して重み付けを行った後に加算し、評価値を算出する等である。 Specifically, the calculation unit 4 is set in advance using the similarity and related information of each recognition candidate character stored in the candidate storage unit 3 in the character recognition result storage step S103 and the related information storage step S105. Based on the above criteria, an evaluation value representing the validity of each recognition candidate character is calculated. This criterion is a criterion for determining an evaluation value from the similarity of each recognition candidate character and related information. As the contents, for example, as described later, the evaluation value is calculated by simply adding the similarity of each recognition candidate character and the related information, or the similarity and the related information are not simply added. Are added after weighting, and an evaluation value is calculated.

より詳細には、算出部４は、認識対象文字に対応する１つ以上の候補文字からなる候補文字群を、各認識対象文字について取得し、上記関連情報および類似度を用いて、分野という所定の概念を表現する場合の、認識対象文字の認識結果としての確からしさを表す評価値を各認識候補文字について、対象分野ごとに算出する。算出部４は、算出した評価値を文字列結合部６へ出力する。 More specifically, the calculation unit 4 acquires a candidate character group including one or more candidate characters corresponding to the recognition target character for each recognition target character, and uses the related information and the similarity to determine a predetermined field as a field. An evaluation value representing the certainty as a recognition result of the recognition target character when expressing the concept is calculated for each recognition candidate character for each target field. The calculation unit 4 outputs the calculated evaluation value to the character string combination unit 6.

次に、文字列結合部６は、算出部４から出力された評価値を用いて、各認識対象文字に対応する候補文字群からそれぞれ１つずつ選ばれた候補文字を、認識対象文字列における認識対象文字の配列と同じ配列で結合することによって、認識候補文字列を、対象分野ごとに生成する（ステップＳ１０８）。文字列結合部６は、生成した認識候補文字列を決定部７へ出力する。 Next, the character string combination unit 6 uses the evaluation value output from the calculation unit 4 to select one candidate character selected from each candidate character group corresponding to each recognition target character in the recognition target character string. A recognition candidate character string is generated for each target field by combining in the same array as the recognition target character array (step S108). The character string combination unit 6 outputs the generated recognition candidate character string to the determination unit 7.

次に、決定部７は、文字列結合部６から出力された複数の認識候補文字列の中から、認識対象文字列の認識結果となる認識結果文字列を決定する（ステップＳ１０９）。決定部７は、決定した認識結果文字列を出力部９へ出力する。この決定部７における処理の詳細については後述する。 Next, the determination unit 7 determines a recognition result character string that is a recognition result of the recognition target character string from among the plurality of recognition candidate character strings output from the character string combining unit 6 (step S109). The determination unit 7 outputs the determined recognition result character string to the output unit 9. Details of the processing in the determination unit 7 will be described later.

最後に、出力部９は、決定部７によって決定された認識結果文字列の出力（例えば、表示）を実行する（ステップＳ１１０）。 Finally, the output unit 9 outputs (for example, displays) the recognition result character string determined by the determination unit 7 (step S110).

このようにして、本実施の形態にかかる文字列生成装置３０の動作が終了する。 In this way, the operation of the character string generation device 30 according to the present embodiment is completed.

（文字認識結果の具体例）
次に、図２の関連情報蓄積ステップＳ１０５における処理内容について、さらに詳しく説明する。図３に、図２の文字認識ステップＳ１０２における処理結果の一例を示す。図３の処理結果例は、認識対象文字列として「奈良県」を用いた場合である。図３に示すように、認識対象文字列「奈良県」を構成する文字「奈」、「良」、「県」のそれぞれに対して、第１〜第５位までの認識候補文字１０、１１、１２が挙げられている。そして、各認識候補文字には類似度が付与されている。例えば、文字「奈」の第１位の認識候補文字は「茶」であり、その類似度は「１１０」である。文字認識ステップＳ１０２において、各認識対象文字の認識候補文字には、その類似度の高い順に従って候補順位が設定されている。 (Specific example of character recognition result)
Next, the processing content in the related information accumulation step S105 of FIG. 2 will be described in more detail. FIG. 3 shows an example of the processing result in the character recognition step S102 of FIG. The processing result example of FIG. 3 is a case where “Nara Prefecture” is used as the recognition target character string. As shown in FIG. 3, the recognition candidate characters 10 and 11 in the first to fifth ranks for the characters “N”, “Ryo”, and “Prefecture” constituting the recognition target character string “Nara Prefecture”. 12 are listed. Each recognition candidate character is given a similarity. For example, the first recognition candidate character of the character “NA” is “Brown”, and its similarity is “110”. In the character recognition step S102, candidate ranks are set for the recognition candidate characters of each recognition target character in descending order of similarity.

なお、図３では、認識対象文字列の文字数は３個、候補順位は第１〜第５位までを示しているが、認識対象文字数は、１〜２個でもよいし、４個以上でもよく、候補順位として出力される順位の数も、１つ以上４つ以下でもよく、６つ以上でもよい。また、「奈良県」の各認識候補文字に設定された類似度も単なる一例であり、文字認識によって必ずしもこの順序や、数値になるものではない。 In FIG. 3, the number of characters in the recognition target character string is three and the candidate ranks are first to fifth. However, the number of recognition target characters may be one to two, or four or more. The number of ranks output as candidate ranks may be 1 or more and 4 or less, or 6 or more. Also, the similarity set for each recognition candidate character of “Nara Prefecture” is merely an example, and this order or numerical value is not necessarily obtained by character recognition.

（関連情報の具体例）
図４に、図２の関連情報蓄積ステップＳ１０５における処理結果の一例を示す。図４の処理結果例では、図３の処理結果例における各認識候補文字に対して、関連情報蓄積部５にあらかじめ蓄積されている関連情報が分野ごとに付与されている。図４の処理結果例では、上述したように、対象分野として「住所」、「姓」、「名」、「一般」の４分野が選択されている。 (Specific examples of related information)
FIG. 4 shows an example of the processing result in the related information accumulation step S105 of FIG. In the processing result example of FIG. 4, related information stored in advance in the related information storage unit 5 is assigned to each recognition candidate character in the processing result example of FIG. In the processing result example of FIG. 4, as described above, four fields of “address”, “surname”, “first name”, and “general” are selected as target fields.

対象分野ごとに、認識対象文字「奈」の認識候補文字１３、認識対象文字「良」の認識候補文字１４、及び、認識対象文字「県」の認識候補文字１５の各々に対して、各文字と各分野との関連性の強さを表わす関連情報が付与されている。ここでは、各分野において、その分野との関連性が強い文字ほど、関連情報の数値が大きくなるようになっている。 For each target field, for each of the recognition candidate character 13 of the recognition target character “NA”, the recognition candidate character 14 of the recognition target character “good”, and the recognition candidate character 15 of the recognition target character “prefecture”, each character And related information indicating the strength of the relationship between each and each field. Here, in each field, the numerical value of the related information is increased as the character is more related to the field.

なお、図１の関連情報蓄積部５にあらかじめ蓄積されている分野別の関連情報は、分野ごとに、文字認識され得る文字候補の分だけ関連情報が存在する。したがって、関連情報蓄積部５には、仮にＪＩＳ第一、第二水準の漢字を全て認識対象文字とした場合、３０００〜４０００文字分程度の関連情報が分野ごとに蓄積されていることになる。 The related information for each field that is stored in advance in the related information storage unit 5 in FIG. 1 includes as many related information as possible character candidates that can be recognized for each field. Therefore, if all the JIS first and second level kanji characters are recognized as characters to be recognized, the related information storage unit 5 stores about 3000 to 4000 characters of related information for each field.

また、図４の例では、関連情報の一例として、各認識対象文字と各分野との間の関連性の強さを数値で示している。各数値は、例えば、各認識対象文字が各分野との関連性として、各分野という概念を表現する場合にその文字を用いる頻度で表わす。数値自体は、類似度との間で桁数の違いが大きいと文字認識の結果とほとんど同じになる場合、あるいは逆に、全く異なる場合が想定されるため、類似度に合わせて関連情報に重み付けを行う（正規化する）場合も想定される。 In the example of FIG. 4, as an example of the related information, the strength of the relationship between each recognition target character and each field is indicated by a numerical value. Each numerical value is represented, for example, by the frequency with which each recognition target character is used when the concept of each field is expressed as the relationship with each field. As for the numerical value itself, if the difference in the number of digits from the similarity is large, it may be almost the same as the result of character recognition, or conversely, it may be completely different, so the related information is weighted according to the similarity. It is also possible to perform (normalize).

例えば、図５の例１で示すように、文字認識の類似度の範囲が０〜２００００であり、関連情報の範囲が０〜５０である場合、類似度に対して、関連情報の値がほとんど影響を与えず、図３の処理結果例がほとんど変わらないことが予想される。 For example, as shown in Example 1 in FIG. 5, when the similarity range of character recognition is 0 to 20000 and the related information range is 0 to 50, the value of the related information is almost the same as the similarity. It is expected that the processing result example of FIG.

逆に、図５の例２で示すように、文字認識の類似度の範囲が０〜２００００であり、関連情報の範囲が０〜５０００である場合、図３の処理結果が関連情報の影響を受けすぎて、図３の処理結果例が意味をなさなくなる可能性がある。 Conversely, as shown in Example 2 in FIG. 5, when the similarity range of character recognition is 0 to 20000 and the related information range is 0 to 5000, the processing result of FIG. There is a possibility that the processing result example of FIG.

本発明では、関連情報は文字認識の誤りを訂正し、認識結果の順位を入れ替えるためのものである。このため、類似度と関連情報とのバランスを考慮し、例えば「関連情報の最大値は類似度の最大値の５％にする」というルールを設定しておけばよい。そうすることにより、図５の例１の場合であれば関連情報の値を一律２０倍し、図５の例２の場合であれば関連情報の値を一律５分の１に減少させる（正規化する）ことで、類似度と関連情報を適正な範囲にすることが可能である。 In the present invention, the related information is for correcting an error in character recognition and changing the order of recognition results. For this reason, considering the balance between the similarity and the related information, for example, a rule that “the maximum value of the related information is 5% of the maximum value of the similarity” may be set. By doing so, the value of the related information is uniformly multiplied by 20 in the case of Example 1 in FIG. 5, and the value of the related information is uniformly reduced to 1/5 in the case of Example 2 in FIG. It is possible to set the similarity and the related information within an appropriate range.

なお、本発明は上記の「５％」に限定されるものではない。要は、本発明は、文字認識で得られた類似度に対して補正的な意味合いで各分野との関連性を用いており、このため、関連情報の割合を大きくしすぎると文字認識の結果が意味をなさなくなってしまう。したがって、関連情報の影響を受けすぎないような値であればよい。 The present invention is not limited to the above “5%”. In short, the present invention uses relevance to each field in a corrective sense with respect to the similarity obtained by character recognition. For this reason, if the proportion of related information is too large, the result of character recognition No longer makes sense. Therefore, the value may be any value that is not excessively affected by the related information.

また、本実施の形態では、上述したように、関連情報として、各分野における、各認識候補文字の出現頻度を主に想定している。例えば「住所」の場合、住所を表すテキストを大量に収集し、そのテキストの中で各認識候補文字が現れる頻度を数え、関連情報とするものである。ただし、本実施の形態の関連情報は上記の出現頻度に限定されるものではなく、例えばシソーラス上における、「住所」と各認識候補文字の距離など、「住所」と各認識候補文字間の意味的な距離を関連情報として用いることも可能である。すなわち、上記関連情報は、認識候補文字と所定の概念との関連性の度合いを表す関連度であってもよい。 Moreover, in this Embodiment, as mentioned above, the appearance frequency of each recognition candidate character in each field is mainly assumed as related information. For example, in the case of “address”, a large amount of text representing an address is collected, and the frequency at which each recognition candidate character appears in the text is counted as related information. However, the related information of the present embodiment is not limited to the appearance frequency described above. For example, on the thesaurus, the meaning between “address” and each recognition candidate character, such as the distance between “address” and each recognition candidate character. It is also possible to use a specific distance as related information. In other words, the related information may be a degree of relevance representing a degree of relevance between the recognition candidate character and a predetermined concept.

（認識候補文字列妥当性評価ステップの詳細）
次に、図２の認識候補文字妥当性評価ステップＳ１０７についてさらに詳しく説明する。図６に、図２の認識候補文字妥当性評価ステップＳ１０７における処理結果の一例を示す。図６の処理結果例では、図３の処理結果例における各認識候補文字に、図４の処理結果例における分野「住所」の各認識候補文字に付与された関連情報が加算され、各評価値が算出されている。 (Details of recognition character string validation step)
Next, the recognition candidate character validity evaluation step S107 in FIG. 2 will be described in more detail. FIG. 6 shows an example of the processing result in the recognition candidate character validity evaluation step S107 of FIG. In the processing result example of FIG. 6, the related information given to each recognition candidate character of the field “address” in the processing result example of FIG. 4 is added to each recognition candidate character in the processing result example of FIG. Is calculated.

例えば、認識候補文字「奈」の場合、図３の類似度「１１０」に図４の関連情報「７」が加算され、評価値「１１２」が算出されている。同様にして、認識対象文字列「奈良県」の各文字「奈」、「良」、「県」に対する認識候補文字２０、２１、２２の各々に対して、各評価値が算出されている。そして、図３及び図６から明らかなように、この算出された評価値の大きさに従って各認識候補文字２０、２２、２１の候補順位の入れ替えが行われている。 For example, in the case of the recognition candidate character “NA”, the related information “7” in FIG. 4 is added to the similarity “110” in FIG. 3, and the evaluation value “112” is calculated. Similarly, each evaluation value is calculated for each of the recognition candidate characters 20, 21, and 22 for the characters “Nara”, “Ryo”, and “Prefecture” in the recognition target character string “Nara Prefecture”. As is clear from FIGS. 3 and 6, the candidate ranks of the recognition candidate characters 20, 22, and 21 are switched according to the calculated evaluation value.

図６の処理結果例では、単純に認識候補文字の類似度と分野別関連情報とを加算して評価値を算出しているが、他にも類似度、関連情報の各々の数値に重みを付けて、評価値を算出する等、評価値の算出方法については様々な方法が考えられる。 In the example of the processing result in FIG. 6, the evaluation value is calculated by simply adding the similarity of the recognition candidate character and the related information according to the field. In addition, various methods can be considered for calculating the evaluation value, such as calculating the evaluation value.

すなわち、算出部４は、関連情報の値または類似度に対して重み付けを行った後に、関連情報の値と類似度とを加算することにより評価値を算出してもよい。 That is, the calculation unit 4 may calculate the evaluation value by adding the value of the related information and the similarity after weighting the value or the similarity of the related information.

例えば、類似度の範囲が０〜１５０の時に、ある認識対象文字についての認識候補文字列のうち、最大の類似度が８０であった場合、その文字に対する文字認識処理の信頼性には問題があると考えられる。その場合、関連情報を重み付けして２倍した値と類似度を加算し、類似度に対する関連情報の影響の度合いを大きくすることにより、各分野との関連性の高い文字を上位に上げてもよい。そうすることにより、文字認識処理の低信頼性の影響を低減することができる。 For example, if the maximum similarity is 80 among the recognition candidate character strings for a certain recognition target character when the similarity is in the range of 0 to 150, there is a problem in the reliability of character recognition processing for that character. It is believed that there is. In that case, by adding the weighted value of the related information and doubling the similarity and increasing the degree of the influence of the related information on the similarity, even if the character highly related to each field is raised to the top Good. By doing so, the influence of the low reliability of a character recognition process can be reduced.

図６の処理結果例により、認識対象文字「奈」の認識候補文字２０のうち、「奈」の評価値が最高の１１２となり、同様に、認識対象文字「良」の認識候補文字２１のうち、「良」の評価値が最高の１０２、認識対象文字「県」の認識候補文字２２のうち、「県」の評価値が最高の１３０となる。ここでは、この結果を便宜的に、住所Ｍａｘ(１１２(奈)、１０２（良）、１３０(県))と表わす。 According to the processing result example of FIG. 6, among the recognition candidate characters 20 of the recognition target character “NA”, the evaluation value of “NA” becomes the highest 112, and similarly, among the recognition candidate characters 21 of the recognition target character “good”. , Among the recognition candidate characters 22 of the recognition target character “prefecture”, the evaluation value of “prefecture” is 130, which is the highest. Here, for the sake of convenience, this result is expressed as an address Max (112 (na), 102 (good), 130 (prefecture)).

図２の認識候補文字妥当性評価ステップＳ１０７においては、「住所」以外の「姓」、「名」、「一般」の各分野に対しても、図６の処理結果例と同様な処理結果を得る。すなわち、分野「姓」の場合であれば、認識対象文字「奈」の認識候補文字のうち、「奈」の評価値が最高の１１２となり、認識対象文字「良」の認識候補文字のうち、「食」の評価値が最高の１０１、認識対象文字「県」の認識候補文字のうち、「県」の評価値が最高の１２０となる。したがって、この結果は、姓Ｍａｘ(１１２(奈)、１０１（食）、１２０(県))となる。 In the recognition candidate character validity evaluation step S107 in FIG. 2, the same processing result as the processing result example in FIG. 6 is also obtained for each field of “last name”, “first name”, and “general” other than “address”. obtain. That is, in the case of the field “last name”, among the recognition candidate characters of the recognition target character “N”, the evaluation value of “NA” is the highest 112, and among the recognition candidate characters of the recognition target character “Good”, The highest evaluation value of “food” is 101, and among the recognition candidate characters of the recognition target character “prefecture”, the evaluation value of “prefecture” is 120, the highest. Therefore, this result is the surname Max (112 (na), 101 (food), 120 (prefecture)).

次に、分野「名」の場合であれば、認識対象文字「奈」の認識候補文字のうち、「茶」及び「奈」の評価値が最高の１１１となり、認識対象文字「良」の認識候補文字のうち、「食」の評価値が最高の１０１、認識対象文字「県」の認識候補文字のうち、「県」の評価値が最高の１２０となる。したがって、この結果は、名Ｍａｘ(１１１(茶、奈)、１０１（食）、１２０(県))となる。なお、認識対象文字「奈」の認識候補文字のうち、「茶」及び「奈」の評価値が同一となってしまうが、この場合、例えば、算出部４は類似度の高いほうを優先するようにすればよい。 Next, in the case of the field “name”, among the recognition candidate characters of the recognition target character “na”, the evaluation value of “brown” and “na” is the highest 111, and the recognition target character “good” is recognized. Among the candidate characters, the evaluation value of “food” is the highest 101, and among the recognition candidate characters of the recognition target character “prefecture”, the evaluation value of “prefecture” is 120, the highest. Therefore, the result is the name Max (111 (tea, na), 101 (food), 120 (prefecture)). Note that among the recognition candidate characters of the recognition target character “NA”, the evaluation values of “Brown” and “NA” are the same. In this case, for example, the calculation unit 4 gives priority to the one with the higher similarity. What should I do?

次に、分野「一般」の場合であれば、認識対象文字「奈」の認識候補文字のうち、「茶」の評価値が最高の１１５となり、認識対象文字「良」の認識候補文字のうち、「食」の評価値が最高の１０７、認識対象文字「県」の認識候補文字のうち、「県」の評価値が最高の１２３となる。したがって、この結果は一般Ｍａｘ(１１５(茶)、１０７（食）、１２３(県))となる。 Next, in the case of the field “general”, among the recognition candidate characters of the recognition target character “na”, the evaluation value of “brown” is the highest 115, and among the recognition candidate characters of the recognition target character “good” The evaluation value of “prefecture” among the recognition candidate characters “107” having the highest evaluation value of “food” and the recognition target character “prefecture” is 123. Therefore, this result is general Max (115 (tea), 107 (food), 123 (prefecture)).

このようにして、図２の認識候補文字妥当性評価ステップＳ１０７において、算出部４は、分野ごとに、各認識候補文字の妥当性を表わす評価値を算出する。 In this way, in the recognition candidate character validity evaluation step S107 of FIG. 2, the calculation unit 4 calculates an evaluation value representing the validity of each recognition candidate character for each field.

（認識候補文字列生成ステップ及び認識結果文字列決定ステップの詳細）
図２の認識候補文字列生成ステップＳ１０８においては、分野ごとに、上記の最高の評価値を持つ認識候補文字からなる認識候補文字列が生成される。具体的には、文字列結合部６は、認識対象文字列に含まれる各認識対象文字について、対応する認識候補文字の群から最高の評価値を有する認識候補文字（最高評価文字）を選択し、選択した最高評価文字を、認識対象文字列における認識対象文字の配列と対応する配列で結合することにより、認識候補文字列を生成する。文字列結合部６は、この処理を分野ごとに行う。それゆえ、分野の数と同数の認識候補文字列が生成される。 (Details of recognition candidate character string generation step and recognition result character string determination step)
In the recognition candidate character string generation step S108 in FIG. 2, a recognition candidate character string including the recognition candidate character having the highest evaluation value is generated for each field. Specifically, the character string combining unit 6 selects a recognition candidate character (highest evaluation character) having the highest evaluation value from the corresponding recognition candidate character group for each recognition target character included in the recognition target character string. Then, a recognition candidate character string is generated by combining the selected highest evaluated character with an array corresponding to the array of recognition target characters in the recognition target character string. The character string combining unit 6 performs this process for each field. Therefore, the same number of recognition candidate character strings as the number of fields are generated.

認識結果文字列決定ステップＳ１０９では、決定部７が、認識候補文字列生成ステップＳ１０８において生成された各分野の認識候補文字列のうちから、出力すべき認識候補文字列を選択し、認識結果文字列を決定する。例えば、分野ごとに、認識候補文字列に含まれる認識候補文字の評価値を加算し、その加算値が最大の認識候補文字列を認識結果文字列として決定すればよい。この場合、上述の例では、各分野の最高評価値の加算値は、「住所」、「姓」、「名」、「一般」の順に、３４４、３３３、３３３、３４５となる。したがって、単純に類似度と分野別関連情報を加算して求めた評価値の場合、分野「一般」の認識候補文字列である「茶良県」が認識結果文字列として決定される。 In the recognition result character string determination step S109, the determination unit 7 selects a recognition candidate character string to be output from the recognition candidate character strings of each field generated in the recognition candidate character string generation step S108, and the recognition result character string Determine the column. For example, the evaluation value of the recognition candidate character included in the recognition candidate character string may be added for each field, and the recognition candidate character string having the maximum addition value may be determined as the recognition result character string. In this case, in the above-described example, the added value of the highest evaluation value in each field is 344, 333, 333, and 345 in the order of “address”, “surname”, “first name”, and “general”. Therefore, in the case of the evaluation value obtained by simply adding the similarity and the field-related information, “Chara Prefecture”, which is the recognition candidate character string for the field “general”, is determined as the recognition result character string.

すなわち、決定部７は、分野ごとに生成した認識候補文字列に含まれる認識候補文字が有する評価値を当該認識候補文字列ごとに加算し、最高の評価値の合計を有する認識候補文字列を、認識対象文字列の認識結果として決定する。 That is, the determination unit 7 adds the evaluation value of the recognition candidate character included in the recognition candidate character string generated for each field for each recognition candidate character string, and determines the recognition candidate character string having the highest total evaluation value. Then, the recognition target character string is determined as a recognition result.

以上説明したように、本発明の実施の形態１によれば、図６に示した認識候補文字の評価値を用いることにより、図３に示した認識候補文字の類似度のみから認識結果文字列を決定した場合と比べて、認識対象文字列「奈良県」により近い認識結果文字列を出力することができる。 As described above, according to the first embodiment of the present invention, by using the recognition candidate character evaluation value shown in FIG. 6, the recognition result character string is obtained from only the similarity of the recognition candidate character shown in FIG. 3. The recognition result character string closer to the recognition target character string “Nara Prefecture” can be output as compared to the case where the determination is made.

なお、上述の例では、認識対象文字列と認識結果文字列とは完全には一致していないが、両者が完全に一致する場合もある。それゆえ、文字列生成装置３０が発明として未完成なわけではない。 In the above-described example, the recognition target character string and the recognition result character string do not completely match, but there are cases in which both match completely. Therefore, the character string generation device 30 is not incomplete as an invention.

（実施の形態２）
次に、本発明の実施の形態２について説明する。上記の実施の形態１では、認識候補文字の類似度に加えて分野ごとの関連情報を用いることにより、類似度のみを用いた場合と比べて認識対象文字列により近い認識結果文字列を出力するものであった。 (Embodiment 2)
Next, a second embodiment of the present invention will be described. In the first embodiment, by using the related information for each field in addition to the similarity of the recognition candidate character, the recognition result character string closer to the recognition target character string is output compared to the case where only the similarity is used. It was a thing.

しかしながら、図６から明らかなように、必ずしも認識対象文字列と完全に一致するとは限らない。すなわち、上記の実施の形態１においては、認識候補文字列と各分野との関連性の強さを表わす関連情報の評価値への寄与度や、類似度と関連情報との間における正規化処理の有無により、認識対象文字列とは完全に一致しない文字列が生成される場合がある。 However, as is apparent from FIG. 6, it does not necessarily match the recognition target character string completely. That is, in the above-described first embodiment, the degree of contribution to the evaluation value of the related information representing the strength of the relevance between the recognition candidate character string and each field, and the normalization process between the similarity and the related information Depending on the presence or absence of a character string, a character string that does not completely match the recognition target character string may be generated.

そこで、本実施の形態は、算出部４が、図２の認識候補文字妥当性評価ステップＳ１０７における評価値算出の際に、初期設定ステップＳ１０１において設定されたパラメータ（ここでは、「業務」の種類）に基づいて、各認識候補文字に付与された関連情報の評価値への寄与度を分野毎に変化させる形態である。以下、上記の実施の形態１と異なる点について、図２を用いて説明する。 Therefore, in the present embodiment, the calculation unit 4 uses the parameters (in this case, the type of “business”) set in the initial setting step S101 when calculating the evaluation value in the recognition candidate character validity evaluation step S107 in FIG. ), The degree of contribution to the evaluation value of the related information given to each recognition candidate character is changed for each field. Hereinafter, differences from the first embodiment will be described with reference to FIG.

例えば、利用者が図１の文字列生成装置３０を用いて実施する「業務」の種類が「住所録」であるとする。この場合、初期設定ステップＳ１０１において制御部１が初期設定を実行する際、利用者がこの文字列生成装置３０を用いて実施する「業務」の種類として「住所録」が設定される。そして、制御部１は、対象とすべき分野のすべてについて対象分野決定ステップＳ１０４及び関連情報蓄積ステップＳ１０５を実行する。 For example, it is assumed that the type of “business” performed by the user using the character string generation device 30 in FIG. 1 is “address book”. In this case, when the control unit 1 executes the initial setting in the initial setting step S101, “address book” is set as the type of “business” performed by the user using the character string generation device 30. And the control part 1 performs object field determination step S104 and related information accumulation | storage step S105 about all the fields which should be made into object.

算出部４は、「住所録」に対応する対象分野「住所」、「姓」、「名」、「一般」の各々について、各認識候補文字の評価値を算出するときの関連情報の寄与度を用いて、認識候補文字妥当性評価ステップＳ１０７における評価値算出を実行する。 The calculation unit 4 contributes the related information when calculating the evaluation value of each recognition candidate character for each of the target fields “address”, “last name”, “first name”, and “general” corresponding to the “address book”. Is used to execute the evaluation value calculation in the recognition candidate character validity evaluation step S107.

具体的には、算出部４は、上記の「業務」の種類である「住所録」に対応する対象分野「住所」、「姓」、「名」、「一般」ごとに各認識候補文字列の評価値を算出するときに、「住所」、「姓」、「名」についての評価値を算出するときには関連情報の寄与を相対的に大きくし、「一般」についての評価値を算出するときには関連情報の寄与を相対的に小さくする。ここでは、対象分野「住所」、「姓」、「名」、「一般」のうち、上記の「住所録」との関連性が相対的に強い分野「住所」、「姓」、「名」の関連情報を例えば２倍とし、上記の「住所録」との関連性が相対的に弱い分野「一般」の関連情報を１倍としている。算出部４は、例えば、制御部１のメモリに、「業務」に含まれる分野（対象分野）のうち、いずれの分野が当該「業務」との関連性が強く、いずれの分野が比較的弱いかを表わす情報があらかじめ蓄積しておき、算出部４がその情報に基づいて、関連情報を変更すればよい。 Specifically, the calculation unit 4 sets each recognition candidate character string for each of the target fields “address”, “last name”, “first name”, and “general” corresponding to the “address book” that is the type of “business”. When calculating the evaluation value of, when calculating the evaluation value for "address", "last name", and "first name", relatively increase the contribution of related information, and when calculating the evaluation value for "general" Make the contribution of related information relatively small. Here, among the subject fields “address”, “last name”, “first name”, “general”, the fields “address”, “last name”, “first name” that are relatively strongly related to the above “address book” The related information in the field “general”, which is relatively weak in relation to the “address book”, is doubled, for example. For example, the calculation unit 4 stores, in the memory of the control unit 1, any field (target field) included in the “business” is strongly related to the “business”, and any field is relatively weak. Information indicating such information may be stored in advance, and the calculation unit 4 may change the related information based on the information.

そうすることにより、各分野の最高評価値の加算値は、「住所」、「姓」、「名」、「一般」の順に、３６８、３３４、３４７、３４５となる。したがって、この場合であれば、分野「住所」の認識候補文字列である「奈良県」が認識結果文字列として出力されることになる。 By doing so, the added value of the highest evaluation value in each field becomes 368, 334, 347, 345 in the order of “address”, “last name”, “first name”, “general”. Therefore, in this case, “Nara Prefecture”, which is a recognition candidate character string of the field “address”, is output as a recognition result character string.

上記の場合、分野関連情報の寄与度のみを設定したが、類似度の寄与度についても、評価値に対する寄与度を設定してもよい。文字認識部２の精度にも依存するが、例えば、図３に示した類似度のみに従って認識結果文字列を生成した場合に、その認識結果文字列と認識対象文字列とが一致する割合が６０〜８０％程度であれば、類似度及び関連情報の各々の寄与度を、関連情報が類似度に対して１０〜３５％程度評価値に寄与するように設定すればよい。そうすることにより、類似度のみに従って認識結果文字列を生成する場合と比べて、類似度に要求される精度を低下させることができる。このため、文字認識部２が文字認識する際に処理すべきデータ量が低減され、文字認識部２が必要とするメモリ容量が小さくなる。 In the above case, only the contribution degree of the field related information is set, but the contribution degree to the evaluation value may also be set for the contribution degree of the similarity degree. Although depending on the accuracy of the character recognition unit 2, for example, when a recognition result character string is generated according to only the similarity shown in FIG. If it is about ˜80%, the degree of contribution of each of the similarity and the related information may be set so that the related information contributes to the evaluation value of about 10 to 35% with respect to the similarity. By doing so, compared with the case where a recognition result character string is generated according to only the similarity, the accuracy required for the similarity can be reduced. For this reason, the amount of data to be processed when the character recognition unit 2 recognizes characters is reduced, and the memory capacity required by the character recognition unit 2 is reduced.

（実施の形態３）
（文字列生成装置３１の構成）
次に、本発明の実施の形態３について説明する。図７は、本発明の実施の形態３にかかる文字列生成装置３１の概略構成を示すブロック図である。図７において、本実施の形態にかかる文字列生成装置３１は、上記の実施の形態１の文字列生成装置３０と同様に、制御部１と、文字認識部２と、候補蓄積部３と、算出部４と、関連情報蓄積部５と、文字列結合部６と、決定部７と、入力部８と、出力部９と、を備えている。 (Embodiment 3)
(Configuration of the character string generation device 31)
Next, a third embodiment of the present invention will be described. FIG. 7 is a block diagram showing a schematic configuration of the character string generation device 31 according to the third embodiment of the present invention. In FIG. 7, the character string generation device 31 according to the present embodiment is similar to the character string generation device 30 of the first embodiment described above, the control unit 1, the character recognition unit 2, the candidate accumulation unit 3, A calculation unit 4, a related information storage unit 5, a character string combination unit 6, a determination unit 7, an input unit 8, and an output unit 9 are provided.

本実施の形態にかかる文字列生成装置はさらに、照合部６０と、キーワード辞書６１と、備えている。照合部６０は、制御部１と接続されており、文字列結合部６が生成する認識候補文字列をキーワード辞書６１にあらかじめ蓄積されているキーワードと照合を行うものである。以下においては、この照合部６０及びキーワード辞書６１について主として説明するものとし、その他については上記の実施の形態１と同様であるので説明は繰り返さない。 The character string generation device according to the present embodiment further includes a collation unit 60 and a keyword dictionary 61. The collation unit 60 is connected to the control unit 1 and collates the recognition candidate character strings generated by the character string combination unit 6 with keywords stored in the keyword dictionary 61 in advance. In the following description, the collation unit 60 and the keyword dictionary 61 will be mainly described, and the other parts are the same as those in the first embodiment, and thus description thereof will not be repeated.

照合部６０は文字列結合部６が生成する認識候補文字列をキーワード辞書６１と照合する。キーワード辞書６１は分野別のキーワードをあらかじめ蓄積している。キーワード辞書６１として、選定された対象分野の各々に対応する辞書が用意されており、各辞書には、それぞれに対応する分野において利用される頻度の高いキーワードがあらかじめ選定され、記載されている。ここでは、上記の実施の形態１と同様、分野として「住所」、「姓」、「名」、「一般」が設定され、キーワード辞書６１としては、「住所キーワード辞書」、「姓キーワード辞書」、「名キーワード辞書」、「一般キーワード辞書」の４種類があるものとする。 The collation unit 60 collates the recognition candidate character string generated by the character string coupling unit 6 with the keyword dictionary 61. The keyword dictionary 61 stores keywords for each field in advance. As the keyword dictionary 61, a dictionary corresponding to each of the selected target fields is prepared. In each dictionary, keywords that are frequently used in the corresponding fields are selected and described in advance. Here, as in the first embodiment, “address”, “last name”, “first name”, and “general” are set as fields, and the keyword dictionary 61 includes “address keyword dictionary” and “last name keyword dictionary”. Assume that there are four types: “name keyword dictionary” and “general keyword dictionary”.

次に、本実施の形態にかかる文字列生成装置３１の動作について説明する。図８は、本実施の形態にかかる文字列生成装置３１の文字列生成方法の処理手順の一例を示すフローチャートである。図２に示した上記実施の形態１のフローチャートと同一のステップには、同一の番号が付されている。ここでは、上述したように、実施の形態１とは異なる点のみ説明する。 Next, the operation of the character string generation device 31 according to the present embodiment will be described. FIG. 8 is a flowchart illustrating an example of a processing procedure of the character string generation method of the character string generation device 31 according to the present embodiment. The same number is attached | subjected to the step same as the flowchart of the said Embodiment 1 shown in FIG. Here, as described above, only differences from the first embodiment will be described.

文字列生成装置３１の文字列生成方法の処理手順においては、図２の認識候補文字列生成ステップＳ１０８と認識結果文字列決定ステップＳ１０９との間にキーワード辞書との照合ステップＳ２０１が追加されている。 In the processing procedure of the character string generation method of the character string generation device 31, a matching step S201 with a keyword dictionary is added between the recognition candidate character string generation step S108 and the recognition result character string determination step S109 in FIG. .

また、文字列結合部６は、生成した認識候補文字列を、当該認識候補文字列の分野および当該認識候補文字列が含む各認識候補文字列の最高評価値の加算値と対応付けて、制御部１を介して決定部７へ出力する。 Further, the character string combining unit 6 associates the generated recognition candidate character string with the field of the recognition candidate character string and the addition value of the highest evaluation value of each recognition candidate character string included in the recognition candidate character string. The data is output to the determination unit 7 via the unit 1.

例えば、上記の実施の形態１においては、認識候補文字列生成ステップＳ１０８において分野ごとに生成された認識候補文字列「奈良県」、「奈食県」、「茶食県、奈食県」、「茶食県」のうち、各認識候補文字の最高評価値を加算した結果が最大であるのは分野「一般」の認識候補文字列「茶良県」であった。 For example, in the above-described first embodiment, the recognition candidate character strings “Nara Prefecture”, “Nanagi Prefecture”, “Tea Food Prefecture, Natsuku Prefecture”, “ Among the “tea-food prefectures”, the largest result obtained by adding the highest evaluation values of the respective recognition candidate characters is the recognition candidate character string “Chara Prefecture” in the field “general”.

しかし、本実施の形態のキーワード辞書照合ステップＳ２０１においては、照合部６０が、文字列結合部６が対象分野「一般」について生成した認識候補文字列がその対象分野と対応付けられて入力されると、まず、当該認識候補文字列と対応付けられた対象分野「一般」に対応する、キーワード辞書６１の一般キーワード辞書を用いて「茶良県」をキーワード照合する。 However, in the keyword dictionary collation step S201 of the present embodiment, the recognition candidate character string generated by the collation unit 60 for the target field “general” by the collation unit 60 is input in association with the target field. First, “Chara Prefecture” is subjected to keyword matching using the general keyword dictionary of the keyword dictionary 61 corresponding to the target field “general” associated with the recognition candidate character string.

次に、一般キーワード辞書に「茶良県」は無いので、最高評価値を加算した結果が次に高い分野「住所」についての認識候補文字列「奈良県」が、その分野についての住所キーワード辞書と照合され、照合に成功する。 Next, because there is no “Chara Prefecture” in the general keyword dictionary, the recognition candidate character string “Nara Prefecture” for the field “address” with the highest result of adding the highest evaluation value is the address keyword dictionary for that field. Is verified and the verification is successful.

決定部７は、この照合成功の結果に基づき、認識結果文字列決定ステップＳ１０９において、分野「住所」の認識候補文字列である「奈良県」を認識結果文字列として決定し、出力部９が、認識結果文字列出力ステップＳ１１０において、決定部７によって決定された認識結果文字列の出力を実行する。 Based on the result of this collation success, the determination unit 7 determines “Nara Prefecture”, which is a recognition candidate character string of the field “address”, as a recognition result character string in the recognition result character string determination step S109, and the output unit 9 In the recognition result character string output step S110, the recognition result character string determined by the determination unit 7 is output.

すなわち、照合部６０は、文字列結合部６が生成した認識候補文字列と、キーワード辞書６１が有する、上記認識候補文字列の分野についてのキーワード辞書に含まれるキーワードとを照合する処理を、分野ごとに生成された認識候補文字列のそれぞれについて、当該認識候補文字列の最高評価値の加算値が大きい順に行い、決定部７は、照合部６０が最初に照合に成功した認識候補文字列を認識結果文字列として決定する。 That is, the collation unit 60 performs a process of collating the recognition candidate character string generated by the character string combination unit 6 with the keyword included in the keyword dictionary for the field of the recognition candidate character string that the keyword dictionary 61 has. Each of the recognition candidate character strings generated for each of the recognition candidate character strings is performed in descending order of the highest evaluation value of the recognition candidate character strings. It is determined as a recognition result character string.

（実施の形態４）
次に、本発明の実施の形態４について説明する。上記の実施の形態３においては、評価値の加算値が一番高い認識候補文字列から順に（換言すれば、評価値の加算値が一番高い分野から順に）と、その認識候補文字列の分野についてのキーワード辞書との照合によって認識結果文字列を決定している。このとき処理時間が長くかかるのは、キーワード辞書６１のオープン、クローズ及び、認識候補文字列とキーワード辞書との照合との照合である。上記の実施の形態３では、まず評価値の加算値が一番高い一般分野で照合し、次に住所分野で照合した結果、正解文字列を得ている。もし、住所分野から照合処理に入ったら正しい文字列生成に要する時間は大幅に縮小される。 (Embodiment 4)
Next, a fourth embodiment of the present invention will be described. In Embodiment 3 described above, the recognition candidate character strings are evaluated in order from the recognition candidate character string having the highest evaluation value addition value (in other words, from the field having the highest evaluation value addition value). The recognition result character string is determined by collation with the keyword dictionary for the field. At this time, it takes a long time to open and close the keyword dictionary 61 and to collate the recognition candidate character string with the keyword dictionary. In the third embodiment, the correct character string is obtained as a result of collation first in the general field where the added value of the evaluation value is the highest and then collation in the address field. If collation processing is entered from the address field, the time required to generate a correct character string is greatly reduced.

この問題を解決するため、本実施の形態では、制御部１が分野特徴文字を用いて照合部６０の照合順を決定する。この分野特徴文字とは、対応する分野に特徴的な文字のことであり、例えば、住所分野であれば、「都」、「道」、「府」、「県」、「市」、「区」、「群」、「町」、「村」、「丁」、「目」、「番」、「地」、「棟」、「号」、「室」等、姓分野であれば「佐」、「斎」、「藤」等の文字が該当する。これら分野特徴文字は、制御部１を構成するメモリにあらかじめ記憶しておけばよい。 In order to solve this problem, in the present embodiment, the control unit 1 determines the collation order of the collation unit 60 using field characteristic characters. The field characteristic characters are characters characteristic of the corresponding field. For example, in the case of an address field, “city”, “road”, “fu”, “prefecture”, “city”, “city” ”,“ Group ”,“ Town ”,“ Village ”,“ Ding ”,“ Eye ”,“ Number ”,“ Land ”,“ Building ”,“ No. ”,“ Room ”, etc. "," Sai "," wisteria ", etc. These field characteristic characters may be stored in advance in a memory constituting the control unit 1.

図９は、本実施の形態にかかる文字列生成装置３１の文字列生成方法の処理手順を示すフローチャートである。図８に示した上記実施の形態３のフローチャートと同一のステップには、同一の番号が付されている。ここでは、実施の形態３とは異なる点のみ説明する。 FIG. 9 is a flowchart showing the processing procedure of the character string generation method of the character string generation device 31 according to the present embodiment. The same steps as those in the flowchart of the third embodiment shown in FIG. Here, only differences from the third embodiment will be described.

図９に示すように、キーワード辞書照合ステップＳ２０１の前に、制御部１は、キーワード辞書の照合順決定ステップＳ３０１を実行する。このキーワード辞書照合順決定ステップＳ３０１では、制御部１が、照合部６０がどの分野からキーワード辞書６１との照合を行うべきかを照合順序ルールを参照することによって決定する。この照合の順序は、例えば、制御部１のメモリに蓄積された照合順序ルールによって決定される。 As shown in FIG. 9, before the keyword dictionary collation step S201, the control unit 1 executes a keyword dictionary collation order determination step S301. In this keyword dictionary collation order determination step S301, the control unit 1 determines which field the collation unit 60 should collate with the keyword dictionary 61 by referring to the collation order rule. This collation order is determined by, for example, collation order rules stored in the memory of the control unit 1.

この照合順序ルールとしては、例えば、上記の実施の形態１のように各認識候補文字の類似度及び関連情報の各寄与度に差をつける方法の場合であれば、「各認識候補文字としての分野特徴文字の評価値を、分野特徴文字が属する認識候補文字列の文字数で割った値が４３以上である同一の分野の分野特徴文字の数が、認識候補文字列の文字数の３０％を超える場合は、当該分野特徴文字に対応する分野から照合する」ルールが挙げられる。図６に示した上記の実施の形態１の場合、「県」の評価値が１３０となっており、認識候補文字列が３文字なので、1文字当たりでは４３．３となる。このため、「県」が文字列全体に占める割合は３文字中１文字のため、３０％以上となり、このルールが適用される。つまり、住所分野の上記分野特徴文字として「県」を選定した場合に、評価値４３．３／文字となる「県」の分野である「住所」が照合の第１候補に変わる。 As this collation order rule, for example, in the case of a method of making a difference between the similarity of each recognition candidate character and each contribution degree of related information as in the first embodiment, “as each recognition candidate character. The number of field characteristic characters in the same field in which the value obtained by dividing the evaluation value of the field characteristic character by the number of characters of the recognition candidate character string to which the field characteristic character belongs is equal to or greater than 43% exceeds the number of characters of the recognition candidate character string. In this case, a rule “match from the field corresponding to the field characteristic character” is included. In the case of the first embodiment shown in FIG. 6, since the evaluation value of “prefecture” is 130 and the recognition candidate character string is 3 characters, 43.3 per character. For this reason, the ratio of “prefecture” to the whole character string is 30% or more because it is 1 character out of 3 characters, and this rule is applied. That is, when “prefecture” is selected as the above-mentioned field characteristic character in the address field, “address”, which is the field of “prefecture” having an evaluation value of 43.3 / character, is changed to the first candidate for collation.

すなわち、制御部１は、ある分野について生成された認識候補文字列を、どの分野のキーワード辞書と最初に照合するかを、当該認識候補文字列に含まれる分野特徴文字の割合に基づいて決定する。 That is, the control unit 1 determines which field's keyword dictionary is first checked against the recognition candidate character string generated for a certain field based on the ratio of the field characteristic characters included in the recognition candidate character string. .

その結果、キーワード辞書照合ステップＳ２０１においては、照合部６０が、最高評価値の加算値で最高となった一般分野で得られた「茶良県」を、最初に、住所キーワード辞書と照合することになる。この例では、照合の結果は不成功に終わり（ステップＳ３０２にてＮＯ）、照合部（置換部）６０は、一般分野の認識候補文字列「茶良県」の先頭の認識候補文字を類似度の高い順に置き換え（ステップＳ３０３）、照合部（置換部）６０はは、認識候補文字がなくなるまでキーワード辞書と照合する（ステップＳ３０３及び３０４にてＮＯ）。この例では、一般分野における認識候補文字列の先頭の認識候補文字は「奈」が第２候補であるので、「奈良県」と住所キーワード辞書が照合され、照合に成功する（ステップＳ３０２にてＹＥＳ）。 As a result, in the keyword dictionary collation step S201, the collation unit 60 first collates “Chara Prefecture” obtained in the general field having the highest value of the highest evaluation value with the address keyword dictionary. become. In this example, the result of the collation is unsuccessful (NO in step S302), and the collation unit (replacement unit) 60 determines the similarity of the first recognition candidate character of the recognition candidate character string “Chara Prefecture” in the general field. Are replaced in descending order (step S303), and the collation unit (replacement unit) 60 collates with the keyword dictionary until there is no recognition candidate character (NO in steps S303 and 304). In this example, since “N” is the second candidate for the first recognition candidate character in the recognition candidate character string in the general field, “Nara Prefecture” and the address keyword dictionary are collated, and the collation is successful (in step S302). YES).

決定部７は、この照合成功の結果に基づき、認識結果文字列決定ステップＳ１０９において、「奈良県」を認識結果文字列として決定し、出力部９が、認識結果文字列出力ステップＳ１１０において、決定部７によって決定された認識結果文字列の出力を実行する。 The determination unit 7 determines “Nara Prefecture” as the recognition result character string in the recognition result character string determination step S109 based on the result of the collation success, and the output unit 9 determines in the recognition result character string output step S110. The recognition result character string determined by the unit 7 is output.

本実施の形態によれば、キーワード辞書６１のキーワード辞書のオープン、クローズの回数を減らすことができるので、照合部６０による照合時間を削減し、照合部６０による照合処理に必要なメモリ容量を削減することができる。 According to the present embodiment, since the number of times of opening and closing the keyword dictionary of the keyword dictionary 61 can be reduced, the matching time by the matching unit 60 is reduced, and the memory capacity required for the matching process by the matching unit 60 is reduced. can do.

なお、上記の実施の形態３及び４において各分野のキーワード辞書を用いて認識候補文字列との照合を行うのは、上記の実施の形態１及び２において生成される認識候補文字列が現実にありえる文字列か否かを判定するためである。現実にありえない文字列を生成しても意味がないため、各分野のキーワードを含むか否かのチェックを行っている。 Note that the recognition candidate character strings generated in the above-described first and second embodiments are actually collated with the recognition candidate character strings using the keyword dictionary in each field in the third and fourth embodiments. This is for determining whether the character string is possible. Since it is meaningless to generate a character string that is impossible in reality, it is checked whether or not it includes keywords in each field.

（実施の形態５）
次に、本発明の実施の形態５について説明する。上記の実施の形態３及び４では、キーワード辞書との照合は完全一致の場合を想定して説明されている。これに対し、本実施の形態は、完全一致はしないが部分一致する場合にも適合するように、図８及び図９のキーワード辞書照合ステップＳ２０１の処理内容を拡張した形態である。 (Embodiment 5)
Next, a fifth embodiment of the present invention will be described. In Embodiments 3 and 4 described above, collation with the keyword dictionary has been described assuming a perfect match. On the other hand, in the present embodiment, the processing contents of the keyword dictionary collation step S201 of FIGS. 8 and 9 are expanded so as to be adapted even in the case of partial matching although not complete matching.

本実施の形態においては、照合部６０が、キーワード辞書からキーワードを取り出し、認識候補文字列と照合し、認識候補文字列にキーワードが含まれるか否かを調べる。キーワードが認識候補文字列に含まれていれば、キーワードは認識候補文字列に含まれるという判定結果が得られる。認識候補文字列よりキーワードの文字列長の方が長い場合は認識候補文字列にワイルドカードを付けてキーワードの文字数と合うようにして照合を行う。ワイルドカードはどんな文字でも一致することを示す。照合部６０による照合結果には、追加したワイルドカードの文字数を記録しておき、結果を使用する際に用いられるようにする。照合部６０による上記の照合結果は、適宜、制御部１のメモリに記憶される。 In the present embodiment, the collation unit 60 takes out a keyword from the keyword dictionary, collates it with a recognition candidate character string, and checks whether or not the keyword is included in the recognition candidate character string. If the keyword is included in the recognition candidate character string, a determination result that the keyword is included in the recognition candidate character string is obtained. When the character string length of the keyword is longer than the recognition candidate character string, a wild card is attached to the recognition candidate character string so as to match the number of characters of the keyword. Wildcards indicate that any character matches. In the collation result by the collation unit 60, the number of characters of the added wild card is recorded and used when the result is used. The above collation result by the collation unit 60 is appropriately stored in the memory of the control unit 1.

図１０は、分野「住所」のキーワード辞書のキーワード「奈良県」に対して、認識候補文字列が「奈食県天理市」だった場合の例である。この状態で認識候補文字列の先頭から照合を行うと、キーワードの１文字目、３文字目は一致するが、２文字目は一致しないという結果になる。上記の実施の形態３及び４では、判定結果はキーワードなしという判定結果になる。しかし、認識候補文字列に間違いがあり、間違いを直せば一致するという場合は、掘り下げて調べる必要がある。 FIG. 10 shows an example of the case where the recognition candidate character string is “Tenri City, Nana Prefecture” for the keyword “Nara Prefecture” in the keyword dictionary of the field “Address”. When collation is performed from the beginning of the recognition candidate character string in this state, the first character and the third character of the keyword are matched, but the second character is not matched. In Embodiments 3 and 4 described above, the determination result is a determination result indicating that there is no keyword. However, if there is a mistake in the recognition candidate character string and it matches if the mistake is corrected, it is necessary to delve into it.

そこで、本実施の形態では、最初の単純照合を行った際の結果に関して設定条件を決めておく。この条件は、例えば「キーワード長の８０％以上が一致している」、「不一致文字がキーワード長２文字、３文字の場合で１文字以内、６文字以下で２文字以内、７文字以上は３文字以内」といった形式であらかじめ設定しておく。最初の単純照合の際に完全一致でなくても、上記の設定条件を満たす範囲で一致している場合に、照合部６０は、認識候補文字列の間違いの可能性を調査する。 Therefore, in the present embodiment, setting conditions are determined with respect to the result when the first simple collation is performed. This condition is, for example, “80% or more of the keyword length is matched”, “If the mismatched character has a keyword length of 2 characters or 3 characters, 1 character or less, 6 characters or less and 2 characters or less, 7 characters or more are 3 characters. Set in advance in a format such as “within characters”. Even if the first simple matching is not complete matching, the matching unit 60 investigates the possibility of an error in the recognition candidate character string when matching is performed within the range satisfying the above setting condition.

図１０の例では、２文字目の認識候補文字は、文字認識処理の際、正しい文字「良」が第１候補にならなかった例である。上記の設定条件を満たした場合、照合部６０は、第２候補以降の認識候補文字を参照し、不一致位置にあるべき正しい認識候補文字があるかどうかを調べる。図１０の例では、２文字目の認識対象文字についての候補文字群に含まれる認識候補文字と対応するキーワードの文字「良」との一致を順に調べていくと、第３位に「良」があり、一致する。このため、キーワード「奈良県」は認識対象文字列内にあると判定される。上記の類似度または評価値について閾値を設定し、探索する範囲を閾値以上のものに限定することも可能である。 In the example of FIG. 10, the second recognition candidate character is an example in which the correct character “good” is not the first candidate during the character recognition process. When the above setting conditions are satisfied, the collation unit 60 refers to the recognition candidate characters after the second candidate and checks whether there is a correct recognition candidate character that should be at the mismatch position. In the example of FIG. 10, when the match between the recognition candidate character included in the candidate character group for the second character to be recognized and the character “good” of the corresponding keyword is sequentially examined, “good” is ranked third. There is a match. Therefore, it is determined that the keyword “Nara Prefecture” is in the recognition target character string. It is also possible to set a threshold value for the above similarity or evaluation value and limit the search range to a threshold value or more.

図１１は、図８及び図９のキーワード辞書照合ステップＳ２０１における処理手順を示すフローチャートである。キーワード辞書照合ステップＳ２０１の処理が部分一致まで拡張されている。ここでは、あるキーワードとある認識候補文字列のある位置でマッチングした際の動作を述べる。実際には、認識候補文字列のマッチング位置ごと、及び、キーワードごとに図１１の処理を呼び出す。 FIG. 11 is a flowchart showing a processing procedure in the keyword dictionary collation step S201 of FIGS. The processing of the keyword dictionary collation step S201 is extended to partial matching. Here, an operation when matching a certain keyword with a certain recognition candidate character string at a certain position will be described. Actually, the processing of FIG. 11 is called for each matching position of the recognition candidate character string and for each keyword.

図１１に示すように、まず、照合部６０がマッチングステップを実行する（ステップＳ４０１）。このマッチングステップＳ４０１では、キーワードと認識候補文字列のある位置で通常のマッチングを行う。 As shown in FIG. 11, the collation part 60 performs a matching step first (step S401). In this matching step S401, normal matching is performed at a position where the keyword and the recognition candidate character string are located.

次に、照合部６０が文字数の集計ステップを実行する（ステップＳ４０２）。この文字数集計ステップ４０２では、マッチングステップＳ４０１におけるマッチング結果から、キーワードと認識候補文字列との間で一致した文字数（一致文字数）を集計する。 Next, the collation part 60 performs the totaling step of the number of characters (step S402). In this character count tabulation step 402, the number of matching characters (number of matching characters) between the keyword and the recognition candidate character string is tabulated from the matching result in the matching step S401.

そして、キーワードの文字数と一致文字数とが一致した場合には（ステップＳ４０３ＹＥＳ）、照合部６０はキーワードありと判定する（ステップＳ４０４）。このキーワードあり判定ステップＳ４０４では、認識候補文字列中に現在マッチング対象にしているキーワードがあると判定する。 If the number of characters in the keyword matches the number of matching characters (YES in step S403), the matching unit 60 determines that there is a keyword (step S404). In this keyword determination step S404, it is determined that there is a keyword that is currently a matching target in the recognition candidate character string.

一方、キーワードと認識候補文字列とが完全に一致しない場合には（ステップＳ４０３ＮＯ）、照合部６０はあらかじめ設定された上記の設定条件を満足するか否かを判断する（ステップＳ４０５）。そして、満足しない場合には（ステップＳ４０５ＮＯ）、照合部６０は、キーワードなしと判定する（ステップＳ４０６）。このキーワードなし判定ステップ４０６では、認識候補文字列中に現在マッチング対象にしているキーワードはないと判定する。 On the other hand, when the keyword and the recognition candidate character string do not completely match (NO in step S403), the collation unit 60 determines whether or not the preset setting condition is satisfied (step S405). And when not satisfied (step S405 NO), the collation part 60 determines with no keyword (step S406). In the no keyword determination step 406, it is determined that there is no keyword currently targeted for matching in the recognition candidate character string.

また、上記の設定条件を満足する場合には（ステップＳ４０５ＹＥＳ）、照合部６０は不一致文字の位置を特定する（ステップＳ４０７）。この不一致文字位置特定ステップＳ４０７では、キーワードと認識候補文字列とで一致しない文字の位置を特定する。 If the above setting condition is satisfied (YES in step S405), the collation unit 60 identifies the position of the mismatched character (step S407). In this mismatched character position specifying step S407, the position of the character that does not match between the keyword and the recognition candidate character string is specified.

次に、照合部６０は、不一致文字が属する候補文字群に含まれる他の認識候補文字を参照する（ステップＳ４０８）。この認識候補文字参照ステップＳ４０８では、不一致文字位置特定ステップＳ４０７において特定された不一致文字位置のうちの１つにおける上記他の認識候補文字の１つを選択する。 Next, the collation unit 60 refers to another recognition candidate character included in the candidate character group to which the unmatched character belongs (step S408). In this recognition candidate character reference step S408, one of the other recognition candidate characters at one of the mismatched character positions specified in the mismatched character position specifying step S407 is selected.

そして、認識候補文字参照ステップＳ４０８において選択された認識候補文字と上記不一致文字とを入れ替えた認識候補文字列中におけるキーワードの有無が判断される（ステップＳ４０９）。そして、キーワードが無ければ（ステップＳ４０９ＮＯ）、照合部６０はキーワードなしと判定する（ステップＳ４０６）。 Then, it is determined whether or not there is a keyword in the recognition candidate character string in which the recognition candidate character selected in the recognition candidate character reference step S408 is replaced with the mismatched character (step S409). If there is no keyword (NO in step S409), the collation unit 60 determines that there is no keyword (step S406).

一方、キーワードが有れば（ステップＳ４０９ＹＥＳ）、照合部６０は、マッチング時の全ての不一致文字をステップＳ４０７〜ステップＳ４０９の手順で処理したかどうかを判断する（ステップＳ４１０）。認識結果によっては、複数の不一致文字が存在する場合もあるためである。不一致文字がまだある場合には（ステップＳ４１０ＮＯ）、ステップＳ４０７に戻って不一致文字の検出を続ける。不一致文字がない場合には（ステップＳ４１０ＹＥＳ）、キーワードありと判定する（ステップＳ４０４）。 On the other hand, if there is a keyword (YES in step S409), the collation unit 60 determines whether all unmatched characters at the time of matching have been processed in the procedure of steps S407 to S409 (step S410). This is because there may be a plurality of mismatched characters depending on the recognition result. If there is still a mismatch character (NO in step S410), the process returns to step S407 to continue detection of the mismatch character. If there is no mismatched character (YES in step S410), it is determined that there is a keyword (step S404).

図１２に、本実施の形態のキーワード辞書照合ステップＳ２０１の処理結果例を示す。図１０に示した認識候補文字列「奈食県天理市」に対し、「住所」、「姓」、「名」、「一般」の各辞書中のキーワードとマッチングした際の結果を示している。分野ごとに、上記の認識候補文字列とキーワードをマッチングさせた際に、一致したキーワードとその一致した認識候補文字列中の位置を記録する。認識候補文字列中の位置は、認識候補文字列の先頭を基準としてキーワードの開始位置を表示している。また、出力文字数よりキーワードの方が長く、出力文字列にワイルドカードを付けてキーワードとマッチした場合は追加したワイルドカードの数も記録する。図１２の例では、住所辞書にキーワード「奈良」、「奈良県」、「天理市」があり、姓辞書にキーワード「奈良」がある場合で、認識候補文字列とのマッチングでこれらのキーワードが発見された場合である。内容はマッチしたキーワード、キーワードの出力文字列中での位置、マッチング時に追加したワイルドカードの個数である。 FIG. 12 shows an example of the processing result of the keyword dictionary collation step S201 of the present embodiment. For the recognition candidate character string “Tenri City, Nago Prefecture” shown in FIG. 10, the result of matching with keywords in the “address”, “last name”, “first name”, and “general” dictionaries is shown. . When the recognition candidate character string and the keyword are matched for each field, the matched keyword and the position in the matching recognition candidate character string are recorded. The position in the recognition candidate character string indicates the start position of the keyword with reference to the beginning of the recognition candidate character string. In addition, if the keyword is longer than the number of output characters and a wild card is added to the output character string to match the keyword, the number of added wild cards is also recorded. In the example of FIG. 12, the keywords “Nara”, “Nara Prefecture”, and “Tenri City” are in the address dictionary and the keyword “Nara” is in the surname dictionary, and these keywords are matched with the recognition candidate character strings. This is the case. The contents are the matched keyword, the position of the keyword in the output string, and the number of wildcards added during matching.

なお、これらの情報は、例えば、出力部９から出力された認識結果文字列に含まれるキーワードに関する情報として、利用者に提供してもよい。 In addition, you may provide such information to a user as information regarding the keyword contained in the recognition result character string output from the output part 9, for example.

（実施の形態６）
（文字列生成装置３２の構成）
次に、本発明の実施の形態６について説明する。図１３は、本実施の形態にかかる文字列生成装置３２の概略構成を示すブロック図である。図１３において、本実施の形態にかかる文字列生成装置３２は、上記の実施の形態５の文字列生成層と同様、制御部１と、文字認識部２と、候補蓄積部３と、算出部４と、関連情報蓄積部５と、文字列結合部６と、決定部７と、入力部８と、出力部９と、照合部６０と、キーワード辞書６１と、を備えている。 (Embodiment 6)
(Configuration of the character string generation device 32)
Next, a sixth embodiment of the present invention will be described. FIG. 13 is a block diagram showing a schematic configuration of the character string generation device 32 according to the present embodiment. In FIG. 13, the character string generation device 32 according to the present embodiment is similar to the character string generation layer of the fifth embodiment described above, in the control unit 1, the character recognition unit 2, the candidate storage unit 3, and the calculation unit. 4, a related information storage unit 5, a character string combination unit 6, a determination unit 7, an input unit 8, an output unit 9, a collation unit 60, and a keyword dictionary 61.

本実施の形態にかかる文字列生成装置３２はさらに、分野解析部（キーワード解析手段）１３０と、書き換え規則記憶部１３１と、備えている。分野解析部１３０は、制御部１と接続されており、文字列結合部６が生成する認識候補文字列に含まれるキーワードの分野を解析し、その解析結果に基づいて、その認識候補文字列の確度（確実さの程度）を付加する。書き換え規則記憶部１３１は、分野解析部１３０がキーワード解析の際に利用するキーワード分野の書き換え規則をあらかじめ記憶している。なお、分野解析部１３０は、算出部４や、照合部６０と同じ技術で実現可能であり、書き換え規則記憶部１３１はキーワード辞書と同様の蓄積部で実現可能である。 The character string generation device 32 according to the present embodiment further includes a field analysis unit (keyword analysis unit) 130 and a rewrite rule storage unit 131. The field analysis unit 130 is connected to the control unit 1, analyzes a keyword field included in the recognition candidate character string generated by the character string combining unit 6, and determines the recognition candidate character string based on the analysis result. Add accuracy (degree of certainty). The rewrite rule storage unit 131 stores keyword field rewrite rules used in advance by the field analysis unit 130 when performing keyword analysis. The field analysis unit 130 can be realized by the same technique as the calculation unit 4 and the collation unit 60, and the rewrite rule storage unit 131 can be realized by an accumulation unit similar to the keyword dictionary.

（分野解析部１３０及び書き換え規則記憶部１３１の詳細）
次に、分野解析部１３０及び書き換え規則記憶部１３１について説明する。図１４は、上記の実施の形態５のキーワード辞書照合ステップＳ２０１の処理結果例を示す。図１４の処理結果例は、認識候補文字列「奈良県警」をキーワード辞書で照合した場合の結果である。図１４に示すように、「奈良県警」の分野が住所であるとすると、住所キーワード辞書との照合により「奈良県」がマッチし、「警」はどの辞書ともマッチしない状況になる。一方、「奈良」が住所あるいは「姓」とすると、一般分野のキーワード辞書にエントリされている「県警」との組み合わせになる。 (Details of the field analysis unit 130 and the rewrite rule storage unit 131)
Next, the field analysis unit 130 and the rewrite rule storage unit 131 will be described. FIG. 14 shows an example of the processing result of the keyword dictionary matching step S201 of the fifth embodiment. The processing result example of FIG. 14 is a result when the recognition candidate character string “Nara Prefectural Police” is collated with a keyword dictionary. As shown in FIG. 14, if the field of “Nara Prefectural Police” is an address, “Nara Prefecture” matches by matching with the address keyword dictionary, and “Police” does not match any dictionary. On the other hand, if “Nara” is an address or “last name”, it is combined with “prefectural police” entered in the keyword dictionary in the general field.

生成された認識候補文字列が同じ分野の部分文字列の組み合わせで生成されていれば認識対象文字列の分野依存が確認でき、且つ、生成された文字列も正しい確率が高くなる。しかし、上記の場合のように、どの辞書ともマッチしない文字列を含む場合や、分野の違う文字列の組み合わせの場合には、生成された文字列は正しい確率が低くなってしまう。 If the generated recognition candidate character string is generated by a combination of partial character strings in the same field, the field dependence of the recognition target character string can be confirmed, and the generated character string has a high probability of being correct. However, as described above, when a character string that does not match any dictionary is included, or in the case of a combination of character strings in different fields, the generated character string has a low probability of being correct.

本実施の形態は、上記のような場合でも、生成される認識候補文字列に、その認識候補文字列の確からしさを示す精度情報を、評価値とは別に付加することにより、正しい認識結果文字列が生成される確率を高くするものである。 In this embodiment, even in the above case, correct recognition result characters are added to the generated recognition candidate character string by adding accuracy information indicating the likelihood of the recognition candidate character string separately from the evaluation value. This increases the probability that a sequence is generated.

図１５に、図１３に示した書き換え規則記憶部１３１に記憶された書き換え規則の内容の一例を示す。図１５において、左側の要素は文字または分野である。「＋」記号は、文字または分野どうしを結合可能であることを示している。〔〕は分野を示している。〔〕の付かない文字は単なる文字で、〔〕の中の文字は分野を表わしている。したがって、〔住所〕は分野「住所」という意味である。図１５では、複数の文字列の組み合わせからなる認識候補文字列の各文字列の分野からその認識候補文字列全体としての分野を解析するための、分野「住所」についての書き換え規則を中心に示している。ここで、〔数〕は分野「数字」で、算用数字、漢数字、ローマ数字などが該当する。〔未〕はいずれのキーワード辞書ともマッチしない場合に付けられる未定義の分野を持つ文字あるいは文字列という意味であり、文字列全体の結果が〔未〕の場合であれば分野解析不能ということである。〔＊〕はキーワード辞書のいずれかとマッチすることを意味し、〔姓名〕は〔姓〕と〔名〕をつないだ姓名という分野である。また、「→」は書き換えが可能であることを示している。 FIG. 15 shows an example of the contents of the rewrite rule stored in the rewrite rule storage unit 131 shown in FIG. In FIG. 15, the left element is a character or a field. The “+” sign indicates that characters or fields can be combined. [] Indicates a field. Characters without [] are simply characters, and characters in [] represent fields. Therefore, [address] means the field "address". FIG. 15 mainly shows rewrite rules for the field “address” for analyzing the field of each recognition candidate character string as a whole from the field of each character string of a recognition candidate character string formed by a combination of a plurality of character strings. ing. Here, [number] is the field “number”, which includes arithmetic numbers, Chinese numerals, Roman numerals, and the like. [Un] means a character or character string with an undefined field that is attached when it does not match any keyword dictionary. If the result of the entire string is [Un], field analysis is impossible. is there. [*] Means matching with any of the keyword dictionaries, and [last name] is the field of surname given by connecting [last name] and [first name]. Further, “→” indicates that rewriting is possible.

図１６は、本発明の実施の形態６にかかる文字列生成装置の文字列生成方法の処理手順を示すフローチャートである。図９及び図１１に示した上記実施の形態５のフローチャートと同一のステップには、同一の番号が付されている。ここでは、上記の実施の形態５とは異なる点のみ説明する。 FIG. 16 is a flowchart showing the processing procedure of the character string generation method of the character string generation device according to the sixth embodiment of the present invention. The same steps as those in the flowchart of the fifth embodiment shown in FIGS. 9 and 11 are denoted by the same reference numerals. Here, only differences from the fifth embodiment will be described.

図１６において、分野解析部１３０は、図１４に示した処理結果例に対して分野解析を行う（ステップＳ５０１）。例えば、認識候補文字列が「奈良県警」の場合、この「奈良県警」は、〔住所〕の「奈良県」と〔未〕との組み合わせであると、分野解析部１３０が判定した場合、図１５の書き換え規則１２０により解析不能という結果になる。 In FIG. 16, the field analysis unit 130 performs field analysis on the processing result example shown in FIG. 14 (step S501). For example, when the recognition candidate character string is “Nara Prefectural Police”, the field analysis unit 130 determines that this “Nara Prefectural Police” is a combination of “Nara Prefecture” and [Not] in [Address]. The 15 rewrite rules 120 result in inability to analyze.

解析不能の場合には（ステップＳ５０２ＮＯ）、分野解析部１３０は、照合部６０の照合結果（ここでは、図１４の処理結果例）に基づいて、〔住所〕を含めて他の分野のキーワードの組み合わせが無いかを調べ（ステップＳ５０４）、あれば（ステップＳ５０４ＹＥＳ）、異なる分野候補に対し再度「奈良県警」についての分野解析を行う（ステップＳ５０１）。図１４の処理結果例であれば、「奈良県警」について（「奈良県」〔住所〕）（「警」〔未〕）→〔未〕という書き換え規則が適用されるのであれば、上記と同じ結果となるが、（「奈良」〔住所〕）（「県警」（一般））→〔一般〕という書き換え規則が適用されれば、解析成功という結果となる（Ｓ５０２にてＹＥＳ）。 When the analysis is impossible (NO in step S502), the field analysis unit 130 determines the keywords of other fields including [address] based on the collation result of the collation unit 60 (here, the processing result example of FIG. 14). If there is no combination (step S504), if it is present (step S504 YES), the field analysis for “Nara Prefectural Police” is performed again for different field candidates (step S501). In the case of the processing result example of FIG. 14, if the rewrite rule “(Nara Prefecture” [address]) (“guard” [not]) → [not]] is applied to “Nara Prefectural Police”, the same as above. As a result, if the rewrite rule (“Nara” [address]) (“prefectural police” (general)) → [general] is applied, the result of analysis is successful (YES in S502).

すなわち、分野解析部１３０は、認識候補文字列が、所定のキーワードの組み合わせから構成されているかどうかを判定するキーワード判定のための複数の判定基準（書き換え規則）のうちの１つである第１の判定基準（第１の書き換え規則）を用いてキーワード判定を行い、当該認識候補文字列が所定のキーワードの組み合わせから構成されていないと第１の判定基準に基づいて判定した場合に、第１の判定基準とは異なる第２の判定基準（第２の書き換え規則）を用いてキーワード判定を行う。 That is, the field analysis unit 130 is one of a plurality of determination criteria (rewrite rules) for keyword determination that determines whether or not the recognition candidate character string is composed of a combination of predetermined keywords. When the keyword is determined using the determination criterion (first rewrite rule) and it is determined based on the first determination criterion that the recognition candidate character string is not composed of a predetermined keyword combination, the first Keyword determination is performed using a second determination criterion (second rewriting rule) different from the determination criterion.

分野解析部１３０は、解析成功との結果に基づき、確度が高いという情報（＋確度情報）を認識候補文字列である「奈良県警」に付加する（ステップＳ５０３）。一方、他の分野候補の組み合わせが無く（ステップＳ５０４にてＮＯ）、分野解析不成功で終わる場合は、確度が低いという情報（−確度情報）を、その認識候補文字列に付加する（ステップＳ５０６）。 The field analysis unit 130 adds information that accuracy is high (+ accuracy information) to the recognition candidate character string “Nara Prefectural Police” based on the result of successful analysis (step S503). On the other hand, when there is no combination of other field candidates (NO in step S504) and the field analysis is unsuccessful, information indicating that the accuracy is low (-accuracy information) is added to the recognition candidate character string (step S506). ).

決定部７は、この確度情報に基づき、認識結果文字列決定ステップＳ１０９において、＋確度情報が付加された認識候補文字列である「奈良県警」を認識結果文字列として決定し、出力部９が、認識結果文字列出力ステップＳ１１０において、決定部７によって決定された認識結果文字列の出力を実行する。 Based on this accuracy information, the determination unit 7 determines “Nara Prefectural Police”, which is a recognition candidate character string to which + accuracy information is added, as a recognition result character string in the recognition result character string determination step S109, and the output unit 9 In the recognition result character string output step S110, the recognition result character string determined by the determination unit 7 is output.

このように、分野解析を行うことによって、生成された文字列の確度を判定し、＋確度情報が付加された認識候補文字列を認識結果文字列として決定し、出力することにより、認識結果文字列の認識精度を高めることができる。 In this way, by performing field analysis, the accuracy of the generated character string is determined, the recognition candidate character string to which the + accuracy information is added is determined as a recognition result character string, and the recognition result character string is output. Column recognition accuracy can be increased.

なお、本発明は、上述した各実施形態に限定されるものではなく、請求項に示した範囲で種々の変更が可能であり、異なる実施形態にそれぞれ開示された技術的手段を適宜組み合わせて得られる実施形態についても本発明の技術的範囲に含まれる。 The present invention is not limited to the above-described embodiments, and various modifications can be made within the scope of the claims, and the technical means disclosed in different embodiments can be appropriately combined. Such embodiments are also included in the technical scope of the present invention.

最後に、各実施の形態にかかる文字列生成装置の各ブロック、特に制御部１は、ハードウェアロジックによって構成してもよいし、次のようにＣＰＵを用いてソフトウェアによって実現してもよい。 Finally, each block of the character string generation device according to each embodiment, in particular, the control unit 1 may be configured by hardware logic, or may be realized by software using a CPU as follows.

すなわち、各実施の形態にかかる文字列生成装置は、各機能を実現する制御プログラムの命令を実行するＣＰＵ（ｃｅｎｔｒａｌｐｒｏｃｅｓｓｉｎｇｕｎｉｔ）、前記プログラムを格納したＲＯＭ（ｒｅａｄｏｎｌｙｍｅｍｏｒｙ）、前記プログラムを展開するＲＡＭ（ｒａｎｄｏｍａｃｃｅｓｓｍｅｍｏｒｙ）、前記プログラム及び各種データを格納するメモリ等の記憶装置（記録媒体）などを備えている。そして、本発明の目的は、上述した機能を実現するソフトウェアである光ディスク装置の制御プログラムのプログラムコード（実行形式プログラム、中間コードプログラム、ソースプログラム）をコンピュータで読み取り可能に記録した記録媒体を、前記光ディスク装置に供給し、そのコンピュータ（又はＣＰＵやＭＰＵ）が記録媒体に記録されているプログラムコードを読み出し実行することによっても、達成可能である。 That is, the character string generation device according to each embodiment expands a CPU (central processing unit) that executes instructions of a control program that implements each function, a ROM (read only memory) that stores the program, and the program. A random access memory (RAM), a storage device (recording medium) such as a memory for storing the program and various data, and the like are provided. An object of the present invention is to provide a recording medium in which a program code (execution format program, intermediate code program, source program) of a control program for an optical disc apparatus, which is software that realizes the functions described above, is recorded in a computer-readable manner, This can also be achieved by supplying the optical disc apparatus and reading and executing the program code recorded on the recording medium by the computer (or CPU or MPU).

前記記録媒体としては、例えば、磁気テープやカセットテープ等のテープ系、フロッピー（登録商標）ディスク／ハードディスク等の磁気ディスクやコンパクトディスク−ＲＯＭ／ＭＯ／ＭＤ／デジタルビデオデイスク／コンパクトディスク−Ｒ等の光ディスクを含むディスク系、ＩＣカード（メモリカードを含む）／光カード等のカード系、あるいはマスクＲＯＭ／ＥＰＲＯＭ／ＥＥＰＲＯＭ／フラッシュＲＯＭ等の半導体メモリ系などを用いることができる。 Examples of the recording medium include a tape system such as a magnetic tape and a cassette tape, a magnetic disk such as a floppy (registered trademark) disk / hard disk, and a compact disk-ROM / MO / MD / digital video disk / compact disk-R. A disk system including an optical disk, a card system such as an IC card (including a memory card) / optical card, or a semiconductor memory system such as a mask ROM / EPROM / EEPROM / flash ROM can be used.

また、各実施の形態にかかる文字列生成装置を通信ネットワークと接続可能に構成し、前記プログラムコードを、通信ネットワークを介して、供給してもよい。この通信ネットワークとしては、特に限定されず、例えば、インターネット、イントラネット、エキストラネット、ＬＡＮ、ＩＳＤＮ、ＶＡＮ、ＣＡＴＶ通信網、仮想専用網（virtual private network）、電話回線網、移動体通信網、衛星通信網等が利用可能である。また、通信ネットワークを構成する伝送媒体としては、特に限定されず、例えば、ＩＥＥＥ１３９４、ＵＳＢ、電力線搬送、ケーブルＴＶ回線、電話線、ＡＤＳＬ回線等の有線でも、ＩｒＤＡやリモコンのような赤外線、Ｂｌｕｅｔｏｏｔｈ（登録商標）、８０２．１１無線、ＨＤＲ、携帯電話網、衛星回線、地上波デジタル網等の無線でも利用可能である。なお、本発明は、前記プログラムコードが電子的な伝送で具現化された、搬送波に埋め込まれたコンピュータデータ信号の形態でも実現され得る。 The character string generation device according to each embodiment may be configured to be connectable to a communication network, and the program code may be supplied via the communication network. The communication network is not particularly limited. For example, the Internet, intranet, extranet, LAN, ISDN, VAN, CATV communication network, virtual private network, telephone line network, mobile communication network, satellite communication. A net or the like is available. Also, the transmission medium constituting the communication network is not particularly limited. For example, even in the case of wired such as IEEE 1394, USB, power line carrier, cable TV line, telephone line, ADSL line, etc., infrared rays such as IrDA and remote control, Bluetooth ( (Registered trademark), 802.11 wireless, HDR, mobile phone network, satellite line, terrestrial digital network, and the like can also be used. The present invention can also be realized in the form of a computer data signal embedded in a carrier wave in which the program code is embodied by electronic transmission.

なお、本発明は、以下のようにも表現することができる。すなわち、本発明にかかる文字列生成装置は、文字認識の候補の列から認識結果としての文字列を生成する文字列生成装置であって、文字認識候補を類似度とともに蓄積する候補蓄積手段と、各文字と予め定められた分野間の関連性を示す情報を蓄積する分野別関連情報蓄積手段と、生成する文字列の信頼度を表す評価値を算出する算出手段と、上記評価値に基づいて文字列を結合する文字列結合手段を備え、上記算出手段によって類似度と分野別関連情報に基づいて認識対象の各文字の分野毎の評価値を求め、最大評価値分野の文字列を結合した文字列を生成する。 The present invention can also be expressed as follows. That is, the character string generation device according to the present invention is a character string generation device that generates a character string as a recognition result from a character recognition candidate sequence, and a candidate storage unit that stores the character recognition candidates together with the similarity, Based on the evaluation value, a field-specific related information storage unit that stores information indicating the relationship between each character and a predetermined field, a calculation unit that calculates an evaluation value representing the reliability of the generated character string, and Character string combining means for combining character strings is provided, and the above calculation means obtains an evaluation value for each field of each character to be recognized based on similarity and field-related information, and combines the character strings of the maximum evaluation value fields. Generate a string.

上記算出手段は評価値の算出に際して文字認識候補の類似度又は分野別関連情報を正規化した数値を用いることが好ましい。 The calculation means preferably uses a numerical value obtained by normalizing the similarity of character recognition candidates or the field-related information when calculating the evaluation value.

上記算出手段は評価値の算出に際して適用する業務の種類に応じて文字認識候補の類似度又は分野別関連情報の分野寄与度に基づいた数値を用いることが好ましい。 The calculation means preferably uses a numerical value based on the similarity of the character recognition candidates or the field contribution of the field related information according to the type of work applied when calculating the evaluation value.

上記文字列生成装置はさらに、上記各分野に対応したキーワードを蓄積したキーワード辞書蓄積手段と、上記生成された文字列とキーワード辞書のキーワードとのマッチングをとる照合手段を備え、上記評価値が最大となる文字列が上記照合手段により照合された上記キーワードと一致し、当該キーワード辞書の分野が前記分野と同一の場合に、上記分野別に結合した文字列を認識結果として生成することが好ましい。 The character string generation device further includes a keyword dictionary storage unit that stores keywords corresponding to the respective fields, and a matching unit that matches the generated character string with a keyword in the keyword dictionary, and the evaluation value is the maximum. When the character string to be matched with the keyword collated by the collating means and the field of the keyword dictionary is the same as the field, it is preferable to generate a character string combined by the field as a recognition result.

上記評価値が最大となる文字列が上記照合手段により照合されたいずれのキーワードともその一部が一致せず、当該不一致文字の認識の他の候補の中に当該不意一文字と置き換えることによって上記文字列がキーワード一致する場合は、当該文字部分を他の候補で置き換えた文字列を認識結果として生成することが好ましい。 The character string with the maximum evaluation value does not partially match any of the keywords collated by the collating means, and the character is replaced by the unexpected character in another candidate for recognition of the mismatched character. When the columns match with each other, it is preferable to generate a character string in which the character portion is replaced with another candidate as a recognition result.

上記文字列生成装置はさらに、上記各分野に対応したキーワードを蓄積したキーワード辞書蓄積手段と、上記生成された文字列とキーワード辞書のキーワードとのマッチングをとる照合手段と、分野項目間の結合による分野項目書き換え規則を蓄積する書き換え規則蓄積手段と、上記分野項目書き換え規則を適用して分野項目を決定する分野解析手段を備え、上記評価値が最大となる文字列の分野が上記書き換え規則を適用することにより、文字列全体が分野解析可能な否かに基づいて生成した文字列の確度情報を付加することが好ましい。 The character string generation device further includes a keyword dictionary storage means for storing keywords corresponding to the respective fields, a matching means for matching the generated character strings with keywords of the keyword dictionary, and a combination between the field items. Rewriting rule storage means for storing field item rewriting rules and field analysis means for determining field items by applying the field item rewriting rules, and applying the rewriting rules to the field of the character string having the maximum evaluation value Thus, it is preferable to add the accuracy information of the character string generated based on whether the entire character string can be analyzed in the field.

対象とする文字列に対し上記書き換え規則を再帰的に適用し分野解析手段で解析した結果解析不能の場合、他の分野属性の候補に対し分野解析を行うことが好ましい。 When the rewrite rule is recursively applied to the target character string and analysis is not possible as a result of analysis by the field analysis means, it is preferable to perform field analysis on other field attribute candidates.

上記分野解析の結果、分野解析可能な場合は生成された文字列の確度を上げ、解析不能の場合生成された文字列の確度を下げることが好ましい。 As a result of the field analysis, it is preferable to increase the accuracy of the generated character string when the field analysis is possible and to decrease the accuracy of the generated character string when the analysis is impossible.

本発明にかかる文字列生成プログラムは、上記の文字列分類装置の上記各手段としてコンピュータを動作させるための文字列生成プログラムである。 A character string generation program according to the present invention is a character string generation program for causing a computer to operate as each of the means of the character string classification device.

本発明にかかるコンピュータ読み取り可能な記録媒体は、上記の文字列生成プログラムを記録した、コンピュータ読み取り可能な記録媒体である。 The computer-readable recording medium concerning this invention is a computer-readable recording medium which recorded said character string production | generation program.

本発明にかかる文字列生成方法は、文字認識の候補の列から認識結果としての文字列を生成する文字列生成装置を用いた文字列生成方法であって、文字認識候補を類似度とともに蓄積する候補蓄積ステップと、各文字と予め定められた分野間の関連性を示す情報を蓄積する分野別関連情報蓄積ステップと、生成する文字列の信頼度を表す評価値を算出する算出ステップと、上記評価値に基づいて文字列を結合する文字列結合ステップを備え、上記算出ステップによって求めた類似度と分野別関連情報に基づいて認識対象の各文字の分野毎の評価値を求めるステップと、最大評価値分野の文字列を結合した文字列を生成するステップを備える。 A character string generation method according to the present invention is a character string generation method using a character string generation device that generates a character string as a recognition result from a character recognition candidate string, and accumulates character recognition candidates together with a similarity. A candidate accumulation step, a field-specific related information accumulation step for accumulating information indicating the relationship between each character and a predetermined field, a calculation step for calculating an evaluation value representing the reliability of the character string to be generated, and A character string combining step for combining character strings based on the evaluation value, and a step of determining an evaluation value for each field of each character to be recognized based on the similarity obtained by the calculation step and the field-related information; And a step of generating a character string obtained by combining the character strings in the evaluation value field.

本発明は、文書原稿を光学的に読み取って得られる画像データより当該文書原稿の文字列を生成する文字列生成装置に適用できる。文字認識結果から少ない計算量でより精度良くキーワードの有無が判定できるため、全体の計算量が削減でき、ＯＣＲの動作の高速化につながる。 The present invention can be applied to a character string generation device that generates a character string of a document original from image data obtained by optically reading the document original. Since the presence / absence of a keyword can be determined more accurately with a small amount of calculation from the character recognition result, the total amount of calculation can be reduced, leading to faster operation of the OCR.

本発明の実施の形態１にかかる文字列生成装置の概略構成を示すブロック図である。It is a block diagram which shows schematic structure of the character string production | generation apparatus concerning Embodiment 1 of this invention. 本発明の実施の形態１にかかる文字列生成装置の文字列生成方法の処理手順を示すフローチャートである。It is a flowchart which shows the process sequence of the character string production | generation method of the character string production | generation apparatus concerning Embodiment 1 of this invention. 図２の文字認識ステップＳ１０２における処理結果の一例を示す図である。It is a figure which shows an example of the process result in the character recognition step S102 of FIG. 図２の関連情報蓄積ステップＳ１０５における処理結果の一例を示す図である。It is a figure which shows an example of the process result in the relevant information storage step S105 of FIG. 類似度と関連情報との間の相対的な重み付けの例を説明するための図である。It is a figure for demonstrating the example of the relative weighting between a similarity and related information. 図２の認識候補文字妥当性評価ステップＳ１０７における処理結果の一例を示す図である。It is a figure which shows an example of the processing result in recognition candidate character validity evaluation step S107 of FIG. 本発明の実施の形態３にかかる文字列生成装置の概略構成を示すブロック図である。It is a block diagram which shows schematic structure of the character string production | generation apparatus concerning Embodiment 3 of this invention. 本発明の実施の形態３にかかる文字列生成装置の文字列生成方法の処理手順を示すフローチャートである。It is a flowchart which shows the process sequence of the character string production | generation method of the character string production | generation apparatus concerning Embodiment 3 of this invention. 本発明の実施の形態４にかかる文字列生成装置の文字列生成方法の処理手順を示すフローチャートである。It is a flowchart which shows the process sequence of the character string production | generation method of the character string production | generation apparatus concerning Embodiment 4 of this invention. 認識候補文字とキーワードとのマッチング結果例を示す図である。It is a figure which shows the matching result example of a recognition candidate character and a keyword. 本発明の実施の形態５のキーワード辞書照合ステップＳ２０１における処理手順を示すフローチャートである。It is a flowchart which shows the process sequence in keyword dictionary collation step S201 of Embodiment 5 of this invention. 本発明の実施の形態５のキーワード辞書照合ステップＳ２０１における処理結果例を示す図である。It is a figure which shows the example of a process result in keyword dictionary collation step S201 of Embodiment 5 of this invention. 本発明の実施の形態６にかかる文字列生成装置の概略構成を示すブロック図である。It is a block diagram which shows schematic structure of the character string production | generation apparatus concerning Embodiment 6 of this invention. 本発明の実施の形態５のキーワード辞書照合ステップＳ２０１における処理結果例を示す図である。It is a figure which shows the example of a process result in keyword dictionary collation step S201 of Embodiment 5 of this invention. 図１３の書き換え規則記憶部の内容例を示す図である。It is a figure which shows the example of the content of the rewrite rule memory | storage part of FIG. 本発明の実施の形態６にかかる文字列生成装置の文字列生成方法の処理手順を示すフローチャートである。It is a flowchart which shows the process sequence of the character string production | generation method of the character string production | generation apparatus concerning Embodiment 6 of this invention.

Explanation of symbols

１制御部
２文字認識部
３候補蓄積部
４算出部（算出手段）
５関連情報蓄積部
６文字列結合部（文字列結合手段）
７決定部（決定手段）
８入力部
９出力部
３０、３１、３２文字列生成装置
６０照合部（照合手段、置換手段）
６１キーワード辞書
１３０分野解析部（キーワード解析手段）
１３１書き換え規則記憶部 DESCRIPTION OF SYMBOLS 1 Control part 2 Character recognition part 3 Candidate storage part 4 Calculation part (calculation means)
5. Related information storage unit 6. Character string coupling unit (character string coupling means)
7 decision part (decision means)
8 Input unit 9 Output unit 30, 31, 32 Character string generation device 60 Verification unit (verification unit, replacement unit)
61 keyword dictionary 130 field analysis unit (keyword analysis means)
131 Rewrite rule storage unit

Claims

Character recognition using similarity between the recognition target character included in the recognition target character string composed of the recognition target characters to be recognized and a plurality of candidate characters that are candidates for recognition results of the recognition target character A character string generator for performing
A candidate character group consisting of one or more candidate characters is acquired for each recognition target character, and the candidate character is used when expressing the similarity and one of a plurality of predetermined concepts. For each candidate character, an evaluation value representing the probability as the recognition result of the recognition target character is used using the frequency or the degree of association representing the degree of relevance between the candidate character and one of the predetermined concepts. Calculating means for calculating for each of the predetermined concepts;
The recognition target is obtained by combining candidate characters selected one by one from the candidate character group corresponding to each recognition target character using the evaluation value for each predetermined concept calculated by the calculation unit. A character string combining means for generating a candidate character string that is a candidate of a character string recognition result for each of the predetermined concepts;
A character string generation apparatus comprising: a determination unit that determines a recognition result character string that is a recognition result of the recognition target character string from candidate character strings generated by the character string combination unit.

The character string combining means selects, for each recognition target character included in the recognition target character string, a highest evaluation candidate character having the highest evaluation value from a group of corresponding candidate characters, and combines the selected highest evaluation candidate characters The character string generation device according to claim 1, wherein the candidate character string is generated.

The determination unit adds the evaluation value of each candidate character included in the candidate character string generated by the character string combination unit for each candidate character string, and uses the total of the evaluation values to determine the recognition result character The character string generation device according to claim 1, wherein a character string is determined.

The said calculation means calculates the said evaluation value by adding the said related degree and the said similarity after weighting with respect to the said related degree or the said similarity. The character string generation device according to any one of the above.

The calculation means acquires an instruction from a user indicating which predetermined concept of the predetermined concepts should be used for calculating the evaluation value, and the acquired related degree The character string generation device according to claim 1, wherein the evaluation value is calculated using a character string.

A keyword dictionary for storing keywords having relevance to any of the predetermined concepts;
Collating means for determining whether or not they match by collating the candidate character strings generated by the character string combining means with the keywords stored in the keyword dictionary;
The character string generation device according to claim 1, wherein the determination unit determines the recognition result character string using a determination result of the collation unit.

The keywords are classified according to the predetermined concept,
The collation means selects a predetermined concept to which a keyword to be collated with the candidate character string belongs according to a predetermined rule, and collates the keyword belonging to the selected predetermined concept with the candidate character string. Item 7. The character string generation device according to Item 6.

When the collating unit determines that the candidate character string and the keyword are partially different within a predetermined condition range, a candidate character group including the candidate character is selected as a candidate character that does not match the keyword. A replacement means for replacing with another candidate character having
The character string generation device according to claim 6, wherein the determination unit adds the candidate character string generated by the replacement unit to the recognition result character string candidate.

When the collating unit determines that the candidate character string and the keyword do not match, the collating unit further includes a keyword analyzing unit that determines whether or not the candidate character string is configured by a combination of a plurality of keywords,
The collation means, when the keyword analysis means determines that the candidate character string is composed of a combination of a plurality of keywords, collates the keyword combination with the candidate character string. Item 8. The character string generation device according to Item 6 or 7.

The keyword analysis means, when the matching means determines that the candidate character string and the keyword do not match, the keyword highly relevant to the predetermined concept to which the candidate character string belongs, and the candidate character string The character string generation device according to claim 9, wherein it is determined whether or not the candidate character string is configured by a combination of a plurality of keywords.

The keyword analysis unit determines that the candidate character string to which the candidate character string belongs when the matching unit determines that the keyword highly relevant to the predetermined concept to which the candidate character string belongs does not match the candidate character string. It is characterized by determining whether or not the candidate character string is composed of a combination of a plurality of keywords by comparing the candidate character string with a keyword highly relevant to a predetermined concept different from the concept The character string generation device according to claim 9.

When the keyword analyzing unit determines that the candidate character string is configured by a combination of a plurality of keywords, the matching unit is based on a matching result obtained by matching the keyword combination with the candidate character string. Give the candidate character string accuracy information indicating the probability of the candidate character string,
The character string generation device according to any one of claims 9 to 11, wherein the determination unit determines the recognition result character string using accuracy information added to the candidate character string.

The character string generation program for operating a computer as said each means of the character string generation apparatus of any one of Claims 1-12.

A computer-readable recording medium on which the character string generation program according to claim 13 is recorded.

Character recognition using similarity between the recognition target character included in the recognition target character string composed of the recognition target characters to be recognized and a plurality of candidate characters that are candidates for recognition results of the recognition target character A string generation method for performing
A candidate character group consisting of one or more candidate characters is acquired for each recognition target character, and the candidate character is used when expressing the similarity and one of a plurality of predetermined concepts. For each candidate character, an evaluation value representing the probability as the recognition result of the recognition target character is used using the frequency or the degree of association representing the degree of relevance between the candidate character and one of the predetermined concepts. A calculation step of calculating for each predetermined concept;
Using the evaluation value for each predetermined concept calculated in the calculation step, combining the candidate characters selected one by one from the candidate character group corresponding to each of the recognition target characters, the recognition target A character string combining step for generating a candidate character string that is a candidate of a character string recognition result for each of the predetermined concepts;
A character string generation method comprising: a determination step of determining a recognition result character string that is a recognition result of the recognition target character string from among the candidate character strings generated in the character string combining step.