JPH0258664B2

JPH0258664B2 -

Info

Publication number: JPH0258664B2
Application number: JP56180363A
Authority: JP
Inventors: Shuzo Oowaku; Akio Nagano; Katsuhide Tanoshima; Masayoshi Yurugi
Original assignee: Oki Electric Industry Co Ltd
Current assignee: Oki Electric Industry Co Ltd
Priority date: 1981-11-12
Filing date: 1981-11-12
Publication date: 1990-12-10
Also published as: JPS5882373A

Description

【発明の詳細な説明】本発明は、情報処理機器の入力装置として用い
られるオンライン手書文字認識装置に関わる認識
方法に関するものである。DETAILED DESCRIPTION OF THE INVENTION The present invention relates to a recognition method related to an online handwritten character recognition device used as an input device for information processing equipment.

従来の手書文字入力ワードプロセツサは手書文
字を認識するためにオンライン手書文字認識技術
を用いていたが、例えば数字、アルフアベツト、
ひらがな、漢字を認識する場合、数字10ケ、アル
フアベツト26ケ、ひらがな46文字及び濁点、半濁
点をはじめとする記号類を含むと約200字あり、
又、漢字をJIS C6226第１水準漢字集合に限つて
も2965字存在する。漢字を常用漢字にしぼつても
1945字あり計2000文字を越えることとなる。 Conventional handwritten character input word processors use online handwritten character recognition technology to recognize handwritten characters, such as numbers, alphabets,
When recognizing hiragana and kanji, there are approximately 200 characters, including 10 numbers, 26 alphanumeric characters, 46 hiragana characters, and symbols such as voiced and half-voiced marks.
Furthermore, even if we limit kanji to the JIS C6226 first level kanji set, there are 2965 characters. Even if kanji are reduced to common kanji
There are 1945 characters, totaling over 2000 characters.

さて、漢字を当用漢字とし合計2000字強の文字
を認識する手書文字入力日本語ワードプロセツサ
も発表されているがその認識処理は、例えばＫ―
Ｌ展開法を用いて、漢字を直接認識せんとする
等、ハードウエア量が大となりすぎるという欠点
があつた。又、前記の例に限らず手書漢字を数
字、アルフアベツト、ひらがな等と同じアルゴリ
ズムで直接認識せんとするため、認識のための計
算量が膨大となり処理時間がかかることとなり、
この処理時間を短縮するためハードウエア量の増
加を招くという、実用化という観点からみた場合
重大な欠点があつた。 Now, a handwritten character input Japanese word processor that recognizes over 2,000 characters in total using kanji as regular kanji has been announced, but its recognition processing is difficult, for example, with K--
The disadvantage was that the amount of hardware required was too large, as the L-expansion method was used to directly recognize kanji. In addition, not only the above example but also handwritten kanji are not directly recognized using the same algorithm as numbers, alphabets, hiragana, etc., so the amount of calculation for recognition is enormous and processing time is required.
In order to shorten this processing time, the amount of hardware increases, which is a serious drawback from the viewpoint of practical use.

本発明は前記の欠点を除去することを目的と
し、漢字を細分化して細分化された文字の部分集
合を認識し前記認識された部分集合の集まりとし
て漢字を識別するものであつて、等価的に認識す
る漢字の字数を減づると共に辞書に要する記憶容
量を減づるもので認識対象の文字辞書は前記文字
の部分集合の系列の形で登録できるため認識文字
系数の増加にともなう記憶容量の増加を低くおさ
えることができるという特徴を有する。 The present invention aims to eliminate the above-mentioned drawbacks, and is aimed at subdividing kanji, recognizing subsets of the subdivided characters, identifying kanji as a collection of the recognized subsets, and identifying the kanji as a collection of the recognized subsets. This reduces the number of kanji characters to be recognized and also reduces the storage capacity required for the dictionary.Since the character dictionary to be recognized can be registered in the form of a series of subsets of the characters, the storage capacity increases as the number of recognized character systems increases. It has the characteristic of being able to keep the amount low.

以下図面により実施例を説明する。 Examples will be described below with reference to the drawings.

第１図は本発明の一実施例を示すブロツク図で
あつて１はタブレツト、２は認識部、３は漢字を
細分化した文字の部分集合（以下擬似部首とい
う）の特徴と擬似部首コードを格納した擬似部首
辞書、４は認識部２で認識された擬似部首コード
を格納する入力レジスタ、５は入力レジスタ４内
に格納されている１ケ又は複数の擬似部首コード
より文字を選択する選択回路、６は擬似部首コー
ドと文字コードを格納した文字辞書の如く構成さ
れて居る。タブレツト１より入力されたストロー
ク情報は、認識部２へ送られる。認識部２はオン
ライン手書に好適な周知のストロークアナリシス
法やＫ―Ｌ展開法等により各擬似部首を認識する
ように働く。認識部２で認識する際は、擬似部首
の特徴と擬似部首コードを格納した擬似部首辞書
３を用いる。認識部２の出力は、擬似部首コード
の形で入力レジスタ４に認識される都度出力され
格納される。入力レジスタ４より格納されている
擬似部首コードが順次出力され、選択回路５に入
力される。選択回路５では、文字辞書６の内容と
入力レジスタ４の出力の擬似部首コードとにより
文字を選択し結果の文字コードを出力する。 FIG. 1 is a block diagram showing one embodiment of the present invention, in which 1 is a tablet, 2 is a recognition unit, and 3 is a feature of a subset of characters (hereinafter referred to as pseudo-radicals) obtained by subdividing kanji characters and pseudo-radicals. 4 is an input register that stores the pseudo radical code recognized by the recognition unit 2; 5 is a character from one or more pseudo radical codes stored in the input register 4; A selection circuit 6 for selecting is configured like a character dictionary storing pseudo radical codes and character codes. Stroke information input from the tablet 1 is sent to the recognition section 2. The recognition unit 2 works to recognize each pseudo radical using the well-known stroke analysis method, KL expansion method, etc. suitable for online handwriting. When the recognition unit 2 performs recognition, a pseudo-radical dictionary 3 that stores pseudo-radical features and pseudo-radical codes is used. The output of the recognition unit 2 is outputted and stored in the input register 4 in the form of a pseudo-radical code each time it is recognized. The pseudo radical codes stored in the input register 4 are sequentially output and input to the selection circuit 5. The selection circuit 5 selects a character based on the contents of the character dictionary 6 and the pseudo radical code output from the input register 4, and outputs the resulting character code.

第２図は擬似部首辞書３の一例を示す。000よ
り続く16進の数字は擬似部首コードを示し、擬似
部首コードの右側には各擬似部首が記載されてい
る。実際の辞書では、各擬似部首の位置には認識
部２の認識アルゴリズムに基く特徴データが記載
されるがここでは説明の便のため各擬似部首その
ものを示す。なお擬似部首辞書３には、漢字以外
のひらがな、数字、アルフアベツト等は細分化さ
れず直接格納されている。 FIG. 2 shows an example of the pseudo radical dictionary 3. The hexadecimal numbers following 000 indicate pseudo-radical codes, and each pseudo-radical is written on the right side of the pseudo-radical code. In an actual dictionary, feature data based on the recognition algorithm of the recognition unit 2 is written at the position of each pseudo-radical, but each pseudo-radical itself is shown here for convenience of explanation. Note that in the pseudo radical dictionary 3, hiragana, numbers, alphanumeric characters, etc. other than kanji are directly stored without being subdivided.

第３図は、入力レジスタ４の詳細を示す。９は
認識部２よりの出力、１０〜１７は入力レジスタ
４内のI₀レジスタ〜I₇レジスタ、１８は切換回
路、１９は入力レジスタ４の出力を示す。 FIG. 3 shows details of the input register 4. Reference numeral 9 indicates the output from the recognition unit 2, 10 to 17 indicate the _I0 register to _I7 register in the input register 4, 18 indicates the switching circuit, and 19 indicates the output of the input register 4.

第４図は文字辞書６の一部分を示す。第４図第
３行目は、擬似部首コード176で示される「立」
という擬似部首と擬似部首コード0FBで示される
「日」という擬似部首により「音」という文字で
あることを示し、「音」という文字の文字コード
は、JIS C6226コードで323Bというコードである
ことを示す。尚、第４図の（）内文字は説明の
便のため記載したもので実際の辞書は擬似部首コ
ードと文字コードで構成される。 FIG. 4 shows a portion of the character dictionary 6. The third line in Figure 4 is “Tachi” indicated by the pseudo radical code 176.
The pseudo radical ``日'', which is indicated by the pseudo radical ``日'' and the pseudo radical code 0FB, indicates that it is the character ``音'', and the character code for the character ``音'' is the code 323B in the JIS C6226 code. Show that something is true. Note that the characters in parentheses in FIG. 4 are shown for convenience of explanation, and the actual dictionary is composed of pseudo radical codes and character codes.

第５図は、「彰」という文字を入した時の本発
明による処理を示すため入力レジスタ４のI₀レジ
スタ１０〜I₇レジスタ１７へ入力される擬似部首
コードを示したものである。 FIG. 5 shows the pseudo radical codes input to the _I0 register 10 to _I7 register 17 of the input register 4 to show the processing according to the present invention when the character "Akira" is entered.

以下、第５図を中心として本発明によるオンラ
イン手書文字認識の方法について「彰」という文
字を例にとり詳細に説明する。なお、各ストロー
クの開始・終了は例えばタブレツトを入力ペンな
どで押圧・押圧解除する場合に得られる電圧など
の発生・消滅に対応するものとする。 Hereinafter, the online handwritten character recognition method according to the present invention will be described in detail with reference to FIG. 5, taking the character "Akira" as an example. Note that the start and end of each stroke correspond to the generation and disappearance of a voltage, etc., obtained when pressing and releasing pressure on a tablet with an input pen or the like, for example.

先づ第１ストロークでタブレツト１より操作者
が「〓」を入力すると「〓」は認識部２へ出力さ
れ、認識部２に於いて擬似部首辞書（以下辞書と
いう。）３を用いて擬似部首として登録されてい
るか否かを検定するが「〓」は辞書３に登録され
ていないため、未定義コード＊をI₀レジスタ１０
に登録する。ついで第２ストロークで「〓」がタ
ブレツト１より入力されると、タイミングT₁で
未定義の「〓」と合せて「〓」という擬似部首が
辞書３にあるかどうかを認識部２において検定す
ると、第２図で示される様に擬似部首コード050
として「〓」という擬似部首が登録されているた
め、I₀レジスタ１０に050というコードが格納さ
れる。 First, in the first stroke, when the operator inputs "〓" from the tablet 1, "〓" is output to the recognition unit 2, and the recognition unit 2 uses a pseudo radical dictionary (hereinafter referred to as dictionary) 3 to generate a pseudo radical. It tests whether it is registered as a radical, but since "〓" is not registered in the dictionary 3, the undefined code * is sent to I ₀ register 10.
Register. Then, when "〓" is input from the tablet 1 in the second stroke, the recognition unit 2 tests whether the pseudo radical "〓" exists in the dictionary 3 along with the undefined "〓" at timing _T1 . Then, as shown in Figure 2, the pseudo radical code 050
Since the pseudo-radical "〓" is registered, the code 050 is stored in the _I0 register 10.

第３ストロークで入力される「〓」は擬似部首
が辞書３に登録されていないため、I₀レジスタ１
０はそのままにしてI₁レジスタ１１に未定義コー
ド＊を登録する。 "〓" input in the third stroke is a pseudo radical that is not registered in the dictionary 3, so it is entered in the I ₀ register 1.
0 is left as is and an undefined code * is registered in the _I1 register 11.

第４ストロークで入力された「〓」は、辞書３
に登録されていないが第３ストロークで未定義の
「〓」と合せて「〓」という擬似部首が辞書３に
014という擬似部首コードで登録されているため、
I₁レジスタ１１の未定義コードを消去して新たに
014というコードがセツトされる。なお、擬似部
首コード050と014で新たな擬似部首となるかどう
かについて「〓」という文字で辞書３を用いて検
定するが「〓」という文字は独立の擬似部首とし
て辞書３に存在しないため、I₀レジスタ１０、I₁
レジスタ１１の内容は変らないで保持される。 “〓” entered in the fourth stroke is in the dictionary 3
The pseudo radical "〓" is added to Dictionary 3 in combination with the undefined "〓" which is not registered in the third stroke.
Because it is registered with the pseudo radical code 014,
I ₁ Erase the undefined code of register 11 and create a new one.
The code 014 is set. In addition, the character "〓" is used to test whether pseudo radical codes 050 and 014 are new pseudo radicals using Dictionary 3, but the character "〓" exists in Dictionary 3 as an independent pseudo radical. I ₀ register 10, I ₁
The contents of register 11 are held unchanged.

第５ストロークで「〓」が入力されると「〓」
という文字は辞書３より擬似部首コード004であ
ることが判明するためI₂レジスタ１２に004がセ
ツトされ、その後「〓」及び「立」について擬似
部首コードが辞書３に登録されているか否かを検
定する。即ち、その文字内の全ストロークについ
て最小個数の擬似部首コードで表現するために検
定を行なうこととなる。この場合「〓」は擬似部
首コードとして登録されてなく「立」は、擬似部
首コード176として辞書に登録されている。従つ
てI₀レジスタ１０、I₀レジスタ１１、I₂レジスタ
１２をリセツトし、I₀レジスタ１０に176を登録
する。かくして「立」という文字は、第２図で示
す擬似部首コード176で示される１ケの擬似部首
であることを示すこととなる。 When “〓” is input in the fifth stroke, “〓”
It is determined from the dictionary 3 that the character ``is'' pseudo-radical code 004, so 004 is set in the _I2 register 12, and then it is checked whether pseudo-radical codes for ``〓'' and ``tate'' are registered in the dictionary 3. to test whether In other words, a test is performed to express all strokes within the character using the minimum number of pseudo-radical codes. In this case, "〓" is not registered as a pseudo-radical code, and "stand" is registered as pseudo-radical code 176 in the dictionary. Therefore, _I0 register 10, _I0 register 11, and _I2 register 12 are reset, and 176 is registered in _I0 register 10. Thus, the character ``tate'' indicates that it is a single pseudo-radical as shown in the pseudo-radical code 176 shown in FIG.

同様にして第５図に示すように「彰」という文
字に関して、結果的に擬似部首コード176，0FB，
045，065で表わされる文字であることが認識され
る。なお、第５ストロークで３ケの擬似部首コー
ドよりなる入力文字を再検定して１ケの擬似部首
コード176を識別したのと同様に、第９ストロー
クでは擬似部首コード021よりなる文字と未定義
の「〓」及び「〓」より擬似部首コード0FBなる
文字「日」を識別し、さらに又第11，13，14スト
ロークにおいてもそれぞれ２ケの擬似部首コード
と識別された入力文字から１ケの擬似部首コード
を識別している。この様にして一種の最長一致法
により擬似部首コードの検定を行なつている。一
般的に最長一致法の場合、その入力すべてが入力
され終つてから一致を見ることが一般的であ。即
ち、入力された全ストロークに対して判定し、擬
似部首と認められない最後の１ストロークを除い
て判定するという手法をとるが擬似部首コードと
して辞書３に登録されている擬似部首の数が数
字、アルフアベツト、ひらがな、記号等を加えて
も600ケ強であり多きな数にならないこと、更に
は、人が文字を入力する速度が遅いことを勘案し
て、入力順に検定を行なつているものである。ま
た１つの文字の入力の終了は最終ストロークを検
知することによつて行なわれるが具体的には各ス
トロークの間の時間を監視して所定の時間を越え
ると次の文字の入力操作に移行したこととし前ス
トロークが最終ストロークであつたということで
検知する方法や１つの文字入力の領域を予め定め
ておく方法やキー操作で指示する方法などがあ
る。 Similarly, as shown in Figure 5, for the character "Akira", the resulting pseudo radical code is 176, 0FB,
It is recognized that the characters are 045, 065. In addition, in the same way that in the 5th stroke, the input character consisting of 3 pseudo-radical codes was re-examined and 1 pseudo-radical code 176 was identified, in the 9th stroke, the character consisting of the pseudo-radical code 021 was identified. and undefined "〓" and "〓", the character "day" with pseudo radical code 0FB is identified, and furthermore, the 11th, 13th, and 14th strokes are also identified as 2 pseudo radical codes each. One pseudo radical code is identified from a character. In this way, pseudo-radical codes are tested using a type of longest match method. Generally, in the case of the longest match method, a match is generally checked after all of the inputs have been input. In other words, a method is used in which all input strokes are judged, excluding the last stroke that is not recognized as a pseudo-radical. Even if you add numbers, alphanumeric characters, hiragana, symbols, etc., the number is just over 600, which is not a large number.Furthermore, taking into consideration the fact that people input characters at a slow speed, we performed the test in the order in which they were entered. It is something that In addition, the end of inputting one character is done by detecting the final stroke, but specifically, the time between each stroke is monitored, and when a predetermined time is exceeded, the input operation for the next character is started. There are a method of detecting that the previous stroke was the last stroke, a method of predetermining an area for inputting one character, and a method of instructing by key operation.

以上説明したように、タブレツト１より「彰」
を入力することにより入力レジスタ４内のI₀レジ
スタ１０〜I₃レジスタ１３内に擬似部首コード
176，0FB，045，065が格納される。これら入力
レジスタ４内のI₀レジスタ１０〜I₇レジスタ１７
の内容は切換回路１８により出力１９に順次導出
され、選択回路５に入力される。選択回路５で
は、入力された擬似部首コードにより、第４図に
示す文字辞書６を用いてJIS C6226による文字コ
ードを選択する。すなわち、入力レジスタ４内の
I₀レジスタ１０〜I₇レジスタ１７に格納されてい
る擬似部首コードが176，0FB，045，065である
ことにより選択回路５により、文字辞書６を調べ
ると第４図に示す如く擬似部首コードが176、
0FB，045，065である文字は3E34となる文字コ
ードで示される漢字「彰」であることが判明す
る。 As explained above, "Akira" is better than Tablet 1.
By inputting the pseudo radical code into _I0 register 10 to _I3 register 13 in input register 4,
176, 0FB, 045, 065 are stored. _I0 register 10 to _I7 register 17 in these input registers 4
The contents of are sequentially led out to the output 19 by the switching circuit 18 and input to the selection circuit 5. The selection circuit 5 selects a character code according to JIS C6226 using the character dictionary 6 shown in FIG. 4 based on the input pseudo radical code. That is, in input register 4
Since the pseudo-radical codes stored in the _I0 register 10 to _I7 register 17 are 176, 0FB, 045, 065, the selection circuit 5 searches the character dictionary 6 and finds the pseudo-radical as shown in FIG. The code is 176,
It turns out that the characters 0FB, 045, and 065 are the kanji ``Akira'', which is indicated by the character code 3E34.

以上の様にして3E34なるJIS C6226文字コード
が選択回路５より出力されることによりタブレツ
ト１より入力された手書文字が漢字「彰」である
ことが認識される。 As described above, the JIS C6226 character code 3E34 is output from the selection circuit 5, thereby recognizing that the handwritten character input from the tablet 1 is the kanji character "Ang".

以上詳細に説明したように前記実施例に於いて
は、漢字を細分化して、細分化された文字の擬似
部首となづけた部分集合を認識し前記認識された
擬似部首の集まりとして漢字を識別する方法を示
す。ここで擬似部首を認識するためには、簡単な
アルゴリズムの認識部２と、漢字の数に比して極
めて少数の擬似部首からなる擬似部首辞書３でよ
いこととなる。例えば数字、アルフアベツト、ひ
らがな、記号類及びJIS C6626第１水準漢字集合
2965字のための擬似部首の数は600強であり、こ
の内400強が漢字2965字のための擬似部首である。
このように漢字2965字の字数を本発明によれば認
識時のみ等価的に減少させる効果を有することと
なる。又、第２図に示す擬似部首辞書３の内容を
認識するための認識部２の機能は、周知のストロ
ークアナリシス法等の簡単なアルゴリズムでよい
ことはその道の専門家であれば容易に理解できる
ものと考える。更にこれら認識された擬似部首の
集まりとして漢字を含む文字を識別するための文
字辞書６は、第４図に示す様に単に擬似部首コー
ドとJIS C6226による漢字コードだけで構成でき
るため漢字を含めた文字の字数が多くなつた場合
でも極めて少量のメモリしか増加しないことも本
発明の利点であり、擬似部首辞書３及び文字辞書
６を合せても、直接漢字を含む文字の特徴を記録
した従来の辞書の容量が大であつたのと相違して
極めて少量となるため、オンライン手書文字認識
を、その対象を漢字2965字より構成されるJIS
C6226第１水準漢字集合にまで広げたとしても、
手軽に提供することができ、もつて、情報処理装
置全般にとつて好適な入力装置を安価に提供する
ことができる。 As explained in detail above, in the above embodiment, a kanji is subdivided, a subset of the subdivided characters is recognized, and a kanji is created as a collection of the recognized pseudo-radicals. Show how to identify. In order to recognize pseudo-radicals here, it is sufficient to use a recognition unit 2 with a simple algorithm and a pseudo-radical dictionary 3 consisting of a very small number of pseudo-radicals compared to the number of Chinese characters. For example, numbers, alphabets, hiragana, symbols, and JIS C6626 level 1 kanji set
The number of pseudoradicals for the 2965 characters is over 600, of which over 400 are pseudoradicals for the 2965 kanji characters.
In this way, the present invention has the effect of equivalently reducing the number of 2965 Chinese characters only during recognition. Furthermore, it is easily understood by experts in the field that the function of the recognition unit 2 for recognizing the contents of the pseudo-radical dictionary 3 shown in FIG. 2 can be performed using a simple algorithm such as the well-known stroke analysis method. Think of it as possible. Furthermore, the character dictionary 6 for identifying characters including kanji as a collection of these recognized pseudo-radicals can be composed of only pseudo-radical codes and kanji codes according to JIS C6226, as shown in Figure 4, so it is difficult to identify kanji. Another advantage of the present invention is that even when the number of characters included increases, only a very small amount of memory is added, and even if the pseudo radical dictionary 3 and the character dictionary 6 are combined, the characteristics of characters including kanji can be directly recorded. Unlike the large capacity of conventional dictionaries, the capacity of online handwritten character recognition is extremely small.
Even if expanded to the C6226 first level kanji set,
It is possible to provide an input device that is easy to provide and is suitable for information processing devices in general at a low cost.

前記実施例では基本的な要素について説明した
が、以下に示す様に各種の改良を実施することに
よりよりよいオンライン手書文字認識方法を提供
することができるので以下に説明する。 Although the basic elements have been explained in the above embodiment, a better online handwritten character recognition method can be provided by implementing various improvements as shown below, which will be explained below.

第１に、前記実施例では標準的な擬似部首辞書
３を１ケだけ設けたが、標準的な辞書の他に使用
者各個人用の擬似部首個人辞書を１ケ又は複数個
設け、認識しづらい擬似部首を含んだ文字の擬似
部首について後から追加登録ができる構成とすれ
ば、より一層認識率の向上が図れるとともに認識
アルゴリズム自体も簡易なものでなくなるという
効果がある。 First, in the above embodiment, only one standard pseudo-radical dictionary 3 was provided, but in addition to the standard dictionary, one or more pseudo-radical personal dictionaries for each user were provided. If a configuration is adopted in which pseudo radicals of characters including pseudo radicals that are difficult to recognize can be additionally registered later, the recognition rate can be further improved and the recognition algorithm itself is not simple.

第２に、前記実施例では文字辞書６の内容とし
て擬似部首コードと文字コードだけの組合せとし
たが第４図の例でも明らな様に同じ「立」という
文字でも「立」単独で１つの文字を構成するも
の、「妾」、「音」、「章」、「意」、「童」の様に文
字
の上部に「〓」が位置するもの、「彰」、「韻」の
様に文字の左上部に「〓」が位置するもの等があ
る。これらの「立」は前記実施例では全く同一に
扱つて居り、字数が実施例程度の場合問題はない
が、字数を更に増加させたい場合においてはこれ
らの「立」という文字の位置情報を文字辞書６に
含ませれば更に認識率を向上させることができ
る。位置情報としてはJIS C6226字形索引第４項
の字形構成を用いれば充分である。 Second, in the above embodiment, the content of the character dictionary 6 is a combination of only pseudo-radical codes and character codes, but as is clear from the example in FIG. Those that make up a single character, those that have a 〓 at the top of the character such as ``concubine'', ``on'', ``chapter'', ``in'', and ``dong'', ``sho'', and ``rhyme''. For example, there are cases where "〓" is located at the upper left of the character. These ``tachi'' are treated exactly the same in the above example, and there is no problem if the number of characters is about the same as in the example, but if you want to further increase the number of characters, you can change the position information of these characters ``tachi'' to a character. If it is included in the dictionary 6, the recognition rate can be further improved. It is sufficient to use the glyph structure in item 4 of the JIS C6226 Glyph Index as position information.

第３に前記実施例では、第５図を用いて説明し
た如く新しいストロークがタブレツト１より入力
される毎に第１ストロークまでさかのぼつて今書
かれた前記新しいストローク迄が１つの擬似部首
を構成しているかどうかについて、入力されたス
トロークについて検定すべく説明した。例えば擬
似部首「〓」と「〓」と「〓」は擬似部首コード
では050，014，004であるが、この３ケの擬似部
首が集つた「立」も擬似部首でありそのコードは
176であることを「立」のストロークにより擬似
部首辞書３を参照して求めた。 Third, in the above embodiment, as explained using FIG. 5, each time a new stroke is input from the tablet 1, one pseudo radical is traced back to the first stroke up to the new stroke just written. The explanation was given to verify whether the input strokes are correct or not. For example, the pseudo-radicals "〓", "〓", and "〓" are pseudo-radical codes 050, 014, 004, but "tachi", which is a collection of these three pseudo-radicals, is also a pseudo-radical and its The code is
176 was determined by referring to the Pseudo Radical Dictionary 3 using the stroke of ``tate''.

しかしながら、認識部２より参照する辞書とし
て擬似部首辞書３の他に、（擬似部首050）＋（擬似
部首014）＋（擬似部首004）＝（擬似部首176）の如
き擬似部首間の相関々係を示す擬似部首相関辞書
を設ければ、認識処理時間が極めて早くなるとい
う効果がある。 However, in addition to the pseudo-radical dictionary 3 as a dictionary referred to by the recognition unit 2, pseudo-radicals such as (pseudo-radical 050) + (pseudo-radical 014) + (pseudo-radical 004) = (pseudo-radical 176) Providing a pseudo-radical correlation dictionary that indicates correlations between necks has the effect of extremely shortening the recognition processing time.

以上詳細に説明した様に、本発明は認識が困難
な漢字を細分化しこの細分化した擬似部首を認識
し擬似部首の集まりとして漢字を識別するため、
先づ認識辞書に要する記憶容量が極小でよく更に
認識文字数の増加にともなう記憶容量の増加を極
めて低くおさえられ更に、前記説明の通り認識す
るのは擬似部首というカナレベルのものであるた
め認識率そのものも高く保てるという好適なオン
ライン手書文字認識方法を安価に提供することが
できる。 As explained in detail above, the present invention subdivides kanji that are difficult to recognize, recognizes the subdivided pseudo radicals, and identifies kanji as a collection of pseudo radicals.
First, the memory capacity required for the recognition dictionary is extremely small, and the increase in memory capacity as the number of recognized characters increases is kept extremely low.Furthermore, as explained above, what is recognized is pseudo-radicals, which are at the kana level, so the recognition rate is low. It is possible to provide a suitable online handwritten character recognition method at a low cost, and the cost itself can be kept high.

[Brief explanation of the drawing]

第１図は本発明の一実施例のブロツク図、第２
図は擬似部首辞書の１例を示す図、第３図は入力
レジスタの詳細図、第４図は文字辞書の部分図、
第５図は本発明による認識方法を示す図である。１…タブレツト、２…認識部、３…擬似部首辞
書、４…入力レジスタ、５…選択回路、６…文字
辞書、９…認識部２よりの出力、１０〜１７…入
力レジスタ４内のI₀レジスタ〜I₇レジスタ、１８
…切換回路、１９…入力レジスタ４の出力。 FIG. 1 is a block diagram of one embodiment of the present invention, and FIG.
The figure shows an example of a pseudo-radical dictionary, Figure 3 is a detailed diagram of an input register, Figure 4 is a partial diagram of a character dictionary,
FIG. 5 is a diagram showing a recognition method according to the present invention. DESCRIPTION OF SYMBOLS 1... Tablet, 2... Recognition unit, 3... Pseudo-radical dictionary, 4... Input register, 5... Selection circuit, 6... Character dictionary, 9... Output from recognition unit 2, 10-17... I in input register 4 ₀ register to I ₇ register, 18
...Switching circuit, 19...Output of input register 4.

Claims

[Claims] 1. An online character recognition method for recognizing handwritten characters, comprising: a tablet for extracting stroke information of handwritten characters; and storing characteristic information of pseudo radicals including subset patterns obtained by subdividing kanji characters. A first dictionary, a second dictionary that stores character codes for pseudo-radical codes that indicate a collection of subsets of kanji, and stroke information from the tablet are compared with the contents of the first dictionary to determine pseudo-radicals. The recognition unit includes a recognition unit that performs recognition processing, an input register that stores a code indicating the recognition result of the recognition unit, and a selection circuit that compares the contents of the input register with a second dictionary to identify characters. , each time stroke information is input, it is verified whether the characteristic information of the information is registered in the first dictionary, and the pseudo radical code and/or undefined code indicating the verification result is stored in the input register. At the same time, each time the input register is stored, the content of the input register is re-verified based on the longest match method to determine whether or not the content is registered in the first dictionary, and the content of the input register is rewritten based on the verification result. An online character recognition method characterized by performing the following.