[go: up one dir, main page]

JPS6084684A - Character recognizing system - Google Patents

Character recognizing system

Info

Publication number
JPS6084684A
JPS6084684A JP58193140A JP19314083A JPS6084684A JP S6084684 A JPS6084684 A JP S6084684A JP 58193140 A JP58193140 A JP 58193140A JP 19314083 A JP19314083 A JP 19314083A JP S6084684 A JPS6084684 A JP S6084684A
Authority
JP
Japan
Prior art keywords
stroke
character
strokes
input
basic
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
JP58193140A
Other languages
Japanese (ja)
Inventor
Mitsumasa Sugiyama
杉山 光正
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Canon Inc
Original Assignee
Canon Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Canon Inc filed Critical Canon Inc
Priority to JP58193140A priority Critical patent/JPS6084684A/en
Publication of JPS6084684A publication Critical patent/JPS6084684A/en
Pending legal-status Critical Current

Links

Landscapes

  • Character Discrimination (AREA)

Abstract

PURPOSE:To suppress classifying error rate of a basic stroke to low by a small processing quantity, and to obtain high recognition rate by limiting a basic stroke group being an object to be collated, by the number of strokes of an input character. CONSTITUTION:A stroke number counting part 5 cuts a character by a character spacing designation by touching some area on a tablet 1 with a pen point of an input pen 2, and a pen touch off-state exceeding some prescribed time, etc. As a result, the number of strokes of an input character, and an order of each stroke are derived from on and off of a pen touch of the input pen 2, and sent to a stroke recognizing part 6 and a character recognizing part 9. In a pre- processing part 4, stroke information of every stroke is sent to the stroke recognizing part 6. In the stroke recognizing part 6, the stroke information obtained from the per-processing part 4 is collated with a stroke pattern of a basic stroke in accordance with the number of strokes of the input stroke by a stroke number counting part 5.

Description

【発明の詳細な説明】 (技術分野) 本発明は、文字を構成するストロークに関する情報によ
って文字認識を行なう文字認識方式に関するものである
DETAILED DESCRIPTION OF THE INVENTION (Technical Field) The present invention relates to a character recognition method that performs character recognition based on information regarding strokes that constitute a character.

(従来技術) 従来、文字を構成するストロークに関する情報によって
、入力ストロークを予め準備された基本ストロークに分
類し、基本ストロークの集合から人力した文字を認識す
る方式をとっている。しかしながらB氷詰においては、
漢字、ひらがな、カタカナ、英字、数字等、多くの文字
が使用されており、そのストロークを分類するための基
本ストロークも多い。しかし、ひらがなを構成するスト
ロークには、漢字、カタカナ、英字、数字等には使われ
ないものも多くあり、また、英字、数字を構成するスト
ロークにも他の文字には使われないものがある。この様
に認識させるべき文字の種類が多いと必然的に備えるべ
き基本ストロークも多くなるので入力ストロークを基本
ストロークに分幀する際に生しる基本ストローク分類誤
りが認識pの低ドを招いていた。
(Prior Art) Conventionally, a system has been adopted in which input strokes are classified into basic strokes prepared in advance based on information regarding the strokes that constitute a character, and a manually created character is recognized from a set of basic strokes. However, in B ice,
Many characters are used, including kanji, hiragana, katakana, alphabets, and numbers, and there are also many basic strokes for classifying the strokes. However, there are many strokes that make up hiragana that are not used for kanji, katakana, alphabets, numbers, etc., and there are also strokes that make up alphabets and numbers that are not used for other characters. . If there are many types of characters to be recognized in this way, the number of basic strokes that must be prepared will also inevitably increase, so errors in basic stroke classification that occur when dividing input strokes into basic strokes result in a low recognition p. Ta.

(11的) 本発明は入力文字の画数により照合対象とするノ1(本
ストローク群を限定し、少ない処理酸で基本ス)・ロー
フの分類誤り率を低く抑え、高い認識率を111ること
ができる文字認識方式を提供することを目的とする。
(Point 11) The present invention aims to keep the classification error rate low for No. 1 (basic stroke by limiting the main stroke group and use a small amount of processing acid) and loaf to be matched based on the number of strokes of the input character, and to achieve a high recognition rate of 111. The purpose is to provide a character recognition method that can.

(実施例) 以下、図面に従って本発明の一実施例を詳細に説明する
(Example) Hereinafter, an example of the present invention will be described in detail with reference to the drawings.

第1図は本発明の一実施例である文字認識装置の構成を
示すブロツク図である。図において3は認識させるべき
文字情報を入力するための文字情報入力装置でタブレッ
トlと入力ペン2により構成されており、入力ペン2を
用いてタブレットl上に認識させるべき文字情報を描く
ことにより入力が行なわれる。4は文字情報入力装置3
より入力された文字情報にノイズ除去、正規化等を施す
前処理部、5はタブレット1上での入力ペン2の座標情
報及び人力ペン2がタブレット1にタッチしているかい
ないかのオン、オフ情報から入力文字の画数をめ番画数
計数部、6は前処理部4からの情報及び画数計数部5か
らの情報により入力文字の入力ストロークを認識するス
トローク認識部、7はストローク認識のために使用され
る基本ストロークの代表ストロークパターンが登録しで
ある基本ストローク辞書部、8は各入力ストロークの長
さ、位置関係等を処理する文字情報処理部、9は6のス
トローク認識部から得た結果と文字情報処理部から得た
文字情報により入力文字を認識する文字認識部、lOは
複数種の文字パターンが格納されている文字辞書部、1
1は文字認識部9で認識された結果を出力する出力部で
ある。
FIG. 1 is a block diagram showing the configuration of a character recognition device which is an embodiment of the present invention. In the figure, 3 is a character information input device for inputting character information to be recognized, which is composed of a tablet 1 and an input pen 2. By using the input pen 2 to draw character information to be recognized on the tablet 1, Input is made. 4 is a character information input device 3
5 is a pre-processing unit that performs noise removal, normalization, etc. on the input character information; 5 is the coordinate information of the input pen 2 on the tablet 1, and whether the human pen 2 is touching the tablet 1 or not is turned on or off A stroke number counting section 6 calculates the number of strokes of the input character from the information; 6 is a stroke recognition section that recognizes the input stroke of the input character based on information from the preprocessing section 4 and information from the stroke number counting section 5; 7 is for stroke recognition; 8 is a character information processing unit that processes the length of each input stroke, positional relationship, etc., and 9 is the result obtained from the stroke recognition unit 6. and a character recognition unit that recognizes input characters based on character information obtained from the character information processing unit; lO is a character dictionary unit in which a plurality of types of character patterns are stored;
Reference numeral 1 denotes an output unit that outputs the result recognized by the character recognition unit 9.

第2図は第1図の基本ストローク辞書部7に格納されて
いる基本ストロークの例であり、ストローク1dナンバ
ー、代表ストローク1〜6画の文字にのみ使われる基本
ストロークを表示している。代表ストロークの矢印は入
力ペン2の移動の方向を表している。第3列に「0」の
あるストロークはひらがな、カタカナ、英字、数字にの
み使われる基本ストロークであり、画数が1以上6以下
の文字にのみ使われる。
FIG. 2 is an example of basic strokes stored in the basic stroke dictionary section 7 of FIG. 1, and displays basic strokes used only for characters with a stroke 1d number and representative strokes 1 to 6. The arrow of the representative stroke represents the direction of movement of the input pen 2. The strokes with "0" in the third column are basic strokes used only for hiragana, katakana, alphabets, and numbers, and are used only for characters with a stroke count of 1 or more and 6 or less.

次に第1図、第2図を参照しつつ、本実施例を説明する
Next, the present embodiment will be described with reference to FIGS. 1 and 2.

オペレータがタブレット1−L−で入力ベン2を用いて
文字を書くと、ある・定時間毎に タブレットl」、に
おける入力ペン2の座標情報と入力ペン2のペン先がタ
ブレットlに触れているかいないかの情報が前処理部4
と画数計数部5に送られる。画数計数部5はタブレット
l」二のある区域を人力ペン2のペン先で触れることに
よる文字区切り指定、あるー・定時間以ヒのベンタッチ
オフ状態等により文字の切り出しを行い、入力ペン2の
ペンタンチのオン、オフにより入力文字の画数および、
各ストロークが第何画であるかをめ、ストローク認識部
6、文字認識部9へ送る。前処理部4では、入力された
文字情報に対し、ノイズ除去、平滑化等の処理を行った
後、入力ペン2のベンタッチのオン、オフ情報によりス
トロークの切り出しを行い、ストローク毎のストローク
情報をストローク認識部6へ送る。また、入力文字の各
ストロークの長さ、ストロークの始点、終点、入力ペン
2の移動方向変化点の座標、各ストロークの交差のイ■
無等を文字情報処理部8へ送る。
When the operator writes a character using the input pen 2 on the tablet 1-L-, the coordinate information of the input pen 2 on the tablet 1 and whether the tip of the input pen 2 is touching the tablet 1 at regular intervals is determined. Information on whether or not there is
and is sent to the stroke number counting section 5. The stroke counting unit 5 specifies character separation by touching a certain area on the tablet 2 with the tip of the manual pen 2, cuts out characters by touching off the pen after a certain period of time, and inputs the input pen 2. The number of strokes of the input character and
The number of each stroke is determined and sent to the stroke recognition section 6 and character recognition section 9. The preprocessing unit 4 performs processing such as noise removal and smoothing on the input character information, and then cuts out strokes based on the on/off information of the bend touch of the input pen 2, and extracts stroke information for each stroke. It is sent to the stroke recognition section 6. In addition, the length of each stroke of the input character, the start point and end point of the stroke, the coordinates of the point of change in the moving direction of the input pen 2, the intersection of each stroke, etc.
The character information processing section 8 sends "No."

ストローク認識部6では、前処理部4から得たストロー
ク情報に対して、画数計数部5から得た入力ストローク
の画数に従って、基本ストロークのストロークパターン
と照合して、基本ストロークのいずれかに分類する。
The stroke recognition unit 6 compares the stroke information obtained from the preprocessing unit 4 with the stroke pattern of basic strokes according to the number of input strokes obtained from the stroke count counting unit 5, and classifies it as one of the basic strokes. .

いま、漢字「辞」がタブレッ)1から「/」。Now, the kanji "ji" is tablet) 1 to "/".

r−J 、rlJ 、N J ’+ rlJ 、r−J
「司 、r−」 、r\ 」、、r/」、 r−J 。
r-J, rlJ, NJ'+ rlJ, r-J
"Tsukasa, r-", r\",, r/", r-J.

r−J、rlJの13画の文字として入力されるとする
。1画から6画までのストロークr′」 。
It is assumed that the characters r-J and rlJ are input as 13-stroke characters. Stroke r' from 1st stroke to 6th stroke.

r−J 、N」 、N」 、r−+」、、r−Jが入力
された時点では、この入力文字の画数が6以Fか7以下
かわからないので、各ストロークをすべての恭木ストロ
ークパターンと所定のアルゴリズムに従って照合し、基
本ストロークに分類する。しかし、7両目の「−」が入
力された時点でこの文字の画数が7以上であることがわ
かるので、以降のストロークは1画から6画の文字にの
み表われる基本ストロークを除いた基本ストロークパタ
ーンと所定のアルゴリズムに従って照合し基本ストロー
クに分類する。もし、1画から6画までのストロークが
、1画から6画の文字にのみ表われる基本ストロークに
分類されている場合には、1画から6画の文字にのみ表
われる基本ストロークを除く基本ストロークパターンと
所定のアルゴリズムに従って照合し、基本ストロークに
分類するか、認識不可能なストロークとしてリジェクト
し、所定の処理を行うか等の処理を行う。以りのように
1文字のすべてのストロークの処理がストローク認識部
6で終ると、文字認識部9ではストローク認識部6から
各人カストロークの1dナン/ヘ−1文字情報処理部8
からストローク位置情報、ストローク交差情報、ストロ
ーク長の文字情報、画数計数部5から入力文字の画数を
得、文字辞書部10に登録しである文字パターンと照合
して認識結果を出力部11より出力する。
When r-J, N'', N'', r-+'', , r-J is input, it is not known whether the number of strokes of this input character is 6 or more F or 7 or less, so each stroke is divided into all Kyogi strokes. It matches the pattern according to a predetermined algorithm and classifies it into basic strokes. However, when the 7th character "-" is input, it is known that the number of strokes of this character is 7 or more, so the subsequent strokes are basic strokes excluding the basic strokes that only appear in characters with 1st to 6th strokes. The pattern is matched according to a predetermined algorithm and classified into basic strokes. If strokes from 1st to 6th stroke are classified as basic strokes that appear only in characters from 1st to 6th stroke, then The stroke pattern is compared with a predetermined algorithm, and processing is performed, such as classifying the stroke as a basic stroke or rejecting it as an unrecognizable stroke and performing predetermined processing. As described above, when all the strokes of one character are processed by the stroke recognition unit 6, the character recognition unit 9 processes the stroke recognition unit 6 to the 1d number/h-1 character information processing unit 8 for each character stroke.
Stroke position information, stroke intersection information, stroke length character information, and the number of strokes of the input character are obtained from the stroke number counting section 5, and the results are compared with the character pattern registered in the character dictionary section 10 and the recognition result is output from the output section 11. do.

M実施例では1文字の画数が6以下か7以下かで照合対
象とする基本ストローク群を変えたか、第2図において
、1両目にしか表われないストロークidナンバー7.
10.1B、34,35゜36のスI・ローフ、2両目
にしか表われないストロークIdナンへ〜16,17の
ストローク等、ある画数口にのみ表われるストロークも
存在するので、文字の画数だけではなく、その入力スト
ロークが何両目であるかにより照合対象とする基本スト
ローク群を限定してもよい。
In the M embodiment, the basic stroke group to be compared was changed depending on whether the number of strokes of one character was 6 or less or 7 or less, or the stroke ID number 7.
10.1B, 34, 35° 36 strokes, loaf, strokes that appear only in the second car, strokes in Id Nan ~ 16, 17, etc. There are strokes that appear only in a certain number of strokes, so the number of strokes in a character. In addition, the group of basic strokes to be compared may be limited depending on which car the input stroke belongs to.

(効 果) 以」、の説明から明らかなように1本発明によれば、人
力文字の画数により照合対象となる基本ストローク長が
限定され、少ない処理量で高いストローク認識率が得ら
れ1文字認識率を高めることかできる。
(Effects) As is clear from the explanation of 1. According to the present invention, the basic stroke length to be verified is limited by the number of strokes of a human character, and a high stroke recognition rate can be obtained with a small amount of processing. It is possible to increase the recognition rate.

【図面の簡単な説明】[Brief explanation of drawings]

第1図は本発明の−・実施例である文字認識装置の構成
を示すプロ、り図、第2図は第1図に示した基本ストロ
ーク辞書部に格納されている基本ス10−クを示す図で
あり、3は文字情報入力装置、5は画数51数部、6は
ストローク認識部、7は基本ストローク辞書部、9は文
字認識部、10は文字辞書部である。 出順人 キャノン株式会社
FIG. 1 is a diagram showing the configuration of a character recognition device according to an embodiment of the present invention, and FIG. 2 shows a basic stroke dictionary stored in the basic stroke dictionary section shown in FIG. In this figure, 3 is a character information input device, 5 is a stroke number of 51, 6 is a stroke recognition section, 7 is a basic stroke dictionary section, 9 is a character recognition section, and 10 is a character dictionary section. Junjin Canon Co., Ltd.

Claims (1)

【特許請求の範囲】[Claims] 文字を構成するストロークに関する情報によって文字認
識を行なう文字認識装置において、入力文字の画数に応
じて照合対象とする基本ストローク群を変更するように
したことを特徴とする文字認識方式。
A character recognition method, characterized in that, in a character recognition device that performs character recognition based on information regarding strokes constituting a character, a group of basic strokes to be compared is changed according to the number of strokes of an input character.
JP58193140A 1983-10-14 1983-10-14 Character recognizing system Pending JPS6084684A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
JP58193140A JPS6084684A (en) 1983-10-14 1983-10-14 Character recognizing system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP58193140A JPS6084684A (en) 1983-10-14 1983-10-14 Character recognizing system

Publications (1)

Publication Number Publication Date
JPS6084684A true JPS6084684A (en) 1985-05-14

Family

ID=16302944

Family Applications (1)

Application Number Title Priority Date Filing Date
JP58193140A Pending JPS6084684A (en) 1983-10-14 1983-10-14 Character recognizing system

Country Status (1)

Country Link
JP (1) JPS6084684A (en)

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS63303488A (en) * 1987-06-04 1988-12-12 Canon Inc Information recognizing device
JPH04336692A (en) * 1991-05-14 1992-11-24 Sharp Corp Character recognizing device
US6035063A (en) * 1996-06-03 2000-03-07 Nec Corporation Online character recognition system with improved standard strokes processing efficiency
KR100484128B1 (en) * 1997-11-19 2005-09-30 삼성전자주식회사 Apparatus for inputting unistroke korean language
JP2007092442A (en) * 2005-09-29 2007-04-12 Tostem Corp Opening equipment equipped with round sash, and its assembling method

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS63303488A (en) * 1987-06-04 1988-12-12 Canon Inc Information recognizing device
JPH04336692A (en) * 1991-05-14 1992-11-24 Sharp Corp Character recognizing device
US6035063A (en) * 1996-06-03 2000-03-07 Nec Corporation Online character recognition system with improved standard strokes processing efficiency
KR100484128B1 (en) * 1997-11-19 2005-09-30 삼성전자주식회사 Apparatus for inputting unistroke korean language
JP2007092442A (en) * 2005-09-29 2007-04-12 Tostem Corp Opening equipment equipped with round sash, and its assembling method

Similar Documents

Publication Publication Date Title
US5854855A (en) Method and system using meta-classes and polynomial discriminant functions for handwriting recognition
CN101290659B (en) Hand-written recognition method based on assembled classifier
TWI435276B (en) A method and apparatus for recognition of handwritten symbols
US5768417A (en) Method and system for velocity-based handwriting recognition
US5802205A (en) Method and system for lexical processing
CN105260751B (en) A kind of character recognition method and its system
JPH0562391B2 (en)
Bhattacharya et al. An end-to-end system for Bangla online handwriting recognition
US5659633A (en) Character recognition method utilizing compass directions and torsion points as features
JPS6084684A (en) Character recognizing system
JPS5929246Y2 (en) Online recognition processing device for handwritten characters
JPH09319828A (en) On-line character recognition device
JPS59188787A (en) Input device of on-line manuscript character
JPS6224382A (en) Method for recognizing handwritten character
JPS613287A (en) Graphic form input system
KR100313993B1 (en) Method for recognizing probable stroke by area partition
JPH01213772A (en) On-line character recognizing system
JPH0766418B2 (en) Handwriting recognition device
JP2865286B2 (en) Online handwritten character recognition device
JPS6084683A (en) Character recognizing system
KR100367580B1 (en) Device for recognizing on-line character of stroke order independence
JPS63301383A (en) Handwritten character recognition device
JPH0737097A (en) Method for recognizing graphic provided with plural strokes
JPS642186A (en) Hand written character recognizing system
JPH042990B2 (en)