[go: up one dir, main page]

CN102981753B - The hand-written Chinese character input recognition method improved - Google Patents

The hand-written Chinese character input recognition method improved Download PDF

Info

Publication number
CN102981753B
CN102981753B CN201210249413.4A CN201210249413A CN102981753B CN 102981753 B CN102981753 B CN 102981753B CN 201210249413 A CN201210249413 A CN 201210249413A CN 102981753 B CN102981753 B CN 102981753B
Authority
CN
China
Prior art keywords
word
candidate
district
chinese character
area
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
CN201210249413.4A
Other languages
Chinese (zh)
Other versions
CN102981753A (en
Inventor
市川文子
贝卡匹拉亚
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Nokia Technologies Oy
Original Assignee
Nokia Oyj
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Nokia Oyj filed Critical Nokia Oyj
Publication of CN102981753A publication Critical patent/CN102981753A/en
Application granted granted Critical
Publication of CN102981753B publication Critical patent/CN102981753B/en
Anticipated expiration legal-status Critical
Expired - Fee Related legal-status Critical Current

Links

Landscapes

  • Character Discrimination (AREA)
  • Document Processing Apparatus (AREA)

Abstract

一种改进的用于电子装置的手写汉字输入识别的设备和方法。其中,所述设备包括用于识别用户在所述电子装置的一个书写区书写的汉字的装置,其中所述的识别只有在满足下列条件时才完成:前一个笔画的起始点或中间点不在一个预定的开始区,前几个笔画构成一个完成的字,且当前笔画的始点或中间点在所述的开始区内。

An improved device and method for handwritten Chinese character input recognition for electronic devices. Wherein, the device includes means for recognizing Chinese characters written by the user in a writing area of the electronic device, wherein the recognition is only completed when the following conditions are met: the starting point or the middle point of the previous stroke is not in a Predetermined starting area, the first few strokes constitute a completed character, and the starting point or middle point of the current stroke is in the described starting area.

Description

改进的手写汉字输入识别方法Improved Handwritten Chinese Character Input Recognition Method

本申请为申请号200310115725.7、申请日2003年11月28日、名称为“改进的手写汉字输入识别方法”的分案申请。This application is a divisional application with application number 200310115725.7, application date November 28, 2003, and titled "Improved Handwritten Chinese Character Input Recognition Method".

技术领域 technical field

本发明涉及手写汉字输入识别,特别涉及一种改进的用于电子装置的手写汉字输入识别方法,它能容忍超时甚至不受超时影响,并且能允许对已输入的字进行部分修改。The present invention relates to handwritten Chinese character input recognition, in particular to an improved handwritten Chinese character input recognition method for electronic devices, which can tolerate timeout or even not be affected by timeout, and can allow partial modification of inputted characters.

背景技术 Background technique

手写汉字输入是具有触摸屏或触摸板(写字板)的计算机及其它电子设备常用的输入方法。随着笔记本电脑、移动电话和个人数据助理(PDA)等的广泛应用,快速、高效和准确的手写汉字输入识别已成为许多人的愿望。Handwritten Chinese character input is a commonly used input method for computers and other electronic devices with touch screens or touch pads (writing pads). With the wide application of notebook computers, mobile phones and personal data assistants (PDAs), fast, efficient and accurate handwritten Chinese character input recognition has become the wish of many people.

对于书写汉字来说,不同的人有不同的书写风格和书写习惯。手写汉字输入识别的一个难题就是如何确定用户是否已经写完一个字。因此,阻碍快速而准确识别的一个障碍就是所谓的超时,即最后一笔之后的等待时间,因为超时是检测用户是否已经写完一个字的最有效的办法。为了使手写汉字输入识别能够实际使用,超时实际上是很短的,通常在0.2-0.8秒之间,一般可以由系统或用户在0.2-2.0秒之间调节。For writing Chinese characters, different people have different writing styles and writing habits. A difficult problem in handwritten Chinese character input recognition is how to determine whether the user has finished writing a word. Therefore, an obstacle to fast and accurate recognition is the so-called timeout, the waiting time after the last stroke, because timeout is the most effective way to detect whether the user has finished writing a word. In order to make the recognition of handwritten Chinese character input practical, the timeout is actually very short, usually between 0.2-0.8 seconds, and generally can be adjusted between 0.2-2.0 seconds by the system or the user.

然而,利用超时来确定输入是否完成有时会有问题。首先,很难设定一个最佳超时,因为超时随用户的不同有很大的不同,也就是说,书写速度以及笔画和字之间的中断在很大程度上取决与用户。其次,随字的不同超时有所不同。对于某些字,用户要在两个笔画之间移动很长一段距离。当距离过长时,就需要比普通的笔画到笔画所需的时间更长的时间。在这种情况下,在用户写完一个字之前出现超时,就会发生在超时前和超时后写的笔画被识别成两个字的情况。However, using timeouts to determine input completion is sometimes problematic. First of all, it is difficult to set an optimal timeout, because the timeout varies greatly from user to user, that is, writing speed and breaks between strokes and words are largely user-dependent. Second, the timeout varies from word to word. For some characters, the user has to move a long distance between two strokes. When the distance is too long, it takes longer than normal stroke-to-stroke. In this case, a timeout occurs before the user finishes writing a word, and strokes written before and after the timeout will be recognized as two characters.

为了解决这种由于超时而导致的问题,通常的解决办法是以笔迹的形式为用户提供视觉反馈,以使其能在识别时进行确认。另外,大多数的手写汉字输入识别软件都允许用户人工设定超时的长短。有些解决方案给用户提供两个或更多的手写输入区,以迫使用户在一个输入区只写一个字。In order to solve this problem caused by timeout, a common solution is to provide the user with visual feedback in the form of handwriting, so that it can confirm when it is recognized. In addition, most handwritten Chinese character input recognition software allows the user to manually set the length of timeout. Some solutions provide the user with two or more handwriting input areas to force the user to write only one word in one input area.

事实上,笔迹反馈必须设置在笔迹的下方,否则,用户不会注意到正在进行的识别。这种办法不能用在没有触摸屏的装置上,例如那些只有手写板的装置上。让用户自己设定超时可以提供一个比较灵活的等待时间(超时),但也不能解决问题。因为对所有的字来说,只有一个相同的超时。实际上,要求得到一个适合每个字和一个字中不同笔画的真正灵活的超时是不现实的。对于那些只有一个相对小的屏幕的便携式电子设备来说,只有一个相对有限的书写区,因此,无法提供多个书写区。In fact, the handwriting feedback must be placed below the handwriting, otherwise, the user will not notice that the recognition is going on. This approach cannot be used on devices without a touch screen, such as those with only a tablet. Allowing users to set the timeout by themselves can provide a more flexible waiting time (timeout), but it cannot solve the problem. Because there is only one same timeout for all words. In practice, it is unrealistic to require a truly flexible timeout for each character and for different strokes within a character. For those portable electronic devices with only a relatively small screen, there is only a relatively limited writing area, therefore, multiple writing areas cannot be provided.

另外,由于手写识别并非100%的准确,用户有时可能会重写一个字,以确保字能被正确识别。由于汉字需要复杂的书写过程,对用户来说,反复写一个字不论从体力上还是心理上都是一个负担。而且,这样会妨碍快速而有效的识别。除此之外,如果首要候选字中只有部分(例如,一个部分)正确,而且系统提供有候选字表的话,用户也可能要浏览所有的候选字(通常是8-10个字),这使得选择过程更困难。而如果用户认为识别错误,他必须重写整个字。Also, since handwriting recognition is not 100% accurate, users may sometimes rewrite a word to ensure it is recognized correctly. Since Chinese characters require a complicated writing process, it is a burden for users to write a word repeatedly, both physically and psychologically. Furthermore, this prevents quick and efficient identification. In addition, if only part (for example, a part) is correct in the primary candidate word, and if the system provides a candidate word list, the user may also want to browse all candidate words (usually 8-10 words), which makes The selection process is more difficult. And if the user thinks that the recognition is wrong, he has to rewrite the whole word.

为了减少重写并有利于输入和识别,人们已经提出了各种各样的解决方案,例如:按照偏旁部首等将字分类或分解(如:五笔字型和苍颉码等)。美国专利5,923,778公开了一种利用部首信息进行识别的方法,并提出了一种基于部首信息分级构造字信息的数据库。美国专利6,272,243公开了一种用于向手写记事薄进行输入的方法和装置,它有一个预先定义的子输入区。然而,它只适于输入日文。In order to reduce rewriting and facilitate input and recognition, various solutions have been proposed, such as: classifying or decomposing characters according to radicals, etc. (such as: Wubi font and Cangjie code, etc.). US Patent No. 5,923,778 discloses a method for identifying using radical information, and proposes a database for hierarchically constructing character information based on radical information. US Patent 6,272,243 discloses a method and device for inputting into a handwritten notepad, which has a predefined sub-input area. However, it is only suitable for inputting Japanese.

发明概述Summary of the invention

本发明的目的在于提供一种改进的手写汉字输入识别方法,它基本上不受超时的影响,并且能容忍超时甚至完全不受超时影响。The purpose of the present invention is to provide an improved handwritten Chinese character input recognition method, which is basically not affected by timeout, and can tolerate timeout or even not be affected by timeout at all.

本发明的目的还在于提供一种改进的手写汉字输入识别方法,它能减少在手写汉字输入识别中的重写,增加输入的速度并允许对一个字的部分修改。The object of the present invention is also to provide an improved handwritten Chinese character input recognition method, which can reduce rewriting in handwritten Chinese character input recognition, increase input speed and allow partial modification of a word.

根据本发明的一个方面,提供了一种改进的用于电子装置的手写汉字输入识别方法,包括下列步骤:根据预定的规则识别用户在所述电子装置的一个书写区书写的汉字,产生一个在上述识别步骤识别的候选字表,以及在所述电子装置的一个显示区显示所述候选字表;其中所述的候选字表至少包括刚被识别的当前字和该当前字与在此当前字之前被识别和显示的候选字的组合。According to one aspect of the present invention, an improved handwritten Chinese character input recognition method for an electronic device is provided, comprising the following steps: identifying a Chinese character written by a user in a writing area of the electronic device according to predetermined rules, and generating a The list of candidate words identified by the above recognition step, and displaying the list of candidate words in a display area of the electronic device; wherein the list of candidate words at least includes the current word just recognized and the current word and the current word A combination of previously recognized and displayed candidate words.

根据本发明的另一个方面,提供了一种改进的用于电子装置的手写汉字输入识别方法,包括识别用户在所述电子装置的一个书写区书写的汉字的步骤,其中所述的汉字识别只有在满足下列条件时才完成:前一个笔画的起始点或中间点不在一个预定的开始区,前几个笔画构成一个完成的字,且当前笔画的始点或中间点在所述的开始区内。According to another aspect of the present invention, there is provided an improved handwritten Chinese character input recognition method for electronic devices, including the step of recognizing Chinese characters written by a user in a writing area of the electronic device, wherein the Chinese character recognition only Just finish when following condition is met: the starting point or middle point of previous stroke is not in a predetermined starting area, the first few strokes constitute a completed word, and the starting point or middle point of current stroke is in described starting area.

根据本发明的再一个方面,提供了一种改进的用于电子装置的手写汉字输入识别方法,它包括识别用户在所述电子装置的一个书写区书写的汉字的步骤,和在所述书写区显示在所述识别步骤识别的主要候选字表的步骤;其中所述的显示包括在背景部分提示该首要候选字的字型的不同部分,并且所述提示部分构成一个独立的书写区,在该书写区内,可以独立地进行识别。According to another aspect of the present invention, there is provided an improved handwritten Chinese character input recognition method for an electronic device, which includes the steps of recognizing a Chinese character written by a user in a writing area of the electronic device, and The step of displaying the list of main candidate characters identified in the recognition step; wherein said display includes prompting different parts of the font of the main candidate characters in the background part, and said prompt part constitutes an independent writing area, in which In the writing area, it can be recognized independently.

附图说明 Description of drawings

下面参照附图举例说明本发明的实施例。Embodiments of the present invention are illustrated below with reference to the accompanying drawings.

图1A和1B示意性地说明了利用超时进行识别的现有技术的手写汉字输入识别方法可能出现的误识别;Figures 1A and 1B schematically illustrate the possible misrecognition of the prior art handwritten Chinese character input recognition method utilizing overtime for recognition;

图2示出了根据本发明方法的一个实施例;Figure 2 shows an embodiment of the method according to the invention;

图3是图2所述实施例的流程图;Fig. 3 is the flowchart of the embodiment described in Fig. 2;

图4是根据本发明方法的另一个实施例;Fig. 4 is another embodiment according to the method of the present invention;

图5是图4所述实施例的流程图;Fig. 5 is the flowchart of the embodiment described in Fig. 4;

图6A和6B是根据本发明方法的再一个实施例。6A and 6B are yet another embodiment of the method according to the present invention.

图7是图6所述实施例的流程图。FIG. 7 is a flowchart of the embodiment described in FIG. 6 .

具体实施方式 Detailed ways

由于手写汉字输入识别可以利用本领域熟知的各种方式实现,而且可以使用常规的系统或软件进行识别,为简单起见,在此不对识别过程本身做详细描述。Since the handwritten Chinese character input recognition can be realized by various methods well known in the art, and conventional systems or software can be used for recognition, for the sake of simplicity, the recognition process itself will not be described in detail here.

首先参照图1,其中图1A示意性地说明了利用超时进行识别的现有技术的手写汉字输入识别方法可能出现的误识别。如图1A所示,用户打算写一个左右型的由两部分构成的字“明”。他通常先写左边的“日”,再写右边的“月”。由于“明”是由能独立成字的偏旁“日”和一个“月”两部分构成,他需要从左半部分“日”的最后一笔(输入/书写区的中间靠下部分)移动较长一段距离到右半部分“月”的第一笔(输入/书写区的中间靠上部分),这花费比较长的时间并可能在两部分之间出现短暂的停顿。如果停顿的时间比预定的超时长,系统会将左边的部分识别为“日”或“月”字,并在候选字表中显示“日”和“月”。而后,系统将右边的部分识别为“月”或“日”并在候选字表中显示识别的“月”和“日”。换句话说,由于超时,系统将两个部分识别为两个独立的字。Referring first to FIG. 1 , FIG. 1A schematically illustrates possible misrecognition in the prior art handwritten Chinese character input recognition method using timeout for recognition. As shown in FIG. 1A, the user intends to write a left-right type character "Ming" consisting of two parts. He usually writes the "day" on the left first, and then the "month" on the right. Since "Ming" is composed of two parts, the radical "日" and a "month" that can independently form a character, he needs to move from the last stroke of the left half "日" (the middle lower part of the input/writing area) A long distance to the first stroke of the "month" in the right half (the upper middle part of the input/writing area), which takes a relatively long time and may have a short pause between the two parts. If the time of pause is longer than the preset overtime, the system will recognize the left part as "day" or "month" and display "day" and "month" in the candidate list. Then, the system recognizes the part on the right as "month" or "day" and displays the recognized "month" and "day" in the candidate list. In other words, due to the timeout, the system recognizes the two parts as two separate words.

同样的问题也会出现在上下型的两部分构成的字的识别。如图1B所示,汉字“幸”是由部首“土”和不构成完整字的下半部分构成。由于超时,系统可能会将“幸”字识别成两个字,即“土”字和看起来像下半部分的“羊”字。The same problem will also occur in the recognition of the two parts of the up and down type. As shown in FIG. 1B, the Chinese character "喜" is composed of the radical "土" and the lower part which does not constitute a complete character. Due to the timeout, the system may recognize the character "lucky" as two characters, namely the character "earth" and the character "sheep" which looks like the lower part.

在实际应用时,调整等待时间(超时)没有什么帮助,因为超时随汉字、笔画、书写习惯和用户的不同而有所不同。即使是同一个用户,对他来说,一个固定的超时也并不适用所有的字。In actual application, adjusting the waiting time (timeout) is not helpful, because the timeout varies with Chinese characters, strokes, writing habits and users. Even for the same user, for whom a fixed timeout does not apply to all words.

图2示出了根据本发明的改进的方法的一个实施例。该实施例利用对手写汉字输入识别来说常用的且至关重要的候选字表,使其在包括当前字的同时,也包括该当前字与此当前字之前被识别和显示的一个字的组合。该候选字表可以显示在触摸屏或一个单独的显示屏上(在写字板的情况下,通常需要独立的显示屏来显示在写字板上书写的字)。该实施例只是在候选字表中增加了当前字与此当前字之前被识别和显示的一个字的最可能的组合,不要求增加额外的显示部件占用显示屏。Figure 2 shows an embodiment of the improved method according to the invention. This embodiment utilizes the frequently used and crucial candidate word list for handwritten Chinese character input recognition, so that while including the current word, it also includes the combination of the current word and a character recognized and displayed before the current word . The candidate list can be displayed on the touch screen or a separate display (in the case of a tablet, a separate display is usually required to display the words written on the tablet). This embodiment only adds the most probable combination of the current word and a word recognized and displayed before the current word to the candidate list, and does not require additional display components to occupy the display screen.

如图2所示,假设同一用户要写同一个由两部分构成的字“明”。根据笔画的书写规则,他将首先以适合其书写习惯的速度写左半部分“日”,如果由于其书写习惯或其它原因,在开始写右半部分之前有一个停顿,而该停顿比预定的超时时间长,系统将会认为输入已经完成,并借助已知的方式将左半部分识别为“日”字。“日”被作为首要候选字显示在一个显示屏的某个部位,在本实施例中,候选字显示在书写区的上面。As shown in FIG. 2 , it is assumed that the same user wants to write the same character "Ming" consisting of two parts. According to the writing rules of strokes, he will first write the left half of "日" at a speed suitable for his writing habits. If due to his writing habits or other reasons, there is a pause before starting to write the right half, and the pause is faster If the timeout is long, the system will consider that the input has been completed, and recognize the left half as the word "日" by means of a known method. "日" is displayed on a certain part of a display screen as the primary candidate word, and in the present embodiment, the candidate word is displayed above the writing area.

用户继续书写剩余部分,即右半部分“月”。系统在超时之后产生一个新的候选字表,列出了他刚刚书写的“月”以及“月”与在此之前的识别和显示的“日”的组合,即他要写的“明”字。而后,“明”字可作为首要候选字或候选字字之一显示在书写区的上面。可以通过系统设置,使作为首要候选字的“明”字自动进入文本区或由用户通过选择,例如:点击或以其它方式,而选取该字。选择后,该字被移至文本区。The user continues to write the remaining part, namely the right half "month". After the timeout, the system generates a new list of candidate characters, which lists the "month" he just wrote and the combination of "month" and the previously recognized and displayed "day", that is, the character "Ming" he wants to write . Then, the word "Ming" can be displayed above the writing area as one of the primary candidate characters or candidate characters. The word "Ming" as the primary candidate word can be set in the system to automatically enter the text area or be selected by the user, for example, by clicking or in other ways. After selection, the word is moved to the text area.

在该实施例中,能够被识别的字不必是完整的汉字,而可以是偏旁部首等字的组成部分。例如:如果用户书写“伴”字,在完成左半部分之后出现超时,而左半部分是一个偏旁而非完整的汉字,在他继续书写右半部分“半”之前,系统将识别该偏旁并在候选字表中显示该偏旁。在用户写完右半部分后,系统将产生一个新的候选字表,该表中包括刚刚识别的当前字“半”和由“半”与在此之前识别和显示的偏旁结合的字“伴”。用户可以选择该字,系统将该字移到文本区。In this embodiment, the characters that can be recognized do not have to be complete Chinese characters, but can be components of characters such as radicals. For example: If a user writes the character "Ban" and there is a timeout after completing the left half, which is a radical rather than a complete Chinese character, the system will recognize the radical before he continues to write the right half "Half" and Display the radical in the candidate list. After the user finishes writing the right half, the system will generate a new list of candidate characters, which includes the current character "Half" just recognized and the character "Accompanied by the combination of "Half" and the previously recognized and displayed radicals. ". The user can select the word, and the system will move the word to the text area.

此外,所述的组合必须构成一个完整的字。根据显示屏的大小和布局,候选字表可以包括最多10个字,以便于选择。Furthermore, said combination must form a complete word. Depending on the size and layout of the display screen, the candidate list can include up to 10 words for easy selection.

这样,通过仅在候选字表中增加当前字与此当前字之前被识别和显示的一个字的最可能组合,通常只是一个字,可以大大的改善识别的效率,而无须增加显示部件而占据显示屏。In this way, by only adding the most likely combination of the current word and a word recognized and displayed before the current word in the candidate word list, usually only one word, the efficiency of recognition can be greatly improved without increasing the display parts and occupying the display Screen.

图3示出了图2实施例的流程图。当用户开始在触摸屏或写字板上书写时,检测开始。系统随时检测在笔画之间是否有长于预定时间的中断(超时)(步骤102)。如果系统检测到超时,则将识别由超时前的笔画所构成的字(步骤201),而后,在候选字表中显示识别的字(步骤202)。同时系统继续检测(步骤103)。如果检测到另一个超时(步骤104),则进行识别(步骤105)和显示(步骤106)。在步骤106,系统显示在步骤105识别的字以及在步骤105识别的字和在步骤201和202识别和显示的字的组合。如果没有检测到超时,则继续检测。在步骤106的显示之后,系统根据用户的选择,将用户选择的字置于文本编辑区。FIG. 3 shows a flowchart of the embodiment in FIG. 2 . Detection begins when the user starts writing on the touch screen or tablet. The system detects at any time whether there is an interruption (timeout) longer than a predetermined time between strokes (step 102). If the system detects overtime, it will identify the character (step 201) formed by the strokes before the overtime, and then display the identified word (step 202) in the candidate list. Meanwhile the system continues to detect (step 103). If another timeout is detected (step 104), it is identified (step 105) and displayed (step 106). In step 106, the system displays the word recognized in step 105 and the combination of the word recognized in step 105 and the word recognized and displayed in steps 201 and 202. If no timeout is detected, continue detection. After the display in step 106, the system places the word selected by the user in the text editing area according to the user's selection.

图4是根据本发明方法的另一个实施例。除了利用上述的候选字表之外,通过检测字的笔画在触摸屏或写字板上的位置,可以几乎完全消除超时对识别的影响。Figure 4 is another embodiment of the method according to the invention. In addition to using the above-mentioned candidate list, by detecting the position of the stroke of a character on the touch screen or tablet, the influence of timeout on recognition can be almost completely eliminated.

由于汉字书写遵循从左到右和从上到下的规则,例如:在触摸屏/写字板上,书写通常是从书写区的左上角开始,在右下角结束,可以把一个字起笔的位置,即书写区的左上部,定义为一个“开始区”。开始区的尺寸最好比书写区或书写框的四分之一稍大一些,以便大多数用户都可以在开始区开始书写。如果一个字包括两个或更多的部分,例如:左右或上下两部分,可以从开始区以外的地方开始书写另一部分。在本实施例中,利用汉字输入的这一特点来检测一个字是否已经写完,并且是否开始了另一个字的书写。Since the writing of Chinese characters follows the rules from left to right and from top to bottom, for example: on a touch screen/writing pad, writing usually starts from the upper left corner of the writing area and ends at the lower right corner, so you can place a character at the starting position of the pen, that is, The upper left part of the writing area is defined as a "start area". The size of the start area is ideally slightly larger than a quarter of the writing area or writing box so that most users can start writing in the start area. If a character includes two or more parts, for example: left and right or upper and lower parts, you can start writing another part from a place other than the start area. In this embodiment, this feature of Chinese character input is used to detect whether a character has been written and whether another character has been written.

根据本实施例,在下列情况下,一个笔画将被认为是一个新字的开始:1)前一个笔画或前几个笔画的开始点或中间点不在开始区;2)前几个笔画构成一个完整的字;3)当前笔画的开始点或当前笔画的中间点在开始区。也就是说,只有在上述三个条件同时满足时,才认为由前几个笔画构成的字已经写完,当前的笔画是一个新字的开始。因此,不必等待超时,系统将根据上述条件而不是超时来确定一个字是否已经写完。According to this embodiment, in the following cases, a stroke will be considered as the beginning of a new word: 1) the starting point or the middle point of the previous stroke or the previous several strokes are not in the starting area; 2) the previous several strokes constitute a Complete character; 3) The starting point of the current stroke or the middle point of the current stroke is in the starting area. That is to say, only when above-mentioned three conditions are met simultaneously, just think that the word that is formed by preceding several strokes has been written, and current stroke is the beginning of a new word. Therefore, instead of waiting for a timeout, the system will determine whether a word has been written based on the above conditions instead of a timeout.

如图4所示,开始区在书写区或书写框的左上角,并且可以用线或图形标出,以提示用户在这个区域开始书写。As shown in FIG. 4 , the starting area is at the upper left corner of the writing area or writing box, and can be marked with lines or graphics to prompt the user to start writing in this area.

同样以“明”字为例,该字由左右两部分构成且应当从左边的“日”字旁开始书写。“日”字旁的前三个笔画的开始点和中间点都在开始区内。虽然第四笔的开始点可能会在开始区以外,但前三笔并未构成一个完整的字。因此,第四笔的开始并不被认为是一个新字的开始。右半部分“月”一般是从开始区以外开始书写的。虽然左半部分“日”作为偏旁也可以构成一个完整的字,但右半部分“月”的第一笔不会被认为是一个新字的开始,因为“月”的第一笔是在开始区以外,不满足上述第三个条件。系统会检测各个笔画的开始点,以根据这个规则确定是否已经写完一个字。也可以设置使系统检测各个笔画的中间点,即笔画的中间的部分是否落在开始区。写完的字会被识别并显示在候选字区。Also take the word "Ming" as an example, which is composed of left and right parts and should be written next to the word "日" on the left. The starting point and the middle point of the first three strokes next to the word "日" are all in the starting area. Although the starting point of the fourth stroke may be outside the starting area, the first three strokes do not constitute a complete word. Therefore, the beginning of the fourth stroke is not considered to be the beginning of a new character. The right half of the "month" is generally written from outside the starting area. Although the left half of "日" can also form a complete character as a radical, the first stroke of the right half of "月" will not be considered as the beginning of a new character, because the first stroke of "月" is at the beginning Outside the district, the third condition above is not met. The system will detect the starting point of each stroke to determine whether a character has been written according to this rule. It can also be set to make the system detect the middle point of each stroke, that is, whether the middle part of the stroke falls in the starting area. The written word will be recognized and displayed in the candidate word area.

由于用户可能会由于忽视等原因在开始区开始写右半部分的“月”,且这可能会导致系统将右半部分的“月”误识别为一个完整的“月”字,因此,候选字表除了包括当前字外,还应当包括当前字与紧邻此当前字之前的一个字的组合,如参照图2所描述的那样。在本例中,“月”和“明”都被显示在候选字表中,因此,由于用户未能在开始区以外开始书写右半部分“月”而引起的错误不会影响系统的正确识别。Since the user may begin to write the right half of "month" in the start area due to neglect and other reasons, and this may cause the system to misrecognize the right half of "month" as a complete "month", therefore, the candidate word The table should include, in addition to the current word, the combination of the current word and a word immediately preceding the current word, as described with reference to FIG. 2 . In this example, both "月" and "明" are displayed in the candidate list, so errors caused by the user failing to start writing the right half of "月" outside the start area will not affect the correct recognition of the system .

图5示出了图4所述实施例的流程。当用户开始在触摸屏或写字板上书写时,检测开始。系统首先检测前一个笔画的开始点/中间点是否在预定的开始区(步骤301)。如果不是,则检测前几个笔画是否构成一个完整的汉字(步骤302)。如果是,则继续检测,直到检测出一个笔画的开始点/中间点不在开始区(步骤401)。如果前几个笔画构成一个完整的汉字,则检测当前的笔画开始点/中间点是否在开始区(步骤303)。如果检测的结果是肯定的,则对前几个笔画构成的字进行识别(步骤304),并且显示(步骤305)在步骤304识别的字以及在此前一轮检测中和识别和显示的字。如果步骤303的结果是否定的,则回到步骤401。最后,系统根据用户的选择,产生将所选的字置于文本编辑区。FIG. 5 shows the flow of the embodiment described in FIG. 4 . Detection begins when the user starts writing on the touch screen or tablet. The system first detects whether the starting point/intermediate point of the previous stroke is in the predetermined starting area (step 301). If not, then detect whether the first few strokes form a complete Chinese character (step 302). If so, then continue to detect, until the starting point/intermediate point of detecting a stroke is not in the starting area (step 401). If the first few strokes form a complete Chinese character, then detect whether the current stroke starting point/intermediate point is in the starting area (step 303). If the result of the detection is affirmative, then the characters formed by the first few strokes are recognized (step 304), and the characters recognized in step 304 and the characters recognized and displayed in the previous round of detection are displayed (step 305). If the result of step 303 is negative, go back to step 401. Finally, the system generates and places the selected word in the text editing area according to the user's choice.

图6A和6B是根据本发明方法的再一个实施例,它使用户能部分修改一个字。Figures 6A and 6B are yet another embodiment of the method according to the present invention, which enables the user to partially modify a word.

根据汉字字型索引系统(SKIP),汉字可以被分成四种字型,即如下所示的左右型(例如:“明”字),上下型(例如:“吴”字),包围型(例如:“国”字)和实心型(例如:“大”字)。因而,除了在显示屏的书写区预设一个开始区外,可以用某种方式,例如:色彩或灰度(在单色显示屏的情况下)提示以识别的字的字型。According to the Chinese Character Type Index System (SKIP), Chinese characters can be divided into four types, namely the left-right type (for example: "Ming"), the upper-lower type (for example: "Wu"), and the surrounding type (such as : "country" character) and solid type (for example: "big" character). Therefore, in addition to preset a starting area in the writing area of the display screen, some method, such as: color or grayscale (in the case of a monochrome display screen) can be used to prompt the font type of the recognized word.

每一个用颜色提示的部分,例如:左右型的左部分和右部分分别构成一个独立的识别区,在此,系统能够识别一个完整字或一个偏旁部首等,但应当使整个书写区中识别的首要候选字是一个完整的字。Each color-prompted part, for example: the left part and the right part of the left and right type form an independent recognition area respectively. Here, the system can recognize a complete character or a radical, etc., but it should be recognized in the entire writing area. The prime candidate for is a complete word.

如图6A所示,在触摸屏的书写区输入汉字“拌”时,用户从左上角(开始区)开始。系统对用户的输入进行识别。识别可以用图2所述的方法进行,也可以用图4的方法进行,甚至可以用已知的方式进行识别。由于,例如:字迹潦草,在书写区显示的首要候选字是“伴”而不是他要写的“拌”,书写区的背景以不同的色彩或灰度提示该字为左右型。此时,用户不必重写整个字,因为首要候选字“几乎是正确的”,即看上去像用户要写的字,只是左半部分不对。此时,用户可以只重写左半部分。当用户重写左半部分时,左侧区域的背景颜色(灰度)发生变化,以确认用户只修改左半部分,而右半部分保持不变。系统根据用户的修改产生一个新的首要候选字“拌”,这正是用户要写的字。As shown in FIG. 6A , when inputting the Chinese character "mix" in the writing area of the touch screen, the user starts from the upper left corner (starting area). The system recognizes the user's input. The identification can be carried out by the method described in FIG. 2, or by the method in FIG. 4, or even in a known manner. Because, for example: the handwriting is illegible, the primary candidate character shown in the writing area is "companion" rather than "mixing" that he wants to write, and the background of the writing area prompts that the word is a left-right type with different colors or gray scales. At this point, the user doesn't have to rewrite the entire word because the top candidate is "almost right," meaning it looks like what the user intended to write, except that the left half is wrong. At this point, the user can rewrite only the left half. When the user rewrites the left half, the background color (grayscale) of the left area changes to confirm that the user only modifies the left half, leaving the right half unchanged. The system generates a new primary candidate word "mix" according to the user's modification, which is just the word the user wants to write.

候选字也可以同时显示在一个候选字表中,例如:图4所示的在书写区上面的候选字表中。Candidate words can also be displayed in a candidate word list at the same time, for example: in the candidate word list above the writing area shown in FIG. 4 .

另一种方式是,使候选字表包括所有的(根据可以显示的字的数量而定)带有用户选择的偏旁或部首的字。在这种情况下,用户可以在书写区选择正确的部分,而不是修改错误的部分。如图6B所示,当用户注意到所显示的首要候选字不是他实际上要写的字的时候,他可以选择正确的部分,例如:通过点击带有正确部分的区域(右侧区域)来选择正确的部分。而后,所有带有正确部分“半”的字都被显示在候选字表中,在本例中,“伴”,“拌”,绊”等。用户可以从候选字表中通过,例如:点击选择他要写的字(“拌”)选择“拌”字。Another way is to make the list of candidate characters include all (according to the number of characters that can be displayed) characters with radicals or radicals selected by the user. In this case, the user can select the correct part in the writing area instead of modifying the wrong part. As shown in Figure 6B, when the user notices that the displayed primary candidate word is not the word he actually wants to write, he can select the correct part, for example: by clicking on the area with the correct part (right area) to Choose the right part. Then, all the words with the correct part "half" are displayed in the candidate list, in this example, "companion", "mixing", tripping" etc. The user can pass through the candidate list, for example: click Select the word he wants to write (“mix”) and select the word “mix”.

本实施例不适用于带有单独显示屏的写字板。This embodiment is not applicable to a tablet with a separate display screen.

图7是图6所述实施例的流程图。用户写完一个字之后,系统以图2或图4的方法,或已知的方式进行识别,随后显示被识别的首要候选字(步骤502),并用颜色/灰度提示该字的字型(步骤503),不同的颜色/灰度表示字型的不同区。如果显示的字就是用户写的字(步骤504),则系统根据用户的选择,产生所选的字并将其置于文本编辑区(步骤505)。如果在步骤503所显示的字部分正确,用户可以选择确认正确的部分或直接修改错误的部分。如果用户选择确认正确的部分(步骤601),系统将产生一个候选字表(步骤602),该表包括含有该正确部分的所有的字。如果用户选择直接修改错误的部分(步骤603),系统将通过改变修改部分的颜色/灰度来提示用户,识别并显示修改后的部分。FIG. 7 is a flowchart of the embodiment described in FIG. 6 . After the user finishes writing a word, the system recognizes it with the method shown in Figure 2 or Figure 4, or in a known manner, and then displays the recognized primary candidate word (step 502), and prompts the font type of the word with color/grayscale ( Step 503), different colors/gray levels represent different regions of the font. If the displayed word is exactly the word written by the user (step 504), then the system generates the selected word according to the user's selection and places it in the text editing area (step 505). If part of the word shown in step 503 is correct, the user can choose to confirm the correct part or directly modify the wrong part. If the user chooses to confirm the correct part (step 601), the system will generate a candidate list (step 602), which includes all the words that contain the correct part. If the user chooses to directly modify the wrong part (step 603), the system will prompt the user by changing the color/grayscale of the modified part to identify and display the modified part.

虽然以颜色/灰度为例说明了提示用户的方式,但显然也可以采用本领域技术人员熟知的其它方式。Although the manner of prompting the user is described by taking color/grayscale as an example, obviously other manners well known to those skilled in the art may also be used.

以上以实施例的方式对本发明进行了描述,但本发明并不局限于所述实施例,本领域的技术人员可以在所附权利要求的范围内,对本发明做出各种修改。The present invention has been described above in the form of embodiments, but the present invention is not limited to the embodiments, and those skilled in the art can make various modifications to the present invention within the scope of the appended claims.

Claims (12)

1. the equipment of the input of the handwritten Chinese character for the electronic installation identification improved, comprise for identifying that user writes the device of the Chinese character that district writes of described electronic installation, wherein said identification only just completes when meeting following condition: the starting point of previous stroke or intermediate point be not or not a predetermined beginning district, front several stroke forms a word completed, and the starting point of current stroke or intermediate point are in described beginning district.
2. equipment according to claim 1, writes the upper left quarter in district described in wherein said beginning district is positioned at.
3., according to the equipment of claim 1 or 2, described in the area in wherein said beginning district is greater than, write 1/4th of district.
4. equipment according to claim 1, described equipment also comprises the device for generation of an above-mentioned candidate's word table identified for the device identified, and for showing the device of described candidate's word table in a viewing area of described electronic installation.
5. equipment according to claim 4, wherein said candidate's word table at least comprises the current word and this current word and the candidate's combinatorics on words identifying before this current word and show that have just been identified.
6. equipment according to claim 4, also comprises the device for selecting a word in the described candidate's word table from the display of described candidate word viewing area.
7. the hand-written Chinese character input recognition method for electronic installation improved, comprise and identify that user writes the Chinese character step of writing in district of described electronic installation, wherein said identification only just completes when meeting following condition: the starting point of previous stroke or intermediate point be not or not a predetermined beginning district, front several stroke forms a word completed, and the starting point of current stroke or intermediate point are in described beginning district.
8. method according to claim 7, writes the upper left quarter in district described in wherein said beginning district is positioned at.
9., according to the method for claim 7 or 8, described in the area in wherein said beginning district is greater than, write 1/4th of district.
10. method according to claim 7, also comprises the step of generation one at candidate's word table of above-mentioned identification step identification, and shows the step of described candidate's word table in a viewing area of described electronic installation.
11. methods according to claim 10, wherein said candidate's word table at least comprises the current word and this current word and the candidate's combinatorics on words identifying before this current word and show that have just been identified.
12. methods according to claim 10, also comprise the step selecting a word from described candidate's word table of described candidate word viewing area display.
CN201210249413.4A 2003-11-28 2003-11-28 The hand-written Chinese character input recognition method improved Expired - Fee Related CN102981753B (en)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN200310115725A CN1622121B (en) 2003-11-28 2003-11-28 Modified handwritten Chinese character input recognition method

Related Parent Applications (1)

Application Number Title Priority Date Filing Date
CN200310115725A Division CN1622121B (en) 2003-11-28 2003-11-28 Modified handwritten Chinese character input recognition method

Publications (2)

Publication Number Publication Date
CN102981753A CN102981753A (en) 2013-03-20
CN102981753B true CN102981753B (en) 2015-10-07

Family

ID=34760531

Family Applications (2)

Application Number Title Priority Date Filing Date
CN201210249413.4A Expired - Fee Related CN102981753B (en) 2003-11-28 2003-11-28 The hand-written Chinese character input recognition method improved
CN200310115725A Expired - Fee Related CN1622121B (en) 2003-11-28 2003-11-28 Modified handwritten Chinese character input recognition method

Family Applications After (1)

Application Number Title Priority Date Filing Date
CN200310115725A Expired - Fee Related CN1622121B (en) 2003-11-28 2003-11-28 Modified handwritten Chinese character input recognition method

Country Status (1)

Country Link
CN (2) CN102981753B (en)

Families Citing this family (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104656938B (en) * 2013-11-19 2018-07-06 阿尔派株式会社 Input device and character input method
US10095673B2 (en) * 2014-11-17 2018-10-09 Lenovo (Singapore) Pte. Ltd. Generating candidate logograms
CN104598148B (en) * 2014-12-16 2017-09-19 任文 A method for handwriting Chinese characters on a computer touch screen
CN107870678A (en) * 2016-09-26 2018-04-03 中兴通讯股份有限公司 A kind of hand-written inputting method and device
TWI664604B (en) * 2018-03-20 2019-07-01 王佑鈞 System for construction and application of handwriting chinese character glyph sets to be used in a computer system
US12175888B2 (en) * 2020-10-20 2024-12-24 Holistic Language Solutions LLC Computerized method and apparatus for determining accuracy of written characters and stroke order and compliance with rules and providing visual and audio feedback

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1232204A (en) * 1998-04-16 1999-10-20 国际商业机器公司 Handwritten Chinese character recognition method and device
CN1279443A (en) * 1999-06-28 2001-01-10 夏普公司 Method and device for correcting faults in symbols identification

Family Cites Families (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH1040340A (en) * 1996-05-01 1998-02-13 Nippon Steel Corp Handwritten character recognition device and recording medium
JPH10162101A (en) * 1996-11-28 1998-06-19 Hitachi Ltd Handwritten character recognition device and handwritten character recognition method
JPH11191138A (en) * 1997-12-25 1999-07-13 Toshiba Corp Kanji input method utilizing handwritten character recognizing function and character input device
CN1197333A (en) * 1998-03-09 1998-10-28 柏凯 Computer hand writing input machine codes
US6681044B1 (en) * 2000-03-29 2004-01-20 Matsushita Electric Industrial Co., Ltd. Retrieval of cursive Chinese handwritten annotations based on radical model

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1232204A (en) * 1998-04-16 1999-10-20 国际商业机器公司 Handwritten Chinese character recognition method and device
CN1279443A (en) * 1999-06-28 2001-01-10 夏普公司 Method and device for correcting faults in symbols identification

Also Published As

Publication number Publication date
CN1622121B (en) 2012-08-29
HK1077900A1 (en) 2006-02-24
CN1622121A (en) 2005-06-01
CN102981753A (en) 2013-03-20

Similar Documents

Publication Publication Date Title
US5267327A (en) Apparatus and method for registering the handwriting of a user so it can be translated into block characters
US7137076B2 (en) Correcting recognition results associated with user input
JP3471919B2 (en) Machine operating method and method used to operate a processor controlled machine
US20140170611A1 (en) System and method for teaching pictographic languages
US8381119B2 (en) Input device for pictographic languages
US7164367B2 (en) Component-based, adaptive stroke-order system
CN101211243A (en) handwriting input device
US20030099398A1 (en) Character recognition apparatus and character recognition method
US6535204B2 (en) Manuscript input data processing device
JP5037491B2 (en) Omitted handwriting of ideographic language phrases by supplying less than the total number of given (one or more) characters, or fewer than the total number of strokes of the character, or both input
US20040243389A1 (en) Method and apparatus for selecting symbols in ideographic languages
EP1513053A2 (en) Apparatus and method for character recognition
CN110059636B (en) Method and system for checking and correcting learning homework of students
CN102981753B (en) The hand-written Chinese character input recognition method improved
US20140245212A1 (en) Method and Apparatus for Improved Text Entry
KR100651396B1 (en) Alphabet recognition apparatus and method
KR20040043454A (en) Pen input method and apparatus in pen computing system
CN100410966C (en) Improved Handwritten Chinese Character Input Recognition Method
US20040210853A1 (en) Method for dynamically distinguishing a character string
US20020085772A1 (en) Intelligent correction key
KR100506231B1 (en) Apparatus and method for inputting character in terminal having touch screen
JP2015125561A (en) Information display device and information display program
CA2039115C (en) Information processing apparatus and method
CN100388266C (en) Portable electronic device with writing and teaching function
JP2006155622A (en) Method and device for performing ideographic character input

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant
C41 Transfer of patent application or patent right or utility model
TR01 Transfer of patent right

Effective date of registration: 20160118

Address after: Espoo, Finland

Patentee after: Technology Co., Ltd. of Nokia

Address before: Espoo, Finland

Patentee before: Nokia Oyj

CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20151007

Termination date: 20191128

CF01 Termination of patent right due to non-payment of annual fee