[go: up one dir, main page]

CN1641635B - Computer executable word/phrase matching and querying system and method thereof - Google Patents

Computer executable word/phrase matching and querying system and method thereof Download PDF

Info

Publication number
CN1641635B
CN1641635B CN 200410001649 CN200410001649A CN1641635B CN 1641635 B CN1641635 B CN 1641635B CN 200410001649 CN200410001649 CN 200410001649 CN 200410001649 A CN200410001649 A CN 200410001649A CN 1641635 B CN1641635 B CN 1641635B
Authority
CN
China
Prior art keywords
word
character
database
translation
phrase
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
CN 200410001649
Other languages
Chinese (zh)
Other versions
CN1641635A (en
Inventor
邱全成
周韬
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Inventec Corp
Original Assignee
Inventec Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Inventec Corp filed Critical Inventec Corp
Priority to CN 200410001649 priority Critical patent/CN1641635B/en
Publication of CN1641635A publication Critical patent/CN1641635A/en
Application granted granted Critical
Publication of CN1641635B publication Critical patent/CN1641635B/en
Anticipated expiration legal-status Critical
Expired - Fee Related legal-status Critical Current

Links

Images

Landscapes

  • Document Processing Apparatus (AREA)
  • Machine Translation (AREA)

Abstract

The invention discloses a system and method for matching and inquiring words/phrases which can be executed by a computer, wherein the system comprises a database, a database and a query module, wherein the database provides the words/phrases and corresponding explanations for the words/phrases; a character recognition module, which generates a character/word to be queried after recognition according to the characters on the picture; a judging module, which determines a corresponding rule according to the letter case of the word/phrase to be inquired; a query module for querying whether there is a completely matched word or phrase in the database according to the corresponding rule; a spelling correction and fuzzy query module, which finds out a most similar word in the database when there is no completely matched word; and an output module. The method is to judge whether the word or phrase to be inquired belongs to the situation of full capitalization of the whole sentence and whether the word or phrase is in capitalization of the first letter, and to match a proper word or phrase in the database according to a preset corresponding rule so as to correctly match the word or phrase and correctly display the translation result.

Description

计算机可执行的字/词匹配与查询的系统及其方法 Computer-executable word/word matching and query system and method

技术领域technical field

本发明涉及一种关于数据处理的系统及其方法,可应用于电子字典上,特别是一种藉由判断字/词大小写的情况,以进行字/词匹配与查询的系统及方法。The present invention relates to a system and method related to data processing, which can be applied to electronic dictionaries, especially a system and method for word/word matching and query by judging the case of words/words.

背景技术Background technique

一般而言,计算机所显示的画面中会有许多不认识的外语生字,这些生字有的是单字,有的是词组。对于这些生字可藉由查询电子字典(翻译软件)的方式,以便能够了解其含义,弥补因为语言能力不足所产生的障碍。电子字典除了提供字/词的翻译查询之外,更提供了许多的附加功能,例如全文翻译、网页翻译或实时翻译等,这些附加的功能使得翻译更快速且更方便。以全文翻译为例,其特色是一次可翻译大量的字汇,并且加以组织成为有意义的文句,免除掉一个字一个字慢慢查询的缺点。以实时翻译为例,其特色是操作上的方便,只要通过光标接触画面上待查询的字/词,随即会显示出一窗口并显示此待查询的字/词的翻译,使用者不需逐字输入便可获得字/词的翻译。Generally speaking, there will be many unfamiliar foreign language new characters in the picture displayed by the computer, some of these new words are single characters, and some are phrases. For these new words, you can consult the electronic dictionary (translation software) so that you can understand their meanings and make up for the obstacles caused by insufficient language skills. In addition to providing word/phrase translation queries, electronic dictionaries also provide many additional functions, such as full-text translation, web page translation or real-time translation, etc. These additional functions make translation faster and more convenient. Taking full-text translation as an example, its feature is that it can translate a large number of words at one time, and organize them into meaningful sentences, avoiding the disadvantage of slowly searching word by word. Taking real-time translation as an example, its feature is the convenience of operation, as long as the cursor touches the word/phrase to be queried on the screen, a window will be displayed and the translation of the word/word to be queried will be displayed, and the user does not need to You can get the translation of the word/phrase by typing the word.

举例来说,当使用者开启一外语的电子文件阅读的时候,以往遇到不认识的外语生字时,必须翻阅外语字(辞)典以获得此一外语生字的含义,但是,翻阅字(辞)典相当不方便且查询生字所花费的时间也较长,因此较快的查询方式是利用计算机中的电子字典作为辅助。利用电子字典查询外语生字,只要输入待查询字/词便能获得有关该字/词的解释,相当方便且快速。虽然电子字典查询字/词的处理速度相当快,但是在字/词的输入上还是要花费一些时间,此为其美中不足的地方,因此更快且更方便的查询方法为利用电子字典附加的功能,例如全文翻译、网页翻译或实时翻译等。For example, when a user opens an electronic file in a foreign language to read, in the past, when encountering an unfamiliar foreign language word, he must look through a foreign language dictionary (dictionary) to obtain the meaning of this foreign language word. ) dictionary is quite inconvenient and takes a long time to look up new words, so the faster way to look up is to use the electronic dictionary in the computer as an aid. It is very convenient and fast to use the electronic dictionary to look up new words in a foreign language. Just input the word/phrase to be queried to get the explanation of the word/phrase. Although the processing speed of the electronic dictionary query word/word is quite fast, it still takes some time to input the word/word. This is a fly in the ointment, so the faster and more convenient query method is to use the additional functions of the electronic dictionary , such as full-text translation, web page translation, or real-time translation.

以下我们以实时翻译为例,说明其运作方式以及存在的问题。如图1所示,为实时翻译运作的流程图。当电子字典的实时翻译功能启动时,会根据光标所在位置撷取一个待查询的字/词(步骤100);并根据所撷取的字/词与数据库进行匹配,查询待查询的字/词在数据库中是否存在一个可完全匹配的字/词(步骤110);如果在数据库中可以找到一个完全匹配的字/词,则显示此待查询的字/词的解释(步骤130);如果在数据库中找不到可以完全匹配的字/词,则执行拼写校正与模糊查询(步骤120),于找数据库中找出一个与待查询的字/词最近似的字/词,然后显示此最近似的字/词的解释(步骤130)。虽然利用实时翻译功能不需要键盘输入的操作,只需利用鼠标便可完成查询生字的目的,但是在某些情况下,不一定能正确地显示出待查询的字/词的正确解释。例如待查询的字/词出现在句首或标题行的时候,可能会以第一个字母大写或是整句全大写的型态出现,而某些字/词偏偏又存在大写形式与小写形式不同意义的解释。在这种情形下查询这类字/词时,会导致待查询的字/词与数据库匹配出不正确的翻译结果,因而所显示的并不是使用者实际想要的查询结果。Let's take real-time translation as an example to illustrate how it works and what problems exist. As shown in Figure 1, it is a flow chart of real-time translation operation. When the real-time translation function of the electronic dictionary is activated, a word/word to be queried will be retrieved according to the position of the cursor (step 100); Whether there is a fully matching word/phrase in the database (step 110); If a fully matching word/phrase can be found in the database, then display the explanation (step 130) of this word/phrase to be queried; Can not find the word/word that can fully match in the database, then carry out spell correction and fuzzy query (step 120), find out a word/word most similar with the word/word to be queried in the database, then display this recent Interpretation (step 130) of similar word/phrase. Although the use of the real-time translation function does not require the operation of keyboard input, and only needs to use the mouse to complete the purpose of querying new words, but in some cases, the correct explanation of the word/phrase to be queried may not be displayed correctly. For example, when the word/phrase to be queried appears at the beginning of a sentence or in the title line, it may appear in the form of capitalizing the first letter or the entire sentence, and some words/phrases have both uppercase and lowercase forms. different interpretations. When querying such words/phrases in this situation, it will cause the word/phrase to be queried to be matched with the database to get an incorrect translation result, and thus what is displayed is not the query result that the user actually wants.

以下以具体的实例作为说明,我们以经常使用的单字as为例,其大写的AS有两种解释,分别为“Anglo-Saxon”,即“盎格鲁撒克逊人”与“antisubmarine”,即“反潜艇的”;其小写的as有多种解释,例如,像...一样、作为、如同...等,其大小写翻译的解释截然不同.因此,当整句文字全大写或首字母大写时,例如,AS IT IS...或As it is...等的情况下,应该是以小写形式的as为其翻译结果,然而字/词匹配的结果,却会以大写的AS为其匹配的字/词,故产生翻译不正确的情形。以AS IT IS为例,因为在数据库中可找到一个与AS完全匹配的字/词,亦即大写的AS。以As it is为例,虽然无法完全匹配一个As的字/词,但是,经过拼写校正与模糊查询的结果,其匹配的字/词为AS,而不是as。上述的两种情形,会造成输出不正确的翻译结果,这显然不是使用者希望看到的查询结果。同样地,这一类的情形也可能发生在电子字典的其它附加功能上,例如全文翻译功能、网页翻译等。The following is a specific example as an illustration. We take the frequently used word as as an example. The uppercase AS has two interpretations, namely "Anglo-Saxon", that is, "Anglo-Saxon" and "antisubmarine", that is, "anti-submarine". of"; its lowercase as has multiple interpretations, for example, like..., as, as...etc., its uppercase and lowercase translations have completely different interpretations. Therefore, when the entire sentence is all capitalized or the first letter is capitalized , for example, in the case of AS IT IS... or As it is..., etc., the lowercase as should be used as the translation result, but the word/word matching result will be matched with the uppercase AS words/phrases, so the translation is incorrect. Take AS IT IS as an example, because a word/phrase that exactly matches AS can be found in the database, that is, uppercase AS. Taking As it is as an example, although it is impossible to completely match a word/phrase of As, after spelling correction and fuzzy query results, the matched word/word is AS instead of as. The above two situations will result in an incorrect translation result, which is obviously not the query result that the user wants to see. Similarly, this type of situation may also occur in other additional functions of the electronic dictionary, such as full-text translation function, web page translation, etc.

发明内容Contents of the invention

鉴于前述有关于电子字典的附加功能所存在的问题,本发明提出一种计算机可执行的字/词匹配与查询的系统及其方法,主要的目的是通过判断待查询的字/词的是否属于整句文字全大写与是否为首字母大写的状态,然后采取适当的对应规则,以解决过去待查询的字/词与数据库匹配不正确的情形,达到正确匹配字/词与正确显示其翻译结果的目的。例如,当待查询的字/词属于整句文字全大写时,对于待查询的字/词所采取的对应规则应该先以其小写形式查询,当小写形式无法与数据库完全匹配时,再采取其它对应规则。同样的,当待查询的字/词为首字母大写的情形时,对于待查询的字/词所采取的对应规则也应该先以其小写形式查询。In view of the aforementioned problems related to the additional functions of electronic dictionaries, the present invention proposes a computer-executable word/word matching and query system and method thereof, the main purpose of which is to determine whether the word/word to be queried belongs to The entire sentence is all capitalized and whether the first letter is capitalized, and then appropriate corresponding rules are adopted to solve the situation where the word/phrase to be queried is incorrectly matched with the database in the past, so as to achieve the correct matching of the word/phrase and the correct display of its translation results Purpose. For example, when the word/phrase to be queried belongs to the entire sentence in uppercase, the corresponding rules for the word/word to be queried should first be queried in its lowercase form, and when the lowercase form cannot completely match the database, other rules should be adopted. corresponding rules. Similarly, when the word/phrase to be queried is capitalized, the corresponding rules adopted for the word/phrase to be queried should also be queried in its lowercase form first.

为了达到上述目的,本发明提供一种计算机可执行的字/词匹配与查询的系统,其中该系统包含有:一数据库,该数据库是建立于一计算机可存取的储存介质,储存字/词与其对应的解释;一文字辨识模块,取得自一输入装置输入的文字,并对该文字进行辨识以产生一待查询的字/词;一判断模块,根据该待查询字/词的字母大小写,决定一对应规则;一查询模块,该查询模块根据该对应规则自该数据库中查找出一完全匹配的字/词,及相应该完全匹配的字/词的翻译;一拼写校正与模糊查询模块,当该查询模块未查找到该完全匹配的字/词时,于该数据库中找出一最近似的字/词,及该最近似的字/词的翻译;及一视讯输出装置,用以输出该完全匹配的字/词的翻译与该最近似的字/词的翻译。In order to achieve the above object, the present invention provides a computer-executable word/word matching and query system, wherein the system includes: a database, which is built on a computer-accessible storage medium, storing words/words An explanation corresponding thereto; a text recognition module, which obtains the text input from an input device, and identifies the text to generate a word/word to be queried; a judgment module, according to the letter case of the word/word to be queried, Determine a corresponding rule; a query module, which searches out a completely matched word/word from the database according to the corresponding rule, and a corresponding translation of the completely matched word/word; a spelling correction and fuzzy query module, When the query module does not find the exact matching word/phrase, find a most similar word/phrase and the translation of the most similar word/phrase in the database; and a video output device for outputting The translation of the exact word/phrase and the translation of the closest word/phrase.

本发明还提供一种计算机可执行的字/词匹配与查询的方法,适用于以一计算机执行下列步骤:取得自一输入装置所输入的一待查询的字/词;根据该待查询的字/词字母大小写的判断,决定一对应规则;根据该对应规则判断一数据库中查找出一完全匹配的字/词,及相应该完全匹配的字/词的翻译;根据该对应规则于该数据库中未查找到完全匹配的字/词时,则以一拼写校正与模糊查询模块执行一拼写校正与模糊查询功能,并于该数据库中找出一最近似的字/词,及该最近似的字/词的翻译;及以一视讯输出装置输出该完全匹配的字/词的翻译或该最近似的字/词的翻译。The present invention also provides a computer-executable word/word matching and query method, which is suitable for performing the following steps with a computer: obtaining a word/word to be queried inputted from an input device; The judgment of the upper and lower case of the letter of a word determines a corresponding rule; according to the corresponding rule, a completely matching word/word is found in a database, and the corresponding translation of the completely matching word/word is found; according to the corresponding rule, the database When no fully matching word/word is found in the database, a spelling correction and fuzzy query module is used to perform a spelling correction and fuzzy query function, and a most similar word/word is found in the database, and the most similar the translation of the word/phrase; and outputting the translation of the exact matching word/phrase or the translation of the most similar word/phrase with a video output device.

也就是说,本发明所提供的计算机可执行的字/词匹配与查询的系统及其方法,必须先行判断待查询的字/词是否属于整句文字全大写与首字母大写的状态,然后根据一预定的查询规则,于数据库中匹配一个合适的字/词,并以其相对应的解释为输出的翻译结果。有关本发明具体可行的实施方式,兹就配合附图说明如下:That is to say, the computer-executable word/word matching and query system and method thereof provided by the present invention must first judge whether the word/word to be queried belongs to the state of full capitalization and initial capitalization of the entire sentence, and then according to A predetermined query rule matches a suitable word/phrase in the database, and uses its corresponding interpretation as the output translation result. Concrete and feasible implementation modes of the present invention are described as follows with regard to coordinating accompanying drawings:

附图说明Description of drawings

图1为实时翻译运作的流程图;Figure 1 is a flowchart of real-time translation operation;

图2为本发明所提的计算机可执行的字词匹配与查询的系统及其方法的硬件架构图;Fig. 2 is the hardware architecture diagram of the system and method thereof of computer-executable word matching and query proposed by the present invention;

图3为本发明所提的计算机可执行的字/词匹配与查询的系统及其方法的系统架构图;Fig. 3 is the system framework diagram of the system and method thereof of computer-executable word/word matching and query proposed by the present invention;

图4为本发明所提的计算机可执行的字/词匹配与查询的系统及其方法的详细流程图。FIG. 4 is a detailed flow chart of the computer-executable word/word matching and query system and method thereof proposed by the present invention.

其中,附图标记说明如下:Wherein, the reference signs are explained as follows:

21 输入装置                  22 中央处理单元21 Input device 22 Central processing unit

23 视讯输出装置23 Video output device

31 数据库                    32 文字辨识模块31 Database 32 Text Recognition Module

33 判断模块                  34 查询模块33 Judgment module 34 Query module

35 拼写校正与模糊查询模块    36 输出模块35 spelling correction and fuzzy query module 36 output module

步骤100撷取待查询的字/词Step 100 extracts the word/phrase to be queried

步骤110查询待查询的字/词与数据库是否完全匹配Step 110 inquires whether the word/phrase to be queried matches completely with the database

步骤120执行拼写校正与模糊查询功能Step 120 performs spelling correction and fuzzy query functions

步骤130输出翻译结果Step 130 outputs translation result

步骤400撷取待查询的字/词Step 400 extracts the word/phrase to be queried

步骤410判断是否整句文字全大写Step 410 judges whether the whole sentence text is all capitalized

步骤420以小写形式查询Step 420 query in lower case

步骤430判断小写形式是否与数据库完全匹配Step 430 judges whether the lowercase form matches the database completely

步骤440输出结果Step 440 output result

步骤450执行拼写校正与模糊查询功能Step 450 performs spelling correction and fuzzy query functions

步骤460以原字/词形式查询Step 460 queries with the original word/word form

步骤470判断原字/词形式是否与数据库完全匹配Step 470 judges whether the original word/word form matches completely with the database

步骤480判断是否首字母大写Step 480 judges whether the first letter is capitalized

具体实施方式Detailed ways

如图2所示,为计算机可执行的字/词匹配与查询的系统及其方法的硬件架构图。为具有输入装置21、中央处理单元22和视讯输出装置23所组成的计算机系统。其输入装置21可以是键盘、鼠标、数字板、光笔等可以操作画面上所产生光标的装置,其视讯输出装置23亦可以是屏幕、液晶显示屏幕、触控屏幕等显示接口,用以提供使用者进行视觉接收的装置。As shown in FIG. 2 , it is a hardware architecture diagram of a computer-executable word/word matching and query system and its method. It is a computer system composed of an input device 21 , a central processing unit 22 and a video output device 23 . Its input device 21 can be the device that can operate the cursor that produces on the screen such as keyboard, mouse, digital board, light pen, and its video signal output device 23 also can be the display interface such as screen, liquid crystal display screen, touch screen, in order to provide use device for visual reception.

如图3所示,为计算机可执行的字/词匹配与查询的系统及其方法的系统架构图。其包含维持系统运作所需的数据库31、文字辨识模块32、判断模块33、查询模块34、拼写校正与模糊查询模块35及输出模块36。如上所述的各模块,兹分别说明如下:As shown in FIG. 3 , it is a system architecture diagram of a computer-executable word/word matching and query system and its method. It includes a database 31 , a character recognition module 32 , a judgment module 33 , a query module 34 , a spelling correction and fuzzy query module 35 and an output module 36 required to maintain system operation. Each module mentioned above is explained as follows:

数据库31,即为字(辞)典数据库,其包含:字/词字段,提供丰富的单字与单词以供查询;解释字段,提供单字与单词的相对应的解释。The database 31 is a character (dictionary) database, which includes: a word/word field, which provides abundant words and words for query; an explanation field, which provides corresponding explanations of words and words.

文字辨识模块32,藉以撷取画面上的字/词产生一可供查询的单字或单词,亦即经过辨识之后产生一个待查询的字/词。The character recognition module 32 is used to capture the character/phrase on the screen to generate a single character or words that can be queried, that is, to generate a character/word to be queried after recognition.

判断模块33,判断待查询的字/词是否属于整句文字全大写,然后决定相对应的对应规则。如果待查询的字/词属于整句文字全大写,则先以待查询的字/词的小写形式查询,否则先以待查询的字/词的原始形式查询。当待查询的字/词的原始形式无法与数据库完全匹配时,再来判断待查询的字/词是否为首字母大写,然后采取相对应的对应规则,如果待查询的字/词为首字母大写,则以待查询的字/词的小写形式查询,否则执行拼写校正与模糊查询功能。The judging module 33 judges whether the word/phrase to be queried belongs to the whole sentence in all capital letters, and then determines the corresponding corresponding rules. If the word/phrase to be queried belongs to the whole sentence in all uppercase, then the lowercase form of the word/phrase to be queried is queried first; otherwise, the original form of the word/phrase to be queried is queried first. When the original form of the word/phrase to be queried cannot be completely matched with the database, then judge whether the word/phrase to be queried is capitalized, and then adopt the corresponding corresponding rules. If the word/phrase to be queried is capitalized, then Query in lowercase form of the word/phrase to be queried, otherwise perform spelling correction and fuzzy query functions.

查询模块34,根据判断模块33的对应规则与数据库31进行匹配,根据匹配的结果产生相对应的处理策略。当待查询的字/词的对应规则与数据库完全匹配时,其处理策略为输出待查询的字/词的解释;当待查询的字/词的对应规则与数据库无法匹配时,其处理策略有二,其一为判断待查询的字/词是否为首字母大写,再根据其对应规则查询;另一为当已经判断过待查询的字/词是否为首字母大写,则执行拼写校正与模糊查询功能。The query module 34 performs matching with the database 31 according to the corresponding rules of the judging module 33, and generates a corresponding processing strategy according to the matching result. When the corresponding rules of the word/word to be queried completely match the database, its processing strategy is to output the explanation of the word/word to be queried; when the corresponding rules of the word/word to be queried cannot match the database, its processing strategy is Two, one is to judge whether the word/phrase to be queried is capitalized, and then query according to its corresponding rules; the other is to perform spelling correction and fuzzy query functions when it has been judged whether the word/phrase to be queried is capitalized .

拼写校正与模糊查询模块35,此模块的功能是为于数据库31中找出一个与待查询的字/词最近似的字/词,并输出其相对应的解释作为查询结果。而执行拼写校正与模糊查询模块35有两个时机,其一为当待查询的字/词的对应规则与数据库31无法匹配时,另一为判断模块33判断待查询的字/词不是首字母大写时。Spelling correction and fuzzy query module 35, the function of this module is to find a word/word most similar to the word/word to be queried in the database 31, and output its corresponding explanation as the query result. And carry out spelling correction and fuzzy query module 35 have two opportunities, and one is when the corresponding rule of word/word to be queried and database 31 can't match, and another is that judging module 33 judges that word/word to be queried is not first letter when capitalized.

输出模块36,用以输出翻译结果,其为产生一个窗口,并利用此窗口显示翻译结果。其输出的信息包含所匹配的字词与其相对应的解释,亦即所显示的解释可为完全匹配的结果或模糊查询的结果。The output module 36 is used to output the translation result, which is to generate a window and use this window to display the translation result. The output information includes the matched words and their corresponding explanations, that is, the displayed explanations can be exact matching results or fuzzy query results.

如图4所示,为本发明的计算机可执行的字/词匹配与查询的系统及其方法的详细流程图,亦为本发明较佳实施例。其流程为撷取一个待查询的字/词(步骤400);判断待查询的字/词是否为整句文字全大写(步骤410),如果待查询的字/词属于整句文字全大写,则以待查询的字/词的小写形式查询(步骤420);然后判断数据库中是否有完全匹配的字/词(步骤430),如果有完全匹配的字/词,则输出其翻译结果(步骤440);如果没有完全匹配的字/词,则执行拼写校正与模糊查询功能,匹配一个和待查询的字/词最近似的字/词(步骤450),然后输出其翻译结果(步骤440);如果待查询的字/词不是整句文字全大写,则以待查询的字/词的原字/词查询(步骤460),判断数据库中是否有完全匹配的字/词(步骤470),如果有完全匹配的字/词,则输出其翻译结果(步骤440);如果没有完全匹配的字/词,则判断待查询的字/词是否为首字母大写(步骤480),如果待查询的字/词为首字母大写,则以待查询的字/词的小写形式查询(步骤420),否则执行拼写校正与模糊查询功能(步骤450)。As shown in FIG. 4 , it is a detailed flowchart of the computer-executable word/word matching and query system and its method of the present invention, which is also a preferred embodiment of the present invention. Its process is to extract a word/word to be queried (step 400); judge whether the word/word to be queried is the whole sentence text all capitals (step 410), if the word/word to be queried belongs to the whole sentence text all capitals, Then query (step 420) with the lowercase form of the word/word to be inquired about; Then judge whether there is the word/word (step 430) of complete match in the database, if there is the word/word of complete match, then output its translation result (step 430) 440); If there is no word/word of complete match, then carry out spell correction and fuzzy query function, match one and the word/word (step 450) that is most similar to the word/word to be queried, then output its translation result (step 440) ; If the word/word to be inquired is not the whole sentence text full capitalization, then with the original word/word query (step 460) of the word/word to be inquired about, judge whether there is the word/word (step 470) of complete match in the database, If there is a complete matching word/word, then output its translation result (step 440); if there is no complete matching word/word, then judge whether the word/word to be queried is capitalized (step 480), if the word to be queried If the /word is capitalized, then query with the lowercase form of the word/word to be queried (step 420), otherwise perform spelling correction and fuzzy query functions (step 450).

以上所述,仅为本发明其中的较佳实施例而已,并非用来限定本发明的实施范围;即凡依本发明专利范围所作的均等变化与修饰,皆为本发明专利范围所涵盖。The above description is only one of the preferred embodiments of the present invention, and is not used to limit the implementation scope of the present invention; that is, all equivalent changes and modifications made according to the patent scope of the present invention are covered by the patent scope of the present invention.

Claims (8)

1. the executable character/word of computing machine mates and the system of inquiring about, and wherein this system includes:
One database, this database are to build on the accessible storage medium of a computing machine, store the character/word explanation corresponding with it;
One text-recognition module obtains the literal from input media input, and this literal is carried out identification to produce a character/word to be checked;
One judge module is write according to the alphabet size of this character/word to be checked, determines a corresponding rule;
One enquiry module, this enquiry module finds out a character/word that mates fully according to this rule of correspondence in this database, and the translation of the character/word that should mate fully mutually;
One spelling correction and fuzzy query module, when this enquiry module does not find this character/word that mates fully, in this database, find out one recently like character/word, and this recently like the translation of character/word; And
One video signal output unit, in order to the translation of exporting this character/word that mates fully with should be recently like the translation of character/word.
2. executable character/word coupling of computing machine as claimed in claim 1 and the system of inquiring about, wherein this judge module more comprises one whole literal and capitalizes judge module entirely, in order to judge whether this character/word to be checked belongs to whole sentence literal and capitalize entirely.
3. executable character/word coupling of computing machine as claimed in claim 1 and the system of inquiring about, whether wherein this judge module more comprises an initial caps judge module, be initial caps in order to judge this character/word to be checked.
4. the system of executable character/word coupling of computing machine as claimed in claim 1 and inquiry, wherein this rule of correspondence reaches group's combination of inquiring about with spelling correction and fuzzy query mode for the inquiry of small letter form, primitive form inquiry with this character/word to be checked.
5. the executable character/word of computing machine mates and the method for inquiring about, and is applicable to a computing machine and carries out the following step:
Obtain a character/word to be checked of importing from an input media;
According to the judgement that this character/word alphabet size to be checked is write, determine a corresponding rule;
In a database, find out a character/word that mates fully according to this rule of correspondence judgement, and the translation of the character/word that should mate fully mutually;
When in this database, not finding the character/word that mates fully according to this rule of correspondence, then carry out a spelling correction and fuzzy query function with a spelling correction and fuzzy query module, and in this database, find out one recently like character/word, and this recently like the translation of character/word; And
With a video signal output unit export this character/word that mates fully translation or should be recently like the translation of character/word.
6. executable character/word coupling of computing machine as claimed in claim 5 and the method for inquiring about, wherein this alphabet size is write judgement, for judging whether this character/word to be checked belongs to whole sentence literal and capitalize entirely.
7. executable character/word coupling of computing machine as claimed in claim 5 and the method for inquiring about, wherein this alphabet size is write judgement, whether belongs to initial caps for judging this character/word to be checked.
8. the method for executable character/word coupling of computing machine as claimed in claim 5 and inquiry, wherein this rule of correspondence is for reaching group's combination of inquiring about with spelling correction and fuzzy query mode with the small letter form inquiry of this character/word to be checked, the inquiry of former character/word.
CN 200410001649 2004-01-09 2004-01-09 Computer executable word/phrase matching and querying system and method thereof Expired - Fee Related CN1641635B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN 200410001649 CN1641635B (en) 2004-01-09 2004-01-09 Computer executable word/phrase matching and querying system and method thereof

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN 200410001649 CN1641635B (en) 2004-01-09 2004-01-09 Computer executable word/phrase matching and querying system and method thereof

Publications (2)

Publication Number Publication Date
CN1641635A CN1641635A (en) 2005-07-20
CN1641635B true CN1641635B (en) 2010-05-05

Family

ID=34867170

Family Applications (1)

Application Number Title Priority Date Filing Date
CN 200410001649 Expired - Fee Related CN1641635B (en) 2004-01-09 2004-01-09 Computer executable word/phrase matching and querying system and method thereof

Country Status (1)

Country Link
CN (1) CN1641635B (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN100449542C (en) * 2005-09-14 2009-01-07 腾讯科技(深圳)有限公司 Method and apparatus for matching associated person information

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1250915A (en) * 1998-10-09 2000-04-19 英业达股份有限公司 instant translation system
GB2356074A (en) * 1999-11-02 2001-05-09 Seiko Instr Inc Electronic dictionary

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1250915A (en) * 1998-10-09 2000-04-19 英业达股份有限公司 instant translation system
GB2356074A (en) * 1999-11-02 2001-05-09 Seiko Instr Inc Electronic dictionary

Also Published As

Publication number Publication date
CN1641635A (en) 2005-07-20

Similar Documents

Publication Publication Date Title
US6393389B1 (en) Using ranked translation choices to obtain sequences indicating meaning of multi-token expressions
US8726148B1 (en) Method and apparatus for processing text and character data
US9734197B2 (en) Determining corresponding terms written in different formats
CN102567509B (en) Method and system for instant messaging with visual messaging assistance
CN102016836A (en) Modular system and method for managing Chinese, Japanese and Korean language data in electronic form
JP2004516527A (en) System and method for computer assisted writing with cross language writing wizard
JP2000200291A (en) Method for automatically detecting selected character string in text
JP2003514304A (en) A linguistic input architecture that converts from one text format to another and is resistant to spelling, typing, and conversion errors
JP2006004427A (en) System and method of searching content of complicated languages such as japanese
JP2003527676A (en) A linguistic input architecture that converts one text format to the other text format with modeless input
WO2022135474A1 (en) Information recommendation method and apparatus, and electronic device
CN101211245A (en) text input device
WO2006115825A2 (en) Abbreviated handwritten ideographic entry phrase by partial entry
WO2024164976A1 (en) Sample construction method and apparatus, and electronic device and readable storage medium
CN1641635B (en) Computer executable word/phrase matching and querying system and method thereof
Randhawa et al. Study of spell checking techniques and available spell checkers in regional languages: a survey
JPH11238051A (en) Chinese input conversion processor, chinese input conversion processing method and recording medium stored with chinese input conversion processing program
Saharia et al. LuitPad: a fully unicode compatible Assamese writing software
JP2008059169A (en) Chinese example sentence search device and Chinese example sentence search processing program
TWI259379B (en) Computer executable word/phrase matching and inquiring system and method thereof
JP2621999B2 (en) Document processing device
US20250094828A1 (en) Knowledge graph for semantic searching of handwritten documents
JP2008140074A (en) Example sentence search device and example sentence search processing program
CN1421804A (en) Lexical processing systems and methods for different languages
CN102110129A (en) Electronic dictionary search device and search method using specific functional language

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant
C17 Cessation of patent right
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20100505

Termination date: 20120109