CN106933382A - Update the method and device of input method dictionary - Google Patents
Update the method and device of input method dictionary Download PDFInfo
- Publication number
- CN106933382A CN106933382A CN201710161216.XA CN201710161216A CN106933382A CN 106933382 A CN106933382 A CN 106933382A CN 201710161216 A CN201710161216 A CN 201710161216A CN 106933382 A CN106933382 A CN 106933382A
- Authority
- CN
- China
- Prior art keywords
- word
- field
- chinese character
- input
- target
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/01—Input arrangements or combined input and output arrangements for interaction between user and computer
- G06F3/02—Input arrangements using manually operated switches, e.g. using keyboards or dials
- G06F3/023—Arrangements for converting discrete items of information into a coded form, e.g. arrangements for interpreting keyboard generated codes as alphanumeric codes, operand codes or instruction codes
- G06F3/0233—Character input methods
- G06F3/0237—Character input methods using prediction or retrieval techniques
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/23—Updating
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- General Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Human Computer Interaction (AREA)
- Data Mining & Analysis (AREA)
- Databases & Information Systems (AREA)
- Document Processing Apparatus (AREA)
Abstract
The disclosure is directed to a kind of method and device for updating input method dictionary, belong to input method technique field.The method includes:According to the deletion action signal corresponding to the target word being input into, at least one of delete target word Chinese character obtains reserved field;Wherein, target word includes at least two Chinese characters, and reserved field includes at least one Chinese character;Reserved field and aiming field are spliced, compound word is formed;Wherein, aiming field includes:Before reserved field and first field adjacent with reserved field, and/or, after reserved field and second field adjacent with reserved field;Compound word is added in input method dictionary.The embodiment of the present disclosure causes that the automatic word function of input method application is more perfect, and the vocabulary of input method dictionary is more enriched, and helps to lift input efficiency.
Description
Technical Field
The disclosure relates to the technical field of input methods, and in particular relates to a method and a device for updating an input method lexicon.
Background
Input method applications (simply "input methods") are application programs used to enter various symbols (including numbers, letters, words, etc.) into a computer or other device (e.g., a cell phone). The vocabulary in the input method lexicon determines the number of candidate words that the input method application can provide. In practical application, the input method lexicon needs to be updated to increase the vocabulary volume.
When a user needs to input a word sequence existing in an input method word stock, the input method application can directly display the word sequence in candidate words according to a coding sequence input by the user; when a user needs to input a word sequence which is not in the input method word library, the input method application needs the user to select word by word, and then a complete word sequence is obtained. For example, assume that the input method thesaurus has stored the word sequence "party" but does not store the word sequence "party". When the user needs to input the party, the coding sequence 'juhui' is input, and candidate words obtained by searching the input method word bank by the input method application comprise the party. When a user needs to input 'Juxuan', a coding sequence 'juxuan' is input, and the user needs to respectively select 'Ju' and 'Xuan' from candidate words twice because the word 'Juxuan' does not exist in the input method lexicon.
In the related art, after detecting that a word is input by a user in a word-by-word selection manner, if the word is not stored in an input method word bank, the input method application adds the word to the input method word bank to increase the vocabulary amount in the input method word bank, so that the word does not need to be selected word-by-word when the word is input again by a subsequent user. For example, the user needs to select word by word when inputting "gather gorgeous" for the first time, and after the input method application adds "gather gorgeous" to the input method lexicon, the user can directly input when inputting "gather gorgeous" again.
Disclosure of Invention
The embodiment of the disclosure provides a method and a device for updating an input method lexicon. The technical scheme is as follows:
according to a first aspect of the embodiments of the present disclosure, there is provided a method for updating an input method lexicon, the method including:
deleting at least one Chinese character in the target words according to a deleting operation signal corresponding to the input target words to obtain reserved fields; wherein the target word comprises at least two Chinese characters, and the reserved field comprises at least one Chinese character;
splicing the reserved field and the target field to form a synthesized word; wherein the target field comprises: a first field located before and adjacent to the reserved field, and/or a second field located after and adjacent to the reserved field; the first field comprises at least one Chinese character, and the first field is the rest Chinese characters after one Chinese character or one word deletes at least one Chinese character; the second field comprises at least one Chinese character, and the second field is the rest Chinese characters after one Chinese character or one word deletes at least one Chinese character;
and adding the synthesized word into an input method word bank.
Optionally, the target field includes the first field, and the compound word is a first compound word;
the splicing the reserved field and the target field to form a compound word comprises the following steps:
splicing the first field and the reserved field to form the first compound word; and the last Chinese character of the first field in the first compound word is connected with the first Chinese character of the reserved field.
Optionally, the target field includes the second field, and the compound word is a second compound word;
the splicing the reserved field and the target field to form a compound word comprises the following steps:
splicing the reserved field and the second field to form the second compound word; and the last Chinese character of the reserved field in the second compound word is connected with the first Chinese character of the second field.
Optionally, the target field includes the first field and the second field, and the compound word is a third compound word;
the splicing the reserved field and the target field to form a compound word comprises the following steps:
splicing the first field, the reserved field and the second field in sequence to form the third synthesized word; and the last Chinese character of the first field in the third compound word is connected with the first Chinese character of the reserved field, and the last Chinese character of the reserved field is connected with the first Chinese character of the second field.
Optionally, the method further comprises:
inputting a first term, wherein the first term is the first field;
or,
inputting a first word; deleting at least one Chinese character in the first word according to a deletion operation signal corresponding to the first word to obtain the first field;
or,
and inputting a first Chinese character, wherein the first Chinese character is the first field.
Optionally, the method further comprises:
inputting a second term, wherein the second term is the second field;
or,
inputting a second word; deleting at least one Chinese character in the second word according to a deletion operation signal corresponding to the second word to obtain the second field;
or,
and inputting a second Chinese character, wherein the second Chinese character is the second field.
Optionally, the method further comprises:
updating the input times corresponding to the synthesized words;
and when the input times corresponding to the synthetic words reach preset times, executing the step of adding the synthetic words into an input method word bank.
Optionally, the method further comprises:
detecting whether the input word sequence comprises the synthetic word or not under the condition that the input times corresponding to the synthetic word are more than or equal to 1 and are less than the preset times;
and if the word sequence comprises the synthetic word, updating the input times corresponding to the synthetic word.
According to a second aspect of the embodiments of the present disclosure, there is provided an apparatus for updating a thesaurus of input methods, the apparatus including:
the Chinese character deleting module is configured to delete at least one Chinese character in the target words according to a deleting operation signal corresponding to the input target words to obtain a reserved field; wherein the target word comprises at least two Chinese characters, and the reserved field comprises at least one Chinese character;
the composite word forming module is configured to splice the reserved field and the target field to form a composite word; wherein the target field comprises: a first field located before and adjacent to the reserved field, and/or a second field located after and adjacent to the reserved field; the first field comprises at least one Chinese character, and the first field is the rest Chinese characters after one Chinese character or one word deletes at least one Chinese character; the second field comprises at least one Chinese character, and the second field is the rest Chinese characters after one Chinese character or one word deletes at least one Chinese character;
and the word bank updating module is configured to add the synthesized words to the input method word bank.
Optionally, the target field includes the first field, and the compound word is a first compound word;
the composite word forming module is configured to splice the first field and the reserved field to form the first composite word; and the last Chinese character of the first field in the first compound word is connected with the first Chinese character of the reserved field.
Optionally, the target field includes the second field, and the compound word is a second compound word;
the composite word forming module is configured to splice the reserved field and the second field to form the second composite word; and the last Chinese character of the reserved field in the second compound word is connected with the first Chinese character of the second field.
Optionally, the target field includes the first field and the second field, and the compound word is a third compound word;
the composite word forming module is configured to sequentially splice the first field, the reserved field and the second field to form the third composite word; and the last Chinese character of the first field in the third compound word is connected with the first Chinese character of the reserved field, and the last Chinese character of the reserved field is connected with the first Chinese character of the second field.
Optionally, the apparatus further comprises:
a first acquisition module configured to: inputting a first term, wherein the first term is the first field; or, inputting a first word; deleting at least one Chinese character in the first word according to a deletion operation signal corresponding to the first word to obtain the first field; or inputting a first Chinese character, wherein the first Chinese character is the first field.
Optionally, the apparatus further comprises:
a second acquisition module configured to: inputting a second term, wherein the second term is the second field; or, inputting a second word; deleting at least one Chinese character in the second word according to a deletion operation signal corresponding to the second word to obtain the second field; or inputting a second Chinese character, wherein the second Chinese character is the second field.
Optionally, the apparatus further comprises:
the frequency updating module is configured to update the input frequency corresponding to the synthesized word;
the word stock updating module is further configured to execute the step of adding the synthetic word to the input method word stock when the input frequency corresponding to the synthetic word reaches a preset frequency.
Optionally, the apparatus further comprises:
the synthetic word detection module is configured to detect whether the input word sequence comprises the synthetic word or not under the condition that the input times corresponding to the synthetic word are greater than or equal to 1 and are less than the preset times;
the number updating module is further configured to update the input number corresponding to the synthesized word when the synthesized word is included in the word sequence.
According to a third aspect of the embodiments of the present disclosure, there is provided an apparatus for updating a thesaurus of input methods, the apparatus including:
a processor;
a memory for storing executable instructions of the processor;
wherein the processor is configured to:
deleting at least one Chinese character in the target words according to a deleting operation signal corresponding to the input target words to obtain reserved fields; wherein the target word comprises at least two Chinese characters, and the reserved field comprises at least one Chinese character;
splicing the reserved field and the target field to form a synthesized word; wherein the target field comprises: a first field located before and adjacent to the reserved field, and/or a second field located after and adjacent to the reserved field; the first field comprises at least one Chinese character, and the first field is the rest Chinese characters after one Chinese character or one word deletes at least one Chinese character; the second field comprises at least one Chinese character, and the second field is the rest Chinese characters after one Chinese character or one word deletes at least one Chinese character;
and adding the synthesized word into an input method word bank.
The technical scheme provided by the embodiment of the disclosure can have the following beneficial effects:
splicing a reserved field obtained after part of Chinese characters in a target word are deleted and the target field to form a synthesized word, and adding the synthesized word into an input method word bank; the technical problems that the automatic word learning function provided by the input method application in the related technology is not complete enough, so that the vocabulary of the input method word bank is not comprehensive enough and the input efficiency of a user is influenced are solved; the automatic word learning is realized for the word sequence input by adopting the mode of deleting Chinese characters after word grouping, so that the automatic word learning function is more perfect, the vocabulary of the input method word bank is richer, and the input efficiency is improved.
It is to be understood that both the foregoing general description and the following detailed description are exemplary and explanatory only and are not restrictive of the disclosure.
Drawings
The accompanying drawings, which are incorporated in and constitute a part of this specification, illustrate embodiments consistent with the present disclosure and together with the description, serve to explain the principles of the disclosure.
FIG. 1 is a flow diagram illustrating a method of updating a thesaurus of input methods in accordance with an exemplary embodiment;
FIG. 2 is a flow diagram illustrating a method of updating a thesaurus of input methods in accordance with another exemplary embodiment;
FIG. 3 is a block diagram illustrating an apparatus for updating a thesaurus of input methods in accordance with an exemplary embodiment;
FIG. 4 is a block diagram illustrating an apparatus for updating a thesaurus of input methods in accordance with another exemplary embodiment;
FIG. 5 is a block diagram illustrating an apparatus in accordance with an example embodiment.
Detailed Description
Reference will now be made in detail to the exemplary embodiments, examples of which are illustrated in the accompanying drawings. When the following description refers to the accompanying drawings, like numbers in different drawings represent the same or similar elements unless otherwise indicated. The implementations described in the exemplary embodiments below are not intended to represent all implementations consistent with the present disclosure. Rather, they are merely examples of apparatus and methods consistent with certain aspects of the present disclosure, as detailed in the appended claims.
In the related art, although the input method application provides a function of automatically learning words for a word sequence input by a user in a word-by-word selection manner, the automatic word learning function is still not complete. When a user needs to input a name, a self-made word, a new word and the like of a character and word sequence which does not exist in an input method word library, in order to facilitate input and reduce the trouble of selecting the character by character, for each Chinese character in the character and word sequence which needs to be input, the user usually firstly inputs a common word containing the Chinese character (the common word exists in the input method word library), and then deletes other Chinese characters in the common word to leave the Chinese character which needs to be input. For example, when the user needs to input "gathering character", the user may input "gathering character", delete "gathering" character, then input "gorgeous", delete "gorgeous" character, and finally leave "gathering character". For the above situation, because the user does not select input word by word, the automatic word learning function applied by the input method does not add "gather word" to the input method lexicon, so that the user still needs to adopt a word by word selection mode or a mode of deleting Chinese characters after word formation when inputting "gather word" again subsequently, the operation is complex, and the input efficiency is low.
Based on the above, the embodiments of the present disclosure provide a method for updating a lexicon of an input method, and an apparatus based on the method, so as to solve the problems in the related art. The technical scheme provided by the embodiment of the disclosure can realize automatic word learning of the word sequence input by adopting the mode of deleting Chinese characters after word grouping, so that the automatic word learning function is more complete, the vocabulary of the input method word stock is richer, and the input efficiency is improved.
In the method provided by the embodiment of the disclosure, the execution subject of each step may be a terminal. For example, a mobile phone, a tablet computer, an e-book reader, a multimedia player, a PDA (Personal Digital Assistant), a wearable device, a computer, or other terminals. Optionally, an input method application is installed and run in the terminal, and the execution subject of each step may be the input method application, and the input method application has a Chinese input function. In the following method embodiments, unless otherwise specified, the execution subject of each step is merely exemplified as an input method application for simplifying the description, but the present invention is not limited thereto.
FIG. 1 is a flow diagram illustrating a method of updating a thesaurus of input methods according to an exemplary embodiment. The method may include the steps of:
in step 101, at least one Chinese character in the target word is deleted according to a deletion operation signal corresponding to the input target word, and a reserved field is obtained.
The target word includes at least two Chinese characters and the reserved field includes at least one Chinese character.
The target words are words already stored in the input method lexicon. The input method word bank is used for storing each candidate word provided by the input method application. The words stored in the input method word library can be words with high use frequency, words conforming to the input habit of the user, words conforming to the Chinese use rule, and the like.
When a user inputs a target word, the input method application directly provides the target word as a candidate word to the user after acquiring a coding sequence corresponding to the target word input by the user, and the user can directly select the target word from the candidate words provided by the input method application. Taking the target word as "party" as an example, after the user inputs the coding sequence "juhui", the candidate words provided by the input method application include "party", "moment", "party", and the like.
Optionally, the delete operation signal is generated by the terminal device after receiving a trigger signal corresponding to a control implementing the delete function, where the control may be an entity control or a virtual control. Illustratively, the control implementing the delete function is an enter key or a delete key in the keyboard.
The number of the deletion operation signals may be one, and may be plural.
When the number of the deleting operation signals is one, the input method firstly acquires the selection signals corresponding to the Chinese characters in the target words, and then acquires the deleting operation signals to delete the Chinese characters. Illustratively, the target word is "smooth", and the input method application acquires a selection signal corresponding to "smooth", and then deletes two characters of "smooth" at the same time by acquiring a deletion operation signal.
When the number of the deleting operation signals is multiple, the input method sequentially deletes the Chinese characters in the target words according to the multiple deleting operation signals. Illustratively, the target word is "smooth", and the input method application deletes the "smooth" according to the first deletion operation signal and then deletes the "smooth" according to the second deletion operation signal.
The reserved field refers to the remaining Chinese characters after deleting part of the Chinese characters in the target word. Illustratively, the target word is "combination", the reserved field obtained by deleting part of the Chinese characters in the target word by the input method application is "knot", the target word is "smooth", and the reserved field obtained by deleting part of the Chinese characters in the target word by the input method application is "smooth".
Optionally, the input method application detects whether there is only one Chinese character in the reserved field; if yes, the following step 102 is executed; if not, the following step 102 is not executed, and the flow ends. Through the mode, the input method only triggers the automatic word learning function provided by the embodiment aiming at the condition that the user deletes one word from the rest of Chinese characters, and does not trigger the automatic word learning function provided by the embodiment aiming at the condition that the user deletes a word from the rest of Chinese characters.
In step 102, the reserved field and the target field are concatenated to form a compound word.
The target field includes: a first field located before and adjacent to the reserved field, and/or a second field located after and adjacent to the reserved field. The first field comprises at least one Chinese character, and the first field is the rest Chinese characters after one Chinese character or one word deletes at least one Chinese character. The second field comprises at least one Chinese character, and the second field is the rest Chinese characters after one Chinese character or one word deletes at least one Chinese character.
The manner in which the compound word is formed is described below in connection with several possible implementations of the target field.
In a first possible implementation, the target field includes a first field, and the compound word is a first compound word. The input method application concatenates the first field and the reserved field to form a first compound word. Wherein the last Chinese character of the first field in the first compound word is connected with the first Chinese character of the reserved field.
For example, the user inputs two words of "Mucun" and "exploitation" in sequence, and the input method application deletes the "exploitation" in the target word "exploitation" according to the deletion operation signal to obtain the reserved field "exploitation". The word "wood village" preceding and adjacent to the reserved field is the first field. The input method application splices the first field "mu cun" and the reserved field "Tu" to form a first compound word "mu cun Tu".
Alternatively, the formation of the first field includes several ways as follows.
1. The input method uses and inputs a first word, and the first word is a first field. Wherein the first term precedes and is adjacent to the reserved field. Exemplarily, a user inputs two words of 'party' and 'gorgeous' in sequence, wherein a reserved field 'gorgeous' is obtained by deleting a 'gorgeous' word in a target word 'gorgeous', a first word 'party' before and adjacent to the reserved field is determined as a first field, and the input method application splices the first field 'party' and the reserved field 'gorgeous', so as to form a first synthetic word 'party gorgeous'.
2. Inputting a first word by an input method application; and deleting at least one Chinese character in the first word according to the deletion operation signal corresponding to the first word to obtain the first field. That is, the remaining Chinese characters obtained after the input method application deletes part of the Chinese characters in the first word are determined as the first field, wherein the first word is before and adjacent to the reserved field. Exemplarily, a user inputs two words of 'party' and 'gorgeous' in sequence, wherein a reserved field 'gorgeous' is obtained by deleting a 'beautiful' word in a target word 'gorgeous', a remaining Chinese character 'party' obtained by deleting a 'party' word in a first word 'party' is determined as a first field, and the input method is applied to splice the first field 'party' and the reserved field 'gorgeous', so that a first synthesized word 'party' is formed.
Optionally, the input method application detects whether there is only one Chinese character in the first field remaining after deletion; if so, splicing the reserved field and the first field to form a compound word; if not, the step of forming the compound word by splicing is not executed, and the flow is ended.
3. The input method inputs a first Chinese character, wherein the first Chinese character is a first field. Wherein the first Chinese character precedes and is adjacent to the reserved field. Exemplarily, a user inputs 'poly' and 'gorgeous' in sequence, wherein a 'gorgeous' character in a target word 'gorgeous' is deleted to obtain a reserved field 'gorgeous', a first Chinese character 'poly' which is located in front of the reserved field and adjacent to the reserved field is determined as a first field, and the input method applies to splice the first field 'poly' and the reserved field 'gorgeous' to form a first compound word 'gorgeous'.
In a second possible implementation, the target field includes a second field, and the compound word is a second compound word. The input method application concatenates the reserved field and the second field to form a second compound word. And the last Chinese character of the reserved field in the second compound word is connected with the first Chinese character of the second field.
For example, the user inputs two terms of "pool water" and "field" in sequence, the input method application deletes the "water" in the target term "pool water" according to the deletion operation signal to obtain the reserved field "pool", and the term "field" located behind the reserved field and adjacent to the reserved field is the second field. The input method application splices the reserved field pool with the second field to form a second compound word pool field.
Alternatively, the formation of the second field includes several ways as follows.
1. And the input method uses a second word input, and the second word is a second field. Wherein the second term follows and is adjacent to the reserved field. Exemplarily, a user inputs two words of "party" and "gorgeous" in sequence, wherein a "party" word in the target word "party" is deleted to obtain a reserved field "party", a second word "gorgeous" after the reserved field and adjacent to the reserved field is determined as a second field, and the reserved field "party" and the second field "gorgeous" are spliced by the input method application to form a second synthetic word "gorgeous".
2. And the input method is used for inputting the second word and deleting at least one Chinese character in the second word according to the deletion operation signal corresponding to the second word to obtain a second field. That is, the remaining Chinese characters obtained after the input method application deletes part of the Chinese characters in the second word are determined as the second field, wherein the second word is after and adjacent to the reserved field. Exemplarily, a user inputs two words of 'party' and 'gorgeous' in sequence, wherein a 'party' word in a target word 'party' is deleted to obtain a reserved field 'party', a remaining Chinese character 'gorgeous' word obtained by deleting a 'gorgeous' word in a second word 'gorgeous' is determined as a second field, and the reserved field 'party' and the second field 'gorgeous' are spliced by the input method application to form a second synthesized word 'gorgeous'.
Optionally, the input method application detects whether there is only one Chinese character in the second field remaining after deletion; if so, splicing the reserved field and the second field to form a compound word; if not, the step of forming the compound word by splicing is not executed, and the flow is ended.
3. The input method inputs a second Chinese character, wherein the second Chinese character is a second field. Wherein the second Chinese character precedes and is adjacent to the reserved field. Exemplarily, a user inputs 'party' and 'gorgeous' in sequence, wherein a 'party' word in a target word 'party' is deleted to obtain a reserved field 'party', a second Chinese character 'gorgeous' adjacent to the reserved field after the reserved field is determined as a second field, and the reserved field 'party' and the second field 'gorgeous' are spliced by the input method application to form a second compound word 'gorgeous'.
In a third possible implementation, the target field includes a first field and a second field, and the compound word is a third compound word. And the input method application sequentially splices the first field, the reserved field and the second field to form a third compound word. And the last Chinese character of the first field in the third compound word is connected with the first Chinese character of the reserved field, and the last Chinese character of the reserved field is connected with the first Chinese character of the second field.
For example, the user sequentially inputs "clear water", "beauty" and "noble", the input method application deletes "beauty" in the target word "beauty" according to the deletion operation signal, resulting in a reserved field "beauty", the "clear water" adjacent to and before the reserved field is determined as a first field, and the "noble" adjacent to and after the reserved field is determined as a second field. The input method application sequentially splices the first field 'clear water', the reserved field 'beauty' and the second field 'noble' to form a third synthetic word 'clear water beauty'.
For the formation of the first field and the second field, refer to the description above, and are not described herein again.
In practical applications, the input method application may form at least one of the first compound word, the second compound word, and the third compound word.
In step 103, the compound word is added to the input method lexicon.
The input method application adds the synthesized words to the input method lexicon to update the input method lexicon.
Optionally, the input method application adds the compound word to either or both of the local input method thesaurus and the input method thesaurus of the server. The server is a background server corresponding to the input method application.
When the input method application adds the synthesized word to the input method word stock of the server, and other users input the synthesized word, the input method word stock can also provide the synthesized word as a candidate word for the users, so that the input efficiency of other users is improved.
Optionally, after step 103, the following steps are further included: acquiring a coding sequence corresponding to the synthetic word; searching and acquiring at least one candidate word corresponding to the coding sequence from an input method word bank, wherein the at least one candidate word comprises a synthetic word; and displaying the acquired candidate words.
Taking the pinyin input method as an example, the coding sequence may be a complete pinyin corresponding to the synthesized word, a complete pinyin of the first Chinese character in the synthesized word and pinyin first letters of other Chinese characters in the synthesized word, or a pinyin first letter of each Chinese character in the synthesized word, and so on. Illustratively, for the compound word "polyaromatic", its corresponding coding sequence may be "jufang", may be "juf", may also be "jf", etc.
In summary, in the method provided by the embodiment of the present disclosure, the reserved field obtained after deleting part of the chinese characters in the target word is spliced with the target field to form the synthetic word, and the synthetic word is added to the input method lexicon; the technical problems that the automatic word learning function provided by the input method application in the related technology is not complete enough, so that the vocabulary of the input method word bank is not comprehensive enough and the input efficiency of a user is influenced are solved; the automatic word learning is realized for the word sequence input by adopting the mode of deleting Chinese characters after word grouping, so that the automatic word learning function is more perfect, the vocabulary of the input method word bank is richer, and the input efficiency is improved.
In an alternative embodiment provided based on the embodiment shown in fig. 1, please refer to fig. 2, before step 103, the following steps may be further included.
In step 104, the input times corresponding to the compound word are updated.
For any compound word, the input method is applied to increase the input times corresponding to the compound word by 1 after each compound word is formed. Taking the synthesis word "gorgeous" as an example, after the input method is applied to form the synthesis word "gorgeous" for the first time, the input times corresponding to the synthesis word "gorgeous" is updated from 0 to 1; after the input method is applied to the second formation of the compound word "gather gorgeous", the input times corresponding to the compound word "gather gorgeous" is updated from 1 to 2, and so on. For a certain compound word, the mode of forming the compound word each time can be the same or different.
In step 105, it is detected whether the input times corresponding to the compound words exceed a preset time. If yes, go to step 103; if not, the flow is ended.
The preset times are set according to the requirements of the word stock of the input method on the use frequency of the words. The input method word stock can be a local input method word stock of the terminal or an input method word stock of the server.
The local input method word stock of the terminal only provides candidate words for one terminal, so that the requirement on the use frequency of the words can be lower. For example, the preset number of times is set to 3 for the update condition of the input method lexicon local to the terminal. The input method word stock of the server provides candidate words for all the terminals, so that the requirement on the use frequency of the words can be high. For example, the preset number of times is set to 100 times for the update condition of the input method lexicon of the server.
Optionally, for any compound word, in the case that the input frequency corresponding to the compound word is greater than or equal to 1 and is less than a preset frequency, the input method application detects whether the compound word is included in the input word sequence; and if the word sequence comprises the synthetic word, updating the input times corresponding to the synthetic word.
When the input frequency corresponding to the synthetic word is more than or equal to 1 and is less than the preset frequency, no matter what way the synthetic word is input or formed, the input frequency corresponding to the synthetic word is updated as long as the input method is applied to detect that the synthetic word is included in the word sequence input by the user. For example, the input method application forms a compound word according to the method provided in the embodiment shown in fig. 1, or the input method application obtains a compound word input in a word-by-word input manner, or the input method application obtains a compound word input in a copy manner, and so on.
In summary, the method provided in the embodiment of the present disclosure further detects the input times corresponding to the synthesized word, and adds the synthesized word to the input method lexicon when the input times corresponding to the synthesized word exceed the preset times, so as to screen the synthesized word with a low frequency of use and avoid adding the synthesized word with a low frequency of use to the input method lexicon.
The technical solution provided by the present disclosure is described below by several examples.
In a first aspect, in a case where the target field includes the first field, the input method application generating the compound word includes several examples as follows.
In a first example, the method comprises the following steps:
1. inputting a first word;
2. deleting at least one Chinese character in the first word according to a deletion operation signal corresponding to the first word to obtain a first field;
3. inputting a target word;
4. deleting at least one Chinese character in the target word according to the deletion operation signal corresponding to the target word to obtain a reserved field;
5. splicing the first field and the reserved field to form a synthesized word;
6. and adding the synthesized word to the input method word bank.
The first word is located before the target word and adjacent to the target word.
For example, the user inputs the first word "build a country" and then deletes the "country" word to obtain the first field "build", the user inputs the target word "fragrant" and then deletes the "fragrant" word to obtain the reserved field "fen", the input method application splices the first field "build" and the reserved field "fen" to form the synthetic word "build fen", and adds the synthetic word "build fen" to the input method lexicon.
In addition, the execution sequence of the steps 1 to 4 is not limited in this embodiment. For example, the execution sequence of the steps 1 to 4 may be: 1. 2, 3, 4, or 1, 3, 2, 4, or 1, 3, 4, 2, or 3, 4, 1, 2, or 3, 1, 2, 4, or 3, 1, 4, 2.
In a second example, the following steps are included:
1. inputting a first Chinese character;
2. inputting a target word;
3. deleting at least one Chinese character in the target word according to the deletion operation signal corresponding to the target word to obtain a reserved field;
4. splicing the first Chinese character and the reserved field to form a synthesized word;
5. and adding the synthesized word to the input method word bank.
The first Chinese character is positioned in front of the target word and is adjacent to the target word.
For example, the user inputs the first Chinese character "Jian", the user inputs the target word "fragrant" and then deletes the "fragrant" word to obtain the reserved field "fen", the input method application splices the first Chinese character "Jian" and the reserved field "fen" to form the synthesized word "Jianfen", and adds the synthesized word "Jianfen" to the input method lexicon.
In addition, the execution sequence of the steps 1 to 3 is not limited in this embodiment. For example, the execution sequence of the steps 1 to 3 may be: 1. 2, 3, or 2, 1, 3, or 2, 3, 1.
In a third example, the following steps are included:
1. inputting a first word;
2. inputting a target word;
3. deleting at least one Chinese character in the target word according to the deletion operation signal corresponding to the target word to obtain a reserved field;
4. splicing the first word and the reserved field to form a synthesized word;
5. and adding the synthesized word to the input method word bank.
The first word is located before the target word and adjacent to the target word.
For example, the user inputs the first word "honda", the user deletes the "best" word after inputting the target word "best" to obtain the reserved field "best", the input method application splices the first word "honda" and the reserved field "best" to form the compound word "honda best", and adds the compound word "honda best" to the input method lexicon.
In addition, the execution sequence of the steps 1 to 3 is not limited in this embodiment. For example, the execution sequence of the steps 1 to 3 may be: 1. 2, 3, or 2, 1, 3, or 2, 3, 1.
Optionally, in the above example, for two input operations, the input method application may detect a time interval of operation times corresponding to the two input operations, and if the time interval is smaller than a preset time length, perform the step of forming the compound word by concatenation, and if the time interval is greater than the preset time length, not perform the step of forming the compound word by concatenation. The operation time corresponding to each input operation may be a start time of the input operation or an end time of the input operation. Through the mode, for two input operations before and after a long interval, word composition is not performed even if word deletion operation is performed, and the situation of mistaken word composition is favorably reduced.
In a second aspect, in a case where the target field includes the second field, the input method application generating the compound word includes the following several examples.
In a first example, the method comprises the following steps:
1. inputting a target word;
2. deleting at least one Chinese character in the target word according to the deletion operation signal corresponding to the target word to obtain a reserved field;
3. inputting a second word;
4. deleting at least one Chinese character in the second word according to the deletion operation signal corresponding to the second word to obtain a second field;
5. splicing the reserved field and the second field to form a synthesized word;
6. and adding the synthesized word to the input method word bank.
The second word is located behind the target word and adjacent to the target word.
For example, the user inputs the target word "dan yang" and deletes the "yang" character to obtain the reserved field "dan", the user inputs the second word "pavilion jade stand" and deletes the "pavilion jade stand" to obtain the second field "pavilion", the input method application splices the reserved field "dan" and the second field "pavilion" to form the compound word "dan pavilion", and adds the compound word "dan pavilion" to the input method lexicon.
In addition, the execution sequence of the steps 1 to 4 is not limited in this embodiment. For example, the execution sequence of the steps 1 to 4 may be: 1. 2, 3, 4, or 1, 3, 2, 4, or 1, 3, 4, 2, or 3, 4, 1, 2, or 3, 1, 2, 4, or 3, 1, 4, 2.
In a second example, the following steps are included:
1. inputting a target word;
2. deleting at least one Chinese character in the target word according to the deletion operation signal corresponding to the target word to obtain a reserved field;
3. inputting a second Chinese character;
4. splicing the reserved field and the second Chinese character to form a synthesized word;
5. and adding the synthesized word to the input method word bank.
The second Chinese character is positioned behind the target word and is adjacent to the target word.
For example, the user inputs the target word "Changping" and then deletes the "Pingping" to obtain the reserved field "Changping", the user inputs the second Chinese character "Jian", the input method uses the reserved field "Changping" and the second Chinese character "Jiangping" to form the composite word "Changping", and the composite word "Changping" is added to the input method lexicon.
In addition, the execution sequence of the steps 1 to 3 is not limited in this embodiment. For example, the execution sequence of the steps 1 to 3 may be: 1. 2, 3, or 3, 1, 2, or 1, 3, 2.
In a third example, the following steps are included:
1. inputting a target word;
2. deleting at least one Chinese character in the target word according to the deletion operation signal corresponding to the target word to obtain a reserved field;
3. inputting a second word;
4. splicing the reserved field and the second word to form a synthesized word;
5. and adding the synthesized word to the input method word bank.
The second word is located behind the target word and adjacent to the target word.
For example, after the user inputs the target word "handkerchief", the word "handedness" is deleted to obtain the reserved field "handkerchief", the user inputs the second word "stan", the input method application splices the reserved field "handkerchief" and the second word "stan" to form the compound word "palstein", and adds the compound word "palstein" to the input method lexicon.
In addition, the execution sequence of the steps 1 to 3 is not limited in this embodiment. For example, the execution sequence of the steps 1 to 3 may be: 1. 2, 3, or 1, 3, 2, or 3, 1, 2.
Optionally, in the above example, for two input operations, the input method application may detect a time interval of operation times corresponding to the two input operations, and if the time interval is smaller than a preset time length, perform the step of forming the compound word by concatenation, and if the time interval is greater than the preset time length, not perform the step of forming the compound word by concatenation. The operation time corresponding to each input operation may be a start time of the input operation or an end time of the input operation. Through the mode, for two input operations before and after a long interval, word composition is not performed even if word deletion operation is performed, and the situation of mistaken word composition is favorably reduced.
In a third aspect, where the target field includes a first field and a second field, the input method application generating the compound word includes a combination of any of the examples provided in the first aspect and any of the examples provided in the second aspect. In the following, only the combination of the first example provided by the first aspect and the second example provided by the second aspect is taken as an example for exemplary description, and other combination situations are not repeated. The method comprises the following steps:
1. inputting a first word;
2. deleting at least one Chinese character in the first word according to a deletion operation signal corresponding to the first word to obtain a first field;
3. inputting a target word;
4. deleting at least one Chinese character in the target word according to the deletion operation signal corresponding to the target word to obtain a reserved field;
5. inputting a second word;
6. deleting at least one Chinese character in the second word according to the deletion operation signal corresponding to the second word to obtain a second field;
7. splicing the first field, the reserved field and the second field in sequence to form a compound word;
8. and adding the synthesized word to the input method word bank.
The first word is positioned in front of the target word and adjacent to the target word; the second word is located behind the target word and adjacent to the target word.
For example, the user inputs the first word "mountain peak" and then deletes the "peak" word to obtain the first field "mountain", the user inputs the target word "original" and then deletes the "original" word to obtain the reserved field "original", the user inputs the second word "groom" and then deletes the "new" word to obtain the second field "groom", the input method application sequentially splices the first field "mountain", the reserved field "this" and the second field "groom" to form the composite word "shanbenlang", and adds the composite word "shanbenlang" to the input method lexicon.
In addition, the execution sequence of the steps 1 to 6 is not limited in this embodiment.
The following are embodiments of the disclosed apparatus that may be used to perform embodiments of the disclosed methods. For details not disclosed in the embodiments of the apparatus of the present disclosure, refer to the embodiments of the method of the present disclosure.
FIG. 3 is a block diagram illustrating an apparatus for updating a thesaurus of input methods according to an example embodiment. The device has the functions of realizing the method examples, and the functions can be realized by hardware or by hardware executing corresponding software. The apparatus may include: a Chinese character deleting module 301, a compound word forming module 302 and a word stock updating module 303.
A Chinese character deleting module 301 configured to delete at least one Chinese character in the target word according to a deleting operation signal corresponding to the input target word, resulting in a reserved field. Wherein the target word comprises at least two Chinese characters, and the reserved field comprises at least one Chinese character.
A compound word forming module 302 configured to concatenate the reserved field and the target field to form a compound word. Wherein the target field comprises: a first field located before and adjacent to the reserved field, and/or a second field located after and adjacent to the reserved field. The first field comprises at least one Chinese character, and the first field is the rest Chinese characters after one Chinese character or one word deletes at least one Chinese character. The second field comprises at least one Chinese character, and the second field is the rest Chinese characters after one Chinese character or one word deletes at least one Chinese character.
A thesaurus updating module 303 configured to add the compound word to the input method thesaurus.
To sum up, the device provided in the embodiment of the present disclosure forms a compound word by splicing the reserved field obtained after deleting part of the chinese characters in the target word with the target field, and adds the compound word to the input method lexicon; the technical problems that the automatic word learning function provided by the input method application in the related technology is not complete enough, so that the vocabulary of the input method word bank is not comprehensive enough and the input efficiency of a user is influenced are solved; the automatic word learning is realized for the word sequence input by adopting the mode of deleting Chinese characters after word grouping, so that the automatic word learning function is more perfect, the vocabulary of the input method word bank is richer, and the input efficiency is improved.
In an alternative embodiment provided based on the embodiment of fig. 3, the target field includes the first field, and the compound word is a first compound word. The compound word forming module 302 is configured to concatenate the first field and the reserved field to form the first compound word. And the last Chinese character of the first field in the first compound word is connected with the first Chinese character of the reserved field.
In another alternative embodiment provided based on the embodiment of fig. 3, the target field includes the second field, and the compound word is a second compound word. The compound word forming module 302 is configured to concatenate the reserved field and the second field to form the second compound word. And the last Chinese character of the reserved field in the second compound word is connected with the first Chinese character of the second field.
In another alternative embodiment provided based on the embodiment of fig. 3, the target field includes the first field and the second field, and the compound word is a third compound word. The compound word forming module 302 is configured to sequentially concatenate the first field, the reserved field, and the second field to form the third compound word. And the last Chinese character of the first field in the third compound word is connected with the first Chinese character of the reserved field, and the last Chinese character of the reserved field is connected with the first Chinese character of the second field.
In another alternative embodiment provided based on the embodiment of fig. 3, please refer to fig. 4, the apparatus further includes: a first acquisition module 304.
A first acquisition module 304 configured to: inputting a first term, wherein the first term is the first field; or, inputting a first word; deleting at least one Chinese character in the first word according to a deletion operation signal corresponding to the first word to obtain the first field; or inputting a first Chinese character, wherein the first Chinese character is the first field.
In another alternative embodiment provided based on the embodiment of fig. 3, please refer to fig. 4, the apparatus further includes: a second acquisition module 305.
A second obtaining module 305 configured to input a second term, where the second term is the second field; or, inputting a second word; deleting at least one Chinese character in the second word according to a deletion operation signal corresponding to the second word to obtain the second field; or inputting a second Chinese character, wherein the second Chinese character is the second field.
In another alternative embodiment provided based on the embodiment of fig. 3, please refer to fig. 4, the apparatus further includes: the number update module 306.
A number updating module 306 configured to update the input number corresponding to the compound word.
The word bank updating module 303 is further configured to execute the step of adding the compound word to the input method word bank when the input frequency corresponding to the compound word reaches a preset frequency.
In another alternative embodiment provided based on the embodiment of fig. 3, please refer to fig. 4, the apparatus further includes: a compound word detection module 307.
A synthesized word detection module 307 configured to detect whether the synthesized word is included in the input word sequence when the input frequency corresponding to the synthesized word is greater than or equal to 1 and is less than the preset frequency.
The number updating module 306 is further configured to update the input number corresponding to the synthesized word when the synthesized word is included in the word sequence.
It should be noted that, when the apparatus provided in the foregoing embodiment implements the functions thereof, only the division of the above functional modules is illustrated, and in practical applications, the above functions may be distributed by different functional modules according to actual needs, that is, the content structure of the device is divided into different functional modules, so as to complete all or part of the functions described above.
With regard to the apparatus in the above-described embodiment, the specific manner in which each module performs the operation has been described in detail in the embodiment related to the method, and will not be elaborated here.
An exemplary embodiment of the present disclosure further provides a device for updating an input method lexicon, which can implement the method provided by the embodiment of the present disclosure. The device includes: a processor, and a memory for storing executable instructions for the processor. Wherein the processor is configured to:
deleting at least one Chinese character in the target words according to a deleting operation signal corresponding to the input target words to obtain reserved fields; wherein the target word comprises at least two Chinese characters, and the reserved field comprises at least one Chinese character;
splicing the reserved field and the target field to form a synthesized word; wherein the target field comprises: a first field located before and adjacent to the reserved field, and/or a second field located after and adjacent to the reserved field; the first field comprises at least one Chinese character, and the first field is the rest Chinese characters after one Chinese character or one word deletes at least one Chinese character; the second field comprises at least one Chinese character, and the second field is the rest Chinese characters after one Chinese character or one word deletes at least one Chinese character;
and adding the synthesized word into an input method word bank.
Optionally, the target field includes the first field, and the compound word is a first compound word; the processor configured to:
splicing the first field and the reserved field to form the first compound word; and the last Chinese character of the first field in the first compound word is connected with the first Chinese character of the reserved field.
Optionally, the target field includes the second field, and the compound word is a second compound word; the processor configured to:
splicing the reserved field and the second field to form the second compound word; and the last Chinese character of the reserved field in the second compound word is connected with the first Chinese character of the second field.
Optionally, the target field includes the first field and the second field, and the compound word is a third compound word; the processor configured to:
splicing the first field, the reserved field and the second field in sequence to form the third synthesized word; and the last Chinese character of the first field in the third compound word is connected with the first Chinese character of the reserved field, and the last Chinese character of the reserved field is connected with the first Chinese character of the second field.
Optionally, the processor is further configured to:
inputting a first term, wherein the first term is the first field;
or,
inputting a first word; deleting at least one Chinese character in the first word according to a deletion operation signal corresponding to the first word to obtain the first field;
or,
and inputting a first Chinese character, wherein the first Chinese character is the first field.
Optionally, the processor is further configured to:
inputting a second term, wherein the second term is the second field;
or,
inputting a second word; deleting at least one Chinese character in the second word according to a deletion operation signal corresponding to the second word to obtain the second field;
or,
and inputting a second Chinese character, wherein the second Chinese character is the second field.
Optionally, the processor is further configured to:
updating the input times corresponding to the synthesized words;
and when the input times corresponding to the synthetic words reach preset times, executing the step of adding the synthetic words into an input method word bank.
Optionally, the processor is further configured to:
detecting whether the input word sequence comprises the synthetic word or not under the condition that the input times corresponding to the synthetic word are more than or equal to 1 and are less than the preset times;
and updating the input times corresponding to the synthesized words under the condition that the synthesized words are included in the word sequence.
Fig. 5 is a block diagram illustrating an apparatus 500 according to an example embodiment. For example, the apparatus 500 may be a mobile phone, a computer, a digital broadcast terminal, a messaging device, a game console, a tablet device, a medical device, an exercise device, a personal digital assistant, a computer, and so forth.
Referring to fig. 5, the apparatus 500 may include one or more of the following components: processing component 502, memory 504, power component 506, multimedia component 508, audio component 510, input/output (I/O) interface 512, sensor component 514, and communication component 516.
The processing component 502 generally controls overall operation of the device 500, such as operations associated with display, telephone calls, data communications, camera operations, and recording operations. The processing components 502 may include one or more processors 520 to execute instructions to perform all or a portion of the steps of the methods described above. Further, the processing component 502 can include one or more modules that facilitate interaction between the processing component 502 and other components. For example, the processing component 502 can include a multimedia module to facilitate interaction between the multimedia component 508 and the processing component 502.
The memory 504 is configured to store various types of data to support operations at the apparatus 500. Examples of such data include instructions for any application or method operating on device 500, contact data, phonebook data, messages, pictures, videos, and so forth. The memory 504 may be implemented by any type or combination of volatile or non-volatile memory devices such as Static Random Access Memory (SRAM), electrically erasable programmable read-only memory (EEPROM), erasable programmable read-only memory (EPROM), programmable read-only memory (PROM), read-only memory (ROM), magnetic memory, flash memory, magnetic or optical disks.
The power supply component 506 provides power to the various components of the device 500. The power components 506 may include a power management system, one or more power supplies, and other components associated with generating, managing, and distributing power for the apparatus 500.
The multimedia component 508 includes a screen that provides an output interface between the device 500 and the user. In some embodiments, the screen may include a Liquid Crystal Display (LCD) and a Touch Panel (TP). If the screen includes a touch panel, the screen may be implemented as a touch screen to receive an input signal from a user. The touch panel includes one or more touch sensors to sense touch, slide, and gestures on the touch panel. The touch sensor may not only sense the boundary of a touch or slide action, but also detect the duration and pressure associated with the touch or slide operation. In some embodiments, the multimedia component 508 includes a front facing camera and/or a rear facing camera. The front camera and/or the rear camera may receive external multimedia data when the device 500 is in an operating mode, such as a shooting mode or a video mode. Each front camera and rear camera may be a fixed optical lens system or have a focal length and optical zoom capability.
The audio component 510 is configured to output and/or input audio signals. For example, audio component 510 includes a Microphone (MIC) configured to receive external audio signals when apparatus 500 is in an operating mode, such as a call mode, a recording mode, and a voice recognition mode. The received audio signals may further be stored in the memory 504 or transmitted via the communication component 516. In some embodiments, audio component 510 further includes a speaker for outputting audio signals.
The I/O interface 512 provides an interface between the processing component 502 and peripheral interface modules, which may be keyboards, click wheels, buttons, etc. These buttons may include, but are not limited to: a home button, a volume button, a start button, and a lock button.
The sensor assembly 514 includes one or more sensors for providing various aspects of status assessment for the device 500. For example, the sensor assembly 514 may detect an open/closed state of the apparatus 500, the relative positioning of the components, such as a display and keypad of the apparatus 500, the sensor assembly 514 may also detect a change in the position of the apparatus 500 or a component of the apparatus 500, the presence or absence of user contact with the apparatus 500, orientation or acceleration/deceleration of the apparatus 500, and a change in the temperature of the apparatus 500. The sensor assembly 514 may include a proximity sensor configured to detect the presence of a nearby object without any physical contact. The sensor assembly 514 may also include a light sensor, such as a CMOS or CCD image sensor, for use in imaging applications. In some embodiments, the sensor assembly 514 may also include an acceleration sensor, a gyroscope sensor, a magnetic sensor, a pressure sensor, or a temperature sensor.
The communication component 516 is configured to facilitate communication between the apparatus 500 and other devices in a wired or wireless manner. The device 500 may access a wireless network based on a communication standard, such as Wi-Fi, 2G, or 3G, or a combination thereof. In an exemplary embodiment, the communication component 516 receives a broadcast signal or broadcast related information from an external broadcast management system via a broadcast channel. In an exemplary embodiment, the communication component 516 further includes a Near Field Communication (NFC) module to facilitate short-range communications. For example, the NFC module may be implemented based on Radio Frequency Identification (RFID) technology, infrared data association (IrDA) technology, Ultra Wideband (UWB) technology, Bluetooth (BT) technology, and other technologies.
In an exemplary embodiment, the apparatus 500 may be implemented by one or more Application Specific Integrated Circuits (ASICs), Digital Signal Processors (DSPs), Digital Signal Processing Devices (DSPDs), Programmable Logic Devices (PLDs), Field Programmable Gate Arrays (FPGAs), controllers, micro-controllers, microprocessors or other electronic components for performing the above-described methods.
In an exemplary embodiment, a non-transitory computer-readable storage medium comprising instructions, such as the memory 504 comprising instructions, executable by the processor 520 of the apparatus 500 to perform the above-described method is also provided. For example, the non-transitory computer readable storage medium may be a ROM, a Random Access Memory (RAM), a CD-ROM, a magnetic tape, a floppy disk, an optical data storage device, and the like.
A non-transitory computer readable storage medium having instructions therein which, when executed by a processor of apparatus 500, enable apparatus 500 to perform the above-described method.
It should be understood that reference to "a plurality" herein means two or more. "and/or" describes the association relationship of the associated objects, meaning that there may be three relationships, e.g., a and/or B, which may mean: a exists alone, A and B exist simultaneously, and B exists alone. The character "/" generally indicates that the former and latter associated objects are in an "or" relationship.
Other embodiments of the disclosure will be apparent to those skilled in the art from consideration of the specification and practice of the disclosure disclosed herein. This application is intended to cover any variations, uses, or adaptations of the disclosure following, in general, the principles of the disclosure and including such departures from the present disclosure as come within known or customary practice within the art to which the disclosure pertains. It is intended that the specification and examples be considered as exemplary only, with a true scope and spirit of the disclosure being indicated by the following claims.
It will be understood that the present disclosure is not limited to the precise arrangements described above and shown in the drawings and that various modifications and changes may be made without departing from the scope thereof. The scope of the present disclosure is limited only by the appended claims.
Claims (13)
1. A method for updating a thesaurus of input methods, the method comprising:
deleting at least one Chinese character in the target words according to a deleting operation signal corresponding to the input target words to obtain reserved fields; wherein the target word comprises at least two Chinese characters, and the reserved field comprises at least one Chinese character;
splicing the reserved field and the target field to form a synthesized word; wherein the target field comprises: a first field located before and adjacent to the reserved field, and/or a second field located after and adjacent to the reserved field; the first field comprises at least one Chinese character, and the first field is the rest Chinese characters after one Chinese character or one word deletes at least one Chinese character; the second field comprises at least one Chinese character, and the second field is the rest Chinese characters after one Chinese character or one word deletes at least one Chinese character;
and adding the synthesized word into an input method word bank.
2. The method of claim 1, wherein the target field comprises the first field, and wherein the compound word is a first compound word;
the splicing the reserved field and the target field to form a compound word comprises the following steps:
splicing the first field and the reserved field to form the first compound word;
and the last Chinese character of the first field in the first compound word is connected with the first Chinese character of the reserved field.
3. The method of claim 1, wherein the target field comprises the second field, and wherein the compound word is a second compound word;
the splicing the reserved field and the target field to form a compound word comprises the following steps:
splicing the reserved field and the second field to form the second compound word;
and the last Chinese character of the reserved field in the second compound word is connected with the first Chinese character of the second field.
4. The method of claim 1, wherein the target field comprises the first field and the second field, and wherein the synthetic word is a third synthetic word;
the splicing the reserved field and the target field to form a compound word comprises the following steps:
splicing the first field, the reserved field and the second field in sequence to form the third synthesized word;
and the last Chinese character of the first field in the third compound word is connected with the first Chinese character of the reserved field, and the last Chinese character of the reserved field is connected with the first Chinese character of the second field.
5. The method according to any one of claims 1 to 4, further comprising:
updating the input times corresponding to the synthesized words;
and when the input times corresponding to the synthetic words reach preset times, executing the step of adding the synthetic words into an input method word bank.
6. The method of claim 5, further comprising:
detecting whether the input word sequence comprises the synthetic word or not under the condition that the input times corresponding to the synthetic word are more than or equal to 1 and are less than the preset times;
and if the word sequence comprises the synthetic word, updating the input times corresponding to the synthetic word.
7. An apparatus for updating a lexicon of input methods, the apparatus comprising:
the Chinese character deleting module is configured to delete at least one Chinese character in the target words according to a deleting operation signal corresponding to the input target words to obtain a reserved field; wherein the target word comprises at least two Chinese characters, and the reserved field comprises at least one Chinese character;
the composite word forming module is configured to splice the reserved field and the target field to form a composite word; wherein the target field comprises: a first field located before and adjacent to the reserved field, and/or a second field located after and adjacent to the reserved field; the first field comprises at least one Chinese character, and the first field is the rest Chinese characters after one Chinese character or one word deletes at least one Chinese character; the second field comprises at least one Chinese character, and the second field is the rest Chinese characters after one Chinese character or one word deletes at least one Chinese character;
and the word bank updating module is configured to add the synthesized words to the input method word bank.
8. The apparatus of claim 7, wherein the target field comprises the first field, and wherein the compound word is a first compound word;
the composite word forming module is configured to splice the first field and the reserved field to form the first composite word;
and the last Chinese character of the first field in the first compound word is connected with the first Chinese character of the reserved field.
9. The apparatus of claim 7, wherein the target field comprises the second field, and wherein the compound word is a second compound word;
the composite word forming module is configured to splice the reserved field and the second field to form the second composite word;
and the last Chinese character of the reserved field in the second compound word is connected with the first Chinese character of the second field.
10. The apparatus of claim 7, wherein the target field comprises the first field and the second field, and wherein the compound word is a third compound word;
the composite word forming module is configured to sequentially splice the first field, the reserved field and the second field to form the third composite word;
and the last Chinese character of the first field in the third compound word is connected with the first Chinese character of the reserved field, and the last Chinese character of the reserved field is connected with the first Chinese character of the second field.
11. The apparatus of any one of claims 7 to 10, further comprising:
the frequency updating module is configured to update the input frequency corresponding to the synthesized word;
the word stock updating module is further configured to execute the step of adding the synthetic word to the input method word stock when the input frequency corresponding to the synthetic word reaches a preset frequency.
12. The apparatus of claim 11, further comprising:
the synthetic word detection module is configured to detect whether the input word sequence comprises the synthetic word or not under the condition that the input times corresponding to the synthetic word are greater than or equal to 1 and are less than the preset times;
the number updating module is further configured to update the input number corresponding to the synthesized word when the synthesized word is included in the word sequence.
13. An apparatus for updating a lexicon of input methods, the apparatus comprising:
a processor;
a memory for storing executable instructions of the processor;
wherein the processor is configured to:
deleting at least one Chinese character in the target words according to a deleting operation signal corresponding to the input target words to obtain reserved fields; wherein the target word comprises at least two Chinese characters, and the reserved field comprises at least one Chinese character;
splicing the reserved field and the target field to form a synthesized word; wherein the target field comprises: a first field located before and adjacent to the reserved field, and/or a second field located after and adjacent to the reserved field; the first field comprises at least one Chinese character, and the first field is the rest Chinese characters after one Chinese character or one word deletes at least one Chinese character; the second field comprises at least one Chinese character, and the second field is the rest Chinese characters after one Chinese character or one word deletes at least one Chinese character;
and adding the synthesized word into an input method word bank.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201710161216.XA CN106933382A (en) | 2017-03-17 | 2017-03-17 | Update the method and device of input method dictionary |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201710161216.XA CN106933382A (en) | 2017-03-17 | 2017-03-17 | Update the method and device of input method dictionary |
Publications (1)
Publication Number | Publication Date |
---|---|
CN106933382A true CN106933382A (en) | 2017-07-07 |
Family
ID=59432496
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201710161216.XA Pending CN106933382A (en) | 2017-03-17 | 2017-03-17 | Update the method and device of input method dictionary |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN106933382A (en) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN112558784A (en) * | 2020-11-20 | 2021-03-26 | 维沃移动通信有限公司 | Method and device for inputting characters and electronic equipment |
Citations (13)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1490701A (en) * | 2002-10-15 | 2004-04-21 | 英业达股份有限公司 | Input method system capable of dynamically adjusting word stock and method thereof |
CN101067766A (en) * | 2006-12-22 | 2007-11-07 | 腾讯科技(深圳)有限公司 | Method for cancelling character string in inputting method and word inputting system |
CN101154134A (en) * | 2006-09-26 | 2008-04-02 | 腾讯科技(深圳)有限公司 | Method and apparatus for adjusting order of candidate words in Chinese input method |
CN101615204A (en) * | 2009-07-24 | 2009-12-30 | 中兴通讯股份有限公司 | Add method, device and the Chinese character input device of custom words in the input method dictionary |
CN102236422A (en) * | 2010-04-27 | 2011-11-09 | 北京搜狗科技发展有限公司 | Input method through stroke code and input method system |
CN102750007A (en) * | 2012-06-12 | 2012-10-24 | 上海量明科技发展有限公司 | Method and client for establishing personality character repertoire |
CN103365905A (en) * | 2012-04-06 | 2013-10-23 | 腾讯科技(深圳)有限公司 | Method and device for correcting word bank data of input method |
CN103389800A (en) * | 2012-05-11 | 2013-11-13 | 北京百度网讯科技有限公司 | Entry generating method and device |
CN103455163A (en) * | 2012-06-04 | 2013-12-18 | 联想(北京)有限公司 | Method and electronic equipment for inputting characters |
CN104461056A (en) * | 2014-12-22 | 2015-03-25 | 联想(北京)有限公司 | Information processing method and electronic equipment |
CN104731766A (en) * | 2013-12-20 | 2015-06-24 | 淘宝(中国)软件有限公司 | Alphabetic writing lexicon establishing method, alphabetic writing lexicon establishing device, inputting method and inputting system |
CN105373236A (en) * | 2015-10-09 | 2016-03-02 | 北京搜狗科技发展有限公司 | Entry learning method and apparatus |
CN106445177A (en) * | 2015-08-06 | 2017-02-22 | 阿尔派株式会社 | Character input device and character input method |
-
2017
- 2017-03-17 CN CN201710161216.XA patent/CN106933382A/en active Pending
Patent Citations (13)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1490701A (en) * | 2002-10-15 | 2004-04-21 | 英业达股份有限公司 | Input method system capable of dynamically adjusting word stock and method thereof |
CN101154134A (en) * | 2006-09-26 | 2008-04-02 | 腾讯科技(深圳)有限公司 | Method and apparatus for adjusting order of candidate words in Chinese input method |
CN101067766A (en) * | 2006-12-22 | 2007-11-07 | 腾讯科技(深圳)有限公司 | Method for cancelling character string in inputting method and word inputting system |
CN101615204A (en) * | 2009-07-24 | 2009-12-30 | 中兴通讯股份有限公司 | Add method, device and the Chinese character input device of custom words in the input method dictionary |
CN102236422A (en) * | 2010-04-27 | 2011-11-09 | 北京搜狗科技发展有限公司 | Input method through stroke code and input method system |
CN103365905A (en) * | 2012-04-06 | 2013-10-23 | 腾讯科技(深圳)有限公司 | Method and device for correcting word bank data of input method |
CN103389800A (en) * | 2012-05-11 | 2013-11-13 | 北京百度网讯科技有限公司 | Entry generating method and device |
CN103455163A (en) * | 2012-06-04 | 2013-12-18 | 联想(北京)有限公司 | Method and electronic equipment for inputting characters |
CN102750007A (en) * | 2012-06-12 | 2012-10-24 | 上海量明科技发展有限公司 | Method and client for establishing personality character repertoire |
CN104731766A (en) * | 2013-12-20 | 2015-06-24 | 淘宝(中国)软件有限公司 | Alphabetic writing lexicon establishing method, alphabetic writing lexicon establishing device, inputting method and inputting system |
CN104461056A (en) * | 2014-12-22 | 2015-03-25 | 联想(北京)有限公司 | Information processing method and electronic equipment |
CN106445177A (en) * | 2015-08-06 | 2017-02-22 | 阿尔派株式会社 | Character input device and character input method |
CN105373236A (en) * | 2015-10-09 | 2016-03-02 | 北京搜狗科技发展有限公司 | Entry learning method and apparatus |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN112558784A (en) * | 2020-11-20 | 2021-03-26 | 维沃移动通信有限公司 | Method and device for inputting characters and electronic equipment |
WO2022105754A1 (en) * | 2020-11-20 | 2022-05-27 | 维沃移动通信有限公司 | Character input method and apparatus, and electronic device |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US10296201B2 (en) | Method and apparatus for text selection | |
CN109683714B (en) | Multimedia resource management method, device and storage medium | |
KR101851474B1 (en) | Method, device and terminal device for changing emoticon in chat interface | |
CN105843615B (en) | Notification message processing method and device | |
CN107908351B (en) | Application interface display method and device and storage medium | |
US9959487B2 (en) | Method and device for adding font | |
CN110569329B (en) | Data processing method and device, electronic equipment and storage medium | |
CN106775202B (en) | Information transmission method and device | |
US11367232B2 (en) | Method and device for generating stickers | |
CN105376412A (en) | Information processing method and device | |
CN111381737B (en) | Dock display method and device and storage medium | |
CN104850643B (en) | Picture comparison method and device | |
CN109947506B (en) | Interface switching method and device and electronic equipment | |
CN107943317B (en) | Input method and device | |
CN106792604B (en) | Method and device for service prompt in communication process | |
CN107656616B (en) | Input interface display method and device and electronic equipment | |
CN113359978A (en) | Control method and device of multimedia equipment, electronic equipment and storage medium | |
CN110244859B (en) | A method and device for customizing input method candidates | |
CN106933382A (en) | Update the method and device of input method dictionary | |
CN104809155A (en) | Method and device for automatically inquiring service note numbers | |
CN111538450B (en) | Theme background display method and device and storage medium | |
CN111831131B (en) | Association input method and related device | |
CN115329193A (en) | Information searching method and device, electronic equipment and storage medium | |
US20170060822A1 (en) | Method and device for storing string | |
CN112068793B (en) | Voice input method and device |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20170707 |
|
RJ01 | Rejection of invention patent application after publication |