[go: up one dir, main page]

CN1256446A - Chinese character coding and inputting method using the first radical, residual radical and stroke number and the key board - Google Patents

Chinese character coding and inputting method using the first radical, residual radical and stroke number and the key board Download PDF

Info

Publication number
CN1256446A
CN1256446A CN 00100002 CN00100002A CN1256446A CN 1256446 A CN1256446 A CN 1256446A CN 00100002 CN00100002 CN 00100002 CN 00100002 A CN00100002 A CN 00100002A CN 1256446 A CN1256446 A CN 1256446A
Authority
CN
China
Prior art keywords
stroke
chinese character
key
stem
code
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN 00100002
Other languages
Chinese (zh)
Other versions
CN1123813C (en
Inventor
王永民
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
WANG YONGMIN
Original Assignee
Individual
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Individual filed Critical Individual
Priority to CN 00100002 priority Critical patent/CN1123813C/en
Publication of CN1256446A publication Critical patent/CN1256446A/en
Application granted granted Critical
Publication of CN1123813C publication Critical patent/CN1123813C/en
Anticipated expiration legal-status Critical
Expired - Fee Related legal-status Critical Current

Links

Landscapes

  • Document Processing Apparatus (AREA)

Abstract

The said keyboard is a number keyboard with at least five number keys of 1,2,3,4 and 5 to represent the five kinds of strokes including horizontal, vertical, right slant, left slant and bend. The coding and inputting method features that each complex Chinese character is divided into two parts of the first part and the residual part and five to seven number codes of one Chinese character including the first stroke of the first part and the first and the last strokes of the residual part constitute the number codes of the Chinese character. Chinese characters and words are input through number keys. The present invention may be widely used in computer, communication system, telephone hand set and network technology.

Description

" stem " " surplus portion " digital stroke coding input method of Chinese character and keyboard thereof
The invention belongs to shape code Chinese character input method and keyboard thereof, particularly " stem " " surplus portion " Chinese-character stroke digital code inputting method and numeric keypad.
Society has entered digital society now, and the numeric keypad range of application is very extensive.Keypad of push-button telephone, various microcomputer, financial commercial field, international network inputting word information and password or the like for example.The digitizing of Chinese character input has become the active demand of communication, network and financial commercial field.After on QWERTY keyboard, realizing that with 26 letters efficient font code inputs obtain historical breakthrough and obtain widespread use, use numeric keypad to realize the numeral input of Chinese character, become Chinese information processing field utmost point key subjects to be broken through.For example, use phone, particularly use roam-a-phone, and online transmission information, put on the agenda.
In the prior art, solve the numeric keypad input of Chinese character, following several types arranged:
1. full stroke input method
Some kinds of strokes of definition on 5 or 10 numerical keys, according to order of writing strokes, each stroke of button input Chinese character successively.For example (the time) with 5 kinds of strokes, in: 2520, key: 3111551111254 etc., the conspicuous advantage of this method is " need not learn ", as long as just can write and can import.But its outstanding shortcoming is to key in number of times (code length) too much, as runs into " rash " word, just need to import 25 times.Thereby, though feasible on this theoretical method, though the repeated code word is fewer, there is not practical value.
2. five-stroke character input method
Five-stroke character input method is owing to be divided into 5 districts totally 25 keys with alphabetic keypad, each radical all has a region-position code, convenient with the input code digitizing, so, since nineteen eighty-three, inventor's Wang Yongmin is just expressed the input code of a Chinese character (speech) with 2 kinds of modes, a kind of is alphabetic mode, and another kind is a digital form.For example: the character code YUKQ of " saying ", its numerical code is: 41 42 23 35.Though, on numeric keypad, can import Chinese character and vocabulary with this numerical code fully, and have only the repeated code about 2%.But, but be difficult to make in this way at once for the people who does not learn the Five-stroke Method.
3. five key 5-stroke input methods
By five key 5-stroke input methods of Mr. Wang Yongmin invention in 1985, be 5 numerical keys of a kind of use, cast aside anyhow with 12345 representatives and press down five kinds of strokes of folding, very easy individual character word stroke input method.This method and keyboard thereof regulation, any one Chinese character is got its 4 singles in front according to sequential write and is drawn and add last single in addition and draw, 5 strokes at the most, 5 of less thaies are mended 0 key to show end.This method is easy to learn and use, and is apparent.Yet very easily " cost " but is that repeated code is many.Because the more radicals by which characters are arranged in traditional Chinese dictionaries of some numbers of words are as " Jin, worm, Rolling, Si, day " etc., only " radicals by which characters are arranged in traditional Chinese dictionaries " have just taken preceding 3 even preceding 4 sign indicating numbers, and the discrete ability of its coding only leans on last 1 yard (5 kinds of possibilities) to contribute, so the repeated code word just can't reduce.To such an extent as in 6763 Chinese characters of national standard, 30 Chinese character repeated codes just have 34 groups, wherein, 50 Chinese character repeated codes have 10 groups, maximum one groups is 88 word repeated codes.Like this, just make the application efficiency of this method not improve, it applies also restricted greatly.
4. pinyin digital input mode
This mode is with reference to the digital input mode of English alphabet, represents the letter of the Chinese phonetic alphabet by 1 to 3 key.The advantage of this method is the sound of can directly fighting on numerical key, convenient easily; Shortcoming is that rhotacism or unacquainted word can't be imported, and a large amount of phonetically similar words need be selected, and stroke is many, and efficient is not high.
5. other is with the stroke input method of 10 numerical keys
In view of with the input method coding radix (bond number) of 5 kinds of strokes very little, repeated code easily is so the someone designs the digital inputting method with 9 kinds or 10 kinds strokes.These class methods are subdivided into the 9-10 kind with the stroke of Chinese character, put the similar stroke of a stack features on each numerical key, formulate code taking rule then.Such as a kind of code taking rule is arranged is exactly according to order of writing strokes, from the first sum of, 3 strokes has been got earlier in a word, last, from the end pen, gets 3 strokes again, adds together totally 6 strokes.Its advantage is that the discreteness of coding is more far better than 5 kinds of strokes because bond number doubles, and repeated code is wanted much less naturally, and efficient also can improve much.But it is 10 groups tens kinds that these class methods will be segmented stroke, and people are difficult to remember and recognize.Moreover the stroke code fetch will stipulate that also " counting down " gets 3 strokes, obviously run counter to people's cognitive law from the end pen.Generally be, this equal to hold order of strokes observed in calligraphy knowledge that millions upon millions of people have now grasped need not, allow people rebulid the order of " falling number ", this is that people are difficult to accept.
6. other is considered the stroke code input method of feature of Chinese characters structure
For example, allow behind the stroke sum of the clear word of importer's number, carry out " code taking rule " again, stipulate 5 strokes with interior word code fetch how, another kind of regular code fetch pressed again in addition in the word of 6 above strokes.
For another example, the Hanzi font that the certain methods regulation is different is with different code taking rules; Other methods then regulation Chinese character are divided into the binary word, the ternary word, and the quaternary word, different unit distributes different codings, or the like.
These loaded down with trivial details rules probably all are difficult to possible in application practice.Because theoretic feasibility with the practicality in the reality, sometimes is a two entirely different things fully.
It is considered herein that this class methods code fetch difficulty, uniqueness is poor, and inconvenience is promoted.
In sum, a kind of use numeric keypad, meet the Chinese character calligraph custom, meet the spoken and written languages standard, repeated code is few, efficient is high, not influenced by dialect pronunciation, the digital keyboard Chinese character input method that is easy to learn and use again is still the technical barrier that Chinese character input field demands urgently breaking through.
The objective of the invention is to propose a kind of brand-new " header, residue digital stroke coding input method of Chinese character and keyboard thereof " technical scheme, do not meet standard to overcome exist when prior art is carried out the Chinese character input on numeric keypad above-mentioned, find it difficult to learn difficult with, efficient is low, repeated code is many, the not high obstacle of efficient.
A kind of " stem " " surplus portion " the digital stroke coding input method of Chinese character and the keyboard thereof of the present invention's initiative, described keyboard is a numeric keypad, have at least 1,2,3,4,5 five numerical key in order to represent the horizontal, vertical of Chinese character, cast aside (point), press down, roll over five kinds of strokes; It is characterized in that to account for Chinese character sum each combinde rqdical character more than 90%, be divided into " stem " and " surplus portion " two parts from structure, constitute the numerical coding of " stem " by the code of 2 or 3 strokes of the first stroke that comprises " stem ", by the first stroke that comprises " surplus portion " and the last 3, the digital code of 4 or 5 strokes constitutes the numerical coding of " surplus portion ", " stem " coding adds " surplus portion " coding, constitute the digital stroke coding of fit Chinese character, use numeric keypad to computer or communication apparatus input Chinese character and or Chinese-character words.
The present invention, has disclosed whole fit Chinese characters first and can be divided into two on structure without exception in recent years in the achievement in research aspect the coding theory according to the author, promptly can be divided into " stem " and " surplus portion " two these structure laws of part.In fact, the overwhelming majority in the Chinese character all is a combinde rqdical character, no matter be phonogram or associative compounds, is section start with the first sum of, approximately can be divided into two parts on structure.
From the angle of coding theory, any one or one group of stroke structure, the stroke that it is peripheral, the stroke that also promptly exposes all has bigger entropy than the stroke in the structure, thereby is more convenient for being recognized, and is identified, and has best discrete ability.When putting into practice when this theoretical result being used for encode, the present invention has just initiated with the first stroke of getting " stem " and has added last or second, and the first stroke and the most last pen of getting " surplus portion " are the code fetch mode of core technology to form a brand-new coding scheme.
" stem " and traditional " radicals by which characters are arranged in traditional Chinese dictionaries " that the present invention proposes, for example more than 200 used radicals by which characters are arranged in traditional Chinese dictionaries of " Xinhua dictionary " are not notions." stem " though majority all is " radicals by which characters are arranged in traditional Chinese dictionaries " that the first stroke is write as, " stem " not all is " radicals by which characters are arranged in traditional Chinese dictionaries " also, and " radicals by which characters are arranged in traditional Chinese dictionaries " are not congruent to " stem " yet.
When " stem " is meant by correct order of strokes observed in calligraphy writing Chinese characters, intersects with the first stroke or closely link together, become an independent body, or become a stroke structure that contains the more traditional radicals by which characters are arranged in traditional Chinese dictionaries of number of words with the first stroke place structure.It is characterized in that " comprising the first sum of ", so claim " stem " in interior stroke structure part.Because " radicals by which characters are arranged in traditional Chinese dictionaries " of most Chinese characters all include the first sum of of Chinese character, so most " radicals by which characters are arranged in traditional Chinese dictionaries " also all are simultaneously that " stem " is just not strange.
For example: wood, mouth, Ren, standing grain, Rolling, Si, wide, Yan, rice, worm, mountain, boat etc., when first orientation that they appear at Chinese character image when (promptly comprising that the first sum of structure), they are " radicals by which characters are arranged in traditional Chinese dictionaries ", are again " stems ";
For another example: just, bundle, lose, heavy, I, song etc., though they are not radicals by which characters are arranged in traditional Chinese dictionaries, when the first stroke of a Chinese character " fell " thereon, they were " stems ";
In addition, though, " stem " is meant that stroke intersects or the stroke structure at the first sum of place that closely links to each other with the first stroke place structure, to contain Chinese character quantity many or as radicals by which characters are arranged in traditional Chinese dictionaries but when some, himself stroke more for a long time, for example: horse, gas, stone, , walk, foot, food, Cannibals Door, door, leather, bone, ghost, fish Fish, tooth etc., when appearing at first locations of structures (being the first sum of place structure) of Chinese character, although their back are the discrete combination of several stroke structures, when dividing the header, residue of Chinese character, they can not be divided into two parts again, their integral body is regarded as " stem ", and can not " be divided into two " again, they are divided into " stem " and " surplus portion ".
In the middle of " stem ", sometimes have only a stroke, for example " picture ", its " stem " just has only a stroke, compiles 1 sign indicating number, needn't supply the code length of " stem " with other sign indicating number.
Isolated point in the Hanzi structure, be commonly considered as with its near stroke adhesion together, thereby can not be separately as stem.For example the first stroke of " suffering " point is stem with horizontal sticking together.
Fit Chinese character generally is divided into left right model, goes up mo(u)ld bottom half and encirclement type.In the Chinese character of encirclement type, " stem " remains the stroke structure at the first sum of place, for example: " stem " of " state " is " mouth ", " stem " of " salty " is " penta ", " stem " of " together " is " Jiong ", and " stem " of " sentence " is " Bao " etc., and the stroke on " stem " sometimes is not strict combination according to stroke order, for example " state ", its finishing touch but is the 3rd stroke of stem " mouth ".
Following routine word can illustrate " stem " and be not equal to " radicals by which characters are arranged in traditional Chinese dictionaries ":
" stem " " advanced " is " well ", and " radicals by which characters are arranged in traditional Chinese dictionaries " of " advancing " are " Chuo ";
" stem " " thought " is " wood ", and " radicals by which characters are arranged in traditional Chinese dictionaries " of " thinking " are " hearts ".
Outstanding substantive technical characterictic of the present invention is, in code taking rule of the present invention, the first sum of and finishing touch of " stem " or second are must code fetch.
If " stem " code fetch length is 3, then add and get preceding 2 according to the order of strokes observed in calligraphy, together with the end stroke of " stem ", totally 3 sign indicating numbers are drawn if " stem " is single, then only get 1 and get final product.
" stem " of Chinese character is different with word collection size." stem " though have hundreds of more than, needn't memorize mechanically one by one, this be can writing of Chinese characters the people, can find out on the structure at a glance.
Fit Chinese character " is divided into two " afterwards, not intersect to be close to " stem ", can leave surplus " the surplus portion " that leave distance and divide, no matter be that word is not a word, no matter be traditional radicals by which characters are arranged in traditional Chinese dictionaries, no matter be what shape and structure, no matter remaining is several, several parts, several radicals, no matter surplus " surplus portion " divides is zoarium or independent body, be called " surplus portion " without exception.
The substantive distinguishing features that the present invention gives prominence to is that for " the surplus portion " of fit Chinese character, its code length of getting is 4 no matter be 3, is 5, and during code fetch, the first sum of and the last of " surplus portion " must be included.When first, end pen are not enough to reach the maximum code length that " surplus portion " should get, mend successively the first sum of after according to sequential write and to get the 2nd, the 3rd, after adding end, reach till the desired maximum code length.Sometimes, be that the stroke of " surplus portion " is got the still not enough maximum code length that is over, should add the end mark sign indicating number, for example " 0 ".
The strictness of " stem " is divided, and can have for a short time flexibly according to the design of coding, and for example, " cave " both can be used as an integral body and be considered as " stem ", can think that also " stem " is " Http ", and " eight " was " surplus portion ".
As embodiments of the invention, in the code fetch, the code length of " stem " can be 2, be 3 that the code length of " surplus portion " can be 3,4,5, combines like this, when " stem " gets 2 sign indicating numbers, 2+3=5 and 2+4=6 can be arranged, three kinds of maximum code length of 2+5=7.In general, maximum code length is definite relevant with the word collection.When only handling 3755 GB first-level Chinese characters, can use the 2+3=5 mode, promptly " stem " gets 2 yards, and " surplus portion " gets 3 yards, and maximum code length is 5; When handling 6763 Chinese characters of GB two-stage Chinese character, can use 2+4=6, promptly " stem " gets 2 yards, and " surplus portion " gets 3 yards, and maximum code length is 6; Or 3+3=6, promptly " stem " gets 3 yards, and " surplus portion " gets 3 yards, and maximum code length is 6.When handling GBK21003 Chinese character, can be 2+5=7 code fetch method with " surplus in the of first 25 ", also can use " surplus the head 34 " 3+4=7 code fetch method, maximum code length all is 7.
In aforesaid code fetch process, the stroke number of " stem " or " surplus portion " no matter when maximum code length that not enough institute should get, is only got existing stroke without exception, promptly has and how much what is got, and does not do any special processing, also without other key " polishing ".As long as the first sum of first and last pen that adds end pen or second, " surplus portion " that guarantees " stem " is necessarily got, this be coding scheme of the present invention information extraction amount from Chinese character image maximum and very easily debate know study stroke as coded message, have creationary substantive distinguishing features.
When with code fetch of the present invention, if after " stem " of a combinde rqdical character and " surplus portion " code fetch, when its total code length does not reach maximum code length, add stroke code code in addition in the back of coding, expression finishes.This code can be 6,7,8,9,0, also can be any other kind of the code that can key on the numeric keypad, as: *, # etc.
On the numeric keypad of the present invention, can be set to space bar and page turning key by " 0 " key, when the described code length of Chinese-character stroke deficiency, hit " 0 " bond bundle.When the input code of Chinese character reaches maximum code length or not enough maximum code length and had hit end key, hit " 0 " key and can make the page turning of the repeated code left and right sides, in the page turning process, numerical key can be used for selecting the repeated code word.
The header, residue code fetch method of using method of the present invention to form, the division of stroke kind can be to cast aside anyhow to press down 5 kinds of foldings, also can be in 6 kinds or 10 kinds, can be with a kind of stroke of 1 number keyboard representation.
The numeric keypad that the present invention uses can be the keyboard that has 1,2,3,4,5,6,7,8,9,0 10 numeric keys at least, wherein numerical key 6,7,8,9,0 is set to function key, function key is set to " universal key " respectively, forward and backward page-turning function key, the word function key, the association function key, wherein the function of " universal key " is to substitute the stroke input that is difficult to determine; The words and phrases function key is used to import the Chinese words and phrases; The association function key is used for the association of Chinese character statement; When repeated code occurring, each numerical key all can have the function of options button, selects the word of confirming from the repeat code Chinese character that screen shows.The numeric keypad that the present invention uses also can be 12 key bit keyboards, and function setting is: 6 keys are " universal key ", and 7 keys are the word key, page turning key before and after 8,9 liang of keys are respectively, and 0 key is a space bar, two keys are association's key and shift key in addition.
The function of each key and retrieval are realized by software.The important difference of the present invention and prior art is that with outstanding substantive distinguishing features fit Chinese character is divided into " stem " and " surplus portion " respectively gets Head-Til stroke Chinese code, generally speaking, only use popular 5 kinds of strokes having known to encode Chinese characters for computer, only use numeral 12345 during coding, only use 5 numerical keys on the numeric keypad, generally define stroke and stroke key no longer in addition, thereby greatly reduce the difficulty of study, and be convenient to the large scale community application.
Below to concentrate " wood " portion Chinese character with international standard characters be the discrete repeated code of example explanation the present invention, improve the coding uniqueness, improve the outstanding substantive distinguishing features of input efficiency.
Get under the situation of 5 sign indicating numbers limiting each Chinese character, when adopting preceding 4 ends 1 five digital stroke codings of prior art:
Structure: 12344, tree: 12344, piece: 12344, stalk: 12344, coffin with a corpse in it: 12344
Chinese juniper: 12344, the school: 12344, root: 12344, plate: 12344, cane: 12344
Its numerical coding is identical, and these words all are same input codes;
According to the present invention program, promptly " stem " gets " first and last " 2 strokes, and when " surplus portion " got " end first " 3 strokes, the numerical coding of above-mentioned Chinese character was (under 5 kinds of stroke situations):
Structure: 14354, tree: 14544, piece: 14314, stalk: 14124, coffin with a corpse in it: 14154
Chinese juniper: 14344, the school: 14414, root: 14524, stalk: 14125, cane: 14134
Obviously, more than use when of the present invention, its numerical coding is all inequality, and originally above each word of repeated code is heavy, in other cases, also makes repeated code on average reduce to 1/5th.
Above embodiment can show the present invention and compared with prior art have outstanding substantive distinguishing features and obvious improvement.
The present invention realizes that to numeric keypad the theoretical property contribution of encode Chinese characters for computer is, for fit Chinese character, adopts earlier Chinese character to be split as " stem stroke structure " and " surplus stroke structure " two parts, gets its head and the tail stroke respectively by rule of the present invention again and constitutes coding; So just can realize reasonable distribution, greatly reduce the repetition rate of coding the encode Chinese characters for computer space.Chinese character with radicals by which characters are arranged in traditional Chinese dictionaries " wood " portion of front embodiment is an example, and " wood " portion is totally 281 words, because " wood " itself has accounted for 4 yards, in the prior art these radicals by which characters are arranged in traditional Chinese dictionaries just used 4 numerical codes.When each Chinese character code length of qualification is 5, only surplus code fetch position.In the prior art, according to the distribution of five kinds of strokes, the average repetition rate of coding is: 182/5=56.
Yet according to the present invention program, preceding two yards are all (horizontal stroke) 1, (right-falling stroke) 4 mutually, in Chinese character preceding two usefulness, five stroke code fetches be 14 except that " wood ", also have 40, be 281+40=321 altogether; This programme is with discrete all the other stroke structures of back three sign indicating numbers, and all there is possibility of five kinds of strokes each digital position, and it is possible to have 5 125 kinds of 3 powers; Its average repetition rate of coding is 321/125=2.568.The repetition rate of coding of the present invention only is 1/22 of a prior art.
As a kind of embodiment of the present invention, the present invention can adopt 5 kinds of stroke 5 coding modes input Chinese characters, promptly each fit Chinese character is got 5 yards at most, wherein earlier get first stroke of " stem " and end stroke as preceding 2 yards according to sequential write, get first, second and the end stroke of " surplus portion " again according to sequential write, when the not enough code fetch of the stroke of " stem " or " surplus portion ", an enchashment has stroke, during total code length less than 5, add benefit once " 0 " key to show end.
For example: in prior art " 5-stroke input method ", the other end of moon word pen is a word of pressing down (point), be that its coding is 35114 word, have 25: clothes, glue, arteries and veins, film, leg, expand, the side of body, abdomen, skin, knee, greasy, arm, gland, elbow, dried meat, the cheek, subjectively, internal organs, armpit, purulence, sumptuous, fat, pancreas.
Yet the repeated code situation is as follows in the present invention:
10 words such as thigh, arteries and veins, leg, the side of body, gland, the cheek, purulence, sumptuous, pancreas, clothes are repeated code no longer all, and the word of repeated code is arranged, and discretely is following 4 groups:
31114: skin is greasy
31314: abdomen expands
31414: glue is the internal organs armpit subjectively
31124: film limb knee dried meat elbow fat
As seen, equally all be with 5 kinds of strokes, be with 5 keys equally, almost be easy to learn and use equally, maximum code length is 5 equally, can situation of the present invention but than prior art, realized the breakthrough of matter, the repeated code number generally can drop to 1/5th of prior art.
Under the situation of 5 yards inputs, the total volume of coding is 55 powers, promptly amounts to 3125 codings.When handling 6763 words, not only there is not the redundance of space encoder, number of words is 2 times of codifiability, repeated code is many naturally.
For this reason, as the embodiment of the invention, except that 5 coding modes, the present invention can also adopt 6 coding modes.Under 6 coding modes, space encoder is 56 powers, i.e. 15625 codifiabilities, and to 6763 encodes Chinese characters for computer, its repeated code promptly can descend significantly again.
For example: in 5 yards the routine word, last repeated code word one has 4 groups in the front, 15 words.As with 6 yards codings, original repeated code word:
Skin: 311134, greasy: 311114, not heavy;
Expand: 313154, abdomen: 313124, not heavy.
Though increased a sign indicating number, after Chinese character " is divided into two ", second branch is still intuitively easily debated, get " 123 end " very easy, meet order of strokes observed in calligraphy custom fully, so, do not increase learning difficulty.
As an alternative embodiment of the invention, the present invention can adopt 6 coding modes input Chinese character, promptly each Chinese character is got 6 yards at most, wherein first stroke of earlier getting " stem " according to sequential write adds end stroke or second, as preceding 2 yards, and back 4 yards first, second, third and end strokes of getting " surplus portion " according to the stroke writing order, when the not enough code fetch of the stroke of " stem " or " surplus portion ", enchashment has stroke, during total code length less than 6, adds benefit " 0 " key once to show end.
The present invention can also import with another kind of 6 coding modes, promptly each Chinese character is got 6 yards at most, wherein first stroke, second stroke and the end stroke of earlier getting " stem " according to sequential write is as preceding 3 yards, get first, second and the end stroke of " surplus portion " again according to sequential write, when the not enough code fetch of the stroke of " stem " or " surplus portion ", enchashment has stroke, during total code length less than 6, add benefit once " 0 " key to show end.
As embodiments of the invention, the present invention can also be according to the needs of character set size, such as when encoding for the GBK Chinese Character Set, adopt the input of 7 coding modes, promptly each Chinese character is got 7 yards at most, preceding 2 yards is to add end stroke or second according to first stroke that sequential write is got " stem ", 5 yards first, second, third, fourth stroke and the end strokes of getting " surplus portion " according to the stroke writing order in back, during the stroke of " stem " or " surplus portion " is not enough code fetch, enchashment has stroke, during total code length less than 7, mend " 0 " key once to show end.
The present invention can also adopt another 7 coding modes input Chinese character, promptly each Chinese character is got 7 yards at most, preceding 3 yards first, second and end strokes of getting " stem " according to sequential write, 4 yards first, second, third stroke and the end strokes of getting " surplus portion " according to sequential write in back, when the stroke of " stem " " surplus portion ", not enough code fetch, only get existing stroke, during total code length less than 7, add benefit " 0 " key once.
Under 7 coding modes, the total volume of space encoder is 78125, is GBK21003 encode Chinese characters for computer in this space, and coincident code problem can greatly alleviate.
Because above embodiment, the present invention can constitute the different coding scheme of several maximum code length and use same Chinese characters for keyboard inputting.
For the monomer word that accounts for Chinese character sum about 10%, or " radicals by which characters are arranged in traditional Chinese dictionaries " Chinese character, for example: the third, the name for ancient tribes in the east, thing, string, leather, stone, fish, door etc., under various code length situations of the present invention, method of its coding input all is the sequential write by standard, gets the stroke number that must comprise first stroke and end stroke and reach the regulation code length, and the stroke number of getting according to the order of strokes observed in calligraphy after the first stroke equals the maximum code long number and subtracts 2, during the deficiency maximum code length, add " 0 " key as end." regulation code length " can be maximum code length, also can be to lack 1, few 2 code length, the maximum code length of looking the other regulation monomer of the big I of word collection Chinese character than maximum code length.
For example:
I: under 5 yards situations, be encoded to 31214;
Under 6 yards situations, be encoded to 312154 or 31214;
Under 7 yards situations, be encoded to 3121534 or 312154.
In: under 3 kinds of code length situations, coding all is 25120.
" stem " of the present invention, " surplus portion " stroke input method and keyboard thereof, its feature also is after the vocabulary tag mark code, only import " stem " of each individual character in the Chinese character word or the part coding of all-key and can import word, the number of Chinese character can be 2 in the word, 3, more than 4 to tens.
With 5 kinds of strokes is example, the method that the present invention imports vocabulary is, earlier with 12345 in addition numeral or symbolic key as vocabulary " guiding key ", after the guiding key, all-key 2-4 in front sign indicating number got in the every word of 2 words, and all-key 2-3 in front sign indicating number of every word, the speech that 4 words are above got in 3 words, get all-key 2 sign indicating numbers foremost of each word, form the vocabulary code input with this; For the multi-character words more than 5 words, preceding 2 yards preceding 2 yards inputs that add the last character of 3-4 word before can also only getting.During the vocabulary input, the repeated code speech shows to be selected according to the frequency arrangement.
When with header, residue compiling method of the present invention during to fit Chinese character code fetch, in order to look after the user who has grasped five-stroke character input method and to obtain well discrete repeated code ability, the stem coding of fit Chinese character, can directly continue to use the region-position code form of the Five-stroke Method coding of this Chinese character, i.e. radical digital code on 25 the Five-stroke Method keyboards in 5 districts.For example, the king 11, wood 14, mountain 25, woman 53 etc.At this moment, do not limit employed the Five-stroke Method version.
" header, residue stroke input method and keyboard thereof " of the present invention, it is characterized in that numeric keypad has 10 numerical keys such as 1,2,3,4,5,6,7,8,9,0 at least, wherein numerical key 6,7,8,9,0 is set to function key, be set to " universal key " respectively, forward and backward page-turning function key, word sign key, wherein the function of " universal key " is to substitute the stroke that is difficult to determine; When repeated code occurring, 0 key can be used as page turning key, and the numerical key that is right after after 0 key is and selects the repeated code key, selects desired word from the repeat code Chinese character that screen shows.
When the word collection enlarges, when using 5 yards or 6 coding modes to import, for example in " mouth " or " Rolling " conduct " stem " afterwards, still have many repeated codes as encode Chinese characters for computer.At this moment, the present invention can also respectively or be grouped on the keys such as 0,6,7,8,9, settles a few the highest Chinese character group word parts of occurrence frequency.Mouth, day, Rolling, soil, Rui etc.In the process by the code taking rule of " stem " " surplus portion " and code length requirement code fetch, when running into these several parts, they are only got 1 sign indicating number, draw and no longer split into single, these high-frequency units can be monopolized 1 key, when they are " stem ", or when in " surplus portion ", taking turns to code fetch, all only get a sign indicating number.With this coding scheme that still forms by " stem " " surplus portion " code fetch, both can be used as a kind of independent coding input mode of using, form new embodiment of the present invention, also can have both general with the various coding schemes of " stem " " surplus portion ".
For example: in 5 yards " 2+3 " of the present invention (stem is got 2 yards, and surplus portion gets 3 yards) input method, can singly be placed on certain key to " mouth ", on 6 keys, form 6 such key inputs of " 12345+ mouth ", at this moment, the word tangerine of original repeated code, dwell, tell, meet etc., just repeated code no longer.
Learn to use for the ease of the operator, the used single of the present invention is drawn, for example under 5 kinds of stroke situations, can be the representative horizontal stroke one of stroke, perpendicular Shu, cast aside Pie, press down , folding second, and radical is printed or is engraved on the corresponding numerical key.
The present invention can also design the brevity code input of some Chinese character.As the all-key of a Chinese character, for example " I "---312154, in whole 6 yards space encoder, all-key needn't be imported and finish, and when this word can show uniquely, this was short sign indicating number than all-key, was the brevity code of this word.Because in the whole coding scheme, there are suitable coding redundancy degree and discreteness, so many Chinese characters all can have brevity code, can also import, to improve input speed with brevity code.
In 3755 GB primary words, substantive distinguishing features that the present invention gives prominence to and great technical progress, can from the following comparison of the present invention and prior art repeated code situation, find out:
Prior art The present invention's (5 yards) The present invention's (6 yards)
Single codeword and proportion 462 words 722 words 1564 words
????12.3% ????19.2% ????41.7%
2 repeated code word and proportions 386 words 652 words 926 words
????10.3% ????17.4% ????24.7%
9 repeated codes are with interior number of words accumulative total and proportion 2186 words 3058 words 3620 words
????58.2% ????81.4% ????96.4%
As seen, under 6 yards situations of the present invention, 9 repeated codes account for 96.4% of GB primary word with interior number of words, and this ratio of prior art has only 58.2%, and the uniqueness of coding has improved 65.6%; Wherein, the Chinese character number of no repeated code is 3.39 times of prior art.
The present invention can be implemented in computer and various data typing, communication system, telephone bandset and network technology, the definition of each key and subsidiary function, can be realized by software, character and communication symbol beyond the Chinese character, can arrange separately as required, with this application that forms various product of the present invention, can be common in the information society of using Chinese character.

Claims (10)

1, a kind of header, residue digital stroke coding input method of Chinese character and keyboard thereof, described keyboard is a numeric keypad, has at least 1,2,3,4,5 five numerical key in order to represent the horizontal, vertical of Chinese character, casts aside (point), presses down, rolls over five kinds of strokes; It is characterized in that to account for Chinese character sum each combinde rqdical character more than 90%, be divided into stem and surplus two parts from structure, constitute the numerical coding of stem by the code of 2 or 3 strokes of the first stroke that comprises stem, by the first stroke that comprises surplus portion and the last 3, the numerical coding of 4 or 5 strokes constitutes the numerical coding of surplus portion, the stem coding adds the digital stroke coding that surplus coding constitutes fit Chinese character, add the stroke coding of independent body Chinese character and form header, residue compiling method and coding scheme thereof, use numeric keypad to computer or communication apparatus input Chinese character and or Chinese-character words.
2, header, residue stroke input method as claimed in claim 1 and keyboard thereof, it is characterized in that each fit Chinese character is got 5 yards at most, wherein first stroke of earlier getting stem according to sequential write adds that the end stroke or second stroke are as preceding 2 yards, get first, second and the coding of end stroke of surplus portion again as surplus portion according to sequential write, when the not enough code fetch of the stroke of stem or surplus portion, enchashment has stroke, during total code length less than 5, add benefit once " 0 " key to show end.
3, header, residue stroke input method as claimed in claim 1 and keyboard thereof, it is characterized in that each Chinese character is got 6 yards at most, wherein first stroke of earlier getting stem according to sequential write adds that the end stroke or second stroke are as preceding 2 yards, 4 yards first, second, third and the end strokes of getting surplus portion according to the stroke writing order in back, when the not enough code fetch of the stroke of stem or surplus portion, enchashment has stroke, during total code length less than 6, adds benefit " 0 " key once to show end.
4, header, residue stroke input method as claimed in claim 1 and keyboard thereof, it is characterized in that each Chinese character is got 6 yards at most, wherein first stroke, second stroke of earlier getting stem according to sequential write adds that end stroke or the 3rd stroke are as preceding 3 yards, get first, second and end stroke of surplus portion again according to sequential write, when the not enough code fetch of the stroke of stem or surplus portion, enchashment has stroke, during total code length less than 6, add benefit once " 0 " key to show end.
5, header, residue stroke input method as claimed in claim 1 and keyboard thereof is characterized in that each Chinese character is got 7 yards at most.2 yards of stems, surplus 5 yards, wherein the first and end stroke of the first stroke of stem and surplus portion must be got; Perhaps stem is 3 yards, and surplus 4 yards, wherein the first and end stroke of stem and surplus portion all must be got.
6, header, residue stroke input method as claimed in claim 1 and keyboard thereof it is characterized in that stem only gets 2 sign indicating numbers, and these 2 sign indicating numbers are first radical region-position codes on 5 districts, 25 bit keyboards in this Chinese character five stroke word pattern input method.
7, as claim 1 or 2 or 3 or 4 or 5 or 6 or 7 described header, residue stroke input method and keyboards thereof, it is 4 or 5 or 6 or 7 that its maximum code length can otherwise provide, it is characterized in that for monomer word that accounts for Chinese character sum about 10% or radicals by which characters are arranged in traditional Chinese dictionaries Chinese character, its coding and input method is the sequential write by standard; get the stroke number that must comprise first stroke and end stroke and reach the regulation code length; during not enough regulation code length, add " 0 " key is as end.
8, header, residue stroke input method as claimed in claim 1 and keyboard thereof, it is characterized in that after a vocabulary identity code, only import former codings of each individual character in the Chinese character word and can import word, the number of Chinese character can be 2 in the word, 3, more than 4 to tens.
9, as claim 1 or 2 or 3 or 4 or 5 or 6 described header, residue stroke input method and keyboards thereof, it is characterized in that numeric keypad has 10 numerical keys such as 1,2,3,4,5,6,7,8,9,0 at least, wherein numerical key 6,7,8,9,0 is set to function key, be set to " universal key " respectively, forward and backward page-turning function key, word sign key, wherein the function of " universal key " is to substitute the stroke that is difficult to determine; When repeated code occurring, 0 key can be used as page turning key, and the numerical key that is right after after 0 key is and selects the repeated code key, can select desired word from the repeat code Chinese character that screen shows.
10, as claim 1 or 2 or 3 or 4 or 5 or 6 or 7 or 8 or 9 described header, residue stroke input method and keyboards thereof, it is characterized in that on the numerical key 6,7,8,9,0, can also distinguish or divide into groups to settle a few the highest Hanzi component of occurrence frequency.Mouth, day, Rolling, soil, Rui etc.In the process by the code taking rule of header, residue and code length requirement code fetch, when running into these several parts, they are only got 1 sign indicating number and no longer split into the single picture, so still press the coding scheme that header, residue code fetch method of the present invention forms, both can be used as a kind of independent coding input mode of using, also can mix, compatibility or have both general with the various coding schemes of header, residue.
CN 00100002 2000-01-03 2000-01-03 Chinese character coding and inputting method using the first radical, residual radical and stroke number and the key board Expired - Fee Related CN1123813C (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN 00100002 CN1123813C (en) 2000-01-03 2000-01-03 Chinese character coding and inputting method using the first radical, residual radical and stroke number and the key board

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN 00100002 CN1123813C (en) 2000-01-03 2000-01-03 Chinese character coding and inputting method using the first radical, residual radical and stroke number and the key board

Publications (2)

Publication Number Publication Date
CN1256446A true CN1256446A (en) 2000-06-14
CN1123813C CN1123813C (en) 2003-10-08

Family

ID=4575162

Family Applications (1)

Application Number Title Priority Date Filing Date
CN 00100002 Expired - Fee Related CN1123813C (en) 2000-01-03 2000-01-03 Chinese character coding and inputting method using the first radical, residual radical and stroke number and the key board

Country Status (1)

Country Link
CN (1) CN1123813C (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102314226A (en) * 2010-06-30 2012-01-11 汉王科技股份有限公司 Method for quickly inputting Chinese characters and keyboard
CN102830810A (en) * 2011-06-17 2012-12-19 汉王科技股份有限公司 Chinese character input method and keyboard realizing same

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102314226A (en) * 2010-06-30 2012-01-11 汉王科技股份有限公司 Method for quickly inputting Chinese characters and keyboard
CN102830810A (en) * 2011-06-17 2012-12-19 汉王科技股份有限公司 Chinese character input method and keyboard realizing same
CN102830810B (en) * 2011-06-17 2016-05-25 汉王科技股份有限公司 Chinese character input method and realize the keyboard of the method

Also Published As

Publication number Publication date
CN1123813C (en) 2003-10-08

Similar Documents

Publication Publication Date Title
CN1123813C (en) Chinese character coding and inputting method using the first radical, residual radical and stroke number and the key board
CN1171137C (en) Improved HLV Chinese character phonetic input method
CN1034245C (en) Burmese characters four-code intelligent coding method and keyboard thereof
CN1073722C (en) Pinyin input method
CN100347645C (en) Chinese phonetic input method for digital keyboard
CN1020386C (en) Structure strokes four-figure number coding method and keyboard
CN1112629C (en) Chinese-character and English input method by numeral keypad
CN1257445C (en) Chinese-character 'Pronunciation-meaning code' input method
CN1117163A (en) Chinese character pictographic code input method and its keyboard
CN1420422A (en) Stroke set digit representation method for code element and use
CN100339808C (en) U Code Chinese character inputting method
CN1068203A (en) Pronunciation-form-meaning words compatible encoding system and keyboard
CN1070493A (en) Sound-shape word combined coding
CN1103497A (en) Chinese character coding method with double strokes and phonetic letter, and keyboard thereof
CN1140867C (en) Chinese character three-code input method
CN1109284C (en) Multi-information code Chinese character input system for computer
CN1089918C (en) Chinese three-dimensional coding method
CN1167994C (en) Input method for Chinese character
CN1049989C (en) Two-stage numeral transmission and numeral keyboard for Chinese characters
CN1162766C (en) Chinese-character 'pronunciation-shape code' input method and its keyboard profile
CN1405660A (en) Chinese character input method
CN1237730A (en) Chiense character code keyboard input method
CN1779624A (en) Chinese coding and input method on syllable compression platform and keyboard
CN1255667A (en) Chinese-character optimizing input method with basic stroke code and its digital key pad
CN1704878A (en) Chinese characters coding method

Legal Events

Date Code Title Description
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C06 Publication
PB01 Publication
C14 Grant of patent or utility model
GR01 Patent grant
C56 Change in the name or address of the patentee
CP03 Change of name, title or address

Address after: Room 10, building 1201, new Jiayuan, No. 5, Changchun Bridge Road, Beijing, Haidian District

Patentee after: Wang Yongmin

Address before: Room 1905, building C (building 12), new garden Jiayuan, No. 5, Changchun Bridge Road, Beijing, Haidian District

Patentee before: Wang Yongmin

C56 Change in the name or address of the patentee
CP02 Change in the address of a patent holder

Address after: 100089, Beijing, Changchun, Haidian District Bridge Road 5, new starting point Jiayuan 6 building, Room 903

Patentee after: Wang Yongmin

Address before: 100089, Beijing, Changchun, Haidian District Bridge Road 5, new starting point Jiayuan 10 building, room 1201

Patentee before: Wang Yongmin

C56 Change in the name or address of the patentee
CP02 Change in the address of a patent holder

Address after: 100089 Beijing City, Haidian District Changchun Road No. 11 willow city building C1 block 807

Patentee after: Wang Yongmin

Address before: 100089, Beijing, Changchun, Haidian District Bridge Road 5, new starting point Jiayuan 6 building, Room 903

Patentee before: Wang Yongmin

DD01 Delivery of document by public notice

Addressee: Wang Yongmin

Document name: Notification to Pay the Fees

DD01 Delivery of document by public notice
C17 Cessation of patent right
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20031008

Termination date: 20140103

DD01 Delivery of document by public notice

Addressee: Wang Yongmin

Document name: Notification of Termination of Patent Right