[go: up one dir, main page]

CN1420422A - Stroke set digit representation method for code element and use - Google Patents

Stroke set digit representation method for code element and use Download PDF

Info

Publication number
CN1420422A
CN1420422A CN 01139523 CN01139523A CN1420422A CN 1420422 A CN1420422 A CN 1420422A CN 01139523 CN01139523 CN 01139523 CN 01139523 A CN01139523 A CN 01139523A CN 1420422 A CN1420422 A CN 1420422A
Authority
CN
China
Prior art keywords
stroke
sign indicating
code element
indicating number
word
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN 01139523
Other languages
Chinese (zh)
Other versions
CN1231830C (en
Inventor
侯朋太
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Individual
Original Assignee
Individual
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Individual filed Critical Individual
Priority to CN 01139523 priority Critical patent/CN1231830C/en
Publication of CN1420422A publication Critical patent/CN1420422A/en
Application granted granted Critical
Publication of CN1231830C publication Critical patent/CN1231830C/en
Anticipated expiration legal-status Critical
Expired - Fee Related legal-status Critical Current

Links

Landscapes

  • Document Processing Apparatus (AREA)

Abstract

A method for representing code cells by the number of stroke in the Chinese-character input is characterized by that a Chinese character is decomposed into single strokes, the number of same strokes is used as code cell, assigning values to different storkes, grouping the adjacent strokes according to their coriting order, and adding the values of the strokes in a group to obtain code cell. Said method can be used for coding Chinese characters or as the diacritical codes. Its advantages are simple method, short code length and easy application.

Description

Stroke set digit representation method for code element and purposes
A kind of encode Chinese characters for computer Code elementThe purposes of umerical method and this method, title of the present invention is Code elementUse stroke SetNumber representation and purposes, hereinafter to be referred as: The long-pending technique of writing
All herein bottoms add the word table that horizontal line connects together and show that these several words are that a speech is as follows altogether.The invention belongs to Robot calculatorThe design field of application software more precisely, belongs to the technical field of the method for Chinese character coding with its application.
The coding of Chinese character (containing the speech and phrase be made up of Chinese character) is meant with letter or number and other symbol arrangement and becomes a string, forms different Sequence, each set sequence is represented corresponding Chinese character respectively.This string sequence, this paper are called the Chinese character correspondence Outer sign indicating number, constitute Outer sign indicating numberLetter, numeral, symbol (having only on each position) this paper be called Outer sign indicating number Code element(this paper is called for short Sign indicating number Unit).
Present encode Chinese characters for computer is essentially two classes.The first kind is made code element with letter and nonnumeric symbol, does not promptly contain digit in the code element, as: Five-stroke character input method, Intelligent ABC is defeated Go into method, (data of consulting is that No. 49640 " the carry-on treasured book of computer "/Shu Yue of Chinese depository library CIP data core word 2000 writes--Zhuhai: Zhuhai publishing house 2000.10).The second class coding method is to do with numeral Code elementAs: The region-position code input method(data of consulting is the same), and often adopt on the mobile phone Stroke input method, promptly single Chinese character be divided into horizontal stroke (contain and carry), perpendicular, cast aside, press down (containing a little), folding (one stroke of turning with band of all band turnovers), represent with 1,2,3,4,5 successively again.In first kind coding method, though each have their own advantage, still, with regard to computer KeyboardInput, The key symbolComprise numerical key, and do not use numeral to do in the said method Sign indicating number Unit, only do with numeral Repeated codeThe sequence number of word.Like this, do not make full use of the keyboard symbol, the result is used for representing that the key symbol of code element is less relatively when formulating certain encoding scheme, and symbol is counted relative deficiency.When this just causes coding Repeated codeWord is many.In order to reduce as far as possible Repeated codeWord has only set more RuleReached at the length of sign indicating number outside perhaps increasing (the outer sign indicating number of partial words is longer when promptly encoding, for example: The Five-stroke Method codingMany reaches 4, and Intelligent ABC can reach 5).Input speed so just slows down when coding is applied to the computer input.And in second class coding, as Zone-bit input methodThough use numeral and do not have repeated code, be specified to thousands of Chinese characters different respectively one by one Region-position code, difficult note finds it difficult to learn; And Stroke input methodUse Code elementHave only five numerals, symbol very little, outer sign indicating number of some Chinese character oversize (nearly six etc.) and repeated code word are more when forming outer sign indicating number, particularly input speed is slower for a long time for the radical stroke of certain word, (as: to the coding input of " dews " just slowly).Moreover present Chinese character index, is used when looking into word by font usually as looking into word etc. The radical methodBe difficult to fast, need with method replacement preferably.
Purpose of the present invention is as follows:
If above-mentioned coding method is improved, when coding, can adopt numeral as much as possible, (be 1,2,3,4,5,6,7,8,9,0 of arabic numeral, and radix point. totally 11 altogether, as follows) make code element, design is represented different with different individual digits Code element, and then utilize the numeric code coding.Do with this Code element, Code elementNumber just than Stroke Input methodIn Code elementNumber is many, when coding, just can form more sequential coding like this, just can reduce Repeated codeWord.And pure digi-tal Outer sign indicating numberCan make the Chinese character index table after the arrangement.
And further, code element selects for use numeral to add letter and other symbol.Make code element with this, the number of code element is just more, when encoding like this, is beneficial in the time of just can forming more sequence reduce Repeated codeWord.These method synthesis are got up, can formulate a comprehensive computer ChineseInput scheme.Just can form different application in view of the above, visualize a total principle with a kind of total design, the invention scheme of many application, multi-usage, many effects, this is a purpose of the present invention.
The present invention realizes like this.
Generally encode Chinese characters for computer is had by font with by two kinds of word sounds.Can adopt double spelling by phonological encoding the time, (seeing " the carry-on treasured book of computer " second, 3 joints) promptly respectively gets two yards of letter back compositions with the initial consonant and the simple or compound vowel of a Chinese syllable of this word Outer sign indicating numberBut it is right like this Phonetically similar wordToo much Repeated codeCan't distinguish.In order to distinguish Phonetically similar word, further can add in two yards back that Two bors d's oeuveres forms one different The table symbolRight Phonetically similar wordDistinguish.Like this, the outer code table of individual character is shown: initial consonant code+simple or compound vowel of a Chinese syllable sign indicating number+difference sign indicating number, this paper is being added in Two bors d's oeuveres back difference phonetically similar word The table symbolCry The difference sign indicating number
And in pressing character shape coding, single Chinese character is pressed horizontal, perpendicular, cast aside, press down, totally five kinds of one stroke of folding splits, then, a, method that need not following b, b, use following method, promptly use 1,2,3,4,5 totally 5 numerals represent respectively horizontal stroke successively, perpendicular, cast aside, press down, folding, and per two strokes of one stroke after splitting are combined into one group by sequential write, this one stroke also is one group if surplus next one stroke can't combine (comprising that this one stroke back is that particular provisions integral body is not rolled over branch " Nian " and special components such as " mouths " or this one stroke in this word finishing touch) with other stroke.On this basis, feature of the present invention is: the one stroke after will splitting gathers by following two kinds of methods and uses numeral.A: under a kind situation (promptly only after certain single-character splitting is five kinds of one stroke, and then) together the stroke set of the same race of this individual character, result's numeral that the set back forms, as the corresponding code element of this stroke, but the implication of " result's numeral that the set back forms " comprises following content:
1, there is not certain stroke to represent (as not rolling over stroke in " wood " word, just with one of " 0 " or ". " expression) after the set with one of " 0 " or ". "
(above 4 or above 5 or above 6 or 7 or 8 or 9 or 10) all press this set numeral if the sum of 2 certain strokes surpasses a set numeral.
3, be no more than set numeral, still represent with the actual number of stroke of the same race.
In a word,, any Chinese character can be formed five kinds of code elements in order to last method, this individual character Stroke set of the same raceUse altogether NumeralExpression Code elementMethod this paper be called for short " With the collection method".
The present invention can also be according to the described method of a, to it is characterized in that individual character usefulness With the collection methodGet up by horizontal, vertical, left-falling stroke, right-falling stroke, folding series arrangement after forming 5 kinds of code elements, the sequence of formation is exactly the outer sign indicating number of this individual character.Certainly horizontal, vertical, cast aside, press down, folding, the front and back order can transposition, but no matter which kind of setting is equivalent, a kind of encoding scheme can only have a kind of setting.Can all weave into 5 yards to any Chinese character with such method Outer sign indicating number
Outer sign indicating number like this can be used for Chinese character index after computer, mobile phone input in Chinese and arrangement.
B, under b kind situation (to be one stroke horizontal, vertical with 1,2,3,4,5 representatives, left-falling stroke, right-falling stroke, folding and per two strokes divide one group of remaining one stroke also to calculate one group), then one stroke in the group that is divided into group is represented digital addition, the corresponding code element of this group of numeral of the result of usefulness addition.Details are as follows for the meaning of the words: being divided into group in the individual character, in the group One strokeWith they Represent numeralAdd up, number with the operation result of adding up (is no more than 10, be at most 10) the expression code element, but 10 of less thaies are still used former number, if add up is 10, (as first order stroke folding and folding in " children ", 5+5=10) one of available " 0 " or ". " expression, promptly the corresponding code element of first group of stroke of children is " 0 " or ". ".But can only select for use the two one of.If but have only one in the group, and then can be just with the representative numeral code element of this one stroke, perhaps regulation is as follows in addition: the representative numeral of this one stroke is added this numeral, represent code element with this result; Perhaps this number be multiply by 2, represent code element or add that all 5 (or 4 or 3 or 2 or 1) represent code element with the result of this number with multiply by 2 result, what must propose is that one stroke can only be selected a kind of with said method in a certain encoding scheme RepresentationUnified running, it is that no matter to select that a kind of method be equivalent.
As: individual character " greatly ", first group of stroke is horizontal and casts aside, representative numeral 1 that can be horizontal and the representative numeral 3 of casting aside, 1+3 is 4,4 is exactly the code element of first group of stroke, second group of stroke has only a right-falling stroke, the representative numeral of pressing down is 4, can be 4 code elements as second group of stroke, also can be shown second group code element table 4+4 is 8, also can be shown 4 * 2=8 to second group code element table, also can be shown 4+5=9 etc. to second group code element table, but can only select one, be good with the method for selecting the former, and promptly the corresponding code element of second group of stroke of " greatly " word is 4 for good.
The present invention can also be: all single Chinese characters all can be divided into following two kinds of situations: Han “ Nian not in first, this word structure " with any or all (as: jade, the Chinese, the unit) of " mouth "; Second, this word contain “ Nian " with " mouth " one of them or whole.(as: a kind of reed mentioned in ancient books, leaf) tool this, feature of the present invention can also be: according to the described method of b when splitting individual character, run into when containing " Nian " or " mouth " in this word structure, " Nian " and " mouth " do not roll over branch, looks as a whole, also as one group of stroke set, make corresponding code element with " 1 " representative " Nian ", as code element, or show Nian with ". " generation with ". " representative " mouth "; use " 1 " representative " mouth ", the former or the latter's is a kind of but a certain encoding scheme can only be selected Representation, can not obscure.But also be noted that “ Nian " implication be shape Shi Nian in the individual character (as Grass-character-head) member, can only be adjacent and can not intersect with other stroke, not very contain " Lv " as Gan Zhong Nian; The implication of mouth is little " mouth " structure in the Chinese character, must not intersect with other stroke and middle sky." prisoner " with " in " wait and just not very to contain " mouth ".And " Lv " or " mouth " is preceding when certain one stroke is arranged, and this one stroke is the whole member “ Nian that does not tear open because of the back has " or " mouth ", so this one stroke is also calculated one group.As: individual character " or " at folding timesharing the first stroke horizontal stroke, because of the back has " mouth ", horizontal stroke is calculated one group alone.
In a word, be suitable for any Chinese character individual character with such method.
This paper presses sequential write grouping to single Chinese character stroke and will organize interior stroke again this SetGet up to form Code elementMethod, be called for short " union method "
The present invention can also be: use The union methodForm Code element, (special-purpose in other words the method directly produces product: a kind of encoding scheme directly to carry out encode Chinese characters for computer.) method of this encoding scheme is as follows: it is the individual character that does not split two halves that the coding of first, single Chinese character, single Chinese character are divided into two classes (two class sums the are whole Chinese characters) first kind, and this paper cries The monomer word,, second class is the individual character of detachable two halves, this paper cries The binary word The binary wordComprise all Combinde rqdical characterAnd band RadicalChinese character, if but RadicalHave only stroke not very The binary word, the Chinese character beyond the binary word is all named The monomer wordRadical has only the individual character of stroke also to cry The monomer word(as: nine, give birth to, with) obviously, both boundary clear of classifying like this.It is right to stipulate Monomer(coding of Chinese character that promptly can not dimidiation is undertaken by laxative remedy word.
Press earlier The union methodForm each code element of this word.
The code element that forms is lined up by sequential write again and form a sequence, this sequence is exactly this word Outer sign indicating numberAs: to " or " word code.Earlier will " or " to be split as group promptly horizontal for the word stroke; Mouthful, horizontal and folding are cast aside and point, its corresponding code element 1., 6,7.Link up 1.67 in order again, it be exactly " or " the outer sign indicating number of word.
In like manner " my god " horizontal and horizontal after the stroke group, cast aside and press down, then " day " outer yard be 27.
Right The binary word(as: " beat, swim, send, room) coding, as follows, earlier will The binary wordDimidiation, left right model divide left and right sides two halves, and last mo(u)ld bottom half divides two halves up and down.The individual character (but radical must be more than two strokes or two strokes) of band radical, radical is calculated half, remaining calculation second half.But crying of half that the first first stroke of a Chinese character is write First half, after write half cry Later halfCoding method is to get First halfFirst group of stroke Code elementBe first yard of this word, then get successively Later halfEach organizes stroke Code element, till having got.Then with preceding half first yard link up in order, be exactly this word Place's sign indicating numberAs: " strings of cash " word dimidiation i.e. " gold " and " by force ", first yard left-falling stroke of getting gold with horizontal be 4, get " by force " stroke groupings more successively, for folding and horizontal, folding, mouthful, perpendicular with folding, horizontal stroke and erect, horizontal stroke and point, so yard be 465.735 outside " strings of cash " word.In like manner: the outer sign indicating number of " robbing " is 670, can also add following provisions certainly, if promptly the outer sign indicating number of a word individual character is then only got preceding 4 yards more than four yards, and 4 yards later leaving out, 4 yards of less thaies motionless.
If (as: being hard to tell certain word is to the inaccurate situation of single-character splitting in case run into the beginner The monomer word, still The binary wordThe time, or radical tear open inaccurate etc.) can when coding, make the processing of two yards of words, two yards is a word, resembles in the phonetic PolyphoneThe same.Certainly, unified in order to encode, also can be not so good as this processing.Like this, the individual character of all Chinese characters can be weaved into and be no more than 4 yards numeral Outer sign indicating numberThis usefulness Union method code elementThe numeral that forms Outer sign indicating numberWeave, this paper is called for short The union digital encode methodThis usefulness The union digital encode methodThe outer sign indicating number that forms can be applied to computer and mobile phone input in Chinese very easily.And with this method in the dictionary all after the Chinese character Unified coding, arrangement in sequence makes and can be used for making Chinese character word and search table again, also can give computer design dictionary, dictionary function.
The present invention can also be: use The union Chinese character coding methodRight Chinese Two wordsWith Multi-character wordsBrevity code write in phrase.
The brevity code of first, two-character word.Get preceding two yards and second word preceding two yards of first word, by the front and back series arrangement totally 4 yards brevity codes of forming two-character words that get up.As: " greatness ", first and second sign indicating number of " big " is 52; First and second sign indicating number of " greatly " is that 44 to link up 5244 be exactly the brevity code of " greatness ".
In like manner, can form 4 yards brevity codes to any two-character word.
The brevity code of second, three words.Get first and second sign indicating number of first word in three words; Get first yard of second word again; Get again triliteral first yard totally 4 yards link up and be.
The third, the brevity code of 4 words and above speech of 4 words or phrase is got the first sign indicating number of first, second and third word successively, gets the first sign indicating number of the last character again, links up totally 4 yards in proper order by front and back to be.
The present invention can also be, (i.e. the initial consonant code of this individual character+simple or compound vowel of a Chinese syllable sign indicating number, each with a letter representation) can add one after two yards after certain encode Chinese characters for computer is with double spelling two unified representation to be weaved in single individual character earlier The difference sign indicating numberExpression, The difference sign indicating number The table symbolAlternative (promptly in order to represent certain symbol of some difference sign indicating numbers), promptly The table symbolCan not only comprise letter and nonnumeric symbol, and feature of the present invention is that Biao Fu also comprises usefulness The union methodThe numeral that forms Code element, that is to say The difference sign indicating numberIn not only contain letter (as a, b, c) and nonnumeric symbol, as: (; :,?) do The table symbol, and containing individual digit tabulation symbol (as:., 0,1,2,3), this difference yard can be used for single Chinese character and two word coding methods, and this also is the main purposes of the described method of b kind.The concrete encoding scheme main contents of using are as follows:
First, single character code.All single Chinese characters are divided into two classes.The first kind is left and right sides type-word (as: marquis, bright, sing).The non-left right model of second class, i.e. all Chinese characters except that last person, (as: room, district, dew, outstanding person) both regulations have clear and definite boundary.
Second, then, the same dimidiation of all left and right sides shape (left half-sum is right half, and a left side half calculated in the most left radical in the shape of the left and right sides, and all the other calculate right half, but radical must be more than two strokes or two strokes).The title initial of then getting a left side half again is (with double spelling letter representation, as for the title on a left side half, promptly Split-type wordPronunciation or the other title of left avertence press the dictionary standard.If its left side half do not have title is arranged with a symbol replace (as stipulate V or? one of the two).Follow again, with this letter or symbol as the difference sign indicating number of whole word (left and right sides two halves altogether) as: the left side of " swimming " partly is Three WaterSo flat s represents with lead-in The difference sign indicating number, in like manner a left side that " pushes away " partly is The handle limit, Handle The limitThe Two bors d's oeuveres initial be t, so the difference sign indicating number of " pushing away " is t;
The third, its difference sign indicating number is to represent like this in the type-word of the non-left and right sides of all except the type-word of the left and right sides.Use after promptly getting first group of stroke of this word by sequential write The union methodThe code element that produces do the difference sign indicating number (note: when meeting " twenty " with " mouth ", by and the regulation " Nian " of method represent that with 1 " mouth " usefulness ". " represents, and a certain stroke back is " Nian " or " mouth ", makes code element with the representative number of this stroke.) as: the horizontal and left-falling stroke of first group of stroke of " too ", code element is 1+3=4,4 is exactly too The difference sign indicating number, in like manner in " a kind of reed mentioned in ancient books " word The difference sign indicating numberBe 1, " crying " The difference sign indicating numberBe k, " can " difference of word is 6, all like this individual characters can form by this The difference sign indicating number, and each Chinese character all can form the outer sign indicating number of trigram of initial consonant code+simple or compound vowel of a Chinese syllable sign indicating number+difference sign indicating number.This method of Chinese character coding this paper is called for short Two bors d's oeuveres trigram method
Fourth, the difference sign indicating number that as above in individual character, forms, then can write brevity code after forming to two words of Chinese, step is as follows: the initial consonant of getting two first words of words, then get first difference sign indicating number, then get the difference district of second word again, trigram gets up to be total to the brevity code that trigram is formed this two words by the front and back series arrangement altogether.As: the brevity code in " years " is as follows.Get " year " initial consonant s, get again " year " difference sign indicating number 7 (not being simple or compound vowel of a Chinese syllable), month difference sign indicating number 8, s78 is the brevity code in " years ".
In like manner can weave into the trigram brevity code to any two words methods of using, this ChineseIn Two words SpeechThe method of the brevity code of writing, this paper also cries Two bors d's oeuveres trigram method
The present invention can also be according to above-mentioned method handle Chinese three wordsWeave into brevity code with the speech more than three words.Step is: the initial consonant (using double-spelling method) of getting first word in this speech is then got first word The difference sign indicating number, then get second word The difference sign indicating numberTail word not The difference sign indicating numberForm coding (what notice that the back trigram gets is that the difference sign indicating number is not the simple or compound vowel of a Chinese syllable sign indicating number for totally 4 yards.) be d6z2 as the brevity code of " host king " speech.From above narration, can see that the present invention under a kind of total principle and guiding theory, has formed numeral Code element, and, different application is arranged respectively, produced Compile with long-pending The sign indicating number method, The union compiling methodWith The union code elementDo The difference sign indicating numberCompiling method.But no matter which kind of is encoded, and all must be applied to the computer input in Chinese.And, a feasible computer ChineseInput scheme, several coding methods interosculates and cooperates often.Therefore, several input methods that the present invention uses back formation can design in an input scheme system, just can use when running into word already learned Two bors d's oeuveres trigram methodInput (3 yards inputs) is used when running into new word The outer sign indicating number of union numeral methodInput (4 yards inputs) runs into deserted word when doing the inaccurate order of strokes observed in calligraphy with and collection compiling method (5 yards inputs) three share out the work and help one another, work in coordination, just fast do not have difficulty during input again, really be: see that word can import.And unlike establishing " z key " or fuzzy input or other help function such as " universal keies " again, also needn't page turning select.
Be not difficult to find out that the present invention has many advantages.At first the method for its employing also will split Chinese character sometimes, but the rule of this fractionation is all very clear and definite, does not use word languages such as " general ... ", and logic is clearly demarcated.Moreover, get adjacent strokes during fractionation, (unlike the fractionation radical that has time take apart crossing stroke) just splits easily.Promptly be that " twenty " branched away with the parts folding of " mouth ", also not intersecting in other stroke, promptly is to divide two halves individual character again, also is to turn up from the radical more than two strokes, all be easy to, the stroke that intersects taken apart with regard to difficult and easy diversification unlike the coding rule that has.With The union methodBe coded in the operating process, only use simple addition, brain reflects the result soon, and is swift to operate.Particularly of the present invention Two bors d's oeuveres trigram method, all single character codes all be no more than trigram (certainly than nearly 4 yards input is fast) and, Two bors d's oeuveres The trigram method The difference sign indicating numberSet more ingenious.When at first encoding, it is a class that word is divided into left right model, and non-left right model is another kind of, as for Left right modelOnly see its left side Split-type wordOr left avertence other (and not being a stroke), very clear and definite, directly perceived, simple, the easy note of such regulation is learnt easily and is operated, and the second class word is promptly Non-left right modelIt has comprised word and all individual characters except that left right model has not been divided into two halves during code fetch, only gets its first group of stroke, and this is also very clear and definite, simple, be easy to learn and use.So use Two bors d's oeuveres trigram method, brain can reflect very soon initial consonant, simple or compound vowel of a Chinese syllable, The difference sign indicating number, think The difference sign indicating numberThe time only think the prefix of this word, needn't scrutinize other stroke of word, so input is just fast.And both combine the difference sign indicating number The table symbolJust many, repeated code word just few (word already learned does not almost have).If but without this method, and only RadicalThe conduct of representative letter The difference sign indicating number, so just be difficult to operation.Because many difficult searchings are just arranged on the dictionary to be done inaccurate Radical, moreover Radical Radicals by which characters are arranged in traditional Chinese dictionariesHave only the word of stroke also to be difficult to identification or the like, it is at a loss as to what to do to be directed at the learner, is deeply aware of one's own helplessness when faced with a great task.
If moreover when design software, several application of the present invention are combined are used for an input system, the several method cooperation of dividing the work mutually, complement each other can form an input in Chinese scheme preferably undoubtedly.
Embodiment 1:
With The collection technique of writingIn With the collection methodForm Code elementSingle Chinese character is encoded
(this paper is called for short The Foolish Old Man's method)
1, at first single Chinese character is resolved into 5 kinds of one stroke, promptly horizontal (contain horizontal with carry), perpendicular (not with structure), cast aside, press down (contain and press down and point), folding (all bands are turned or strokes of band turnover in the one stroke), and by after left-falling stroke right-falling stroke folding is classified as five classes anyhow, five class strokes are gathered respectively, see total separately how much total.
If 2 sums that gather are represented the code element of this stroke less than 6 with its substantial amt; If the sum that gathers equals 6 or more than 6, all use 6 code elements of representing this stroke; If this individual character really lacks certain stroke, then represent with 0.
3, press down the folding order each by casting aside anyhow Code elementPlatoon gets up, and is exactly this word Outer sign indicating number
As " marquis " word, be split as one stroke and be: cast aside, perpendicular, folding, horizontal, cast aside, horizontal, horizontal, cast aside, press down, gather and have 3 horizontal strokes, 3 is exactly horizontal correspondence Sign indicating numberUnit; Have 1 and erect, perpendicular corresponding code element is 1; Have 3 left-falling strokes, the correspondence of left-falling stroke Code elementBe 3; Have 1 right-falling stroke, the corresponding code element of right-falling stroke is 1; Have 1 folding, folding Code elementBe 1.Be 31311 by cast aside pressing down anyhow that the folding platoon gets up then, it " marquis " Outer sign indicating number
For another example: " Chinese " word, split the back single and divide point, point into, carry, roll over, press down, gather, have only a horizontal stroke (promptly carrying), then Heng corresponding code element is 1.The not perpendicular pen of " Chinese " word, perpendicular correspondence Code elementBe 0: do not cast aside pen, the correspondence of left-falling stroke Code elementAlso be 0; 2 points and 1 right-falling stroke are arranged, the correspondence of right-falling stroke Sign indicating number UnitBe 3; 1 folding pen is arranged, the correspondence of folding Code elementBe 1, platoon gets up, the correspondence of " Chinese " Code elementBe 10031.
For another example: " Qu " word, its horizontal pen be 10 and surpass 6, and 6 to be representative, then the code element of horizontal pen is 6, and 4 perpendicular pens are arranged, perpendicular code element is 4,1 left-falling strokes, and the code element of left-falling stroke is 1, one point, and the code element of right-falling stroke is 1, two foldings, the code element of folding is 2, platoon gets up, then " Qu " word Outer sign indicating numberBe 64112.
Also can be with such method to any encode Chinese characters for computer.
Example 2: use With the collection methodTo encode Chinese characters for computer
Lack certain stroke if stipulate certain individual character, then the corresponding code element of certain stroke is represented with " "; If certain stroke sum of individual character kind surpasses 9, represent corresponding code element with " 9 ".Other regulation is with example 1.(different with example 1 certain stroke code elements that need not " 0 " expression lack also need not surpass 6 corresponding code element by 6 expression strokes sums.If certain stroke is no more than 9 in the individual character, still represent code element with real figure.) for example, the outer sign indicating number of " time " word is still 31311, the outer sign indicating number of Chinese character but is 111, the outer sign indicating number of " Qu " word but is 94112.
Also can be with such method to any encode Chinese characters for computer.
Example 3 usefulness The collection technique of writingIt The union method(this paper is called for short to encode Chinese characters for computer The union compiling method)
Determine earlier The union methodContent, promptly total is regular as follows:
To single encode Chinese characters for computer the time, be single-character splitting one stroke earlier, totally 5 kinds.Promptly horizontal (contain and carry), perpendicular, cast aside, press down (containing a little), folding (all band turnovers and the stroke of turning), represent with 1,2,3,4,5 successively respectively again; Press sequential write again, per two are one group and gather, as run into " Lv " and " mouth " do not roll over branch, wholely calculate one group, " Lv " and " mouth " preceding grouping back remainder One strokeAlso calculate one group, also calculate one group for one of other two combination back remainders, the result that the representative numeral of two strokes in the group is added up mutually represents this group again Code element(but the code element that the regulation folding combines with the folding stroke is a numeral 0)
" Lv " integral body is one group, Code elementRepresent with 1;
" mouth " integral body is one group, Code elementWith representing;
Single divides the representative numeral code element of one group usefulness one stroke into.
Following by this rule encoding:
One, the coding of single Chinese character
Single Chinese character is divided into two classes (two class sums are whole Chinese characters).
The first kind is the individual character that is not split as two halves, cries The monomer wordBe that this individual character does not contain more than two strokes Radical, more be not Combinde rqdical character, but the word that only contains the radical of stroke is listed as at this.As: sheet, first, one-tenth, fourth, just, or not fragrant-flowered garlic, also, book, interior, adopted, ball, pellet, fly.
Second class is the word that is split as two halves, cries The binary word, as: marquis, pond, give, roc, room, disease, ticket.
The binary word comprises all individual characters except that first kind word, contains all Combinde rqdical characterAnd have RadicalWord (but RadicalMust be more than two strokes or two strokes).
Three, right The monomer wordCoding is pressed and is carried out
1, uses earlier The union methodForm the code element of this word, promptly use above rule.
2, above code element being got up by the front and back series arrangement, form a sequence, is exactly this word Outward Sign indicating number
As " greatly " compiled Outer sign indicating number, can be first group of horizontal left-falling stroke with the stroke groupings of " greatly ", 1+3=4 then, 4 is first group Code element:
Second group of stroke has only a right-falling stroke, and the representative numeral of right-falling stroke is 4, then 4 second groups Code elementSo, big Outer sign indicating numberBe 44.
For another example to " or " word code, will " or " folding is divided into group of strokes.
First group have only a horizontal stroke (there is mouth the back, can't with other one stroke addition, so for single is one group, code element is 1;
Second group is mouthful (one group of a whole calculation), and code element is;
The 3rd group is to carry folding, 1+5=6, and code element is 6;
The 4th group is apostrophe, 3+4=7, and code element is 7.
It is 167 that platoon gets up, " or " Outer sign indicating number, in like manner day Outer sign indicating numberBe 27.
3, right The binary wordCoding
1. earlier will The binary wordDimidiation, left right model are divided into left and right sides two halves, and last mo(u)ld bottom half is divided into the mo(u)ld bottom half two halves, are with more than two strokes or two strokes Radical, radical is half, remaining is second half.But headed by writing earlier half, after write for later half.
2. press The union methodGet the code element of first first group of stroke of half, then use The union methodGet the later half code element of respectively organizing stroke,, form the outer sign indicating number of numeral of this word again by the order platoon before and after writing.
3. the Chinese character that surpasses 4 yards only can be got preceding 4 yards, promptly in coding, just stop as long as compiling enough 4 yards.That is to say no matter a word has how much organize stroke, when coding, get at most and be no more than 4 groups (according to stroke order).
As coding to " employing ", divide earlier two halves " family " and " Cui ", get that first group of stroke point at " family " roll over Code elementBe that 4+5 is 9, it is perpendicular for casting aside again " Cui " even to be got trigram---and 5, put horizontal stroke---5, horizontal---2, so far, formation is employed Outer sign indicating number9552.
With above method, can weave into any single Chinese character and be no more than 4 yards Outer sign indicating number
Such numeral Outer sign indicating numberAfter the formation, can queue by order from small to large and, numeral all corresponding individual character, and the page number of individual character on dictionary all arranged, can be made into index of Chinese Characters after this arrangement of tool, the design computerized dictionary, also can make index of Chinese Characters, and indicate the page number of this individual character on dictionary, thereby can replace The radicals by which characters are arranged in traditional Chinese dictionaries method, The stroke method, be used to look up the dictionary, dictionary.
As: the index of Chinese Characters after the arrangement is taken passages as follows:
Coding Individual character Page number
??…… ??…… ????……
??1721 ????351
??1723 Grass ????44
??1726 Transplant ????135
??1727 Chang ????51
??1728 Bright ????337
??1729 Luxuriant growth ????203
??1731 Seedling ????342
??…… ??…… ????……
Annotate: this example the right Page numberBe meant the page number of " Xinhua dictionary " (revised edition in 1998).As be used to retrieve other words allusion quotation, then page number is determined by the page number at the individual character place in this words allusion quotation.
Example 4. usefulness The union Chinese character coding methodTo two words and multi-character words and phrase historical records sign indicating number.
One, the brevity code of two-character word.
Form preceding two yards in preceding two yards and second individual character of first individual character in the two-character word by the method in the example 3, by the front and back order platoon brevity codes of totally 4 yards composition two-character words that get up.As: preceding 2 yards of " wisdom " " English " is 17, and bright preceding 2 yards is 77, and linking up 1777 is exactly wise brevity code.In like manner, can weave into 4 yards brevity codes (pure digi-tal) to any two-character word.
Two, the brevity code of three words.
The method that connects example 3 forms code element, preceding 2 yards of getting first word, first yard of getting second word, get the triliteral first yard totally 4 yards form brevity codes.As the 1st yard of: preceding 2 yards 17, the second words " palpus " of " unwarranted " first word " not " is that the 1st yard of " having " of 6, the three words is 4, and then 1764 is exactly the brevity code of " unwarranted ".
Three, the brevity code of the speech more than four words and four words.
The method that connects example 3 forms code element, get first yard of preceding 3 words and the last character first yard in order platoon get up totally 4 yards to form brevity codes.As: " at a tremendous pace " gets that separately first yard platoon get up is 1747, and it is 3780 to be that first sign indicating number 0 platoon that " World Trade Organization " first, second and third prefix coee 378 and the last character " are knitted " gets up.
Example 5, usefulness difference sign indicating number are to encode Chinese characters for computer (this paper abbreviation Two bors d's oeuveres trigram method).
This scheme can be compiled all Chinese characters (individual character) without exception and be trigram.But if brevity code word and keyboard word were not arranged surely, these words could be without this method.The trigram order is as follows: initial consonant code+simple or compound vowel of a Chinese syllable sign indicating number+difference sign indicating number; Wherein initial consonant code and simple or compound vowel of a Chinese syllable sign indicating number are used Double spellingHandle each with a letter representation, can adopt the 71st definition in " the carry-on treasured book of computer " book.
As for distinguishing being set as follows of sign indicating number:
1, at first Chinese character is divided into left right model or non-left right model two classes, two class sums are all Chinese characters.Left right model is that the font of Chinese character is left and right sides structure, left, center, right structure and contains The end of walking, built by the wordIndividual character, individual character is if having left avertence other (must be the radical more than two strokes or two strokes) all than row, all Chinese characters in addition all are that non-left right model (comprises the monomer word and except band Word is built at the end of walkingAll of other radical are surrounded type-word and last mo(u)ld bottom half, chiasma types etc.) the two boundary is clear and definite.
2, every left and right sides type-word, (left side half can be a radical, also can be to get its left side half without exception The binary wordThe branch word, each that see a left side half then claims to use double spelling phonetic, the initial of phonetic is exactly The districtOther sign indicating number.) as " marquis " word left side partly be Single upright people, initial is d, d is exactly marquis's a difference sign indicating number.For another example: the left side of " sending " is half of The end of walking, z is exactly a code element, and " roc " left side partly is a month word limit, and Y is exactly a code element.In like manner the left side of " swimming " partly is 3 water, The difference sign indicating numberBe the initial s of 3 water Two bors d's oeuveres, still the title on a left side half must be determined, returns model by Chinese.But the usefulness that a left side half can't be named? expression.Any like this left and right sides type-word has all had the correspondence of oneself The difference sign indicating number
3, in the type-word of the non-left and right sides, no matter this word is any structure, get this wordbook without exception and write first group of stroke of order, the code element of using the union method to form is again made the difference sign indicating number (as for how pressing union method code fetch by the rule in the example 3, no longer citation), first group of stroke as " greatly " word is horizontal and casts aside, big difference sign indicating number is 3, in like manner the difference sign indicating number of " celery " be 1 (because of first group be twenty) in like manner the difference sign indicating number of " brother " be that all so non-left right model individual characters of ". " (because of " mouth " is first group) have all had oneself The difference sign indicating numberBecause left and right sides type-word and non-left and right sides type-word are exactly all Chinese characters altogether.Therefore, any Chinese character all has own correspondence The difference sign indicating number
4, on this basis, all Chinese characters all can form: the outer sign indicating number of trigram of initial consonant code+simple or compound vowel of a Chinese syllable sign indicating number+difference sign indicating number.
5, after single Chinese character has all had the outer sign indicating number of the trigram that contains difference sign indicating number, we can also write brevity code to the speech of two individual characters compositions.
Compile Two-character word Outer sign indicating numberThe method of brevity code as follows:
1, gets Two-character wordIn the initial consonant (but initial consonant must be a letter that produces with double-spelling method) of first individual character as first yard of brevity code.
2, the difference sign indicating number of then getting lead-in (the same individual character) is made second yard of brevity code.
3, then getting afterwards again, the difference sign indicating number of an individual character is a trigram.Like this, the lead-in initial consonant code+the trigram composition should altogether for the other sign indicating number in lead-in difference block, sign indicating number+back PhraseBrevity code.As: " Chinese character " this two-character word is write the initial consonant h that brevity code is got lead-in " Chinese " earlier, and the difference sign indicating number s that gets lead-in " Chinese " again is (promptly 3 waterInitial s) get the outer yard brevity code of difference sign indicating number (first and second stroke point adds up to 8 with the representative numeral of point) " Chinese character " of back word " word " again with regard to hs8.The brevity code of two-character word " phonetic symbol " is in like manner got the initial consonant y of lead-in " sound ", the difference sign indicating number 5 of lead-in " sound ", and the difference sign indicating number m of back word " mark " (is a target RadicalThe initial of wood), y5m is the brevity code of " phonetic symbol ".In like manner the brevity code of " influence " is yjk.Can write brevity code to any two words with this quadrat method.
This paper is this initial consonant and two difference representation used Two wordsThe method of brevity code is also named Two bors d's oeuveresThe trigram method.

Claims (9)

1, a kind of encode Chinese characters for computer Code elementUmerical method when coding, is used individual character Double spellingFormation sound, rhythm add for two yards again The difference sign indicating numberAlso can individual character folding be divided into horizontal, vertical, cast aside, press down, the folding one stroke, a then: without the b method, b: with following method promptly with 1,2,3,4,5 represent successively respectively horizontal, vertical, cast aside, press down, folding also is one group by per two strokes of the sequential write one stroke that is one group of remainder can't be with other one stroke addition the time again; Feature of the present invention is that the one stroke after splitting gathers formation by following two kinds of methods Code element:
A, under a kind situation, stroke set mutually of the same race in the individual character is closed formation Code element, the sum that gathers with numeral by certain rule; B, under b kind situation, the representative numeral of dividing stroke in groups the group is added up mutually as the code element of this group, but two strokes all are foldings in the group specified, then with the corresponding code element of one of digital " 0 " or ". " expression.
2, according to 1, b: described method, it is characterized in that if having only an one stroke in organizing, then the corresponding code element of this group represents that numeral is that the representative numeral of this single or the representative numeral of single are added this number, or the representative numeral of single be multiply by 2 or the numeral of single added 5 or 4 or 3 or 2 or 1, but that a kind of encoding scheme can only be with each " perhaps " is a kind of.
3, under last kind of situation, if contain “ Nian in the structure of certain Chinese character " " mouth " the two one of or all; according to claim 1, b: described method, its feature also are can be Ba “ Nian " and " mouth " do not split as one group as an integral body, represent “ Nian with " digital 1 " " work Code element, point ". " representative " mouth " is done decimally Code element, perhaps use " numeral 1 " expression mouthful: show to show " Nian " with ". ", but a kind of encoding scheme, can only select or one of front or rear.
4, also be according to claim 1, its feature of the described method of a, when certain stroke of certain Chinese character surpasses 6 strokes, all use sum, the work correspondence of this stroke of " 6 " expression Code element, certain stroke that certain word lacks lacks stroke and represents that with " numeral 0 " remaining is all still represented with the substantial amt of this kind stroke.
5, also being to split in the individual character back according to claim 1, its feature of the described method of a uses With the collection methodThe code element of the various strokes that form gets up to form individual character by horizontal, vertical, left-falling stroke, right-falling stroke, folding, series arrangement Outer sign indicating number
6, according to claim 1, b: described method, its feature also are can be special-purpose The union methodThe code element that the grouping of single Chinese character is formed in conjunction with the back except that leave out do not get, lining up in order forms the numeral of this word Outer sign indicating number
7, according to claim 1, b: its feature of described method also is to use the Chinese character of union method generation Code element, in conjunction with non-numeric The table symbol, shared conduct The difference sign indicating number The table symbolThereby, the formation phonetically similar word The difference sign indicating number
8, be also that according to claim 1, b and 7 its features of described method its purposes one has been: can be with 1, b and 7 methods form The difference sign indicating numberTrigram weaved in any single Chinese character Outer sign indicating numberAlso can use The difference sign indicating numberTwo-character word is weaved into the trigram brevity code.
9, according to claim 5,6, its feature of described method is that also one of its purposes is: two kinds of numerals of formation Outer sign indicating numberAfter all can being used for computer, mobile phone input in Chinese separately separately and also all can putting in order respectively, be respectively applied for Chinese character (containing speech) and retrieve, weave into compuword (speech) allusion quotation key and dictionary dictionary key.
CN 01139523 2001-11-20 2001-11-20 Stroke set digit representation method for code element and use Expired - Fee Related CN1231830C (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN 01139523 CN1231830C (en) 2001-11-20 2001-11-20 Stroke set digit representation method for code element and use

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN 01139523 CN1231830C (en) 2001-11-20 2001-11-20 Stroke set digit representation method for code element and use

Publications (2)

Publication Number Publication Date
CN1420422A true CN1420422A (en) 2003-05-28
CN1231830C CN1231830C (en) 2005-12-14

Family

ID=4675255

Family Applications (1)

Application Number Title Priority Date Filing Date
CN 01139523 Expired - Fee Related CN1231830C (en) 2001-11-20 2001-11-20 Stroke set digit representation method for code element and use

Country Status (1)

Country Link
CN (1) CN1231830C (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103164466A (en) * 2011-12-16 2013-06-19 李瑞民 Stroke order sub-word retrieval method for uncommon Chinese character
CN103543841A (en) * 2013-11-13 2014-01-29 罗嗣孝 Chinese character unique splitting input method
CN109271610A (en) * 2018-07-27 2019-01-25 昆明理工大学 A kind of vector expression of Chinese character

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103164466A (en) * 2011-12-16 2013-06-19 李瑞民 Stroke order sub-word retrieval method for uncommon Chinese character
CN103543841A (en) * 2013-11-13 2014-01-29 罗嗣孝 Chinese character unique splitting input method
CN109271610A (en) * 2018-07-27 2019-01-25 昆明理工大学 A kind of vector expression of Chinese character

Also Published As

Publication number Publication date
CN1231830C (en) 2005-12-14

Similar Documents

Publication Publication Date Title
CN1043210A (en) Radical code input method and its equipment
CN1141633C (en) 24-radical sorting encode method for Chinese characters and its keyboard
CN1420422A (en) Stroke set digit representation method for code element and use
CN102511021A (en) Number-order-code-element keyboard and information input method thereof
CN1435749A (en) Chinese character stroke and phonetic code input method and keyboard thereof
CN101038517A (en) Shape-pronunciation encoding input method of Chinese characters
CN1184554C (en) Chinese character Hanyi code input method and keyboard for computer
CN1034245C (en) Burmese characters four-code intelligent coding method and keyboard thereof
CN1062361C (en) Method for inputting chinese characters by key shape code derived from sound and shape
CN1167994C (en) Input method for Chinese character
CN1196057C (en) One-code two-form quick Chinese digital coding input method
CN1028457C (en) Chinese character computer input system of stroke digital code and sound code
CN1052200A (en) Pronunciation-form-meaning words encode series with compatibility and keyboard
CN1088211C (en) Chinese character positive and negative singular radicals periodic table and radicals digital code input method
CN1159642C (en) Simplified Chinese-character 'Sound-shape code' input method
CN1056007C (en) Codes for inputting Chinese characters
CN103412656A (en) Chinese character syllable rime stroke shape composite phonetic and morphological code
CN1052314C (en) Computer keyboard and input method of Chinese character two-dimensional numerals
CN1109284C (en) Multi-information code Chinese character input system for computer
CN1146572A (en) Chinese character orthography coding method
CN1558310A (en) Consonant and vowel font code Chinese characters input method
CN1060277C (en) Chinese characters coding and input method for computer using sentences as input unit
CN1160883A (en) Phonetic double code of Chinese characters for computer input
CN1256446A (en) Chinese character coding and inputting method using the first radical, residual radical and stroke number and the key board
CN86105505A (en) Chinese character input method and applied keyboard thereof

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant
C19 Lapse of patent right due to non-payment of the annual fee
CF01 Termination of patent right due to non-payment of annual fee