CN1420422A - Stroke set digit representation method for code element and use - Google Patents
Stroke set digit representation method for code element and use Download PDFInfo
- Publication number
- CN1420422A CN1420422A CN 01139523 CN01139523A CN1420422A CN 1420422 A CN1420422 A CN 1420422A CN 01139523 CN01139523 CN 01139523 CN 01139523 A CN01139523 A CN 01139523A CN 1420422 A CN1420422 A CN 1420422A
- Authority
- CN
- China
- Prior art keywords
- stroke
- sign indicating
- code element
- indicating number
- word
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 238000000034 method Methods 0.000 title claims abstract description 123
- 230000015572 biosynthetic process Effects 0.000 claims description 9
- 230000033764 rhythmic process Effects 0.000 claims 1
- 230000008901 benefit Effects 0.000 abstract description 3
- 238000007185 Stork enamine alkylation reaction Methods 0.000 abstract 1
- 150000001875 compounds Chemical class 0.000 description 10
- 239000000178 monomer Substances 0.000 description 8
- 229940074869 marquis Drugs 0.000 description 6
- VBUNOIXRZNJNAD-UHFFFAOYSA-N ponazuril Chemical compound CC1=CC(N2C(N(C)C(=O)NC2=O)=O)=CC=C1OC1=CC=C(S(=O)(=O)C(F)(F)F)C=C1 VBUNOIXRZNJNAD-UHFFFAOYSA-N 0.000 description 6
- 239000000203 mixture Substances 0.000 description 4
- 238000003825 pressing Methods 0.000 description 4
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 4
- 238000005266 casting Methods 0.000 description 3
- 238000005194 fractionation Methods 0.000 description 3
- 230000007306 turnover Effects 0.000 description 3
- 206010011469 Crying Diseases 0.000 description 2
- 235000014676 Phragmites communis Nutrition 0.000 description 2
- 210000004556 brain Anatomy 0.000 description 2
- PCHJSUWPFVWCPO-UHFFFAOYSA-N gold Chemical compound [Au] PCHJSUWPFVWCPO-UHFFFAOYSA-N 0.000 description 2
- 239000010931 gold Substances 0.000 description 2
- 229910052737 gold Inorganic materials 0.000 description 2
- 230000009182 swimming Effects 0.000 description 2
- 239000002023 wood Substances 0.000 description 2
- 244000000383 Allium odorum Species 0.000 description 1
- 235000018645 Allium odorum Nutrition 0.000 description 1
- 240000007087 Apium graveolens Species 0.000 description 1
- 235000015849 Apium graveolens Dulce Group Nutrition 0.000 description 1
- 235000010591 Appio Nutrition 0.000 description 1
- 244000025254 Cannabis sativa Species 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 230000000295 complement effect Effects 0.000 description 1
- 230000007812 deficiency Effects 0.000 description 1
- 201000010099 disease Diseases 0.000 description 1
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 230000008676 import Effects 0.000 description 1
- 239000010977 jade Substances 0.000 description 1
- 239000008141 laxative Substances 0.000 description 1
- 230000002475 laxative effect Effects 0.000 description 1
- 230000008520 organization Effects 0.000 description 1
- 239000008188 pellet Substances 0.000 description 1
- 230000008569 process Effects 0.000 description 1
- 238000005096 rolling process Methods 0.000 description 1
- 238000003786 synthesis reaction Methods 0.000 description 1
- 230000017105 transposition Effects 0.000 description 1
Landscapes
- Document Processing Apparatus (AREA)
Abstract
A method for representing code cells by the number of stroke in the Chinese-character input is characterized by that a Chinese character is decomposed into single strokes, the number of same strokes is used as code cell, assigning values to different storkes, grouping the adjacent strokes according to their coriting order, and adding the values of the strokes in a group to obtain code cell. Said method can be used for coding Chinese characters or as the diacritical codes. Its advantages are simple method, short code length and easy application.
Description
A kind of encode Chinese characters for computer
Code elementThe purposes of umerical method and this method, title of the present invention is
Code elementUse stroke
SetNumber representation and purposes, hereinafter to be referred as:
The long-pending technique of writing
All herein bottoms add the word table that horizontal line connects together and show that these several words are that a speech is as follows altogether.The invention belongs to
Robot calculatorThe design field of application software more precisely, belongs to the technical field of the method for Chinese character coding with its application.
The coding of Chinese character (containing the speech and phrase be made up of Chinese character) is meant with letter or number and other symbol arrangement and becomes a string, forms different
Sequence, each set sequence is represented corresponding Chinese character respectively.This string sequence, this paper are called the Chinese character correspondence
Outer sign indicating number, constitute
Outer sign indicating numberLetter, numeral, symbol (having only on each position) this paper be called
Outer sign indicating number Code element(this paper is called for short
Sign indicating number Unit).
Present encode Chinese characters for computer is essentially two classes.The first kind is made code element with letter and nonnumeric symbol, does not promptly contain digit in the code element, as:
Five-stroke character input method,
Intelligent ABC is defeated Go into method, (data of consulting is that No. 49640 " the carry-on treasured book of computer "/Shu Yue of Chinese depository library CIP data core word 2000 writes--Zhuhai: Zhuhai publishing house 2000.10).The second class coding method is to do with numeral
Code elementAs:
The region-position code input method(data of consulting is the same), and often adopt on the mobile phone
Stroke input method, promptly single Chinese character be divided into horizontal stroke (contain and carry), perpendicular, cast aside, press down (containing a little), folding (one stroke of turning with band of all band turnovers), represent with 1,2,3,4,5 successively again.In first kind coding method, though each have their own advantage, still, with regard to computer
KeyboardInput,
The key symbolComprise numerical key, and do not use numeral to do in the said method
Sign indicating number Unit, only do with numeral
Repeated codeThe sequence number of word.Like this, do not make full use of the keyboard symbol, the result is used for representing that the key symbol of code element is less relatively when formulating certain encoding scheme, and symbol is counted relative deficiency.When this just causes coding
Repeated codeWord is many.In order to reduce as far as possible
Repeated codeWord has only set more
RuleReached at the length of sign indicating number outside perhaps increasing (the outer sign indicating number of partial words is longer when promptly encoding, for example:
The Five-stroke Method codingMany reaches 4, and Intelligent ABC can reach 5).Input speed so just slows down when coding is applied to the computer input.And in second class coding, as
Zone-bit input methodThough use numeral and do not have repeated code, be specified to thousands of Chinese characters different respectively one by one
Region-position code, difficult note finds it difficult to learn; And
Stroke input methodUse
Code elementHave only five numerals, symbol very little, outer sign indicating number of some Chinese character oversize (nearly six etc.) and repeated code word are more when forming outer sign indicating number, particularly input speed is slower for a long time for the radical stroke of certain word, (as: to the coding input of " dews " just slowly).Moreover present Chinese character index, is used when looking into word by font usually as looking into word etc.
The radical methodBe difficult to fast, need with method replacement preferably.
Purpose of the present invention is as follows:
If above-mentioned coding method is improved, when coding, can adopt numeral as much as possible, (be 1,2,3,4,5,6,7,8,9,0 of arabic numeral, and radix point. totally 11 altogether, as follows) make code element, design is represented different with different individual digits
Code element, and then utilize the numeric code coding.Do with this
Code element,
Code elementNumber just than
Stroke Input methodIn
Code elementNumber is many, when coding, just can form more sequential coding like this, just can reduce
Repeated codeWord.And pure digi-tal
Outer sign indicating numberCan make the Chinese character index table after the arrangement.
And further, code element selects for use numeral to add letter and other symbol.Make code element with this, the number of code element is just more, when encoding like this, is beneficial in the time of just can forming more sequence reduce
Repeated codeWord.These method synthesis are got up, can formulate a comprehensive computer
ChineseInput scheme.Just can form different application in view of the above, visualize a total principle with a kind of total design, the invention scheme of many application, multi-usage, many effects, this is a purpose of the present invention.
The present invention realizes like this.
Generally encode Chinese characters for computer is had by font with by two kinds of word sounds.Can adopt double spelling by phonological encoding the time, (seeing " the carry-on treasured book of computer " second, 3 joints) promptly respectively gets two yards of letter back compositions with the initial consonant and the simple or compound vowel of a Chinese syllable of this word
Outer sign indicating numberBut it is right like this
Phonetically similar wordToo much
Repeated codeCan't distinguish.In order to distinguish
Phonetically similar word, further can add in two yards back that Two bors d's oeuveres forms one different
The table symbolRight
Phonetically similar wordDistinguish.Like this, the outer code table of individual character is shown: initial consonant code+simple or compound vowel of a Chinese syllable sign indicating number+difference sign indicating number, this paper is being added in Two bors d's oeuveres back difference phonetically similar word
The table symbolCry
The difference sign indicating number
And in pressing character shape coding, single Chinese character is pressed horizontal, perpendicular, cast aside, press down, totally five kinds of one stroke of folding splits, then, a, method that need not following b, b, use following method, promptly use 1,2,3,4,5 totally 5 numerals represent respectively horizontal stroke successively, perpendicular, cast aside, press down, folding, and per two strokes of one stroke after splitting are combined into one group by sequential write, this one stroke also is one group if surplus next one stroke can't combine (comprising that this one stroke back is that particular provisions integral body is not rolled over branch " Nian " and special components such as " mouths " or this one stroke in this word finishing touch) with other stroke.On this basis, feature of the present invention is: the one stroke after will splitting gathers by following two kinds of methods and uses numeral.A: under a kind situation (promptly only after certain single-character splitting is five kinds of one stroke, and then) together the stroke set of the same race of this individual character, result's numeral that the set back forms, as the corresponding code element of this stroke, but the implication of " result's numeral that the set back forms " comprises following content:
1, there is not certain stroke to represent (as not rolling over stroke in " wood " word, just with one of " 0 " or ". " expression) after the set with one of " 0 " or ". "
(above 4 or above 5 or above 6 or 7 or 8 or 9 or 10) all press this set numeral if the sum of 2 certain strokes surpasses a set numeral.
3, be no more than set numeral, still represent with the actual number of stroke of the same race.
In a word,, any Chinese character can be formed five kinds of code elements in order to last method, this individual character
Stroke set of the same raceUse altogether
NumeralExpression
Code elementMethod this paper be called for short "
With the collection method".
The present invention can also be according to the described method of a, to it is characterized in that individual character usefulness
With the collection methodGet up by horizontal, vertical, left-falling stroke, right-falling stroke, folding series arrangement after forming 5 kinds of code elements, the sequence of formation is exactly the outer sign indicating number of this individual character.Certainly horizontal, vertical, cast aside, press down, folding, the front and back order can transposition, but no matter which kind of setting is equivalent, a kind of encoding scheme can only have a kind of setting.Can all weave into 5 yards to any Chinese character with such method
Outer sign indicating number
Outer sign indicating number like this can be used for Chinese character index after computer, mobile phone input in Chinese and arrangement.
B, under b kind situation (to be one stroke horizontal, vertical with 1,2,3,4,5 representatives, left-falling stroke, right-falling stroke, folding and per two strokes divide one group of remaining one stroke also to calculate one group), then one stroke in the group that is divided into group is represented digital addition, the corresponding code element of this group of numeral of the result of usefulness addition.Details are as follows for the meaning of the words: being divided into group in the individual character, in the group
One strokeWith they
Represent numeralAdd up, number with the operation result of adding up (is no more than 10, be at most 10) the expression code element, but 10 of less thaies are still used former number, if add up is 10, (as first order stroke folding and folding in " children ", 5+5=10) one of available " 0 " or ". " expression, promptly the corresponding code element of first group of stroke of children is " 0 " or ". ".But can only select for use the two one of.If but have only one in the group, and then can be just with the representative numeral code element of this one stroke, perhaps regulation is as follows in addition: the representative numeral of this one stroke is added this numeral, represent code element with this result; Perhaps this number be multiply by 2, represent code element or add that all 5 (or 4 or 3 or 2 or 1) represent code element with the result of this number with multiply by 2 result, what must propose is that one stroke can only be selected a kind of with said method in a certain encoding scheme
RepresentationUnified running, it is that no matter to select that a kind of method be equivalent.
As: individual character " greatly ", first group of stroke is horizontal and casts aside, representative numeral 1 that can be horizontal and the representative numeral 3 of casting aside, 1+3 is 4,4 is exactly the code element of first group of stroke, second group of stroke has only a right-falling stroke, the representative numeral of pressing down is 4, can be 4 code elements as second group of stroke, also can be shown second group code element table 4+4 is 8, also can be shown 4 * 2=8 to second group code element table, also can be shown 4+5=9 etc. to second group code element table, but can only select one, be good with the method for selecting the former, and promptly the corresponding code element of second group of stroke of " greatly " word is 4 for good.
The present invention can also be: all single Chinese characters all can be divided into following two kinds of situations: Han “ Nian not in first, this word structure " with any or all (as: jade, the Chinese, the unit) of " mouth "; Second, this word contain “ Nian " with " mouth " one of them or whole.(as: a kind of reed mentioned in ancient books, leaf) tool this, feature of the present invention can also be: according to the described method of b when splitting individual character, run into when containing " Nian " or " mouth " in this word structure, " Nian " and " mouth " do not roll over branch, looks as a whole, also as one group of stroke set, make corresponding code element with " 1 " representative " Nian ", as code element, or show Nian with ". " generation with ". " representative " mouth "; use " 1 " representative " mouth ", the former or the latter's is a kind of but a certain encoding scheme can only be selected
Representation, can not obscure.But also be noted that “ Nian " implication be shape Shi Nian in the individual character (as
Grass-character-head) member, can only be adjacent and can not intersect with other stroke, not very contain " Lv " as Gan Zhong Nian; The implication of mouth is little " mouth " structure in the Chinese character, must not intersect with other stroke and middle sky." prisoner " with " in " wait and just not very to contain " mouth ".And " Lv " or " mouth " is preceding when certain one stroke is arranged, and this one stroke is the whole member “ Nian that does not tear open because of the back has " or " mouth ", so this one stroke is also calculated one group.As: individual character " or " at folding timesharing the first stroke horizontal stroke, because of the back has " mouth ", horizontal stroke is calculated one group alone.
In a word, be suitable for any Chinese character individual character with such method.
This paper presses sequential write grouping to single Chinese character stroke and will organize interior stroke again this
SetGet up to form
Code elementMethod, be called for short
" union method "
The present invention can also be: use
The union methodForm
Code element, (special-purpose in other words the method directly produces product: a kind of encoding scheme directly to carry out encode Chinese characters for computer.) method of this encoding scheme is as follows: it is the individual character that does not split two halves that the coding of first, single Chinese character, single Chinese character are divided into two classes (two class sums the are whole Chinese characters) first kind, and this paper cries
The monomer word,, second class is the individual character of detachable two halves, this paper cries
The binary word The binary wordComprise all
Combinde rqdical characterAnd band
RadicalChinese character, if but
RadicalHave only stroke not very
The binary word, the Chinese character beyond the binary word is all named
The monomer wordRadical has only the individual character of stroke also to cry
The monomer word(as: nine, give birth to, with) obviously, both boundary clear of classifying like this.It is right to stipulate
Monomer(coding of Chinese character that promptly can not dimidiation is undertaken by laxative remedy word.
Press earlier
The union methodForm each code element of this word.
The code element that forms is lined up by sequential write again and form a sequence, this sequence is exactly this word
Outer sign indicating numberAs: to " or " word code.Earlier will " or " to be split as group promptly horizontal for the word stroke; Mouthful, horizontal and folding are cast aside and point, its corresponding code element 1., 6,7.Link up 1.67 in order again, it be exactly " or " the outer sign indicating number of word.
In like manner " my god " horizontal and horizontal after the stroke group, cast aside and press down, then " day " outer yard be 27.
Right
The binary word(as: " beat, swim, send, room) coding, as follows, earlier will
The binary wordDimidiation, left right model divide left and right sides two halves, and last mo(u)ld bottom half divides two halves up and down.The individual character (but radical must be more than two strokes or two strokes) of band radical, radical is calculated half, remaining calculation second half.But crying of half that the first first stroke of a Chinese character is write
First half, after write half cry
Later halfCoding method is to get
First halfFirst group of stroke
Code elementBe first yard of this word, then get successively
Later halfEach organizes stroke
Code element, till having got.Then with preceding half first yard link up in order, be exactly this word
Place's sign indicating numberAs: " strings of cash " word dimidiation i.e. " gold " and " by force ", first yard left-falling stroke of getting gold with horizontal be 4, get " by force " stroke groupings more successively, for folding and horizontal, folding, mouthful, perpendicular with folding, horizontal stroke and erect, horizontal stroke and point, so yard be 465.735 outside " strings of cash " word.In like manner: the outer sign indicating number of " robbing " is 670, can also add following provisions certainly, if promptly the outer sign indicating number of a word individual character is then only got preceding 4 yards more than four yards, and 4 yards later leaving out, 4 yards of less thaies motionless.
If (as: being hard to tell certain word is to the inaccurate situation of single-character splitting in case run into the beginner
The monomer word, still
The binary wordThe time, or radical tear open inaccurate etc.) can when coding, make the processing of two yards of words, two yards is a word, resembles in the phonetic
PolyphoneThe same.Certainly, unified in order to encode, also can be not so good as this processing.Like this, the individual character of all Chinese characters can be weaved into and be no more than 4 yards numeral
Outer sign indicating numberThis usefulness
Union method code elementThe numeral that forms
Outer sign indicating numberWeave, this paper is called for short
The union digital encode methodThis usefulness
The union digital encode methodThe outer sign indicating number that forms can be applied to computer and mobile phone input in Chinese very easily.And with this method in the dictionary all after the Chinese character Unified coding, arrangement in sequence makes and can be used for making Chinese character word and search table again, also can give computer design dictionary, dictionary function.
The present invention can also be: use
The union Chinese character coding methodRight
Chinese Two wordsWith
Multi-character wordsBrevity code write in phrase.
The brevity code of first, two-character word.Get preceding two yards and second word preceding two yards of first word, by the front and back series arrangement totally 4 yards brevity codes of forming two-character words that get up.As: " greatness ", first and second sign indicating number of " big " is 52; First and second sign indicating number of " greatly " is that 44 to link up 5244 be exactly the brevity code of " greatness ".
In like manner, can form 4 yards brevity codes to any two-character word.
The brevity code of second, three words.Get first and second sign indicating number of first word in three words; Get first yard of second word again; Get again triliteral first yard totally 4 yards link up and be.
The third, the brevity code of 4 words and above speech of 4 words or phrase is got the first sign indicating number of first, second and third word successively, gets the first sign indicating number of the last character again, links up totally 4 yards in proper order by front and back to be.
The present invention can also be, (i.e. the initial consonant code of this individual character+simple or compound vowel of a Chinese syllable sign indicating number, each with a letter representation) can add one after two yards after certain encode Chinese characters for computer is with double spelling two unified representation to be weaved in single individual character earlier
The difference sign indicating numberExpression,
The difference sign indicating number The table symbolAlternative (promptly in order to represent certain symbol of some difference sign indicating numbers), promptly
The table symbolCan not only comprise letter and nonnumeric symbol, and feature of the present invention is that Biao Fu also comprises usefulness
The union methodThe numeral that forms
Code element, that is to say
The difference sign indicating numberIn not only contain letter (as a, b, c) and nonnumeric symbol, as: (; :,?) do
The table symbol, and containing individual digit tabulation symbol (as:., 0,1,2,3), this difference yard can be used for single Chinese character and two word coding methods, and this also is the main purposes of the described method of b kind.The concrete encoding scheme main contents of using are as follows:
First, single character code.All single Chinese characters are divided into two classes.The first kind is left and right sides type-word (as: marquis, bright, sing).The non-left right model of second class, i.e. all Chinese characters except that last person, (as: room, district, dew, outstanding person) both regulations have clear and definite boundary.
Second, then, the same dimidiation of all left and right sides shape (left half-sum is right half, and a left side half calculated in the most left radical in the shape of the left and right sides, and all the other calculate right half, but radical must be more than two strokes or two strokes).The title initial of then getting a left side half again is (with double spelling letter representation, as for the title on a left side half, promptly
Split-type wordPronunciation or the other title of left avertence press the dictionary standard.If its left side half do not have title is arranged with a symbol replace (as stipulate V or? one of the two).Follow again, with this letter or symbol as the difference sign indicating number of whole word (left and right sides two halves altogether) as: the left side of " swimming " partly is
Three WaterSo flat s represents with lead-in
The difference sign indicating number, in like manner a left side that " pushes away " partly is
The handle limit,
Handle The limitThe Two bors d's oeuveres initial be t, so the difference sign indicating number of " pushing away " is t;
The third, its difference sign indicating number is to represent like this in the type-word of the non-left and right sides of all except the type-word of the left and right sides.Use after promptly getting first group of stroke of this word by sequential write
The union methodThe code element that produces do the difference sign indicating number (note: when meeting " twenty " with " mouth ", by and the regulation " Nian " of method represent that with 1 " mouth " usefulness ". " represents, and a certain stroke back is " Nian " or " mouth ", makes code element with the representative number of this stroke.) as: the horizontal and left-falling stroke of first group of stroke of " too ", code element is 1+3=4,4 is exactly too
The difference sign indicating number, in like manner in " a kind of reed mentioned in ancient books " word
The difference sign indicating numberBe 1, " crying "
The difference sign indicating numberBe k, " can " difference of word is 6, all like this individual characters can form by this
The difference sign indicating number, and each Chinese character all can form the outer sign indicating number of trigram of initial consonant code+simple or compound vowel of a Chinese syllable sign indicating number+difference sign indicating number.This method of Chinese character coding this paper is called for short
Two bors d's oeuveres trigram method
Fourth, the difference sign indicating number that as above in individual character, forms, then can write brevity code after forming to two words of Chinese, step is as follows: the initial consonant of getting two first words of words, then get first difference sign indicating number, then get the difference district of second word again, trigram gets up to be total to the brevity code that trigram is formed this two words by the front and back series arrangement altogether.As: the brevity code in " years " is as follows.Get " year " initial consonant s, get again " year " difference sign indicating number 7 (not being simple or compound vowel of a Chinese syllable), month difference sign indicating number 8, s78 is the brevity code in " years ".
In like manner can weave into the trigram brevity code to any two words methods of using, this
ChineseIn
Two words SpeechThe method of the brevity code of writing, this paper also cries
Two bors d's oeuveres trigram method
The present invention can also be according to above-mentioned method handle
Chinese three wordsWeave into brevity code with the speech more than three words.Step is: the initial consonant (using double-spelling method) of getting first word in this speech is then got first word
The difference sign indicating number, then get second word
The difference sign indicating numberTail word not
The difference sign indicating numberForm coding (what notice that the back trigram gets is that the difference sign indicating number is not the simple or compound vowel of a Chinese syllable sign indicating number for totally 4 yards.) be d6z2 as the brevity code of " host king " speech.From above narration, can see that the present invention under a kind of total principle and guiding theory, has formed numeral
Code element, and, different application is arranged respectively, produced
Compile with long-pending The sign indicating number method,
The union compiling methodWith
The union code elementDo
The difference sign indicating numberCompiling method.But no matter which kind of is encoded, and all must be applied to the computer input in Chinese.And, a feasible computer
ChineseInput scheme, several coding methods interosculates and cooperates often.Therefore, several input methods that the present invention uses back formation can design in an input scheme system, just can use when running into word already learned
Two bors d's oeuveres trigram methodInput (3 yards inputs) is used when running into new word
The outer sign indicating number of union numeral methodInput (4 yards inputs) runs into deserted word when doing the inaccurate order of strokes observed in calligraphy with and collection compiling method (5 yards inputs) three share out the work and help one another, work in coordination, just fast do not have difficulty during input again, really be: see that word can import.And unlike establishing " z key " or fuzzy input or other help function such as " universal keies " again, also needn't page turning select.
Be not difficult to find out that the present invention has many advantages.At first the method for its employing also will split Chinese character sometimes, but the rule of this fractionation is all very clear and definite, does not use word languages such as " general ... ", and logic is clearly demarcated.Moreover, get adjacent strokes during fractionation, (unlike the fractionation radical that has time take apart crossing stroke) just splits easily.Promptly be that " twenty " branched away with the parts folding of " mouth ", also not intersecting in other stroke, promptly is to divide two halves individual character again, also is to turn up from the radical more than two strokes, all be easy to, the stroke that intersects taken apart with regard to difficult and easy diversification unlike the coding rule that has.With
The union methodBe coded in the operating process, only use simple addition, brain reflects the result soon, and is swift to operate.Particularly of the present invention
Two bors d's oeuveres trigram method, all single character codes all be no more than trigram (certainly than nearly 4 yards input is fast) and,
Two bors d's oeuveres The trigram method The difference sign indicating numberSet more ingenious.When at first encoding, it is a class that word is divided into left right model, and non-left right model is another kind of, as for
Left right modelOnly see its left side
Split-type wordOr left avertence other (and not being a stroke), very clear and definite, directly perceived, simple, the easy note of such regulation is learnt easily and is operated, and the second class word is promptly
Non-left right modelIt has comprised word and all individual characters except that left right model has not been divided into two halves during code fetch, only gets its first group of stroke, and this is also very clear and definite, simple, be easy to learn and use.So use
Two bors d's oeuveres trigram method, brain can reflect very soon initial consonant, simple or compound vowel of a Chinese syllable,
The difference sign indicating number, think
The difference sign indicating numberThe time only think the prefix of this word, needn't scrutinize other stroke of word, so input is just fast.And both combine the difference sign indicating number
The table symbolJust many, repeated code word just few (word already learned does not almost have).If but without this method, and only
RadicalThe conduct of representative letter
The difference sign indicating number, so just be difficult to operation.Because many difficult searchings are just arranged on the dictionary to be done inaccurate
Radical, moreover
Radical Radicals by which characters are arranged in traditional Chinese dictionariesHave only the word of stroke also to be difficult to identification or the like, it is at a loss as to what to do to be directed at the learner, is deeply aware of one's own helplessness when faced with a great task.
If moreover when design software, several application of the present invention are combined are used for an input system, the several method cooperation of dividing the work mutually, complement each other can form an input in Chinese scheme preferably undoubtedly.
Embodiment 1:
With
The collection technique of writingIn
With the collection methodForm
Code elementSingle Chinese character is encoded
(this paper is called for short
The Foolish Old Man's method)
1, at first single Chinese character is resolved into 5 kinds of one stroke, promptly horizontal (contain horizontal with carry), perpendicular (not with structure), cast aside, press down (contain and press down and point), folding (all bands are turned or strokes of band turnover in the one stroke), and by after left-falling stroke right-falling stroke folding is classified as five classes anyhow, five class strokes are gathered respectively, see total separately how much total.
If 2 sums that gather are represented the code element of this stroke less than 6 with its substantial amt; If the sum that gathers equals 6 or more than 6, all use 6 code elements of representing this stroke; If this individual character really lacks certain stroke, then represent with 0.
3, press down the folding order each by casting aside anyhow
Code elementPlatoon gets up, and is exactly this word
Outer sign indicating number
As " marquis " word, be split as one stroke and be: cast aside, perpendicular, folding, horizontal, cast aside, horizontal, horizontal, cast aside, press down, gather and have 3 horizontal strokes, 3 is exactly horizontal correspondence
Sign indicating numberUnit; Have 1 and erect, perpendicular corresponding code element is 1; Have 3 left-falling strokes, the correspondence of left-falling stroke
Code elementBe 3; Have 1 right-falling stroke, the corresponding code element of right-falling stroke is 1; Have 1 folding, folding
Code elementBe 1.Be 31311 by cast aside pressing down anyhow that the folding platoon gets up then, it " marquis "
Outer sign indicating number
For another example: " Chinese " word, split the back single and divide point, point into, carry, roll over, press down, gather, have only a horizontal stroke (promptly carrying), then Heng corresponding code element is 1.The not perpendicular pen of " Chinese " word, perpendicular correspondence
Code elementBe 0: do not cast aside pen, the correspondence of left-falling stroke
Code elementAlso be 0; 2 points and 1 right-falling stroke are arranged, the correspondence of right-falling stroke
Sign indicating number UnitBe 3; 1 folding pen is arranged, the correspondence of folding
Code elementBe 1, platoon gets up, the correspondence of " Chinese "
Code elementBe 10031.
For another example: " Qu " word, its horizontal pen be 10 and surpass 6, and 6 to be representative, then the code element of horizontal pen is 6, and 4 perpendicular pens are arranged, perpendicular code element is 4,1 left-falling strokes, and the code element of left-falling stroke is 1, one point, and the code element of right-falling stroke is 1, two foldings, the code element of folding is 2, platoon gets up, then " Qu " word
Outer sign indicating numberBe 64112.
Also can be with such method to any encode Chinese characters for computer.
Example 2: use
With the collection methodTo encode Chinese characters for computer
Lack certain stroke if stipulate certain individual character, then the corresponding code element of certain stroke is represented with " "; If certain stroke sum of individual character kind surpasses 9, represent corresponding code element with " 9 ".Other regulation is with example 1.(different with example 1 certain stroke code elements that need not " 0 " expression lack also need not surpass 6 corresponding code element by 6 expression strokes sums.If certain stroke is no more than 9 in the individual character, still represent code element with real figure.) for example, the outer sign indicating number of " time " word is still 31311, the outer sign indicating number of Chinese character but is 111, the outer sign indicating number of " Qu " word but is 94112.
Also can be with such method to any encode Chinese characters for computer.
Example 3 usefulness
The collection technique of writingIt
The union method(this paper is called for short to encode Chinese characters for computer
The union compiling method)
Determine earlier
The union methodContent, promptly total is regular as follows:
To single encode Chinese characters for computer the time, be single-character splitting one stroke earlier, totally 5 kinds.Promptly horizontal (contain and carry), perpendicular, cast aside, press down (containing a little), folding (all band turnovers and the stroke of turning), represent with 1,2,3,4,5 successively respectively again; Press sequential write again, per two are one group and gather, as run into " Lv " and " mouth " do not roll over branch, wholely calculate one group, " Lv " and " mouth " preceding grouping back remainder
One strokeAlso calculate one group, also calculate one group for one of other two combination back remainders, the result that the representative numeral of two strokes in the group is added up mutually represents this group again
Code element(but the code element that the regulation folding combines with the folding stroke is a numeral 0)
" Lv " integral body is one group,
Code elementRepresent with 1;
" mouth " integral body is one group,
Code elementWith representing;
Single divides the representative numeral code element of one group usefulness one stroke into.
Following by this rule encoding:
One, the coding of single Chinese character
Single Chinese character is divided into two classes (two class sums are whole Chinese characters).
The first kind is the individual character that is not split as two halves, cries
The monomer wordBe that this individual character does not contain more than two strokes
Radical, more be not
Combinde rqdical character, but the word that only contains the radical of stroke is listed as at this.As: sheet, first, one-tenth, fourth, just, or not fragrant-flowered garlic, also, book, interior, adopted, ball, pellet, fly.
Second class is the word that is split as two halves, cries
The binary word, as: marquis, pond, give, roc, room, disease, ticket.
The binary word comprises all individual characters except that first kind word, contains all
Combinde rqdical characterAnd have
RadicalWord (but
RadicalMust be more than two strokes or two strokes).
Three, right
The monomer wordCoding is pressed and is carried out
1, uses earlier
The union methodForm the code element of this word, promptly use above rule.
2, above code element being got up by the front and back series arrangement, form a sequence, is exactly this word
Outward Sign indicating number
As " greatly " compiled
Outer sign indicating number, can be first group of horizontal left-falling stroke with the stroke groupings of " greatly ", 1+3=4 then, 4 is first group
Code element:
Second group of stroke has only a right-falling stroke, and the representative numeral of right-falling stroke is 4, then 4 second groups
Code elementSo, big
Outer sign indicating numberBe 44.
For another example to " or " word code, will " or " folding is divided into group of strokes.
First group have only a horizontal stroke (there is mouth the back, can't with other one stroke addition, so for single is one group, code element is 1;
Second group is mouthful (one group of a whole calculation), and code element is;
The 3rd group is to carry folding, 1+5=6, and code element is 6;
The 4th group is apostrophe, 3+4=7, and code element is 7.
It is 167 that platoon gets up, " or "
Outer sign indicating number, in like manner day
Outer sign indicating numberBe 27.
3, right
The binary wordCoding
1. earlier will
The binary wordDimidiation, left right model are divided into left and right sides two halves, and last mo(u)ld bottom half is divided into the mo(u)ld bottom half two halves, are with more than two strokes or two strokes
Radical, radical is half, remaining is second half.But headed by writing earlier half, after write for later half.
2. press
The union methodGet the code element of first first group of stroke of half, then use
The union methodGet the later half code element of respectively organizing stroke,, form the outer sign indicating number of numeral of this word again by the order platoon before and after writing.
3. the Chinese character that surpasses 4 yards only can be got preceding 4 yards, promptly in coding, just stop as long as compiling enough 4 yards.That is to say no matter a word has how much organize stroke, when coding, get at most and be no more than 4 groups (according to stroke order).
As coding to " employing ", divide earlier two halves " family " and " Cui ", get that first group of stroke point at " family " roll over
Code elementBe that 4+5 is 9, it is perpendicular for casting aside again " Cui " even to be got trigram---and 5, put horizontal stroke---5, horizontal---2, so far, formation is employed
Outer sign indicating number9552.
With above method, can weave into any single Chinese character and be no more than 4 yards
Outer sign indicating number
Such numeral
Outer sign indicating numberAfter the formation, can queue by order from small to large and, numeral all corresponding individual character, and the page number of individual character on dictionary all arranged, can be made into index of Chinese Characters after this arrangement of tool, the design computerized dictionary, also can make index of Chinese Characters, and indicate the page number of this individual character on dictionary, thereby can replace
The radicals by which characters are arranged in traditional Chinese dictionaries method,
The stroke method, be used to look up the dictionary, dictionary.
As: the index of Chinese Characters after the arrangement is taken passages as follows:
Annotate: this example the right
Page numberBe meant the page number of " Xinhua dictionary " (revised edition in 1998).As be used to retrieve other words allusion quotation, then page number is determined by the page number at the individual character place in this words allusion quotation.
Coding | Individual character | Page number |
??…… | ??…… | ????…… |
??1721 | ????351 | |
??1723 | Grass | ????44 |
??1726 | Transplant | ????135 |
??1727 | Chang | ????51 |
??1728 | Bright | ????337 |
??1729 | Luxuriant growth | ????203 |
??1731 | Seedling | ????342 |
??…… | ??…… | ????…… |
Example 4. usefulness
The union Chinese character coding methodTo two words and multi-character words and phrase historical records sign indicating number.
One, the brevity code of two-character word.
Form preceding two yards in preceding two yards and second individual character of first individual character in the two-character word by the method in the example 3, by the front and back order platoon brevity codes of totally 4 yards composition two-character words that get up.As: preceding 2 yards of " wisdom " " English " is 17, and bright preceding 2 yards is 77, and linking up 1777 is exactly wise brevity code.In like manner, can weave into 4 yards brevity codes (pure digi-tal) to any two-character word.
Two, the brevity code of three words.
The method that connects example 3 forms code element, preceding 2 yards of getting first word, first yard of getting second word, get the triliteral first yard totally 4 yards form brevity codes.As the 1st yard of: preceding 2 yards 17, the second words " palpus " of " unwarranted " first word " not " is that the 1st yard of " having " of 6, the three words is 4, and then 1764 is exactly the brevity code of " unwarranted ".
Three, the brevity code of the speech more than four words and four words.
The method that connects example 3 forms code element, get first yard of preceding 3 words and the last character first yard in order platoon get up totally 4 yards to form brevity codes.As: " at a tremendous pace " gets that separately first yard platoon get up is 1747, and it is 3780 to be that first sign indicating number 0 platoon that " World Trade Organization " first, second and third prefix coee 378 and the last character " are knitted " gets up.
Example 5, usefulness difference sign indicating number are to encode Chinese characters for computer (this paper abbreviation
Two bors d's oeuveres trigram method).
This scheme can be compiled all Chinese characters (individual character) without exception and be trigram.But if brevity code word and keyboard word were not arranged surely, these words could be without this method.The trigram order is as follows: initial consonant code+simple or compound vowel of a Chinese syllable sign indicating number+difference sign indicating number; Wherein initial consonant code and simple or compound vowel of a Chinese syllable sign indicating number are used
Double spellingHandle each with a letter representation, can adopt the 71st definition in " the carry-on treasured book of computer " book.
As for distinguishing being set as follows of sign indicating number:
1, at first Chinese character is divided into left right model or non-left right model two classes, two class sums are all Chinese characters.Left right model is that the font of Chinese character is left and right sides structure, left, center, right structure and contains
The end of walking, built by the wordIndividual character, individual character is if having left avertence other (must be the radical more than two strokes or two strokes) all than row, all Chinese characters in addition all are that non-left right model (comprises the monomer word and except band
Word is built at the end of walkingAll of other radical are surrounded type-word and last mo(u)ld bottom half, chiasma types etc.) the two boundary is clear and definite.
2, every left and right sides type-word, (left side half can be a radical, also can be to get its left side half without exception
The binary wordThe branch word, each that see a left side half then claims to use double spelling phonetic, the initial of phonetic is exactly
The districtOther sign indicating number.) as " marquis " word left side partly be
Single upright people, initial is d, d is exactly marquis's a difference sign indicating number.For another example: the left side of " sending " is half of
The end of walking, z is exactly a code element, and " roc " left side partly is a month word limit, and Y is exactly a code element.In like manner the left side of " swimming " partly is
3 water,
The difference sign indicating numberBe the initial s of 3 water Two bors d's oeuveres, still the title on a left side half must be determined, returns model by Chinese.But the usefulness that a left side half can't be named? expression.Any like this left and right sides type-word has all had the correspondence of oneself
The difference sign indicating number
3, in the type-word of the non-left and right sides, no matter this word is any structure, get this wordbook without exception and write first group of stroke of order, the code element of using the union method to form is again made the difference sign indicating number (as for how pressing union method code fetch by the rule in the example 3, no longer citation), first group of stroke as " greatly " word is horizontal and casts aside, big difference sign indicating number is 3, in like manner the difference sign indicating number of " celery " be 1 (because of first group be twenty) in like manner the difference sign indicating number of " brother " be that all so non-left right model individual characters of ". " (because of " mouth " is first group) have all had oneself
The difference sign indicating numberBecause left and right sides type-word and non-left and right sides type-word are exactly all Chinese characters altogether.Therefore, any Chinese character all has own correspondence
The difference sign indicating number
4, on this basis, all Chinese characters all can form: the outer sign indicating number of trigram of initial consonant code+simple or compound vowel of a Chinese syllable sign indicating number+difference sign indicating number.
5, after single Chinese character has all had the outer sign indicating number of the trigram that contains difference sign indicating number, we can also write brevity code to the speech of two individual characters compositions.
Compile
Two-character word Outer sign indicating numberThe method of brevity code as follows:
1, gets
Two-character wordIn the initial consonant (but initial consonant must be a letter that produces with double-spelling method) of first individual character as first yard of brevity code.
2, the difference sign indicating number of then getting lead-in (the same individual character) is made second yard of brevity code.
3, then getting afterwards again, the difference sign indicating number of an individual character is a trigram.Like this, the lead-in initial consonant code+the trigram composition should altogether for the other sign indicating number in lead-in difference block, sign indicating number+back
PhraseBrevity code.As: " Chinese character " this two-character word is write the initial consonant h that brevity code is got lead-in " Chinese " earlier, and the difference sign indicating number s that gets lead-in " Chinese " again is (promptly
3 waterInitial s) get the outer yard brevity code of difference sign indicating number (first and second stroke point adds up to 8 with the representative numeral of point) " Chinese character " of back word " word " again with regard to hs8.The brevity code of two-character word " phonetic symbol " is in like manner got the initial consonant y of lead-in " sound ", the difference sign indicating number 5 of lead-in " sound ", and the difference sign indicating number m of back word " mark " (is a target
RadicalThe initial of wood), y5m is the brevity code of " phonetic symbol ".In like manner the brevity code of " influence " is yjk.Can write brevity code to any two words with this quadrat method.
This paper is this initial consonant and two difference representation used
Two wordsThe method of brevity code is also named
Two bors d's oeuveresThe trigram method.
Claims (9)
1, a kind of encode Chinese characters for computer
Code elementUmerical method when coding, is used individual character
Double spellingFormation sound, rhythm add for two yards again
The difference sign indicating numberAlso can individual character folding be divided into horizontal, vertical, cast aside, press down, the folding one stroke, a then: without the b method, b: with following method promptly with 1,2,3,4,5 represent successively respectively horizontal, vertical, cast aside, press down, folding also is one group by per two strokes of the sequential write one stroke that is one group of remainder can't be with other one stroke addition the time again; Feature of the present invention is that the one stroke after splitting gathers formation by following two kinds of methods
Code element:
A, under a kind situation, stroke set mutually of the same race in the individual character is closed formation
Code element, the sum that gathers with numeral by certain rule; B, under b kind situation, the representative numeral of dividing stroke in groups the group is added up mutually as the code element of this group, but two strokes all are foldings in the group specified, then with the corresponding code element of one of digital " 0 " or ". " expression.
2, according to 1, b: described method, it is characterized in that if having only an one stroke in organizing, then the corresponding code element of this group represents that numeral is that the representative numeral of this single or the representative numeral of single are added this number, or the representative numeral of single be multiply by 2 or the numeral of single added 5 or 4 or 3 or 2 or 1, but that a kind of encoding scheme can only be with each " perhaps " is a kind of.
3, under last kind of situation, if contain “ Nian in the structure of certain Chinese character " " mouth " the two one of or all; according to claim 1, b: described method, its feature also are can be Ba “ Nian " and " mouth " do not split as one group as an integral body, represent “ Nian with " digital 1 " " work
Code element, point ". " representative " mouth " is done decimally
Code element, perhaps use " numeral 1 " expression mouthful: show to show " Nian " with ". ", but a kind of encoding scheme, can only select or one of front or rear.
4, also be according to claim 1, its feature of the described method of a, when certain stroke of certain Chinese character surpasses 6 strokes, all use sum, the work correspondence of this stroke of " 6 " expression
Code element, certain stroke that certain word lacks lacks stroke and represents that with " numeral 0 " remaining is all still represented with the substantial amt of this kind stroke.
5, also being to split in the individual character back according to claim 1, its feature of the described method of a uses
With the collection methodThe code element of the various strokes that form gets up to form individual character by horizontal, vertical, left-falling stroke, right-falling stroke, folding, series arrangement
Outer sign indicating number
6, according to claim 1, b: described method, its feature also are can be special-purpose
The union methodThe code element that the grouping of single Chinese character is formed in conjunction with the back except that leave out do not get, lining up in order forms the numeral of this word
Outer sign indicating number
7, according to claim 1, b: its feature of described method also is to use the Chinese character of union method generation
Code element, in conjunction with non-numeric
The table symbol, shared conduct
The difference sign indicating number The table symbolThereby, the formation phonetically similar word
The difference sign indicating number
8, be also that according to claim 1, b and 7 its features of described method its purposes one has been: can be with 1, b and 7 methods form
The difference sign indicating numberTrigram weaved in any single Chinese character
Outer sign indicating numberAlso can use
The difference sign indicating numberTwo-character word is weaved into the trigram brevity code.
9, according to claim 5,6, its feature of described method is that also one of its purposes is: two kinds of numerals of formation
Outer sign indicating numberAfter all can being used for computer, mobile phone input in Chinese separately separately and also all can putting in order respectively, be respectively applied for Chinese character (containing speech) and retrieve, weave into compuword (speech) allusion quotation key and dictionary dictionary key.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN 01139523 CN1231830C (en) | 2001-11-20 | 2001-11-20 | Stroke set digit representation method for code element and use |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN 01139523 CN1231830C (en) | 2001-11-20 | 2001-11-20 | Stroke set digit representation method for code element and use |
Publications (2)
Publication Number | Publication Date |
---|---|
CN1420422A true CN1420422A (en) | 2003-05-28 |
CN1231830C CN1231830C (en) | 2005-12-14 |
Family
ID=4675255
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN 01139523 Expired - Fee Related CN1231830C (en) | 2001-11-20 | 2001-11-20 | Stroke set digit representation method for code element and use |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN1231830C (en) |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103164466A (en) * | 2011-12-16 | 2013-06-19 | 李瑞民 | Stroke order sub-word retrieval method for uncommon Chinese character |
CN103543841A (en) * | 2013-11-13 | 2014-01-29 | 罗嗣孝 | Chinese character unique splitting input method |
CN109271610A (en) * | 2018-07-27 | 2019-01-25 | 昆明理工大学 | A kind of vector expression of Chinese character |
-
2001
- 2001-11-20 CN CN 01139523 patent/CN1231830C/en not_active Expired - Fee Related
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103164466A (en) * | 2011-12-16 | 2013-06-19 | 李瑞民 | Stroke order sub-word retrieval method for uncommon Chinese character |
CN103543841A (en) * | 2013-11-13 | 2014-01-29 | 罗嗣孝 | Chinese character unique splitting input method |
CN109271610A (en) * | 2018-07-27 | 2019-01-25 | 昆明理工大学 | A kind of vector expression of Chinese character |
Also Published As
Publication number | Publication date |
---|---|
CN1231830C (en) | 2005-12-14 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN1043210A (en) | Radical code input method and its equipment | |
CN1141633C (en) | 24-radical sorting encode method for Chinese characters and its keyboard | |
CN1420422A (en) | Stroke set digit representation method for code element and use | |
CN102511021A (en) | Number-order-code-element keyboard and information input method thereof | |
CN1435749A (en) | Chinese character stroke and phonetic code input method and keyboard thereof | |
CN101038517A (en) | Shape-pronunciation encoding input method of Chinese characters | |
CN1184554C (en) | Chinese character Hanyi code input method and keyboard for computer | |
CN1034245C (en) | Burmese characters four-code intelligent coding method and keyboard thereof | |
CN1062361C (en) | Method for inputting chinese characters by key shape code derived from sound and shape | |
CN1167994C (en) | Input method for Chinese character | |
CN1196057C (en) | One-code two-form quick Chinese digital coding input method | |
CN1028457C (en) | Chinese character computer input system of stroke digital code and sound code | |
CN1052200A (en) | Pronunciation-form-meaning words encode series with compatibility and keyboard | |
CN1088211C (en) | Chinese character positive and negative singular radicals periodic table and radicals digital code input method | |
CN1159642C (en) | Simplified Chinese-character 'Sound-shape code' input method | |
CN1056007C (en) | Codes for inputting Chinese characters | |
CN103412656A (en) | Chinese character syllable rime stroke shape composite phonetic and morphological code | |
CN1052314C (en) | Computer keyboard and input method of Chinese character two-dimensional numerals | |
CN1109284C (en) | Multi-information code Chinese character input system for computer | |
CN1146572A (en) | Chinese character orthography coding method | |
CN1558310A (en) | Consonant and vowel font code Chinese characters input method | |
CN1060277C (en) | Chinese characters coding and input method for computer using sentences as input unit | |
CN1160883A (en) | Phonetic double code of Chinese characters for computer input | |
CN1256446A (en) | Chinese character coding and inputting method using the first radical, residual radical and stroke number and the key board | |
CN86105505A (en) | Chinese character input method and applied keyboard thereof |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C14 | Grant of patent or utility model | ||
GR01 | Patent grant | ||
C19 | Lapse of patent right due to non-payment of the annual fee | ||
CF01 | Termination of patent right due to non-payment of annual fee |