[go: up one dir, main page]

CN101702101A - Input method of 6-code number oracle - Google Patents

Input method of 6-code number oracle Download PDF

Info

Publication number
CN101702101A
CN101702101A CN200910218978A CN200910218978A CN101702101A CN 101702101 A CN101702101 A CN 101702101A CN 200910218978 A CN200910218978 A CN 200910218978A CN 200910218978 A CN200910218978 A CN 200910218978A CN 101702101 A CN101702101 A CN 101702101A
Authority
CN
China
Prior art keywords
eye
sign indicating
stroke
code
indicating number
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN200910218978A
Other languages
Chinese (zh)
Other versions
CN101702101B (en
Inventor
刘志祥
尹奎英
刘晓戎
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Individual
Original Assignee
Individual
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Individual filed Critical Individual
Priority to CN2009102189784A priority Critical patent/CN101702101B/en
Publication of CN101702101A publication Critical patent/CN101702101A/en
Application granted granted Critical
Publication of CN101702101B publication Critical patent/CN101702101B/en
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Landscapes

  • Machine Translation (AREA)
  • Adornments (AREA)

Abstract

The invention discloses an input method of a 6-code number oracle, comprising the steps of: coding every structural division of oracle characters into a 6-bit code element, constituting a code element sequence by an eye code, an eyelash code, a tiller code, a branch code, a flutter code and a structure code, and respectively corresponding same to the numbers from 0 to 9 on a computer keyboard, wherein the eye code is a number of strokes formed by a closed curve in the oracle characters; the eyelash code is a number of non-bifurcated strokes connected with the eye in the oracle characters; the tiller code is a number of intersections of non-eye crossed strokes; the branch code is a number of strokes which can be described by using minimum strokes; the flutter code is a number of floating strokes which do not form the eye, the tiller and the branch in the oracle characters; and the structure code is a number of non-conglutinated structure blocks which form an oracle character. By coding and inputting the oracles by a computer according to the 6-bit code elements, the ancient writing office automation can be realized and the history of transcribing ancient characters by hands in publishing industry is put to an end.

Description

Input method of 6-code number oracle
Technical field
The present invention relates to a kind of inscriptions on bones or tortoise shells input method that is used for robot calculator.
Background technology
The inscriptions on bones or tortoise shells is one of the most ancient in the world several literal, also is unique literal that continues into the present.Making a study of the inscriptions on bones or tortoise shells all has very important meaning to multi-door subject such as philology, history, archaeology, linguistics and to calligraphy research and creation.So it is necessary to create a kind of inscriptions on bones or tortoise shells information handling system of computer Recognition, but the research of this respect is very limited, lag far behind the identification treatment technology of modern text.
The information processing technology of modern Chinese character is very ripe, and only input method just has hundreds of, and the computer input method that the inscriptions on bones or tortoise shells and other ancient writing also do not have simply, easily remember.Therefore, when relating to the books of quoting the Chinese character in ancient times in a large number when publishing, be difficult to realize office automation.For example the magazine of book such as " inscriptions on bones or tortoise shells dictionary ", " Western Zhou Dynasty, the first literary composition was annotated " and many research ancient writings all is plate-making again behind Hand writing or the part Hand writing, and Hand writing is difficult to accomplish neat appearance, more influences work efficiency.
The input method of ancient writings such as more existing inscriptions on bones or tortoise shells, inscription on ancient bronze objects is not utilized the characteristics of ancient writing itself, but has applied mechanically the coding mode of modern Chinese character, as adopting the method for phonetic or word-root spelling form.These methods only are applicable to the inscriptions on bones or tortoise shells of " standardization ", inscription on ancient bronze objects, can not express the inscriptions on bones or tortoise shells of original literary style, inscription on ancient bronze objects, can only be used to retrieve the inscriptions on bones or tortoise shells (being the forward retrieval) of certain modern Chinese character correspondence, can not be used to see that certain inscriptions on bones or tortoise shells literal retrieves corresponding modern Chinese character or corresponding modern Chinese character (being reverse retrieval) is not arranged, and a lot of inscriptions on bones or tortoise shells literal only have font, can't know its pronunciation and the meaning of word, see a first bone word, look into its implication, just difficult, will estimate it earlier is any word, consult by indexing system for Chinese characters again, if estimate to be forbidden, must try repeatedly, putting off until some time later in the inscriptions on bones or tortoise shells has a large amount of radicals originally not have radicals by which characters are arranged in traditional Chinese dictionaries; And input methods such as employing phonetic can not realize the input of whole inscriptions on bones or tortoise shells literal.In addition, the coding method that has has only four bit codes, promptly enables to express all inscriptions on bones or tortoise shells, inscription on ancient bronze objects, and repeated code also can be a lot.
Summary of the invention
The purpose of this invention is to provide a kind of inscriptions on bones or tortoise shells input method, this method coding is simple, easy to use.
To achieve these goals, the present invention takes following technical solution:
A kind of input method of 6-code number oracle may further comprise the steps:
(1) structure according to inscriptions on bones or tortoise shells literal correctly contrasts the step that code element is selected in the code element definition:
Each structure division of inscriptions on bones or tortoise shells literal is divided into eye sign indicating number, eyelash sign indicating number, tiller sign indicating number, branch sign indicating number, waft sign indicating number, constructive code six bit symbols, corresponding with digital 0-9 on the computer keyboard respectively sequence of code symbols that forms the numeral input of described six bit symbols, the form of described sequence of symhols for from left to right be arranged in order<eye sign indicating number<the eyelash sign indicating number<the tiller sign indicating number<the branch sign indicating number<sign indicating number wafts<constructive code form, described each code element is defined as follows:
Eye sign indicating number: the eye sign indicating number be the blank that surrounds of eye and number, wherein, eye is the stroke that the closed curve in the inscriptions on bones or tortoise shells literal forms;
Rule is as follows:
A. fertile notes are eye; B. the round dot in the stroke is counted eye; C. there is eye to calculate respectively in the eye;
The eyelash sign indicating number: the eyelash sign indicating number be eyelash and number, wherein, eyelash is the stroke that links to each other with eye in the inscriptions on bones or tortoise shells literal and do not intersect with other stroke;
Rule is as follows:
A. work as the eyelash stroke and will count sign indicating number respectively through having a look at; B. the eyelash stroke of intraocular is counted eyelash; C. the eyelash stroke between the eye is counted eyelash;
The tiller sign indicating number: the tiller sign indicating number be in the tiller stroke point of crossing and the number, wherein, the tiller stroke can not be in the inscriptions on bones or tortoise shells literal eye crossing stroke;
Branch sign indicating number: be the minimum stroke number that can describe described tiller stroke with strokes;
Sign indicating number wafts: the sign indicating number that wafts be waft and number, wherein, wafing is not constitute eye and the wafing from stroke of tiller in the inscriptions on bones or tortoise shells literal;
Constructive code: constructive code be block structure and number, wherein, block structure is in the inscriptions on bones or tortoise shells literal NA mutually;
Rule is as follows:
A., eye is arranged in the eye of independent word or eyelash or tiller are arranged or when wafing, constructive code gets 0;
B. the eye that comprised of eye and it is connected, and eye connects with the wall scroll eyelash, or with a contact be the intraocular eye, constructive code get 0, one eye by several encirclements be adjacent, constructive code gets 1;
C. intraocular has eye, eyelash, tiller, the stroke that wafts and just during component composition word more than a part of, constructive code gets 1, and and other structure stroke count constructive code together;
Described eye sign indicating number, eyelash sign indicating number, tiller sign indicating number, branch sign indicating number, waft sign indicating number and constructive code are up to 9, still are designated as 9 above 9;
(2) input step: after the selected code element of above-mentioned steps, on above-mentioned computer keyboard, import 6 bit digital code elements by key in order;
(3) select step: the inscriptions on bones or tortoise shells literal of listing in the inscriptions on bones or tortoise shells textbox according to the 6 bit symbols correspondences that occur on the screen, select needed inscriptions on bones or tortoise shells literal by numerical key.
By above scheme as seen, the present invention encodes according to the font characteristics of the inscriptions on bones or tortoise shells self, use six digit numeric code, the repetition rate of coding is lower, even some complexity are not even known the inscriptions on bones or tortoise shells literal of the word sound meaning of word, as long as according to the selected code element of the structure analysis of this inscriptions on bones or tortoise shells literal, also can in computer system, import, print inscriptions on bones or tortoise shells literal, realize first bone ancient writing office automation, the more important thing is, provide convenience for decoding first bone ancient writing.
Description of drawings:
Fig. 1 is an input state synoptic diagram of the present invention.
Below in conjunction with accompanying drawing the specific embodiment of the present invention is done explanation in further detail.
Embodiment
" inscriptions on bones or tortoise shells dictionary " of Xu Zhongshu chief editor, the publication of Sichuan dictionary publishing house, similar (859-1-2) such comment of appearance, the 859th page of the 1st row the 2nd word in the expression " inscriptions on bones or tortoise shells dictionary " taken from employed inscriptions on bones or tortoise shells literal in the explanation of the present invention.
Analyze inscriptions on bones or tortoise shells literal, can find out that inscriptions on bones or tortoise shells literal is made up of three kinds of structures: a kind of is closed curve structure, and a kind of is the cross spider segment structure, and a kind of is to waft from curve or dot structure.The present invention is defined as a stroke with closed curve, the not bifurcated stroke that links to each other with eye is defined as the eyelash stroke, be defined as the tiller stroke by the crossing stroke that does not become eye, wherein, the point of crossing in the tiller stroke with number for the tiller sign indicating number, the minimum stroke number that can describe the tiller stroke with strokes is yard, weave into the stroke that wafts by wafing from stroke, weave into constructive code by the block structure in the inscriptions on bones or tortoise shells literal, these six kinds of code elements from left to right are arranged in order, and constitute the sequence of code symbols of input method of the present invention.The present invention weaves into six bit symbols in view of the above, and each structure division of inscriptions on bones or tortoise shells literal is mapped by the digital 0-9 on code element and the keyboard, forms the sequence of code symbols of numeral input.
The form of numeric code sequence of the present invention is as follows:
<eye sign indicating number〉<the eyelash sign indicating number〉<the tiller sign indicating number〉<the branch sign indicating number〉<sign indicating number wafts〉<constructive code 〉
Below above-mentioned each code element is described in further detail:
1, eye sign indicating number: the closed curve in the inscriptions on bones or tortoise shells literal constitutes eye (using go term called after " eye "), and the eye sign indicating number is blank that a stroke surrounds and number is represented eye yard with y.The blank that word has several closed curves to surround, the eye sign indicating number is exactly several.For example:
(mouth) y=1 (certainly) y=2
(specially) y=5
Figure G2009102189784D0000044
(fore-telling) y=0 (not having eye)
Attention: for
Figure G2009102189784D0000045
(mouth) word has only It is the stroke of forming eye.
For
Figure G2009102189784D0000047
(certainly) word has only
Figure G2009102189784D0000048
It is the stroke of forming eye.
For (Designed) word has only
Figure G2009102189784D00000410
It is the stroke of forming eye.
A kind of stroke must use once, and only use once when the meter sign indicating number, and meter no longer calculates in other sign indicating number later.
Eye has variform, regular symmetric figure, for example,
Figure G2009102189784D00000411
Irregular figure is arranged, for example:
Figure G2009102189784D00000412
Also have very complicated figure, tangle, intert and form by stroke, for example:
Figure G2009102189784D00000413
Y=7 Y=9 Y=9 Y=4 Y=3
To calculating the eye sign indicating number following provisions are arranged:
(1), eye sign indicating number is up to 9, the eye number surpasses 9, the eye sign indicating number still is designated as 9 (all the other each yard classes this).Example:
Figure G2009102189784D00000414
Y=9 Y=9 Y=9
(2), fertile pen (promptly corresponding on the first osteocomma damaged) is designated as eye.Example:
Y=1 Y=5
Figure G2009102189784D0000053
Y=3
(3), the round dot on the stroke is counted eye.Example:
Y=6 Y=2 Y=2
(4), have eye will calculate example respectively in the eye:
Figure G2009102189784D0000055
Y=7 Y=5 Y=6
2, eyelash sign indicating number: eyelash is the stroke that links to each other with eye in the inscriptions on bones or tortoise shells literal and do not intersect with other stroke, the eyelash sign indicating number be eyelash and number, represent the eyelash sign indicating number with J.On the eye several eyelashes are arranged, the eyelash sign indicating number is exactly several.Among the present invention, the T shape of two lines is connected or decussation is defined as intersection, folding line is not defined as intersection.The eyelash stroke depends on a stroke and exists, and does not have eye just not have eyelash in the word.When introducing the eye sign indicating number in the above in the given example
Figure G2009102189784D0000056
(mouth),
Figure G2009102189784D0000057
Those strokes of not listing a stroke in (certainly) in are exactly the eyelash stroke.For example:
(certainly) J=5 (ancestor) J=3
Figure G2009102189784D00000510
(mouth) J=2 (fore-telling) J=0
And
Figure G2009102189784D00000512
(Designed) word has divided fork from the stroke that eye extends, and decussate texture is promptly arranged, and is not the eyelash stroke.
To calculating the eyelash sign indicating number following provisions are arranged:
(1), stroke will count sign indicating number respectively through having a look at, even one is write as and can not count 1, accrued is 2.
Figure G2009102189784D00000513
J=2 J=5 J=2
(2), the eyelash stroke of intraocular is also counted eyelash.Example:
Figure G2009102189784D00000514
J=2 J=2 J=3
(3), the eyelash stroke between the eye is also counted eyelash.Example:
Figure G2009102189784D00000515
J=3 J=2
3, tiller sign indicating number: not becoming the crossing stroke of eye in the inscriptions on bones or tortoise shells literal is the tiller stroke, like the bifurcated of branch, as tillering of standing grain fringe.The intersection of crossing stroke is counted and is the tiller sign indicating number, represents the tiller sign indicating number with N.
4, branch sign indicating number: the minimum stroke number that can describe tiller stroke (must not retrace, the stroke sequencing is not limit) with strokes is exactly a branch yard, represents yard with Z.
For example:
Figure G2009102189784D0000061
(slowly) N=2 Z=3
Figure G2009102189784D0000062
(Song) N=3 Z=5
(mulberry) N=7 Z=8
Figure G2009102189784D0000064
(mouth) N=0 Z=0
Especially, in the tiller stroke that T shape is intersected or X-shaped intersects to form, tiller sign indicating number and the available following formulate of branch sign indicating number:
Z=N+ tiller agglomerate number
Tiller agglomerate number contains the tiller stroke exactly and quilt is blank or separated number of eye, for example, above-mentioned " slowly " word, the agglomerate number is 1, " Song " word, agglomerate number are 2.Again
Figure G2009102189784D0000065
(foot of a hill or mountain) word, agglomerate number are 4, and N=6 Z=N+4=10 meter does 9
Intersect at the tiller stroke that a bit forms for three strokes or four strokes, for example
Figure G2009102189784D0000066
(?) word, (small house) word, few because of number of words, directly count stroke and get final product.
Calculating tiller sign indicating number, branch sign indicating number there are following provisions:
(1), the tiller stroke through having a look at, though some strokes can one write as, but still to regard two tiller agglomerates as, tiller sign indicating number and branch sign indicating number are wanted separate computations.Example:
Figure G2009102189784D0000068
N=4 Z=6 N=3 Z=6
(2), the stroke on having a look at, be the tiller stroke if a bit extend three lines from the eye to homonymy of eye, then be eyelash if a bit extend two lines from the eye to homonymy of eye.
Example:
Figure G2009102189784D0000069
J=2 N=1 Z=2 J=0 N=2 Z=4
In addition, the word that has is out of shape when portrayal, has formed new eye, and tiller stroke originally may become eye:
Figure G2009102189784D00000610
Y=3 J=5 N=1 Z=2 Y=4 J=7 N=0 Z=0
The processing of this situation will be told about in the back.
5, the sign indicating number that wafts: wafing is not constitute the wafing from stroke of eye and tiller in the inscriptions on bones or tortoise shells literal, and the sign indicating number that wafts is represented the yardage of wafing for that waft and several with P.Wafing has strokes, and the sign indicating number that wafts is exactly several.These stroke patterns are a lot, and curve is arranged, broken line, short drawing or point.When meter wafts yard, also be only to count stroke number, not tube shape.For example:
Figure G2009102189784D0000071
(needlework) P=3
Figure G2009102189784D0000072
(little) P=3
Figure G2009102189784D0000073
(?)P=1 (?)P=3
Figure G2009102189784D0000075
(bright) P=2
Waft and collide from stroke and eye, can form eyelash or new eye, coding method in this case is as follows:
A. press contour analysis, with the stroke contrast of wafing of other symmetry, symmetry when wafing stroke, then count and waft.
Example:
B. by forming new eye, character library all listed in two kinds of fonts.Coding can be realized input in any case.
Example:
Figure G2009102189784D0000077
P=0,
Figure G2009102189784D0000078
P=1
6, constructive code: block structure is mutual NA of inscriptions on bones or tortoise shells literal, constructive code be exactly block structure and number (not comprising the piece number in the eye), represent the structure yardage with G.For example:
Figure G2009102189784D0000079
G=2
Figure G2009102189784D00000710
(defending) G=8
The computation structure sign indicating number there are following provisions:
(1), eye or eyelash or tiller are arranged or when wafing in the eye of independent word, G=0 for example:
Figure G2009102189784D00000711
(bright) G=0
Figure G2009102189784D00000712
(misfortune) G=0
Figure G2009102189784D00000713
(?)G=0
(2), eye is connected with the eye that it is comprised, connect with the wall scroll eyelash, or with a contact be the intraocular eye, in the middle of G=0, eye are enclosed in by other several eyes, be adjacent, G=1, example:
Figure G2009102189784D00000714
G=0 G=1 G=1
Judge that in this case constructive code is 1 or 0 standard, be to see that this eye is surrounded by eye, still by several encirclements, surrounded by an eye, and constructive code is 0, by several encirclements, constructive code is 1.
(3), the intraocular stroke that eye, eyelash, tiller arranged, waft, and just during a combined characters a part of, G=1, and and other block structure stroke computation structure sign indicating number together.
Example:
Figure G2009102189784D0000081
Be that independent word: G=0 is the part of combined characters: G=3
According to above-mentioned code element regulation, six digit numeric code is combined in order, just can intactly import an inscriptions on bones or tortoise shells literal.
Figure G2009102189784D0000082
(people) 001201 (mouth) 120001
Figure G2009102189784D0000084
(mulberry) 247803
Figure G2009102189784D0000085
(foot of a hill or mountain) 206903
Figure G2009102189784D0000086
(bright) 220000
Below by specific embodiment the present invention is explained:
For example, as importing
Figure G2009102189784D0000087
Word,
At first the structure according to this inscriptions on bones or tortoise shells literal contrasts code element description selection<eye sign indicating number〉<the eyelash sign indicating number〉<the tiller sign indicating number〉<the branch sign indicating number〉<sign indicating number wafts〉<constructive code〉each code element;
Figure G2009102189784D0000088
(happiness) word, analyzing the blank that closed curve forms in its structure has 3, and the eye sign indicating number is 3;
Link to each other with eye and not the eyelash stroke of bifurcated be 4, the eyelash sign indicating number is 4;
Do not become the intersection of the tiller stroke of eye to count 2, the tiller sign indicating number is 2;
Branch sign indicating number is minimumly can describe the number of tiller stroke with strokes, yard is 3;
Wafing from stroke is 1, and the sign indicating number that wafts is 1;
Mutual NA block structure number has 3, and constructive code is 3;
After determining code element according to above-mentioned steps, on computer keyboard in order by key input 342313;
Have the choice box (Fig. 1) of 2 inscriptions on bones or tortoise shells literal on the screen, the numeral 1 of the needed inscriptions on bones or tortoise shells literal of input representative obtains
Figure G2009102189784D0000089
(happiness) finishes input.
The inscriptions on bones or tortoise shells also has everyday character as modern Chinese character, for example the date of representing with the Heavenly Stems and Earthly Branches, loyal people, my late grandfather's name, is foretold and is asked that weather, farming, good or ill luck misfortune good fortune, disease, fertility, fore-telling ask war win and defeat etc. square state name, particularly the vocabulary such as sacrifice of a multitude of names occur in a large number.To everyday character and literary style word clearly, can realize input at an easy rate with above-mentioned coding.
Below be example explanation with the complete oracle inscriptions of the Shang Dynasty:
Example one: (first 806, " inscriptions on bones or tortoise shells dictionary " 1538-15) second last of the twelve Earthly Branches chastity is spoon year in the Zu Yi prison one N again.
The input number is followed successively by: 000,011 002,301 520,001 001,201 001,212 002,301,002,301 320,012 002,301 002,312 000,011 002301.
Example two:
Figure G2009102189784D0000092
(capital 4409, " inscriptions on bones or tortoise shells dictionary " 1541-16) Xin Maozhen third relates to from hunting
The input number is followed successively by: 121,201 240,002 520,001 002,402 204,602 220001260013.
Example three:
Figure G2009102189784D0000093
Is (third 302, " inscriptions on bones or tortoise shells dictionary " 1544-6) foretold Que chastity the third of the twelve Earthly Branches in the ninth of the ten Heavenly Stems and says the son merchant? the last of the ten Heavenly stems is honest
The input number is: 002,301 121,201 001,201 323,502 500,001 120,012,101,201 462,401 430,001 005,601 322302.
In addition, owing in the inscriptions on bones or tortoise shells a lot of baroque words are arranged, and stroke is very random when inscribing, add the inscriptions on bones or tortoise shells and be the word that is unearthed on the first osteocomma, the handwriting is blurred on the material object that is unearthed, damaged be very common thing, the handwriting is blurred and the word of literary style complexity and incomplete word for these, and input has brought a difficult problem to computing machine.So that the word of radical word, literary style complexity to be arranged in the inscriptions on bones or tortoise shells, local incomplete word, the fuzzy word of part describe choosing of code element as special example below.
To above-mentioned situation, except can be according to the preceding method coding, can also be more convenient, coding exactly in conjunction with following method:
1), radical standardization: though the inscriptions on bones or tortoise shells does not have block letter, to each inscriptions on bones or tortoise shells word literary style that also can not lay down a criterion, but radical is arranged in the inscriptions on bones or tortoise shells, can realize the radical literary style that lays down a criterion, and standard word radical amount is limited, all be related with inscription on ancient bronze objects, the lesser seal character and even modern Chinese character, remember during use that radical does not have much difficulties, to inscriptions on bones or tortoise shells when coding that radical is arranged the word of complexity (especially to), can the combined standard radical and the explanation of above-mentioned code element, these words are encoded.
The inscriptions on bones or tortoise shells is the literal of comparative maturity, and a lot of words all have radical, but same word or radical literary style are various, and a lot of variant Chinese character are promptly arranged, and literary style is different, and it is also different to encode, and the word that is combined into by it just has a plurality of codings, is unfavorable for using exactly.The wooden word of example, literary style is
Figure G2009102189784D0000101
The time, coding is 002301; Literary style is
Figure G2009102189784D0000102
Be encoded to 003401; Literary style is
Figure G2009102189784D0000103
The time, be encoded to 002401 etc., can compile out four or five different sign indicating numbers at least; And the word that is combined into by wooden word
Figure G2009102189784D0000104
(elm),
Figure G2009102189784D0000105
Also there is different literary styles on the word side, the sign indicating number more than 8 just will be compiled at least in the elm word so, this can cause repetition rate too high, when running into first osteocomma writing and not knowing very much, even cannot encode, and with the radical standardization, as long as can find out to contain what radical in the word, by the standardization literary style coding of this radical, coding is just much quick, accurate.
The literary style of optional the most normal appearance is as the standard radical during selection standard radical, and the selection principle of other each standard radicals herewith.Inscriptions on bones or tortoise shells standard radical, font table are attached, 179 standard radicals and standard font have been chosen in the table altogether, the allosome literary style that comprises these words, have 265 (word that comprises repetition as: worm (it), (), totally 10 words such as month (sunset), Cui (only) and (ancestral), broom (woman), its (dustpan), mountain (fire), 18 allosome literary styles).
Attached inscriptions on bones or tortoise shells standard etymon list is described as follows:
First row: be the pronunciation of the pairing modern Chinese character of radical, corresponding just modern Chinese character prefix with the prefix first sign sound.As represent that brave prefix, mark with phonetic symbols are hu.This table is with the series arrangement of phonetic alphabet.It is for the ease of utilizing the sound preface to search that radical phonetic is placed on first row.
Secondary series: root coding, the radical that has have only a sign indicating number, and the radical that has is write fado, and several sign indicating numbers are arranged.
The 3rd row: inscriptions on bones or tortoise shells standard radical or standard font.
The 4th row: the modern Chinese character that standard radical (font) is corresponding, its corresponding relation complexity is only chosen a representative modern Chinese character, as
Figure G2009102189784D0000106
That corresponding modern Chinese character has is left and right, again, very little etc., only get here and.
The 5th row: other several allosome literary styles of standard radical institute standard, close with the standard radical in appearance, but coding is different, runs into these allosome literary styles in the synthetic word, standard to become the standard radical.
The 6th row: enumerate the word example that standard radical (font) is combined into, an only row malapropism demonstration.
Giving an example of use standard root coding:
" inscriptions on bones or tortoise shells dictionary " 905-8
Figure G2009102189784D0000111
Word, the meaning of word does not crack.Can find out that this word is made up of four parts: the people, again, with, mountain (perhaps similar mountain word is by the mountain word code), all be the word in the standard radical, have only " usefulness " word different with the standard radical, standard turns to
Figure G2009102189784D0000112
This word is write as
Figure G2009102189784D0000113
Coding 492404 is realized input.
Choose the principle of radical:
1. a radical has multiple allosome literary style, occurs maximum literary styles in the choosing " inscriptions on bones or tortoise shells dictionary " as the standard radical, for example
Figure G2009102189784D0000114
(wood),
Figure G2009102189784D0000115
(),
Figure G2009102189784D0000116
(the third), (woman),
Figure G2009102189784D0000118
(minister) etc. is maximum literary style in the various allosome literary styles.
If do not have a kind of having superiority in the 2. various allosome literary styles, with regard to the minimum literary style of number of words in the code selection position, for example
Figure G2009102189784D0000119
(bird), coding 391200 has only word of bird word under this yard position, help reducing repetition rate.
3. in the standardization etymon list, some standard radical is only got a kind of literary style, for example:
Figure G2009102189784D00001110
Wood,
Figure G2009102189784D00001111
The woman,
Figure G2009102189784D00001112
Row,
Figure G2009102189784D00001113
Again,
Figure G2009102189784D00001114
Vow,
Figure G2009102189784D00001115
Cui, From,
Figure G2009102189784D00001117
Ji,
Figure G2009102189784D00001118
Jie,
Figure G2009102189784D00001119
On-Cheng,
Figure G2009102189784D00001120
And,
Figure G2009102189784D00001121
The third,
Figure G2009102189784D00001122
Minister,
Figure G2009102189784D00001123
Occasion,
Figure G2009102189784D00001124
Figure G2009102189784D00001125
Figure G2009102189784D00001126
The woman, Fish,
Figure G2009102189784D00001128
The people,
Figure G2009102189784D00001129
The family, End,
Figure G2009102189784D00001131
Net, Volume, Factory,
Figure G2009102189784D00001134
,
Figure G2009102189784D00001135
Dustpan, Five,
Figure G2009102189784D00001137
Zhuang (sheet),
Figure G2009102189784D00001138
Sheep, Ox etc.
Even these literary styles replace other allosome literary styles also can not cause misidentification fully.
Some word has several literary styles, for example shellfish, height, heptan, angle, separate, fortunately, tenth of the twelve Earthly Branches, rain, Hui etc. word (seeing etymon list).These allosome literary styles are widely different, and are also just widely different with inscriptions on bones or tortoise shells word and former word that a kind of literary style replaces other allosome literary styles to write out, have been not the original inscriptions on bones or tortoise shells just.Though in principle also can be only with a literary style (do like this, use the fashion note), the repetition rate of coding is too high, get a sign indicating number after, list a large amount of same code words, will be very inconvenient, and it is inconsistent to have the font and the coding of a large amount of words, uses also inconvenience.The two is weighed mutually, has selected the given several literary styles of radical of these words:
For example:
Figure G2009102189784D00001140
4. choose eyelash sign indicating number in the coding or tiller sign indicating number and be 0 or 9 word, when the sign indicating number of calculation combination word, saved the trouble of calculating like this.As
Figure G2009102189784D00001141
(fish) 900001,
Figure G2009102189784D00001142
(bird) 391200,
Figure G2009102189784D00001143
(wind, phoenix) 239901,
Figure G2009102189784D00001144
(chicken) 329901 words such as grade;
When if the existing radical (font) 5. in the dictionary can not be satisfied the demand, then existing radical is done suitable modification, makes the good memory of image, as horse, deer, elk, resemble, the standard fonts of expression beasts such as tiger, pictograph trunk at word has at the moment, does not get tiller; At tiger, elk, female rhinoceros, beast-like animals trunk not at the moment, the tiller of expression trunk all is 45 of tillers, and the deer word code is 306901, and only the tiller of torso portion also is that tiller 4, branch 5 , Ji word codes are 004501, also is 45 of tillers, the radical of each pictograph animal, the encoding law unanimity is convenient to memory;
6. some word select has been got and the lesser seal character, inscription on ancient bronze objects, the similar radical of modern word, for example
Figure G2009102189784D0000121
(wood),
Figure G2009102189784D0000122
(water),
Figure G2009102189784D0000123
(Contraband), (towel),
Figure G2009102189784D0000125
(slit bamboo or chopped wood),
Figure G2009102189784D0000126
Words such as (literary compositions), the roughly the same lesser seal character;
Figure G2009102189784D0000127
(capital),
Figure G2009102189784D0000128
Words such as (shellfishes) is equal to inscription on ancient bronze objects;
Figure G2009102189784D0000129
(car) crouches as modern Chinese character is flat,
Figure G2009102189784D00001210
(coming) similar modern Chinese character, regulation helps memory like this;
In addition, though a lot of radicals and little seal character are related, but it is that the inscriptions on bones or tortoise shells is distinctive that a large amount of radicals is also arranged, form existing more than the 1000 year time because be formed into the lesser seal character from the inscriptions on bones or tortoise shells, font has had great changes, some list the word in " origin of Chinese character " radicals by which characters are arranged in traditional Chinese dictionaries in, in the inscriptions on bones or tortoise shells not from these radicals by which characters are arranged in traditional Chinese dictionaries (or radical), for example:
Figure G2009102189784D00001211
Spoon, the lesser seal character is from showing, the inscriptions on bones or tortoise shells is the original literary style of spoon word not from showing, begins to occur from inscription on ancient bronze objects from the spoon word that shows; Trap, the lesser seal character from well, are phonogram from mound, the inscriptions on bones or tortoise shells
Figure G2009102189784D00001212
(460012), be associative compounds, neither from mound, do not have well yet and accord with as sound, analogous cases are a lot; Also have the radical of some inscriptions on bones or tortoise shells, in the lesser seal character, do not have, for example
Figure G2009102189784D00001213
(called after On-Cheng is On-Cheng in fact
Figure G2009102189784D00001214
The latter half of word, but be not slowly), do not classify radicals by which characters are arranged in traditional Chinese dictionaries in the lesser seal character as, but in the inscriptions on bones or tortoise shells, have tens words to contain this radical, therefore list etymon list in by the own characteristic of the inscriptions on bones or tortoise shells, in addition, some radicals are not selected into the standard radical, for example
Figure G2009102189784D00001215
(Nian), because of having
Figure G2009102189784D00001216
Make radical and just can import, thus do not get, for another example The Yichang word also is a radical of the inscriptions on bones or tortoise shells, but the combined characters of forming is few, does not also get.
Regulation to the use of standard radical:
1, the standard radical just when the input combined characters, replaces other various allosome literary styles, when using separately, does not replace other allosome literary styles.For example: Wood,
Figure G2009102189784D00001219
In the elm word, replaced
Figure G2009102189784D00001220
Etc. literary style, but when using wooden word separately, each wooden word will be by aforementioned regulation coding.
When 2, replacing the allosome literary style input inscriptions on bones or tortoise shells with the standard radical, coding and the inconsistent situation of font may appear in the still original shape literary style of literal of input like this, and this allows.For example:
Figure G2009102189784D00001221
By original character shape coding should be 582403, should be 592303 by standard code, the still original literary style that shows during input
Figure G2009102189784D00001222
Only from this example, use the standard radical to appear to and make an unnecessary move, in fact, the word on the first osteocomma is likely unsharp, could realize input preferably with the standard radical.
To an inscriptions on bones or tortoise shells word that is combined into, both encoded in the character library, also used standard radical (or standardization font) coding, and allowed the user no matter choose any way and can realize input by its original literary style.
2) font standardization:
Some words are arranged in the inscriptions on bones or tortoise shells, for example car, deer, chicken, phoenix etc., structure is complicated especially, and the Shang dynasty is divined the people when inscribing, and uses the randomness of cutter very big originally, again through more than 3,000 years underground burying, stroke is just more unclear, if when input, also will take magnifier and look for it that several eyes, several eyelashes are arranged, several tillers, that is tantamount to climb a tree to seek fish.But these words have a significant characteristics-height pictographization, though the handwriting is blurred, still can recognize is any word, can stipulate that a standard font replaces its various variant Chinese character to it, in the practice, as long as recognize these words, just by the standard character shape coding, after the input, demonstrate a plurality of variant Chinese character, the word of need therefrom selecting gets final product.The standard font is the same with the standard radical, also replaces other allosome literary styles in synthetic word, and different is, when using as independent word, the standard radical can not replace the various literary styles of other variant Chinese character, and the standard font can replace the various literary styles of variant Chinese character.
Totally 18 of standard fonts, they are:, page or leaf, phoenix (wind), bird, chicken, tortoise, Chinese alligator, autumn, fish, worm (it), car, tiger, female rhinoceros, horse, resemble, the standard fonts (called after beasts) of deer, elk and other beasts of expression, comprise totally 28 of various allosome literary styles.
Grouping illustrates the coding way of standard font below:
A, Nao and page or leaf: Nao is obviously different with page or leaf in the modern word, but in the inscriptions on bones or tortoise shells the spitting image of, word 622 pages at " inscriptions on bones or tortoise shells dictionary ", the page or leaf word is at 991 pages, in the standard etymon list Nao with page take same font
Figure G2009102189784D0000131
Be encoded to 342400, see the word of similar monkey shape, import this graphemic code and get final product.
This standard font has synthetic word foam word and cuts down word (992 pages).
Foam, coding: 484703 (are equivalent to
Figure G2009102189784D0000133
)
Figure G2009102189784D0000134
Cut down, coding: 482402 (are equivalent to )
The standard font of B, chicken, bird, phoenix (wind):
These three words all are bird shape pictographic characters, but difference also is significantly, chicken word (394 pages) except that the standard font list, other all contain
Figure G2009102189784D0000136
Word is seen " inscriptions on bones or tortoise shells dictionary " 395 pages, the standard font of chicken word Coding 329901.The not synthetic word of chicken word.
The wind (1429 pages) and phoenix (427 pages) interchangeability of Chinese characters, characteristics are that hot prefix is arranged on the head, or add three eyelashes, example on hot prefix again
Figure G2009102189784D0000138
Perhaps be with all, example
Figure G2009102189784D0000139
Can with chicken block branch.The standard font of stipulating phoenix (wind) word in view of the above is
Figure G2009102189784D00001310
Coding 239901.
The synthetic word of phoenix word has
Figure G2009102189784D00001311
Coding 239924.
Bird shape word does not have hot prefix, distinguish well with the phoenix word, and and the key distinction of chicken word be that the tiller stroke of expression health, tail is few, do not contain
Figure G2009102189784D00001312
Word, the standard font
Figure G2009102189784D00001313
The coding 391200, can import 426 pages after all 15 bird words.
The pictographic character of similar bird word is a lot, runs into a bird shape word, be on earth bird word, swallow word or other words with regard to bad resolution, be defined as and all use 391200 these standard fonts, with convenient input.If synthetic word is simple in structure, direct coding is more convenient, then need not use the standard font.Synthetic word example:
Figure G2009102189784D0000141
Collection (426-4): 393502.
The standardization of C, tortoise, Chinese alligator, autumn word:
These three words all are pictographic characters, relatively as, also very complicated.Their difference is: the autumn word
Figure G2009102189784D0000142
Eyelash is arranged, the tortoise word on the head
Figure G2009102189784D0000143
No, Chinese alligator word
Figure G2009102189784D0000144
Also has eye on the eyelash of head.Stipulate different standard fonts in view of the above:
Tortoise word (1434 pages): the tortoise word of positive image is arranged in the inscriptions on bones or tortoise shells, the tortoise word of lateral facial image is also arranged, we select for use
Figure G2009102189784D0000145
902401 make the standard font of side shape tortoise word, use 504801 standard fonts as positive shape tortoise word.The eye sign indicating number of these two standard fonts is different, and reason is to also have a Strider word (1441-4)
Figure G2009102189784D0000147
Be more or less the same with the font of positive shape tortoise word, when in material object, finding such word, just be hard to tell Strider or tortoise word, can choose same sign indicating number, with the trouble of avoiding selecting.
Synthetic word example (1438-13), from side shape tortoise, from ware, from an ancient type of spoon, standard code 943603.
Synthetic word:
Figure G2009102189784D0000149
(1437-3), from positive shape tortoise, from
Figure G2009102189784D00001410
Coding 506902.
Autumn word (1435 pages): standard font
Figure G2009102189784D00001411
Coding 932401.Synthetic word
Figure G2009102189784D00001412
(1441-1-2), from the autumn,,, encode: 933624 from again from eight.
Chinese alligator word (1441 pages): standard font Coding 942401.Not synthetic word.
The standard font of D, fish word:
Fish (1255 pages): pictograph, the standard font:
Figure G2009102189784D00001414
Coding 900001.
Combined characters
Figure G2009102189784D00001415
Fishing, 900056.
The standard font of E, worm word:
Worm word (1430 pages): independent worm word, the literary style that has is very simple
Figure G2009102189784D00001416
Do not need only to list the standard radical in the representative of standard font, other worm word complexity, the standard font of worm word is used for representing the worm word of complicated literary style and the word that word contains worm shape figure.
The standard font:
Figure G2009102189784D00001417
Take from " inscriptions on bones or tortoise shells dictionary " 1430-5, coding: 910001.
Synthetic word example: (593-1), the standard radical of three high words is got on top, and the standard font of worm word is got in the bottom, coding 960002.
The standardization of F, car word: enchashment is write across the page for the car word, as standardization car word, represents the car word of various literary styles.
Figure G2009102189784D0000151
Standard code: 402401.
The standardization of G, tiger, female rhinoceros, horse, elephant:
Female rhinoceros (1061 pages): the rhinoceros pictograph, the standard font: Coding 704501.The literary style of female rhinoceros is various in the inscriptions on bones or tortoise shells, 1061 pages of all female rhinoceros words of this standard character pattern input.
Resembling (1065 pages), pictographic character, is principal feature with its proboscis.The standard font:
Figure G2009102189784D0000153
Coding 360001.
Horse (1067 pages): it is principal feature that the horse word has eyelash stroke (mane) with the back.The trunk of the health that has has eye, and the trunk anophthalmia that has (head all has eye, and whether only distinguish trunk has eye) will be encoded respectively.Trunk has eye:
Figure G2009102189784D0000154
Coding: 470001, import 1067 pages all 10 the horse words that eye is arranged.Synthetic word example (1077-10-2) from horse, from an ancient type of spoon, utilize the standard character shape coding of two words: 471202 (are equivalent to
Figure G2009102189784D0000156
Word).The trunk anophthalmia:
Figure G2009102189784D0000157
Coding 304501, the horse word of two trunk anophthalmias of input 1067-6,1067-7 and the horse word of other local trunk anophthalmias that occur.
Synthetic word example
Figure G2009102189784D0000158
(1073-13), from
Figure G2009102189784D0000159
Horse, from Upright, from
Figure G2009102189784D00001511
Dog, coding: 429903.
Tiger (527 pages): with the nose shape is principal feature, and trunk has the standard font of eye: Coding: 960001
Synthetic word example: (529-2), from dagger-axe, from tiger, coding 962302.
The standard font of trunk anophthalmia:
Figure G2009102189784D00001514
Coding 224501.
Synthetic word example:
Figure G2009102189784D00001515
(532-8-5) from the tiger, from wood, from the woman, the coding 457903
The standardization of H, expression deer animal word: the deer word is differentiated well in the inscriptions on bones or tortoise shells, with its angle as outstanding feature.Trunk has at the moment
Figure G2009102189784D00001516
Coding 442401 is during the trunk anophthalmia Coding 306901.
The elk word is a principal feature so that three eyelashes to be arranged on the head.Trunk has at the moment
Figure G2009102189784D00001518
Coding 470001 is during the trunk anophthalmia
Figure G2009102189784D00001519
Coding 334501.
I, represent the words of other animals: this class word is of all shapes and colors, and " beasts " standard font is listed in the most bad resolution without exception in.Original Fawn word taken from font, and the expansion usable range is represented other beasts words.
Trunk has eye, head that ear is arranged
Figure G2009102189784D00001520
Coding 460001.
Trunk has, earless
Figure G2009102189784D00001521
Coding 440001.
Trunk anophthalmia, head have ear
Figure G2009102189784D00001522
Coding 324501.
Body anophthalmia, an earless Coding 304501.
Synthetic word example 1:
Figure G2009102189784D0000162
(1093-5) from mouth, from unknown beast, by (beasts) character shape coding (body has, earless): 560002.
Synthetic word example 2:
Figure G2009102189784D0000163
(1076-6), from fish (900001); From unknown animal, there are, earless beasts to handle (440001) with trunk.Synthetic word code 940002.This word is a unsharp word of literary style, does not know the meaning of word and pronunciation again, and the input method of imitation modern Chinese character is difficult to coding, has solved an input difficult problem well with standard font method.
The principle of the spelling input method of standard font method and modern Chinese character is the same, seem, these 18 standard fonts also can replace the coding input with phonetic input, but can bring two problems like this, the one, destroyed digitally coded unitarity, the 2nd, when being used in synthetic word, phonetic can't with other member combined coding.
Be not the part of standard radical, standard font in addition in the inscriptions on bones or tortoise shells, also possible stroke is unclear, can solve with following way:
3), literal symmetrization.There is a large amount of words to be up and down or left-right symmetric in the inscriptions on bones or tortoise shells.
Symmetry in the inscriptions on bones or tortoise shells has variform, as the center symmetric figure:
Figure G2009102189784D0000164
Axisymmetric shape:
Figure G2009102189784D0000165
Figure G2009102189784D0000166
The conjugation symmetric figure:
Figure G2009102189784D0000167
Similar equal symmetric figure:
Figure G2009102189784D0000168
Broken symmetry: Local symmetric figure etc.
Local symmetry:
Figure G2009102189784D00001610
Two arm symmetries of Ji word.
Three structure divisions all are the rotational symmetry words.
Figure G2009102189784D00001612
The bottom left section symmetry.
Figure G2009102189784D00001613
Left side two parts up and down is respectively an asymmetric herringbone.
The symmetry of the inscriptions on bones or tortoise shells has increased the aesthetic feeling of literal, utilizes this characteristics, can carry out standard to first bone word, and damaged stroke is repaired, and confirms the code element of inscriptions on bones or tortoise shells literal better.
Example one:
Figure G2009102189784D00001614
Be symmetrically
Figure G2009102189784D00001615
592303
Example two:
Figure G2009102189784D00001616
--- 442404
Example three:
Figure G2009102189784D00001618
---
Figure G2009102189784D00001619
692402,
Figure G2009102189784D00001620
592402.
Example four:
Figure G2009102189784D00001621
---
Figure G2009102189784D00001622
772324,
Figure G2009102189784D00001623
792324.
Example five:
Figure G2009102189784D0000171
(857-7-1), this is a word that structure is in a mess, and uses balanced method and can encode: earlier left-right symmetric is made on top and handled, obtaining eye is 1, and eyelash is 2; Again symmetry is made in the bottom and handle, make it and the top symmetry, two ones eye is 2 up and down, and eyelash is 4; Deal with between centering, middle stroke can find out and wherein not contain the tiller and the stroke that wafts again, and its eye sign indicating number is greater than 9, and eyelash has 0,1,2,3,4,5 ..., altogether, coding has 6 kinds: 940003,950003,960003,970003,980003,990003.The user at will compiles out above-mentioned which sign indicating number by cryptoprinciple, can import this word.
4), block structure partition method.First bone word is the word that carves, and in the process of carving characters, some stroke may be out of shape, causes the adhesion between the block structure, therefore, at first with irrational adhesion coding more separately, on to save example four also be to have used the block structure partition method.Again for example:
Example one:
Figure G2009102189784D0000172
(204-1-1) be separated into
The top of this word in statu quo described in word under " inscriptions on bones or tortoise shells dictionary " entry
Figure G2009102189784D0000174
With
Figure G2009102189784D0000175
" volume " has been connected in together, separate, and with the standardization of volume word, coding result is: 591225 again.
Example two:
Figure G2009102189784D0000176
(1341-11) be divided into four block structures, women word standardization, Coding: 473604.
Example three:
Figure G2009102189784D0000178
(1490-6) internal junction building block adhesion separates,
Figure G2009102189784D0000179
The word symmetrization is handled,
Figure G2009102189784D00001710
Coding:
Example four:
Figure G2009102189784D00001712
(1516-6), be separated into Coding: 642404.
Example five: (523-5) symmetry, be separated into
Figure G2009102189784D00001715
226903 or
Figure G2009102189784D00001716
227903.
Some word is that inscription person deliberately connects together block structure, and is understanding to represent certain, but we may think it is unreasonable adhesion when input.For preventing to cause error because of understanding difference, we separate block structure, and all list two kinds of fonts in character library, and coding can be realized input in any case.
Example six:
Figure G2009102189784D00001717
(531-11)--- Coding: 244502.
Example seven:
Figure G2009102189784D00001719
(1161-12), isolate the standard radical " " word:
Figure G2009102189784D00001720
693504.
Example eight:
Figure G2009102189784D00001721
(893-5-2) isolate
Figure G2009102189784D00001722
And standardization:
Figure G2009102189784D00001723
464702.
The principle of separating is: except that special provision, the standard radical in every combined characters, standard font exclude the first bone word that radical also is independent word though perhaps contain, and all will separate with other parts.See above each example.
Some special circumstances: radical (or word) does not separate with other stroke:
A: radical is represented the part of animal or human's health, and links to each other with the human or animal, then will not separate.As represent headwear
Figure G2009102189784D0000181
(suffering),
Figure G2009102189784D0000182
Tiger prefix etc.
Example nine:
Figure G2009102189784D0000183
Ear word radical does not separate with the people.
B: according to the requirement of the meaning of word, structure, can't separate, not separate.
Example ten: It is understanding to press font,
Figure G2009102189784D0000185
With
Figure G2009102189784D0000186
Should not separate.
Example 11:
Figure G2009102189784D0000187
It is understanding to press font,
Figure G2009102189784D0000188
With
Figure G2009102189784D0000189
Should not separate.
Example 12: (pig) understanding arrow is injected the pig body, should not separate.
Some word is not familiar with in these words, but the understanding meaning by font as can be seen.
5), code value is from many methods.Some font is smudgy, and can not differentiating is several eyes, or several tiller, or several wafing from stroke, just takes big the encoding of numerical value.Here the comparison of numerical values recited is the comparison under same sign indicating number position, rather than the comparison of different sign indicating number interdigits, promptly eye is many when being hard to tell less, and it is many to get eye; Eyelash many when being hard to tell less, get eyelash many or the like.
Example one: (Yin) gets
Figure G2009102189784D00001812
260001 make standard.Synthetic word Qi:
Figure G2009102189784D00001813
Example two:
Figure G2009102189784D00001814
(rabbit) eye is unclear, gets 3, and tiller like three, gets 3 again like two.Coding 343501.
Example three:
Figure G2009102189784D00001815
(mulberry) 247801 and 248901 all goes into character library, from many 248901 can not ignore of tiller.
Be noted that the rule of using code value can not deviate from front code element explanation from many principles.
6), many yards methods of a word.In the routine word of above-mentioned each method, talked about many yards methods, promptly to the possible literary style of certain word all in character library, get any one sign indicating number, can import this word.Being defined as of many yards methods of one word, the radical in the synthetic word only uses the standard root coding, the allosome literary style is not encoded.If the word of conformance with standard font, only by the standard character shape coding.
Example one:
Figure G2009102189784D00001816
Former word code 006701,
Symmetrical treatment:
Coding: 161200.
The another kind of possible understanding of user:
Figure G2009102189784D00001818
Coding: 180000.
Example two:
Figure G2009102189784D0000191
Separation, symmetrical treatment:
Figure G2009102189784D0000192
152402, second kind of literary style after the symmetrical treatment:
Figure G2009102189784D0000193
114802.
The problem that one word is many yards is, a word has taken a plurality of digital resources, and a word has brought one yard multiword for many yards, may be inconvenient during use.But in fact, this problem and not serious.In the low numerical value district of each yard position, repeated code is many, and for example 001201,002301,003502,101201 etc.Mainly be different identical the causing of word code, many yards of one words mainly occur on the word of literary style complexity, the high numerical value district that is mostly each yard position that takies, these regional repetition rates of coding are very low, even a lot of neutral gears are arranged, the example four, five of balanced method, all in the high numerical value district of eye sign indicating number, their coding 772324,792324,940003,950003,970003 has only a sign indicating number, 960003,990003 is three repeated codes, and 980003 is two repeated codes.
The character library of input method of the present invention is to adopt the Windows TrueType form inscriptions on bones or tortoise shells literal pool of Unicode coding, the sign indicating number position that the coding of inscriptions on bones or tortoise shells literal in character library uses Unicode to keep as coinage.Coinage district coding is from 0xE000, finishes to 0xF8FF.
The code table of inscriptions on bones or tortoise shells input method mainly comprises two parts: gauge outfit and encoder dictionary.The gauge outfit of code table partly is used to store the descriptor about input method and code table, the name of input method for example, maximum code length, encoder dictionary side-play amount, size of encoder dictionary or the like; The encoder dictionary of code table partly is used for storing the position of the inscriptions on bones or tortoise shells literal of inscriptions on bones or tortoise shells input method coding and correspondence in character library.Because inscriptions on bones or tortoise shells literal relative fixed, so encoder dictionary adopts the inscriptions on bones or tortoise shells literal and the method establishment one to one of encoding.Begin from the deviation post of encoder dictionary code table, per 12 bytes are used for of memory encoding dictionary, and preceding 6 bytes of every are stored six codings of inscriptions on bones or tortoise shells input method in the ASCII character mode.For example: 000011, in encoder dictionary 0x30,0x30,0x30,0x30,0x31,0x31.After six bytes be complete two bytes of zero as at interval, be two positions of bytes store inscriptions on bones or tortoise shells Chinese character in character library afterwards, AA for example, A1 or the like.Be complete two bytes of zero at last as one end.The identical item of encoding is stored in encoder dictionary continuously.Code table is to realize the indispensable data file of input method, and it has stipulated the corresponding relation of inscriptions on bones or tortoise shells literal in the coding of the inscriptions on bones or tortoise shells and the character library, the process that the input of literal in fact just is to use the coding of literal that codes table file is dynamically retrieved.
No matter adopt inscriptions on bones or tortoise shells literal and numerical coding method one to one, be many yards of one a yard multiword or words, and input method software can both obtain and six corresponding inscriptions on bones or tortoise shells literal of encoding by the inquiry code table.
Inscriptions on bones or tortoise shells input method coding method of the present invention is similar to the coding of indexing system of Chinese characters of the four corner code, promptly all be to encode by the font characteristics, but it is different, indexing system of Chinese characters of the four corner code is to be the sign indicating number of the characteristics volume of Chinese characters according to modern Chinese character, the inscriptions on bones or tortoise shells is not a Chinese characters, be by the design feature coding of the inscriptions on bones or tortoise shells self.When realizing input, do not need to be provided with in addition keyboard, only get final product with the existing keyboard of computer, the just numerical key of use as 1,2,3,4,5,6,7,8,9,0, is whenever got one group of six bit digital, just imports an inscriptions on bones or tortoise shells literal.But it should be noted that the radical standardization mentioned among the present invention just for count sign indicating number accurately, convenient and set, different with the five-stroke character input method of modern Chinese character, the standard radical do not need to be fixed on the keyboard and keyboard it doesn't matter.
The present invention encodes according to the font characteristics of the inscriptions on bones or tortoise shells self, uses six digit numeric code, and the repetition rate of coding is lower.According to the IMM-IME structure that Windows operating system provides, designed and developed input method.Realize input in computer system, printed inscriptions on bones or tortoise shells literal.The principle of coding also is applicable to inscription on ancient bronze objects.After using the present invention to finish the inscriptions on bones or tortoise shells, inscription on ancient bronze objects character library, can realize the ancient writing office automation.The more important thing is, provide convenience for decoding ancient writing.
Inscriptions on bones or tortoise shells standard radical/font table
Figure G2009102189784D0000201
Figure G2009102189784D0000221
Figure G2009102189784D0000231
Figure G2009102189784D0000261
Figure G2009102189784D0000271
Figure G2009102189784D0000281

Claims (7)

1. an input method of 6-code number oracle is characterized in that, may further comprise the steps:
(1) structure according to inscriptions on bones or tortoise shells literal correctly contrasts the step that code element is selected in the code element definition:
Each structure division of inscriptions on bones or tortoise shells literal is divided into eye sign indicating number, eyelash sign indicating number, tiller sign indicating number, branch sign indicating number, waft sign indicating number, constructive code six bit symbols, corresponding with digital 0-9 on the computer keyboard respectively sequence of code symbols that forms the numeral input of described six bit symbols, the form of described sequence of symhols for from left to right be arranged in order<eye sign indicating number<the eyelash sign indicating number<the tiller sign indicating number<the branch sign indicating number<sign indicating number wafts<constructive code form, described each code element is defined as follows:
Eye sign indicating number: the eye sign indicating number be the blank that surrounds of eye and number, wherein, eye is the stroke that the closed curve in the inscriptions on bones or tortoise shells literal forms;
Rule is as follows:
A. fertile notes are eye; B. the round dot in the stroke is counted eye; C. there is eye to calculate respectively in the eye;
The eyelash sign indicating number: the eyelash sign indicating number be eyelash and number, wherein, eyelash is the stroke that links to each other with eye in the inscriptions on bones or tortoise shells literal and do not intersect with other stroke;
Rule is as follows:
A. work as the eyelash stroke and will count sign indicating number respectively through having a look at; B. the eyelash stroke of intraocular is counted eyelash; C. the eyelash stroke between the eye is counted eyelash;
The tiller sign indicating number: the tiller sign indicating number be in the tiller stroke point of crossing and the number, wherein, the tiller stroke can not be in the inscriptions on bones or tortoise shells literal eye crossing stroke;
Branch sign indicating number: be the minimum stroke number that can describe described tiller stroke with strokes;
Sign indicating number wafts: the sign indicating number that wafts be waft and number, wherein, wafing is not constitute eye and the wafing from stroke of tiller in the inscriptions on bones or tortoise shells literal;
Constructive code: constructive code be block structure and number, wherein, block structure is in the inscriptions on bones or tortoise shells literal NA mutually;
Rule is as follows:
A., eye is arranged in the eye of independent word or eyelash or tiller are arranged or when wafing, constructive code gets 0;
B. the eye that comprised of eye and it is connected, and eye connects with the wall scroll eyelash, or with a contact be the intraocular eye, constructive code get 0, one eye by several encirclements be adjacent, constructive code gets 1;
C. intraocular has eye, eyelash, tiller, the stroke that wafts and just during component composition word more than a part of, constructive code gets 1, and and other structure stroke count constructive code together;
Described eye sign indicating number, eyelash sign indicating number, tiller sign indicating number, branch sign indicating number, waft sign indicating number and constructive code are up to 9, still are designated as 9 above 9;
(2) input step: after the selected code element of above-mentioned steps, on above-mentioned computer keyboard, import 6 bit digital code elements by key in order;
(3) select step: the inscriptions on bones or tortoise shells literal of listing in the inscriptions on bones or tortoise shells textbox according to the 6 bit symbols correspondences that occur on the screen, select needed inscriptions on bones or tortoise shells literal by numerical key.
2. input method of 6-code number oracle according to claim 1 is characterized in that: in by the tiller stroke that T shape is intersected or X-shaped intersects to form, and described tiller sign indicating number and the available following formulate of branch sign indicating number:
Branch sign indicating number=tiller sign indicating number+tiller agglomerate number;
Rule is as follows:
A. tiller agglomerate number be the tiller agglomerate and number, wherein, the tiller agglomerate contains the tiller stroke and exactly by separated of blank or eye;
B. the tiller stroke through having a look at is two tiller agglomerates, and tiller sign indicating number and branch sign indicating number will calculate respectively;
C. the stroke on having a look at is the tiller stroke as if a bit extend three lines to the same side of eye from eye, then is eyelash as if a bit extend two lines to the same side of eye from eye.
3. input method of 6-code number oracle according to claim 1 is characterized in that: intersect at some tiller stroke of formation for three strokes or four strokes, the branch sign indicating number is for directly counting the stroke number of coming out.
4. input method of 6-code number oracle according to claim 1 is characterized in that: described wafing when stroke is collided with eye forms eyelash or new eye, according to the following rules coding:
Press contour analysis, with the stroke contrast of wafing of other symmetry, symmetry when wafing stroke, then count and waft or by forming new.
5. input method of 6-code number oracle according to claim 1, it is characterized in that: during for the input of the inscriptions on bones or tortoise shells literal that radical is arranged of fuzzy or scarce stroke, described step (1) comprises the radical normalization step before, promptly contrast inscriptions on bones or tortoise shells standard radical/font table, with inscriptions on bones or tortoise shells text normalisation.
6. input method of 6-code number oracle according to claim 1, it is characterized in that: during for the input of the inscriptions on bones or tortoise shells literal that symmetrical structure is arranged of fuzzy or scarce stroke, described step (1) comprises literal symmetrization step before, is about to inscriptions on bones or tortoise shells literal symmetrization to repair damaged stroke.
7. input method of 6-code number oracle according to claim 1, it is characterized in that: during for the input of the inscriptions on bones or tortoise shells literal of fuzzy or scarce stroke, described step (1) comprises the block structure separating step before, the regulation of separating is as follows: contain the word of radical or contain the first bone word of independent word, these radicals or independent word to be separated with other parts, but except the following situation: a. represents the part of animal or human's health and links to each other with the human or animal when radical, will not separate; B. according to the requirement of the meaning of word, structure, will not separate.
CN2009102189784A 2009-11-16 2009-11-16 Input method of 6-code number oracle Expired - Fee Related CN101702101B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN2009102189784A CN101702101B (en) 2009-11-16 2009-11-16 Input method of 6-code number oracle

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN2009102189784A CN101702101B (en) 2009-11-16 2009-11-16 Input method of 6-code number oracle

Publications (2)

Publication Number Publication Date
CN101702101A true CN101702101A (en) 2010-05-05
CN101702101B CN101702101B (en) 2011-04-20

Family

ID=42157019

Family Applications (1)

Application Number Title Priority Date Filing Date
CN2009102189784A Expired - Fee Related CN101702101B (en) 2009-11-16 2009-11-16 Input method of 6-code number oracle

Country Status (1)

Country Link
CN (1) CN101702101B (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103488798A (en) * 2013-10-14 2014-01-01 大连民族学院 Automatic oracle identification method

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103488798A (en) * 2013-10-14 2014-01-01 大连民族学院 Automatic oracle identification method
CN103488798B (en) * 2013-10-14 2016-06-15 大连民族学院 A kind of Automatic oracle identification method

Also Published As

Publication number Publication date
CN101702101B (en) 2011-04-20

Similar Documents

Publication Publication Date Title
CN104809142A (en) Trademark inquiring system and method
Wells The archaeology and epigraphy of Indus writing
Wicker et al. Bracteates and runes
CN1027558C (en) Five-stroke two-dimensional computer Chinese character input method and keyboard thereof
CN101702101B (en) Input method of 6-code number oracle
Prem Aztec writing
CN102053719A (en) Input method for Chinese characters
CN101377712A (en) Chinese characters input method and look-up method
CN101692188A (en) Sound-image code Chinese character input method
CN101517573A (en) Database system and its handling method for ideogram
CN108919978B (en) Chinese character sound and shape input method for computer and mobile phone
CN105912139B (en) Method for correspondingly recognizing modular stroke coding Chinese characters
CN104123011A (en) Method for coding type inputting of Chinese characters and pinyin and application thereof
CN103324299A (en) Chinese character pictographic code computer input method based on Chinese character basic components
CN102253726A (en) Method for inputting Chinese word digital strokes of computer and keyboard technology
CN102043469A (en) Two-stroke type three-dimensional digital input method and keyboard
CN100568162C (en) A kind of computer Chinese input method
CN112328095B (en) Four-purpose phonetic and shape code Chinese character input method and input platform without using number keys
CN111459296B (en) Shape spelling Chinese character input method and electronic equipment
CA2026228A1 (en) Holo-information code of chinese characters
CN1089176C (en) Positive and negative numeral inputting method for Chinese characters
CN100428118C (en) Inputting method of Chinese code series
CN1108553C (en) Universal popular voice form Chinese character coding input method
Zhang et al. Tibetan Lhasa Phonetic to International Phonetic Alphabet Conversion System Based on Small Character Set
CN101470535A (en) Optimized Chinese character code input method

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20110420

Termination date: 20171116

CF01 Termination of patent right due to non-payment of annual fee