CN85105023B - Chinese character stroke indexing coding method and processing method thereof - Google Patents
Chinese character stroke indexing coding method and processing method thereof Download PDFInfo
- Publication number
- CN85105023B CN85105023B CN85105023A CN85105023A CN85105023B CN 85105023 B CN85105023 B CN 85105023B CN 85105023 A CN85105023 A CN 85105023A CN 85105023 A CN85105023 A CN 85105023A CN 85105023 B CN85105023 B CN 85105023B
- Authority
- CN
- China
- Prior art keywords
- stroke
- chinese
- chinese character
- character
- strokes
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired
Links
Landscapes
- Document Processing Apparatus (AREA)
Abstract
The stroke indexing coding method is used to index the Chinese characters according to the stroke types of the first or the first and the second strokes of the initial stroke of the Chinese character, and five strokes and their digital codes are used to reclassify the Chinese characters to reduce the indexing range. The stroke indexing coding method is related to Chinese and English computers by the utility program of the operating system, adopts C language and UNIX operating system, and the Chinese character memory code is arranged according to the number of strokes of Chinese characters.
Description
Involved in the present invention is Chinese-character stroke searching coding method and disposal route thereof.More specifically say, related to a kind of method of Chinese character coding easy to learn and the method for computer input of Chinese characters thereof, and be to realize with the utility routine of operating system.
At present, the domestic method of Chinese character coding and disposal route thereof have many kinds.In general, these methods of Chinese character coding and disposal route thereof are often:
1. adopt numerous and diverse indexing system of Chinese Characters input Chinese character, need special training during use;
2. used input keyboard, same key button is represented multiple symbol, is difficult to memory; And used key button is more, is awkward;
3. adopt the computer program of assembly language, can only be used for specific computing machine;
4. Chinese word library is made on the hardware of terminating machine, lacks flexibility, also uneconomical.
The objective of the invention is to adopt Chinese-character stroke searching coding method, realize that coding rule is easy to learn easy-to-use, used keyboard does not seldom have the trained non-full-time personnel can very fast grasp yet.In the cooperation of computer hardware, have big elasticity in addition, be not limited to a certain specific computing machine.
The output of Chinese language computer, main is prints Chinese character, all Chinese language computers all show Chinese character with dot matrix, there is no special character, the different Chinese characters of just importing.So-called input Chinese character in general, is just specified an individual character in the character library that prestores.In other words, just select a kind of indexing system of Chinese Characters." character indexing method compiling method " of the present invention is the indexing system for Chinese characters of improvement, according to first or first, second stroke kind searching of the first stroke of a Chinese character, no longer counts stroke.
Every Chinese character just has various strokes, be divided into tens kinds passable, be divided into several also passable.The present invention is categorized as the basis with the stroke of Chinese character, so at first will define stroke.Even the stroke title is identical, but definition is different, and usage is also different.Stroke kind of the present invention is according to geometric viewpoint, the stroke of Chinese character reduce horizontal stroke, straight, tiltedly, five kinds of points, folding.Because Chinese character also is a kind of geometric figure of plane, the figure on plane has only two kinds of Points And lines, and line divides two kinds of straight line and non-rectilinears again, and straight line divides horizontal stroke, straight, oblique three kinds again, is five kinds altogether.Diagramming is its integrality and generality more as can be seen.
These strokes are to be as the criterion with hand-written script, also should be all right if be as the criterion with printing type face, and this is the agreement problem, and are irrelevant with method.Simultaneously, so-called straight line neither strict straight line.Especially casting aside to press down and all be with some arcs, but always work as its straight line on the idea of calligraphy, below is the definition of various strokes:
No matter the horizontal stroke that comprises any horizontal is length.
Directly comprise any rectilinear stroke, no matter band colludes or is not with and colludes, no matter also length.
The stroke that tiltedly comprises the non-straight orthoscopic of any non-horizontal stroke is pressed down no matter cast aside, no matter also length.
Point comprises any type of point, for example various points in " heart " word.
Folding comprises the non-directional stroke of any non-point, and tangible angle and turnover are arranged, and comprises the stroke that band colludes, and is no matter large or small.
Above stroke all has one to represent number, and it is as follows now to arrange its number:
Horizontal=1 directly=2 oblique=3 point=4 rolls over=5
The purpose of the indexing system of Chinese Characters all is the scope of progressively dwindling searching, and last only surplus next word will be looked for." character indexing method compiling method " can be used in indexing system for Chinese characters, also can be used in the phonetic indexing system of Chinese Characters or other the indexing system of Chinese Characters, can be used for dwindling the scope of searching equally, finally specified certain individual character.Below how " character indexing method compiling method " is applied in the Chinese character input of Chinese language computer with regard to accompanying drawings.
Fig. 1 is the radical-code table of " character indexing method compiling method ".
Use Computer Processing Chinese, might not create the method for novel input Chinese character, traditional indexing system for Chinese characters is improved also can be reached same purpose.Indexing system for Chinese characters has two shortcomings: the one, and the difficult bonding part head of some word, the 2nd, count stroke, the former can be listed in several possible radicals by which characters are arranged in traditional Chinese dictionaries to same word, and the latter can solve with " character indexing method compiling method ".
Adopt " character indexing method compiling method " needn't count stroke, only see the stroke kind of the first stroke of a Chinese character, look into radicals by which characters are arranged in traditional Chinese dictionaries and see stroke, stroke also seen in the verification certificate word.Look into radicals by which characters are arranged in traditional Chinese dictionaries and utilize the radical-code table, see accompanying drawing one.It is divided into 23 classes to all radicals by which characters are arranged in traditional Chinese dictionaries by the stroke kind that plays two of nibs, determine the first stroke of a Chinese character after, be easy to just find radical number, again the 'Radical classification ' number is added the number of being annotated under each radicals by which characters are arranged in traditional Chinese dictionaries.In Computer Processing Chinese, " # " mark can appear on the screen, and the meaning will be thrown radical number into, specifies radicals by which characters are arranged in traditional Chinese dictionaries.
Specify after the radicals by which characters are arranged in traditional Chinese dictionaries, have following three kinds of situations:
1. number of words radicals by which characters are arranged in traditional Chinese dictionaries seldom for example " are built " word in Yin portion, and the stroke of the first stroke of a Chinese character is that folding is oblique, and the oblique class of folding at the radical-code table finds Yin portion soon, and its radical-code is 534, throws this number into and has just specified lonely portion.The word of Yin portion seldom, so screen will show automatically that all words are as follows:
The court of a feudal ruler is prolonged and is built
1 2 3
A number is all arranged under every word on the screen, and the number of building word is 3, throws 3 into and just specifies and build word.
2. number of words radicals by which characters are arranged in traditional Chinese dictionaries neither too much or too little, for example " city " word is in towel portion, the first stroke of a Chinese character is straight folding, and radical number is 233, throws this number into, can occur on the screen question mark "? " the meaning is to throw the strokes number of a first stroke of a Chinese character (stroke of radicals by which characters are arranged in traditional Chinese dictionaries is disregarded) into, and the first stroke of a Chinese character of city's word is that some number is 4, throw 4 into, the word of towel portion's point first stroke of a Chinese character just shows as follows on screen:
Supreme Being's large, oblong sheet of silk with an appropriate message attached building, city
1 2 3 4
Throw 1 into and just refer to city's word.
3. a lot of radicals by which characters are arranged in traditional Chinese dictionaries of number of words, for example " curtain " word is in Lv portion, and the first stroke of a Chinese character is anyhow, and radical number is 122, throw number into, two question marks of meeting demonstration on the screen " ", meaning will be thrown two first stroke of a Chinese character numbers of " curtain " word into, the first stroke of a Chinese character of " curtain " word is straight folding, throws 25 into, and screen just shows below:
1 2 3 4 5 6
7 8 9 10 11 12 13 14 15
Day is said to transplant and sprouts Chang Pueraria lobota creeping weed tiny
16 17 18 19 20 21 22
23 24 25 26 27 28 29
30 31 32 33 34 35 36 37
Other English cocoon amaranths
38 39 40 41
The word that Lv portion directly turns up pen still is too many, and 41 words are arranged, and also will classify according to the identical font of part again, just is convenient to check.The title of these groups and the meaning of giving are all very obvious, and according to these class names, curtain word one fixes on " not word " class, needn't see the word of other class again, finds an act word from " not word " class soon, throw 32 into and just specify the curtain word.
Subsidiary statement be that the radical-code table is as a reference, specialize in the beginner and use, skilled a little after, can remember the radical number used always at least, needn't often check the radical-code table.
From above example, can obtain the definition of " character indexing method coding ".So-called " pen " is to say to specify radicals by which characters are arranged in traditional Chinese dictionaries and specify individual character all to be as the criterion with the stroke of the first stroke of a Chinese character.So-called " shape " is to say sometimes according to the identical word classification of part.Alleged " form of a stroke or a combination of strokes " of the present invention is a special noun, and " character indexing method method " also is a special noun, and be different with general usage.
C language and UNIX operating system are undivided, the existing final conclusion of their advantage.Needn't speak more, the present invention selects C language and UNIX operating system for use, is because can not be subjected to the restriction of computer hardware, to the development of computer hardware with rapid changepl. never-ending changes and improvements, has more the adaptability of height.
The present invention just can make " character indexing method compiling method " input Chinese character obtain practical application by set up some programs in UNIX operating system.It is the utility routine of the processing Chinese under the operating system.Set up these programs, just can change into the computing machine of supporting C language and UNIX operating system the computing machine of Chinese and English dual-purpose.As Chinese language computer, as long as under UNIX operating system, increase by four functions of handling Chinese:
1, cscanf this be function from keyboard input Chinese character, the suitable original scanf of UNIX operating system is also promptly from the function of keyboard inputting English letter.
2, cprintf this be that Chinese character is presented at function on the screen, quite original printf also promptly is presented at letter the function of screen.
3, cfprintf this be that Chinese character is stored into a function of specifying the shelves volume, quite original fprintf also promptly is stored into a function of specifying the shelves volume to letter.
4, CIPr this be the function of printing Chinese character, quite original lpr also is the function of letter punch.
Adopt 24 * 24 idea square formation, 72 bytes wanted in each individual character, even adopt 16 * 16 idea square formation, 32 bytes also wanted in each individual character, so internal memory mostly adopts the tilde sign indicating number of four figures.The encode Chinese characters for computer that the national standard message exchange of the People's Republic of China (PRC) is used is exactly a kind of standard memory.The present invention's four-digit number coding that what are arranged according to Chinese-character stroke.The benefit of this coding is how much to arrange Chinese material by stroke with existing " sort program ", just as English can be arranged according to lexicographic order.This internal memory sign indicating number by how many arrangements of stroke, naming is that " Zhang Shi encode Chinese characters for computer " abbreviation " is opened sign indicating number ".Opening sign indicating number and international code has the cross reference relation, with GB and no conflict part.
About the identification problem of a sign indicating number with ASCII character, in software systems of the present invention, be non-existent, every function of handling Chinese of using, handled character must be a sign indicating number, will never be ASCII character.Same, every function of using other, handled character must be ASCII character, open sign indicating number anything but.
The present invention adopts Chinese character " character indexing method compiling method ", avoids the shortcoming of number stroke searching, be the significant improvement that traditional indexing system for Chinese characters is made, and its rule is easy to learn, training that need not be special during use; The searching scope of utilizing screen display to dwindle has one by one replaced big keyboard, forms the chicoder of improvement.Make the user can be with the method Computer Processing Chinese of being familiar with; Only with in the keyboard 1,2,3,4,5 five numerical keys represent respectively horizontal stroke, straight, tiltedly, point, five kinds of strokes of folding, in addition, without other symbolic key in order to operation; The character indexing method compiling method is write as application program with the C language, can adapt to the various novel computers that make rapid progress, and also be easy to the computer software that the C language is write as is changed into Chinese edition, saves the manpower and materials that develop software; What to open sign indicating number, be convenient to the Chinese character data by stroke series arrangement as Chinese character internal memory sign indicating number.The present invention particularly is applicable to the printing Chinese form of business administration and Chinese conversational application of leading body's decision-making.
Claims (8)
1, a kind of computer Chinese input method, wherein:
With first stroke of a Chinese character stroke reduce horizontal stroke, straight, tiltedly, point, five kinds of strokes of folding;
Above-mentioned five kinds of strokes are represented with the numerical code on the keyboard respectively;
It is characterized in that,
All Chinese character radicals are divided into some classes by the stroke that plays two of nibs, and every class is represented with 2 above-mentioned numerical codes;
Above-mentioned each Chinese character radicals is with 1 digitized representation less than two;
Will input to the Chinese character of computing machine for each, the first stroke of its radicals by which characters are arranged in traditional Chinese dictionaries or individual character or the first stroke, second above-mentioned numerical code and above-mentioned numeral less than two are represented in input earlier.Select according to screen display then, thereby with this Chinese character input computing machine.
2, according to the input method of claim 1, the corresponding relation of wherein said five kinds of strokes and numerical code is: horizontal in 1; Directly corresponding to 2; Tiltedly corresponding to 3; O'clock corresponding to 4; Folding is corresponding to 5.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN85105023A CN85105023B (en) | 1985-07-02 | 1985-07-02 | Chinese character stroke indexing coding method and processing method thereof |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN85105023A CN85105023B (en) | 1985-07-02 | 1985-07-02 | Chinese character stroke indexing coding method and processing method thereof |
Publications (2)
Publication Number | Publication Date |
---|---|
CN85105023A CN85105023A (en) | 1987-01-07 |
CN85105023B true CN85105023B (en) | 1988-08-17 |
Family
ID=4794206
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN85105023A Expired CN85105023B (en) | 1985-07-02 | 1985-07-02 | Chinese character stroke indexing coding method and processing method thereof |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN85105023B (en) |
Families Citing this family (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1047447C (en) * | 1994-04-15 | 1999-12-15 | 蔡勇飞 | Computer imput method of figure-sign coding |
CN1048561C (en) * | 1994-07-19 | 2000-01-19 | 肖宝林 | Chinese character input method for computer |
HK1023263A2 (en) | 1999-07-28 | 2000-07-28 | Qcode Information Technology Ltd | Chinese character input method and device |
CN103425261B (en) * | 2013-06-13 | 2016-09-07 | 吴礼明 | Compound code element numeric keyboard shape code Chinese input |
CN103838393B (en) * | 2014-03-03 | 2017-10-13 | 万仁芳 | Hanzi structure number character learning input method |
-
1985
- 1985-07-02 CN CN85105023A patent/CN85105023B/en not_active Expired
Also Published As
Publication number | Publication date |
---|---|
CN85105023A (en) | 1987-01-07 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US5197810A (en) | Method and system for inputting simplified form and/or original complex form of Chinese character | |
US4684926A (en) | Universal system of encoding chinese characters and its keyboard | |
WO2003104963A1 (en) | Input method for optimizing digitize operation code for the world characters information and information processing system thereof | |
GB2221780A (en) | System for encoding a collection of ideographic characters | |
CN1003326B (en) | Optimized five-stroke font coding method and keyboard thereof | |
CN1003890B (en) | An Zijie's Pen Shape Computer Coding Method for Chinese Characters and Its Keyboard | |
CN85105023B (en) | Chinese character stroke indexing coding method and processing method thereof | |
GB2116341A (en) | Interactive chinese typewriter | |
CN1006014B (en) | Non-coding Chinese character processing method and input keyboard | |
CN1045878A (en) | Computing machine Chinese sound-digit code input technology | |
Wu et al. | Computer processing of Chinese characters: An overview of two decades' research and development | |
CN85102473B (en) | Chinese character information processing technology by sequential etymon method | |
CN100373307C (en) | International exchange Chinese character software | |
CN1022350C (en) | Chinese alphabet coding input method | |
CN1288187B (en) | Computer Chinese character input method and its keyboard | |
TWI280491B (en) | Easyten Chinese text processing and inputting method | |
CN1389775A (en) | Chinese character digital code input method | |
CN106951106A (en) | The alphanumeric code of the new binary digits of magnificent word 16 | |
CN85107060B (en) | Dot matrix method for writing and transmitting handwriting | |
WO2019156386A1 (en) | Method for generating chinese character address/idiom | |
KR820000406B1 (en) | Korean(han geul)electronic typewriter and communication equipment system | |
CN107340883A (en) | Handan code input method | |
CN1460913A (en) | One-code two-form quick Chinese digital coding input method | |
CN101833378B (en) | Standard five-stroke input method and keyboard thereof | |
CHEN et al. | New artificial intelligence based method of inputting Chinese characters for computer usage |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C06 | Publication | ||
PB01 | Publication | ||
C13 | Decision | ||
GR02 | Examined patent application | ||
C14 | Grant of patent or utility model | ||
GR01 | Patent grant | ||
C19 | Lapse of patent right due to non-payment of the annual fee | ||
CF01 | Termination of patent right due to non-payment of annual fee |