CN1196535A - The method of automatic labeling of pronunciation symbols - Google Patents
The method of automatic labeling of pronunciation symbols Download PDFInfo
- Publication number
- CN1196535A CN1196535A CN97110364A CN97110364A CN1196535A CN 1196535 A CN1196535 A CN 1196535A CN 97110364 A CN97110364 A CN 97110364A CN 97110364 A CN97110364 A CN 97110364A CN 1196535 A CN1196535 A CN 1196535A
- Authority
- CN
- China
- Prior art keywords
- code
- phonetic
- character
- chinese character
- index
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 238000000034 method Methods 0.000 title claims abstract description 55
- 238000002372 labelling Methods 0.000 title claims 3
- 150000001875 compounds Chemical class 0.000 description 6
- 102100039250 Essential MCU regulator, mitochondrial Human genes 0.000 description 2
- 101000813097 Homo sapiens Essential MCU regulator, mitochondrial Proteins 0.000 description 2
- 238000010586 diagram Methods 0.000 description 2
- 230000007704 transition Effects 0.000 description 2
- 238000006243 chemical reaction Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 238000012856 packing Methods 0.000 description 1
Images
Landscapes
- Document Processing Apparatus (AREA)
Abstract
An automatic marking method for the pronunciation symbol of a character is disclosed, which automatically marks the pronunciation symbol of a character on a display device. First, the inner code of the word is read and the offset of the inner code in the index is generated. Reading a pronunciation code from the index according to the offset, and obtaining a pronunciation symbol according to the pronunciation code. Finally, the pronunciation symbol is displayed on the display device.
Description
The present invention relates to a kind of automatic marking method of text pronunciation symbol, the automatic marking method of particularly a kind of phonetic annotation of Chinese characters symbol and Roman phonetic symbol.
Under the Chinese display environment of traditional computer, the demonstration of Chinese-character pronunciation symbol must be reached by the artificial method of keying in.Yet artificial key entry very easily causes input error; And when corresponding Chinese character had change, the pronunciation symbol of this key entry can't change, and must therefore, lack the dirigibility on using manually to key in again.
In addition, pronunciation symbol, for example phonetic symbol uses 42 keys altogether, and it is quite difficult to store its arrangement position; And different machine patterns may be used different arrangement modes, therefore cause the inconvenience in the use.In addition, usually on each slight button, all be printed on English, phonetic notation, Cangjie, symbol such as simple and easy, easier difficulty when causing artificial key entry.
In above-mentioned prior art, many shortcomings that traditional pronunciation symbol mask method is produced and inconvenience, fundamental purpose of the present invention are that the pronunciation symbol with literal is shown on the display device automatically.
Another object of the present invention is to avoid wait shortcoming slowly because of artificial key entry mistake and speed that pronunciation symbol caused; For polyphone, then show its most frequently used sound, and can mark its specific pronunciation symbol separately.
For achieving the above object, the invention provides a kind of method of automatic marking pronunciation symbol, it is that pronunciation symbol with a literal is shown on the display device automatically, this method comprises the following step: the ISN that obtains this literal; Produce the side-play amount that this ISN is arranged in an index; From this index, read a pronunciation sign indicating number according to this side-play amount; According to this pronunciation sign indicating number to produce this pronunciation symbol; And this pronunciation symbol is shown on this display device.
According to above-described purpose, the present invention also provides a kind of automatic marking method of Chinese-character pronunciation symbol, and it is the pronunciation symbol with a Chinese character, and for example phonetic symbol is shown on the display device automatically.At first, obtain the ISN of Chinese character, and whether include the special symbol of representing the multitone Chinese character after judging Chinese character by literal shelves.Produce the index offset amount of ISN, and from index, read a phonetic notation sign indicating number that comprises initial consonant, head vowel, simple or compound vowel of a Chinese syllable and tone according to side-play amount.Including special symbol after Chinese character, is the phonetic notation sign indicating number with most symbol transition that comprised after the special symbol then.At last,, and be shown on the display device producing phonetic symbol according to the phonetic notation sign indicating number.
Another aspect of the present invention provides a kind of pronunciation symbol with a Chinese character, and for example the Roman phonetic symbol is shown on the display device automatically.At first, obtain the ISN of Chinese character, and whether include the special symbol of representing the multitone Chinese character after judging Chinese character by literal shelves.Then, produce the index offset amount of ISN, and from index, read a Roman phonetic sign indicating number that comprises phonetic sign indicating number, tone locations number and sound tone mark according to side-play amount.Including special symbol after Chinese character, is the Roman phonetic sign indicating number with most symbol transition that comprised after the special symbol then.At last,, and be shown on the display device producing the Roman phonetic symbol according to the Roman phonetic sign indicating number.
Fig. 1 shows process flow diagram of the present invention.
Fig. 2 shows a demonstration example according to BIG_5 coding and the automatic mark of the use phonetic symbol that the present invention reached.
Fig. 3 shows a demonstration example according to GB2312 coding and the automatic mark of the use Roman phonetic symbol that the present invention reached.
Fig. 1 shows process flow diagram of the present invention.At first, in step 10, set up an index corresponding tables.The foundation of this index corresponding tables is by means of hanzi system, and for example the inside index code of BIG_5 and GB2312 system carries out inverse conversion to generate required corresponding concordance list to it; And the pronunciation sign indicating number in the index is to arrange with the order of Hanzi internal code usually.With the phonetic symbol is example, and the ISN of this Chinese character comprises initial consonant, head vowel, simple or compound vowel of a Chinese syllable and four parts of tone.As for the Roman phonetic symbol, then its ISN comprises phonetic sign indicating number, tone locations number and sound tone mark three parts.Have the Chinese character of multitone as for some, then in index, deposit a most frequently used person.Table one is listed initial consonant, head vowel, simple or compound vowel of a Chinese syllable and the tone part of 42 phonetic symbols.
Then, open a literal retaining that includes Chinese character in step 11, the Chinese character in the literal shelves obtains the ISN of Chinese character thus again.Obtaining of these Chinese characters also can be via keyboard shown in step 12.Step 13 is whether to include the special symbol of representing the multitone Chinese character after judging Chinese character, as ">".If do not comprise special symbol after the Chinese character, then enter step 14, produce the index offset amount of ISN according to ISN.With the BIG_5 Chinese character coding rule is example, and its phonetic notation sign indicating number comprises two bytes (bytes):
AAAAAABB CCCCDDDD, 6 wherein the highest AAAAAA deposit initial consonant, and the 7th to the 8th the highest BB deposits head vowel, and the 5th to the 8th minimum CCCC deposits simple or compound vowel of a Chinese syllable, and 4 minimum DDDD deposit tone.For example, the phonetic notation of " event " word be "
, wherein " be the 9th symbol (seeing Table) of initial consonant part, so AAAAAA=9=001001;
Be second symbol of head vowel part, so BB=2=10; Owing to there is not therefore CCCC=0=0000 of simple or compound vowel of a Chinese syllable part;
Be the 4th symbol of tone part, so DDDD=4=0100; Therefore " event " word code becomes AAAAAABBCCCCDDDD=00100110 00000100.As follows as for its step that produces side-play amount: when the value of the low byte of ISN greater than 127, then the side-play amount computing formula is: the low byte-98 of 157 (high byte of ISN-164)+ISNs; Yet, when the value of the low byte of ISN less than, equal 127, the side-play amount computing formula is: 157
*The low byte-64 of (high byte of ISN-164)+ISN.
If use the GB2312 coded system, its Roman phonetic sign indicating number comprises seven bytes (bytes), and wherein the 1st to the 6th byte is deposited the phonetic sign indicating number, deposits tone locations number for high 4 of the 7th byte, and low 4 of the 7th byte deposit the sound tone mark.As for its side-play amount computing formula be: 94
*The low byte-161 of (high byte of ISN-164)+ISN.
In step 15, from index, read a phonetic notation sign indicating number or Roman phonetic sign indicating number according to side-play amount; Then, according to this pronunciation sign indicating number, table one and above-mentioned coding rule to obtain pronunciation symbol (step 16).
Yet, after Chinese character, including special symbol, most the pronunciation sign indicating numbers that symbol is formed with being comprised after this special symbol are converted into the phonetic notation sign indicating number according to this pronunciation sign indicating number, table one and above-mentioned coding rule.For example suppose to comprise in the literal shelves following literal:
Green hill Guo is outer tiltedly<and 141,012 14 represent the T of initial consonant according to table one, and on behalf of one, 01 of head vowel, 1 represent the Y of simple or compound vowel of a Chinese syllable, and 2 represent tone
(second sound).
In step 17,, and it is stored in the storage device (step 18) obtaining pronunciation symbol according to the pronunciation sign indicating number, for example in the disk; Or pronunciation symbol mark is shown in device, for example on the terminating machine (step 19).Show Chinese character and the method for pronunciation symbol on terminating machine, be to use general display packing and cooperate self-built function and some to call module and reach.
Fig. 2 shows one according to the demonstration example that BIG_5 encodes and the use phonetic symbol that the present invention reached marks automatically, and Fig. 3 then shows a demonstration example according to GB2312 coding and the automatic mark of the use Roman phonetic symbol that the present invention reached.According to statistics and experience, in general literal shelves, the situation that needs specific mark polyphone is less than 3%.Therefore, almost most literal shelves can reach automatic mark pronunciation symbol.
The above is preferred embodiment of the present invention only, is not in order to limit the scope of claim of the present invention; All other do not break away from the equivalence of being finished under the disclosed spirit and changes or modification, all should be included in the described claim scope.For example, in the instructions be automatic mark with the Chinese-character pronunciation symbol as embodiment, yet other literal especially is the writing system of non-phonetic, all meets the spirit of the claimed scope of the present invention.In addition, embodiment adopts Chinese character BIG_5 and GB2312 coded system, yet other coded system equally goes in the present invention's scope required for protection.
Claims (39)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CNB97110364XA CN100392640C (en) | 1997-04-15 | 1997-04-15 | method for automatically marking pronunciation symbol |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CNB97110364XA CN100392640C (en) | 1997-04-15 | 1997-04-15 | method for automatically marking pronunciation symbol |
Publications (2)
Publication Number | Publication Date |
---|---|
CN1196535A true CN1196535A (en) | 1998-10-21 |
CN100392640C CN100392640C (en) | 2008-06-04 |
Family
ID=5171388
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CNB97110364XA Expired - Fee Related CN100392640C (en) | 1997-04-15 | 1997-04-15 | method for automatically marking pronunciation symbol |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN100392640C (en) |
Cited By (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN100440207C (en) * | 2004-12-31 | 2008-12-03 | 北京中星微电子有限公司 | Chinese dictionary search engine and method for quick positioning words in Chinese dictionary |
CN101482867B (en) * | 2008-01-09 | 2012-07-04 | 北大方正集团有限公司 | Method and apparatus for automatically adding pinyin for Chinese character |
CN102567296A (en) * | 2011-01-04 | 2012-07-11 | 中国移动通信有限公司 | Chinese character information processing method and Chinese character information processing device |
WO2015169134A1 (en) * | 2014-05-07 | 2015-11-12 | Tencent Technology (Shenzhen) Company Limited | Method and apparatus for phonetically annotating text |
US11479693B2 (en) | 2018-05-03 | 2022-10-25 | Avery Dennison Corporation | Adhesive laminates and method for making adhesive laminates |
-
1997
- 1997-04-15 CN CNB97110364XA patent/CN100392640C/en not_active Expired - Fee Related
Cited By (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN100440207C (en) * | 2004-12-31 | 2008-12-03 | 北京中星微电子有限公司 | Chinese dictionary search engine and method for quick positioning words in Chinese dictionary |
CN101482867B (en) * | 2008-01-09 | 2012-07-04 | 北大方正集团有限公司 | Method and apparatus for automatically adding pinyin for Chinese character |
CN102567296A (en) * | 2011-01-04 | 2012-07-11 | 中国移动通信有限公司 | Chinese character information processing method and Chinese character information processing device |
WO2012092845A1 (en) * | 2011-01-04 | 2012-07-12 | 中国移动通信集团公司 | Chinese character information processing method and chinese character information processing device |
WO2015169134A1 (en) * | 2014-05-07 | 2015-11-12 | Tencent Technology (Shenzhen) Company Limited | Method and apparatus for phonetically annotating text |
US10114809B2 (en) | 2014-05-07 | 2018-10-30 | Tencent Technology (Shenzhen) Company Limited | Method and apparatus for phonetically annotating text |
US11479693B2 (en) | 2018-05-03 | 2022-10-25 | Avery Dennison Corporation | Adhesive laminates and method for making adhesive laminates |
Also Published As
Publication number | Publication date |
---|---|
CN100392640C (en) | 2008-06-04 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN1196535A (en) | The method of automatic labeling of pronunciation symbols | |
CN100501656C (en) | Tone and shape combination method for inputting Chinese character into electronic apparatus | |
CN108304083A (en) | With the method and keyboard of two Chinese character of skeleton symbol Pinyin Input | |
CN87105564A (en) | A Chinese character input method and its input keyboard | |
CN1111373A (en) | Computer Chinese input scheme based on the Chinese Phonetic Alphabet | |
CN85100094A (en) | Phonetic transcriptions of Chinese characters association coding and spelling keyboard | |
CN1079550C (en) | Phonetic Chinese characters | |
CN1027839C (en) | Chinese character encoding input method | |
CN107451105B (en) | Bright braille conversion system based on novel Chinese character holographic coding rule | |
CN1257444C (en) | Complete pronunciation Chinese input method for computer | |
CN1022350C (en) | Chinese alphabet coding input method | |
CN1074146C (en) | Scheme for inputting Chinese characters | |
CN1116336A (en) | Substitution type Chinese phonetic character, word input coding method and keyboard thereof | |
CN1612095A (en) | Double phonetic alphabet input method | |
CN1614539A (en) | Initial consonant and vowel inputting method | |
CN1485716A (en) | Mandarin Chinese phonetic alphabet Chinese input method and equipment thereof | |
CN1034030C (en) | Simple key-in technique for Chinese characters | |
CN1388430A (en) | Modern Chinese pronunciation input method | |
CN1304075A (en) | Nanural phonetic configuration code computer Chiense character code input method | |
CN1609762A (en) | Binary syllabification | |
CN87106019A (en) | Method for encoding Chinese characters by pronunciation and keyboard | |
CN1470978A (en) | Block word input scheme for displaying all Chinese character component and high-frequency individual character on one screen | |
CN103488309A (en) | Chinese character input method combining simple spelling and component figures | |
CN1018774B (en) | Chinese-character, symbol coding method by shape, pronunciation and symbol and its keyboard | |
CN85102847A (en) | The input of computer Chinese-character dynamic coding |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C06 | Publication | ||
PB01 | Publication | ||
C14 | Grant of patent or utility model | ||
GR01 | Patent grant | ||
C17 | Cessation of patent right | ||
CF01 | Termination of patent right due to non-payment of annual fee |
Granted publication date: 20080604 Termination date: 20110415 |