JPH05216887A

JPH05216887A - Device and method for chinese pronunciation notation/ chinese character conversion

Info

Publication number: JPH05216887A
Application number: JP4299136A
Authority: JP
Inventors: Okikuni Naka; 興国仲
Original assignee: Omron Corp; Omron Tateisi Electronics Co
Current assignee: Omron Corp
Priority date: 1991-10-14
Filing date: 1992-10-13
Publication date: 1993-08-27

Abstract

PURPOSE:To input pronunciation by utilizing any of plural kinds of notation method including the PinYin notation method and the ZhuYin notation method and to obtain Chinese character candidates including a desired Chinese character without inputting a tone or even when the input of the tone is erroneous. CONSTITUTION:Input data in the PinYin notation method are converted into corresponding sound codes by means of a PinYin/sound code conversion table 31, while the input data in the ZhuYin notation method are converted into the corresponding sound codes by means of a ZhuYin/sound code conversion table 32. A dictionary 35 stores Chinese character codes corresponding to a sound code train (corresponding to a word). The sound code train is prepared from the input data. The respective sound codes in the input sound code train is compared with the sound codes in the sound code train in the dictionary 35 through a filter masking a certain bit of the sound codes. Based on the comparison, the Chinese character codes corresponding to the coincident sound code train are read out of the dictionary 35 and the word (Chinese character) corresponding to the Chinese character codes are displayed.

Description

Detailed Description of the Invention

【０００１】[0001]

【技術分野】この発明は，キーボードから入力された中
国語発音表記をそれに対応する漢字に変換して出力する
装置および方法に関し，とくに中国語用ワード・プロセ
ッサやワーク・ステーション等で好適に利用される装置
および方法に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to an apparatus and method for converting Chinese phonetic notation input from a keyboard into corresponding Chinese characters and outputting the same, and is particularly preferably used in a Chinese word processor or work station. Apparatus and method.

【０００２】[0002]

【背景技術】中国語は漢字を用いて表現される。漢字の
発音を表記するためのいつくかの表記法がある。その代
表的なものに，中国政府が1958年に発布したBACKGROUND ART Chinese is expressed using Chinese characters. There are several notations for writing the pronunciation of kanji. The representative one was promulgated by the Chinese government in 1958.

【数１】それ以前に使用され，現在でも台湾で使用されている[Equation 1] Used before then, still used in Taiwan

【数２】がある。[Equation 2] There is.

【０００３】中国漢字一字の発音は，子音に相当する声
母（Sheng Mu）と，母音に相当する韵母（Yun Mu）と，
抑揚を表わす四声（Si Sheng）または声調（Sheng Dia
o）とに分けることができる。韵母と声母とをあわせて
声韵（Sheng Yun ）という。漢字の発音の中には抑揚の
ないものもある。一漢字の発音は１個以下（１個または
０個）の声母と１個の韵母と（必要に応じてさらに声調
を用いて）によって表わされる。The pronunciation of a single Chinese kanji character is made up of a phonetic mother (Sheng Mu) corresponding to a consonant, and a Chinese mother (Yun Mu) corresponding to a vowel.
Four tones (Si Sheng) or tones (Sheng Dia)
o) can be divided into The combination of Kyoko and Kyoko is called Sheng Yun. Some kanji pronunciations have no intonation. The pronunciation of one Kanji is represented by one or less (one or zero) phonetic syllable and one cypress (using further tones as necessary).

【０００４】声調は次の４種類である。一声（Yi Sheng
または１Sheng ）：高音で平坦。There are the following four tone types. One voice (Yi Sheng
Or 1Sheng): High and flat.

【数３】二声（Er Shengまたは２Sheng ）：低音から高音に上が
る。[Equation 3] Two voices (Er Sheng or 2Sheng): Bass to treble.

【数４】三声（Shan Shengまたは３Sheng ）：高音から低音にな
り，再び高音になる。[Equation 4] Three voices (Shan Sheng or 3Sheng): High to low, then high again.

【数５】四声（Si Shengまたは４Sheng ）：高音から低音に下が
る。[Equation 5] Four voices (Si Sheng or 4 Sheng): Decrease from treble to bass.

【数６】 [Equation 6]

【０００５】たとえば，「中国」という漢字はPin Yin
法によると，For example, the Chinese character “China” is Pin Yin
According to the law,

【数７】と表記される。これらの中で，“Ｚｈ”および“Ｇ”が
声母，“ｏｎｇ”および“ｕｏ”が韵母である。また，
「日本」という漢字はPin Yin 法によると，[Equation 7] Is written. Among these, "Zh" and "G" are the vocal mothers, and "ong" and "uo" are the female mothers. Also,
According to the Pin Yin method, the kanji "Japan" is

【数８】と表記される。“Ｒ”および“Ｂ”が声母，“ｉ”およ
び“ｅｎ”が韵母である。[Equation 8] Is written. "R" and "B" are voice carriers, and "i" and "en" are iron carriers.

【０００６】従来の中国語ワード・プロセッサではPin
Yin 法表記による入力のみが許されていた。Pin Yin 法
は比較的新しいので，Zhu Yin 法は知っているがPin Yi
n 法を知らない人々または世代がいる。したがって，よ
り多くの人々に中国語ワード・プロセッサを活用しても
らおうとするとZhu Yin 法による入力も許容するように
しなければならない。Pin in a traditional Chinese word processor
Only input in Yin notation was allowed. The Pin Yin method is relatively new, so I know the Zhu Yin method, but Pin Yi
n Some people or generations do not know the law. Therefore, in order to get more people to use the Chinese word processor, we have to allow Zhu Yin input.

【０００７】また，Pin Yin 法表記は北京語を標準語と
して定められている。広大な中国においては北京語と声
調が異なる発音をもつ言語がある。地域によっては声韵
さえ北京語と異なる場合がある。したがって標準語とし
ての北京語を知らないまたは熟知していない人々にとっ
ては声韵と声調とを正しく入力することが困難となり，
入力誤りがしばしば生じる。北京語圏の人であっても，
声調の区別を意識しながら発音しているとは限らず，声
調を思い出しながらまたは考えながらワード・プロセッ
サへの入力作業を行なわなければならず，入力作業が煩
雑であるばかりでなく，正しい声調を入力できないとき
もある。The Pin Yin system notation is defined with Mandarin as the standard language. In vast China, there are languages with pronunciations that differ from the tone of Mandarin. Depending on the region, even the vocal language may be different from Mandarin. Therefore, it becomes difficult for people who do not know or are familiar with Mandarin as the standard language to input the correct voice and tone,
Input errors often occur. Even if you are a Mandarin-speaking person,
It is not always possible to pronounce while paying attention to the distinction of tones, and it is necessary to perform input work into the word processor while remembering or thinking about the tone, and not only the input work is complicated, but also the correct tone is produced. Sometimes you can't enter.

【０００８】従来の中国語ワード・プロセッサにおいて
は，声韵と声調が正しく入力された場合にのみそれに対
応する正しい漢字が出力され，入力誤りがあると正しい
漢字が得られない。In the conventional Chinese word processor, the correct Chinese character corresponding to the correct input is output only when the vocal and tone are correctly input, and the correct Chinese character cannot be obtained if there is an input error.

【０００９】[0009]

【発明の開示】この発明の目的は，中国語の発音表記／
漢字変換装置において，Pin Yin 表記法，Zhu Yin 表記
法を含む複数種類の表記法のいずれかを用いて発音を入
力することができるようにするものである。DISCLOSURE OF THE INVENTION It is an object of the present invention to provide Chinese phonetic notation /
The Kanji conversion device enables pronunciations to be input using any of several types of notation including Pin Yin notation and Zhu Yin notation.

【００１０】この発明の他の目的は，声調を入力しなく
てもまたは声調の入力に誤りがあっても所望の漢字を含
む候補漢字が得られるようにすることにある。Another object of the present invention is to make it possible to obtain a candidate kanji including a desired kanji even if the tone is not input or there is an error in the tone input.

【００１１】この発明のさらに他の目的は発音の少くと
も一部が正確であれば，その一部発音を含む発音に対応
する漢字の候補が得られるようにすることにある。It is still another object of the present invention to obtain a kanji candidate corresponding to a pronunciation including a part of the pronunciation if at least a part of the pronunciation is accurate.

【００１２】この発明の第１の特徴によると，この発明
による中国語の発音表記／漢字変換装置は，中国語発音
を複数種類の表記法にしたがって入力可能な入力装置，
上記入力装置を用いて入力可能な複数種類の表記法のそ
れぞれについて設けられ，各表記法にしたがう入力デー
タを，その入力データによって表わされる発音に対応す
る音コードに変換する複数の変換テーブル，音コード
と，その音コードによって表わされる発音をもつ漢字を
示す漢字コードとを相互に対応させて記憶した辞書，お
よび上記入力装置から入力された入力データを上記複数
の変換テーブルのいずれかを用いて音コードに変換し，
変換された音コードに対応する漢字コードを上記辞書に
おいて検索する制御手段を備えている。According to the first aspect of the present invention, the Chinese pronunciation notation / Kanji conversion apparatus according to the present invention is capable of inputting Chinese pronunciations according to a plurality of kinds of notations,
A plurality of conversion tables and sounds provided for each of a plurality of types of notations that can be input using the above-mentioned input device, for converting input data according to each notation into a sound code corresponding to the pronunciation represented by the input data. A dictionary in which a code and a Kanji code indicating a Kanji having a pronunciation represented by the phonetic code are stored in association with each other, and input data input from the input device is stored using one of the plurality of conversion tables. Convert to sound code,
The control means for searching the dictionary for the kanji code corresponding to the converted phonetic code is provided.

【００１３】好ましい実施態様においては，この発明の
装置には，複数種類の表記法のいずれかを選択するため
の入力モード選択手段がさらに設けられる。そして，こ
の入力モード選択手段によって選択された表記法に関係
する変換テーブルを用いて入力データが音コードに変換
される。In a preferred embodiment, the apparatus of the present invention is further provided with input mode selection means for selecting one of a plurality of types of notation. Then, the input data is converted into a sound code by using a conversion table related to the notation selected by the input mode selection means.

【００１４】上記入力装置から入力された入力データに
基づいて表記法が自動的に判定され，この判定にしたが
って用いるべき変換テーブルを選択するようにしてもよ
い。The notation may be automatically determined based on the input data input from the input device, and the conversion table to be used may be selected according to this determination.

【００１５】上記入力装置を，入力音声を電気信号に変
換する装置と，音声電気信号に基づいて発音を認識し，
入力音声を音コードに変換する音声認識装置とから構成
することもできる。The input device is a device for converting an input voice into an electric signal, and recognizes pronunciation based on the voice electric signal,
It can also be configured with a voice recognition device that converts an input voice into a sound code.

【００１６】この発明による装置が中国語ワード・プロ
セッサ等に応用された場合には，検索された漢字コード
を，その漢字コードによって示される漢字を表わす表示
データに変換する手段，および表示データに基づいて漢
字を表示する装置がさらに設けられる。When the device according to the present invention is applied to a Chinese word processor or the like, the means for converting the retrieved Kanji code into display data representing the Kanji indicated by the Kanji code, and the display data are used. Further, a device for displaying kanji is provided.

【００１７】この装置にはさらに，表示された漢字の候
補の中からいずれか一つを指定する指定入力手段，およ
び指定された漢字を示す漢字コードを格納するメモリが
設けられる。This device is further provided with a designation input means for designating one of the displayed kanji candidates and a memory for storing a kanji code indicating the designated kanji.

【００１８】より現実的なワード・プロセッサに適用す
るためには，上記各変換テーブルは一漢字について入力
データを音コードに変換するように構成される。これに
対して，上記辞書は，一漢字または複数の漢字からなる
単語について音コード列と漢字コードとを対応させて記
憶するように構成される。上記入力装置から入力された
一連の入力データは一漢字ごとに区切って音コードに変
換される。変換後の一または複数の音コードは単語単位
に配列されて音コード列を構成する。この音コード列に
対応する漢字コードが上記辞書において検索される。In order to apply to a more realistic word processor, each conversion table described above is configured to convert input data into a phonetic code for one Chinese character. On the other hand, the dictionary is configured to store a phonetic code string and a kanji code in association with each other for a word consisting of one kanji or a plurality of kanji. A series of input data input from the input device is divided into each kanji character and converted into a sound code. The converted one or more sound codes are arranged in word units to form a sound code string. The Kanji code corresponding to this phonetic code string is searched in the dictionary.

【００１９】この発明は一つの発音（漢字の読み）に一
つの音コードを対応させることができるという認識に基
礎を置いている。Pin Yin 表記，Zhu Yin 表記などの複
数種類の表記法が存在したとしても，これらの表記法に
よって表現された発音はそれが同一発音のものであれば
必ず一つの音コードに収斂させることができる。したが
って，漢字（または単語）の辞書としては音コードを用
いて検索されるものを用意すれば足りる。このようにし
てこの発明によると，複数種類の表記法のいずれかを用
いて発音を入力することができ，入力された発音がその
発音をもつ漢字に変換される。The present invention is based on the recognition that one phonetic code (kanji reading) can be associated with one phonetic code. Even if there are multiple types of notations such as Pin Yin notation and Zhu Yin notation, the pronunciations expressed by these notations can always be converged into one phonetic chord if they have the same pronunciation. .. Therefore, it suffices to prepare a dictionary for kanji (or words) that is searched using phonetic codes. In this way, according to the present invention, the pronunciation can be input using any of a plurality of types of notation, and the input pronunciation is converted into the kanji having the pronunciation.

【００２０】この発明の実施態様においては，声調を正
しく入力しなくても，または発音表記に多少の誤りがあ
っても，希望する漢字を含む漢字候補を得ることができ
る。このためにこの発明による装置は，音コードを構成
する所定の一または複数ビットをマスクするフィルタリ
ング手段をさらに備えている。そして，上記制御手段
は，入力データに対応する音コードと上記辞書の音コー
ドとを上記フィルタリング手段を用いてフィルタリング
したのちに比較することにより，入力データに対応する
音コードに一致する音コードを上記辞書において捜し出
す。In the embodiment of the present invention, it is possible to obtain a kanji character candidate including a desired kanji character even if the tone is not correctly input or the phonetic transcription has some error. To this end, the device according to the invention further comprises filtering means for masking the predetermined one or more bits that make up the sound code. Then, the control means filters the sound code corresponding to the input data and the sound code of the dictionary by using the filtering means and then compares them to determine a sound code matching the sound code corresponding to the input data. Search in the above dictionary.

【００２１】この実施態様の特徴は，次に述べるこの発
明の第２の特徴の観点から充分に明らかにされよう。The features of this embodiment will be fully clarified in view of the second feature of the invention described below.

【００２２】この発明の第２の特徴によると，中国語の
発音表記／漢字変換装置は，入力された中国語発音を表
わすデータを，その発音に対応する音コードに変換する
変換手段，音コードと，その音コードによって表わされ
る発音をもつ漢字を示す漢字コードとを相互に対応させ
て記憶した辞書，音コードを構成する所定の一または複
数ビットをマスクするフィルタリング手段，上記変換手
段から得られる音コードと上記辞書の音コードとを上記
フィルタリング手段を用いてフィルタリングした上で相
互に比較することにより，上記変換手段から得られる音
コードに一致する音コードを上記辞書において検索し，
一致する音コードに対応する漢字コードを上記辞書から
読み出す制御手段を備えている。According to the second aspect of the present invention, the Chinese pronunciation notation / Kanji conversion apparatus converts the input data representing the Chinese pronunciation into a sound code corresponding to the pronunciation, and a sound code. And a kanji code indicating a kanji having a pronunciation represented by the sound code are stored in association with each other, filtering means for masking a predetermined one or more bits constituting the sound code, and the conversion means. By filtering the sound code and the sound code in the dictionary using the filtering means and comparing them with each other, a sound code matching the sound code obtained from the converting means is searched in the dictionary,
The control means is provided for reading the kanji code corresponding to the matching phonetic code from the dictionary.

【００２３】上記音コードは，一実施態様においては，
声母を表わすビットと，韵母を表わすビットと，声調を
表わすビットとを含むように構成される。この場合，上
記フィルタリング手段は，声母を表わすビット，韵母を
表わすビットまたは声調を表わすビットをマスクするも
のとして構成される。The above sound chord is, in one embodiment,
It is configured so as to include a bit representing a phonetic character, a bit representing a vowel, and a bit representing a tone. In this case, the filtering means is configured as a mask for a bit representing a voice vowel, a bit indicating a vowel vowel, or a bit indicating a tone.

【００２４】上記フィルタリング手段は，音コードをそ
のまま通過させるものを含むと理解されたい。It is to be understood that the filtering means include those that allow the sound chords to pass through as they are.

【００２５】フィルタリング手段の使用の有無，または
複数のフィルタリング手段のいずれかを選択する検索モ
ード選択手段が，必要に応じて，さらに設けられる。Search mode selecting means for selecting whether or not to use the filtering means or for selecting one of the plurality of filtering means is further provided, if necessary.

【００２６】上述したこの発明の第１の特徴のように，
中国語発音を複数種類の表記法にしたがって入力可能な
入力装置をさらに設けてもよい。この場合，上記変換手
段は，上記入力装置を用いて入力可能な複数種類の表記
法のそれぞれについて設けられ，かつ各表記法にしたが
う入力データを，その入力データによって表わされる発
音に対応する音コードに変換する複数の変換テーブルを
備えることになる。Like the first feature of the present invention described above,
You may further provide the input device which can input Chinese pronunciation according to the multiple types of notation. In this case, the conversion means is provided for each of a plurality of types of notation that can be input using the input device, and the input data according to each notation is converted into a sound code corresponding to the pronunciation represented by the input data. It will have a plurality of conversion tables for converting to.

【００２７】上記入力手段および変換手段の代わりに，
音声入力信号に基づいて発音を認識し，その発音に対応
する音コードを出力する音声認識手段を設けてもよい。Instead of the above input means and conversion means,
A voice recognition means may be provided for recognizing a pronunciation based on a voice input signal and outputting a sound code corresponding to the pronunciation.

【００２８】より現実的なワード・プロセッサを実現す
るために，読み出された漢字コードを，その漢字コード
によって示される漢字を表わす表示データに変換する手
段，表示データに基づいて漢字を表示する装置，表示さ
れた漢字の候補の中からいずれか一つを指定する指定入
力手段，および指定された漢字を示す漢字コードを格納
するメモリが設けられよう。In order to realize a more realistic word processor, a means for converting the read kanji code into display data representing the kanji indicated by the kanji code, and a device for displaying the kanji based on the display data. , A designation input means for designating one of the displayed kanji candidates and a memory for storing a kanji code indicating the designated kanji will be provided.

【００２９】さらに，上記変換手段が一漢字について入
力データを音コードに変換するものとして構成されるの
に対して，上記辞書は，一漢字または複数の漢字からな
る単語について音コード列と漢字コードとを対応させて
記憶するものとして構成される。そして，一連の入力デ
ータが一漢字ごとに区切って音コードに変換され，変換
後の一または複数の音コードが単語単位に配列されて音
コード列が構成される。この音コード列に対応する漢字
コードが上記辞書において検索される。Further, while the conversion means is configured to convert the input data into a phonetic code for one Kanji, the above dictionary has a phonetic code string and a Kanji code for a word consisting of one Kanji or a plurality of Kanji. And are associated and stored. Then, a series of input data is divided into each Kanji character and converted into a sound code, and one or a plurality of converted sound codes are arranged in word units to form a sound code string. The Kanji code corresponding to this phonetic code string is searched in the dictionary.

【００３０】この発明によると，入力データを表わす音
コードと辞書における音コードとがともにフィルタリン
グされた後に比較される。一致，不一致を不問に付すべ
き音コードの部分（一または複数のビット）がフィルタ
によってマスクされるので，このマスクされた部分は比
較処理の対象にならない。According to the present invention, the sound code representing the input data and the sound code in the dictionary are both filtered and then compared. Since the filter masks the part (one or more bits) of the sound code that should be matched or unmatched, the masked part is not subject to comparison processing.

【００３１】したがって，声調を不問にしたい場合には
それに適したフィルタを用いることにより，声調の一
致，不一致に関係なく，または声調を入力しなくても，
その他の部分（声韵）で入力データに対応する音コード
と一致する音コードに対応する一または複数の漢字の候
補が得られることになる。このようにして，声調を入力
しなくても，または声調の入力に誤りがあっても所望の
漢字（単語）を含む漢字（単語）の候補が出力されるよ
うになる。Therefore, when it is desired to make the tones unquestioned, by using a filter suitable for them, regardless of whether the tones match or not, or even when the tones are not input,
In the other part (Voice), one or a plurality of Chinese character candidates corresponding to the sound code corresponding to the input data can be obtained. In this way, candidates for kanji (words) including a desired kanji (word) can be output even if no tone is input or there is an error in tone input.

【００３２】フィルタの種類は任意に設定可能である。
したがって，声母のみの一致または韵母のみの一致とい
う条件下での漢字の検索も可能となる。すなわち，発音
の少くとも一部が正確であればその発音を含む発音に対
応する漢字（単語）の候補を得ることができるようにな
る。The type of filter can be set arbitrarily.
Therefore, it is also possible to search for kanji under the condition that only the phonetic mother or only the Chinese mother is matched. That is, if at least part of the pronunciation is correct, it is possible to obtain candidates for kanji (words) corresponding to the pronunciation including the pronunciation.

【００３３】この発明はさらに上述した第１および第２
の特徴による装置にそれぞれ対応する中国語の発音表記
／漢字変換方法を提供している。The present invention further includes the above-mentioned first and second
It provides a phonetic transcription / kanji conversion method for Chinese corresponding to each device according to the above features.

【００３４】第１の特徴による方法は，中国語発音を複
数種類の表記法にしたがって入力可能とし，入力可能な
複数種類の表記法のそれぞれについて，各表記法にした
がう入力データを，その入力データによって表わされる
発音に対応する音コードに変換する複数の変換テーブ
ル，および音コードと，その音コードによって表わされ
る発音をもつ漢字を示す漢字コードとを相互に対応させ
て記憶した辞書をあらかじめ用意し，入力データを上記
複数の変換テーブルのいずれかを用いて音コードに変換
し，変換された音コードに対応する漢字コードを上記辞
書において検索するものである。In the method according to the first feature, Chinese pronunciation can be input according to a plurality of types of notation, and for each of the plurality of types of notations that can be input, the input data according to each notation is converted into the input data. A dictionary is prepared in advance that stores a plurality of conversion tables for converting into the phonetic codes corresponding to the pronunciations represented by, and the phonetic codes and the Kanji codes representing the Kanjis with the pronunciations represented by the phonetic codes in association with each other. The input data is converted into a sound code using any of the plurality of conversion tables, and the Kanji code corresponding to the converted sound code is searched in the dictionary.

【００３５】第２の特徴による方法は，音コードと，そ
の音コードによって表わされる発音をもつ漢字を示す漢
字コードとを相互に対応させて記憶した辞書をあらかじ
め用意しておき，入力された中国語発音を表わすデータ
を，その発音に対応する音コードに変換し，変換により
得られた音コードと，上記辞書の音コードとを，音コー
ドを構成する所定の一または複数ビットをマスクするこ
とによりフィルタリングした上で相互に比較し，変換に
より得られた音コードに一致する音コードを上記辞書に
おいて検索し，一致する音コードに対応する漢字コード
を上記辞書から読み出すものである。In the method according to the second feature, a dictionary in which a phonetic code and a kanji code indicating a kanji having a pronunciation represented by the phonetic code are stored in association with each other is prepared in advance, and the input Chinese Data representing a word pronunciation is converted into a sound code corresponding to the pronunciation, and the sound code obtained by the conversion and the sound code in the dictionary are masked with one or more predetermined bits constituting the sound code. The sound codes that match the sound codes obtained by the conversion are searched in the dictionary, and the Kanji codes corresponding to the matching sound codes are read from the dictionary.

【００３６】この発明の他の特徴および利点は，以下に
示す図面を参照した実施例の説明において明らかになる
であろう。Other features and advantages of the present invention will be apparent from the following description of the embodiments with reference to the drawings.

【００３７】[0037]

【実施例の説明】図１は中国語の発音表記／漢字変換装
置の構成を示している。この装置は，一般には，中国語
ワード・プロセッサ，ワーク・ステーション等の一部と
して実現されよう。DESCRIPTION OF THE PREFERRED EMBODIMENTS FIG. 1 shows the configuration of a Chinese phonetic transcription / kanji conversion device. This device would typically be implemented as part of a Chinese word processor, work station, etc.

【００３８】中国語の発音表記／漢字変換装置は，中央
処理装置（ＣＰＵ）を含むコンピュータ10，発音表記，
各種モード，その他の機能を入力するためのキーボード
20，辞書および各種変換テーブルを記憶したメモリ装置
30，ならびに変換された漢字，その他の情報またはデー
タを表示するための表示装置14およびその制御装置12か
ら構成されている。The phonetic transcription / kanji conversion device for Chinese includes a computer 10 including a central processing unit (CPU), a phonetic transcription,
Keyboard for entering various modes and other functions
20, memory device storing dictionary and various conversion tables
30 and a display device 14 for displaying converted Chinese characters and other information or data and its control device 12.

【００３９】コンピュータ10としては市販の汎用コンピ
ュータを用いることができる。このコンピュータ10は後
述する入力，編集処理および漢字検索処理を実行するよ
うにプログラムされている。A commercially available general-purpose computer can be used as the computer 10. This computer 10 is programmed to execute the input, edit processing and Kanji search processing described later.

【００４０】この実施例では，入力可能な発音表記とし
てPin Yin 法とZhu Yin 法とがある。また，声調を含め
た完全な発音表記と，声調を除いた声韵のみで行う発音
表記とが許容される。In this embodiment, there are the Pin Yin method and the Zhu Yin method as the phonetic notations that can be input. In addition, a complete phonetic transcription that includes the tone and a phonetic transcription that excludes the tone and that is performed using only the vocal cord are allowed.

【００４１】キーボード20には，Pin Yin 法で発音表記
を入力するためのPin Yin キー21，Zhu Yin 法で発音表
記を入力するためのZhu Yin キー22，入力する発音表記
がPin Yin 法によるものかZhu Yin 法によるものかを選
択するための入力モード・キー23，声調を含めた完全な
発音表記で漢字検索をしてほしい（これを第１モードと
いう）か，声調を除いた声韵のみによる発音表記で漢字
検索をしてほしい（これを第２モードという）かを選択
するための変換モード・キー，ならびに入力した発音表
記を漢字に変換すべきことを指令するための変換キー，
スペース・キー（必要な場合），その他の機能を入力す
るための機能キー25が設けられている。On the keyboard 20, a Pin Yin key 21 for inputting phonetic notation by the Pin Yin method, a Zhu Yin key 22 for inputting phonetic notation by the Zhu Yin method, and a phonetic notation to be input also by the Pin Yin method. No. or Zhu Yin method input mode key 23, Kanji search with complete phonetic notation including tone (this is called first mode), A conversion mode key to select whether you want to search Kanji by phonetic notation (this is called the second mode), and a conversion key to instruct that the entered phonetic notation should be converted to Kanji,
A space key (if required) and a function key 25 for entering other functions are provided.

【００４２】キーボード20から入力された発音表記を表
わすデータ（またはコード）は対応する音コード（Yin
Code）に変換される。このコード変換のためにPin Yin
／音コード変換テーブル31およびZhu Yin ／音コード変
換テーブル32がメモリ装置30に設けられている。入力さ
れた発音表記をその発音表記または他の発音表記で表示
するために，一旦，音コードに変換されたものをPin Yi
n 表記またはZhu Yin表記を表わすデータ（またはコー
ド）に逆変換するために音コード／Pin Yin テーブル33
および音コード／Zhu Yin テーブル34がメモリ装置30に
設けられている。さらに，変換された音コードに基づい
てそれに対応する漢字を表わすコード（漢字コード）を
検索するために用いる辞書35，および変換後の漢字コー
ドを記憶するための変換後データ・エリア36がメモリ装
置30に設けられている。メモリ装置30は，半導体メモリ
（ＲＯＭやＲＡＭ），磁気メモリ（フロッピィ・ディス
クやハード・ディスク）またはこれらの組合せにより実
現される。たとえば，変換テーブル31〜34はＲＯＭ，フ
ロッピィ・ディスクまたはハード・ディスクに格納さ
れ，辞書35はフロッピィ・ディスクまたはハード・ディ
スクに格納され，変換後データ・エリアはＲＡＭに設定
される。The data (or chord) representing the phonetic notation input from the keyboard 20 is the corresponding phonetic chord (Yin
Code). Pin Yin for this code conversion
A / sound code conversion table 31 and a Zhu Yin / sound code conversion table 32 are provided in the memory device 30. In order to display the input phonetic notation with that phonetic notation or another phonetic notation, the one that has been converted into a phonetic code is Pin Yi
Phonetic chord / Pin Yin table 33 for converting back to data (or chords) representing n notation or Zhu Yin notation
A sound code / Zhu Yin table 34 is provided in the memory device 30. Further, a dictionary 35 used for searching a code (Kanji code) representing a corresponding Kanji character based on the converted phonetic code, and a converted data area 36 for storing the converted Kanji code are memory devices. It is provided in 30. The memory device 30 is realized by a semiconductor memory (ROM or RAM), a magnetic memory (floppy disk or hard disk), or a combination thereof. For example, the conversion tables 31 to 34 are stored in the ROM, the floppy disk or the hard disk, the dictionary 35 is stored in the floppy disk or the hard disk, and the converted data area is set in the RAM.

【００４３】表示装置14としては，最も一般的には，Ｃ
ＲＴ表示装置が用いられるが，プラズマ・ディスプレイ
装置や液晶表示装置を利用することもできる。表示制御
装置12にはキャラクタ・ジェネレータ13が内蔵されてい
る。キャラクタ・ジェネレータ13は，Pin Yin またはZh
u Yin を表わすデータ，および漢字コードを表示データ
（ドット・データ）に変換するものである。As the display device 14, most commonly, C
An RT display device is used, but a plasma display device or a liquid crystal display device can also be used. The display controller 12 has a character generator 13 built-in. Character generator 13 is Pin Yin or Zh
u Yin data and Kanji code are converted to display data (dot data).

【００４４】図２は図１に示す構成をその機能および処
理の流れにしたがって配列したものである。切換回路1
5，編集処理装置16および漢字検索処理装置17は実質的
にはコンピュータ10によって実現される。または，発音
表記／漢字変換装置を図２に示されるハードウェア・ア
ーキテクチャをもつように構成することもできる。FIG. 2 is a diagram in which the configuration shown in FIG. 1 is arranged according to its function and processing flow. Switching circuit 1
5, the edit processing device 16 and the kanji search processing device 17 are substantially realized by the computer 10. Alternatively, the phonetic transcription / kanji conversion device can be configured to have the hardware architecture shown in FIG.

【００４５】切換回路15は入力モード・キー23を通した
選択入力に応じてPin Yin キー21またはZhu Yin キー22
を選択する。Pin Yin キー21またはZhu Yin キー22を用
いて入力されたPin Yin またはZhu Yin 表記を表わすデ
ータは編集処理装置16に与えられる。切換回路15はPin
Yin キー入力かZhu Yin キー入力かを自動的に識別して
その入力を選択するようにしてもよい。The switching circuit 15 selects the Pin Yin key 21 or the Zhu Yin key 22 according to the selection input through the input mode key 23.
Select. The data representing the Pin Yin or Zhu Yin notation input using the Pin Yin key 21 or the Zhu Yin key 22 is given to the edit processing device 16. Switching circuit 15 is Pin
It may be possible to automatically discriminate between Yin key input and Zhu Yin key input and select the input.

【００４６】中国語の単語には漢字一字で構成されるも
のや複数（一般的には二または三字）の漢字で構成され
るものがある。編集処理装置16は入力されたPin Yin ま
たはZhu Yin 表記を表わすデータ列を漢字一字を表わす
データごとに区切り，区切られた各データを入力モード
・キー23によって選択された変換テーブル31または32を
参照して音コードに変換する。機能キー25に含まれる変
換キーからの入力があると，それまでに入力されたデー
タから作成された音コード列が編集処理装置16から漢字
検索処理装置17に与えられる。編集処理装置16は，入力
されたPin YinまたはZhu Yin 表記を表わすデータを表
示制御装置12に与えるので，表示装置14の画面上には入
力されたPin Yin またはZhu Yin 表記による文字が入力
された順序で順次表示されていく。この場合には逆変換
テーブル33，34は必要ない。編集処理装置16が入力デー
タを音コードに変換したのちにPin Yin またはZhu Yin
表記による表示を行う場合に，逆変換テーブル33，34が
用いられる。これらの逆変換テーブル33，34はとくに，
Pin Yin 法（またはZhu Yin 法）で入力されたデータを
Zhu Yin 表記（またはPin Yin 表記）で表示する場合に
有効に用いられる。これにより，Pin Yin 表記しか知ら
ないオペレータがZhu Yin 表記を，Zhu Yin表記しか知
らないオペレータがPin Yin 表記を知ることができるの
で便利である。Some Chinese words are composed of one Kanji character, and some are composed of a plurality of (generally two or three) Kanji characters. The edit processing device 16 divides the input data string representing the Pin Yin or Zhu Yin notation into data representing one kanji character, and stores the separated data into the conversion table 31 or 32 selected by the input mode key 23. Refer to and convert to sound code. When there is an input from the conversion key included in the function key 25, the sound code string created from the data input so far is given from the edit processing device 16 to the kanji search processing device 17. Since the edit processing device 16 gives the data representing the input Pin Yin or Zhu Yin notation to the display control device 12, the characters in the input Pin Yin or Zhu Yin notation are input on the screen of the display device 14. It is displayed in order. In this case, the inverse conversion tables 33 and 34 are not necessary. The editing processor 16 converts the input data into sound chords, and then Pin Yin or Zhu Yin
The reverse conversion tables 33 and 34 are used when displaying by notation. These inverse conversion tables 33 and 34 are
Data input by the Pin Yin method (or Zhu Yin method)
It is effectively used when displaying in Zhu Yin notation (or Pin Yin notation). This is convenient because an operator who knows only Pin Yin notation can know Zhu Yin notation and an operator who knows only Zhu Yin notation can know Pin Yin notation.

【００４７】漢字検索処理装置17は与えられた音コード
列に基づいて，変換モード・キー24による選択に応じ
て，辞書35を検索し，音コード列によって表わされる発
音をもつ１または複数の漢字を表わす漢字コードを読み
出し，表示制御装置12に与える。表示制御装置12は与え
られた漢字コードによって表わされる漢字を表示するた
めの表示データをキャラクタ・ジェネレータ13から読み
出し，これに基づいて表示装置14の表示画面に１または
複数の漢字（漢字の候補）を表示する。オペレータがこ
の表示をみて，機能キー25を用いて，表示された漢字を
確認またはいずれかを選択すると，確認または選択され
た漢字を表わす漢字コードが変換後データ・エリア36に
格納される。The kanji search processor 17 searches the dictionary 35 according to the selection by the conversion mode key 24 based on the given phonetic code string, and one or a plurality of kanji having the pronunciation represented by the phonetic code string. The Kanji code representing "" is read and given to the display controller 12. The display control device 12 reads display data for displaying the kanji represented by the given kanji code from the character generator 13, and based on this, one or a plurality of kanji (candidates for kanji) on the display screen of the display device 14. Is displayed. When the operator sees this display and confirms or selects one of the displayed kanji by using the function key 25, the kanji code representing the confirmed or selected kanji is stored in the converted data area 36.

【００４８】図３はPin Yin ／音コード変換テーブルの
一例を，図４はZhu Yin ／音コード変換テーブルの一例
をそれぞれ示している。FIG. 3 shows an example of the Pin Yin / sound code conversion table, and FIG. 4 shows an example of the Zhu Yin / sound code conversion table.

【００４９】一種類の発音についてPin Yin 法やZhu Yi
n 法のように様々な発音表記法があるが，これらの表記
は必ず一種類の発音に対応している。一種類の発音に対
して一種類の音コード（Yin Code）が割当てられる。一
つのPin Yin 表記は必ず一つの音コードに対応し，一つ
のZhu Yin 表記は必ず一つの音コードに対応する。一つ
の発音を表わすPin Yin 表記とこれと同じ発音を表わす
Zhu Yin 表記は共通の一つの音コードに対応する。この
ようにして，発音の入力に用いられる発音表記法が複数
種類あったとしても，同一の発音は一つの音コードを用
いて表現されることになる。Pin Yin 法を用いて入力し
ても，Zhu Yin 法を用いて入力しても，同一の発音のも
のであれば一つの音コードに変換されるので，装置の内
部では音コードを唯一の発音を表わすコードとして統一
的に用いることができる。このため，Pin Yin 法用の辞
書およびZhu Yin 法用の辞書というように，発音表記法
ごとに辞書を用意する必要がなく，すべての表記法に共
通の音コードを用いて検索される辞書を用意すれば足り
る。About one type of pronunciation Pin Yin method and Zhu Yi
There are various phonetic notations such as the n system, but these notations always correspond to one type of pronunciation. One type of sound code (Yin Code) is assigned to one type of pronunciation. One Pin Yin notation always corresponds to one phonetic chord, and one Zhu Yin notation always corresponds to one phonetic chord. Pin Yin notation for one pronunciation and the same pronunciation
The Zhu Yin notation corresponds to one common chord. In this way, even if there are multiple types of phonetic notation used for phonetic input, the same phonetic representation is expressed using one phonetic code. Whether the input is made using the Pin Yin method or the Zhu Yin method, as long as they have the same pronunciation, they will be converted into one sound code, so the sound code will be the only pronunciation inside the device. Can be used uniformly as a code representing. Therefore, it is not necessary to prepare a dictionary for each phonetic notation, such as a dictionary for the Pin Yin method and a dictionary for the Zhu Yin method, and a dictionary searched using a phonetic code common to all the notations is used. All you have to do is prepare.

【００５０】たとえば図３第１行に示されたPin Yin 表
記による発音と，図４第１行のZhuYin 表記による発音
とは同一であるから，同じ音コード５２ｆ８（16進数表
現）に対応する。他の行の発音も同じである。図３およ
び図４においては，分りやすくするために，最も左側の
欄はPin Yin 表記およびZhu Yin 表記でそれぞれ表わさ
れているが，変換テーブル内においてはこれらの表記は
２進数データによって表わされるのはいうまでもない。For example, the pronunciation in Pin Yin notation shown in the first line of FIG. 3 and the pronunciation in ZhuYin notation of the first line in FIG. 4 correspond to the same sound code 52f8 (hexadecimal notation). The pronunciation of the other lines is the same. 3 and 4, the leftmost columns are represented by Pin Yin notation and Zhu Yin notation respectively for the sake of clarity, but these notations are represented by binary data in the conversion table. Needless to say.

【００５１】図５(A) は音コードのデータ・フォーマッ
トを示している。この実施例では音コードは２バイトで
構成される。上位１バイトは主に韵母を表わし，下位１
バイトは主に声母を表わす。上位１バイトにおける最上
位ビット（ｆ）のデータ「０」および下位１バイトにお
ける最上位ビット（７）のデータ「１」は，一つの音コ
ードを構成する上位１バイトと下位１バイトを区別する
とともに，他のデータ（とくに音コード列における他の
音コード）と区別するために用いられる。FIG. 5 (A) shows the data format of the sound code. In this embodiment, the sound code is composed of 2 bytes. The high-order 1 byte mainly represents Yibo, and the low-order 1
The bytes mainly represent the voice mother. The data "0" of the most significant bit (f) in the upper 1 byte and the data "1" of the most significant bit (7) in the lower 1 byte distinguish between the upper 1 byte and the lower 1 byte constituting one sound code. It is also used to distinguish it from other data (in particular, other sound codes in the sound code string).

【００５２】上位１バイトの最下位ビット（第８ビッ
ト）は声調の有無を表わす。声調の無い発音もあるから
である。声調無しが「０」，声調有が「１」で表わされ
る。下位１バイトの下位２ビット（第０〜１ビット）は
声調を表わす。図５(B) に示すように，一声，二声，三
声，四声はそれぞれ「００」，「０１」，「１０」，
「１１」で表わされる。The least significant bit (8th bit) of the upper 1 byte indicates the presence or absence of a tone. This is because some pronunciations have no tone. No tone is represented by "0", and tone is represented by "1". The lower 2 bits (0th to 1st bits) of the lower 1 byte represent a tone. As shown in FIG. 5 (B), one voice, two voices, three voices, and four voices are “00”, “01”, “10”,
It is represented by "11".

【００５３】上位１バイトの中間の６ビット（第９〜ｅ
ビット）が韵母を，下位１バイトの中間の５ビット（第
２〜６ビット）が声母をそれぞれ表わす。韵音は37種
類，声母は24種類であるからこの程度のビット数で充分
である。The middle 6 bits of the upper 1 byte (9th to eth)
Bits) represent the mother, and the middle 5 bits (2nd to 6th bits) of the lower 1 byte represent the mother. Since there are 37 different types of syllabary and 24 types of vowels, this number of bits is sufficient.

【００５４】後に示す漢字検索処理においてフィルタが
用いられる。このフィルタも２バイトで構成され，第０
ビット，第１ビットおよび第８ビットが「０」に，他の
ビットが「１」に設定されている。このフィルタを16進
数で表現すると「ＦＥＦＣ」となる。A filter is used in the Chinese character search processing described later. This filter also consists of 2 bytes,
The bit, the first bit and the eighth bit are set to "0" and the other bits are set to "1". If this filter is expressed in hexadecimal, it becomes "FEFC".

【００５５】図６はコンピュータ10による入力，編集処
理の手順，または編集処理装置16の動作を示している。
コンピュータ10または編集処理装置16には図７に示すよ
うにキー・データ・バッファおよび図８に示すように音
コード列バッファが設けられている。FIG. 6 shows the procedure of the input and edit processing by the computer 10 or the operation of the edit processing device 16.
The computer 10 or the editing processor 16 is provided with a key data buffer as shown in FIG. 7 and a tone code string buffer as shown in FIG.

【００５６】まず，入力モード・キー23によってPin Yi
n 法またはZhu Yin 法のいずれのモードが設定されてい
るかが判定され（ステップ41），Pin Yin 法が選択され
ていればPin Yin ／音コード変換テーブル31が選択され
（ステップ42），Zhu Yin 法が選択されていればZhu Yi
n ／音コード変換テーブル32が選択される（ステップ4
3）。First, using the input mode key 23, Pin Yi
It is determined whether the n method or the Zhu Yin method is set (step 41). If the Pin Yin method is selected, the Pin Yin / sound chord conversion table 31 is selected (step 42) and the Zhu Yin method is selected. Zhu Yi if law is selected
n / sound chord conversion table 32 is selected (step 4
3).

【００５７】続いて，変換モード・キー24によって第１
モードまたは第２モードのいずれが選択されているかが
判定される（ステップ44）。第１モードが選択されてい
る場合には何らの処理も不要であるが，第２モードが選
択されている場合にはフィルタ（レジスタまたはメモリ
・エリアにより実現される）に「ＦＥＦＣ」が設定され
る（ステップ45）。第１モードが選択されている場合
に，フィルタとして全ビット１からなるデータ「ＦＦＦ
Ｆ」を設定してもよい。Then, the conversion mode key 24
It is determined whether the mode or the second mode is selected (step 44). When the first mode is selected, no processing is required, but when the second mode is selected, "FEFC" is set in the filter (implemented by the register or memory area). (Step 45). When the first mode is selected, the data "FFF" consisting of all 1's as a filter
F "may be set.

【００５８】Pin Yin キー21またはZhu Yin キー22から
発音表記を表わす文字または記号のデータが入力される
ごとに，これらのデータがキー・データ・バッファに格
納される（ステップ47）。図７に示すように，一文字が
入力されるとその文字の後段に終端記号データ「φ」が
格納される。これは発音表記を表わすデータは可変長デ
ータであり，そのデータの終端を明示しておくことが必
要であるからである。図７には，第２モードにしたがっ
て，Pin Yin 表記による「Ｚｈｏｎｇ」という発音が入
力されていく様子が示されている。Each time character or symbol data representing a phonetic transcription is input from the Pin Yin key 21 or Zhu Yin key 22, these data are stored in the key data buffer (step 47). As shown in FIG. 7, when one character is input, the terminal symbol data “φ” is stored in the subsequent stage of the character. This is because the data representing the phonetic notation is variable length data, and it is necessary to clearly indicate the end of the data. FIG. 7 shows how the pronunciation “Zhong” in Pin Yin notation is input according to the second mode.

【００５９】一漢字分のキー・データが入力されたかど
うかが判定される（ステップ48）。この判定には様々な
方法がある。その１は，一漢字分のキー・データ入力終
了時にオペレータにスペース・キーを押させることであ
る。スペース・キーによる入力があると一漢字分入力終
了と判定される。その２は，第１モードの場合に有効な
もので，発音記号の入力ののちに声調をオペレータに
１，２，３，４の数字で入力させることである。たとえ
ば「Ｚｈｏｎｇ」という発音で一声の抑揚をもつものに
ついては「Ｚｈｏｎｇ１」と入力させる。数字キー入力
があれば一漢字分のデータの入力終了と判定される。そ
の３は音節分割による自動認識により判定することであ
る。Pin Yin 法による発音の記述には一定の法則がある
ので，この法則を利用すれば，入力されたキー・データ
列のどこで一漢字分の表記が終了するのかを判定するこ
とができる。Zhu Yin 法による表記にも同様に一定の法
則があるからこれを利用できる。It is determined whether key data for one Chinese character has been input (step 48). There are various methods for this determination. The first is to have the operator press the space key at the end of key data input for one Chinese character. When the space key is pressed, it is determined that the input for one kanji has been completed. The second is effective in the case of the first mode, and is to allow the operator to input the tone with the numbers 1, 2, 3, 4 after inputting the phonetic symbols. For example, if the pronunciation is “Zhong” and has one voice intonation, “Zhong1” is input. If there is a numeric key input, it is determined that the input of data for one kanji has been completed. The third is to make a determination by automatic recognition by syllable division. Since there is a certain law in the description of pronunciation by the Pin Yin method, by using this law, it is possible to determine where in the entered key data string the notation for one Kanji character ends. Similarly, the Zhu Yin method can be used because there is a certain law.

【００６０】いずれにしても一漢字分のキー・データが
入力されると，先に選択されたPinYin ／音コード変換
テーブル31またはZhu Yin ／音コード変換テーブル32を
参照して，入力されたキー・データ（Pin Yin またはZh
u Yin 表記）がそれに対応する音コードに変換される。
この音コードは音コード列バッファに格納される（ステ
ップ49）。In any case, when the key data for one Chinese character is input, the input key is referred to with reference to the previously selected PinYin / sound code conversion table 31 or Zhu Yin / sound code conversion table 32.・ Data (Pin Yin or Zh
u Yin notation) is converted into the corresponding sound code.
This sound code is stored in the sound code string buffer (step 49).

【００６１】上述したステップ47の処理は一漢字分のキ
ー・データ入力が終了するまで繰返し続行され（ステッ
プ48），ステップ47〜49の処理は変換キーが押されるま
で繰返し実行される（ステップ46）。そして，変換キー
入力があると，音コード列バッファに蓄えられていた音
コード列が，図９から始まる漢字検索処理に渡される
（ステップ50）。The above-mentioned processing of step 47 is repeatedly continued until the input of key data for one Chinese character is completed (step 48), and the processing of steps 47 to 49 is repeatedly executed until the conversion key is pressed (step 46). ). Then, when the conversion key is input, the sound code string stored in the sound code string buffer is passed to the kanji search process starting from FIG. 9 (step 50).

【００６２】たとえば，Pin Yin 法により第２モードで
「ＺｈｏｎｇＧｕｏ」と入力された場合に，キー入力
データ「Ｚｈｏｎｇ」が音コード「５２ｆ８」に，「Ｇ
ｕｏ」が音コード「６６ｂ４」にそれぞれ変換され，音
コード列「５２ｆ８６６ｂ４」が得られることになる。For example, when "Zhong Guo" is input in the second mode by the Pin Yin method, the key input data "Zhong" is converted into the sound code "52f8" and "G".
"uo" is converted into the sound code "66b4", respectively, and the sound code string "52f866b4" is obtained.

【００６３】第２モードを指定した上で，声調を含めて
発音を入力してもよい。たとえば，「Ｚｈｏｎｇ１Ｇ
ｕｏ２」と入力することができる。この場合には「５３
ｆ８６７ｂ５」という音コード列が生成される。第２モ
ードが指定されているのであるから，フィルタ「ＦＥＦ
Ｃ」が設定されており（ステップ45），後に詳述する第
２モード検索が実行されることになる。It is also possible to specify the second mode and then input the pronunciation including the tone. For example, "Zhong1 G
You can enter "uo2". In this case, "53
The sound code string "f867b5" is generated. Since the second mode is specified, the filter "FEF
"C" is set (step 45), and the second mode search, which will be described in detail later, is executed.

【００６４】図９から図11は，とくに第２モードにおけ
る漢字検索処理の手順を示している。この処理は，フィ
ルタを「ＦＦＦＦ」に設定することにより，第１モード
における漢字検索処理にもそのまま適用される。また，
この処理は図１に示すコンピュータ10または図２に示す
漢字検索処理装置17により実行されよう。9 to 11 particularly show the procedure of the kanji search processing in the second mode. By setting the filter to "FFFF", this processing is also applied as it is to the Kanji search processing in the first mode. Also,
This processing may be executed by the computer 10 shown in FIG. 1 or the kanji search processing device 17 shown in FIG.

【００６５】この漢字検索処理の説明に先だち，辞書35
の構造について図12および図13を参照して説明する。図
12に示すように，辞書35には，インデックスＩテーブル
と，インデックスIIテーブルと，音コード列／漢字コー
ド対応テーブルとが設けられている。Before explaining this kanji search processing, the dictionary 35
The structure of will be described with reference to FIGS. 12 and 13. Figure
As shown in FIG. 12, the dictionary 35 is provided with an index I table, an index II table, and a phonetic code string / kanji code correspondence table.

【００６６】図13に示すように，音コード列／漢字コー
ド対応テーブルは，音コード列とその音コード列によっ
て表わされる発音をもつ単語を構成する一または複数の
漢字を表わす漢字コードとを対応づけて記憶するもので
ある。図13には分りやすくするために，漢字コードの代
わりに漢字そのものが図示されているが，実際には２進
数で表わされるコードが格納されていると理解された
い。As shown in FIG. 13, the phonetic code string / Kanji code correspondence table associates a phonetic code string with a Kanji code representing one or a plurality of Chinese characters forming a word having a pronunciation represented by the phonetic code string. It is something to remember. For the sake of clarity, FIG. 13 shows the kanji itself instead of the kanji code, but it should be understood that the code represented by a binary number is actually stored.

【００６７】「中国人」という単語は三つの漢字から構
成されるので，音コード列は６バイトにより構成され
る。二つの漢字から構成される単語（たとえば「中
国」）には４バイトの音コード列が対応する。一つの漢
字には２バイトの音コード列が対応する。このように，
先頭の漢字（上記の例では「中」）が共通する単語は近
接して配列され，かつ単語を構成する音コード列のバイ
ト数の多いものの方が相対アドレスを示す値が小さくな
るように並べられている。図13において，音コード列の
最後には「００００」を表わす記号「φ」が必ず存在す
る。Since the word "Chinese" is composed of three Chinese characters, the sound code string is composed of 6 bytes. A 4-byte sound code string corresponds to a word composed of two Chinese characters (for example, "China"). A 2-byte sound code string corresponds to one Kanji character. in this way,
Words that share the same Chinese character at the beginning (“middle” in the above example) are arranged close to each other, and arranged so that the value indicating the relative address becomes smaller as the number of bytes in the phonetic code string that constitutes the word becomes smaller. Has been. In FIG. 13, there is always a symbol "φ" representing "0000" at the end of the sound code string.

【００６８】一の発音が二つ以上の漢字を表現すること
がある。たとえば，相対アドレス１０２および１０３の
音コード列はともに「５３ｆ８」であり，これは漢字
「中」，「忠」等に対応する。One pronunciation may express two or more kanji. For example, the tone code strings of the relative addresses 102 and 103 are both “53f8”, which corresponds to the Chinese characters “middle”, “tada”, and the like.

【００６９】音コード列／漢字コード対応テーブルの相
対アドレスをｐで表わす。また，相対アドレスｐの音コ
ード列をＹ０（ｐ，１），Ｙ０（ｐ，２），…，φで表
現する。Ｙ０（ｐ，１），Ｙ０（ｐ，２）等を一般的に
Ｙ０（ｐ，Ｃ）（Ｃ＝１，２，…）で表現する。相対ア
ドレスｐの漢字コードをＫＡ（ｐ）（可変長）で表わ
す。The relative address of the phonetic code string / Kanji code correspondence table is represented by p. Further, the tone code string of the relative address p is expressed by Y0 (p, 1), Y0 (p, 2), ..., φ. Y0 (p, 1), Y0 (p, 2), etc. are generally represented by Y0 (p, C) (C = 1, 2, ...). The Kanji code of the relative address p is represented by KA (p) (variable length).

【００７０】音コード列／漢字コード対応テーブルに
は，できるだけ多くの（できれば，中国で用いられる殆
んどすべての）単語が格納される。これらの単語の配列
順序は，上述した規則を除いて自由である。したがっ
て，任意の記憶場所に任意の音コード列と漢字コードと
の対を配置することができる。音コード列／漢字コード
対応テーブルに配列された単語の数をＭとする。The phonetic code string / Kanji code correspondence table stores as many words as possible (preferably, almost all words used in China). The order of arranging these words is arbitrary except the above-mentioned rules. Therefore, it is possible to arrange a pair of a phonetic code string and a Kanji code in any memory location. Let M be the number of words arranged in the phonetic code string / Kanji code correspondence table.

【００７１】図12に戻って，インデックスＩテーブルお
よびインデックスIIテーブルは，ランダムに配列された
音コード列／漢字コード対応テーブルの音コード列を，
音コード列の表わす数値の大きさの順に検索できるよう
にするためのものである。Returning to FIG. 12, the index I table and the index II table store the sound code strings in the randomly arranged sound code string / kanji code correspondence table,
This is so that the search can be performed in the order of the numerical value represented by the sound code string.

【００７２】インデックスＩテーブルには，Ｎ個のＩｎ
ｄｅｘＩ（ｉ）が一定の順序で配列されている。Ｉｎｄ
ｅｘＩ（ｉ）はインデックスIIテーブルにおける対応す
る要素へのポインタ（インデックスIIテーブルの相対ア
ドレスを示す）である。Ｎは，音コード列／漢字コード
対応テーブルにおける異なる音コード列の個数である。
上述のように，一つの音コード列に二つ以上の単語が対
応することがありうるので，一般にＮ≦Ｍが成立する。The index I table has N Ins.
dexI (i) are arranged in a fixed order. Ind
exI (i) is a pointer (indicating the relative address of the index II table) to the corresponding element in the index II table. N is the number of different sound code strings in the sound code string / Kanji code correspondence table.
As described above, since two or more words may correspond to one phonetic code string, N ≦ M is generally established.

【００７３】インデックスIIテーブルにはＭ個の記憶場
所があり，各記憶場所に３種類の要素，Ｆ１（ｋ），Ｆ
２（ｋ），Ｆ３（ｋ）が格納されている。Ｆ３（ｋ）
は，音コード列／漢字コード対応テーブルにおける対応
する音コード列へのポインタ（対応テーブルの相対アド
レスを示す）である。Ｆ２（ｋ）は，それと同一の記憶
場所にあるＦ３（ｋ）がポイントする音コード列と同一
の音コード列をポイントするＦ３（ｋ）をもつインデッ
クスIIテーブル内の他の記憶場所（の相対アドレス）を
示す。これにより音コード列５３ｆ８について単語
「中」と「忠」の両方の検索が可能となる。同一の音コ
ード列が他にない場合にはＦ２（ｋ）＝φと置かれる。
Ｆ１（ｋ）は，それと同一の記憶場所にあるＦ３（ｋ）
がポイントする音コード列と同一の音コード列を上位に
含む音コード列（すなわち，Ｆ３（ｋ）がポイントする
音コード列よりも長い音コード列）をポイントするＦ３
（ｋ）をもつインデックスIIテーブル内の他の記憶場所
（の相対アドレス）を示す。これにより「中国」を検索
したときに，「中国」を含みかつこれよりも漢字数の多
い「中国人」が自動的に検索されることになる。The index II table has M storage locations, and each storage location has three types of elements, F1 (k) and F1.
2 (k) and F3 (k) are stored. F3 (k)
Is a pointer (indicating the relative address of the correspondence table) to the corresponding sound code sequence in the sound code sequence / Kanji code correspondence table. F2 (k) is the relative location of another storage location in the index II table that has F3 (k) that points to the same sound code sequence as the sound code sequence that F3 (k) points to in the same storage location. Address). As a result, both the words "medium" and "tada" can be searched for the sound code string 53f8. When there is no other same tone code string, F2 (k) = φ is set.
F1 (k) is the same memory location as F3 (k)
F3 that points to a sound code string that includes a sound code string that is the same as the sound code string that points to
FIG. 7 shows (relative address of) another storage location in the Index II table with (k). As a result, when "China" is searched, "Chinese" including "China" and having more kanji characters than this is automatically searched.

【００７４】インデックスＩテーブルにおけるＩｎｄｅ
ｘＩ（ｉ）は，音コード列／漢字コード対応テーブルに
おける音コード列の表わす数値の小さい順にあらかじめ
ソートされている。したがって，音コード列／漢字コー
ド対応テーブルにおける音コード列の配列がランダムで
あっても，インデックスＩテーブルを通して見たとき
に，音コード列／漢字コード対応テーブルにおいてはあ
たかも音コード列がそれの表わす数値が小さい順に配列
されているかのように見えることになる。Inde in the index I table
xI (i) is pre-sorted in ascending order of the numerical value represented by the sound code string in the sound code string / Kanji code correspondence table. Therefore, even if the arrangement of the sound code strings in the sound code string / Kanji code correspondence table is random, when viewed through the index I table, it is as if the sound code string is represented in the sound code string / Kanji code correspondence table. It looks as if the numbers are arranged in ascending order.

【００７５】図９から図11に示す漢字検索処理では２分
検索（binary search またはdichotomizing ）の手法が
用いられる。In the Kanji search processing shown in FIGS. 9 to 11, a binary search (dichotomizing) method is used.

【００７６】この漢字検索処理において，いくつかの変
数が用いられる。それらの変数はＳＴＡＲＴ，ＥＮＤ，
ｆｉｎｄ等である。変数ＳＴＡＲＴおよびＥＮＤはイン
デックスＩテーブルをアクセスするために用いられる。
ｆｉｎｄは見つかった漢字コードを記憶する漢字コード
・バッファ（図14参照）の記憶場所を指し示すために用
いられる。これらの変数はレジスタまたはメモリ・エリ
アに記憶されるデータとして実現される。Several variables are used in this Kanji search processing. These variables are START, END,
find, etc. The variables START and END are used to access the Index I table.
find is used to indicate the memory location of the kanji code buffer (see FIG. 14) that stores the found kanji code. These variables are implemented as data stored in registers or memory areas.

【００７７】入力，編集処理（図６）または編集処理装
置16から漢字検索処理または漢字検索処理装置17に与え
られる入力音コード列をｘ（１），ｘ（２），ｘ
（３），…，φで表わす。たとえば，Pin Yin 表記によ
って「Ｚｈｏｎｇ１Ｇｕｏ２」が入力されたときに
は，入力音コード列は「５３ｆ８６７ｂ５ φ」とな
る。すなわち，ｘ（１）＝５３ｆ８，ｘ（２）＝６７ｂ
５である。入力音コード列を構成する各音コードが入力
音コード列の何番目に位置しているものかを表わすため
に音コード・カウンタＣが用いられる。たとえば，ｘ
（１）はｘ（Ｃ）（Ｃ＝１）で表わされる。Input / edit processing (FIG. 6) or input sound code string given from the edit processing device 16 to the kanji search processing or kanji search processing device 17 is x (1), x (2), x
(3), ..., φ. For example, when “Zhong1 Guo2” is input by the Pin Yin notation, the input sound chord sequence is “53f8 67b5 φ”. That is, x (1) = 53f8, x (2) = 67b
It is 5. The sound code counter C is used to indicate the position of each sound code forming the input sound code string in the input sound code string. For example, x
(1) is represented by x (C) (C = 1).

【００７８】図９において，まず音コード・カウンタＣ
が１に初期化される（Ｃ＝１，ステップ51）。これによ
り，入力音コード列の先頭にある音コードｘ（１）が指
定される。In FIG. 9, first, the sound chord counter C
Are initialized to 1 (C = 1, step 51). As a result, the sound code x (1) at the head of the input sound code string is designated.

【００７９】続いて，変数ＳＴＡＲＴが０に，ＥＮＤが
（Ｎ−１）に，ｆｉｎｄが０にそれぞれ初期化される
（ステップ52）。Subsequently, the variable START is initialized to 0, END is initialized to (N-1), and find is initialized to 0 (step 52).

【００８０】変数ＳＴＡＲＴとＥＮＤを用いてインデッ
クスＩテーブルの相対アドレスが（ＳＴＡＲＴ＋ＥＮ
Ｄ）／２として算出され，これがｉと置かれる（ステッ
プ54）。これは，インデックスＩテーブルの相対アドレ
ス上の丁度中央に位置する相対アドレスを求める処理で
ある。２分探索法は，このように一連の相対アドレス
（一般的にいうと，一群の項目）を２つの部分に分け，
そのどちらかを選択し，選択した部分をさらに２つに分
けるというようにして目的の相対アドレス（項目）に到
達（見い出す）探索方法である。Using the variables START and END, the relative address of the index I table is (START + EN
It is calculated as D) / 2 and is set as i (step 54). This is a process for obtaining the relative address located exactly in the center of the relative address of the index I table. The binary search method divides a series of relative addresses (generally speaking, a group of items) into two parts,
This is a search method of reaching (finding) a target relative address (item) by selecting one of them and further dividing the selected part into two.

【００８１】算出された相対アドレスｉを用いてインデ
ックスＩテーブルがアクセスされ，その相対アドレスｉ
をもつ記憶場所に記憶されているＩｎｄｅｘＩ（ｉ）が
読み出される。このＩｎｄｅｘＩ（ｉ）はｋと置かれる
（ステップ55）。The index I table is accessed using the calculated relative address i, and the relative address i
The IndexI (i) stored in the storage location having is read. This IndexI (i) is placed as k (step 55).

【００８２】続いてＩｎｄｅｘＩ（ｉ）＝ｋを相対アド
レスとして用いてインデックスIIテーブルがアクセスさ
れ，その相対アドレスをもつ記憶場所に記憶されている
要素Ｆ１（ｋ），Ｆ２（ｋ），Ｆ３（ｋ）が読み出さ
れ，これらがそれぞれｐ１，ｐ２，ｐ３と置かれる（ス
テップ56）。Subsequently, the index II table is accessed using IndexI (i) = k as a relative address, and the elements F1 (k), F2 (k), F3 (k stored in the storage location having the relative address are accessed. ) Are read out and placed as p1, p2 and p3, respectively (step 56).

【００８３】図10を参照して，インデックスIIテーブル
から読み出された第３番目の要素Ｆ３（ｋ）＝ｐ３を相
対アドレスとして用いて音コード列／漢字コード対応テ
ーブルがアクセスされ，その相対アドレスをもつ記憶場
所に記憶されている音コードＹ０（ｐ３，Ｃ）が読み出
される。第２モードが選択されている場合にはＦＩＬＴ
ＥＲ＝ＦＥＦＣが設定されている（図６，ステップ4
5）。このＦＩＬＴＥＲと読み出された音コードＹ０
（ｐ３，Ｃ）とのＡＮＤ論理演算が行なわれる。また，
入力音コード列の音コード・カウンタＣによって指定さ
れる音コードｘ（Ｃ）とＦＩＬＴＥＲとのＡＮＤ論理演
算が実行される。これらの２つのＡＮＤ論理演算結果が
等しいかどうか，どちらが大きいかが比較される（ステ
ップ63，64）。Referring to FIG. 10, the tone code string / Kanji code correspondence table is accessed using the third element F3 (k) = p3 read from the index II table as a relative address, and the relative address The tone code Y0 (p3, C) stored in the memory location having the is read. FILT when the second mode is selected
ER = FEFC is set (Fig. 6, Step 4
Five). This FILTER and the read sound code Y0
An AND logical operation with (p3, C) is performed. Also,
The AND logic operation of the sound code x (C) designated by the sound code counter C of the input sound code string and the FILTER is executed. It is compared whether these two AND logical operation results are equal and which is larger (steps 63 and 64).

【００８４】上述したように，インデックスＩテーブル
を通して音コード列／漢字コード対応テーブルを見る
と，音コード列／漢字コード対応テーブルにおいて音コ
ード列はそれの表わす数値が小さい順に配列されてい
る。したがって，ＦＩＬＴＥＲＡＮＤｘ（Ｃ）＜ＦＩＬＴＥＲＡＮＤＹ０（ｐ３，Ｃ）式１が成立つということは，入力音コードｘ（Ｃ）の方が読
み出された音コードＹ０（ｐ３，Ｃ）よりも小さい，す
なわち捜し出すべきｘ（Ｃ）はＹ０（ｐ３，Ｃ）よりも
相対アドレスが小さい記憶場所に格納されているという
ことになる。ｘ（Ｃ）により近づくために相対アドレス
のより小さい記憶場所にアクセスする必要がある。すな
わち，インデックスＩテーブルの上半分をアクセスする
必要がある。そこで変数ＥＮＤにｉが代入され（ステッ
プ67），ステップ53を経てステップ54に戻ることにな
る。As described above, looking at the sound code string / Kanji code correspondence table through the index I table, the sound code strings in the sound code string / Kanji code correspondence table are arranged in ascending numerical order. Therefore, the fact that FILTER AND x (C) <FILTER AND Y0 (p3, C) is satisfied means that the input tone code x (C) is better than the read tone code Y0 (p3, C). It means that x (C) which is small, that is, to be searched for is stored in a memory location having a relative address smaller than that of Y0 (p3, C). In order to get closer to x (C), it is necessary to access a memory location with a smaller relative address. That is, it is necessary to access the upper half of the index I table. Then, i is substituted for the variable END (step 67), and the process returns to step 54 via step 53.

【００８５】ＦＩＬＴＥＲＡＮＤｘ（Ｃ）＞ＦＩＬＴＥＲＡＮＤＹ０（ｐ３，Ｃ）式２の場合には，変数ＳＴＡＲＴにｉが代入され（ステップ
68），同じようにステップ54に戻る。FILTER AND x (C)> FILTER AND Y0 (p3, C) In the case of Expression 2, i is substituted into the variable START (step
68), and similarly return to step 54.

【００８６】このようにして，２分探索法にしたがっ
て，入力音コードｘ（Ｃ）と一致する音コードを音コー
ド列／漢字コード対応テーブルにおいて検索していくこ
とになる。In this way, according to the binary search method, the sound code that matches the input sound code x (C) is searched in the sound code string / Kanji code correspondence table.

【００８７】遂に，ＦＩＬＴＥＲＡＮＤｘ（Ｃ）＝ＦＩＬＴＥＲＡＮＤＹ０（ｐ３，Ｃ）式３が成立すると，目的の音コードｘ（Ｃ）が格納されてい
る記憶場所が対応テーブルにおいて見付かったことにな
る。Finally, FILTER AND x (C) = FILTER AND Y0 (p3, C) When Expression 3 is established, it means that the memory location storing the target sound code x (C) is found in the correspondence table. ..

【００８８】Ｃ＝１であったとすると，次に２番目（Ｃ
＝２）の入力音コードｘ（Ｃ）と対応テーブルにおける
音コード列の２番目の音コードＹ０（ｐ３，Ｃ）とが一
致するかどうかをみるために，音カウンタＣがインクレ
メントされる（ステップ66）。If C = 1, then the second (C
= 2) input sound code x (C) and the second sound code Y0 (p3, C) of the sound code string in the correspondence table match, the sound counter C is incremented ( Step 66).

【００８９】インクレメントされたＣにおいて式３が成
立しなければ，式１または式２のいずれが成立するかに
応じて再び２分検索法にしたがって探索が行なわれてい
くことになる（ステップ64，67，68）。Ｃ＝１において
式３が成立するということは，対応テーブルにおいて見
付かったＹ０（ｐ３，Ｃ）の近傍に目的とする音コード
列が存在する筈であるから，わざわざ２分探索法を用い
ずに，その近傍を捜し出すようにしてもよい。また，後
述するように要素Ｆ１（ｋ）を利用して探索を行なうこ
ともできる。If the expression 3 is not satisfied in the incremented C, the search is again performed according to the binary search method depending on whether the expression 1 or the expression 2 is satisfied (step 64). , 67, 68). If C = 1 holds, it means that the target chord string exists near Y0 (p3, C) found in the correspondence table, so it is not necessary to use the binary search method. , You may search for the neighborhood. Further, as will be described later, the search can be performed using the element F1 (k).

【００９０】式３が成立する音コードＹ０（ｐ３，Ｃ）
を見付け出しながら音コード・カウンタＣをインクレメ
ントしていった結果，入力音コード列のすべての音コー
ドについて探索が終了し，遂にｘ（Ｃ）＝φとなったと
きには（ステップ61），対応テーブルにおける音コード
列の音コードにおいてもＹ０（ｐ３，Ｃ）＝φが成立す
るかどうかがチェックされる（ステップ69）。ここでＹ
ＥＳとなれば，入力音コード列と一致する音コード列が
見付かったことになり，図11に示す処理に移る。音コー
ドＹ０（ｐ３，Ｃ）がφでない場合には，その記憶場所
よりも相対アドレスの大きい位置に目的とする音コード
列が存在するのであるから，相対アドレスより大きい記
憶場所についてさらに検索を続けることになる（ステッ
プ70）。もっとも，入力音コード列をすべて含む音コー
ド列が見付かったのであるから，（「中国」を見付け出
そうとして「中国人」が見付かったような場合であ
る），そのすぐ近くに目的とする音コード列がある筈で
あるから，対応テーブルにおける相対アドレスを１つず
つインクレメントしていけば，すぐに目的とする音コー
ド列が見つかる筈である。Sound code Y0 (p3, C) for which expression 3 holds
As a result of incrementing the sound chord counter C while finding out, the search is completed for all the sound chords in the input sound chord string, and finally x (C) = φ (step 61), the action is taken. It is checked whether Y0 (p3, C) = φ holds in the sound chords of the sound chord string in the table (step 69). Where Y
If it becomes ES, it means that a sound code string that matches the input sound code string is found, and the process moves to the process shown in FIG. If the sound code Y0 (p3, C) is not φ, the target sound code string exists at a position having a relative address larger than that storage location, so the search is continued for storage locations larger than the relative address. This will be the case (step 70). However, since the sound code string that includes all the input sound code strings was found (as if "Chinese" was found in an attempt to find "China"), the target sound was found in the immediate vicinity. Since there should be a chord sequence, if the relative addresses in the correspondence table are incremented one by one, the intended chord sequence should be found immediately.

【００９１】さらに，ｘ（Ｃ）がまだφとなっていない
のに読み出した音コードＹ０（ｐ３，Ｃ）がφとなって
しまったときには（「中国」を捜しているのに対して
「中」が見付かったような場合）（ステップ62），相対
アドレスのより小さい記憶場所のサーチに移る（ステッ
プ65）。この場合にも，到達している記憶場所のすぐ近
くに目的とする文字コード列が存在する筈であるから，
対応テーブルの相対アドレスを１つずつデクレメントし
ていくことにより，迅速に目的とする文字コード列を見
付け出すことができよう。Further, when the read sound code Y0 (p3, C) becomes φ even though x (C) is not φ yet (while searching for "China", "Is found (step 62), the process moves to the search for a memory location with a smaller relative address (step 65). In this case as well, since the target character code string should exist in the immediate vicinity of the reached memory location,
By decrementing the relative addresses of the correspondence table one by one, it will be possible to quickly find the target character code string.

【００９２】ここでＦＩＬＴＥＲ＝ＦＥＦＣの意味につ
いて図15を参照して説明しておく。例えば，「Ｚｈｏｎ
ｇ」という声韵をもつ一声〜四声のいずれかの音コード
とＦＩＬＴＥＲ＝ＦＥＦＣとのＡＮＤを演算すると，声
調を無視した声韵「Ｚｈｏｎｇ」を表わす音コードに変
換される。したがって，第２モードの下で漢字検索を行
った場合には，声調に関係なく声韵が同一のすべての漢
字が見つかることになる。この様子が図16に示されてい
る。「Ｚｈｏｎｇ１Ｇｕｏ２」というように声調を含
めて入力しても，「ＺｈｏｎｇＧｕｏ」というように
声調なしで入力しても，その入力音コードをフィルタを
通すことにより，「中国」という単語について式３が成
立し（Ｃ＝１，Ｃ＝２の両方において），この単語が見
付けられる。これがステップ63，64の処理におけるフィ
ルタを通す意味である。Here, the meaning of FILTER = FEFC will be described with reference to FIG. For example, "Zhon
When an AND of any one of the one-to-four-tone sound code having the voice g and the FILTER = FEFC is calculated, the tone code is converted into a sound code representing the voice "Zhong". Therefore, when a kanji search is performed under the second mode, all kanji with the same vocal gram can be found regardless of the tone. This is shown in FIG. Whether inputting with a tone such as "Zhong1 Guo2" or without inputting a tone such as "Zhong Guo", the input sound code is filtered to obtain the expression "3" for the word "China". Holds (in both C = 1 and C = 2), the word is found. This is the meaning of passing the filter in the processing of steps 63 and 64.

【００９３】第１モードにおいてはフィルタを必ずしも
使用する必要はなく，入力音コードと対応テーブルの音
コードとを直接に比較すればよいが，ＦＩＬＴＥＲ＝Ｆ
ＦＦＦを用いれば，ステップ63，64，式１〜式３をその
まま利用することができる。In the first mode, it is not always necessary to use the filter, and the input sound code and the sound code in the correspondence table may be directly compared. FILTER = F
If FFF is used, steps 63, 64 and equations 1 to 3 can be used as they are.

【００９４】入力音コード列と一致する（フィルタを通
して）音コード列が見つかると（ステップ69），図11を
参照して，変数ｆｉｎｄがインクレメントされて漢字コ
ード・バッファの記憶場所が指定され，見つかった音コ
ード列に対応して格納されている漢字コードＫＡ（ｐ
３）が音コード列／漢字コード対応テーブルから読み出
され，漢字コード・バッファの変数ｆｉｎｄによって指
定された記憶場所に格納される（ステップ71）。When a tone code string that matches the input tone code string (through a filter) is found (step 69), referring to FIG. 11, the variable find is incremented to specify the storage location of the Kanji code buffer. Kanji code KA (p stored in correspondence with the found sound code string
3) is read out from the phonetic code string / Kanji code correspondence table and stored in the memory location designated by the variable find of the Kanji code buffer (step 71).

【００９５】続いて，インデックスIIテーブルの第２番
目の要素Ｆ２（ｋ）を利用して，同じ発音をもつ対応テ
ーブルの音コード列をポイントするインデックスIIテー
ブルの他の記憶場所の要素Ｆ３（ｋ）に基づいて対応テ
ーブルをアクセスし，そこに記憶されている同じ発音を
もつ漢字コードを，変数ｆｉｎｄをインクレメントして
新しい記憶場所を指定した上で，漢字コード・バッファ
に格納する（ステップ73）。要素Ｆ２（ｋ）によってリ
ンクされている同じ発音をもつ漢字コードが順次読み出
されて漢字コード・バッファに格納されていく（ステッ
プ73，74の繰返し）。これにより「中」に加えて「忠」
等が漢字の候補として見付け出されていくことになる。Then, by using the second element F2 (k) of the index II table, the element F3 (k) of another storage location of the index II table that points to the note code string of the corresponding table having the same pronunciation. ), The corresponding kanji code having the same pronunciation stored therein is stored in the kanji code buffer after incrementing the variable find to specify a new storage location (step 73). ). The Kanji codes having the same pronunciation linked by the element F2 (k) are sequentially read out and stored in the Kanji code buffer (repeat steps 73 and 74). As a result, in addition to "Medium"
Will be found as a candidate for kanji.

【００９６】Ｆ２（ｋ）＝ｐ２＝φとなれば（ステップ
72），入力音コード列を含みかつこれよりも長い音コー
ド列が，要素Ｆ１（ｋ）＝ｐ１を用いて検索される。要
素Ｆ１（ｋ）によってポイントされるインデックスIIテ
ーブル内の他の（第２番目の）記憶場所に格納されてい
る要素Ｆ３（ｋ）によってアクセスされる漢字コードが
対応テーブルから読み出され，変数ｆｉｎｄをインクレ
メントした上で，漢字コード・バッファに格納される
（ステップ76）。第２番目の記憶場所の要素Ｆ１（ｋ）
によってリンクされるさらに他の記憶場所があれば，同
じようにして，上記のさらに他の記憶場所の要素Ｆ３
（ｋ）によってアクセスされる漢字コードが読み出され
て漢字コード・バッファに格納される（ステップ75〜7
7）。Ｆ１（ｋ）＝ｐ１＝φであればすべての処理が終
る（ステップ75) 。When F2 (k) = p2 = φ is satisfied (step
72), a sound code string that includes and is longer than the input sound code string is searched using the element F1 (k) = p1. The Kanji code accessed by the element F3 (k) stored in the other (second) storage location in the index II table pointed to by the element F1 (k) is read from the correspondence table and the variable find is used. Is incremented and stored in the Kanji code buffer (step 76). Element F1 (k) of second memory location
If there are other memory locations linked by, in the same way, the element F3 of the further memory location described above.
The kanji code accessed by (k) is read and stored in the kanji code buffer (steps 75 to 7).
7). If F1 (k) = p1 = φ, all processing is finished (step 75).

【００９７】また，２分探索を繰返して遂にＳＴＡＲＴ
＋１≧ＥＮＤになった場合には入力音コード列に該当す
る音コード列が見付からなかったとして漢字検索処理が
終了する（ステップ53）。Moreover, the binary search is repeated and finally START is started.
If + 1 ≧ END, it means that the sound code string corresponding to the input sound code string has not been found, and the Kanji search processing ends (step 53).

【００９８】図17は上述した処理を実現するハードウェ
ア・アーキテクチャを示すものであり，漢字検索処理装
置17の構成例を示している。FIG. 17 shows a hardware architecture for realizing the above-mentioned processing, and shows an example of the configuration of the kanji search processing device 17.

【００９９】音コード列／漢字コード対応テーブル81は
図12または図13に示すものと同じである。検索読出し回
路82は，この対応テーブルから順次，または入力音コー
ド列を参照して音コード列を読み出すものである。フィ
ルタ・レジスタ83ＡにはデータＦＦＦＦが，フィルタ・
レジスタ83ＢにはデータＦＥＦＣがそれぞれ格納されて
いる。変換モード・キー24による変換モード選択に応じ
て，マルチプレクサ84は，第１モードのときにはレジス
タ83Ａを，第２モードのときにはレジスタ83Ｂをそれぞ
れ選択し，そこに格納されているフィルタ・データをＡ
ＮＤ回路85，86に与える。The sound code string / Kanji code correspondence table 81 is the same as that shown in FIG. 12 or FIG. The retrieval / readout circuit 82 reads out the tone code sequence from this correspondence table sequentially or by referring to the input tone code sequence. In the filter register 83A, the data FFFF is
The data FEFC is stored in the register 83B. In response to the conversion mode selection by the conversion mode key 24, the multiplexer 84 selects the register 83A in the first mode and the register 83B in the second mode, and selects the filter data stored therein as A
It is given to the ND circuits 85 and 86.

【０１００】ＡＮＤ回路85には入力音コード列が，ＡＮ
Ｄ回路86には対応テーブル81から読み出された音コード
列がそれぞれ与えられる。ＡＮＤ回路85，86において入
力音コード列，音コード列のフィルタリングがそれぞれ
行なわれ，それらの出力データは比較回路87において比
較される。比較回路87は２つの入力データが一致したと
きのみ一致信号を出力する。この一致信号によってゲー
ト88がイネーブルとされる。また，一致信号は検索読出
し回路82に与えられ，検索読出し回路82は一致した音コ
ードに対応するすべての漢字コードを対応テーブルから
読み出してゲート88に与えるので，この漢字コードはゲ
ート88を通って漢字コード・バッファ89に蓄えられるこ
とになる。In the AND circuit 85, the input tone code string is
The tone code strings read from the correspondence table 81 are given to the D circuits 86, respectively. The AND circuits 85 and 86 respectively filter the input sound code string and the sound code string, and the output data of them are compared in the comparison circuit 87. The comparison circuit 87 outputs the coincidence signal only when the two input data coincide. This match signal enables the gate 88. Further, the coincidence signal is given to the search and read circuit 82, and the search and read circuit 82 reads all the kanji codes corresponding to the matched sound chords from the correspondence table and gives them to the gate 88. Therefore, this kanji code passes through the gate 88. It will be stored in the Kanji code buffer 89.

【０１０１】上記実施例においては，第１モードと第２
モードがあり，フィルタＦＦＦＦおよびＦＥＦＣを用い
て声調を含めた，および声調を無視した漢字検索が行な
われるようになっているが，さらに他のフィルタを用い
て他のモードの漢字検索を行うこともできる。たとえ
ば，16進数表現で００ＦＣというフィルタを用いれば，
入力音コードと声母が一致する音コードをもつすべての
漢字の検索が可能となろう。In the above embodiment, the first mode and the second mode
There is a mode, and the Kanji search including the tone and ignoring the tone is performed by using the filters FFFF and FEFC, but it is also possible to perform the Kanji search of other modes by using other filters. it can. For example, if you use the filter 00FC in hexadecimal notation,
It will be possible to search for all kanji whose phonetic code matches the input phonetic code.

【０１０２】また，入力装置としてはキーボードに限ら
ず，発音そのものを入力するものでもよい。この場合に
は発音を表わす電気信号をそれに対応する音コードに変
換する音声認識装置が利用されよう。The input device is not limited to the keyboard but may be a device for inputting the pronunciation itself. In this case, a voice recognition device which converts an electric signal representing pronunciation into a corresponding sound code may be used.

[Brief description of drawings]

【図１】中国語の発音表記／漢字変換装置の電気的構成
を示すブロック図である。FIG. 1 is a block diagram showing an electrical configuration of a Chinese phonetic transcription / kanji conversion device.

【図２】中国語の発音表記／漢字変換装置の主要部分の
ハードウェア構成を示す，または機能的観点からみた構
成を示すブロック図である。FIG. 2 is a block diagram showing a hardware configuration of a main part of a Chinese phonetic transcription / kanji conversion device, or a configuration viewed from a functional point of view.

【図３】Pin Yin ／音コード変換テーブルの一例を示
す。FIG. 3 shows an example of a Pin Yin / sound code conversion table.

【図４】Zhu Yin ／音コード変換テーブルの一例を示
す。FIG. 4 shows an example of a Zhu Yin / sound code conversion table.

【図５】(A) は音コードのデータ・フォーマットを示
し，(B) は声調を表わすコードを示す。5A shows a data format of a tone code, and FIG. 5B shows a code representing a tone.

【図６】入力，編集処理の手順を示すフロー・チャート
である。FIG. 6 is a flow chart showing a procedure of input / edit processing.

【図７】キー・データ・バッファにキー入力データが格
納されていく様子を示す。FIG. 7 shows how key input data is stored in a key data buffer.

【図８】音コード列バッファに音コードが格納されてい
く様子を示す。FIG. 8 shows how a sound code is stored in a sound code string buffer.

【図９】漢字検索処理手順を示すフロー・チャートであ
る。FIG. 9 is a flow chart showing a kanji search processing procedure.

【図１０】漢字検索処理手順を示すフロー・チャートで
ある。FIG. 10 is a flow chart showing a kanji search processing procedure.

【図１１】漢字検索処理手順を示すフロー・チャートで
ある。FIG. 11 is a flow chart showing a kanji search processing procedure.

【図１２】辞書の構造を示す。FIG. 12 shows the structure of a dictionary.

【図１３】音コード列／漢字コード対応テーブルの一例
を示す。FIG. 13 shows an example of a phonetic code string / Kanji code correspondence table.

【図１４】漢字コード・バッファを示す。FIG. 14 shows a Kanji code buffer.

【図１５】フィルタリングにより音コードが変換される
様子を示す。FIG. 15 shows how a sound code is converted by filtering.

【図１６】フィルタを利用した漢字検索の様子を示す。FIG. 16 shows a state of kanji search using a filter.

【図１７】漢字検索装置のハードウェア・アーキテクチ
ャを示す，または漢字検索処理の機能に着目して表わさ
れたブロック図である。FIG. 17 is a block diagram showing the hardware architecture of a kanji search device or focusing on the function of kanji search processing.

[Explanation of symbols]

10 コンピュータ 12 表示制御装置 13 キャラクタ・ジェネレータ 14 表示装置 15 切換回路 16 編集処理装置 17 漢字検索処理装置 20 キーボード 21 Pin Yin キー 22 Zhu Yin キー 23 入力モード・キー 24 変換モード・キー 25 その他の機能キー 30 メモリ装置 31 Pin Yin ／音コード 32 Zhu Yin ／音コード 33 音コード／Pin Yin 34 音コード／Zhu Yin 35 辞書 36 変換後データ 81 音コード列／漢字コード対応テーブル 82 検索読出し回路 83Ａ，83Ｂフィルタ・レジスタ 84 マルチプレクサ 85，86 ＡＮＤ回路 87 比較回路 88 ゲート 89 バッファ 10 Computer 12 Display control device 13 Character generator 14 Display device 15 Switching circuit 16 Edit processing device 17 Kanji search processing device 20 Keyboard 21 Pin Yin key 22 Zhu Yin key 23 Input mode key 24 Conversion mode key 25 Other function keys 30 Memory device 31 Pin Yin / Sound code 32 Zhu Yin / Sound code 33 Sound code / Pin Yin 34 Sound code / Zhu Yin 35 Dictionary 36 Data after conversion 81 Sound code string / Kanji code correspondence table 82 Search / read circuit 83A, 83B Filter・ Register 84 Multiplexer 85, 86 AND circuit 87 Comparison circuit 88 Gate 89 Buffer

Claims

[Claims]

1. An input device capable of inputting Chinese pronunciations according to a plurality of types of notations, and a plurality of types of notations that can be input using the input device, respectively, and input data according to each notation is provided. , A plurality of conversion tables for converting into a sound code corresponding to the pronunciation represented by the input data, a dictionary storing the sound code and the Kanji code indicating the Kanji having the pronunciation represented by the sound code in association with each other, And control means for converting the input data input from the input device into a sound code using any of the plurality of conversion tables and searching the dictionary for a Kanji code corresponding to the converted sound code. Chinese phonetic transcription / kanji conversion device.

2. An input mode selection means for selecting one of a plurality of types of notation is further provided, and the control means inputs using a conversion table relating to the notation selected by the input mode selection means. The apparatus according to claim 1, which converts data into sound codes.

3. The apparatus according to claim 1, wherein the control means determines a notation based on input data input from the input device, and selects a conversion table to be used according to the determination.

4. The input device comprises a device for converting an input voice into an electric signal and a voice recognition device for recognizing pronunciation based on the voice electric signal and converting the input voice into a sound code. The apparatus according to Item 1.

5. The method according to claim 1, further comprising means for converting the retrieved Kanji code into display data representing the Kanji indicated by the Kanji code, and a device for displaying the Kanji based on the display data. apparatus.

6. The apparatus according to claim 5, further comprising: a designation input means for designating one of the displayed kanji candidates, and a memory for storing a kanji code indicating the designated kanji. .

7. Each of the conversion tables converts input data into a phonetic code for one Kanji character, and the dictionary associates a phonetic code string and a Kanji code with each other for a word consisting of one Kanji character or a plurality of Kanji characters. The control means divides the input data input from the input device into each kanji character and converts it into a sound code, and arranges the converted sound code in word units to create a sound code string. The device according to claim 1, wherein the dictionary is searched for a Kanji code corresponding to this phonetic code string.

8. Further comprising filtering means for masking a predetermined one or a plurality of bits constituting a sound code,
The control means filters the sound code corresponding to the input data and the sound code of the dictionary by using the filtering means, and then compares the sound code corresponding to the input data with the sound code corresponding to the input data. The apparatus of claim 1, located in.

9. A conversion means for converting input data representing a Chinese pronunciation into a sound code corresponding to the pronunciation.
From a dictionary in which a phonetic code and a kanji code indicating a kanji having a pronunciation represented by the phonetic code are stored in association with each other, filtering means for masking a predetermined one or more bits constituting the phonetic code, and the conversion means The obtained sound code and the sound code in the dictionary are filtered by the filtering means and compared with each other, thereby searching the sound code in the dictionary for a sound code that matches the sound code obtained from the conversion means, and matching the sound code. A Chinese phonetic transcription / Kanji conversion device comprising control means for reading out a Kanji code corresponding to a phonetic code from the dictionary.

10. The apparatus according to claim 9, wherein the phonetic code includes a bit representing a voice vowel, a bit representing a claw vowel, and a bit representing a tone.

11. The apparatus according to claim 10, wherein said filtering means masks a bit representing a vocal vowel, a bit representing a vowel or a bit representing a tone.

12. The apparatus according to claim 9, wherein the filtering means includes one that allows a sound code to pass through.

13. The apparatus further comprises an input device capable of inputting Chinese pronunciations according to a plurality of types of notations, and the conversion means is provided for each of a plurality of types of notations that can be input using the input device. 10. The apparatus according to claim 9, further comprising a plurality of conversion tables for converting input data according to each notation into sound codes corresponding to pronunciations represented by the input data.

14. The apparatus according to claim 9, wherein the conversion means is a voice recognition means for recognizing a pronunciation based on a voice input signal and outputting a sound code corresponding to the pronunciation.

15. The apparatus according to claim 9, further comprising search mode selection means for selecting whether to use the filtering means or to select one of the plurality of filtering means.

16. An input mode selecting means for selecting one of a plurality of types of notation is further provided, and the converting means inputs using a conversion table relating to the notation selected by the input mode selecting means. 14. The device according to claim 13, which converts data into phonetic chords.

17. The method according to claim 9, further comprising means for converting the read Kanji code into display data representing the Kanji indicated by the Kanji code, and a device for displaying the Kanji based on the display data. Equipment.

18. The apparatus according to claim 17, further comprising a designation input means for designating one of the displayed kanji candidates, and a memory for storing a kanji code indicating the designated kanji. .

19. The conversion means converts input data for one Chinese character into a phonetic code, and the dictionary stores a phonetic code string and a Chinese character code in association with one Chinese character or a word consisting of a plurality of Chinese characters. The control means controls the conversion means so that the input data is divided into each kanji and converted into a sound code, and the converted sound code is arranged in word units to create a sound code string. 10. The apparatus according to claim 9, wherein the dictionary is searched for a kanji code corresponding to this phonetic code string.

20. Chinese pronunciation can be input according to a plurality of types of notations, and for each of the plurality of types of notations that can be input, the input data according to each notation corresponds to the pronunciation represented by the input data. A plurality of conversion tables for converting to sound codes, and sound codes,
Prepare a dictionary that stores the Kanji code that indicates the Kanji that has the pronunciation represented by the phonetic code in correspondence with each other, and convert the input data to a phonetic code using one of the above conversion tables A phonetic transcription / kanji conversion method for Chinese, in which a kanji code corresponding to the selected phonetic code is searched in the dictionary.

21. A dictionary in which a phonetic code and a Kanji code indicating a Kanji having a pronunciation represented by the phonetic code are stored in association with each other is prepared in advance, and input data representing the Chinese pronunciation is stored. , Converted to the sound code corresponding to the pronunciation, and the sound code obtained by the conversion,
The sound code in the dictionary is filtered by masking one or more bits that make up the sound code and then compared with each other, and a sound code matching the sound code obtained by the conversion is searched in the dictionary. Then, the Chinese phonetic transcription / Kanji conversion method that reads the Kanji code corresponding to the matching phonetic code from the above dictionary.