JPS58159133A

JPS58159133A - Kana (japanese syllabary)-to-kanji (chinese character) conversion processing system

Info

Publication number: JPS58159133A
Application number: JP57042110A
Authority: JP
Inventors: Hirokawa Hayashi; 林　大川; Yoshitoshi Yamauchi; 佐敏山内
Original assignee: Ricoh Co Ltd
Current assignee: Ricoh Co Ltd
Priority date: 1982-03-17
Filing date: 1982-03-17
Publication date: 1983-09-21

Abstract

PURPOSE:To facilitate the processing of KANA-KANJI conversion, by changing the paragraph in accordance with the atribute for a Chinese character processing system in which an input sentence is decmposed into paragraphs centering on independent words and supplying them in the form of a KANA sentence and then by selecting the paragraph an operator desires. CONSTITUTION:An input sentence is decomposed into paragrahs centering on independent words by means of a control key and then supplied in the form of a KANA sentence. Then the information on English letter and numeric character paragraphs, etc. are recognized at an input preprocessing part 1, and then a KANA letter string to be converted is extracted to be sent to a word extracting part 2. Here the candidates of word strings which can be grammatically allowed are extracted with retrieval of a dictionary. While a homophone discriminating part 3 decides the most prominent candidate among word string candidates. An output control part 4 displays the corresponding characters for each paragraph. If the result of display is opposed to the desire of an operator, the decomposing method of paragraph is changed through the part 1. Then the extraction 2 of words and discrimination 3 of homophones for output control 4. As a result, the KANA-Chinese character conversion processing is facilitated.

Description

【発明の詳細な説明】本発明は邦文ワードプルセッサ等におけるカナ漢字変換
処理方式に関し、特に文節区切り情報を与えるカナ漢字
変換処理方式における文書作成作業の処理速度を向上さ
せ得るカナ漢字弯換処瑠方式に関する。DETAILED DESCRIPTION OF THE INVENTION The present invention relates to a kana-kanji conversion processing method in a Japanese word processor, etc., and in particular to a kana-kanji conversion processing method that can improve the processing speed of document creation work in a kana-kanji conversion processing method that provides bunsetsu separation information. Regarding the Ru method.

カナ漠字憂換（以下、単に「変換」ともいう。）処理方
式に関しては従来から種々の方式が提素されている。従
来の変換方式においては、文節指定、単語指定等の差は
あっても、一般には、入力仮名文字列に対する可能な変
換文字列を全て探索して変換結果を出力表示するもので
あった。そして、同音異字飴（以下「同音飴」という。Various methods have been proposed in the past regarding kanabakuji yukan (hereinafter also simply referred to as "conversion") processing methods. Conventional conversion methods generally search for all possible conversion character strings for an input kana character string and output and display the conversion results, even though there are differences in phrase designation, word designation, etc. And homophone candy (hereinafter referred to as ``homophone candy'').

）がある場合には最有力候補から変換文字列を表示し−
（オペレータに選択、ｗｉｈを求め、２ベレータの意図
しない変換文字列のときは次候補キーを押すことにより
、別の変候文字夕すが替って表示される。この過程でオ
ペレータの意図したものがあればオペレータが選択キー
を押すことにより変換文字列が確定するというものであ
った。), display the converted string from the most likely candidates.
(If you ask the operator to select and wih, and if it is an unintended conversion character string of 2 berators, press the next candidate key, and another conversion character string will be displayed instead. If there is one, the operator presses the selection key to confirm the converted character string.

従来の変換方式においては、ある陵換属性で入力文字列
の変換を行った鮎、オペレータの意図するものが出力己
れない場合には、その部分を削除し前記入力文字列を再
度入力しＣ異なった変換属性によ０る変換を試みるよつ
にしていた。しかし、これではオペレータの負担が大き
くなりすぎることから、取消キーを設けて、これを押す
ことにより１つ前の操作を取消すようにする方式も提案
されている。In the conventional conversion method, if an input character string is converted with a certain conversion attribute, and if the output is not what the operator intended, that part is deleted and the input string is input again. I was trying to convert 0 with different conversion attributes. However, since this places too much of a burden on the operator, a method has also been proposed in which a cancel key is provided and the previous operation can be canceled by pressing the cancel key.

本発明はこれを更に改良し、前述の如き変換属性の変更
をより容易にしてオペレータの負担を軽減することが可
能なカナ漢字変換方式を提供することを目的とするもの
である。It is an object of the present invention to further improve this and provide a kana-kanji conversion method that can more easily change the conversion attributes as described above and reduce the burden on the operator.

本発明の上記目的は、入力文を、自立語を中心とする分
かち書き単位に分解して仮名文字で入力し、これに対応
する漢字カナ混じり文を逐次得るカナ漢字変換処理方式
において、入力文節の属性に応じて異なった変換方法を
指定可能な複数の置換キーを設けて、オペレータの最初
の変換キー操作により出力表示された変換候補の中にオ
ペレータの意図する変換文字列がないとき、他の属性を
有する変換キーを操作することにより、入力文字列を最
初とは興なる置換方法で再変換することを特徴とするカ
ナ漢字変換処理方式によって達成される。The above-mentioned object of the present invention is to provide a kana-kanji conversion processing method in which an input sentence is broken down into division units centered on independent words and input as kana characters, and corresponding sentences containing kanji and kana are sequentially obtained. By providing multiple replacement keys that can specify different conversion methods depending on the attribute, when the conversion string that the operator intends is not among the conversion candidates output and displayed by the operator's first conversion key operation, other conversion strings can be used. This is achieved by a kana-kanji conversion processing method characterized in that an input character string is reconverted using a replacement method different from the original one by operating a conversion key having an attribute.

以下、本発明の実施例を図面に基づいて詳細に説明する
。Embodiments of the present invention will be described in detail below with reference to the drawings.

１１１図は本発明の一実施例であるカナ漢字変換処理の
プ田ツタ図である。図において、１は入力前処理部、２
は単−抽出部、８は同音語判別部、番は出力制御部モし
て６は変換制御部である。FIG. 111 is a diagram of kana-kanji conversion processing according to an embodiment of the present invention. In the figure, 1 is an input preprocessing unit, 2
Reference numeral 8 indicates a single extraction section, 8 indicates a homophone discrimination section, numeral 6 indicates an output control section, and 6 indicates a conversion control section.

日本−文が［４文で入力されると、以下の如き処理を経
て漢字力す混じり文として出力される。When a Japanese sentence is input as 4 sentences, it is output as a kanji mixed sentence through the following processing.

入力前処理部１は、入力便名文中の英数字、文節区切り
情報等を■謙して置換対象となる仮名文字列を艙出し、
変換制御部６の制御の下に単語抽出部２に、変換単位と
な４便名文字列を渡す。単語抽出部２は前記仮名文字列
から、カナを見出しとする単一辞書および蒙単語辞書に
付加されている■の品詞情報、品調肩の接続情報を納め
た辞書等を参照し、前記仮鳴文字列と辞書見出しとの一
致を試み、文法的に許容される単語列の候補を抽出する
。同音１１１１１１１１１３は、上記単一列の候補が複
歇傭存夜する場合に、単語の持つ頻度情報等を用いて最
有力単一列を決定する。出力制御部４は、上述の如く決
定された文節の対応文字を出方表示装置に表示する。そ
の後、入力前処理部１が制御キーの次候補を指示する制
御信号を受けると、初めに表示した最有力候補の代りに
次点となった候補を表示する。The input preprocessing unit 1 extracts a kana character string to be replaced by examining alphanumeric characters, bunsetsu delimiter information, etc. in the input flight number sentence,
Under the control of the conversion control section 6, four flight name character strings, which are conversion units, are passed to the word extraction section 2. The word extraction unit 2 extracts the kana character string from the kana character string by referring to a single dictionary with kana as the heading and a dictionary containing part-of-speech information of ■ added to the Mongolian word dictionary, connection information of the tone shoulder, etc. It attempts to match the sound string with the dictionary entry and extracts grammatically acceptable word string candidates. Hoson 1111111113 determines the most likely single string using the frequency information of the words, etc. when the candidates for the single string are present multiple times. The output control unit 4 displays the characters corresponding to the clause determined as described above on the appearance display device. Thereafter, when the input preprocessing section 1 receives a control signal instructing the next candidate for the control key, the second candidate is displayed instead of the most likely candidate that was initially displayed.

単語抽出部２は第２図に示す如く、文節単位仮名文字列
記憶部２人、単語抽出制御部２　’ｓ　ｓ候捕単語記憶
部２０、辞書検索制御部２Ｄおよび置換辞書記憶部２ｍ
から虞っている。文節単位仮名文字列記憶部２人は文節
単位文字列信号を受取り、これを単語抽出制御部２Ｂに
送り、単語抽出制御部２Ｂはこれを単語に分解し辞書検
索制御部２Ｄを通して変換辞書記憶部２ｍに記憶されて
いる内容と照合する。この処理により見出された変換候
補は候補単語記憶部２０に格納される。As shown in FIG. 2, the word extraction section 2 includes two clause unit kana character string storage sections, a word extraction control section 2's s candidate word storage section 20, a dictionary search control section 2D, and a replacement dictionary storage section 2m.
I'm afraid of this. Clause-based kana character string storage unit The two receive the clause-based character string signal and send it to the word extraction control unit 2B, which breaks it down into words and sends it to the conversion dictionary storage unit through the dictionary search control unit 2D. Check with the contents stored in 2m. The conversion candidates found through this process are stored in the candidate word storage section 20.

本実施例装置においては、制御キーとして、次候傭を選
択するものの他に、以下に例を示す如き入力文節中に含
まれる単語の属性に応じた変換キーを設けている。In the device of this embodiment, in addition to the control keys for selecting the next choice, there are also conversion keys according to the attributes of the words contained in the input phrase, as shown in the following example.

■変換キー：仮名文字列を入力してこのキーを押すと、
変換システムが入力仮名文字列に対応する一般語を含む
単−辞書内を探索する。■Conversion key: Enter a kana character string and press this key.
The conversion system searches a single dictionary containing common words corresponding to the input kana string.

■固有名詞変換キー：仮名文字列を入力してこのキーを
拝すと、固有名調を含む単語辞書内を探索する。■Proper noun conversion key: When you enter a kana character string and press this key, the dictionary will search for words that include proper names.

（５）ユーザ辞書変換キー二仮名文字亮を入力してこの
キーを拝すと、ユーザ登録語を含む単語辞書内を探索す
る。(5) User dictionary conversion key When you input the Nikana character Ryo and press this key, the word dictionary containing the user registered words is searched.

（４）専門語辞書変換キー：仮名文字列を入力してこの
キーを押すと、ユーザ別あるいは分野別の指定された専
門語を含む単語辞書内をＩＩ＊する。(4) Technical term dictionary conversion key: When a kana character string is input and this key is pressed, the text in the word dictionary containing the specified technical term by user or field is displayed.

（５）無変換キー：仮名文字列を入力してこのキーを押
すと、入力仮名文字列を置換せずそのまま出力表示する
。(5) No conversion key: When a kana character string is input and this key is pressed, the input kana character string is output and displayed as is without being replaced.

また、例えば前記入力前処理部１内に、入力文字列バッ
ファ領域を設け、ここには前記制御キーを押す前に入力
された仮名文字列を格納する。この文字列は各変換キー
がそれに対応する変換が実行された後も、次の文字列の
入力が行われるまではそのまま保持される。Further, for example, an input character string buffer area is provided in the input preprocessing section 1, and a kana character string inputted before the control key is pressed is stored here. This string remains unchanged even after the conversion corresponding to each conversion key is performed until the next string is input.

以下、本実施例のカナ漢字変換処理を第３図に示した７
０−チャートに従って説明する。Below, the kana-kanji conversion process of this embodiment is shown in Figure 3.
0-Explain according to the chart.

まず、キーを選択する。通常は文字キーを選択し、変換
処理を行う文節の仮名文字列を人力する。First, select a key. Normally, a character key is selected and the kana character string of the phrase to be converted is manually entered.

入力された文字列は前記入力文字列バッファに格納され
る。The input string is stored in the input string buffer.

次に、再びキー選択を行う。この場合のキー選択は、前
記入力文字列が一般語であれば前記「変換キー」を、ま
た前記入力文字列が固有名詞であることが明らかな場合
に１よ、前記「固有名調変換キー」を選択するという如
く、オペレータの判断によって行うようにする。これに
より−、正しい変換を行う率（慶換率）が格段に向上す
るという効果を奏する。Next, perform key selection again. In this case, the key selection is to use the "conversion key" if the input character string is a general word, or 1 to select the "proper name conversion key" if the input character string is clearly a proper noun. ” is selected at the discretion of the operator. This has the effect of significantly improving the rate of correct conversion (conversion rate).

この場合、もし、オペレータが判断を誤っても、最初の
選択キーによる探索終了後、これに気付いた時点で正し
い変換キーを押し直すことにより、簡単に〜他の変換操
作に移行することができる。また、オペレータがどの変
換キーを選択すべきか分らない場合には、一般的なもの
から特殊なものへと順次探索範囲を拡げる如く操作する
ようにしておけば良い。In this case, even if the operator makes a mistake in judgment, he can easily move on to another conversion operation by pressing the correct conversion key again after he has finished searching with the first selection key. . Furthermore, if the operator does not know which conversion key to select, he or she may operate to expand the search range sequentially from general to special keys.

上記各変換キ、−による単語辞書内の探索において、こ
れを例えばいわゆる最長一致法による探索と、総当り法
による探索との２段構えにしても良い。これにより、最
初の変換候補出力までの所要時間を更に短縮することが
できる。In the search in the word dictionary using each of the above conversion keys, -, this may be performed in two stages, for example, a search using a so-called longest match method and a search using a brute force method. Thereby, the time required to output the first conversion candidate can be further shortened.

変換キーとしては上記実施例に示した。他にも、更に細
分された属性の変換キーを設けることができるのは言う
までもない。The conversion key is shown in the above embodiment. It goes without saying that conversion keys for further subdivided attributes can also be provided.

以上述べた如く、本発明によれば、入力文を、自立語を
中心とする分かち書き単位に分解して仮名文字で人力し
、これに対応する漢字カナ混じり文を逐次得るカナ漢字
変換処理方式において、入力文節の属性に応じて異なっ
た変換方法を指定可能な複数の変換キーを設けて、オペ
レータの最初の変換キー操作により出力表示された変換
候補の中にオペレータの意図する変換文字列がないとき
、他の属性を有する変換キーを操作することにより、入
力文字列を最初とは興なる変換方法で再変換するように
したので、変換率が向上するばかりでなく、オペレータ
が操作を誤った場合にも簡単な操作で正しい変換操作に
切換えることが可能となり、実月性の高いカナ漢字変換
処理方式を実現できるという顯着な効果を春するもので
ある。As described above, according to the present invention, in the kana-kanji conversion processing method, an input sentence is broken down into division units centered on independent words, manually inputted using kana characters, and corresponding sentences containing kanji and kana are sequentially obtained. , multiple conversion keys that can specify different conversion methods depending on the attributes of the input clause are provided, and the conversion string intended by the operator is not found among the conversion candidates output and displayed by the operator's first conversion key operation. By operating a conversion key with other attributes, the input character string is reconverted using a conversion method different from the original one, which not only improves the conversion rate but also prevents the operator from making a mistake in the operation. Even in the case of a certain situation, it is possible to switch to the correct conversion operation with a simple operation, and this has the advantageous effect of realizing a highly effective kana-kanji conversion processing method.

[Brief explanation of the drawing]

第１図は本発明の一実施例であるカナ漢字変換処理のブ
ロック図、第２図は単語抽出部の詳細を示す図、第３図
は処理の７０−チャートである。ｌ：入力前処理部、２：単語抽出部、２ム：文節単位仮
名文字列記憶部、２Ｂ＝単語抽出記憶部、２０二候補単
語記憶部、２Ｄ＝辞書検索記憶部、２に：変換辞書記憶
部、３：同音語判別部、４二出力制御部、５：変換制御
部。特許出願人　　株式金社　リ　コ　一代　理　人　　　弁理士　礪　　村　　雅　　−、セ　
１FIG. 1 is a block diagram of a kana-kanji conversion process according to an embodiment of the present invention, FIG. 2 is a diagram showing details of a word extraction section, and FIG. 3 is a 70-chart of the process. 1: Input preprocessing unit, 2: Word extraction unit, 2M: Clause unit kana character string storage unit, 2B = Word extraction storage unit, 20 candidate word storage unit, 2D = Dictionary search storage unit, 2: Conversion dictionary Storage unit, 3: Homophone discrimination unit, 42 output control unit, 5: Conversion control unit. Patent applicant Riko Kinsha Co., Ltd. Patent attorney Miyabi Tsubomura -, SE
1

Claims

[Claims]

In the kana-kanji conversion processing method, the input sentence is broken down into separated units centered on independent words and input as kana characters, and the corresponding sentences containing kanji and kana are sequentially obtained. Different conversions are performed depending on the attributes of the input clause. By setting up multiple conversion keys that can specify methods, when the conversion string that the operator intends is not found among the conversion candidates output and displayed by the operator's first conversion key operation, a conversion key with other attributes can be used. A kana-kanji conversion processing method characterized by re-converting an input character string using a conversion method different from the original one.