JPH01156870A

JPH01156870A - Retrieving system for translation equivalent

Info

Publication number: JPH01156870A
Application number: JP62316735A
Authority: JP
Inventors: Yoshio Nakao; 由雄仲尾
Original assignee: Fujitsu Ltd
Current assignee: Fujitsu Ltd
Priority date: 1987-12-15
Filing date: 1987-12-15
Publication date: 1989-06-20

Abstract

PURPOSE:To automatically acquire a translation equivalent or synonym corresponding to an input word by copying the state of a node point of a primary network on a node point of a secondary network for decision of the initial state of the secondary network and extracting the translation equivalent out of the secondary network. CONSTITUTION:A secondary network generating means 330 extracts the translation equivalent and its related words to a due word of a primary network 371 by reference to a word dictionary 220 and a meaning relation dictionary 400. Then a secondary network 372 where the relation is defined as an arc with the words set at the side of the extracted translation equivalent as node points is produced by connecting the node points set at the side of said translation equivalent via a related arc. A secondary network state setting means 340 copies the state of the node point corresponding on the network 371 on a node point of the network 372. Thus the initial state of the network 372 is decided. Then the states of the peripheral node points are set or corrected with the node point whose state is set defined as a starting point. In such a way, a translation equivalent or a synonym corresponding to the input word is automatically obtained.

Description

【発明の詳細な説明】〔概要〕入力単語に対応する訳語および類義語を出力できる対訳
語検索システムに関し、入力単語に対応する訳語又は類義語の獲得を自動的に行
い得るようにすることを目的とし、入力単語に関係する
単語を求め、求めた原語側′の単語を節点とし関係をア
ークとする１次ネットワークを生成し、１次ネットワー
クの節点に状態を設定し、１次ネットワーク上の単語に
対応する訳語および関係する訳語の単語を求め、求めた
訳語側の単語を節点とし関係をアークとする２次ネ、ッ
トワークを生成し、２次ネットワーク上の節点に対応す
る１次ネットワークの節点の状態を２次ネットワークの
節点にコピーして２次ネットワークの初期状態を決定し
、２次ネットワークの節点の状態やアークの値に基づい
て２次ネットワークの節点の状態を決定し、状態が決定
された２次ネットワークから訳語を抽出することを構成
要件としている。また、状態が決定された１次ネットワ
ークから類義語を抽出することをも構成要件としている
。[Detailed Description of the Invention] [Summary] The present invention relates to a bilingual word search system that can output translations and synonyms corresponding to input words, and an object thereof is to automatically acquire translations or synonyms corresponding to input words. , find words related to the input word, generate a first-order network with the found words on the source language side as nodes and relationships as arcs, set states at the nodes of the first-order network, and Find the corresponding translated words and related translated words, generate a secondary network with the found translated words as nodes and the relationships as arcs, and connect the nodes of the primary network corresponding to the nodes on the secondary network. The initial state of the secondary network is determined by copying the state to the nodes of the secondary network, and the state of the nodes of the secondary network is determined based on the state of the nodes of the secondary network and the value of the arc, and the state is determined. The component is to extract translated words from the secondary network. In addition, a constituent requirement is to extract synonyms from the primary network whose state has been determined.

[Industrial application field]

本発明は、入力単語に対応する訳語および類義語を自動
的に獲得できるようになった対訳語検索システムの改良
に関するものでる。The present invention relates to an improved bilingual word search system that can automatically obtain translations and synonyms corresponding to input words.

[Conventional technology]

機械翻訳には様々な機能があるが、その主要なものの一
つに対訳語の検索機能がある。対訳語検索には、単に与
えられた被翻訳言語の単語に対応する訳語を日英辞書の
ように示すだけでなく、その単語の文脈に応じて、適切
な訳語を選び出す機能が必要である。ここで言う文脈に
は、文構造に関わる文法的な要素と、単語間の相性に近
い意味的な要素があり、その取り扱い方には色々な方式
第１１図に示すように、代表的な方式は、大きく分ける
と、単語ダイレクト方式と、中間表現を使う方式の二つ
がある。単語ダイレクト方式とは、被翻訳語と訳語のペ
アを個別に直接的に定義し、訳文生成を行いつつ適正単
語の選択を行う方式である。中間表現を用いる方式とは
、原文を意味・文法解析により概念を表す中間表現に変
換し、中間表現に合った訳語を選択する方式である。Machine translation has various functions, but one of the main ones is a bilingual word search function. Bilingual word search requires a function that not only shows the translation of a given word in the target language, as in a Japanese-English dictionary, but also selects an appropriate translation according to the context of the word. The context referred to here includes grammatical elements related to sentence structure and semantic elements similar to the compatibility between words.There are various ways to handle this, as shown in Figure 11. Broadly speaking, there are two methods: word direct method and method using intermediate representation. The word direct method is a method in which pairs of translated words and target words are individually and directly defined, and appropriate words are selected while generating translated sentences. The method using intermediate representations is a method in which the original text is converted into an intermediate representation representing a concept through semantic and grammatical analysis, and a translated word that matches the intermediate representation is selected.

中間表現に変換する方式には、中間表現の取り方により
、トランスファ方式と、ピボット方式の代表的な二つの
方式に分類される。トランスファ方式とは、各言語に対
応して固有の中間表現を設定し、被翻訳言語の中間表現
を翻訳言語の中間表現に変換（トランスファ）した後、
訳語を取り出すものである。ピボット方式とは、全ての
言語に共通な中間表現を設定し、それを通じて訳語を選
び出すものである。両方式とも、中間表現としては、生
成文法論などの言語学知識とフレーム理論などの知識工
学的な方法とにより、文構造を体系化したものが良く用
いられている。Methods for converting into intermediate representations are classified into two representative methods: transfer methods and pivot methods, depending on how the intermediate representation is taken. The transfer method involves setting a unique intermediate representation for each language, converting (transferring) the intermediate representation of the target language to the intermediate representation of the translation language, and then
It extracts the translated words. The pivot method sets an intermediate expression that is common to all languages, and uses this to select translated words. In both methods, intermediate representations that systematize sentence structures using linguistic knowledge such as generative grammar theory and knowledge engineering methods such as frame theory are often used.

第１２図は「飛行機がとぶ」における各方式の動作例を
示す図である。同図に示すように、「飛行機」に対して
“ｐｌａｎｅ”の訳語を定義し、「とぶ」に対して“ｆ
ｌｙ”、“ｊｕｍｐ”の訳語を定義し、各単語に定義さ
れている言語情報および文構造から、“ｐｌａｎｅ”と
“ｎｙ”を選び出すのが、単語ダイレクト方式である。FIG. 12 is a diagram showing an example of the operation of each method in "Airplane Flying". As shown in the figure, the translation of “plane” is defined for “airplane”, and the translation of “f” is defined for “flying”.
In the word direct method, translations of "ly" and "jump" are defined, and "plane" and "ny" are selected from the linguistic information and sentence structure defined for each word.

「飛行機」や”ｐｌａｎｅ”に対応する物体概念の中間
表現Ａと、「空を飛ぶ」と言う概念の中間表現Ｂとを用
意し、原文を「ＡがＢする」と言った形の中間表現に変
換し、この中間表現から“ｐｌａｎｅ”と“ｆｌｙ”を
取り出すのが中間表現を用いる方式である。Prepare an intermediate representation A of the object concept corresponding to "airplane" or "plane" and intermediate representation B of the concept "flying in the sky", and create an intermediate representation of the original sentence as "A does B". A method using an intermediate representation is to convert "plane" and "fly" into "plane" and "fly" from this intermediate representation.

[Problem that the invention seeks to solve]

以上の方式に共通した課題として翻訳に使う言語情報を
如何に設定するかと言うことがある。A common issue with the above methods is how to set the language information used for translation.

単語ダイレクト方式においては、翻訳に必要な情報は個
々の単語に直接与えられるか、または訳文生成のための
規則として与えなければならないため、単語の持つ情報
や文生成の規則が複雑になりがちである。In the word direct method, the information necessary for translation must be given directly to each word or as rules for generating translated sentences, so the information contained in words and the rules for generating sentences tend to be complex. be.

中間表現を用いる方式においては、各言語の単語が個別
に持つ情報、中間表現が持つ情報、変換規則として与え
る情報のように、翻訳に使う情報を各言語固有なものと
、全言語に共通なものとに成る程度分散して与えること
が出来る反面、考え得る全ての概念に対して中間表現を
設定しなければならないと言う困難がある。In methods that use intermediate representations, the information used for translation is divided into information that is unique to each language and information that is common to all languages, such as information that words in each language have individually, information that intermediate expressions have, and information that is provided as conversion rules. While it is possible to provide information in a distributed manner to the extent that it becomes a reality, there is the difficulty of having to set up intermediate representations for all possible concepts.

言語情報設定の難しさ・煩雑さが顕著に現れるのが、原
文に未知語が存在したときの処理である。The difficulty and complexity of setting linguistic information becomes apparent when processing unknown words in the original text.

単語ダイレクト方式においては、未知語に対して新たに
訳語を設定し、複雑な言語情報や規則を与えることが必
要である。中間表現を用いる方式においては、既存の概
念（中間表現）に嵌め込むか、未知語（概念）に合わせ
た中間表現を設定しなくてはならない。両者とも、この
作業を人手によらず自動的に行うことは、かなり困難で
ある。特に後者において、中間表現が深い言語解析に基
づいた複雑なものである程、中間表現の設定は難しくな
る。In the word direct method, it is necessary to set new translations for unknown words and provide complex linguistic information and rules. In a method that uses intermediate representations, it is necessary to either fit it into an existing concept (intermediate representation) or set an intermediate representation that matches the unknown word (concept). In both cases, it is quite difficult to perform this work automatically without manual intervention. Particularly in the latter case, the more complex the intermediate representation is based on deep language analysis, the more difficult it becomes to set the intermediate representation.

殆ど無限に近い情報を扱う汎用の翻訳機において、翻訳
の質に関わる言語情報（意味１文法等）の獲得は重要な
問題であるが、この作業の大部分は人手によらざるを得
ない現状である。翻訳の質を向上し、システムを効率化
するために、言語情報獲得の自動化を推進し、獲得効率
を向上させることは、現在の大きな課題である。Acquiring linguistic information (semantics, grammar, etc.) related to the quality of translation is an important issue for general-purpose translation machines that handle almost infinite information, but the majority of this work currently has to be done manually. It is. In order to improve the quality of translation and make the system more efficient, promoting automation of linguistic information acquisition and improving acquisition efficiency are current major challenges.

本発明は、この点に鑑みて創作されたものであって、入
力単語に対応する訳語または類義語を自動的に効率よく
獲得できるようになった対訳語検索システムを一提供す
ることを目的としている。The present invention was created in view of this point, and aims to provide a bilingual word search system that can automatically and efficiently acquire translations or synonyms corresponding to input words. .

[Means for solving problems]

第１図は本発明の原理図である。本発明の対訳語検索シ
ステムは、入力言語を入力する入力部１００と、原語側
の単語辞書２１０と、訳語側の単語辞書２２０と、意味
関係辞書４００と、入力言語の意味連想ネットワークで
ある所の１次ネットワーク３７１を生成する１次ネット
ワーク生成手段３１０と、１次ネットワーク３７１の状
態設定を行う１次ネットワーク状態設定手段３２０と、
翻訳言語の意味連想ネットワークである所の２次ネット
ワーク３７２を生成する２次ネットワーク生成手段３３
０と、２次ネットワーク３７２の状態設定を行う２次ネ
ットワーク状態設定手段３４０と、状態設定の終了した
２次ネットワーク３７２から訳語を抽出する訳語抽出手
段３５０と、状態設定の終了した１次ネットワーク３７
１から類義語を抽出する類義語抽出手段３６０と、訳語
抽出手段３５０によって抽出された訳語又は類義語抽出
手段３６０によって抽出された類義語を出力する出力部
５００とを具備している。FIG. 1 is a diagram showing the principle of the present invention. The bilingual word search system of the present invention includes an input unit 100 for inputting an input language, a word dictionary 210 on the source language side, a word dictionary 220 on the translation side, a semantic relation dictionary 400, and a meaning association network of the input language. a primary network generating means 310 for generating a primary network 371; a primary network state setting means 320 for setting the state of the primary network 371;
Secondary network generation means 33 for generating a secondary network 372 which is a meaning association network of the translated language
0, a secondary network status setting means 340 that sets the status of the secondary network 372, a translated word extracting unit 350 that extracts a translated word from the secondary network 372 whose status has been set, and a primary network 37 whose status has been set.
1, and an output unit 500 that outputs the translated words extracted by the translated word extracting means 350 or the similar synonyms extracted by the similar word extracting means 360.

原語側の単語辞書２１０は、単語の見出しと、意味関係
辞書（４００）を検索するための情報と、訳語辞書（２
２０）を検索するための情報と、文法情報とを有してい
る。訳語側の単語辞書２２０は、単語の見出しと、意味
関係辞書（４００）を検索するための情報と、文法情報
とを有している。意味関係辞書４００は、単語と単語の
意味関係を示す情報を有している。The word dictionary 210 on the source language side contains word headings, information for searching the semantic relationship dictionary (400), and the translation dictionary (210).
20) and grammatical information. The word dictionary 220 on the translation side includes word headings, information for searching the semantic relationship dictionary (400), and grammatical information. The semantic relationship dictionary 400 has information indicating the semantic relationship between words.

１次ネットワーク生成手段３１０は、入力言語に関係す
る単語を単語辞書２１０と意味関係辞書４００とを参照
して抽出し、抽出された原語側の単語を節点とし関係を
アークとする１次ネットワーク３７１を、節点を関係に
応じたアークで結んで行くことにより生成する。The primary network generation means 310 extracts words related to the input language by referring to the word dictionary 210 and the semantic relationship dictionary 400, and creates a primary network 371 in which the extracted words in the original language are nodes and relationships are arcs. is generated by connecting the nodes with arcs according to the relationships.

１次ネットワーク状態設定手段３２０は、１次ネットワ
ーク３７１上の入力単語の節点に初期状態をセットし、
入力単語の節点に連なるアーク及び当該節点の状態に応
じて隣接節点の状態を設定し、次いで、状態が設定され
た節点を起点として、同様な処理を行うことにより、周
囲の節点の状態を設定もしくは修正する。The primary network state setting means 320 sets an initial state to the node of the input word on the primary network 371,
The state of adjacent nodes is set according to the arc connected to the node of the input word and the state of that node, and then the state of surrounding nodes is set by performing the same process starting from the node where the state has been set. Or fix it.

２次ネットワーク生成手段３３０は、１次ネットワーク
３７１上の単語に対応する訳語および当該訳語に関係す
る単語を単語辞書２２０と意味関係辞書４００を参照し
て抽出し、抽出された訳語側の単語を節点とし関係をア
ークとする２次ネットワーク３７２を、訳語側の節点を
関係に応じたアークで結んで行（ことにより生成する。The secondary network generation means 330 extracts the translated words corresponding to the words on the primary network 371 and words related to the translated words by referring to the word dictionary 220 and the semantic relationship dictionary 400, and extracts the extracted translated words. A quadratic network 372 with nodes and relationships as arcs is generated by connecting nodes on the translation side with arcs corresponding to the relationships.

２次ネットワーク状態設定手段３４０は、２次ネットワ
ークの節点に、１次ネットワーク上の対応する節点の状
態をコピーして２次ネットワークの初期状態を決定し、
次いで、状態が設定されている節点を起点として、周囲
の節点の状態を設定もしくは修正する。The secondary network state setting means 340 determines the initial state of the secondary network by copying the state of the corresponding node on the primary network to the node of the secondary network,
Next, starting from the node whose state has been set, the states of surrounding nodes are set or modified.

〔Example〕

本発明の対訳語検索システムは、被翻訳言語の単語・単
語対・単語列の入力に対し翻訳言語の単語・単語対・単
語列の候補を取り出す機能と、成る単語と与えられた情
報から類義語の候゛補を出力する機能とを有しており、
これらはコントロール文の入力により制御される。前者
においては、訳語検索のための情報が不十分であっても
候補単語を類推する機能も含まれている。The bilingual word search system of the present invention has a function of extracting candidates for words, word pairs, and word strings in the translation language based on the input of words, word pairs, and word strings in the language to be translated, and synonyms from the words consisting of the words and given information. It has a function to output candidates for
These are controlled by inputting control statements. The former includes a function for guessing candidate words even if there is insufficient information for searching for translations.

両機能を実現する処理において、意味連想ネットワーク
を中間表現として用い、その構造を簡明なものにするこ
とで、言語情報のうち意味的な情報の自動獲得を成る程
度可能とし、それと共に訳語の類推機能を加えることに
より、言語情報の設定が翻訳言語と被翻訳言語で成る程
度まで分割可能となり、言語情報の設定の効率化を推進
している。In the process of realizing both functions, by using a semantic association network as an intermediate representation and simplifying its structure, it is possible to automatically acquire semantic information of linguistic information to a certain extent, and at the same time, it is possible to automatically acquire semantic information among linguistic information. By adding this function, it is now possible to divide language information settings into a translated language and a translated language, promoting more efficient language information settings.

本発明の基礎となっている考え方は、人間の言語の意味
とは、物体的な概念（モノ：以下、物体側と称す）と、
物体の状態・動作・性質を表す概念（以下、様態詞と称
す）の連鎖であると言うものである。すなわち、モノを
表す単語（物体側）の意味とは、その単語を修飾する形
容詞や当該単語に関わり得る（当該単語を主語に取れる
。目的語に取れる。・・・）動詞などと言ったものの集
合のことであると言う考え方である。例えば、「りんご
」と言う単語は「赤い」や「甘い」、「食べる（食べら
れる）」、「切る」、「へた」、「皮」等々の単語と共
起し得る（同一文中で成る纏まりとして存在し得る）と
言う「意味」を持つと考えるわけである。The idea underlying the present invention is that the meaning of human language is defined by the concept of an object (hereinafter referred to as the object side).
It is said to be a chain of concepts (hereinafter referred to as modal words) that express the state, action, and properties of an object. In other words, the meaning of a word that represents a thing (object side) is an adjective that modifies that word, a verb that can be related to the word (the word can be the subject, it can be the object, etc.), etc. The idea is that it is a collection. For example, the word ``apple'' can co-occur with words such as ``red'', ``sweet'', ``eatable'', ``cut'', ``smelt'', ``skin'', etc. (in the same sentence). In other words, they are considered to have a ``meaning'' that means they can exist as a group.

この考え方は、例えば「具体物」に対し１色」。This idea is based on, for example, "one color for each concrete object."

「味」、・・・等と言った属性（スロット）を予め設定
し、「りんご」に対し「色」が「赤」で「味」が「甘い
」・・・等と言った値を持つデータを対応させるフレー
ム理論的な考え方に比べ、取り扱うデータ（概念構造）
が内部構造を持たなくなると言う点で単純な中間表現を
生成できるものである。Attributes (slots) such as "taste", etc. are set in advance, and for "apple", the "color" is "red", the "taste" is "sweet", etc., and the values are set in advance. Data handled (conceptual structure) compared to the frame theory approach of matching data.
It is possible to generate a simple intermediate representation in that it has no internal structure.

第２図はシステムの構成例を示す図である。同図におい
て、１００は入力部、２１０は原語側の単語辞書、２２
０は訳語側の単語辞書、３００はプロセッサ、３７１は
１次ネットワーク、３７２は２次ネットワーク、４００
は意味関係辞書、５００は出力部をそれぞれ示す。FIG. 2 is a diagram showing an example of the system configuration. In the figure, 100 is an input section, 210 is a word dictionary on the original language side, and 22
0 is a word dictionary on the translation side, 300 is a processor, 371 is a primary network, 372 is a secondary network, 400
5 indicates a semantic relational dictionary, and 500 indicates an output unit.

本発明の対訳語検索システムは、原語側の単語辞書２１
０と、訳語側の単語辞書２２０と、単語（概念と同義）
間の関係を記述した意味関係辞書４００と、辞書を基に
各言語の意味連想ネットワークを生成し訳語や類義語の
抽出（検出）を行うプロセッサ３００と、入力部１００
と、出力部５００とを具備している。゛原語側の単語辞書２１０は、単語（見出し）と、それに
対応する意味関係辞書４００や訳語辞書２２０を検索す
る情報と、文法情報とを有している。訳語側の単語辞書
２２０は、単語（見出し）と、それに対応する意味関係
辞書（４００）を検索する情報と、文法情報とを有して
いる。The bilingual word search system of the present invention has a word dictionary 21 on the source language side.
0, the word dictionary 220 on the translation side, and the word (synonymous with the concept)
a semantic relationship dictionary 400 that describes the relationships between the two; a processor 300 that generates a semantic association network for each language based on the dictionary and extracts (detects) translations and synonyms; and an input unit 100.
and an output section 500. The word dictionary 210 on the source language side includes words (headings), information for searching the corresponding meaning-related dictionary 400 and translation dictionary 220, and grammatical information. The word dictionary 220 on the translation side includes words (headings), information for searching the semantic relationship dictionary (400) corresponding to the words, and grammatical information.

意味関係辞書４００は、同義・類義・反答・反類義（反
答の類義）・上位（概念）・下位（概念）・共起可・共
起不可などの情報が有している。なお、類義・反類義・
共起可・共起不可は、その度合に応じたレベル付けもな
されている。同義は類義の度合が最も強いものであり、
反答は反類義の度合が最も強いものであるとして考えら
れている。The semantic relationship dictionary 400 has information such as synonyms, similar meanings, counter-answers, anti-synonyms (synonyms of counter-answers), superior (concept), subordinate (concept), possible co-occurrence, non-co-occurrence, etc. . In addition, similar meanings, anti-synonyms,
Possible co-occurrence and non-co-occurrence are also graded according to the degree of co-occurrence. Synonyms have the strongest degree of similarity;
The counter-answer is considered to be the one with the strongest degree of anti-synonymy.

意味関係辞書４００とは、自然言語の単語の持つ概念に
ついて、概念と概念の関係を記述した辞書である。記述
内容としては、 ■　人間は動物の下位概念である（人間と動物に含まれ
る）といった概念の上位下位関係。The semantic relationship dictionary 400 is a dictionary that describes the relationship between concepts regarding the concepts of natural language words. The content of the description is as follows: ■ A superior-subordinate relationship between concepts such as humans being a subordinate concept of animals (included in humans and animals).

■　「走る」と「駆ける」は類義であるといった同義・
類義関係。■ Synonyms, such as "running" and "running" are synonyms.
Similarity relationship.

■　「暑い」と「寒い」は反答・反類義関係。■ “Hot” and “cold” have an anti-answer/anti-synonym relationship.

■　「寒い」と「冬」は「涼しい」と「冬」に比べて文
中で共起しやすいといった共起度。■ The degree of co-occurrence, such that "cold" and "winter" are more likely to co-occur in a sentence than "cool" and "winter".

などがある。and so on.

記述形式としては、複数の概念とその間に成立している
関係およびその関係の確かさ（重み）を“「概念」２つ
と「関係」と「重み」”のセットを基本として表現した
ものを用いており、概念と関係の双方で検索可能なもの
となっている。また、日本語・英語といった言語の種類
に依存しない関係と、各言語により異なる関係は分割し
て格納され、共起度についても、対象分野に応じてチュ
ーニングされ、必要に応じて容易に修正が可能なように
分割して格納されている。The description format is one that expresses multiple concepts, the relationships established between them, and the certainty (weight) of those relationships based on a set of "two concepts, a relationship, and a weight." It is possible to search for both concepts and relationships.In addition, relationships that do not depend on the type of language such as Japanese and English, and relationships that differ depending on each language are stored separately, and the degree of co-occurrence is calculated. It is also tuned according to the target field and is stored in sections so that it can be easily modified as needed.

検索は、各単語辞書中の単語に記述されている概念マー
カによる検索を基本とし、通常は単語辞書と組あわせて
使用される。Searches are based on concept markers written in words in each word dictionary, and are usually used in combination with word dictionaries.

反答・反類義については、訳語の尤もらしさを計算する
際に原稿中の同義・類義と略ぼ同じように用いられる（
重みが負の同義・類義語として扱うことで、共起情報か
ら訳語の確かさを計算することができる）。ただし、同
義・類義と異なり、反答・反類義とされた単語を訳語の
候補とすることはない。Counter-answers and counter-synonyms are used in almost the same way as synonyms and similar meanings in the manuscript when calculating the likelihood of translated words (
By treating them as synonyms/synonyms with negative weights, it is possible to calculate the certainty of translations from co-occurrence information). However, unlike synonyms and similar meanings, words that are considered counter-answers and anti-synonyms are not candidates for translation.

上位・下位は、（反）同義・類義関係や共起関係が存在
しないときや不足している場合に、類義と同じような方
式で用いられる。例えば、訳語が存在しないとき、もと
の単語の上位・下位とされている単語の訳語が検索され
ることもある。Superordinate and subordinate are used in the same way as synonyms when (anti)synonymous/similar synonymous relations or co-occurrence relations do not exist or are lacking. For example, when a translated word does not exist, a translated word of a word that is higher or lower than the original word may be searched.

第３図は処理の概要を示す図である。これらの処理はプ
ロセッサ３００によって行われる。処理の概要は、次の
通りである。但し、被翻訳言語の意味連想ネットワーク
（１次ネットワーク）の生成、１次ネットワークの状態
設定、翻訳言語の意味゛連想ネットワーク（２次ネット
ワーク）の生成、２次ネットワークの状態設定は、略ぼ
一括して行われる。なお、１次ネットワークの生成→１
次ネットワークの状態設定→２次ネットワークの生成→
２次ネットワークの状態設定→訳語の抽出と進むのが訳
語検索処理であり、１次ネットワークの生成→１次ネッ
トワークの状態設定→類義語の抽出と進むのが類義語検
索の処理である。FIG. 3 is a diagram showing an outline of the process. These processes are performed by the processor 300. The outline of the process is as follows. However, generation of the meaning association network (primary network) of the translated language, setting of the state of the primary network, generation of the meaning association network (secondary network) of the translated language, and setting of the state of the secondary network are almost done all at once. It will be done as follows. In addition, generation of the primary network → 1
Next Network status settings → Secondary network generation →
Translation search processing proceeds as follows: secondary network state setting -> translation extraction, and synonym search processing proceeds as primary network generation -> primary network state setting -> synonym extraction.

被翻訳言語側の意味連想ネットワークの生成について説
明する。被翻訳言語側の意味連想ネットワークは１次ネ
ットワークと呼ばれる。被翻訳言語の単語（対又は列）
が入力されると、それに関係する（入力文字列と同表記
のもの全てと、それらと同義・類義・反答・反類義・共
起可・共起不可・上位・下位などの）単語を、単語辞書
２１０および意味関係辞書４００を用いて抽出し、関係
に応じたアークで単語（節点）を結んでいくことで、１
次ネットワークを生成する。The generation of a semantic association network on the translated language side will be explained. The meaning association network on the side of the translated language is called a primary network. Words (pairs or sequences) in the target language
When input, words related to it (all the same notations as the input string, synonyms, synonyms, counter answers, anti-synonyms, possible co-occurrence, non-co-occurrence, superiors, inferiors, etc.) are displayed. is extracted using the word dictionary 210 and the semantic relationship dictionary 400, and by connecting the words (nodes) with arcs according to the relationships, 1
Generate the following network.

１次ネットワークの状態設定について説明する。The state setting of the primary network will be explained.

１次ネットワーク上の入力文字列に対応する単語（節点
）に初期の状態をセットし、その節点に連なるアークと
節点の状態に応じたシグナルをアークを通して隣接する
節点に送ることで、隣接する節点の状態を設定する。そ
の後、状態の設定された節点を起点とし、同様な操作に
より、周囲の節点の状態を設定・修正していくことで、
−次ネットワークの全ての節点に状態を設定する。ここ
で言う「状態」とは、ネットワーク（グラフ）のつなが
り方を示す値のセットのことで、次の方針で各節点に与
えられる ■　入力文字列に対応する単語（節点）が最も状態が高
い。By setting the initial state of the word (node) corresponding to the input character string on the primary network, and sending a signal according to the arc connected to that node and the state of the node to the adjacent node through the arc, the adjacent node Set the state of . After that, by using the node whose state has been set as the starting point and performing similar operations to set and modify the state of the surrounding nodes,
− Set the state to all nodes of the next network. The "state" here refers to a set of values that indicate how the network (graph) is connected, and is given to each node according to the following policy: ■ The word (node) corresponding to the input string has the highest state. .

■　入力文字列に対応する単語に似た単語はど状態が高
い。■ Words similar to the word corresponding to the input string have a high status.

■　入力文字列に対応する単語に共起する度合の高い単
語はど状態が高い。■ Words that have a high degree of co-occurrence with words corresponding to the input character string have a high degree of co-occurrence.

■　入力文字列に対応する単語の共起の仕方と類似した
共起の仕方をする単語の状態は高くなる。■ Words that co-occur in a similar manner to the co-occurrence of words corresponding to the input character string have a higher status.

翻訳言語側の意味連想ネットワークの生成について説明
する。１次ネットワークの生成および１次ネットワーク
の状態設定の処理で作られた一次連想ネットワーク上の
単語に対応する訳語およびその訳語に関係のある単語を
、単語辞書２２０および意味関係辞書４００を用いて抽
出し、１次ネットワークの生成と同様な方法で、翻訳言
語側の意味連想ネットワークを生成する。翻訳言語側の
意味連想ネットワークを２次ネットワークと言う。The generation of a semantic association network on the translation language side will be explained. Translations corresponding to words on the primary association network created by the processing of primary network generation and primary network state setting processing and words related to the translations are extracted using the word dictionary 220 and the semantic relationship dictionary 400. Then, a meaning association network on the translation language side is generated in the same way as the first-order network is generated. The meaning association network on the translation language side is called a secondary network.

２次ネットワークの状態の設定について説明する。２次
ネットワーク上の単語（節点）で、２次ネットワーク上
の単語に対応している（訳語として定義されている）も
のの状態を１次ネットワークからコピーし、２次ネット
ワークの初期状態を決定する。次に状態が設定されてい
る節点を起点とし、１次ネットワークの状態設定と同様
な作業を行うことで、２次ネットワーク上の状態を決定
する。Setting the state of the secondary network will be explained. The state of a word (node) on the secondary network that corresponds to a word on the secondary network (defined as a translated word) is copied from the primary network to determine the initial state of the secondary network. Next, starting from the node where the state has been set, the state on the secondary network is determined by performing the same work as setting the state of the primary network.

訳語の抽出について説明する。２次ネットワーク上の状
態を基に、２次ネットワークから単語を抽出する。即ち
、状態の高い単語、単語対、単語列を選ぶ。Explaining the extraction of translated words. Words are extracted from the secondary network based on the state on the secondary network. That is, words, word pairs, and word strings with high status are selected.

類義語の抽出について説明する。１次ネットワーク上の
状態を基に、１次ネットワーク上の単語を抽出する。Extracting synonyms will be explained. Words on the primary network are extracted based on the state on the primary network.

本発明を理解し易くするために、簡略化した処理の仕方
を示す。ここで、関係は同義・類義・共起強・共起間の
四種にとどめる。各状態は、意味ネットワークの状態の
設定のため成る値を持つが、ここでは第４図のように設
定しておく。アークは１、アークの種別とアークの値、
またはアークの値を持つ。節点は、物体側か様態間かの
別、類義度、共起度、推定尤度を状態として持つ。類義
度とは入力単語（入力文字列と同表記の単語）に似てい
る度合を示す値であり、共起度とは入力単語（入力文字
列と同表記の単語）と共起する度合を示す値であり、推
定尤度とは節点のつながり具合がどのくらい入力単語に
近いかを示す値である。反答・反類義・反共起について
は、負の同義・類義・共起を考えることで、ここでの説
明が使用できる。In order to make it easier to understand the present invention, a simplified method of processing will be shown. Here, the relationships are limited to four types: synonymous, similar, strong co-occurrence, and co-occurrence. Each state has a value for setting the state of the semantic network, and here the values are set as shown in FIG. Arc is 1, arc type and arc value,
or have a value of arc. Nodes have states such as whether they are on the object side or between modalities, degree of similarity, degree of co-occurrence, and estimated likelihood. The degree of similarity is a value that indicates the degree of resemblance to an input word (a word with the same notation as the input string), and the degree of co-occurrence is the degree to which it co-occurs with the input word (a word with the same notation as the input string). The estimated likelihood is a value that indicates how close the connection of nodes is to the input word. Regarding counter-answers, anti-synonyms, and anti-co-occurrences, the explanation here can be used by considering negative synonyms, synonyms, and co-occurrences.

なお、以下に示す処理は、物体側と様態間が一つずつペ
アで与えられ、対応する訳語対を出力する場合を中心に
説明したものである。また、第４図には図示しないが、
入力文字列と同表記の単語は、同義と扱い方を同じであ
るが、区別のために−の記号を用いる。Note that the processing described below is mainly described for the case where the object side and the aspect are given in pairs, and corresponding translated word pairs are output. Also, although not shown in Figure 4,
Words with the same notation as the input character string are treated in the same way as synonyms, but a - symbol is used to distinguish them.

ネットワークは、単語に対応する節点と、関係に対応す
るアークとから成り、各節点は物体側か様態間かの別、
類義度、共起度、推定光度を状態として持ち、各アーク
も第４図に示すような値を持つ。The network consists of nodes corresponding to words and arcs corresponding to relationships.
It has states of similarity, co-occurrence, and estimated luminosity, and each arc also has values as shown in Figure 4.

ネットワークの生成法および状態設定について説明する
。先ず、同表記語・類義語（同義語を含む）の抽出を説
明する。辞書を使用して、類義語を類義度を計算しなが
ら類義アークで結んでゆく。The network generation method and state settings will be explained. First, extraction of the same notation and synonyms (including synonyms) will be explained. Use a dictionary to calculate the degree of synonymy between synonyms and connect them with synonymous arcs.

この例では、同表記語を１とし、類義アークを一つ越す
度に類義アークの持つ値を掛算するものとすれば良い。In this example, the same notation word is set to 1, and each time a similar meaning arc is exceeded, the value of the similar meaning arc is multiplied.

なお、類義度が複数与えられるときには、その最大値を
取ることとする。この作業の終了は、新たに抽出された
単語の類義度が成る値より小さくなった時に終了する。Note that when multiple degrees of similarity are given, the maximum value is taken. This work ends when the degree of synonymy of the newly extracted word becomes smaller than a certain value.

また、類義アークで結ばれた一つのルートが既存の類義
度より大きい場合に限り、類義度の修正と言う形で作業
が続けられる。第５図は類義語の抽出の例を示す図であ
る。同図において、単語に添えである数字は類義度を示
す。単語Ａは入力単語に値１の類義アークでつながれ、
単語Ｂは値０．５の類義アークで単語Ａにつながれ、単
語Ｃは値０．５の類義アークで入力単語につながれ、単
語Ｂと単語Ｃは値０．５の類義アークでつながれている
。図示の例では、ルートａに沿って類義度を設定すると
、Ａの類義度は１．０、Ｂの類義度は０．５、Ｃの類義
度は０．２５になる。また、ルートｂに沿って類義度を
設定すると、への類義度は０．１２５、Ｂの類義度は０
゜２５、Ｃの類義度は０．５になる。この場合、最大値
を取ると、図示のような類義度になる。ルートａでＢま
で類義度をセットした状態の下で、ルートｂに沿って単
語を抽出して行くと、Ｂに与える類義度０．２５がＢの
持つ類義度０．５より小さいので、このルートの処理を
終了する。Furthermore, only when one route connected by a synonymous arc is greater than the existing degree of synonymy, work continues in the form of modifying the degree of synonymy. FIG. 5 is a diagram showing an example of similar synonym extraction. In the figure, the numbers attached to words indicate the degree of synonymy. Word A is connected to the input word by a synonymous arc with value 1,
Word B is connected to word A by a synonymous arc with a value of 0.5, word C is connected to the input word by a synonymous arc with a value of 0.5, and words B and C are connected by a synonymous arc with a value of 0.5. ing. In the illustrated example, if the degree of similarity is set along route a, the degree of similarity for A will be 1.0, the degree of similarity for B will be 0.5, and the degree of similarity for C will be 0.25. Also, if you set the degree of similarity along route b, the degree of similarity to is 0.125, and the degree of similarity to B is 0.
゜25, the degree of similarity of C becomes 0.5. In this case, if the maximum value is taken, the degree of similarity will be as shown in the figure. If you extract words along route b with the synonymy set up to B on route a, the synonymy given to B is 0.25, which is smaller than the synonymy 0.5 that B has. Therefore, we finish processing this route.

共起語の抽出について説明する。訳語検索の場合には行
わないが、類義語検索の場合には、入力単語に対して成
る値より大きい共起度を持ち、コントロール文の条件に
合った単語を抽出し、コントロール文により指定された
類義度をセットする。The extraction of co-occurring words will be explained. Although this is not done in the case of a translation search, in the case of a synonym search, words that have a degree of co-occurrence greater than the value for the input word and that meet the conditions of the control sentence are extracted, and words specified by the control sentence are extracted. Set similarity.

コントロール文とは、システムの処理に関するオプショ
ンを指定するもので、システムに対し与えるコマンド列
のことである。A control statement specifies options related to system processing, and is a string of commands given to the system.

共起度のセットについて説明する。入力単語または同表
記語から出ている共起アークの先の節点（単語）に、共
起アークの値を共起度としてセットする。複数の値が存
在するときの処理は、その和をとる。値をＯにする等、
コントロール文の指定による。第６図は共起度セットの
例を示す図である。入力単語すは０．５の共起アークで
入力単語ａにつながれているので、入力単語すの共起度
は０．５となる。また、単語Ｂは０．５の共起アークで
入力単語ａにつながれているので、単語Ｂの共起度は０
．５になる。A set of degrees of co-occurrence will be explained. The value of the co-occurrence arc is set as the degree of co-occurrence at the node (word) beyond the co-occurrence arc originating from the input word or the same written word. When there are multiple values, the sum is calculated. Setting the value to O, etc.
Depends on the control statement specification. FIG. 6 is a diagram showing an example of a co-occurrence degree set. Since the input word S is connected to the input word a by a co-occurrence arc of 0.5, the degree of co-occurrence of the input word S is 0.5. Also, since word B is connected to input word a with a co-occurrence arc of 0.5, the degree of co-occurrence of word B is 0.
．． It becomes 5.

推定光度のセットについて説明する。類義度、共起度の
セットされている単語（Ａとする）から共起アークで結
ばれた単語の推定尤度（入力単語の共起状況との類似性
を示す値）を、への類義度・共起度および共起アークの
値に応じてセットする。ここでは、以下の式で与えるこ
ととする。A set of estimated luminosity will be explained. The estimated likelihood (a value indicating the similarity with the co-occurrence situation of the input word) of the words connected by the co-occurrence arc from the words for which the degree of synonymy and the degree of co-occurrence are set (denoted as A) is Set according to the value of degree of similarity, degree of co-occurrence, and co-occurrence arc. Here, it is given by the following formula.

（１，０−ＩＡの共起度−共起アーク値１）＊Ａの類義
度＊Ａの共起度　　　　（１）代筆７図は推定尤度のセ
ットの例を示す図である。(1,0 - Co-occurrence degree of IA - Co-occurrence arc value 1) * Degree of similarity of A * Degree of co-occurrence of A (1) Ghost-written Figure 7 is a diagram showing an example of a set of estimated likelihoods.

Ａは、物体側、１の類義度、１の共起度を持つ。A is on the object side, has a degree of similarity of 1, and a degree of co-occurrence of 1.

Ｂ′はＡに０．５の共起アークでつながれているので、
（１）式によりＢ′に対しては０．５の推定尤度が与え
られる。同様にして、Ａに対しては０．５の推定尤度、
Ａ′に対しては１の推定先度、已に対しては０．２５の
推定尤度が与えられる。Since B' is connected to A with a co-occurrence arc of 0.5,
Equation (1) gives an estimated likelihood of 0.5 to B'. Similarly, for A, the estimated likelihood is 0.5,
A' is given an estimated likelihood of 1, and A' is given an estimated likelihood of 0.25.

本発明の対訳語検索システムは、訳語検索機能と類義語
検索機能とを有しているが、それぞれについて１〜２例
をあげる。The bilingual word search system of the present invention has a translation word search function and a synonym search function, and one or two examples of each will be given below.

第８図は共起情報のない単語の訳語検索を示すものであ
る。FIG. 8 shows a translation search for words without co-occurrence information.

入カニ「戸」、「あく」の単語対主な辞書情報：原語側の情報 ○「あく」は同表記語として、閉まっていたものが開く
意味のものと、「店があく」と言うときのもの、「席が
あく」と言うときのものの三つがある。Main dictionary information on the words ``to'' and ``aku'': Information on the original language ○ ``Aku'' is the same written word, and it means opening something that was closed, and when saying ``the store is open.'' There are three things: ``A seat is available'' and ``A seat is available''.

○「ドア」、「門」は、開く意の「あくｊと強共起する
。○ “Door” and “gate” strongly co-occur with “akuj”, which means to open.

○「あく」は、その意味に応じて“ｏｐｅｎ　””ごｏ
ｐｅｎ２″どｖａｃａｎｔ”に対応している。○ “Aku” can be used as “open” or “go” depending on its meaning.
It corresponds to pen2"dovacant".

○「ドア」、「門」は“ｄｏｏｒ”、”Ｂａｔｅ’″に
対応している。○ "Door" and "gate" correspond to "door" and "Bate'".

訳語側の情報 ○“ｄｏｏｒ”＋”ｇａｔｅ’　　は開く意の”ｏｐｅ
ｎ”　と強共起する。Information on the translation side ○ “door” + “gate” is “ope” meaning to open
Strongly co-occurs with “n”.

○″ｇａ　ｔｅ”は、始まる意のｏｐｅｎ”　と弱共起
する。○"gate" weakly co-occurs with "open" which means to begin.

出カニ第１候補″ｄｏｏｒ″、“ｏｐｅｎ　”　”第２
候補“ｇａｔｅ”、“ｏｐｅｎ　””この例は、共起情
報、訳語情報のない「戸」に対する訳語を同義・類義の
単語「ドア」、「門」の情報を用いて検索したものであ
る。1st candidate "door", 2nd "open"
Candidates “gate”, “open”” In this example, the translation for “door”, which has no co-occurrence information or translation information, is searched using the information of the synonymous/synonymous words “door” and “gate”. .

第９図は訳語情報のない単語の訳語検索の例を示す図で
ある。FIG. 9 is a diagram showing an example of a translation search for a word without translation information.

入カニ「セスナ」という単語主な辞書情報原語側の情報Ｏ「セスナ」と「飛行機」は類義 ○「セスナ」、「飛行機」は「飛ぶ」と強共起○「セス
ナ」は「小型」と弱共起訳語側の情報 ○「飛行機」、「飛ぶ」、「小型」は“ｐｌａｎｅ”、
　’ｆｌｙ″、　”ｓｍａｌｌ”に対応Ｏ“Ｃｅ５ｓｎ
ａ”と”ｐｌａｎｅ”　は類義Ｏ″Ｃｅ５ｓｎａ”　、
“ｐｌａｎｅ″は”ｒｔｙ″と強共起○”Ｃｅ５ｓｎａ
”は”ｓｍａｌｌ″と弱共起出カニ第１候補“ｐｌａｎ
ｅ”″ 第２候補”Ｃｅ５ｓｎａ″ この例は、訳語が与えられていない「セスナ」に対応す
る訳語を検索するものであるが、原語側と訳語側で独立
に設定される共起情報から“Ｃｅ５ｓｎａ″も候補とし
て出力する。この機能は、独立に辞書に登録した単語の
中から対応するペアを捜し出す手助けになる。Main dictionary information for the word ``Cessna'' Source information O ``Cessna'' and ``Airplane'' are synonyms ○ ``Cessna'' and ``Airplane'' are strongly co-occurring with ``fly'' ○ ``Cessna'' is ``small'' Information on the translation word that weakly co-occurs with ○ "plane", "fly", "small" are "plane",
Compatible with 'fly' and 'small'O'Ce5sn
a" and "plane" have similar meanings O"Ce5sna",
“plane” has strong co-occurrence with “rty”○”Ce5sna
” is weakly co-occurring with “small” The first candidate “plan”
e"" Second candidate "Ce5sna" This example searches for a translation word corresponding to "Cessna" for which no translation word has been given. Ce5sna'' is also output as a candidate. This feature helps you find matching pairs among independently registered words in the dictionary.

第１０図は類義語検索の例を示す図である。FIG. 10 is a diagram showing an example of synonym search.

入カニ「太極拳」　（！ｉ義語を検索する単語）および
「空手」、「合気道」、「剣道」　（類義語を検索する
範囲）主な辞書情報Ｏ「太極拳」は「打つ」、「蹴る」と強共起Ｏ「空手」
は「打つ」、「蹴る」と強共起○「合気道」は「打つ」
、「投げる」と強共起Ｏ「剣道」は「打つ」と強共起出カニ第１候補「空手」第２候補「合気道」第３候補「剣道」この例は、類義語の検索範囲の中の単語に、その共起の
仕方から類似性を推測し、類義語に順位をつけるものあ
る。この他、ネットワークが大きくなるので省略するが
、類義語検索の範囲を限定せずに共起情報の中から類義
語を検索する機能もある。Entering crab "Taijiquan" (! i Word to search for synonyms) and "Karate", "Aikido", "Kendo" (range to search for synonyms) Main dictionary information O "Taijiquan" means "to hit", ""Kicking" and strong co-occurrence O "Karate"
is strongly co-occurring with "hitting" and "kicking" ○ "Aikido" is "hitting"
, "Throw" and strong co-occurrence O "Kendo" strongly co-occur with "Strike" First candidate "Karate" Second candidate "Aikido" Third candidate "Kendo" This example is within the search range of synonyms. There is a system that infers similarities between words based on how they co-occur and ranks synonyms. In addition, there is also a function to search for similar synonyms from co-occurrence information without limiting the scope of the synonym search, although this will be omitted since the network will be large.

〔Effect of the invention〕

本発明の対訳語検索システムには、主機能して訳語（対
２列）の検索機能と、副ａ能として類義語の検索機能と
がある。訳語検索機能は、辞書に記述されている訳語情
報、類義語情報、共起情報を用いて実現されるが、これ
らの情報の何れかが不足または欠落していても、他の情
報を使用して動作可能とする類推機能も備わっている。The bilingual word search system of the present invention has a search function for translated words (pairs of two columns) as a main function, and a search function for synonyms as a secondary function. The translation search function is realized using translation information, synonym information, and co-occurrence information described in the dictionary, but even if any of these information is missing or missing, it cannot be performed using other information. It also has an analogy function that enables it to operate.

例えば、共起情報の記述されていない入力単語について
は類義語情報等から別の単語の記述情報を使って多義語
を選択することが可能であり、訳語情報についても同様
なことが言える。また、共起情報しか定義されていない
単語についても、副機能の類義語検索機能を併用すれば
、訳語の検索が可能になる。For example, for an input word for which co-occurrence information is not described, it is possible to select a polysemous word using description information of another word from synonym information, etc., and the same can be said for translation word information. Furthermore, even for words for which only co-occurrence information is defined, translations can be searched by using the synonym search function, which is a sub-function.

この類推機能と、ネットワーク処理の主要部とを、原語
側（１次ネットワーク）と訳語側（２次ネットワーク）
で分離したことで、言語情報設定を原語側と訳語側で独
立して行うことを可能としている。極端な例として、検
索能率の面から好ましくないが、訳語検索のための情報
を与える単語を少数に絞り、他の単語には訳語情報を与
えられた単語との類義情報・共起情報・上位下位の情報
などを与えることにすれば、殆ど各言語で独立した言語
情報の設定が可能となる。This analogy function and the main part of network processing are divided into the source language side (primary network) and the target language side (secondary network).
This separation allows language information settings to be performed independently on the source language side and the target language side. As an extreme example, although it is undesirable from the viewpoint of search efficiency, the words that provide information for translation searches are narrowed down to a small number of words, and other words are given synonym information, co-occurrence information, and other words that provide translation information. By providing upper and lower information, it becomes possible to set independent language information for almost each language.

更に、類義情報・共起情報の設定をサポートすることが
出来るため、言語情報の設定が能率化される。Furthermore, since the setting of synonym information and co-occurrence information can be supported, the setting of linguistic information can be streamlined.

更に、類推機能においては、辞書に記述されている関連
情報をまとめて使用するため、類推機能の信頼性が高く
なっている。この理由は、少しぐらい変な情報があって
も他の情報により結果に表れ難くなっているためである
。Furthermore, since the analogy function uses related information described in the dictionary all together, the reliability of the analogy function is high. The reason for this is that even if there is slightly strange information, it is difficult to show up in the results due to other information.

以上とは別に、意味ネットワークの処理の大部分を、隣
接する節点とその間のアークの王者の値（状態）だけを
参照する局所的処理の繰り返しで行う方式としたため、
処理の並列化・分散化が進め易くなっている。複数のＣ
ＰＵを用い、各ＣＰＵがネットワークの一部を分担して
処理に当たるようにした場合、各ＣＰＵは大部分独立し
て処理が行なえるようになっている訳である。このこと
は、検索スピードの向上を計る上で、かなり有効な特徴
である。Apart from the above, most of the processing of the semantic network is performed by repeating local processing that refers only to the values (states) of the champions of adjacent nodes and the arcs between them.
It is becoming easier to parallelize and distribute processing. multiple C
When a PU is used and each CPU is assigned a part of the network to perform processing, each CPU is able to perform most of the processing independently. This is a very effective feature in improving search speed.

以上をまとめると、（ａ）　　言語情報の類推の機能があるため、言語情報
の設定が容易になった。To summarize the above, (a) Since there is a function of analogy of linguistic information, setting of linguistic information has become easier.

０）使用する言語情報の多くが統計的な情報であるため
、言語情報の設定を（自動的に）サポー、　　トするシ
ステムの構築が容易である。0) Since most of the language information used is statistical information, it is easy to build a system that (automatically) supports the setting of language information.

（Ｃ）　　単純な構造のネットワークを用い、その処理
を局所的な操作の集まりとしたことで、処理の分散・並
列化が容易である。(C) By using a network with a simple structure and making the processing a collection of local operations, it is easy to distribute and parallelize the processing.

と言うことになる。但し、本発明のシステムは対訳語検
索システムであるため、ここで言う言語情報とは翻訳シ
ステムの使用する情報の一部であることを断っておく。That's what I would say. However, since the system of the present invention is a bilingual word search system, it should be noted that the linguistic information referred to here is part of the information used by the translation system.

[Brief explanation of the drawing]

第１図は本発明の原理図、第２図はシステムの構成例、
第３図は処理の概要を示す図、第４図はアークの値と記
号を示す図、第５図は類義語の抽出の例を示す図、第６
図は共起度セットの例を示す図、第７図は推定光度のセ
ットの例を示す図、第８図は訳語検索の例を示す図、第
９図は訳語検索の他側を示す図、第１０図は類義語検索
の例を示す図、第１１図は従来の訳語検索の方式を示す
図、第１２図は「飛行機が飛ぶ」における各従来方式の
動作例を示す図である。１００・・・入力部、２１０・・・原語側の単語辞書、
２２０・・・訳語側の単語辞書、３００・・・プロセッ
サ、３７１・・・１次ネットワーク、３７２・・・２次
ネットワーク、４００・・・意味関係辞書、５００・・
・出力部。特許出願人　　　富士通株式会社代理人弁理士　　京　谷　四　部九＃／）攪キ隼３ｍ了−フのイ直と記号徒起慶毛・・２トの４グリ鳩ｔ（２）（物７１．θ、０）推定ｒ度の名ット０イタ１１燥−Ｉ（２）づ　又　ア（物、１．ち１５）　　　　　　（様、１．θ、５．θ
５）（１ゴの０．２５，０．Ｄ）　　　　　　　　（７
Ｊシ、１，０．５，０．５ン第ｑ図　　ＰｌｔＬｆＬＬ
Ｃ郡肌（尺待千、（ｑ手７合銖道、愈１道））ｒ７ｚ′８ｆ−
り＝＝：）−ｒ′ｆつ（□・１・０・１）（様、１．θ、５．７）空手漬艙り
剣道預良諺倹紫／）４り′１瑞／ρ図？記士１Figure 1 is a diagram of the principle of the present invention, Figure 2 is an example of the system configuration,
Figure 3 is a diagram showing an overview of the process, Figure 4 is a diagram showing arc values and symbols, Figure 5 is a diagram showing an example of synonym extraction, and Figure 6 is a diagram showing an example of synonym extraction.
The figure shows an example of a co-occurrence degree set, Figure 7 shows an example of a set of estimated luminosity, Figure 8 shows an example of a translated word search, and Figure 9 shows the other side of a translated word search. , FIG. 10 is a diagram showing an example of synonym search, FIG. 11 is a diagram showing a conventional translation search method, and FIG. 12 is a diagram showing an example of the operation of each conventional method for "Airplane flies." 100... Input unit, 210... Word dictionary on the original language side,
220... Word dictionary on the translation side, 300... Processor, 371... Primary network, 372... Secondary network, 400... Semantic relationship dictionary, 500...
・Output section. Patent Applicant Fujitsu Ltd. Representative Patent Attorney Kyotani 4 part 9#/) Stirring Hayabusa 3m Ryo-fu's i-nao and symbol Takuki Keike...2t's 4grihatot(2) (Object 71. θ, 0) Estimated r degree name 0 ita 11 DR-I (2) zu again a (thing, 1.chi15) (like, 1.θ, 5.θ
5) (0.25, 0.D of 1 go) (7
Jshi, 1,0.5,0.5n Figure q PltLfLL
C Gunhada (Shakumachisen, (q hand 7 joint road, yu 1 road)) r7z'8f-
ri==:)-r'f tsu (□・1・0・1) (sama, 1.θ, 5.7) karate pickles kendo yoryo proverb 倹子/)4ri′1 Mizui/ρ diagram ? Journalist 1

Claims

[Claims] An input unit (100) for inputting an input language, a word dictionary (210) on the source language side, a word dictionary (220) on the translation side, a semantic relationship dictionary (400), and a meaning of the input language. A primary network generating means (310) for generating a primary network (371) which is an associative network; a primary network state setting means (320) for setting the state of the primary network (371); A secondary network generating means (330) for generating a secondary network (372) which is a semantic association network; a secondary network state setting means (340) for setting the state of the secondary network (372); and a state setting means. A translation word extraction means (350) for extracting translation words from the secondary network (372) for which the status setting has been completed; 350
) is the translated word or synonym extraction means (360
) outputs the synonyms extracted by
), and the word dictionary (210) on the source side contains information for searching word headings and the semantic relationship dictionary (400), and information for searching the word dictionary (220) on the translation side. The word dictionary (220) on the translation side has word headings, information for searching the semantic relationship dictionary (400), and grammatical information. ) has information indicating the semantic relationship between words, and the primary network generation means (310) stores words related to the input language in a word dictionary (210) and a semantic relationship dictionary (40).
0), and the extracted words in the original language are nodes and the relationships are arcs (371)
The primary network state setting means (320) sets an initial state to the nodes of the input word on the primary network (371). , set the state of adjacent nodes according to the arc connected to the node of the input word and the state of the node, and then perform the same process starting from the node where the state has been set, to determine the state of the surrounding nodes. The secondary network generation means (330) stores translations corresponding to words on the primary network (371) and words related to the translations in a word dictionary (220) and a semantic relationship dictionary (220).
A secondary network (372
) by connecting the nodes on the translation side with arcs according to the relationship, and the secondary network state setting means (340) connects the nodes of the secondary network with the corresponding nodes on the primary network. Determine the initial state of the secondary network by copying the state of the node, then start from the node whose state is set,
A bilingual word search system configured to set or modify states of surrounding nodes.