JP2006209173A

JP2006209173A - Dependency structure analyzing apparatus and computer program

Info

Publication number: JP2006209173A
Application number: JP2005016341A
Authority: JP
Inventors: Masahiro Ono; 誠寛大野; Shigeki Matsubara; 茂樹松原; Hidenori Kashioka; 秀紀柏岡; Naoto Kato; 直人加藤
Original assignee: ATR Advanced Telecommunications Research Institute International
Current assignee: ATR Advanced Telecommunications Research Institute International
Priority date: 2005-01-25
Filing date: 2005-01-25
Publication date: 2006-08-10

Abstract

【課題】発話途中でも節間の係り受けを決定し、後の処理に利用することができるようにする。
【解決手段】係り受け構造解析装置４２は、逐次的に入力される発話４０の係り受け構造を漸進的に解析するもので、発話４０の節境界を発話の局所的情報に基づいて検出し、当該節境界と直前の節境界との間の発話からなる節境界単位を出力する節判定処理部６０と、節境界が検出されたことに応答して、係り先が未決定の節境界単位について節間係り受け確率モデル３８を用いた節間係り受け推定で得られた係り先情報を出力する節間係り受け解析処理部７０と、係り先情報の出力に応答して、係り先が未決定の節境界単位について得られた係り先情報の履歴に基づいて、係り先が未決定の節境界単位のうち、所定の条件を充足するものの係り先を決定して出力する節間係り受け判定処理部７２とを含む。
【選択図】図２
To determine the dependency between clauses even during utterance so that it can be used for later processing.
A dependency structure analyzing device 42 analyzes a dependency structure of utterances 40 sequentially input, detects a node boundary of the utterance 40 based on local information of the utterance, A node determination processing unit 60 that outputs a node boundary unit composed of utterances between the node boundary and the immediately preceding node boundary, and a node boundary unit whose relationship destination is undetermined in response to the detection of the node boundary. Intersection dependency analysis processing unit 70 that outputs dependency destination information obtained by inter-node dependency estimation using inter-subordinate dependency probability model 38, and the dependency destination is undetermined in response to the output of dependency destination information Based on the history of the dependency destination information obtained for each node boundary unit, among the node boundary units for which the dependency destination is not yet determined, the interdependence dependency determination process for determining and outputting the dependency destination that satisfies a predetermined condition Part 72.
[Selection] Figure 2

Description

この発明は文の構成要素の間の係り受け解析技術に関し、特に、同時通訳及び字幕生成のように、発話をその進行と同時に係り受け解析する技術、すなわち漸進的係り受け解析技術の改良に関する The present invention relates to a dependency analysis technique between sentence components, and more particularly to a technique for dependency analysis simultaneously with its progress, such as simultaneous interpretation and subtitle generation, that is, an improvement of a progressive dependency analysis technique.

同時通訳又は映画若しくはテレビジョン放送における字幕自動生成において適切な結果を得るためには、発話を音声認識処理等により一旦テキスト情報に変換し、さらに音声認識結果の入力と同時並行的に係り受け解析することが必要である。そのように入力と同時に係り受け解析する技術を本明細書では漸進的係り受け解析と呼ぶ。 In order to obtain appropriate results in simultaneous interpretation or automatic caption generation in movies or television broadcasts, speech is once converted into text information by voice recognition processing, etc., and dependency analysis is performed simultaneously with input of the speech recognition results. It is necessary to. Such a technique of performing dependency analysis simultaneously with input is referred to as progressive dependency analysis in this specification.

上記したような需要が存在するため、従来から漸進的係り受け解析に関する研究がいくつか行なわれている。そうした従来の手法では、ある言語単位（例えば文、節等）で入力を分割し、分割後の言語単位に対し何らかの手法に基づいて係り受け解析を行なうことを繰返す。例えば非特許文献１には、確率文脈自由文法に基づいて漸進的に係り受け解析を行なう技術が開示されている。 Due to the above-described demand, several studies on gradual dependency analysis have been conducted. In such a conventional method, the input is divided in a certain language unit (for example, sentence, clause, etc.), and dependency analysis is repeatedly performed on the divided language unit based on some method. For example, Non-Patent Document 1 discloses a technique for performing dependency analysis gradually based on a probability context free grammar.

一方、例えば独話では、１文が長く構造が複雑であるという特徴に着目し、文中の節境界を検出して節境界に基づく係り受け解析を行なう手法が、非特許文献２に開示されている。この手法では、節レベルと文レベルとの２段階で係り受け解析を実行する。まず、節境界解析により文を節に分割し、各節に対して係り受け解析を行なうことにより、節内の係り受け関係を同定する。次に節境界をまたぐ係り受け関係を定め、全体の係り受け構造を作り上げる。 On the other hand, for example in monologue, Non-Patent Document 2 discloses a technique for detecting dependency of a clause in a sentence and performing dependency analysis based on the clause boundary, focusing on the feature that a sentence is long and complicated in structure. Yes. In this method, dependency analysis is executed in two stages, a clause level and a sentence level. First, a sentence is divided into clauses by clause boundary analysis, and dependency analysis in each clause is identified by performing dependency analysis on each clause. Next, a dependency relationship across the node boundaries is determined, and the entire dependency structure is created.

一方、発話中の節境界を検出するための節境界検出手法が特許文献１に開示されている。特許文献１に記載の手法は、基本的には形態素列中で局所的に特定のパタンを検出した場合に、そのパタンに関連した所定箇所に節境界を想定するというものである。そのために、形態素列の局所的なパタンと、節境界の位置とをいわゆる正規表現で表したルールが予め準備される。節境界検出時には、検出された節境界と、直前に検出された節境界とではさまれた単位を近似的な節として逐次出力できる。
特開２００４−３１８３００号公報加藤芳秀、松原茂樹、外山勝彦、稲垣康善、「確率文脈自由文法に基づく漸進的構文解析」、電気学会論文誌、Ｖｏｌ．１２２−Ｃ、Ｎｏ．１、２１０９頁〜２１１９頁、２００２年１２月大野誠寛、松原茂樹、丸山岳彦、柏岡秀紀、田中英輝、稲垣康善、「節境界に基づく独話文係り受け解析の効率化」、情報処理学会研究報告、ＮＬ−１６２、２１３頁〜２２０頁、２００４年 On the other hand, Patent Document 1 discloses a node boundary detection method for detecting a node boundary during speech. The method described in Patent Document 1 basically assumes that when a specific pattern is detected locally in a morpheme string, a node boundary is assumed at a predetermined location related to the pattern. For this purpose, a rule in which the local pattern of the morpheme string and the position of the node boundary are expressed by a so-called regular expression is prepared in advance. At the time of detecting a node boundary, a unit between the detected node boundary and the node boundary detected immediately before can be sequentially output as an approximate node.
JP 2004-318300 A Yoshihide Kato, Shigeki Matsubara, Katsuhiko Toyama, Yasuyoshi Inagaki, “Progressive Parsing Based on Stochastic Context Free Grammar”, IEEJ Transactions, Vol. 122-C, no. 1, 2, 109-2119, December 2002 Masanobu Ohno, Shigeki Matsubara, Takehiko Maruyama, Hideki Sasaoka, Hideki Tanaka, Yasuyoshi Inagaki, "Efficiency of dependency analysis based on clause boundaries", IPS-162, pp. 213-220, 2004

ところで、文境界が付与されていない発話データ、特に独話データの場合には、明示的な文末標識がない。そのため、係り受け解析に先立って発話データを文単位に区切ることは容易ではない。そうした状況では、上記した非特許文献１に記載の手法を適用することは難しい。また、非特許文献２に記載の手法は発話データ全体の入力の後に適用することが前提である。従って、この手法はそのままでは漸進的な係り受け解析には用いることができない。 By the way, in the case of utterance data to which no sentence boundary is given, especially monologue data, there is no explicit sentence end indicator. Therefore, it is not easy to divide speech data into sentence units prior to dependency analysis. In such a situation, it is difficult to apply the method described in Non-Patent Document 1 described above. In addition, it is assumed that the method described in Non-Patent Document 2 is applied after input of the entire speech data. Therefore, this method cannot be used for progressive dependency analysis as it is.

しかし、経済及び人の交流が国際化した現代では、ある言語の発話内容をリアルタイムで別の言語に自動的に変換したり、字幕を生成したりするという技術に関する需要は大きい。そうした処理では、発話の進行と並行して精度よく係り受け解析を行なうことができる技術が必要である。 However, in the present age when economics and human exchange are internationalized, there is a great demand for technology for automatically converting the utterance content of one language into another language in real time and generating subtitles. Such processing requires technology that can perform dependency analysis with high accuracy in parallel with the progress of the utterance.

従って、本発明の目的は、発話途中でも節間の係り受けを決定し、後の処理に利用することができる係り受け構造解析装置とそのためのコンピュータプログラムとを提供することである。 Accordingly, an object of the present invention is to provide a dependency structure analyzing apparatus and a computer program therefor, which can determine dependency between clauses even during utterance and can be used for later processing.

本発明の第１の局面に係る係り受け構造解析装置は、逐次的に入力される発話の係り受け構造を漸進的に解析する係り受け構造解析装置であって、逐次入力される発話の節境界を発話の局所的情報に基づいて検出し、当該節境界と直前の節境界との間の発話からなる節境界単位を出力するための節境界検出手段と、節境界検出手段により節境界が検出されたことに応答して、係り先が未決定の節境界単位について所定の節間係り受け解析方法を適用し、得られた係り先情報を出力するための節間係り受け解析手段と、節間係り受け解析手段による係り先情報の出力に応答して、係り先が未決定の節境界単位について節間係り受け解析手段により得られた係り先情報の履歴を保持し、当該履歴に基づいて、係り先が未決定の節境界単位のうち、所定の条件を充足するものの係り先を決定して出力するための節間係り受け判定手段とを含む。 A dependency structure analyzing apparatus according to a first aspect of the present invention is a dependency structure analyzing apparatus that progressively analyzes a dependency structure of utterances that are sequentially input, and is a node boundary of utterances that are sequentially input. Is detected based on the local information of the utterance, and the node boundary is detected by the node boundary detecting means for outputting the node boundary unit consisting of the utterance between the relevant node boundary and the immediately preceding node boundary, and the node boundary detecting means detects the node boundary. In response to this, an inter-node dependency analysis means for applying a predetermined inter-cell dependency analysis method to a node boundary unit whose dependency point is undetermined, and outputting the obtained dependency information, In response to the output of the dependency destination information by the dependency dependency analysis means, a history of dependency destination information obtained by the inter-cell dependency analysis means for the node boundary unit whose dependency is undetermined is retained, and based on the history , Out of the node boundary unit whose relationship is not yet determined, And a internode dependency determination means for determines and outputs one of dependency destination that satisfies certain conditions.

節境界検出手段により、入力される発話の節境界が検出され、それまでの節境界間の発話からなる節境界単位が出力される。節間係り受け解析手段は、節境界が検出されると、係り先が未決定の節境界単位について係り先を推定する。節間係り受け判定手段は、係り先が未決定の節境界単位について、推定された係り先の履歴を保持し、その履歴に基づいて所定の条件を充足する節境界単位の係り先を決定する。節境界検出手段による節境界の検出と節境界単位の出力と節間係り受け解析手段による節間係り受け解析とが逐次的に行なわれる。節間係り受け解析の結果に応じて、所定の条件が充足されると、節間係り受け判定手段が対応する節境界単位の係り先を決定し出力する。これら処理が発話の入力と同時並行的に漸進的に行なわれるので、文単位でなく、発話途中でも節間の係り受けを決定し、後の処理に利用することができる。 The node boundary detection means detects the node boundary of the input utterance, and outputs the node boundary unit consisting of the utterances between the node boundaries so far. When the node boundary is detected, the inter-node dependency analyzing means estimates the dependency for the node boundary unit whose dependency is not yet determined. The inter-node dependency determination means retains the history of the estimated dependency destination for the node boundary unit for which the dependency destination has not been determined, and determines the dependency destination of the node boundary unit that satisfies a predetermined condition based on the history. . The detection of the node boundary by the node boundary detecting means, the output of the node boundary unit, and the inter-node dependency analysis by the inter-node dependency analyzing means are sequentially performed. When a predetermined condition is satisfied according to the result of the inter-node dependency analysis, the inter-cell dependency determining means determines and outputs a corresponding inter-unit boundary unit. Since these processes are performed gradually in parallel with the input of the utterance, the dependency between the clauses can be determined not only in the sentence unit but also in the middle of the utterance and used for the subsequent process.

好ましくは、係り受け構造解析装置はさらに、節境界検出手段により節境界が検出されたことに応答して、節境界検出手段から出力された節境界単位に対して所定の節内係り受け解析方法を適用し、得られた節内係り受け結果を保存するための節内係り受け解析手段を含み、節間係り受け判定手段は、節間係り受け解析手段による係り先情報の出力に応答して、係り先が未決定の節境界単位について節間係り受け解析手段により得られた係り先情報の履歴を保持するための履歴保持手段と、履歴保持手段により保持されている履歴に基づいて、係り先が未決定の節境界単位のうち、所定の条件を充足するものの係り先を決定するための節係り先決定手段と、節係り先決定手段により決定された係り先と、当該節境界単位について節内係り受け解析手段に保持されていた節内係り受け結果とをあわせて当該節境界単位に対する係り受け関係情報として出力するための手段とを含む。 Preferably, the dependency structure analyzing apparatus further includes a predetermined intra-section dependency analysis method for the nodal boundary unit output from the nodal boundary detecting means in response to the nodal boundary being detected by the nodal boundary detecting means. And inter-group dependency analysis means for storing the obtained intra-section dependency result, and the inter-node dependency determination means responds to the output of dependency destination information by the inter-node dependency analysis means. A history holding means for holding the history of the dependency destination information obtained by the inter-node dependency analysis means for the node boundary unit whose dependency destination is undetermined, and a relationship based on the history held by the history holding means Among clause boundary units whose destinations are not yet determined, a clause destination determination means for determining a destination of a clause that satisfies a predetermined condition, a dependency destination determined by the clause destination determination means, and the clause boundary unit Dependency in section Together with intranodal dependency results held in the analyzing means and means for outputting as a dependency relationship information for the clause boundary unit.

節内係り受け解析手段がこの節境界単位に対して節内係り受け解析を行ない、その結果を保持しておく。ある節境界単位に対し節間係り受けが決定すると、当該節境界単位に対する節内係り受け解析の結果とあわせて係り受け関係として出力される。逐次入力される発話に対し、漸進的に各節境界単位に対する係り受け関係が節内及び節間の双方について得られる。その結果、文単位でなく、発話途中でも節内及び節間の係り受けを決定し、後の処理に利用することができる。 The intra-clause dependency analysis means performs intra-clause dependency analysis on the node boundary unit and holds the result. When the inter-node dependency is determined for a certain node boundary unit, the dependency relationship is output together with the result of the intra-node dependency analysis for the node boundary unit. For utterances that are input sequentially, a dependency relationship with respect to each node boundary unit is obtained progressively for both within and between nodes. As a result, it is possible to determine the dependency within the clause and between the clauses even during the utterance instead of the sentence unit, and use it for later processing.

さらに好ましくは、履歴保持手段は、節間係り受け解析手段による係り先情報の出力に応答して、係り先が未決定の節境界単位について節間係り受け解析手段により得られた係り先情報の直前の履歴を各節境界単位に対し所定個数だけ保持するための手段を含み、節係り先決定手段は、ある節境界単位に対して保持するための手段に保持されている係り先情報が全て一致していることに応答して、当該ある節境界単位の係り先を節間係り受け解析手段により最後に得られた係り先に決定するための手段を含む。 More preferably, the history holding means responds to the output of the dependency destination information by the inter-node dependency analysis means, and the dependency destination information obtained by the inter-node dependency analysis means for the node boundary unit whose dependency is not yet determined. Including a means for holding a predetermined number of previous histories for each node boundary unit, and the node destination determination unit includes all the destination information held in the unit for holding a node boundary unit. In response to the coincidence, means for determining a destination of the certain node boundary unit as a last obtained by the inter-node dependency analyzing means is included.

節間係り受け判定手段はさらに、所定の自然数を記憶するための所定値記憶手段を含み、保持するための手段は、節間係り受け解析手段による係り先情報の出力に応答して、係り先が未決定の節境界単位について節間係り受け解析手段により得られた係り先情報の直前の履歴を、各節境界単位に対し所定値記憶手段に記憶された所定の自然数の個数だけ保持するための手段を含んでもよい。 The inter-node dependency determining means further includes a predetermined value storage means for storing a predetermined natural number, and the means for holding is in response to the output of dependency destination information by the inter-node dependency analyzing means. In order to hold the history immediately before the dependency destination information obtained by the inter-node dependency analysis means for the undecided clause boundary unit, by the number of a predetermined natural number stored in the predetermined value storage means for each node boundary unit. These means may be included.

係り受け構造解析装置はさらに、所定値記憶手段に記憶される所定の自然数を入力するための入力手段を含んでもよい。 The dependency structure analyzing apparatus may further include an input unit for inputting a predetermined natural number stored in the predetermined value storage unit.

好ましくは、節係り先決定手段は、節間係り受け解析手段による係り先情報の出力に応答して、係り先が未決定の節境界単位について節間係り受け解析手段により得られた係り先情報と、当該節境界単位について保持するための手段に保持されている直前の係り先情報とが一致するか否かを判定するための判定手段と、判定手段により一致という結果が連続して得られた回数を、係り先が未決定の節境界単位の各々に対してカウントするためのカウンタ手段と、カウンタ手段による、ある節境界単位のカウント結果が所定の値に達したことに応答して、当該ある節境界単位の係り先を、当該ある節境界単位に対して節間係り受け解析手段により最後に得られた係り先に決定するための手段とを含む。 Preferably, the node destination determination means is the response destination information obtained by the node dependency analysis means for the node boundary unit whose dependency is not yet determined in response to the output of the dependency destination information by the internode dependency analysis means. And the determination means for determining whether or not the last relation information held in the means for holding the relevant section boundary unit matches, and the determination means continuously obtains the result of the match. In response to the counter means for counting the number of times for each of the node boundary units for which the relationship is undetermined, and the count result of a certain node boundary unit by the counter means reaching a predetermined value, And a means for determining a dependency point of the certain node boundary unit as a dependency point finally obtained by the inter-node dependency analysis means for the certain node boundary unit.

本発明の第２の局面に係るコンピュータプログラムは、コンピュータにより実行されると、当該コンピュータを、上記したいずれかの係り受け構造解析装置として動作させるものである。従って、上記した係り受け構造解析装置と同様の作用・効果を奏することができる。 The computer program according to the second aspect of the present invention, when executed by a computer, causes the computer to operate as any one of the dependency structure analyzing apparatuses described above. Therefore, the same operation and effect as the above-described dependency structure analyzing apparatus can be obtained.

［節と係り受け］
以下に述べる本発明の実施の形態は、節を基本的な解析単位とする発話の漸進的係り受け解析手法を用いる。そのために、上記した特許文献１に記載の節境界検出がほぼリアルタイムで発話の節境界を検出できることを利用する。さらに、節境界が検出され、解析の対象となる節が確定するたびに、その節の内部の係り受け構造を作り上げるとともに、既に入力されている節の係り先を決定することを試みる。節の係り先となる文節の決定は、後続するいくつかの文節との係り受けの尤度を考慮した動的なタイミングで実行する。 [Section and dependency]
The embodiment of the present invention described below uses a gradual dependency analysis method of utterances with a section as a basic analysis unit. Therefore, the fact that the node boundary detection described in Patent Document 1 described above can detect the node boundary of the utterance in almost real time is utilized. Furthermore, every time a node boundary is detected and a node to be analyzed is determined, a dependency structure inside the node is created, and an attempt is made to determine the dependency of a node that has already been input. The determination of the clauses to which the clauses are related is executed at a dynamic timing in consideration of the likelihood of the dependency with some subsequent clauses.

節とは、述語を中心としたまとまりであり、複文又は重文の場合、文は複数の節から構成される。さらに、節は、統語的及び意味的にまとまった単位である。そのため、文に変わる単位として節を利用できる。なお、以下の実施の形態は、「文は一つ以上の節の連接であり、各節を構成する文節は、節の最終文節を除き、その節の内部の文節に係る」とみなすことにより実現している。 A clause is a group centered on a predicate. In the case of a compound sentence or a heavy sentence, a sentence is composed of a plurality of clauses. In addition, a clause is a syntactically and semantically unit. Therefore, clauses can be used as units that change into sentences. In the following embodiment, it is assumed that “a sentence is a concatenation of one or more clauses, and the clauses constituting each clause are related to the clauses inside the clause except the last clause of the clause”. Realized.

図１に、以下の実施の形態で行なわれる係り受け構造解析を模式的に示す。図１を参照して、発話「先日総理府が発表いたしました世論調査によりますと死刑を支持するという人が八十パーセント近くになっております。」という文は、４つの節「先日総理府が発表いたしました」、「世論調査によりますと」、「死刑を支持するという」、及び「人が八十パーセント近くになっております」から構成される。各節は係り受け構造を形成し、それらが節の最終文節からの係り受け構造でつながっている。図１において節内の係り受けは実線の矢印で、節間の係り受けは点線の矢印で、それぞれ示されている。 FIG. 1 schematically shows a dependency structure analysis performed in the following embodiment. Referring to Figure 1, the sentence "The percentage of people who support the death penalty is nearly 80 percent according to a poll released by the Prime Minister the other day." It was made up of “I did it”, “According to public opinion polls”, “To support the death penalty”, and “Persons are nearly 80 percent”. Each section forms a dependency structure, and they are connected by a dependency structure from the last sentence of the section. In FIG. 1, the dependency in the node is indicated by a solid arrow, and the dependency between the nodes is indicated by a dotted arrow.

節を文に代わる解析単位とするためには、係り受け解析の前処理として発話文を節に分割する必要がある。節には、主節の中に埋め込まれた従属節も存在するため、本来、文を節に一次元的に分割することは困難である。また、漸進的な処理では文の最後まで到達しないうちに発話の前半部分の係り受け解析を行なう必要があり、文の分割はさらに困難である。 In order to make a clause an analysis unit instead of a sentence, it is necessary to divide an utterance sentence into clauses as a preprocessing of dependency analysis. Since there are subordinate clauses embedded in the main clause, it is inherently difficult to divide the sentence into one-dimensional sections. Further, in the gradual processing, it is necessary to perform dependency analysis of the first half of the utterance before the end of the sentence is reached, and the sentence division is further difficult.

しかし、上記した特許文献１に記載の節境界検出手法を用いることにより、検出された節境界で文を近似的に節に分割することができる。しかも特許文献１に記載の手法では、必要な情報は入力された文を形態素解析した結果得られる形態素列の局所的なパタンである。従って、漸進的手法にこの手法を適用することにより、各節ごとに内部の係り受け構造を解析できる。なお、検出された節境界によりはさまれた言語単位を以下の説明では「節境界単位」と呼び、これを新たな解析単位と考える。特許文献１に記載の手法では、局所的な形態素列のみを手がかりとして節境界単位の終端位置と節境界単位の種類とを特定し、１４４種の節ラベルのいずれかを各節境界単位に付与する。 However, by using the node boundary detection method described in Patent Document 1, the sentence can be approximately divided into nodes at the detected node boundary. In addition, in the method described in Patent Document 1, necessary information is a local pattern of a morpheme string obtained as a result of morphological analysis of an input sentence. Therefore, by applying this method to the progressive method, the internal dependency structure can be analyzed for each node. In the following description, a language unit sandwiched between detected clause boundaries is referred to as a “section boundary unit” and is considered as a new analysis unit. In the method described in Patent Document 1, the end position of a node boundary unit and the type of node boundary unit are specified using only a local morpheme string as a clue, and any one of 144 types of node labels is assigned to each node boundary unit. To do.

以下の実施の形態では、このように各節境界単位の内部の係り受け構造を解析した後、さらに各節境界単位の最終文節からの係り先について以下のようにして決定する。すなわち、後続する節境界単位が入力されるたびに、節間の係り受けを繰返し解析し、所定の条件が充足された場合に、各節境界単位の最終文節の係り先を決定する。なお、以下の実施の形態の説明では、説明を簡略にするため、節境界単位を単に節と呼ぶ。 In the following embodiment, after analyzing the dependency structure inside each clause boundary unit as described above, the dependency destination from the final clause of each clause boundary unit is further determined as follows. That is, each time a subsequent clause boundary unit is input, the dependency between clauses is repeatedly analyzed, and when a predetermined condition is satisfied, the destination of the final clause of each clause boundary unit is determined. In the following description of the embodiments, the node boundary unit is simply referred to as a node in order to simplify the description.

［節と係り受け構造の関係］
節と係り受け構造との関係を明らかにするために、独話文コーパスを用いて節と係り受け構造との関係について分析した。分析には、日本放送協会（ＮＨＫ）の解説番組「あすを読む」の書起こしデータ２００文に対して形態素解析、文節まとめ上げ、節境界解析、及び係り受け解析を自動的に行ない、その結果を人手で修正したものを用いた。 [Relationship between section and dependency structure]
In order to clarify the relationship between clauses and dependency structures, we analyzed the relationship between clauses and dependency structures using a monologue corpus. For the analysis, the Japanese Broadcasting Corporation (NHK) commentary program "Read Asu" transcription data 200 sentences, automatically morphological analysis, paragraph summarization, clause boundary analysis, and dependency analysis, the result The one that was manually corrected was used.

この２００文の基礎統計を表１に示す。 Table 1 shows the basic statistics of these 200 sentences.

表１に示すように、総文節数２，４３０のうち、節の最終文節（９５１文節）を除いた１，４７９文節の中で、９４文節のみが節外に位置する文節に係っていた。これは、全体の９３．６％（１，３８５／１，４７９）の係り受けが節で閉じていることを意味している。従って、以下で説明する実施の形態での「文は一つ以上の節の連接であり、各節を構成する文節は、節の最終文節を除き、その節の内部の文節に係る」という仮定がある程度妥当であることを表している。

As shown in Table 1, out of 1,479 clauses excluding the last clause (951 clause) of the total number of clauses of 2,430, only 94 clauses were related to clauses located outside the clause. . This means that 93.6% (1,385 / 1,479) of the dependency is closed at the node. Therefore, in the embodiment described below, it is assumed that “a sentence is a concatenation of one or more clauses, and the clauses constituting each clause are related to the clauses inside the clause except the last clause of the clause”. Indicates that it is appropriate to some extent.

［第１の実施の形態］
−構成−
以下、本発明の第１の実施の形態に係る係り受け構造解析システム３０の機能的構成について、図２〜図５を参照して説明する。図２を参照して、係り受け構造解析システム３０は、節内の係り受けに関する確率モデルである節内係り受け確率モデル３６と、節間の係り受けに関する確率モデルである節間係り受け確率モデル３８とを用い、入力される発話４０に対する係り受け構造を漸進的に解析して係り受け関係４６を節単位で順次出力するための係り受け構造解析装置４２を含む。係り受け構造解析装置４２には、後述するように内部処理で用いられる所定値（Ｎ）を入力する際に利用者が操作する入出力装置４４が接続される。 [First Embodiment]
−Configuration−
Hereinafter, the functional configuration of the dependency structure analyzing system 30 according to the first embodiment of the present invention will be described with reference to FIGS. Referring to FIG. 2, the dependency structure analysis system 30 includes an intra-node dependency probability model 36 that is a probability model related to intra-node dependency and an inter-node dependency probability model that is a probability model related to inter-node dependency. 38, a dependency structure analyzing device 42 for progressively analyzing the dependency structure for the input utterance 40 and sequentially outputting the dependency relationship 46 in units of nodes. The dependency structure analyzing device 42 is connected to an input / output device 44 operated by a user when inputting a predetermined value (N) used in internal processing, as will be described later.

節内係り受け確率モデル３６及び節間係り受け確率モデル３８は、予め行なわれた係り受け構造解析の結果を示す情報が付された学習データ３２を用い、後述する式に従って係り受け学習処理部３４により準備される。 The intra-node dependency probability model 36 and the inter-node dependency probability model 38 use the learning data 32 to which information indicating the result of the dependency structure analysis performed in advance is used, and the dependency learning processing unit 34 according to an expression described later. Prepared by.

係り受け構造解析装置４２は、発話４０に対し形態素解析、文節まとめ上げ、及び特許文献１に記載の節境界検出処理を行なうことにより発話４０を節に逐次分割し、節が分割されるたびに当該節を出力するための節判定処理部６０と、節判定処理部６０がその節判定処理中に発話データを一時記憶するために使用する発話データ一時記憶部６２と、節判定処理部６０により節が出力されるたびに当該節を受け、節内係り受け確率モデル３６を参照して当該節に対する節内係り受け解析処理を行ない、その結果を当該節の形態素列とともに出力するための節内係り受け解析処理部６４と、節内係り受け解析処理部６４による係り受け解析結果を節の形態素列とともに記憶するための解析結果記憶部６６とを含む。 The dependency structure analyzing apparatus 42 sequentially divides the utterance 40 into clauses by performing morphological analysis, sentence summarization, and node boundary detection processing described in Patent Document 1 for each utterance 40, and each time a clause is divided. The node determination processing unit 60 for outputting the clause, the utterance data temporary storage unit 62 used by the node determination processing unit 60 to temporarily store utterance data during the node determination processing, and the node determination processing unit 60 Each time a clause is output, the clause is received, the clause dependency analysis process is performed on the clause with reference to the clause dependency probability model 36, and the result is output together with the morpheme sequence of the clause. A dependency analysis processing unit 64 and an analysis result storage unit 66 for storing the dependency analysis result by the intra-node dependency analysis processing unit 64 together with the morpheme sequence of the nodes are included.

係り受け構造解析装置４２はさらに、入出力装置４４を用いて操作者により入力された所定値（Ｎ）を記憶するための所定数記憶部６８と、節判定処理部６０により節境界が検出された節が節内係り受け解析処理部６４に出力されたことに応答して、それ以前に出力された節であってまだ係り先が確定していない最終文節を持つものに対する節間係り受け解析処理を節間係り受け確率モデル３８を利用して行なうための節間係り受け解析処理部７０とを含む。 The dependency structure analyzing device 42 further detects a node boundary by a predetermined number storage unit 68 for storing a predetermined value (N) input by an operator using the input / output device 44 and a node determination processing unit 60. In response to the fact that the clauses are output to the intra-section dependency analysis processing unit 64, the inter-node dependency analysis is performed on a clause that has been output before and whose final destination has not yet been determined. An inter-node dependency analysis processing unit 70 for performing processing using the inter-cell dependency probability model 38 is included.

係り受け構造解析装置４２はさらに、節間係り受け解析処理部７０による節間係り受け解析処理により各節の最終文節に対して得られた係り先に関する情報を、節ごとに履歴として記憶するための節間係り先一時記憶部７４と、カウンタとして使用される変数を記憶するためのカウンタ記憶部７６と、解析結果記憶部６６に記憶された節内解析結果、節間係り先一時記憶部７４に記憶された節間係り先の履歴、及びカウンタ記憶部７６を用い、節の最終文節の係り先が、所定数記憶部６８に記憶された所定値で定められる回数の間、変化せず一定だったか否かを節間係り受け解析処理部７０による解析の完了に応答して判定し、その結果に従って当該節の最終文節の係り先を特定し、当該節の節内係り受け関係と、当該節に関する節間係り受け関係とを係り受け関係４６として出力するための節間係り受け判定処理部７２とを含む。なお本実施の形態では、節間係り受け判定処理部７２は、上記した判定の後、次の判定の際に新たな節間係り先と比較するために、節間係り先一時記憶部７４に記憶されていた節間係り先に関する情報を記憶する。 The dependency structure analyzing apparatus 42 further stores, as a history for each clause, information on the dependency destination obtained for the final sentence of each clause by the inter-node dependency analysis processing by the inter-node dependency analysis processing unit 70. Inter-node connection destination temporary storage unit 74, counter storage unit 76 for storing variables used as counters, intra-node analysis results stored in analysis result storage unit 66, inter-node connection destination temporary storage unit 74 The inter-node relationship history stored in the memory and the counter storage unit 76 are used, and the destination of the last clause of the clause remains constant for the number of times determined by the predetermined value stored in the predetermined-number storage unit 68. Is determined in response to the completion of the analysis by the inter-part dependency analysis processing unit 70, and the destination of the final clause of the relevant section is specified according to the result, Intersection dependency on clauses Receiving internode dependency for outputting a dependency relation 46 a relationship and a determination processing unit 72. In this embodiment, the inter-node dependency determination processing unit 72 stores the inter-cell dependency destination temporary storage unit 74 in order to compare with a new inter-node relationship destination at the next determination after the above determination. Stores information related to inter-node relationships.

図３に、節間係り受け解析処理部７０の詳細な構成を示す。図３を参照して、節間係り受け解析処理部７０は、節判定処理部６０から節が与えられたことを検出して節入力信号を出力するための節入力検出部９０と、節入力検出部９０からの節入力信号に応答して、新たに入力された節に対する記憶領域を節間係り先一時記憶部７４に、カウンタ領域をカウンタ記憶部７６に、それぞれ確保するための節間係り受け記憶初期化部９６と、節入力検出部９０からの節入力信号に応答して、解析結果記憶部６６に記憶されている各節の係り受け解析結果に対し、節間係り受け確率モデル３８を用いた節間係り受け解析処理を行ない、各節の最終文節の係り先を推定する処理を行なうための節最終文節係り先解析処理部９２と、節最終文節係り先解析処理部９２により推定された各節の最終文節の係り先によって、節間係り先一時記憶部７４に記憶されている各節の節間係り先の履歴を更新するための係り先更新処理部９４とを含む。 FIG. 3 shows a detailed configuration of the inter-node dependency analysis processing unit 70. Referring to FIG. 3, inter-node dependency analysis processing unit 70 detects a node from node determination processing unit 60 and outputs a node input signal. In response to the node input signal from the detection unit 90, the storage region for the newly input node is stored in the node connection temporary storage unit 74, and the counter region is stored in the counter storage unit 76. In response to the node input signals from the reception memory initialization unit 96 and the node input detection unit 90, the inter-node dependency probability model 38 for the dependency analysis results of each node stored in the analysis result storage unit 66. The inter-node dependency analysis processing using the clause is performed, and the clause final clause dependency destination analysis processing unit 92 for performing the processing for estimating the dependency destination of the final clause of each clause, and the section final clause dependency destination analysis processing unit 92 perform the estimation Depending on the destination of the last sentence of each section Te, and a dependency destination update processing unit 94 for updating the internodal dependency destination history of each section stored in the internode relates destination temporary storage unit 74.

以下、上記した各部の処理のうち、係り受け解析の手法の詳細について説明する。 The details of the dependency analysis method will be described below among the processes of the above-described units.

−節境界に基づく文解析−
本実施の形態における係り受け解析では、形態素解析、文節まとめ上げ、及び節境界解析が施された文を入力とし、係り受けの後方修飾性、係り先の唯一性、及び非交差性の３つの性質を絶対的制約とする。解析の手順は以下の通りである。 -Sentence analysis based on clause boundaries-
In the dependency analysis in the present embodiment, a sentence subjected to morphological analysis, sentence grouping, and clause boundary analysis is input, and there are three types of dependency: backward modification of dependency, uniqueness of dependency, and non-crossing The property is an absolute constraint. The analysis procedure is as follows.

（１）節レベルの係り受け解析
１文中の全ての節に対して、その内部の係り受け構造を解析する。図２に示す節内係り受け解析処理部６４が実行する処理である。 (1) Dependency analysis at the node level Analyze the dependency structure inside all the clauses in one sentence. This is a process executed by the intra-node dependency analysis processing unit 64 shown in FIG.

（２）文レベルの係り受け解析
１文中の全ての節に対して、その最終文節の係り先を解析し推定する。図２に示す節間係り受け解析処理部７０が実行する処理である。 (2) Dependency analysis at the sentence level For all clauses in one sentence, analyze and estimate the destinations of the last sentence. This is a process executed by the inter-node dependency analysis processing unit 70 shown in FIG.

以下では、１文を構成する節をＣ₁，Ｃ₂，…，Ｃ_m、節Ｃ_iを構成する文節列をｂ₁ ⁱ，…，ｂ_ni ⁱ、文節ｂ_k ⁱを係り文節とする係り受け関係をｄｅｐ（ｂ_k ⁱ）、１文の係り受け構造を｛dep(ｂ₁ ¹),…,dep(ｂ_nm-1 ^m)｝で表す。 In the following, clauses constituting one sentence are C ₁ , C ₂ ,..., C _m , clause strings constituting clause C _i are b ₁ ⁱ ,..., B _ni ⁱ , and clause b _k ⁱ is a clause. The receiving relationship is represented by dep (b _k ⁱ ), and the dependency structure of one sentence is represented by {dep (b ₁ ¹ ),..., Dep (b _nm−1 ^m )}.

節レベルの係り受け解析
節レベルの係り受け解析では、節Ｃ_i中の文節列ｂ₁ ⁱ，…，ｂ_ni ⁱをＢ_iとするとき、Ｐ（Ｓ_i|Ｂ_i）を最大にする係り受け構造Ｓi（＝｛dep(ｂ₁ ⁱ),…,dep(ｂ_ni-1 ⁱ)｝）を求める。ただし、節の最終文節ｂ_ni ⁱ（１≦ｉ≦ｍ）の受け文節は決定しない。 The dependency analysis section level dependency parsing node level, clause column b ₁ ⁱ in sections C _i, ..., when the b _ni ⁱ and B _i, P | relates to maximize (S _{_i} B _i) The receiving structure Si (= {dep (b ₁ ⁱ ),..., Dep (b _ni-1 ⁱ )}) is obtained. However, the receiving clause of the final clause b _ni ⁱ (1 ≦ i ≦ m) of the clause is not determined.

係り受け関係は互いに独立であると仮定すると、Ｐ（Ｓi｜Ｂi）は以下の式で計算できる。 Assuming that the dependency relationships are independent of each other, P (Si | Bi) can be calculated by the following equation.

ここで、式（１）の右辺の各項は、入力文節列Ｂ_iが与えられたときに、文節ｂ_k ⁱがｂ_l ⁱに係る確率を表す。最尤の係り受け構造は、式（１）の確率を最大とする構造であるとして動的計画法を用いて計算する。

Here, each term on the right side of Equation (1) represents the probability that the phrase b _k ⁱ relates to b _l ⁱ when the input phrase string B _i is given. The maximum likelihood dependency structure is calculated by using dynamic programming, assuming that the maximum likelihood dependency structure is a structure that maximizes the probability of Equation (1).

次に、式（１）の右辺の各項の計算について述べる。係り文節における自立語の原形をｈ_k ⁱ、その品詞をｔ_k ⁱ、係りの種類をｒ_k ⁱとし、受け文節における自立語の原形をｈ_l ⁱ、その品詞をｔ_l ⁱとする。また、受け文節が節の最終文節であるか否かをｅ_l ⁱとし、文節間距離をｄ_kl ⁱⁱとする。ここで、係りの種類とは、係り文節が付属語を伴うときはその付属語の語彙、品詞、活用形であり、そうでないときは文節末の形態素の品詞、活用形である。 Next, calculation of each term on the right side of Equation (1) will be described. The original form of the independent word in the dependency phrase is h _k ⁱ , its part of speech is t _k ⁱ , the type of the dependency is r _k ⁱ , the original form of the independent word in the reception phrase is h _l ⁱ , and its part of speech is t _l ⁱ . Also, e _l ⁱ indicates whether the received phrase is the last phrase in the section, and the distance between phrases is d _kl ⁱⁱ . Here, the type of relationship is the vocabulary, part of speech, or inflected form of the attached word when the related phrase accompanies the attached word, and the morphological part of speech or inflected form of the ending of the clause otherwise.

以上の属性を用いて、確率式（１）の右辺の各項を構成する確率を以下のように計算する。 Using the above attributes, the probability of forming each term on the right side of the probability formula (1) is calculated as follows.

ただしＦは共起頻度関数である。この計算は、図２に示す係り受け学習処理部３４が学習データ３２に基づいて行なう。

Where F is a co-occurrence frequency function. This calculation is performed based on the learning data 32 by the dependency learning processing unit 34 shown in FIG.

文レベルの係り受け解析
この処理では、節の最終文節の係り先の文節（受け文節）を同定する。１文の文節列をＢ（＝Ｂ₁…Ｂ_m）とし、節の最終文節を係り文節とするような係り受け構造｛dep(ｂ_n1 ¹),…,dep(ｂ_nm-1 ^m-1)｝をＳ_lastとするとき、Ｐ（Ｓ_last｜Ｂ）を最大とするＳ_lastを求める。Ｐ（Ｓ_last｜Ｂ）は以下の式で計算できる。 Sentence level dependency analysis In this process, the destination clause (receiving clause) of the final clause of the clause is identified. Dependent structure {dep (b _n1 ¹ ),..., Dep (b _nm-1 ^m-1 , where B (= B ₁ ... B _m ) is a sentence sequence of one sentence and the last phrase of the clause is a related phrase. )} when to the S _{_last,} P (S _last | Request S _last to maximize B). P (S _last | B) can be calculated by the following equation.

ここで、式（３）の右辺の各項は、１文の文節ｂが与えられたときに、Ｃ_iの最終文節ｂ_ni ⁱがｂ_l ^jに係る確率を表し、式（２）と同様に計算する。この計算も、図２に示す係り受け学習処理部３４が学習データ３２に基づいて行なう。最尤の係り受け構造は、式（３）の確率を最大とする構造であるとして動的計画法を用いて計算する。

Here, each term on the right side of Equation (3) represents the probability that the final clause b _ni ⁱ of C _i is related to b _l ^j when one clause b is given, and is similar to Equation (2). To calculate. This calculation is also performed by the dependency learning processing unit 34 shown in FIG. The maximum likelihood dependency structure is calculated using a dynamic programming method as a structure that maximizes the probability of Expression (3).

ただし、本実施の形態では、先に解析した節内部の係り受け構造を前提として節の最終文節の受け文節を決定する。すなわち、後方に位置する全ての文節を受け文節の候補として計算するのではなく、節内部の係り受け候補から非交差性を満たすものだけを受け文節の候補とする。図１に示す例の場合、文節「支持するという」の受け文節は「人が」又は「なっております」のいずれかであるとして計算する。 However, in the present embodiment, the receiving clause of the final clause of the clause is determined on the assumption of the dependency structure inside the clause analyzed earlier. That is, not all the clauses located behind are calculated as candidate phrases, but only those satisfying the non-intersection among the dependency candidates in the clause are taken as candidate phrases. In the case of the example shown in FIG. 1, the received clause of the phrase “I support” is calculated as being either “People” or “I am”.

漸進的係り受け解析
既に述べたように、係り受け構造解析装置４２は節判定処理部６０による節の検出があるごとに漸進的に係り受け解析を行なう。すなわち、入力に対して節判定処理部６０が節境界を随時判定し、節が同定されると、その時点までの入力に対して係り受け解析を実行する。 Progressive Dependency Analysis As already described, the dependency structure analysis device 42 performs a gradual dependency analysis each time a node is detected by the node determination processing unit 60. That is, the node determination processing unit 60 determines node boundaries as needed for the input, and when a node is identified, dependency analysis is executed for the input up to that point.

節間係り受け解析処理部７０による係り受け解析については、その受け文節がいつ決定できるかは明らかではない。そのため、それを決定するタイミングが問題となる。本実施の形態では、前述した通り、節の最終文節が入力されてから後続の節についてある程度解析が進んだ時点でその受け文節を決定する。これは、文節間の係り受け関係が文をまたぐことがないこと、及びその距離が格段に長くなることはないこと、という二つの事実に着目したものである。 Regarding dependency analysis by the inter-node dependency analysis processing unit 70, it is not clear when the received clause can be determined. Therefore, the timing for determining it becomes a problem. In the present embodiment, as described above, the receiving clause is determined when the analysis of the subsequent clause progresses to some extent after the final clause of the clause is input. This focuses on the two facts that the dependency relationship between clauses does not cross sentences and that the distance does not become much longer.

具体的には、節境界単位が入力されるたびにその時点での最尤の係り受け構造を上記「文レベルの係り受け解析」で述べた手法により解析して、その時点でのその節の係り受け関係を解析し、節の最終文節の係り受け関係が一定の入力回数（所定数記憶部６８に記憶される「所定値Ｎ」）の間変わらなかった（この回数を以下「不変回数」と呼ぶ。）場合、その受け文節を当該最終文節の係り先として決定する。つまり、直前の係り先の推定結果の履歴に基づいて、節の係り先を決定する。 Specifically, whenever the clause boundary unit is input, the maximum likelihood dependency structure at that time is analyzed by the method described in the above-mentioned “sentence level dependency analysis”, and the relationship of the clause at that time is analyzed. The dependency relationship was analyzed, and the dependency relationship of the final clause of the clause did not change for a certain number of times of input (“predetermined value N” stored in the predetermined number storage unit 68) (this number is hereinafter referred to as “invariant number”). In this case, the received clause is determined as the destination of the final clause. In other words, the node destination is determined based on the history of the immediately previous destination destination estimation result.

漸進的係り受け解析アルゴリズム
この解析では、節Ｃ_iが入力されるごとに、既に入力された節Ｃ₁，…，Ｃ_iの各最終文節ｂ_ni ⁱ，…，ｂ_ni ^jに対する係り受け構造Ｄ＝｛dep(ｂ_nj ^j),k｜１≦ｊ≦ｉ｝を更新することにより節間の係り受けを定める。ここでｋはdep(ｂ_nj ^j)の不変回数を示す。以下に係り受け解析アルゴリズムを示す。なお，固定値を所定値Ｎ（Ｎは自然数）とする。 The progressive dependency analysis algorithms this analysis, each time the node C _i is input, section C ₁ already entered, ..., each final clause of C _i b _ni ^i, ..., b relates for _ni ^j receiving structure D = {Dep (b _nj ^j ), k | 1 ≦ j ≦ i} is updated to determine the dependency between nodes. Here, k _represents the invariant number of dep (b _nj ^j ). The dependency analysis algorithm is shown below. The fixed value is a predetermined value N (N is a natural number).

（１）内部の係り受け構造が決定された節Ｃ_iを受ける。 (1) A node C _i whose internal dependency structure is determined is received.

（２）節の最終文節のうち、係り先が未決定な文節に対して、それを係り文節とする係り受け関係を「文レベルの係り受け解析」で説明した方法により求める。 (2) Of the final clauses in the clause, for the clauses whose dependency destinations are not yet determined, the dependency relationship using the dependency clause as the dependency clause is obtained by the method described in the “sentence level dependency analysis”.

（３）（２）で生成された係り受け関係dep(ｂ_nj ^j)に基づき、最終文節に対する係り受け関係Ｄを更新する。ここでdep(ｂ_nj ^j)が前回と同一の場合は不変回数に１加算し、異なる場合は不変回数を１とする。 (3) Based on the dependency relationship dep (b _nj ^j ) generated in (2), the dependency relationship D for the final phrase is updated. Here, when dep (b _nj ^j ) is the same as the previous time, 1 is added to the invariant number, and when it is different, the invariant number is set to 1.

（４）不変回数＝所定値Ｎを満たす係り受け関係dep(ｂ_nj ^j)∈Ｄに対して、文節ｂ_nj ^jの係り先が決定したとしてdep(ｂ_nj ^j)を出力する。 (4) With respect to the dependency relationship dep (b _nj ^j ) ∈D satisfying the invariant number = predetermined value N, dep (b _nj ^j ) is output assuming that the dependency destination of the clause b _nj ^j is determined.

（５）全ての節が入力された時点で、不変回数＜所定値Ｎの（dep(ｂ_nj ^j)，ｋ）∈Ｄに対して、その係り受け関係dep(ｂ_nj ^j)を出力する。 (5) At the time when all the clauses are input, the dependency relationship dep (b _nj ^j ) is output for (dep (b _nj ^j ), k) εD where the invariant number <the predetermined value N.

なお本実施の形態では、文末は係り先がないものとして解析する。そのため、節末の解析では係り先なしを候補に含める。具体的には、式（３）において、係り先のない文節はそれ自身に係る（すなわち，ｂ_ni ⁱ＝ｂ_l ^j）とし、係り先なしとなる確率も計算する。 In the present embodiment, the sentence ending is analyzed as having no dependency. Therefore, in the analysis at the end of the paragraph, no destination is included as a candidate. Specifically, in Equation (3), a clause without a relation is related to itself (that is, b _ni ⁱ = b _l ^j ), and the probability of having no relation is also calculated.

−動作−
上記実施の形態に係る係り受け構造解析システム３０は以下のように動作する。この動作には二つの局面が存在する。図２を参照して、第１の局面は係り受け学習処理部３４による節内係り受け確率モデル３６及び節間係り受け確率モデル３８の学習という局面である。第２の局面はこの節内係り受け確率モデル３６及び節間係り受け確率モデル３８を用いた、係り受け構造解析装置４２による漸進的係り受け解析処理である。また、入出力装置４４を用いた所定値Ｎの設定は任意の時により操作者により行なわれるものとする。 -Operation-
The dependency structure analysis system 30 according to the above embodiment operates as follows. There are two aspects to this operation. Referring to FIG. 2, the first aspect is an aspect of learning of the intra-node dependency probability model 36 and the inter-node dependency probability model 38 by the dependency learning processing unit 34. The second aspect is a progressive dependency analysis process by the dependency structure analysis device 42 using the intra-node dependency probability model 36 and the inter-node dependency probability model 38. The predetermined value N using the input / output device 44 is set by the operator at any time.

第１の局面では、まず学習データ３２を準備する。学習データ３２は、予め何らかの方法で係り受け解析を行なった多数の文に対し、係り受け解析を手作業で修正したものを用いる。 In the first aspect, learning data 32 is first prepared. The learning data 32 is obtained by manually modifying the dependency analysis for a large number of sentences that have been subjected to the dependency analysis in advance by some method.

係り受け学習処理部３４による節内係り受け確率モデル３６及び節間係り受け確率モデル３８の計算方法は、上記した計算式に従って行なわれる。 The calculation method of the intra-node dependency probability model 36 and the inter-node dependency probability model 38 by the dependency learning processing unit 34 is performed according to the above-described calculation formula.

節内係り受け確率モデル３６及び節間係り受け確率モデル３８、並びに節判定処理部６０が形態素解析において使用する辞書及び節境界検出のためのルール等の言語的資源が準備できれば、発話４０に対する係り受け構造解析装置４２による漸進的係り受け解析が可能になる。 If the linguistic resources such as the intra-clause dependency probability model 36 and the inter-node dependency probability model 38, and the dictionary used by the node determination processing unit 60 in the morphological analysis and the rules for detecting the node boundary can be prepared, the relationship to the utterance 40 will be described. Gradual dependency analysis by the receiving structure analysis device 42 becomes possible.

まず節判定処理部６０が、逐次入力される発話４０を発話データ一時記憶部６２に蓄積しながら、上記特許文献１に記載の手法に従って形態素解析する。節判定処理部６０はさらに、この処理により得られる形態素列に対して節境界検出のためのルールを適用することにより節境界を逐次的に検出する。節判定処理部６０は、節境界を検出すると直前の節境界との間の形態素列からなる節を節内係り受け解析処理部６４に与える。このとき、節には節識別子（以下「節ＩＤ」と呼ぶ。）が付与される。以後の処理ではこの節ＩＤにより各節が区別される。節ＩＤとしては、本実施の形態では１から始まる連続番号を使用するが、節の区別が可能であればどのような手法で付与してもよい。 First, the node determination processing unit 60 performs morphological analysis according to the method described in Patent Document 1 while accumulating the utterances 40 sequentially input in the utterance data temporary storage unit 62. The node determination processing unit 60 further detects node boundaries sequentially by applying a rule for node boundary detection to the morpheme string obtained by this processing. When detecting the node boundary, the node determination processing unit 60 gives the node including the morpheme sequence between the previous node boundary to the intra-node dependency analysis processing unit 64. At this time, a node identifier (hereinafter referred to as “node ID”) is assigned to the node. In subsequent processing, each section is distinguished by this section ID. As the node ID, a serial number starting from 1 is used in the present embodiment, but any method may be used as long as the node can be distinguished.

図２に示す節内係り受け解析処理部６４は、節判定処理部６０から節を受取ると、節内係り受け確率モデル３６を用いて当該節内の最終文節を除く文節の間の係り受け解析を実行し、その結果を解析結果記憶部６６に記憶させる。 When the intra-node dependency analysis processing unit 64 shown in FIG. 2 receives a node from the node determination processing unit 60, the intra-node dependency analysis processing unit 64 uses the intra-node dependency probability model 36 to perform dependency analysis between the clauses excluding the last clause in the clause. And the result is stored in the analysis result storage unit 66.

図３を参照して、節間係り受け解析処理部７０の節入力検出部９０は、節判定処理部６０から節が入力されると、節入力信号を節間係り受け記憶初期化部９６及び節最終文節係り先解析処理部９２に与える。節入力検出部９０はまた、節を構成する形態素列を節最終文節係り先解析処理部９２に与える。 Referring to FIG. 3, when a node is input from node determination processing unit 60, node input detection unit 90 of inter-node dependency analysis processing unit 70 converts the node input signal into inter-node dependency storage initialization unit 96 and This is given to the clause final sentence dependency destination analysis processing unit 92. The clause input detection unit 90 also supplies the morpheme sequence that constitutes the clause to the clause final clause relation destination analysis processing unit 92.

節間係り受け記憶初期化部９６は、節入力信号に応答して節間係り先一時記憶部７４内に、新たな節のための記憶領域を確保して初期化する。節間係り受け記憶初期化部９６はまた、カウンタ記憶部７６内に、新たな節のためのカウンタ領域を確保し、その値を０に初期化する。 The inter-node dependency memory initialization unit 96 secures and initializes a storage area for a new node in the inter-node dependency destination temporary storage unit 74 in response to the node input signal. The inter-node dependency storage initialization unit 96 also reserves a counter area for a new node in the counter storage unit 76 and initializes its value to zero.

一方、節最終文節係り先解析処理部９２は、節入力検出部９０から節入力信号と節の形態素列とが与えられると、解析結果記憶部６６に記憶されている、それまで入力された節に関する節内解析結果と、節間係り受け確率モデル３８とを用い、まだ係り先が確定していない節の最終文節の係り先を最尤推定により推定する。節最終文節係り先解析処理部９２は、その結果を係り先更新処理部９４に与える。係り先更新処理部９４はその結果を、節間係り先一時記憶部７４内のその節に対応する記憶領域に保存する。節最終文節係り先解析処理部９２による処理が、まだ係り先の決定していない節の全てに対して完了すると、係り先更新処理部９４は、節間係り受け判定処理部７２に対して全ての節について係り先の推定が終了したことを通知する。 On the other hand, when the clause input signal and the morpheme sequence of the clause are given from the clause input detection unit 90, the clause final clause assignment destination analysis processing unit 92 is stored in the analysis result storage unit 66 and has been input so far. Using the intra-node analysis result and the inter-node dependency probability model 38, the destination of the last clause of the clause whose destination is not yet determined is estimated by maximum likelihood estimation. The clause final sentence dependency destination analysis processing unit 92 gives the result to the dependency destination update processing unit 94. The dependency destination update processing unit 94 stores the result in a storage area corresponding to the node in the inter-node relationship destination temporary storage unit 74. When the processing by the clause final sentence dependency destination analysis processing unit 92 is completed for all of the clauses for which the dependency destination has not yet been determined, the dependency destination update processing unit 94 performs all processing for the inter-node dependency determination processing unit 72. Notify that the estimation of the relation destination has been completed for the section in (1).

節間係り受け判定処理部７２は、この通知に応答して以下のような処理を実行する。すなわち節間係り受け判定処理部７２は、係り先の決定していない節に対して自己が記憶している係り先の情報と、節間係り先一時記憶部７４中に記憶されている同じ節に関する係り先の情報とが一致するか否かを判定する。両者が一致していなければ節間係り受け判定処理部７２はカウンタ記憶部７６内の当該節に対応するカウンタの値を１に設定し、節間係り先一時記憶部７４に記憶されている係り先情報を記憶する。両者が一致していれば、節間係り受け判定処理部７２はカウンタ記憶部７６内の当該節に対応するカウンタの値に１を加算する。この場合、節間係り受け判定処理部７２はさらに、加算の結果カウンタの値が所定数記憶部６８に記憶されている所定値Ｎと一致しているか否かを判定する。両者が一致していれば、節間係り受け判定処理部７２はその節に関する係り先を、節間係り先一時記憶部７４に記憶されている係り先に決定し、当該節について解析結果記憶部６６に保存されている節内係り受け解析結果とともに、その節の係り受け関係４６として出力する。カウンタの値が所定値Ｎに一致していなければ（すなわちＮ未満であれば）、節間係り受け判定処理部７２は次の節についての処理を開始する。 In response to this notification, the inter-node dependency determination processing unit 72 executes the following processing. In other words, the inter-node dependency determination processing unit 72 stores the information on the dependency destination stored by itself for the node for which the dependency destination has not been determined, and the same clause stored in the inter-node dependency destination temporary storage unit 74. It is determined whether or not the information on the relationship is related. If they do not match, the inter-node dependency determination processing unit 72 sets the value of the counter corresponding to the relevant node in the counter storage unit 76 to 1 and the relationship stored in the inter-node relationship temporary storage unit 74 Store previous information. If the two match, the inter-node dependency determination processing unit 72 adds 1 to the value of the counter corresponding to the relevant node in the counter storage unit 76. In this case, the inter-node dependency determination processing unit 72 further determines whether or not the value of the addition result counter matches the predetermined value N stored in the predetermined number storage unit 68. If the two match, the inter-node dependency determination processing unit 72 determines an inter-relationship destination related to the node as the inter-relationship destination temporary storage unit 74, and the analysis result storage unit for the relevant node. Together with the intra-section dependency analysis result stored in 66, the dependency relationship 46 of that section is output. If the counter value does not match the predetermined value N (that is, less than N), the inter-node dependency determination processing unit 72 starts processing for the next clause.

このようにして節間係り受け判定処理部７２は係り先が未決定の全ての節についてこの処理を繰返す。 In this way, the inter-node dependency determination processing unit 72 repeats this process for all the nodes for which the dependency destination is not yet determined.

係り受け構造解析装置４２は、発話４０の全体に対して上記した処理を繰返して実行する。その結果、履歴に基づいて係り先が決定された節の係り受け関係４６が係り受け構造解析装置４２から逐次出力される。 The dependency structure analyzing apparatus 42 repeatedly executes the above-described processing on the entire utterance 40. As a result, the dependency relationship 46 of the node whose dependency destination is determined based on the history is sequentially output from the dependency structure analyzing device 42.

仮に所定の終了条件が充足されると、節間係り受け判定処理部７２は、係り先が未決定の全ての節について、解析結果記憶部６６に保存されている節内係り受け解析結果と、節間係り先一時記憶部７４に保存されている節の直前の係り先の推定結果とをまとめ、各節に関する係り受け関係４６として出力する。 If the predetermined termination condition is satisfied, the inter-node dependency determination processing unit 72 includes the intra-node dependency analysis results stored in the analysis result storage unit 66 for all the nodes for which the dependency destinations have not been determined. The estimation results of the connection destination immediately before the node stored in the inter-node connection destination temporary storage unit 74 are collected and output as a dependency relationship 46 for each node.

こうして、発話４０を構成する節の全てについて、節内係り受け解析結果と、節間係り受け解析結果とが得られる。節間係り受け解析結果は、発話４０の入力とともに逐次決定されるので、発話の途中でも、発話に対する係り受け解析結果を用いた機械翻訳又は字幕生成などの処理を適切に行なうことができる。最後に出力される節間係り受け解析結果は、決定条件を充足したものではないが、節間係り受け解析処理部７０による最尤推定により推定されたものであるから、一定の精度が期待できる。 Thus, the intra-section dependency analysis result and the inter-node dependency analysis result are obtained for all the nodes constituting the utterance 40. The inter-node dependency analysis results are sequentially determined together with the input of the utterance 40, so that even during the utterance, processing such as machine translation or subtitle generation using the dependency analysis result for the utterance can be appropriately performed. Although the inter-node dependency analysis result output at the end does not satisfy the determination condition, it is estimated by the maximum likelihood estimation by the inter-cell dependency analysis processing unit 70, so that a certain accuracy can be expected. .

−コンピュータによる実現−
ソフトウェア構成
図２及び図３に示す各ブロックは、コンピュータハードウェアと、当該コンピュータにより実行されるソフトウェアすなわちコンピュータプログラム（以下単に「プログラム」と呼ぶ。）とにより実現できる。図４及び図５に、これらブロックのうち節間係り受け解析処理部７０及び節間係り受け判定処理部７２による処理を実現するためのプログラムのフローチャートを示す。 -Realization by computer-
Software Configuration Each block shown in FIGS. 2 and 3 can be realized by computer hardware and software executed by the computer, that is, a computer program (hereinafter simply referred to as “program”). 4 and 5 show a flowchart of a program for realizing the processing by the inter-node dependency analysis processing unit 70 and the inter-cell dependency determination processing unit 72 among these blocks.

図４を参照して、このプログラムが起動されると、まずステップ１１０で初期設定を行なう。この初期設定では、予めメモリに設定されていた所定値（図２に示す所定数記憶部６８に記憶された所定値Ｎ）をメモリから読出したり、作業に使用する記憶領域を初期設定したり、処理中の節数を表す変数Ｍを初期値０に設定したりする。 Referring to FIG. 4, when this program is started, first, at step 110, initial setting is performed. In this initial setting, a predetermined value (predetermined value N stored in the predetermined number storage unit 68 shown in FIG. 2) previously set in the memory is read from the memory, a storage area used for work is initialized, A variable M indicating the number of nodes being processed is set to an initial value 0.

ステップ１１２において、節判定処理部６０を実現するプログラムにより節境界が検出されたことに応答し、検出された節境界とその直前の節境界との間に挟まれた節をメモリ上から読出す。 In step 112, in response to the detection of the node boundary by the program that implements the node determination processing unit 60, the node sandwiched between the detected node boundary and the immediately preceding node boundary is read from the memory. .

ステップ１１４において、この新たな節を特定するための節ＩＤを割当て、この節の節間係り先（この節の最終文節の係り先）を一時的に記憶する領域と、この節の節間係り先が、係り先推定の繰返しに対し何回同じ値となっているか（この回数を以下「継続数」と呼ぶ。）をカウントするためのカウンタの領域とを、それぞれ節間係り先一時記憶部７４とカウンタ記憶部７６とに確保する処理を行なう。ステップ１１４においてさらに、処理中の節の数を表す変数Ｍに１を加算する。 In step 114, a section ID for identifying this new section is assigned, and an area for temporarily storing the inter-node relation destination of this section (the sub-section destination of this section) and the inter-section relation destination of this section are: A counter area for counting how many times the same value is repeated for the dependency destination estimation (this number is hereinafter referred to as “continuation number”), and the internode relationship temporary storage unit 74 and the counter, respectively. Processing to be secured in the storage unit 76 is performed. In step 114, 1 is further added to the variable M representing the number of nodes being processed.

ステップ１１６で、節間の係り受け構造の解析処理を、処理中の各節について行なう。この結果、処理中の各節について、ステップ１１２で入力された新たな節を考慮して最終文節の係り先が推定される。この処理は図２に示す節間係り受け確率モデル３８を用いて行なわれる。推定結果は節間係り先一時記憶部７４に保存される。節間係り先一時記憶部７４の中の領域には、ステップ１１４において各節の節ＩＤとの対応関係が付与されている。従って、節間係り先一時記憶部７４の中のどの領域に結果を記憶するかは、節ＩＤによって判定できる。 In step 116, the dependency structure between nodes is analyzed for each node being processed. As a result, for each section being processed, the destination of the final phrase is estimated in consideration of the new section input in step 112. This process is performed using the inter-node dependency probability model 38 shown in FIG. The estimation result is stored in the inter-node connection destination temporary storage unit 74. In step 114, a correspondence relationship with the node ID of each node is given to the area in the internode connection destination temporary storage unit 74. Therefore, it can be determined by the clause ID in which area in the inter-node relationship temporary storage unit 74 the result is stored.

ステップ１１８では、処理中の全節に対し、最終文節の係り先の継続数が更新される。本実施の形態では、継続数をＣ（Ｊ）（Ｊ＝１〜Ｍ）で表す。すなわち、ステップ１１８では、ステップ１１６においてある節（Ｊ番目の節とする。）の最終文節について新たに推定された係り先が、前回推定されたものと同一の場合には、当該節の継続数Ｃ（Ｊ）に１が加算される。この節について新たに推定された係り先が前回推定されたものと異なる場合には、当該節の継続数Ｃ（Ｊ）には１が設定される。ステップ１１８の処理の詳細については図５を参照して後述する。なお、継続数を記憶するためのカウンタ領域の各々についても、節ＩＤとの対応関係がステップ１１４で付与されている。どのカウンタを増分すべきかは、処理中の節の節ＩＤで判定できる。 In step 118, the continuation number of the last phrase is updated for all sections being processed. In the present embodiment, the continuation number is represented by C (J) (J = 1 to M). That is, in step 118, if the newly estimated destination for the last phrase of a section (referred to as the J-th section) in step 116 is the same as that previously estimated, the number of continuations of the section. 1 is added to C (J). When the newly estimated destination for this section is different from that previously estimated, 1 is set as the continuation number C (J) of the section. Details of the processing in step 118 will be described later with reference to FIG. It should be noted that each counter area for storing the number of continuations is also given a corresponding relationship with the node ID in step 114. Which counter should be incremented can be determined by the section ID of the section being processed.

ステップ１２０では、ステップ１１８での継続数の更新処理の結果、継続数Ｃ（Ｊ）が所定数記憶部６８に記憶された所定値Ｎ以上となった節があるか否かについての判定が行なわれる。そのような節がなければ制御はステップ１２６に進む。あればステップ１２２に進む。 In step 120, it is determined whether or not there is a clause in which the continuation number C (J) is equal to or greater than a predetermined value N stored in the predetermined number storage unit 68 as a result of the continuation number update process in step 118. It is. If there is no such clause, control proceeds to step 126. If so, go to Step 122.

ステップ１２２では、継続数が所定値Ｎ以上となった節の全てについて節内係り受けだけではなく節間係り受けも確定したと判定し、解析結果記憶部６６に記憶されていた当該節に関する節内係り受けと、ステップ１１６において当該節に対し推定された節間係り受けとがまとめられ、その節の係り受け関係として出力される。 In step 122, it is determined that not only the intra-section dependency but also the inter-node dependency has been confirmed for all the nodes whose continuation number is equal to or greater than the predetermined value N, and the node related to the relevant clause stored in the analysis result storage unit 66 is determined. The inner dependency and the inter-node dependency estimated for the relevant node in step 116 are collected and output as the dependency relationship of the node.

ステップ１２４では、ステップ１２２で係り受け関係が出力された節に対しステップ１１４で確保されていた記憶領域及びカウンタ領域を解放する。さらに、係り受け関係が出力された節の数だけ変数Ｍが減算される。 In step 124, the storage area and the counter area secured in step 114 are released for the node for which the dependency relationship is output in step 122. Further, the variable M is subtracted by the number of clauses for which the dependency relationship is output.

この後、ステップ１２６では発話データの入力が終了したか否かが判定される。この判定は、例えば入力なしで所定時間が経過したか否かを判定したり、操作者による終了指示を検出したりすることにより行なわれる。制御は、入力が終了していればステップ１２８に進み、さもなければステップ１１２に戻る。 Thereafter, in step 126, it is determined whether or not the input of utterance data has been completed. This determination is performed, for example, by determining whether a predetermined time has passed without input or by detecting an end instruction from the operator. Control proceeds to step 128 if the input has been completed, and returns to step 112 otherwise.

ステップ１２８では、節間係り受け関係について未確定の節の全てについて、解析結果記憶部６６に記憶されていた節間係り受け関係と、節間係り先一時記憶部７４に記憶されている推定された節間係り受け関係とがまとめられ、各節の係り受け関係として出力される。この処理が終了するとこのプログラムの実行も終了する。 In step 128, the inter-node dependency relationship stored in the analysis result storage unit 66 and the inter-cell dependency destination temporary storage unit 74 are estimated for all of the undefined clauses regarding the inter-node dependency relationship. The inter-node dependency relationship is collected and output as the dependency relationship of each clause. When this process ends, the execution of this program also ends.

図５に、図４のステップ１１８で実行される処理を実現するためのプログラムのより詳細なフローチャートを示す。この処理は、処理中の全ての節に対し、図１に示す節間係り受け確率モデル３８を参照して最終文節の係り先を推定し、その結果と、１回前の繰返しで得られた推定結果とを比較して当該節の継続数を更新する処理である。 FIG. 5 shows a more detailed flowchart of a program for realizing the processing executed in step 118 of FIG. In this process, for all the nodes being processed, the last clause dependency is estimated with reference to the inter-node dependency probability model 38 shown in FIG. 1, and the result and the previous iteration were obtained. This is a process for comparing the estimation result and updating the number of continuations of the relevant section.

図５を参照して、まずステップ１４０で繰返し制御変数Ｊに０を代入する。この変数Ｊは、処理対象の節の順番を表す。 Referring to FIG. 5, first, at step 140, 0 is substituted for repeated control variable J. This variable J represents the order of the nodes to be processed.

ステップ１４２では、変数Ｊに１を加算する。そしてステップ１４４で変数Ｊが処理中の節数を表す変数Ｍを超えたか否かが判定される。超えていれば処理を終了する。変数Ｊが変数Ｍ以下であれば制御はステップ１４６に進む。 In step 142, 1 is added to the variable J. In step 144, it is determined whether or not the variable J exceeds the variable M representing the number of nodes being processed. If it exceeds, the process is terminated. If variable J is less than or equal to variable M, control proceeds to step 146.

ステップ１４６では、Ｊ番目の節について推定された節間係り先が、前回の繰返しで推定された節間係り先と同じか否かが判定される。この判定は、図２に示す構成では、節間係り先一時記憶部７４に記憶されたある節の節間係り先が、節間係り受け判定処理部７２に記憶された、同じ節の１回前の繰返しにおいて推定された節間係り先と一致するか否かを調べることにより行なわれる。制御は、一致していればステップ１４８に、さもなければステップ１５０に、それぞれ進む。 In step 146, it is determined whether or not the inter-node connection destination estimated for the J-th node is the same as the inter-node connection destination estimated in the previous iteration. In the configuration shown in FIG. 2, this determination is performed once for the same clause stored in the inter-node dependency determination processing unit 72 by the inter-node dependency destination stored in the inter-node dependency destination temporary storage unit 74. This is done by checking whether or not it matches the inter-node relationship estimated in the previous iteration. Control proceeds to step 148 if there is a match, otherwise to step 150.

ステップ１４８では、Ｊ番目の節に対する継続数Ｃ（Ｊ）に１が加算される。この後制御はステップ１４２に戻る。 In step 148, 1 is added to the continuation number C (J) for the J-th clause. Thereafter, the control returns to step 142.

一方、ステップ１５０では、Ｊ番目の節に対する継続数Ｃ（Ｊ）に１が設定される。続いてステップ１５２において、今回のＪ番目の節の最終文節が節間係り先一時記憶部７４から読出され、節間係り受け判定処理部７２内の記憶部に記憶される。この後、制御はステップ１４２に戻る。 On the other hand, in step 150, 1 is set to the continuation number C (J) for the J-th clause. Subsequently, at step 152, the final clause of the current J-th clause is read from the inter-node dependency destination temporary storage unit 74 and stored in the storage unit within the inter-cell dependency determination processing unit 72. Thereafter, control returns to step 142.

ハードウェア構成
上記した実施の形態に係る係り受け構造解析装置４２は、コンピュータシステムにより実現できる。図６は上記した実施の形態による係り受け構造解析装置４２を実現するコンピュータシステム２５０の全体構成を示す外観図である。図６を参照して、コンピュータシステム２５０は、マイクロフォン２６４及びスピーカ２７８の組と、ＣＤ−ＲＯＭ（ＣｏｍｐａｃｔＤｉｓｃＲｅａｄ−ＯｎｌｙＭｅｍｏｒｙ）ドライブ２７０及びＦＤ（ＦｌｅｘｉｂｌｅＤｉｓｋ）ドライブ２７２を有するコンピュータ２６０と、いずれもコンピュータ２６０に接続されたモニタ２６２、キーボード２６６及びマウス２６８とを含む。 Hardware Configuration The dependency structure analyzing apparatus 42 according to the above-described embodiment can be realized by a computer system. FIG. 6 is an external view showing the overall configuration of a computer system 250 that implements the dependency structure analyzing apparatus 42 according to the above-described embodiment. Referring to FIG. 6, a computer system 250 includes a pair of a microphone 264 and a speaker 278, and a computer 260 having a CD-ROM (Compact Disc Read-Only Memory) drive 270 and an FD (Flexible Disk) drive 272. A monitor 262 connected to the computer 260, a keyboard 266, and a mouse 268 are included.

マイクロフォン２６４とスピーカ２７８とは、必要であれば音声翻訳のための発話の入力及び翻訳結果の出力等に用いられるものであって、この発明の一部を構成するものではない。従って、システムのうちマイクロフォン２６４及びスピーカ２７８に関する部分の詳細はここでは説明しない。 The microphone 264 and the speaker 278 are used for inputting a speech for speech translation and outputting a translation result, if necessary, and do not constitute a part of the present invention. Accordingly, the details of the portion of the system relating to the microphone 264 and speaker 278 will not be described here.

図７はコンピュータ２６０のハードウェアブロック図である。図７を参照して、コンピュータ２６０は、ＣＰＵ（ＣｅｎｔｒａｌＰｒｏｃｅｓｓｉｎｇＵｎｉｔ：中央処理装置）３４０と、ＣＰＵ３４０に接続されたバス３４２と、バス３４２に接続された読出専用メモリ（ＲＯＭ）３４４と、バス３４２に接続されたランダムアクセスメモリ（ＲＡＭ）３４６と、バス３４２に接続されたハードディスク３４８と、ＣＤ−ＲＯＭ（コンパクト・ディスクＲＯＭ）３６０が装着され、ＣＤ−ＲＯＭからデータを読出すＣＤ−ＲＯＭドライブ２７０と、ＦＤ（フレキシブル・ディスク）３６２が装着され、ＦＤからデータを読出し、データを書込むＦＤドライブ２７２と、マイクロフォン２６４及びスピーカ２７８が接続されるサウンドボード３５０と、バス３４２に接続されローカルエリアネットワーク（ＬＡＮ）等のデータ通信ネットワークに接続する機能を提供するネットワークボード３５２とを含む。 FIG. 7 is a hardware block diagram of the computer 260. Referring to FIG. 7, a computer 260 includes a CPU (Central Processing Unit) 340, a bus 342 connected to the CPU 340, a read only memory (ROM) 344 connected to the bus 342, and a bus 342. A CD-ROM drive 270 that is loaded with a random access memory (RAM) 346 connected to the CD-ROM, a hard disk 348 connected to the bus 342, and a CD-ROM (compact disk ROM) 360, and reads data from the CD-ROM. FD (flexible disk) 362 is mounted, FD drive 272 that reads data from FD and writes data, sound board 350 to which microphone 264 and speaker 278 are connected, and local area network connected to bus 342 War And a network board 352 that provides a function of connecting to a data communication network such as a LAN (LAN).

図２〜図５を参照して説明した実施の形態に係る係り受け構造解析装置４２は、上記した通り、コンピュータシステム２５０のハードウェア、その上で実行されるプログラム、及びコンピュータシステム２５０のハードディスク３４８、ＲＡＭ３４６等に格納される節内係り受け確率モデル３６、節間係り受け確率モデル３８、並びに節判定処理部６０のためのルール集及び辞書などのデータにより実現可能である。 As described above, the dependency structure analyzing apparatus 42 according to the embodiment described with reference to FIGS. 2 to 5 includes the hardware of the computer system 250, the program executed thereon, and the hard disk 348 of the computer system 250. This can be realized by data such as an intra-node dependency probability model 36, an inter-node dependency probability model 38, and a rule collection and dictionary for the node determination processing unit 60 stored in the RAM 346 or the like.

コンピュータシステム２５０に係り受け構造解析装置４２としての動作を行なわせるためのプログラムは、ＣＤ−ＲＯＭドライブ２７０又はＦＤドライブ２７２に挿入されるＣＤ−ＲＯＭ３６０又はＦＤ３６２に記憶され、さらにハードディスク３４８に転送される。又は、プログラムは図示しないネットワークを通じてコンピュータ２６０に送信されハードディスク３４８に記憶されてもよい。プログラムは実行の際にＲＡＭ３４６にロードされる。ＣＤ−ＲＯＭ３６０から、ＦＤ３６２から、又はネットワークを介して、直接にＲＡＭ３４６にプログラムをロードしてもよい。 A program for causing the computer system 250 to operate as the dependency structure analyzing apparatus 42 is stored in the CD-ROM 360 or FD 362 inserted in the CD-ROM drive 270 or the FD drive 272 and further transferred to the hard disk 348. . Alternatively, the program may be transmitted to the computer 260 through a network (not shown) and stored in the hard disk 348. The program is loaded into the RAM 346 when executed. The program may be loaded directly into the RAM 346 from the CD-ROM 360, from the FD 362, or via a network.

このプログラムは、コンピュータ２５０にこの実施の形態の係り受け構造解析装置４２として動作を行なわせる複数の命令を含む。この動作を行なわせるのに必要な基本的機能のいくつかはコンピュータ２５０上で動作するオペレーティングシステム（ＯＳ）又はサードパーティのプログラム、若しくはコンピュータ２５０にインストールされる各種ツールキットのモジュールにより提供される。従って、このプログラムはこの実施の形態のシステム及び方法を実現するのに必要な機能全てを必ずしも含まなくてよい。このプログラムは、命令のうち、所望の結果が得られるように制御されたやり方で適切な機能又は「ツール」を呼出すことにより、上記した係り受け構造解析装置４２としての動作を実行する命令のみを含んでいればよい。コンピュータシステム２５０の動作原理は周知であるので、ここでは繰返さない。 This program includes a plurality of instructions for causing the computer 250 to operate as the dependency structure analyzing apparatus 42 of this embodiment. Some of the basic functions necessary to perform this operation are provided by operating system (OS) or third party programs running on the computer 250 or modules of various toolkits installed on the computer 250. Therefore, this program does not necessarily include all functions necessary to realize the system and method of this embodiment. This program calls only an instruction for executing the above-described operation as the dependency structure analyzing apparatus 42 by calling an appropriate function or “tool” in a controlled manner so as to obtain a desired result. It only has to be included. The operating principle of computer system 250 is well known and will not be repeated here.

［解析例］
以下、本実施の形態により、発話「正当な事由がない限り契約期間が切れたといっても明け渡しを請求できない点にあるといわれています」の各節の最終文節の係り先を解析する様子を図８〜図１３を用いて説明する。図８〜図１３はいずれも、上部に係り受け構造を、下部に節の最終文末の係り受け構造を、それぞれ示す。すなわち、（dep(ｂ_nj ^j)，ｋ）∈Ｄのdep(ｂ_nj ^j)が係り文節及び受け文節に、ｋが不変回数に、それぞれ相当する。なお、ここでは所定値Ｎ＝３として説明する。 [Example of analysis]
In the following, according to this embodiment, the state of analyzing the destination of the final sentence of each section of the utterance `` It is said that it is impossible to claim surrender even if the contract period expires unless there is a valid reason '' This will be described with reference to FIGS. 8 to 13 each show a dependency structure at the top and a dependency structure at the end of the last sentence of the section at the bottom. That is, _{^{(dep (b nj j),}} k) ∈D of dep (b _nj ^j) is dependent bunsetsu and receiving clauses, k is unchanged number, corresponding respectively. Here, the description will be made assuming that the predetermined value N = 3.

図８は、最初の節「正当な事由がない限り」が入力されたときの状態を示す。 FIG. 8 shows a state when the first section “unless there is a valid reason” is input.

図９は、第２の節「契約期間が切れたと」が入力され、係り受け構造｛dep(限り)｝が解析された状態を示す。ｄｅｐ（限り）は図９の中央上部の点線矢印に相当し、「限り」の係り先が「切れた」であり、不変回数が１であることが図９の下部に示されている。 FIG. 9 shows a state where the second section “contract period has expired” is input and the dependency structure {dep (limited)} is analyzed. Dep (limited) corresponds to the dotted arrow in the upper center of FIG. 9, and it is shown in the lower part of FIG. 9 that the destination of “limited” is “broken” and the invariant number is 1.

同様にして図１０及び図１１はそれぞれ、第３の節「いっても」及び第４の節「明け渡しを請求できない」が入力されたときの最尤の係り受け構造｛ｄｅｐ（限り），ｄｅｐ（切れた）｝、｛ｄｅｐ（限り），ｄｅｐ（切れた），ｄｅｐ（請求できない）｝が解析により推定された状態を示す。 Similarly, FIG. 10 and FIG. 11 show the maximum likelihood dependency structure {dep (limited), dep when the third section "I can't claim" and the fourth section "I can't claim surrender", respectively. (Broken)}, {dep (limited), dep (broken), dep (cannot be charged)} indicate states estimated by analysis.

図１２は、節「点にあると」が新たに入力され、最尤の構造｛ｄｅｐ（限り），ｄｅｐ（切れた），ｄｅｐ（いっても），ｄｅｐ（請求できない）｝が求められた状態を示す。このとき、図１２に下部に示すように、係り受け関係ｄｅｐ（切れた）の不変回数が所定値３に達する。従って、節「契約期間が切れたと」の係り先が「いっても」であると決定され、この節の係り受け関係が（節内及び節間を含め）出力される。 In FIG. 12, the node “when it is at a point” is newly input, and the maximum likelihood structure {dep (limited), dep (cut), dep (even), dep (cannot be charged)} is obtained. Indicates the state. At this time, as shown in the lower part of FIG. 12, the invariant number of the dependency relationship dep (broken) reaches a predetermined value 3. Therefore, it is determined that the destination of the clause “contract period has expired” is “even”, and the dependency relationship of this clause (including within and between clauses) is output.

図１３は、新たな節「いわれています」が入力され、最尤の係り受け構造｛ｄｅｐ（限り），ｄｅｐ（切れた）｝、｛ｄｅｐ（限り），ｄｅｐ（切れた），ｄｅｐ（いっても），ｄｅｐ（あると）｝が求められた状態を示す。図１２と同様に、不変回数が所定値３に達した節「正当な事由がない限り」、「いっても」について、節の係り先が決定され、係り受け情報が出力される。 In FIG. 13, a new clause “I am called” is input, and the maximum likelihood dependency structure {dep (limited), dep (broken)}, {dep (limited), dep (broken), dep (all Even if) dep (if any)} is obtained. In the same manner as in FIG. 12, for the clauses “when there is no legitimate reason” where the number of invariants has reached the predetermined value 3 and “even if”, the destination of the clause is determined and dependency information is output.

［解析実験］
上記実施の形態で採用された手法の有効性を評価するため、解析実験を行なった。 [Analysis experiment]
In order to evaluate the effectiveness of the technique employed in the above embodiment, an analysis experiment was performed.

−実験の概要−
実験には、ＮＨＫの解説番組「あすを読む」（番組あたりの長さは約１０分）を使用した。使用したデータの概要を表２に示す。 -Outline of the experiment-
In the experiment, NHK's commentary program “Read Asu” (length per program is about 10 minutes) was used. A summary of the data used is shown in Table 2.

テストデータとして、書起こしデータ（非特許文献２参照）に形態素解析、文節まとめ上げを施した７番組（４７０文）を用いた。節境界、及び、係り受けの正解は人手で作成した。一方、学習データとしては、形態素、文節まとめ上げ、節境界、係り受けに関する情報が与えられた９５番組(５，５３２文)を用いた。

As test data, seven programs (470 sentences) in which transcription data (see Non-Patent Document 2) was subjected to morphological analysis and sentence grouping were used. The node boundaries and the correct answers for the dependency were created manually. On the other hand, as learning data, 95 programs (5,532 sentences) to which information on morphemes, sentence grouping, clause boundaries, and dependency are given were used.

これらのデータを用いて解析を行ない、係り受け正解率と解析時間とを求めた。解析システムは、GNU CommonLISPのプログラムとして実装し、クロック周波数２．４０ＧＨｚの高性能ＣＰＵ、及び２ＧＢのメモリ容量を持ち、オープンソースのオペレーティングシステムが稼動しているコンピュータ上で実行した。なお、この実験では、所定値Ｎの値を１から１２まで１刻みで変化させて、計１２回実験した。 Analysis was performed using these data, and the dependency correct answer rate and the analysis time were obtained. The analysis system was implemented as a GNU CommonLISP program, and was executed on a computer having a high-performance CPU with a clock frequency of 2.40 GHz and a memory capacity of 2 GB and running an open source operating system. In this experiment, the predetermined value N was changed from 1 to 12 in increments of 1, and a total of 12 experiments were performed.

−実験結果−
各所定値Ｎに対応する係り受け正解率を表３に示す。 -Experimental results-
Table 3 shows the dependency correct answer rate corresponding to each predetermined value N.

表３の中央の列は、番組末を除く全ての節境界単位末に対する正解率を、最右列は、番組末を除く全ての文節に対する正解率を、それぞれ示す。所定値Ｎが２のとき及び３のときに、節境界単位末に対する正解率が最も高く、そのとき、全体の正解率は７６．２％となった。なお、節末を除く節内に対する解析の正解率は８７．５％であった。

The middle column of Table 3 shows the correct answer rate for all clause boundary unit ends except the program end, and the rightmost column shows the correct answer rate for all phrases except the program end. When the predetermined value N was 2 and 3, the accuracy rate with respect to the end of the node boundary unit was the highest, and at that time, the overall accuracy rate was 76.2%. The accuracy rate of analysis for the inside of the node excluding the end of the node was 87.5%.

表４に、本実施の形態で使用した節境界検出プログラムＣＢＡＰ（特許文献１に記載のものと同様）の節境界解析の精度について、ラベルを無視して節境界の位置のみで評価した結果を示す。表４から明らかなように、本実施の形態によれば適合率、再現率ともに高く、後に行なわれる解析への影響はあまりない。 Table 4 shows the result of evaluating the node boundary analysis accuracy of the node boundary detection program CBAP (similar to that described in Patent Document 1) used in the present embodiment by ignoring the label and only the position of the node boundary. Show. As is apparent from Table 4, according to the present embodiment, both the relevance rate and the recall rate are high, and there is not much influence on the analysis performed later.

所定値Ｎと解析時間の関係を図１４に示す。図１４を参照して、所定値Ｎを大きくするに従って、解析時間が増加している。解析時間が最も短かかったのは、所定値Ｎ＝３のときで、全７番組で１２．５秒、１番組あたり１．８秒であった。なお、この解析時間には、ＣＢＡＰによる節境界解析の時間も含まれている。節境界解析の平均解析時間は１番組あたり０．３秒程度である。 The relationship between the predetermined value N and the analysis time is shown in FIG. Referring to FIG. 14, the analysis time increases as the predetermined value N is increased. The analysis time was the shortest when the predetermined value N = 3, which was 12.5 seconds for all 7 programs and 1.8 seconds per program. This analysis time includes the time of node boundary analysis by CBAP. The average analysis time for node boundary analysis is about 0.3 seconds per program.

本手法では、文末は係り先がないとして解析を実行している。すなわち、係り先なしと判定された文節を文末であるとみなしている。このような観点から、本手法の文末判定性能を評価した。表５に文末判定の適合率、再現率、Ｆ値を示す。 In this method, analysis is performed assuming that there is no dependency at the end of the sentence. That is, the clause determined to have no relation is regarded as the end of the sentence. From this point of view, the end-of-sentence judgment performance of this method was evaluated. Table 5 shows the precision, recall, and F value of the sentence end determination.

表５を参照して、所定値Ｎ＝３のときに最も高いＦ値を示した。独話の文境界判定手法は既にいくつか提案されているが、本手法では漸進的係り受け解析と同時的に文末を判定できるという特徴がある。

Referring to Table 5, the highest F value was shown when the predetermined value N = 3. Several methods have already been proposed for determining the sentence boundaries of monologues, but this method is characterized by the ability to determine the end of a sentence simultaneously with progressive dependency analysis.

以上の結果から、本実験においては所定値Ｎが３のとき、最も高い性能を示しており、文単位を入力とする従来の係り受け解析手法（正解率で７９．０％、処理時間で１番組あたり約２．１秒）（非特許文献２）と比較しても、同程度の解析精度と解析時間とを達成している。 From the above results, in this experiment, when the predetermined value N is 3, the highest performance is shown, and the conventional dependency analysis method using the sentence unit as an input (the accuracy rate is 79.0%, the processing time is 1). Compared with (approximately 2.1 seconds per program) (Non-Patent Document 2), the same analysis accuracy and analysis time are achieved.

［変形例］
上記した第１の実施の形態のシステムでは、節間係り受けの解析の際、不変回数のカウントにカウンタを用い、前回の節間係り受け解析の結果を図２に示す節間係り受け判定処理部７２に履歴として保持している。しかし本発明はそのような実施の形態には限定されない。例えば、節間係り先一時記憶部７４を二次元テーブルとし、処理中の各節に対し、節間係り受け解析の結果の履歴を、所定数記憶部６８に記憶されている所定値Ｎだけ保存するようにしてもよい。この場合、履歴中のＮ個の解析結果が全て等しい場合に、当該節の係り受け関係が決定したものと判定する。 [Modification]
In the system according to the first embodiment described above, a counter is used for counting the number of invariants during the analysis of inter-node dependency, and the result of the previous inter-cell dependency analysis is shown in FIG. This is held in the section 72 as a history. However, the present invention is not limited to such an embodiment. For example, the inter-node temporary storage unit 74 is a two-dimensional table, and the history of inter-cell dependency analysis results is stored for each node being processed by a predetermined value N stored in a predetermined number of storage units 68. You may make it do. In this case, when all the N analysis results in the history are equal, it is determined that the dependency relationship of the section is determined.

また、上記した実施の形態では、図２に示す節間係り受け判定処理部７２は、節間係り受けを決定する際の条件として所定数記憶部６８に記憶された所定値Ｎを用い、その所定値Ｎは入出力装置４４によって設定することができる。しかし、本発明を実施する際には、このように所定値を変化可能にする必要はない。例えば所定値としてある数を定め、節間係り受け判定処理部７２はその数を固定的に使用するようにすることもできる。上記した実験の結果から、この場合にはＮ＝３とするのが適当と思われるが、それ以外の値を採用してもよい。また、処理対象となる発話の内容によってもこの値が変わる可能性がある。 Further, in the above-described embodiment, the inter-node dependency determination processing unit 72 shown in FIG. 2 uses the predetermined value N stored in the predetermined number storage unit 68 as a condition when determining the inter-cell dependency. The predetermined value N can be set by the input / output device 44. However, when carrying out the present invention, it is not necessary to make the predetermined value variable in this way. For example, a certain number may be set as a predetermined value, and the inter-node dependency determination processing unit 72 may use the number fixedly. From the results of the experiment described above, it is considered appropriate to set N = 3 in this case, but other values may be adopted. Also, this value may change depending on the content of the utterance to be processed.

さらに、上記した実施の形態では、節内係り受け解析処理部６４及び節間係り受け解析処理部７０はいずれも、確率モデルに従って係り受け解析を行なう。しかし本発明はそのようなものには限定されない。例えば、文法的な解析によって係り受け解析を行なうもの、予め準備された係り受け解析のためのルールに従って係り受け解析を行なうもの、等を使用することもできる。 Furthermore, in the above-described embodiment, the intra-node dependency analysis processing unit 64 and the inter-node dependency analysis processing unit 70 both perform dependency analysis according to a probability model. However, the present invention is not limited to such. For example, one that performs dependency analysis by grammatical analysis, one that performs dependency analysis according to a previously prepared rule for dependency analysis, and the like can also be used.

今回開示された実施の形態は単に例示であって、本発明が上記した実施の形態のみに制限されるわけではない。本発明の範囲は、発明の詳細な説明の記載を参酌した上で、特許請求の範囲の各請求項によって示され、そこに記載された文言と均等の意味及び範囲内でのすべての変更を含む。 The embodiment disclosed herein is merely an example, and the present invention is not limited to the above-described embodiment. The scope of the present invention is indicated by each claim in the claims after taking into account the description of the detailed description of the invention, and all modifications within the meaning and scope equivalent to the wording described therein are intended. Including.

本発明の一実施の形態で行なわれる係り受け構造解析を模式的に示す図である。It is a figure which shows typically the dependency structure analysis performed by one embodiment of this invention. 本発明の一実施の形態に係る係り受け構造解析装置４２の構成を示すブロック図である。It is a block diagram which shows the structure of the dependency structure analysis apparatus 42 which concerns on one embodiment of this invention. 図２に示す節間係り受け解析処理部７０のより詳細なブロック図である。It is a more detailed block diagram of the inter-node dependency analysis processing unit 70 shown in FIG. 図２に示す節間係り受け解析処理部７０及び節間係り受け判定処理部７２を実現するためのプログラムのフローチャートである。It is a flowchart of the program for implement | achieving the inter-node dependency analysis process part 70 and the inter-node dependency determination process part 72 shown in FIG. 継続数の更新処理の詳細なフローチャートである。It is a detailed flowchart of a continuation number update process. 本発明の一実施の形態に係る係り受け構造解析装置４２を実現するコンピュータシステムの正面図である。It is a front view of the computer system which implement | achieves the dependency structure analysis apparatus 42 which concerns on one embodiment of this invention. 図６に示すコンピュータのハードウェア構成を示すブロック図である。It is a block diagram which shows the hardware constitutions of the computer shown in FIG. 発話例の各節の最終文節の係り先を解析する様子を図解する模式図である。It is a schematic diagram which illustrates a mode that the destination of the last clause of each clause of an utterance example is analyzed. 発話例の各節の最終文節の係り先を解析する様子を図解する模式図である。It is a schematic diagram which illustrates a mode that the destination of the last clause of each clause of an utterance example is analyzed. 発話例の各節の最終文節の係り先を解析する様子を図解する模式図である。It is a schematic diagram which illustrates a mode that the destination of the last clause of each clause of an utterance example is analyzed. 発話例の各節の最終文節の係り先を解析する様子を図解する模式図である。It is a schematic diagram which illustrates a mode that the destination of the last clause of each clause of an utterance example is analyzed. 発話例の各節の最終文節の係り先を解析する様子を図解する模式図である。It is a schematic diagram which illustrates a mode that the destination of the last clause of each clause of an utterance example is analyzed. 発話例の各節の最終文節の係り先を解析する様子を図解する模式図である。It is a schematic diagram which illustrates a mode that the destination of the last clause of each clause of an utterance example is analyzed. 本発明の一実施の形態を使用して行なった実験における、所定値Ｎと解析時間との関係を示すグラフである。It is a graph which shows the relationship between the predetermined value N and analysis time in the experiment conducted using one Embodiment of this invention.

Explanation of symbols

３０係り受け構造解析システム
３２学習データ
３４係り受け学習処理部
３６節内係り受け確率モデル
３８節間係り受け確率モデル
４０発話
４２係り受け構造解析装置
４４入出力装置
４６係り受け関係
６０節判定処理部
６２発話データ一時記憶部
６４節内係り受け解析処理部
６６解析結果記憶部
６８所定数記憶部
７０節間係り受け解析処理部
７２節間係り受け判定処理部
７４節間係り先一時記憶部
７６カウンタ記憶部
９０節入力検出部
９２節最終文節係り先解析処理部
９４係り先更新処理部
９６節間係り受け記憶初期化部 DESCRIPTION OF SYMBOLS 30 Dependency structure analysis system 32 Learning data 34 Dependency learning processing part 36 Intersection dependency probability model 38 Internode dependency probability model 40 Utterance 42 Dependency structure analyzer 44 Input / output device 46 Dependency relation 60 Clause determination processing part 62 utterance data temporary storage unit 64 intra-node dependency analysis processing unit 66 analysis result storage unit 68 predetermined number storage unit 70 inter-node dependency analysis processing unit 72 inter-node dependency determination processing unit 74 inter-node dependency temporary storage unit 76 counter Storage section 90 Section input detection section 92 Section final sentence dependency destination analysis processing section 94 Relationship update processing section 96 Intersection dependency storage initialization section

Claims

A dependency structure analysis device that progressively analyzes the dependency structure of utterances that are sequentially input,
A node boundary detecting means for detecting a node boundary of the utterance sequentially input based on local information of the utterance, and outputting a node boundary unit consisting of an utterance between the node boundary and the immediately preceding node boundary;
In response to detection of a node boundary by the node boundary detection means, a predetermined inter-node dependency analysis method is applied to a node boundary unit whose dependency is undetermined, and the obtained dependency destination information is output. Inter-node dependency analysis means,
In response to the output of the dependency destination information by the inter-node dependency analysis means, a history of dependency destination information obtained by the inter-node dependency analysis means for a node boundary unit whose dependency is not yet determined is retained, and the history And a dependency structure determining unit for determining and outputting a dependency point that satisfies a predetermined condition among the node boundary units for which the dependency point has not yet been determined.

In response to detection of a node boundary by the node boundary detection unit, a predetermined intra-node dependency analysis method is applied to the node boundary unit output from the node boundary detection unit, and Further includes an intra-section dependency analysis means for storing the dependency results;
The inter-node dependency determining means is:
A history for holding a history of dependency information obtained by the inter-cell dependency analysis unit for a node boundary unit whose dependency is not yet determined in response to the output of dependency information by the inter-cell dependency analysis unit Holding means;
Based on the history held by the history holding means, among the node boundary units for which the dependency destination is undetermined, a relationship destination determining means for determining an engagement destination that satisfies a predetermined condition;
Dependency relationship information for the node boundary unit is determined by combining the dependency point determined by the node dependency destination determination unit and the intra-node dependency analysis result held in the intra-node dependency analysis unit for the node boundary unit. The dependency structure analyzing apparatus according to claim 1, further comprising means for outputting.

In response to the output of dependency destination information by the inter-node dependency analysis unit, the history holding unit immediately before the dependency destination information obtained by the inter-cell dependency analysis unit for a node boundary unit whose dependency is not yet determined. Means for holding a predetermined number of histories for each clause boundary unit,
The node destination determination means responds to the fact that all the destination information held in the means for holding a certain node boundary unit matches, and determines the destination of the node boundary unit. 3. The dependency structure analyzing apparatus according to claim 2, further comprising means for determining a last obtained dependency by the inter-node dependency analysis means.

The inter-node dependency determining means further includes a predetermined value storage means for storing a predetermined natural number,
The means for holding the dependency destination information obtained by the inter-node dependency analysis means for a node boundary unit whose dependency destination is undetermined in response to the output of the dependency destination information by the inter-node dependency analysis means. 4. The dependency structure analyzing apparatus according to claim 3, further comprising means for holding the history immediately before the number of the predetermined natural number stored in the predetermined value storage means for each node boundary unit.

5. The dependency structure analyzing apparatus according to claim 4, further comprising input means for inputting the predetermined natural number stored in the predetermined value storage means.

The section destination determination means is
In response to the output of the dependency destination information by the inter-node dependency analysis means, the dependency destination information obtained by the inter-node dependency analysis means for the node boundary unit whose dependency is undetermined, and the above-mentioned node boundary unit A determination means for determining whether or not the last-mentioned dependency destination information held in the means for holding matches;
Counter means for counting the number of times that the result of matching is continuously obtained by the judging means for each of the node boundary units for which the relationship is undetermined;
In response to the count result of a certain node boundary unit reaching a predetermined value by the counter means, the dependency of the certain node boundary unit is changed to the inter-node dependency analysis means for the certain node boundary unit. 4. The dependency structure analyzing apparatus according to claim 3, further comprising means for determining the last obtained dependency point.

A computer program that, when executed by a computer, causes the computer to operate as the dependency structure analyzing apparatus according to any one of claims 1 to 6.