JPH02190972A

JPH02190972A - Example retrieving system

Info

Publication number: JPH02190972A
Application number: JP1010158A
Authority: JP
Inventors: Yasuji Kobuchi; 保司小渕; Hideo Mitsuyoshi; 三吉　秀夫; Akira Hamada; 明濱田; Hirokatsu Akiyama; 広勝秋山
Original assignee: Sharp Corp
Current assignee: Sharp Corp
Priority date: 1989-01-19
Filing date: 1989-01-19
Publication date: 1990-07-26

Abstract

PURPOSE:To improve the efficiency of extraction and analysis of various data for natural language research by retrieving a data storage means by a defined keyword obtained by adding information for grammatically restricting an object to be retrieved to a keyword. CONSTITUTION:A keyword input part 10 input one or more keywords, specifies grammatical information for defining the category of words constituting a sentence example executed in each keyword and forms a defined keyword by editing the specified grammatical information. The defined keyword is converted into a partial item to be matched, the converted partial item is matched with a sentence information list of a text data base 13 and the text data base 13 is retrieved to obtain the example of the corresponding sentence. A retrieved result output part 12 displays the example obtained by retrieval by a KWIC (context filing word) format in accordance with an output mode specified by a user. Thus, the example can be flexibly retrieved by using a structured retrieval keyword.

Description

【発明の詳細な説明】（イ）産業上の利用分野この発明は言語研究あるいは自然言語処理の研究に有用
な用例検索システムに関する。DETAILED DESCRIPTION OF THE INVENTION (a) Field of Industrial Application This invention relates to an example retrieval system useful for language research or natural language processing research.

（ロ）従来の技術従来、言語の用例を参照したり言語現象の分析を行うた
めに、用例検索システムにおいては“文脈付き見出し語
°と呼ばれているＫＷＩＣ（にｅｙｆｏｒｄ　Ｉｎ　Ｃ
ｏｎｔｅｘｔ）がよく用いられている。このＫＷＩＣは
単語の用例を提供するための言語データベースであり、
言語研究あるいは自然言語処理研究を行うための有用な
言語分析ツールであるとされている。ＫＷＩＣの構成は
、見出し語を、それが用いられている前後の文脈を付け
てソートした表であり、多くのＫＷＩＣが開発されてい
る。そして通常のＫＷＩＣは、１個の単語をキーワード
にして分類した形式のものが多い。一方このような単語
単位でソートしたＫＷＩＣの他に、多少異なる形式を持
つものとして、電子技術総合研究所で開発され新編日本
語品詞列集成、右順編（１９７９）、左順編（１９７９
）に記載されている“日本語品詞列集成”があげられる
。この日本語品詞列集成は、品詞付き単語列を品詞でソ
ートし、所定の長さの品詞列とその品詞列に該当する単
語の並びに前後文脈を付して収録したものである。(b) Conventional technology Conventionally, in order to refer to language examples and analyze linguistic phenomena, example retrieval systems have used KWIC (keyford in C), which is called "headword with context".
ontext) is often used. This KWIC is a language database that provides examples of word usage.
It is said to be a useful language analysis tool for language research or natural language processing research. The structure of KWIC is a table in which headwords are sorted by the context in which they are used, and many KWICs have been developed. Most of the usual KWICs are classified using one word as a keyword. On the other hand, in addition to the KWIC sorted by word, there are KWICs with a slightly different format, developed by the Electronic Technology Research Institute, called the New Collection of Japanese Parts of Speech, Right Order Edition (1979) and Left Order Edition (1979).
) is the “Japanese part-of-speech collection”. This collection of Japanese part-of-speech strings is a collection of word strings with part-of-speech sorted by part-of-speech, and a part-of-speech string of a predetermined length, an arrangement of words corresponding to the part-of-speech string, and context.

（ハ）発明が解決しようとする課題しかし従来のＫＷＩＣはその名の示すとおり、１個の単
語の表記をキーワードとして検索する仕組みであったり
、印刷物の形聾を取っているものが多いため、必ずしも
上記の目的に即した有用な情報は得られなかった。自然
言語処理ＩＥ究者や言語学者が本当に調査したい言語現
象は、個々の単語の用例ではなく、ある制約を持つ語句
同志の共起あるいは複雑なパターンを含む用例であるが
、上記した従来の固定化されたＫＷｒＣでは、このよう
な複雑なパターンによる用例検索は不可能であり、よっ
て従来の用例検索システムでは単語の用例を数多く参照
することや、検索しようとする単語が実際にどのような
文脈で用いられているかを広く調査することができなか
った。このような伏況から、柔軟な用例検索システムが
望まれていた。そして言語学者が言語研究を行う際に検
索したい情報は、（１）ある語が使われているか（２）
ある語がどの作品のどのページ・どの行で使われている
か（３）ある語がどの作品のどのページ・どの行で・ど
ういう文脈の中で使われているか（４）ある語がどの作
品のどのページ・どの行で・どういう文脈の中で・どう
いう品詞情報列の中で使われているか（５）ある意味の
語を文脈つきで出せ（６）連用修飾を文脈付きで出せ（
７）主語を文脈付きで出せ（８）「〜の〜する名詞」構
文の用例を出せ（９）ある語にかかる語を出せ（１０）
明治上期上流階級の女性の会話文において人称代名詞を
使った文を出せなどの項目である。(c) Problems to be solved by the invention However, as the name suggests, conventional KWICs have a mechanism for searching using the notation of a single word as a keyword, or are often printed in deaf form. It was not possible to obtain useful information that met the above objectives. The linguistic phenomena that natural language processing IE researchers and linguists really want to investigate are not examples of individual words, but examples that include the co-occurrence of words with certain constraints or complex patterns. In the standardized KWrC, it is impossible to search for examples using such complex patterns. Therefore, in conventional example search systems, it is necessary to refer to many examples of a word or to understand the actual context of the word to be searched. It was not possible to conduct a wide survey to see if it was used in Under these circumstances, a flexible example retrieval system has been desired. The information that linguists want to search for when conducting language research is (1) whether a certain word is used; and (2)
(3) Which page, in which line, and in what context a certain word is used in which work? (4) Which work does a certain word appear in? On which page, in which line, in what context, and in what part of speech information sequence? (5) Give a word with a certain meaning with a context. (6) Give a conjunctive modification with a context.
7) Give the subject with context (8) Give an example of the "noun of ~" construction (9) Give the word that refers to a certain word (10)
Items include producing sentences that use personal pronouns in conversations written by upper-class women in the early Meiji period.

この発明は以上の事情を考慮してなされたもので、構造
化された検索キーワードを用いて柔軟に用例検索を行う
ことができる用例検索システムを提供する。The present invention has been made in consideration of the above circumstances, and provides an example search system that allows example searches to be performed flexibly using structured search keywords.

（ニ）課題を解決するための手段および作用この発明に
おいて用例とは用い方の例を意味し、文における用い方
、語句における用い方を含む。(d) Means and effects for solving the problem In this invention, the term "example" means an example of how to use it, and includes how it is used in a sentence or in a phrase.

これらの文や語句は、日本語あるいは英語や独語等の外
国語においても適用することができる。These sentences and phrases can be applied to Japanese or foreign languages such as English and German.

テキストデータベースに付される文番号等とは、文番号
と文番号以外に含めることができる例えば、文をカテゴ
リー別に分類するための情報等である。The sentence numbers and the like attached to the text database are, for example, information for classifying sentences into categories that can be included in addition to the sentence numbers and sentence numbers.

定義付きキーワードとは実施例に説明する構造化キーワ
ードを示し、語曇範噴（単！Ｍ）を定義するための文法
素性の部分集合である語愈情報を要素とする順序付リス
トである。The defined keyword refers to a structured keyword explained in the embodiment, and is an ordered list whose elements are word information, which is a subset of grammatical features for defining the word cloud expression (single!M).

ＫＷＩＣ形式とは自動検索において、見出し語だけでな
く、その見出し語が使われている文脈も併せて検索する
形式を示す。The KWIC format refers to a format in which automatic searches include not only headwords but also the context in which the headwords are used.

この発明は、出典、文番号、文情報リストが付された文
の用例を表記にて多数記憶し、かつ文情報リストを、文
の用例を構成する単語の品詞分類、表記、品詞Ｍ類、活
用型、意味等の情報からなる文法情報の集合として表し
たテキストデータベースと、少なくともｌ以」二のキーワードの入力と、各キーワー
ド毎に行われ文の用例を構成する単語の範驕を定義する
ための文法情報の指定と、指定された文法情報を編集す
ることによる定義付きキーワードの作成を行うキーワー
ド入力部と、キーワード入力部から人力される定義付き
キーワードを、整合可能な部分項に変換し、テキストデ
ータベースの文情報リストと整合をとることにより、定
義付きキーワードに該当する文の用例を検索する検索部
と、検索された文の用例をＫＷＩＣ形式にて出力する検索結
果出力部とから構成されることを特徴とする用例検索シ
ステムである。This invention stores a large number of sentence examples in notation with sources, sentence numbers, and sentence information lists attached, and stores the sentence information list in terms of part-of-speech classification, notation, part-of-speech M class, etc. of the words that make up the sentence examples. A text database represented as a collection of grammatical information consisting of information such as conjugation types and meanings, input of at least 1 or 2 keywords, and definition of the range of words constituting examples of sentence usage for each keyword. A keyword input section that specifies grammatical information and creates defined keywords by editing the specified grammatical information, and a keyword input section that converts defined keywords entered manually from the keyword input section into matching subterms. , consists of a search section that searches for examples of sentences corresponding to defined keywords by matching with the sentence information list in the text database, and a search result output section that outputs examples of the searched sentences in KWIC format. This is an example retrieval system characterized by:

また、第２の発明ではキーワード入力部が、少なくとも
１以上の品詞の指定、または１以上の品詞の指定と指定
した各品詞固有の文法情報の指定によって定義付きキー
ワードの作成を行い、また、第３の発明ではキーワード
入力部が、品詞の指定と、指定した品詞が活用を伴う用
言である際に用言の活用に関する情報を指定することに
より、定義付きキーワードの作成を行い、第４の発明で
はテキストデータベースが、名詞の概念相互を上位／下
位関係で結び体系をなすシソーラスからなる意味情報を
有し、キーワード入力部が、少なくと６１以上の名詞の
概念の指定により定義付きキーワードを作成し、検索部
が定義付きキーワードに基づき、概念で結ばれた用例を
テキストデータベースから検索する。また第５の発明で
はキーワード入力部が、助ふ１１の指定と、指定した助
薗の種類を指定することにより、定義付きキーワードの
作成を行い、第６の発明ではキーワード入力部が、助詞
の指定と、指定した助詞の意味情報とを指定することに
より定義付きキーワードの作成を行う。Further, in the second invention, the keyword input unit creates a keyword with a definition by specifying at least one part of speech, or by specifying one or more parts of speech and specifying grammatical information specific to each specified part of speech. In the third invention, the keyword input unit creates a keyword with a definition by specifying the part of speech and, when the specified part of speech is a predicate with conjugation, specifying information regarding the conjugation of the predicate. In the invention, the text database has semantic information consisting of a thesaurus that connects noun concepts in a superior/subordinate relationship to form a system, and the keyword input section creates keywords with definitions by specifying at least 61 or more noun concepts. Then, the search unit searches the text database for examples connected by concepts based on defined keywords. Further, in the fifth invention, the keyword input unit creates a keyword with a definition by specifying the auxiliary term 11 and the type of the specified auxiliary term; A keyword with a definition is created by specifying the specification and the semantic information of the specified particle.

また、この発明の用例検索システムは、次のような特徴
を有する。Further, the example search system of the present invention has the following features.

（１））１敗個の単語から構成されるパターンを検索キ
ーワードとして日本語文の用例を検索することができる
。さらに検索キーワードの各単語に各種の文法的制約を
課することができる。このような性質を持った検索キー
ワード列を構造化キーワードと呼ぶ。文法的制約として
は“品詞、品詞細分類、活用情報、見出し、出現形”の
ような形態的情報のみならず、概念の階層関係を表すシ
ソーラスの概念名を意味的制約として与えることが可能
である。(1)) Examples of Japanese sentences can be searched using a pattern consisting of one word as a search keyword. Furthermore, various grammatical constraints can be imposed on each word of the search keyword. A search keyword string with such characteristics is called a structured keyword. As grammatical constraints, it is possible to provide not only morphological information such as "parts of speech, subdivisions of parts of speech, conjugation information, headings, and occurrences," but also thesaurus concept names that express hierarchical relationships between concepts as semantic constraints. be.

（２）検索された用例は、ユーザーが指定する順序にソ
ートされ、ＫＷＩＣ形式で表示される。また検索結果の
中の特定用例についてさらに詳細の文法情報を提供する
ことができる。(2) The searched examples are sorted in the order specified by the user and displayed in KWIC format. Additionally, more detailed grammatical information can be provided for specific usage examples within the search results.

（３）ユーザーにとって都合の良い構造化キーワード入
力機能を有する。(3) It has a structured keyword input function that is convenient for users.

（４）構造化キーワードの一部に任意の単語列とマッヂ
ング可能なワイルドカードを用いることができる。(4) A wild card that can be matched with any word string can be used as part of the structured keyword.

（ホ）実施例以下図に示す実施例に基づいてこの発明を詳述する。な
お、これによってこの発明は限定されるものではない。(e) Examples The present invention will be described in detail below based on examples shown in the drawings. Note that this invention is not limited by this.

第１図はこの発明の用例検索システムの構成図である。FIG. 1 is a block diagram of the example retrieval system of the present invention.

同図において用例検索システムは、主として入力手段と
してのキーワード入力部１０と、検索部１１と、検索結
果出力部１２と、データ記憶手段としてのテキストデー
タベースＩ３とから構成される。キーワード入力部１０
は検索のための構造化キーワードの単語数や各単！吾の
文法素性等を人力する。検索部１１は与えられた構造化
キーワードをＣＩ　Ｌ　（Ｃｏｍｐｌｅｘ　　Ｉｎｄｅ
ｔｅｒｍｉｎａｔｅＬａｎｇｕａｇｅ：人工知能的分野
の問題に適する関数形ブ［２グラミング言語“ｐｒｏｌ
ｏｇ”の拡張言語）の部分項リストに変換し、テキスト
データベース１３の検索を行う。検索結果出力部＋２は
検索して得られる用例を、ユーザーが指定する出力モー
ドに従ってＫｗｔｃ杉式で表示する。テキストデータベ
ース１３は検索部１１によって検索される対象となるテ
キスト（付加情報付き）が格納されているデータベース
である。なおＩ４はファイル部、１５はプリンタ出力部
を示す。In the figure, the example search system mainly includes a keyword input section 10 as an input means, a search section 11, a search result output section 12, and a text database I3 as a data storage means. Keyword input section 10
Is a structured keyword word count or each unit for searching! Manually check my grammar background etc. The search unit 11 searches the given structured keywords as CI L (Complex Index).
terminateLanguage: A functional language suitable for problems in the field of artificial intelligence.
og" (extended language) and searches the text database 13. The search result output section +2 displays the examples obtained by the search in Kwtc Sugi style according to the output mode specified by the user. The text database 13 is a database that stores text (with additional information) to be searched by the search section 11. Note that I4 indicates a file section, and 15 indicates a printer output section.

以下に上記した各部の詳細について説明する。The details of each part described above will be explained below.

テキストデータベース（ａ）素性集合として表現される単語テキストデータベース１３は検索部ＩＩによって検索さ
れる対象となるテキストが格納されているデータベース
である。前述したこの発明の特徴を実現するためには、
データベースに原文の文字づらだけの情報だけでなく、
“品詞、品詞細分類、活用情報、意味情報”などを付加
しておく必要がある。そのほか“単語の切れ目、出典、
文番号”等の情報も必要である。このような情報を簡便
に記述するために本システムではテキストデータの各単
語を“素性名／素性値゛対の集合としてとらえ、各種文
法情報の集合として１個の単語を表現している。これは
文法範噴を素性集合としてとらえる単一化文法理論の枠
組みに基づいている。モして各文の情報は、その集合の
リストとして表現される。各単語を構成する文法素性は
その単語の品詞によって異なる。Text Database (a) Words Expressed as a Feature Set The text database 13 is a database in which texts to be searched by the search unit II are stored. In order to realize the features of this invention described above,
The database contains not only information about the original text, but also
It is necessary to add "parts of speech, subclassification of parts of speech, conjugation information, semantic information," etc. In addition, “word breaks, sources,
Information such as "sentence number" is also required.In order to easily describe such information, this system treats each word in text data as a set of "feature name/feature value" pairs, and stores it as a set of various grammatical information. Expresses one word. This is based on the framework of unified grammar theory, which considers grammatical ranges as feature sets. The information of each sentence is expressed as a list of sets. The grammatical features that make up each word differ depending on the part of speech of that word.

（ｂ）品詞分類表Ｉに本システムで採用している品詞分類と、各品詞が
持つ分類素性を示す。(b) Part-of-speech classification table I shows the part-of-speech classifications adopted in this system and the classification features of each part of speech.

（以下余白）表１　品詞分類と素性表１かられかるように、１４種類の品詞を導入している。この品詞分類は、一般的な学校文法に亭拠し
ている。ただし、最後の“補助側“は一般に補助動お１
、補助形容詞などと呼ばれているものの総称である。こ
れらの各品詞に対して現在８種類の文法素性を割り当て
ている。ただし、各品詞が８個４°べての素性値を持つ
のではなく、表１中の“Ｏ”印の付いた素性のみを持つ
。各文法素性の役割と値は次のとおりである。(Leaving space below) Table 1 Classification and features of parts of speech As shown in Table 1, 14 types of parts of speech have been introduced. This classification of parts of speech is based on general school grammar. However, the last “auxiliary side” is generally an auxiliary movement or one
, is a general term for what are called auxiliary adjectives. Currently, eight types of grammatical features are assigned to each of these parts of speech. However, each part of speech does not have feature values for all eight 4 degrees, but only features marked with "O" in Table 1. The role and value of each grammatical feature are as follows.

品詞：Ｉ４［！ｌの品詞名のいずれか。Part of speech: I4 [! One of the part-of-speech names of l.

表記二その単語が文中で実際に使用されている表記（出
現形）。Form 2: The form in which the word is actually used in the sentence (occurring form).

見出し：活用語の終止形。複数表記のある場合代表形。Heading: Final form of conjugated word. If there is more than one notation, use the representative form.

表記と同一の場合が多い。Often the same as the notation.

種類：品詞によって異なる値を持つ。名詞の場合は“清
適名詞、す変名詞、数詞、固有名詞、形式名詞“のいず
れか。接頭語および接尾語の場合は、それが接続する語
によって“一般、数詞“のいずれか。助詞の場合は“格
助詞、終助詞、副助詞、並立助！１１１、接続助詞、係
助詞、鵡体助詞”のいずれか。Type: Has different values depending on the part of speech. In the case of nouns, they can be either "proper nouns, slang nouns, numerals, proper nouns, or formal nouns." In the case of prefixes and suffixes, either "general" or "numerical" depending on the word they connect. In the case of particles, either "case particle, final particle, adverbial particle, parallel particle! 111, conjunctive particle, conjunction particle, or corpuscular particle".

活用型：“五段、−段、力変、す変、行く、ある、なさ
る”のいずれか。Conjugation type: One of "godan, -dan, kihen, suhen, go, aru, and do".

活用行二五段活用の動詞のみに付加する。Add only to conjugated verbs with conjugation lines 2 and 5.

活用形：“未然、連用、終止、連体、仮定、命令′のい
ずれか。Conjugated form: “foreign, conjunctive, terminating, adjunct, presupposition, command’.

意味：第２図に示す本システム用シソーラスの中の概念
名を要素とするリスト。複数の値を取り得るのでリスト
にする。Meaning: A list whose elements are concept names in the thesaurus for this system shown in Figure 2. It can take multiple values, so make it a list.

（ｅ）文情報の実装テキストデータベースのそれぞれの文は次のような形式
を持っＣＩＬの１個のユニットクローズとして表現され
ている。(e) Implementation of sentence information Each sentence in the text database has the following format and is expressed as one unit clause in CIL.

５ｅｎｔｅｎｃｅ　　（出典、文番号、文情報リスト）
“文情報リスト”は、各単語を素性集合として表現した
もの（ＣＩＬの部分項）を要素とするリストである。素
性集・合をＣＩＬの部分項として表現する理由は、部分
項はフレームのようなデータ構造を表現しやすく、ＣＩ
Ｌの拡張ユニフィケーションが使用できるからである。5entence (source, sentence number, sentence information list)
The "sentence information list" is a list whose elements are each word expressed as a feature set (subterms of CIL). The reason why feature sets and combinations are expressed as CIL subterms is that subterms are easy to express data structures such as frames, and CI
This is because extended unification of L can be used.

この形式を用いることにより、例えば“自然を守る“と
いう出典の１３番口の文”森という環境を例にとってみ
よう”は第３図のように表現される。By using this format, for example, the 13th sentence in the source ``protecting nature'', ``Let's take the environment of the forest as an example,'' can be expressed as shown in Figure 3.

キーワード入力部キーワード入力部ＩＯは、検索のための構造化キーワー
ドの人力を行うためのものであり、通常キーボードまた
はマウスによって構成され、構造化キーワードの単語数
や各単語の文法素性を入力する。Keyword Input Unit The keyword input unit IO is for inputting structured keywords for searching, and is usually configured with a keyboard or mouse, and inputs the number of words of the structured keyword and the grammatical features of each word.

（ａ）構造化キーワード検索のための構造化キーワードは次のように定義される
。(a) Structured keywords for structured keyword search are defined as follows.

定義１：構造化キーワードは、語棄情報を要素とする順
序付リストである。Definition 1: A structured keyword is an ordered list whose elements are disjunction information.

定義２：語曇情報とは語鴬範噴（単語）を定義するため
の文法素性の部分集合である。ただし文法素性は“素性
名／Ｔ、性値”対として表現される。Definition 2: Word cloud information is a subset of grammatical features for defining words. However, grammatical features are expressed as a "feature name/T, gender value" pair.

このような構造化キーワードを導入することにより、表
記あるいは品詞のみをキーワードとしていた従来のＫＷ
ｆＧでは行えなかった柔軟な検索が可能になる。次に構
造化キーワードの例を示す。By introducing such structured keywords, conventional KW that used only expressions or parts of speech as keywords can be improved.
Flexible searches that were not possible with fG become possible. Below are some examples of structured keywords.

例１：［（品ｉＡ／名詞、種類／普通名詞、意味／抽象
概念ン、（品詞／助詞、見出し／を）］例２：［（品詞
／副詞）、＊、（品ｒＡ／動詞、活用型／五段、活用形
／連用）］例１は抽象概念という意味素性を持つ普通名詞の次に助
詞“を”が用いられている用例を検索するための構造化
キーワードである。例２はワイルドカード“＊“を用い
た例であり、ある副詞の後方に五段活用の動詞の連用形
が現れる用例を検索するための構造化キーワードである
。“＊”は複数の単語列を表し、その部分では何でもよ
いという意味になる。したがって例えば“東京＊“とす
ると、東京都、東京本社、東京都千代田区など“東京“
で始まる文字すべてが当てはまることになる。Example 1: [(article iA/noun, type/common noun, meaning/abstract concept, (part of speech/particle, heading/wo)] Example 2: [(article/adverb), *, (article rA/verb, conjugation Example 1 is a structured keyword to search for examples in which the particle "wo" is used next to a common noun that has the semantic feature of abstract concept. Example 2 is This is an example of using the wildcard "*", which is a structured keyword to search for examples in which the conjunctive form of a five-stage conjugated verb appears after a certain adverb. "*" represents multiple word strings, and The part can mean anything. Therefore, for example, if you say “Tokyo*”, it means “Tokyo”, such as Tokyo, Tokyo Head Office, Chiyoda Ward, Tokyo, etc.
All characters starting with .

（ｂ）構造化キーワード入力手順＋１ｔｌｆ＋化キーワードは次ぎの順序すなわち（１）
出典（２）品詞（３）表記・見出しく４）種類（５）活
用形・活用行・活用形（６）意味の順序で入力される。(b) Structured keyword input procedure + 1tlf + structured keywords in the following order, i.e. (1)
They are input in the order of source (2) part of speech (3) notation/heading 4) type (5) conjugation form/conjugation line/conjugation form (6) meaning.

素性値は、“表記・見出し“については直接入力するが
、他の素性は専用のウィンドウ表示においてメニュー選
択方式により行う。（４）〜（６）の素性は品詞の値に
より不要な入力要求を出さないようにしている。第４図
はこのような構造化キーワード入力中の画面の例を示し
、４ａ〜４ｅはそれぞれウィンドウ表示を示す。For feature values, "notation/heading" is directly input, but other features are entered using a menu selection method in a dedicated window display. The features (4) to (6) prevent unnecessary input requests from being made depending on the part of speech values. FIG. 4 shows an example of a screen during inputting such a structured keyword, and 4a to 4e each show a window display.

検索部検索部１１はＣＰＬＩ（中央処理装置）から構成され、
人力された構造化キーワードをパターンマツチング可能
な部分項に変換し、テキストデータベースＩ３への検索
を行う。そして検索結果を蓄えておく。Search unit The search unit 11 is composed of a CPLI (central processing unit),
The manually generated structured keywords are converted into subterms that can be pattern matched, and the text database I3 is searched. And save the search results.

検索結果出力部検索結果出力部１２は、検索した用例をユーザーが指定
するモードに従ってＫＷｆＣ形式で表示する。そしてユ
ーザーは次のようなモード指定を行うことができる。Search Result Output Unit The search result output unit 12 displays the searched examples in KWfC format according to the mode specified by the user. The user can then specify the following modes.

（１）表示する文法素性（デフォルトは表記のみ）（２
）ソートするか否か（昇順、降順、出現順）（３）ソー
トの優先順位（どのキーワードのどの素性か）（４）検索結果の出力光（ＣＦｔ’ｒ、ファイル、エデ
ィタのバッファ）（５）詳細文法素性表示指定（特定検索文の詳細情報表
示）第５図は構造化キーワード人力手順にて説明した構造化
キーワードによって、検索を実行した結果の例である。(1) Grammar features to display (default is notation only) (2
) Whether or not to sort (ascending order, descending order, appearance order) (3) Sorting priority (which feature of which keyword) (4) Output light of search results (CFt'r, file, editor buffer) (5 ) Detailed grammar feature display specification (detailed information display of specific search sentence) FIG. 5 is an example of the result of executing a search using the structured keyword explained in the structured keyword manual procedure.

この画面において上下左右方向のスクロールを行わせる
ことが可能である。It is possible to scroll this screen in the vertical and horizontal directions.

なお、この実施例におけるテキストデータベース！３は
、現在マニュアル操作によって形態素情報を付加してい
るが、既に試作されているＩＣＯ′ＶのＤＬＩＡＬＳの
ように、表記のみの文章に、形態素解析、構文解析、意
味解析等により自動的に種々の言語情報を付加するシス
テムによって構成されるテキストデータベースを用いて
もよい。したがって当然のことながらこの種のシステム
により生成される用例文章データベースもこの実施例に
おけるテキストデータベースＩ３に含まれる。In addition, the text database in this example! 3 currently adds morphological information through manual operations, but as with ICO'V's DLIALS, which has already been produced as a prototype, various types of information are automatically added to sentences with only notation through morphological analysis, syntactic analysis, semantic analysis, etc. A text database constructed by a system that adds linguistic information may also be used. Therefore, as a matter of course, the example sentence database generated by this type of system is also included in the text database I3 in this embodiment.

（以下余白）次にこの実施例の動作を第６図〜第１２図に示すフロー
チう・−トに従って説明する。(The following is a blank space.) Next, the operation of this embodiment will be explained according to the flow charts shown in FIGS. 6 to 12.

第６図は用例検索における全体の動作を示す図である。FIG. 6 is a diagram showing the overall operation in example search.

同図においてキーワード入力部ｌＯより構造化キーワー
ドの単語数Ｎを入力すると（ステップ２０）、処理終了
が指定されたかどうかを判断しくステップ２１）、処理
終了が指定されていなければキーワード入力部ＩＯから
の構造化キーワードにの人力を受け（ステップ２２）、
次いで検索部Ｉｌは、テキストデータベース１３に対し
て構造化キーワードＫに該当する情報の検索を行う（ス
テップ２３）。そして検索が終了すると、検索結果は検
索結果出力部１２に転送され、検索結果出力部１２は転
送された検索結果を、設定された出カモ−１・に従って
画面上に表示する（ステップ２４）。なお、ステップ２
１において処理終了が指定されると、検索した情報の解
除、ウィンドウのクローズ等の終了時の処理を行い（ス
テップ２５）、用例検索処理を終了１°る。In the same figure, when the number of words N of a structured keyword is input from the keyword input section IO (step 20), it is determined whether or not the end of processing has been specified (step 21). If the end of processing has not been specified, from the keyword input section IO Receive human power for structured keywords (step 22),
Next, the search unit Il searches the text database 13 for information corresponding to the structured keyword K (step 23). When the search is completed, the search results are transferred to the search result output unit 12, and the search result output unit 12 displays the transferred search results on the screen according to the set output mode 1 (step 24). In addition, step 2
When the end of processing is specified in step 1, end processing such as canceling the searched information and closing the window is performed (step 25), and the example search processing is ended by 1°.

次に上記した各処理の詳細フローを説明する。Next, detailed flows of each of the above-mentioned processes will be explained.

構造化キーワード人力処理第７図および第８図に、ステップ２２に示す“構造化キ
ーワード人力”の詳細な処理を示す。第７図において、
まず、ステップ２０において入力した単語数Ｎだけ単語
を人力したかどうかを判断しくステップ３α）、Ｎｏの
場合は単語を続けて人力しくステップ３Ｉ）、次いで単
語数Ｎだけ入力したかを判断４−る（ステップ３２）。Structured Keyword Manual Processing FIGS. 7 and 8 show detailed processing of the "structured keyword manual processing" shown in step 22. In Figure 7,
First, it is determined whether or not the number N of words entered in step 20 has been entered manually (step 3α); if no, the words are continued and manually entered step 3I); then it is determined whether the number N of words has been entered 4- (Step 32).

ここでＮｏの場合ワイルドカード指定の有無を人力しく
ステップ３３）ステップ３１に戻る。ただしワイルドカ
ードの指定の人力は単語数Ｎにはカウントされない。ス
テップ３ｏおよびステップ３２においてＹｅｓの場合は
、キーワードの他の候補ずなわら新たなキーワードを入
力するかどうか判断しくステップ３４）、Ｙｅｓの場合
はステップ３０に戻り、Ｎｏの場合は検索処理（ステッ
プ２３）に移る。If No here, step 33) returns to step 31 to manually determine whether or not a wild card is specified. However, the manpower required to specify a wild card is not counted in the number of words N. If Yes in step 3o and step 32, it is determined whether to enter a new keyword instead of other keyword candidates (step 34); if Yes, the process returns to step 30; if No, the search process (step 34) is performed; 23).

第８図はステップ３１に示す“単語人力″処理の詳細を
示している。まず品詞の入力を行い（ステップ４０）、
次いで単語の表記・見出しの入力を行い（ステップ４１
）、次いで文法素性すなわち入力した品詞の種類、活用
型、活用行、活用形、意味を人力４゛る（ステップ４２
）。ただし素性値は人ツノした品詞の種類によって異な
る。これらの入力は、画面上に表示される人力指示ガイ
ダンスに従って人力され、品詞および文法素性が入力さ
れると、人力した情報は編集され、単語が画面に表示さ
れる（ステップ４３）。次いで単語の他の候補を入力す
るかどうかを判断しくステップ４４）、Ｙｅｓの場合は
ステップ４０に戻り、Ｎｏの場合はステップ３２に移る
。FIG. 8 shows details of the "word manual" processing shown in step 31. First, enter the part of speech (step 40),
Next, input the notation and heading of the word (step 41).
), then the grammatical features, that is, the type, conjugation type, conjugation line, conjugation form, and meaning of the input part of speech, are manually determined (step 42).
). However, the feature value differs depending on the type of part of speech. These inputs are performed manually according to manual instruction guidance displayed on the screen, and once the part of speech and grammatical features are input, the manually entered information is edited and the words are displayed on the screen (step 43). Next, it is determined whether to input another word candidate (step 44), and if YES, the process returns to step 40, and if NO, the process moves to step 32.

検索処理第９図において、ステップ２３にて示した“検索”処理
の詳細を示す。まず検索は検索対象と゛なる１文′■゛
の取込みを行い（ステップ５０）、対象となる文が汀れ
ば（ステップ５１）その１文中に含まれる単語数Ｍを抽
出しくステップ５２）、次に文′ｒに関する単語ポイン
タ’１’　ｉを１にセットしくステップ５３）、キーワ
ードＫに関する単語ポインタＫｊを目こセットする（ス
テップ５４）。次いでマツチング情報エリアをクリアし
くステップ５５）、キーワ−ドに関する単語ポインタの
値が、構造化キーワードの単語数Ｎ以下であるかどうか
を判断しくステップ５６）、Ｙｅｓの場合は次いで文に
関する単語ポインタの値が、文に含まれる単語数Ｍ以下
であるかどうかを判断しくステップ５７）、Ｙｅｓであ
れば、ここで文Ｔにおける単語ポインタによって指示さ
れる要素と、キーワードＫにおける単語ポインタによっ
て指示される要素とが一致すれば（ステップ５８）、文
に関する単語ポインタの値ｉとキーワードに関する単語
ポインタの値ｊとを記憶（ｓａｖｅ）する（ステップ５
９）。次に、引き続き検索を行うためｉおよびｊの値に
それぞれＩを加算して（ステップ６０）ステップ５６に
戻る。ステップ５８において要素が一致しなければ、キ
ーワード間にワイルドカードが設定されているかどうか
判断しくステップ６１）、Ｙｅｓの場合は文Ｔにおける
単語ポインタの値ｉのみをＩＪＩＩ算して（ステップ６
２）ステップ５６に戻る。これはワイルドカードはキー
ワードとしてカウントしないためである。そしてステッ
プ５６においてキーワードＫにおける単語ポインタの値
ｊｈ＜構造化キーワードの単語数Ｎを超えた際は、マツ
チングした情報を検索結果出力部Ｉ２の記憶エリアに記
憶させ（ステップ６３）、さらに検索対象の文中に、キ
ーワードにと一致する要素が他に存在しないか検索する
ため、キーワードＫにおける単語ポインタの値ｊを１に
セットさせて上記したステップ５５〜６２の処理を繰り
返す。Search Process FIG. 9 shows details of the "search" process shown in step 23. First, the search takes in one sentence '■'' to be searched (step 50), and when the target sentences are found (step 51), the number of words M included in that one sentence is extracted (step 52), and then The word pointer '1' i associated with the sentence 'r' is set to 1 (step 53), and the word pointer Kj associated with the keyword K is set (step 54). Next, the matching information area is cleared (Step 55), and it is determined whether the value of the word pointer related to the keyword is less than or equal to the number of words N of the structured keyword (Step 56). If Yes, the value of the word pointer related to the sentence is then cleared. It is determined whether the value is less than or equal to the number of words M included in the sentence (step 57), and if Yes, the element pointed to by the word pointer in sentence T and the word pointer in keyword K is determined here. If the elements match (step 58), the value i of the word pointer regarding the sentence and the value j of the word pointer regarding the keyword are stored (save) (step 5
9). Next, in order to continue the search, I is added to each of the values of i and j (step 60), and the process returns to step 56. If the elements do not match in step 58, it is determined whether a wild card is set between the keywords (step 61), and if Yes, only the value i of the word pointer in the sentence T is calculated by IJII (step 61).
2) Return to step 56. This is because wildcards are not counted as keywords. Then, in step 56, when the value jh of the word pointer for the keyword K<the number of words N of the structured keyword is exceeded, the matched information is stored in the storage area of the search result output unit I2 (step 63), and the information of the search target is In order to search for any other element matching the keyword in the sentence, the value j of the word pointer for the keyword K is set to 1 and the processes of steps 55 to 62 described above are repeated.

なお、マッヂング情報とは、文における単語ポインタｉ
の値と、キーワードにおける単語ポインタの値ｊと、出
典と、文番号等である。Note that the mapping information is the word pointer i in the sentence.
, the value j of the word pointer in the keyword, the source, the sentence number, etc.

検索結果出力第１Ｏ図〜第１２図に、ステップ２４にて示した“検索
結果出力°処理の詳細を示す。Search result output FIGS. 1O to 12 show details of the "search result output process" shown in step 24.

第１０図において、“検索結果出力”は主として出力モ
ード設定（ステップ７０）、ウィンドウ表示（ステップ
７１）、コマンド入力（ステップ７２）、出力処理終了
の指示の有無の判断（ステップ７３）、そしてコマンド
実行（ステップ７４）の順で処理される。第１ｔ図に、
ステップ７０にて示した“出力モード設定″処理を説明
する。まず出力する文法素性を指定し、どの範囲の情報
（表記、品詞分類等）まで必要かを指定する（ステップ
８０）。次にセンタリング（第５図に示す一列に配置さ
れたキー“むしろ”および“らう”を示す）させるキー
を指定する（ステップ８１）。次いでソートするか否か
の判断を行い（ステップ８２）、Ｙｅｓの場合はソート
方向が昇順であるかまたは降順かを指定する（ステップ
８３）。次いでソートの対象となるキーの順序ケなわち
何番目のキーをソートの中心に設定するかを指定しくス
テップ８４）、ソートの対象となるキーの文法素性を指
定する（ステップ８５）。In FIG. 10, "search result output" mainly includes output mode setting (step 70), window display (step 71), command input (step 72), determination of whether there is an instruction to end output processing (step 73), and command Processing is performed in the order of execution (step 74). In Figure 1t,
The "output mode setting" process shown in step 70 will be explained. First, the grammatical features to be output are specified, and the range of information (notation, part-of-speech classification, etc.) required is specified (step 80). Next, the key to be centered (the keys ``rata'' and ``rau'' arranged in a row as shown in FIG. 5) is designated (step 81). Next, it is determined whether or not to sort (step 82), and if yes, the sorting direction is designated as ascending or descending (step 83). Next, the order of the keys to be sorted, that is, the number of keys to be set as the center of sorting is specified (step 84), and the grammatical features of the keys to be sorted are specified (step 85).

この文法素性の指定は、例えばキーの出力を表記でなく
品詞分類によって表示したいときなどに指定される。上
記した処理により出力モードが設定される。This grammatical feature is specified, for example, when it is desired to display key output by part of speech classification rather than notation. The output mode is set by the process described above.

次に第１２図に、ステップ７１にて示した°ウィンドウ
表示”処理を説明する。まず出力モードにソートが設定
されているかどうかを判断しくステップ９０）、Ｙｅｓ
の場合はソートを実行しくステップ９１）、設定された
出力モードに従い、表示データを生成しくステップ９２
）、生成結果を画面上に表示する（ステップ９３）。上
記した処理によりウィンドウ表示が実行される。Next, the "window display" process shown in step 71 will be explained with reference to FIG. 12. First, it is determined whether or not the output mode is set to sort.
If so, perform sorting (step 91) and generate display data according to the set output mode (step 92).
), the generated results are displayed on the screen (step 93). Window display is executed by the above processing.

次にこのような処理を行う用例検索方式において、構造
化キーワードの入力形態と出力としての詔曇抽出例とを
対比させて以下に示す。Next, in the example search method that performs such processing, the input form of the structured keyword and an example of the extraction of 返cloud as output will be compared and shown below.

なお説明を簡単にするためにテキストデータベース１３
に記憶されている語灸を、“森という環境を例にとって
みよう“の−文だけであると仮定する。また抽出例は、
文の表記をもって出力するらのとする。In order to simplify the explanation, we will use the text database 13.
Assume that the only memorized word moxibustion is the sentence ``Let's take the forest environment as an example.'' Also, an example of extraction is
Suppose that we output the notation of the sentence.

（１）形態素情報による抽出キーワードが［（品詞／動詞）、（品詞／助詞）］のと
き、抽出例は“森という環境を例にとりてみよう”であ
る。この検索方式によれば、キーワード毎に定義を設定
できるとともに、品詞が動詞の場合はさらに動詞固有の
情報である活用型、活用行、活用形を定義でき、また品
詞が名詞であれば活用に関する情報は当然不要となり、
その代わりに名詞固有の情報である上位概念情報（シソ
ーラス）等を設定することができる。(1) Extraction based on morphological information When the keyword is [(part of speech/verb), (part of speech/particle)], the extraction example is "Let's take the forest environment as an example." According to this search method, it is possible to set a definition for each keyword, and if the part of speech is a verb, it is also possible to define the conjugation type, conjugation line, and conjugation form, which are verb-specific information, and if the part of speech is a noun, it is possible to define the conjugation type, conjugation line, and conjugation form. Of course, the information is no longer needed,
Instead, superordinate concept information (thesaurus), which is noun-specific information, can be set.

（２）用言の活用情報による抽出キーワードが［（品詞／動詞、活用型／五段）］のとき
、抽出例は“森という環境を例にとってみよう”、森と
いう環境を例にとってみよう”である。動詞、形容詞、
形容動詞など活用のある自立語を用言と呼ぶが、ここで
は活用のある助動詞も用言として扱う。用言は一般に未
然、連用、終止、連体、仮定、命令の６つの活用がある
。これを構造化キーワードの１つの要素として定義する
ことにより検索を行う。またこの検索方式によれば、構
造化キーワードにおいて、表記が“動＜゛、活用形か“
連体形”と設定すると、連体形の°動く”と終止形の“
動く”とを区別することができ、連体形の“動く”に関
連した用例のみを検索することができる。(2) Extraction based on conjugation information of predicates When the keyword is [(part of speech/verb, conjugation type/5 stages)], the extraction examples are "Let's take the environment called a forest as an example" or "Let's take the environment called a forest as an example". There are verbs, adjectives,
Independent words with conjugations, such as adjective verbs, are called pragmatics, but here, auxiliary verbs with conjugations are also treated as pragmatics. There are generally six conjugations of predicates: conjugation, conjunctive, terminating, adjunctive, presupposition, and imperative. A search is performed by defining this as one element of a structured keyword. Also, according to this search method, structured keywords can be written as
If you set ``Administrative form'', the continuous form ``Move'' and the final form ``Move'' will be set.
``move'' can be distinguished, and only examples related to the adjunctive form of ``move'' can be searched.

（３）意味情報による抽出キーワードが［（意味／抽象概念）］のとき、抽出例は
“森という環境を例にとってみよう“である。(3) Extraction based on semantic information When the keyword is [(meaning/abstract concept)], the extraction example is "Let's take the forest environment as an example."

この検索方式によれば、構造化キーワードにおいて意味
を動物とし［（動物）が走る］を設定すると、（動物）
の下位概念をまとめて検索することができる。また、［
が走る］を設定すると多数の用例が検索されるが、［（
無生物）が走る］を設定することにより“汽車が走る”
などのように（生物）を除去した検索を行うことができ
る。According to this search method, if the meaning of a structured keyword is set to animal and [(animal) runs], then (animal)
It is possible to search all the sub-concepts of . Also,[
If you set ``runs'', many examples will be found, but ``(
“The train runs” by setting “Inanimate object) runs”
You can perform a search that excludes (organisms), such as.

（４）助詞の種類ｔｉｔ報による抽出キーワードが［【品ｒＡ／名詞）、（品詞／助詞、ｌｌ
ｌ１格助詞）］のとき、抽出例は“森という環境を例に
とってみよう”、“森という環境を例にとってみよう”
である。この検索方式によれば、格助詞として使われる
゛と”と並列助詞として使われる“と“とを区別して検
索することができる。(4) Extracted keywords based on particle type tit information are [[article rA/noun], (part of speech/particle, ll
l1 case particle)], the extraction examples are "Let's take the environment called a forest as an example", "Let's take the environment called a forest as an example"
It is. According to this search method, it is possible to distinguish between ゛ and '', which are used as case particles, and `` and '', which are used as parallel particles, in a search.

（５）助詞の意味情報による抽出ただしこの場合においては、データベースＩ３に記憶さ
れている語愈は次ぎの２例と仮定する。(5) Extraction based on semantic information of particles However, in this case, it is assumed that the following two examples are the words stored in the database I3.

ｌ！Ａ１１［風（）、で（品詞／助詞、見出し／で１表
記／で、ｆＸ類／格助詞、意味／［原因コ）、会社（）
を（）、休む（）］例２　［紙（）、を（）、鋏（）、で（品詞／助詞、見
出し／で１表記／で１種類／格助詞、意味／［道具〕）
、切る（）〕である。なお、（）内の情報は省略する。l! A11 [Wind (), de (part of speech / particle, heading / 1 notation / de, fX class / case particle, meaning / [cause ko), company ()
wo (), rest ()] Example 2 [Paper (), wo (), scissors (), de (part of speech/particle, heading/one notation/one type/case particle, meaning/[tool])
, cut()]. Note that information in parentheses is omitted.

キーワードが［（品詞／名詞）、（品詞／助詞、意味／
道具）］のとき、抽出例は“紙を鋏で切る“である。The keywords are [(part of speech/noun), (part of speech/particle, meaning/
tool)], the extraction example is "cutting paper with scissors."

助詞の意味は助詞毎に何種類かあるが、格助詞“で“に
関して意味を分類した例を示すと、学校で勉強する（場
所）、車で荷物を運ぶ（手段、方法）、金づちで釘を打
つ（道具）、風で休む（原因）、粉でパンを作る（材料
）、三日で完了する（＠囲、程度）、小声で話す（様ｔ
ｓ）、会社で雇う（主体）などに分類される。したがっ
てこの検索方式によれば、表記が同じ助詞（なおかつ種
類の同じ助詞であってら）を、その使われる役割（この
実施例では意味という）によって区別して検索すること
ができる。There are several meanings for each particle, but examples of the classification of meanings for the case particle ``are'': studying at school (place), carrying luggage by car (means, method), nailing with a hammer. hitting (tool), resting in the wind (cause), making bread with flour (ingredients), completing it in three days (@en, degree), speaking in a whisper (sama t)
s), hired by the company (subject), etc. Therefore, according to this search method, particles with the same notation (and particles of the same type) can be searched by distinguishing them according to the role they are used in (referred to as meaning in this embodiment).

なお、ここに示したキーワードはあくまでも一例であり
、種々のバリエーションが存在する。Note that the keywords shown here are just examples, and there are various variations.

（へ）発明の効果この発明によれば、キーワードに、検索対象を文法的に
制約する情報を付加した定義付きキーワードによってデ
ータ記憶手段の検索を行うので、自然言語研究のための
多様なデータの抽出および解析を効率良（行うことがで
きる。(f) Effects of the Invention According to the present invention, data storage means is searched using defined keywords, which are added with information that grammatically restricts the search target. Extraction and analysis can be performed efficiently.

[Brief explanation of the drawing]

ｍ１図はこの発明に係る用例検索システムの構成図、第
２図は実施例の用例検索処理で用いられるシソーラスの
説明図、第３図は実施例のテキストデータベースに記憶
される文情報例を示す説明図、第４図は実施例の構造化
キーワード入力状態を示す説明図、第５図は実施例の検
索結果を示す説明図、第６図〜第１２図はこの実施例の
動作を示すフローヂャートである。ＩＯ・・・・・・キーワード入力部、Ｉｔ・・・・・・検索部、１２・・・・・・検索結果出力部、１３・・・・・・テキストデータベース。第５図第８図Fig. m1 is a configuration diagram of the example retrieval system according to the present invention, Fig. 2 is an explanatory diagram of a thesaurus used in the example retrieval process of the embodiment, and Fig. 3 shows an example of sentence information stored in the text database of the embodiment. 4 is an explanatory diagram showing the structured keyword input state of the embodiment, FIG. 5 is an explanatory diagram showing the search results of the embodiment, and FIGS. 6 to 12 are flowcharts showing the operation of this embodiment. It is. IO: Keyword input section, It: Search section, 12: Search result output section, 13: Text database. Figure 5 Figure 8

Claims

[Claims] 1. A large number of usage examples are stored, classified into grammatical information such as part-of-speech classification, notation, part-of-speech type, conjugation type, meaning, etc. for each word constituting the usage example, and A text database with sources, sentence numbers, etc. added as desired, at least one piece of grammatical information in a predetermined order, and when grammatical information is specified, a definition can be created by editing the grammatical information. The keyword input section that creates keywords and the defined keyword input from the keyword input section are converted into matching subterms, and by matching the converted subterms with the sentence information in the text database, the defined keywords are created. An example search system comprising: a search section that searches for examples corresponding to a keyword; and a search result output section that outputs the searched examples in KWIC format. 2. In the usage example retrieval system according to claim 1, the keyword input section specifies at least one part of speech, or by specifying one or more parts of speech and specifying information such as conjugation form specific to each specified part of speech. 2. The example search system according to claim 1, wherein keywords with definitions are created. 3. In the example search system according to claim 2, the keyword input section specifies the part of speech and, when the specified part of speech is a predicate with conjugation, specifies information regarding the conjugation of the predicate, thereby generating the keyword with a definition. 3. The example retrieval system according to claim 2, wherein the example retrieval system creates an example retrieval system. 4. In the example retrieval system according to claim 2, the text database has semantic information consisting of a thesaurus that forms a system that connects concepts of nouns in a superior/subordinate relationship, and the keyword input section includes at least one or more nouns. 3. The example search system according to claim 2, wherein a defined keyword is created by specifying a concept, and the search unit searches for examples connected by the concept based on the defined keyword. 5. The example search system according to claim 2, wherein the keyword input section creates a keyword with a definition by specifying a particle and a type of the specified particle. Example search system. 6. The example search system according to claim 2, wherein the keyword input section creates a defined keyword by specifying a particle and semantic information of the specified particle. example search system.