JP4515186B2

JP4515186B2 - Speech dictionary creation device, speech dictionary creation method, and program

Info

Publication number: JP4515186B2
Application number: JP2004222876A
Authority: JP
Inventors: 正浩道脇
Original assignee: 株式会社ジー・エフグループ
Priority date: 2003-09-02
Filing date: 2004-07-30
Publication date: 2010-07-28
Anticipated expiration: 2024-07-30
Also published as: JP2005099741A

Abstract

<P>PROBLEM TO BE SOLVED: To efficiently create a voice dictionary suitable for practical use. <P>SOLUTION: This voice dictionary creation device has a reading processing part connected to the Internet network. The reading processing part reads a site to which character input is attempted to be performed or a site linked from the site or other site which a user specifies as especially having deep relation, transfers the data to a character string extraction part. The character string extraction part analyses the transferred data as character strings to be used for voice recognition of words, etc., and transfers its result to the voice dictionary creation part. The voice dictionary creation part creates the voice dictionary by performing processing such as adjustment of use frequency of the words and addition of unknown words according to the data transferred from the character string extraction part. <P>COPYRIGHT: (C)2005,JPO&NCIPI

Description

この発明は、音声を認識して文字列に変換する際に用いられる音声辞書データを作成する音声辞書作成装置、音声辞書作成方法、及びプログラムに関する。 The present invention relates to a speech dictionary creation device, a speech dictionary creation method, and a program for creating speech dictionary data used when speech is recognized and converted into a character string.

音声認識装置は、その一部または別個に音声辞書作成装置とともに用いられる。これは、たとえば用途によって、同じ「こうかい」という発音でも「公開」「後悔」「航海」などがあり、日本語の係り受けを使っても「こうかい−する」でいずれも成立することから、使用者側がどの用例を使うか指定してやらないと音声認識した結果の文字が、意図したものが得られないからである。このような音声辞書があらかじめ準備されている場合もあるが、それでも使用者側で優先順位（選択順序）などを調整してやらないと望む結果が得がたいので、やはり音声辞書を操作する必要が生じる。また、そもそも音声認識装置にとって既知でない単語である「未知語」が含まれていると、音声認識に支障を生じるため、そのような未知語を音声辞書作成装置で追加して音声認識装置の認識率を高めるのである。 The speech recognition device is used in part or separately with the speech dictionary creation device. This is because, for example, depending on the application, the same pronunciation “Koukai” can be “public”, “regret”, “voyage”, etc. This is because if the user side does not specify which example to use, the intended character as a result of speech recognition cannot be obtained. In some cases, such a speech dictionary is prepared in advance, but it is still necessary to operate the speech dictionary because it is difficult to obtain a desired result unless the user adjusts the priority (selection order). In addition, if an "unknown word" that is a word that is unknown to the speech recognition device is included in the first place, it will interfere with speech recognition. Therefore, such an unknown word is added by the speech dictionary creation device and the speech recognition device recognizes it. Increase the rate.

このような音声辞書作成を行う場合、従来はひとつひとつの単語を、人間が辞書登録を行っており、実用に耐える辞書を作成するのには多くの工数を必要とした。また、あらかじめ準備された辞書を用いる場合でも、用途によって認識される単語の優先順位が異なるために、使用者による試行錯誤によって音声辞書を操作する手間が膨大に必要であった。また、大規模な長文を認識して効率を上げるような試みも行われていたが、用意する文章が必ずしも話者が意図したものに近似しているとは限らないので、やはり多く修正の工数が必要とされていた。 In the case of creating such a voice dictionary, humans have conventionally registered a dictionary for each word, and it took a lot of man-hours to create a practical dictionary. Further, even when a dictionary prepared in advance is used, the priority order of the words recognized by the application is different, so that it takes a lot of labor to operate the speech dictionary by trial and error by the user. In addition, attempts have been made to increase efficiency by recognizing large, long sentences, but since the prepared text does not always approximate what the speaker intended, there are still many man-hours for correction. Was needed.

なお、音声認識により音声を文字列に変換する処理において、変換する文字列の候補が複数ある場合に、過去に入力された文字列の履歴に基づいて複数の候補の中から適切な候補を選択する音声認識装置及び音声認識方法が特許文献１に開示されている。
特開２００３−１０８１８９号公報 In the process of converting speech into a character string by speech recognition, if there are multiple character string candidates to be converted, an appropriate candidate is selected from a plurality of candidates based on the history of character strings input in the past A speech recognition apparatus and a speech recognition method are disclosed in Patent Document 1.
JP 2003-108189 A

前述した通り、音声認識装置の使用者が音声を文字列に変換する際の優先順位をひとつひとつ登録することにより実用に耐え得る音声辞書を作成するには多くの工数が必要である。また、音声辞書作成装置があらかじめ備える音声辞書を用いる場合であっても、使用者の用途に応じて優先順位を調整するには膨大な手間がかかった。また、特許文献１に開示されている音声認識装置及び音声認識方法を用いる場合であっても、過去に入力された文字列の履歴が無ければ適切な文字列の候補を選択することができない。 As described above, it takes a lot of man-hours to create a speech dictionary that can withstand practical use by registering the priorities for converting speech into character strings one by one by the user of the speech recognition apparatus. Further, even when a speech dictionary provided in advance in the speech dictionary creation device is used, it takes a great deal of time to adjust the priority order according to the user's application. Even when the speech recognition apparatus and speech recognition method disclosed in Patent Document 1 are used, an appropriate character string candidate cannot be selected if there is no history of character strings input in the past.

本発明は前記課題に鑑み案出されたもので、インターネット接続手段がありインターネット接続手段を通じて提供されている文字入力可能なサイトに音声認識装置を使って文字入力を行いたいときに、短期間に実用に耐え得る音声辞書作成を可能にすることを目的とする。 The present invention has been devised in view of the above problems, and when a user has an Internet connection means and wants to input characters using a speech recognition apparatus in a site where characters can be input provided through the Internet connection means, The purpose is to enable the creation of a speech dictionary that can withstand practical use.

インターネット網に接続された読込処理部が文字入力を行おうとするサイトまたはそのサイトからリンクされたサイトまたはその他特に関連が深いとして使用者が指定したサイトを読み込んで、そのデータを文字列抽出部に受け渡す。文字列抽出部は、受け渡されたデータを単語等の音声認識に使用する文字列として解析してその結果を音声辞書作成部に受け渡す。音声辞書作成部は文字列抽出部から受け渡されたデータに従って、単語の使用頻度を調整したり、未知語を追加するなどの処理を行って音声辞書を作成し音声認識装置の音声辞書を作成する。 The reading processing unit connected to the Internet network reads the site where the character input is to be performed, the site linked from the site, or the other site specified by the user as being particularly relevant, and the data is read to the character string extracting unit. Deliver. The character string extraction unit analyzes the received data as a character string used for speech recognition of a word or the like, and passes the result to the speech dictionary creation unit. The speech dictionary creation unit creates a speech dictionary by adjusting the frequency of use of words and adding unknown words according to the data passed from the character string extraction unit, and creates a speech dictionary for the speech recognition device. To do.

従来は１件１件試行錯誤しながら人手によって音声辞書を作成していたが、本発明ではインターネット上から音声辞書を作成するためのデータを得ることとし、その範囲を、音声入力を行おうとするサイト、またそのリンク先、または使用者が特に関連が深いとして指定したサイトをデータ取得対象とする点が異なる。また、単に長文を音声辞書作成に使用するのではなく、その範囲を、音声入力を行おうとするサイト、またそのリンク先、または使用者が特に関連が深いとして指定したサイトをデータ取得対象とする点が異なる。 Conventionally, a voice dictionary is created manually by trial and error one by one. However, in the present invention, data for creating a voice dictionary is obtained from the Internet, and the range is to be used for voice input. The difference is that the site, the link destination, or the site designated by the user as being particularly relevant is targeted for data acquisition. Also, rather than simply using a long sentence to create a speech dictionary, the scope of the data acquisition targets sites that are intended to be used for speech input, links to them, or sites that the user has specified as being particularly relevant. The point is different.

インターネットの性質を利用して、音声入力を行おうとするサイトそのもの又そのリンク先から音声辞書を作成する。また、以上でデータ量が不足である場合には使用者が任意に関連の深いと思われるサイトを読込処理部に追加することができる。このような手段により、効率的に音声辞書の作成を行う手段を提供する。 Utilizing the nature of the Internet, a voice dictionary is created from the site or the link destination where the voice is to be input. In addition, when the amount of data is insufficient as described above, a user can arbitrarily add a site that seems to be closely related to the read processing unit. By such means, means for efficiently creating a speech dictionary is provided.

インターネット上に文字入力を受け付ける文字入力サイトが存在しており、インターネット網に接続されたインターネット接続装置がこの文字入力サイトに接続されており、インターネット接続装置に音声認識装置が付加されていて文字入力サイトに音声認識された文字を送信しようとするときに、インターネット網に接続された読込処理部が文字入力を行おうとするサイトまたはそのサイトからリンクされたサイトまたはその他特に関連が深いとして使用者が指定したサイトを読み込んで、そのデータを文字列抽出部に受け渡し、文字列抽出部は、受け渡されたデータを単語等の音声認識に使用する文字列として解析してその結果を音声辞書作成部に受け渡し、音声辞書作成部は文字列抽出部から受け渡されたデータに従って、単語の使用頻度を調整したり未知語を追加するなどの処理を行って音声辞書を作成して音声認識装置の音声辞書を作成し、この音声辞書を音声認識装置が用いることで効率的な音声認識処理による文字入力処理を行う。 There is a character input site that accepts character input on the Internet. An Internet connection device connected to the Internet network is connected to this character input site, and a voice recognition device is added to the Internet connection device. When trying to send a voice-recognized character to a site, the user may consider that the reading processing unit connected to the Internet network attempts to input the character, the site linked from that site, or other particularly relevant The specified site is read and the data is passed to the character string extraction unit. The character string extraction unit analyzes the received data as a character string used for speech recognition of words and the like, and the result is a voice dictionary creation unit. The voice dictionary creation unit uses words according to the data received from the character string extraction unit. Create a speech dictionary by adjusting the degree or adding unknown words to create a speech dictionary for the speech recognition device. The speech recognition device uses this speech dictionary for efficient speech recognition processing. Perform character input processing.

本発明の音声辞書作成装置は、ＣＰＵ、メモリ、及びインターネットに接続するための通信インタフェースと、前記メモリに設けられた記憶領域である、第一の音声辞書記憶部、第二の音声辞書記憶部、読込ＵＲＬ記憶部、及び出現頻度記憶部と、前記ＣＰＵが前記メモリに記憶されているプログラムを実行することにより実現される、読込処理部、文字列抽出部、及び音声辞書作成部と、を備え、前記第一の音声辞書記憶部には、単語等の文字列を示す文字列データと、当該文字列の読みを示す読みデータとが対応付けて記憶され、前記読込ＵＲＬ記憶部には、ＷｅｂページのＵＲＬが記憶され、前記読込処理部は、前記読込ＵＲＬ記憶部に記憶されている前記ＵＲＬで指定されるＷｅｂページを、インターネット上のＷｅｂサーバにアクセスすることにより取得し、前記文字列抽出部は、前記読込処理部が取得した前記Ｗｅｂページを構成しているデータに含まれている前記文字列データを抽出して前記文字列データごとの出現頻度を求め、前記文字列データ及び前記出現頻度を示すデータである出現頻度データを対応付けて前記出現頻度記憶部に記憶し、前記音声辞書作成部は、前記出現頻度記憶部に記憶されている前記文字列データに対応する前記読みデータを前記第一の音声辞書記憶部から取得し、取得した前記読みデータが同一の複数の前記文字列データの夫々に対して、音声認識によって得られる前記読みデータを前記複数の前記文字列データのうちから選択される一の文字列データに変換する際の前記文字列データの選択順序を前記出現頻度データに基づいて決定し、前記文字列データ、前記読みデータ、及び前記選択順序を示すデータである選択順序データを対応付けて、音声を認識して文字列に変換する際に用いられるデータである音声辞書データを生成し、当該音声辞書データを前記第二の音声辞書記憶部に記憶することとする。 The speech dictionary creation device of the present invention includes a CPU, a memory, a communication interface for connecting to the Internet, and a first speech dictionary storage unit and a second speech dictionary storage unit which are storage areas provided in the memory. A read URL storage unit, an appearance frequency storage unit, and a read processing unit, a character string extraction unit, and a speech dictionary creation unit realized by the CPU executing a program stored in the memory. The first voice dictionary storage unit stores character string data indicating a character string such as a word and reading data indicating a reading of the character string in association with each other, and the reading URL storage unit stores The URL of the Web page is stored, and the read processing unit accesses the Web page specified by the URL stored in the read URL storage unit to a Web server on the Internet. The character string extraction unit extracts the character string data included in the data constituting the Web page acquired by the read processing unit, and appears for each character string data. The frequency is obtained, the character string data and the appearance frequency data which is data indicating the appearance frequency are associated with each other and stored in the appearance frequency storage unit, and the speech dictionary creation unit is stored in the appearance frequency storage unit The reading data corresponding to the character string data is acquired from the first speech dictionary storage unit, and the acquired reading data is obtained by speech recognition for each of the plurality of character string data having the same reading data. The selection order of the character string data when converting the data into one character string data selected from the plurality of character string data is determined based on the appearance frequency data The character string data, the reading data, and the selection order data that is the data indicating the selection order are associated with each other to generate speech dictionary data that is data used when the speech is recognized and converted into a character string. The speech dictionary data is stored in the second speech dictionary storage unit.

また、本発明のプログラムは、ＣＰＵ、メモリ、及びインターネットに接続するための通信インタフェースと、前記メモリに設けられた記憶領域である、単語等の文字列を示す文字列データと、当該文字列の読みを示す読みデータとが対応付けて記憶されている第一の音声辞書記憶部と、ＷｅｂページのＵＲＬが記憶されている読込ＵＲＬ記憶部と、を備えるコンピュータに、前記読込ＵＲＬ記憶部に記憶されている前記ＵＲＬで指定されるＷｅｂページを、インターネット上のＷｅｂサーバにアクセスすることにより取得する手順と、取得した前記Ｗｅｂページに含まれている前記文字列データを抽出して前記文字列データごとの出現頻度を求める手順と、前記文字列データ及び前記出現頻度を示すデータである出現頻度データを対応付けて前記メモリに設けられた記憶領域である出現頻度記憶部に記憶する手順と、前記出現頻度記憶部に記憶されている前記文字列データに対応する前記読みデータを前記第一の音声辞書記憶部から取得する手順と、取得した前記読みデータが同一の複数の前記文字列データの夫々に対して、音声認識によって得られる前記読みデータを前記複数の前記文字列データのうちから選択される一の文字列データに変換する際の前記文字列データの選択順序を前記出現頻度データに基づいて決定する手順と、前記文字列データ、前記読みデータ、及び前記選択順序を示すデータである選択順序データを対応付けて、音声を認識して文字列に変換する際に用いられるデータである音声辞書データを生成し、当該音声辞書データを前記メモリに設けられた記憶領域である第二の音声辞書記憶部に記憶する手順と、を実行させるためのものである。 The program of the present invention includes a CPU, a memory, a communication interface for connecting to the Internet, character string data indicating a character string such as a word, which is a storage area provided in the memory, Stored in the read URL storage unit in a computer comprising a first voice dictionary storage unit storing reading data indicating readings in association with each other and a read URL storage unit storing a URL of a Web page A procedure for acquiring a Web page specified by the URL that is specified by accessing a Web server on the Internet, and extracting the character string data included in the acquired Web page to extract the character string data Associating the procedure for obtaining the appearance frequency for each character with the character string data and the appearance frequency data that is the data indicating the appearance frequency The procedure of storing in the appearance frequency storage unit, which is a storage area provided in the memory, and the reading data corresponding to the character string data stored in the appearance frequency storage unit from the first speech dictionary storage unit One character selected from among the plurality of character string data for the reading data obtained by voice recognition for each of the plurality of character string data having the same reading data and the acquired reading data Corresponds to the procedure for determining the selection order of the character string data when converting to column data based on the appearance frequency data, and the selection order data that is the data indicating the character string data, the reading data, and the selection order In addition, voice dictionary data, which is data used when recognizing voice and converting it into a character string, is generated, and the voice dictionary data is stored in the memory. Is intended for executing the instructions stored in the second speech dictionary storage unit is a region.

音声認識技術の発達に伴い、音声認識プログラムを用いて音声を文字列に変換することが行われている。例えば、営業日報等の文字入力が必要なＷｅｂページへの文字入力や、学術論文等の文書データの作成等において用いられている。このような音声認識プログラムでは、音声を文字列に変換するための音声辞書が用いられる。音声認識プログラムが初めから備える標準の音声辞書（以後、「標準辞書」と称する。本発明の「第一の音声辞書」に該当。）には、例えば、「こうかい」という読みに対して「後悔」や「公開」、「航海」等の文字列が、変換される優先順位とともに記憶されている。音声認識プログラムは、認識した音声に対応する文字列を標準辞書の優先順位に従って決定する。 With the development of speech recognition technology, speech is converted into a character string using a speech recognition program. For example, it is used for character input to a Web page that requires character input such as a daily business report, or creation of document data such as academic papers. In such a speech recognition program, a speech dictionary for converting speech into a character string is used. For example, a standard speech dictionary (hereinafter referred to as “standard dictionary”, which corresponds to the “first speech dictionary” of the present invention) included in the speech recognition program is “ Character strings such as “regret”, “public”, and “voyage” are stored together with the priority order to be converted. The voice recognition program determines a character string corresponding to the recognized voice according to the priority order of the standard dictionary.

したがって、音声認識プログラムの使用者は、音声が使用者の意図する文字に変換されるよう、標準辞書に設定されている優先順位を調整する必要がある。例えば、対象が海洋分野であれば「航海」の優先順位を上げる等の作業が必要となる。 Therefore, the user of the voice recognition program needs to adjust the priority order set in the standard dictionary so that the voice is converted into the character intended by the user. For example, if the target is the marine field, work such as raising the priority of “voyage” is required.

本発明のプログラムは、音声認識プログラムとともにコンピュータ（音声辞書作成装置）にインストールされて用いられる。そして、このような音声辞書作成装置を用いれば、音声認識させて文字入力を行おうとしている分野に関連するサイトのＷｅｂページのＵＲＬを読込ＵＲＬ記憶部に記憶させておくだけで、そのＷｅｂページにおける文字列の出現頻度に応じた優先順位が設定された音声辞書（以後、「ユーザ辞書」と称する。本発明の「第二の音声辞書」に該当。）を得ることができる。 The program of the present invention is installed and used in a computer (voice dictionary creation device) together with a voice recognition program. If such a speech dictionary creation device is used, the URL of a web page of a site related to the field in which character recognition is performed by voice recognition is simply stored in the read URL storage unit. A speech dictionary (hereinafter referred to as “user dictionary”, which corresponds to the “second speech dictionary” of the present invention) in which priorities are set in accordance with the appearance frequency of the character string in FIG.

なお、ユーザ辞書は、標準辞書とは別の新たな辞書として作成されることとしてもよいし、標準辞書を更新することにより作成されることとしてもよい。また、第一の音声辞書は、音声認識プログラムの標準辞書に限らず、文字列と読みが対応付けられているものであればよい。例えば、仮名漢字変換プログラムが備える仮名漢字変換用の辞書を第一の音声辞書として用いることも可能である。また、生成されたユーザ辞書は、音声認識における文字変換に限らず、仮名漢字変換においても使用可能である。 Note that the user dictionary may be created as a new dictionary different from the standard dictionary, or may be created by updating the standard dictionary. Further, the first speech dictionary is not limited to the standard dictionary of the speech recognition program, but may be any one as long as a character string and a reading are associated with each other. For example, a kana-kanji conversion dictionary provided in the kana-kanji conversion program can be used as the first speech dictionary. The generated user dictionary can be used not only for character conversion in speech recognition but also for kana-kanji conversion.

さらに、本発明のプログラムは、前記出現頻度記憶部に記憶されている前記文字列データのうち、前記第一の音声辞書記憶部に記憶されていない前記文字列データをディスプレイやプリンタ等の所定の出力装置に出力する手順と、前記コンピュータが備える所定の入力装置から入力される、前記出力装置に出力された前記文字列データによって表される文字列の読みを示す前記読みデータを受け付け、当該文字列データ及び当該読みデータを対応付けて前記メモリに設けられた記憶領域である読みデータ記憶部に記憶する手順と、を更に備え、前記出現頻度記憶部に記憶されている前記文字列データに対応する前記読みデータを取得する前記手順は、前記出現頻度記憶部に記憶されている前記文字列データに対応する前記読みデータを前記第一の音声辞書記憶部または前記読みデータ記憶部から取得する手順であることとしてもよい。 Furthermore, the program according to the present invention is configured to store the character string data that is not stored in the first phonetic dictionary storage unit among the character string data stored in the appearance frequency storage unit. Receiving the reading data indicating the reading of the character string represented by the procedure of outputting to the output device and the character string data output from the predetermined input device included in the computer and output to the output device; And a procedure for associating the column data and the reading data with each other and storing them in the reading data storage unit that is a storage area provided in the memory, and corresponding to the character string data stored in the appearance frequency storage unit The step of acquiring the reading data includes the reading data corresponding to the character string data stored in the appearance frequency storage unit. Of it may be a procedure for acquiring a speech dictionary storage unit or the read data storage unit.

取得したＷｅｂページ中に、標準辞書に登録されていない文字列（以後、「未登録語」と称する。）が存在する場合、その文字列の読みを登録する必要がある。そこで、音声辞書作成装置によって未登録語がディスプレイ等の出力装置に出力され、ユーザが未登録語の読みをキーボードやマイク等の入力装置から入力する。そして、音声辞書作成装置は、未登録語も含めて同音異義語の優先順位を設定したユーザ辞書を生成する。このように、未登録語が多く存在する新しい技術分野等においても、文字列の出現頻度に応じた優先順位が設定されたユーザ辞書を得ることができ、音声を意図した文字列に変換することができる確率を高めることができる。 If there is a character string that is not registered in the standard dictionary (hereinafter referred to as “unregistered word”) in the acquired Web page, it is necessary to register the reading of the character string. Therefore, the unregistered word is output to an output device such as a display by the speech dictionary creation device, and the user inputs reading of the unregistered word from an input device such as a keyboard or a microphone. Then, the speech dictionary creation device creates a user dictionary in which the priority order of homonyms including unregistered words is set. In this way, even in a new technical field where there are many unregistered words, it is possible to obtain a user dictionary with a priority set according to the appearance frequency of the character string, and to convert the speech into a character string intended Can increase the probability of being able to.

また、本発明のプログラムは、Ｗｅｂページに対応する画像や文字列等をディスプレイに表示するソフトウェアであるＷｅｂブラウザがユーザから指定されたＷｅｂページのＵＲＬを記憶する記憶領域であるＵＲＬ記憶部に記憶されている前記ＵＲＬを取得する手順と、前記ＵＲＬ記憶部から取得した前記ＵＲＬを前記読込ＵＲＬ記憶部に記憶する手順と、を更に備えることとしてもよい。 The program of the present invention is stored in a URL storage unit, which is a storage area for storing a URL of a Web page designated by a user by a Web browser, which is software that displays an image, a character string, or the like corresponding to the Web page on a display. It is also possible to further include a procedure for acquiring the URL that has been stored and a procedure for storing the URL acquired from the URL storage unit in the read URL storage unit.

一般的に、Ｗｅｂブラウザは、ユーザからの指定により特定のＷｅｂページのＵＲＬを記憶しておく記憶部（以後、「ＵＲＬ記憶部」と称する。）を備えている。ユーザは、自分の業務や研究等に関連するＷｅｂページのＵＲＬを、このＵＲＬ記憶部に登録しておくことが多い。そこで、このＵＲＬ記憶部に記憶されているＵＲＬを本発明における読込ＵＲＬ記憶部に記憶することで、ユーザは音声辞書作成用に別途ＵＲＬを登録することなく、業務や研究等に関連する分野のＷｅｂページにおける文字列の出現頻度に応じた優先順位が設定された音声辞書を得ることができる。 In general, a Web browser includes a storage unit (hereinafter referred to as “URL storage unit”) that stores a URL of a specific Web page as designated by a user. In many cases, a user registers a URL of a Web page related to his / her work or research in the URL storage unit. Therefore, by storing the URL stored in the URL storage unit in the read URL storage unit according to the present invention, the user does not register a separate URL for creating a speech dictionary, but in a field related to business or research. It is possible to obtain a speech dictionary in which priorities are set according to the appearance frequency of character strings on a Web page.

さらに、本発明のプログラムは、前記読込ＵＲＬ記憶部が記憶する複数の前記ＵＲＬのうち、ユーザによって指定される前記ＵＲＬを受け付ける手順を更に備え、前記ＵＲＬ記憶部に記憶されている前記ＵＲＬを取得する前記手順は、前記ＵＲＬ記憶部に記憶されている前記ＵＲＬのうち、ユーザによって指定される前記ＵＲＬを前記ＵＲＬ記憶部から取得する手順であることとしてもよい。 Furthermore, the program of the present invention further includes a procedure of accepting the URL specified by the user among the plurality of URLs stored in the read URL storage unit, and acquires the URL stored in the URL storage unit The procedure to be performed may be a procedure for acquiring, from the URL storage unit, the URL specified by a user among the URLs stored in the URL storage unit.

ＵＲＬ記憶部には、多様な分野のＷｅｂページのＵＲＬが登録されている場合がある。そこで、ユーザは、ＵＲＬ記憶部に登録されているＵＲＬの中から、ユーザが文字入力を行おうとしている分野に合うＷｅｂページのＵＲＬを指定する。なお、ユーザがＵＲＬ記憶部に分野ごとのフォルダを設けている場合は、そのフォルダを選択することで、そのフォルダの中に格納されている複数のＵＲＬが指定されるようにすることもできる。 There are cases where URLs of Web pages in various fields are registered in the URL storage unit. Therefore, the user designates the URL of the Web page that matches the field in which the user is trying to input characters from the URLs registered in the URL storage unit. When the user has a folder for each field in the URL storage unit, a plurality of URLs stored in the folder can be specified by selecting the folder.

このように、Ｗｅｂブラウザが備えるＵＲＬ記憶部に多様な分野のＷｅｂページのＵＲＬが登録されている場合においても、ユーザが文字入力を行おうとしている分野に応じた優先順位が設定された音声辞書を得ることができる。また、ユーザがＵＲＬを分野別に整理してＵＲＬ記憶部に登録しておくことにより、目的とする分野に応じた音声辞書を容易に得ることができる。 Thus, even when URLs of Web pages in various fields are registered in the URL storage unit provided in the Web browser, a speech dictionary in which priorities are set according to the fields in which the user is trying to input characters. Can be obtained. In addition, when the user organizes URLs by field and registers them in the URL storage unit, a speech dictionary corresponding to the target field can be easily obtained.

また、Ｗｅｂページに設定された他のＷｅｂページへのリンクを辿る階層数を示すリンク階層データが前記メモリに設けられた記憶領域であるリンク階層記憶部に記憶され、前記読込ＵＲＬ記憶部に記憶されている前記ＵＲＬで指定される前記Ｗｅｂページを取得する前記手順には、取得した前記Ｗｅｂページを構成しているデータに含まれている他のＷｅｂページへのリンクを示すＵＲＬであるリンク先ＵＲＬを抽出して当該リンク先ＵＲＬで指定される前記他のＷｅｂページを取得する処理を、前記リンク階層記憶部に記憶されている前記リンク階層データによって示される前記階層数に応じて繰り返し実行する手順が含まれていることとしてもよい。 In addition, link hierarchy data indicating the number of hierarchies that follow links to other Web pages set in the Web page is stored in a link hierarchy storage unit that is a storage area provided in the memory, and stored in the read URL storage unit In the procedure for acquiring the Web page specified by the URL that is specified, a link destination that is a URL indicating a link to another Web page included in the data constituting the acquired Web page The process of extracting the URL and acquiring the other Web page specified by the link destination URL is repeatedly executed according to the number of levels indicated by the link level data stored in the link level storage unit. Procedures may be included.

Ｗｅｂページには、そのＷｅｂページに関連するＷｅｂページのリンクが設定されていることが多い。そこで、音声辞書作成装置は、読込ＵＲＬ記憶部に記憶されているＵＲＬが示すＷｅｂページを取得する際に、そのＷｅｂページに設定されたリンクを辿って別のＷｅｂページを取得する。ここで、リンクを無限に辿ると、あらゆる分野のＷｅｂページが取得されてしまい、目的とする分野に応じた音声辞書を作成することができない。逆に、リンクを辿る階層数を少なくすると、取得できる文字列数が限られ、実用的な音声辞書を作成することができない。そこで、ユーザが、リンクを辿る階層数をリンク階層記憶部に設定しておくことにより、ユーザが目的とする分野に応じた音声辞書を作成することができる。 In many cases, a web page link related to the web page is set in the web page. Therefore, when acquiring the Web page indicated by the URL stored in the read URL storage unit, the speech dictionary creating apparatus acquires another Web page by following the link set for the Web page. Here, if the link is traced infinitely, Web pages in all fields are acquired, and a speech dictionary corresponding to the target field cannot be created. Conversely, if the number of layers following the link is reduced, the number of character strings that can be acquired is limited, and a practical speech dictionary cannot be created. Therefore, when the user sets the number of layers following the link in the link layer storage unit, a speech dictionary corresponding to the field intended by the user can be created.

さらに、前記リンク階層記憶部には、前記読込ＵＲＬ記憶部に記憶されている前記ＷｅｂページのＵＲＬごとに前記リンク階層データが記憶され、前記他のＷｅｂページを取得する前記処理を、前記階層数に応じて繰り返し実行する前記手順は、前記他のＷｅｂページを取得する前記処理を、前記ＵＲＬ記憶部に記憶されている前記ＵＲＬに対応する前記リンク階層データによって示される前記階層数に応じて繰り返し実行する手順であることとしてもよい。 Further, the link hierarchy storage unit stores the link hierarchy data for each URL of the Web page stored in the read URL storage unit, and the process of acquiring the other Web page is performed according to the number of layers. The procedure of repeatedly executing the process according to the process repeats the process of acquiring the other Web page according to the number of levels indicated by the link level data corresponding to the URL stored in the URL storage unit. It may be a procedure to be executed.

読込ＵＲＬ記憶部に記憶されているＵＲＬが示すＷｅｂページの特性に応じて、リンクを辿る階層数を指定することができる。例えば、ある分野の用語集のＷｅｂページであるような場合、そのＷｅｂページには、ユーザが目的とする分野の文字列が多く含まれており、リンクを更に辿る必要がない場合がある。一方、例えば、ある分野の総合サイトの入口にあるＷｅｂページ（トップページ）であるような場合、そのＷｅｂページに設定されたリンクを辿ることにより、ユーザが目的とする分野の文字列を得ることができる。このように、読込ＵＲＬ記憶部に記憶されているＵＲＬが示すＷｅｂページごとに、リンクを辿る階層数を指定可能とすることで、目的とする分野に一段と合致した音声辞書を作成することができる。 The number of hierarchies to follow the link can be designated according to the characteristics of the Web page indicated by the URL stored in the read URL storage unit. For example, in the case of a web page of a glossary of a certain field, there are cases where the web page includes a large number of character strings in the field targeted by the user, and there is no need to follow the link further. On the other hand, for example, when it is a Web page (top page) at the entrance of a general site in a certain field, the user can obtain a character string in the target field by following the link set in the Web page. Can do. In this way, by making it possible to specify the number of layers following the link for each Web page indicated by the URL stored in the read URL storage unit, it is possible to create a speech dictionary that further matches the target field. .

また、本発明のプログラムが、Ｗｅｂページを表示するソフトウェアが備える表示するＷｅｂページのＵＲＬを入力するエリアに入力されたＵＲＬを取得し、取得した当該ＵＲＬを前記読込ＵＲＬ記憶部に記憶する手順を更に備えることとしてもよい。 In addition, the program of the present invention acquires a URL input to an area for inputting a URL of a Web page to be displayed provided in software for displaying the Web page, and stores the acquired URL in the read URL storage unit. Further, it may be provided.

例えば、ユーザが音声認識を利用して文字入力を行おうとしているＷｅｂページが、営業日報の入力ページである場合、ユーザが入力しようとしている文字列は、そのＷｅｂページに含まれている過去の営業日報等に含まれていることが多い。このような場合、ＷｅｂブラウザのＵＲＬ入力エリアに入力されたＵＲＬを読込ＵＲＬ記憶部に記憶することにより、ユーザが文字入力を行おうとする際に、別途読込対象のＵＲＬを指定することなく、目的に応じた音声辞書を作成することができる。 For example, when a Web page on which a user is trying to input characters using voice recognition is an input page for a daily business report, the character string that the user is trying to input is a past text included in the Web page. It is often included in business daily reports. In such a case, the URL input in the URL input area of the Web browser is stored in the read URL storage unit, so that when the user tries to input characters, the URL to be read is not specified separately. A voice dictionary can be created according to

１件１件試行錯誤を行って実用に耐え得る音声辞書を作成するよりも、効率的に実用に耐え得る音声辞書を作成することができる。また、適当な長文をもとに音声辞書を作成する手段は従来より存在していたが、インターネットの性質を利用してより高い精度で音声辞書の作成を行うことができる。さらに、迅速な音声辞書作成が行えることから、既存のサイトに対して音声認識機能を付加することが容易に行えるようになる。 Rather than creating a speech dictionary that can withstand practical use by performing trial and error on a case-by-case basis, a speech dictionary that can withstand practical use can be created efficiently. In addition, there has been a means for creating a speech dictionary based on an appropriate long sentence, but it is possible to create a speech dictionary with higher accuracy by utilizing the nature of the Internet. Furthermore, since a quick voice dictionary can be created, a voice recognition function can be easily added to an existing site.

このように、本発明によれば、実用に耐え得る音声辞書を効率的に作成する音声辞書作成装置、音声辞書作成方法、及びプログラムを提供することができる。 Thus, according to the present invention, it is possible to provide a speech dictionary creation device, a speech dictionary creation method, and a program that efficiently create a speech dictionary that can be practically used.

＝＝＝第一の実施形態＝＝＝
図１は本発明の実施例の構成図であり、インターネット上に文字入力を受け付ける文字入力サイトが存在しており、インターネット網に接続されたインターネット接続装置がこの文字入力サイトに接続されており、インターネット接続装置に音声認識装置が付加されていて文字入力サイトに音声認識された文字を送信できるようになっており、インターネット網に接続された読込処理部が文字入力を行おうとするサイトまたはそのサイトからリンクされたサイトまたはその他特に関連が深いとして使用者が指定したサイトを読み込み、そのデータを文字列抽出部に受け渡し、文字列抽出部は、受け渡されたデータを単語等の音声認識に使用する文字列として解析してその結果を音声辞書作成部に受け渡し、音声辞書作成部は文字列抽出部から受け渡されたデータに従って、単語の使用頻度を調整したり未知語を追加するなどの処理を行って音声辞書を作成して音声認識装置の音声辞書を作成することで効率的な音声認識による文字入力処理を行うものである。 === First Embodiment ===
FIG. 1 is a configuration diagram of an embodiment of the present invention, where there is a character input site that accepts character input on the Internet, and an Internet connection device connected to the Internet network is connected to the character input site. A site where a speech recognition device is added to the Internet connection device so that the speech-recognized characters can be sent to the character input site, and the site where the reading processing unit connected to the Internet network attempts to input characters or the site Sites linked from or other sites that are specified by the user as particularly relevant are read, and the data is passed to the character string extractor, which uses the received data for speech recognition of words, etc. And the result is passed to the speech dictionary creation unit. The speech dictionary creation unit Characters by efficient speech recognition by creating a speech dictionary by adjusting the frequency of use of words or adding unknown words according to the passed data and creating a speech dictionary of the speech recognition device Input processing is performed.

一般に、音声入力を行おうとするときに問題になるのは、それぞれのサイトによって頻出語、業界用語、慣用句などの用例が異なることが原因である。しかし、これらの言葉はすでに音声入力を行おうとするサイト、またはそのリンク先のサイトにほとんど記述してあるものである。たとえば、船舶業界では「こうかい」は「航海」「公海」として用いられる例が多いはずであり「公開」「更改」などの用例は少なくなるはずである。そこで、「こうかいする」であれば「航海する」となり「公海する」はおかしいので、係り受けの解析から「航海する」が正しい音声認識である確率が高い者と判断できる。そして、インターネットの性質上、すでに文字入力しようとしているサイトまたはリンクされたサイトであれば、やはり船舶業界関連の内容が記述されているはずであり上記のような用例を容易に見出し得る。また、船舶業界で著名なサイトがあれば、そのようなサイトも音声辞書作成に有益であるから必要に応じてそのデータを取得すべきである。また、掲示板のようなサイトにおいては、書き込みしようとする者の前の書き込みを受けたかたちで内容の書き込みが行われるケースが多いはずであり、必然的に音声認識に必要な単語がすでに掲示板サイトに出現していると思われるのである。このようなインターネット自体が有する性質を利用して、読込処理部がインターネットからデータを取得することで結果的に効率的な音声辞書を作成することが可能になるのである。 In general, the problem that arises when trying to input speech is that the usage of frequently used words, industry terms, idioms, etc. differ from site to site. However, these words are mostly written on the site where the voice input is to be made or the linked site. For example, in the marine industry, “Koukai” should be used as “voyage” and “high seas” and there should be fewer examples such as “open” and “renewal”. Therefore, if it is “Like”, it will be “Navigating” and “Going to sea” will be strange, so it can be judged from the analysis of the dependency that the person who has a high probability of “Navigating” is correct speech recognition. In addition, due to the nature of the Internet, if it is a site where characters are already input or a linked site, the contents related to the ship industry should be described, and the above example can be easily found. In addition, if there is a famous site in the ship industry, such a site is also useful for creating a speech dictionary, so that data should be acquired as necessary. In addition, in a site such as a bulletin board, there should be many cases where the content is written in the form of receiving the previous writing of the person trying to write, and the words necessary for speech recognition are already already in the bulletin board site. It seems to have appeared in. Utilizing such properties of the Internet itself, the reading processing unit acquires data from the Internet, and as a result, an efficient speech dictionary can be created.

＝＝＝第二の実施形態＝＝＝
＝＝全体構成＝＝
次に、本発明における第二の実施形態について説明する。図２は、第二の実施形態のネットワーク構成を示す図である。音声辞書作成装置１は、本発明における音声辞書作成プログラムを実行する装置であり、パーソナルコンピュータや音声認識に用いられる専用の装置等のコンピュータである。音声辞書作成装置１は、インターネットやイントラネット等のネットワーク２を介して複数のＷｅｂサーバ３ａ，３ｂ，３ｃと通信することができる。 === Second Embodiment ===
== Overall structure ==
Next, a second embodiment of the present invention will be described. FIG. 2 is a diagram illustrating a network configuration according to the second embodiment. The speech dictionary creation device 1 is a device that executes the speech dictionary creation program according to the present invention, and is a computer such as a personal computer or a dedicated device used for speech recognition. The speech dictionary creation device 1 can communicate with a plurality of Web servers 3a, 3b, 3c via a network 2 such as the Internet or an intranet.

Ｗｅｂサーバ３ａにはＨＴＭＬ（Hyper Text Markup Language）やＸＭＬ（eXtensible Markup Language）等のマークアップ言語で記述されたデータであるＷｅｂページ４ａが保存されている。そして、音声辞書作成装置１はＨＴＴＰ（Hyper Text Transfer Protocol）に則ってネットワーク２を介してＷｅｂサーバ３ａにアクセスすることで、Ｗｅｂページ４ａを取得することができる。図３に、Ｗｅｂページ４ａの一例を示す。Ｗｅｂページ４ａには、“<HTML>”や“<TITLE>”等のタグや、Ｗｅｂブラウザに表示される文書である文書データ５等が含まれている。また、Ｗｅｂページ４ａには、“<A HREF=”＠＠＠”></A>”というタグを用いて他のＷｅｂページへのリンク６が設定されている。「＠＠＠」の部分は、リンク先のＷｅｂページのＵＲＬを示すものである。 The Web server 3a stores a Web page 4a that is data described in a markup language such as HTML (Hyper Text Markup Language) or XML (eXtensible Markup Language). Then, the speech dictionary creation device 1 can acquire the Web page 4a by accessing the Web server 3a via the network 2 in accordance with HTTP (Hyper Text Transfer Protocol). FIG. 3 shows an example of the Web page 4a. The Web page 4a includes tags such as “<HTML>” and “<TITLE>”, document data 5 that is a document displayed on the Web browser, and the like. Further, in the Web page 4a, a link 6 to another Web page is set using a tag "<A HREF="@@@"> </A>". The part “@@@” indicates the URL of the linked web page.

なお、Ｗｅｂブラウザとは、Ｗｅｂページを表示するソフトウェアである。図４は、Ｗｅｂページ４ａを表示したＷｅｂブラウザ７を示す図である。Ｗｅｂブラウザ７は、表示するＷｅｂページのＵＲＬを入力するＵＲＬ入力エリア８および文書データ５や画像データ等が表示される表示エリア９が設けられている。ＵＲＬ入力エリア８にＷｅｂページ４ａのＵＲＬが入力されると、音声辞書作成装置１はＷｅｂサーバ３ａからＷｅｂページ４ａを取得し、Ｗｅｂページ４ａに含まれる文書データ５等を表示エリア９に表示する。表示エリア９には、リンク６も表示されており、ユーザがリンク６をクリックすると、音声辞書作成装置１はリンク先のＷｅｂページを取得する。 A web browser is software that displays a web page. FIG. 4 is a diagram showing the web browser 7 displaying the web page 4a. The Web browser 7 is provided with a URL input area 8 for inputting the URL of a Web page to be displayed and a display area 9 for displaying document data 5 and image data. When the URL of the Web page 4a is input to the URL input area 8, the speech dictionary creation device 1 acquires the Web page 4a from the Web server 3a and displays the document data 5 and the like included in the Web page 4a in the display area 9. . A link 6 is also displayed in the display area 9, and when the user clicks on the link 6, the speech dictionary creation device 1 acquires a linked Web page.

このように、Ｗｅｂページに設定されたリンクを辿ることにより、複数のＷｅｂページを取得することができる。図２の例は、Ｗｅｂページ４ａにはＷｅｂサーバ３ｂに保存されたＷｅｂページ４ｂへのリンクが設定され、Ｗｅｂページ４ｂにはＷｅｂサーバ３ｃに保存されたＷｅｂページ４ｃへのリンクが設定されていることを示している。 In this way, a plurality of Web pages can be acquired by following the links set in the Web pages. In the example of FIG. 2, a link to the Web page 4b stored in the Web server 3b is set in the Web page 4a, and a link to the Web page 4c stored in the Web server 3c is set in the Web page 4b. It shows that.

＝＝ハードウェア構成＝＝
図５は、音声辞書作成装置１のハードウェア構成を示す図である。音声辞書作成装置１は、ＣＰＵ１０、メモリ１１、記憶装置１２、記録媒体読取装置１３、通信インタフェース１４、入力装置１５、出力装置１６、及びマイク１７を備えている。 == Hardware configuration ==
FIG. 5 is a diagram illustrating a hardware configuration of the speech dictionary creation device 1. The speech dictionary creation device 1 includes a CPU 10, a memory 11, a storage device 12, a recording medium reading device 13, a communication interface 14, an input device 15, an output device 16, and a microphone 17.

ＣＰＵ１０は、音声辞書作成装置１の全体を制御するものであり、記憶装置１２に記憶されているプログラムを順次メモリ１１に格納し、メモリ１１に記憶されているプログラムを実行することができる。記憶装置１２は、ハードディスクドライブ等であり、プログラムやデータが記憶される。記録媒体読取装置１３は、ＣＤ−ＲＯＭ等の記録媒体１８に記録されているプログラムを読みとり、記憶装置１２に格納することができる。 The CPU 10 controls the entire speech dictionary creation device 1, and can sequentially store programs stored in the storage device 12 in the memory 11 and execute the programs stored in the memory 11. The storage device 12 is a hard disk drive or the like, and stores programs and data. The recording medium reading device 13 can read a program recorded on a recording medium 18 such as a CD-ROM and store it in the storage device 12.

通信インタフェース１４は、ネットワーク２を介してＷｅｂサーバ３ａ，３ｂ，３ｃとデータの送受信を行うためのインタフェースである。入力装置１５は、ユーザからのデータ入力を受け付ける装置であり、例えば、キーボードやマウス等である。出力装置１６は、ＣＰＵ１０の処理結果等を出力する装置であり、例えば、ディスプレイやプリンタ等である。マイク１７はユーザの発話により発生する音声を取得する装置である。なお、入力装置１５、出力装置１６、及びマイク１７は、音声辞書作成装置１と別体で構成され、ケーブル等を用いて音声辞書作成装置１と接続されることとしてもよい。 The communication interface 14 is an interface for transmitting / receiving data to / from the Web servers 3a, 3b, 3c via the network 2. The input device 15 is a device that receives data input from a user, and is, for example, a keyboard or a mouse. The output device 16 is a device that outputs a processing result of the CPU 10, and is, for example, a display or a printer. The microphone 17 is a device that acquires sound generated by the user's utterance. The input device 15, the output device 16, and the microphone 17 may be configured separately from the speech dictionary creation device 1 and connected to the speech dictionary creation device 1 using a cable or the like.

＝＝機能構成＝＝
図６は、音声辞書作成装置１が備える機能の構成を示す図である。音声辞書作成装置１は、音声認識機能部２０、音声辞書作成機能部３０、及び記憶部４０を備えている。 == Functional configuration ==
FIG. 6 is a diagram illustrating a configuration of functions provided in the speech dictionary creation device 1. The speech dictionary creation device 1 includes a speech recognition function unit 20, a speech dictionary creation function unit 30, and a storage unit 40.

音声認識機能部２０は、音声辞書作成装置１のメモリ１１に記憶されている音声認識プログラムをＣＰＵ１０が実行することにより実現される、音声取得部２１、音声認識部２２、及び文字列変換部２３を備えている。 The voice recognition function unit 20 is realized by the CPU 10 executing a voice recognition program stored in the memory 11 of the voice dictionary creation device 1. The voice acquisition unit 21, the voice recognition unit 22, and the character string conversion unit 23 are realized. It has.

また、音声辞書作成機能部３０は、音声辞書作成装置１のメモリ１１に記憶されている音声辞書作成プログラムをＣＰＵ１０が実行することにより実現される、読込処理部３１、文字列抽出部３２、未登録語出力部３３、読みデータ受付部３４、音声辞書作成部３５、及び入力ＵＲＬ取得部３６を備えている。 The speech dictionary creation function unit 30 is realized by the CPU 10 executing the speech dictionary creation program stored in the memory 11 of the speech dictionary creation device 1. The speech processing unit 31, the character string extraction unit 32, A registered word output unit 33, a reading data reception unit 34, a speech dictionary creation unit 35, and an input URL acquisition unit 36 are provided.

そして、記憶部４０は、メモリ１１に設けられた記憶領域であり、標準辞書記憶部（本発明の「第一の音声辞書記憶部」に該当。）４１、ユーザ辞書記憶部（本発明の「第二の音声辞書記憶部」に該当。）４２、読込ＵＲＬ記憶部４３、リンク階層記憶部４４、出現頻度記憶部４５、及び読みデータ記憶部４６を含んでいる。 The storage unit 40 is a storage area provided in the memory 11, and includes a standard dictionary storage unit (corresponding to the “first speech dictionary storage unit” of the present invention) 41, a user dictionary storage unit (“ Corresponding to the “second voice dictionary storage unit”.

音声取得部２１は、マイク１７から入力された音声データを取得する。音声認識部２２は、音声取得部２１が取得した音声データによって示される文字を認識して文字データを生成する。文字列変換部２３は、音声認識部２２が生成した複数の文字データから平仮名の文字列を生成し、平仮名の文字列を漢字や片仮名等の文字列に変換する。 The voice acquisition unit 21 acquires voice data input from the microphone 17. The voice recognition unit 22 recognizes a character indicated by the voice data acquired by the voice acquisition unit 21 and generates character data. The character string conversion unit 23 generates a hiragana character string from a plurality of character data generated by the speech recognition unit 22 and converts the hiragana character string into a character string such as kanji or katakana.

なお、文字列変換部２３は、文字列を変換する際に、標準辞書記憶部４１に記憶されている標準辞書またはユーザ辞書記憶部４２に記憶されているユーザ辞書を用いることができる。標準辞書とは、音声認識プログラムをインストールした際等に作成される、基本となる音声辞書である。また、ユーザ辞書とは、ユーザが文字列の追加・変更や、平仮名の文字列に対応する漢字や片仮名の文字列の候補が複数ある場合に、漢字や片仮名の文字列の夫々が選択される順序（優先順位）の変更等を行うことができる音声辞書である。なお、標準辞書を上書き更新してユーザ辞書を生成することも可能である。 The character string conversion unit 23 can use a standard dictionary stored in the standard dictionary storage unit 41 or a user dictionary stored in the user dictionary storage unit 42 when converting a character string. A standard dictionary is a basic speech dictionary created when a speech recognition program is installed. The user dictionary is selected when a user adds or changes a character string or when there are a plurality of candidate kanji or katakana character strings corresponding to a hiragana character string. This is a speech dictionary that can be changed in order (priority order). It is also possible to create a user dictionary by overwriting and updating the standard dictionary.

図７は、標準辞書記憶部４１およびユーザ辞書記憶部４２の構造を示す図である。標準辞書記憶部４１およびユーザ辞書記憶部４２には、文字列データ、読みデータ、及び優先順位データが対応付けられたデータである音声辞書データが記憶されている。文字列データには、漢字や片仮名の文字列が設定され、読みデータには、その文字列の読みを示す平仮名の文字列が設定されている。そして、優先順位データには、文字列変換部２３が平仮名の文字列を漢字や片仮名の文字列に変換する際の優先順位を示すデータが設定されている。 FIG. 7 is a diagram showing the structure of the standard dictionary storage unit 41 and the user dictionary storage unit 42. The standard dictionary storage unit 41 and the user dictionary storage unit 42 store voice dictionary data, which is data in which character string data, reading data, and priority order data are associated with each other. A character string of kanji or katakana is set in the character string data, and a hiragana character string indicating the reading of the character string is set in the reading data. In the priority order data, data indicating the priority order when the character string conversion unit 23 converts a hiragana character string into a kanji or katakana character string is set.

図７に示す例では、「こうかい」という読みに対して「航海」、「公開」、「後悔」の優先順位が順に「１」、「２」、「３」となっている。この場合、文字列変換部２３は、「こうかい」という平仮名の文字列を優先順位が最も高い「航海」という漢字の文字列に変換する。 In the example shown in FIG. 7, the priority of “voyage”, “public”, and “regret” is “1”, “2”, and “3” in order with respect to the reading “Koukai”. In this case, the character string conversion unit 23 converts the character string of hiragana “Koukai” into the character string of kanji “voyage” having the highest priority.

読込処理部３１は、読込ＵＲＬ記憶部４３に記憶されているＷｅｂページのＵＲＬを取得し、当該ＵＲＬが示すＷｅｂページをＷｅｂサーバから取得する。図８は、読込ＵＲＬ記憶部４３の構造を示す図である。読込ＵＲＬ記憶部４３には、ユーザによって指定されたＷｅｂページのＵＲＬが記憶されている。読込ＵＲＬ記憶部４３に記憶されているＵＲＬは、一つであってもよいし、複数であってもよい。 The read processing unit 31 acquires the URL of the Web page stored in the read URL storage unit 43, and acquires the Web page indicated by the URL from the Web server. FIG. 8 is a diagram illustrating the structure of the read URL storage unit 43. The read URL storage unit 43 stores the URL of the Web page designated by the user. The URL stored in the read URL storage unit 43 may be one or plural.

また、読込処理部３１は、取得したＷｅｂページに設定されているリンクを取得し、当該リンクに設定されたＵＲＬが示す他のＷｅｂページを取得する。このように、読込処理部３１は、Ｗｅｂページに設定されたリンクを辿って複数のＷｅｂページを取得する。図９は、リンク階層記憶部４４の構造を示す図である。リンク階層記憶部４４には、読込ＵＲＬ記憶部４３に記憶されているＵＲＬごとに、リンクを辿る階層数を示すデータであるリンク階層データが設定されている。なお、リンク階層記憶部４４には、読込ＵＲＬ記憶部４３に記憶されている全てのＵＲＬに共通のリンク階層データが一つだけ記憶されていることとしてもよい。 Further, the read processing unit 31 acquires a link set in the acquired Web page, and acquires another Web page indicated by the URL set in the link. In this manner, the read processing unit 31 acquires a plurality of Web pages by following the links set in the Web page. FIG. 9 is a diagram illustrating the structure of the link hierarchy storage unit 44. In the link hierarchy storage unit 44, for each URL stored in the read URL storage unit 43, link hierarchy data, which is data indicating the number of layers following the link, is set. Note that the link hierarchy storage unit 44 may store only one link hierarchy data common to all URLs stored in the read URL storage unit 43.

文字列抽出部３２は、読込処理部３１が取得したＷｅｂページを構成するデータからタグを除去して得られる文書データから、意味のある単位に分解した文字列データを抽出する。文書データから文字列データを抽出する方法としては、例えば、特開２０００−２５９６７３号公報に開示されている単語分割方法を用いることができる。そして、文字列抽出部３２は、文字列データごとの出現回数をカウントし、文字列データと出現回数を示すデータである出現頻度データとを対応付けて出現頻度記憶部４５に記憶する。図１０は、出現頻度記憶部４５の構造を示す図である。 The character string extraction unit 32 extracts character string data decomposed into meaningful units from document data obtained by removing tags from the data constituting the Web page acquired by the read processing unit 31. As a method for extracting character string data from document data, for example, a word division method disclosed in Japanese Patent Laid-Open No. 2000-259673 can be used. Then, the character string extraction unit 32 counts the number of appearances for each character string data, and stores the character string data in the appearance frequency storage unit 45 in association with appearance frequency data that is data indicating the number of appearances. FIG. 10 is a diagram illustrating the structure of the appearance frequency storage unit 45.

未登録語出力部３３は、出現頻度記憶部４５に記憶されている文字列データのうち、標準辞書に登録されていない文字列データをディスプレイ等の出力装置１６に出力する。読みデータ受付部３４は、ユーザによって入力装置１５やマイク１７から入力される、出力装置１６に出力された文字列データの読みを示す読みデータを受け付け、当該文字列データと当該読みデータとを対応付けて読みデータ記憶部４６に記憶する。図１１は、読みデータ記憶部４６の構造を示す図である。 The unregistered word output unit 33 outputs character string data that is not registered in the standard dictionary among the character string data stored in the appearance frequency storage unit 45 to the output device 16 such as a display. The reading data receiving unit 34 receives reading data indicating reading of character string data output to the output device 16 input from the input device 15 or the microphone 17 by the user, and associates the character string data with the reading data. At the same time, it is stored in the reading data storage unit 46. FIG. 11 is a diagram illustrating the structure of the reading data storage unit 46.

音声辞書作成部３５は、出現頻度記憶部４５に記憶されている文字列データに対応する読みデータを、標準辞書記憶部４１または読みデータ記憶部４６から取得し、読みデータが同一の文字列データの夫々について、出現頻度データに基づいて優先順位を決定する。つまり、音声辞書作成部３５は、読みデータが同一の文字列データに対して、出現回数が多い順に高い優先順位を設定する。そして、音声辞書作成部３５は、文字列データ、読みデータ、及び湯煎順位を示すデータである優先順位データを対応付けて音声辞書データを生成し、当該音声辞書データをユーザ辞書記憶部４２に記憶する。 The speech dictionary creation unit 35 acquires reading data corresponding to the character string data stored in the appearance frequency storage unit 45 from the standard dictionary storage unit 41 or the reading data storage unit 46, and character string data having the same reading data. The priority order is determined based on the appearance frequency data. That is, the speech dictionary creation unit 35 sets a higher priority for character string data having the same reading data in descending order of the number of appearances. Then, the speech dictionary creation unit 35 generates speech dictionary data by associating the character string data, the reading data, and the priority order data that is the data indicating the hot water ranking, and stores the speech dictionary data in the user dictionary storage unit 42. To do.

入力ＵＲＬ取得部３６は、Ｗｅｂブラウザ７のＵＲＬ入力エリア８に入力されたＵＲＬを取得し、当該ＵＲＬを読込ＵＲＬ記憶部４３に記憶する。つまり、入力ＵＲＬ取得部３６は、ユーザがＷｅｂブラウザ７に表示させようとしているＷｅｂページに基づいて文字列の優先順位を設定する場合に用いられる。 The input URL acquisition unit 36 acquires a URL input to the URL input area 8 of the Web browser 7 and stores the URL in the read URL storage unit 43. That is, the input URL acquisition unit 36 is used when the priority order of the character strings is set based on the Web page that the user is trying to display on the Web browser 7.

＝＝音声辞書作成処理＝＝
次に、音声辞書作成装置１における音声辞書作成処理について説明する。図１２は、音声辞書作成処理のフローチャートである。まず、読込処理部３１が、読込ＵＲＬ記憶部４３に記憶されているＵＲＬを一つ取得する（Ｓ１２０１）。さらに、読込処理部３１は、当該ＵＲＬに対応するリンク階層データをリンク階層記憶部４４から取得し（Ｓ１２０２）、リンク階層をカウントするための作業用のデータである作業階層データに０を設定する（Ｓ１２０３）。 == Voice dictionary creation process ==
Next, the speech dictionary creation process in the speech dictionary creation device 1 will be described. FIG. 12 is a flowchart of the voice dictionary creation process. First, the read processing unit 31 acquires one URL stored in the read URL storage unit 43 (S1201). Further, the read processing unit 31 acquires the link hierarchy data corresponding to the URL from the link hierarchy storage unit 44 (S1202), and sets 0 in the work hierarchy data that is work data for counting the link hierarchy. (S1203).

続いて、読込処理部３１は、取得したＵＲＬが示すＷｅｂページをＷｅｂサーバ３ａ，３ｂ，３ｃから取得する（Ｓ１２０４）。そして、読込処理部３１は作業階層データがリンク階層データと等しいかどうか確認する（Ｓ１２０５）。 Subsequently, the read processing unit 31 acquires the Web page indicated by the acquired URL from the Web servers 3a, 3b, and 3c (S1204). Then, the read processing unit 31 confirms whether the work hierarchy data is equal to the link hierarchy data (S1205).

作業階層データがリンク階層データと等しくない場合（Ｓ１２０５：Ｎｏ）、読込処理部３１は、取得したＷｅｂページを構成するデータに含まれるリンク先ＵＲＬを取得し（Ｓ１２０６）、当該リンク先ＵＲＬが示すＷｅｂページを取得する（Ｓ１２０７）。そして、読込処理部３１は、作業階層データに１加算し（Ｓ１２０８）、作業階層データとリンク階層データが等しいかどうか確認するステップ（Ｓ１２０５）に戻る。つまり、読込処理部３１は、Ｓ１２０５〜Ｓ１２０８の処理を繰り返すことによって、読込ＵＲＬ記憶部４３に記憶されているＵＲＬが示すＷｅｂページからリンク階層データによって示されるリンク階層の範囲内にあるＷｅｂページを取得する。 When the work hierarchy data is not equal to the link hierarchy data (S1205: No), the read processing unit 31 acquires a link destination URL included in the data constituting the acquired Web page (S1206), and the link destination URL indicates A web page is acquired (S1207). Then, the read processing unit 31 adds 1 to the work hierarchy data (S1208), and returns to the step of confirming whether the work hierarchy data is equal to the link hierarchy data (S1205). In other words, the read processing unit 31 repeats the processes of S1205 to S1208, so that a Web page within the link hierarchy indicated by the link hierarchy data is displayed from the Web page indicated by the URL stored in the read URL storage unit 43. get.

作業階層データがリンク階層データと等しくなると（Ｓ１２０５：Ｙｅｓ）、読込処理部３１は、読込ＵＲＬ記憶部４３に未取得のＵＲＬがあるかどうか確認する（Ｓ１２０９）。未取得のＵＲＬがある間（Ｓ１２０９：Ｙｅｓ）、読込処理部３１は、未取得のＵＲＬを一つ取得し、Ｗｅｂページを取得する一連の処理（Ｓ１２０１〜Ｓ１２０８）を繰り返し実行する。 When the work hierarchy data becomes equal to the link hierarchy data (S1205: Yes), the read processing unit 31 checks whether there is an unacquired URL in the read URL storage unit 43 (S1209). While there is an unacquired URL (S1209: Yes), the read processing unit 31 acquires one unacquired URL and repeatedly executes a series of processes (S1201 to S1208) for acquiring a Web page.

読込処理部３１によって読込ＵＲＬ記憶部のＵＲＬが全て取得されると（Ｓ１２０９：Ｎｏ）、文字列抽出部３２は、読込処理部３１が取得したＷｅｂページを構成するデータに含まれる文字列データを抽出する（Ｓ１２１０）。そして、文字列抽出部３２は、抽出した文字列データごとの出現回数をカウントし、文字列データと出現回数を示す出現頻度データとを対応付けて出現頻度記憶部４５に記憶する（Ｓ１２１１）。 When all the URLs in the read URL storage unit are acquired by the read processing unit 31 (S1209: No), the character string extraction unit 32 converts the character string data included in the data constituting the Web page acquired by the read processing unit 31. Extract (S1210). Then, the character string extraction unit 32 counts the number of appearances for each extracted character string data, and stores the character string data and the appearance frequency data indicating the number of appearances in association in the appearance frequency storage unit 45 (S1211).

続いて、未登録語出力部３３は、出現頻度記憶部４５に記憶されている文字列データのうち、標準辞書記憶部４１に記憶されていない文字列データを抽出し、当該文字列データをディスプレイ等の出力装置１６に出力する（Ｓ１２１２）。そして、読みデータ受付部３４は、ユーザによってキーボード等の入力装置１５やマイク１７から入力される当該文字列データに対する読みデータを受け付け、当該文字列データと当該読みデータとを対応付けて読みデータ記憶部４６に記憶する（Ｓ１２１３）。 Subsequently, the unregistered word output unit 33 extracts character string data not stored in the standard dictionary storage unit 41 from the character string data stored in the appearance frequency storage unit 45, and displays the character string data on the display. To the output device 16 (S1212). The reading data receiving unit 34 receives reading data for the character string data input from the input device 15 such as a keyboard or the microphone 17 by the user, and stores the reading data in association with the character string data and the reading data. The information is stored in the unit 46 (S1213).

次に、音声辞書作成部３５は、出現頻度記憶部４５に記憶されている文字列データに対応する読みデータを標準辞書記憶部４１または読みデータ記憶部４６から取得する（Ｓ１２１４）。そして、音声辞書作成部３５は、読みデータが同じ複数の文字列データに対する優先順位を出現頻度記憶部４５に記憶されている出現頻度データに基づいて決定する（Ｓ１２１５）。最後に、音声辞書作成部３５は、文字列データ、読みデータ、及び優先順位を示すデータである優先順位データを対応付けて音声辞書データを生成し、当該音声辞書データをユーザ辞書記憶部４２に記憶する（Ｓ１２１６）。 Next, the speech dictionary creation unit 35 acquires reading data corresponding to the character string data stored in the appearance frequency storage unit 45 from the standard dictionary storage unit 41 or the reading data storage unit 46 (S1214). Then, the voice dictionary creation unit 35 determines the priority order for a plurality of character string data having the same reading data based on the appearance frequency data stored in the appearance frequency storage unit 45 (S1215). Finally, the speech dictionary creation unit 35 generates speech dictionary data by associating the character string data, the reading data, and the priority order data that indicates the priority order, and stores the speech dictionary data in the user dictionary storage unit 42. Store (S1216).

ここで、出現頻度データに基づいて優先順位を決定する方法の例を説明する。図１３は、優先順位を決定する方法の第一の例を示す図である。まず、音声辞書作成部３５は、出現頻度記憶部４５に記憶されている出現頻度データの降順に各文字列データの優先順位を設定した音声辞書データをユーザ辞書記憶部４２に記憶する（Ｓ１３０１）。そして、音声辞書作成部３５は、標準辞書記憶部４１に記憶されている文字列データのうち、出現頻度記憶部４５に存在しない文字列データについて、Ｓ１３０１で記憶された文字列データよりも後の優先順位を設定して、ユーザ辞書記憶部４２に記憶する（Ｓ１３０２）。 Here, an example of a method for determining the priority order based on the appearance frequency data will be described. FIG. 13 is a diagram illustrating a first example of a method for determining a priority order. First, the speech dictionary creation unit 35 stores, in the user dictionary storage unit 42, speech dictionary data in which the priority order of each character string data is set in descending order of the appearance frequency data stored in the appearance frequency storage unit 45 (S1301). . Then, the voice dictionary creation unit 35, for the character string data that does not exist in the appearance frequency storage unit 45 among the character string data stored in the standard dictionary storage unit 41, is later than the character string data stored in S1301. Priorities are set and stored in the user dictionary storage unit 42 (S1302).

また、図１４は、優先順位を決定する方法の第二の例を示す図である。まず、音声辞書作成部３５は、標準辞書記憶部４１に記憶されている文字列データの優先順位を、出現頻度記憶部４５に記憶されている出現回数分上昇させた音声辞書データをユーザ辞書記憶部４２に記憶する（Ｓ１４０１）。図１４の例では、「航海」の出現回数は２回であるため、標準辞書の「航海」の優先順位は「３」から「２」上昇し、「１」となる。これに伴い、「後悔」および「公開」の優先順位は１ずつ下げられる。 FIG. 14 is a diagram illustrating a second example of the method for determining the priority order. First, the speech dictionary creation unit 35 stores the speech dictionary data in which the priority order of the character string data stored in the standard dictionary storage unit 41 is increased by the number of appearances stored in the appearance frequency storage unit 45. Store in the unit 42 (S1401). In the example of FIG. 14, since the number of appearances of “voyage” is two, the priority of “voyage” in the standard dictionary increases by “2” from “3” to “1”. Accordingly, the priority order of “regret” and “public” is lowered by one.

次に、音声辞書作成部３５は、未登録語に最も低い優先順位を設定した音声辞書データをユーザ辞書記憶部４２に記憶する（Ｓ１４０２）。図１４の例では、未登録語の「公海」に最も低い優先順位「４」が設定され、ユーザ辞書に追加されている。そして、音声辞書作成部３５は、追加した未登録語の優先順位を出現頻度記憶部４５に記憶されている出現回数分上昇させる（Ｓ１４０３）。図１４の例では、「公海」の出現回数は１回であるため、「公海」の優先順位は「４」から「１」上昇し、「３」となる。そして、「公開」の優先順位が「３」から「４」に下げられる。 Next, the speech dictionary creation unit 35 stores the speech dictionary data in which the lowest priority is set for unregistered words in the user dictionary storage unit 42 (S1402). In the example of FIG. 14, the lowest priority “4” is set for the unregistered word “high seas” and added to the user dictionary. Then, the speech dictionary creation unit 35 increases the priority of the added unregistered word by the number of appearances stored in the appearance frequency storage unit 45 (S1403). In the example of FIG. 14, since the appearance frequency of “high seas” is one, the priority of “high seas” increases by “1” from “4” to “3”. Then, the priority of “public” is lowered from “3” to “4”.

図１３および図１４に例示したように、音声辞書作成部３５は、読みデータが同一の複数の文字列データの夫々に対する優先順位データを出現頻度記憶部４５に記憶されている出現頻度データに基づいて生成し、文字列データ、読みデータ、及び優先順位データを対応付けた音声辞書データをユーザ辞書記憶部４２に記憶することができる。 As illustrated in FIGS. 13 and 14, the speech dictionary creation unit 35 is based on the appearance frequency data stored in the appearance frequency storage unit 45 for the priority order data for each of a plurality of character string data having the same reading data. The voice dictionary data that is generated and associated with the character string data, the reading data, and the priority order data can be stored in the user dictionary storage unit 42.

＝＝ＷｅｂブラウザのＵＲＬ記憶部を利用＝＝
次に、Ｗｅｂブラウザ７が備えるＵＲＬの記憶部を読込対象のＵＲＬとして用いる場合について説明する。図１５は、Ｗｅｂブラウザ７の登録ＵＲＬメニュー５０を表示した図である。ユーザは、自分の気に入ったＷｅｂページがあった場合、そのＷｅｂページがＷｅｂブラウザ７に表示されている際に、登録ＵＲＬメニュー５０で追加ボタン５１を選択する。すると、そのＷｅｂページのタイトル５２が登録ＵＲＬメニュー５０の中に追加される。また、ユーザは登録ＵＲＬメニュー５０の中にフォルダ５３を作成し、そのフォルダ５３の中にＷｅｂページのタイトル５４を追加することもできる。 == Use URL storage part of Web browser ==
Next, a case where the URL storage unit included in the Web browser 7 is used as a URL to be read will be described. FIG. 15 is a diagram showing the registration URL menu 50 of the Web browser 7. When there is a web page that the user likes, when the web page is displayed on the web browser 7, the user selects the add button 51 from the registration URL menu 50. Then, the title 52 of the Web page is added to the registration URL menu 50. The user can also create a folder 53 in the registration URL menu 50 and add a Web page title 54 to the folder 53.

図１６は、ユーザが登録ＵＲＬメニュー５０を用いて登録したＷｅｂページのＵＲＬを記憶しているＵＲＬ記憶部５５の構造を示す図である。ＵＲＬ記憶部５５は、メモリ１１に設けられた記憶領域である。ＵＲＬ記憶部５５には、フォルダ５３、Ｗｅｂページのタイトル５２，５４、及びＷｅｂページのＵＲＬが対応付けられて記憶されている。 FIG. 16 is a diagram showing the structure of the URL storage unit 55 that stores the URL of the Web page registered by the user using the registration URL menu 50. The URL storage unit 55 is a storage area provided in the memory 11. The URL storage unit 55 stores a folder 53, Web page titles 52 and 54, and Web page URLs in association with each other.

読込処理部３１は、このＵＲＬ記憶部５５に記憶されているＵＲＬを取得し、読込ＵＲＬ記憶部４３に記憶することができる。また、読込処理部３１は、ユーザによって指定される登録ＵＲＬメニュー５０に登録されたＷｅｂページのタイトル５２，５４やフォルダ５３を受け付け、そのタイトル５２，５４やフォルダ５３に対応するＵＲＬを読込ＵＲＬ記憶部４３に記憶することも可能である。 The read processing unit 31 can acquire the URL stored in the URL storage unit 55 and store it in the read URL storage unit 43. Further, the read processing unit 31 receives the titles 52 and 54 and the folder 53 of the Web page registered in the registration URL menu 50 designated by the user, and reads the URLs corresponding to the titles 52 and 54 and the folder 53 by reading the URL. It can also be stored in the unit 43.

＝＝Ｗｅｂブラウザで入力されたＵＲＬを利用＝＝
次に、Ｗｅｂブラウザ７のＵＲＬ入力エリア８に入力されたＵＲＬを読込対象のＵＲＬとして用いる場合について説明する。図１７は、入力ＵＲＬ取得部３６の処理を示すフローチャートである。まず、入力ＵＲＬ取得部３６は、ユーザがＵＲＬ入力エリア８に入力したＵＲＬを取得する（Ｓ１７０１）。そして、入力ＵＲＬ取得部３６は、取得したＵＲＬを読込ＵＲＬ記憶部４３に記憶し（Ｓ１７０２）、図１２に示した音声辞書作成処理を起動する（Ｓ１７０３）。これにより、ＵＲＬ入力エリア８に入力されたＵＲＬが示すＷｅｂページ及び当該Ｗｅｂページに設定されたリンクを辿って取得される他のＷｅｂページをもとに、ユーザ辞書が生成される。 == Use URL entered by Web browser ==
Next, a case where the URL input in the URL input area 8 of the Web browser 7 is used as the URL to be read will be described. FIG. 17 is a flowchart showing the processing of the input URL acquisition unit 36. First, the input URL acquisition unit 36 acquires a URL input by the user to the URL input area 8 (S1701). Then, the input URL acquisition unit 36 stores the acquired URL in the read URL storage unit 43 (S1702), and starts the speech dictionary creation process shown in FIG. 12 (S1703). As a result, a user dictionary is generated based on the Web page indicated by the URL input in the URL input area 8 and other Web pages acquired by following links set in the Web page.

以上、音声辞書作成プログラムを実行する音声辞書作成装置１の動作について説明した。前述したように、本発明の音声辞書作成プログラムがインストールされた音声辞書作成装置１を用いれば、音声認識させて文字入力を行おうとしている分野に関連するサイトのＷｅｂページのＵＲＬを読込ＵＲＬ記憶部４３に記憶させておくだけで、そのＷｅｂページにおける文字列の出現頻度に応じた優先順位が設定されたユーザ辞書を得ることができる。なお、生成されたユーザ辞書は、音声認識における文字変換に限らず、仮名漢字変換における文字変換においても使用可能である。 The operation of the speech dictionary creation device 1 that executes the speech dictionary creation program has been described above. As described above, if the speech dictionary creation device 1 in which the speech dictionary creation program of the present invention is installed is used, the URL of a Web page of a site related to the field in which speech recognition is to be performed is read and URL storage is performed. By simply storing it in the unit 43, it is possible to obtain a user dictionary in which priorities are set in accordance with the appearance frequency of character strings on the Web page. The generated user dictionary can be used not only for character conversion in speech recognition but also for character conversion in kana-kanji conversion.

また、音声辞書作成装置１は未登録語をディスプレイ等の出力装置１６に出力する。そして、ユーザが未登録語の読みをキーボードやマイク等の入力装置から入力する。音声辞書作成装置１は、未登録語も含めて同音異義語の優先順位を設定したユーザ辞書を生成する。このように、未登録語が存在する場合においても、文字列の出現頻度に応じた優先順位が設定された音声辞書を得ることができる。 In addition, the voice dictionary creation device 1 outputs unregistered words to the output device 16 such as a display. Then, the user inputs reading of the unregistered word from an input device such as a keyboard or a microphone. The speech dictionary creation device 1 generates a user dictionary in which priorities of homophones including unregistered words are set. In this way, even when there are unregistered words, it is possible to obtain a speech dictionary in which priorities according to the appearance frequency of character strings are set.

また、音声辞書作成装置１は、ユーザがＷｅｂブラウザ７の登録ＵＲＬメニュー５０から登録したＵＲＬが記憶されているＵＲＬ記憶部５５からＵＲＬを取得して読込ＵＲＬ記憶部４３に記憶することもできる。ユーザは、自分の業務や研究等に関連するＷｅｂページのＵＲＬを、このＵＲＬ記憶部５５に登録しておくことが多い。そこで、このＵＲＬ記憶部５５に記憶されているＵＲＬを読込ＵＲＬ記憶部に記憶することで、ユーザは音声辞書作成用に別途ＵＲＬを登録することなく、業務や研究等に関連する分野のＷｅｂページにおける文字列の出現頻度に応じた優先順位が設定された音声辞書を得ることができる。 The voice dictionary creation device 1 can also acquire a URL from the URL storage unit 55 in which a URL registered by the user from the registration URL menu 50 of the Web browser 7 is stored and store it in the read URL storage unit 43. In many cases, the user registers the URL of a Web page related to his / her work or research in the URL storage unit 55. Therefore, by storing the URL stored in the URL storage unit 55 in the read URL storage unit, the user does not register a separate URL for creating a speech dictionary, and the web page in a field related to business or research. A speech dictionary in which priorities are set in accordance with the appearance frequency of the character string can be obtained.

この場合、ＵＲＬ記憶部５５には、多様な分野のＷｅｂページのＵＲＬが登録されている場合がある。そこで、音声辞書作成装置１は、ＵＲＬ記憶部５５に登録されているＵＲＬの中から、ユーザによって指定されたＵＲＬのみを読込ＵＲＬ記憶部４３に記憶することができる。なお、ユーザは、ＵＲＬ記憶部５５に分野ごとのフォルダを設けている場合は、そのフォルダを選択することにより、そのフォルダの中に格納されている複数のＵＲＬを指定することができる。 In this case, URLs of Web pages in various fields may be registered in the URL storage unit 55. Therefore, the speech dictionary creation device 1 can store only the URL designated by the user from the URLs registered in the URL storage unit 55 in the read URL storage unit 43. Note that, when a folder for each field is provided in the URL storage unit 55, the user can designate a plurality of URLs stored in the folder by selecting the folder.

このように、Ｗｅｂブラウザが備えるＵＲＬ記憶部５５に多様な分野のＷｅｂページのＵＲＬが登録されている場合においても、ユーザが文字入力を行おうとしている分野に応じた優先順位が設定された音声辞書を得ることができる。また、ユーザがＵＲＬを分野別に整理してＵＲＬ記憶部５５に登録しておくことにより、目的とする分野に応じた音声辞書を容易に得ることができる。 As described above, even when URLs of Web pages in various fields are registered in the URL storage unit 55 provided in the Web browser, the voice in which the priority order is set according to the field in which the user intends to input characters. You can get a dictionary. In addition, when the user organizes URLs by field and registers them in the URL storage unit 55, a speech dictionary corresponding to the target field can be easily obtained.

また、Ｗｅｂページの特徴として、Ｗｅｂページには関連する他のＷｅｂページへのリンクが設定されていることが多い。そこで、音声辞書作成装置１は、読込ＵＲＬ記憶部４３に記憶されているＵＲＬが示すＷｅｂページを取得する際に、そのＷｅｂページに設定されたリンクを辿って他のＷｅｂページを取得する。ここで、リンクを無限に辿ると、あらゆる分野のＷｅｂページが取得されてしまい、目的とする分野に応じた音声辞書を作成することができない。逆に、リンクを辿る階層数を少なくすると、取得できる文字列数が限られ、実用的な音声辞書を作成することができない。そこで、ユーザが、リンクを辿る階層数をリンク階層記憶部４４に設定しておくことにより、ユーザが目的とする分野に応じた音声辞書を作成することができる。 In addition, as a feature of a Web page, a link to another related Web page is often set in the Web page. Therefore, when acquiring the Web page indicated by the URL stored in the read URL storage unit 43, the voice dictionary creating apparatus 1 acquires another Web page by following the link set for the Web page. Here, if the link is traced infinitely, Web pages in all fields are acquired, and a speech dictionary corresponding to the target field cannot be created. Conversely, if the number of layers following the link is reduced, the number of character strings that can be acquired is limited, and a practical speech dictionary cannot be created. Therefore, when the user sets the number of hierarchies following the link in the link hierarchy storage unit 44, an audio dictionary corresponding to the field intended by the user can be created.

この場合、読込ＵＲＬ記憶部４３に記憶されているＵＲＬが示すＷｅｂページの特性に応じて、リンクを辿る階層数を指定することも可能である。例えば、Ｗｅｂページがある分野の用語集のＷｅｂページであるような場合、そのＷｅｂページには、ユーザが目的とする分野の文字列が多く含まれており、リンクを更に辿る必要がない場合がある。一方、例えば、ある分野の総合サイトの入口にあるＷｅｂページ（トップページ）であるような場合、そのＷｅｂページに設定されたリンクを辿ることにより、ユーザが目的とする分野の文字列を得ることができる。このように、読込ＵＲＬ記憶部４３に記憶されているＵＲＬが示すＷｅｂページごとに、リンクを辿る階層数を指定可能とすることで、目的とする分野に一段と合致した音声辞書を作成することができる。 In this case, it is possible to specify the number of hierarchies to follow the link according to the characteristics of the Web page indicated by the URL stored in the read URL storage unit 43. For example, when a web page is a glossary web page in a certain field, the web page includes a large number of character strings in the field targeted by the user, and there is a case where it is not necessary to follow the link further. is there. On the other hand, for example, when it is a Web page (top page) at the entrance of a general site in a certain field, the user can obtain a character string in the target field by following the link set in the Web page. Can do. As described above, for each Web page indicated by the URL stored in the read URL storage unit 43, it is possible to specify the number of layers to follow the link, thereby creating a speech dictionary that further matches the target field. it can.

また、音声辞書作成装置１は、Ｗｅｂブラウザ７のＵＲＬ入力エリア８に入力されたＵＲＬを取得して、取得した当該ＵＲＬを読込ＵＲＬ記憶部４３に記憶することもできる。例えば、ユーザが音声認識を利用して文字入力を行おうとしているＷｅｂページが、営業日報の入力ページである場合、ユーザが入力しようとしている文字列は、そのＷｅｂページに含まれている過去の営業日報等に含まれていることが多い。このような場合、Ｗｅｂブラウザ７のＵＲＬ入力エリア８に入力されたＵＲＬを読込ＵＲＬ記憶部４３に記憶することにより、ユーザが文字入力を行おうとする際に、別途読込対象のＵＲＬを指定することなく、目的に応じた音声辞書を作成することができる。 The voice dictionary creation device 1 can also acquire the URL input in the URL input area 8 of the Web browser 7 and store the acquired URL in the read URL storage unit 43. For example, when a Web page on which a user is trying to input characters using voice recognition is an input page for a daily business report, the character string that the user is trying to input is a past text included in the Web page. It is often included in business daily reports. In such a case, by storing the URL input in the URL input area 8 of the Web browser 7 in the read URL storage unit 43, when the user tries to input characters, the URL to be read is specified separately. The voice dictionary according to the purpose can be created.

以上本発明の実施形態について説明したが、以上の実施形態の説明は本発明の理解を容易にするためのものであり、本発明を限定するものではない。本発明はその趣旨を逸脱することなく、変更、改良され得ると共に本発明にはその等価物が含まれることは勿論である。 Although the embodiment of the present invention has been described above, the above description of the embodiment is intended to facilitate understanding of the present invention and is not intended to limit the present invention. It goes without saying that the present invention can be changed and improved without departing from the gist thereof, and that the present invention includes equivalents thereof.

本発明の実施例の構成図である。It is a block diagram of the Example of this invention. 本発明の第二の実施形態におけるネットワーク構成を示す図である。It is a figure which shows the network structure in 2nd embodiment of this invention. Ｗｅｂページの一例を示す図である。It is a figure which shows an example of a web page. Ｗｅｂページを表示したＷｅｂブラウザを示す図である。It is a figure which shows the web browser which displayed the web page. 音声辞書作成装置のハードウェア構成を示す図である。It is a figure which shows the hardware constitutions of an audio dictionary creation apparatus. 音声辞書作成装置が備える機能の構成を示す図である。It is a figure which shows the structure of the function with which an audio dictionary creation apparatus is provided. 標準辞書記憶部およびユーザ辞書記憶部の構造を示す図である。It is a figure which shows the structure of a standard dictionary memory | storage part and a user dictionary memory | storage part. 読込ＵＲＬ記憶部の構造を示す図である。It is a figure which shows the structure of a reading URL memory | storage part. リンク階層記憶部の構造を示す図である。It is a figure which shows the structure of a link hierarchy memory | storage part. 出現頻度記憶部の構造を示す図である。It is a figure which shows the structure of an appearance frequency memory | storage part. 読みデータ記憶部の構造を示す図である。It is a figure which shows the structure of a reading data storage part. 音声辞書作成処理のフローチャートである。It is a flowchart of an audio dictionary creation process. 優先順位を決定する方法の第一の例を示す図である。It is a figure which shows the 1st example of the method of determining a priority. 優先順位を決定する方法の第二の例を示す図である。It is a figure which shows the 2nd example of the method of determining a priority. Ｗｅｂブラウザの登録ＵＲＬメニューを表示した図である。It is the figure which displayed the registration URL menu of the Web browser. ユーザが登録ＵＲＬメニューを用いて登録したＷｅｂページのＵＲＬを記憶しているＵＲＬ記憶部の構造を示す図である。It is a figure which shows the structure of the URL memory | storage part which memorize | stored URL of the web page which the user registered using the registration URL menu. 入力ＵＲＬ取得部の処理を示すフローチャートである。It is a flowchart which shows the process of an input URL acquisition part.

Explanation of symbols

１音声辞書作成装置２ネットワーク
３ａ，３ｂ，３ｃＷｅｂサーバ４ａ，４ｂ，４ｃＷｅｂページ
５文書データ６リンク
７Ｗｅｂブラウザ８ＵＲＬ入力エリア
９表示エリア１０ＣＰＵ
１１メモリ１２記憶装置
１３記録媒体読取装置１４通信インタフェース
１５入力装置１６出力装置
１７マイク１８記録媒体
２０音声認識機能部２１音声取得部
２２音声認識部２３文字列変換部
３０音声辞書作成機能部３１読込処理部
３２文字列抽出部３３未登録語出力部
３４読みデータ受付部３５音声辞書作成部
３６入力ＵＲＬ取得部４０記憶部
４１標準辞書記憶部４２ユーザ辞書記憶部
４３読込ＵＲＬ記憶部４４リンク階層記憶部
４５出現頻度記憶部４６読みデータ記憶部
５０登録ＵＲＬメニュー５１追加ボタン
５２，５４Ｗｅｂページのタイトル５３フォルダ
５５ＵＲＬ記憶部 DESCRIPTION OF SYMBOLS 1 Speech dictionary creation apparatus 2 Network 3a, 3b, 3c Web server 4a, 4b, 4c Web page 5 Document data 6 Link 7 Web browser 8 URL input area 9 Display area 10 CPU
DESCRIPTION OF SYMBOLS 11 Memory 12 Storage device 13 Recording medium reader 14 Communication interface 15 Input device 16 Output device 17 Microphone 18 Recording medium 20 Voice recognition function part 21 Voice acquisition part 22 Voice recognition part 23 Character string conversion part 30 Voice dictionary creation function part 31 Reading Processing unit 32 Character string extraction unit 33 Unregistered word output unit 34 Reading data reception unit 35 Speech dictionary creation unit 36 Input URL acquisition unit 40 Storage unit 41 Standard dictionary storage unit 42 User dictionary storage unit 43 Read URL storage unit 44 Link hierarchy storage Unit 45 Appearance frequency storage unit 46 Reading data storage unit 50 Registration URL menu 51 Add button 52, 54 Web page title 53 Folder 55 URL storage unit

Claims

A communication interface for connecting to a CPU, memory, and the Internet;
A first voice dictionary storage unit, a second voice dictionary storage unit, a read URL storage unit, and an appearance frequency storage unit, which are storage areas provided in the memory;
A read processing unit, a character string extraction unit, and a speech dictionary creation unit, which are realized by the CPU executing a program stored in the memory;
With
In the first speech dictionary storage unit, character string data indicating a character string such as a word and reading data indicating a reading of the character string are stored in association with each other,
The read URL storage unit stores a URL of a Web page,
The read processing unit acquires a Web page specified by the URL stored in the read URL storage unit by accessing a Web server on the Internet,
The character string extraction unit
Extracting the character string data included in the data constituting the web page acquired by the read processing unit to obtain the appearance frequency for each character string data;
Storing the character string data and the appearance frequency data, which is data indicating the appearance frequency, in association with each other in the appearance frequency storage unit;
The voice dictionary creation unit
Obtaining the reading data corresponding to the character string data stored in the appearance frequency storage unit from the first speech dictionary storage unit;
For each of the plurality of character string data having the same acquired reading data, the reading data obtained by voice recognition is converted into one character string data selected from the plurality of character string data. The selection order of the character string data at the time is determined based on the appearance frequency data,
Associating the character string data, the reading data, and selection order data that is data indicating the selection order, and generating speech dictionary data that is data used when recognizing speech and converting it to a character string; Storing the voice dictionary data in the second voice dictionary storage unit;
Voice dictionary creation device characterized by.

A communication interface for connecting to a CPU, memory, and the Internet;
A storage area provided in the memory;
A first voice dictionary storage unit in which character string data indicating a character string such as a word and reading data indicating a reading of the character string are stored in association with each other;
A read URL storage unit in which the URL of the Web page is stored;
On a computer with
Obtaining a Web page specified by the URL stored in the read URL storage unit by accessing a Web server on the Internet;
A procedure for extracting the character string data included in the acquired web page and obtaining an appearance frequency for each character string data;
A procedure for associating the character string data and appearance frequency data that is data indicating the appearance frequency and storing them in an appearance frequency storage unit that is a storage area provided in the memory;
A procedure for acquiring the reading data corresponding to the character string data stored in the appearance frequency storage unit from the first speech dictionary storage unit;
For each of the plurality of character string data having the same acquired reading data, the reading data obtained by voice recognition is converted into one character string data selected from the plurality of character string data. A procedure for determining the selection order of the character string data at the time based on the appearance frequency data;
Associating the character string data, the reading data, and selection order data that is data indicating the selection order, and generating speech dictionary data that is data used when recognizing speech and converting it to a character string; A procedure for storing the speech dictionary data in a second speech dictionary storage unit that is a storage area provided in the memory;
A program for running

The program according to claim 2,
Among the character string data stored in the appearance frequency storage unit, a procedure for outputting the character string data not stored in the first speech dictionary storage unit to a predetermined output device such as a display or a printer;
The reading data indicating the reading of the character string represented by the character string data output to the output device, which is input from a predetermined input device included in the computer, is received, and the character string data and the reading data are associated with each other. In addition, a procedure for storing in a reading data storage unit which is a storage area provided in the memory;
Further comprising
The procedure of acquiring the reading data corresponding to the character string data stored in the appearance frequency storage unit includes the reading data corresponding to the character string data stored in the appearance frequency storage unit. A procedure to obtain from one voice dictionary storage unit or the reading data storage unit,
A program characterized by

A CPU, a memory, an input device, an output device, and a communication interface connected to the Internet;
A storage area provided in the memory;
A first voice dictionary storage unit in which character string data indicating a character string such as a word and reading data indicating a reading of the character string are stored in association with each other;
A read URL storage unit in which the URL of the Web page is stored;
On a computer with
Obtaining a Web page specified by the URL stored in the read URL storage unit by accessing a Web server on the Internet;
A procedure for extracting the character string data included in the data constituting the acquired web page and obtaining an appearance frequency for each character string data;
A procedure for associating the character string data and appearance frequency data that is data indicating the appearance frequency and storing them in an appearance frequency storage unit that is a storage area provided in the memory;
A procedure for outputting the extracted character string data to the output device;
The reading data indicating the reading of the character string represented by the character string data output from the input device and output to the output device is received, and the character string data and the reading data are associated with each other in the memory. A procedure of storing in a reading data storage unit which is a storage area provided;
A procedure for acquiring the reading data corresponding to the character string data stored in the appearance frequency storage unit from the reading data storage unit;
For each of the plurality of character string data having the same read data, the read data obtained by voice recognition is selected from the plurality of character string data having the same read data. A procedure for determining the selection order of the character string data when converting to the character string data based on the appearance frequency data;
Associating the character string data, the reading data, and selection order data that is data indicating the selection order, and generating speech dictionary data that is data used when recognizing speech and converting it to a character string; A procedure for storing the speech dictionary data in a second speech dictionary storage unit that is a storage area provided in the memory;
A program for running

The program according to claim 2 or 4,
A Web browser, which is software that displays an image, a character string, or the like corresponding to the Web page on the display, acquires the URL stored in the URL storage unit that is a storage area for storing the URL of the Web page specified by the user. Procedure and
Storing the URL obtained from the URL storage unit in the read URL storage unit;
A program characterized by further comprising:

The speech dictionary creation program according to claim 5,
A step of accepting the URL specified by the user among the plurality of URLs stored in the read URL storage unit;
The procedure for acquiring the URL stored in the URL storage unit is a procedure for acquiring, from the URL storage unit, the URL specified by the user among the URLs stored in the URL storage unit. thing,
A program characterized by

The program according to claim 2 or 4,
Link hierarchy data indicating the number of hierarchies that follow links to other web pages set in a web page is stored in a link hierarchy storage unit that is a storage area provided in the memory,
In the procedure of acquiring the Web page specified by the URL stored in the read URL storage unit,
A link destination URL that is a URL indicating a link to another Web page included in the data constituting the acquired Web page is extracted, and the other Web page specified by the link destination URL is acquired. Including a procedure of repeatedly executing the processing to be performed according to the number of levels indicated by the link level data stored in the link level storage unit,
A program characterized by

The program according to claim 7,
The link hierarchy storage unit stores the link hierarchy data for each URL of the Web page stored in the read URL storage unit,
The procedure of repeatedly executing the process of acquiring the other Web page according to the number of hierarchies corresponds to the URL stored in the URL storage unit of the process of acquiring the other Web page. A procedure that is repeatedly executed according to the number of layers indicated by the link layer data.
A program characterized by

The program according to claim 2 or 4,
A program further comprising a procedure of acquiring a URL input in an area for inputting a URL of a Web page to be displayed provided in software for displaying the Web page, and storing the acquired URL in the read URL storage unit .

A communication interface for connecting to a CPU, memory, and the Internet;
A storage area provided in the memory;
A first voice dictionary storage unit in which character string data indicating a character string such as a word and reading data indicating a reading of the character string are stored in association with each other;
A read URL storage unit in which the URL of the Web page is stored;
A computer comprising
Obtaining a Web page specified by the URL stored in the read URL storage unit by accessing a Web server on the Internet;
Extracting the character string data contained in the acquired web page and determining the appearance frequency for each character string data;
The character string data and the appearance frequency data which is the data indicating the appearance frequency are associated with each other and stored in the appearance frequency storage unit which is a storage area provided in the memory,
Obtaining the reading data corresponding to the character string data stored in the appearance frequency storage unit from the first speech dictionary storage unit;
For each of the plurality of character string data having the same acquired reading data, the reading data obtained by voice recognition is converted into one character string data selected from the plurality of character string data. The selection order of the character string data at the time is determined based on the appearance frequency data,
Associating the character string data, the reading data, and selection order data that is data indicating the selection order, and generating speech dictionary data that is data used when recognizing speech and converting it to a character string; Storing the speech dictionary data in a second speech dictionary storage unit which is a storage area provided in the memory;
A voice dictionary creation method characterized by