JP3884970B2

JP3884970B2 - Information processing apparatus and information processing method

Info

Publication number: JP3884970B2
Application number: JP2002039033A
Authority: JP
Inventors: 雅章山田; 勝彦川崎
Original assignee: Canon Inc
Current assignee: Canon Inc
Priority date: 2002-02-15
Filing date: 2002-02-15
Publication date: 2007-02-21
Anticipated expiration: 2022-02-15
Also published as: JP2003241778A

Abstract

<P>PROBLEM TO BE SOLVED: To provide a convenient and portable information terminal the manufacturing cost of which does not increase as the number of parts such as an operation button increases for various information processing. <P>SOLUTION: When a user gives an instruction for performing various operation for 'playback', 'stop', 'temporary stop', 'fast forward', 'rewind', etc., during an operation for 'playback of sound-recorded data', 'read-aloud of stored document', 'read-aloud of new information', etc., the indication is detected and processing based upon the user's instruction is carried out without increasing the number of components such as an operation button. Consequently, the manufacturing cost can be suppressed. <P>COPYRIGHT: (C)2003,JPO

Description

【０００１】
【発明の属する技術分野】
本発明は、情報処理装置、情報処理方法、制御プログラムおよびコンピュータ可読記憶媒体に関し、特に音声合成を用いた情報処理装置、情報処理方法、制御プログラムおよびコンピュータ可読記憶媒体に関する。
【０００２】
【従来の技術】
現在、例えば、図２０に示すような携帯可能な情報端末が実用化されており、この情報端末を用いて様々な情報処理が行われている。この携帯可能な情報端末には、例えば、通信部、記憶部、音声出力部および音声合成部を備えるものがあり、これらの各部では例えば、以下の「録音データ再生」、「蓄積文書読み上げ」、「新着情報読み上げ」などの機能が実現されている。
１）「録音データ再生」機能
通信部を通じてダウンロードした音楽や語学教材といった音響データを記憶部に記憶し、任意の時間・場所で再生する。
２）「蓄積文書読み上げ」機能
データ蓄積部に蓄積された小説などのテキストデータを音声合成(テキスト音声変換) を用いて読み上げることにより、場所を選ばず情報を閲覧する。
３）「新着情報読み上げ」機能
通信部を用いてインターネットなどに接続し、メールやニュースなどのリアルタイム情報（テキストデータ）を得る。さらに、得られた情報を、音声合成(テキスト音声変換) を用いて読み上げる。
【０００３】
さらに、上記「録音データ再生」、「蓄積文書読み上げ」、「新着情報読み上げ」機能を組み合わせる以下の機能を用いることも可能である。
４）「録音データをＢＧＭとした文書読み上げ」機能
録音データを再生しつつ、蓄積文書や新着情報（テキストデータ）を音声合成(テキスト音声変換) を用いて読み上げる。
５）「新着情報の割り込み通知」機能
メールや新着ニュースが到着した際に、音声合成(テキスト音声変換) を用いて読み上げる。音声なので、他の作業中であっても作業の邪魔になり難い。また、音楽などを再生している最中に合成音声を重ねることも可能である。
【０００４】
【発明が解決しようとする課題】
しかしながら、上記説明した従来法には以下の２つの問題点がある。
【０００５】
第１の問題は、操作ボタンが多くなることである。
【０００６】
「録音データ再生」、「蓄積文書読み上げ」、「新着情報読み上げ」のいずれに対しても、その操作中にユーザから「再生」、「停止」、「早送り」、「巻き戻し」といった操作の指示を合わせて行なうことができる。しかしながら、「録音データ再生」、「蓄積文書読み上げ」、「新着情報読み上げ」の操作ボタンに対して、例えば「再生」、「停止」、「早送り」、「巻き戻し」などの操作ボタンを別に設けた場合には、設置場所が大きくなり、結果的に情報端末全体のサイズが大きくなる、部品点数が増え、製造コストが上昇するという問題が生じる。
【０００７】
第２の問題は、音声合成(テキスト音声変換)を用いて読み上げている際に、録音データの再生と同様の「早送り」、「巻き戻し」処理を行なった場合、「早送り」、「巻き戻し」中は、ユーザは音声合成(テキスト音声変換)を用いて読み上げている内容を聞き取ることができず、利便性が悪いことである。
【０００８】
本発明は、上記従来技術の問題点を解決するためのものであり、その目的は、例えば、「録音データ再生」、「蓄積文書読み上げ」、「新着情報読み上げ」の操作中に「再生」、「停止」、「早送り」、「巻き戻し」などの各種操作を行うことができ、かつ操作ボタンなどの部品点数が増えて製造コストが上昇しない携帯可能な情報処理装置および情報処理方法を提供することである。
【０００９】
また本発明の別の目的は、例えば、音声合成(テキスト音声変換)を用いて読み上げている際に、録音データの再生と同様の「早送り」、「巻き戻し」処理を行なった場合でも、ユーザが音声合成を用いて読み上げている内容を聞き取ることができる、利便性のよい携帯可能な情報処理装置および情報処理方法を提供することである。
【００１０】
【課題を解決するための手段】
上記目的を達成するための本発明に係る一実施形態の情報処理装置は、以下の構成を有する。すなわち、ユーザの指示に基づいて、テキストデータを音声合成する音声合成手段に、合成音声の出力、停止、早送り、巻き戻し、一時停止の少なくともいずれかを実行させる第１制御手段と、ユーザの指示に基づいて、音声データから音声を再生する音声再生手段に、再生音声の出力、停止、早送り、巻き戻し、一時停止の少なくともいずれかを実行させる第２制御手段と、ユーザの指示を検出する検出手段と、前記検出手段により検出されたユーザの指示が出力である場合に、前記音声合成手段により音声合成される読み上げ対象のテキストデータの有無を判定する判定手段と、前記判定手段により読み上げ対象のテキストデータが存在すると判定された場合に、前記合成音声の出力の停止を指示するユーザの指示が検出された場合に解除される第１データを設定する設定手段と、前記検出手段によりユーザの指示が検出された場合に、前記第１データが、設定されていた場合は、該ユーザの指示を前記第１制御手段に対して出し、該第１データが解除されていた場合は、前記再生音声の出力を指示するユーザの指示が検出された場合に設定され前記再生音声の出力の停止を指示するユーザの指示が検出された場合に解除される第２データに対応して該ユーザの指示を前記第２制御手段に対して出す指示手段と、を有することを特徴とする。
【００１４】
ここで、例えば、前記検出手段により検出された前記ユーザの指示が早送りであって、前記第１データが設定されていることによって前記指示手段が前記ユーザの指示を前記第１制御手段に出した場合に、前記第１制御手段は、読み上げ対象のテキストデータの各文より所定部分のデータを抽出して要約データを作成し、前記要約データを音声合成して出力するように前記音声合成手段を制御することが好ましい。
【００１５】
ここで、例えば、前記検出手段により検出された前記ユーザの指示が早送りであって、前記第１データが設定されていることによって前記指示手段が前記ユーザの指示を前記第１制御手段に出した場合に、前記第１制御手段は、読み上げ対象のテキストデータの各文より先頭文節を抽出して抽出された文節を順次音声合成して出力するように前記音声合成手段を制御することが好ましい。
【００１６】
ここで、例えば、前記検出手段により検出された前記ユーザの指示が巻き戻しであって、前記第１データが設定されていることによって前記指示手段が前記ユーザの指示を前記第１制御手段に出した場合に、前記第１制御手段は、読み上げ対象のテキストデータの各文より先頭文節を抽出して抽出された文節を前記テキストデータの文の並びとは逆順に音声合成して出力するように前記音声合成手段を制御することが好ましい。
【００１９】
ここで、例えば、前記制御手段は、前記テキストデータのうちで、前記音声合成手段によって音声合成されずに省略されたデータに対応して、所定の音を出力するように前記音声合成手段を制御することが好ましい。
【００２７】
上記目的を達成するための本発明に係る一実施形態の情報処理方法は、以下の構成を有する。すなわち、ユーザの指示を検出する検出工程と、前記検出工程により検出されたユーザの指示が出力である場合に、前記ユーザの指示に基づいてテキストデータを音声合成する音声合成手段により音声合成される読み上げ対象のテキストデータの有無を判定する判定工程と、前記判定工程により読み上げ対象のテキストデータが存在すると判定された場合に、前記合成音声の出力の停止を指示するユーザの指示が検出された場合に解除される第１データを設定する設定工程と、前記検出工程によりユーザの指示が検出された場合に、前記第１データが設定されていた場合は、該ユーザの指示を、前記音声合成手段に、前記合成音声の出力、停止、早送り、巻き戻し、一時停止の少なくともいずれかを実行させる第１制御手段に対して出し、該第１データが解除されていた場合は、再生音声の出力を指示するユーザの指示が検出された場合に設定され前記再生音声の出力の停止を指示するユーザの指示が検出された場合に解除される第２データに対応して該ユーザの指示を、前記ユーザの指示に基づいて音声データから音声を再生する音声再生手段に、再生音声の出力、停止、早送り、巻き戻し、一時停止の少なくともいずれかを実行させる第２制御手段に対して出す指示工程と、を有することを特徴とする。
【００２９】
上記目的を達成するための本発明に係る一実施形態の制御プログラムは、以下の構成を有する。すなわち、コンピュータに、ユーザの指示を検出する検出工程と、前記検出工程により検出されたユーザの指示が出力である場合に、前記ユーザの指示に基づいてテキストデータを音声合成する音声合成手段により音声合成される読み上げ対象のテキストデータの有無を判定する判定工程と、前記判定工程で読み上げ対象のテキストデータが存在すると判定された場合に、前記合成音声の出力の停止を指示するユーザの指示が検出された場合に解除される第１データを設定する設定工程と、前記検出工程によりユーザの指示が検出された場合に、前記第１データが設定されていた場合は、該ユーザの指示を、前記音声合成手段に、前記合成音声の出力、停止、早送り、巻き戻し、一時停止の少なくともいずれかを実行させる第１制御手段に対して出し、該第１データが解除されていた場合は、再生音声の出力を指示するユーザの指示が検出された場合に設定され前記再生音声の出力の停止を指示するユーザの指示が検出された場合に解除される第２データに対応して該ユーザの指示を、前記ユーザの指示に基づいて音声データから音声を再生する音声再生手段に、再生音声の出力、停止、早送り、巻き戻し、一時停止の少なくともいずれかを実行させる第２制御手段に対して出す指示工程と、を実行させることを特徴とする。
【００３０】
上記目的を達成するための本発明に係る一実施形態のコンピュータ可読記憶媒体は、以下の構成を有する。すなわち、コンピュータに、ユーザの指示を検出する検出工程と、前記検出工程により検出されたユーザの指示が出力である場合に、前記ユーザの指示に基づいてテキストデータを音声合成する音声合成手段により音声合成される読み上げ対象のテキストデータの有無を判定する判定工程と、前記判定工程で読み上げ対象のテキストデータが存在すると判定された場合に、前記合成音声の出力の停止を指示するユーザの指示が検出された場合に解除される第１データを設定する設定工程と、前記検出工程によりユーザの指示が検出された場合に、前記第１データが設定されていた場合は、該ユーザの指示を、前記音声合成手段に、前記合成音声の出力、停止、早送り、巻き戻し、一時停止の少なくともいずれかを実行させる第１制御手段に対して出し、該第１データが解除されていた場合は、再生音声の出力を指示するユーザの指示が検出された場合に設定され前記再生音声の出力の停止を指示するユーザの指示が検出された場合に解除される第２データに対応して該ユーザの指示を、前記ユーザの指示に基づいて音声データから音声を再生する音声再生手段に、再生音声の出力、停止、早送り、巻き戻し、一時停止の少なくともいずれかを実行させる第２制御手段に対して出す指示工程と、を実行させるための制御プログラムを格納したことを特徴とする。
【００３８】
【発明の実施の形態】
以下に図面を参照して、本発明に係る一実施形態の情報処理装置について説明する。本実施形態では、情報処理装置の一例として、携帯可能な情報端末を用いて説明するが、本発明の範囲を記載例に限定する趣旨のものではない。
【００３９】
＜第１の実施形態＞
［ハードウェア構成：図１］
図１は、第１の実施形態である携帯可能な情報端末Ｈ１０００のハードウェア構成を説明する図である。
【００４０】
Ｈ１は、数値演算・制御等の処理を行なう中央処部であり、本発明の処理手順を記載した制御プログラムに従って演算を行なう。Ｈ２は、ユーザに対して情報を提示する出力部である。出力部Ｈ２は、スピーカ・ヘッドフォン等の音声出力部Ｈ２０１および液晶などの画面表示部Ｈ２０２を含む。
【００４１】
Ｈ３はユーザが情報端末Ｈ１０００に対して動作の指示を与え、あるいは情報を入力する入力部である。入力部Ｈ３は、再生ボタンＨ３０１、停止ボタンＨ３０２、一時停止ボタンＨ３０３、早送りボタンＨ３０４、巻き戻しボタンＨ３０５およびタッチパネルＨ３０６等の汎用入力部を含む。
【００４２】
Ｈ４は、ＬＡＮカードやＰＨＳカード等のデータ通信部であり、新着メール等のデータの取得に用いられる。Ｈ５はハードディスクや不揮発メモリ等の記憶部であり、録音されたデータや蓄積情報が保持される。Ｈ６は読み取り専用の記憶部であり、本発明の手順を示す制御プログラムや音声合成用辞書等の固定的データが格納される。
【００４３】
Ｈ７は、ＲＡＭ等の一時情報を保持する記憶部であり、一時的なデータや各種フラグ等が保持される。Ｈ８はインタバルなタイマ部であり、タイマ起動後所定の時間経過した後中央処部Ｈ１に割り込みを掛ける働きをする。上記説明した中央処部Ｈ１〜タイマ部Ｈ８は、バスで接続されている。
【００４４】
［情報端末のハードウェア構成の外観：図２０］
図２０は、図１で説明した携帯可能な情報端末Ｈ１０００のハードウェア構成の外観イメージの一例を示したスケッチ図である。
【００４５】
［各イベント処理の概要：図２］
以上説明した情報端末Ｈ１０００におけるイベント処理について、図２〜図１６に示すフローチャートを用いて説明する。なお下記の処理は、中央処部Ｈ１により読み取り専用の記憶部Ｈ６等に格納されているイベント駆動型の制御プログラムに基づいて、ＲＡＭ等の一時情報を保持する記憶部Ｈ７を用いて実行されるものである。また入力部Ｈ３からの入力処理や出力部Ｈ２からのデータ要求、タイマ割り込み等の割り込みは、上記制御プログラムにおける各イベントの開始として処理される。
【００４６】
図２において、まず、イベント取得ステップＳ１において、新たなイベントを取得する。
【００４７】
次に、再生ボタン押下判定ステップＳ２で、イベント取得ステップＳ１で取得したイベントが「再生ボタンの押下」であるか否かを判定し、取得したイベントが「再生ボタンの押下」であるならば、図３に示すステップＳ１０１に処理を移し、「再生ボタンの押下」でなければ、停止ボタン押下判定ステップＳ３に処理を移す。
【００４８】
次に、停止ボタン押下判定ステップＳ３では、イベント取得ステップＳ１で取得したイベントが「停止ボタンの押下」であるか否かを判定し、取得したイベントが「停止ボタンの押下」であるならば、図４に示すステップＳ２０１に処理を移し、「停止ボタンの押下」でなければ、一時停止ボタン押下判定ステップＳ４に処理を移す。
【００４９】
次に、一時停止ボタン押下判定ステップＳ４では、イベント取得ステップＳ１で取得したイベントが「一時停止ボタンの押下」であるか否か判定し、「一時停止ボタンの押下」であるならば、図５に示すステップＳ３０１に処理を移し、「一時停止ボタンの押下」でなければ、早送りボタン押下判定ステップＳ５に処理を移す。
【００５０】
次に、早送りボタン押下判定ステップＳ５では、イベント取得ステップＳ１で取得したイベントが「早送りボタンの押下」であるか否かを判定し、「早送りボタンの押下」であるならば、図６に示すステップＳ４０１に処理を移し、「早送りボタンの押下」でなければ、早送りボタン解除判定ステップＳ６に処理を移す。
【００５１】
次に、早送りボタン解除判定ステップＳ６では、イベント取得ステップＳ１で取得したイベントが「早送りボタンの解除(押下したボタンをはなす操作) 」であるか否かを判定し、「早送りボタンの解除」であるならば、図７に示すステップＳ５０１に処理を移し、「早送りボタンの解除」でなければ、巻き戻しボタン押下判定ステップＳ７に処理を移す。
【００５２】
次に、巻き戻しボタン押下判定ステップＳ７では、イベント取得ステップＳ１で取得したイベントが「巻き戻しボタンの押下」であるか否かを判定し、「巻き戻しボタンの押下」であるならば、図８に示すステップＳ６０１に処理を移し、「巻き戻しボタンの押下」でなければ、巻き戻しボタン解除判定ステップＳ８に処理を移す。
【００５３】
次に、巻き戻しボタン解除判定ステップＳ８では、イベント取得ステップＳ１で取得したイベントが「巻き戻しボタンの解除」であるか否かを判定し、「巻き戻しボタンの解除」であるならば、図９に示すステップＳ７０１に処理を移し、「巻き戻しボタンの解除」でなければ、新規情報到着判定ステップＳ９に処理を移す。
【００５４】
次に、新規情報到着判定ステップＳ９では、イベント取得ステップＳ１で取得したイベントが「新着情報」が到着したことを示すものであるか否かを判定し、「新着情報」の到着であるならば、図１０に示すステップＳ８０１に処理を移し、「新着情報」の到着でなければ、蓄積情報読み上げ指示判定ステップＳ１０に処理を移す。
【００５５】
次に、蓄積情報読み上げ指示判定ステップＳ１０では、イベント取得ステップＳ１で取得したイベントが「ユーザによる蓄積情報読み上げの指示」であるか否かを判定し、「ユーザによる蓄積情報読み上げの指示」であるならば、図１１に示すステップＳ９０１に処理を移し、「ユーザによる蓄積情報読み上げの指示」でなければ、音声合成データ要求判定ステップＳ１１に処理を移す。
【００５６】
次に、音声合成データ要求判定ステップＳ１１では、イベント取得ステップＳ１で取得したイベントが「合成音声出力デバイスからのデータ要求」であるか否かを判定し、「合成音声出力デバイスからのデータ要求」であるならば、図１２に示すステップＳ１００１に処理を移し、「合成音声出力デバイスからのデータ要求」でなければ、録音再生データ要求判定ステップＳ１２に処理を移す。
【００５７】
次に、録音再生データ要求判定ステップＳ１２では、イベント取得ステップＳ１で取得したイベントが「録音データ出力デバイスからのデータ要求」であるか否かを判定し、「録音データ出力デバイスからのデータ要求」であるならば、図１３に示すステップＳ１１０１に処理を移し、「録音データ出力デバイスからのデータ要求」でなければ、タイマイベント判定ステップＳ１３に処理を移す。
【００５８】
次に、タイマイベント判定ステップＳ１３では、イベント取得ステップＳ１で取得したイベントが、タイマ部Ｈ８からのタイマ開始後所定時間経過したことを示す旨の通知であるか否かを判定し、タイマ部Ｈ８からの通知であるならば図１４に示すステップＳ１２０１に処理を移し、タイマ部Ｈ８からの通知でなければ、イベント取得ステップＳ１に処理を戻す。
【００５９】
［「再生ボタンの押下」処理：図３］
次に、上記説明した各イベントの処理について、以下詳細に説明する。まず「再生ボタンの押下」処理について図３を用いて説明する。
【００６０】
［読み上げポインタ］
読み上げポインタ設定判定(再生) ステップＳ１０１では、「読み上げポインタ」が設定されているか否かを判定し、「読み上げポインタ」が設定されていれば音声合成ポーズフラグ解除(再生) ステップＳ１０６に処理を移し、「読み上げポインタ」が設定されていなければ、優先読み上げ文存在判定(再生) ステップＳ１０２に処理を移す。なお「読み上げポインタ」は、例えば図１８に一例を示す優先読み上げ文（テキストデータ）の途中に、次に音声合成して読み上げを開始する位置を保持する領域であり、非設定もしくは、文書中の「読み上げポインタ」の位置が値として設定される。
【００６１】
次に、優先読み上げ文存在判定(再生) ステップＳ１０２では、「優先読み上げ文が存在」するか否かを判定し、「優先読み上げ文が存在」すれば優先読み上げ文初期ポインタ設定ステップＳ１０８に処理を移し、「優先読み上げ文が存在」しなければ蓄積読み上げ文存在判定ステップＳ１０３に処理を移す。
【００６２】
次に、蓄積読み上げ文存在判定ステップＳ１０３では、「蓄積読み上げ文が存在」するか否かを判定し、「蓄積読み上げ文が存在」すれば蓄積読み上げ文初期ポインタ設定ステップＳ１０９に処理を移し、「蓄積読み上げ文が存在」しなければ再生ポインタ設定判定(再生) ステップＳ１０４に処理を移す。
【００６３】
［再生ポインタ］
次に、再生ポインタ設定判定(再生) ステップＳ１０４では、「再生ポインタが設定」されているか否かを判定し、「再生ポインタが設定」されていれば再生ポーズフラグ解除(再生) ステップＳ１１１に処理を移し、「再生ポインタが設定」されていなければ、録音データ存在判定ステップＳ１０５に処理を移す。なお「再生ポインタ」は、次に再生される位置を保持する領域で、非設定もしくは、録音データ中の「再生ポインタ」の位置が値として設定される。
【００６４】
次に、録音データ存在判定ステップＳ１０５では、「録音データが存在」するか否かを判定し、「録音データが存在」すれば録音データ再生初期ポインタ設定ステップＳ１１３に処理を移し、「録音データが存在」しなければ図２のイベント取得ステップＳ１に処理を移す。
【００６５】
次に、音声合成ポーズフラグ解除(再生) ステップＳ１０６では、音声合成ポーズフラグを解除する。音声合成ポーズフラグは音声合成が一時状態にあるかどうかを示すフラグであり、設定されているときは「真」、解除されているときは「偽」の値をとる。
【００６６】
次に、音声合成再開(再生) ステップＳ１０７では、図５のステップＳ３０４で一時停止されていた音声合成を再開してから、図２のイベント取得ステップＳ１に処理を移す。「音声合成開始」、「音声合成停止」、「音声合成一時停止」、「音声合成再開」の各ルーチンでの処理については、図１５を用いて後述する。
【００６７】
次に、優先読み上げ文初期ポインタ設定ステップＳ１０８では読み上げポインタを優先読み上げ文の先頭に設定し、音声合成開始ステップＳ１１０に処理を移す。
【００６８】
次に、蓄積読み上げ文初期ポインタ設定ステップＳ１０９では読み上げポインタを蓄積読み上げ文の先頭に設定し、音声合成開始ステップＳ１１０に処理を移す。
【００６９】
次に、優先読み上げ文初期ポインタ設定ステップＳ１０８あるいは蓄積読み上げ文初期ポインタ設定ステップＳ１０９で読み上げポインタを設定した後、音声合成開始ステップＳ１１０で音声合成を開始してから図２のイベント取得ステップＳ１に処理を移す。
【００７０】
次に、再生ポーズフラグ解除(再生) ステップＳ１１１では、再生ポーズフラグを解除する。再生ポーズフラグは録音データ再生が一時状態にあるか否かを示すフラグである。
【００７１】
次に、録音データ再生再開(再生) ステップＳ１１２でステップＳ３０８で一時停止されていた録音データの再生を再開し、前記イベント取得ステップＳ１に処理を移す。「録音データ再生開始」、「録音データ再生停止」、「録音データ再生一時停止」、「録音データ再生再開」の各ルーチンでの処理については図１６を用いて後述する。
【００７２】
次に、録音データ再生初期ポインタ設定ステップＳ１１３では再生ポインタを録音データの先頭に設定し、録音データ再生開始ステップＳ１１４に処理を移す。録音データ再生開始ステップＳ１１４では、録音データの再生を開始してから図２のイベント取得ステップＳ１に処理を移す。
【００７３】
［「停止ボタンの押下」処理：図４］
次に、「停止ボタンの押下」処理について、図４を用いて説明する。
【００７４】
読み上げポインタ設定判定(停止) ステップＳ２０１では、「読み上げポインタ」が設定されているか否かを判定し、「読み上げポインタ」が設定されていれば音声合成ポーズフラグ解除(停止) ステップＳ２０３に処理を移し、「読み上げポインタ」が設定されていなければ、再生ポインタ設定判定(停止) ステップＳ２０２に処理を移す。
【００７５】
次に、再生ポインタ設定判定(停止) ステップＳ２０２では、「再生ポインタ」が設定されているか否かを判定し、「再生ポインタ」が設定されていれば再生ポーズフラグ解除(停止) ステップＳ２０６に処理を移し、「再生ポインタ」が設定されていなければイベント取得ステップＳ１に処理を移す。
【００７６】
音声合成ポーズフラグ解除(停止) ステップＳ２０３では、音声合成ポーズフラグを解除する。次に、読み上げポインタ解除(停止) ステップＳ２０４で読み上げポインタを解除する(非設定にする)。次に、音声合成停止ステップＳ２０５で音声合成を停止してから、図２のイベント取得ステップＳ１に処理を移す。
【００７７】
次に、再生ポーズフラグ解除(停止) ステップＳ２０６では、再生ポーズフラグを解除する。次に、再生ポインタ解除(停止) ステップＳ２０７で再生ポインタを解除する(非設定にする)。次に、録音データ再生停止ステップＳ２０８で録音データの再生を停止してから、図２のイベント取得ステップＳ１に処理を移す。
【００７８】
［「一時停止ボタンの押下」処理：図５］
次に、「一時停止ボタンの押下」処理について、図５を用いて説明する。
【００７９】
まず読み上げポインタ設定判定(一時停止) ステップＳ３０１では、「読み上げポインタ」が設定されているか否かを判定し、「読み上げポインタ」が設定されていれば音声合成ポーズフラグ設定判定ステップＳ３０２に処理を移し、「読み上げポインタ」が設定されていなければ、再生ポインタ設定判定(一時停止) ステップＳ３０５に処理を移す。
【００８０】
次に、音声合成ポーズフラグ設定判定ステップＳ３０２では、音声合成ポーズフラグが設定されているか、すなわち音声合成が一時停止状態にあるか否かを判定し、音声合成ポーズフラグが設定されていれば、図３の読み上げポインタ設定判定(再生) ステップＳ１０１に処理を移し、音声合成ポーズフラグが設定されていなければ、音声合成ポーズフラグ設定ステップＳ３０３に処理を移す。
【００８１】
次に、音声合成ポーズフラグ設定ステップＳ３０３では、音声合成ポーズフラグを設定する(値を「真」にする)。次に、音声合成一時停止ステップＳ３０４で音声合成を一時停止してから、図２のイベント取得ステップＳ１に処理を移す。
【００８２】
次に、再生ポインタ設定判定(一時停止) ステップＳ３０５では、「再生ポインタ」が設定されているか否かを判定し、「再生ポインタ」が設定されていれば再生ポーズフラグ設定判定ステップＳ３０６に処理を移し、「再生ポインタ」が設定されていなければ、図２のイベント取得ステップＳ１に処理を移す。
【００８３】
次に、再生ポーズフラグ設定判定ステップＳ３０６では、「再生ポーズフラグ」が設定されているか、すなわち録音データの再生が一時停止状態にあるか否かを判定し、「再生ポーズフラグ」が設定されていれば、図３の読み上げポインタ設定判定(再生) ステップＳ１０１に処理を移し、「再生ポーズフラグ」が設定されていなければ、再生ポーズフラグ設定ステップＳ３０７に処理を移す。
【００８４】
次に、再生ポーズフラグ設定ステップＳ３０７では、再生ポーズフラグを設定する(値を「真」にする)。次に、録音データ再生一時停止ステップＳ３０８で録音データの再生を一時停止し、図２のイベント取得ステップＳ１に処理を移す。
【００８５】
［「早送りボタンの押下」処理：図６］
次に、「早送りボタンの押下」処理について、図６を用いて説明する。
【００８６】
読み上げポインタ設定判定(早送り) ステップＳ４０１では、「読み上げポインタ」が設定されているか否かを判定し、「読み上げポインタ」が設定されていれば早送り読み上げタイマモード設定ステップＳ４０２に処理を移し、「読み上げポインタ」が設定されていなければ、再生ポインタ設定判定(早送り) ステップＳ４０５に処理を移す。
【００８７】
次に、早送り読み上げタイマモード設定ステップＳ４０２では、タイマモードを「早送り読み上げ」に設定し、早送りイベントマスク設定ステップＳ４０３に処理を移す。タイマモードは、タイマの用途を示すものである。
【００８８】
次に、早送りイベントマスク設定ステップＳ４０３では、イベントマスクを早送り処理用に設定し、前記イベント取得ステップＳ１で取得するイベントを、以降、「早送りボタン解除」、「音声合成データ要求」、「録音再生データ要求」、「タイマイベント」のみに限定する。
【００８９】
次に、タイマ開始(早送り) ステップＳ４０４でタイマを開始し、所定の時間後にタイマイベントが発生するようにする。そして処理を図２のイベント取得ステップＳ１に移す。
【００９０】
再生ポインタ設定判定(早送り) ステップＳ４０５では、再生ポインタが設定されているか否かを判定し、再生ポインタが設定されていれば早送り再生タイマモード設定ステップＳ４０６に処理を移し、再生ポインタが設定されていなければ図２のイベント取得ステップＳ１に処理を移す。
【００９１】
早送り再生タイマモード設定ステップＳ４０６では、タイマモードを「早送り再生」に設定し、早送りイベントマスク設定ステップＳ４０３に処理を移す。
【００９２】
［「早送りボタンの解除」処理：図７］
次に、「早送りボタンの解除」処理について、図７を用いて説明する。
【００９３】
まず、イベントマスク解除(早送り) ステップＳ５０１では、イベントマスクを解除し、以降のイベント取得ステップＳ１で全てのイベントを取得するようにする。
【００９４】
次に、タイマモード初期化/タイマ停止(早送り) ステップＳ５０２でタイマモードを初期化した後、タイマを停止する。
【００９５】
次に、読み上げポインタ設定判定(早送り解除) ステップＳ５０３で「読み上げポインタ」が設定されているか否かを判定し、「読み上げポインタ」が設定されていれば読み上げモード判定(早送り) ステップＳ５０４に処理を移し、「読み上げポインタ」が設定されていなければ、再生ポインタ設定判定(早送り解除)ステップＳ５１１に処理を移す。
【００９６】
読み上げモード判定(早送り) ステップＳ５０４では、読み上げモードが「早送り」であるか否かを判定し、「早送り」であるならば読み上げモード初期化(早送り) ステップＳ５０５に処理を移し、「早送り」でなければ音声合成停止(早送り) ステップＳ５０８に処理を移す。
【００９７】
次に、読み上げモード初期化(早送り) ステップＳ５０５では、読み上げモードを初期状態に戻す。次に、読み上げポインタ復帰(早送り) ステップＳ５０６で、図１４のステップＳ１２０７で作成された要約文に設定されている読み上げポインタを元文書中の対応する位置に設定する。
【００９８】
次に、要約文破棄ステップＳ５０７で要約文を破棄してから図２のイベント取得ステップＳ１に処理を移す。
【００９９】
次に、音声合成停止(早送り) ステップＳ５０８では、音声合成を停止する。次に、読み上げポインタ前向きスキップステップＳ５０９で、読み上げポインタを、現在読み上げている文の次の文の先頭に移す。次に音声合成開始(早送り) ステップＳ５１０で音声合成を開始し、イベント取得ステップＳ１に処理を移す。
【０１００】
一方、再生ポインタ設定判定(早送り解除) ステップＳ５１１では、「再生ポインタ」が設定されているか否かを判定し、「再生ポインタ」が設定されていれば録音再生モード判定(早送り) ステップＳ５１２に処理を移し、「再生ポインタ」が設定されていなければ、図２のイベント取得ステップＳ１に処理を移す。
【０１０１】
録音再生モード判定(早送り) ステップＳ５１２では、録音再生モードが「早送り」であるか否かを判定し、「早送り」であるならば録音再生モード初期化(早送り) ステップＳ５１３に処理を移し、「早送り」でなければ録音データ再生停止(早送り) ステップＳ５１４に処理を移す。
【０１０２】
次に、録音再生モード初期化(早送り) ステップＳ５１３では、録音再生モードを初期状態に戻してから図２のイベント取得ステップＳ１に処理を移す。次に、録音データ再生停止(早送り) ステップＳ５１４では、録音データの再生を停止する。次に、再生ポインタ前向きスキップステップＳ５１５で、再生ポインタを１インデックス分進める。例えば録音データが音楽データならば次曲の先頭に再生ポインタが移ることになる。
【０１０３】
次に、録音データ再生開始(早送り) ステップＳ５１６で録音データの再生を開始してから図２のイベント取得ステップＳ１に処理を移す。
【０１０４】
［「巻き戻しボタンの押下」処理：図８］
次に、「巻き戻しボタンの押下」処理について、図８を用いて説明する。
【０１０５】
まず、読み上げポインタ設定判定(巻き戻し) ステップＳ６０１では、「読み上げポインタが設定」されているか否かを判定し、「読み上げポインタが設定」されていれば巻き戻し読み上げタイマモード設定ステップＳ６０２に処理を移し、「読み上げポインタが設定」されていなければ再生ポインタ設定判定(巻き戻し) ステップＳ６０５に処理を移す。
【０１０６】
次に、巻き戻し読み上げタイマモード設定ステップＳ６０２では、タイマモードを「巻き戻し読み上げ」に設定し、巻き戻しイベントマスク設定ステップＳ６０３に処理を移す。
【０１０７】
次に、巻き戻しイベントマスク設定ステップＳ６０３では、イベントマスクを巻き戻し処理用に設定してから、図２のイベント取得ステップＳ１で取得するイベントを、以降、「巻き戻しボタン解除」、「音声合成データ要求」、「録音再生データ要求」、「タイマイベント」のみに限定する。
【０１０８】
次に、タイマ開始(巻き戻し) ステップＳ６０４でタイマを開始し、所定の時間後にタイマイベントが発生するようにしてから、処理を図２のイベント取得ステップＳ１に移す。
【０１０９】
次に、再生ポインタ設定判定(巻き戻し) ステップＳ６０５では、「再生ポインタ」が設定されているか否かを判定し、「再生ポインタ」が設定されていれば巻き戻し再生タイマモード設定ステップＳ６０６に処理を移し、「再生ポインタ」が設定されていなければ、図２のイベント取得ステップＳ１に処理を移す。
【０１１０】
次に、巻き戻し再生タイマモード設定ステップＳ６０６では、タイマモードを「巻き戻し再生」に設定し、巻き戻しイベントマスク設定ステップＳ６０３に処理を移す。
【０１１１】
［「巻き戻しボタンの解除」処理：図９］
次に、「巻き戻しボタンの解除」処理について、図９を用いて説明する。
【０１１２】
まず、イベントマスク解除(巻き戻し) ステップＳ７０１では、イベントマスクを解除し、以降のイベント取得ステップＳ１で全てのイベントを取得するようにする。
【０１１３】
次に、タイマモード初期化/タイマ停止(巻き戻し) ステップＳ７０２でタイマモードを初期化した後、タイマを停止する。
【０１１４】
次に、読み上げポインタ設定判定(巻き戻し解除) ステップＳ７０３で「読み上げポインタ」が設定されているか否かを判定し、「読み上げポインタ」が設定されていれば読み上げモード判定(巻き戻し) ステップＳ７０４に処理を移し、「読み上げポインタ」が設定されていなければ再生ポインタ設定判定(巻き戻し解除) ステップＳ７１１に処理を移す。
【０１１５】
次に、読み上げモード判定(巻き戻し) ステップＳ７０４では、読み上げモードが「巻き戻し」であるか否かを判定し、「巻き戻し」であるならば読み上げモード初期化(巻き戻し) ステップＳ７０５に処理を移し、「巻き戻し」でなければ音声合成停止(巻き戻し) ステップＳ７０８に処理を移す。
【０１１６】
次に、読み上げモード初期化(巻き戻し) ステップＳ７０５では、読み上げモードを初期状態に戻す。次に、読み上げポインタ復帰(巻き戻し) ステップＳ７０６で、図１４のステップＳ１２０４で作成された先頭文節リストに設定されている読み上げポインタを元文書中の対応する位置に設定する。(ステップＳ１２０５で作成された情報を用いる。)
次に、先頭文節リスト破棄ステップＳ７０７で先頭文節リストを破棄してから図２のイベント取得ステップＳ１に処理を移す。
【０１１７】
次に、音声合成停止(巻き戻し) ステップＳ７０８では、音声合成を停止する。次に、読み上げポインタ後向きスキップステップＳ７０９で、読み上げポインタを、現在読み上げている文の前の文の先頭に移す。
【０１１８】
次に音声合成開始(巻き戻し) ステップＳ７１０で音声合成を開始してから図２のイベント取得ステップＳ１に処理を移す。
【０１１９】
次に、再生ポインタ設定判定(巻き戻し解除) ステップＳ７１１では、「再生ポインタ」が設定されているか否かを判定し、「再生ポインタ」が設定されていれば録音再生モード判定(巻き戻し) ステップＳ７１２に処理を移し、「再生ポインタ」が設定されていなければイベント取得ステップＳ１に処理を移す。
【０１２０】
次に、録音再生モード判定(巻き戻し) ステップＳ７１２では、録音再生モードが「巻き戻し」であるか否かを判定し、「巻き戻し」であるならば録音再生モード初期化(巻き戻し) ステップＳ７１３に処理を移し、「巻き戻し」でなければ録音データ再生停止(巻き戻し) ステップＳ７１４に処理を移す。
【０１２１】
次に、録音再生モード初期化(巻き戻し) ステップＳ７１３では、録音再生モードを初期状態に戻し、イベント取得ステップＳ１に処理を移す。
【０１２２】
次に、録音データ再生停止(巻き戻し) ステップＳ７１４では、録音データの再生を停止する。次に、再生ポインタ後向きスキップステップＳ７１５で、再生ポインタを１インデックス分戻す。例えば、録音データが音楽データで再生ポインタとインデックスが重なっていない状態ならば、現在の曲の先頭に再生ポインタが移ることになる。
【０１２３】
次に、録音データ再生開始(巻き戻し) ステップＳ７１６で録音データの再生を開始してから図２のイベント取得ステップＳ１に処理を移す。
【０１２４】
［「新規情報の到着」処理：図１０］
次に、「新規情報の到着」処理について、図１０を用いて説明する。
【０１２５】
まず、優先読み上げ文存在判定(新着) ステップＳ８０１では、優先読み上げ文が存在するか否かを判定し、優先読み上げ文が存在すれば新着読み上げ文追加ステップＳ８０７に処理を移し、優先読み上げ文が存在しなければ新着通知メッセージコピーステップＳ８０２に処理を移す。
【０１２６】
次に、新着通知メッセージコピーステップＳ８０２では、優先読み上げ文の先頭に新着通知メッセージをコピーする。新着通知メッセージの例を図１７に示す。
【０１２７】
次に、新着読み上げ文コピーステップＳ８０３で、新たに到着した読み上げ文を優先読み上げ文中、新着通知メッセージの後ろにコピーする。
【０１２８】
次に、読み上げポインタ設定判定(新着) ステップＳ８０４で読み上げポインタが設定されているか否かを判定し、読み上げポインタが設定されていれば読み上げポインタバックアップ作成(新着) ステップＳ８０５に処理を移し、読み上げポインタが設定されていなければステップＳ１０１に処理を移す。
【０１２９】
次に、読み上げポインタバックアップ作成(新着) ステップＳ８０５では、現在の読み上げポインタの値を、優先読み上げ文に対する付帯情報として保持する。
【０１３０】
次に、新着読み上げ文読み上げポインタ設定ステップＳ８０６において、読み上げポインタを優先読み上げ文の先頭に設定し、イベント取得ステップＳ１に処理を移す。
【０１３１】
次に、新着読み上げ文追加ステップＳ８０７では、新着読み上げ文を優先読み上げ文の末尾にコピーしてから図２のイベント取得ステップＳ１に処理を移す。
【０１３２】
［「蓄積情報の読み上げ指示」処理：図１１］
次に、「蓄積情報の読み上げ指示」処理について、図１１を用いて説明する。
【０１３３】
読み上げポインタ設定判定(蓄積情報読み上げ) ステップＳ９０１では、「読み上げポインタが設定」されているか否かを判定し、「読み上げポインタが設定」されていれば読み上げ中警告表示ステップＳ９０５に処理を移し、「読み上げポインタが設定」されていなければ蓄積読み上げ文コピーステップＳ９０２に処理を移す。
【０１３４】
次に、蓄積読み上げ文コピーステップＳ９０２では、外部の記憶部Ｈ５に記憶された蓄積情報から蓄積情報読み上げ指示判定ステップＳ１０で指示された情報を蓄積読み上げ文にコピーする。
【０１３５】
次に、優先読み上げ文存在判定(蓄積情報読み上げ) ステップＳ９０３で「優先読み上げ文が存在」するか否かを判定し、「優先読み上げ文が存在」すれば読み上げポインタバックアップ設定ステップＳ９０４に処理を移し、「優先読み上げ文が存在」しなければイベント取得ステップＳ１に処理を移す。
【０１３６】
次に、読み上げポインタバックアップ設定ステップＳ９０４では、蓄積読み上げ文の先頭を優先読み上げ文に対する付帯情報として設定してから図２のイベント取得ステップＳ１に処理を移す。
【０１３７】
次に、読み上げ中警告表示ステップＳ９０５では、現在読み上げ中である旨の警告を出力してから図２のイベント取得ステップＳ１に処理を移す。
【０１３８】
［「音声合成の要求指示」処理：図１２］
次に、「音声合成の要求指示」処理について、図１２を用いて説明する。
【０１３９】
まず、合成音声データ存在判定ステップＳ１００１では、テキストから音声波形に変換された「波形データ」がすでに存在するか否かを判定し、「波形データ」が存在すれば合成音声データコピーステップＳ１００７に処理を移し、「波形データ」が存在しなければ、読み上げポインタ設定判定(音声出力) ステップＳ１００２に処理を移す。
【０１４０】
読み上げポインタ設定判定(音声出力) ステップＳ１００２では、「読み上げポインタが設定」されているか否かを判定し、「読み上げポインタが設定」されていれば文書データ終了判定ステップＳ１００３に処理を移し、「読み上げポインタが設定」されていなければ図２のイベント取得ステップＳ１に処理を移す。
【０１４１】
次に、文書データ終了判定ステップＳ１００３では、「読み上げポインタが文書データの終端」に達しているか判定し、「読み上げポインタが文書データの終端」に達しているならば、読み上げポインタバックアップ存在判定ステップＳ１００８に処理を移し、「読み上げポインタが文書データの終端」に達していなければ文書データ切り出しステップＳ１００４に処理を移す。
【０１４２】
文書データ切り出しステップＳ１００４では、文書データより所定量(例えば１文) のデータを切り出す。次に、合成音声データ生成ステップＳ１００５で、前記切り出されたデータに対して音声合成の処理を行ない、合成音声データを得る。
【０１４３】
次に、読み上げポインタ移動ステップＳ１００６で、文書データ切り出しステップＳ１００４で切り出した分だけ読み上げポインタを移動し、合成音声データコピーステップＳ１００７に処理を移す。
【０１４４】
合成音声データコピーステップＳ１００７では、合成音声データから所定量(合成音声出力デバイスのバッファサイズ) のデータを合成音声出力デバイスに出力し、イベント取得ステップＳ１に処理を移す。
【０１４５】
読み上げポインタバックアップ存在判定ステップＳ１００８では、文書データの付帯情報として「読み上げポインタのバックアップが存在」するか否かを判定し、「読み上げポインタのバックアップが存在」するならば、読み上げポインタバックアップ復帰ステップＳ１００９に処理を移し、「読み上げポインタのバックアップが存在」しないならば、読み上げポインタ解除ステップＳ１０１０に処理を移す。
【０１４６】
次に、読み上げポインタバックアップ復帰ステップＳ１００９では、文書データに付帯する読み上げポインタのバックアップを読み上げポインタに設定し、前記文書データ終了判定ステップＳ１００３に処理を移す。
【０１４７】
次に、読み上げポインタ解除ステップＳ１０１０では、読み上げポインタを解除する(非設定にする)。そして、イベント取得ステップＳ１に処理を移す。
【０１４８】
次に、再生ポインタ設定判定(録音再生) ステップＳ１１０１では、「再生ポインタが設定」されているか否かを判定し、「再生ポインタが設定」されていれば録音再生モード判定(巻き戻し２) ステップＳ１１０２に処理を移し、「再生ポインタが設定」されていなければイベント取得ステップＳ１に処理を移す。
【０１４９】
［「録音再生の要求指示」処理：図１３］
次に、「録音再生の要求指示」処理について、図１３を用いて説明する。
【０１５０】
まず、録音再生モード判定(巻き戻し２) ステップＳ１１０２では、録音再生モードが「巻き戻し」であるか否かを判定し、「巻き戻し」であるならば再生ポインタ先頭判定ステップＳ１１０９に処理を移し、「巻き戻し」でないならば再生ポインタ終端判定ステップＳ１１０３に処理を移す。
【０１５１】
次に、再生ポインタ終端判定ステップＳ１１０３では、「再生ポインタが録音データの終端(最後)」であるか安定し、「再生ポインタが録音データの終端」であるならば再生ポインタ解除ステップＳ１１０４に処理を移し、「再生ポインタが録音データの終端」でないならば録音データコピーステップＳ１１０５に処理を移す。
【０１５２】
次に、再生ポインタ解除ステップＳ１１０４では、再生ポインタを解除してからイベント取得ステップＳ１に処理を移す。
【０１５３】
次に、録音データコピーステップＳ１１０５では、録音データから所定量(録音データ出力デバイスのバッファサイズ) のデータを録音データ出力デバイスに出力し、録音再生モード判定(早送り２) ステップＳ１１０６に処理を移す。
【０１５４】
次に、録音再生モード判定(早送り２) ステップＳ１１０６では、「録音再生モードが早送り」であるか否かを判定し、「録音再生モードが早送り」であるならば再生ポインタ早送り移動ステップＳ１１０７に処理を移し、「録音再生モードが早送り」でないならば再生ポインタ移動ステップＳ１１０８に処理を移す。
【０１５５】
次に、再生ポインタ早送り移動ステップＳ１１０７では、再生ポインタを録音データコピーステップＳ１１０５で出力したよりも多く(例えば、前記所定量の１０倍) 進めてから図２のイベント取得ステップＳ１に処理を移す。
【０１５６】
次に、再生ポインタ移動ステップＳ１１０８では、再生ポインタを録音データコピーステップＳ１１０５で出力した分だけ進め、イベント取得ステップＳ１に処理を移す。
【０１５７】
次に、再生ポインタ先頭判定ステップＳ１１０９では、「再生ポインタが録音データの先頭」を指しているか否かを判定し、「再生ポインタが録音データの先頭」であるならばイベント取得ステップＳ１に処理を移し、「再生ポインタが録音データの先頭」でないならば、録音データ逆順コピーステップＳ１１１０に処理を進める。
【０１５８】
録音データ逆順コピーステップＳ１１１０では、前記録音データコピーステップＳ１１０５と同様に所定量(録音データ出力デバイスのバッファサイズ) のデータを録音データ出力デバイスに出力するが、データの順を逆にして出力する。
【０１５９】
次に、再生ポインタ巻き戻し移動ステップＳ１１１１で、再生ポインタを再生時の逆方向に移動してから図２のイベント取得ステップＳ１に処理を移す。
【０１６０】
［「タイマイベント」処理：図１４］
次に、「タイマイベント」処理について、図１４を用いて説明する。
【０１６１】
まず、タイマ停止ステップＳ１２０１では、タイマを停止する。
【０１６２】
次に、タイマモード判定(早送り読み上げ) ステップＳ１２０２でタイマモードが「早送り読み上げ」であるか否かを判定し、「早送り読み上げ」であるならば要約文作成ステップＳ１２０７に処理を移し、「早送り読み上げ」でないならばタイマモード判定(巻き戻し読み上げ) ステップＳ１２０３に処理を移す。
【０１６３】
次に、タイマモード判定(巻き戻し読み上げ) ステップＳ１２０３では、タイマモードが「巻き戻し読み上げ」であるか否かを判定し、「巻き戻し読み上げ」であるならば先頭文節リスト作成ステップＳ１２０４に処理を移し、「巻き戻し読み上げ」でないならば、タイマモード判定(早送り再生) ステップＳ１２１０に処理を移す。
【０１６４】
先頭文節リスト作成ステップＳ１２０４では、読み上げポインタが示す文書の先頭から読み上げポインタまでに存在する各文の先頭の文節のリストを作成する。一例を図１８に示す。図１８中の（ａ）が元文書であり、（ｂ）が作成された文節リストのイメージである。なお読み上げポインタの位置は、読み上げられた文書の末尾に読み上げポインタの位置がくるように設定されており、文書が読み上げられるとその読み上げに同期して読み上げポインタの位置が移動する。
【０１６５】
次に、巻き戻し読み上げポインタバックアップ作成ステップＳ１２０５で、巻き戻しモードから復帰する際に読み上げポインタを移す対応点を作成する。図１８では、先頭文節リストと元文書を結ぶ矢印の関係が対応点である。
【０１６６】
次に、巻き戻し読み上げモード設定ステップＳ１２０６で、読み上げモードを巻き戻しに設定してから図２のイベント取得ステップＳ１に処理を移す。
【０１６７】
次に、要約文作成ステップＳ１２０７では、読み上げポインタが示す場所から文書の末尾までの要約文を作成する。一例を図１９に示す。図１９中の（ａ）が元文書、（ｂ）が作成された要約文のイメージである。なお読み上げポインタの位置は、読み上げられた文書の末尾（すなわち未読部分の先頭）に読み上げポインタの位置がくるように設定されており、文書が読み上げられるとその読み上げに同期して読み上げポインタの位置は移動する。
【０１６８】
次に、早送り読み上げポインタバックアップ作成ステップＳ１２０８で、早送りモードから復帰する際に読み上げポインタを移す対応点を作成する。図１９では、要約文と元文書を結ぶ矢印の関係が対応点である。但し、図１９では、煩雑さを避けるため全てを図示していない。
【０１６９】
次に、早送り読み上げモード設定ステップＳ１２０９で、読み上げモードを早送りに設定してから図２のイベント取得ステップＳ１に処理を移す。
【０１７０】
次に、タイマモード判定(早送り再生) ステップＳ１２１０では、タイマモードが「早送り再生」であるか否かを判定し、「早送り再生」であるならば早送り録音再生モード設定ステップＳ１２１１に処理を移し、「早送り再生」でないならば巻き戻し録音再生モード設定ステップＳ１２１２に処理を移す。
【０１７１】
次に、早送り録音再生モード設定ステップＳ１２１１では、録音再生モードを早送りに設定し、イベント取得ステップＳ１に処理を移す。
【０１７２】
次に、巻き戻し録音再生モード設定ステップＳ１２１２では、録音再生モードを巻き戻しに設定してから図２のイベント取得ステップＳ１に処理を移す。
【０１７３】
［「音声合成」の各処理：図１５Ａ〜１５Ｄ］
次に、「音声合成」の各処理について、図１５Ａ〜１５Ｄを用いて説明する。
【０１７４】
図１５Ａ〜１５Ｄは、「音声合成開始」、「音声合成停止」、「音声合成一時停止」、「音声合成再開」の各ルーチンでの処理を示す。
【０１７５】
まず合成音声出力デバイス設定ステップＳ１３０１では、サンプリングレートの設定などの合成音声出力デバイスの初期設定を行なう。
【０１７６】
次に、合成音声出力デバイス起動ステップＳ１３０２では、合成音声出力デバイスを起動し、合成音声出力の動作を開始する。
【０１７７】
次に、合成音声データクリアステップＳ１３０３では、合成音声データ生成ステップＳ１００５で生成され、保持された合成音声データを消去する。
【０１７８】
次に、合成音声出力デバイス停止ステップＳ１３０４では、合成音声出力デバイスを停止する。
【０１７９】
次に、合成音声出力デバイス一時停止ステップＳ１３０５では、合成音声出力デバイスを一時停止する。
【０１８０】
次に、合成音声出力デバイス再開ステップＳ１３０６では、合成音声出力デバイス一時停止ステップＳ１３０５で一時停止した合成音声出力デバイスの動作を再開する。
【０１８１】
［「録音データ再生」の各処理：図１６Ａ〜１６Ｄ］
次に、「録音データ再生」の各処理について、図１６Ａ〜１６Ｄを用いて説明する。図１６Ａ〜１６Ｄは、「録音データ再生開始」、「録音データ再生停止」、「録音データ再生一時停止」、「録音データ再生再開」の各ルーチンでの処理を示す。
【０１８２】
まず、録音データ出力デバイス設定ステップＳ１４０１では、サンプリングレートの設定などの録音データ出力デバイスの初期設定を行なう。
【０１８３】
次に、録音データ出力デバイス起動ステップＳ１４０２では、録音データ出力デバイスを起動し、録音データ出力の動作を開始する。
【０１８４】
次に、録音データ出力デバイス停止ステップＳ１４０３では、録音データ出力デバイスを停止する。
【０１８５】
次に、録音データ出力デバイス一時停止ステップＳ１４０４では、録音データ出力デバイスを一時停止する。
【０１８６】
次に、録音データ出力デバイス再開ステップＳ１４０５では、録音データ出力デバイス一時停止ステップＳ１４０４で一時停止した録音データ出力デバイスの動作を再開する。
【０１８７】
なお上記説明した第１の実施形態は一例であり、例えば、先頭文節リスト作成ステップＳ１２０４において、先頭文節リストは先頭１文節からのみなるリストとなっているが、これは必ずしも先頭１文節である必要はなく、例えばユーザ設定による複数の文節であっても良い。
【０１８８】
また、要約文作成ステップＳ１２０７に示された要約文の例は、各文の主要部分が抽出された一例であるが必ずしも文毎に要約を行なう必要はなく、例えば、情報の少ない文は全て省略しても良い。
【０１８９】
また要約文作成ステップＳ１２０７の代わりに、早送りの場合には、図２８に示すように先頭文節リストを作成し、作成した先頭文節リストの先頭の「以下」から「Ｈ４は」まで順次先頭から読み出すようにしても良い。
【０１９０】
また巻き戻しの場合に要約を用いる場合には、図２９に一例を示す要約を用いればよい。
【０１９１】
また上記テキストデータのうちで音声合成により読み上げられなかった部分に対して省略する旨を示すビープ音などの音響出力をするようにしてもよい。
【０１９２】
さらに、先頭文節リスト作成ステップＳ１２０４および要約文作成ステップＳ１２０７は、ともに巻き戻し/早送りボタンの解放イベントが取得されてから実行されているが、新着読み上げ文コピーステップＳ８０３、新着読み上げ文追加ステップＳ８０７および蓄積読み上げ文コピーステップＳ９０２の各ステップの後に実行しても良い。これにより、巻き戻し/早送りボタンの解放からのレスポンスタイムが短くなる。
【０１９３】
＜第２の実施形態＞
次に、第２の実施形態について説明する。
【０１９４】
［ハードウェア構成：図２１］
図２１は、第２の実施形態である携帯可能な情報端末Ｈ１２００のハードウェア構成を説明する図である。
【０１９５】
Ｈ１１は、数値演算・制御等の処理を行なう中央処部であり、本発明の処理手順を記載した制御プログラムに従って演算を行なう。Ｈ１２は、ユーザに対して情報を提示する出力部である。出力部Ｈ１２は、スピーカ・ヘッドフォン等の音声出力部Ｈ１２０１および液晶などの画面表示部Ｈ１２０２を含む。
【０１９６】
Ｈ１３は、ユーザが情報端末Ｈ１２００に対して動作の指示を与え、あるいは情報を入力する入力部である。Ｈ１４は、ＬＡＮカードやＰＨＳカード等のデータ通信部であり、新着メール等のデータの取得に用いられる。Ｈ１５はハードディスクや不揮発メモリ等の記憶部であり、録音されたデータや蓄積情報が保持される。
【０１９７】
Ｈ１６は読み取り専用の記憶部であり、本発明の手順を示す制御プログラムや音声合成用辞書等の固定的データが格納される。Ｈ１７はＲＡＭ等の一時情報を保持するの記憶部であり、一時的なデータや各種フラグ等が保持される。
【０１９８】
Ｈ１８は角度に応じた値を出力する角度検出部であり、ダイアル部Ｈ１９の操作量を検出する。Ｈ１９はユーザが操作できるダイアル部であり、角度検出部Ｈ１８に接続されている。上記中央処部Ｈ１〜角度検出部Ｈ１８は、バスで接続されている。
【０１９９】
［情報端末のハードウェア構成の外観：図２７］
図２７は、図２１で説明した携帯可能な情報端末Ｈ１２００のハードウェア構成の外観イメージの一例を示したスケッチ図である。
【０２００】
［イベント処理の概要：図２２］
以上説明した第２の実施形態の情報端末Ｈ１２００におけるイベント処理について、図２２〜図２４に示すフローチャートを用いて説明する。なお下記の処理は、中央処部Ｈ１１により読み取り専用の記憶部Ｈ１６等に格納されているイベント駆動型の制御プログラムに基づいて、ＲＡＭ等の一時情報を保持する記憶部Ｈ１７を用いて実行されるものである。また入力部Ｈ１３からの入力処理や出力部Ｈ１２からのデータ要求、タイマ割り込み等の割り込みは、上記制御プログラムにおける各イベントの開始として処理される。
【０２０１】
まず、図２２の変数初期設定ステップＳ１５０１で、各変数を初期値に設定する。
【０２０２】
次に、音声合成デバイス開始／一時停止ステップＳ１５０２で、音声合成デバイスを一時停止状態にする。
【０２０３】
次に、イベント取得ステップＳ１５０３で、新たなイベントを取得する。
【０２０４】
次に、ダイアル角度変更判定ステップＳ１５０４で、イベント取得ステップＳ１５０３で取得したイベントが「ダイアルの角度変更」によるものか否かを判定し、「ダイアルの角度変更」であればステップＳ１６０１に処理を移し、「ダイアルの角度変更」でなければ音声合成データ要求判定ステップＳ１５０５に処理を移す。
【０２０５】
次に、音声合成要求判定ステップＳ１５０５では、イベント取得ステップＳ１５０３で取得したイベントが「合成音声出力デバイスからのデータ要求」であるか否かを判定し、「合成音声出力デバイスからのデータ要求」であるならばステップＳ１７０１に処理を移し、「合成音声出力デバイスからのデータ要求」でなければ、イベント取得ステップＳ１５０３に処理を戻す。
【０２０６】
［「ダイアル角度変更」処理：図２３］
次に、上記説明した各イベントの処理について、以下詳細に説明する。
【０２０７】
まず「ダイアル角度変更」処理について図２３を用いて説明する。
【０２０８】
新ダイアル角度判定ステップＳ１６０１では、新たなダイアル角度が「０」であるか否かを判定し、「０」であるならば合成音声出力デバイス一時停止ステップＳ１６０５に処理を移し、新たなダイアル角度が「０」でなければダイアル角度変数判定ステップＳ１６０２に処理を移す。
【０２０９】
ダイアル角度変数判定ステップＳ１６０２では、ダイアル角度変数に保持された以前のダイアル角度が「０」か否かを判定し、ダイアル角度変数に保持された以前のダイアル角度が「０」ならば合成音声出力デバイス再開ステップＳ１６０６に処理を移し、ダイアル角度変数に保持された以前のダイアル角度が「０」でないならばダイアル角度変数更新ステップＳ１６０３に処理を移す。
【０２１０】
ダイアル角度変数更新ステップＳ１６０３では、ダイアル角度変数に新たなダイアル角度を代入する。
【０２１１】
次に、読み上げスキップ数設定ステップＳ１６０４で、ダイアル角度変数の値に応じて読み上げスキップ数を設定する。読み上げスキップ数は、ダイアル角度の絶対値が大きいほどスキップ数の絶対値が大きくなるように、また、ダイアル角度の符号とスキップ数の符号が同じになるように設定する。ダイアル角度(単位角度をθとする) とスキップ数の対応表の例を図２５に示す。スキップ数設定後、処理をイベント取得ステップＳ１５０３に移す。
【０２１２】
次に、合成音声出力デバイス一時停止ステップＳ１６０５では、音声合成出力デバイスを一時停止、イベント取得ステップＳ１５０３に処理を移す。
【０２１３】
次に、合成音声出力デバイス再開ステップＳ１６０６では、合成音声出力デバイス一時停止ステップＳ１６０５で一時停止した合成音声出力デバイスを再開し、ダイアル角度変数更新ステップＳ１６０３に処理を移す。
【０２１４】
［「音声合成指示」処理：図２４］
次に、「音声合成指示」処理について図２３を用いて説明する。
【０２１５】
まず、合成音声データ終了判定ステップＳ１７０１では、「文節カウンタが文節数と等しい」か否かを判定し、「文節カウンタが文節数と等しい」ならば文書データ切り出しステップＳ１７０９に処理を移し、「文節カウンタが文節数と等し」くなければダイアル角度絶対値判定ステップＳ１７０２に処理を移す。文節数は、直前に実行された合成音声データ生成ステップＳ１７１０の対象となった文に含まれる文節の数であり、文節カウンタが文節数と等しい状態は、ステップＳ１７１０で得られた合成音声データを全て出力し終ったことを示す。
【０２１６】
次に、ダイアル角度絶対値判定ステップＳ１７０２では、ダイアル角度変数に保持されたダイアル角度の絶対値が「１」より大きいか判定し、ダイアル角度の絶対値が「１」より大ならば読み上げ対象文更新ステップＳ１７１７に処理を移し、ダイアル角度の絶対値が「１」より大でなければ読み上げポインタ判定ステップＳ１７０３に処理を移す。
【０２１７】
次に、読み上げポインタ判定ステップＳ１７０３では、「読み上げポインタが読み上げ対象文と等しい」か否かを判定し、「読み上げポインタが読み上げ対象文と等し」ければ文節カウンタ判定ステップＳ１７０４に処理を移し、「読み上げポインタが読み上げ対象文と等し」くなければ音声合成デバイス停止ステップＳ１７０５に処理を移す。
【０２１８】
次に、文節カウンタ判定ステップＳ１７０４では、文節カウンタが「０」か判定し、文節カウンタが「０」ならば読み上げ対象文更新ステップＳ１７１７に処理を移し、文節カウンタが「０」でないならば音声合成デバイス停止ステップＳ１７０５に処理を移す。
【０２１９】
次に、音声合成デバイス停止ステップＳ１７０５では、音声合成デバイスを停止する。次に、ビープ音出力ステップＳ１７０６でビープ音を出力する。次に、音声合成デバイス開始(２) ステップＳ１７０７で音声合成デバイスを開始する。
【０２２０】
次に、文節カウンタ更新ステップＳ１７０８で文節カウンタに「１」を加え、イベント取得ステップＳ１５０３に処理を移す。
【０２２１】
次に、文書データ切り出しステップＳ１７０９では、読み上げ対象文書から、読み上げポインタを先頭に１文分のデータを切り出す。
【０２２２】
次に、合成音声データ生成ステップＳ１７１０で、文書データ切り出しステップＳ１７０９で切り出した文に対して音声合成を行ない、合成音声データを得る。
【０２２３】
次に、文節数計算ステップＳ１７１１で、文書データ切り出しステップＳ１７０９で切り出した文に含まれる文節数を計算する。
【０２２４】
次に、同期点生成ステップＳ１７１２で、合成音声データ生成ステップＳ１７１０で生成された合成音声と、文書データ切り出しステップＳ１７０９で切り出した文に含まれる各文節の対応関係を求め同期点として保持する。同期点の例を図２６に示す。
【０２２５】
次に、文節カウンタ初期化ステップＳ１７１３で、文節カウンタを「０」にする。
【０２２６】
次に、ダイアル角度正負判定ステップＳ１７１４で、ダイアル角度変数に保持されたダイアル角度が「正」か否かを判定し、「正」ならば読み上げポインタインクリメントステップＳ１７１５に処理を移し、「正」でないならば読み上げポインタデクリメントステップＳ１７１６に処理を移す。
【０２２７】
次に、読み上げポインタインクリメントステップＳ１７１５では、読み上げポインタに１を加え、処理をダイアル角度絶対値判定ステップＳ１７０２に処理を移す。
【０２２８】
次に、読み上げポインタデクリメントステップＳ１７１６では、読み上げポインタから１を減じ、処理をダイアル角度絶対値判定ステップＳ１７０２に処理を移す。
【０２２９】
次に、読み上げ対象文更新ステップＳ１７１７では、読み上げ対象文を、読み上げポインタと読み上げスキップ数設定ステップＳ１６０４で設定したスキップ数の和に設定する。
【０２３０】
次に、合成音声データコピーステップＳ１７１８で、合成音声データ生成ステップＳ１００５で生成した合成音声を１文節分音声合成デバイスのバッファににコピーする。コピーする範囲は、現在の文節カウンタに対応する同期点から１文節分である。データコピーの後、文節カウンタ更新ステップＳ１７０８に処理を進める。
【０２３１】
なお上記説明した第２の実施形態は一例であり、例えば、読み上げスキップ数設定ステップＳ１６０４では、読み上げスキップ数をダイアル角度変数の値に応じた一定の文数としたが、ダイアル角が大きいときは次段落までスキップするようにすることも可能である。これは、読み上げポインタから次段落先頭の文までの文数を数えることで実現できる。また、ダイアル角度が小さいときには一ないし複数の文節をスキップすることも可能である。
【０２３２】
さらに第２の実施形態では、早送り・巻き戻し中のビープ音の回数が、スキップした文節の個数と同じであったが、必ずしも同じである必要はない。また、第２の実施形態では、早送り・巻き戻しを単一のビープ音で表していたが、早送り・巻き戻しの種別あるいはダイアル角度に応じて異なるビープ音あるいは異なる信号で発声しても良い。
【０２３３】
またさらに、第２の実施形態に対して、第１の実施形態で用いた要約による早送りを適用することも可能である。この場合、読み上げスキップ数設定ステップＳ１６０４で設定されたスキップ数に対応して要約の圧縮率を変更することができる。
【０２３４】
【他の実施形態】
なお、本発明は、複数の機器（例えばホストコンピュータ、インタフェイス機器、リーダ、プリンタなど）から構成されるシステムに適用しても、一つの機器からなる装置（例えば、複写機、ファクシミリ装置など）に適用してもよい。
【０２３５】
また、本発明の目的は、前述した実施形態の機能を実現するソフトウェアのプログラムコードを記録した記憶媒体（または記録媒体）を、システムあるいは装置に供給し、そのシステムあるいは装置のコンピュータ（またはＣＰＵやＭＰＵ）が記憶媒体に格納されたプログラムコードを読み出し実行することによっても、達成されることは言うまでもない。この場合、記憶媒体から読み出されたプログラムコード自体が前述した実施形態の機能を実現することになり、そのプログラムコードを記憶した記憶媒体は本発明を構成することになる。また、コンピュータが読み出したプログラムコードを実行することにより、前述した実施形態の機能が実現されるだけでなく、そのプログラムコードの指示に基づき、コンピュータ上で稼働しているオペレーティングシステム（ＯＳ）などが実際の処理の一部または全部を行い、その処理によって前述した実施形態の機能が実現される場合も含まれることは言うまでもない。
【０２３６】
さらに、記憶媒体から読み出されたプログラムコードが、コンピュータに挿入された機能拡張カードやコンピュータに接続された機能拡張ユニットに備わるメモリに書込まれた後、そのプログラムコードの指示に基づき、その機能拡張カードや機能拡張ユニットに備わるＣＰＵなどが実際の処理の一部または全部を行い、その処理によって前述した実施形態の機能が実現される場合も含まれることは言うまでもない。
【０２３７】
本発明を上記記憶媒体に適用する場合、その記憶媒体には、先に説明した（図２〜１６、２２〜２４に示す）フローチャートに対応するプログラムコードが格納されることになる。
【０２３８】
以上説明したように、本発明によれば、少ない操作ボタンでユーザが指示を与えることが可能となり、結果的に、端末全体のサイズが小さくなる。また、部品点数が減り製造コストが現象する。また、早送り・巻き戻し中にユーザが必要とする情報の提供が可能となり、利便性が向上する。
【０２３９】
【発明の効果】
以上説明したように、本発明によれば、各種の情報処理を行う際に、操作ボタンなどの部品点数が増えることによる製造コストの上昇を抑制することができるとともに複数の情報処理対象を該操作ボタンで相互に移行でき利便性よく上記の各種の情報処理を行うことができる携帯可能な情報端末を提供することができる。
【図面の簡単な説明】
【図１】本発明の第１の実施形態に係るハードウェア構成を示したブロック図である。
【図２】本発明の第１の実施形態に係るイベント処理の全体を説明するフローチャートである。
【図３】再生ボタンが押下されたときの処理を説明するフローチャートである。
【図４】停止ボタンが押下されたときの処理を説明するフローチャートである。
【図５】一時停止ボタンが押下されたときの処理を説明するフローチャートである。
【図６】早送りボタンが押下されたときの処理を説明するフローチャートである。
【図７】早送りボタンが解除されたときの処理を説明するフローチャートである。
【図８】巻き戻しボタンが押下されたときの処理を説明するフローチャートである。
【図９】巻き戻しボタンが解除されたときの処理を説明するフローチャートである。
【図１０】新規情報が到着したときの処理を説明するフローチャートである。
【図１１】蓄積情報読み上げを指示されたときの処理を説明するフローチャートである。
【図１２】音声合成を指示されたときの処理を説明するフローチャートである。
【図１３】録音再生を指示されたときの処理を説明するフローチャートである。
【図１４】タイマイベント処理を説明するフローチャートである。
【図１５Ａ】音声合成開始処理を説明するフローチャートである。
【図１５Ｂ】音声合成停止処理を説明するフローチャートである。
【図１５Ｃ】音声合成一時停止処理を説明するフローチャートである。
【図１５Ｄ】音声合成再開処理を説明するフローチャートである。
【図１６Ａ】録音データ再生開始処理を説明するフローチャートである。
【図１６Ｂ】録音データ再生停止処理を説明するフローチャートである。
【図１６Ｃ】録音データ再生一時停止処理を説明するフローチャートである。
【図１６Ｄ】録音データ再生再開処理を説明するフローチャートである。
【図１７】新着通知メッセージの一例を説明する図である。
【図１８】先頭文節リストのイメージの一例を説明する図である。
【図１９】要約文のイメージの一例を説明する図である。
【図２０】本発明に係る第１の実施形態のハードウェア構成のイメージを示したスケッチ図である。
【図２１】本発明の第２の実施形態に係るハードウェア構成を示したブロック図である。
【図２２】本発明の第２の実施形態に係るイベント処理の全体を説明するフローチャートである。
【図２３】ダイヤル角度が変更されたときの処理を説明するフローチャートである。
【図２４】音声合成が要求されたきの処理を説明するフローチャートである。
【図２５】ダイアル角度と読み上げスキップ数の対応を説明する図である。
【図２６】同期点の一例を説明する図である。
【図２７】本発明に係る第２の実施形態のハードウェア構成のイメージを示したスケッチ図である。
【図２８】早送りの場合の先頭文節リストのイメージの一例を説明する図である。
【図２９】巻き戻しの際の要約の一例を示す図である。
【符号の説明】
Ｈ１数値演算・制御処理を行なう中央処部
Ｈ２ユーザに情報提示する出力部
Ｈ２０１音声出力部（スピーカ・ヘッドフォン等）
Ｈ２０２画面表示部（液晶など）
Ｈ３入力部（ユーザによる動作指示／情報入力）
Ｈ３０１再生ボタン（入力部）
Ｈ３０２停止ボタン（入力部）
Ｈ３０３一時停止ボタン（入力部）
Ｈ３０４早送りボタン（入力部）
Ｈ３０５巻き戻しボタン（入力部）
Ｈ３０６タッチパネル
Ｈ４データ通信部（ＬＡＮカードやＰＨＳカード、新着メール等の取得）
Ｈ５記憶部（ハードディスクや不揮発メモリ、録音データの保持）
Ｈ６読み取り専用記憶部（ＲＯＭ、プログラム、音声合成用辞書等の保持）
Ｈ７一時情報を保持する記憶部（ＲＡＭ等、一時的なデータ等の保持）
Ｈ８インタバルタイマ部（中央処部への割り込み処理）[0001]
BACKGROUND OF THE INVENTION
The present invention relates to an information processing apparatus, an information processing method, a control program, and a computer-readable storage medium, and more particularly to an information processing apparatus, information processing method, control program, and computer-readable storage medium using speech synthesis.
[0002]
[Prior art]
Currently, for example, a portable information terminal as shown in FIG. 20 is put into practical use, and various information processing is performed using this information terminal. Some portable information terminals include, for example, a communication unit, a storage unit, a voice output unit, and a voice synthesis unit. In each of these units, for example, the following “recorded data reproduction”, “read stored document reading”, Functions such as “Read New Information” are implemented.
1) “Recorded data playback” function
Sound data such as music and language teaching materials downloaded through the communication unit is stored in the storage unit and played back at any time and place.
2) “Reading stored document” function
By reading out text data such as novels stored in the data storage unit using speech synthesis (text-to-speech conversion), information can be browsed anywhere.
3) “Read new information” function
Connect to the Internet using a communication unit to obtain real-time information (text data) such as e-mails and news. Furthermore, the obtained information is read out using speech synthesis (text-to-speech conversion).
[0003]
Further, it is possible to use the following functions in combination with the functions of “recorded data reproduction”, “accumulated document reading”, and “new arrival information reading”.
4) “Reading document with recorded data as BGM” function
While playing back the recorded data, the stored documents and new arrival information (text data) are read out using speech synthesis (text-to-speech conversion).
5) "New information interrupt notification" function
When an email or new news arrives, it is read out using speech synthesis (text-to-speech conversion). Because it is audio, it is hard to get in the way even during other work. It is also possible to superimpose synthesized speech while playing music or the like.
[0004]
[Problems to be solved by the invention]
However, the conventional method described above has the following two problems.
[0005]
The first problem is that there are many operation buttons.
[0006]
For any of “Recorded data playback”, “Read out stored document”, and “Read out newly arrived information”, the user gives instructions for operations such as “Play”, “Stop”, “Fast forward”, “Rewind” during the operation. Can be performed together. However, operation buttons such as “Play”, “Stop”, “Fast forward”, and “Rewind” are provided separately for the operation buttons of “Recorded data playback”, “Read stored document”, “Read new information”, for example. In such a case, the installation location becomes large, resulting in an increase in the size of the entire information terminal, an increase in the number of parts, and an increase in manufacturing cost.
[0007]
The second problem is that when reading out using speech synthesis (text-to-speech conversion), if the “fast forward” and “rewind” processing is performed in the same way as the playback of the recorded data, “fast forward”, “rewind” "Is not convenient because the user cannot hear the content read out using speech synthesis (text-to-speech conversion).
[0008]
The present invention is to solve the above-mentioned problems of the prior art, and its purpose is, for example, “playback” during the operations of “recorded data playback”, “stored document read-out”, and “new arrival information read-out”. Provided is a portable information processing apparatus and information processing method capable of performing various operations such as “stop”, “fast forward”, “rewind” and the like, and the number of parts such as operation buttons is increased and the manufacturing cost is not increased. That is.
[0009]
Another object of the present invention is that, for example, when reading out using speech synthesis (text-to-speech conversion), even when the “fast forward” and “rewind” processing similar to the reproduction of the recorded data is performed, the user It is to provide a convenient portable information processing apparatus and information processing method that can listen to the contents read out using speech synthesis.
[0010]
[Means for Solving the Problems]
In order to achieve the above object, an information processing apparatus according to an embodiment of the present invention has the following arrangement. That is, a first control unit that causes a speech synthesizer that synthesizes text data based on a user instruction to execute at least one of output, stop, fast forward, rewind, and pause of synthesized speech, and a user instruction Based on the second control means for causing the sound reproduction means for reproducing sound from the sound data to execute at least one of output, stop, fast forward, rewind, and pause of the reproduced sound, and detection for detecting a user instruction And an instruction of the user detected by the detecting means is an output, Text data to be read out and synthesized by the speech synthesizer Determining means for determining the presence or absence of the Text data to be read out And setting means for setting first data to be released when a user instruction to stop the output of the synthesized speech is detected, and the detection means detects the user instruction. When the first data is set, the user's instruction is issued to the first control means, and when the first data is released, the reproduced sound is output. The user instruction is set in response to second data that is set when a user instruction is detected and is canceled when a user instruction to stop the output of the reproduced sound is detected. And 2 indicating means for outputting to the control means.
[0014]
Here, for example, Detected by the detection means The user instruction is fast forward, By setting the first data The indicating means is the User's When an instruction is given to the first control means, the first control means creates a summary data by extracting a predetermined portion of data from each sentence of the text data to be read out, and synthesizes the summary data by speech synthesis. It is preferable to control the voice synthesizing means so as to output the output.
[0015]
Here, for example, Detected by the detection means The user instruction is fast forward, By setting the first data The indicating means is the User When the instruction is issued to the first control means, the first control means extracts the first phrase from each sentence of the text data to be read out and sequentially synthesizes and outputs the extracted phrases. It is preferable to control the speech synthesis means.
[0016]
Here, for example, Detected by the detection means The user instruction is rewind, By setting the first data The indicating means is the User's When the instruction is given to the first control means, the first control means extracts the first phrase from each sentence of the text data to be read out and displays the extracted phrases in the reverse order to the sentence order of the text data. It is preferable to control the voice synthesizing means so as to synthesize and output the voice.
[0019]
Here, for example, the control means controls the voice synthesis means so as to output a predetermined sound corresponding to the data omitted from the text data without being synthesized by the voice synthesis means. It is preferable to do.
[0027]
In order to achieve the above object, an information processing method according to an embodiment of the present invention has the following arrangement. That is, when a detection step for detecting a user instruction and a user instruction detected by the detection step are outputs, Text data to be read out that is synthesized by speech synthesis means that synthesizes text data based on the user's instruction A determination step of determining the presence or absence of the Text data to be read out When the user's instruction to stop the output of the synthesized speech is detected, a setting process for setting first data to be canceled when the user's instruction to stop the output of the synthesized speech is detected, and the user's instruction is detected by the detection process If the first data is set, the user's instruction is Above When the first data is released to the first control means for causing the voice synthesis means to execute at least one of output, stop, fast forward, rewind and pause of the synthesized voice, In response to the second data that is set when the user's instruction for instructing the output is detected and the user's instruction for instructing the stop of the output of the reproduced sound is detected, the user's instruction is given. An instruction to be issued to the second control means for causing the sound reproduction means for reproducing sound from the sound data based on the user's instruction to execute at least one of output, stop, fast forward, rewind, and pause of the reproduced sound. And a process.
[0029]
In order to achieve the above object, a control program according to an embodiment of the present invention has the following arrangement. That is, On the computer, Detection process for detecting user instructions When When the user instruction detected by the detection step is output, Text data to be read out that is synthesized by speech synthesis means that synthesizes text data based on the user's instruction Judgment process to determine the presence or absence of When In the determination step Text data to be read out A setting step of setting first data to be canceled when a user instruction to stop the output of the synthesized speech is detected When If the first data is set when a user instruction is detected by the detection step, the user instruction is Above When the first data is released to the first control means for causing the voice synthesis means to execute at least one of output, stop, fast forward, rewind and pause of the synthesized voice, In response to the second data that is set when the user's instruction for instructing the output is detected and the user's instruction for instructing the stop of the output of the reproduced sound is detected, the user's instruction is given. An instruction to be issued to the second control means for causing the sound reproduction means for reproducing sound from the sound data based on the user's instruction to execute at least one of output, stop, fast forward, rewind, and pause of the reproduced sound. Process When The Execute It is characterized by that.
[0030]
In order to achieve the above object, a computer-readable storage medium according to an embodiment of the present invention has the following arrangement. That is, On the computer, Detection process for detecting user instructions When When the user instruction detected by the detection step is output, Text data to be read out that is synthesized by speech synthesis means that synthesizes text data based on the user's instruction Judgment process to determine the presence or absence of When In the determination step Text data to be read out A setting step of setting first data to be canceled when a user instruction to stop the output of the synthesized speech is detected When When the first data is set when a user instruction is detected by the detection step, the user instruction is output to the speech synthesizer, and the synthesized speech is output, stopped, fast-forwarded, wound. When the first data is released to the first control means for executing at least one of return and pause, and when the first data is released, it is set when the user's instruction for instructing the output of the reproduced sound is detected. The user's instruction corresponding to the second data released when the user's instruction for instructing to stop the output of the reproduced sound is detected, and the sound for reproducing the sound from the sound data based on the user's instruction Instructing step to be issued to second control means for causing reproduction means to execute at least one of output, stop, fast forward, rewind, and pause of reproduction sound When The Stored the control program to be executed It is characterized by that.
[0038]
DETAILED DESCRIPTION OF THE INVENTION
An information processing apparatus according to an embodiment of the present invention will be described below with reference to the drawings. In the present embodiment, a portable information terminal is used as an example of the information processing apparatus. However, the scope of the present invention is not limited to the description example.
[0039]
<First Embodiment>
[Hardware configuration: Fig. 1]
FIG. 1 is a diagram illustrating a hardware configuration of a portable information terminal H1000 according to the first embodiment.
[0040]
H1 is a central processing unit that performs processing such as numerical calculation and control, and performs calculation according to a control program that describes the processing procedure of the present invention. H2 is an output unit that presents information to the user. The output unit H2 includes an audio output unit H201 such as a speaker / headphone and a screen display unit H202 such as a liquid crystal.
[0041]
H3 is an input unit for the user to give an operation instruction to the information terminal H1000 or to input information. The input unit H3 includes general-purpose input units such as a play button H301, a stop button H302, a pause button H303, a fast forward button H304, a rewind button H305, and a touch panel H306.
[0042]
H4 is a data communication unit such as a LAN card or a PHS card, and is used for acquiring data such as new mail. H5 is a storage unit such as a hard disk or a nonvolatile memory, and holds recorded data and stored information. H6 is a read-only storage unit that stores control data indicating the procedure of the present invention and fixed data such as a speech synthesis dictionary.
[0043]
H7 is a storage unit that holds temporary information such as RAM, and holds temporary data, various flags, and the like. H8 is an interval timer unit that works to interrupt the central processing unit H1 after a predetermined time elapses after the timer is started. The central processing unit H1 to the timer unit H8 described above are connected by a bus.
[0044]
[Appearance of hardware configuration of information terminal: FIG. 20]
FIG. 20 is a sketch diagram showing an example of an external appearance image of the hardware configuration of the portable information terminal H1000 described in FIG.
[0045]
[Outline of each event process: Fig. 2]
The event processing in the information terminal H1000 described above will be described using the flowcharts shown in FIGS. The following processing is executed by the central processing unit H1 using the storage unit H7 that holds temporary information such as a RAM based on an event-driven control program stored in the read-only storage unit H6. Is. An input process from the input unit H3, a data request from the output unit H2, and an interrupt such as a timer interrupt are processed as the start of each event in the control program.
[0046]
In FIG. 2, first, in event acquisition step S1, a new event is acquired.
[0047]
Next, in the playback button press determination step S2, it is determined whether or not the event acquired in the event acquisition step S1 is “play button press”, and if the acquired event is “play button press”, The process moves to step S101 shown in FIG. 3, and if it is not “pressing the reproduction button”, the process moves to stop button press determination step S3.
[0048]
Next, in the stop button press determination step S3, it is determined whether or not the event acquired in the event acquisition step S1 is “stop button press”, and if the acquired event is “stop button press”, The process moves to step S201 shown in FIG. 4, and if it is not “pressing the stop button”, the process moves to pause button press determination step S4.
[0049]
Next, in the pause button pressing determination step S4, it is determined whether or not the event acquired in the event acquisition step S1 is “pressing the pause button”. The process moves to step S301, and if it is not “pressing the pause button”, the process moves to a fast-forward button pressing determination step S5.
[0050]
Next, in the fast-forward button pressing determination step S5, it is determined whether or not the event acquired in the event acquisition step S1 is “pressing the fast-forward button”. The process moves to step S401, and if it is not “pressing the fast-forward button”, the process moves to fast-forward button release determination step S6.
[0051]
Next, in the fast forward button release determination step S6, it is determined whether or not the event acquired in the event acquisition step S1 is “release of the fast forward button (operation to release the pressed button)”, and “release of the fast forward button”. If there is, the process moves to step S501 shown in FIG. 7, and if it is not “cancel fast-forward button”, the process moves to rewind button press determination step S7.
[0052]
Next, in the rewind button pressing determination step S7, it is determined whether or not the event acquired in the event acquisition step S1 is “pressing the rewind button”. The process moves to step S601 shown in FIG. 8, and if it is not “pressing the rewind button”, the process moves to rewind button release determination step S8.
[0053]
Next, in rewind button release determination step S8, it is determined whether or not the event acquired in event acquisition step S1 is “release of rewind button”. The process proceeds to step S701 shown in FIG. 9, and if it is not “cancel rewind button”, the process proceeds to new information arrival determination step S9.
[0054]
Next, in the new information arrival determination step S9, it is determined whether or not the event acquired in the event acquisition step S1 indicates that “new arrival information” has arrived. Then, the process moves to step S801 shown in FIG. 10, and if the arrival of “new arrival information” is not reached, the process moves to accumulated information reading instruction determination step S10.
[0055]
Next, in the accumulated information reading instruction determination step S10, it is determined whether or not the event acquired in the event acquisition step S1 is “an instruction to read the stored information by the user”, and is “an instruction to read the stored information by the user”. If so, the process proceeds to step S901 shown in FIG. 11, and if it is not “an instruction to read out stored information by the user”, the process proceeds to speech synthesis data request determination step S11.
[0056]
Next, in the speech synthesis data request determination step S11, it is determined whether or not the event acquired in the event acquisition step S1 is “data request from the synthesized speech output device”, and “data request from the synthesized speech output device”. If so, the process proceeds to step S1001 shown in FIG. 12, and if it is not “data request from synthesized speech output device”, the process proceeds to recording / playback data request determination step S12.
[0057]
Next, in recording / playback data request determination step S12, it is determined whether or not the event acquired in event acquisition step S1 is “data request from recording data output device”, and “data request from recording data output device”. If so, the process proceeds to step S1101 shown in FIG. 13. If it is not “data request from recording data output device”, the process proceeds to timer event determination step S13.
[0058]
Next, in the timer event determination step S13, it is determined whether or not the event acquired in the event acquisition step S1 is a notification indicating that a predetermined time has elapsed from the timer start from the timer unit H8, and the timer unit H8. If not, the process proceeds to step S1201 shown in FIG. 14. If not, the process returns to event acquisition step S1.
[0059]
["Play button press" processing: Fig. 3]
Next, the processing of each event described above will be described in detail below. First, the “play button push” process will be described with reference to FIG.
[0060]
[Reading pointer]
Reading Pointer Setting Determination (Playback) In Step S101, it is determined whether or not “Reading Pointer” is set. If “Reading Pointer” is set, the speech synthesis pause flag is released (playback) and the process proceeds to Step S106. If the “reading pointer” is not set, the priority reading sentence existence determination (reproduction) is transferred to step S102. Note that the “speech pointer” is an area that holds a position where speech synthesis is started next and speech reading is started in the middle of a priority text (text data) shown in FIG. 18, for example. The position of the “read pointer” is set as a value.
[0061]
Next, in the priority reading sentence existence determination (reproduction) step S102, it is determined whether or not “priority reading sentence exists”. If “priority reading sentence exists”, the process proceeds to the priority reading sentence initial pointer setting step S108. If no “priority reading sentence exists”, the process moves to the accumulated reading sentence existence determination step S103.
[0062]
Next, in accumulated read-out sentence existence determination step S103, it is determined whether or not “stored read-out sentence exists”. If “accumulated read-out sentence exists”, the process proceeds to accumulated read-out sentence initial pointer setting step S109. If there is no stored read-out sentence ”, reproduction pointer setting determination (reproduction) proceeds to step S104.
[0063]
[Playback pointer]
Next, in playback pointer setting determination (playback) step S104, it is determined whether or not “playback pointer is set”. If “playback pointer is set”, playback pause flag release (playback) is processed in step S111. If “playback pointer is not set”, the process moves to the recording data existence determination step S105. The “reproduction pointer” is an area for holding the position to be reproduced next, and is not set or the position of the “reproduction pointer” in the recorded data is set as a value.
[0064]
Next, in the recording data existence determination step S105, it is determined whether or not “recording data exists”. If “recording data exists”, the process proceeds to a recording data reproduction initial pointer setting step S113. If it does not exist, the process proceeds to the event acquisition step S1 in FIG.
[0065]
Next, the speech synthesis pause flag is released (reproduction). In step S106, the speech synthesis pause flag is released. The speech synthesis pause flag is a flag indicating whether speech synthesis is in a temporary state, and takes a value of “true” when it is set and “false” when it is released.
[0066]
Next, in the speech synthesis restart (playback) step S107, the speech synthesis that was temporarily stopped in step S304 in FIG. 5 is resumed, and then the process proceeds to the event acquisition step S1 in FIG. The processing in each routine of “speech synthesis start”, “speech synthesis stop”, “speech synthesis pause”, and “speech synthesis restart” will be described later with reference to FIG.
[0067]
Next, in the preferential reading sentence initial pointer setting step S108, the reading pointer is set at the head of the preferential reading sentence, and the process proceeds to the speech synthesis start step S110.
[0068]
Next, in the stored read-out sentence initial pointer setting step S109, the read-out pointer is set at the head of the stored read-out sentence, and the process proceeds to the speech synthesis start step S110.
[0069]
Next, after setting the reading pointer in the priority reading sentence initial pointer setting step S108 or the accumulated reading sentence initial pointer setting step S109, the voice synthesis is started in the voice synthesis start step S110, and then the process proceeds to the event acquisition step S1 in FIG. Move.
[0070]
Next, the reproduction pause flag is released (reproduction). In step S111, the reproduction pause flag is released. The reproduction pause flag is a flag indicating whether or not the recording data reproduction is in a temporary state.
[0071]
Next, the reproduction of the recorded data is resumed (reproduced). In step S112, the reproduction of the recorded data suspended in step S308 is resumed, and the process proceeds to the event acquisition step S1. Processes in the routines of “recorded data reproduction start”, “recorded data reproduction stop”, “recorded data reproduction pause”, and “recorded data reproduction restart” will be described later with reference to FIG.
[0072]
Next, in the recording data reproduction initial pointer setting step S113, the reproduction pointer is set to the head of the recording data, and the process proceeds to the recording data reproduction start step S114. In the recorded data reproduction start step S114, the reproduction of the recorded data is started, and then the process proceeds to the event acquisition step S1 in FIG.
[0073]
["Stop button press" processing: Fig. 4]
Next, the “stop button pressing” process will be described with reference to FIG.
[0074]
Reading Pointer Setting Determination (Stop) In Step S201, it is determined whether or not “Reading Pointer” has been set. If “Reading Pointer” has been set, the speech synthesis pause flag is released (Stop) and the process proceeds to Step S203. If the “reading pointer” is not set, the reproduction pointer setting determination (stop) is transferred to step S202.
[0075]
Next, playback pointer setting determination (stop) In step S202, it is determined whether or not "playback pointer" is set. If "playback pointer" is set, playback pause flag release (stop) is processed in step S206. If “playback pointer” is not set, the process moves to event acquisition step S1.
[0076]
Release voice synthesis pause flag (stop) In step S203, the voice synthesis pause flag is released. Next, the reading pointer is released (stopped). In step S204, the reading pointer is released (not set). Next, after speech synthesis is stopped in speech synthesis stop step S205, the process proceeds to event acquisition step S1 in FIG.
[0077]
Next, the playback pause flag is released (stopped) In step S206, the playback pause flag is released. Next, the playback pointer is released (stopped). In step S207, the playback pointer is released (not set). Next, after the reproduction of the recording data is stopped in the recording data reproduction stop step S208, the processing is shifted to the event acquisition step S1 in FIG.
[0078]
["Press the pause button" processing: Fig. 5]
Next, the process of “pressing the pause button” will be described with reference to FIG.
[0079]
First, reading pointer setting determination (pause) In step S301, it is determined whether or not “reading pointer” is set. If “reading pointer” is set, the process proceeds to speech synthesis pause flag setting determination step S302. If the “readout pointer” is not set, the reproduction pointer setting determination (pause) is transferred to step S305.
[0080]
Next, in the speech synthesis pause flag setting determination step S302, it is determined whether or not the speech synthesis pause flag is set, that is, whether or not the speech synthesis is paused. If the speech synthesis pause flag is set, The reading pointer setting determination (reproduction) of FIG. 3 is shifted to step S101. If the voice synthesis pause flag is not set, the process is shifted to voice synthesis pause flag setting step S303.
[0081]
Next, in a speech synthesis pause flag setting step S303, a speech synthesis pause flag is set (value is set to “true”). Next, after the speech synthesis is paused in the speech synthesis pause step S304, the process proceeds to the event acquisition step S1 in FIG.
[0082]
Next, playback pointer setting determination (pause) In step S305, it is determined whether or not “playback pointer” is set. If “playback pointer” is set, the process proceeds to playback pause flag setting determination step S306. If the “playback pointer” is not set, the process moves to the event acquisition step S1 in FIG.
[0083]
Next, in the playback pause flag setting determination step S306, it is determined whether or not the “playback pause flag” is set, that is, whether or not the playback of the recorded data is paused, and the “playback pause flag” is set. If so, the process proceeds to reading pointer setting determination (reproduction) step S101 in FIG. 3, and if the “reproduction pause flag” is not set, the process proceeds to reproduction pause flag setting step S307.
[0084]
Next, in a reproduction pause flag setting step S307, a reproduction pause flag is set (value is set to “true”). Next, the reproduction of the recording data is paused in the recording data reproduction pause step S308, and the processing is shifted to the event acquisition step S1 in FIG.
[0085]
["Fast-forward button pressed" processing: Fig. 6]
Next, the “pressing fast-forward button” process will be described with reference to FIG.
[0086]
Reading Pointer Setting Determination (Fast-forwarding) In step S401, it is determined whether or not “Reading Pointer” is set. If “pointer” is not set, playback pointer setting determination (fast-forward) moves to step S405.
[0087]
Next, in fast forward reading timer mode setting step S402, the timer mode is set to “fast forward reading”, and the process proceeds to fast forward event mask setting step S403. The timer mode indicates the usage of the timer.
[0088]
Next, in the fast forward event mask setting step S403, the event mask is set for fast forward processing, and the events acquired in the event acquisition step S1 are hereinafter referred to as “fast forward button release”, “voice synthesis data request”, “recording / playback”. Limited to “data request” and “timer event” only.
[0089]
Next, timer start (fast forward) In step S404, the timer is started so that a timer event occurs after a predetermined time. Then, the process proceeds to the event acquisition step S1 in FIG.
[0090]
Playback pointer setting determination (fast forward) In step S405, it is determined whether or not the playback pointer is set. If the playback pointer is set, the process proceeds to fast forward playback timer mode setting step S406, where the playback pointer is set. If not, the process proceeds to the event acquisition step S1 in FIG.
[0091]
In fast forward playback timer mode setting step S406, the timer mode is set to “fast forward playback”, and the process proceeds to fast forward event mask setting step S403.
[0092]
["Cancel fast-forward button" processing: Fig. 7]
Next, the “fast forward button release” process will be described with reference to FIG.
[0093]
First, in the event mask cancellation (fast forward) step S501, the event mask is canceled, and all events are acquired in the subsequent event acquisition step S1.
[0094]
Next, timer mode initialization / timer stop (fast forward) After the timer mode is initialized in step S502, the timer is stopped.
[0095]
Next, reading pointer setting determination (fast forward cancellation) In step S503, it is determined whether or not “reading pointer” is set. If “reading pointer” is set, reading mode determination (fast forward) processing is performed in step S504. If the “readout pointer” is not set, the process proceeds to the reproduction pointer setting determination (fast forward release) step S511.
[0096]
Reading mode determination (fast forward) In step S504, it is determined whether or not the reading mode is "fast forward". If it is "fast forward", the process proceeds to step S505 for reading mode initialization (fast forward). If not, speech synthesis is stopped (fast forward). The process proceeds to step S508.
[0097]
Next, read-out mode initialization (fast forward) In step S505, the read-out mode is returned to the initial state. Next, reading pointer return (fast forward) In step S506, the reading pointer set in the summary sentence created in step S1207 in FIG. 14 is set to a corresponding position in the original document.
[0098]
Next, after the summary sentence is discarded in the summary sentence discarding step S507, the process proceeds to the event acquisition step S1 in FIG.
[0099]
Next, speech synthesis is stopped (fast forward). In step S508, speech synthesis is stopped. Next, in the reading pointer forward skip step S509, the reading pointer is moved to the head of the sentence next to the sentence currently being read out. Next, speech synthesis is started (fast forward). In step S510, speech synthesis is started, and the process proceeds to event acquisition step S1.
[0100]
On the other hand, playback pointer setting determination (fast forward release) In step S511, it is determined whether or not “playback pointer” is set. If “playback pointer” is set, recording playback mode determination (fast forward) is processed in step S512. If “playback pointer” is not set, the process moves to the event acquisition step S1 in FIG.
[0101]
Recording / playback mode determination (fast forward) In step S512, it is determined whether or not the recording / playback mode is “fast forward”. If it is “fast forward”, the recording / playback mode initialization (fast forward) is shifted to step S513. If it is not “fast forward”, the recorded data reproduction is stopped (fast forward). The process proceeds to step S514.
[0102]
Next, in recording / playback mode initialization (fast forward), in step S513, the recording / playback mode is returned to the initial state, and then the process proceeds to event acquisition step S1 in FIG. Next, recording data reproduction stop (fast forward) In step S514, reproduction of the recording data is stopped. Next, in the reproduction pointer forward skip step S515, the reproduction pointer is advanced by one index. For example, if the recorded data is music data, the playback pointer moves to the beginning of the next song.
[0103]
Next, recording data reproduction start (fast forward) After reproduction of recording data is started in step S516, the process proceeds to event acquisition step S1 in FIG.
[0104]
["Rewind button press" processing: Fig. 8]
Next, the process of “pressing the rewind button” will be described with reference to FIG.
[0105]
First, in the reading pointer setting determination (rewinding) step S601, it is determined whether or not “the reading pointer is set”. If “the reading pointer is set”, the process proceeds to the rewind reading timer mode setting step S602. If “read pointer is not set”, the reproduction pointer setting determination (rewind) is transferred to step S605.
[0106]
Next, in rewind reading timer mode setting step S602, the timer mode is set to “rewind reading aloud”, and the process proceeds to rewind event mask setting step S603.
[0107]
Next, in the rewind event mask setting step S603, the event mask is set for rewind processing, and then the event acquired in the event acquisition step S1 in FIG. 2 is referred to as “rewind button release”, “speech synthesis”. Limited to “data request”, “recording / playback data request”, and “timer event”.
[0108]
Next, a timer is started (rewinding) in step S604, a timer event is generated after a predetermined time, and the process proceeds to event acquisition step S1 in FIG.
[0109]
Next, in playback pointer setting determination (rewind) step S605, it is determined whether or not “playback pointer” is set. If “playback pointer” is set, the process proceeds to rewind playback timer mode setting step S606. If “playback pointer” is not set, the process moves to the event acquisition step S1 in FIG.
[0110]
Next, in rewind playback timer mode setting step S606, the timer mode is set to “rewind playback”, and the process proceeds to rewind event mask setting step S603.
[0111]
["Rewind button release" processing: FIG. 9]
Next, the “cancel rewind button” process will be described with reference to FIG.
[0112]
First, in the event mask release (rewind) step S701, the event mask is released, and all events are acquired in the subsequent event acquisition step S1.
[0113]
Next, timer mode initialization / timer stop (rewind) After the timer mode is initialized in step S702, the timer is stopped.
[0114]
Next, reading pointer setting determination (rewind release) In step S703, it is determined whether or not “reading pointer” is set. If “reading pointer” is set, reading mode determination (rewinding) proceeds to step S704. If the “readout pointer” is not set, the playback pointer setting determination (rewind release) is transferred to step S711.
[0115]
Next, in the reading mode determination (rewinding) step S704, it is determined whether or not the reading mode is “rewinding”. If it is “rewinding”, the reading mode initialization (rewinding) is processed in step S705. If it is not “rewinding”, speech synthesis is stopped (rewinding). The process moves to step S708.
[0116]
Next, reading mode initialization (rewinding) In step S705, the reading mode is returned to the initial state. Next, reading pointer return (rewinding) In step S706, the reading pointer set in the first phrase list created in step S1204 in FIG. 14 is set to a corresponding position in the original document. (The information created in step S1205 is used.)
Next, after the first phrase list is discarded in the first phrase list discarding step S707, the process proceeds to the event acquisition step S1 in FIG.
[0117]
Next, speech synthesis is stopped (rewinding) In step S708, speech synthesis is stopped. Next, in the reading pointer backward skip step S709, the reading pointer is moved to the head of the sentence before the sentence currently being read out.
[0118]
Next, speech synthesis is started (rewinding). After speech synthesis is started in step S710, the process proceeds to event acquisition step S1 in FIG.
[0119]
Next, playback pointer setting determination (rewind release) In step S711, it is determined whether or not “playback pointer” is set. If “playback pointer” is set, recording / playback mode determination (rewind) step. The processing moves to S712, and if the “playback pointer” is not set, the processing moves to event acquisition step S1.
[0120]
Next, in recording / reproducing mode determination (rewinding) step S712, it is determined whether or not the recording / reproducing mode is “rewinding”. If it is “rewinding”, a recording / reproducing mode initialization (rewinding) step is performed. The process moves to S713. If it is not “rewind”, the recorded data reproduction is stopped (rewind). The process moves to step S714.
[0121]
Next, recording / reproducing mode initialization (rewinding) In step S713, the recording / reproducing mode is returned to the initial state, and the process proceeds to event acquisition step S1.
[0122]
Next, recording data reproduction stop (rewinding) In step S714, reproduction of the recording data is stopped. Next, in the reproduction pointer backward skip step S715, the reproduction pointer is returned by one index. For example, if the recorded data is music data and the playback pointer does not overlap the index, the playback pointer moves to the beginning of the current song.
[0123]
Next, recording data reproduction start (rewinding) The reproduction of the recording data is started in step S716, and then the process proceeds to the event acquisition step S1 in FIG.
[0124]
["New information arrival" processing: FIG. 10]
Next, the “new information arrival” process will be described with reference to FIG.
[0125]
First, a priority reading sentence existence determination (new arrival) In step S801, it is determined whether or not a priority reading sentence exists. If a priority reading sentence exists, the process proceeds to a new reading sentence addition step S807, and a priority reading sentence exists. If not, the process proceeds to new arrival notification message copy step S802.
[0126]
Next, in a new arrival notification message copy step S802, the new arrival notification message is copied to the head of the preferential reading sentence. An example of a new arrival notification message is shown in FIG.
[0127]
Next, in a newly read-out sentence copy step S803, the newly arrived read-out sentence is copied after the new arrival notification message in the preferential reading sentence.
[0128]
Next, reading pointer setting determination (new arrival) It is determined whether or not a reading pointer is set in step S804. If the reading pointer is set, the reading pointer backup is created (new arrival). If is not set, the process proceeds to step S101.
[0129]
Next, reading pointer backup creation (new arrival) In step S805, the current reading pointer value is held as supplementary information for the priority reading sentence.
[0130]
Next, in the newly read aloud text reading pointer setting step S806, the reading pointer is set at the head of the priority reading text, and the process proceeds to the event acquisition step S1.
[0131]
Next, in a newly read aloud sentence adding step S807, the newly read aloud sentence is copied to the end of the priority read aloud sentence, and then the process proceeds to the event acquisition step S1 in FIG.
[0132]
["Reading instruction for stored information" processing: FIG. 11]
Next, the “stored information reading instruction” process will be described with reference to FIG.
[0133]
Reading Pointer Setting Determination (Accumulation Information Reading) In step S901, it is determined whether or not “reading pointer is set”. If “reading pointer is set”, the process proceeds to reading warning display step S905. If the reading pointer is not set, the process proceeds to the accumulated reading sentence copy step S902.
[0134]
Next, in the accumulated reading sentence copy step S902, the information instructed in the accumulated information reading instruction determining step S10 is copied from the accumulated information stored in the external storage unit H5 to the accumulated reading sentence.
[0135]
Next, it is determined whether or not “priority reading sentence exists” in step S903. If “priority reading sentence exists”, the process proceeds to reading pointer backup setting step S904. If “a preferential reading sentence does not exist”, the process proceeds to the event acquisition step S1.
[0136]
Next, in reading pointer backup setting step S904, the head of the accumulated reading sentence is set as supplementary information for the priority reading sentence, and then the process proceeds to event acquisition step S1 in FIG.
[0137]
Next, in a reading warning display step S905, a warning indicating that reading is currently being performed is output, and then the process proceeds to the event acquisition step S1 in FIG.
[0138]
["Speech synthesis request instruction" processing: FIG. 12]
Next, the “speech synthesis request instruction” process will be described with reference to FIG.
[0139]
First, in synthesized speech data existence determination step S1001, it is determined whether or not “waveform data” converted from text into a speech waveform already exists. If “waveform data” does not exist, the reading pointer setting determination (voice output) moves to step S1002.
[0140]
Reading pointer setting determination (voice output) In step S1002, it is determined whether or not “reading pointer is set”. If “reading pointer is set”, the process proceeds to document data end determination step S1003. If the pointer is not set, the process proceeds to event acquisition step S1 in FIG.
[0141]
Next, in the document data end determination step S1003, it is determined whether or not “the reading pointer has reached the end of the document data”. If “the reading pointer has reached the end of the document data”, the reading pointer backup existence determination step S1008 is determined. If the “read-out pointer has not reached the end of the document data”, the process proceeds to document data extraction step S1004.
[0142]
In document data cutout step S1004, a predetermined amount (for example, one sentence) of data is cut out from the document data. Next, in synthesized speech data generation step S1005, speech synthesis processing is performed on the extracted data to obtain synthesized speech data.
[0143]
Next, in the reading pointer moving step S1006, the reading pointer is moved by the amount cut out in the document data cutting out step S1004, and the processing is shifted to the synthesized voice data copying step S1007.
[0144]
In the synthesized voice data copy step S1007, a predetermined amount of data (buffer size of the synthesized voice output device) is output from the synthesized voice data to the synthesized voice output device, and the process proceeds to the event acquisition step S1.
[0145]
In the reading pointer backup existence determination step S1008, it is determined whether or not “a reading pointer backup exists” as supplementary information of the document data. If “a reading pointer backup exists”, the reading pointer backup return step S1009 is performed. If the processing is shifted and “a backup of the reading pointer does not exist”, the processing shifts to reading pointer release step S1010.
[0146]
In reading pointer backup return step S1009, the reading pointer backup attached to the document data is set as the reading pointer, and the process proceeds to the document data end determination step S1003.
[0147]
Next, in a reading pointer release step S1010, the reading pointer is released (not set). Then, the process proceeds to the event acquisition step S1.
[0148]
Next, playback pointer setting determination (recording / playback) In step S1101, it is determined whether or not “playback pointer is set”. If “playback pointer is set”, recording / playback mode determination (rewind 2) step. The process moves to S1102, and if “reproduction pointer is not set”, the process moves to event acquisition step S1.
[0149]
["Recording / playback request instruction" processing: FIG. 13]
Next, the “recording / playback request instruction” process will be described with reference to FIG.
[0150]
First, recording / playback mode determination (rewind 2) In step S1102, it is determined whether or not the recording / playback mode is “rewind”. If it is “rewind”, the process proceeds to playback pointer head determination step S1109. If it is not “rewind”, the process proceeds to reproduction pointer end determination step S1103.
[0151]
Next, in the reproduction pointer end determination step S1103, whether or not the “reproduction pointer is the end of the recorded data (last)” is stabilized, and if “the reproduction pointer is the end of the recorded data”, the processing is performed in the reproduction pointer release step S1104. If the “playback pointer is not the end of the recorded data”, the process proceeds to the recorded data copy step S1105.
[0152]
Next, in playback pointer release step S1104, the processing is moved to event acquisition step S1 after the playback pointer is released.
[0153]
Next, in a recorded data copy step S1105, a predetermined amount of data (buffer size of the recorded data output device) is output from the recorded data to the recorded data output device, and the process proceeds to step S1106 for recording / playback mode determination (fast forward 2).
[0154]
Next, recording / playback mode determination (fast-forward 2) In step S1106, it is determined whether or not “recording / playback mode is fast-forward”. If “recording / playback mode is fast-forward”, processing proceeds to playback pointer fast-forward movement step S1107. If the “recording / playback mode is not fast forward”, the process moves to a playback pointer moving step S1108.
[0155]
Next, in the reproduction pointer fast-forward movement step S1107, the process proceeds to the event acquisition step S1 in FIG. 2 after the reproduction pointer is advanced more than the reproduction pointer output in the recording data copy step S1105 (for example, 10 times the predetermined amount).
[0156]
Next, in the reproduction pointer moving step S1108, the reproduction pointer is advanced by the amount output in the recording data copying step S1105, and the processing is moved to the event acquisition step S1.
[0157]
Next, in a reproduction pointer head determination step S1109, it is determined whether or not “the reproduction pointer points to the beginning of the recorded data”. If the “playback pointer is not the beginning of the recorded data”, the process advances to the recorded data reverse order copy step S1110.
[0158]
In the recording data reverse order copy step S1110, a predetermined amount of data (buffer size of the recording data output device) is output to the recording data output device as in the recording data copy step S1105, but the data order is output in reverse.
[0159]
Next, in the reproduction pointer rewind movement step S1111, the reproduction pointer is moved in the reverse direction at the time of reproduction, and then the processing is shifted to the event acquisition step S1 in FIG.
[0160]
["Timer event" processing: FIG. 14]
Next, the “timer event” process will be described with reference to FIG.
[0161]
First, in timer stop step S1201, the timer is stopped.
[0162]
Next, in timer mode determination (fast forward reading), it is determined in step S1202 whether or not the timer mode is “fast forward reading”. If it is “fast forward reading”, the process proceeds to summary sentence creating step S1207. If not, the timer mode determination (rewind reading) is shifted to step S1203.
[0163]
Next, in the timer mode determination (rewind reading) step S1203, it is determined whether or not the timer mode is “rewind reading”. If it is “rewind reading”, the process proceeds to the first phrase list creation step S1204. If it is not “rewind reading”, the process proceeds to timer mode determination (fast forward reproduction) step S1210.
[0164]
In a first phrase list creation step S1204, a list of the first phrase of each sentence existing from the beginning of the document indicated by the reading pointer to the reading pointer is created. An example is shown in FIG. In FIG. 18, (a) is the original document, and (b) is an image of the created phrase list. The position of the reading pointer is set so that the reading pointer is positioned at the end of the read-out document. When the document is read out, the position of the reading-out pointer moves in synchronization with the reading-out.
[0165]
Next, in the rewind reading pointer backup creation step S1205, a corresponding point to which the reading pointer is moved when returning from the rewind mode is created. In FIG. 18, the relationship between the arrow connecting the first phrase list and the original document is the corresponding point.
[0166]
Next, in rewind reading mode setting step S1206, the reading mode is set to rewind, and then the process proceeds to event acquisition step S1 in FIG.
[0167]
Next, in a summary sentence creation step S1207, a summary sentence from the location indicated by the reading pointer to the end of the document is created. An example is shown in FIG. In FIG. 19, (a) is an original document and (b) is an image of a summary sentence created. The position of the reading pointer is set so that the position of the reading pointer comes to the end of the read-out document (that is, the head of the unread portion). When the document is read out, the position of the reading pointer is synchronized with the reading-out. Moving.
[0168]
Next, in the fast forward reading pointer backup creation step S1208, a corresponding point to which the reading pointer is moved when returning from the fast forward mode is created. In FIG. 19, the relationship between the arrows connecting the summary sentence and the original document is the corresponding point. However, in FIG. 19, not all are shown in order to avoid complexity.
[0169]
Next, in fast forward reading mode setting step S1209, the reading mode is set to fast forward, and then the process proceeds to event acquisition step S1 in FIG.
[0170]
Next, in the timer mode determination (fast forward playback) step S1210, it is determined whether or not the timer mode is “fast forward playback”. If it is “fast forward playback”, the process proceeds to the fast forward recording / playback mode setting step S1211. If it is not “fast forward playback”, the process proceeds to rewind recording / playback mode setting step S1212.
[0171]
Next, in fast-forward recording / playback mode setting step S1211, the recording / playback mode is set to fast-forward, and the process proceeds to event acquisition step S1.
[0172]
Next, in rewind recording / playback mode setting step S1212, the recording / playback mode is set to rewind, and then the process proceeds to event acquisition step S1 in FIG.
[0173]
[Each process of “speech synthesis”: FIGS. 15A to 15D]
Next, each process of “speech synthesis” will be described with reference to FIGS.
[0174]
FIGS. 15A to 15D show processing in each routine of “speech synthesis start”, “speech synthesis stop”, “speech synthesis pause”, and “speech synthesis restart”.
[0175]
First, in the synthesized voice output device setting step S1301, the synthesized voice output device is initially set up such as setting the sampling rate.
[0176]
Next, in the synthesized speech output device activation step S1302, the synthesized speech output device is activated and the synthesized speech output operation is started.
[0177]
Next, in the synthesized voice data clear step S1303, the synthesized voice data generated and held in the synthesized voice data generating step S1005 is deleted.
[0178]
Next, in the synthesized speech output device stop step S1304, the synthesized speech output device is stopped.
[0179]
Next, in the synthesized voice output device pause step S1305, the synthesized voice output device is paused.
[0180]
Next, in the synthesized voice output device restart step S1306, the operation of the synthesized voice output device suspended in the synthesized voice output device pause step S1305 is resumed.
[0181]
[Each process of “recorded data reproduction”: FIGS. 16A to 16D]
Next, each process of “recorded data reproduction” will be described with reference to FIGS. FIGS. 16A to 16D show processing in each routine of “recorded data playback start”, “recorded data playback stop”, “recorded data playback pause”, and “recorded data playback restart”.
[0182]
First, in a recording data output device setting step S1401, initial setting of the recording data output device such as setting of a sampling rate is performed.
[0183]
Next, in a recording data output device activation step S1402, the recording data output device is activated and the recording data output operation is started.
[0184]
Next, in the recording data output device stop step S1403, the recording data output device is stopped.
[0185]
Next, in a recording data output device pause step S1404, the recording data output device is paused.
[0186]
Next, in the recording data output device restart step S1405, the operation of the recording data output device suspended in the recording data output device pause step S1404 is resumed.
[0187]
The first embodiment described above is an example. For example, in the first phrase list creation step S1204, the first phrase list is a list composed only of the first phrase, but this is not necessarily the first phrase. There may be, for example, a plurality of clauses set by the user.
[0188]
The example of the summary sentence shown in the summary sentence creation step S1207 is an example in which the main part of each sentence is extracted, but it is not always necessary to summarize each sentence. For example, all sentences with little information are omitted. You may do it.
[0189]
Also, instead of the summary sentence creation step S1207, in the case of fast-forwarding, a head phrase list is created as shown in FIG. 28, and from the beginning “below” to “H4” is read sequentially from the head of the created head phrase list. You may do it.
[0190]
In the case of using a summary in the case of rewinding, the summary shown in FIG. 29 may be used.
[0191]
In addition, an acoustic output such as a beep sound indicating that a portion of the text data that has not been read out by speech synthesis is omitted may be output.
[0192]
Further, both the first phrase list creation step S1204 and the summary sentence creation step S1207 are executed after the rewind / fast forward button release event is acquired, but the newly read aloud sentence copy step S803, the newly read aloud sentence addition step S807, and It may be executed after each step of accumulated read-out sentence copy step S902. This shortens the response time from release of the rewind / fast forward button.
[0193]
<Second Embodiment>
Next, a second embodiment will be described.
[0194]
[Hardware configuration: Fig. 21]
FIG. 21 is a diagram illustrating a hardware configuration of a portable information terminal H1200 according to the second embodiment.
[0195]
H11 is a central processing unit that performs processing such as numerical calculation and control, and performs calculation according to a control program that describes the processing procedure of the present invention. H12 is an output unit that presents information to the user. The output unit H12 includes an audio output unit H1201 such as a speaker / headphone and a screen display unit H1202 such as a liquid crystal.
[0196]
H13 is an input unit in which a user gives an operation instruction to the information terminal H1200 or inputs information. H14 is a data communication unit such as a LAN card or a PHS card, and is used to acquire data such as new mail. H15 is a storage unit such as a hard disk or a non-volatile memory, and holds recorded data and stored information.
[0197]
H16 is a read-only storage unit, and stores fixed data such as a control program indicating the procedure of the present invention and a dictionary for speech synthesis. H17 is a storage unit for holding temporary information such as RAM, and holds temporary data, various flags, and the like.
[0198]
H18 is an angle detection unit that outputs a value corresponding to the angle, and detects an operation amount of the dial unit H19. H19 is a dial section that can be operated by the user, and is connected to the angle detection section H18. The central processing unit H1 to the angle detection unit H18 are connected by a bus.
[0199]
[Appearance of information terminal hardware configuration: FIG. 27]
FIG. 27 is a sketch diagram showing an example of an external appearance image of the hardware configuration of the portable information terminal H1200 described in FIG.
[0200]
[Outline of event processing: FIG. 22]
The event processing in the information terminal H1200 according to the second embodiment described above will be described with reference to the flowcharts shown in FIGS. The following processing is executed by the central processing unit H11 using a storage unit H17 that holds temporary information such as a RAM, based on an event-driven control program stored in a read-only storage unit H16 or the like. Is. An input process from the input unit H13, a data request from the output unit H12, and an interrupt such as a timer interrupt are processed as the start of each event in the control program.
[0201]
First, in variable initial setting step S1501 in FIG. 22, each variable is set to an initial value.
[0202]
Next, in step S1502, the voice synthesis device is paused in a voice synthesis device start / pause step S1502.
[0203]
Next, in event acquisition step S1503, a new event is acquired.
[0204]
Next, in the dial angle change determination step S1504, it is determined whether or not the event acquired in the event acquisition step S1503 is due to “dial angle change”. If it is “dial angle change”, the process proceeds to step S1601. If it is not “change dial angle”, the process proceeds to speech synthesis data request determination step S1505.
[0205]
Next, in speech synthesis request determination step S1505, it is determined whether or not the event acquired in event acquisition step S1503 is “data request from synthesized speech output device”, and “data request from synthesized speech output device”. If there is, the process moves to step S1701, and if it is not “data request from synthesized speech output device”, the process returns to event acquisition step S1503.
[0206]
["Dial angle change" processing: FIG. 23]
Next, the processing of each event described above will be described in detail below.
[0207]
First, the “dial angle change” process will be described with reference to FIG.
[0208]
In the new dial angle determination step S1601, it is determined whether or not the new dial angle is “0”. If it is “0”, the process proceeds to the synthesized speech output device pause step S1605, and the new dial angle is determined. If it is not “0”, the process proceeds to dial angle variable determination step S1602.
[0209]
In the dial angle variable determination step S1602, it is determined whether or not the previous dial angle held in the dial angle variable is “0”. If the previous dial angle held in the dial angle variable is “0”, a synthesized voice is output. The process moves to device restart step S1606, and if the previous dial angle held in the dial angle variable is not “0”, the process moves to dial angle variable update step S1603.
[0210]
In the dial angle variable update step S1603, a new dial angle is substituted for the dial angle variable.
[0211]
Next, in a reading skip number setting step S1604, a reading skip number is set according to the value of the dial angle variable. The reading skip number is set so that the larger the absolute value of the dial angle, the larger the absolute value of the skip number, and the same sign of the dial angle and the number of skip numbers. FIG. 25 shows an example of a correspondence table between the dial angle (unit angle is θ) and the number of skips. After setting the skip number, the process proceeds to event acquisition step S1503.
[0212]
Next, in the synthesized speech output device pause step S1605, the speech synthesized output device is paused, and the process proceeds to the event acquisition step S1503.
[0213]
Next, in the synthesized voice output device restart step S1606, the synthesized voice output device paused in the synthesized voice output device pause step S1605 is resumed, and the process proceeds to the dial angle variable update step S1603.
[0214]
[“Speech synthesis instruction” processing: FIG. 24]
Next, the “speech synthesis instruction” process will be described with reference to FIG.
[0215]
First, in synthesized speech data end determination step S1701, it is determined whether or not “the phrase counter is equal to the number of phrases”. If “the phrase counter is equal to the number of phrases”, the process proceeds to document data extraction step S1709. If the counter is not equal to the number of phrases, the process proceeds to dial angle absolute value determination step S1702. The number of phrases is the number of phrases included in the sentence subjected to the immediately preceding synthesized speech data generation step S1710. The state in which the phrase counter is equal to the number of phrases is obtained by combining the synthesized speech data obtained in step S1710. Indicates that all output has been completed.
[0216]
Next, in the dial angle absolute value determination step S1702, it is determined whether the absolute value of the dial angle held in the dial angle variable is larger than “1”. If the absolute value of the dial angle is larger than “1”, the text to be read out is read. The processing moves to update step S1717, and if the absolute value of the dial angle is not larger than “1”, the processing moves to reading pointer determination step S1703.
[0217]
Next, in a reading pointer determination step S1703, it is determined whether or not “the reading pointer is equal to the reading target sentence”. If “the reading pointer is equal to the reading target sentence”, the process proceeds to the phrase counter determination step S1704. If “the reading pointer is not equal to the reading target sentence”, the process proceeds to the speech synthesis device stop step S1705.
[0218]
Next, in the phrase counter determination step S1704, it is determined whether the phrase counter is “0”. If the phrase counter is “0”, the process proceeds to the reading target sentence update step S1717. If the phrase counter is not “0”, speech synthesis is performed. The processing moves to device stop step S1705.
[0219]
Next, in the speech synthesis device stop step S1705, the speech synthesis device is stopped. Next, a beep sound is output in a beep sound output step S1706. Next, speech synthesis device start (2) In step S1707, the speech synthesis device is started.
[0220]
Next, “1” is added to the phrase counter in the phrase counter update step S1708, and the process proceeds to the event acquisition step S1503.
[0221]
Next, in document data cutout step S1709, one sentence of data is cut out from the reading target document with the reading pointer at the head.
[0222]
Next, in synthesized speech data generation step S1710, speech synthesis is performed on the sentence extracted in document data extraction step S1709 to obtain synthesized speech data.
[0223]
Next, in the phrase number calculation step S1711, the number of phrases included in the sentence extracted in the document data extraction step S1709 is calculated.
[0224]
Next, in a synchronization point generation step S1712, the correspondence between the synthesized speech generated in the synthesized speech data generation step S1710 and each phrase included in the sentence extracted in the document data extraction step S1709 is obtained and held as a synchronization point. An example of the synchronization point is shown in FIG.
[0225]
Next, in the phrase counter initialization step S1713, the phrase counter is set to “0”.
[0226]
Next, in the dial angle positive / negative determination step S1714, it is determined whether or not the dial angle held in the dial angle variable is “positive”. If “positive”, the process proceeds to the reading pointer increment step S1715, and is not “positive”. If so, the process proceeds to reading pointer decrement step S1716.
[0227]
Next, in a reading pointer increment step S1715, 1 is added to the reading pointer, and the process proceeds to the dial angle absolute value determination step S1702.
[0228]
Next, in the reading pointer decrement step S1716, 1 is subtracted from the reading pointer, and the process proceeds to the dial angle absolute value determination step S1702.
[0229]
Next, in the reading target sentence update step S1717, the reading target sentence is set to the sum of the reading pointer and the skip count set in the read skip count setting step S1604.
[0230]
Next, in the synthesized speech data copy step S1718, the synthesized speech generated in the synthesized speech data generation step S1005 is copied to the buffer of the one-speech speech synthesizer device. The range to be copied is one phrase from the synchronization point corresponding to the current phrase counter. After data copying, the process proceeds to clause counter update step S1708.
[0231]
Note that the second embodiment described above is an example. For example, in the reading skip number setting step S1604, the reading skip number is set to a fixed number of sentences according to the value of the dial angle variable, but when the dial angle is large. It is also possible to skip to the next paragraph. This can be realized by counting the number of sentences from the reading pointer to the first sentence of the next paragraph. It is also possible to skip one or more phrases when the dial angle is small.
[0232]
Furthermore, in the second embodiment, the number of beeps during fast forward / rewind is the same as the number of skipped phrases, but it is not necessarily the same. In the second embodiment, fast forward / rewind is represented by a single beep sound, but different beep sounds or different signals may be produced according to the type of fast forward / rewind or the dial angle.
[0233]
Furthermore, it is also possible to apply fast-forwarding based on the summary used in the first embodiment to the second embodiment. In this case, the compression ratio of the summary can be changed in accordance with the skip number set in the reading skip number setting step S1604.
[0234]
[Other Embodiments]
Note that the present invention can be applied to a system including a plurality of devices (for example, a host computer, an interface device, a reader, and a printer), and a device (for example, a copying machine and a facsimile device) including a single device. You may apply to.
[0235]
Another object of the present invention is to supply a storage medium (or recording medium) in which a program code of software that realizes the functions of the above-described embodiments is recorded to a system or apparatus, and the computer (or CPU or CPU) of the system or apparatus. Needless to say, this can also be achieved by the MPU) reading and executing the program code stored in the storage medium. In this case, the program code itself read from the storage medium realizes the functions of the above-described embodiment, and the storage medium storing the program code constitutes the present invention. Further, by executing the program code read by the computer, not only the functions of the above-described embodiments are realized, but also an operating system (OS) running on the computer based on the instruction of the program code. It goes without saying that a case where the function of the above-described embodiment is realized by performing part or all of the actual processing and the processing is included.
[0236]
Furthermore, after the program code read from the storage medium is written into a memory provided in a function expansion card inserted into the computer or a function expansion unit connected to the computer, the function is determined based on the instruction of the program code. It goes without saying that the CPU or the like provided in the expansion card or the function expansion unit performs part or all of the actual processing and the functions of the above-described embodiments are realized by the processing.
[0237]
When the present invention is applied to the storage medium, the storage medium stores program codes corresponding to the flowcharts described above (shown in FIGS. 2 to 16 and 22 to 24).
[0238]
As described above, according to the present invention, it becomes possible for a user to give an instruction with few operation buttons, and as a result, the size of the entire terminal is reduced. In addition, the number of parts is reduced, resulting in a manufacturing cost. In addition, it is possible to provide information required by the user during fast forward / rewind, and convenience is improved.
[0239]
【The invention's effect】
As described above, according to the present invention, when various information processing is performed, an increase in manufacturing cost due to an increase in the number of parts such as operation buttons is suppressed. Multiple information processing targets can be transferred to each other using the operation buttons. It is possible to provide a portable information terminal that can perform the above-described various types of information processing with convenience.
[Brief description of the drawings]
FIG. 1 is a block diagram showing a hardware configuration according to a first embodiment of the present invention.
FIG. 2 is a flowchart illustrating an entire event process according to the first embodiment of the present invention.
FIG. 3 is a flowchart illustrating processing when a playback button is pressed.
FIG. 4 is a flowchart illustrating processing when a stop button is pressed.
FIG. 5 is a flowchart for explaining processing when a pause button is pressed.
FIG. 6 is a flowchart illustrating processing when a fast-forward button is pressed.
FIG. 7 is a flowchart for explaining processing when a fast-forward button is released.
FIG. 8 is a flowchart for explaining processing when a rewind button is pressed.
FIG. 9 is a flowchart illustrating processing when a rewind button is released.
FIG. 10 is a flowchart illustrating processing when new information arrives.
FIG. 11 is a flowchart for explaining processing when an instruction to read out stored information is given.
FIG. 12 is a flowchart for explaining processing when speech synthesis is instructed.
FIG. 13 is a flowchart for explaining processing when recording / reproduction is instructed.
FIG. 14 is a flowchart illustrating timer event processing.
FIG. 15A is a flowchart for describing speech synthesis start processing;
FIG. 15B is a flowchart for explaining speech synthesis stop processing;
FIG. 15C is a flowchart illustrating speech synthesis pause processing.
FIG. 15D is a flowchart illustrating speech synthesis resumption processing.
FIG. 16A is a flowchart illustrating a recorded data reproduction start process.
FIG. 16B is a flowchart illustrating a recorded data reproduction stop process.
FIG. 16C is a flowchart illustrating a recorded data reproduction pause process.
FIG. 16D is a flowchart for explaining a recording data reproduction restart process;
FIG. 17 is a diagram illustrating an example of a new arrival notification message.
FIG. 18 is a diagram illustrating an example of an image of a head phrase list.
FIG. 19 is a diagram for explaining an example of an image of a summary sentence.
FIG. 20 is a sketch diagram showing an image of a hardware configuration according to the first embodiment of the present invention.
FIG. 21 is a block diagram showing a hardware configuration according to a second embodiment of the present invention.
FIG. 22 is a flowchart illustrating an entire event process according to the second embodiment of the present invention.
FIG. 23 is a flowchart illustrating processing when a dial angle is changed.
FIG. 24 is a flowchart illustrating processing when speech synthesis is requested.
FIG. 25 is a diagram for explaining the correspondence between the dial angle and the number of reading skips.
FIG. 26 is a diagram illustrating an example of a synchronization point.
FIG. 27 is a sketch diagram showing an image of the hardware configuration of the second embodiment according to the present invention;
FIG. 28 is a diagram illustrating an example of an image of a head phrase list in the case of fast forward.
FIG. 29 is a diagram illustrating an example of a summary during rewinding.
[Explanation of symbols]
H1 Central processing unit for numerical computation and control processing
Output unit for presenting information to H2 users
H201 Audio output unit (speakers, headphones, etc.)
H202 Screen display (liquid crystal etc.)
H3 input unit (operation instruction / information input by user)
H301 Playback button (input unit)
H302 Stop button (input unit)
H303 Pause button (input section)
H304 Fast forward button (input part)
H305 Rewind button (input unit)
H306 touch panel
H4 data communication unit (acquisition of LAN card, PHS card, new mail, etc.)
H5 storage (Hard disk, non-volatile memory, recording data retention)
H6 read-only memory (holding ROM, program, speech synthesis dictionary, etc.)
H7 Storage unit for holding temporary information (RAM and other temporary data)
H8 interval timer section (interrupt processing to central processing section)

Claims

First control means for causing speech synthesis means for synthesizing text data based on a user's instruction to execute at least one of output, stop, fast forward, rewind, and pause of synthesized speech;
Second control means for causing a sound reproduction means for reproducing sound from sound data to execute at least one of reproduction sound output, stop, fast forward, rewind, and pause based on a user instruction;
Detecting means for detecting a user instruction;
A determination unit that determines whether or not there is text data to be read out to be synthesized by the speech synthesis unit when the user instruction detected by the detection unit is an output;
Setting means for setting first data to be canceled when an instruction of a user instructing to stop the output of the synthesized speech is detected when it is determined by the determination means that there is text data to be read out ;
If the first data is set when a user instruction is detected by the detection means, the user instruction is issued to the first control means, and the first data is released. Corresponding to the second data that is set when the user's instruction for instructing the output of the reproduced sound is detected and is canceled when the user's instruction for instructing the stop of the output of the reproduced sound is detected. And an instruction means for issuing an instruction of the user to the second control means.

When the user instruction detected by the detection means is fast forward and the first data is set, the instruction means issues the user instruction to the first control means. 1 control means extracts the data of a predetermined part from each sentence of the text data to be read out, creates summary data, and controls the speech synthesis means to synthesize and output the summary data. The information processing apparatus according to claim 1.

When the user instruction detected by the detection means is fast forward and the first data is set, the instruction means issues the user instruction to the first control means. 2. The control unit according to claim 1, wherein the control unit controls the speech synthesis unit so as to extract the first phrase from each sentence of the text data to be read out and sequentially synthesize and output the extracted phrase. The information processing apparatus described.

When the user instruction detected by the detection means is rewind and the instruction means issues the user instruction to the first control means because the first data is set , The first control means is configured to extract the first phrase from each sentence of the text data to be read out and output the extracted phrase by synthesizing and outputting the extracted phrase in the reverse order of the sentence sequence of the text data. The information processing apparatus according to claim 2, wherein the information processing apparatus is controlled.

The first control means controls the voice synthesizing means so as to output a predetermined sound corresponding to data omitted from the text data without being synthesized by the voice synthesizing means. The information processing apparatus according to any one of claims 1 to 4, wherein the information processing apparatus is characterized.

A detection step of detecting a user instruction;
A determination step of determining whether or not there is text data to be read out by a speech synthesizer that performs speech synthesis of text data based on the user instruction when the user instruction detected by the detection step is an output. When,
A setting step of setting first data to be canceled when a user instruction to stop the output of the synthesized speech is detected when it is determined by the determination step that text data to be read out exists;
If the first data is set when a user instruction is detected in the detection step, the user instruction is output to the speech synthesizer to output, stop, fast forward, and rewind the synthesized speech. When the first data is released to the first control means for executing at least one of the pauses, and the first data is released, it is set when the user's instruction for instructing the output of the reproduction sound is detected. Audio reproduction that reproduces audio from audio data based on the user instruction corresponding to the second data to be released when a user instruction to stop the output of reproduced audio is detected. An instruction step to be issued to the second control means for causing the means to execute at least one of output, stop, fast forward, rewind, and pause of the reproduced sound;
An information processing method characterized by comprising:

On the computer,
A detection step of detecting a user's instruction,
A determination step of determining whether or not there is text data to be read out by a speech synthesizer that performs speech synthesis of text data based on the user instruction when the user instruction detected by the detection step is an output. And
When the text data of the object read by said determining step is determined to exist, a setting step of instructing the user to instruct the stop of output of the synthesized speech is to set the first data to be released when it is detected,
If the first data is set when a user instruction is detected by the detection step, the user instruction is output to the speech synthesizer to output, stop, fast forward, and rewind the synthesized speech. When the first data is released to the first control means for executing at least one of the pauses, and the first data is released, it is set when the user's instruction for instructing the output of the reproduction sound is detected. Audio reproduction that reproduces audio from audio data based on the user instruction corresponding to the second data to be released when a user instruction to stop the output of reproduced audio is detected. the means, and an instruction step of issuing the playback audio output, stop, fast forward, rewind, second control means for performing at least one of pause,
A control program to execute .

On the computer,
A detection step of detecting a user's instruction,
A determination step of determining whether or not there is text data to be read out by a speech synthesizer that performs speech synthesis of text data based on the user instruction when the user instruction detected by the detection step is an output. And
When the text data of the object read by said determining step is determined to exist, a setting step of instructing the user to instruct the stop of output of the synthesized speech is to set the first data to be released when it is detected,
If the first data is set when a user instruction is detected by the detection step, the user instruction is output to the speech synthesizer to output, stop, fast forward, and rewind the synthesized speech. When the first data is released to the first control means for executing at least one of the pauses, and the first data is released, it is set when the user's instruction for instructing the output of the reproduction sound is detected. Audio reproduction that reproduces audio from audio data based on the user instruction corresponding to the second data to be released when a user instruction to stop the output of reproduced audio is detected. the means, and an instruction step of issuing the playback audio output, stop, fast forward, rewind, second control means for performing at least one of pause,
The computer-readable storage medium which stored the control program for performing .