JPH0511796A

JPH0511796A - Voice recognizer

Info

Publication number: JPH0511796A
Application number: JP3162889A
Authority: JP
Inventors: Kikumi Kaburagi; 喜久美鏑木
Original assignee: Seiko Epson Corp
Current assignee: Seiko Epson Corp
Priority date: 1991-07-03
Filing date: 1991-07-03
Publication date: 1993-01-22

Abstract

(57)【要約】【目的】入力音声の単位に左右されることなく、様々
な音声入力単位に対して安定な音声認識処理を行うこと
が出来る音声認識装置を実現することである。【構成】音声認識装置は入力された音声に対し、音声
入力単位識別情報を基に、音声入力単位識別部１２にお
いて音声入力単位を識別し、その音声入力単位識別結果
に合った辞書情報を選択し音声認識処理に用いて音声認
識処理を行う。 (57) [Abstract] [Purpose] To realize a voice recognition device capable of performing stable voice recognition processing for various voice input units without being influenced by the unit of input voice. A voice recognition device identifies a voice input unit in a voice input unit identification unit 12 based on voice input unit identification information for an input voice, and selects dictionary information suitable for the voice input unit identification result. Then, the voice recognition process is performed using the voice recognition process.

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】音声認識装置に係わる。[Industrial application] The present invention relates to a voice recognition device.

【０００２】[0002]

【従来の技術】従来の音声認識装置について図３を用い
て説明する。2. Description of the Related Art A conventional voice recognition device will be described with reference to FIG.

【０００３】従来の音声認識装置では、音響分析部１に
おいて入力された音声の分析を行い、特徴を出力する。
音響分析部１からの出力に基づいて、音声認識部２にお
いて音声認識用辞書とマッチングすることによって入力
音声の認識を行う。音声認識部２にて認識された結果は
操作者が確認できるように出力される。入力音声を分析
した特徴抽出の結果は、記憶部３に記憶される。In the conventional voice recognition device, the voice input by the acoustic analysis unit 1 is analyzed and the characteristics are output.
Based on the output from the acoustic analysis unit 1, the voice recognition unit 2 matches the voice recognition dictionary to recognize the input voice. The result recognized by the voice recognition unit 2 is output so that the operator can confirm it. The result of the feature extraction that analyzes the input voice is stored in the storage unit 3.

【０００４】[0004]

【発明が解決しようとする課題】音声による入力は、キ
ー入力操作をすることなくデータ入力を行うことがで
き、キー入力装置のキー配置位置、キー操作方法等を知
る必要がなく、誰でもが簡便に使用できる入力方法であ
る。しかし、音声による入力方法は、キー操作による入
力方法と異なり、操作者が正確に発話をしても誤って認
識される場合がある。In voice input, data input can be performed without key input operation, and there is no need to know the key arrangement position of the key input device, the key operation method, etc. This is a convenient input method. However, the voice input method is different from the key input method, and may be erroneously recognized even if the operator speaks accurately.

【０００５】音声認識装置における音声認識誤りは認識
の対象となる発話の文法的な単位が「音節」、「単
語」、「文節」、「文章」と複雑になるにつれて多くな
る。また、その認識装置の使用環境における背景雑音の
レベルが上がるにつれても多くなる。Speech recognition errors in the speech recognition apparatus increase as the grammatical unit of speech to be recognized becomes complicated, such as "syllable", "word", "bunsetsu", and "sentence". Further, the background noise level increases in the environment in which the recognition device is used.

【０００６】音声認識単位が「音節」、「単語」である
場合は、認識処理は単なるパタンマッチング処理である
と考えられる。一方、音声認識単位が「文節」、「文
章」となると、これは単なるパタンマッチングであると
考える訳にはいかなくなる。When the voice recognition unit is "syllable" or "word", the recognition process is considered to be a simple pattern matching process. On the other hand, when the speech recognition unit is "bunsetsu" or "sentence", this cannot be considered as merely pattern matching.

【０００７】音声認識装置に「文節」、「文章」等の文
法的な単位で音声入力を行った場合に、音声認識率の低
下の原因を招く一つの原因として助詞の問題がある。単
純な音素、音節や単語単位での音声認識率は高いのだ
が、文章あるいは文節単位での入力を受け付けた場合
に、極端に音声認識率が下がってしまうような場合は、
正に助詞の取り扱いが原因であると思われる。助詞は他
の品詞に比べ、はっきりと明快に発音されることは少な
く、助詞にアクセントが付けられることはない。このた
め、単語単位での音声入力に対する音声認識処理とは異
なる助詞に対する意味的、文法的な知識が有効になる。When a voice is input to a voice recognition device in a grammatical unit such as "bunsetsu" or "sentence", there is a problem of a particle as one of the causes of a decrease in the voice recognition rate. Although the voice recognition rate for simple phonemes, syllables, and words is high, if the voice recognition rate is extremely reduced when input is accepted for sentences or syllables,
It seems that the reason is the handling of particles. Particles are less pronounced clearly and clearly than other parts of speech, and particles are not accented. For this reason, semantic and grammatical knowledge for particles, which is different from the voice recognition processing for voice input in word units, is effective.

【０００８】文章単位の入力に対して音声認識率が低下
する他の原因としては、文章単位での入力音声に対し
て、文法的知識や文の意味的知識を用いて処理していな
い点が挙げられる。Another cause of the decrease in the voice recognition rate with respect to sentence-based input is that input speech in sentence units is not processed using grammatical knowledge or sentence semantic knowledge. Can be mentioned.

【０００９】このように認識単位が「文節」、「文章」
である場合には、より複雑な認識処理が要求される。ま
た、背景雑音のレベルが上昇すると誤認識率も急激に上
昇する。しかし、音声認識のための条件が良く、「文
節」、「文章」を認識単位としても充分な認識が得られ
る場合には、「文節」、「文章」を認識単位とした音声
認識装置は「音節」、「単語」を認識単位としたものよ
りも格段に使いやすいものである。As described above, the recognition units are “bunsetsu” and “sentence”.
Then a more complex recognition process is required. Also, as the background noise level rises, the false recognition rate also rises sharply. However, if the conditions for speech recognition are good and sufficient recognition can be obtained even with “bunsetsu” and “sentence” as recognition units, a speech recognition device using “bunsetsu” and “sentence” as recognition units It is much easier to use than the one using syllables and words as recognition units.

【００１０】従来技術を用いた音声認識装置において
は、入力音声の文法的な単位が「音節」、「単語」、
「文節」、「文章」等と変化しても、音声入力単位に基
づく固有の情報を利用することなく、音声の音響的な特
徴のみによって音声認識処理が行われていた。また、そ
のような音声認識単位は、その音声認識装置の使用にお
いて固定されたものであり、環境に応じて変化させられ
るものではなかった。そのため、従来技術を用いた音声
認識装置では、音声入力単位に依存して音声認識率が大
きく変動したり、操作者が音声を入力した単位と異なっ
た単位での入力として音声認識をする場合があり、きわ
めて奇怪な音声認識結果を出力することがあった。In the speech recognition apparatus using the prior art, the grammatical unit of the input speech is "syllable", "word",
Even if it changes to "bunsetsu", "sentence", etc., the voice recognition processing is performed only by the acoustic characteristics of the voice without using the unique information based on the voice input unit. Moreover, such a voice recognition unit is fixed in use of the voice recognition device, and cannot be changed according to the environment. Therefore, in the voice recognition device using the conventional technology, the voice recognition rate may greatly change depending on the voice input unit, or the voice recognition may be performed as an input in a unit different from the unit in which the operator inputs the voice. There was a case where a very strange voice recognition result was output.

【００１１】[0011]

【課題を解決するための手段】本発明の音声認識装置
は、入力された音声の特徴を出力する音響分析部と、前
記入力された音声の文法的な単位を判断する音声入力単
位識別部と、前記音声入力単位識別部の出力を手がかり
に前記音響分析部の出力を符号列に変換する音声認識部
と、前記音響分析部の出力を記憶する音響分析記憶部
と、前記音声認識部の出力を記憶する音声認識記憶部と
からなることを特徴とする。A speech recognition apparatus according to the present invention comprises an acoustic analysis section for outputting the characteristics of an input speech, and a speech input unit identification section for determining a grammatical unit of the input speech. A voice recognition unit that converts the output of the acoustic analysis unit into a code string based on the output of the voice input unit identification unit; an acoustic analysis storage unit that stores the output of the acoustic analysis unit; and an output of the voice recognition unit And a voice recognition storage unit for storing

【００１２】[0012]

【実施例】以下、本発明について実施例に基づいて詳細
に説明する。EXAMPLES The present invention will be described in detail below based on examples.

【００１３】（実施例１）図１は本発明の音声認識装置
の原理ブロック図である。(Embodiment 1) FIG. 1 is a block diagram showing the principle of a voice recognition apparatus according to the present invention.

【００１４】図２は本発明の音声認識装置のブロック図
である。FIG. 2 is a block diagram of the voice recognition apparatus of the present invention.

【００１５】人間が発話出来る音声の文法的な単位とし
ては、「音節」、「単語」、「文節」、「文章」等が考
えられる。本発明において図１音声入力単位識別部１２
において識別を行っている単位とは、これらの文法的な
単位を示している。As grammatical units of speech that can be spoken by humans, "syllable", "word", "bunsetsu", "sentence", etc. can be considered. In the present invention, FIG.
The unit which is identified in (1) means these grammatical units.

【００１６】入力された音声は、図１音響分析部１１の
構成要素であるマイク、高域強調フィルタ、ＡＤ変換器
より構成される図２音声入力部２１によってデジタル信
号としてサンプリングされる。更に同じく図１音響分析
部１１の構成要素である図２特徴抽出回路２２におい
て、デジタル信号に変換された音声信号を周波数変換
し、周波数領域での特徴パラメータを抽出し、発声され
た単語の特徴パラメータ列として表される。図２特徴抽
出回路２２で抽出された入力音声の特徴パラメータ列
は、図２特徴パラメータ列記憶回路２３に記憶される。The input voice is sampled as a digital signal by the voice input unit 21 shown in FIG. 2, which includes a microphone, a high-frequency emphasis filter, and an AD converter, which are components of the acoustic analysis unit 11 shown in FIG. Further, in the feature extraction circuit 22 of FIG. 2 which is also a component of the acoustic analysis unit 11 of FIG. 1, the voice signal converted into a digital signal is frequency-converted, the feature parameters in the frequency domain are extracted, and the feature of the uttered word is extracted. It is represented as a parameter string. The characteristic parameter sequence of the input voice extracted by the characteristic extraction circuit 22 of FIG. 2 is stored in the characteristic parameter sequence storage circuit 23 of FIG.

【００１７】特徴パラメータに変換された入力音声は、
図２マッチング回路２５において図２音声入力単位識別
情報２４を用いて音声入力単位の識別が行われる。図２
音声入力単位識別情報２４と図２マッチング回路２５は
図１音声入力単位識別部１２を構成している。図２マッ
チング回路２５において判断された音声入力単位は、図
２音声入力単位記憶回路２６に記憶される。そして、次
の音声入力に関しては、図２音声入力単位記憶回路２６
の情報を基に音声認識処理を行い、図１音声入力単位識
別部１２での処理は行わない。図２音声入力単位記憶回
路２６の情報を基に音声認識処理を行った結果が失敗に
終わった際に、再び図１音声入力単位識別部１２におい
て入力単位の識別から処理を進める。The input voice converted into the characteristic parameter is
In the FIG. 2 matching circuit 25, the voice input unit is identified using the voice input unit identification information 24 of FIG. Figure 2
The voice input unit identification information 24 and the matching circuit 25 of FIG. 2 constitute the voice input unit identification unit 12 of FIG. The voice input unit determined by the matching circuit 25 of FIG. 2 is stored in the voice input unit storage circuit 26 of FIG. Then, regarding the next voice input, the voice input unit storage circuit 26 shown in FIG.
The voice recognition processing is performed on the basis of the above information, and the processing in the voice input unit identification unit 12 in FIG. 1 is not performed. When the result of the voice recognition processing based on the information of the voice input unit storage circuit 26 in FIG. 2 is unsuccessful, the voice input unit identification unit 12 in FIG.

【００１８】この処理の流れは、人間が雑音レベルの高
い環境で電話をする場合と類似している。例えば、「明
日の３時に成田に着く。」と発話して、相手が聞き取れ
なかった場合に、「明日の」、「３時に」、「成田
に」、「着く」と文節で発話したりする。或は、それで
も聞き取れなかった場合には単に、「明日」、「３
時」、「成田」と発話し、更に相手が「成田」と「羽
田」を聞き間違えた場合は「ナ」、「リ」、「タ」と音
節で発話したりする。The flow of this processing is similar to the case where a person makes a call in an environment with a high noise level. For example, if you say "I will arrive at Narita at 3 o'clock tomorrow" and the other party cannot hear you, say "Tomorrow", "3 o'clock", "To Narita", or "Arrived" in a phrase. .. Or, if you still can't hear, simply say "Tomorrow", "3
When the other person mistakenly hears "Narita" and "Haneda", he or she speaks "na", "ri", and "ta" in syllables.

【００１９】本発明の音声認識装置はこれらの発話単位
を識別し、それに最適の手法で音声認識処理を行う。The speech recognition apparatus of the present invention identifies these utterance units and performs speech recognition processing by an optimum method.

【００２０】図１音声入力単位識別部１２において音声
の入力単位が識別された結果にそった情報を備えている
のが、図２音声認識用辞書２８である。図２音声認識用
辞書２８は、図２音声認識情報３２、図２文法情報３
３、図２意味情報３４を備えている。図２ＤＰマッチン
グ回路２７において、図２特徴パラメータ列記憶回路２
３に記憶された結果と、図２音声認識用辞書２８とをＤ
Ｐマッチングし、符号列として音声認識する。図２ＤＰ
マッチング回路２７は図１音声認識部１３を構成してい
る。また図２ＤＰマッチング２７において音声認識され
た結果は、図２音声認識記憶回路２９に記憶される。図
２音声認識記憶回路２９は、図１音声認識記憶部１４を
構成する。The voice recognition dictionary 28 shown in FIG. 2 is provided with information according to the result of the voice input unit identifying section 12 identifying the voice input unit. The voice recognition dictionary 28 shown in FIG. 2 includes the voice recognition information 32 shown in FIG.
3, the FIG. 2 semantic information 34 is provided. In the DP matching circuit 27 shown in FIG.
The result stored in FIG. 3 and the voice recognition dictionary 28 shown in FIG.
P matching is performed and voice recognition is performed as a code string. Figure 2DP
The matching circuit 27 constitutes the voice recognition unit 13 in FIG. The result of voice recognition in the DP matching 27 of FIG. 2 is stored in the voice recognition storage circuit 29 of FIG. The voice recognition storage circuit 29 shown in FIG. 2 constitutes the voice recognition storage unit 14 shown in FIG.

【００２１】音声認識処理の結果は、図２音声認識記憶
回路２９に記憶されるとともに、図２表示部制御回路３
０の制御により図２表示部３１に出力され、操作者が確
認することが出来る。The result of the voice recognition processing is stored in the voice recognition storage circuit 29 shown in FIG.
It is output to the display unit 31 shown in FIG. 2 under the control of 0 and can be confirmed by the operator.

【００２２】本発明の構成要素である図１音声入力単位
識別部１２について、さらに詳しく説明する。The voice input unit identification unit 12 shown in FIG. 1, which is a constituent element of the present invention, will be described in more detail.

【００２３】図１音声入力単位識別部１２は先述のとお
り、図２音声入力単位識別情報２４と図２マッチング回
路２５よりなる。図２音声入力単位識別情報２４におい
て用いている情報は、入力音声の時間の長さ情報、及び
入力音声が含む音素数情報等である。As described above, the voice input unit identification section 12 of FIG. 1 comprises the voice input unit identification information 24 of FIG. 2 and the matching circuit 25 of FIG. The information used in the voice input unit identification information 24 in FIG. 2 is time length information of the input voice, phoneme number information included in the input voice, and the like.

【００２４】これらの図２音声入力単位識別情報２４を
用いて、入力音声時間の長さ情報があるしきい値以下の
長さであれば、「単語」或はそれよりも小さい単位での
入力であると判断する。また、入力音声時間の長さ情報
があるしきい値以上の長さであれば、「単語」よりも大
きな単位での入力であると判断する。If the length information of the input voice time is less than a certain threshold value by using the voice input unit identification information 24 shown in FIG. 2, the input is made by a "word" or a unit smaller than that. It is determined that If the length information of the input voice time is longer than a certain threshold value, it is determined that the input is in a unit larger than the "word".

【００２５】さらに、入力音声の単位が入力音声時間長
では識別しにくい場合には、入力音声が含む音素個数に
よっても音声入力単位を識別することが出来る。入力さ
れた音声のなかに存在する音素個数が、ある個数の範囲
であれば、「単語」、他の範囲であれば「文節」等と判
断できるように、音素数によっても入力単位をさらに限
定し、識別することが出来る。このようにして、容易に
しかも正確に入力音声の単位を識別することが出来る。Furthermore, if the input voice unit is difficult to identify by the input voice time length, the voice input unit can also be identified by the number of phonemes included in the input voice. The input unit is further limited by the number of phonemes so that it can be determined as "word" if the number of phonemes existing in the input speech is within a certain number range and "bunsetsu" if it is within another range. Can be identified. In this way, the unit of the input voice can be identified easily and accurately.

【００２６】図１音声入力単位識別部１２において識別
された入力単位の情報に基づいて、図１音声認識部１３
において音声認識処理に用いる知識を選択する。つま
り、図２音声入力単位識別部１２において、音声入力単
位が「単語」であると判断された場合には、図２音声認
識用辞書２８に記憶されている「単語」に関する情報
を、つまり、図２音声認識情報３２の中の「単語」に関
する情報を用いて音声認識処理を行い、図２文法情報３
３、図２意味情報３４は用いない。また、音声入力単位
が「文節」と識別されれば、「文節」に関する情報を、
つまり図２音声認識情報３２、図２文法情報３３、図２
意味情報３４の「文節」に関する情報を用いて、図２Ｄ
Ｐマッチング回路２７において音声認識処理が行われ
る。Based on the information of the input unit identified by the voice input unit identification unit 12 of FIG. 1, the voice recognition unit 13 of FIG.
In, the knowledge used for the voice recognition process is selected. That is, when the voice input unit identification unit 12 in FIG. 2 determines that the voice input unit is a “word”, the information regarding the “word” stored in the voice recognition dictionary 28 in FIG. FIG. 2 The grammatical information 3 shown in FIG.
3, the semantic information 34 of FIG. 2 is not used. Also, if the voice input unit is identified as "bunsetsu", information about "bunsetsu"
That is, FIG. 2 voice recognition information 32, FIG. 2 grammar information 33, FIG.
2D using the information about the “bunsetsu” of the semantic information 34.
A voice recognition process is performed in the P matching circuit 27.

【００２７】本発明について、本発明の（実施例１）に
基づいて更に説明する。本発明の一実施例である音声認
識装置に、単語単位で入力した場合の処理について説明
する。操作者が音声認識装置に「花笠音頭」と単語単
位で音声入力を行った。入力された音声は図２音声入力
部２１より受け付けられ、図２特徴抽出回路２２におい
て音声の特徴を抽出され、図２特徴パラメータ列記憶回
路２３に記憶される。音声特徴のひとつである音声入力
時間長は、この音声パワーの情報を用いてノイズと明ら
かに異なる音声パワーが観測された時間を計測すること
で、容易に知ることが出来る。ここで入力された音声の
音声入力時間長は０．８秒である。この音声入力時間の
情報によって、音声入力単位は単語もしくは、単語より
も小さな単位であると識別される。The present invention will be further described based on (Example 1) of the present invention. Processing when inputting in units of words in the voice recognition device that is an embodiment of the present invention will be described. The operator inputs a voice into the voice recognition device in units of words "Hanagasha Ondo". The input voice is received from the voice input unit 21 of FIG. 2, the feature of the voice is extracted by the feature extraction circuit 22 of FIG. 2, and is stored in the feature parameter string storage circuit 23 of FIG. The voice input time length, which is one of the voice characteristics, can be easily known by measuring the time when the voice power obviously different from noise is observed by using the voice power information. The voice input time length of the voice input here is 0.8 seconds. The voice input time information identifies the voice input unit as a word or a unit smaller than the word.

【００２８】次に音声入力単位識別情報の一つである入
力音声中に存在する音素数を調べる。７音素が存在して
いることが分かった。この情報からも音声入力単位が単
語もしくは、単語よりも小さな単位であると判定され
る。Next, the number of phonemes present in the input voice, which is one of the voice input unit identification information, is checked. It turns out that seven phonemes exist. Also from this information, it is determined that the voice input unit is a word or a unit smaller than the word.

【００２９】そこで、図１音声入力単位識別部１２より
得られた入力単位情報をもとに、図２音声認識用辞書２
８から音声入力単位にそった情報を選び、図１音声認識
部１３において音声認識処理が進められる。ここでは、
音声入力単位として識別された結果が「単語」であるか
ら、図２音声認識用辞書２８の中の図２音声認識情報３
２の単語に関する情報を用いて、図２ＤＰマッチング回
路２７において図２特徴パラメータ列記憶回路２３に記
憶されていた特徴パラメータ列とＤＰマッチングし、入
力音声は符号列として認識処理される。この際に、音声
入力単位が「文節」、「文章」単位の場合にのみ有効で
ある、情報は一切使用せずに音声認識処理を進めること
ができ、非常に効率的である。Therefore, based on the input unit information obtained from the voice input unit identifying section 12 of FIG. 1, the voice recognition dictionary 2 of FIG.
The information corresponding to the voice input unit is selected from 8, and the voice recognition processing is performed in the voice recognition unit 13 in FIG. here,
Since the result identified as the voice input unit is a “word”, the voice recognition information 3 of FIG. 2 in the voice recognition dictionary 28 of FIG.
The DP matching circuit 27 in FIG. 2 performs DP matching with the feature parameter sequence stored in the feature parameter sequence storage circuit 23 using the information about the two words, and the input speech is recognized as a code sequence. At this time, it is effective only when the voice input unit is a “bunsetsu” or “sentence” unit. The voice recognition process can proceed without using any information, which is very efficient.

【００３０】以上述べてきたような処理を経て入力音声
は音声認識され、図２表示部制御回路３０の制御におい
て、図２表示部３１より操作者が確認できる形態で出力
される。The input voice is voice-recognized through the processing as described above and is output from the display unit 31 of FIG. 2 in a form that can be confirmed by the operator under the control of the display unit control circuit 30 of FIG.

【００３１】また、音声入力単位を識別しない従来の音
声認識装置に、同じく「花笠音頭」と入力した場合に
は、「花が左辺だ」という意味がまったくとおらない非
文を音声認識結果として出力してしまう場合もある。こ
のように誤った音声認識結果を出力する原因は、音声入
力単位に対して不適切な辞書、知識が用いられ、音声認
識処理が進められたことであると考えられる。When "Hanakasa Ondo" is also input to the conventional voice recognition device which does not identify the voice input unit, a non-sentence meaning "flower is on the left side" is not recognized at all as a voice recognition result. It may be output. It is considered that the cause of outputting the erroneous voice recognition result is that the voice recognition processing is advanced due to the use of an inappropriate dictionary or knowledge for the voice input unit.

【００３２】尚、（実施例１）では音声入力部として、
マイク、高域強調フィルタ、ＡＤ変換器より構成し、デ
ジタル信号としてサンプリングしたものを用いたが、迅
速に入力音声をサンプリングできるものであれば、それ
以外の構成であってもかまわない。また、特徴抽出回路
では、デジタル信号に変換された音声信号を周波数変換
し、周波数領域での特徴パラメータを抽出し、発声され
た単語の特徴パラメータ列として表す方法を用いたが、
これ以外の方法であっても特徴を的確に抽出できる方法
であればかまわない。また、音声入力単位を識別する情
報として、音声入力時間情報、音素個数情報を用いる方
法を示したが、迅速に正確に音声入力単位を識別できる
方法であれば、これ以外の方法であってもかまわない。
また、音声認識結果を操作者に知らせる手段として、
（実施例１）では表示部に音声認識結果を表示する方法
を用いたが、これ以外の方法であっても、音声認識結果
を迅速に操作者に知らせることが出来る方法であれば構
わない。In the first embodiment, as the voice input section,
A microphone, a high-frequency emphasis filter, and an AD converter, which are sampled as a digital signal, are used, but any other structure may be used as long as the input voice can be sampled quickly. Further, in the feature extraction circuit, the method of frequency-converting the voice signal converted into the digital signal, extracting the feature parameter in the frequency domain, and expressing it as the feature parameter string of the uttered word is used.
Any other method may be used as long as it can accurately extract the features. Although the method of using the voice input time information and the phoneme number information as the information for identifying the voice input unit is shown, any other method may be used as long as it is a method capable of quickly and accurately identifying the voice input unit. I don't care.
Also, as a means of notifying the operator of the voice recognition result,
Although the method of displaying the voice recognition result on the display unit is used in the first embodiment, any method other than this may be used as long as it can promptly notify the operator of the voice recognition result.

【００３３】[0033]

【発明の効果】以上述べてきたように本発明の音声認識
装置は、入力音声の入力単位を識別し、音声入力単位に
あった的確な情報のみを用いることで、極めて速やかに
音声認識処理を行うことが出来るようになった。そのた
め、音声認識装置に異なった入力単位で音声入力を行う
ことが可能になり、使用環境の変化や、音声入力データ
の変化により、音声入力単位が頻繁に変化するような状
況にも非常に柔軟に対応できるようになり、音声入力装
置使用の用途を大幅に広げることが出来るようになっ
た。As described above, the voice recognition apparatus of the present invention recognizes the input unit of the input voice, and uses only the correct information that is in the voice input unit, so that the voice recognition process can be performed very quickly. I can do it now. As a result, it is possible to input voice to the voice recognition device in different input units, and it is very flexible even in situations where the voice input unit changes frequently due to changes in the usage environment or voice input data. Now, it is possible to greatly expand the use of voice input devices.

【００３４】また、あらゆる単位での音声入力を迅速に
使用できるため、音声認識装置を使用する操作者が音声
入力操作方法を修得するために要する時間が極めて削減
できるようになった。そのため、多くの人が音声認識装
置を使用する環境ができやすくなった。Further, since the voice input in every unit can be quickly used, the time required for the operator using the voice recognition device to master the voice input operation method can be extremely reduced. Therefore, it has become easier for many people to create an environment in which the voice recognition device is used.

[Brief description of drawings]

【図１】本発明の音声認識装置の原理ブロック図であ
る。FIG. 1 is a principle block diagram of a voice recognition device of the present invention.

【図２】本発明の一実施例のブロック図である。FIG. 2 is a block diagram of an embodiment of the present invention.

【図３】従来の音声認識訂正装置のブロック図であ
る。FIG. 3 is a block diagram of a conventional voice recognition correction device.

[Explanation of symbols]

１音響分析部２音声認識部３記憶部１１音響分析部１２音声入力単位識別部１３音声認識部１４音声認識記憶部２１音声入力部２２特徴抽出回路２３特徴パラメータ列記憶回路２４音声入力単位識別情報２５マッチング回路２６音声入力単位記憶回路２７ＤＰマッチング回路２８音声認識用辞書２９音声認識記憶回路３０表示部制御回路３１表示部３２音声認識情報３３文法情報３４意味情報 DESCRIPTION OF SYMBOLS 1 acoustic analysis unit 2 speech recognition unit 3 storage unit 11 acoustic analysis unit 12 speech input unit identification unit 13 speech recognition unit 14 speech recognition storage unit 21 speech input unit 22 feature extraction circuit 23 characteristic parameter sequence storage circuit 24 speech input unit identification information 25 Matching Circuit 26 Voice Input Unit Storage Circuit 27 DP Matching Circuit 28 Voice Recognition Dictionary 29 Voice Recognition Storage Circuit 30 Display Control Circuit 31 Display 32 Voice Recognition Information 33 Grammar Information 34 Semantic Information

Claims

Claim: What is claimed is: 1. An acoustic analysis unit that outputs characteristics of an input voice, a voice input unit identification unit that determines a grammatical unit of the input voice, and the voice input unit identification. A speech recognition unit that converts the output of the acoustic analysis unit into a code string based on the output of the unit, an acoustic analysis storage unit that stores the output of the acoustic analysis unit, and a speech recognition storage that stores the output of the speech recognition unit. A voice recognition device comprising: