JPH07244498A

JPH07244498A - Basic frequency extractor

Info

Publication number: JPH07244498A
Application number: JP3340894A
Authority: JP
Inventors: Tadashi Yonezaki; 正米崎
Original assignee: Matsushita Electric Industrial Co Ltd
Current assignee: Panasonic Holdings Corp
Priority date: 1994-03-03
Filing date: 1994-03-03
Publication date: 1995-09-19

Abstract

(57)【要約】【目的】Ａ−ｂ−Ｓ手法を導入して、正確な基本周波
数を抽出する。【構成】入力音声自己相関関数算出器１５で算出した
入力音声の自己相関関数と合成音声自己相関関数算出器
１７で算出した合成音声の自己相関関数を誤差算出器１
６により比較し、周波数調整器１３で基本周波数を調整
していくことで基本周波数抽出する。 (57) [Abstract] [Purpose] The A-B-S method is introduced to extract an accurate fundamental frequency. An error calculator 1 calculates an autocorrelation function of input speech calculated by the input speech autocorrelation function calculator 15 and an autocorrelation function of synthesized speech calculated by the synthesized speech autocorrelation function calculator 17.
6, and the fundamental frequency is adjusted by the frequency adjuster 13 to extract the fundamental frequency.

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】本発明は、ディジタル移動通信な
どに利用し、音声信号を分析および合成する際に必要と
なる基本周波数抽出装置に関するものである。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a fundamental frequency extraction device which is used in digital mobile communication or the like and is required when analyzing and synthesizing a voice signal.

【０００２】[0002]

【従来の技術】近年、音声信号を分析及び合成する技術
は、データ伝送・蓄積媒体を効率的に活用する情報圧縮
の基礎技術として注目されている。中でも、音声信号の
分析合成処理において重要なパラメータである基本周波
数を誤りなく抽出する基本周波数抽出装置は合成音声の
品質を向上する上で、広く求められている。2. Description of the Related Art In recent years, a technique of analyzing and synthesizing a voice signal has been attracting attention as a basic technique of information compression which efficiently utilizes a data transmission / storage medium. Above all, a fundamental frequency extraction device that extracts a fundamental frequency, which is an important parameter in the analysis and synthesis processing of a speech signal, without error is widely required in order to improve the quality of synthesized speech.

【０００３】以下、従来の基本周波数抽出装置について
説明する。図１７は、従来の基本周波数抽出装置のブロ
ック図を示す。図１７において、４１は音声を入力する
音声入力器である。４２は自己相関算出器であり、入力
された音声の自己相関関数を算出する。４３は誤差関数
算出器であり、求められた自己相関から基本周波数とし
ての確からしさを求める。誤差関数算出器４３で求めら
れた誤差関数は、バッファ４４にｎフレーム分（ｎは任
意）蓄積される。４５は基本周波数抽出器であり、蓄積
されている誤差関数値の和が最小となる周波数を抽出す
る。A conventional fundamental frequency extraction device will be described below. FIG. 17 shows a block diagram of a conventional fundamental frequency extraction device. In FIG. 17, reference numeral 41 is a voice input device for inputting voice. An autocorrelation calculator 42 calculates the autocorrelation function of the input voice. Reference numeral 43 denotes an error function calculator, which obtains the certainty as a fundamental frequency from the obtained autocorrelation. The error function obtained by the error function calculator 43 is accumulated in the buffer 44 for n frames (n is arbitrary). Reference numeral 45 is a fundamental frequency extractor, which extracts the frequency at which the sum of the accumulated error function values is the minimum.

【０００４】以上のように構成され基本周波数抽出装置
において、音声入力器４１により入力された音声から自
己相関算出器４２で自己相関関数が求められる。求めら
れた自己相関関数に基づいて図１８に示す誤差関数が誤
差関数算出器４３によって求められる。また、図１９
（ａ）に示すように、フレーム毎に求められた誤差関数
はバッファ４４にｎフレーム蓄積され、この関数の和が
最小となるように推移する図１９（ｂ）に示す基本周波
数を基本周波数抽出器４５により抽出する。In the fundamental frequency extraction device configured as described above, the autocorrelation calculator 42 obtains the autocorrelation function from the voice input by the voice input device 41. The error function calculator 43 calculates the error function shown in FIG. 18 based on the calculated autocorrelation function. In addition, FIG.
As shown in (a), the error function obtained for each frame is accumulated in the buffer 44 for n frames, and the fundamental frequency extracted from the fundamental frequency shown in FIG. 19 (b) is changed so that the sum of these functions is minimized. It is extracted by the device 45.

【０００５】[0005]

【発明が解決しようとする課題】しかしながら、上記の
ような従来の基本周波数抽出装置では、入力された音声
信号の位相情報を考慮せずに音声信号を分析・合成する
システムの場合、入力音声信号の波形と合成音声の波形
は、図２０および図２１に示すように全く異なるため、
これらの波形を直接比較することができず、抽出された
基本周波数の妥当性を確認することが困難である。特に
真の基本周波数に対して整数倍の周波数（倍ピッチ）や
整数分の一の周波数（半ピッチ）を基本周波数として、
誤って抽出することが多いという問題を有していた。However, in the conventional fundamental frequency extraction device as described above, in the case of a system for analyzing and synthesizing an audio signal without considering the phase information of the input audio signal, the input audio signal Since the waveform of and the waveform of the synthesized voice are completely different as shown in FIGS. 20 and 21,
Since it is not possible to directly compare these waveforms, it is difficult to confirm the validity of the extracted fundamental frequency. Especially, the frequency that is an integral multiple of the true fundamental frequency (double pitch) or the frequency that is an integral fraction (half pitch) is the basic frequency,
It had a problem that it was often mistakenly extracted.

【０００６】本発明は上記従来の問題を解決するもので
あり、いかなる音声分析合成手法を用いた音声合成器に
おいても正確な基本周波数を抽出する基本周波数抽出装
置を提供することを目的とする。The present invention solves the above-mentioned conventional problems, and an object of the present invention is to provide a fundamental frequency extraction device for extracting an accurate fundamental frequency in a speech synthesizer using any speech analysis and synthesis method.

【０００７】また、本発明は高調波周波数毎に音声を合
成するシステムに対して、倍ピッチ、半ピッチ誤りを避
けることを可能とした優れた基本周波数抽出装置を提供
することを目的とする。Another object of the present invention is to provide an excellent fundamental frequency extraction device capable of avoiding double-pitch and half-pitch errors in a system that synthesizes voice for each harmonic frequency.

【０００８】[0008]

【課題を解決するための手段】上記目的を達成するため
に、請求項１の発明は、音声を入力する音声入力器と、
前記入力された音声の自己相関関数を算出する入力音声
自己相関算出器と、前記入力された音声の周期を分析し
初期基本周波数を求める初期基本周波数分析器と、前記
初期基本周波数を誤差算出器の出力によって調整し基本
周波数を定める基本周波数調整装置と、前記基本周波数
に従って音声を分析・合成する音声分析合成装置と、前
記合成音声の自己相関関数を算出する合成音声自己相関
算出器と、前記入力音声および合成音声の自己相関関数
の誤差を算出する誤差算出器とを備えてなる構成にし
た。In order to achieve the above object, the invention of claim 1 is a voice input device for inputting voice,
An input speech autocorrelation calculator that calculates an autocorrelation function of the input speech, an initial fundamental frequency analyzer that analyzes the period of the input speech to obtain an initial fundamental frequency, and an error calculator that calculates the initial fundamental frequency. A fundamental frequency adjusting device that determines a fundamental frequency by adjusting the output of the device, a speech analysis and synthesis device that analyzes and synthesizes speech according to the fundamental frequency, a synthesized speech autocorrelation calculator that calculates an autocorrelation function of the synthesized speech, An error calculator for calculating an error of the autocorrelation function of the input voice and the synthesized voice is provided.

【０００９】また、請求項２の発明は、前記音声合成手
段が基本周波数に基づいて周波数を高調波帯域に分割す
る高調波帯域分割器と、前記分割された高調波毎に高調
波の振幅を求める高調波振幅算出器と、前記算出された
振幅に従い高調波毎に信号を生成する高調波音源発生器
とから構成され、前記入力音声自己相関算出器と前記合
成音声自己相関算出器とを、前記高調波音源発生器で生
成された全ての高調波を加算することによって第１の合
成音声を生成する第１の高調波音源合成器と、得られた
高調波音源のうち基本周波数を除いた高調波音源を加算
することによって第２の合成音声を生成する第２の高調
波音源合成器とに置き換えてなるものである。[0009] According to a second aspect of the present invention, the voice synthesizing means divides a frequency into harmonic bands based on a fundamental frequency, and a harmonic amplitude for each of the divided harmonics. A harmonic amplitude calculator to be obtained, and a harmonic sound source generator that generates a signal for each harmonic according to the calculated amplitude, the input speech autocorrelation calculator and the synthesized speech autocorrelation calculator, A first harmonic sound source synthesizer that generates a first synthesized voice by adding all the harmonics generated by the harmonic sound source generator, and a fundamental frequency of the obtained harmonic sound source is removed. This is replaced with a second harmonic sound source synthesizer that generates a second synthesized voice by adding harmonic sound sources.

【００１０】[0010]

【作用】上記請求項１の構成により、入力音声信号の位
相情報を考慮せずに音声を分析・合成するために入力音
声と合成音声を直接比較することができないシステムに
おいても、自己相関関数を比較することによって抽出さ
れた基本周波数を補正することができる。そして、この
ようなＡ−ｂ−Ｓ手法（合成による分析法）を導入した
閉ループ構造を用いることで、入力音声に最も近い基本
周波数を抽出することができる。According to the structure of claim 1, even in a system in which the input voice and the synthesized voice cannot be directly compared to analyze and synthesize the voice without considering the phase information of the input voice signal, the autocorrelation function is The extracted fundamental frequency can be corrected by comparing. Then, the fundamental frequency closest to the input voice can be extracted by using a closed loop structure in which such an A-B-S method (analysis method by synthesis) is introduced.

【００１１】また、請求項２の構成により、入力音声信
号が高城ろ波されている場合、抽出された基本周波数を
基準として分析した周波数特性を持つ合成音声と、この
合成音声から基本周波数成分だけを取り除いた周波数特
性をもつ合成音声の自己相関関数を比較することによっ
て、倍ピッチ、半ピッチ誤りを検出することが可能とな
り、さらに正確な基本周波数を抽出することができる。According to the structure of claim 2, when the input voice signal is filtered by Takashiro, only the synthesized voice having the frequency characteristic analyzed with the extracted fundamental frequency as a reference and the fundamental frequency component from this synthesized voice. By comparing the autocorrelation functions of the synthesized speech having the frequency characteristics with the removed, it is possible to detect double pitch and half pitch errors, and it is possible to extract a more accurate fundamental frequency.

【００１２】[0012]

【実施例】以下本発明の一実施例について、図面を参照
しながら説明する。DESCRIPTION OF THE PREFERRED EMBODIMENTS An embodiment of the present invention will be described below with reference to the drawings.

【００１３】図１において、１１は音声を入力する音声
入力器、１２は基本周波数分析器であり、入力音声を分
析して以降の処理における基本周波数の初期値を求め
る。１３は周波数調整器であり、基本周波数分析器１２
から得られた基本周波数を初期値として、誤差算出器１
６からの出力に応じて調整する。１４は音声分析合成器
であり、周波数調整器１３により定められた基本周波数
に基づいて音声信号を分析し合成音声を生成する。１５
は入力音声自己相関関数算出器であり、入力音声の自己
相関関数を算出する。In FIG. 1, 11 is a voice input device for inputting voice, and 12 is a fundamental frequency analyzer, which analyzes the input voice and obtains an initial value of a fundamental frequency in the subsequent processing. Reference numeral 13 is a frequency adjuster, which is a fundamental frequency analyzer 12
Error calculator 1 with the fundamental frequency obtained from
Adjust according to the output from 6. Reference numeral 14 is a voice analysis / synthesis device, which analyzes a voice signal based on the fundamental frequency determined by the frequency adjuster 13 to generate a synthetic voice. 15
Is an input speech autocorrelation function calculator, which calculates an autocorrelation function of the input speech.

【００１４】誤差算出器１６は、その誤差算出関数によ
って入力音声自己相関関数算出器１５から得られる入力
音声の自己相関関数と、合成音声自己相関関数算出器１
７から得られる合成音声の自己相関関数との誤差を算出
する。合成音声自己相関関数算出器１７では、音声分析
合成器１４により生成された合成音声の自己相関関数を
求める。The error calculator 16 calculates the input speech autocorrelation function obtained from the input speech autocorrelation function calculator 15 by the error calculation function and the synthesized speech autocorrelation function calculator 1.
The error from the autocorrelation function of the synthetic speech obtained from 7 is calculated. The synthetic speech autocorrelation function calculator 17 obtains the autocorrelation function of the synthetic speech generated by the speech analysis / synthesis unit 14.

【００１５】図２は、高調波周波数帯域毎に音声信号を
分析・合成するシステムを用いることによって倍ピッチ
誤りを検出することを可能とする音声分析合成器の構成
を示すブロック図である。FIG. 2 is a block diagram showing the arrangement of a voice analysis / synthesis device that enables detection of double-pitch errors by using a system for analyzing / synthesizing voice signals for each harmonic frequency band.

【００１６】図２において、１３は周波数調整器であ
り、入力音声信号を分析する際の基本周波数を調整す
る。２２は高調波帯域分割器であり、周波数調整器１３
で定められた基本周波数に基づいて周波数帯域を分割す
る。２３は高調波振幅算出器であり、高調波帯域分割器
２２によって分割された周波数帯域毎に高調波スペクト
ルの振幅を算出し、その振幅値および高調波周波数に基
づいて各高調波周波数毎の音源を高調波音源生成器２４
で生成する。２５は第１の高調波音源合成器であり、高
調波音源生成器２４で得られる音源を全て加算して合成
音声ａを生成し合成音声ａの自己相関関数ａを第１の
自己相関関数算出器２６で求める。In FIG. 2, reference numeral 13 is a frequency adjuster, which adjusts the fundamental frequency when analyzing the input voice signal. 22 is a harmonic band divider, which is a frequency adjuster 13
The frequency band is divided based on the fundamental frequency defined in. Reference numeral 23 is a harmonic amplitude calculator, which calculates the amplitude of the harmonic spectrum for each frequency band divided by the harmonic band divider 22, and based on the amplitude value and the harmonic frequency, the sound source for each harmonic frequency. The harmonic source generator 24
Generate with. Reference numeral 25 denotes a first harmonic sound source synthesizer, which adds all the sound sources obtained by the harmonic sound source generator 24 to generate a synthetic speech a. Obtained with the vessel 26.

【００１７】２７は第２の高調波音源合成器であり、高
調波音源生成器２４で得られる音源のうち、基本周波数
音源を除く全ての音源を加算して合成音声ｂを生成し、
この合成音声ｂの自己相関関数ｂを第２の自己相関関数
算出器２８で求める。２９は誤差算出器であり、自己相
関関数ａと自己相関関数ｂを比較し、周波数調整器１３
で定められた基本周波数が倍ピッチであるか否かを判定
する。Reference numeral 27 denotes a second harmonic sound source synthesizer, which adds all the sound sources except the fundamental frequency sound source among the sound sources obtained by the harmonic sound source generator 24 to generate a synthesized voice b,
The autocorrelation function b of the synthesized voice b is obtained by the second autocorrelation function calculator 28. An error calculator 29 compares the autocorrelation function a and the autocorrelation function b, and the frequency adjuster 13
It is determined whether or not the fundamental frequency defined in step 2 is a double pitch.

【００１８】以上のように構成された本実施例の基本周
波数抽出装置について、その動作を説明する。The operation of the fundamental frequency extraction device of this embodiment constructed as described above will be described.

【００１９】図１において、音声入力器１１により入力
された図３に示す音声信号から基本周波数分析器１２に
よって以降の処理における基本周波数の初期値が求めら
れる。周波数調整器１３では、得られた初期値を基準に
して、音声合成器１１で音声信号を分析する際に用いる
基本周波数を定める。音声分析合成器１４では、定めら
れた基本周波数に基づいて音声信号を分析し図５に示す
ような合成音声を生成する。入力音声自己相関関数算出
器１５によって、図４に示す入力音声の自己相関関数を
算出し、合成音声自己相関関数算出器１７によって、図
５に示す合成音声の自己相関関数（図６参照）を求め
る。これらの入力音声、および合成音声の自己相関関数
の誤差を誤差算出器１６によって求め、誤差が最小とな
る基本周波数に周波数調整器１３により基本周波数を調
整する。In FIG. 1, the fundamental frequency analyzer 12 obtains the initial value of the fundamental frequency in the subsequent processing from the speech signal shown in FIG. The frequency adjuster 13 determines the basic frequency used when the voice synthesizer 11 analyzes the voice signal based on the obtained initial value. The voice analysis / synthesis unit 14 analyzes the voice signal based on the determined fundamental frequency to generate a synthetic voice as shown in FIG. The input speech autocorrelation function calculator 15 calculates the input speech autocorrelation function shown in FIG. 4, and the synthetic speech autocorrelation function calculator 17 calculates the synthesized speech autocorrelation function (see FIG. 6) shown in FIG. Ask. The error of the autocorrelation function of the input voice and the synthesized voice is obtained by the error calculator 16, and the fundamental frequency is adjusted by the frequency adjuster 13 to the fundamental frequency at which the error is minimized.

【００２０】次に、高調波周波数帯域毎に音声信号を分
析・合成するシステムを用いることによって倍ピッチ誤
りを検出する場合の動作について、図７の流れ図および
図８〜図１２を参照して説明する。Next, the operation in the case of detecting the double pitch error by using the system for analyzing and synthesizing the voice signal for each harmonic frequency band will be described with reference to the flowchart of FIG. 7 and FIGS. 8 to 12. To do.

【００２１】まず、図７のステップ３１において、基本
周波数抽出アルゴリズムの終了条件（ｆｌａｇ←０、し
きい値Ｅｍ）を設定する。次のステップ３２では周波数
調整器１３により、入力音声信号を分析する際の基本周
波数を設定する。このときの入力音声のパワー・スペク
トルと基本周波数を図８に示す。ここで定められた周波
数に基づいて周波数帯域を基本高調波帯域分割器２２で
図９に示すように分割する（ステップ３３）。高調波振
幅算出器２３では、分割された周波数帯域毎に、それぞ
れの高調波スペクトルの振幅を算出する（ステップ３
４）。この振幅値および高調波周波数に基づいて各高調
波周波数毎の音源を高調波音源生成器２４で図１０に示
すように生成する。（ステップ３６）。得られた高調波
音源から次の２種類の合成音声を求める。First, in step 31 of FIG. 7, the end condition (flag ← 0, threshold value Em) of the fundamental frequency extraction algorithm is set. In the next step 32, the frequency adjuster 13 sets the fundamental frequency for analyzing the input audio signal. FIG. 8 shows the power spectrum and fundamental frequency of the input voice at this time. Based on the frequency determined here, the frequency band is divided by the fundamental harmonic band divider 22 as shown in FIG. 9 (step 33). The harmonic amplitude calculator 23 calculates the amplitude of each harmonic spectrum for each of the divided frequency bands (step 3).
4). Based on this amplitude value and the harmonic frequency, the harmonic sound source generator 24 generates a sound source for each harmonic frequency as shown in FIG. (Step 36). The following two types of synthesized speech are obtained from the obtained harmonic sound source.

【００２２】まず、第１の高調波音源合成器２５では、
全ての高調波音源から音声を図１１に示すように合成し
（ステップ３８）、第２の高調波音源合成器２７では、
基本周波数音源を除く全ての音源から音声を図１２に示
すように合成する（ステップ３９）。ここに得られた２
つの合成音声の自己相関関数を、それぞれの自己相関関
数算出器２６、２８で求める。（ステップ４０、４
１）。First, in the first harmonic sound source synthesizer 25,
Speech is synthesized from all the harmonic sound sources as shown in FIG. 11 (step 38), and in the second harmonic sound source synthesizer 27,
Speech is synthesized from all sound sources except the fundamental frequency sound source as shown in FIG. 12 (step 39). 2 obtained here
The autocorrelation function of each synthetic speech is obtained by the autocorrelation function calculator 26, 28. (Steps 40, 4
1).

【００２３】誤差算出器２９では、まず、ステップ４２
において、２つの自己相関関数の差ｅを求め、次のステ
ップ４３において、その差ｅをしきい値Ｅｍと比較す
る。最後に、ステップ４４で、基本周波数抽出アルゴリ
ズム完了フラグがセットされているか否かを判定する。
ステップ４３において、ｅ＜Ｅｍでないと判定された場
合は、基本周波数抽出アルゴリズム完了フラグを「１」
にセットし、基本周波数調整器１３で基本周波数を半分
にし、（ステップ３７）、上記プロセスをくり返す。ま
た、ステップ４４において、ｆｌａｇ＝１でないと判定
された場合は、基本周波数調整器１３で基本周波数を倍
にし（ステップ３７）、上記プロセスを繰り返す。ステ
ップ４４が真であれば、ステップ４５において基本周波
数を２倍して、抽出した基本周波数とする。In the error calculator 29, first, step 42
In, the difference e between the two autocorrelation functions is obtained, and in the next step 43, the difference e is compared with the threshold value Em. Finally, in step 44, it is determined whether the fundamental frequency extraction algorithm completion flag is set.
When it is determined in step 43 that e <Em is not satisfied, the fundamental frequency extraction algorithm completion flag is set to "1".
, The fundamental frequency adjuster 13 halves the fundamental frequency (step 37), and the above process is repeated. When it is determined in step 44 that flag = 1 is not satisfied, the fundamental frequency adjuster 13 doubles the fundamental frequency (step 37), and the above process is repeated. If step 44 is true, the fundamental frequency is doubled in step 45 to obtain the extracted fundamental frequency.

【００２４】図１３、図１４および図１５、図１６はス
テップ４０、４１において、それぞれの自己相関関数算
出器２６、２８で求められた出力波形を示すもので、図
１３、図１４は基本周波数が正しく抽出されている場合
において得られる自己相関関数を表し、基本周波数を倍
ピッチで誤った場合において得られる自己相関関数を表
す。図１５、図１６から明らかなように、倍ピッチ誤り
を起こしたときの自己相関関数は、他と比較し、大きく
異なることがわかる。FIGS. 13, 14 and 15, and 16 show the output waveforms obtained by the autocorrelation function calculators 26 and 28 in steps 40 and 41, respectively, and FIGS. 13 and 14 show the fundamental frequency. Represents the autocorrelation function obtained when is correctly extracted, and the autocorrelation function obtained when the fundamental frequency is erroneous in double pitch. As is apparent from FIGS. 15 and 16, the autocorrelation function when a double pitch error occurs is significantly different from the others.

【００２５】上記本実施例による基本周波数抽出装置に
おいては、入力音声と合成音声を直接比較することがで
きない音声分析合成器においても、閉ループを構成しＡ
−ｂ−Ｓ手法を導入することが可能になる点で優れた効
果が得られる。In the fundamental frequency extracting apparatus according to the present embodiment, even in the voice analysis / synthesizer which cannot directly compare the input voice and the synthesized voice, a closed loop is formed.
An excellent effect is obtained in that the -bS method can be introduced.

【００２６】また、２種類の合成音声の自己相関関数を
比較することで、基本周波数の倍ピッチ抽出誤りを検出
でき、正確な基本周波数を抽出することができる点で優
れた効果が得られる。Further, by comparing the autocorrelation functions of the two types of synthesized speech, the double pitch extraction error of the fundamental frequency can be detected, and an excellent effect is obtained in that the accurate fundamental frequency can be extracted.

【００２７】以上のように本実施例によれば、自己相関
関数算出器を設けることにより、音声分析合成器に依存
することなくＡ−ｂ−Ｓ手法を可能とする閉ループ構成
を導入することができ、正確に基本周波数を抽出するこ
とができる。As described above, according to the present embodiment, by providing the autocorrelation function calculator, it is possible to introduce a closed loop configuration that enables the A-B-S method without depending on the voice analysis / synthesis device. Therefore, the fundamental frequency can be accurately extracted.

【００２８】[0028]

【発明の効果】以上のように本発明においては、自己相
関関数算出器を設けることにより、音声合成装置に依存
することなくＡ−ｂ−Ｓ手法を可能とする閉ループ構成
を導入することができ、正確に基本周波数を抽出するこ
とができる優れた基本周波数抽出装置を実現できる。As described above, in the present invention, by providing the autocorrelation function calculator, it is possible to introduce a closed loop configuration that enables the A-B-S method without depending on the speech synthesizer. Therefore, it is possible to realize an excellent fundamental frequency extracting device that can accurately extract the fundamental frequency.

【００２９】また、本発明においては、２種類の高調波
音源合成器、および自己相関関数算出器を設けることに
より、基本周波数の倍ピッチ抽出誤りを検出し、正確に
基本周波数を抽出することができる優れた基本周波数抽
出装置を実現できる。Further, in the present invention, by providing two kinds of harmonic sound source synthesizers and an autocorrelation function calculator, a double pitch extraction error of the fundamental frequency can be detected and the fundamental frequency can be accurately extracted. An excellent fundamental frequency extraction device that can be realized can be realized.

[Brief description of drawings]

【図１】本発明の実施例における基本周波数抽出装置の
ブロック図FIG. 1 is a block diagram of a fundamental frequency extraction device according to an embodiment of the present invention.

【図２】本発明の実施例における音声分析合成器の構成
を示すブロック図FIG. 2 is a block diagram showing a configuration of a voice analysis / synthesis device according to an exemplary embodiment of the present invention.

【図３】本実施例における入力音声の波形図FIG. 3 is a waveform diagram of an input voice in this embodiment.

【図４】本実施例における入力音声の自己相関関数を表
す図FIG. 4 is a diagram showing an autocorrelation function of input speech in the present embodiment.

【図５】本実施例における合成音声の波形図FIG. 5 is a waveform diagram of synthesized speech in this embodiment.

【図６】本実施例における合成音声の自己相関関数を表
す図FIG. 6 is a diagram showing an autocorrelation function of synthesized speech in the present embodiment.

【図７】本実施例における基本周波数抽出装置の動作手
順を示す流れ図FIG. 7 is a flowchart showing an operation procedure of the fundamental frequency extraction device in the present embodiment.

【図８】本実施例における入力音声のパワー・スペクト
ルと基本周波数を表す図FIG. 8 is a diagram showing a power spectrum and a fundamental frequency of input speech in the present embodiment.

【図９】本実施例における基本周波数に基づく高調波帯
域の分割を表す図FIG. 9 is a diagram showing division of a harmonic band based on a fundamental frequency in the present embodiment.

【図１０】本実施例における各高調波音源の波形図FIG. 10 is a waveform diagram of each harmonic sound source in this embodiment.

【図１１】本実施例における高調波音源合成器２５によ
る合成音声の波形図FIG. 11 is a waveform diagram of synthesized speech by the harmonic sound source synthesizer 25 in the present embodiment.

【図１２】本実施例における高調波音源合成器２７によ
る合成音声の波形図FIG. 12 is a waveform diagram of synthesized speech by the harmonic sound source synthesizer 27 in the present embodiment.

【図１３】本実施例における基本周波数が正しいときの
自己相関関数算出器２６の出力波形図FIG. 13 is an output waveform diagram of the autocorrelation function calculator 26 when the fundamental frequency is correct in the present embodiment.

【図１４】本実施例における基本周波数が正しいときの
自己相関関数算出器２８の出力波形図FIG. 14 is an output waveform diagram of the autocorrelation function calculator 28 when the fundamental frequency is correct in the present embodiment.

【図１５】本実施例における基本周波数が誤ったときの
自己相関関数算出器２６の出力波形図FIG. 15 is an output waveform diagram of the autocorrelation function calculator 26 when the fundamental frequency is incorrect in the present embodiment.

【図１６】本実施例における基本周波数が誤ったときの
自己相関関数算出器２８の出力波形図FIG. 16 is an output waveform diagram of the autocorrelation function calculator 28 when the fundamental frequency is incorrect in the present embodiment.

【図１７】従来の基本周波数抽出装置のブロック図FIG. 17 is a block diagram of a conventional fundamental frequency extraction device.

【図１８】従来の誤差関数を表す説明図FIG. 18 is an explanatory diagram showing a conventional error function.

【図１９】従来の累積誤差と基本周波数を表す説明図FIG. 19 is an explanatory diagram showing a conventional accumulated error and a fundamental frequency.

【図２０】従来の入力信号の波形を示す図FIG. 20 is a diagram showing a waveform of a conventional input signal.

【図２１】従来の合成音声信号の波形を示す図FIG. 21 is a diagram showing a waveform of a conventional synthesized speech signal.

[Explanation of symbols]

１１音声入力器１２基本周波数分析器１３周波数調整器１４音声分析合成器１５入力音声自己相関関数算出器１６誤差算出器１７合成音声自己相関関数算出器２２高調波帯域分割器２３高調波振幅算出器２４高調波音源生成器２５第１の高調波音源合成器２６第１の自己相関関数算出器２７第２の高調波音源合成器２８第２の自己相関関数算出器２９誤差算出器４１音声入力器４２自己相関算出器４３誤差関数算出器４４バッファ４５基本周波数抽出器 11 voice input device 12 fundamental frequency analyzer 13 frequency adjuster 14 voice analysis synthesizer 15 input voice autocorrelation function calculator 16 error calculator 17 synthetic voice autocorrelation function calculator 22 harmonic band divider 23 harmonic amplitude calculator 24 Harmonic Source Generator 25 First Harmonic Source Synthesizer 26 First Autocorrelation Function Calculator 27 Second Harmonic Source Synthesizer 28 Second Autocorrelation Function Calculator 29 Error Calculator 41 Voice Input Device 42 autocorrelation calculator 43 error function calculator 44 buffer 45 fundamental frequency extractor

Claims

[Claims]

1. A voice input means for inputting voice, an input voice autocorrelation calculating means for calculating an autocorrelation function of the input voice, and an initial stage for obtaining an initial fundamental frequency by analyzing a cycle of the input voice. A fundamental frequency analyzing means, a fundamental frequency adjusting device for determining the fundamental frequency by adjusting the initial fundamental frequency by the output of the error calculating means, a voice analysis / synthesizing device for analyzing and synthesizing a voice according to the fundamental frequency, and a self-synthesizing voice A fundamental frequency extraction device comprising: synthetic speech autocorrelation calculation means for calculating a correlation function; and error calculation means for calculating an error of the autocorrelation function of the input speech and the synthesized speech.

2. A harmonic band dividing means for dividing the frequency into harmonic bands based on a fundamental frequency by the voice synthesizing means, and a harmonic amplitude calculating means for obtaining an amplitude of the harmonic for each of the divided harmonics. A harmonic sound source generating means for generating a signal for each harmonic according to the calculated amplitude, and the input sound autocorrelation calculating means and the synthesized sound autocorrelation calculating means are First harmonic sound source synthesizing means for generating a first synthesized voice by adding all generated harmonic waves, and adding a harmonic sound source excluding the fundamental frequency of the obtained harmonic sound sources The fundamental frequency extraction device according to claim 1, wherein the fundamental frequency extraction device is replaced with a second harmonic sound source synthesizing means for generating a second synthetic voice.