JPH0449960B2

JPH0449960B2 -

Info

Publication number: JPH0449960B2
Application number: JP59132385A
Authority: JP
Inventors: Satoru Taguchi; Takayuki Ishikawa
Original assignee: Nippon Electric Co Ltd
Current assignee: NEC Corp
Priority date: 1984-06-27
Filing date: 1984-06-27
Publication date: 1992-08-12
Also published as: JPS6111800A

Description

【発明の詳細な説明】〔技術分野〕本発明は残差励振型ボコーダ、特にスペクトル
包絡情報をパタンマツチング手段により伝送する
手段と残差波形の高域周波数成分を分析側で再生
する手段とを備えた残差励振型ボコーダに関す
る。[Detailed Description of the Invention] [Technical Field] The present invention relates to a residual excitation type vocoder, particularly a means for transmitting spectral envelope information using a pattern matching means, and a means for reproducing high frequency components of a residual waveform on an analysis side. The present invention relates to a residual excitation type vocoder equipped with a residual excitation type vocoder.

〔従来技術〕入力音声信号をLPC（Linear Prediction
Coefficient、線形予測係数）分析して得られる
LPC係数とともに波形情報としての残差波形の
うち予め設定する領域の低域周波数成分に関する
情報を分析側から合成側に伝送し、合成側では入
力した低域周波数成分を利用して復元して高域周
波数成分とともに残差波形を合成し、これと
LPC係数とによつて合成フイルタを動作せしめ
て高品質の入力音声信号の再生を行なう残差励振
型ボコーダはRELP（Residual Exciting Linear
Prediction vocoder）の略称によつてもよく知ら
れている。[Prior art] The input audio signal is processed by LPC (Linear Prediction).
Coefficient, linear prediction coefficient) obtained by analyzing
Along with the LPC coefficients, information regarding the low frequency components of a preset region of the residual waveform as waveform information is transmitted from the analysis side to the synthesis side, and the synthesis side uses the input low frequency components to restore and enhance the The residual waveform is synthesized with the frequency component, and
RELP (Residual Exciting Linear) is a residual excitation type vocoder that operates a synthesis filter using LPC coefficients and reproduces high-quality input audio signals.
It is also known by the abbreviation ``Prediction Vocoder''.

このRELPは、LPC係数とともに、モデル化し
た残差波形を分析側から合成側に送出する従来の
一般的なボコーダに比して、残差波形の１部の低
域周波数成分をモデル化を介することなく分析側
から合成側に送出するという形式で波形の一部伝
送を行なつている点で基本的に再生音質が優れて
り、4.8Kb／ｓ（キロビツト／秒）程度の従来タ
イプのボコーダの伝送ビツトレート帯域でも高品
質の再生音が得られるCODEC（COder
DECorder、コーデツク）として近時多用されつ
つある。 Compared to a conventional general vocoder, which sends a modeled residual waveform from the analysis side to the synthesis side together with the LPC coefficients, RELP sends a part of the low frequency components of the residual waveform through modeling. Basically, the playback quality is excellent in that part of the waveform is transmitted from the analysis side to the synthesis side without any noise, and it is comparable to conventional vocoders with a speed of about 4.8Kb/s (kilobits per second). CODEC (COder
It has recently become widely used as a DECorder (codec).

しかしながら、従来のこの種のRELPにあつて
は、分析側から合成側に供給する残差波形情報は
利用可能なデータビツトレートの関係から予め設
定する周波数領域の低域周波数成分に限定されて
おり、高域周波数成分はこの低域周波数成分を合
成側で非線形回路に印加してその高調波成分とし
て得られるものを位相情報は無視して単純加算す
るのが基本的手法となつており、位相の再現性が
得られないため再生音声の波形が入力音声信号と
異なつたいわゆるRELP音と言われる再生歪が発
生する欠点があることはよく知られている。 However, in conventional RELP of this type, the residual waveform information supplied from the analysis side to the synthesis side is limited to low frequency components in a preset frequency domain due to the available data bit rate. The basic method for high frequency components is to apply this low frequency component to a nonlinear circuit on the synthesis side and simply add the resulting harmonic components, ignoring phase information. It is well known that this method has the drawback that reproduction distortion, so-called RELP sound, occurs in which the waveform of the reproduced audio differs from the input audio signal because the reproducibility of the input audio signal cannot be obtained.

このようなRELP音を改善するために分析側に
高域周波数再生手段を設けた残差励振型ボコーダ
が近時利用されつつある。これは合成側で残差波
形を再生するために使用されるトランスバーサル
型の高域再生フイルタのタツプ係数を分析側で推
定したうえ合成側に供給するものであり、合成側
で使用する高域再生フイルタと同じ構成のトラン
スバーサルフイルタを対象とし、低域周波数と合
成すべき高域周波数に対する位相条件を含むイン
パルスレスポンス特性の設定すなわちタツプ係数
の設定を実施したうえ、これを合成側に供給する
ものである。 In order to improve such RELP sound, residual excitation type vocoders equipped with high frequency reproduction means on the analysis side have recently been used. This is because the analysis side estimates the tap coefficient of the transversal high-frequency reproduction filter used to reproduce the residual waveform on the synthesis side, and then supplies it to the synthesis side. For a transversal filter with the same configuration as the reproduction filter, set the impulse response characteristics including the phase conditions for the low frequency and high frequency to be synthesized, that is, set the tap coefficient, and then supply this to the synthesis side. It is something.

しかしながら、従来のこの種の残差励振型ボコ
ーダにおいては、合成側で発生する高周波成分は
分析側から供給を受けた低域周波数成分を非線形
回路に印加することによつて生成される高調波分
を利用しているため所望の高域周波数再生が得難
いことがしばしば発生するという欠点がある。 However, in conventional residual excitation type vocoders of this type, the high frequency components generated on the synthesis side are harmonic components generated by applying the low frequency components supplied from the analysis side to a nonlinear circuit. Since this method utilizes the same frequency range, it is often difficult to obtain the desired high frequency reproduction.

又、従来のこの種の残差励振型ボコーダにおい
てはLPC係数を直接的に量子化して伝送してい
るために、残差波形に十分な情報量を割当てるこ
とが困難となり、高音質の再生音を得難いことが
しばしば発生するという欠点がある。 In addition, in conventional residual excitation vocoders of this type, the LPC coefficients are directly quantized and transmitted, making it difficult to allocate a sufficient amount of information to the residual waveform, making it difficult to reproduce high-quality sound. The disadvantage is that it is often difficult to obtain.

[Purpose of the invention]

本発明の目的は上述した欠点を除去しスペクト
ル包絡情報をパタンマツチング手段により伝送す
る手段を有し、且つ、高域周波数成分再生手段を
分析側に備えた残差励振型ボコーダにおいて、残
差波形低域周波数成分のダウンサンプル値を零レ
べルサンプルの内挿を介して基準サンプリング変
換標本系列としたサンプルを利用しかつ非線形回
路を利用することなく高域再生フイルタのタツプ
係数を推定する手段を備えて入力音声信号の分析
ならびに合成を図ることにより、大幅なRELP音
の改善が安定かつ確実に図れる残差励振型ボコー
ダを提供することにある。 An object of the present invention is to provide a residual excitation type vocoder which eliminates the above-mentioned drawbacks and has a means for transmitting spectral envelope information by a pattern matching means, and is equipped with a high frequency component reproducing means on the analysis side. Estimating the tap coefficient of a high-frequency reproduction filter using a sample sequence in which a down-sampled value of a waveform low-frequency component is made into a reference sampling conversion sample sequence through interpolation of a zero-level sample and without using a nonlinear circuit. It is an object of the present invention to provide a residual excitation type vocoder that can stably and reliably improve RELP sound to a large extent by analyzing and synthesizing input audio signals.

[Structure of the invention]

本発明によれば、入力音声信号のスペクトル包
絡特性を予め準備した標準パタンと照合し、標準
パタンに付されたラベルをスペクトル包絡特性を
示す情報として分析側から合成側へ伝送する第１
の手段と、予め設定する基準サンプリング周波数
に対応して入力音声信号の残差波形の基準サンプ
ル系列を求める第２の手段と、前記基準サンプリ
ング周波数を予め設定する割合で減周したダウン
サンプリング周波数に対応して残差波形の低域周
波数成分のダウンサンプル系列を求める第３の手
段と、前記基準サンプリング周波数のタイミング
で前記ダウンサンプル系列に零レベルの内挿点を
設定しつつ形成する前記残差波形の前記低域周波
数成分の基準サンプリング変換標本系列と前記残
差波形の前記基準サンプル系列とにもとづきかつ
非線形回路を介することなく合成側に備えるべき
トランスバーサル型の高域再生フイルタの係数を
推定した合成側へ伝送される前記係数を出力する
第４の手段を備えて成ることを特徴とする残差励
振型ボコーダが得られる。 According to the present invention, the first step compares the spectral envelope characteristic of an input audio signal with a standard pattern prepared in advance, and transmits the label attached to the standard pattern from the analysis side to the synthesis side as information indicating the spectral envelope characteristic.
means for determining a reference sample sequence of a residual waveform of an input audio signal corresponding to a preset reference sampling frequency; and a downsampling frequency obtained by reducing the reference sampling frequency at a preset rate. Correspondingly, third means for obtaining a down-sampled sequence of low frequency components of the residual waveform, and the residual is formed while setting a zero-level interpolation point in the down-sampled sequence at the timing of the reference sampling frequency. Estimating the coefficients of a transversal high frequency reproduction filter to be provided on the synthesis side based on the reference sampling conversion sample sequence of the low frequency component of the waveform and the reference sample sequence of the residual waveform and without going through a nonlinear circuit. A residual excitation type vocoder is obtained, characterized in that it comprises fourth means for outputting the coefficients to be transmitted to the combining side.

また本発明の他の特徴によれば、入力音声信号
のスペクトル包絡特性を予め準備した標準パタン
と照合し、複数のパタン候補を予備的に選択し、
各パタン候補についてそれぞれ合成側に備えるべ
きトランスバーサル型の高域再生フイルタの係数
を分析側で推定する第１の手段と、更に、各パタ
ン候補に対応する入力音声信号と合成音声信号と
のＳ／Ｎ比を求める第２の手段と、前記第２の手
段の出力を受け最もＳ／Ｎ比のよいパタン候補に
付されたラベルをスペクトル包絡特性を示す情報
として分析側から合成側へ伝送する第３の手段
と、予め設定する基準サンプリング周波数に対応
して入力音声信号の残差波形の基準サンプル系列
を求める第４の手段と、前記基準サンプリング周
波数を予め設定する割合で減周したダウンサンプ
リング周波数に対応して前記残差波形の低域周波
数成分のダウンサンプル系列を求める第５の手段
と、前記基準サンプリング周波数のタイミングで
前記ダウンサンプル系列に零レベルの内挿点を設
定しつつ形成する前記残差波形の低域周波数成分
の基準サンプリング変換標本系列と前記残差波形
の前記基準サンプル系列とにもとづきかつ非線形
回路を介することなく合成側に備えるべきトラン
スバーサル型の高域再生フイルタの係数を推定し
合成側へ伝送される前記係数を出力する第６の手
段を備えて成ることを特徴とする残差励振型ボコ
ーダが得られる。 According to another feature of the present invention, the spectral envelope characteristic of the input audio signal is compared with a standard pattern prepared in advance, and a plurality of pattern candidates are preliminarily selected;
A first means for estimating coefficients of a transversal high-frequency reproduction filter to be provided on the synthesis side for each pattern candidate on the analysis side; a second means for determining the /N ratio; and a label attached to a pattern candidate with the best S/N ratio upon receiving the output of the second means is transmitted from the analysis side to the synthesis side as information indicating the spectral envelope characteristic. a third means, a fourth means for determining a reference sample sequence of a residual waveform of an input audio signal corresponding to a preset reference sampling frequency, and downsampling in which the reference sampling frequency is reduced at a preset rate. a fifth means for obtaining a down-sampled sequence of low-frequency components of the residual waveform corresponding to a frequency; and forming a down-sampled sequence while setting a zero-level interpolation point in the down-sampling sequence at the timing of the reference sampling frequency. Coefficients of a transversal high-frequency reproduction filter to be provided on the synthesis side based on the reference sampling conversion sample sequence of the low frequency component of the residual waveform and the reference sample sequence of the residual waveform and without going through a nonlinear circuit. There is obtained a residual excitation type vocoder characterized in that it comprises a sixth means for estimating the coefficients and outputting the coefficients to be transmitted to the synthesis side.

〔Example〕

次に図面を参照して本発明を詳細に説明する。
第１図Ａは本発明による残差励振型ボコーダの第
一の実施例の分析側の構成を示すブロツク図、第
１図Ｂは合成側の構成を示すブロツク図である。 Next, the present invention will be explained in detail with reference to the drawings.
FIG. 1A is a block diagram showing the structure of the analysis side of a first embodiment of the residual excitation type vocoder according to the present invention, and FIG. 1B is a block diagram showing the structure of the synthesis side.

第１の実施例は入力音声信号のスペクトル包絡
情報をパタンマツチング手段により伝送するに当
り、入力音声信号を分析して得られるLPC係数
と最もスペクトル距離の小さな標準パタンを選択
し、前記パタンに対応するラベルを伝送する形式
のものである。 In the first embodiment, when the spectral envelope information of an input audio signal is transmitted by a pattern matching means, the LPC coefficient obtained by analyzing the input audio signal and a standard pattern having the smallest spectral distance are selected, and the standard pattern is applied to the pattern. This is a format that transmits the corresponding label.

第１図Ａに示す分析側はＡ／Ｄコンバータ１、
LPC分析器２、量子化／復号化器３、LPC逆フ
イルタ４、LPF（Low Pass Filter、ローパスル
イルタ）５、ダウンサンプラ６，量子化／復号化
器７、フイルタ係数推定器(1)８、量子化器９、電
力算出器１０、量子化器１１および多重化器１２
を備えて構成され、また第１図Ｂに示す合成側は
多重分離・復号化器１３、高域再生フイルタ１
４、乗算器１５、LPC合成フイルタ１６および
Ｄ／Ａコンバータ１７を備えて構成される。 The analysis side shown in FIG. 1A is an A/D converter 1,
LPC analyzer 2, quantizer/decoder 3, LPC inverse filter 4, LPF (Low Pass Filter) 5, down sampler 6, quantizer/decoder 7, filter coefficient estimator (1) 8 , quantizer 9, power calculator 10, quantizer 11 and multiplexer 12
The synthesis side shown in FIG.
4, a multiplier 15, an LPC synthesis filter 16, and a D/A converter 17.

入力端子１０１から入力した音声信号は、Ａ／
Ｄコンバータ１に供給され予め設定する高域周波
数、本実施例では3.4KHz以上の成分をカツトす
る低域フイルタリングを実施したのち、予め設定
する基準サンプリング周波数8KHzでサンプリン
グしたうえ所定のビツト数のデジタル量に変換し
量子化入力音声データとしてLPC分析器２およ
びLPC逆フイルタ４に送出される。8KHzのサン
プリング周波数は入力音声信号の最高周波数を
3.4KHzと設定した条件のもとでナイキストレー
トを勘案して決定されたものである。LPC分析
器２は、このようにして入力する量子化入力音声
データに対し予め設定する窓関数を乗算するウイ
ンドウ処理を基本フレーム周期ごとに次次に実施
したうえ基本フレームごとに計測する自己相関係
数を利用しよく知られるAuto Correlation（自己
相関）法による手法で所定の次数のLPC係数を
抽出するLPC分析を行なう。 The audio signal input from the input terminal 101 is
After performing low-pass filtering to cut out components of a preset high frequency, in this example, 3.4KHz or higher, which is supplied to the D converter 1, sampling is performed at a preset reference sampling frequency of 8KHz, and a predetermined number of bits is sampled. It is converted into a digital quantity and sent as quantized input audio data to an LPC analyzer 2 and an LPC inverse filter 4. The 8KHz sampling frequency matches the highest frequency of the input audio signal.
This was determined by considering the Nyquist rate under the conditions set at 3.4KHz. The LPC analyzer 2 sequentially performs window processing in which the input quantized input audio data is multiplied by a preset window function for each basic frame period, and then measures the autocorrelation for each basic frame. LPC analysis is performed to extract LPC coefficients of a predetermined order using the well-known Auto Correlation method.

このよにしてLPC分析器２から出力される
LPC係数は量子化／復号化器３によつて量子化
され多重化器１２に供給されるとともに、またこ
の量子化LPC係数データをいつたん復号化した
うえLPC逆フイルタ４に供給する。量子化／復
号化器３はパタンマツチング処理によりLPC係
数を分析し、LPC係数データをパタンのラベル
という形式で符号化し、更にLPC係数に復号化
するものである。第１０図は量子化／復号化器３
の構成の一例を示すブロツク図である。以下、第
１０図を用いて量子化／復号化器３を詳しく説明
する。 In this way, the output from LPC analyzer 2 is
The LPC coefficients are quantized by the quantizer/decoder 3 and supplied to the multiplexer 12, and this quantized LPC coefficient data is once decoded and supplied to the LPC inverse filter 4. The quantizer/decoder 3 analyzes the LPC coefficients by pattern matching processing, encodes the LPC coefficient data in the form of a pattern label, and further decodes it into LPC coefficients. Figure 10 shows the quantizer/decoder 3
FIG. 2 is a block diagram showing an example of the configuration. The quantizer/decoder 3 will be explained in detail below using FIG.

LPC分析器２から出力されるLPC係数はパタ
ン照合プロセツサ１００１へ供給される。パタン
照合プロセツサ１００１は入力されたLPC係数
と最も空間ベクトル距離が近い標準パタンを標準
パタンメモリ１００２に記憶されている複数の標
準パタン（本実施例では4096＝2¹²）から選択す
る。（以後、この選択処理を「マツチング」と云
う）。 The LPC coefficients output from the LPC analyzer 2 are supplied to the pattern matching processor 1001. The pattern matching processor 1001 selects a standard pattern having the closest spatial vector distance to the input LPC coefficient from a plurality of standard patterns (4096=2 ¹² in this embodiment) stored in the standard pattern memory 1002. (Hereinafter, this selection process will be referred to as "matching").

次にマツチングの原理を簡単に説明する。この
マツチングはマツチング特性が全般的にすぐれて
いるもの、すなわち全周波数範囲にわたつて空間
ベクトル距離、いわゆるスペクトル距離が接近し
ているものを標準パタンから選択する。 Next, the principle of matching will be briefly explained. For this matching, a standard pattern is selected that has generally excellent matching characteristics, that is, one in which the spatial vector distance, so-called spectral distance, is close over the entire frequency range.

Ｎ次のLPC係数のそれぞれはＮ次のパラメー
タ空間におけるひとつの空間ベクトルを表わすも
のと考えることができ、従つてLPC係数による
周波数間の空間ベクトルを表わすスペクトル距離
は二つのLSP係数の近似度を表わすマツチング尺
度として利用されることはよく知られている。 Each of the Nth-order LPC coefficients can be considered to represent one spatial vector in the Nth-order parameter space, and therefore, the spectral distance representing the space vector between frequencies by the LPC coefficients can be expressed as the degree of approximation between two LSP coefficients. It is well known that it is used as a matching measure.

上述した標準パタンは予め音声資料をLPC分
析して得られる標準的LPC係数の分布パタンで
あり、予め設定する種類、本実施例では2¹²種類
を用意し、またスペクトル距離は次の(1)式に示す
Dijによつて基本的に示される。 The above-mentioned standard pattern is a distribution pattern of standard LPC coefficients obtained by LPC analysis of audio materials in advance, and there are 2 ^{to 12} preset types in this example, and the spectral distance is as follows (1) shown in the formula
Basically indicated by Dij.

Dij＝1π／π₀｛Si（ｗ）−Sj（ｗ）｝²dw ……(1) 言(1)式においてｉ，ｊはそれぞれLPC分析にお
ける処理単位区間としてのフレーム番号、Si
（ｗ），Sj（ｗ）はフレームｉ，ｊの対数スペクト
ルである。(1)式は通常次の(2)式の近似式に変換利
用される。 Dij=1π/π ₀ {Si(w)−Sj(w)} ² dw...(1) In equation (1), i and j are the frame numbers as processing unit intervals in LPC analysis, and Si
(w) and Sj(w) are logarithmic spectra of frames i and j. Equation (1) is usually converted to the following approximate equation (2).

Dij_N 〓^k=1 Wk｛Pk(i)−Pk^(j)｝² ……(2) (2)式においてPk(i)，Pk^(j)はフレームｉおよび
ｊにおけるＮ次のLPC係数、WkはＮ次のLPCス
ペクトル感度である。上述したＮは全極型の
LPCデジタルフイルタの次数、本実施例にあつ
ては10と対応し、そのLPC係数α₁，α₂，…α₁₀を
示す。さらにＮ次のスペクトル感度WkはＮ次、
本実施例の場合は10次のLPC係数の微少変化に
よつて発生するスペクトル変化の程度を示す。 Dij _N 〓 ^k=1 Wk{Pk(i)−Pk ^(j) } ² ...(2) In equation (2), Pk(i) and Pk ^(j) are the Nth-order LPC coefficients in frames i and j, Wk is the Nth-order LPC spectral sensitivity. The above N is an all-pole type
The order of the LPC digital filter corresponds to 10 in this embodiment, and its LPC coefficients α ₁ , α ₂ , . . . α ₁₀ are shown. Furthermore, the Nth-order spectral sensitivity Wk is Nth-order,
In the case of this example, the degree of spectral change caused by a minute change in the 10th-order LPC coefficient is shown.

再び第１０図を用いて説明する。パタン照合プ
ロセツサ１００１はLPC係数が入力されると、
最小距離レジスタ１００４を経験的に知られてい
る前記(2)式に示すDijの最大値よりも大きな値Do
に初期設定する。更にラベルレジスタ１００５を
例えば“4096”に初期設定する。次にパタン照合
プロセツサ１００１は標準パタンに付されたラベ
ル“０”に相等するスタート番地０〓10＝０（０
〓10の10はLPCの次数10に相等する）を標準パ
タンメモリ１００２へ、LPC係数α₁に対応する
スペクトル感度W₁を指定する番地０をスペクト
ル感度メモリ１００３へ出力し、スペクトル距離
Ｄを“０”に設定する。次にパタン照合プロセツ
サ１００１は標準パタンメモリ１００２より出力
されるラベル“０”の標準パタンのα₁ ^(o)（前記(2)
式に於いてP₁ ⁽ⁱ⁾に相等する）とスペクトル感度メ
モリ１００３より出力されるスペクトル感度W₁
と、LPC分析器２より供給されたLPC係数の第
１番目の係数α₁ ^(I)（前記(2)式に於いてP₁ ⁽ⁱ⁾に相等
する）とから、Ｄ＝W₁（α₁ ⁽⁰⁾−α₁ ^(I)）² を算出する。次にパタン照合プロセツサ１００１
は標準パタンメモリ１００２に出力している番地
データをインクリメントし、即ち０＋１＝１と
し、更にスペクトル感度メモリ１００３へ出力し
ている番地データをインクリメントし、即ち０＋
１＝１とし、α₂ ⁽⁰⁾，W₂を入力する。パタン照合
プロセツサ１００１は前記求めたＤにW₂（α₂ ⁽⁰⁾−
α₂ ^(I)）²で算出される量を加算しＤ＝W₁（α₁ ⁽⁰⁾−α₁ ^(I)）²＋W₂（α₂ ⁽⁰⁾−α₂ ^(I)）² ＝₂ 〓^K=1 Wk（α_k ⁽⁰⁾−α_k ^(I)）² を求める。以下同様にパタン照合プロセツサ１０
０１は標準パタンメモリ１００２へ番地データを
２，３，４，…10と次々に、スペクトル感度メモ
リ１００３へ番地データを２，３，４，…と次々
に与えＤ＝₁₀ 〓^k=1 Wk（α_k ⁽⁰⁾−α_k ^(I)）² …… (3) を算出する。 This will be explained using FIG. 10 again. When the pattern matching processor 1001 receives the LPC coefficients,
The minimum distance register 1004 is set to a value Do larger than the empirically known maximum value of Dij shown in equation (2) above.
Initialize to . Furthermore, the label register 1005 is initialized to, for example, "4096". Next, the pattern matching processor 1001 selects the start address 0〓10=0(0
〓10 of 10 is equivalent to the order 10 of LPC) is output to the standard pattern memory 1002, the address 0 specifying the spectral sensitivity W ₁ corresponding to the LPC coefficient α ₁ is output to the spectral sensitivity memory 1003, and the spectral distance D is output as “ Set to 0”. Next, the pattern matching processor 1001 calculates α ₁ ^(o) of the standard pattern with the label “0” output from the standard pattern memory 1002 ((2)
P ₁ ⁽ⁱ⁾ in the equation) and the spectral sensitivity W ₁ output from the spectral sensitivity memory 1003
and the first coefficient α ₁ ^(I) of the LPC coefficients supplied from the LPC analyzer 2 (equivalent to P ₁ ⁽ⁱ⁾ in equation (2) above), D=W ₁ (α ₁ ⁽⁰⁾ −α ₁ ^(I) ) ² is calculated. Next, the pattern matching processor 1001
Increments the address data output to the standard pattern memory 1002, that is, 0+1=1, and further increments the address data output to the spectral sensitivity memory 1003, that is, 0+
1=1, and input α ₂ ⁽⁰⁾ and W ₂ . The pattern matching processor 1001 applies W ₂ (α ₂ ⁽⁰⁾ −
Add the quantities calculated in α ₂ ^(I) ) ² and get D=W ₁ (α ₁ ⁽⁰⁾ − _{α 1} ^(I) ) ² + W ₂ (α ₂ ⁽⁰⁾ − α ₂ ^(I) ) ² = Find ₂ 〓 ^K=1 Wk (α _k ⁽⁰⁾ − α _k ^(I) ) ² . Similarly, the pattern matching processor 10
01 gives address data 2, 3, 4, ... 10 one after another to the standard pattern memory 1002, and gives address data 2, 3, 4, ... one after another to the spectral sensitivity memory 1003. D= ₁₀ 〓 ^k=1 Wk ( α _k ⁽⁰⁾ −α _k ^(I) ) ² ...... (3) is calculated.

(3)式で与えられるＤは標準パタンα⁽⁰⁾と入力
LPC係数データα^(I)との距離D_0Iである。 D given by equation (3) is input as the standard pattern α ⁽⁰⁾
This is the distance D _0I from the LPC coefficient data α ^(I) .

次にパタン照合プロセツサ１００１は最小距離
レジスタ１００４に記憶されているデータD₀と
前記D_0Iとを比較し、D₀＞D_0Iの場合には、最小距
離レジスタ１００４にD₀としてD_0Iを再設定し、
更にラベルレジスタ１００５にラベル”０”を示
す”０”を再設定する。D₀＞D_0Iの場合にはパタ
ン照合プロセツサ１００１は最小距離レジスタ１
００４とラベルレジスタ１００５とに対する再設
定は実施しない。D₀＝D_0Iの場合は、前記再設定
は実施しても、しなくともよい。 Next, the pattern matching processor 1001 compares the data D ₀ stored in the minimum distance register 1004 with the D _0I , and if D ₀ > D _0I , rewrites D _0I as D ₀ in the minimum distance register 1004. Set,
Further, the label register 1005 is reset to "0" indicating the label "0". If D ₀ > D _0I , the pattern matching processor 1001 registers the minimum distance register 1.
004 and the label register 1005 are not reset. In the case of D ₀ =D _0I , the resetting may or may not be performed.

次にパタン照合プロセツサ１００１は標準パタ
ンに付されたラベル”１”に相等するスタート番
地１〓10＝10を標準パタンメモリ１００２へ、
LPC係数α₁に対応するスペクトル感度W₁を指定
する番地”０”をスペクトル感度メモリ１００３
へ出力し、スペクトル距離Ｄを”０”に設定す
る。更にパタン照合プロセツサ１００１はラベ
ル”１”の標準パタンのα₁ ⁽¹⁾と前記のW₁と前記
のα₁ ^(I)とからＤ＝W₁（α₁ ⁽¹⁾−α₁ ^(I)）²を算出する
。次
にパタン照合プロセツサ１００１は標準パタンメ
モリ１００２へ供給する番地データを11，12，…
20と次々に与え、スペクトル感度メモリ１００３
へ供給する番地データを１，２，…10と次々と与
えてＤ＝₁₀ 〓^k=1 Wk（α_k ⁽¹⁾−α_k ^(I)）² ……(4) を算出する。(4)式で与えられるＤは標準パタン
α⁽¹⁾と入力LPC係数データα^(I)との距離D_1Iである。
次にパタン照合プロセツサ９０１は最小距離レジ
スタ９０４に記憶されているデータD₀と前記D_1I
とを比較し適宜、最小距離レジスタ１００４とラ
ベルレジスタ１００５とを書替える。 Next, the pattern matching processor 1001 stores the start address 1〓10=10, which is equivalent to the label "1" attached to the standard pattern, into the standard pattern memory 1002.
The address “0” specifying the spectral sensitivity W ₁ corresponding to the LPC coefficient α ₁ is stored in the spectral sensitivity memory 1003.
and set the spectral distance D to "0". Furthermore, the pattern matching processor 1001 calculates D= _{W 1} ₍ _α ₁ ₍ ¹ ⁾ ⁻ α ₁ ^(I) ) Calculate ² . Next, the pattern matching processor 1001 supplies address data 11, 12, . . . to the standard pattern memory 1002.
20 and given one after another, the spectral sensitivity memory 1003
The address data to be supplied to 1, 2, . . . 10 is given one after another, and D= ₁₀ 〓 ^k=1 Wk (α _k ⁽¹⁾ − α _k ^(I) ) ² …… (4) is calculated. D given by equation (4) is the distance D _1I between the standard pattern α ⁽¹⁾ and the input LPC coefficient data α ^(I) .
Next, the pattern matching processor 901 uses the data D ₀ stored in the minimum distance register 904 and the data D _1I
The minimum distance register 1004 and label register 1005 are rewritten as appropriate.

以下、次々にラベル”２”，”３”，…，”4095”
の標準パタンとの照合を実施する。 Below, the labels "2", "3", ..., "4095" one after another
Perform comparison with the standard pattern.

最後にパタン照合プロセツサ１００１はラベル
レジスタの内容を符号化データとして多重化器１
２へ出力し、ラベルレジスタの内容に対応する標
準パタンデータを復号化データとしてLPC逆フ
イルタ４へ出力する。 Finally, the pattern matching processor 1001 sends the contents of the label register to the multiplexer 1 as encoded data.
The standard pattern data corresponding to the contents of the label register is output to the LPC inverse filter 4 as decoded data.

再び第１図に戻つて説明する。 The explanation will be given by returning to FIG. 1 again.

LPC逆フイルタ４は、第１図Ｂに示す合成側
のLPC合成フイルタ１６とはインパルス応答特
性が逆特性のフイルタであり、かつLPC合成フ
イルタ１６と量子化誤差による影響を同一のもの
とするため量子化／復号化器３でいつたん復号化
したLPC係数データを受け、これによりＡ／Ｄ
コンバータ１の出力から受ける量子化入力音声デ
ータの残差波形に関するデータのみを抽出し、こ
れをLPF５、フイルタ係数推定器(1)８、電力算
出器１０に供給する。 The LPC inverse filter 4 is a filter whose impulse response characteristic is opposite to that of the LPC synthesis filter 16 on the synthesis side shown in FIG. The quantizer/decoder 3 receives the decoded LPC coefficient data, and then the A/D
Only data related to the residual waveform of the quantized input audio data received from the output of the converter 1 is extracted and supplied to the LPF 5, filter coefficient estimator (1) 8, and power calculator 10.

LPF５はボコーダのデータビツトレートその
他の条件を勘案して設定する高域遮断特性を有す
るローパスフイルタであり、LPC逆フイルタ４
から受ける残差波形のうち、1KHzを高域遮断周
波数として残差波形のうち1KHz以下の低域周波
数成分をダウンサンプラ６に供給する。 LPF5 is a low-pass filter with high-frequency cutoff characteristics that is set taking into consideration the data bit rate of the vocoder and other conditions.
Among the residual waveforms received from , 1 KHz is set as a high cut-off frequency, and low frequency components of 1 KHz or less of the residual waveforms are supplied to the down sampler 6 .

ダウンサンプラ６は、入力した1KHz以下の低
域周波数成分を基準サンプリング周波数を予め設
定する割合、本実施例では1/4にダウンしたダウ
ンサンプリング周波数2KHzによつてサンプリン
グしてこれを量子化／復号化器７に送出する。 The down sampler 6 samples the input low frequency components of 1KHz or less at a preset rate of the reference sampling frequency, in this example, the downsampling frequency is 2KHz, which is down to 1/4, and quantizes/decodes this. It is sent to the converter 7.

量子化／復号化器７は、こうして入力した残差
波形低域周波数成分を量子化してこれを多重化器
１２に送出するとともに、これを復号化したデー
タをフイルタ係数推定器(1)８に供給する。このフ
イルタ係数推定器(1)８に供給するダウンサンプリ
ングデータを量子化／復号化器７による復号化処
理ずみのデータとするのは、合成側における高域
再生フイルタ１４における量子化誤差の影響とフ
イルタ係数推定器(1)８における量子化誤差の影響
とをほぼ同一の状態とするためである。 The quantizer/decoder 7 quantizes the input residual waveform low frequency component and sends it to the multiplexer 12, and also sends the decoded data to the filter coefficient estimator (1) 8. supply The reason why the downsampled data supplied to the filter coefficient estimator (1) 8 is the data that has been decoded by the quantizer/decoder 7 is to avoid the influence of quantization errors in the high frequency reproduction filter 14 on the synthesis side. This is to make the influence of the quantization error on the filter coefficient estimator (1) 8 almost the same.

なお、量子化／復号化器７における残差波形低
域周波数成分の量子化、復号化処理は本実施例で
は後述する電力算出器１０より供給される電力デ
ータにより、正規化して実行される。正規化処理
により残差波形低域周波数成分の量子化を効率よ
く行なつている。 Note that, in this embodiment, the quantization and decoding processing of the residual waveform low frequency component in the quantization/decoder 7 is normalized and executed using power data supplied from a power calculator 10, which will be described later. The normalization process efficiently quantizes the low frequency components of the residual waveform.

フイルタ係数推定器(1)８はLPC逆フイルタ４
からは8KHzの基準サンプリング周波数による残
差波形のサンプル系列を入力し、また量子化／復
号化器７からは2KHzのダウンサンプリング周波
数による残差波形低域周波数成分のダウンサンプ
ル系列を入力し次のようにして合成側に備えるべ
き高域再生フイルタのタツプ係数の推定を実施す
る。 Filter coefficient estimator (1) 8 is LPC inverse filter 4
A sample sequence of the residual waveform with a reference sampling frequency of 8KHz is input from the quantizer/decoder 7, and a downsample sequence of the low frequency component of the residual waveform with a downsampling frequency of 2KHz is input from the quantizer/decoder 7. In this way, the tap coefficient of the high frequency reproduction filter to be provided on the synthesis side is estimated.

第２図ａは第１図Ａ，Ｂに示す本発明第一の実
施例において入力音声信号の残差波形を基準サン
プリング周波数でサンプリングした残差波形の一
例を示す残差波形基準サンプル標本図、第２図ｂ
は残差波形の低域周波数成分をダウンサンプリン
グ周波数でサンプリングしたダウンサンプルの一
例を示すダウンサンプル標本図、第２図ｃは第２
図ｃに示す残差波形低域周波数成分のダウンサン
プル標本に対し基準サンプリング周波数に対応す
る基準サンプリング変換を実施した残差波形低域
周波数成分基準サンプリング変換標本系列であ
る。以下に第２図ａ，ｂおよびｃを参照しつつ第
１図Ａ，Ｂの実施例について説明する。 FIG. 2a is a residual waveform reference sample diagram showing an example of the residual waveform obtained by sampling the residual waveform of the input audio signal at the reference sampling frequency in the first embodiment of the present invention shown in FIGS. 1A and B; Figure 2b
Figure 2c is a downsample sample diagram showing an example of a downsample in which the low frequency component of the residual waveform is sampled at the downsampling frequency.
This is a residual waveform low frequency component reference sampling conversion sample sequence obtained by performing reference sampling conversion corresponding to the reference sampling frequency on the down sample sample of the residual waveform low frequency component shown in FIG. The embodiments shown in FIGS. 1A and 1B will be described below with reference to FIGS. 2A, 2B, and 2C.

第２図ａに示す残差波形標本列は、LPC逆フ
イルタ４からの8KHzサンプリング出力であり、
第２図ｂに示す残差波形低域周波数成分ダウンサ
ンプル標本は量子化／復号化器７による2KHzサ
ンプリング出力で、これをさらに8KHzの基準サ
ンプリング周波数による標本に変換したのが第２
図ｃに示すa₁，a₂，a₃，a₄，a₅，a₆およびb₁〜b₅
の標本群である。これらの標本群のうちa₁〜a₆は
第２図ｂに示す標本に対応するものであり、また
b₁〜b₅は標本a₁〜a₆間に間に内挿され全体として
8KHzの基準サンプリング変換を図るために等間
隔に配列された零レベルの標本点であり、このよ
うにして恰も8KHzの基準サンプリング周波数に
よる標本と同様なイメージで残差波形低域周波数
成分のダウンサンプル標本の基準サンプリング変
換が図れる。 The residual waveform sample sequence shown in FIG. 2a is the 8KHz sampling output from the LPC inverse filter 4,
The residual waveform low frequency component down-sampled sample shown in Figure 2b is the 2KHz sampling output from the quantizer/decoder 7, which is further converted into a sample with a reference sampling frequency of 8KHz as the second sample.
a ₁ , a ₂ , a ₃ , a ₄ , a ₅ , a ₆ and b ₁ to b ₅ shown in figure c
This is the sample group. Among these sample groups, a ₁ to a ₆ correspond to the samples shown in Figure 2 b, and
b ₁ to _{b 5} are interpolated between samples a ₁ to _{a 6} as a whole
These are zero-level sampling points arranged at equal intervals in order to convert the standard sampling frequency to 8KHz, and in this way, the low-frequency components of the residual waveform can be down-sampled with an image similar to that of a sample using the standard sampling frequency of 8KHz. Standard sampling conversion of samples can be achieved.

フイルタ係数推定器(1)８はこのようにして2K
Hzのダウンサンプル周波数によるダウンサンプル
標本化列を8KHzの基準サンプリング周波数によ
る標本化イメージで系列化し、これを合成側に備
えるべき高域再生フイルタとしてのトランスバー
サルフイルタと同様な内蔵フイルタの入力として
供給する。このトランスバーサルフイルタのタツ
プ係数は音声資料等にもとづいて予め設定する初
期値から次のようにして学習同定法あるいはＡ−
ｂ−Ｓ（Analysis−by−Synthesis）法等の手法
にもとづき分析フレームごとに制御されつつ決定
される。 The filter coefficient estimator (1)8 is thus 2K
A downsampling sampling sequence with a downsampling frequency of Hz is converted into a sequence of sampling images with a reference sampling frequency of 8KHz, and this is supplied as an input to a built-in filter similar to the transversal filter as a high-frequency reproduction filter that should be provided on the synthesis side. do. The tap coefficients of this transversal filter are determined using the learning identification method or the A-
It is controlled and determined for each analysis frame based on a method such as the b-S (Analysis-by-Synthesis) method.

すなわち、フイルタ係数推定器(1)８に第２図ｂ
に示す2KHzサンプリングの残差波形低域周波数
成分の標本化系列が入力されるこれは第２図ｂに
示す8KHzサンプリングイメージ標本化系列に変
換されて内蔵トランスバーサルフイルタに印加さ
れ、音声資料等にもとづいて予め設定するタツプ
係数初期値を付与したトランスバーサルフイルタ
出力と、別に入力する第２図ａに示す如き残差波
形の8KHzサンプリング標本化系列とを比較する
ことによつて得られる誤差出力を得る。本実施例
の場合はこの誤差出力の電力を算出しこの誤差電
力の大きさに対応してトランスバーサルフイルタ
の係数を分析フレームごとに制御しつつ誤差電力
の最小化が図れるようなフイルタ係数の推定を学
習同定法やＡ−ｂ−Ｓ法等の手法によつて実施す
る。 That is, in the filter coefficient estimator (1) 8,
The sampling sequence of the low frequency component of the residual waveform of 2KHz sampling shown in Figure 2b is input.This is converted into the 8KHz sampling image sampling sequence shown in Figure 2b, which is applied to the built-in transversal filter, and is used as audio material, etc. The error output obtained by comparing the output of the transversal filter to which the initial value of the tap coefficient set in advance is given and the 8KHz sampling sequence of the residual waveform as shown in Figure 2a, which is input separately, is obtained. obtain. In the case of this embodiment, the power of this error output is calculated, and the filter coefficients are estimated so that the error power can be minimized while controlling the transversal filter coefficients for each analysis frame in accordance with the magnitude of the error power. is carried out using methods such as the learning identification method and the A-b-S method.

第３図は第１図Ａに示すフイルタ係数推定器(1)
８の構成を詳細に示すブロツク図である。 Figure 3 shows the filter coefficient estimator (1) shown in Figure 1A.
FIG. 8 is a block diagram showing in detail the configuration of FIG.

フイルタ係数推定器(1)８は合成側に備えるべき
高域再生フイルタと基本的に同一構成のトランス
バーサルフイルタ８１、減算器８２および電力算
出器８３を備えて構成され、トランスバーサルフ
イルタ８１には量子化／復号化器７の出力のうち
第２図ｂに示す如き残差波形低域周波数成分ダウ
ンサンプルが入力され、また8KHzサンプリング
周波数の供給をうけつつ次のようにしてフイルタ
係数の推定を行なう。 The filter coefficient estimator (1) 8 includes a transversal filter 81, a subtracter 82, and a power calculator 83, which have basically the same configuration as the high-frequency reproduction filter to be provided on the synthesis side. Out of the output of the quantizer/decoder 7, the down-sampled residual waveform low frequency component as shown in Figure 2b is input, and while receiving the 8KHz sampling frequency, the filter coefficients are estimated as follows. Let's do it.

すなわち、トランスバーサルフイルタ８１は、
第２図ｂに示すような残差波形低域周波数成分ダ
ウンサンプルを入力するとこれにもとづいて第２
図ｃの如き残差波形低域周波数成分の基準サンプ
リング変換標本化系列に変換、初期設定タツプ係
数によるトランスバーサルフイルタ出力を出力ラ
イン８１１を介して減算器８２に送出、LPC逆
フイルタ４の出力する残差波形との差をとりこの
誤差出力を出力ライン８２１を介して電力算出器
８３に供給、その電力を算出する。 That is, the transversal filter 81 is
When the residual waveform low frequency component down sample as shown in Figure 2b is input, the second
Convert the residual waveform low frequency component into a reference sampling conversion sampling sequence as shown in Figure c, send the transversal filter output based on the initial setting tap coefficient to the subtracter 82 via the output line 811, and output it to the LPC inverse filter 4. The difference from the residual waveform is taken, and this error output is supplied to a power calculator 83 via an output line 821, and its power is calculated.

フイルタ係数推定器(1)８はこうして算出された
誤差電力にもとづいてトランスバーサルフイルタ
８１のタツプ係数の制御を実施しつつ誤差電力の
最小化が図れたときのタツプ係数推定フイルタ係
数として出力し量子化器９によつて量子化データ
としたのち多重化器１２に供給する。 The filter coefficient estimator (1) 8 controls the tap coefficient of the transversal filter 81 based on the error power calculated in this way and outputs the tap coefficient as the filter coefficient estimated when the error power can be minimized. After being converted into quantized data by the quantizer 9, it is supplied to the multiplexer 12.

このようにして残差波形の低域周波数成分のダ
ウンサンプルの8KHzサンプリングイメージ標本
化系列を図つたうえ学習同定法やＡ−ｂ−Ｓ法に
もとづき非線形回路を用いることなくフイルタの
係数の推定が可能となる。 In this way, the 8KHz sampling image sampling sequence of down-sampling of the low frequency component of the residual waveform is created, and the coefficients of the filter can be estimated without using a nonlinear circuit based on the learning identification method or the A-b-S method. It becomes possible.

電力算出器１０は残差波形を入力し、分析フレ
ームごとの短時間平均音声電力を算出しこれを量
子化器１１と量子化／復号化器７とに送出、所定
の量子化を施したうえ多重化器１２に供給する。 The power calculator 10 inputs the residual waveform, calculates the short-term average audio power for each analysis frame, sends it to the quantizer 11 and the quantizer/decoder 7, performs predetermined quantization, and then A multiplexer 12 is provided.

この短時間平均音声電力は、合成側でLPC合
成フイルタ１６によつて入力音声信号を再生する
場合、再生すべき音声のレベル設定のために利用
されるが、入力音声信号のレベルが長時間にわた
つてほぼ一定であるときなどはこのデータ送出は
不要である。 This short-time average audio power is used to set the level of the audio to be reproduced when the input audio signal is reproduced by the LPC synthesis filter 16 on the synthesis side. This data transmission is not necessary when the value is approximately constant over time.

多重化器１２はこうして供給された各量子化デ
ータを予め設定する方式で多重化して伝送路を介
して合成側に送出する。 The multiplexer 12 multiplexes each of the quantized data thus supplied using a preset method and sends the multiplexed data to the combining side via a transmission path.

第１図Ｂの合成側では多重化されて分析側から
供給された各量子化データを多重分離・復号化器
１３によつて多重化分離し、さらに復号化したう
え、第２図ｂに示す如き残差波形低域周波数成分
のダウンサンプリング標本化データは出力ライン
１３０１を介して高域再生フイルタ１４に、また
推定フイルタ係数データも出力ライン１３０２を
介して高域再生フイルタ１４に供給される。また
短時間平均音声電力データは入力ライン１３０３
を介して乗算器１６に、更にラベルの形式に変換
されているLPC係数データは多重分離・復号化
器１３に内蔵されている、分析側に有する標準パ
タンメモリ１００２と同一のメモリより復号化さ
れたLPC係数データを読出し、出力ライン１３
０４を介してLPC合成フイルタ１６にそれぞれ
供給される。 On the synthesis side in Figure 1B, each quantized data that has been multiplexed and supplied from the analysis side is demultiplexed and demultiplexed by the demultiplexer/decoder 13, and further decoded, as shown in Figure 2B. The downsampled sampled data of the low frequency component of the residual waveform is supplied to the high frequency reproduction filter 14 via an output line 1301, and the estimated filter coefficient data is also supplied to the high frequency reproduction filter 14 via an output line 1302. In addition, the short-time average audio power data is input to the input line 1303.
The LPC coefficient data, which has been converted into a label format, is sent to the multiplier 16 via the multiplier 16, and is decoded by the same memory as the standard pattern memory 1002 on the analysis side, which is built into the demultiplexer/decoder 13. Reads out the LPC coefficient data and sends it to output line 13.
04 to the LPC synthesis filter 16.

高域再生フイルタ１４は、推定フイルタ係数を
タツプ係数とするトランスバーサルフイルタを内
蔵し、入力したダウンサンプリング標本化サンプ
ルをいつたん第２図ｃに示す如く8KHzサンプリ
ングイメージの標本に変換してこれをフイルタ入
力として第２図ａに示す如き8KHzサンプリング
の残差波形データを再生する。この残差波形デー
タは乗算器１５に供給され、入力ライン１３０３
を介して入力する短時間平均音声電力との乗算を
介して再生残差波形データのレベル修正を行なつ
たうえこれをLPC合成フイルタ１６に供給する。 The high frequency reproduction filter 14 has a built-in transversal filter that uses the estimated filter coefficients as tap coefficients, and converts the input downsampling sample into an 8KHz sampling image sample as shown in Figure 2c. As a filter input, residual waveform data of 8KHz sampling as shown in FIG. 2a is reproduced. This residual waveform data is supplied to the multiplier 15 and input line 1303
The level of the reproduced residual waveform data is corrected through multiplication with the short-term average voice power inputted via the LPC synthesis filter 16.

LPC合成フイルタ１６は、入力ライン１３０
４を介して入力するLPC係数をフイルタ係数と
し、再生残差波形データを駆動音源情報として入
力音声信号をデジタル的に再生しこれをＤ／Ａコ
ンバータ１７に供給、アナログ量に変換したうえ
さらに内蔵LPFによつて所定の低域周波数成分
だけを出力端子１７０１に出力する。 The LPC synthesis filter 16 is connected to the input line 130.
The input audio signal is digitally reproduced using the LPC coefficients input through 4 as filter coefficients and the reproduced residual waveform data as drive sound source information, and is supplied to the D/A converter 17, which converts it into an analog quantity and further incorporates it. Only a predetermined low frequency component is outputted to the output terminal 1701 by the LPF.

第４図は本発明第二の実施例における分析側の
構成を示すブロツク図である。第４図に示す分析
側の構成はフイルタ係数推定器(2)１８の内容のみ
が第１図Ａ、第３図と異なるのみであり、他の同
一記号のものは全く同様であるのでこれらに関す
る詳細な説明は省略する。 FIG. 4 is a block diagram showing the configuration of the analysis side in the second embodiment of the present invention. The configuration of the analysis side shown in FIG. 4 differs from FIGS. 1A and 3 only in the contents of the filter coefficient estimator (2) 18, and the other components with the same symbols are exactly the same, so we will not be concerned about these. Detailed explanation will be omitted.

また第二の実施例における合成側は第１図Ｂに
示す第一の実施例における合成側と同一の構成で
あるのでこれについても詳細な説明は省略する。 Further, since the composition side in the second embodiment has the same configuration as the composition side in the first embodiment shown in FIG. 1B, detailed explanation thereof will also be omitted.

第４図に示す第二の実施例におけるフイルタ係
数推定器(2)１８は、第１図Ａに示すフイルタ係数
推定器(1)８がいわゆる学習同定法医やＡ−ｂ−Ｓ
法等によつて入力音声信号の残差波形との差を最
小ならしめるインパルス応答をもつトランスバー
サルフイルタのフイルタ係数を推定しようとする
ものであるのに対し、フイルタ係数を遂次的に変
化しつつ最適インパルス応答をもつトランスバー
サルフイルタを決定しようとする、いわゆるフオ
ワード的フイルタ係数設定手法によつてフイルタ
係数を推定する点が異る。 The filter coefficient estimator (2) 18 in the second embodiment shown in FIG. 4 is different from the filter coefficient estimator (1) 8 shown in FIG.
This method attempts to estimate the filter coefficients of a transversal filter that has an impulse response that minimizes the difference from the residual waveform of the input audio signal by using methods such as the method of changing the filter coefficients sequentially. The difference is that the filter coefficients are estimated by a so-called forward filter coefficient setting method, which attempts to determine a transversal filter with an optimal impulse response.

第５図は第４図に示すフイルタ係数推定器(2)１
８の構成を詳細に示すブロツク図である。 Figure 5 shows the filter coefficient estimator (2) 1 shown in Figure 4.
FIG. 8 is a block diagram showing in detail the configuration of FIG.

第５図のフイルタ係数推定器(2)１８は、相互相
関回路１８１、自己相関回路１８２および最大値
検索回路１８３を備えて構成される。 The filter coefficient estimator (2) 18 in FIG. 5 includes a cross-correlation circuit 181, an autocorrelation circuit 182, and a maximum value search circuit 183.

第２図ｂに示すような残差波形低域周波数成分
ダウンサンプルが量子化／復号化器７から相互相
関回路１８１と自己相関回路１８２に供給される
と、相互相関回路１８１は2KHzのダウンサンプ
リング周波数によるこのダウンサンプルを第２図
ｃの如く零レベルサンプルを内挿させながら見掛
上8KHzサンプリング標本化系列に変換したうえ
これとLPC逆フイルタ４から出力される第２図
ａに示すような8KHzサンプリングの残差波形と
の相互相関係数を算出しこれを最大値検索回路１
８３に送出する。 When the residual waveform low frequency component downsampling as shown in FIG. This down-sampling by frequency is converted to an apparent 8KHz sampling sequence while interpolating zero-level samples as shown in Figure 2c, and then the output from the LPC inverse filter 4 as shown in Figure 2a is Calculate the cross-correlation coefficient with the residual waveform of 8KHz sampling and use this as maximum value search circuit 1
83.

また自己相関回路１８２は、2KHzのダウンサ
ンプリング周波数によるダウンサンプル波形の零
レベル内挿による8KHzサンプルイメージ化を図
つて第２図ｃに示す如き標本化系列を発生したう
え、この8KHzイメージ化サンプルの自己相関係
数列を算出する。これら相互相関係数列と自己相
関係数列の算出には8KHzサンプリング周波数が
利用される。 Further, the autocorrelation circuit 182 generates a sampling sequence as shown in FIG. Calculate the autocorrelation coefficient sequence. An 8KHz sampling frequency is used to calculate these cross-correlation coefficient sequences and autocorrelation coefficient sequences.

さて、最大値検索回路１８３に供給される相互
相関係数列は予め設定する限定された遅れ時間内
でその最大値を検索する。この場合前記遅れ時間
に対応するタツプ位置がたとえばタツプγであり
かつこのタツプγにおける相互相関係数をφγと
し、また自己相関回路１８２において得られる自
己相関係数列の遅れ零における自己相関係数を
ψοとすると、φγをψοで除した値Cγ＝φγ／ψο
が
トランスバーサルフイルタのタツプγにおけるタ
ツプ係数となる。 Now, the cross-correlation coefficient sequence supplied to the maximum value search circuit 183 searches for its maximum value within a preset limited delay time. In this case, the tap position corresponding to the delay time is, for example, tap γ, the cross-correlation coefficient at this tap γ is φγ, and the auto-correlation coefficient at zero delay of the auto-correlation coefficient sequence obtained in the auto-correlation circuit 182 is If ψο, then the value of φγ divided by ψο is Cγ = φγ/ψο
is the tap coefficient at tap γ of the transversal filter.

次いで、前述した相互相関係数列からCγ倍さ
れた前記自己相関係数列をタツプ位置γを自己相
関値ψοの位置に一致して減ずる演算を実施し、
このような演算を繰返しつつCγを次次に求めて
トランスバーサルフイルタの全フイルタ係数を決
定しこれを推定フイルタ係数として出力する。第
５図に示す補正メツセージはこのようにしてCγ
を次次に演算、出力するために必要とするデータ
である。 Next, an operation is performed to subtract the autocorrelation coefficient sequence multiplied by Cγ from the cross-correlation coefficient sequence described above so that the tap position γ coincides with the position of the autocorrelation value ψο,
While repeating such calculations, Cγ is successively obtained, all filter coefficients of the transversal filter are determined, and these are output as estimated filter coefficients. In this way, the correction message shown in FIG.
This is the data required to perform subsequent calculations and outputs.

こうしてトランスバーサルフイルタの係数の連
続的変化のもとに最大インパルス応答を得るフオ
ワード的係数設定が実施でき、RELP音を大幅に
抑圧した残差励振型ボコーダの分析と合成処理が
安定、確実に行なうことができる。 In this way, forward coefficient settings can be performed to obtain the maximum impulse response based on continuous changes in the coefficients of the transversal filter, and the analysis and synthesis processing of the residual excitation type vocoder, which significantly suppresses RELP sound, can be performed stably and reliably. be able to.

第６図Ａ，Ｂはそれぞれ本発明の第三の実施例
における分析側Ａおよび合成側Ｂの構成を示すブ
ロツク図である。 FIGS. 6A and 6B are block diagrams showing the configurations of analysis side A and synthesis side B, respectively, in a third embodiment of the present invention.

第６図Ａにおいてはフイルタ係数推定器(3)１９
が、また第６図Ｂにおいては係数再生器(1)２０お
よび高域再生フイルタ２１がそれぞれ第１図Ａ，
Ｂにおけるものと異なるかもしくは追加されてい
るのみで、他はすべで同一であるのでこれらに関
する詳細な説明は省略する。 In FIG. 6A, filter coefficient estimator (3) 19
However, in FIG. 6B, the coefficient regenerator (1) 20 and the high frequency regeneration filter 21 are the same as those in FIG. 1A, respectively.
The only difference from or addition to that in B is that everything else is the same, so detailed explanations regarding these will be omitted.

第６図Ａに示すフイルタ係数推定器(3)１９は、
第１図Ａに示すフイルタ係数推定器(1)８のほかに
零位相化回路等を備えて構成され、前者で得られ
た合成側の高域再生フイルタの推定係数をさらに
位相線形フイルタの係数に変換し、零位相位をは
かつたうえ出力する。 The filter coefficient estimator (3) 19 shown in FIG. 6A is
In addition to the filter coefficient estimator (1) 8 shown in FIG. Convert it to , remove the zero phase position, and output it.

第７図Ａは第６図Ａに示す第三の実施例の分析
側において形成する高域再生フイルタ予測係数特
性図、第７図Ｂは第７図Ａの零位相化予測係数特
性図である。以下第７図Ａ，Ｂを参照しながら第
６図Ａ，Ｂの実施例を説明する。 FIG. 7A is a high-frequency reproduction filter prediction coefficient characteristic diagram formed on the analysis side of the third embodiment shown in FIG. 6A, and FIG. 7B is a zero-phase prediction coefficient characteristic diagram of FIG. 7A. . The embodiment shown in FIGS. 6A and 6B will be described below with reference to FIGS. 7A and 7B.

フイルタ係数推定器(3)１９は、まず第１図Ａに
示すフイルタ係数推定器(1)８とほぼ同様にして第
７図Ａに示すような推定フイルタ係数を発生す
る。次にこの推定フイルタ係数はトランスバーサ
ルフイルタによる複素スペクトルとしての係数群
からほぼ同じインパルス応答特性を示す位相線形
フイルタによるパワースペクトルとしての第７図
Ｂに示す係数群に変換される。この変換によつて
形成される係数群は位相情報は失つてしまうが第
７図Ａにおけるトランスバーサルフイルタ係数群
と同一のタツプ数でエネルギー中心位置は不変、
かつ左右対象なタツプ係数すなわちインパルス応
答列が得られる。このようにエネルギー中心位置
が不変であれば位相情報を失つても高域再生フイ
ルタの係数として実用上殆んど問題はない。 The filter coefficient estimator (3) 19 first generates estimated filter coefficients as shown in FIG. 7A in substantially the same manner as the filter coefficient estimator (1) 8 shown in FIG. 1A. Next, the estimated filter coefficients are converted from a group of coefficients as a complex spectrum by a transversal filter to a group of coefficients shown in FIG. 7B as a power spectrum by a phase linear filter having substantially the same impulse response characteristics. The coefficient group formed by this conversion loses phase information, but has the same number of taps as the transversal filter coefficient group in Figure 7A, and the energy center position remains unchanged.
Moreover, a symmetrical tap coefficient, that is, an impulse response sequence can be obtained. In this way, if the energy center position remains unchanged, even if phase information is lost, there is practically no problem as a coefficient of a high frequency reproduction filter.

さて第７図Ｂに示す位相線形フイルタ係数群は
左右対称性を有するのでエネルギー中心位置から
左右いずれか、全体のほぼ1/2のみを推定係数と
して合成側に送出しても容易に全体を再生するこ
とが可能である。フイルタ係数推定器(3)１９から
はこのような理由から全体係数群のほぼ1/2の係
数群のみが量子化器９に送出され、所定の量子化
を施されて多重化器１２に供給され他の量子化デ
ータとともに多重化されて第６図Ｂの合成側に伝
送される。 Now, since the phase linear filter coefficient group shown in Figure 7B has left-right symmetry, the whole can be easily reproduced even if only approximately 1/2 of the total, either on the left or right of the energy center position, is sent to the synthesis side as estimated coefficients. It is possible to do so. For this reason, from the filter coefficient estimator (3) 19, only approximately half of the coefficient group of the entire coefficient group is sent to the quantizer 9, where it is subjected to predetermined quantization and supplied to the multiplexer 12. The quantized data is multiplexed with other quantized data and transmitted to the combining side in FIG. 6B.

第６図Ｂに示す合成側では多重分離・復号化器
１３から出力ライン１３０１を介して多重化分
離、復号化した推定係数データを係数再生器(1)２
０に供給する。このようにして供給される推定係
数データは第６図Ｂに示す係数群の中心タツプ位
置ぶんを含みほぼ1/2である。係数再生器(1)２０
はこの係数群にもとづき再び第７図Ｂに示す対称
化係数群を再生したのちこれら係数群をタツプ係
数として構成されるトランスバーサルフイルタを
利用する高域再生フイルタ２１に供給し、他は第
１図Ｂ等によつて説明したとほぼ同様にして再生
入力音声信号を出力端子１７０１に出力し、かく
して推定係数の伝送データビツトレートを大幅に
低減した分析、合成が可能となる。 On the synthesis side shown in FIG.
Supply to 0. The estimated coefficient data thus supplied is approximately 1/2 including the center tap position of the coefficient group shown in FIG. 6B. Coefficient regenerator (1) 20
On the basis of this coefficient group, the symmetrization coefficient group shown in FIG. The reproduced input audio signal is outputted to the output terminal 1701 in substantially the same manner as described with reference to FIG.

第８図Ａ，Ｂは本発明の第四の実施例における
分析側Ａおよび合成側Ｂの構成を示すブロツク図
である。 FIGS. 8A and 8B are block diagrams showing the configurations of analysis side A and synthesis side B in a fourth embodiment of the present invention.

第８図Ａはフイルタ係数推定器(4)２２が、また
第８図Ｂは係数再生器(2)２３および高域再生フイ
ルタ２４以外はそれぞれ第１図Ａ，Ｂと同様であ
るのでこれらに関する詳細な説明は省略する。 Figure 8A is the same as Figure 1A and B except for the filter coefficient estimator (4) 22, and Figure 8B is the same as Figure 1A and B except for the coefficient regenerator (2) 23 and the high frequency reproduction filter 24, Detailed explanation will be omitted.

第８図Ａ，Ｂに示す第四の実施例は、第１図Ａ
に示すフイルタ係数推定器(1)８と同様にして得ら
れる高域再生フイルタのタツプ係数群をこれとほ
ぼ同一の電力スペクトルを有しかつエネルギー中
心位置は不変である三角形包絡配列の推定係数列
となるように変換し、分析側から合成側にはこの
三角波包絡を構成する変換推定係数列と実変換系
数列との差の成分のみを伝送するという方法で伝
送データレートの減少を図るものである。 The fourth embodiment shown in FIGS. 8A and 8B is as shown in FIG. 1A.
The tap coefficient group of the high-frequency reproducing filter obtained in the same manner as the filter coefficient estimator (1) 8 shown in Fig. 8 is an estimated coefficient sequence of a triangular envelope array having almost the same power spectrum and the energy center position unchanged. This method aims to reduce the transmission data rate by converting the triangular wave envelope so that be.

第９図Ａは第８図Ａに示す第四の実施例の分析
側において形成する高域再生フイルタ予測係数特
性図であり、第７図Ａと基本的にほぼ同一の内容
を示し、第９図Ｂは第９図Ａの推定係数群をほぼ
同一電力スペクトルに保ちつつ得られる三角形包
絡分布変換特性図である。 FIG. 9A is a high frequency reproduction filter prediction coefficient characteristic diagram formed on the analysis side of the fourth embodiment shown in FIG. 8A, and shows basically the same contents as FIG. FIG. 9B is a triangular envelope distribution transformation characteristic diagram obtained while keeping the estimated coefficient group of FIG. 9A in substantially the same power spectrum.

第９図Ｂは第９図Ａの係数群に対しその概略的
な包絡線が所望の三角形状となるように変換され
る。この変換は第９図Ａに示す係数群をフーリエ
変換し、所望の三角形状の係数が有する位相推移
に一致するように、第９図Ａのフーリエ変換値を
位相調整することにより実現される。 In FIG. 9B, the coefficient group in FIG. 9A is transformed so that its approximate envelope becomes a desired triangular shape. This transformation is realized by Fourier transforming the coefficient group shown in FIG. 9A, and adjusting the phase of the Fourier transformed value in FIG. 9A so as to match the phase shift of the desired triangular coefficient.

変換係数群の包絡が三角形状となるようにする
動機は、一般的に声帯振動波形が三角形の波形を
とることにもとづいて配慮したものであり、この
ような変換によつて位相情報を喪失しても、この
包絡形状とエネルギー中心位置不変の係数配列特
性から合成による再生音声品質には殆んど影響を
与えないで済む。 The motivation for making the envelope of the transformation coefficient group triangular is based on the fact that the vocal fold vibration waveform generally takes a triangular waveform, and this kind of transformation causes loss of phase information. However, due to this envelope shape and the characteristic of the coefficient array in which the energy center position remains unchanged, there is almost no effect on the quality of the reproduced audio by synthesis.

こうして、三角形の包絡分布を示しかつほぼ同
一の電力スペクトルを有するような係数系列に変
換されたあと所望の三角形状との差成分のみをフ
イルタ係数推定器(4)２２から出力し量子化器９を
介して量子化データとして多重化器１２に送出、
これから合成側に伝送する。 In this way, after being converted into a coefficient series that shows a triangular envelope distribution and has almost the same power spectrum, only the difference component from the desired triangular shape is outputted from the filter coefficient estimator (4) 22, and the quantizer 9 Send it to the multiplexer 12 as quantized data via
Now it will be transmitted to the synthesis side.

第８図Ｂに示す合成側は、多重分離・復号化器
１３によつて復号再生した所望の三角形状との差
成分を出力ライン１３０１を介して係数再生器(2)
２３に供給し、これにより分析側で減じた所望の
三角形状を付加して係数群を再生、これらを高域
再生フイルタ２４に供給する。 On the synthesis side shown in FIG.
23, thereby adding the desired triangular shape subtracted on the analysis side to reproduce the coefficient group, and supplying these to the high frequency reproduction filter 24.

高域再生フイルタ２４は、これら係数群をその
タツプ係数とし、かつ出力ライン１３０２を介し
て8KHzサンプリングイメージ化を図つた残差波
形低域周波数成分を入力として高域周波数成分を
含む全残差波形を再生する。 The high frequency reproduction filter 24 uses these coefficient groups as its tap coefficients, and inputs the residual waveform low frequency component which has been converted into an 8KHz sampling image via the output line 1302, and generates the total residual waveform including the high frequency component. Play.

合成側はこのあと、第１図Ｂに示す内容と同一
の再生動作を実施して入力音声信号を再生し、か
くしてRELP音を基本的に除去するとともに高域
再生フイルタのタツプ係数の伝送データビツトレ
ートを著しく低減した分析、合成処理が可能とな
る。 The synthesis side then performs the same playback operation as shown in Figure 1B to play back the input audio signal, thus basically removing the RELP sound and also changing the transmission data bits of the tap coefficient of the high-frequency playback filter. Analysis and synthesis processing can be performed at significantly reduced rates.

尚、上述の説明では第１図Ａ、第４図、第６図
Ａ、第８図Ａに各々電力算出器１０を備え、第１
図Ｂ、第６図Ｂ、第８図Ｂに各々乗算器１５を備
えていたが、量子化／復号化器７の残差波形低域
周波数成分の量子化、復号化処理に関する電力デ
ータによる正規化処理を行なわず、絶対レベルの
量子化、復号化処理を行なうのであれば、前記電
力算出器１０と乗算器１５とは不要である。 In the above explanation, the power calculators 10 are provided in each of FIGS. 1A, 4, 6A, and 8A, and the first
Although multipliers 15 were provided in each of FIG. B, FIG. 6B, and FIG. If absolute level quantization and decoding processing are performed without performing quantization processing, the power calculator 10 and multiplier 15 are unnecessary.

第１１図は本発明第五の実施例における分析側
の構成を示すブロツク図である。第１１図に示す
分析側に於いて第１図に示す第１の実施例と同一
の構成のものは同一の記号が付されており、これ
らに関する詳細な説明は省略する。 FIG. 11 is a block diagram showing the configuration of the analysis side in the fifth embodiment of the present invention. On the analysis side shown in FIG. 11, the same components as in the first embodiment shown in FIG. 1 are denoted by the same symbols, and detailed explanations thereof will be omitted.

また第五の実施例における合成側は第１図Ｂに
示す第１の実施例における合成側と同一の構成で
あるので、これについても詳細な説明は省略す
る。 Further, since the composition side in the fifth embodiment has the same configuration as the composition side in the first embodiment shown in FIG. 1B, detailed explanation thereof will also be omitted.

第１１図に示す第５の実施例は入力音声信号の
スペクトル包絡情報をパタンマツチング手段によ
り伝送するに当り、１入力音声を分析して得られるLPC係数と最
もスペクトル距離の小さな標準パタンと、２番
目にスペクトル距離の小さな標準パタンとの２
種類の標準パタン候補を予備的に選択し、２前記選択された２種類の標準パタン候補の
各々を用いて、各々独立に前述の第１の実施例
に於ける残差波形の算出、フイルタ係数推定等
の一連の分析処理を実行し、更に本来、合成側
で実行される音声再生処理を実行する。 The fifth embodiment shown in FIG. 11 transmits the spectral envelope information of the input audio signal using the pattern matching means. 1. LPC coefficients obtained by analyzing the input audio and a standard pattern with the smallest spectral distance; 2 with the standard pattern with the second smallest spectral distance
2. Using each of the two selected standard pattern candidates, each independently calculates the residual waveform and filters the filter coefficients in the first embodiment. It executes a series of analysis processes such as estimation, and further executes audio reproduction processing that would normally be executed on the synthesis side.

３更に２種類の予備的に選択された標準パタン
候補に対応する２種類の再生音声各々と入力音
声とのＳ／Ｎ比を測定する。3. Furthermore, measure the S/N ratio between each of the two types of reproduced sounds corresponding to the two types of preliminarily selected standard pattern candidates and the input sound.

４前記Ｓ／Ｎ比の良好な標準パタン候補を標準
パタンと決定する。4. Determine the standard pattern candidate with a good S/N ratio as the standard pattern.

ことにより、残差励振型ボコーダに於いて、より
適切なパタンマツチングを行なうものである。As a result, more appropriate pattern matching can be performed in the residual excitation type vocoder.

Ａ／Ｄコンバータ１は出力をLPC分析器２お
よび波形メモリ２９へ出力する。LPC分析器２
は分析したLPC係数を量子化／復号化器２５へ
出力する。量子化／復号化器２５は例えば第１２
図に示す構成であり、量子化／復号化器３に補助
ラベルレジスタ１２０１を追加して構成されるも
のである。以下、第１２図を用いて量子化／復号
化器２５を説明する。パタン照合プロセツサ１０
０１はLPC係数が入力されると、最小距離レジ
スタ１００４を経験的に知られている前記(2)式に
示すDijの最大値よりも大きな値D₀に初期設定す
る。更にラベルレジスタ１００５と補助ラベルレ
ジスタ１２０１を例えば”4096”に初期設定す
る。次にパタン照合プロセツサ１００１は標準パ
タンに付されたラベル”０”に相等するスタート
番地０〓10＝０（０〓10の10はLPCの次数10に相
等する）を標準パタンメモリ１００２へ、LPC
係数α₁に対応するスペクトル感度W₁を指定する
番地０をスペクトル感度メモリ１００３へ出力
し、スペクトル距離Ｄを”０”に設定する。次に
パタン照合プロセツサ１００１は標準パタンメモ
リ１００２より出力されるラベル”０”の標準パ
タンのα₁ ⁽⁰⁾（前記(2)式に於いてP₁ ⁽ⁱ⁾に相等する）
とスペクトル感度メモリ１００３より出力される
スペクトル感度W₁と、LPC分析器２より供給さ
れたLPC係数の第１番目の係数α₁ ^(I)（前記(2)式に
於いてP₁ ^(j)に相等する）とから、Ｄ＝W₁（α₁ ⁽⁰⁾−α₁ ^(I)）² を算出する。次にパタン照合プロセツサ１００１
は標準パタンメモリ１００２に出力している番地
データをインクリメントし、即ち０＋１＝１と
し、更にスペクトル感度メモリ１００３へ出力し
ている番地データをインクリメントし、即ち０＋
１＝１とし、α₂ ⁽⁰⁾，W₂を入力する。パタン照合
プロセツサ１００１は前記求めたＤにW₂（α₂ ⁽⁰⁾−
α₂ ^(I)）²で算出される量を加算しＤ＝W₁（α₁ ⁽⁰⁾−α₁ ^(I)）²＋W₂（α₂ ⁽⁰⁾−α₂ ^(I)）² ＝₂ 〓^k=1 W_k（α_k ⁽⁰⁾−α_k ^(I)）² を求める。以下同様にパタン照合プロセツサ１０
０１は標準パタンメモリ１００２へ番地データを
２，３，４，…10と次々に、スペクトル感度メモ
リ１００３へ番地データを２，３，４，…10と
次々に与えＤ＝₁₀ 〓^k=1 W_k（α_k ⁽⁰⁾−α_k ^(I)）² ……(3) を算出する。 A/D converter 1 outputs an output to LPC analyzer 2 and waveform memory 29. LPC analyzer 2
outputs the analyzed LPC coefficients to the quantizer/decoder 25. The quantizer/decoder 25 is, for example, the 12th
The configuration shown in the figure is one in which an auxiliary label register 1201 is added to the quantizer/decoder 3. The quantizer/decoder 25 will be explained below using FIG. 12. Pattern matching processor 10
01 initializes the minimum distance register 1004 to a value D ₀ larger than the empirically known maximum value of Dij shown in equation (2) above when the LPC coefficient is input. Furthermore, the label register 1005 and the auxiliary label register 1201 are initialized to, for example, "4096". Next, the pattern matching processor 1001 transfers the start address 0〓10=0 (10 of 0〓10 is equivalent to the order 10 of the LPC) to the standard pattern memory 1002, which is equivalent to the label "0" attached to the standard pattern, to the standard pattern memory 1002.
The address 0 specifying the spectral sensitivity W ₁ corresponding to the coefficient α ₁ is output to the spectral sensitivity memory 1003, and the spectral distance D is set to "0". Next, the pattern matching processor 1001 calculates α ₁ ⁽⁰⁾ (equivalent to P ₁ ⁽ⁱ⁾ in equation (2) above) of the standard pattern with the label “0” output from the standard pattern memory 1002.
and the spectral sensitivity W ₁ output from the spectral sensitivity memory 1003, and the first coefficient α ₁ ^(I) of the LPC coefficients supplied from the LPC analyzer 2 (P ₁ ( ^{j) in the above equation (2)} ), calculate D=W ₁ (α ₁ ⁽⁰⁾ − α ₁ ^(I) ) ² . Next, the pattern matching processor 1001
Increments the address data output to the standard pattern memory 1002, that is, 0+1=1, and further increments the address data output to the spectral sensitivity memory 1003, that is, 0+
1=1, and input α ₂ ⁽⁰⁾ and W ₂ . The pattern matching processor 1001 applies W ₂ (α ₂ ⁽⁰⁾ −
Add the quantities calculated in α ₂ ^(I) ) ² and get D=W ₁ (α ₁ ⁽⁰⁾ − _{α 1} ^(I) ) ² + W ₂ (α ₂ ⁽⁰⁾ − α ₂ ^(I) ) ² = Find ₂ 〓 ^k=1 W _k (α _k ⁽⁰⁾ − α _k ^(I) ) ² . Similarly, the pattern matching processor 10
01 gives address data 2, 3, 4, ... 10 one after another to the standard pattern memory 1002, and gives address data 2, 3, 4, ... 10 one after another to the spectral sensitivity memory 1003, D= ₁₀ 〓 ^k=1 W Calculate _k (α _k ⁽⁰⁾ − α _k ^(I) ) ² ...(3).

次にパタン照合プロセツサ１００１は最小距離
レジスタ１００４に記憶されているデータD₀と
前記D_0Iとを比較し、D₀＞D_0Iの場合には、最小距
離レジスタ１００４にD₀としてD_0Iを再設定し、
更にラベルレジスタ１００５にラベル“０”を示
す“０”を再設定する。次にパタン照合プロセツ
サ１００１は、ラベルレジスタ１００５に記憶さ
れていた“4096”を補助ラベルレジスタ１２０１
へ出力する。D₀＞D_0Iの場合にはパタン照合プロ
セツサ１００１は最小距離レジスタ１００４とラ
ベルレジスタ１００５とに対する再設定は実施し
ない。D₀＝D_0Iの場合は、前記再設定は実施して
も、しなくともよい。 Next, the pattern matching processor 1001 compares the data D ₀ stored in the minimum distance register 1004 with the D _0I , and if D ₀ > D _0I , rewrites D _0I as D ₀ in the minimum distance register 1004. Set,
Further, the label register 1005 is reset to "0" indicating the label "0". Next, the pattern matching processor 1001 transfers “4096” stored in the label register 1005 to the auxiliary label register 1201.
Output to. In the case of D ₀ >D _0I , the pattern matching processor 1001 does not reset the minimum distance register 1004 and the label register 1005. In the case of D ₀ =D _0I , the resetting may or may not be performed.

次にパタン照合プロセツサ１００１は標準パタ
ンに付されたラベル“１”に相等するスタート番
地１〓10＝10を標準パタンメモリ１００２へ、
LPC係数α₁に対応するスペクトル感度W₁を指定
する番地“０”をスペクトル感度メモリ１００３
へ出力し、スペクトル距離Ｄを“０”に設定す
る。更にパタン照合プロセツサ１００１はラベル
“１”の標準パタンのα₁ ⁽¹⁾と前記のW₁と前記のα₁ ⁽
^Ｉ）とからＤ＝W₁（α₁ ⁽¹⁾−α₁ ^(I)）²を算出する。次
にパ
タン照合プロセツサ１００１は標準パタンメモリ
１００２へ供給する番地データを11，12，…，20
と次々に与え、スペクトル感度メモリ１００３へ
供給する番地データを１，２，…10と次々と与え
てＤ＝₁₀ 〓^k=1 W_k（α_k ⁽¹⁾−α_k ^(I)）² ……(4) を算出する。(4)式で与えられるＤは標準パタン
α⁽¹⁾と入力LPC係数データα^(I)との距離D_1Iである。
次にパタン照合プロセツサ１００１は最小距離レ
ジスタ１００４に記憶されているデータD₀と前
記D_1Iとを比較し適宜、最小距離レジスタ１００
５とラベルレジスタ１００５とを書替え、更にラ
ベルレジスタ１００５の内容を補助ラベルレジス
タ１２０１へ退避する。 Next, the pattern matching processor 1001 stores the start address 1〓10=10, which is equivalent to the label "1" attached to the standard pattern, into the standard pattern memory 1002.
The address “0” specifying the spectral sensitivity W ₁ corresponding to the LPC coefficient α ₁ is stored in the spectral sensitivity memory 1003.
and set the spectral distance D to "0". Further, the pattern matching processor 1001 uses α ₁ ⁽¹⁾ of the standard pattern labeled “1”, the above W ₁ and the above α ₁ ⁽
D=W ₁ (α ₁ ⁽¹⁾ − α ₁ ^(I) ) ² is calculated from ^I) . Next, the pattern matching processor 1001 supplies address data 11, 12,..., 20 to the standard pattern memory 1002.
are given one after another, and the address data to be supplied to the spectral sensitivity memory 1003 is given one after another as 1, 2, ...10, D= ₁₀ 〓 ^k=1 W _k (α _k ⁽¹⁾ − α _k ^(I) ) ² ... ...(4) is calculated. D given by equation (4) is the distance D _1I between the standard pattern α ⁽¹⁾ and the input LPC coefficient data α ^(I) .
Next, the pattern matching processor 1001 compares the data D ₀ stored in the minimum distance register 1004 with the data D _1I and stores the data in the minimum distance register 100 as appropriate.
5 and the label register 1005, and further saves the contents of the label register 1005 to the auxiliary label register 1201.

以下、次々にラベル“２”，“３”，…，“4095”
の標準パタンとの照合を実施する。 Below, the labels "2", "3", ..., "4095" are displayed one after another.
Perform comparison with the standard pattern.

パタン照合プロセツサ１００１は必要に応じて
ラベルレジスタ１００５又は補助ラベルレジスタ
１２０１の内容を符号化データとして多重化器２
６へ出力し、ラベルレジスタ１００５又は補助ラ
ベルレジスタ１２０１の内容に対応する標準パタ
ンデータをLPC逆フイルタ４とLPC合成フイル
タ１６とへ出力する。 The pattern matching processor 1001 converts the contents of the label register 1005 or auxiliary label register 1201 into coded data to the multiplexer 2 as necessary.
6, and outputs standard pattern data corresponding to the contents of the label register 1005 or the auxiliary label register 1201 to the LPC inverse filter 4 and the LPC synthesis filter 16.

量子化／復号化器７は残差波形低域周波数成分
を量子化してこれを多重化器２６に送出するとと
もに、これを復号化したデータをフイルタ係数推
定器(1)８と高域再生フイルタ１４とに供給する。
フイルタ係数推定器(1)８は推定フイルタ係数を量
子化器９へ出力する。量子化器９は前記係数を量
子化し多重化器２６と復号化器２８とへ供給す
る。復号化器２８は量子化されたフイルタ係数を
復号化し乗算器１５へ出力する。電力算出器１０
は残差波形を入力し分析フレームごとの短時間平
均音声電力を算出し、これを量子化器１１と量子
化／復号化器１７とへ出力する。量子化器１１は
短時間平均音声電力を量子化し多重化器２６と復
号化器２８とへ出力する。復号化器２８は短時間
平均電力を復号化し乗算器１５へ出力する。 The quantizer/decoder 7 quantizes the residual waveform low frequency component and sends it to the multiplexer 26, and also sends the decoded data to the filter coefficient estimator (1) 8 and the high frequency reproduction filter. 14.
Filter coefficient estimator (1) 8 outputs estimated filter coefficients to quantizer 9. A quantizer 9 quantizes the coefficients and supplies them to a multiplexer 26 and a decoder 28. The decoder 28 decodes the quantized filter coefficients and outputs them to the multiplier 15. Power calculator 10
inputs the residual waveform, calculates the short-term average audio power for each analysis frame, and outputs this to the quantizer 11 and the quantizer/decoder 17. The quantizer 11 quantizes the short-term average voice power and outputs it to the multiplexer 26 and the decoder 28. The decoder 28 decodes the short-time average power and outputs it to the multiplier 15.

高域再生フイルタ１４は量子化／復号化器７よ
り供給された残差波形低域周波数成分から高域を
再生し、乗算器１５へ出力する。乗算器１５は前
記波形に短時間平均音声電力との乗算を実施して
LPC合成フイルタ１６へ出力する。LPC合成フ
イルタ１６は音声を再生しＳ／Ｎ比算出器３０へ
出力する。波形メモリ２９はＳ／Ｎ比算出器３０
へ入力音声波形を出力する。Ｓ／Ｎ比算出器３０
は再生音声波形と入力音声波形との差の波形（以
下「誤差波形」と云う）を求め、これの電力P_N
を算出する。更に入力音声波形の電力P_Iを算出
し、Ｓ／Ｎ比即ちP_N／P_Iを算出し多重化器２６
へ出力する。 The high frequency reproduction filter 14 reproduces the high frequency from the residual waveform low frequency component supplied from the quantizer/decoder 7 and outputs it to the multiplier 15 . The multiplier 15 multiplies the waveform by the short-term average audio power.
Output to LPC synthesis filter 16. The LPC synthesis filter 16 reproduces the audio and outputs it to the S/N ratio calculator 30. The waveform memory 29 is an S/N ratio calculator 30
Outputs the input audio waveform to. S/N ratio calculator 30
calculates the waveform of the difference between the reproduced audio waveform and the input audio waveform (hereinafter referred to as the "error waveform"), and calculates its power P _N
Calculate. Furthermore, the power P _I of the input audio waveform is calculated, the S/N ratio, that is, P _N /P _I is calculated, and the multiplexer 26
Output to.

以上のSN算出処理は前述のラベルレジスタ１
００５の内容と補助ラベルレジスタ１００４の内
容とについて各々１回づつ実行される。多重化器
２６はラベルレジスタ１００５の内容に関する
Ｓ／Ｎ比と補助ラベルレジスタ１２０１の内容に
関するＳ／Ｎ比とを比較し、Ｓ／Ｎ比の良好なラ
ベルと前記ラベルに対応するフイルタ係数等を合
成側へ出力する。 The above SN calculation process is performed using the label register 1 mentioned above.
005 and the contents of the auxiliary label register 1004 are executed once each. The multiplexer 26 compares the S/N ratio regarding the contents of the label register 1005 and the S/N ratio regarding the contents of the auxiliary label register 1201, and selects a label with a good S/N ratio and a filter coefficient etc. corresponding to the label. Output to the synthesis side.

無論Ｓ／Ｎ比を算出する代りに誤差波形の電力
を判定パラメータに利用しても本発明は実施でき
る。又、量子化／復号化器２５は補助ラベルレジ
スタを高々１ケ有しているが必ずしも１ケに限定
されない。更にフイルタ係数推定器(1)８はフイル
タ係数推定器(2)１８又はフイルタ係数推定器(3)１
９と置換し得る。 Of course, the present invention can also be implemented by using the power of the error waveform as the determination parameter instead of calculating the S/N ratio. Further, the quantizer/decoder 25 has at most one auxiliary label register, but the number is not necessarily limited to one. Furthermore, the filter coefficient estimator (1) 8 is the filter coefficient estimator (2) 18 or the filter coefficient estimator (3) 1
9 can be replaced.

なお、第１〜第５の実施例においてパタンマツ
チングのパラメータとしてαパラメータを使用し
ていたが、これはＫパラメータ、LSPパラメータ
等でも実施し得ることは明らかである。 In the first to fifth embodiments, the α parameter was used as a pattern matching parameter, but it is clear that this can also be implemented using the K parameter, LSP parameter, etc.

〔Effect of the invention〕

以上説明した如く本発明によれば、スペクトル
包絡情報をパタンマツチング手段により伝送する
手段を有し、且つ、高域周波数成分再生手段を分
析側に備えた残差励振型ボコーダにおいて、残差
波形低域周波数成分のダウンサンプル値を零レべ
ルサンプルの内挿を介して基準サンプリング変換
標本系列としたサンプルを利用しかつ非線形回路
を利用することなく高域再生フイルタのタツプ係
数を推定する手段を備えて入力音声信号の分析な
らびに合成を図ることにより、大幅なRELP音の
改善がきわめて安定かつ確実に図れる残差励振型
ボコーダが実現できるという効果がある。 As explained above, according to the present invention, in a residual excitation type vocoder having a means for transmitting spectral envelope information by a pattern matching means and having a high frequency component reproducing means on the analysis side, the residual waveform Means for estimating the tap coefficient of a high-frequency reproduction filter by using a sample sequence in which a down-sampled value of a low-frequency component is made into a reference sampling conversion sample sequence through interpolation of a zero-level sample and without using a nonlinear circuit. By analyzing and synthesizing the input audio signal using the system, it is possible to realize a residual excitation type vocoder that can significantly improve the RELP sound in an extremely stable and reliable manner.

[Brief explanation of the drawing]

第１図Ａは本発明第一の実施例における分析側
の構成を示すブロツク図、第１図Ｂは合成側の構
成を示すブロツク図、第２図ａは第１図Ａ，Ｂに
示す本発明第一の実施例において入力音声信号の
残差波形を基準サンプリング周波数でサンプリン
グした残差波形の一例を示す残差波形基準サンプ
ル標本図、第２図ｂは残差波形の低域周波数成分
をダウンサンプリング周波数でサンプリングした
ダウンサンプルの一例を示すダウンサンプル標本
図、第２図ｃは第２図ｂに示す残差波形低域周波
数成分のダウンサンプル標本に対し基準サンプリ
ング周波数に対応する基準サンプリング変換を実
施した残差波形低域周波数成分の基準サンプリン
グ変換標本系列、第３図は第１図Ａに示す第一の
実施例における分析側のフイルタ係数推定器(1)８
の構成を詳細に示すブロツク図、第４図は本発明
の第二の実施例における分析側の構成を示すブロ
ツク図、第５図は第４図に示す本発明の第二の実
施例におけるフイルタ係数推定器(2)１８の構成を
詳細に示すブロツク図、第６図Ａは本発明第三の
実施例における分析側の構成を示すブロツク図、
第６図Ｂは合成側の構成を示すブロツク図、第７
図Ａ，Ｂは第６図Ａに示す本発明の第三の実施例
の分析側において形成する高域再生フイルタ予測
係数特性図Ａおよびその零位相化予測係数特性図
Ｂ、第８図Ａ，Ｂは本発明の第四の実施例におけ
る分析側Ａおよび合成側Ｂの構成を示すブロツク
図、第９図Ａ，Ｂは第８図Ａの第四の実施例にお
ける分析側において形成する高域再生フイルタ予
測係数特性図Ａおよびその同一電力スペクトルに
よる三角形包絡分布変換特性図Ｂ、第１０図は第
１図Ａに示す第１の実施例における分析側の量子
化／復号化器３の構成を詳細に示すブロツク図、
第１１図は本発明第５の実施例における分析側の
構成を示すブロツク図、第１２図は第１１図に示
す第５の実施例における分析側の量子化／復号化
器２５の構成を詳細に示すブロツク図である。１……Ａ／Ｄコンバータ、２……LPC分析器、
３……量子化／復号化器、４……LPC逆フイル
タ、５……LPF、６……ダウンサンプラ、７…
…量子化／復号化器、８……フイルタ係数推定器
(1)、９……量子化器、１０……電力算出器、１１
……量子化器、１２……多重化器、１３……多重
分離・復号化器、１４……高域再生フイルタ、１
５……乗算器、１６……LPC合成フイルタ、１
７……Ｄ／Ａコンバータ、１８……フイルタ係数
推定器(2)、１９……フイルタ係数推定器(3)、２０
……係数再生器(1)、２１……高域再生フイルタ、
２２……フイルタ係数推定器(4)、２３……係数再
生器(2)、２４……高域再生フイルタ、２５……量
子化／復号化器、２６……多重化器、２７……復
号化器、２８……復号化器、２９……波形メモ
リ、３０……Ｓ／Ｎ比算出器、８１……トランス
バーサルフイルタ、８２……減算器、８３……電
力算出器、１８１……相互相関回路、１８２……
自己相関回路、１８３……最大値検索回路、１０
０１……パタン照合プロセツサ、１００２……標
準パタンメモリ、１００３……スペクトル感度メ
モリ、１００４……最小距離レジスタ、１００５
……ラベルレジスタ、１２０１……補助ラベルレ
ジスタ。 FIG. 1A is a block diagram showing the configuration on the analysis side in the first embodiment of the present invention, FIG. 1B is a block diagram showing the configuration on the synthesis side, and FIG. A residual waveform standard sample diagram showing an example of a residual waveform obtained by sampling the residual waveform of an input audio signal at a standard sampling frequency in the first embodiment of the invention, FIG. 2b shows a low frequency component of the residual waveform. A down-sampling sample diagram showing an example of down-sampling sampled at the down-sampling frequency, Fig. 2 c is a reference sampling conversion corresponding to the reference sampling frequency for the down-sampling sample of the residual waveform low frequency component shown in Fig. 2 b. FIG. 3 shows the reference sampling conversion sample sequence of the low frequency component of the residual waveform obtained by performing the following steps.
4 is a block diagram showing the configuration of the analysis side in the second embodiment of the present invention, and FIG. 5 is a block diagram showing the configuration of the filter in the second embodiment of the present invention shown in FIG. A block diagram showing in detail the configuration of the coefficient estimator (2) 18, FIG. 6A is a block diagram showing the configuration of the analysis side in the third embodiment of the present invention,
Figure 6B is a block diagram showing the configuration of the synthesis side, Figure 7
Figures A and B are a high frequency reproduction filter prediction coefficient characteristic diagram A formed on the analysis side of the third embodiment of the present invention shown in Figure 6A, its zero-phase prediction coefficient characteristic diagram B, Figure 8A, B is a block diagram showing the configuration of the analysis side A and the synthesis side B in the fourth embodiment of the present invention, and FIGS. 9A and B are the high regions formed on the analysis side in the fourth embodiment of FIG. The reproduction filter prediction coefficient characteristic diagram A and the triangular envelope distribution transformation characteristic diagram B based on the same power spectrum, FIG. 10 show the configuration of the quantizer/decoder 3 on the analysis side in the first embodiment shown in FIG. Detailed block diagram,
FIG. 11 is a block diagram showing the configuration of the analysis side in the fifth embodiment of the present invention, and FIG. 12 shows the detailed configuration of the quantizer/decoder 25 on the analysis side in the fifth embodiment shown in FIG. FIG. 1...A/D converter, 2...LPC analyzer,
3...Quantizer/decoder, 4...LPC inverse filter, 5...LPF, 6...Down sampler, 7...
...quantizer/decoder, 8...filter coefficient estimator
(1), 9...Quantizer, 10...Power calculator, 11
... Quantizer, 12 ... Multiplexer, 13 ... Demultiplexer/decoder, 14 ... High frequency reproduction filter, 1
5... Multiplier, 16... LPC synthesis filter, 1
7...D/A converter, 18...Filter coefficient estimator (2), 19...Filter coefficient estimator (3), 20
... Coefficient regenerator (1), 21 ... High frequency regeneration filter,
22... Filter coefficient estimator (4), 23... Coefficient regenerator (2), 24... High frequency reproduction filter, 25... Quantizer/decoder, 26... Multiplexer, 27... Decoding encoder, 28... decoder, 29... waveform memory, 30... S/N ratio calculator, 81... transversal filter, 82... subtractor, 83... power calculator, 181... mutual Correlation circuit, 182...
Autocorrelation circuit, 183... Maximum value search circuit, 10
01...Pattern matching processor, 1002...Standard pattern memory, 1003...Spectral sensitivity memory, 1004...Minimum distance register, 1005
... Label register, 1201 ... Auxiliary label register.

Claims

[Claims] 1. A first method that compares the spectral envelope characteristic of an input audio signal with a standard pattern prepared in advance, and transmits a label attached to the standard pattern from the analysis side to the synthesis side as information indicating the spectral envelope characteristic. means for determining a reference sample sequence of a residual waveform of an input audio signal corresponding to a preset reference sampling frequency; and a second means corresponding to a downsampling frequency obtained by reducing the reference sampling frequency at a preset rate. a third means for obtaining a down-sampled sequence of low frequency components of the residual waveform; and the residual waveform is formed while setting a zero-level interpolation point in the down-sampled sequence at the timing of the reference sampling frequency. Estimating the coefficients of a transversal high-frequency reproduction filter to be provided on the synthesis side based on the reference sampling conversion sample sequence of the low-frequency component of and the reference sample sequence of the residual waveform and without using a nonlinear circuit. A residual excitation type vocoder comprising a fourth means for outputting the coefficients to be transmitted to a synthesis side. 2. Compare the spectral envelope characteristics of the input audio signal with a standard pattern prepared in advance, select a plurality of pattern candidates in advance, and determine the coefficients of a transversal high-frequency reproduction filter to be provided on the synthesis side for each pattern candidate. a first means for estimating on the analysis side; a second means for calculating the S/N ratio between the input speech signal and the synthesized speech signal corresponding to each pattern candidate; A third means for transmitting labels attached to pattern candidates with a good S/N ratio from the analysis side to the synthesis side as information indicating spectral envelope characteristics, and a third means for transmitting labels attached to pattern candidates with a good S/N ratio from the analysis side to the synthesis side, and a fourth means for obtaining a reference sample sequence of the difference waveform; and a fourth means for obtaining a downsample sequence of low frequency components of the residual waveform corresponding to a downsampling frequency obtained by reducing the reference sampling frequency at a preset rate. and a reference sampling conversion sample sequence of the low frequency component of the residual waveform formed while setting a zero-level interpolation point in the down-sampling sequence at the timing of the reference sampling frequency and the residual waveform. A sixth means for estimating coefficients of a transversal high-frequency reproduction filter to be provided on the synthesis side based on the reference sample sequence and without going through a nonlinear circuit, and outputting the coefficients to be transmitted to the synthesis side. A residual excitation type vocoder characterized by: