JP3103108B2

JP3103108B2 - Audio coding device

Info

Publication number: JP3103108B2
Application number: JP02340411A
Authority: JP
Inventors: 公生三関; 政巳赤嶺
Original assignee: Toshiba Corp
Current assignee: Toshiba Corp
Priority date: 1990-02-05
Filing date: 1990-11-30
Publication date: 2000-10-23
Anticipated expiration: 2015-10-23
Also published as: JPH03273300A

Description

【発明の詳細な説明】［発明の目的］（産業上の利用分野）この発明は、音声信号を高能率に圧縮する音声符号化
装置に係り、特に伝送ビットレートを例えば10kb/s以下
にできる適応密度駆動パルス列に基づく音声符号化装置
に関する。DETAILED DESCRIPTION OF THE INVENTION [Object of the Invention] (Field of Industrial Application) The present invention relates to a speech coding apparatus for compressing a speech signal with high efficiency, and in particular, can reduce a transmission bit rate to, for example, 10 kb / s or less. The present invention relates to a speech coding apparatus based on an adaptive density driving pulse train.

（従来の技術）現在、音声信号を10kb/s以下の低ビットレートで伝送
できる符号化技術が盛んに検討されているが、その具体
的方法として一定間隔で並んだパルス列で合成フィルタ
の駆動信号を表現し、これを用いて符号化を行う方式が
ある。この詳細については、PETER KROON氏らによるIEE
E 1986年10月、Vol.ASSP−34,pp.1054−1063に掲載の
“Regular Pulse Excitation A Novel Approach to Eff
ective and Efficient Multipulse Coding of Speech"
と題した論文（文献１）に説明されている。(Prior art) At present, coding techniques capable of transmitting audio signals at a low bit rate of 10 kb / s or less are being actively studied. As a specific method, a driving signal for a synthesis filter is composed of a pulse train arranged at regular intervals. And encoding is performed using this. For more information on this, see IEE by PETER KROON et al.
E October 1986, “Regular Pulse Excitation A Novel Approach to Eff” published in Vol.ASSP-34, pp.1054-1063.
ective and Efficient Multipulse Coding of Speech "
(Reference 1).

この音声符号化方式を第25図および第26図を用いて説
明する。第25図および第26図は符号器と復号器のブロッ
ク図である。This speech coding method will be described with reference to FIGS. 25 and 26. FIG. 25 and FIG. 26 are block diagrams of the encoder and the decoder.

第25図において、入力端子INにはA/D変換された音声
信号系列ｓ（ｎ）が入力される。予測フィルタ１はｓ
（ｎ）の過去の系列と予測パラメータa_i（１≦ｉ≦ｐ）
を用い、次式に示されるような予測残差信号ｒ（ｎ）を
計算し、出力する。In FIG. 25, an A / D converted audio signal sequence s (n) is input to an input terminal IN. The prediction filter 1 is s
(N) past series and prediction parameters a _i (1 ≦ i ≦ p)
Is used to calculate and output a prediction residual signal r (n) as shown in the following equation.

ここで、ｐは予測フィルタ１の次数であり、文献１で
はｐ＝12としている。予測フィルタ１の伝達関数Ａ
（ｚ）は次のように表わされる。 Here, p is the order of the prediction filter 1, and in Reference 1, p = 12. Transfer function A of prediction filter 1
(Z) is expressed as follows.

駆動信号発生回路２は、予め定められた間隔で並んだ
駆動パルス列Ｖ（ｎ）を駆動信号として発生する。この
駆動パルス列Ｖ（ｎ）のパターンの一例を第27図に示
す。この図におけるＫはパルス系列の位相を示し、フレ
ームにおける最初のパルスの位置を表わす。また、横軸
は離散的な時刻を表わす。ここでは、１フレームの長さ
を40サンプル（サンプリング周波数8kHzで5ms）、パル
スの間隔を４サンプルとした場合について示してある。 The drive signal generation circuit 2 generates drive pulse trains V (n) arranged at predetermined intervals as drive signals. FIG. 27 shows an example of the pattern of the driving pulse train V (n). In this figure, K indicates the phase of the pulse sequence, and indicates the position of the first pulse in the frame. The horizontal axis represents discrete times. Here, a case where the length of one frame is 40 samples (5 ms at a sampling frequency of 8 kHz) and the pulse interval is 4 samples is shown.

減算器３は、予測残差信号ｒ（ｎ）と駆動信号Ｖ
（ｎ）との差を計算し、重み付けフィルタ４へ出力す
る。このフィルタ14は聴覚のマスキング効果を利用する
ためにｅ（ｎ）を周波数領域で整形するためのものであ
り、その伝達関数Ｗ（ｚ）は次式で与えられる。The subtractor 3 calculates the prediction residual signal r (n) and the driving signal V
The difference from (n) is calculated and output to the weighting filter 4. This filter 14 is for shaping e (n) in the frequency domain in order to utilize the masking effect of hearing, and its transfer function W (z) is given by the following equation.

重み付けフィルタやマスキング効果に関しては、例え
ば古井貞照著「ディジタル音声処理」1985年東海大学出
版会発行（文献２）に記述されているので、ここでは説
明を省略する。 The weighting filter and the masking effect are described in, for example, "Digital Speech Processing" by Sadateru Furui, published by Tokai University Press in 1985 (Reference 2), and will not be described here.

重み付けフィルタ４によって重み付けされた誤差ｅ′
（ｎ）は、誤差最小化回路５に入力される。この誤差最
小化回路５は、ｅ′（ｎ）の２乗誤差が最小となるよう
に駆動パルス列の振幅と位相を決定する。駆動信号発生
回路２は、これら振幅と位相の情報を基に駆動信号を発
生させる。誤差最小化回路５における駆動パルス列の振
幅と位相の決定手順を文献１の記述に従って、以下に簡
単に説明する。Error e ′ weighted by weighting filter 4
(N) is input to the error minimizing circuit 5. The error minimizing circuit 5 determines the amplitude and phase of the drive pulse train so that the square error of e '(n) is minimized. The drive signal generation circuit 2 generates a drive signal based on the information on the amplitude and the phase. The procedure for determining the amplitude and phase of the drive pulse train in the error minimizing circuit 5 will be briefly described below according to the description in Reference 1.

まず、フレーム長をＬサンプル、１フレーム中の駆動
パルス数をＱ個とし、駆動パルスの位置を表わすＱ×Ｌ
の行例をM_Kとおく。M_Kの要素m_ijは、次のように表わさ
れる。また、Ｋは前述したように駆動パルス列の位相で
ある。First, the frame length is L samples, the number of drive pulses in one frame is Q, and Q × L representing the position of the drive pulse
Let M _K be a line example of. The element m _{ij of} M _K is expressed as follows. K is the phase of the drive pulse train as described above.

m_ij＝1;j＝ｉ×N;K−１のとき m_ij＝0;その他０≦１≦Ｑ−１０≦ｊ≦Ｌ−１（但し、Ｎ＝L/Q）（４）次に、位相Ｋの駆動信号（駆動パルス列）の非零の振
幅を要素とする行ベクトルをb^(K)とおくと、位相Ｋの駆
動信号を表わす行ベクトルu^(K)は、次式のように表わさ
れる。When m _ij = 1; j = i × N; K−1, m _ij = 0; other 0 ≦ 1 ≦ Q−1 0 ≦ j ≦ L−1 (where N = L / Q) (4) Next When a row vector having a non-zero amplitude of a drive signal (drive pulse train) of the phase K as an element is denoted by b ^(K) , a row vector u ^(K) representing the drive signal of the phase K is represented by the following equation. Is represented.

u^(K)＝b^(K)M_K （５）重み付けフィルタ４のインパルス応答を要素とする次
のＬ×Ｌ行列をＨとおく。u ^(K) = b ^(K) M _K (5) Let H be the next L × L matrix having the impulse response of the weighting filter 4 as an element.

このとき、重み付けされた誤差ｅ′（ｎ）を要素とす
る誤差ベクトルe^(k)は、次式で記述される。 At this time, an error vector e ^(k) having the weighted error e '(n) as an element is described by the following equation.

e^(K)＝e⁽⁰⁾−b^(K) （７）（Ｋ＝1,…,N）ここで、e⁽⁰⁾e₀＋rH （８） H_K＝M_KH （９）ベクトルe₀は前フレームにおける重み付けフィルタの
内部状態による重み付けフィルタの出力であり、ベクト
ルｒは予測残差信号ベクトルである。最適な駆動パルス
の振幅を表わすベクトルb^(K)は、次式の２乗誤差Ｅ＝e^(K)e^(K)t （10）をb^(K)で偏微分し、零とおくことにより、次式のように
得られる。e ^(K) = e ⁽⁰⁾ −b ^(K) (7) (K = 1,..., N) where e ⁽⁰⁾ e ₀ + rH (8) H _K = M _K H (9) Vector e ₀ is the output of the weighting filter based on the internal state of the weighting filter in the previous frame, and vector r is the prediction residual signal vector. A vector b ^(K) representing the optimum drive pulse amplitude is obtained by partially differentiating the square error E = e ^(K) e ^{(K) t} (10) of the following equation with b ^(K) and setting it to zero. Is obtained as follows:

b^(k)＝e⁽⁰⁾H_K ^t［H_KH_K ^t］^-1 （11）但し、ｔは転置を示す。 ^{^{b (k) = e (0}} ) H K t [H K H K t] -1 (11) where, t denotes the transpose.

このとき、駆動パルス列の位相Ｋは、次式を各Ｋにつ
いて計算し、E^(K)が最小となるように選ぶ。At this time, the phase K of the drive pulse train is calculated so that the following equation is calculated for each K, and is selected so that E ^(K) is minimized.

E^(K) ＝e⁽⁰⁾［H_K ^t［H_KH_K ^t］^-1H_K］・e^(0)t （12）このようにして駆動パルス列の振幅および位相が決定
される。 ^{^{E (K) = e (0}} ) [H K t [H K H K t] -1 H K] · e (0) t (12) amplitude and phase of the thus drive pulse train is determined.

次に、復号器について説明する。 Next, the decoder will be described.

第26図において、駆動信号発生回路７は第25図の駆動
信号発生回路２と同じものであり、符号器から伝送さ
れ、入力端子６に入力された駆動パルス列の振幅と位相
を基に駆動信号を発生する。合成フィルタ８は、この駆
動信号を入力として合成音声信号ｓ（ｎ）を生成し、出
力端子９へ出力する。この合成フィルタ８は第13図の予
測フィルタ１と逆フィルタの関係にあり、その伝達関数
は1/A（ｚ）である。26, the drive signal generation circuit 7 is the same as the drive signal generation circuit 2 in FIG. 25, and is driven based on the amplitude and phase of the drive pulse train transmitted from the encoder and input to the input terminal 6. Occurs. The synthesis filter 8 receives the drive signal as input, generates a synthesized voice signal s (n), and outputs the synthesized voice signal s (n) to the output terminal 9. This synthesis filter 8 has a relationship of an inverse filter with the prediction filter 1 in FIG. 13, and its transfer function is 1 / A (z).

以上説明した従来の符号化方式において、伝送すべき
情報は合成フィルタ22のパラメータa_i（１≦ｉ≦ｐ）と
駆動パルス列の振幅および位相であり、駆動パルス列の
間隔Ｎ＝L/Qを変えることによって、伝送レートを自由
に設定できる。しかしながら、この従来方式による実験
結果では、伝送レートが10kb/s以下になると合成音声に
雑音が目立ち、品質が悪くなる。特にピッチ周期の短い
女性の声で実験を行ったときの品質劣化が目立つ。この
原因は、駆動パルス列を常に一定の間隔のパルス列で表
現しているためであることを突き止めた。すなわち、音
声信号は有声音の時ピッチによる周期的な信号となるた
め、その予測残差信号もピッチ周期毎にパワーが大きく
なる周期的な信号となる。このように周期的にパワーが
大きくなる予測残差信号では、パワーの大きい部分が重
要な情報を含んでいる。また、音韻等の劣化に伴って音
声信号の相関が変わる部分や、発声の開始部分等、音声
信号のパワーが大きくなる部分では、予測残差信号のパ
ワーもフレーム内で大きくなる。この場合も、残差信号
のパワーの大きい部分は、音声信号の性質が変化した部
分であるので、重要である。In the conventional coding method described above, the information to be transmitted is the parameter a _i (1 ≦ i ≦ p) of the synthesis filter 22 and the amplitude and phase of the drive pulse train, and changes the drive pulse train interval N = L / Q. This allows the transmission rate to be set freely. However, according to the experimental results of the conventional method, when the transmission rate is 10 kb / s or less, noise is conspicuous in the synthesized speech, and the quality is deteriorated. In particular, quality degradation is noticeable when an experiment is performed with a female voice having a short pitch cycle. It has been found that this is because the drive pulse train is always represented by pulse trains at a constant interval. That is, since the audio signal is a periodic signal based on the pitch of the voiced sound, the prediction residual signal is also a periodic signal in which the power increases at each pitch cycle. As described above, in the prediction residual signal whose power periodically increases, a portion where the power is large contains important information. In addition, in a portion where the power of the audio signal increases, such as a portion where the correlation of the audio signal changes due to deterioration of the phoneme or the like, or a start portion of utterance, the power of the prediction residual signal also increases in the frame. Also in this case, the part where the power of the residual signal is large is important because it is the part where the property of the audio signal has changed.

ところが、従来の方式では予測残差信号のパワーがフ
レーム内で変化しているにも関わらず、フレーム内で常
に一定の間隔を持つ駆動パルス列で合成フィルタを駆動
して合成音声を得ているため、合成音声の品質が著しく
劣化してしまう。However, in the conventional method, although the power of the prediction residual signal changes in the frame, the synthesis filter is driven by a drive pulse train having a constant interval in the frame to obtain the synthesized voice. However, the quality of the synthesized speech is significantly deteriorated.

（発明が解決しようとする課題）上述したように、従来の音声符号化方式はフレーム内
で常の一定の間隔を持つ駆動パルス列で合成フィルタを
駆動しているため、伝送レートが例えば10kb/s以下にな
ると合成音声の品質が劣化するという問題があった。(Problems to be Solved by the Invention) As described above, in the conventional speech coding method, since the synthesis filter is driven by a driving pulse train having regular constant intervals in a frame, the transmission rate is, for example, 10 kb / s. Below this, there is a problem that the quality of synthesized speech deteriorates.

本発明はこのような問題に鑑みてなされたもので、10
kb/s以下というような低い伝送レートにおいても高品質
の合成音声が得られる音声符号化装置を提供することを
目的とする。The present invention has been made in view of such a problem.
It is an object of the present invention to provide a speech encoding device capable of obtaining high-quality synthesized speech even at a low transmission rate such as kb / s or less.

［発明の構成］（課題を解決するための手段）本発明は駆動信号により合成フィルタを駆動して合成
音声を得る音声符号化装置において、パルス密度（パル
ス間隔）が所定の区間単位で可変である駆動パルス列、
具体的には駆動信号のフレームを複数のサブフレームに
分割し、パルス密度が各サブフレーム内または各フレー
ム内では等間隔で、サブフレーム単位またはフレーム単
位で可変である駆動パルス列によって駆動信号を構成す
る。駆動パルス列の振幅または振幅および位相は、駆動
信号により駆動される合成フィルタの出力信号と入力音
声信号との聴感重み付き誤差信号のパワーが最小となる
ように決定される。（ａ）予測残差信号（短期予測残差
信号、ピッチ予測残差信号、または短期予測残差信号に
ピッチ予測を施して得られるピッチ予測残差信号）を聴
感重みフィルタに通した信号、（ｂ）短期予測残差信号
にピッチ予測を施して得られるピッチ予測残差信号、
（ｃ）予測残差信号（短期予測残差信号、ピッチ予測残
差信号、または短期予測残差信号にピッチ予測を施して
得られるピッチ予測残差信号）に基づいて算出された駆
動信号の各サブフレームに割り当てられるビット配分
値、（ｄ）予測残差信号（短期予測残差信号、ピッチ予
測残差信号、または短期予測残差信号にピッチ予測を施
して得られるピッチ予測残差信号）のパワーまたは零交
差数に非線形の重み付けを施した関数値、に基づいて決
定される。[Constitution of the Invention] (Means for Solving the Problems) The present invention relates to a speech coding apparatus for obtaining a synthesized speech by driving a synthesis filter with a drive signal, wherein a pulse density (pulse interval) is variable in a predetermined section unit. A certain drive pulse train,
Specifically, the frame of the drive signal is divided into a plurality of subframes, and the drive signal is constituted by a drive pulse train in which the pulse density is variable in each subframe or in each frame at equal intervals and in subframe or frame units. I do. The amplitude or the amplitude and the phase of the drive pulse train are determined so that the power of the perceptually weighted error signal between the output signal of the synthesis filter driven by the drive signal and the input audio signal is minimized. (A) a signal obtained by passing a prediction residual signal (short-term prediction residual signal, pitch prediction residual signal, or pitch prediction residual signal obtained by performing pitch prediction on the short-term prediction residual signal) through an auditory weighting filter, b) pitch prediction residual signal obtained by performing pitch prediction on the short-term prediction residual signal;
(C) Each of the drive signals calculated based on the prediction residual signal (short-term prediction residual signal, pitch prediction residual signal, or pitch prediction residual signal obtained by performing pitch prediction on the short-term prediction residual signal) (D) prediction residual signal (short-term prediction residual signal, pitch prediction residual signal, or pitch prediction residual signal obtained by performing pitch prediction on short-term prediction residual signal) The power or the number of zero crossings is determined based on a non-linear weighted function value.

また、パルス密度を所定の密度パターンに従ってサブ
フレーム単位に設定し、かつフレーム単位で密度パター
ンが可変である駆動パルス列により駆動信号を構成して
もよい。その場合、（ａ）〜（ｄ）によって得られた結
果に基づいて駆動パルス列の密度パターンが予め容易さ
れた複数のパターンから選択されて決定される。Alternatively, the pulse density may be set in sub-frame units according to a predetermined density pattern, and the drive signal may be configured by a drive pulse train whose density pattern is variable in frame units. In this case, the density pattern of the drive pulse train is selected and determined from a plurality of patterns that are facilitated in advance based on the results obtained in (a) to (d).

（作用）予測残差信号（短期予測残差信号、ピッチ予測残差信
号、または短期予測残差信号にピッチ予測を施して得ら
れるピッチ予測残差信号）を聴感重みフィルタに通した
信号に基づいて駆動パルス列のパルス密度を決定する
と、聴感重みフィルタから聴感的に重要な情報の振幅が
強調された重み付きの残差信号が得られるため、これを
用いることで聴感的により好適なパルス密度を決定する
ことができ、合成音声の品質がより向上する。(Operation) Based on a signal obtained by passing a prediction residual signal (a short-term prediction residual signal, a pitch prediction residual signal, or a pitch prediction residual signal obtained by performing pitch prediction on a short-term prediction residual signal) through an auditory weighting filter. When the pulse density of the drive pulse train is determined by the above method, a weighted residual signal in which the amplitude of audibly important information is emphasized is obtained from the audibility weighting filter. Can be determined, and the quality of the synthesized speech is further improved.

また、短期予測残差信号にピッチ予測を施して得られ
るピッチ予測残差信号、すなわち通常の短期予測にピッ
チ予測を組み合わせた場合の残差信号に基づいて駆動パ
ルス列のパルス密度を決定することにより、単に短期予
測残差信号のみに基づいてパルス密度を決定する方式に
比較して、合成音声をより低歪とするようなパルス密度
を決定でき、結果的に合成音声の品質が飛躍的に向上す
る。Further, by determining the pulse density of the driving pulse train based on the pitch prediction residual signal obtained by performing pitch prediction on the short-term prediction residual signal, that is, the residual signal obtained by combining pitch prediction with normal short-term prediction. Compared to the method that determines the pulse density based only on the short-term prediction residual signal, it is possible to determine the pulse density that reduces the distortion of the synthesized speech, resulting in a dramatic improvement in the quality of the synthesized speech. I do.

また、予測残差信号（短期予測残差信号、ピッチ予測
残差信号、または短期予測残差信号にピッチ予測を施し
て得られるピッチ予測残差信号）に基づいて算出された
駆動信号の各サブフレームに割り当てられるビット配分
値に基づいて駆動パルス列のパルス密度を決定すると、
ビット配分値の大小により聴感的に重要な区間（サブフ
レーム）であるか否かを判断することが可能であるた
め、これに基づいてパルス密度を決定することで、合成
音声において重要な区間をより精度よく表現できるよう
なパルス密度を決定でき、合成音声の品質が向上する。Each sub-signal of the drive signal calculated based on the prediction residual signal (short-term prediction residual signal, pitch prediction residual signal, or pitch prediction residual signal obtained by performing pitch prediction on the short-term prediction residual signal). When the pulse density of the drive pulse train is determined based on the bit allocation value assigned to the frame,
Since it is possible to determine whether or not the section (subframe) is audibly important depending on the size of the bit allocation value, the pulse density is determined based on this, so that the important section in the synthesized speech can be determined. A pulse density that can be expressed more accurately can be determined, and the quality of synthesized speech is improved.

さらに、予測残差信号（短期予測残差信号、ピッチ予
測残差信号、または短期予測残差信号にピッチ予測を施
して得られるピッチ予測残差信号）のパワーまたは零交
差数に非線形の重み付けを施した関数値に基づいて駆動
パルス列のパルス密度を決定すると、通常の線形の重み
付け操作では得られない微細な重み調整が可能となるた
め、この重み付けの関数値に基づくと、より精度よくパ
ルス密度を決定することができ、結果的として単純に短
期予測残差信号のみに基づいてパルス密度を決定する方
式に比較して合成音声の品質が向上する。Further, the power or the number of zero crossings of the prediction residual signal (short-term prediction residual signal, pitch prediction residual signal, or pitch prediction residual signal obtained by performing pitch prediction on the short-term prediction residual signal) is nonlinearly weighted. If the pulse density of the drive pulse train is determined based on the applied function value, fine weight adjustment that cannot be obtained by a normal linear weighting operation can be performed. Therefore, based on the weighted function value, the pulse density can be more accurately determined. As a result, the quality of the synthesized speech is improved as compared with the method of simply determining the pulse density based on only the short-term prediction residual signal.

すなわち、本発明では駆動パルス列の密度が、聴感的に
重要な情報または多くの情報が含まれるサブフレームま
たはフレームでは密に、そうでないサブフレームまたは
フレームでは粗くというようにサブフレームまたはフレ
ーム毎に適応的に変化することにより、合成音声の品質
が向上する。特に、駆動パルス列の密度をサブフレーム
単位で変化させれば、音声の性質が急激に変化する部分
での合成音声の品質が向上し、フレーム単位で変化させ
れば音声の性質が比較的緩やかに変化する部分での合成
音声の品質が向上する。特に、駆動パルス列の密度をサ
ブフレーム単位で変化させれば、音声の性質が急激に変
化する部分での合成音声の品質が向上し、フレーム単位
で変化させれば音声の性質が比較的緩やかに変化する部
分での合成音声の品質が向上する。That is, in the present invention, the density of the drive pulse train is adapted for each sub-frame or frame such that the density of the drive pulse train is dense in a sub-frame or frame containing information that is perceptually significant or a lot of information, and coarse in a sub-frame or frame that is not. As a result, the quality of synthesized speech is improved. In particular, if the density of the drive pulse train is changed in units of subframes, the quality of the synthesized voice is improved in the portion where the characteristics of the voice changes rapidly, and if the density is changed in units of frames, the characteristics of the voice are relatively moderate. The quality of the synthesized speech in the changing part is improved. In particular, if the density of the drive pulse train is changed in units of subframes, the quality of the synthesized voice is improved in the portion where the characteristics of the voice changes rapidly, and if the density is changed in units of frames, the characteristics of the voice are relatively moderate. The quality of the synthesized speech in the changing part is improved.

また、パルス密度を所定の密度パターンに従ってサブ
フレーム単位に設定し、フレーム単位で密度パターンが
可変である駆動パルス列により駆動信号を構成して、駆
動パルス列の密度パターンを音声の性質に応じて適応的
に変化させれば、音声の性質がどのように変化する部分
でも合成音声の品質が全体的に向上することになる。In addition, the pulse density is set in subframe units according to a predetermined density pattern, a drive signal is formed by a drive pulse train whose density pattern is variable in frame units, and the density pattern of the drive pulse train is adaptively adjusted according to the nature of audio. , The quality of the synthesized speech is improved as a whole in any part where the properties of the speech change.

（実施例）以下、本発明の実施例を図面を参照しながら説明す
る。(Example) Hereinafter, an example of the present invention will be described with reference to the drawings.

第１図および第２図はそれぞれ本発明の原理的構成を
示す第１の実施例に係る符号化装置および復号化装置の
ブロック図である。FIG. 1 and FIG. 2 are block diagrams of an encoding device and a decoding device according to a first embodiment, respectively, showing the basic configuration of the present invention.

第１図において、フレームバッファ11は入力端子10に
入力される音声信号を１フレーム分蓄積する回路であ
り、第１図の各ブロックはフレームバッファ11を用いて
フレーム毎またはサブフレーム毎に以下の処理を行う。In FIG. 1, a frame buffer 11 is a circuit for accumulating one frame of the audio signal input to the input terminal 10. Each block in FIG. 1 uses the frame buffer 11 for each frame or each subframe. Perform processing.

予測パラメータ計算回路12は、予測パラメータを公知
の方法を用いて計算する。予測フィルタ14が第３図に示
すような長時間予測フィルタ41と短時間予測フィルタ42
を縦続接続して構成される場合、予測パラメータ計算回
路12はピッチ周期とピッチ予測係数および線形予測係数
（αパラメータまたはＫパラメータ）を自己相関法や共
分散法等の公知の方法で計算する。計算法については、
例えば前記文献２（古井貞照著「ディジタル音声処理」
1985年東海大学出版会発行）に記述されている。計算さ
れた予測パラメータは、予測パラメータ符号化回路13へ
入力される。予測パラメータ符号化回路13は、予測パラ
メータを予め定められた量子化ビット数に基づいて符号
化し、この符号をマルチプレクサ25に出力すると共に、
復号値を予測フィルタ14と合成フィルタ15および聴感重
みフィルタ20に出力する。予測フィルタ14は、入力の音
声信号と予測パラメータを入力として予測残差信号を計
算し、それを密度パターン選択回路15へ出力する。The prediction parameter calculation circuit 12 calculates a prediction parameter using a known method. The prediction filter 14 includes a long-term prediction filter 41 and a short-term prediction filter 42 as shown in FIG.
Are connected in cascade, the prediction parameter calculation circuit 12 calculates the pitch period, the pitch prediction coefficient, and the linear prediction coefficient (α parameter or K parameter) by a known method such as an autocorrelation method or a covariance method. For the calculation method,
For example, Ref. 2 (Sadateru Furui, "Digital Audio Processing")
1985, published by Tokai University Press). The calculated prediction parameters are input to the prediction parameter coding circuit 13. The prediction parameter encoding circuit 13 encodes the prediction parameter based on a predetermined number of quantization bits, and outputs this code to the multiplexer 25.
The decoded value is output to the prediction filter 14, the synthesis filter 15, and the perceptual weight filter 20. The prediction filter 14 receives the input speech signal and the prediction parameter as inputs, calculates a prediction residual signal, and outputs the signal to the density pattern selection circuit 15.

密度パターン選択回路15は、駆動パルス列の密度パタ
ーンまたは密度をサブフレーム単位またはフレーム単位
で選択する回路である。例えばサブフレーム単位で選択
する場合を例にとると、まず１フレームの予測残差信号
を複数個のサブフレームに分割し、それぞれのサブフレ
ームの予測残差信号の２乗和を計算する。次に、予測残
差信号の２乗和を基に各サブフレームでの駆動パルス列
信号の密度（パルス間隔）パターンを求める。その具体
的な方法の一例は、密度パターンとしてパルス間隔が最
短の２種類、パルス間隔が長いサブフレームの個数とパ
ルス間隔が短いサブフレームの個数を予め設定してお
き、予測残差信号の２乗和が大きいサブフレームの順に
パルス間隔が短くなる密度パターンを選択する方法であ
る。駆動パルス列のフレーム単位で変える場合は、予測
残差信号の２乗和が閾値より大きい場合に短いパルス間
隔を選択し、閾値より小さい場合に長いパルス間隔を選
択すればよい。The density pattern selection circuit 15 is a circuit that selects the density pattern or density of the drive pulse train in subframe units or frame units. For example, in a case where selection is performed in units of subframes, for example, a prediction residual signal of one frame is first divided into a plurality of subframes, and the sum of squares of the prediction residual signal of each subframe is calculated. Next, a density (pulse interval) pattern of the drive pulse train signal in each subframe is obtained based on the sum of squares of the prediction residual signal. One example of the specific method is that two types of density patterns having the shortest pulse interval, the number of subframes having a long pulse interval and the number of subframes having a short pulse interval are set in advance, and the density of the prediction residual signal is set to two. This is a method of selecting a density pattern in which the pulse interval becomes shorter in the order of the subframe having a larger sum of squares. When the drive pulse train is changed in frame units, a short pulse interval may be selected when the sum of squares of the prediction residual signal is larger than a threshold, and a long pulse interval may be selected when the sum of squares is smaller than the threshold.

ゲイン計算回路27は、密度パターン選択回路15でサブ
フレーム単位で密度パターンが選択される場合、選択さ
れた密度パターンの情報を入力とし、駆動信号のゲイン
を例えばパルス間隔の短い全サブフレームの予測残差信
号の標準偏差とパルス間隔の長い全サブフレームの予測
残差信号の標準偏差を用いて２種類求める。また、密度
パターン選択回路15でフレーム単位で密度が選択される
場合、選択された密度の情報を入力とし、駆動信号のゲ
インを予測残差信号の標準偏差を用いて求める。この標
準偏差をσ_ｅとおくと、ゲインＧはにより計算される。得られた密度パターンまたは密度
（パルス間隔）とゲインはそれぞれ符号化回路16,28で
符号化され、マルチプレクサ25に入力されると共に、そ
れらの復号値が駆動信号生成回路17へ入力される。駆動
信号生成回路17は、符号化回路16,28から入力される密
度パターンとゲイン、コードブック24から入力される駆
動パルスの正規化された振幅、および位相探索回路22か
ら入力される駆動パルスの位相を基に、サブフレーム単
位またはフレーム単位で密度が可変の駆動信号を生成す
る。When a density pattern is selected in units of subframes in the density pattern selection circuit 15, the gain calculation circuit 27 receives information of the selected density pattern as input and predicts the gain of the drive signal for all subframes having a short pulse interval, for example. Two types are obtained using the standard deviation of the residual signal and the standard deviation of the predicted residual signal of all subframes having a long pulse interval. When the density is selected in frame units by the density pattern selection circuit 15, information on the selected density is input, and the gain of the drive signal is obtained using the standard deviation of the prediction residual signal. If this standard deviation is σ _e , the gain G is Is calculated by The obtained density pattern or density (pulse interval) and gain are encoded by encoding circuits 16 and 28, respectively, and input to multiplexer 25, and their decoded values are input to drive signal generation circuit 17. The drive signal generation circuit 17 includes a density pattern and a gain input from the encoding circuits 16 and 28, a normalized amplitude of the drive pulse input from the codebook 24, and a drive pulse input from the phase search circuit 22. Based on the phase, a drive signal having a variable density is generated in subframe units or frame units.

第４図に駆動信号生成回路17で作成される駆動信号の
例を示す。ｍ番目のサブフレームにおける駆動パルスの
ゲインをG^(m)、駆動パルスの正規化された振幅を
g_i ^(m)、パルス数をQ_m、パルス間隔をD_m、パルスの位相
をK_m、サブフレームの長さをＬとおくと、密度がサブフ
レーム単位で選択される駆動信号e_X ^(m)（ｎ）は次式で
記述できる。FIG. 4 shows an example of the drive signal generated by the drive signal generation circuit 17. G ^(m) is the gain of the drive pulse in the m-th subframe, and the normalized amplitude of the drive pulse is G ^(m) .
Assuming that g _i ^(m) , the number of pulses is Q _m , the pulse interval is D _m , the pulse phase is K _m , and the length of the subframe is L, the drive signal e _X ^(the density of which is selected in subframe units ^{) m)} (n) can be described by the following equation.

なお、位相K_mはサブフレームにおけるパルスの先頭位
置であり、σ（ｎ）はクロネッカのデルタ関数である。 Note that the phase K _m is the leading position of the pulse in the subframe, and σ (n) is the Kronecker delta function.

また、コードブック24に格納されているコードベクト
ルをC_i＝｛C_i（１）,C_i（２），…C_i（ｌ）｝ｂとする
と、駆動信号e_X（ｎ）は次式で記述できる。Further, the codebook 24 code vectors stored in _{_{C i = {C i (1}} ), C i (2), ... C i (l)} When is b, the driving signal e _X (n) by the following equation Can be described by

同様に、位相Ｋはサブフレームにおけるパルスの先頭
位置、σはクロネッカのデルタ関数である。 Similarly, the phase K is the leading position of the pulse in the subframe, and σ is the Kronecker delta function.

駆動信号生成回路17で生成された駆動信号は合成フィ
ルタ18に入力され、合成信号が出力される。合成フィル
タ18は、予測フィルタ14と逆フィルタの関係にある。減
算回路19の出力である入力音声信号と合成信号との誤差
は、聴感重みフィルタ20によりそのスペクトルが変形さ
れた後、２乗誤差計算回路21へ入力される。聴感重みフ
ィルタ20は、伝達関数がで表わされるフィルタで、従来例における重み付けフィ
ルタと同様に聴感のマスキング効果を利用するためのも
のであり、文献２に詳述されているので説明は省略す
る。The drive signal generated by the drive signal generation circuit 17 is input to the synthesis filter 18, and the synthesized signal is output. The synthesis filter 18 has a relationship of an inverse filter with the prediction filter 14. The error between the input speech signal and the synthesized signal, which is the output of the subtraction circuit 19, is input to the square error calculation circuit 21 after its spectrum is transformed by the audibility weighting filter 20. The perceptual weight filter 20 has a transfer function This filter is used to utilize the masking effect of the audibility like the weighting filter in the conventional example, and is described in detail in the literature 2, so the description is omitted.

２乗誤差計算回路21は、聴感重み付けされた誤差信号
の２乗和をコードブック24に蓄積されたコードベクトル
毎に、および位相探索回路22から出力される駆動パルス
の位相毎に計算し、計算結果を位相探索回路22と振幅探
索回路23へ出力する。振幅探索回路23は、位相探索回路
22から出力される駆動パルスの位相１個毎に、誤差信号
の２乗和を最小とするコードワードのインデックスをコ
ードブック24から探索し、２乗和の最小値を位相探索回
路22へ出力すると共に、２乗和を最小とするコードワー
ドのインデックスを保持する。位相探索回路22は、選択
された密度パターンの情報を入力とし、駆動パルス列の
位相K_mを１≦K_m≦D_mの範囲で変化させ、その値を駆動信
号生成回路17に与え、D_m個の位相に対してそれぞれ決ま
る誤差信号の２乗和の最小値を振幅探索回路23から受
け、そのD_m個の最小値の中で最も小さい２乗和に対応す
る位相をマルチプレクサ25に出力すると同時に、振幅探
索回路23にその位相を知らせる。振幅探索回路23では、
その位相に対応するコードワードのインデックスをマル
チプレクサ25に出力する。The square error calculation circuit 21 calculates the sum of squares of the perceptually weighted error signal for each code vector stored in the codebook 24 and for each phase of the drive pulse output from the phase search circuit 22. The result is output to the phase search circuit 22 and the amplitude search circuit 23. The amplitude search circuit 23 is a phase search circuit
For each phase of the drive pulse output from 22, the index of the code word that minimizes the sum of squares of the error signal is searched from the codebook 24, and the minimum value of the sum of squares is output to the phase search circuit 22. In addition, an index of a code word that minimizes the sum of squares is held. The phase search circuit 22 receives the information on the selected density pattern as input, changes the phase K _m of the drive pulse train in the range of 1 ≦ K _m ≦ D _m , gives the value to the drive signal generation circuit 17, and provides D _m The minimum value of the sum of squares of the error signal determined for each of the phases is received from the amplitude search circuit 23, and the phase corresponding to the smallest sum of the squares of the D _m minimum values is output to the multiplexer 25. At the same time, the amplitude search circuit 23 is notified of the phase. In the amplitude search circuit 23,
The index of the codeword corresponding to the phase is output to the multiplexer 25.

マルチプレクサ25は予測パラメータ、密度パターン、
ゲイン、駆動パルスの位相および振幅の符号を多重化
し、出力端子26を介して伝送路へ出力する。なお、減算
回路19の出力を聴感重みフィルタ20を介さずに直接２乗
誤差計算回路21へ入力してもよい。Multiplexer 25 contains prediction parameters, density patterns,
The gain, the code of the phase and the amplitude of the drive pulse are multiplexed and output to the transmission line via the output terminal 26. Note that the output of the subtraction circuit 19 may be directly input to the square error calculation circuit 21 without passing through the audibility weighting filter 20.

次に、第２図に示す復号化装置について説明する。第
２図において、デマルチプレクサ31は入力端子30から入
力された符号を予測パラメータ、密度パターン、ゲイ
ン、駆動パルスの位相・振幅の符号に分離する。復号化
回路32、37はそれぞれ前記駆動パルスの密度パターンと
駆動パルスのゲインの符号を復号し、駆動信号生成回路
33へ出力する。コードブック35は第１図の符号化装置内
のコードブック24と同じものであり、送られた駆動パル
スの振幅のインデックスに対応するコードワードを駆動
信号生成回路33へ出力する。予測パラメータ復号化回路
36は、第１図の予測パラメータ符号化回路13で符号化さ
れた予測パラメータの符号を復号し、合成フィルタ34へ
出力する。駆動信号生成回路33は、符号化装置内の駆動
信号生成回路17と同様に、入力された駆動パルスの正規
化された振幅と駆動パルスの位相を基に、サブフレーム
単位で密度が可変の駆動信号を生成する。合成フィルタ
34は符号化装置内の合成フィルタ18と同じものであり、
駆動信号と予測パラメータを受けて、合成信号をバッフ
ァ38へ出力する。バッファ38は、入力される信号をフレ
ーム毎に結合し、合成信号を出力端子39へ出力する。Next, the decoding device shown in FIG. 2 will be described. In FIG. 2, a demultiplexer 31 separates a code input from an input terminal 30 into a prediction parameter, a density pattern, a gain, and a code of a phase and an amplitude of a driving pulse. Decoding circuits 32 and 37 decode the density pattern of the driving pulse and the sign of the gain of the driving pulse, respectively, and generate a driving signal generation circuit.
Output to 33. The code book 35 is the same as the code book 24 in the encoding apparatus of FIG. 1, and outputs a code word corresponding to the index of the amplitude of the transmitted drive pulse to the drive signal generation circuit 33. Prediction parameter decoding circuit
36 decodes the code of the prediction parameter coded by the prediction parameter coding circuit 13 in FIG. The drive signal generation circuit 33, similar to the drive signal generation circuit 17 in the encoding device, has a drive whose density is variable in subframe units based on the normalized amplitude of the input drive pulse and the phase of the drive pulse. Generate a signal. Synthesis filter
34 is the same as the synthesis filter 18 in the encoding device,
Upon receiving the drive signal and the prediction parameter, the composite signal is output to the buffer 38. The buffer 38 combines the input signals for each frame and outputs the combined signal to the output terminal 39.

第５図は本発明の第２の実施例に係る符号化装置のブ
ロック図である。この実施例は第１図の符号化装置と同
一の機能を有しながら、駆動信号のパルス列の符号化に
要する計算量を約1/2に低減できるようにしたものであ
る。FIG. 5 is a block diagram of an encoding device according to a second embodiment of the present invention. This embodiment has the same function as that of the encoding apparatus shown in FIG. 1, but is capable of reducing the amount of calculation required for encoding the pulse train of the drive signal to about half.

以下、この計算量低減の原理について簡単に説明す
る。第１図の２乗誤差計算回路21に入力される聴感重み
付き誤差信号e_W（ｎ）は e_W（ｎ）＝［ｓ（ｎ）−ex_C（ｎ）＊ｈ（ｎ）］＊Ｗ（ｎ）（15）ここで、ｓ（ｎ）は入力音声信号、ex_C（ｎ）は駆動
信号の候補、ｈ（ｎ）は合成フィルタ18のインパルス応
答、Ｗ（ｎ）は聴感重みフィルタ20のインパルス応答を
それぞれ示し、また＊は時間域の畳み込み演算を表わ
す。Hereinafter, the principle of the calculation amount reduction will be briefly described. The perceptually weighted error signal e _W (n) input to the square error calculating circuit 21 in FIG. 1 is e _W (n) = [s (n) −ex _C (n) * h (n)] * W (N) (15) Here, s (n) is an input voice signal, ex _C (n) is a candidate for a driving signal, h (n) is an impulse response of the synthesis filter 18, and W (n) is an audibility weighting filter 20. , And * represents a time domain convolution operation.

（15）式の両辺をｚ変換すると、 E_W（ｚ）＝［Ｓ（ｚ）−Ex_C（ｚ）Ｈ（ｚ）］Ｗ（ｚ）（16）となる。When both sides of the equation (15) are z-transformed, E _W (z) = [S (z) −Ex _C (z) H (z)] W (z) (16)

（16）式におけるＨ（ｚ）,W（ｚ）はそれぞれ予測フ
ィルタ14の伝達関数Ａ（ｚ）を用いて、と定義されるので、（17）（18）式を（16）式に代入す
ると、次式が得られる。H (z) and W (z) in the equation (16) are obtained by using the transfer function A (z) of the prediction filter 14, respectively. By substituting equations (17) and (18) into equation (16), the following equation is obtained.

これを逆ｚ変換すると、次式が得られる。 When this is inverse z-transformed, the following equation is obtained.

e_W（ｎ）＝ｘ（ｎ）ex_C（ｎ）＊h_W（ｎ）（20）ここで、ｘ（ｎ）は聴感重み付け入力信号、ex
_C（ｎ）は駆動信号の候補、h_W（ｎ）は1/A（z/γ）の伝
達関数を持つ聴感重みフィルタのインパルス応答を示
す。e _W (n) = x (n) ex _C (n) * h _W (n) (20) where x (n) is an auditory weighting input signal, ex
_C (n) is a candidate of the driving signal, h _W (n) represents the impulse response of the perceptual weighting filter having a transfer function 1 / A (z / γ) .

（15）式と（20）式を比較すると、（15）式では聴感
重み付き誤差信号e_W（ｎ）を計算するのに１つの駆動信
号候補ex_C（ｎ）につき２つのフィルタの畳み込み演算
が必要であるが、（20）式では１つのフィルタの畳み込
み演算でよいことが分かる。実際の符号化処理では、数
百〜数千の駆動信号の候補について聴感重み付き誤差信
号を計算するため、この部分の計算量が符号化装置の全
計算量の大部分を占めている。従って、（15）式の代わ
りに（20）式を用いるように符号化装置の構成を変えた
場合、1/のオーダで符号化に要する計算量が低減される
ため、符号化装置の実用化が一層容易となる。Comparing Expressions (15) and (20), in Expression (15), the convolution operation of two filters per one drive signal candidate ex _C (n) is used to calculate the perceptual weighted error signal e _W (n). Is required, but it can be seen that the convolution operation of one filter is sufficient in the equation (20). In an actual encoding process, perceptually weighted error signals are calculated for hundreds to thousands of drive signal candidates, and this calculation amount occupies most of the total calculation amount of the encoding device. Therefore, when the configuration of the encoding apparatus is changed to use equation (20) instead of equation (15), the amount of calculation required for encoding is reduced on the order of 1 / Becomes easier.

第５図において、第１図と同一番号が付されているブ
ロックは第１図と同一機能を有するため、ここでは説明
を省略する。伝達関数が1/A（z/γ）の第１の聴感重み
フィルタ51は予測パラメータを入力とし、予測フィルタ
14からの予測残差信号ｒ（ｎ）を受け、聴感重み付き入
力信号ｘ（ｎ）を出力する。一方、第１の聴感重みフィ
ルタ51と同一特性の第２の聴感重みフィルタ52は予測パ
ラメータを入力とし、駆動信号生成回路17からの駆動信
号の候補ex_C（ｎ）を受けて聴感重み付き合成信号候補x
_C（ｎ）を出力する。減算回路53は聴感重み付き入力信
号ｘ（ｎ）と聴感重み付き合成信号候補x_C（ｎ）との誤
差、すなわち聴感重み付き誤差信号e_W（ｎ）を２乗誤差
計算回路21へ出力する。In FIG. 5, the blocks denoted by the same reference numerals as those in FIG. 1 have the same functions as those in FIG. A first auditory weighting filter 51 having a transfer function of 1 / A (z / γ) receives a prediction parameter as an input,
It receives the prediction residual signal r (n) from 14 and outputs a perceptually weighted input signal x (n). On the other hand, a second perceptual weight filter 52 having the same characteristics as the first perceptual weight filter 51 receives prediction parameters as input, receives a drive signal candidate ex _C (n) from the drive signal generation circuit 17, and receives perceptual weighted synthesis. Signal candidate x
_C (n) is output. The subtraction circuit 53 outputs an error between the perceptually weighted input signal x (n) and the perceptually weighted synthesized signal candidate x _C (n), that is, a perceptually weighted error signal e _W (n) to the square error calculating circuit 21. .

第６図は本発明の第３の実施例に係る符号化装置のブ
ロック図である。この符号化装置は、第５図の符号化装
置と同一の機能を有しながら、駆動パルスのゲインを閉
ループ型で最適に決定することができるようにし、さら
に合成音声の品質を改良したものである。FIG. 6 is a block diagram of an encoding device according to a third embodiment of the present invention. This encoder has the same function as the encoder shown in FIG. 5, but allows the gain of the drive pulse to be optimally determined in a closed-loop manner, and further improves the quality of synthesized speech. is there.

第１図と第５図の符号化装置においては、駆動パルス
のゲインは入力信号の予測残差信号の標準偏差を用いて
正規化されたコードブックから引き出される全てのコー
ドベクトルに共通のゲインＧを乗じて、位相Ｊ、コード
ブックのインデックスＩを探索する構成となっていた。
この方法は、確定したゲインＧに対して最適な位相Ｊと
インデックスＩが選ばれるが、ゲイン、位相およびイン
デックスが同時に最適化されている訳ではない。ゲイ
ン、位相およびインデックスを同時に最適化できれば、
さらに駆動パルスを精度良く表現できるため、合成音声
の品質が大幅に向上する。In the encoding apparatus of FIGS. 1 and 5, the gain of the driving pulse is the gain G common to all code vectors extracted from the codebook normalized using the standard deviation of the prediction residual signal of the input signal. , And the phase J and the index I of the codebook are searched.
In this method, the optimum phase J and index I are selected for the determined gain G, but the gain, phase and index are not simultaneously optimized. If gain, phase and index can be optimized simultaneously,
Further, since the drive pulse can be expressed with high accuracy, the quality of the synthesized speech is greatly improved.

以下にゲイン、位相およびインデックスを効率良く同
時に最適化する方法の原理について説明する。Hereinafter, the principle of a method for simultaneously optimizing the gain, phase, and index efficiently will be described.

前記（20）式は、次の（21）式に書き替えることがで
きる。The above equation (20) can be rewritten as the following equation (21).

e_W（ｎ）＝ｘ（ｎ）−G_ij・x_j ⁽ⁱ⁾（ｎ）（21）ここで、e_W（ｎ）は聴感重み付き誤差信号、ｘ（ｎ）
は聴感重み付き入力信号、G_ijはインデックスi,位相ｊ
の駆動パルスに最適なゲイン、x_j ⁽ⁱ⁾（ｎ）はインデッ
クスi,位相ｊのゲインを乗じない駆動パルスを、前述の
1/A（z/γ）の伝達関数の聴感重みフィルタで重み付け
した聴感重み付け合成信号候補を示す。聴感重み付き誤
差信号のパワーを最適ゲインで偏微分した値∂E_W/∂G_ijを零とおくこと
により、最適ゲインG_ijが決定される。e _W (n) = x (n) −G _ij · x _j ⁽ⁱ⁾ (n) (21) where e _W (n) is an error signal weighted by auditory sense, x (n)
Is the input signal with auditory weight, G _ij is the index i, phase j
X _j ⁽ⁱ⁾ (n) is a drive pulse that is not multiplied by the gain of the index i and the phase j.
FIG. 9 shows auditory weighted synthesized signal candidates weighted by an auditory weight filter of a transfer function of 1 / A (z / γ). FIG. Power of perceptually weighted error signal The optimal gain G _ij is determined by setting the value ∂E _W / ∂G _ij obtained by partially differentiating with the optimal gain to zero.

とおくと、（23）式はと表わすことができる。（26）式を（22）式に代入する
と、聴感重み付き誤差信号のパワーの極小値は次式で表
わすことができる。 In other words, equation (23) is Can be expressed as By substituting equation (26) into equation (22), the minimum value of the power of the perceptually weighted error signal can be expressed by the following equation.

（27）式の聴感重み付き誤差信号のパワーを最小化す
るインデックスｉおよび位相ｊは、｛A_j ⁽ⁱ⁾｝²/B_j ⁽ⁱ⁾を
最大にするインデックスｉおよび位相ｊと等しい。従っ
て、最適なインデックスＩと位相ＪおよびゲインG_IJを
同時に求めるには、一例として、まずインデックスi,位
相ｊの候補について（24）（25）式によりA⁽ⁱ⁾ _j,B⁽ⁱ⁾ _j
をそれぞれ求め、次に｛A_j ⁽ⁱ⁾｝²/B_j ⁽ⁱ⁾が最大となるイ
ンデックスI,位相Ｊの組を探索し、（26）式を用いてG
_IJを求め符号化すればよい。 The index i and the phase j that minimize the power of the perceptually weighted error signal in equation (27) are equal to the index i and the phase j that maximize ｛A _j ⁽ⁱ⁾ ｝ ² / B _j ⁽ⁱ⁾ . Therefore, in order to simultaneously obtain the optimal index I, the phase J, and the gain _GIJ , as an example, first, for the candidate for the index i and the phase j, A ⁽ⁱ⁾ _j , B ⁽ⁱ⁾ _{j are obtained} by the equations (24) and (25)
Then, a set of an index I and a phase J that maximizes ｛A _j ⁽ⁱ⁾ ｝ ² / B _j ⁽ⁱ⁾ is searched, and G is calculated using equation (26).
_What is necessary is _just to obtain _IJ and encode it.

第６図の符号化装置は、上記のインデックス、位相お
よびゲインを同時に最適化する方法を組み込んでいる点
だけが第５図の符号化装置と異なるため、第５図と同一
機能を有するブロックは第５図と同一番号を付して説明
を省略する。第６図において、位相探索回路22は密度パ
ターン情報とインデックス・位相選択回路56よりの位相
更新情報を入力として、正規化駆動信号生成回路58に位
相情報ｊを出力する。正規化駆動信号生成回路58は、コ
ードブック24に格納される予め正規化されたコードベク
トルC⁽ⁱ⁾（ｉはコードベクトルのインデックス）と密度
パターン情報および位相情報ｊを入力とし、密度パター
ン情報に基づき上記コードベクトルの各要素の後ろに所
定数の零を内挿することでサブフレーム内で一定のパル
ス間隔となる正規化駆動信号を生成し、入力された位相
情報ｊに基づき正規化駆動信号を時間軸の正方向へシフ
トさせたものを最終出力として聴感重みフィルタ52へ出
力する。The encoding device of FIG. 6 differs from the encoding device of FIG. 5 only in that it incorporates the above-described method of simultaneously optimizing the index, phase, and gain. The same reference numerals as in FIG. 5 denote the same parts, and a description thereof will be omitted. 6, the phase search circuit 22 receives the density pattern information and the phase update information from the index / phase selection circuit 56, and outputs the phase information j to the normalized drive signal generation circuit 58. The normalized drive signal generating circuit 58 receives the pre-normalized code vector C ⁽ⁱ⁾ (i is an index of the code vector ⁾ stored in the code book 24, the density pattern information and the phase information j, and outputs the density pattern information A normalized drive signal having a constant pulse interval within a subframe is generated by interpolating a predetermined number of zeros after each element of the code vector based on the above-described code vector, and the normalized drive signal is generated based on the input phase information j. A signal obtained by shifting the signal in the positive direction of the time axis is output to the audibility weighting filter 52 as a final output.

内積計算回路54は聴感重み付き入力信号ｘ（ｎ）と聴
感重み付き合成信号候補x_j ⁽ⁱ⁾（ｎ）との内積値A_j ⁽ⁱ⁾を
（24）式により計算し、これをインデックス・位相選択
回路56へ出力する。パワー計算回路55は聴感重み付き合
成信号候補x_j ⁽ⁱ⁾（ｎ）のパワーB_j ⁽ⁱ⁾を（25）式により
計算し、これをインデックス・位相選択回路56へ出力す
る。インデックス・位相選択回路56は入力された内積値
の２乗とパワーの比｛A_j ⁽ⁱ⁾｝²/B_j ⁽ⁱ⁾が最大となるイン
デックスＩと位相Ｊを探索するために、順次インデック
スと位相の更新情報をコードブック24および位相探索回
路22へ出力する。この探索によって得られる最適なイン
デックスＩと位相Ｊの情報は、マルチプレクサ25へ出力
され、A_J ^(I),B_J ^(I)は一時保存される。ゲイン符号化回
路57はインデックス・位相選択回路56よりのA^(I) _J,B^(I)
_Jを入力とし、最適ゲインA_J ^(I)/B_J ^(I)の量子化・符号化
を行い、このゲインの情報をマルチプレクサ25へ出力す
る。The inner product calculation circuit 54 calculates the inner product value A _j ⁽ⁱ⁾ of the perceptually weighted input signal x (n) and the perceived weighted synthesized signal candidate x _j ⁽ⁱ⁾ (n) according to equation (24), and calculates this as an index. • Output to the phase selection circuit 56. The power calculation circuit 55 calculates the power B _j ⁽ⁱ⁾ of the perceptually weighted synthesized signal candidate x _j ⁽ⁱ⁾ (n) by equation (25), and outputs this to the index / phase selection circuit 56. The index / phase selection circuit 56 sequentially searches for an index I and a phase J that maximize the ratio of the square of the input inner product value to the power ｛A _j ⁽ⁱ⁾ ｝ ² / B _j ^(i). And the phase update information to the codebook 24 and the phase search circuit 22. Information on the optimal index I and phase J obtained by this search is output to the multiplexer 25, and A _J ^(I) and B _J ^(I) are temporarily stored. The gain encoding circuit 57 outputs A ^(I) _J and B ^(I) from the index / phase selection circuit 56.
_{With J} as an input, the quantization and encoding of the optimal gains A _J ^(I) / B _J ^(I) are performed, and the information on the gain is output to the multiplexer 25.

第７図は本発明の第４の実施例に係る符号化装置のブ
ロック図である。この符号化装置は、第６図の符号化装
置と同一の機能を有しながら、駆動信号の位相探索に要
する計算量を減らすことができるように構成したもので
ある。FIG. 7 is a block diagram of an encoding device according to a fourth embodiment of the present invention. This encoder has the same function as the encoder of FIG. 6, but is configured to be able to reduce the amount of calculation required for the phase search of the drive signal.

第７図において、位相シフト回路59は聴感重みフィル
タ52から出力される位相１の聴感重み付き合成信号候補
x₁ ⁽ⁱ⁾（ｎ）を入力とし、インデックスｉについて取り
得る全ての位相状態を単にx₁ ⁽ⁱ⁾（ｎ）のサンプル点を
時間軸の正方向にシフトさせることで簡単に作り出すこ
とができる。In FIG. 7, a phase shift circuit 59 outputs a perceptually weighted synthesized signal candidate of phase 1 output from the perceptual weight filter 52.
With x ₁ ⁽ⁱ⁾ (n) as input, all possible phase states for index i can be easily created by simply shifting the sample point of x ₁ ⁽ⁱ⁾ (n) in the positive direction of the time axis. it can.

今、コードブック24内のインデックスの候補数をN_I、
位相の候補数をN_Jとすれば、第６図における聴感重みフ
ィルタ52の使用回数は１回の駆動信号探索につきN_I×N_J
のオーダであるにのに対し、第７図における聴感重みフ
ィルタ52の使用回数は１回の駆動信号探索につきN_Iのオ
ーダとなり、約１×N_Jに計算量を減らすことができる。Now, let N _{I be} the number of index candidates in codebook 24,
Assuming that the number of phase candidates is N _J , the number of times the auditory weighting filter 52 in FIG. 6 is used is N _I × N _J per drive signal search.
Whereas the a of the order, the number of uses of the perceptual weighting filter 52 in FIG. 7 become the order of one drive signal search every N _I, can reduce the amount of computation to about 1 × N _J.

次に、密度パターン選択回路15をその前処理部分を含
めてより具体的に示した第５および第６の実施例を説明
する。また、以上の第１〜第４の実施例では、予測フィ
ルタ14を第３図に示したように長時間予測フィルタ41と
短時間予測フィルタ42の縦続構成とし、それらの予測パ
ラメータを入力音声信号の分析により求めていたが、以
下の実施例では長期予測フィルタおよびの逆フィルタで
ある長期合成フィルタのパラメータを閉ループで、すな
わち入力音声信号と合成信号との２乗平均誤差が最小と
なるように求める構成とする。この構成によると、合成
信号のレベルで誤差が最小となるようにパラメータを求
めるので、合成音声の品質がさらに向上する。Next, fifth and sixth embodiments will be described in which the density pattern selection circuit 15 is shown in more detail, including its preprocessing portion. In the first to fourth embodiments, the prediction filter 14 has a cascade configuration of a long-time prediction filter 41 and a short-time prediction filter 42 as shown in FIG. In the following embodiment, the parameters of the long-term synthesis filter, which is the inverse filter of the long-term prediction filter, are set in a closed loop, that is, such that the root-mean-square error between the input speech signal and the synthesized signal is minimized. The required configuration is adopted. According to this configuration, the parameters are determined so that the error is minimized at the level of the synthesized signal, so that the quality of the synthesized speech is further improved.

第８図および第９図は本発明の第５の実施例に係る音
声符号化装置および復号化装置のブロック図である。FIGS. 8 and 9 are block diagrams of a speech encoding apparatus and a decoding apparatus according to a fifth embodiment of the present invention.

第８図において、フレームバッファ101は入力端子100
に入力される音声信号を１フレーム分蓄積する回路であ
り、第８図の各ブロックはフレームバッファ101を用い
て１フレーム毎に以下の処理を行う。In FIG. 8, a frame buffer 101 has an input terminal 100.
Is a circuit for accumulating one frame of the audio signal inputted to each block. Each block in FIG. 8 performs the following processing for each frame using the frame buffer 101.

まず、１フレーム分の音声信号に対し予測パラメータ
計算回路102において、公知の方法を用いて短時間予測
パラメータを計算する（通常、この予測パラメータは８
〜12個計算される）。計算法については、例えば前記文
献２（古井貞照著「ディジタル音声処理」）に記述され
ている。計算された予測パラメータは、予測パラメータ
符号化回路103に入力される。予測パラメータ符号化回
路103は、予測パラメータを予め定められた量子化ビッ
ト数に基づいて符号化し、その符号をマルチプレクサ11
7へ出力すると共に、復号値Ｐを予測フィルタ104、聴感
重みフィルタ105、影響信号作成回路107、長期ベクトル
量子化回路109および短期ベクトル量子化回路111へ出力
する。First, the prediction parameter calculation circuit 102 calculates a short-term prediction parameter for one frame of a speech signal using a known method (normally, this prediction parameter is 8
~ 12 are calculated). The calculation method is described in, for example, the aforementioned reference 2 (“Digital Speech Processing” by Sadateru Furui). The calculated prediction parameters are input to the prediction parameter coding circuit 103. The prediction parameter encoding circuit 103 encodes the prediction parameter based on a predetermined number of quantization bits, and
7, and outputs the decoded value P to the prediction filter 104, the perceptual weight filter 105, the influence signal creation circuit 107, the long-term vector quantization circuit 109, and the short-term vector quantization circuit 111.

予測フィルタ104は、フレームバッファ101からの入力
音声信号と符号化回路103からの予測パラメータの復号
値から短期予測残差信号ｒを計算し、それを聴感重みフ
ィルタ105へ出力する。The prediction filter 104 calculates a short-term prediction residual signal r from the input speech signal from the frame buffer 101 and the decoded value of the prediction parameter from the encoding circuit 103, and outputs the signal to the auditory weighting filter 105.

聴感重みフィルタ105は、予測パラメータの復号値Ｐ
を基に構成されるフィルタで短期予測残差信号ｒのスペ
クトルを変形した信号ｘを減算回路106へ出力する。こ
の聴感重みフィルタ105は従来例における重み付けフィ
ルタと同様に聴覚のマスキング効果を利用するためのも
のであり、その詳細は前記文献２に記載されているの
で、説明は省略する。The perceptual weight filter 105 calculates the decoded value P of the prediction parameter.
A signal x obtained by transforming the spectrum of the short-term prediction residual signal r with a filter configured based on The audibility weighting filter 105 is for utilizing the auditory masking effect in the same manner as the weighting filter in the conventional example.

影響信号作成回路107は、加算回路112からの過去の重
み付けされた合成信号と、予測パラメータの復号値Ｐ
を入力とし、過去の影響信号ｆを出力する。具体的には
過去の重み付けされた合成信号をフィルタの内部状態
とする聴感重みフィルタの零入力応答を計算し、それを
影響信号ｆとして、予め設定されるサブフレーム単位で
出力する。8kHzサンプリング時のサブフレーム中の典型
的な値としては、１フレーム（160サンプル）を４分割
した40サンプル程度が使用される。影響信号作成回路10
7は、第１サブフレームにおいては前フレームで決定し
た密度パターンＫに基づいて作成された前フレームの合
成信号を入力として影響信号ｆを作成する。減算回路
106は、サブフレーム単位で聴感重み付き入力信号ｘか
ら過去の影響信号ｆを差し引いた信号ｕを減算回路108
および長期ベクトル量子化回路109へ出力する。The influence signal creation circuit 107 includes the past weighted synthesized signal from the addition circuit 112 and the decoded value P of the prediction parameter.
And outputs a past influence signal f. Specifically, it calculates a zero input response of an auditory weighting filter that uses the past weighted synthesized signal as an internal state of the filter, and outputs it as an influence signal f in preset subframe units. As a typical value in a subframe at the time of 8 kHz sampling, about 40 samples obtained by dividing one frame (160 samples) into four are used. Influence signal creation circuit 10
In the first sub-frame, an influence signal f is created by inputting a synthesized signal of the previous frame created based on the density pattern K determined in the previous frame. Subtraction circuit
A subtraction circuit 108 subtracts a signal u obtained by subtracting the past influence signal f from the perceptually weighted input signal x in subframe units.
And outputs it to the long-term vector quantization circuit 109.

分割回路113は、予測フィルタ104の出力である短期予
測残差信号ｒをサブフレーム単位で切り出し、それを重
みフィルタ114へ出力する。重みフィルタ114は基本的に
は聴感重みフィルタ105と同様の構成であるが、サブフ
レーム単位で短期予測差信号ｒが入力されるとき、内部
状態をリセットする点が聴感重みフィルタ105と異なっ
ている。The dividing circuit 113 cuts out the short-term prediction residual signal r, which is the output of the prediction filter 104, in subframe units, and outputs it to the weight filter 114. The weight filter 114 has basically the same configuration as the auditory weight filter 105, but differs from the auditory weight filter 105 in that when the short-term prediction difference signal r is input in subframe units, the internal state is reset. .

パワー計算回路115は重みフィルタ114の出力信号のパ
ワー（２乗和）をフレームに渡って計算する。ｍ番目の
サブフレームの短期予測残差信号をr^(m)（ｎ）、r
^(m)（ｎ）が入力された場合の重みフィルタ114の出力信
号をr_W（ｎ）とおくと、サブフレームｍに対応する重み
フィルタ114の出力信号のパワーP^(m)は、次式により計
算される。The power calculation circuit 115 calculates the power (sum of squares) of the output signal of the weight filter 114 over the frame. Let the short-term prediction residual signal of the m-th subframe be r ^(m) (n), r
^(m) If the output signal of the weight filter 114 when (n) is input is r _w (n), the power P ^(m) of the output signal of the weight filter 114 corresponding to the subframe m is expressed by the following equation. Is calculated by

但し、ｎは各サブフレーム間で定義されるサンプル
点、Ｌはサブフレーム長（サンプル）である。 Here, n is a sample point defined between subframes, and L is a subframe length (sample).

密度パターン選択回路116は、パワー計算回路115から
出力される重みフィルタ114の出力信号のパワーを基
に、予め設定された駆動信号の密度パターンの中から一
つを選択する。具体的には、パワーの大きいサブフレー
ムの順に密度が高くなるように密度パターンを選択す
る。例えば、等長のサブフレームが４個、密度の種類が
２つで、密度パターンを次表のように設定した場合、密
度パターン選択回路116はサブフレーム毎の上記パワー
を比較し、パワーが最大となるサブフレームが密となる
密度パターンの番号Ｋを選択し、それを密度パターン情
報として短期ベクトル量子化回路111とマルチプレクサ1
17へ出力する。The density pattern selection circuit 116 selects one of the preset drive signal density patterns based on the power of the output signal of the weight filter 114 output from the power calculation circuit 115. More specifically, a density pattern is selected such that the density increases in the order of the subframes having higher power. For example, if there are four equal-length sub-frames, two types of density, and density patterns set as shown in the following table, the density pattern selection circuit 116 compares the powers for each sub-frame, The number K of the density pattern in which the subframe becomes dense is selected, and this is used as the density pattern information as the short-term vector quantization circuit 111 and the multiplexer 1.
Output to 17.

長期ベクトル量子化回路109は、減算回路106からの差
信号、後述の駆動信号保持回路110からの過去の駆動
信号e_Xおよび符号化回路103からの予測パラメータＰを
入力とし、サブフレーム単位で差信号ｕの量子化出力信
号ｕを減算回路108および加算回路112へ、ベクトルゲイ
ンβおよびインデックスＴをマルチプレクサ117へ、長
期駆動信号ｔを駆動信号保持回路110へそれぞれ出力す
る。このときｔととの間には、＝ｔ＊ｈ（ｈは聴感
重みフィルタ105のインパルス応答、＊は畳み込みを表
わす）という関係がある。 The long-term vector quantization circuit 109 receives the difference signal from the subtraction circuit 106, the past drive signal e _X from the drive signal holding circuit 110 described later, and the prediction parameter P from the encoding circuit 103, and calculates the difference in subframe units. The quantized output signal u of the signal u is output to the subtraction circuit 108 and the addition circuit 112, the vector gain β and the index T are output to the multiplexer 117, and the long-term drive signal t is output to the drive signal holding circuit 110. At this time, there is a relationship between t and t = t * h (h represents the impulse response of the audibility weighting filter 105 and * represents convolution).

サブフレーム単位のベクトルゲインβ^(m)とインデッ
クスT^(m)（ｍはサブフレームの番号）の詳細な求め方の
一例を以下に示す。An example of a detailed method of obtaining the vector gain β ^(m) and index T ^(m) (m is the number of a subframe ^{) in} subframe units will be described below.

予め設定されるインデックスＴとゲインβと過去の駆
動信号を用いて現サブフレームの駆動信号候補を作成
し、これを聴感重みフィルタに入力して差信号ｕの量子
化信号の候補を作成し、差信号ｕと量子化信号の候補と
の誤差が最小となるように最適なインデックスT^(m)と最
適なβ^(m)を決定する。このときT^(m)と最適なβ^(m)を用
いて作成される現サブフレームの駆動８信号をｔとし、
ｔを聴感重みフィルタに入力して得られる信号を差信号
の量子化出力信号ｕとする。A drive signal candidate for the current subframe is created using the preset index T, gain β, and past drive signal, and is input to an audibility weighting filter to create a quantized signal candidate for the difference signal u. An optimal index T ^(m) and an optimal β ^(m) are determined so that an error between the difference signal u and the quantized signal candidate is minimized. At this time, t is the drive 8 signal of the current subframe created using T ^(m) and the optimal β ^(m) ,
A signal obtained by inputting t to the perceptual weighting filter is defined as a quantized output signal u of the difference signal.

これと同様な方法は、例えばPETER KROON氏らによるI
EEE 1988年２月、Vol.SAC−6,pp.353−363に掲載の“A
class of Analysis−by−Synthesic Predicative Coder
s for High Quality Speech Coding at Rates Between
4.8 and 16kbits/s"と題する論文（文献３）中の閉ルー
プでピッチ予測器の係数を求める方法と同様の公知の方
法を用いることができるので、ここでは説明を省略す
る。A similar method is described, for example, by PETER KROON et al.
EEE February 1988, Vol.SAC-6, pp.353-363
class of Analysis-by-Synthesic Predicative Coder
s for High Quality Speech Coding at Rates Between
4.8 and 16 kbits / s "(Reference 3), a known method similar to the method of obtaining the coefficient of the pitch predictor in a closed loop can be used, and the description is omitted here.

一方、減算回路108ではサブフレーム単位で差信号
から量子化出力信号ｕを減じた差信号Ｖを短期ベクトル
量子化回路111へ出力する。On the other hand, the subtraction circuit 108 outputs to the short-term vector quantization circuit 111 a difference signal V obtained by subtracting the quantized output signal u from the difference signal in subframe units.

短期ベクトル量子化回路111は、差信号Ｖ、予測パラ
メータＰおよび密度パターン選択回路116より出力され
る密度パターン番号Ｋを入力とし、サブフレーム単位で
差信号の量子化出力信号Ｖを加算回路112へ、短期駆
動信号ｙを駆動信号保持回路110へそれぞれ出力する。
ここでとｙとの間には、＝ｙ＊ｈという関係があ
る。The short-term vector quantization circuit 111 receives the difference signal V, the prediction parameter P, and the density pattern number K output from the density pattern selection circuit 116, and sends the difference signal quantized output signal V in subframe units to the addition circuit 112. , And outputs the short-term drive signal y to the drive signal holding circuit 110.
Here, there is a relationship of = y * h between and y.

また、これと共に短期ベクトル量子化回路111は駆動
パルス列のゲインＧ、位相情報Ｊおよびコードベクトル
のインデックスＩをマルチプレクサ117へ出力する。こ
のとき、サブフレーム単位で出力されるパラメータG,J,
Iは、密度パターン番号Ｋで決まる現サブフレーム（第
ｍサブフレーム）の密度（パルス間隔）に応じたパルス
数N^(m)をサブフレーム内で符号化しなければならないの
で、予め設定されるコードベクトルの次元数N_D（１つず
つのコードベクトルを構成するパルス数）に応じた個
数、すなわちN^(m)/N_D個ずつ現サブフレームで出力され
る。At the same time, the short-term vector quantization circuit 111 outputs the gain G of the drive pulse train, the phase information J, and the index I of the code vector to the multiplexer 117. At this time, the parameters G, J,
Since the number of pulses N ^(m) corresponding to the density (pulse interval) of the current sub-frame (m-th sub-frame) determined by the density pattern number K must be encoded in the sub-frame, number corresponding to the number of dimensions N _D vector (number of pulses constituting one each codevector), that is, output by N ^(m) / N _D amino each current sub-frame.

例えばフレーム長が160サンプル、サブフレームが４
つの等長の40サンプルで構成され、コードベクトルの次
元が20であるとする。この場合、予め用意される密度パ
ターンの１つが第１サブフレームのパルス間隔１、第２
〜第４サブフレームのパルス間隔２とすると、この密度
パターンに対して短期ベクトル量子化回路111から出力
されるゲイン、位相およびインデックスの個数は、それ
ぞれ第１サブフレームで40/20＝２個（ただし、この場
合はパルス間隔が１なので、位相情報は出力しない）、
第２〜第４サブフレームで20/20＝１個となる。For example, the frame length is 160 samples and the subframe is 4
It is assumed that the code vector is composed of 40 samples of equal length and the dimension of the code vector is 20. In this case, one of the density patterns prepared in advance is the pulse interval 1 of the first subframe, the second
Assuming that the pulse interval is 2 to the fourth subframe, the number of gains, phases, and indices output from the short-term vector quantization circuit 111 for this density pattern is 40/20 = 2 in the first subframe ( However, in this case, since the pulse interval is 1, no phase information is output.)
20/20 = 1 in the second to fourth subframes.

短期ベクトル量子化回路111の具体的な構成例を第10
図に示す。第10図において、合成ベクトル生成回路301
は予測パラメータＰと、予め設定されるコードブック30
2内のコードベクトルC⁽ⁱ⁾（ｉはコードベクトルのイン
デックス）および密度パターン情報Ｋとから、密度パタ
ーン情報Ｋに対応する予め設定されたパルス間隔となる
ようにC⁽ⁱ⁾の第１サンプル以降に所定の周期で零を内挿
して密度情報を持つパルス列を作成し、このパルス列を
予測パラメータＰから生成される聴感重みフィルタで合
成することにより、合成ベクトルV₁ ⁽ⁱ⁾を生成する。The specific configuration example of the short-term vector quantization circuit 111 is
Shown in the figure. In FIG. 10, a synthetic vector generation circuit 301
Represents a prediction parameter P and a preset code book 30
2 from the code vector C ⁽ⁱ⁾ (i is an index of the code vector) and the density pattern information K so that the first sample of C ⁽ⁱ⁾ becomes a preset pulse interval corresponding to the density pattern information K. Thereafter, a pulse train having density information is created by interpolating zero at a predetermined cycle, and this pulse train is synthesized by an audibility weighting filter generated from the prediction parameter P, thereby generating a synthesized vector V ₁ ⁽ⁱ⁾ .

位相シフト回路303は、この合成ベクトルV₁ ⁽ⁱ⁾を密度
パターン情報Ｋに基づいて所定のサンプル数だけ遅延さ
せて位相の異なる合成ベクトルV₂ ⁽ⁱ⁾,V₃ ⁽ⁱ⁾,…V_j ⁽ⁱ⁾,…
を作成し、内積計算回路304およびパワー計算回路305へ
出力する。コードブック302は適応密度パルスの振幅情
報を格納し、インデックスｉに対して予め定められたコ
ードベクトルC⁽ⁱ⁾が引き出し可能なメモリ回路またはベ
クトル発生回路で構成される。内積計算回路304は、第
８図の減算回路108からの差信号Ｖと、合成ベクトルV_j
⁽ⁱ⁾との内積値A_j ⁽ⁱ⁾を求め、インデックス・位相選択回
路306へ出力する。パワー計算回路305は、合成ベクトル
V_j ⁽ⁱ⁾のパワーB_j ⁽ⁱ⁾を求め、インデックス・位相選択回
路306へ出力する。The phase shift circuit 303 delays the combined vector V ₁ ⁽ⁱ⁾ by a predetermined number of samples based on the density pattern information K, and combines the combined vectors V ₂ ⁽ⁱ⁾ , V ₃ ⁽ⁱ⁾ _,. ⁽ⁱ⁾ ,…
Is generated and output to the inner product calculation circuit 304 and the power calculation circuit 305. The codebook 302 stores the amplitude information of the adaptive density pulse, and is configured by a memory circuit or a vector generation circuit from which a predetermined code vector C ⁽ⁱ⁾ can be derived for an index i. Inner product computation circuit 304, and a difference signal V from Figure 8 of the subtracting circuit 108, the resultant vector V _j
obtains an inner product value A _j ⁽ⁱ⁾ and ^(i), and outputs the index phase selection circuit 306. The power calculation circuit 305 calculates the composite vector
Power B obtains the _j ⁽ⁱ⁾ of V _j ^(i), and outputs the index phase selection circuit 306.

インデックス・位相選択回路306では、内積値A_j ⁽ⁱ⁾と
パワーB_j ⁽ⁱ⁾を用いて、次式の評価値｛A_j ⁽ⁱ⁾｝²/B_j ⁽ⁱ⁾ （28）が最も大きくなるような位相ＪとインデックスＩを位相
候補ｊとインデックス候補ｉの中から選択し、対応する
内積値A_J ^(I)とパワーB_J ^(I)の組をゲイン符号化回路307
へ出力する。また、インデックス・位相選択回路306は
さらに位相Ｊの情報を短期駆動信号生成回路308および
第８図のマルチプレクサ117へ出力し、インデックスＩ
の情報をコードブック302および第８図のマルチプレク
サ117へ出力する。The index / phase selection circuit 306 uses the inner product value A _j ⁽ⁱ⁾ and the power B _j ⁽ⁱ⁾ to find the evaluation value ｛A _j ⁽ⁱ⁾ ｝ ² / B _j ⁽ⁱ⁾ (28) A phase J and an index I that become larger are selected from the phase candidate j and the index candidate i, and a set of the corresponding inner product value A _J ^(I) and power B _J ^(I) is obtained by the gain encoding circuit 307.
Output to Further, the index / phase selection circuit 306 further outputs the information of the phase J to the short-term drive signal generation circuit 308 and the multiplexer 117 of FIG.
Is output to the code book 302 and the multiplexer 117 of FIG.

ゲイン符号化回路307では、インデックス・位相選択
回路306からの内積値A_J ^(I)とパワーB_J ^(I)との比 A_J ^(I)/B_J ^(I) （29）を所定の方法で符号化して、そのゲイン情報Ｇを短期駆
動信号生成回路308および第８図のマルチプレクサ117へ
出力する。In the gain encoding circuit 307, a ratio A _J ^(I) / B _J ^(I) (29) between the inner product value A _J ^(I) from the index / phase selection circuit 306 and the power B _J ^(I) is determined by a predetermined method. And outputs the gain information G to the short-term drive signal generation circuit 308 and the multiplexer 117 in FIG.

上の（28）（29）式は、例えばI.M.Trancoso氏らによ
るInternational Conference on Acoustic,Speech and
Signal Processingの論文“EFFICIENT PROCEDURES FOR
FINDING THE OPTIMUM INNOVATION IN STOCHATIC CODER
S"（文献４）によって提案されたものを用いてもよい。Equations (28) and (29) above are described, for example, by International Conference on Acoustic, Speech and
Signal Processing paper “EFFICIENT PROCEDURES FOR
FINDING THE OPTIMUM INNOVATION IN STOCHATIC CODER
S "(Reference 4) may be used.

短期駆動信号生成回路308は、密度パターン情報Ｋ、
ゲイン情報Ｇ、位相情報ＪおよびインデックスＩに対応
するコードベクトルC^(I)を入力とし、ＫおよびC^(I)を用
いて上記合成ベクトル生成回路301での方法と同様の方
法で密度情報を持つパルス列を作成し、そのパルス振幅
にゲイン情報Ｇに対応する値を乗じ、位相情報Ｊに基づ
き所定のサンプル数だけパルス列を遅延することによ
り、短期駆動信号ｙを生成する。この短期駆動信号ｙ
は、聴感重みフィルタ309および第８図の駆動信号保持
回路110へ出力される。聴感重みフィルタ309は第８図の
聴感重みフィルタ105と同様の特性を持つフィルタであ
り、予測パラメータＰを基にして作られ、短期駆動信号
ｙを入力として差信号Ｖの量子化出力を第８図の加算
回路112へ出力する。The short-term drive signal generation circuit 308 includes density pattern information K,
Gain information G, phase information J, and code vector C ^(I) corresponding to index I are input, and density information is stored using K and C ^(I) in the same manner as in synthetic vector generation circuit 301 described above. A short-term drive signal y is generated by creating a pulse train, multiplying the pulse amplitude by a value corresponding to the gain information G, and delaying the pulse train by a predetermined number of samples based on the phase information J. This short-term drive signal y
Are output to the audibility weighting filter 309 and the drive signal holding circuit 110 in FIG. The perceptual weighting filter 309 is a filter having the same characteristics as the perceptual weighting filter 105 of FIG. 8, is formed based on the prediction parameter P, and receives the short-term drive signal y as input and outputs the quantized output of the difference signal V to the eighth input. It is output to the adder circuit 112 in the figure.

第８図に説明を戻すと、駆動信号保持回路110は長期
ベクトル量子化回路109より出力される長期駆動信号ｔ
および短期ベクトル量子化回路111より出力される短期
駆動信号ｙを入力とし、駆動信号e_Xをサブフレーム単位
で長期ベクトル量子化回路109へ出力する。具体的に
は、例えばｔとｙをサブフレーム単位でサンプル毎に加
算したものを駆動信号e_Xとすればよい。現サブフレーム
の駆動信号e_Xは、次のサブフレームにおいて過去の駆動
信号として長期ベクトル量子化回路109において使用で
きるように、駆動信号保持回路110内のバッファメモリ
に保持される。加算回路112は、サブフレーム単位で量
子化出力^(m)および^(m)と、現サブフレームで作成さ
れた過去の影響信号ｆとの和信号ｘを求め、影響信号作
成回路107へ出力する。Returning to FIG. 8, the drive signal holding circuit 110 outputs the long-term drive signal t output from the long-term vector quantization circuit 109.
The short-term drive signal y output from the short-term vector quantization circuit 111 is input, and the drive signal e _X is output to the long-term vector quantization circuit 109 in subframe units. Specifically, for example, those obtained by adding the t and y for each sample in the sub-frames may be used as the driving signal e _X. The drive signal e _X of the current sub-frame is held in a buffer memory in the drive signal holding circuit 110 so that it can be used in the long-term vector quantization circuit 109 as a past drive signal in the next sub-frame. The addition circuit 112 obtains a sum signal x of the quantized outputs ^(m) and ^{(m) in} units of subframes and the past influence signal f created in the current subframe, and outputs the sum signal x to the influence signal creation circuit 107.

以上のようにして求められた各パラメータP,β,T,G,
I,J,Kの情報がマルチプレクサ117により多重化され、伝
送符号として出力端子118より伝送される。The parameters P, β, T, G,
The information of I, J, and K is multiplexed by the multiplexer 117 and transmitted from the output terminal 118 as a transmission code.

次に、第８図の符号化装置から伝送された符号を復号
する第９図の復号化装置について説明する。Next, the decoding device shown in FIG. 9 for decoding the code transmitted from the encoding device shown in FIG. 8 will be described.

第９図において、入力端子200には伝送された符号が
入力される。デマルチプレクサ201はこの入力符号をま
ず予測パラメータ、密度パターン情報Ｋ、ゲインβ、ゲ
インＧ、位相Ｊ、インデックスＴ、インデックスＩおよ
び位相情報Ｊの符号に分離する。復号化回路202〜207
は、それぞれ密度パターン情報Ｋ、ゲインＧ、インデッ
クスＩ、ゲインβおよびインデックスＴの符号を復号
し、駆動信号生成回路209へ出力する。もう一つの復号
化回路208は、符号化された予測パラメータを復号し、
合成フィルタ210へ出力する。駆動信号生成回路209は、
復号された各パラメータを入力とし、密度パターン情報
Ｋに基づいてサブフレーム単位で密度の異なる駆動信号
を生成する。In FIG. 9, a transmitted code is input to an input terminal 200. The demultiplexer 201 first separates the input code into codes of a prediction parameter, density pattern information K, gain β, gain G, phase J, index T, index I, and phase information J. Decoding circuits 202-207
Decodes the codes of the density pattern information K, the gain G, the index I, the gain β, and the index T, respectively, and outputs the codes to the drive signal generation circuit 209. Another decoding circuit 208 decodes the encoded prediction parameter,
Output to the synthesis filter 210. The drive signal generation circuit 209 is
Drive signals having different densities are generated in subframe units based on the density pattern information K by using the decoded parameters as inputs.

駆動信号生成回路209は、具体的には例えば第11図に
示すように構成される。第11図において、コードブック
500は符号化装置内の第10図に示すコードブック302と同
一機能を有するものであり、インデックスＩに対応する
コードベクトルC^(I)を短期駆動信号生成回路501へ出力
する。短期駆動信号生成回路501は、符号化装置内の第1
0図に示す短期駆動信号生成回路308と同一機能を有する
ものであり、密度パターン情報Ｋ、位相情報Ｊおよびゲ
インＧを入力とし、短期駆動信号ｙを加算回路506へ出
力する。加算回路506は、短期駆動信号ｙと長期駆動信
号生成回路502で生成された長期駆動信号ｔとの和信
号、すなわち駆動信号e_Xを駆動信号バッファ503および
第９図の合成フィルタ210へ出力する。The drive signal generation circuit 209 is specifically configured, for example, as shown in FIG. In FIG. 11, the code book
Reference numeral 500 has the same function as the code book 302 shown in FIG. 10 in the encoder, and outputs a code vector C ^(I) corresponding to the index I to the short-term drive signal generation circuit 501. The short-term drive signal generation circuit 501
It has the same function as the short-term drive signal generation circuit 308 shown in FIG. 0, receives the density pattern information K, the phase information J and the gain G, and outputs the short-term drive signal y to the addition circuit 506. Adding circuit 506 outputs the short drive signal y and the long-term drive signal generating circuit sum signal of the long-term drive signal t generated by 502, that is, the driving signal e synthesis filter 210 of the _X drive signal buffer 503 and Figure 9 .

駆動信号バッファ503は、加算回路506から出力される
駆動信号を現在から所定のサンプル数だけ過去のものま
で保持し、インッデックスＴが入力されるとＴサンプル
過去の駆動信号から順にサブフレーム長に相当するサン
プル数だけ出力する構成となっている。長期駆動信号生
成回路502は、インデックスＴに基づき駆動信号バッフ
ァ503より出力される信号を入力とし、この入力信号に
ゲインβを乗じると共に、Ｔサンプルの周期で繰り返す
長期駆動信号を生成し、加算回路506へサブフレーム単
位で出力する。The drive signal buffer 503 holds the drive signal output from the adder circuit 506 up to a predetermined number of samples from the present to the past, and when an index T is input, the drive signal buffer 503 sequentially increases the sub-frame length from the drive signal T samples past. It is configured to output the corresponding number of samples. The long-term drive signal generation circuit 502 receives a signal output from the drive signal buffer 503 based on the index T, multiplies the input signal by a gain β, and generates a long-term drive signal that repeats at a period of T samples. Output to 506 in subframe units.

第９図に説明を戻すと、合成フィルタ210は符号化装
置内の第８図に示す予測フィルタ104と逆の周波数特性
を持つフィルタであり、駆動信号と予測パラメータを入
力として、合成信号を出力する。Returning to FIG. 9, the synthesis filter 210 is a filter having a frequency characteristic opposite to that of the prediction filter 104 shown in FIG. 8 in the encoding device, and outputs the synthesized signal by inputting the drive signal and the prediction parameter. I do.

ポストフィルタ211は予測パラメータ、ゲインβおよ
びインデックスＴを用いて合成フィルタ210から出力さ
れる合成信号のスペクトルを主観的に雑音が減少するよ
うに整形して、バッファ212へ出力する。ポストフィル
タの具体的な構成法については例えば前記文献５に記載
されているような方法を用いればよい。また、ポストフ
ィルタ211を用いずに直接合成フィルタ210の出力をバッ
ファ212へ供給する構成としてもよい。バッファ212は入
力される信号をフレーム毎に結合し、合成音声信号を出
力端子213へ出力する。The post filter 211 shapes the spectrum of the synthesized signal output from the synthesis filter 210 using the prediction parameter, the gain β, and the index T so as to subjectively reduce noise, and outputs the resultant to the buffer 212. As a specific configuration method of the post-filter, for example, a method described in the above-mentioned reference 5 may be used. Alternatively, the output of the synthesis filter 210 may be directly supplied to the buffer 212 without using the post filter 211. The buffer 212 combines the input signals on a frame-by-frame basis and outputs the synthesized audio signal to the output terminal 213.

なお、上述した実施例では駆動信号の密度パターン選
択を短期予測残差信号を重みフィルタに通した信号のパ
ワーに基づいて行ったが、短期予測残差信号にピッチ予
測を適用して得られるピッチ予測残差信号を重みフィル
タに通した信号のパワーに基づいて行うこともできる。In the above-described embodiment, the density pattern of the drive signal is selected based on the power of the signal obtained by passing the short-term prediction residual signal through the weighting filter. It can also be performed based on the power of the signal obtained by passing the prediction residual signal through a weight filter.

第12図は本発明の第６の実施例に係る符号化装置のブ
ロック図であり、ピッチ予測残差信号を重みフィルタに
通した信号のパワーに基づいて密度パターン選択を行う
例である。第12図は、第８図における分割回路113の前
にピッチ分析回路119、ピッチ予測フィルタ120を配置し
た構成となっている。ピッチ分析回路119はピッチ周期
とピッチゲインを計算する回路であり、その計算結果を
ピッチ予測フィルタ120へ出力し、ピッチ予測フィルタ1
20はピッチ予測残差信号を分割回路113へ出力する。ピ
ッチ周期とピッチゲインは、公知の方法、例えば自己相
関法や共分散法で求めることができる。FIG. 12 is a block diagram of an encoding device according to a sixth embodiment of the present invention, in which a density pattern is selected based on the power of a signal obtained by passing a pitch prediction residual signal through a weighting filter. FIG. 12 shows a configuration in which a pitch analysis circuit 119 and a pitch prediction filter 120 are arranged before the division circuit 113 in FIG. The pitch analysis circuit 119 is a circuit that calculates a pitch period and a pitch gain, outputs the calculation result to the pitch prediction filter 120, and outputs the pitch prediction filter 1
20 outputs the pitch prediction residual signal to the dividing circuit 113. The pitch period and the pitch gain can be obtained by a known method, for example, an autocorrelation method or a covariance method.

第13図は本発明の第７の実施例に係る音声符号化装置
のブロック図である。FIG. 13 is a block diagram of a speech coding apparatus according to a seventh embodiment of the present invention.

第13図において、フレームバッファ101は入力端子100
に入力される音声信号を１フレーム分蓄積する回路であ
り、第13図の各ブロックはフレームバッファ101を用い
て１フレーム毎に以下の処理を行う。In FIG. 13, the frame buffer 101 has an input terminal 100.
Is a circuit for accumulating the audio signal for one frame, which is input to each block. Each block in FIG. 13 performs the following processing for each frame using the frame buffer 101.

まず、１フレーム分の音声信号に対し予測パラメータ
計算回路102において、公知の方法を用いて短時間予測
パラメータを計算する（通常、この予測パラメータは８
〜12個計算される）。計算法については、例えば前記文
献２に記述されている。計算された予測パラメータは、
予測パラメータ符号化回路103に入力される。予測パラ
メータ符号化回路103は、予測パラメータを予め定めら
れた量子化ビット数に基づいて符号化し、その符号をマ
ルチプレクサ117へ出力すると共に、復号値Ｐを予測フ
ィルタ104、聴感重みフィルタ105、影響信号作成回路10
7、長期ベクトル量子化回路109および短期ベクトル量子
化回路111へ出力する。First, the prediction parameter calculation circuit 102 calculates a short-term prediction parameter for one frame of a speech signal using a known method (normally, this prediction parameter is 8
~ 12 are calculated). The calculation method is described in, for example, the aforementioned reference 2. The calculated prediction parameters are
It is input to the prediction parameter coding circuit 103. The prediction parameter encoding circuit 103 encodes the prediction parameter based on a predetermined number of quantization bits, outputs the code to the multiplexer 117, and outputs the decoded value P to the prediction filter 104, the perceptual weight filter 105, the influence signal Creation circuit 10
7. Output to the long-term vector quantization circuit 109 and the short-term vector quantization circuit 111.

聴感重みフィルタ105は、予測パラメータの復号値Ｐ
を基に構成されるフィルタで短期予測残差信号ｒのスペ
クトルを変形した信号ｘを減算回路106へ出力する。こ
の聴感重みフィルタ105は従来例における重み付けフィ
ルタと同様に聴覚のマスキング効果を利用するためのも
のでり、その詳細は前記文献２に記載されているので、
説明は省略する。The perceptual weight filter 105 calculates the decoded value P of the prediction parameter.
A signal x obtained by transforming the spectrum of the short-term prediction residual signal r with a filter configured based on This auditory weighting filter 105 is for using the masking effect of the auditory sense similarly to the weighting filter in the conventional example.
Description is omitted.

影響信号作成回路107は、加算回路112からの過去の重
み付けされた合成信号と、予測パラメータの復号値Ｐ
を入力とし、過去の影響信号ｆを出力する。具体的には
過去の重み付けされた合成信号をフィルタの内部状態
をする聴感重みフィルタの零入力応答を計算し、それを
影響信号ｆとして、予め設定されるサブフレーム単位で
出力する。8kHzサンプリング時のサブフレーム中の典型
的な値としては、１フレーム（160サンプル）を４分割
した40サンプル程度が使用される。影響信号作成回路10
7は、第１サブフレームにおいては前フレームで決定し
た密度パターンＫに基づいて作成された前フレームの合
成信号を入力として影響信号ｆを作成する。減算回路
106は、サブフレーム単位で聴感重み付き入力信号ｘか
ら過去の影響信号ｆを差し引いた信号ｕを減算回路108
および長期ベクトル量子化回路109へ出力する。The influence signal creation circuit 107 includes the past weighted synthesized signal from the addition circuit 112 and the decoded value P of the prediction parameter.
And outputs a past influence signal f. More specifically, a quiescent response of a perceptual weighting filter that performs the internal state of the filter on the past weighted synthesized signal is calculated, and is output as an influence signal f in preset subframe units. As a typical value in a subframe at the time of 8 kHz sampling, about 40 samples obtained by dividing one frame (160 samples) into four are used. Influence signal creation circuit 10
In the first sub-frame, an influence signal f is created by inputting a synthesized signal of the previous frame created based on the density pattern K determined in the previous frame. Subtraction circuit
A subtraction circuit 108 subtracts a signal u obtained by subtracting the past influence signal f from the perceptually weighted input signal x in subframe units.
And outputs it to the long-term vector quantization circuit 109.

パワー計算回路113は、予測フィルタ104の出力である
短期予測残差信号のパワー（２乗和）をサブフレーム単
位で計算し、各サブフレームのパワーを密度パターン選
択回路122へ出力する。The power calculation circuit 113 calculates the power (sum of squares) of the short-term prediction residual signal output from the prediction filter 104 in subframe units, and outputs the power of each subframe to the density pattern selection circuit 122.

密度パターン選択回路122は、パワー計算回路117から
出力されるサブフレーム毎の短期予測残差信号のパワー
を基に、予め設定された駆動信号の密度パターンの中か
ら一つを選択する。具体的には、パワーの大きいサブフ
レームの順に密度が高くなるように密度パターンを選択
する。例えば、等長のサブフレームが４個、密度の種類
が２つで、密度パターンを前記表１のように設定した場
合、密度パターン選択回路117はサブフレーム毎の上記
パワーを比較し、パワーが最大となるサブフレームが密
となる密度パターンの番号Ｋを選択し、それを密度パタ
ーン情報として短期ベクトル量子化回路111とマルチプ
レクサ117へ出力する。The density pattern selection circuit 122 selects one of preset drive signal density patterns based on the power of the short-term prediction residual signal for each subframe output from the power calculation circuit 117. More specifically, a density pattern is selected such that the density increases in the order of the subframes having higher power. For example, when there are four equal-length subframes, two types of density, and density patterns set as shown in Table 1, the density pattern selection circuit 117 compares the power for each subframe, and determines the power. The number K of the density pattern in which the maximum subframe is dense is selected, and is output to the short-term vector quantization circuit 111 and the multiplexer 117 as density pattern information.

長期ベクトル量子化回路109は、減算回路106からの差
信号ｕ、後述の駆動信号保持回路110からの過去の駆動
信号e_Xおよび符号化回路103からの予測パラメータＰを
入力とし、サブフレーム単位で差信号ｕの量子化出力信
号を減算回路108および加算回路112へ、ベクトルゲイ
ンβおよびインデックスＴをマルチプレクサ117へ、長
期駆動信号ｔを駆動信号保持回路110へそれぞれ出力す
る。このときｔととの間には、＝ｔ＊ｈ（ｈは聴感
重みフィルタ105のインパルス応答、＊は畳み込みを表
わす）という関係がある。The long-term vector quantization circuit 109 receives the difference signal u from the subtraction circuit 106, the past drive signal e _X from the drive signal holding circuit 110 described later, and the prediction parameter P from the encoding circuit 103, and receives a sub-frame unit. The quantized output signal of the difference signal u is output to the subtraction circuit 108 and the addition circuit 112, the vector gain β and the index T are output to the multiplexer 117, and the long-term drive signal t is output to the drive signal holding circuit 110. At this time, there is a relationship between t and t = t * h (h represents the impulse response of the audibility weighting filter 105 and * represents convolution).

サブフレーム単位のベクトルゲインβ^(m)とインデッ
クスT^(m)（ｍはサブフレームの番号）の詳細な求め方
は、第５の実施例と同様でよいため、説明を省略する。A detailed method of obtaining the vector gain β ^(m) and the index T ^(m) (m is the number of the subframe ^{) in} subframe units may be the same as in the fifth embodiment, and thus the description is omitted.

一方、減算回路108ではサブフレーム単位で差信号ｕ
からの量子化出力信号を減じた差信号Ｖを短期ベクト
ル量子化回路111へ出力する。On the other hand, in the subtraction circuit 108, the difference signal u is
And outputs the difference signal V obtained by subtracting the quantized output signal from the short-term vector quantization circuit 111.

短期ベクトル量子化回路111は、差信号Ｖ、予測パラ
メータＰおよび密度パターン選択回路122より出力され
る密度パターン番号Ｋを入力とし、サブフレーム単位で
差信号Ｖの量子化出力信号を加算回路112へ、短期駆
動信号ｙを駆動信号保持回路110へそれぞれ出力する。
ここでとｙとの間には、＝ｙ＊ｈという関係があ
る。The short-term vector quantization circuit 111 receives the difference signal V, the prediction parameter P, and the density pattern number K output from the density pattern selection circuit 122, and sends the quantization output signal of the difference signal V to the addition circuit 112 in subframe units. , And outputs the short-term drive signal y to the drive signal holding circuit 110.
Here, there is a relationship of = y * h between and y.

位相シフト回路303は、この合成ベクトルV₁ ⁽ⁱ⁾を密度
パターン情報Ｋに基づいて所定のサンプル数だけ遅延さ
せて位相の異なる合成ベクトルV₂ ⁽ⁱ⁾,V₃ ⁽ⁱ⁾,…V_j ⁽ⁱ⁾,…
を作成し、内積計算回路304およびパワー計算回路305へ
出力する。コードブック302は適応密度パルスの振幅情
報を格納し、インデックスｉに対して予め定められたコ
ードベクトルC⁽ⁱ⁾が引き出し可能なメモリ回路またはベ
クトル発生回路で構成される。内積計算回路304は、第1
3図の減算回路108からの差信号Ｖと、合成ベクトルV_j
⁽ⁱ⁾との内積値A_j ⁽ⁱ⁾を求め、インデックス・位相選択回
路306へ出力する。パワー計算回路305は、合成ベクトル
V_j ⁽ⁱ⁾のパワーB_j ⁽ⁱ⁾を求め、インデックス・位相選択回
路306へ出力する。The phase shift circuit 303 delays the combined vector V ₁ ⁽ⁱ⁾ by a predetermined number of samples based on the density pattern information K, and combines the combined vectors V ₂ ⁽ⁱ⁾ , V ₃ ⁽ⁱ⁾ _,. ⁽ⁱ⁾ ,…
Is generated and output to the inner product calculation circuit 304 and the power calculation circuit 305. The codebook 302 stores the amplitude information of the adaptive density pulse, and is configured by a memory circuit or a vector generation circuit from which a predetermined code vector C ⁽ⁱ⁾ can be derived for an index i. The inner product calculation circuit 304
And a difference signal V from the three view subtraction circuit 108, the resultant vector V _j
obtains an inner product value A _j ⁽ⁱ⁾ and ^(i), and outputs the index phase selection circuit 306. The power calculation circuit 305 calculates the composite vector
Power B obtains the _j ⁽ⁱ⁾ of V _j ^(i), and outputs the index phase selection circuit 306.

インデックス・位相選択回路306では、内積値A_j ⁽ⁱ⁾と
パワーB_j ⁽ⁱ⁾を用いて、前記（28）式に示した評価値が
最も大きくなるような位相ＪとインデックスＩを位相候
補ｊとインデックス候補ｉの中から選択し、対応する内
積値A_J ^(I)とパワーB_J ^(I)の組をゲイン量子化回路307へ
出力する。また、インデックス・位相選択回路306はさ
らに位相Ｊの情報を短期駆動信号生成回路308および第1
3図のマルチプレクサ117ヘ出力し、インデックスＩの情
報をコードブック302および第13図のマルチプレクサ117
へ出力する。The index / phase selection circuit 306 uses the inner product value A _j ⁽ⁱ⁾ and the power B _j ⁽ⁱ⁾ to determine the phase J and index I for which the evaluation value shown in the above equation (28) is the largest, as a phase candidate. j and an index candidate i, and outputs a corresponding set of the inner product value A _J ^(I) and the power B _J ^(I) to the gain quantization circuit 307. The index / phase selection circuit 306 further outputs the information on the phase J to the short-term drive signal generation circuit 308 and the first
Output to the multiplexer 117 shown in FIG. 3 and the information of the index I are stored in the codebook 302 and the multiplexer 117 shown in FIG.
Output to

ゲイン符号化回路307では、前記（29）式に示したイ
ンデックス・位相選択回路306からの内積値A_J ^(I)とパワ
ーB_J ^(I)との比を所定の方法で符号化して、そのゲイン
情報Ｇを短期駆動信号生成回路308および第13図のマル
チプレクサ117へ出力する。In the gain encoding circuit 307, the ratio between the inner product value A _J ^(I) and the power B _J ^(I) from the index / phase selection circuit 306 shown in the above equation (29) is encoded by a predetermined method. The gain information G is output to the short-term drive signal generation circuit 308 and the multiplexer 117 in FIG.

短期駆動信号生成回路308は、密度パターン情報Ｋ、
ゲイン情報Ｇ、位相情報ＪおよびインデックスＩに対応
するコードベクトルC^(I)を入力とし、ＫおよびC^(I)を用
いて上記合成ベクトル生成回路301での方法と同様の方
法で密度情報を持つパルス列を作成し、そのパルス振幅
にゲイン情報Ｇに対応する値を乗じ、位相情報Ｊに基づ
き所定のサンプル数だけパルス列を遅延することによ
り、短期駆動信号ｙを生成する。この短期駆動信号ｙ
は、聴感重みフィルタ309および第13図の駆動信号保持
回路110へ出力される。聴感重みフィルタ309は第13図の
聴感重みフィルタ105と同様の特性を持つフィルタであ
り、予測パラメータＰを基にして作られ、短期駆動信号
ｙを入力として差信号Ｖの量子化出力を第13図の加算
回路112へ出力する。The short-term drive signal generation circuit 308 includes density pattern information K,
Gain information G, phase information J, and code vector C ^(I) corresponding to index I are input, and density information is stored using K and C ^(I) in the same manner as in synthetic vector generation circuit 301 described above. A short-term drive signal y is generated by creating a pulse train, multiplying the pulse amplitude by a value corresponding to the gain information G, and delaying the pulse train by a predetermined number of samples based on the phase information J. This short-term drive signal y
Are output to the audibility weighting filter 309 and the drive signal holding circuit 110 in FIG. The perceptual weight filter 309 is a filter having the same characteristics as the perceptual weight filter 105 of FIG. 13, is formed based on the prediction parameter P, and receives the short-term drive signal y as an input and converts the quantized output of the difference signal V into the thirteenth. It is output to the adder circuit 112 in the figure.

第13図に説明を戻すと、駆動信号保持回路110は長期
ベクトル量子化回路109より出力される長期駆動信号ｔ
および短期ベクトル量子化回路111より出力される短期
駆動信号ｙを入力とし、駆動信号e_Xをサブフレーム単位
で長期ベクトル量子化回路109へ出力する。具体的に
は、例えばｔとｙをサブフレーム単位でサンプル毎に加
算したものを駆動信号e_Xとすればよい。現サブフレーム
の駆動信号e_Xは、次のサブフレームにおいて過去の駆動
信号として長期ベクトル量子化回路109において使用で
きるように、駆動信号保持回路110内のバッファメモリ
に保持される。Returning to FIG. 13, the drive signal holding circuit 110 outputs the long-term drive signal t output from the long-term vector quantization circuit 109.
The short-term drive signal y output from the short-term vector quantization circuit 111 is input, and the drive signal e _X is output to the long-term vector quantization circuit 109 in subframe units. Specifically, for example, those obtained by adding the t and y for each sample in the sub-frames may be used as the driving signal e _X. The drive signal e _X of the current sub-frame is held in a buffer memory in the drive signal holding circuit 110 so that it can be used in the long-term vector quantization circuit 109 as a past drive signal in the next sub-frame.

加算回路112は、サブフレーム単位で量子化出力u^(m)
および^(m)と、現サブフレームで作成された過去の影
響信号ｆとの和信号を求め、影響信号作成回路107へ
出力する。The adder circuit 112 outputs a quantized output u ^{(m) in} subframe units.
And a sum signal of ^(m) and the past influence signal f created in the current subframe, and outputs the sum signal to the influence signal creation circuit 107.

なお、上述した第７の実施例では駆動信号の密度パタ
ーン選択を短期予測残差信号のパワーに基づいて行った
が、短期予測残差信号の零交差数に基づいて行うことも
できる。この方式に基づく第８の実施例に係る符号化装
置のブロック図を第14図に示す。In the above-described seventh embodiment, the density pattern of the drive signal is selected based on the power of the short-term prediction residual signal, but may be determined based on the number of zero crossings of the short-term prediction residual signal. FIG. 14 is a block diagram of an encoding apparatus according to the eighth embodiment based on this method.

第14図において、零交差数計算回路123はサブフレー
ム単位で短期予測残差信号ｒがｒ＝０を交差する回数を
カウントし、その値を密度パターン選択回路122へ出力
する。この場合、密度パターン選択回路122はサブフレ
ーム毎の零交差数の大きさに基づいて、予め設定された
密度パターンの中から一つのパターンを選択する。In FIG. 14, the zero-crossing number calculation circuit 123 counts the number of times that the short-term prediction residual signal r crosses r = 0 in subframe units, and outputs the value to the density pattern selection circuit 122. In this case, the density pattern selection circuit 122 selects one of the preset density patterns based on the number of zero crossings for each subframe.

また、密度パターン選択を短期予測残差信号にピッチ
予測を適用して得られるピッチ予測残差信号のパワーま
たは零交差数に基づいて行うこともできる。第15図に、
ピッチ予測残差信号のパワーに基づいて密度パターン選
択を行うようにした第９の実施例に係る符号化装置を示
し、また第16図にピッチ予測残差信号の零交差数に基づ
いて密度パターン選択を行うようにした第10の実施例に
係る符号化装置を示す。第15図および第16図は、それぞ
れ第13図および第14図におけるパワー計算回路113およ
び零交差数計算回路123の前にピッチ分析回路124、ピッ
チ予測フィルタ125を配置した構成となっている。ピッ
チ分析回路124はピッチ周期とピッチゲインを計算する
回路であり、その計算結果をピッチ予測フィルタ125へ
出力し、ピッチ予測フィルタ125はピッチ予測残差信号
をパワー計算回路113または零交差数計算回路123へ出力
する。ピッチ周期とピッチゲインは、公知の方法、例え
ば自己相関法や共分散法で求めることができる。Further, the density pattern can be selected based on the power or the number of zero crossings of the pitch prediction residual signal obtained by applying the pitch prediction to the short-term prediction residual signal. In Figure 15,
FIG. 16 shows an encoding apparatus according to a ninth embodiment in which a density pattern is selected based on the power of a pitch prediction residual signal. FIG. 16 shows a density pattern based on the number of zero crossings of the pitch prediction residual signal. 15 shows an encoding apparatus according to a tenth embodiment for performing selection. FIGS. 15 and 16 show a configuration in which a pitch analysis circuit 124 and a pitch prediction filter 125 are arranged before the power calculation circuit 113 and the zero-crossing number calculation circuit 123 in FIGS. 13 and 14, respectively. The pitch analysis circuit 124 is a circuit that calculates the pitch period and the pitch gain, and outputs the calculation result to the pitch prediction filter 125. The pitch prediction filter 125 outputs the pitch prediction residual signal to the power calculation circuit 113 or the number of zero-crossings calculation circuit. Output to 123. The pitch period and the pitch gain can be obtained by a known method, for example, an autocorrelation method or a covariance method.

第17図は本発明の第11の実施例に係る音声符号化装置
のブロック図である。FIG. 17 is a block diagram of a speech coding apparatus according to an eleventh embodiment of the present invention.

第17図において、フレームバッファ101は入力端子100
に入力される音声信号を１フレーム分蓄積する回路であ
り、第17図の各ブロックはフレームバッファ101を用い
て１フレーム毎に以下の処理を行う。In FIG. 17, a frame buffer 101 has an input terminal 100.
Is a circuit for accumulating one frame of the audio signal input to each block. Each block in FIG. 17 performs the following processing for each frame using the frame buffer 101.

影響信号作成回路107は、加算回路112からの過去の重
み付けされた合成信号と、予測パラメータの復号値Ｐ
を入力とし、過去の影響信号ｆを出力する。具体的には
過去の重み付けされた合成信号をフィルタの内部状態
とする聴感重みフィルタの零入力応答を計算し、それを
影響信号ｆとして、予め設定されるサブフレーム単位で
出力する。8kHzサンプリング時のサブフレーム中の典型
的な値としては、１フレーム（160サンプル）を４分割
した40サンプル程度が使用される。影響信号作成回路10
7は、第１サブフレームにおいては前フレームで決定し
た密度パターンＫに基づいて作成された前フレームの合
成信号を入力して影響信号ｆを作成する。減算回路10
6は、サブフレーム単位で聴感重み付き入力信号ｘから
過去の影響信号ｆを差し引いた信号ｕを減算回路108お
よび長期ベクトル量子化回路109へ出力する。The influence signal creation circuit 107 includes the past weighted synthesized signal from the addition circuit 112 and the decoded value P of the prediction parameter.
And outputs a past influence signal f. Specifically, it calculates a zero input response of an auditory weighting filter that uses the past weighted synthesized signal as an internal state of the filter, and outputs it as an influence signal f in preset subframe units. As a typical value in a subframe at the time of 8 kHz sampling, about 40 samples obtained by dividing one frame (160 samples) into four are used. Influence signal creation circuit 10
7 inputs the synthesized signal of the previous frame generated based on the density pattern K determined in the previous frame in the first sub-frame to generate the influence signal f. Subtraction circuit 10
6 outputs, to the subtraction circuit 108 and the long-term vector quantization circuit 109, a signal u obtained by subtracting the past influence signal f from the perceptually weighted input signal x in subframe units.

パワー計算回路131は、予測フィルタ104の出力である
短期予測残差信号の２乗平均値をサブフレーム単位で計
算し、その値をビット配分計算回路132へ出力する。The power calculation circuit 131 calculates the mean square value of the short-term prediction residual signal output from the prediction filter 104 in subframe units, and outputs the value to the bit allocation calculation circuit 132.

ビット配分計算回路132は、短期予測残差信号の２乗
平均値を基に各サブフレームに割り当てる駆動信号のビ
ット配分の計算する。ビット配分の計算は、割り当てら
れるビット数の総和が一定である条件の下で、量子化誤
差の２乗平均値を最小化する最適ビット配分の式を用い
て行うことができる。最適ビット配分については、N.S.
Tayant and P.Nall:“DIGITAL CODING OF WAVEFORMS",P
RENTICE−HALL,1984（文献４）に記述されているので、
ここでは説明を省略する。割り当てられるビット数の総
和をＲビット、サブフレーム数をＭ＝４、各サブフレー
ムの短期予測残差信号の２乗平均値をσi²とおくと、ｉ
番目のサブフレームのビット配分biは次式により計算さ
れる。The bit allocation calculation circuit 132 calculates the bit allocation of the drive signal allocated to each subframe based on the mean square value of the short-term prediction residual signal. The calculation of the bit allocation can be performed using an optimum bit allocation formula that minimizes the root mean square value of the quantization error under the condition that the total number of allocated bits is constant. For optimal bit allocation, NS
Tayant and P. Nall: “DIGITAL CODING OF WAVEFORMS”, P
RENTICE-HALL, 1984 (Reference 4)
Here, the description is omitted. If the total number of allocated bits is R bits, the number of subframes is M = 4, and the mean square value of the short-term prediction residual signal of each subframe is σi ² , i
The bit distribution bi of the subframe is calculated by the following equation.

ビット配分テーブル134は、予め設定された駆動信号
の密度パターンと１対１に対応したサブフレームのビッ
ト配分値を格納するものであり、パターン番号とピット
配分値を密度パターン選択回路133へ出力する。次表２
に、密度パターンとビット配分値の例を示す。但し、こ
こではＲ＝50,M＝４としている。 The bit allocation table 134 stores the bit allocation value of the subframe corresponding to the density pattern of the drive signal set in advance, and outputs the pattern number and the pit allocation value to the density pattern selection circuit 133. . Next Table 2
Shows examples of density patterns and bit allocation values. Here, R = 50 and M = 4.

密度パターン選択回路133は、ビット配分計算回路132
の出力b_iを要素とするベクトルＢ＝（b₁,b₂,…b_M）と、
ビット配分テーブル134に格納されている各サブフレー
ムのビット配分値を要素とするベクトルB_K（Ｋ＝1,2,,
…,M、Ｋは密度パターン番号）との距離を計算し、その
距離が最小となる密度パターンの番号Ｋを密度パターン
情報として短期ベクトル量子化回路111とマルチプレク
サ117へ出力する。 The density pattern selection circuit 133 includes a bit allocation calculation circuit 132
A vector B = (b ₁ , b ₂ ,... B _M ) whose elements are the output b _i of
A vector B _K (K = 1, 2 ,,...) Having the bit allocation value of each subframe stored in the bit allocation table 134 as an element
, M, and K are the density pattern numbers), and outputs the number K of the density pattern having the minimum distance to the short-term vector quantization circuit 111 and the multiplexer 117 as density pattern information.

長期ベクトル量子化回路109は、減算回路106からの差
信号ｕ、後述の駆動信号保持回路110からの過去の駆動
信号e_Xおよび符号化回路103からの予測パラメータＰを
入力とし、サブフレーム単位で差信号ｕの量子化出力信
号を減算回路108および加算回路112へ、ベクトルゲイ
ンβおよびインデックスＴをマルチプレクサ117へ、長
期駆動信号ｔを駆動信号保持回路110へそれぞれ出力す
る。このときｔととの間には、＝ｔ＊ｈ（ｈは聴感
重みフィルタ105のインパルス応答、＊は畳み込みを表
わす）という関係がある。サブフレーム単位のベクトル
ゲインβ^(m)とインデックスT^(m)（ｍはサブフレームの
番号）の詳細な求め方は、第５の実施例と同様でよい。The long-term vector quantization circuit 109 receives the difference signal u from the subtraction circuit 106, the past drive signal e _X from the drive signal holding circuit 110 described later, and the prediction parameter P from the encoding circuit 103, and receives a sub-frame unit. The quantized output signal of the difference signal u is output to the subtraction circuit 108 and the addition circuit 112, the vector gain β and the index T are output to the multiplexer 117, and the long-term drive signal t is output to the drive signal holding circuit 110. At this time, there is a relationship between t and t = t * h (h represents the impulse response of the audibility weighting filter 105 and * represents convolution). The detailed method of obtaining the vector gain β ^(m) and index T ^(m) (m is the number of the subframe ^{) in} subframe units may be the same as in the fifth embodiment.

一方、減算回路108ではサブフレーム単位で差信号ｕ
から量子化出力信号ｕを減じた差信号Ｖを短期ベクトル
量子化回路111へ出力する。On the other hand, in the subtraction circuit 108, the difference signal u is
And outputs a difference signal V obtained by subtracting the quantized output signal u from the short-term vector quantization circuit 111.

短期ベクトル量子化回路111は、差信号Ｖ、予測パラ
メータＰおよび密度パターン選択回路133より出力され
る密度パターン番号Ｋを入力とし、サブフレーム単位で
差信号Ｖの量子化出力信号を加算回路112へ、短期駆
動信号ｙを駆動信号保持回路110へそれぞれ出力する。
ここでとｙとの間には、＝ｙ＊ｈという関係があ
る。The short-term vector quantization circuit 111 receives as input the difference signal V, the prediction parameter P, and the density pattern number K output from the density pattern selection circuit 133, and sends the quantization output signal of the difference signal V to the addition circuit 112 in subframe units. , And outputs the short-term drive signal y to the drive signal holding circuit 110.
Here, there is a relationship of = y * h between and y.

また、これと共に短期ベクトル量子化回路111は駆動
パルス列のゲインＧ、位相情報Ｊおよびコードベクトル
のインデックスＩをマルチプレクサ117へ出力する。こ
のとき、サブフレーム単位で出力されるパラメータG,J,
Iは、密度パターン番号Ｋで決まる現サブフレーム（第
ｍサブフレーム）の密度（パルス間隔）に応じたパルス
数N^(m)をサブフレーム内で符号化しなければならないの
で、予め設定されるコードベクトルの次元数N_D（１つず
つのコードベクトルを構成するパルス数）に応じた個
数、すなわちN^(m)/N_D個ずつ出力される。At the same time, the short-term vector quantization circuit 111 outputs the gain G of the drive pulse train, the phase information J, and the index I of the code vector to the multiplexer 117. At this time, the parameters G, J,
Since the number of pulses N ^(m) corresponding to the density (pulse interval) of the current sub-frame (m-th sub-frame) determined by the density pattern number K must be encoded in the sub-frame, number corresponding to the number of dimensions N _D vector (number of pulses constituting one each codevector), that is, output N ^(m) / N _D pieces by.

位相シフト回路303は、この合成ベクトルV₁ ⁽ⁱ⁾を密度
パターン情報Ｋに基づいて所定のサンプル数だけ遅延さ
せて位相の異なる合成ベクトルV₂ ⁽ⁱ⁾,V₃ ⁽ⁱ⁾,…V_j ⁽ⁱ⁾,…
を作成し、内積計算回路304およびパワー計算回路305へ
出力する。コードブック302は適応密度パルスの振幅情
報を格納し、インデックスｉに対して予め定められたコ
ードベクトルC⁽ⁱ⁾が引き出し可能なメモリ回路またはベ
クトル発生回路で構成される。内積計算回路304は、第1
7図の減算回路108からの差信号Ｖと、合成ベクトルV_j
⁽ⁱ⁾との内積値A_j ⁽ⁱ⁾を求め、インデックス・位相選択回
路306へ出力する。パワー計算回路305は、合成ベクトル
V_j ⁽ⁱ⁾のパワーB_j ⁽ⁱ⁾を求め、インデックス・位相選択回
路306へ出力する。The phase shift circuit 303 delays the combined vector V ₁ ⁽ⁱ⁾ by a predetermined number of samples based on the density pattern information K, and combines the combined vectors V ₂ ⁽ⁱ⁾ , V ₃ ⁽ⁱ⁾ _,. ⁽ⁱ⁾ ,…
Is generated and output to the inner product calculation circuit 304 and the power calculation circuit 305. The codebook 302 stores the amplitude information of the adaptive density pulse, and is configured by a memory circuit or a vector generation circuit from which a predetermined code vector C ⁽ⁱ⁾ can be derived for an index i. The inner product calculation circuit 304
And a difference signal V from the 7 Figure subtraction circuit 108, the resultant vector V _j
obtains an inner product value A _j ⁽ⁱ⁾ and ^(i), and outputs the index phase selection circuit 306. The power calculation circuit 305 calculates the composite vector
Power B obtains the _j ⁽ⁱ⁾ of V _j ^(i), and outputs the index phase selection circuit 306.

インデックス・位相選択回路306では、内積値A_j ⁽ⁱ⁾と
パワーB_j ⁽ⁱ⁾を用いて、（28）式に示した評価値が最も
大きくなるような位相ＪとインデックスＩを位相候補ｊ
とインデックス候補ｉの中から選択し、対応する内積値
A_J ^(I)とパワーB_J ^(I)の組をゲイン符号化回路307へ出力
する。また、インデックス・位相選択回路306はさらに
位相Ｊの情報を短期駆動信号生成回路308および第17図
のマルチプレクサ117へ出力し、インデックスＩの情報
をコードブック302および第17図のマルチプレクサ117へ
出力する。The index / phase selection circuit 306 uses the inner product value A _j ⁽ⁱ⁾ and the power B _j ⁽ⁱ⁾ to determine the phase J and index I for which the evaluation value shown in Expression (28) is the largest, as the phase candidate j
And index candidate i, and the corresponding inner product value
A set of A _J ^(I) and power B _J ^(I) is output to gain encoding circuit 307. The index / phase selection circuit 306 further outputs the information of the phase J to the short-term drive signal generation circuit 308 and the multiplexer 117 of FIG. 17, and outputs the information of the index I to the codebook 302 and the multiplexer 117 of FIG. .

ゲイン符号化回路307では、（29）式に示したインデ
ックス・位相選択回路306からの内積値A_J ^(I)とパワーB_J
^(I)との比を所定の方法で符号化して、そのゲイン情報
Ｇを短期駆動信号生成回路308および第17図のマルチプ
レクサ117へ出力する。In the gain encoding circuit 307, the inner product value A _J ^(I) and the power B _J from the index / phase selection circuit 306 shown in Expression (29) are obtained.
The ratio to ^(I) is encoded by a predetermined method, and the gain information G is output to the short-term drive signal generation circuit 308 and the multiplexer 117 in FIG.

短期駆動信号生成回路308は、密度パターン情報Ｋ、
ゲイン量子化値Ｇ、位相情報ＪおよびインデックスＩに
対応するコードベクトルC^(I)を入力とし、ＫおよびC^(I)
を用いて上記合成ベクトル生成回路301での方法と同様
の方法で密度情報を持つパルス列を作成し、そのパルス
振幅にゲイン情報に対応する値Ｇを乗じ、位相情報Ｊに
基づき所定のサンプル数だけパルス列を遅延することに
より、短期駆動信号ｙを生成する。この短期駆動信号ｙ
は、聴感重みフィルタ309および第17図の駆動信号保持
回路110へ出力される。聴感重みフィルタ309は第17図の
聴感重みフィルタ105と同様の特性を持つフィルタであ
り、予測パラメータＰを基にして作られ、短期駆動信号
ｙを入力として差信号Ｖの量子化出力Ｖを第１図の加算
回路112へ出力する。The short-term drive signal generation circuit 308 includes density pattern information K,
A gain quantization value G, phase information J and a code vector C ^(I) corresponding to an index I are input, and K and C ^(I)
To generate a pulse train having density information in the same manner as in the synthetic vector generation circuit 301, multiply the pulse amplitude by a value G corresponding to the gain information, and obtain a predetermined number of samples based on the phase information J. The short-term drive signal y is generated by delaying the pulse train. This short-term drive signal y
Are output to the audibility weighting filter 309 and the drive signal holding circuit 110 in FIG. The perceptual weighting filter 309 is a filter having the same characteristics as the perceptual weighting filter 105 of FIG. 17 and is formed based on the prediction parameter P, and receives the short-term drive signal y as input and outputs the quantized output V of the difference signal V. The signal is output to the adding circuit 112 shown in FIG.

第17図に説明を戻すと、駆動信号保持回路110は長期
ベクトル量子化回路109より出力される長期駆動信号ｔ
および短期ベクトル量子化回路111より出力される短期
駆動信号ｙを入力とし、駆動信号e_Xをサブフレーム単位
で長期ベクトル量子化回路109へ出力する。具体的に
は、例えばｔとｙをサブフレーム単位でサンプル毎に加
算したものを駆動信号e_Xとすればよい。現サブフレーム
の駆動信号e_Xは、次のサブフレームにおいて過去の駆動
信号として長期ベクトル量子化回路109において使用で
きるように、駆動信号保持回路110内のバッファメモリ
に保持される。Referring back to FIG. 17, the drive signal holding circuit 110 outputs the long-term drive signal t output from the long-term vector quantization circuit 109.
The short-term drive signal y output from the short-term vector quantization circuit 111 is input, and the drive signal e _X is output to the long-term vector quantization circuit 109 in subframe units. Specifically, for example, those obtained by adding the t and y for each sample in the sub-frames may be used as the driving signal e _X. The drive signal e _X of the current sub-frame is held in a buffer memory in the drive signal holding circuit 110 so that it can be used in the long-term vector quantization circuit 109 as a past drive signal in the next sub-frame.

なお、上述した第11の実施例では駆動信号の密度パタ
ーン選択を短期予測残差信号のパワーに基づいて行った
が、短期予測残差信号の零交差数に基づいて行うことも
できる。この方式に基づく第12の実施例に係る符号化装
置のブロック図を第18図に示すう。In the above-described eleventh embodiment, the selection of the density pattern of the drive signal is performed based on the power of the short-term prediction residual signal. However, the selection may be performed based on the number of zero crossings of the short-term prediction residual signal. FIG. 18 shows a block diagram of an encoding device according to a twelfth embodiment based on this method.

第18図において、零交差数計算回路135はサブフレー
ム単位で短期予測残差信号ｒがｒ＝０を交差する回数を
カウントし、その値をビット配分計算回路132を介して
密度パターン選択回路133へ出力する。この場合、密度
パターン選択回路133はサブフレーム毎の零交差数の大
きさに基づいて、予め設定された密度パターンの中から
一つのパターンを選択する。In FIG. 18, a zero-crossing number calculation circuit 135 counts the number of times that the short-term prediction residual signal r crosses r = 0 for each subframe, and outputs the value via a bit allocation calculation circuit 132 to a density pattern selection circuit 133. Output to In this case, the density pattern selection circuit 133 selects one of the preset density patterns based on the number of zero crossings for each subframe.

また、密度パターン選択を短期予測残差信号にピッチ
予測を適用して得られるピッチ予測残差信号のパワーま
たは零交差数に基づいて行うこともできる。第19図に、
ピッチ予測残差信号のパワーに基づいて密度パターン選
択を行うようにした第13の実施例に係る符号化装置を示
し、また第20図にピッチ予測残差信号の零交差数に基づ
いて密度パターン選択を行うようにした第14の実施例に
係る符号化装置の実施例を示す。第19図および第20図
は、それぞれ第17図および第18図におけるパワー計算回
路131および零交差数計算回路135の前にピッチ分析回路
136、ピッチ予測フィルタ137を配置した構成となってい
る。ピッチ分析回路136はピッチ周期とピッチゲインを
計算する回路であり、その計算結果をピッチ予測フィル
タ137へ出力し、ピッチ予測フィルタ137はピッチ予測残
差信号をパワー計算回路131または零交差数計算回路135
へ出力する。ピッチ周期とピッチゲインは、公知の方
法、例えば自己相関法や共分散法で求めることができ
る。Further, the density pattern can be selected based on the power or the number of zero crossings of the pitch prediction residual signal obtained by applying the pitch prediction to the short-term prediction residual signal. In FIG. 19,
FIG. 20 shows an encoding apparatus according to a thirteenth embodiment in which a density pattern is selected based on the power of a pitch prediction residual signal, and FIG. 20 shows a density pattern based on the number of zero crossings of the pitch prediction residual signal. An example of the encoding device according to the fourteenth example in which selection is performed is shown. FIGS. 19 and 20 show a pitch analysis circuit before the power calculation circuit 131 and the zero-crossing number calculation circuit 135 in FIGS. 17 and 18, respectively.
136 and a pitch prediction filter 137 are arranged. The pitch analysis circuit 136 is a circuit that calculates the pitch period and the pitch gain, and outputs the calculation result to the pitch prediction filter 137. The pitch prediction filter 137 outputs the pitch prediction residual signal to the power calculation circuit 131 or the number of zero crossings. 135
Output to The pitch period and the pitch gain can be obtained by a known method, for example, an autocorrelation method or a covariance method.

第21図は本発明の第15の実施例に係る音声符号化装置
のブロック図である。FIG. 21 is a block diagram of a speech coding apparatus according to a fifteenth embodiment of the present invention.

第21図において、フレームバッファ101は入力端子100
に入力される音声信号を１フレーム分蓄積する回路であ
り、第21図の各ブロックはフレームバッファ101を用い
て１フレーム毎に以下の処理を行う。In FIG. 21, a frame buffer 101 has an input terminal 100.
21 is a circuit for accumulating one frame of audio signal input thereto. Each block in FIG. 21 performs the following processing for each frame using the frame buffer 101.

まず、１フレーム分の音声信号に対し予測パラメータ
計算回路102において、公知の方法を用いて短時間予測
パラメータを計算する（通常、この予測パラメータは８
〜12個計算される）。計算法については、例えば前記文
献２に記述されている。計算された予測パラメータは、
予測パラメータ符号化回路103に入力される。予測パラ
メータ符号化回路103は、予測パラメータを予め定めら
れた量子化ビット数に基づいて符号化し、その符号をマ
ルチプレクサ117へ出力すると共に、復号値Ｐを予測フ
ィルタ104、聴感重みフィルタ105、影響信号作成回路10
7、長期ベクトル量子化回路109及び短期ベクトル量子化
回路111へ出力する。First, the prediction parameter calculation circuit 102 calculates a short-term prediction parameter for one frame of a speech signal using a known method (normally, this prediction parameter is 8
~ 12 are calculated). The calculation method is described in, for example, the aforementioned reference 2. The calculated prediction parameters are
It is input to the prediction parameter coding circuit 103. The prediction parameter encoding circuit 103 encodes the prediction parameter based on a predetermined number of quantization bits, outputs the code to the multiplexer 117, and outputs the decoded value P to the prediction filter 104, the perceptual weight filter 105, the influence signal Creation circuit 10
7. Output to the long-term vector quantization circuit 109 and the short-term vector quantization circuit 111.

パワー計算回路141は、予測フィルタ104の出力である
短期予測残差信号のパワー（２乗和）をサブフレーム単
位で計算し、各サブフレームのパワーを重み付け回路14
2へ出力する。重み付け回路142は、サブフレーム毎のパ
ワーに非線形の重み付けを施した関数値を出力するもの
であり、サブフレーム数が４のとき、例えば重み関数W_i
（ｉ＝1,2,3,4）を次のように設定する。The power calculation circuit 141 calculates the power (sum of squares) of the short-term prediction residual signal output from the prediction filter 104 for each subframe, and weights the power of each subframe.
Output to 2. The weighting circuit 142 outputs a function value obtained by performing non-linear weighting on the power of each subframe. When the number of subframes is 4, for example, the weighting function W _i
(I = 1,2,3,4) is set as follows.

W₁＝0.4 W₂＝0.25 W₃＝0.20 W₄＝0.15 この重み付けは、サブフレームの中で各サブフレーム
の重要性が均一でないために行う処理である。重み付け
回路142の出力信号は、密度パターン選択回路143へ出力
される。 _{_{W 1 = 0.4 W 2 = 0.25}} W 3 = 0.20 W 4 = 0.15 The weighting is a process for the importance of each sub-frame in the sub-frame is not uniform. The output signal of the weighting circuit 142 is output to the density pattern selection circuit 143.

密度パターン選択回路143は、重み付け回路142から出
力されるサブフレーム毎の短期予測段差信号のパワーの
重み付け関数値を基に、予め設定された駆動信号の密度
パターンの中から一つを選択する。具体的には、重み付
け関数値の大きいサブフレームの順に密度が高くなるよ
うに密度パターンを選択する。例えば、等長のサブフレ
ームが４個、密度の種類が２つで、密度パターンを前記
表１のように設定した場合、密度パターン選択回路143
はサブフレーム毎の上記パワーの重み付け関数値を比較
し、その値が最大となるサブフレームが密となる密度パ
ターンの番号Ｋを選択し、それを密度パターン情報とし
て短期ベクトル量子化回路111とマルチプレクサ117へ出
力する。The density pattern selection circuit 143 selects one of preset drive signal density patterns based on a weighting function value of the power of the short-term prediction step signal for each sub-frame output from the weighting circuit 142. Specifically, a density pattern is selected such that the density increases in the order of subframes having the largest weighting function value. For example, when there are four equal-length subframes, two types of density, and density patterns set as shown in Table 1, the density pattern selection circuit 143
Compares the power weighting function value for each sub-frame, selects the number K of the density pattern in which the sub-frame with the maximum value is dense, and uses it as the density pattern information as the short-term vector quantization circuit 111 and the multiplexer. Output to 117.

長期ベクトル量子化回路109は、減算回路106からの差
信号ｕ、後述の駆動信号保持回路110からの過去の駆動
信号e_Xおよび符号化回路103からの予測パラメータＰを
入力とし、サブフレーム単位で差信号ｕの量子化出力信
号を減算回路108および加算回路112へ、ベクトルゲイ
ンβおよびインデックスＴをマルチプレクサ117へ、長
期駆動信号ｔを駆動信号保持回路110へそれぞれ出力す
る。このときｔととの間には、ｕ＝ｔ＊ｈ（ｈは聴感
重みフィルタ105のインパルス応答、＊は畳み込んで表
わす）という関係がある。The long-term vector quantization circuit 109 receives the difference signal u from the subtraction circuit 106, the past drive signal e _X from the drive signal holding circuit 110 described later, and the prediction parameter P from the encoding circuit 103, and receives a sub-frame unit. The quantized output signal of the difference signal u is output to the subtraction circuit 108 and the addition circuit 112, the vector gain β and the index T are output to the multiplexer 117, and the long-term drive signal t is output to the drive signal holding circuit 110. At this time, there is a relationship of u = t * h (h is the impulse response of the auditory weighting filter 105 and * is convoluted) with t.

サブフレーム単位のベクトルゲインβ^(m)とインデッ
クスT^(m)（ｍはサブフレームの番号）の詳細な求め方
は、第５の実施例と同様でよい。The detailed method of obtaining the vector gain β ^(m) and index T ^(m) (m is the number of the subframe ^{) in} subframe units may be the same as in the fifth embodiment.

一方、減算回路108ではサブフレーム単位で差信号ｕ
から量子化出力信号を減じた差信号Ｖを短期ベクトル
量子化回路111へ出力する。On the other hand, in the subtraction circuit 108, the difference signal u is
And outputs the difference signal V obtained by subtracting the quantization output signal from the short-term vector quantization circuit 111.

短期ベクトル量子化回路111は、差信号Ｖ、予測パラ
メータＰおよび密度パターン選択回路143より出力され
る密度パターン番号Ｋを入力とし、サブフレーム単位で
差信号Ｖの量子化出力信号を加算回路112へ、短期駆
動信号ｙを駆動信号保持回路110へそれぞれ出力する。
ここでとｙとの間には、＝ｙ＊ｈという関係があ
る。The short-term vector quantization circuit 111 receives as input the difference signal V, the prediction parameter P, and the density pattern number K output from the density pattern selection circuit 143, and sends the quantized output signal of the difference signal V to the addition circuit 112 in subframe units. , And outputs the short-term drive signal y to the drive signal holding circuit 110.
Here, there is a relationship of = y * h between and y.

位相シフト回路303は、この合成ベクトルV₁ ⁽ⁱ⁾を密度
パターン情報Ｋに基づいて所定のサンプル数だけ遅延さ
せて位相の異なる合成ベクトルV₂ ⁽ⁱ⁾,V₃ ⁽ⁱ⁾,…V_j ⁽ⁱ⁾,…
を作成し、内積計算回路304およびパワー計算回路305へ
出力する。コードブック302は適応密度パルスの振幅情
報を格納し、インデックスｉに対して予め定められたコ
ードベクトルC⁽ⁱ⁾が引き出し可能なメモリ回路またはベ
クトル発生回路で構成される。内積計算回路304は、第2
1図の減算回路108からの差信号Ｖと、合成ベクトルV_j
⁽ⁱ⁾との内積値A_j ⁽ⁱ⁾を求め、インデックス・位相選択回
路306へ出力する。パワー計算回路305は、合成ベクトル
V_j ⁽ⁱ⁾のパワーB_j ⁽ⁱ⁾を求め、インデックス・位相選択回
路306へ出力する。The phase shift circuit 303 delays the combined vector V ₁ ⁽ⁱ⁾ by a predetermined number of samples based on the density pattern information K, and combines the combined vectors V ₂ ⁽ⁱ⁾ , V ₃ ⁽ⁱ⁾ _,. ⁽ⁱ⁾ ,…
Is generated and output to the inner product calculation circuit 304 and the power calculation circuit 305. The codebook 302 stores the amplitude information of the adaptive density pulse, and is configured by a memory circuit or a vector generation circuit from which a predetermined code vector C ⁽ⁱ⁾ can be derived for an index i. The inner product calculation circuit 304
And a difference signal V from the 1 Figure subtraction circuit 108, the resultant vector V _j
obtains an inner product value A _j ⁽ⁱ⁾ and ^(i), and outputs the index phase selection circuit 306. The power calculation circuit 305 calculates the composite vector
Power B obtains the _j ⁽ⁱ⁾ of V _j ^(i), and outputs the index phase selection circuit 306.

インデックス・位相選択回路306では、内積値A_j ⁽ⁱ⁾と
パワーB_j ⁽ⁱ⁾を用いて、前記（28）式に示した評価値が
最も大きくなるような位相ＪとインデックスＩを位相候
補ｊとインデックス候補ｉの中から選択し、対応する内
積値A_J ^(I)とパワーB_J ^(I)の組をゲイン符号化回路307へ
出力する。また、インデックス・位相選択回路306はさ
らに位相Ｊの情報を短期駆動信号生成回路308および第2
1図のマルチプレクサ117へ出力し、インデックスＩの情
報をコードブック302および第21図のマルチプレクサ117
へ出力する。The index / phase selection circuit 306 uses the inner product value A _j ⁽ⁱ⁾ and the power B _j ⁽ⁱ⁾ to determine the phase J and index I for which the evaluation value shown in the above equation (28) is the largest, as a phase candidate. j and an index candidate i, and outputs a corresponding set of the inner product value A _J ^(I) and the power B _J ^(I) to the gain encoding circuit 307. The index / phase selection circuit 306 further outputs the information of the phase J to the short-term drive signal generation circuit 308 and the second
The information of the index I is output to the multiplexer 117 shown in FIG.
Output to

ゲイン符号化回路307では、（29）式に示したインデ
ックス・位相選択回路306からの内積値A_J ^(I)とパワーB_J
^(I)との比を所定の方法で符号化して、そのゲイン情報
Ｇを短期駆動信号生成回路308および第21図のマルチプ
レクサ117へ出力する。In the gain encoding circuit 307, the inner product value A _J ^(I) and the power B _J from the index / phase selection circuit 306 shown in Expression (29) are obtained.
The ratio with ^(I) is encoded by a predetermined method, and the gain information G is output to the short-term drive signal generation circuit 308 and the multiplexer 117 in FIG.

短期駆動信号生成回路308は、密度パターン情報Ｋ、
ゲイン情報Ｇ、位相情報ＪおよびインデックスＩに対応
するコードベクトルC^(I)を入力とし、ＫおよびC^(I)を用
いて上記合成ベクトル生成回路301での方法と同様の方
法で密度情報を持つパルス列を作成し、そのパルス振幅
にゲイン情報Ｇに対応する値を乗じ、位相情報Ｊに基づ
き所定のサンプル数だけパルス列を遅延することによ
り、短期駆動信号ｙを生成する。この短期駆動信号ｙ
は、聴感重みフィルタ309および第21図の駆動信号保持
回路110へ出力される。聴感重みフィルタ309は第21図の
聴感重みフィルタ105と同様の特性を持つフィルタであ
り、予測パラメータＰを基にして作られ、短期駆動信号
ｙを入力として差信号Ｖの量子化出力を第21図の加算
回路112へ出力する。The short-term drive signal generation circuit 308 includes density pattern information K,
Gain information G, phase information J, and code vector C ^(I) corresponding to index I are input, and density information is stored using K and C ^(I) in the same manner as in synthetic vector generation circuit 301 described above. A short-term drive signal y is generated by creating a pulse train, multiplying the pulse amplitude by a value corresponding to the gain information G, and delaying the pulse train by a predetermined number of samples based on the phase information J. This short-term drive signal y
Are output to the audibility weighting filter 309 and the drive signal holding circuit 110 in FIG. The perceptual weight filter 309 is a filter having the same characteristics as the perceptual weight filter 105 of FIG. 21 and is formed based on the prediction parameter P, and receives the short-term drive signal y as an input and outputs the quantized output of the difference signal V to the 21st. It is output to the adder circuit 112 in the figure.

第21図に説明を戻すと、駆動信号保持回路110は長期
ベクトル量子化回路109より出力される長期駆動信号ｔ
および短期ベクトル量子化回路111より出力される短期
駆動信号ｙを入力とし、駆動信号e_Xをサブフレーム単位
で長期ベクトル量子化回路109へ出力する。具体的に
は、例えばｔとｙをサブフレーム単位でサンプル毎に加
算したものを駆動信号e_Xとすればよい。現サブフレーム
の駆動信号e_Xは、次のサブフレームにおいて過去の駆動
信号として長期ベクトル量子化回路109において使用で
きるように、駆動信号保持回路110内のバッファメモリ
に保持される。Returning to FIG. 21, the drive signal holding circuit 110 outputs the long-term drive signal t output from the long-term vector quantization circuit 109.
The short-term drive signal y output from the short-term vector quantization circuit 111 is input, and the drive signal e _X is output to the long-term vector quantization circuit 109 in subframe units. Specifically, for example, those obtained by adding the t and y for each sample in the sub-frames may be used as the driving signal e _X. The drive signal e _X of the current sub-frame is held in a buffer memory in the drive signal holding circuit 110 so that it can be used in the long-term vector quantization circuit 109 as a past drive signal in the next sub-frame.

加算回路112は、サブフレーム単位で量子化出力u^(m)
およびV^(m)と、現サブフレームで作成された過去の影響
信号ｆとの和信号を求め、影響信号作成回路107へ出
力する。The adder circuit 112 outputs a quantized output u ^{(m) in} subframe units.
And a sum signal of V ^(m) and the past influence signal f created in the current subframe, and outputs the sum signal to the influence signal creation circuit 107.

なお、上述した第15の実施例では駆動信号の密度パタ
ーン選択を短期予測残差信号のパワーに非線形の重み付
けを施した関数値に基づいて行ったが、短期予測残差信
号の零交差数に非線形の重み付けを施した関数値に基づ
いて行うこともできる。されたものを用いてもよい。こ
の方式に基づく第16の実施例に係る音声符号化装置のブ
ロック図を第22図に示す。In the fifteenth embodiment described above, the density pattern of the drive signal is selected based on the function value obtained by applying nonlinear weighting to the power of the short-term prediction residual signal. It can also be performed based on a function value subjected to nonlinear weighting. You may use what was done. FIG. 22 is a block diagram of a speech coding apparatus according to a sixteenth embodiment based on this method.

第22図において、零交差数計算回路144はサブフレー
ム単位で短期予測残差信号ｒがｒ＝０を交差する回数を
カウントし、その値を重み付け回路142へ出力する。こ
の場合、密度パターン選択回路143はサブフレーム毎の
零交差数の重み付け関数値に基づいて、予め設定された
密度パターンの中から一つのパターンを選択する。In FIG. 22, the zero-crossing number calculation circuit 144 counts the number of times that the short-term prediction residual signal r crosses r = 0 in subframe units, and outputs the value to the weighting circuit 142. In this case, the density pattern selection circuit 143 selects one of the preset density patterns based on the weighting function value of the number of zero crossings for each subframe.

また、密度パターン選択を短期予測残差信号にピッチ
予測を適用して得られるピッチ予測残差信号のパワーま
たは零交差数に重み付けを施した関数値に基づいて行う
こともできる。第23図に、ピッチ予測残差信号のパワー
に非線形の重み付けを施した関数値に基づいて密度パタ
ーン選択を行うようにした第17の実施例に係る符号化装
置を示し、また第24図にピッチ予測残差信号の零交差数
に非線形を重み付けを施した関数値に基づいて密度パタ
ーン選択を行うようにした第18の実施例に係る符号化装
置を示す。Further, the density pattern can be selected based on a function value obtained by weighting the power or the number of zero crossings of the pitch prediction residual signal obtained by applying pitch prediction to the short-term prediction residual signal. FIG. 23 shows an encoding device according to a seventeenth embodiment in which density pattern selection is performed based on a function value obtained by performing nonlinear weighting on the power of the pitch prediction residual signal, and FIG. An encoding apparatus according to an eighteenth embodiment is configured to select a density pattern based on a function value obtained by weighting the number of zero crossings of a pitch prediction residual signal nonlinearly.

第23図および第24図は、それぞれ第21図および第22図
におけるパワー計算回路141および零交差数計算回路144
の前にピッチ分析回路145、ピッチ予測フィルタ146を配
置した構成となっている。ピッチ分析回路145はピッチ
周期とピッチゲインを計算する回路であり、その計算結
果をピッチ予測フィルタ146へ出力し、ピッチ予測フィ
ルタ146はピッチ予測残差信号をパワー計算回路141また
は零交差数計算回路144へ出力する。ピッチ周期とピッ
チゲインは、公知の方法、例えば自己相関法や共分散法
で求めることができる。FIGS. 23 and 24 show the power calculation circuit 141 and the zero-crossing number calculation circuit 144 in FIGS. 21 and 22, respectively.
, A pitch analysis circuit 145 and a pitch prediction filter 146 are arranged. The pitch analysis circuit 145 is a circuit that calculates a pitch period and a pitch gain, and outputs the calculation result to the pitch prediction filter 146. The pitch prediction filter 146 outputs the pitch prediction residual signal to the power calculation circuit 141 or the zero-crossing number calculation circuit. Output to 144. The pitch period and the pitch gain can be obtained by a known method, for example, an autocorrelation method or a covariance method.

なお、上述した第５〜第18の実施例では、駆動パルス
列の密度をサブフレーム単位で選択したが、フレーム単
位で選択してもよく、その場合も表１または表２に示し
たような密度パターンを用いればよい。その場合、表1,
表２におけるサブフレーム番号をフレーム番号と読み替
えることとする。但し、第1,2では密度パターンの数が
４となっているが、密度パターンの数は任意に選べるこ
とはいうまでもない。In the above-described fifth to eighteenth embodiments, the density of the drive pulse train is selected in units of subframes, but may be selected in units of frames. A pattern may be used. In that case, Table 1,
The subframe number in Table 2 is read as a frame number. However, although the number of density patterns is four in the first and second, it goes without saying that the number of density patterns can be arbitrarily selected.

また、駆動パルス列の密度選択に関して、第１〜第18
の実施例で説明したサブフレーム単位の選択と、上記フ
レーム単位の選択を組み合わせて実施することも可能で
ある。すなわち、パルス密度を所定の密度パターンに従
ってサブフレーム単位に設定し、かつフレーム単位で密
度パターンが可変である駆動パルス列により駆動信号を
構成する。Further, regarding the selection of the density of the drive pulse train, the first to eighteenth
It is also possible to combine the selection of the sub-frame unit described in the embodiment with the selection of the frame unit. That is, the pulse density is set for each sub-frame in accordance with a predetermined density pattern, and the drive signal is constituted by a drive pulse train whose density pattern is variable for each frame.

具体的には各実施例において説明した密度パターン選
択回路15,116,122,133,143に、例えば次表３に示すよう
に予め設定した複数個（この例では６個）の密度パター
ンを格納したテーブルを用意しておく。但し、この表３
は先と同様サブフレーム数が４の場合であり、表１に示
した４つのパターンに、１フレーム内の全サブフレーム
の駆動パルス列の密度が全て疎のパターンと、全て密の
パターンとが加わり、合計６個のパターンが設定されて
いる。More specifically, the density pattern selection circuits 15, 116, 122, 133, and 143 described in each embodiment are provided with tables storing a plurality of (predetermined in this example, six) density patterns, for example, as shown in Table 3 below. However, this Table 3
Is a case where the number of sub-frames is four, as in the previous case. In addition to the four patterns shown in Table 1, a pattern in which the drive pulse trains of all the sub-frames in one frame are all sparse and a pattern in which all are dense , A total of six patterns are set.

そして、これらの密度パターンの中から、各実施例に
おいて密度パターンを選択した方法と同様の方法によ
り、一つの密度パターンを選択する。例えば第１の実施
例と同様の方法を用いる場合を例にとると、各密度パタ
ーンについて最適なパルス振幅のコードベクトルと位相
を探索し、その時の誤差の２乗和の最小値をフレームに
渡って求める。この結果、得られた最小値の中で最も小
さい最小値を与える密度パターンを選択する。 Then, from these density patterns, one density pattern is selected by a method similar to the method of selecting the density pattern in each embodiment. For example, taking the case where the same method as in the first embodiment is used as an example, a code vector and a phase of the optimum pulse amplitude are searched for each density pattern, and the minimum value of the sum of squares of the error at that time is transmitted over the frame. Ask. As a result, a density pattern that gives the smallest minimum value among the obtained minimum values is selected.

この実施例によると、駆動パルス列の密度をサブフレ
ーム単位で変化させた場合の利点である音声の性質が急
激に変化する部分での合成音声の品質向上と、フレーム
単位で変化させた場合の利点である音声の性質が比較的
緩やかに変化する部分での合成音声の品質向上の効果が
両方得られ、音声がどのように変化する部分でも合成音
声の品質が一様に向上するという利点がある。According to this embodiment, the advantage of changing the density of the driving pulse train on a subframe basis is the improvement of the quality of the synthesized speech in the portion where the nature of the speech changes abruptly, and the advantage of changing on the frame basis There is an advantage that the effect of improving the quality of the synthesized voice is obtained both in the portion where the nature of the voice changes relatively slowly, and the quality of the synthesized voice is uniformly improved in the portion where the voice changes. .

なお、以上の実施例では駆動パルス列の密度（パルス
間隔）を疎密の２種類としたが、３以上の多段階に密度
を変えるようにしてもよい。In the above-described embodiment, the density (pulse interval) of the drive pulse train is set to two types, ie, sparse and dense. However, the density may be changed in three or more stages.

［発明の効果］本発明によれば、合成フィルタを駆動するための駆動
信号を構成する駆動パルスのパルス密度を、重要な情報
または多くの情報が含まれるサブフレームまたはフレー
ムでは密に、そうでないサブフレームまたはフレームで
は粗くというようにサブフレームまたはフレーム毎に変
えるか、または適応的にサブフレーム単位かフレーム単
位で変えることにより、例えば10kb/s以下というような
低ビットレートの場合でも、品質の高い合成音声を再生
することができる。[Effects of the Invention] According to the present invention, the pulse density of a drive pulse constituting a drive signal for driving a synthesis filter is densely set in a sub-frame or a frame including important information or a large amount of information, and not so. By changing each sub-frame or frame such that it is coarse in a sub-frame or frame, or adaptively changing each sub-frame or frame, even at a low bit rate such as 10 kb / s or less, the quality can be improved. High synthesized speech can be reproduced.

[Brief description of the drawings]

第１図および第２図は本発明の第１の実施例に係る符号
化装置および復号化装置の構成をそれぞれ示すブロック
図、第３図は第１図における予測フィルタの一構成例を
示すブロック図、第４図は同実施例で生成される駆動信
号の一例を示す図、第５図、第６図および第７図はそれ
ぞれ本発明の第２、第３および第４の実施例に係る符号
化装置の構成を示すブロック図、第８図および第９図は
本発明の第５の実施例に係る符号化装置および復号化装
置の構成をそれぞれ示すブロック図、第10図は第８図に
おける短期ベクトル量子化回路の一構成例を示すブロッ
ク図、第11図は第９図における駆動信号生成回路の一構
成例を示すブロック図、第12図乃至第24図はそれぞれ本
発明の第６乃至第18の実施例に係る符号化装置の構成を
示すブロック図、第25図は従来技術による符号器の構成
を示すブロック図、第26図は同じく復号器の構成を示す
ブロック図、第27図は従来方式による駆動信号の例を示
す図である。 14,104……予測フィルタ 15,116,122,133,143……密度パターン選択回路 17……駆動信号生成回路 20,105……聴感重みフィルタ 21……２乗誤差計算回路、 55,115,121,141……パワー計算回路 114,142……重みフィルタ 119,124,136,145……ピッチ分析回路 120,125,137,146……ピッチ予測回路 123,135,144……零交差数計算回路 132……ビット配分計算回路 134……ビット配分テーブル1 and 2 are block diagrams respectively showing the configuration of an encoding device and a decoding device according to a first embodiment of the present invention, and FIG. 3 is a block diagram showing an example of the configuration of a prediction filter in FIG. FIG. 4 is a diagram showing an example of a drive signal generated in the embodiment, and FIGS. 5, 6, and 7 are diagrams according to the second, third, and fourth embodiments of the present invention, respectively. 8 and 9 are block diagrams showing the configurations of an encoding device and a decoding device, respectively, according to a fifth embodiment of the present invention. FIG. 10 is a block diagram showing the configuration of the encoding device. FIG. 11 is a block diagram showing a configuration example of a short-term vector quantization circuit in FIG. 11, FIG. 11 is a block diagram showing a configuration example of a drive signal generation circuit in FIG. 9, and FIGS. 25 is a block diagram illustrating a configuration of a coding apparatus according to the thirteenth to eighteenth embodiments. Block diagram showing the configuration of an encoder according to the prior art, Figure 26 is a block diagram similarly showing the configuration of a decoder, Figure 27 is a diagram showing an example of a driving signal according to a conventional method. 14,104 prediction filter 15,116,122,133,143 density pattern selection circuit 17 drive signal generation circuit 20,105 hearing weight filter 21 square error calculation circuit 55,115,121,141 power calculation circuit 114,142 weight filter 119,124,136,145 pitch analysis Circuits 120, 125, 137, 146 Pitch prediction circuits 123, 135, 144… Zero crossing number calculation circuit 132… Bit allocation calculation circuit 134… Bit allocation table

───────────────────────────────────────────────────── フロントページの続き (31)優先権主張番号特願平2−25840 (32)優先日平成２年２月５日(1990．2．5) (33)優先権主張国日本（ＪＰ） (56)参考文献 1989年電子情報通信学会秋季全国大会講演論文集，分冊１，「Ａ−３適応密度パルス列モデルに基づくＣＥＬＰ符号化方式」，ｐ．１−３，（1989年８月15 日発行) 昭和63年電子情報通信学会秋季全国大会講演論文集，分冊Ａ−１，「Ａ−６ＡＲＭＡモデルを用いた８ｋｂｐｓ音声符号化方式の検討」，ｐ．Ａ−１−６, （昭和63年８月15日発行) 1989年電子情報通信学会春季全国大会講演論文集，分冊１，「Ａ−５ＡＲＭＡモデルと適応密度パルス列モデルに基づく音声符号化方式」，ｐ．１−５, （1989年３月15日発行) 1990年電子情報通信学会春季全国大会講演論文集，分冊１，「Ａ−226 ＡＤＰ−ＣＥＬＰ方式におけるパラメータの符号誤り感度」，ｐ．１−226，（1990 年３月５日発行) 1990年電子情報通信学会春季全国大会講演論文集，分冊１，「ＳＡ−５−７ＡＤＰ−ＣＥＬＰ符号化方式のＤＳＰによる実現の検討」，ｐ．１−433, （1990年３月５日発行) 1990年電子情報通信学会秋季全国大会講演論文集，分冊３，「４．８ｋｂｐｓＡＤＰ−ＣＥＬＰ音声コーデックの開発」，ｐ．３−285，（1990年９月15日発行) ＰｒｏｃｅｅｄｉｎｇｓｏｆＩＥＥＥ 1989 ＩｎｔｅｒｎａｔｉｏｎａｌＣｏｎｆｅｒｅｎｃｅｏｎＡｃｏｕｓｔｉｃｓ，ＳｐｅｅｃｈａｎｄＳｉｇｎａｌＰｒｏｃｅｓｓｉｎｇ，Ｖｏｌ．１，”Ｓ４．８ＡＲＭＡＭｏｄｅｌＢａｓｅｄＳｐｅｅｃｈＣｏｄｉｎｇａｔ８ｋｂ／ｓ" ｐ．148−151 ＰｒｏｃｅｅｄｉｎｇｓｏｆＩＥＥＥ 1990 ＩｎｔｅｒｎａｔｉｏｎａｌＣｏｎｆｅｒｅｎｃｅｏｎＡｃｏｕｓｔｉｃｓ，ＳｐｅｅｃｈａｎｄＳｉｇｎａｌＰｒｏｃｅｓｓｉｎｇ，Ｖｏｌ．１，”Ｓ１．８ＣＥＬＰＣｏｄｉｎｇｗｉｔｈａｎＡｄａｐｔｉｖｅＤｅｎｓｉｔｙＰｕｌｓｅＥｘｃｉｔａｔｉｏｎＭｏｄｅｌ”ｐ．29−32 ＣｏｎｆｅｒｅｎｃｅＲｅｃｏｒｄｏｆＩＥＥＥＧｌｏｂａｌＴｅｌｅｃｏｍｍｕｎｉｃａｔｉｏｎｓＣｏｎｆｅｒｅｎｃｅ，ＧＬＯＢＥＣＯＭ ’91，Ｖｏｌ．１ｏｆ３，”ＩｍｐｒｏｖｅｍｅｎｔｏｆＡＤＰ−ＣＥＬＰＳｐｅｅｃｈＣｏｄｉｎｇａｔ４Ｋｂｉｔｓ／ｓ”，ｐ．53．２. １−53．２．５，Ｐｈｏｅｎｉｘ，Ａｒｉｚｏｎａ，Ｄｅｃｅｍｂｅｒ２− ５，1991 (58)調査した分野(Int.Cl.⁷，ＤＢ名) G10L 11/00 - 13/08 G10L 19/00 - 21/06 ＩＮＳＰＥＣ（ＤＩＡＬＯＧ) ＪＩＣＳＴファイル（ＪＯＩＳ) ＩＥＥＥ／ＩＥＥＥｌｅｃｔｒｏｎｉｃＬｉｂｒａｒｙＯｎｌｉｎｅ──────────────────────────────────────────────────続き Continued on the front page (31) Priority claim number Japanese Patent Application No. 2-25840 (32) Priority date February 5, 1990 (1990.2.5) (33) Priority claim country Japan (JP) (56) References 1989 IEICE Autumn National Convention Lecture Papers, Supplement 1, “A-3 CELP Coding Scheme Based on Adaptive Density Pulse Train Model”, p. 1-3, (August 15, 1989) Proceedings of the IEICE Autumn Meeting, 1988, Supplement A-1, “Study on 8 kbps speech coding using A-6 ARMA model” , P. A-1-6, (published August 15, 1988) Proceedings of the 1989 IEICE Spring Conference, Volume 1, "A-5 Speech Coding Based on ARM A Model and Adaptive Density Pulse Train Model" Method ", p. 1-5, (issued March 15, 1989) Proceedings of the 1990 IEICE Spring Conference, Volume 1, “Code Error Sensitivity of Parameters in A-226 ADP-CELP System,” p. 1-226, (published March 5, 1990) Proceedings of the 1990 IEICE Spring Conference, Volume 1, “Analysis of DSP Implementation of SA-5-7 ADP-CELP Coding Scheme,” p. . 1-433, (published March 5, 1990) Proceedings of the 1990 IEICE National Conference, Volume 3, “Development of 4.8 kbps ADP-CELP Speech Codec”, p. 3-285, (published September 15, 1990) Proceedings of IE 1989 International Conference on Acoustics, Speech and Signal Processing, Vol. 1, "S4.8 ARMA Model Based Speed Coding at 8 kb / s" p. 148-151 Proceedings of IEEE 1990 International Conference on Acoustics, Speech and Signal Processing, Vol. 1, "S1.8 CELP Coding with an Adaptive Density Pulse Excitation Model" p. 29-32 Conference Record of IEEE Global Telecommunications Conference, GLOBECOM '91, Vol. 1 of 3, "Improvement of ADP-CE LP Speech Coding at 4 Kbits / s", p. 53.2.1-53.2.5, Phoenix, Arizona, December 2-5, 1991 (58) Fields investigated (Int. Cl. ⁷ , DB name) G10L 11/00-13/08 G10L 19 / 00-21/06 INSPEC (DIALOG) JICST File (JOIS) IEEE / IEEE Electronic Library Online

Claims

(57) [Claims]

1. A drive signal generating means for generating a drive signal comprising a drive pulse train whose pulse density is variable in a predetermined section unit, a synthesis filter driven by the drive signal, an output signal of the synthesis filter and an input Means for determining the amplitude or the amplitude and phase of the drive pulse train so that the power of the perceptual weighted error signal with the audio signal is minimized, and passing the short-term prediction residual signal to the input audio signal through a perceptual weight filter Density determining means for determining a pulse density of the driving pulse train based on the obtained signal.

2. A drive signal generating means for generating a drive signal composed of a drive pulse train whose pulse density is variable in predetermined intervals, a synthesis filter driven by the drive signal, an output signal of the synthesis filter and an input Means for determining the amplitude or amplitude and phase of the drive pulse train so that the power of the perceptual weighted error signal with the voice signal is minimized, and passing the pitch prediction residual signal for the input voice signal through a perceptual weight filter Density determining means for determining a pulse density of the driving pulse train based on the obtained signal.

3. A drive signal generating means for generating a drive signal comprising a drive pulse train whose pulse density is variable in a predetermined section unit; a synthesis filter driven by the drive signal; an output signal of the synthesis filter; Means for determining the amplitude or amplitude and phase of the drive pulse train so that the power of the perceptually weighted error signal with the audio signal is minimized, and performing pitch prediction on the short-term prediction residual signal for the input audio signal. And a density determining means for determining a pulse density of the drive pulse train based on a signal obtained by passing a pitch prediction residual signal obtained through an auditory weighting filter.

4. A drive signal generating means for generating a drive signal consisting of a drive pulse train whose pulse density is variable in predetermined intervals, a synthesis filter driven by the drive signal, an output signal of the synthesis filter and an input. Means for determining the amplitude or amplitude and phase of the drive pulse train so that the power of the perceptually weighted error signal with the audio signal is minimized, and performing pitch prediction on the short-term prediction residual signal for the input audio signal. And a density determining means for determining a pulse density of the drive pulse train based on a pitch prediction residual signal obtained.

5. The pulse density is equally spaced within a predetermined section,
A drive signal generating means for generating a drive signal composed of a drive pulse train that is variable in units of the predetermined section; a synthesis filter driven by the drive signal; and an audible weight of an output signal of the synthesis filter and an input audio signal. Means for determining the amplitude or the amplitude and phase of the drive pulse train so that the power of the error signal is minimized; and assigned to each subframe of the drive signal based on a short-term prediction residual signal for the input audio signal. A speech coding apparatus comprising: means for calculating a bit allocation value; and density determining means for determining a pulse density of the drive pulse train based on the bit allocation value calculated by the means.

6. A drive signal generating means for generating a drive signal comprising a drive pulse train whose pulse density is variable in a predetermined section unit; a synthesis filter driven by the drive signal; an output signal of the synthesis filter; Means for determining the amplitude or the amplitude and phase of the drive pulse train so that the power of the perceptual weighted error signal with the audio signal is minimized; and the drive signal based on a pitch prediction residual signal for the input audio signal. And a density determining means for determining a pulse density of the driving pulse train based on the bit allocation value calculated by the means. Device.

7. A drive signal generating means for generating a drive signal composed of a drive pulse train whose pulse density is variable in a predetermined section unit; a synthesis filter driven by the drive signal; an output signal of the synthesis filter; Means for determining the amplitude or amplitude and phase of the drive pulse train so that the power of the perceptually weighted error signal with the audio signal is minimized, and performing pitch prediction on the short-term prediction residual signal for the input audio signal. Means for calculating a bit allocation value assigned to each subframe of the drive signal based on the obtained pitch prediction residual signal; and determining a pulse density of the drive pulse train based on the bit allocation value calculated by the means. A speech coding apparatus comprising: a density determining unit.

8. A drive signal generating means for generating a drive signal composed of a drive pulse train whose pulse density is variable in predetermined intervals, a synthesis filter driven by the drive signal, an output signal of the synthesis filter and an input. Means for determining the amplitude or amplitude and phase of the drive pulse train so that the power of the perceptual weighted error signal with the audio signal is minimized, and the power or the number of zero crossings of the short-term prediction residual signal for the input audio signal. And a density determining means for determining a pulse density of the drive pulse train based on a function value subjected to nonlinear weighting.

9. A drive signal generating means for generating a drive signal composed of a drive pulse train whose pulse density is variable in predetermined intervals, a synthesis filter driven by the drive signal, an output signal of the synthesis filter and an input. Means for determining the amplitude or amplitude and phase of the drive pulse train so that the power of the perceptual weighted error signal with the audio signal is minimized, and the power or the number of zero crossings of the pitch prediction residual signal for the input audio signal And a density determining means for determining a pulse density of the drive pulse train based on a function value subjected to nonlinear weighting.

10. A drive signal generating means for generating a drive signal comprising a drive pulse train whose pulse density is variable in predetermined intervals, a synthesis filter driven by the drive signal, an output signal of the synthesis filter and an input. Means for determining the amplitude or amplitude and phase of the drive pulse train so that the power of the perceptually weighted error signal with the audio signal is minimized, and performing pitch prediction on the short-term prediction residual signal for the input audio signal. And a density determining means for determining a pulse density of the driving pulse train based on a function value obtained by nonlinearly weighting the power or the number of zero crossings of the pitch prediction residual signal to be obtained.

11. The driving signal generating means according to claim 1, wherein said driving signal generating means comprises: a driving pulse train in which a frame is divided into a plurality of sub-frames, and a pulse density is variable in sub-frame units or frame units at equal intervals in each sub-frame or each frame. 3. A driving signal comprising:
11. The speech encoding device according to any one of 10.

12. The drive signal generating means, wherein the frame is divided into a plurality of subframes, the pulse density is set in subframe units according to a predetermined density pattern, and the density pattern is variable in frame units. 11. The speech encoding apparatus according to claim 1, wherein a drive signal including a pulse train is generated, and the density determination unit determines a density pattern of the drive pulse train.