JPH034300A

JPH034300A - Voice encoding and decoding system

Info

Publication number: JPH034300A
Application number: JP1139524A
Authority: JP
Inventors: Kazunori Ozawa; 一範小澤
Original assignee: NEC Corp
Current assignee: NEC Corp
Priority date: 1989-05-31
Filing date: 1989-05-31
Publication date: 1991-01-10
Anticipated expiration: 2014-02-03
Also published as: JP2853170B2

Abstract

PURPOSE:To reproduce a voice excellently with a small arithmetic quantity irrelevantly to a bit rate by dividing a frame into subframes corresponding to a pitch period and using multipulses which are found by pitch interpolation and multipulses which are found on the whole frame without pitch prediction. CONSTITUTION:The voice signal from an input terminal 100 is divided into specific frames and an LPC and a pitch analysis part 150 finds an LPC coefficient of specific degree as a spectrum parameter representing a spectrum envelope from the voice signal of a frame. A pitch interpolation multipulse calculator 250 divides the frame into subframes by using a pitch period, finds the amplitude and position of multipulses in one specific section, and finds gain correction coefficients and phase correction coefficients of other subframes. Then a sound source signal is restored by specific arithmetic operation through a subtracter 260 and a multipulse calculation part 270 and then processed by composition. Consequently, the excellent voice can be reproduced with a small arithmetic quantity irrelevantly to the bit rate.

Description

【発明の詳細な説明】（産業上の利用分野）本発明は音声信号を低いビットレートで効率的に符号化
し、復号化するための音声符号化復号北方式に関する。DETAILED DESCRIPTION OF THE INVENTION Field of the Invention The present invention relates to an audio encoding and decoding method for efficiently encoding and decoding audio signals at low bit rates.

（従来の技術）音声信号を低いビットレート、例えば１６Ｋｂ／ｓ程度
以下で伝送する方式としては、マルチパルス符号化法な
どが知られている。これらは音源信号を複数個のパルス
組合せ（マルチパルス）で表し、声道の特徴をデジタル
フィルタで表し、音源パルスの情報とフィルタの係数を
、一定時間区間（フレーム）毎に求めて伝送している。(Prior Art) As a method for transmitting audio signals at a low bit rate, for example, about 16 Kb/s or less, a multipulse encoding method is known. These represent the sound source signal as a combination of multiple pulses (multi-pulse), represent the characteristics of the vocal tract with a digital filter, and transmit the information on the sound source pulse and the filter coefficients after determining them for each fixed time interval (frame). There is.

この方法の詳細については、例えばＡｒａｓｅｋｉ、　
Ｏｚａｗａ、　Ｏｎｏ、　０ｃｈｉａｉ氏による“Ｍｕ
ｌｔｉ−ｐｕｌｓｅ　Ｅｘｃｉｔｅｄ　５ｐｅｅｃｈ　
Ｃｏｄｅｒ　Ｂａ５ｅｄ　ｏｎＭａｘｉｍｕｍ　　Ｃｒ
ｏｓｓｃｏｒｒｅｌａｔｉｏｎ　　５ｅａｒｃｈ　　Ａ
ｌｇｏｒｉｔｈｍ”。For details of this method, see for example Araseki,
“Mu” by Ozawa, Ono, Ochiai
lti-pulse Excited 5peech
Coder Ba5ed on Maximum Cr
osscorrelation 5earch A
lgorithm”.

（ＧＬＯＢＥＣＯＭ　８３．　ＩＥＥＥ　Ｇｌｏｂａｌ
　Ｔｅｌｅｃｏｍｍｕｎｉｃａｔｉｏｎ。(GLOBECOM 83. IEEE Global
Telecommunication.

講演番号２３．３．１９８３Ｘ文献１）に記載されてい
る。この方法では、声道情報と音源信号を分離してそれ
ぞれ表現すること、および音源信号を表現する手段とし
て複数のパルス列の組合せ（マルチパルス）を用いるこ
とにより、復号後に良好な音声信号を出力できる。音源
信号を表すパルス列を求める基本的な考え方については
第５図を用いて説明する。図中の入力端子９００からは
フレーム毎に分割された音声信号が人力される。合成フ
ィルタ９２０には現フレームの音声信号から求められた
スペクトルパラメータが入力されている。音源計算回路
９１０において初期マルチパルスを発生し、これを前記
合成フィルタ９２０に入力することによって出力として
合成音声波形が得られる。減算器９４０で前記人力信号
から合成音声波形を減する。この結果を重み付は回路９
５０へ入力し、現フレームでの重み付は誤差電力を得る
。そしてこの重み付は誤差電力を最小とするように、音
源計算回路９１０において規定個数のマルチパルスの振
幅と位置を求める。It is described in lecture number 23.3.1983X document 1). This method can output a good audio signal after decoding by separately expressing the vocal tract information and the sound source signal, and by using a combination of multiple pulse trains (multipulse) as a means of expressing the sound source signal. . The basic idea of finding a pulse train representing a sound source signal will be explained using FIG. 5. An audio signal divided into frames is manually inputted from an input terminal 900 in the figure. Spectral parameters determined from the audio signal of the current frame are input to the synthesis filter 920. An initial multipulse is generated in the sound source calculation circuit 910 and inputted to the synthesis filter 920 to obtain a synthesized speech waveform as an output. A subtracter 940 subtracts the synthesized speech waveform from the human input signal. This result is weighted by circuit 9.
50 and weighting in the current frame obtains the error power. Then, the amplitude and position of a specified number of multipulses are determined in the sound source calculation circuit 910 so that this weighting minimizes the error power.

（発明が解決しようとする課題）しかしながら、この従来法ではビットレートが充分に高
く音源パルスの数が充分なときは音質が良好であったが
、ビットレートを下げて行くと音質が低下するという問
題点が合った。(Problem to be solved by the invention) However, with this conventional method, the sound quality was good when the bit rate was high enough and the number of sound source pulses was sufficient, but as the bit rate was lowered, the sound quality deteriorated. I agree with the problem.

この問題点を改善するために、マルチパルス音源のピッ
チ毎の準周期性（ピッチ相関）を利用したピッチ予測マ
ルチパルス法が提案されている。この方法の詳細は、例
えば、特願昭５８−１３９０２２号明細書（文献２）に
詳しいのでここでは説明を省略する。In order to improve this problem, a pitch prediction multi-pulse method has been proposed that utilizes the pitch-wise quasi-periodicity (pitch correlation) of a multi-pulse sound source. The details of this method are detailed in, for example, Japanese Patent Application No. 139022/1982 (Document 2), so the explanation will be omitted here.

しかしながら、マルチパルス音源のピッチ毎の準周期性
は大振幅のパルスでは大きいと考えられるが、全てのパ
ルスについてこのような周期性が存在するわけではなく
、振幅の小さなパルスはピッチ毎の周期性は少ないと考
えられる。前記文献２のピッチ予測マルチパルス法では
、フレーム内で予め定められたすべての個数のパルスに
ついてピッチ毎の周期性を仮定して全てのパルスをピッ
チ予測により求めているので、特に周期性の少ないパル
スに対してはピッチ予測によりかえって特性が悪化する
という問題点があった。特にこのことは、母音同士の遷
移区間や過渡部において顕著であり、このような部分で
音質が劣化するという問題点があった。However, although the pitch-wise quasi-periodicity of a multipulse sound source is considered to be large for large-amplitude pulses, such periodicity does not exist for all pulses, and small-amplitude pulses have pitch-wise periodicity. is considered to be small. In the pitch prediction multi-pulse method of Document 2, all pulses are determined by pitch prediction assuming periodicity for each pitch for a predetermined number of pulses within a frame, so all pulses are determined by pitch prediction. For pulses, there is a problem in that pitch prediction actually worsens the characteristics. This is particularly noticeable in transition sections and transitional parts between vowels, and there is a problem in that the sound quality deteriorates in such parts.

さらに、前記文献２の方法では、ピッチ情報をインパル
ス応答に含ませているため非常に時間長の長いインパル
ス応答（例えば２０ｍ５ｅｃ以上）を必要とし、予め定
められた個数の全てのパルスをピッチ予測により求めて
いるので、パルスの探索に要する演算量は非常に多く、
現在のＬＳＩ技術をもってしても装置をコンパクトに実
現することは円錐であった。Furthermore, in the method of Document 2, since pitch information is included in the impulse response, a very long impulse response (for example, 20 m5ec or more) is required, and all pulses of a predetermined number are determined by pitch prediction. Since we are searching for a pulse, the amount of calculation required to search for a pulse is extremely large.
Even with current LSI technology, it has been difficult to realize a compact device.

本発明の目的は、ビットレートが高いところでも、下げ
ていっても従来よりも良好な音声を再生することが可能
で、すくない演算量で実現可能な音声符号化復号化方式
を提供することにある。The purpose of the present invention is to provide an audio encoding/decoding method that can reproduce better audio than before even when the bit rate is high or lower, and that can be realized with a small amount of calculation. be.

（課題を解決するための手段）本発明の音声符号化復号化方式は、送信側では離散的な
音声信号を入力し前記音声信号からフレーム毎にスペク
トル包絡を表すスペクトルパラメータとピッチ周期を表
すピッチパラメータとを抽出し、前記フレームの音声信
号を前記ピッチパラメータに応じた小区間に分割し、前
記小区間のうちの１つの区間の音声信号に対して前記ピ
ッチパラメータと前記スペクトルパラメータを用いて第
１のマルチパルスを求め、他の区間では前記マルチパル
スを補正する係数を求め、前記マルチパルスと前記係数
により求めた信号を前記音声信号から除去して得られる
信号に対してスペクトルパラメータを用いて第２のマル
チパルスを求め、受信側では前記第１のマルチパルスと
前記ピッチパラメータと前記係数と前記第２のマルチパ
ルスを用いて音源信号を復元し、さらに前記スペクトル
パラメータを用いて構成される合成フィルタを駆動して
合成音声信号を求めることを特徴とする。(Means for Solving the Problem) The audio encoding/decoding method of the present invention inputs a discrete audio signal on the transmitting side, and extracts a spectral parameter representing a spectral envelope and a pitch representing a pitch period from the audio signal for each frame. The audio signal of the frame is divided into small sections according to the pitch parameter, and the audio signal of one section of the small sections is extracted using the pitch parameter and the spectrum parameter. 1 multi-pulse is obtained, coefficients for correcting the multi-pulse are obtained in other sections, and the signal obtained by removing the multi-pulse and the coefficient from the audio signal is obtained using spectral parameters. A second multi-pulse is obtained, and on the receiving side, a sound source signal is restored using the first multi-pulse, the pitch parameter, the coefficient, and the second multi-pulse, and further configured using the spectral parameter. It is characterized by driving a synthesis filter to obtain a synthesized speech signal.

また本発明による音声符号化方式は、送信側では離散的
な音声信号を入力し前記音声信号からフレーム毎にスペ
クトル包絡を表すスペクトルパラメータとピッチ周期を
表すピッチパラメータを抽出し、前記フレームの音声信
号を前記ピッチパラメータに応じた小区間に分割し、前
記音声信号の音源信号として前記小区間のうち１つの区
間において前記ピッチパラメータと前記スペクトルパラ
メータを用いて第１のマルチパルスを求め、他の区間で
は前記マルチパルスを補正する係数を求め、前記マルチ
パルスと前記係数により求めた信号を前記音声信号から
除去して得られる信号に対して前記スペクトルパラメー
タを用いて第２のマルチパルスを求めて得られるマルチ
パルス音源か、予め定められた種類の雑音信号から構成
される符号帳から前記音声信号と合成信号との誤差電力
を小さくするように選択した雑音信号を用いて表し、受
信側では前記第１のマルチパルスと前記ピッチパラメー
タと前記係数と前記第２のマルチパルスを用いて音源信
号を復元するか、前記選択した雑音信号を用いて音源信
号を復元し、前記スペクトルパラメータを用いて構成さ
れる合成フィルタを前記音源信号により駆動して合成音
声信号を求めることを特徴とする。Further, in the audio encoding method according to the present invention, a discrete audio signal is input on the transmitting side, and a spectral parameter representing a spectral envelope and a pitch parameter representing a pitch period are extracted from the audio signal for each frame, and the audio signal of the frame is is divided into small sections according to the pitch parameter, a first multi-pulse is obtained as a sound source signal of the audio signal using the pitch parameter and the spectrum parameter in one section among the small sections, and the first multi-pulse is obtained in the other sections. Then, a coefficient for correcting the multi-pulse is obtained, and a second multi-pulse is obtained by using the spectral parameter for the signal obtained by removing the multi-pulse and the signal obtained by the coefficient from the audio signal. The receiver side uses a multi-pulse sound source selected from a codebook consisting of predetermined types of noise signals to reduce the error power between the speech signal and the synthesized signal. 1 multi-pulse, the pitch parameter, the coefficient and the second multi-pulse, or the selected noise signal is used to restore the sound source signal and the spectral parameter is used to reconstruct the sound source signal. The synthesized speech signal is obtained by driving a synthesis filter according to the sound source signal.

（作用）第１の発明による音声符号化復号化方式は、フレーム区
間（例えば２０ｍ５）の音声信号の音源信号を、有音区
間ではフレームを分割した小区間において、ピッチ補間
により求めたマルチパルス（第１のマルチパルス）と、
フレーム全体においてピッチ予測無しで求めたマルチパ
ルス（第２のマルチパルス）とを用いて表すことを特徴
としている。前記第１のマルチパルスの計算は次のよう
に行う。マルチパルス音源のピッチ毎の単周期性を・非
常に効率よく利用すると共に演算量を大きく低減するた
めに、フレームをあらかじめピッチ周期に応じた小区間
（サブフレーム）に分割し、前記サブフレームのうちの
１つのサブフレーム（代表区間）についてのみマルチパ
ルスを求める。他のサブフレームについては前記代表区
間で求めたマルチパルスのゲインと位相を補正する補正
係数を求め、この係数を用いて他のサブフレームにおい
て、前記代表区間のマルチパルスのゲインと位相を補正
してパルスを発生させ、フレーム全体のパルスを復元す
る。そして前記パルスによりフレームで信号を再生して
前記音声信号から前記信号を減算した後に、前記フレー
ムにおいて前記文献１と同様の方法により、マルチパル
ス（第２のマルチパルス）を求めるわけである。(Operation) The audio encoding/decoding method according to the first invention converts the sound source signal of an audio signal in a frame section (for example, 20 m5) into a multi-pulse ( first multi-pulse);
It is characterized by representing the entire frame using a multi-pulse (second multi-pulse) obtained without pitch prediction. The calculation of the first multi-pulse is performed as follows. In order to utilize the monoperiodic nature of each pitch of a multipulse sound source very efficiently and to greatly reduce the amount of calculation, a frame is divided in advance into small sections (subframes) according to the pitch period, and each of the subframes is Multipulses are obtained only for one subframe (representative section). For other subframes, find a correction coefficient that corrects the gain and phase of the multipulse found in the representative section, and use this coefficient to correct the gain and phase of the multipulse in the representative section in other subframes. to generate a pulse and restore the pulse for the entire frame. Then, after reproducing a signal in frames using the pulses and subtracting the signals from the audio signal, a multipulse (second multipulse) is obtained in the frame using the same method as in Document 1.

以下で本方式の基本的な処理を第３図を用いて説明する
。第３図は、本発明の作用を示すブロック図である。入
力端子１００から音声信号を入力し、前記音声信号を予
め定められた時間長の（例えば２０ｍ５）フレームに分
割する。ＬＰＧ、ピッチ分析部１５０はフレームの音声
信号からスペクトル包絡を表すスペクトルパラメータと
して、予め定められた次数のＬＰＧ係数を衆知のＬＰＣ
分析によゆもとめる。ＬＰＧ係数としては、ここで用い
る線形予測係数ａ、の他にＬＳＰ、ホルマント、ＬＰＣ
ケプストラムなどの他の良好なパラメータを用いること
もできる。また、ＬＰＣ以外の分析法、例えばケプスト
ラムやＰＳＥ、ＡＲＭＡ法などを用いることもできる。The basic processing of this method will be explained below using FIG. FIG. 3 is a block diagram showing the operation of the present invention. An audio signal is input from an input terminal 100, and the audio signal is divided into frames of a predetermined time length (for example, 20 m5). The LPG and pitch analysis unit 150 converts the LPG coefficients of a predetermined order into well-known LPC signals from the frame audio signal as spectral parameters representing the spectral envelope.
Stop for analysis. In addition to the linear prediction coefficient a used here, LPG coefficients include LSP, formant, and LPC.
Other good parameters such as cepstrum can also be used. Furthermore, analysis methods other than LPC, such as cepstrum, PSE, and ARMA methods, can also be used.

以下では線形予測係数を用いるものとして説明を行う。The following explanation assumes that linear prediction coefficients are used.

また１５０は、フレームの音声からピッチパラメータと
してピッチ周期Ｍを計算する。これには衆知の自己相関
法を用いることができる。Further, 150 calculates a pitch period M as a pitch parameter from the audio of the frame. The well-known autocorrelation method can be used for this.

ピッチ補間マルチパルス計算部２５０及びマルチパルス
計算部２７０の動作を第４図を引用して説明する。第４
図（ａ）はフレームの音声信号を表す。ここでは−例と
してフレーム長を２０ｍ５としている。ピッチ補間マル
チパルス計算部２５０では、まず、（ｂ）のように、フ
レームをピッチ周期Ｍを用いて小区間（サブフレーム）
に分割する。ここではサブフレームの長さはピッチ周期
Ｍと同一としている。The operations of the pitch interpolation multipulse calculation section 250 and the multipulse calculation section 270 will be explained with reference to FIG. Fourth
Figure (a) represents the audio signal of a frame. Here, as an example, the frame length is 20 m5. The pitch interpolation multipulse calculation unit 250 first divides the frame into small sections (subframes) using the pitch period M, as shown in (b).
Divide into. Here, the length of the subframe is the same as the pitch period M.

次に、前記文献１と同一の方法により、前記線形予測係
数から構成される合成フィルタのインパルス応答ｈ（ｎ
）の自己相関関数”ｈｈ（ｍ）、聴感重みすけ音声信号
と前記インパルス応答ｈ（ｎ）との相互相関関数ｏｈｘ
（ｍ）を求める。次に、前記サブフレームのうちの予め
定められた１つの区間（以下、代表区間と呼ぶ。ここで
は例えば第４図（ｂ）の区間■）についてのみ、予め定
められた個数Ｋ（ここでは４としている）のマルチパル
ス（第１のマルチパルス）の振幅ｇ４、位置ｍ。Next, using the same method as in Document 1, the impulse response h(n
), the autocorrelation function ``hh(m)'', the cross-correlation function ohx between the perceptually weighted speech signal and the impulse response h(n)
Find (m). Next, a predetermined number K (here, 4 amplitude g4 and position m of the multi-pulse (first multi-pulse).

を求める。ここでマルチパルスの求め方は前記文献１を
参照できる。第４図（Ｃ）は求めたマルチパルスを示す
。次に、代表区間以外のサブフレームでは、代表区間で
求めたマルチパルスのゲイン、位相を補正してパルスを
発生するためのゲイン補正係数、位相補正係数を求める
。フレーム内のｊ番目のサブフレームにおけるゲイン補
正係数Ｃ１、位相補」正係数ｄ、は次式の誤差電力を最小化するように求める
。seek. Here, reference can be made to the above-mentioned document 1 for how to obtain the multi-pulse. FIG. 4(C) shows the obtained multipulse. Next, in subframes other than the representative section, gain correction coefficients and phase correction coefficients for generating pulses by correcting the gain and phase of the multi-pulse obtained in the representative section are determined. The gain correction coefficient C1 and phase correction positive coefficient d in the j-th subframe within the frame are determined to minimize the error power according to the following equation.

Ｅ＝Ｅ［（ｘｔ（ｎ）　−ｙ（ｎ））＊ｗ（ｎ）］　　
　　　　　　　　　　　（１）ここでｘ、（ｎ）、ｓ、
（ｎ）はｊ番目のサブフレームにおけ」　　　　　」る音７ｊｇ信号、マルチパルスのゲイン、位相を補正し
て求めた合成音声をそれぞれ示す。ただしＳ、（、ｎ）
＝ｃｉｉｇ、ｇ、・ｈ（ｎ−ｍ、７Ｌ−Ｍ−ｄ、）　　
（Ｌは整数）（２）ここでｈ（ｎ）ｌよ合成フィルタの
インパルス応答である。（２）式を（１）式に代入して
Ｃ０で偏微分してＯとおくことにより、（１）式を最小
化するＣ５、ｄ、を求める事ができる。詳細は特願昭６
３−２０８２０１号明細書（文献３）等を参照できる。E=E[(xt(n) −y(n))*w(n)]
(1) Here x, (n), s,
(n) shows the synthesized speech obtained by correcting the sound 7jg signal, multi-pulse gain, and phase in the j-th subframe. However, S, (, n)
=ciig, g, ・h (n-m, 7L-M-d,)
(L is an integer) (2) where h(n)l is the impulse response of the synthesis filter. By substituting equation (2) into equation (1), partially differentiating it with respect to C0, and setting it as O, C5,d, which minimizes equation (1), can be found. For details, please see the special request
Reference can be made to the specification of No. 3-208201 (Document 3).

このようにして基本的にはフレーム内の他のサブフレー
ム区間すべてについてゲイン補正係数、位相補正係数を
求める。そして代表区間のマルチパルスとゲイン補正数
、位相補正係数を用いて第４図（ｄ）のようにフレーム
全体のパルスを再生する。なお、代表区間のフレーム内
位置は、いくつかのサブフレームを探索して決定しても
よいし、あらかじめ決めておいてもよい。前者の方法の
詳細は例えば前記文献３等を参照できる。In this way, gain correction coefficients and phase correction coefficients are basically obtained for all other subframe sections within the frame. Then, using the multi-pulse of the representative section, the gain correction number, and the phase correction coefficient, the pulse of the entire frame is reproduced as shown in FIG. 4(d). Note that the intra-frame position of the representative section may be determined by searching several subframes, or may be determined in advance. For details of the former method, reference can be made to, for example, the above-mentioned document 3.

次に、再生したパルスｖ（ｎ）を用いて（３）式で定義
される合成フィルタを駆動して再生信号ｘ’（ｎ）を得
る。Next, a synthesis filter defined by equation (3) is driven using the reproduced pulse v(n) to obtain a reproduced signal x'(n).

ｘ’（ｎ）＝ｖ（ｎ）＋　、ｆｉ　ａｉｘ’（ｎ−ｉ）
　　　　　　　（３）ｓ＝１ここでａ、は線形予測係数である。x'(n)=v(n)+, fi aix'(ni)
(3) s=1 where a is a linear prediction coefficient.

減算器２６０は次式にしたがい音声信号ｘ（ｎ）からＸ
・（ｎ）を減算してｅ（ｎ）を得る。The subtracter 260 converts the audio signal x(n) to X according to the following equation:
- Subtract (n) to obtain e(n).

ｅ（ｎ）　＝ｘ（ｎ）　−ｘ’（ｎ）　　　　　　　　
　　　　　　（４）次に、マルチパルス計算部２７０は
ｅ（ｎ）に対して、前記文献１と同一の方法を用いてｅ
（ｎ）に聴感重み付けをした信号と合成フィルタの重み
ずけインパルス応答との相互相関関数と、前記重みすけ
インパルス応答の自己相関関数を用いて、フレーム内で
予め定められた個数Ｑのマルチパルス（第２のマルチパ
ルス）を求める。これを第４図（ｅ）に示す。図ではＱ
を４としている。e(n) = x(n) −x'(n)
(4) Next, the multipulse calculation unit 270 calculates e(n) using the same method as in Document 1.
A predetermined number Q of multi-pulses are generated within a frame using a cross-correlation function between the perceptually weighted signal (n) and the weighted impulse response of the synthesis filter, and an autocorrelation function of the weighted impulse response. (second multipulse). This is shown in FIG. 4(e). In the diagram, Q
is set as 4.

一方、無声フレームでは、フレーム全体に対してマルチ
パルスの振幅、位置を求める。On the other hand, for an unvoiced frame, the amplitude and position of multipulses are determined for the entire frame.

送信側の伝送情報は、合成フィルタのスペクトルパラメ
ータの他に、有声フレームでは、スペクトル包絡を表す
スペクトルパラメータａ、、ピッチＭ、代表区間のに個
のマルチパルスの振幅と位置、ゲイン補正係数、位相補
正係数、代表区間のフレーム内位置、Ｑ個のマルチパル
スの振幅と位置である。また、無声フレームでは、マル
チパルスの振幅、位置を伝送する。In addition to the spectral parameters of the synthesis filter, the transmission information on the transmitting side includes, in voiced frames, the spectral parameter a representing the spectral envelope, the pitch M, the amplitude and position of the multipulses in the representative section, the gain correction coefficient, and the phase. These are the correction coefficient, the position within the frame of the representative section, and the amplitude and position of the Q multipulses. Furthermore, in the unvoiced frame, the amplitude and position of the multipulse are transmitted.

第２の発明では、有声フレームでは第１の発明と同じ動
作をするが、無声フレームではマルチバルスではなくて
、予め定められた種類の雑音信号からなる符号帳から一
種類を選択した雑音信号を用いて音源信号を表すことを
特徴とする。雑音信号としては、例えばガウス性の統計
分布を有する乱数を用いることができる。雑音信号の時
間方向の長さ（次元数）は通常フレームよりも短い長さ
（例えば５〜１０ｍ５）とする。また雑音信号の種類は
２種類とする。このような符号帳から入力音声に対して
最もよい雑音信号を選択する方法としては、雑音信号を
用いて合成フィルタを駆動して音声を合成して原音声と
の誤差電力を求め、誤差電力を最小化する雑音信号を選
択する方法が知られている。この方法の詳細は、例えば
５ｃｈｒｏｅｄｅｒ、　Ａｔａ１氏による”Ｃｏｄｅ−
ｅｘｃｉｔｅｄ　１ｉｎｅａｒ　ｐｒｅｄｉｃｔｉｏｎ
　（ＣＥＬＰ）　：　Ｈｉｇｈｑｕａｌｉｔｙ　５ｐｅ
ｅｃｈ　ａｔ　ｖｅｒｙ　ｌｏｗ　ｂｉｔ　ｒａｔｅｓ
”と題した論文（Ｐｒｏｃ、　ＩＣＡＳＳＰ、　ｐｐ、
　９３７−９４０．１９８５Ｘ文献４）等を参照するこ
とができる。In the second invention, the same operation as in the first invention is performed in voiced frames, but in unvoiced frames, a noise signal selected from a codebook of predetermined types of noise signals is used instead of a multi-pulse. It is characterized in that it represents a sound source signal using As the noise signal, for example, a random number having a Gaussian statistical distribution can be used. The length (number of dimensions) of the noise signal in the time direction is shorter than the normal frame (for example, 5 to 10 m5). Furthermore, there are two types of noise signals. The method of selecting the best noise signal for input speech from such a codebook is to use the noise signal to drive a synthesis filter to synthesize speech, find the error power with the original speech, and calculate the error power. Methods of selecting noise signals to be minimized are known. Details of this method can be found, for example, in “Code-
Excited 1inear prediction
(CELP): Highquality 5pe
ech at very low bit rates
” (Proc, ICASSP, pp.
937-940.1985X document 4), etc. can be referred to.

無声フレームでは、選択された雑音信号を示すインデッ
クス、ゲイン、ピッチ再生フィルタのピッチゲイン、ピ
ッチ周期、合成フィルタのスペクトルパラメータを受信
側へ伝送する。In the unvoiced frame, an index indicating the selected noise signal, a gain, a pitch gain of a pitch recovery filter, a pitch period, and a spectrum parameter of a synthesis filter are transmitted to the receiving side.

（実施例）第１の発明の一実施例を示す第１図において、入力端子
５００から離散的な音声信号ｘ（ｎ）を入力する。(Embodiment) In FIG. 1 showing an embodiment of the first invention, a discrete audio signal x(n) is input from an input terminal 500.

スペクト・ル、ピッチパラメータ計算回路５２０では分
割したフレーム区間（例えば２０ｍ５）の音声信号スペ
クトル包絡を表す合成フィルタのスペクトルパラメータ
ａｉを、衆知のＬＰＣ分析法によって求める。また、ピ
ッチ周期Ｍを衆知の自己相関法により求める。The spectral and pitch parameter calculation circuit 520 calculates the spectral parameter ai of the synthesis filter representing the audio signal spectral envelope of the divided frame section (for example, 20 m5) using the well-known LPC analysis method. Further, the pitch period M is determined by the well-known autocorrelation method.

求められたスペクトルパラメータ及びピッチ周期に対し
て、量子化器５２５において量子化を行う。A quantizer 525 performs quantization on the obtained spectrum parameters and pitch periods.

量子化の方法は、特願昭５９−２７２４３５号明Ｍｌ書
（文献５）に示されているようなスカラー量子化や、あ
るいはベクトル量子化を行ってもよい。ベクトル量子化
の具体的な方法については、例えば、Ｍａｋｈｏｕ１氏
らによる“Ｖｅｃｔｏｒ　ｑｕａｎｔｉｚａｔｉｏｎ　
ｉｎ　５ｐｅｅｃｈ　ｃｏｄｉｎｇ”（Ｐｒｏｃ、　Ｉ
ＥＥＥ、　ｐｐ、　１５５１−１５５８．１９８５Ｘ文
献６）などの論文を参照できる。The quantization method may be scalar quantization as shown in Japanese Patent Application No. 59-272435 (Reference 5), or vector quantization. For a specific method of vector quantization, see, for example, “Vector quantization” by Makhou et al.
in 5peech coding” (Proc, I
You can refer to papers such as EEE, pp. 1551-1558.1985X Reference 6).

逆量子化器５３０は、量子化した結果を用いて逆量子化
して出力する。The dequantizer 530 dequantizes and outputs the quantized result.

減算器５３５はフレームの音声信号から影響信号を減算
して出力する。A subtracter 535 subtracts the influence signal from the audio signal of the frame and outputs the result.

重み付は回路５４０は、音声信号と逆量子化されたスペ
クトルパラメータを用いて前記信号に聴感重み付けを行
う。重み付けの方法は、前記文献２の重み付は回路２０
０を参照することができる。Weighting circuit 540 perceptually weights the audio signal using the dequantized spectral parameters. The weighting method of the above-mentioned document 2 is based on the weighting circuit 20.
0 can be referenced.

インパルス応答計算回路５５０は、逆量子化されたスペ
クトルパラメータａ１．を用いて聴感重みずけをした合
成フィルタのインパルス応答ｈ（ｎ）を計算する。The impulse response calculation circuit 550 calculates the dequantized spectrum parameters a1. The impulse response h(n) of the synthesis filter subjected to auditory weighting is calculated using .

具体的な方法は前記文献２のインパルス応答計算回路を
参照できる。For a specific method, refer to the impulse response calculation circuit in Document 2.

自己相関関数計算回路５６０は前記インパルス応答に対
して自己相関関数”ｈｈ（ｍ）を計算し、それぞれ音源
パルス計算回路５８０とパルス計算回路５８６へ出力す
る。自己相関関数の計算法は前記文献２の自己相関関数
計算回路１８０を参照することができる。The autocorrelation function calculation circuit 560 calculates the autocorrelation function "hh(m)" for the impulse response, and outputs it to the sound source pulse calculation circuit 580 and the pulse calculation circuit 586, respectively.The method for calculating the autocorrelation function is described in the above-mentioned document 2. The autocorrelation function calculation circuit 180 of FIG.

相互相関関数計算回路５７０は前記聴感重み付けられた
信号と、前記インパルス応答ｈ（ｎ）との相互相関関数
Φ、、（ｍ）を計算する。A cross-correlation function calculation circuit 570 calculates a cross-correlation function Φ, , (m) between the perceptually weighted signal and the impulse response h(n).

音源パルス計算回路５８０では、まず、フレームを逆量
子化したピッチ周期Ｍ′を用いて前記第４図（ｂ）のよ
うにサブフレーム区間に分割する。そして予め定められ
た１つのサブフレーム区間（代表区間）（例えば第４図
（ｂ）のサブフレーム■）について、Φ、、（ｍ）とＲ
，、（ｍ）とを用いてに個のマルチパルス列（第１のマ
ルチパルス）の振幅ｇ該位置ｍｉを求める。パルス列の
計算方法については、前記文献２の音源パルス計算回路
を参照することができる。The sound source pulse calculation circuit 580 first divides the frame into subframe sections as shown in FIG. 4(b) using the inversely quantized pitch period M'. Then, for one predetermined subframe section (representative section) (for example, subframe ■ in Fig. 4(b)), Φ, , (m) and R
, , (m) to find the amplitude g of the multi-pulse train (first multi-pulse) at the position mi. Regarding the pulse train calculation method, reference can be made to the sound source pulse calculation circuit in Document 2.

補正係数計算回路５８３では作用の項で示した（１）。The correction coefficient calculation circuit 583 is shown in the function section (1).

（２）式に従い、代表区間以外のサブフレーム区間にお
いてゲイン補正係数Ｃ１、位相補正係数ｄ、を計算しＪ
　　　　　　　　　　　　　　　　」て出力する。According to equation (2), calculate the gain correction coefficient C1 and the phase correction coefficient d in subframe sections other than the representative section.
” is output.

量子化器５８５は、前記マルチパルス列の振幅と位置を
量子化して符号を出力する。具体的な方法は前記文献１
．２などを参照できる。またゲイン補正係数、位相補正
係数、代表区間のフレーム内位置を量子化して符号を出
力する。具体的な方法は例えば前記文献３などを参照で
きる。これらの出力はさらに逆量子化され、ピッチ補間
回路６０５に出力され第４図（ｄ）のようにフレーム全
体のパルスが復元される。A quantizer 585 quantizes the amplitude and position of the multi-pulse train and outputs a code. The specific method is in the above document 1.
．． 2 etc. can be referred to. It also quantizes the gain correction coefficient, phase correction coefficient, and position within the frame of the representative section, and outputs a code. For a specific method, reference can be made to the above-mentioned document 3, for example. These outputs are further dequantized and output to a pitch interpolation circuit 605 to restore the pulses of the entire frame as shown in FIG. 4(d).

前記復元されたパルスは、合成フィルタ６１０に通すこ
とによって、前記（３）式に従い合成音声信号ｘ’（ｎ
）が求まる。The restored pulse is passed through a synthesis filter 610 to produce a synthesized speech signal x'(n
) can be found.

減算器６１５は、前記音声信号ｘ（ｎ）から合成音声信
号ｘ’（ｎ）を（４）式に従い減することによって、残
差信号ｅ（ｎ）を得る。The subtracter 615 obtains a residual signal e(n) by subtracting the synthesized speech signal x'(n) from the speech signal x(n) according to equation (4).

重み付は回路６００は前記残差信号に対して聴感重みず
けを行う。The weighting circuit 600 performs perceptual weighting on the residual signal.

相互相関関数計算回路６０３は重み付は回路６００の出
力と前記インパルス応答ｈ（ｎ）との相互相関関数を計
算する。A cross-correlation function calculation circuit 603 calculates a cross-correlation function between the output of the weighting circuit 600 and the impulse response h(n).

パルス計算回路５８６では、前記相互相関関数とインパ
ルス応答ｈ（ｎ）の自己相関関数を用いて、予め定めら
れた個数のマルチパルス（第２のマルチパルス）の振幅
と位置を求める。The pulse calculation circuit 586 uses the cross-correlation function and the autocorrelation function of the impulse response h(n) to find the amplitude and position of a predetermined number of multipulses (second multipulse).

量子化器６２０は前記マルチパルスの振幅、位置を量子
化して出力するとともに、これらを逆量子化して合成フ
ィルタ６２５へ出力する。The quantizer 620 quantizes and outputs the amplitude and position of the multi-pulse, and also dequantizes and outputs them to the synthesis filter 625.

合成フィルタ６２５は残差信号を合成して出力する。A synthesis filter 625 synthesizes and outputs the residual signals.

加算器６２７は合成フィルタ６２５と合成フィルタ６１
０の出力を加算してフレームの再生信号を求め、さらに
次フレームに対する影響信号をもとめて出力する。影響
信号計算の具体的な方法は前記文献２を参照できる。The adder 627 includes the synthesis filter 625 and the synthesis filter 61.
The reproduced signal of the frame is obtained by adding the outputs of 0, and the influence signal for the next frame is also determined and output. For a specific method of calculating the influence signal, refer to the above-mentioned document 2.

マルチプレクサ６３５は、量子化器５８５，６２０の出
力であるマルチパルス列の振幅、位置、補正係数、代表
区間の位置を表す符号、パラメータ量子化器５２５の出
力であるスペクトルパラメータ、ピッチ周期を表す符号
を組み合せて出力する。The multiplexer 635 receives the amplitude, position, correction coefficient, and code representing the position of the representative section of the multi-pulse train output from the quantizers 585 and 620, the spectrum parameter output from the parameter quantizer 525, and the code representing the pitch period. Combine and output.

一方、受信側では、デマルチプレクサ７１０は、ピッチ
補間マルチパルス（第１のマルチパルス）の振幅、位置
、補正係数、代表区間の位置を表す符号、マルチパルス
（第２のマルチパルス）の振幅、位置を表す符号、スペ
クトルパラメータ、ピッチ周期を表す符号を分離して出
力する。On the other hand, on the receiving side, the demultiplexer 710 outputs the amplitude, position, and correction coefficient of the pitch interpolation multipulse (first multipulse), a code representing the position of the representative section, the amplitude of the multipulse (second multipulse), The code representing the position, the spectrum parameter, and the code representing the pitch period are separated and output.

第１のパルス復号器７２０はピッチ補間マルチパルスの
振幅、位置を復号する。第２のパルス復号器７２５は第
２のマルチパルスの振幅、位置を復号する。パラメータ
復号器７５０は、送信側の逆量子化器５３０と同じ働き
をして、スペクトルパラメータａ”１、ピッチ周期Ｍ′
を復号して出力する。The first pulse decoder 720 decodes the amplitude and position of the pitch interpolated multi-pulse. A second pulse decoder 725 decodes the amplitude and position of the second multi-pulse. The parameter decoder 750 has the same function as the inverse quantizer 530 on the transmitting side, and has a spectral parameter a''1 and a pitch period M'.
Decode and output.

ピッチ補間回路７２６は、送信側のピッチ補間回路６０
５と同一の動作を行う。The pitch interpolation circuit 726 is the pitch interpolation circuit 60 on the transmission side.
Perform the same operation as 5.

パルス発生器７２７は前記第２のマルチパルスによる音
源信号をフレーム長だけ発生させる。The pulse generator 727 generates a sound source signal based on the second multi-pulse for a frame length.

加算器７４０はパルス発生器７２７とピッチ補間回路７
２６の出力信号を加算してフレームの駆動音源信号を求
め、合成フィルタ回路７６０を駆動する。Adder 740 includes pulse generator 727 and pitch interpolation circuit 7
26 output signals are added to obtain a frame driving sound source signal, and the synthesis filter circuit 760 is driven.

合成フィルタ回路７６０は、前記駆動音源信号及び前記
復号されたスペクトルパラメータを用いて、フレーム毎
に合成音声波形を求めて出力する。The synthesis filter circuit 760 uses the driving sound source signal and the decoded spectral parameters to obtain and output a synthesized speech waveform for each frame.

以上で第１の発明の一実施例の説明を終える。This concludes the description of one embodiment of the first invention.

第２図は第２の発明の一実施例を示すブロック図である
。図において第１図と同一の番号を付した構成要素は、
第１図と同一の動作を行うので説明は省略する。FIG. 2 is a block diagram showing an embodiment of the second invention. In the figure, the components numbered the same as in Figure 1 are as follows:
Since the operation is the same as in FIG. 1, the explanation will be omitted.

図において、スペクトル、ピッチパラメータ計算回路５
２２はスペクトルパラメータａを衆知のＬＰＣ分析を用
いて求め、ピッチパラメータとしてピッチ周期Ｍ、ピッ
チゲインｂを衆知の自己相関法を用いて求める。In the figure, spectrum and pitch parameter calculation circuit 5
22, a spectral parameter a is determined using a well-known LPC analysis, and pitch parameters such as a pitch period M and a pitch gain b are determined using a well-known autocorrelation method.

量子化器５２２は、スペクトルパラメータａをＰＡＲＣ
ＯＲ係数あるいはＬＳＰ係数に変換した後に量子化する
。ここではＰＡＲＣＯＲ係数を用いる。またピンチ周期
Ｍ、ピッチゲインｂを量子化する。またこれらの量子化
値を復号化して復号値ａ、ｔ、Ｍ′、ｂ′を出力する。The quantizer 522 converts the spectral parameter a into PARC
After converting into OR coefficients or LSP coefficients, quantization is performed. Here, PARCOR coefficients are used. Also, the pinch period M and pitch gain b are quantized. Furthermore, these quantized values are decoded to output decoded values a, t, M', and b'.

Ｂ　　　　。B.

コードブック８００は、２　（Ｂはヒツト数を示す）種
類の雑音信号をあらかじめ格納している。雑音信号の発
生の方法は前記文献４を参照できる。このうちから一種
類ずつたたみこみ回路８１０へ出力する。The codebook 800 stores two types of noise signals (B indicates the number of hits) in advance. For the method of generating the noise signal, refer to the above-mentioned document 4. Of these, one type is output to the convolution circuit 810.

畳み込み回路８１０は、一種類の雑音信号ｃ（ｎ）と前
記インパルス応答ｈ（ｎ）を次式に従いたたみこみ、結
果をスイッチ８２０に出力する。The convolution circuit 810 convolves one type of noise signal c(n) and the impulse response h(n) according to the following equation, and outputs the result to the switch 820.

ｆ（ｎ）＝ｃ（ｎ）＊ｈ（ｎ）　　　　　　　　　　　
　　　（５）ここで記号＊は畳み込み和を表す。f(n)=c(n)*h(n)
(5) Here, the symbol * represents a convolution sum.

スイッチ８２０は有声フレームではインパルス応答計算
回路５５０の出力を相関関数計算回路５６０へ出力し、
無声フレームでは畳み込み回路８１０の出力を自己相関
関数計算回路５６０へ出力する。ここで有声、無声の判
別は例えば、復号化したピンチゲインｂ′の値が予めか
ためられたしきい値を越えたときは有声、そうでないと
きは無声と判別することができる。The switch 820 outputs the output of the impulse response calculation circuit 550 to the correlation function calculation circuit 560 in a voiced frame,
For unvoiced frames, the output of the convolution circuit 810 is output to the autocorrelation function calculation circuit 560. Here, voiced or unvoiced can be determined, for example, when the value of the decoded pinch gain b' exceeds a preset threshold, it is determined that there is voice, and otherwise, it is determined that voice is unvoiced.

スイッチ８２５は自己相関関数計算回路５６０の出力を
、有声フレームでは音源パルス計算回路５８０へ出力し
、無声フレームでは信号選択回路８３０へ出力する。The switch 825 outputs the output of the autocorrelation function calculation circuit 560 to the excitation pulse calculation circuit 580 in a voiced frame, and to the signal selection circuit 830 in an unvoiced frame.

信号選択回路８３０は相互相関関数Φｘｈと自己相関関
数Ｒｈｈとを用いて次式の計算を行う。The signal selection circuit 830 uses the cross-correlation function Φxh and the autocorrelation function Rhh to calculate the following equation.

Ｇ＝（ΦＸｈ）／Ｒｈｈ（６）（６）式の計算を全ての雑音信号に対して行い、（６）
式を最大化する雑音信号を選択し、選択された雑音信号
を表すインデックスと（６）式で求めたゲインＧを出力
する。G=(ΦXh)/Rhh(6) Calculate equation (6) for all noise signals, and (6)
The noise signal that maximizes the equation is selected, and the index representing the selected noise signal and the gain G obtained from equation (6) are output.

符号器８４０は、ゲインＧを予め定められたビット数で
量子化しマルチプレクサ６３５へ出力する。また量子化
値を復号化してピッチ再生フィルタ８５０へ出力する。Encoder 840 quantizes gain G using a predetermined number of bits and outputs it to multiplexer 635. It also decodes the quantized value and outputs it to the pitch recovery filter 850.

ピッチ再生フィルタ８５０は次式に従い音源信号ｖ（ｎ
）を求めて出力する。The pitch recovery filter 850 receives the sound source signal v(n
) and output it.

Ｖ（ｎ）＝ｃ（ｎ）＋ｂ’・ｖ（ｎ−Ｍ）　　　　　　
　　　　（７）ここでｃ（ｎ）は選択された雑音信号で
ある。V(n)=c(n)+b'・v(n-M)
(7) where c(n) is the selected noise signal.

合成フィルタ８６０はｖ（ｎ）を人力して合成音声を求
めて出力する。The synthesis filter 860 manually calculates v(n) to obtain and output synthesized speech.

スイッチ８６５は、減算器５３５に対して有声フレーム
では加算器６２７の出力を出力し、無声フレームでは合
成フィルタ８６０の出力を出力する。Switch 865 outputs the output of adder 627 to subtracter 535 in voiced frames, and outputs the output of synthesis filter 860 in unvoiced frames.

受信側では、復号回路８７５は、雑音信号のゲイン、イ
ンデックスを復号する。On the receiving side, a decoding circuit 875 decodes the gain and index of the noise signal.

パラメータ復号回路８７０は、ピッチゲインｂ′、ピッ
チ周期Ｍ’、スペクトルパラメータａ、ｌを復号する。Parameter decoding circuit 870 decodes pitch gain b', pitch period M', and spectral parameters a and l.

ピッチ再生フィルタ８８０は、送信側のピッチ再生フィ
ルタ８５０と同一の動作を行ない、無声フレームにおけ
る音源信号を復号する。The pitch recovery filter 880 performs the same operation as the pitch recovery filter 850 on the transmission side, and decodes the sound source signal in the unvoiced frame.

スイッチ８７０は有声フレームと無声フレームで音源信
号を切り替える。A switch 870 switches the sound source signal between voiced frames and unvoiced frames.

以上で第２の発明の一実ｈＩ＆例の説明を終了する。This concludes the explanation of one example of the second invention.

以上述べた構成は本発明の一実施例に過ぎず、種々の変
形も可能である。The configuration described above is only one embodiment of the present invention, and various modifications are possible.

マルチパルスの計算方法としては、前記文献１に示した
方法の他に、種々の衆知な方法を用いることができる。As a method for calculating multi-pulses, in addition to the method shown in Document 1, various well-known methods can be used.

これには、例えば、Ｏｚａｗａ氏らによる“Ａ　５ｔｕ
ｄｙ　ｏｎ　Ｐｕ１ｓｅ　５ｅａｒｃｈ　Ａｌｇｏｒｉ
ｔｈｍｓ　ｆｏｒ　Ｍｕｌｔｉ−ｐｕｌｓｅ　５ｐｅｅ
ｃｈ　Ｃｏｄｅｒ　Ｒｅａｌｉｚａｔｉｏｎ”　（ＩＥ
ＥＥ　ＪＳＡＣ，ｐｐ。For example, “A 5tu” by Ozawa et al.
dy on Pulse 5earch Algori
thms for Multi-pulse 5pee
ch Coder Realization” (IE
EE JSAC, pp.

１３３−１４１．１９８６Ｘ文献７）を参照することが
できる。133-141.1986X Document 7).

また、ピッチ周期、ピッチゲインの計算法としては、前
述の実施例で示した方法の他に、例えば、下記（８）式
のように、過去の音源信号ｖ（ｎ）とピッチ再生フィル
タ、合成フィルタで再生した信号と、現サブフレームの
入力音声信号ｘ（ｎ）との誤差電力Ｅを最小化するよう
な位置Ｍを探索し、そのときの係数すを求めることもで
きる。In addition, as a method for calculating the pitch period and pitch gain, in addition to the method shown in the above embodiment, for example, as shown in the following equation (8), the past sound source signal v(n) and the pitch reproduction filter, It is also possible to search for a position M that minimizes the error power E between the signal reproduced by the filter and the input audio signal x(n) of the current subframe, and find the coefficients at that time.

Ｅ＝Σ［（ｘ（ｎ）−ｂ−Ｖ（ｎ−Ｔ）＊ｈ（ｎ））本
ｗ（ｎ）］　　　　　　　（８）ここで、ｈ（ｎ）は合
成フィルタのインパルス応答、ｗ（ｎ）は聴感重みすけ
回路のインパルス応答を示す。E=Σ[(x(n)-b-V(n-T)*h(n)) w(n)] (8) Here, h(n) is the impulse response of the synthesis filter, w(n ) shows the impulse response of the auditory weighting circuit.

また、送信側の合成フィルタ６１０では重みすけ信号を
再生するようにして、重みずけ回路５４０がらこれを減
算するような構成とすると、重みすけ回路６００を省略
することができる。Further, if the transmitting side synthesis filter 610 is configured to reproduce the weighted signal and the weighted signal is subtracted from the weighted signal, the weighted signal can be omitted.

また送信側における合成フィルタ６１０．６２５．８６
０を共通化することもできる。Also, the synthesis filter 610.625.86 on the transmitting side
0 can also be made common.

また、特性は少し低下するが、送信側で影響信号の減算
を省略することもできる。このような構成とすると、減
算器５３５、合成フィルタ６２５、加算器６２７、ピッ
チ再生フィルタ８５０、合成フィルタ８６０が不要とな
り、構成を簡略化できる。Furthermore, the subtraction of the influence signal can be omitted on the transmitting side, although the characteristics are slightly degraded. With such a configuration, the subtracter 535, the synthesis filter 625, the adder 627, the pitch recovery filter 850, and the synthesis filter 860 become unnecessary, and the configuration can be simplified.

（発明の効果）第１の発明によれば、有声フレームでは、ピッチ毎の周
期性の強いパルスについては、ピッチ補間により１つの
サブフレーム区間のパルスを求めることにより非常に効
率的に表し、ピッチ毎の相関のそれほど強くないパルス
についてはピッチ補間を用いずにマルチパルスを求めて
いるので、全てのパルスに対してピッチ予測を用いて求
める従来法と比較して、母音遷移部や過渡部など周期性
が少し弱くなる部分で音質を大きく改善することができ
るという効果がある。さらにピッチ補間では一つのサブ
フレームに対してのみマルチパルスを求めているので、
ピッチ予測マルチパルスに比べ必要な演算量を大幅に低
減することが可能という大きな効果がある。さらに、第
２の発明によれば、周期性がなく音源信号が雑音的な無
声フレームでは、最も良好な雑音信号を選択して音源を
表しているので従来方式に比べ音質がさらに改善される
という効果がある。(Effects of the Invention) According to the first invention, in voiced frames, pulses with strong periodicity for each pitch can be expressed very efficiently by determining pulses in one subframe section by pitch interpolation, and Since multi-pulses are obtained without using pitch interpolation for pulses whose correlation is not very strong, compared to the conventional method that uses pitch prediction for all pulses, it is possible to obtain multi-pulses without using pitch interpolation. This has the effect of greatly improving the sound quality in areas where the periodicity is slightly weakened. Furthermore, since pitch interpolation requires multipulses only for one subframe,
This has the great effect of significantly reducing the amount of calculation required compared to pitch prediction multi-pulse. Furthermore, according to the second invention, in unvoiced frames where there is no periodicity and the sound source signal is noise, the best noise signal is selected to represent the sound source, so the sound quality is further improved compared to the conventional method. effective.

[Brief explanation of the drawing]

第１図は第１の発明による音声符号化復号化方式の一実
施例の構成を示すブロック図、第２図は第２の発明によ
る音声符号化復号化方式の一実施例の構成を示すブロッ
ク図、第３図は本発明の作用を示すブロック図である。第４図はピッチ補間マルチパルスの例を表すブロック図
である。第５図は従来方式の例を示すブロック図である
。図において、１５０・・・ＬＰＧ、ピッチ分析部、２５
０・・・音源パルス計算部、２７０・・・パルス計８部
、５２０，５２２・・・スペクトル、ピッチパラメータ
計算回路、５２５・・・パラメータ量子化器、５３０・
・逆量子化器、５３５．２６０・・・減算器、５４０・
・・重みずけ回路、５５０・０．インパルス応答計算回
路、５６０・・・自己相関関数計算回路、５７０．６０
３・・・相互相関関数計算回路、５８５．６２０・・・
量子化器、６２７・・・加算器、５８６・・・パルス計
算回路、６０５．７２６・・・ピッチ補間回路、６１０
．６２５．７６０．８６０・・・合成フィルタ、６３５
・・・マルチプレクサ、７１０・・・デマルチプレクサ
、７２０・・・第１のパルス復号器、７２５・・・第２
のパルス復号器、７５０．８７０・・・パラメータ復号
器、７２７・・・パルス発生器、８００・・・コードブ
ック、８１０・・・畳み込み回路、８２０．８２５．８
６５・・・スイッチ、８３０・・・信号選択回路、８５
０．８８０・・・ピッチ再生フィルタ、８７５・・・復
号回路。FIG. 1 is a block diagram showing the configuration of an embodiment of the audio encoding/decoding method according to the first invention, and FIG. 2 is a block diagram showing the configuration of an embodiment of the audio encoding/decoding method according to the second invention. 3 are block diagrams showing the operation of the present invention. FIG. 4 is a block diagram showing an example of pitch interpolation multi-pulse. FIG. 5 is a block diagram showing an example of a conventional method. In the figure, 150...LPG, pitch analysis section, 25
0... Sound source pulse calculation unit, 270... Pulse meter 8 unit, 520, 522... Spectrum, pitch parameter calculation circuit, 525... Parameter quantizer, 530...
・Inverse quantizer, 535.260...Subtractor, 540・
...Weighting circuit, 550.0. Impulse response calculation circuit, 560...Autocorrelation function calculation circuit, 570.60
3... Cross-correlation function calculation circuit, 585.620...
Quantizer, 627...Adder, 586...Pulse calculation circuit, 605.726...Pitch interpolation circuit, 610
．． 625.760.860...Synthesis filter, 635
... multiplexer, 710 ... demultiplexer, 720 ... first pulse decoder, 725 ... second
pulse decoder, 750.870...parameter decoder, 727...pulse generator, 800...codebook, 810...convolution circuit, 820.825.8
65... Switch, 830... Signal selection circuit, 85
0.880...Pitch reproduction filter, 875...Decoding circuit.

Claims

[Claims]

(1) On the transmitting side, a discrete audio signal is input, a spectral parameter representing a spectral envelope and a pitch parameter representing a pitch period are extracted from the audio signal for each frame, and the audio signal of the frame is adjusted according to the pitch parameter. dividing the audio signal into small sections, using the pitch parameter and the spectrum parameter to obtain a first multi-pulse for the audio signal in one of the small sections, and coefficients for correcting the multi-pulse in other sections. After removing the signal obtained from the multi-pulse and the coefficient from the audio signal, a second multi-pulse is obtained using the spectral parameter, and on the receiving side, the first multi-pulse, the pitch parameter, and the signal are removed from the audio signal. A speech encoding/decoding method characterized in that a sound source signal is restored using a correction coefficient and the second multi-pulse, and a synthesized speech signal is obtained by driving a synthesis filter configured using the spectral parameter. .

(2) On the transmitting side, a discrete audio signal is input, a spectral parameter representing a spectral envelope and a pitch parameter representing a pitch period are extracted from the audio signal for each frame, and the audio signal of the frame is adjusted according to the pitch parameter. dividing the audio signal into small sections, and calculating a first multipulse using the pitch parameter and the spectrum parameter in one of the small sections as a sound source signal of the audio signal, and correcting the multipulse in other sections. a multipulse sound source obtained by calculating a second multipulse using the spectral parameter for the signal obtained by removing the multipulse and the signal calculated by the coefficient from the audio signal, It is expressed using a noise signal selected from a codebook consisting of predetermined types of noise signals so as to reduce the error power between the speech signal and a composite signal obtained from the noise signals, and the reception side 1 multipulse, the pitch parameter, the correction coefficient, and the second multipulse, or restore the sound source signal using the selected noise signal and configure using the spectral parameter. A speech encoding/decoding method characterized in that a synthesized filter is driven by the sound source signal to obtain a synthesized speech signal.