JPH10124093A

JPH10124093A - Method and device for speech compressive encoding

Info

Publication number: JPH10124093A
Application number: JP8273835A
Authority: JP
Inventors: Atsushi Yamane; 淳山根; Masanaka Sagara; 政仲相良
Original assignee: Ricoh Co Ltd
Current assignee: Ricoh Co Ltd
Priority date: 1996-10-16
Filing date: 1996-10-16
Publication date: 1998-05-15

Abstract

PROBLEM TO BE SOLVED: To decrease the arithmetic quantity for encoding (quantization) and to reduce the memory capacity for storing noise code vectors by encoding (quantizing) a secondary error signal itself for the encoding (quantization) of the secondary error signal which was carried out by a noise code vector search in a process of CELP system encoding. SOLUTION: When noise source information is extracted and encoded, a secondary error signal constitution part 301 constitutes the secondary error signal of a subframe signal, extracted spectrum envelope information, pitch information, gain information and the residue signal up to the least subframe, a DCT transformation part 302 performs the discrete cosine transformation of the extracted secondary error signal, and a coefficient conversion part 303 converts a DCT transformed coefficient sequence to specific bit length, thereby extracting and encoding noise source information.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は，留守番電話，音声
応答システム，ボイスメール等に適用される音声圧縮符
号化方法および装置に関し，より詳細には，アナログ音
声波形を入力してディジタル音声波形に変換した後，該
ディジタル音声波形を所定の符号化方式で符号化するこ
とにより，データ量を圧縮する音声圧縮符号化方法およ
び装置に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a voice compression encoding method and apparatus applied to an answering machine, a voice response system, a voice mail, and the like. The present invention relates to an audio compression encoding method and apparatus for compressing the amount of data by encoding the digital audio waveform by a predetermined encoding method after conversion.

【０００２】[0002]

【従来の技術】近年，自動車電話等の移動体通信におけ
るチャンネル容量の拡大や，マルチメディア通信におけ
る膨大な情報の蓄積・伝送の必要性から，実用的な低ビ
ットレート音声符号化に対する要求が高まっている。2. Description of the Related Art In recent years, there has been an increasing demand for practical low bit rate speech coding due to the expansion of channel capacity in mobile communications such as automobile telephones and the necessity of storing and transmitting enormous information in multimedia communications. ing.

【０００３】また，ファクシミリ・モデムの付加機能と
して，留守番電話のための音声符号化／復号化機能を備
えたものが求められており，この音声符号化／復号化た
めの低ビットレートの音声符号化手法の開発も望まれて
いる。As an additional function of the facsimile modem, a function having a voice encoding / decoding function for an answering machine is required, and a low bit rate voice code for the voice encoding / decoding is required. There is also a desire for the development of a method for the conversion.

【０００４】現在，１０ｋｂｐｓ以下の低ビットレート
音声符号化方式では，ＣＥＬＰ（ＣｏｄｅＥｘｃｉｔ
ｅｄＬｉｎｅａｒＰｒｅｄｉｃｔｉｏｎｃｏｄｉ
ｎｇｓｙｓｔｅｍ）方式が主流になっている。このＣＥ
ＬＰ方式は，線形予測に基づく音声のＡＲ（Ａｕｔｏ−
Ｒｅｇｒｅｓｓｉｖｅ：自己回帰）モデルに基づいた符
号化方式である。At present, in a low bit rate speech coding system of 10 kbps or less, CELP (Code Exit) is used.
ed Linear Prediction codi
ngsystem) method has become mainstream. This CE
The LP method uses a speech AR (Auto-
This is an encoding method based on a regressive (autoregressive) model.

【０００５】具体的には，符号化側において，音声をフ
レームまたはサブフレームと呼ばれる単位に分割し，そ
れぞれの単位についてスペクトル包絡を表すＬＰＣ（Ｌ
ｉｎｅａｒＰｒｅｄｉｃｔｉｏｎＣｏｄｉｎｇ：線
形予測）係数と，そのピッチ情報を表すピッチラグと，
音源情報である雑音源情報と，利得とを抽出し，それぞ
れ符号化（量子化）を行い，格納または伝送するもので
ある。Specifically, on the encoding side, speech is divided into units called frames or subframes, and LPC (LPC (LPC)
inner Prediction Coding (linear prediction) coefficient, a pitch lag representing the pitch information thereof,
It extracts noise source information, which is sound source information, and gain, performs encoding (quantization), and stores or transmits them.

【０００６】また，復号側では，符号化された各情報を
復元し，雑音源情報にピッチ情報を加えることによって
励振源信号を生成し，この励振源信号をＬＰＣ係数で構
成される線形予測合成フィルタに通し，合成音声を得る
ものである。On the decoding side, the encoded information is restored, an excitation source signal is generated by adding pitch information to the noise source information, and this excitation source signal is subjected to linear prediction synthesis composed of LPC coefficients. The synthesized speech is obtained through a filter.

【０００７】[0007]

【発明が解決しようとする課題】しかしながら，上記従
来のＣＥＬＰ方式では，１０ｋｂｐｓの低ビットレート
において，良好な音声を得ることができるという利点を
有する反面，それぞれのパラメータの符号化過程におけ
る演算量が多いという問題点があった。However, the above-mentioned conventional CELP system has an advantage that good speech can be obtained at a low bit rate of 10 kbps, but the amount of calculation in the encoding process of each parameter is small. There was a problem that there were many.

【０００８】特に，ピッチラグの符号化や雑音源情報の
符号化については，符号化された励振源信号を線形予測
合成フィルタに通した合成音声を生成し，原音声と比較
する必要があるが，フィルタ演算には多くの演算を必要
とするため，全ての励振源信号をフィルタに通すのは非
現実的であるという問題点があった。In particular, for pitch lag encoding and noise source information encoding, it is necessary to generate a synthesized speech obtained by passing the encoded excitation source signal through a linear prediction synthesis filter, and to compare the synthesized speech with the original speech. Since many operations are required for the filter operation, it is impractical to pass all the excitation source signals through the filter.

【０００９】また，従来のＣＥＬＰ方式では，二次誤差
信号の符号帳を持ち，符号帳に属する各符号ベクトルと
スペクトル包絡とから二次誤差信号を合成し，入力信号
から得られた二次誤差信号と比較し，そのひずみが最小
となる符号を選択することによって符号化を行っている
ため，符号帳探索のための演算量および符号帳を蓄える
ためのメモリ量が多くなるという問題点もあった。Further, the conventional CELP system has a codebook of a secondary error signal, synthesizes a secondary error signal from each code vector belonging to the codebook and a spectral envelope, and obtains a secondary error signal obtained from an input signal. Since encoding is performed by selecting a code that minimizes the distortion as compared with the signal, the amount of computation for searching for the codebook and the amount of memory for storing the codebook also increase. Was.

【００１０】なお，ＣＥＬＰ方式における演算量を削減
する従来技術として，例えば，フィルタ演算を行って比
較するのではなく，近似的に原音声との比較を行うこと
のできるパラメータによって絞り込むという予備選択手
法が提案されている。As a conventional technique for reducing the amount of calculation in the CELP system, for example, a preliminary selection method of narrowing down by a parameter that can be compared with the original voice approximately, instead of performing a filter calculation and comparing. Has been proposed.

【００１１】また，雑音源は，与えられたビット数に相
当する雑音ベクトルを蓄えているのが一般的であり，そ
の構成を工夫することにより，演算量を削減する方法も
提案されている。具体的には，雑音ベクトルをビット数
だけ持ち，それらの和や差で雑音源を表すＶＳＥＬＰ
（ＶｅｃｔｏｒＳｕｍＥｘｃｉｔｅｄＬｉｎｅａ
ｒＰｒｅｄｉｃｔｉｏｎｃｏｄｉｎｇ）方式や，オ
ーバーラッピング型の符号長を用いる方式（特開昭６４
−５４４９７号公報『コード励振線形予測ボコーダ及び
その方法』）等がある。Further, a noise source generally stores a noise vector corresponding to a given number of bits, and a method of reducing the amount of calculation by devising the configuration has been proposed. Specifically, VSELP which has a noise vector by the number of bits and represents a noise source by the sum or difference thereof
(Vector Sum Excited Linea
r Prediction coding) method and a method using an overlapping type code length (Japanese Patent Laid-Open No.
-54497, "Code Excited Linear Prediction Vocoder and Method Thereof").

【００１２】ところが，実用的な低ビットレート音声符
号化に対する要求から，上記従来のＣＥＬＰ方式におけ
る演算量を削減する方法（予備選択手法，ＶＳＥＬＰ方
式等）の他にも，それらとは異なる方法で演算量を削減
可能なものが要望されている。However, due to the demand for practical low-bit-rate speech coding, in addition to the above-mentioned methods for reducing the amount of computation in the conventional CELP method (preliminary selection method, VSELP method, etc.), methods different from those methods are used. What can reduce the amount of calculation is demanded.

【００１３】本発明は上記に鑑みてなされたものであっ
て，ＣＥＬＰ方式の符号化の過程において，雑音符号ベ
クトル探索によって行っていた二次誤差信号の符号化
（量子化）を，二次誤差信号そのものを符号化（量子
化）することにより，符号化（量子化）のための演算量
を削減すると共に，雑音符号ベクトルを蓄えるためのメ
モリ量の低減を図ることを目的とする。The present invention has been made in view of the above, and in the process of coding in the CELP system, the coding (quantization) of the secondary error signal performed by the noise code vector search is performed by the secondary error signal. An object of the present invention is to encode (quantize) a signal itself to reduce the amount of computation for encoding (quantization) and to reduce the amount of memory for storing a noise code vector.

【００１４】[0014]

【課題を解決するための手段】上記の目的を達成するた
めに，請求項１に係る音声圧縮符号化方法は，アナログ
音声波形を入力してディジタル音声波形に変換する第１
の工程と，前記ディジタル音声波形を所定の符号化方式
で符号化する第２の工程と，前記符号化されたディジタ
ル音声波形を蓄積する第３の工程と，前記蓄積されたデ
ィジタル音声波形を取り出して復号化する第４の工程
と，前記復号化されたディジタル音声波形をアナログ音
声波形に変換する第５の工程と，を有する音声圧縮符号
化方法において，前記第２の工程が，前記ディジタル音
声波形をフレームと呼ばれる処理単位に分割するフレー
ム分割工程と，前記分割したフレームのそれぞれからス
ペクトル包絡を表すスペクトル包絡情報を抽出して符号
化するスペクトル包絡情報抽出工程と，前記分割したフ
レームのそれぞれをサブフレームと呼ばれる処理単位に
分割するサブフレーム分割工程と，前記抽出したスペク
トル包絡情報を用いて，前記分割したサブフレームのそ
れぞれからピッチ情報を抽出して符号化するピッチ情報
抽出工程と，前記抽出したピッチ情報から利得情報を抽
出して符号化する利得情報抽出工程と，前記スペクトル
包絡情報，サブフレーム，ピッチ情報および利得情報か
ら音源情報である雑音源情報を抽出して符号化する雑音
源情報抽出工程と，を含み，前記第４の工程が，前記符
号化されたスペクトル包絡情報，ピッチ情報，利得情報
および雑音源情報を復元する復元工程と，前記復元した
ピッチ情報，利得情報および雑音源情報から励振源信号
を生成する励振源信号生成工程と，前記励振源信号と前
記復元されたスペクトル包絡情報とから合成音声を生成
する合成音声生成工程と，を含み，さらに，前記雑音源
抽出工程が，前記抽出したスペクトル包絡情報，サブフ
レーム，ピッチ情報および利得情報を用いて，二次誤差
信号を抽出する二次誤差信号抽出工程と，前記抽出した
二次誤差信号を離散コサイン変換する離散コサイン変換
工程と，前記離散コサイン変換工程で得られた離散コサ
イン変換係数列を所定のビット長に変換する係数列変換
工程と，を含むことにより，前記雑音源情報を抽出して
符号化するものである。According to a first aspect of the present invention, there is provided a speech compression encoding method according to the first aspect of the present invention, wherein an analog speech waveform is inputted and converted into a digital speech waveform.
And a second step of encoding the digital audio waveform by a predetermined encoding method, a third step of storing the encoded digital audio waveform, and extracting the accumulated digital audio waveform. And a fifth step of converting the decoded digital audio waveform into an analog audio waveform, wherein the second step comprises: A frame dividing step of dividing a waveform into processing units called frames; a spectrum envelope information extracting step of extracting and encoding spectrum envelope information representing a spectrum envelope from each of the divided frames; Using a subframe division step of dividing into processing units called subframes and the extracted spectral envelope information, A pitch information extracting step of extracting and encoding pitch information from each of the divided subframes; a gain information extracting step of extracting and encoding gain information from the extracted pitch information; Extracting noise source information as sound source information from subframes, pitch information and gain information, and encoding the extracted noise source information, wherein the fourth step comprises the steps of: Information, gain information and noise source information, an excitation source signal generation step of generating an excitation source signal from the restored pitch information, gain information and noise source information, and an excitation source signal and the restored And a synthesized speech generating step of generating a synthesized speech from the spectrum envelope information. A second-order error signal extracting step of extracting a second-order error signal using the short-circuit information, the subframe, the pitch information, and the gain information; a discrete cosine transform step of performing a discrete cosine transform of the extracted second-order error signal; A coefficient sequence converting step of converting the discrete cosine transform coefficient sequence obtained in the cosine transform step into a predetermined bit length, thereby extracting and encoding the noise source information.

【００１５】また，請求項２に係る音声圧縮符号化方法
は，請求項１記載の音声圧縮符号化方法において，前記
係数列変換工程が，前記離散コサイン変換係数列を所定
のビット長に変換する際に，前記離散コサイン変換工程
で得られた複数の離散コサイン変換係数から予め定めた
数の係数を選択した後，前記選択した係数の位置および
係数値と前記選択した係数の前後に位置する予め定めた
数の係数の係数値とを所定のビット長に符号化すること
により，前記離散コサイン変換係数列を所定のビット長
に変換するものである。According to a second aspect of the present invention, in the first aspect, the coefficient sequence converting step converts the discrete cosine transform coefficient sequence into a predetermined bit length. At this time, after selecting a predetermined number of coefficients from the plurality of discrete cosine transform coefficients obtained in the discrete cosine transform step, the position and coefficient value of the selected coefficient and the coefficient located before and after the selected coefficient are determined. The discrete cosine transform coefficient sequence is converted into a predetermined bit length by encoding a coefficient value of a predetermined number of coefficients into a predetermined bit length.

【００１６】また，請求項３に係る音声圧縮符号化方法
は，請求項２記載の音声圧縮符号化方法において，前記
係数列変換工程が，前記離散コサイン変換工程で得られ
た複数の離散コサイン変換係数から予め定めた数の係数
を選択する際に，係数値の絶対値が最大の係数を選択す
るものである。According to a third aspect of the present invention, in the voice compression encoding method according to the second aspect, the coefficient sequence conversion step includes the step of converting a plurality of discrete cosine transforms obtained in the discrete cosine conversion step. When a predetermined number of coefficients are selected from the coefficients, the coefficient having the largest absolute value of the coefficient value is selected.

【００１７】また，請求項４に係る音声圧縮符号化方法
は，請求項２記載の音声圧縮符号化方法において，前記
係数列変換工程が，前記離散コサイン変換工程で得られ
た複数の離散コサイン変換係数から予め定めた数の係数
を選択する際に，係数値の絶対値が最大のものから予め
定めた数の係数を選択するものである。According to a fourth aspect of the present invention, in the voice compression encoding method according to the second aspect, the coefficient sequence conversion step includes the step of converting a plurality of discrete cosine transforms obtained in the discrete cosine conversion step. When a predetermined number of coefficients are selected from the coefficients, a predetermined number of coefficients are selected from those having the largest absolute value of the coefficient value.

【００１８】また，請求項５に係る音声圧縮符号化方法
は，請求項２記載の音声圧縮符号化方法において，前記
係数列変換工程が，前記離散コサイン変換工程で得られ
た複数の離散コサイン変換係数から予め定めた数の係数
を選択する際に，予め設定した閾値を超える係数値を有
する係数を選択するものである。According to a fifth aspect of the present invention, there is provided the voice compression encoding method according to the second aspect, wherein the coefficient sequence conversion step is performed by a plurality of discrete cosine transforms obtained in the discrete cosine transform step. When a predetermined number of coefficients are selected from the coefficients, a coefficient having a coefficient value exceeding a preset threshold is selected.

【００１９】また，請求項６に係る音声圧縮符号化方法
は，請求項１記載の音声圧縮符号化方法において，前記
係数列変換工程が，前記離散コサイン変換係数列を所定
のビット長に変換する際に，前記離散コサイン変換工程
で得られた複数の離散コサイン変換係数から予め定めた
閾値を超える係数値を有する係数を選択し，前記選択し
た係数の位置と係数値とを所定のビット長に符号化する
ことにより，前記離散コサイン変換係数列を所定のビッ
ト長に変換するものである。According to a sixth aspect of the present invention, in the first aspect, the coefficient sequence converting step converts the discrete cosine transform coefficient sequence into a predetermined bit length. At this time, a coefficient having a coefficient value exceeding a predetermined threshold is selected from a plurality of discrete cosine transform coefficients obtained in the discrete cosine transform step, and the position and coefficient value of the selected coefficient are set to a predetermined bit length. By encoding, the discrete cosine transform coefficient sequence is converted into a predetermined bit length.

【００２０】また，請求項７に係る音声圧縮符号化方法
は，請求項１記載の音声圧縮符号化方法において，前記
係数列変換工程が，前記離散コサイン変換係数列を所定
のビット長に変換する際に，前記離散コサイン変換工程
で得られた複数の離散コサイン変換係数から一つの係数
を選択し，前記選択した係数の位置および係数値と前記
選択した係数の前後に位置する予め定めた数の係数の係
数値とを所定のビット長に符号化した後，さらに前記符
号化した係数を除いた前記複数の離散コサイン変換係数
から一つの係数を選択し，前記選択した係数の位置およ
び係数値と前記選択した係数の前後に位置する予め定め
た数の係数の係数値とを所定のビット長に符号化する工
程を予め設定した回数実行することにより，前記離散コ
サイン変換係数列を所定のビット長に変換するものであ
る。According to a seventh aspect of the present invention, in the voice compression encoding method according to the first aspect, the coefficient sequence converting step converts the discrete cosine transform coefficient sequence into a predetermined bit length. At this time, one coefficient is selected from the plurality of discrete cosine transform coefficients obtained in the discrete cosine transform step, and a predetermined number of positions and coefficient values of the selected coefficient and a predetermined number of positions before and after the selected coefficient are selected. After encoding the coefficient value of the coefficient to a predetermined bit length, one coefficient is further selected from the plurality of discrete cosine transform coefficients excluding the encoded coefficient, and the position and coefficient value of the selected coefficient and The step of encoding a predetermined number of coefficient values before and after the selected coefficient to a predetermined bit length is executed a predetermined number of times, whereby the discrete cosine transform coefficient sequence And converts it into a predetermined bit length.

【００２１】また，請求項８に係る音声圧縮符号化方法
は，請求項７記載の音声圧縮符号化方法において，前記
係数列変換工程が，前記離散コサイン変換工程で得られ
た複数の離散コサイン変換係数から一つの係数を選択す
る際に，係数値の絶対値が最大の係数を選択し，前記符
号化した係数を除いた前記複数の離散コサイン変換係数
から一つの係数を選択する際に，係数値の絶対値が最大
の係数を選択するものである。According to an eighth aspect of the present invention, in the voice compression encoding method according to the seventh aspect, the coefficient sequence conversion step includes the step of converting the plurality of discrete cosine transforms obtained in the discrete cosine conversion step. When selecting one coefficient from the coefficients, the coefficient having the largest absolute value of the coefficient value is selected, and when selecting one coefficient from the plurality of discrete cosine transform coefficients excluding the coded coefficient, The coefficient whose absolute value is the largest is selected.

【００２２】また，請求項９に係る音声圧縮符号化方法
は，請求項１記載の音声圧縮符号化方法において，前記
係数列変換工程が，前記離散コサイン変換係数列を所定
のビット長に変換する際に，前記離散コサイン変換工程
で得られた複数の離散コサイン変換係数から予め定めた
数の係数を選択し，前記選択した係数の係数値と前記選
択した係数の前後に位置する予め定めた数の係数の係数
値との第１の比のパターンを求めた後，予め用意した複
数の第２の比のパターンと符号との対応関係情報から前
記第１の比のパターンに最も近い性質の前記第２の比の
パターンを選択し，前記選択した第２の比のパターンに
対応する符号を用いて前記第１の比のパターンを符号化
することにより，前記離散コサイン変換係数列を所定の
ビット長に変換するものである。According to a ninth aspect of the present invention, in the audio compression encoding method according to the first aspect, the coefficient sequence converting step converts the discrete cosine transform coefficient sequence into a predetermined bit length. At this time, a predetermined number of coefficients are selected from the plurality of discrete cosine transform coefficients obtained in the discrete cosine transform step, and a coefficient value of the selected coefficient and a predetermined number of coefficients located before and after the selected coefficient are selected. After obtaining the pattern of the first ratio with the coefficient value of the coefficient of the above, from the correspondence relationship information between the plurality of patterns of the second ratio and the code prepared in advance, the pattern having the property closest to the pattern of the first ratio is obtained. By selecting a pattern of the second ratio and encoding the pattern of the first ratio using a code corresponding to the selected pattern of the second ratio, the discrete cosine transform coefficient sequence is converted to a predetermined bit. Convert to long It is intended.

【００２３】また，請求項１０に係る音声圧縮符号化方
法は，請求項９記載の音声圧縮符号化方法において，前
記係数列変換工程が，前記第２の比のパターンを選択す
る際に，前記第１の比のパターンとのユークリッド距離
が最も小さい前記第２の比のパターンを選択するもので
ある。According to a tenth aspect of the present invention, in the voice compression encoding method according to the ninth aspect, when the coefficient sequence conversion step selects the pattern of the second ratio, The pattern of the second ratio having the smallest Euclidean distance from the pattern of the first ratio is selected.

【００２４】また，請求項１１に係る音声圧縮符号化方
法は，請求項１０記載の音声圧縮符号化方法において，
前記予め用意した複数の第２の比のパターンと符号との
対応関係情報が，多次元データ構造により管理されてお
り，前記係数列変換工程が，前記第２の比のパターンを
選択する際に，最近点探索を用いて前記第１の比のパタ
ーンとのユークリッド距離が最も小さい前記第２の比の
パターンを選択するものである。[0024] According to a still further aspect of the present invention, there is provided a voice compression encoding method as defined in claim 10, wherein
Correspondence information between the plurality of second ratio patterns and codes prepared in advance is managed by a multidimensional data structure, and when the coefficient sequence conversion step selects the second ratio pattern, , Using the nearest point search to select the pattern of the second ratio having the smallest Euclidean distance from the pattern of the first ratio.

【００２５】また，請求項１２に係る音声圧縮符号化方
法は，請求項９記載の音声圧縮符号化方法において，前
記係数列変換工程が，前記第２の比のパターンを選択す
る際に，前記第１の比のパターンとのノルムが最も小さ
い前記第２の比のパターンを選択するものである。According to a twelfth aspect of the present invention, in the audio compression encoding method according to the ninth aspect, when the coefficient sequence conversion step selects the pattern of the second ratio, The second ratio pattern having the smallest norm with respect to the first ratio pattern is selected.

【００２６】また，請求項１３に係る音声圧縮符号化装
置は，アナログ音声波形を入力してディジタル音声波形
に変換するＡ／Ｄ変換手段と，前記ディジタル音声波形
を所定の符号化方式で符号化する音声符号化手段と，前
記符号化されたディジタル音声波形を蓄積する蓄積手段
と，前記蓄積されたディジタル音声波形を取り出して復
号化する音声復号化手段と，前記復号化されたディジタ
ル音声波形をアナログ音声波形に変換するＤ／Ａ変換手
段と，を有する音声圧縮符号化装置において，前記音声
符号化手段が，前記ディジタル音声波形をフレームと呼
ばれる処理単位に分割するフレーム分割手段と，前記分
割したフレームのそれぞれからスペクトル包絡を表すス
ペクトル包絡情報を抽出して符号化するスペクトル包絡
情報抽出手段と，前記分割したフレームのそれぞれをサ
ブフレームと呼ばれる処理単位に分割するサブフレーム
分割手段と，前記抽出したスペクトル包絡情報を用い
て，前記分割したサブフレームのそれぞれからピッチ情
報を抽出して符号化するピッチ情報抽出手段と，前記抽
出したピッチ情報から利得情報を抽出して符号化する利
得情報抽出手段と，前記スペクトル包絡情報，サブフレ
ーム，ピッチ情報および利得情報から音源情報である雑
音源情報を抽出して符号化する雑音源情報抽出手段と，
を含み，前記音声復号化手段が，前記符号化されたスペ
クトル包絡情報，ピッチ情報，利得情報および雑音源情
報を復元する復元手段と，前記復元したピッチ情報，利
得情報および雑音源情報から励振源信号を生成する励振
源信号生成手段と，前記励振源信号と前記復元されたス
ペクトル包絡情報とから合成音声を生成する合成音声生
成手段と，を含み，さらに，前記雑音源抽出手段が，前
記抽出したスペクトル包絡情報，サブフレーム，ピッチ
情報および利得情報を用いて，二次誤差信号を抽出する
二次誤差信号抽出手段と，前記抽出した二次誤差信号を
離散コサイン変換する離散コサイン変換手段と，前記離
散コサイン変換手段で得られた離散コサイン変換係数列
を所定のビット長に変換する係数列変換手段と，を含む
ことにより，前記雑音源情報を抽出して符号化するもの
である。A speech compression encoding apparatus according to a thirteenth aspect of the present invention provides an A / D conversion means for inputting an analog speech waveform and converting it into a digital speech waveform, and encoding the digital speech waveform by a predetermined encoding method. Voice encoding means, a storage means for storing the coded digital voice waveform, a voice decoding means for taking out and decoding the stored digital voice waveform, and a voice decoding means for storing the decoded digital voice waveform. And a D / A converting means for converting the digital audio waveform into an analog audio waveform. The audio encoding means comprises: a frame dividing means for dividing the digital audio waveform into processing units called frames; Spectrum envelope information extraction means for extracting and encoding spectrum envelope information representing a spectrum envelope from each of the frames, A sub-frame dividing unit that divides each of the divided frames into processing units called sub-frames, and a pitch that extracts pitch information from each of the divided sub-frames and encodes the pitch information using the extracted spectral envelope information. Information extraction means, gain information extraction means for extracting and encoding gain information from the extracted pitch information, and noise source information as sound source information from the spectrum envelope information, subframe, pitch information and gain information. Source information extraction means for encoding
Wherein the speech decoding means restores the encoded spectrum envelope information, pitch information, gain information and noise source information, and an excitation source based on the restored pitch information, gain information and noise source information. An excitation source signal generation unit for generating a signal; and a synthesized speech generation unit for generating a synthesized speech from the excitation source signal and the restored spectrum envelope information. A second-order error signal extracting means for extracting a second-order error signal using the extracted spectral envelope information, subframe, pitch information, and gain information; a discrete cosine transform means for performing a discrete cosine transform of the extracted second-order error signal; A coefficient sequence converting means for converting a discrete cosine transform coefficient sequence obtained by the discrete cosine transform means into a predetermined bit length, It is intended to encode by extracting the sound source information.

【００２７】また，請求項１４に係る音声圧縮符号化装
置は，請求項１３記載の音声圧縮符号化装置において，
前記係数列変換手段が，前記離散コサイン変換係数列を
所定のビット長に変換する際に，前記離散コサイン変換
手段で得られた複数の離散コサイン変換係数から予め定
めた数の係数を選択した後，前記選択した係数の位置お
よび係数値と前記選択した係数の前後に位置する予め定
めた数の係数の係数値とを所定のビット長に符号化する
ことにより，前記離散コサイン変換係数列を所定のビッ
ト長に変換するものである。According to a fourteenth aspect of the present invention, in the voice compression encoding apparatus according to the thirteenth aspect,
When the coefficient sequence converting means converts the discrete cosine transform coefficient sequence into a predetermined bit length, after selecting a predetermined number of coefficients from a plurality of discrete cosine transform coefficients obtained by the discrete cosine transform means The discrete cosine transform coefficient sequence is encoded by encoding the position and coefficient value of the selected coefficient and the coefficient values of a predetermined number of coefficients before and after the selected coefficient into a predetermined bit length. Is converted to the bit length.

【００２８】また，請求項１５に係る音声圧縮符号化装
置は，請求項１４記載の音声圧縮符号化装置において，
前記係数列変換手段が，前記離散コサイン変換手段で得
られた複数の離散コサイン変換係数から予め定めた数の
係数を選択する際に，係数値の絶対値が最大の係数を選
択するものである。According to a fifteenth aspect of the present invention, in the voice compression encoding apparatus according to the fourteenth aspect,
The coefficient sequence converting means selects a coefficient having a maximum absolute value of a coefficient value when selecting a predetermined number of coefficients from a plurality of discrete cosine transform coefficients obtained by the discrete cosine transform means. .

【００２９】また，請求項１６に係る音声圧縮符号化装
置は，請求項１４記載の音声圧縮符号化装置において，
前記係数列変換手段が，前記離散コサイン変換手段で得
られた複数の離散コサイン変換係数から予め定めた数の
係数を選択する際に，係数値の絶対値が最大のものから
予め定めた数の係数を選択するものである。According to a sixteenth aspect of the present invention, in the audio compression encoding apparatus according to the fourteenth aspect,
When the coefficient sequence conversion means selects a predetermined number of coefficients from the plurality of discrete cosine transform coefficients obtained by the discrete cosine conversion means, the coefficient sequence conversion means selects a predetermined number of coefficients from the largest absolute value of the coefficient value. This is for selecting the coefficient.

【００３０】また，請求項１７に係る音声圧縮符号化装
置は，請求項１４記載の音声圧縮符号化装置において，
前記係数列変換手段が，前記離散コサイン変換手段で得
られた複数の離散コサイン変換係数から予め定めた数の
係数を選択する際に，予め設定した閾値を超える係数値
を有する係数を選択するものである。According to a seventeenth aspect of the present invention, in the voice compression encoding apparatus according to the fourteenth aspect,
The coefficient sequence conversion means, when selecting a predetermined number of coefficients from a plurality of discrete cosine transform coefficients obtained by the discrete cosine conversion means, selects a coefficient having a coefficient value exceeding a preset threshold value It is.

【００３１】また，請求項１８に係る音声圧縮符号化装
置は，請求項１３記載の音声圧縮符号化装置において，
前記係数列変換手段が，前記離散コサイン変換係数列を
所定のビット長に変換する際に，前記離散コサイン変換
手段で得られた複数の離散コサイン変換係数から予め定
めた閾値を超える係数値を有する係数を選択した後，前
記選択した係数の位置と係数値とを所定のビット長に符
号化することにより，前記離散コサイン変換係数列を所
定のビット長に変換するものである。[0031] The speech compression encoding apparatus according to claim 18 is the speech compression encoding apparatus according to claim 13,
When the coefficient sequence converting means converts the discrete cosine transform coefficient sequence into a predetermined bit length, the coefficient sequence converting means has a coefficient value exceeding a predetermined threshold from a plurality of discrete cosine transform coefficients obtained by the discrete cosine transform coefficient. After selecting a coefficient, the discrete cosine transform coefficient sequence is converted into a predetermined bit length by encoding the position and coefficient value of the selected coefficient to a predetermined bit length.

【００３２】また，請求項１９に係る音声圧縮符号化装
置は，請求項１３記載の音声圧縮符号化装置において，
前記係数列変換手段が，前記離散コサイン変換係数列を
所定のビット長に変換する際に，前記離散コサイン変換
手段で得られた複数の離散コサイン変換係数から一つの
係数を選択し，前記選択した係数の位置および係数値と
前記選択した係数の前後に位置する予め定めた数の係数
の係数値とを所定のビット長に符号化した後，さらに前
記符号化した係数を除いた前記複数の離散コサイン変換
係数から一つの係数を選択し，前記選択した係数の位置
および係数値と前記選択した係数の前後に位置する予め
定めた数の係数の係数値とを所定のビット長に符号化す
る工程を予め設定した回数実行することにより，前記離
散コサイン変換係数列を所定のビット長に変換するもの
である。[0032] The speech compression encoding apparatus according to claim 19 is the speech compression encoding apparatus according to claim 13,
When the coefficient sequence converting means converts the discrete cosine transform coefficient sequence into a predetermined bit length, the coefficient sequence converting means selects one coefficient from the plurality of discrete cosine transform coefficients obtained by the discrete cosine transform means, and After encoding the position and coefficient value of a coefficient and the coefficient values of a predetermined number of coefficients positioned before and after the selected coefficient to a predetermined bit length, the plurality of discrete values excluding the encoded coefficient are further encoded. Selecting one coefficient from the cosine transform coefficients and encoding the position and coefficient value of the selected coefficient and the coefficient values of a predetermined number of coefficients positioned before and after the selected coefficient into a predetermined bit length; Is executed a predetermined number of times to convert the discrete cosine transform coefficient sequence into a predetermined bit length.

【００３３】また，請求項２０に係る音声圧縮符号化装
置は，請求項１９記載の音声圧縮符号化装置において，
前記係数列変換手段が，前記離散コサイン変換手段で得
られた複数の離散コサイン変換係数から一つの係数を選
択する際に，係数値の絶対値が最大の係数を選択し，前
記符号化した係数を除いた前記複数の離散コサイン変換
係数から一つの係数を選択する際に，係数値の絶対値が
最大の係数を選択するものである。According to a twentieth aspect of the present invention, in the voice compression encoding apparatus according to the nineteenth aspect,
When the coefficient sequence converting means selects one coefficient from the plurality of discrete cosine transform coefficients obtained by the discrete cosine transform means, the coefficient sequence converting means selects the coefficient having the largest absolute value of the coefficient value, and selects the coded coefficient. When selecting one coefficient from the plurality of discrete cosine transform coefficients except for the coefficient, the coefficient having the largest absolute value of the coefficient value is selected.

【００３４】また，請求項２１に係る音声圧縮符号化装
置は，請求項１３記載の音声圧縮符号化装置において，
前記係数列変換手段が，前記離散コサイン変換係数列を
所定のビット長に変換する際に，前記離散コサイン変換
手段で得られた複数の離散コサイン変換係数から予め定
めた数の係数を選択し，前記選択した係数の係数値と前
記選択した係数の前後に位置する予め定めた数の係数の
係数値との第１の比のパターンを求めた後，予め用意し
た複数の第２の比のパターンと符号との対応関係情報か
ら前記第１の比のパターンに最も近い性質の前記第２の
比のパターンを選択し，前記選択した第２の比のパター
ンに対応する符号を用いて前記第１の比のパターンを符
号化することにより，前記離散コサイン変換係数列を所
定のビット長に変換するものである。According to a twenty-first aspect of the present invention, in the audio compression encoding apparatus according to the thirteenth aspect,
When the coefficient sequence converting means converts the discrete cosine transform coefficient sequence into a predetermined bit length, a predetermined number of coefficients are selected from a plurality of discrete cosine transform coefficients obtained by the discrete cosine transform means, After obtaining a first ratio pattern between a coefficient value of the selected coefficient and coefficient values of a predetermined number of coefficients positioned before and after the selected coefficient, a plurality of second ratio patterns prepared in advance And selecting the second ratio pattern having the property closest to the first ratio pattern from the correspondence information between the first ratio pattern and the first ratio pattern using the code corresponding to the selected second ratio pattern. The discrete cosine transform coefficient sequence is converted into a predetermined bit length by encoding a pattern having a ratio of.

【００３５】また，請求項２２に係る音声圧縮符号化装
置は，請求項２１記載の音声圧縮符号化装置において，
前記係数列変換手段が，前記第２の比のパターンを選択
する際に，前記第１の比のパターンとのユークリッド距
離が最も小さい前記第２の比のパターンを選択するもの
である。The speech compression encoding apparatus according to claim 22 is the speech compression encoding apparatus according to claim 21,
The coefficient sequence converting means selects the second ratio pattern having the shortest Euclidean distance from the first ratio pattern when selecting the second ratio pattern.

【００３６】また，請求項２３に係る音声圧縮符号化装
置は，請求項２２記載の音声圧縮符号化装置において，
前記予め用意した複数の第２の比のパターンと符号との
対応関係情報が，多次元データ構造により管理されてお
り，前記係数列変換手段が，前記第２の比のパターンを
選択する際に，最近点探索を用いて前記第１の比のパタ
ーンとのユークリッド距離が最も小さい前記第２の比の
パターンを選択するものである。According to a twenty-third aspect of the present invention, in the voice compression encoding apparatus according to the twenty-second aspect,
Correspondence information between the plurality of second ratio patterns and codes prepared in advance is managed by a multidimensional data structure, and when the coefficient sequence conversion means selects the second ratio pattern, , Using the nearest point search to select the pattern of the second ratio having the smallest Euclidean distance from the pattern of the first ratio.

【００３７】さらに，請求項２４に係る音声圧縮符号化
装置は，請求項２１記載の音声圧縮符号化装置におい
て，前記係数列変換手段が，前記第２の比のパターンを
選択する際に，前記第１の比のパターンとのノルムが最
も小さい前記第２の比のパターンを選択するものであ
る。Further, in the speech compression encoding apparatus according to claim 24, in the speech compression encoding apparatus according to claim 21, when the coefficient sequence converting means selects the pattern of the second ratio, The second ratio pattern having the smallest norm with respect to the first ratio pattern is selected.

【００３８】[0038]

【発明の実施の形態】以下，本発明の音声圧縮符号化方
法および装置について，〔実施の形態１〕，〔実施の形
態２〕，〔実施の形態３〕，〔実施の形態４〕，〔実施
の形態５〕，〔実施の形態６〕，〔実施の形態７〕，
〔実施の形態８〕，〔実施の形態９〕，〔実施の形態１
０〕，〔実施の形態１１〕，〔実施の形態１２〕の順
で，図面を参照して詳細に説明する。DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS A speech compression encoding method and apparatus according to the present invention will be described below with reference to [Embodiment 1], [Embodiment 2], [Embodiment 3], [Embodiment 4], [ Fifth Embodiment, Sixth Embodiment, Seventh Embodiment,
[Embodiment 8], [Embodiment 9], [Embodiment 1]
0], [Embodiment 11], and [Embodiment 12] will be described in detail with reference to the drawings.

【００３９】〔実施の形態１〕図１は，実施の形態１の
音声圧縮符号化装置１００の概略構成図を示す。音声圧
縮符号化装置１００は，アナログ信号（アナログ音声波
形）を入力してディジタル信号（ディジタル音声波形）
に変換するＡ／Ｄ変換手段としてのＡ／Ｄ変換部１０１
と，Ａ／Ｄ変換部１０１からディジタル信号を入力し
て，圧縮符号化する音声符号化手段としての音声符号化
部１０２と，圧縮符号化された圧縮符号化信号（量子化
信号）を蓄積する蓄積手段としての蓄積部１０３と，圧
縮符号化信号（量子化信号）を伸長復号する音声復号化
手段としての音声復号化部１０４と，復号化されたディ
ジタル信号をアナログ信号に変換するＤ／Ａ変換手段と
してのＤ／Ａ変換部１０５と，から構成される。[Embodiment 1] FIG. 1 is a schematic block diagram of a speech compression encoding apparatus 100 according to Embodiment 1. The audio compression encoding apparatus 100 receives an analog signal (analog audio waveform) and inputs a digital signal (digital audio waveform).
A / D conversion unit 101 as A / D conversion means for converting to A / D
And a digital signal input from the A / D conversion unit 101, and stores an audio encoding unit 102 as audio encoding means for performing compression encoding, and stores a compression-encoded compression-encoded signal (quantized signal). A storage unit 103 serving as storage means, a voice decoding unit 104 serving as voice decoding means for expanding and decoding the compressed coded signal (quantized signal), and a D / A converting the decoded digital signal into an analog signal. And a D / A conversion unit 105 as conversion means.

【００４０】図２は，音声符号化部１０２のブロック構
成図を示す。音声符号化部１０２は，入力したディジタ
ル信号を予め定められたサンプル数のフレーム単位に分
割し，フレーム信号を出力するフレーム構成部２０１
と，フレーム構成部２０１で分割したフレーム（フレー
ム信号）から，フレーム単位でスペクトル包絡を表すス
ペクトル包絡情報を抽出して符号化するスペクトル包絡
抽出部２０２と，フレーム構成部２０１で分割したフレ
ームを更に予め定められたサンプル数のサブフレーム単
位に分割し，サブフレーム信号を出力するサブフレーム
構成部２０３と，スペクトル包絡抽出部２０２で抽出し
たスペクトル包絡情報を用いて，サブフレーム構成部２
０３で分割したサブフレームからピッチ情報を抽出して
符号化するピッチ情報抽出部２０４と，ピッチ情報から
利得情報を抽出して符号化する利得抽出部２０５と，ス
ペクトル包絡情報，サブフレーム，ピッチ情報および利
得情報から音源情報である雑音源情報を抽出して符号化
する雑音源抽出部２０６と，から構成される。FIG. 2 is a block diagram showing the arrangement of the speech encoding unit 102. The speech encoding unit 102 divides the input digital signal into frames each having a predetermined number of samples, and outputs a frame signal.
And a spectrum envelope extraction unit 202 for extracting and encoding spectrum envelope information representing a spectrum envelope in frame units from the frame (frame signal) divided by the frame configuration unit 201, and further dividing the frame divided by the frame configuration unit 201. The sub-frame forming unit 203 divides the data into sub-frame units of a predetermined number of samples and outputs a sub-frame signal, and the spectral envelope information extracted by the spectral envelope extracting unit 202.
03, a pitch information extracting unit 204 for extracting and encoding pitch information from the subframe, a gain extracting unit 205 for extracting and encoding gain information from the pitch information, a spectrum envelope information, a subframe, and pitch information. And a noise source extracting unit 206 that extracts noise source information, which is sound source information, from the gain information and encodes it.

【００４１】また，図３は，雑音源抽出部２０６のブロ
ック構成図を示す。雑音抽出部２０６は，スペクトル包
絡抽出部２０２で抽出されたスペクトル包絡情報，サブ
フレーム構成部２０３から出力されたサブフレーム信
号，ピッチ情報抽出部２０４で抽出されたピッチ情報お
よび利得抽出部２０５で抽出された利得情報を用いて，
二次誤差信号を抽出する二次誤差信号構成部３０１と，
二次誤差信号を離散コサイン変換するＤＣＴ（Ｄｉｓｃ
ｒｅｔｅＣｏｓｉｎｅＴｒａｎｓｆｏｒｍ：離散コ
サイン変換）変換部３０２と，ＤＣＴ変換部３０２によ
るＤＣＴ変換で得られたＤＣＴ変換係数列を所定のビッ
ト長に変換する係数変換部３０３と，から構成される。FIG. 3 shows a block diagram of the noise source extraction unit 206. The noise extraction unit 206 extracts the spectrum envelope information extracted by the spectrum envelope extraction unit 202, the subframe signal output from the subframe construction unit 203, the pitch information extracted by the pitch information extraction unit 204, and the gain extraction unit 205. Using the gain information obtained,
A secondary error signal composing unit 301 for extracting a secondary error signal;
DCT (Discc) for performing a discrete cosine transform of the secondary error signal
The DCT transform unit 302 includes a transform unit 302 and a coefficient transform unit 303 that transforms a DCT transform coefficient sequence obtained by the DCT transform by the DCT transform unit 302 into a predetermined bit length.

【００４２】更に，図４は，音声復号化部１０４のブロ
ック構成図を示す。音声復号化部１０４は，蓄積部１０
３から取り出した圧縮符号化信号を入力し，符号化され
たスペクトル包絡情報を復元するスペクトル包絡復号部
４０１と，符号化されたピッチ情報を復元するピッチ情
報復号部４０２と，符号化された雑音源情報から雑音源
情報を復元する雑音源復号部４０３と，符号化された利
得情報を復元する利得復号部４０４と，復元されたスペ
クトル包絡情報と復元されたピッチ情報，利得情報およ
び雑音源情報から生成された励振源信号とから合成音声
を生成する音声合成部４０５と，から構成される。FIG. 4 is a block diagram of the speech decoding unit 104. The audio decoding unit 104 stores
3, a spectrum envelope decoding unit 401 for restoring encoded spectrum envelope information, a pitch information decoding unit 402 for restoring encoded pitch information, and encoded noise. Noise source decoding section 403 for restoring noise source information from source information, gain decoding section 404 for restoring encoded gain information, restored spectrum envelope information and restored pitch information, gain information, and noise source information And a speech synthesis unit 405 that generates a synthesized speech from the excitation source signal generated from the speech.

【００４３】以上の構成において，その動作を説明す
る。図１において，アナログ音声入力装置（図示せず）
から入力されたアナログ信号（アナログ音声波形）はＡ
／Ｄ変換部１０１によってディジタル信号に変換され
る。ここで，アナログ音声入力装置としては，例えば，
マイクロフォンや，ＣＤプレーヤ，カセットデッキ等が
挙げられる。The operation of the above configuration will be described. In FIG. 1, an analog voice input device (not shown)
The analog signal (analog sound waveform) input from is A
The signal is converted into a digital signal by the / D converter 101. Here, as an analog voice input device, for example,
Examples include a microphone, a CD player, and a cassette deck.

【００４４】続いて，ディジタル信号は，音声符号化部
１０２に送られ，図２に示すように，フレーム構成部２
０１によって予め定められたサンプル数（例えば，２５
６サンプル）のフレームと呼ばれる単位に分割される。
このフレームはフレーム信号としてスペクトル包絡抽出
部２０２およびサブフレーム構成部２０３に出力され
る。Subsequently, the digital signal is sent to the speech encoding unit 102, and as shown in FIG.
01 (for example, 25
(Six samples).
This frame is output as a frame signal to spectrum envelope extracting section 202 and subframe forming section 203.

【００４５】スペクトル包絡抽出部２０２は，該フレー
ム信号からスペクトル包絡情報を抽出して符号化（量子
化）し，ピッチ情報抽出部２０４および雑音源抽出部２
０６へ出力する。スペクトル包絡情報としては，例え
ば，線形予測分析に基づく線形予測係数，ＰＡＲＣＯＲ
係数，ＬＳＰ係数等が挙げられる。またスペクトル包絡
情報の符号化（量子化）には，ベクトル量子化や，スカ
ラー量子化，分割ベクトル量子化，多段ベクトル量子
化，あるいはそれらの複数の量子化の組み合わせが挙げ
られる。The spectrum envelope extraction unit 202 extracts and encodes (quantizes) the spectrum envelope information from the frame signal, and extracts the pitch information extraction unit 204 and the noise source extraction unit 2.
06. As the spectrum envelope information, for example, a linear prediction coefficient based on linear prediction analysis, PARCOR
Coefficients, LSP coefficients, and the like. The encoding (quantization) of the spectral envelope information includes vector quantization, scalar quantization, split vector quantization, multi-stage vector quantization, or a combination of these multiple quantizations.

【００４６】一方，サブフレーム構成部２０３は，フレ
ーム構成部２０１からフレーム信号を入力し，該フレー
ム信号を予め定められたサンプル数（例えば，６４サン
プル）に分割し，サブフレーム信号として出力する。On the other hand, the sub-frame forming section 203 receives a frame signal from the frame forming section 201, divides the frame signal into a predetermined number of samples (for example, 64 samples), and outputs it as a sub-frame signal.

【００４７】各サブフレームは，ピッチ情報抽出部２０
４において，スペクトル包絡抽出部２０２によって抽出
されたスペクトル包絡情報を用いて，ピッチ情報が抽出
され，符号化される。ピッチ情報の抽出には，ＣＥＬＰ
方式で用いられる適応符号帳探索，あるいはフーリエ変
換係数，ウェーブレット変換係数等のスペクトル情報か
ら求める方法が適用できる。また，上記適応符号帳探索
には，聴覚重みづけフィルタを用いる場合もある。な
お，聴覚重みづけフィルタは，前述した線形予測係数か
ら構成することができる。Each sub-frame is provided with a pitch information extracting unit 20.
In 4, the pitch information is extracted and encoded using the spectrum envelope information extracted by the spectrum envelope extraction unit 202. CELP for pitch information extraction
An adaptive codebook search used in the scheme or a method of obtaining from spectral information such as Fourier transform coefficients and wavelet transform coefficients can be applied. In addition, the adaptive codebook search may use an auditory weighting filter. The auditory weighting filter can be composed of the above-described linear prediction coefficients.

【００４８】図２は，適応符号帳探索によってピッチ情
報を抽出する場合の装置構成を示しており，ピッチ情報
抽出部２０４で抽出されたピッチ情報は，利得抽出部２
０５に入力され，利得情報（利得成分）が抽出されて符
号化される。FIG. 2 shows an apparatus configuration for extracting pitch information by adaptive codebook search. The pitch information extracted by the pitch information
05, gain information (gain component) is extracted and encoded.

【００４９】雑音源抽出部２０６では，図３に示すよう
に，二次誤差信号構成部３０１がサブフレーム信号，ス
ペクトル包絡情報，ピッチ情報および利得情報を用い
て，二次誤差信号を構成する。この際，前サブフレーム
までの残差信号と，ピッチ情報抽出部２０４において抽
出したピッチ情報と，利得抽出部２０５において抽出し
た利得情報と，によってピッチ成分残差信号を構成し，
さらに，構成したピッチ成分残差信号とスペクトル包絡
情報とからピッチ成分信号を構成した後，サブフレーム
信号からピッチ成分信号を差し引くことによって二次誤
差信号を得ることができる。なお，ピッチ成分残差信号
とスペクトル包絡情報とからピッチ成分信号を得るに
は，スペクトル包絡情報によって得られる合成フィルタ
に残差信号を通す等の方法を用いることができる。この
ようにして構成された二次誤差信号は，ＤＣＴ変換部３
０２に出力される。In noise source extracting section 206, as shown in FIG. 3, secondary error signal forming section 301 forms a secondary error signal using the subframe signal, spectrum envelope information, pitch information and gain information. At this time, a pitch component residual signal is formed by the residual signal up to the previous subframe, the pitch information extracted by the pitch information extracting unit 204, and the gain information extracted by the gain extracting unit 205.
Further, a second-order error signal can be obtained by constructing a pitch component signal from the composed pitch component residual signal and the spectrum envelope information and then subtracting the pitch component signal from the subframe signal. In order to obtain a pitch component signal from the pitch component residual signal and the spectrum envelope information, a method of passing the residual signal through a synthesis filter obtained from the spectrum envelope information can be used. The secondary error signal thus configured is supplied to the DCT converter 3
02 is output.

【００５０】ＤＣＴ変換部３０２は，二次誤差信号を入
力してＤＣＴ変換し，ＤＣＴ変換によって得られた複数
のＤＣＴ変換係数を係数変換部３０３へ出力する。The DCT transform unit 302 receives the secondary error signal, performs DCT transform, and outputs a plurality of DCT transform coefficients obtained by the DCT transform to the coefficient transform unit 303.

【００５１】係数変換部３０３は，複数のＤＣＴ変換係
数を入力し，ＤＣＴ変換係数列を所定のビット長に変換
する。すなわち，ＤＣＴ変換係数列を符号化し，雑音源
情報として出力する。The coefficient transforming unit 303 receives a plurality of DCT transform coefficients and converts the DCT transform coefficient sequence into a predetermined bit length. That is, the DCT transform coefficient sequence is encoded and output as noise source information.

【００５２】このようにして音声符号化部１０２は，デ
ィジタル信号からスペクトル包絡情報，ピッチ情報，利
得情報および雑音源情報を抽出して符号化し，これらを
量子化信号として出力する。これらの量子化信号は，圧
縮符号化信号として蓄積部１０３によって蓄積される。As described above, the speech encoding unit 102 extracts and encodes spectral envelope information, pitch information, gain information, and noise source information from the digital signal, and outputs these as a quantized signal. These quantized signals are accumulated by the accumulation unit 103 as compression-encoded signals.

【００５３】蓄積部１０３に蓄積された量子化信号は，
必要に応じて，音声復号化部１０４によって読み出され
て復号化（復元）される。音声復号化部１０４において
は，図４に示すように，スペクトル包絡復号部４０１で
スペクトル包絡情報が復元され，ピッチ情報復号部４０
２でピッチ情報が復元され，雑音源復号部４０３で雑音
源情報が復元され，さらに，利得復号部４０４で利得情
報が復元される。ここで，復元されたピッチ情報，雑音
源情報および利得情報は，残差信号（励振源信号）を構
成する。音声合成部４０５は，復元されたスペクトル包
絡情報と残差信号とからディジタル信号である復号音声
（合成音声）を生成して，Ｄ／Ａ変換部１０５に出力す
る。The quantized signal stored in the storage unit 103 is
The audio data is read out and decoded (restored) by the audio decoding unit 104 as necessary. In the speech decoding unit 104, as shown in FIG. 4, the spectrum envelope information is restored by the spectrum envelope decoding unit 401, and the pitch information decoding unit 40
2, the pitch information is restored, the noise source information is restored by the noise source decoding unit 403, and the gain information is restored by the gain decoding unit 404. Here, the restored pitch information, noise source information and gain information constitute a residual signal (excitation source signal). The speech synthesizer 405 generates a decoded speech (synthesized speech) as a digital signal from the restored spectrum envelope information and the residual signal, and outputs the decoded speech to the D / A converter 105.

【００５４】音声合成部４０５から出力されたディジタ
ル信号は，図１に示すように，Ｄ／Ａ変換部１０５でア
ナログ信号（アナログ音声波形）に変換される。The digital signal output from the voice synthesizer 405 is converted to an analog signal (analog voice waveform) by the D / A converter 105 as shown in FIG.

【００５５】なお，本発明に用いている音声符号化方法
は，ＣＥＬＰ音声符号化に属する符号化方法である。従
来のＣＥＬＰ方式では，二次誤差信号の符号帳を持ち，
符号帳に属する各符号ベクトルとスペクトル包絡情報と
から二次誤差信号を合成し，入力信号から得られた二次
誤差信号と比較し，そのひずみが最小となる符号を選択
することによって符号化を行っている。因みに，この探
索においては聴覚重みづけフィルタを用いることができ
る。The speech coding method used in the present invention is a coding method belonging to CELP speech coding. The conventional CELP method has a codebook for the secondary error signal,
The secondary error signal is synthesized from each code vector belonging to the codebook and the spectral envelope information, compared with the secondary error signal obtained from the input signal, and the code having the minimum distortion is selected to perform the encoding. Is going. Incidentally, an auditory weighting filter can be used in this search.

【００５６】ところが，ＣＥＬＰ方式は，低ビットレー
トで高品質の音声圧縮符号化技術であるものの，符号帳
探索のための演算量および符号帳を蓄えるためのメモリ
量の多さが問題となっている。これに対して，実施の形
態１の音声圧縮符号化装置によれば，二次誤差信号その
ものを符号化（量子化）するため，演算量を削減でき，
また符号帳を記憶する必要がないため，低メモリ量のＣ
ＥＬＰ方式を提供することができる。However, although the CELP system is a high-quality speech compression coding technique at a low bit rate, the CELP system has a problem of a large amount of calculation for searching a codebook and a large amount of memory for storing a codebook. I have. On the other hand, according to the audio compression encoding apparatus of the first embodiment, since the secondary error signal itself is encoded (quantized), the amount of calculation can be reduced,
Also, since there is no need to store a codebook, a low memory C
An ELP method can be provided.

【００５７】〔実施の形態２〕実施の形態２の音声圧縮
符号化装置は，実施の形態１の音声圧縮符号化装置にお
いて，係数変換部３０３が，ＤＣＴ変換係数列を所定の
ビット長に変換する際に，複数のＤＣＴ変換係数から予
め定めた数の係数を選択した後，選択した係数の位置お
よび係数値と選択した係数の前後に位置する予め定めた
数の係数の係数値とを所定のビット長に符号化すること
により，ＤＣＴ変換係数列を所定のビット長に変換する
ものである。[Second Embodiment] The voice compression encoding apparatus according to the second embodiment is the same as the voice compression encoding apparatus according to the first embodiment, except that the coefficient conversion unit 303 converts the DCT-transformed coefficient sequence into a predetermined bit length. In this case, after selecting a predetermined number of coefficients from a plurality of DCT transform coefficients, the position and the coefficient value of the selected coefficient and the coefficient values of the predetermined number of coefficients located before and after the selected coefficient are determined. The DCT transform coefficient sequence is converted into a predetermined bit length by encoding to a predetermined bit length.

【００５８】実施の形態１の音声圧縮符号化装置１００
においては，選択／符号化されるＤＣＴ変換係数の数が
多ければ多い程再生音の音質を向上させることができ
る。そのためには，より少ないビット数で多くの係数を
選択／符号化する処理を行う必要がある。そのため，実
施の形態２の係数変換部３０３では，予め定めた数の係
数を選択し，選択した係数の位置および係数値と選択し
た係数の前後に位置する予め定めた数の係数の係数値と
を所定のビット長に符号化することにし，その結果，選
択した係数の前後に位置する係数については位置情報を
符号化する必要をなくし，より多くの係数について符号
化を可能としたものである。Speech compression encoding apparatus 100 of the first embodiment
In, the larger the number of DCT transform coefficients to be selected / encoded, the more the sound quality of the reproduced sound can be improved. For that purpose, it is necessary to perform a process of selecting / encoding a large number of coefficients with a smaller number of bits. Therefore, the coefficient conversion unit 303 of the second embodiment selects a predetermined number of coefficients, and determines the position and coefficient value of the selected coefficient and the coefficient values of the predetermined number of coefficients located before and after the selected coefficient. Is encoded into a predetermined bit length. As a result, it is not necessary to encode the position information for the coefficients located before and after the selected coefficient, and it is possible to encode more coefficients. .

【００５９】図５は，実施の形態２の係数変換部３０３
の動作手順を示すフローチャートである。なお，実施の
形態２の音声圧縮符号化装置の基本的な構成および動作
は，図１から図４で説明した実施の形態１の音声圧縮符
号化装置と同様に付き，ここではその説明を省略する。FIG. 5 shows a coefficient conversion unit 303 according to the second embodiment.
6 is a flowchart showing the operation procedure of FIG. The basic configuration and operation of the audio compression encoding apparatus according to the second embodiment are the same as those of the audio compression encoding apparatus according to the first embodiment described with reference to FIGS. 1 to 4, and description thereof is omitted here. I do.

【００６０】図５において，係数変換部３０３は，図２
に示すＤＣＴ変換部３０２からＤＣＴ変換によって得ら
れた複数のＤＣＴ変換係数を入力し，予め定めたＭ個の
係数（Ｃｉ（ｉ＝０，・・・，Ｍ−１））を選択する
（Ｓ５０１）。In FIG. 5, the coefficient conversion unit 303
, Input a plurality of DCT transform coefficients obtained by the DCT transform from the DCT transform unit 302, and select predetermined M coefficients (Ci (i = 0,..., M−1)) (S501). ).

【００６１】続いて，係数変換部３０３は，ステップＳ
５０１で選択した係数の中から一つの係数，例えばＣｉ
（ｉ＝０）の係数を選択する（Ｓ５０２）。そして，ス
テップＳ５０３において，係数の位置および係数値と選
択した係数の前後に位置する予め定めたＮ個の係数の係
数値とを所定のビット長に符号化する。Subsequently, the coefficient conversion unit 303 determines in step S
One of the coefficients selected in 501, for example, Ci
The coefficient of (i = 0) is selected (S502). Then, in step S503, the position and coefficient value of the coefficient and the coefficient values of the predetermined N coefficients positioned before and after the selected coefficient are encoded into a predetermined bit length.

【００６２】係数（例えばＣｉ（ｉ＝０））について符
号化処理を行った後，ステップＳ５０４において，ステ
ップＳ５０１で選択した全ての係数（Ｃｉ（ｉ＝０，・
・・，Ｍ−１））についてステップＳ５０３の処理を実
行したか否かを判定する。選択した全ての係数について
符号化処理を行った場合は処理を終了し，一方，全ての
係数について符号化が終了していない場合はステップＳ
５０５に進み，残りの係数それぞれについてステップＳ
５０３の処理を実行する。すなわち，係数変換部３０３
は，図５に示すように，選択した全ての係数についてス
テップＳ５０３の処理を実行して符号化することによ
り，ＤＣＴ変換係数列を所定のビット長に変換する。After the encoding process is performed on the coefficients (for example, Ci (i = 0)), in step S504, all the coefficients (Ci (i = 0,...) Selected in step S501 are selected.
.., M-1)), it is determined whether or not the process of step S503 has been executed. If the encoding process has been performed for all the selected coefficients, the process ends. On the other hand, if the encoding has not been completed for all the coefficients, step S
Proceeding to step 505, for each of the remaining coefficients, step S
503 is executed. That is, the coefficient conversion unit 303
Converts the DCT-transformed coefficient sequence into a predetermined bit length by executing the processing of step S503 for all selected coefficients and encoding the selected coefficients, as shown in FIG.

【００６３】このように，実施の形態２の音声圧縮符号
化装置によれば，選択した係数の前後に位置する係数に
ついて位置情報を符号化する必要をなくし，より多くの
係数について符号化を可能としたため，変換後のＤＣＴ
変換係数列のビット数を増加させることなく，再生音の
音質を向上させることができる。As described above, according to the audio compression encoding apparatus of the second embodiment, it is not necessary to encode position information for coefficients located before and after the selected coefficient, and it is possible to encode more coefficients. DCT after conversion
The sound quality of the reproduced sound can be improved without increasing the number of bits of the conversion coefficient sequence.

【００６４】例えば，サブフレーム長が６４サンプル，
係数値の符号化に＋／−の符号を含め５ビットを付与す
る場合，一つのＤＣＴ変換係数を符号化するには１１ビ
ット必要である。したがって，一般に，所定の係数２本
を符号化するには２２ビット必要となる。一方，実施の
形態２に係る音声圧縮符号化装置を用いた場合は，２１
ビットで３本の係数を符号化することができる。For example, if the subframe length is 64 samples,
When 5 bits including +/- sign are added to the coefficient value encoding, 11 bits are required to encode one DCT transform coefficient. Therefore, generally, 22 bits are required to encode two predetermined coefficients. On the other hand, when the audio compression encoding apparatus according to Embodiment 2 is used, 21
Bits can encode three coefficients.

【００６５】〔実施の形態３〕実施の形態３の音声圧縮
符号化装置は，実施の形態２の音声圧縮符号化装置にお
いて，係数変換部３０３が，ＤＣＴ変換部３０２で得ら
れた複数のＤＣＴ変換係数から予め定めた数の係数を選
択する際に，係数値の絶対値が最大の係数を選択するも
のである。[Third Embodiment] The speech compression encoding apparatus according to the third embodiment is the same as the speech compression encoding apparatus according to the second embodiment, except that the coefficient transforming section 303 includes a plurality of DCTs obtained by the DCT transforming section 302. When a predetermined number of coefficients are selected from the conversion coefficients, the coefficient having the largest absolute value of the coefficient value is selected.

【００６６】図６は，実施の形態３の係数変換部３０３
の動作手順を示すフローチャートである。なお，基本的
な動作は，図５で示した実施の形態２の音声圧縮符号化
装置と同様に付き，ここでは異なる部分のみを説明す
る。FIG. 6 shows a coefficient conversion unit 303 according to the third embodiment.
6 is a flowchart showing the operation procedure of FIG. The basic operation is the same as that of the audio compression encoding apparatus according to the second embodiment shown in FIG. 5, and only different parts will be described here.

【００６７】図６において，係数変換部３０３は，図２
に示すＤＣＴ変換部３０２からＤＣＴ変換によって得ら
れた複数のＤＣＴ変換係数を入力し，係数値の絶対値が
最大の係数Ｃｉを選択する（Ｓ６０１）。In FIG. 6, the coefficient conversion unit 303
A plurality of DCT transform coefficients obtained by the DCT transform are input from the DCT transform unit 302 shown in (1), and the coefficient Ci having the largest absolute value of the coefficient value is selected (S601).

【００６８】続いて係数変換部３０３は，実施の形態２
で説明したように，選択した係数の位置および係数値と
選択した係数の前後に位置する予め定めたＮ個の係数の
係数値とを所定のビット長に符号化することにより，Ｄ
ＣＴ変換係数列を所定のビット長に変換する（Ｓ５０
３）Subsequently, the coefficient conversion unit 303 performs the processing according to the second embodiment.
As described above, by coding the position and coefficient value of the selected coefficient and the coefficient values of predetermined N coefficients before and after the selected coefficient to a predetermined bit length,
The CT conversion coefficient sequence is converted into a predetermined bit length (S50).
3)

【００６９】このように，実施の形態３の音声圧縮符号
化装置によれば，複数のＤＣＴ変換係数から係数値の絶
対値が最大の係数を選択するため，符号化すべき係数を
容易に選択することができる。As described above, according to the audio compression encoding apparatus of the third embodiment, since the coefficient having the largest absolute value of the coefficient value is selected from the plurality of DCT transform coefficients, the coefficient to be encoded is easily selected. be able to.

【００７０】〔実施の形態４〕実施の形態４の音声圧縮
符号化装置は，実施の形態２の音声圧縮符号化装置にお
いて，係数変換部３０３が，ＤＣＴ変換部３０２で得ら
れた複数のＤＣＴ変換係数から予め定めた数の係数を選
択する際に，係数値の絶対値が最大のものから予め定め
た数の係数を選択するものである。[Fourth Embodiment] The speech compression encoding apparatus according to the fourth embodiment is the same as the speech compression encoding apparatus according to the second embodiment, except that the coefficient transforming section 303 includes a plurality of DCTs obtained by the DCT transforming section 302. When a predetermined number of coefficients are selected from the conversion coefficients, a predetermined number of coefficients are selected from those having the largest absolute value of the coefficient value.

【００７１】図７は，実施の形態４の係数変換部３０３
の動作手順を示すフローチャートである。なお，基本的
な動作は，図５で示した実施の形態２の音声圧縮符号化
装置と同様に付き，ここでは異なる部分のみを説明す
る。FIG. 7 shows a coefficient conversion unit 303 according to the fourth embodiment.
6 is a flowchart showing the operation procedure of FIG. The basic operation is the same as that of the audio compression encoding apparatus according to the second embodiment shown in FIG. 5, and only different parts will be described here.

【００７２】図７において，係数変換部３０３は，図２
に示すＤＣＴ変換部３０２からＤＣＴ変換によって得ら
れた複数のＤＣＴ変換係数を入力し，係数値の絶対値が
最大のものから予め定めたＭ個の係数（Ｃｉ（ｉ＝０，
・・・，Ｍ−１））を選択する（Ｓ７０１）。In FIG. 7, the coefficient conversion unit 303
, A plurality of DCT transform coefficients obtained by the DCT transform from the DCT transform unit 302, and a predetermined number of M coefficients (Ci (i = 0,
.., M-1)) are selected (S701).

【００７３】その後，係数変換部３０３は，ステップＳ
７０１で選択した係数それぞれについて，係数の位置お
よび係数値と選択した係数の前後に位置する予め定めた
Ｎ個の係数の係数値とを所定のビット長に符号化するこ
とにより，ＤＣＴ変換係数列を所定のビット長に変換す
る（Ｓ５０２〜Ｓ５０５）。Thereafter, the coefficient conversion unit 303 determines in step S
For each of the coefficients selected in step 701, the DCT transform coefficient sequence is obtained by encoding the position and coefficient value of the coefficient and the coefficient values of predetermined N coefficients located before and after the selected coefficient into a predetermined bit length. Is converted to a predetermined bit length (S502 to S505).

【００７４】このように，実施の形態５の音声圧縮符号
化装置によれば，複数のＤＣＴ変換係数から，係数値の
絶対値が最大のものから予め定めた数の係数を選択する
ため，符号化すべき係数を容易に選択することができ
る。加えて，ＤＣＴ変換係数においては，一般に，係数
値の絶対値が高い係数の近傍に位置する係数も係数値の
絶対値も高いという性質があるため，この性質を活かし
て絶対値の最大の係数から選択していくことにより，よ
り効率よくＤＣＴ変換係数列を所定のビット長に変換す
ることができる。As described above, according to the audio compression encoding apparatus of the fifth embodiment, a predetermined number of coefficients are selected from a plurality of DCT transform coefficients from those having the largest absolute value of the coefficient value. The coefficient to be converted can be easily selected. In addition, the DCT transform coefficient generally has a property that both the coefficient located near the coefficient having a high absolute value and the absolute value of the coefficient value are high. , The DCT transform coefficient sequence can be more efficiently converted into a predetermined bit length.

【００７５】〔実施の形態５〕実施の形態５の音声圧縮
符号化装置は，実施の形態２の音声圧縮符号化装置にお
いて，係数変換部３０３が，ＤＣＴ変換部３０２で得ら
れた複数のＤＣＴ変換係数から予め定めた数の係数を選
択する際に，予め設定した閾値を超える係数値を有する
係数を選択するものである。[Fifth Embodiment] The voice compression encoding apparatus according to the fifth embodiment is the same as the voice compression encoding apparatus according to the second embodiment, except that the coefficient transforming section 303 includes a plurality of DCTs obtained by the DCT transforming section 302. When a predetermined number of coefficients are selected from the conversion coefficients, a coefficient having a coefficient value exceeding a preset threshold is selected.

【００７６】図８は，実施の形態５の係数変換部３０３
の動作手順を示すフローチャートである。なお，基本的
な動作は，図５で示した実施の形態２の音声圧縮符号化
装置と同様に付き，ここでは異なる部分のみを説明す
る。FIG. 8 is a diagram showing a coefficient conversion unit 303 according to the fifth embodiment.
6 is a flowchart showing the operation procedure of FIG. The basic operation is the same as that of the audio compression encoding apparatus according to the second embodiment shown in FIG. 5, and only different parts will be described here.

【００７７】図５において，係数変換部３０３は，図２
に示すＤＣＴ変換部３０２からＤＣＴ変換によって得ら
れた複数のＤＣＴ変換係数を入力し，予め設定した閾値
Ｔｈを超える係数値を有する係数（Ｃｉ（ｉ＝０，・・
・，Ｍ−１））を選択する（Ｓ８０１）。In FIG. 5, the coefficient conversion unit 303
, A plurality of DCT transform coefficients obtained by the DCT transform from the DCT transform unit 302, and a coefficient (Ci (i = 0,...) Having a coefficient value exceeding a preset threshold Th.
., M-1)) is selected (S801).

【００７８】その後，係数変換部３０３は，ステップＳ
８０１で選択した係数それぞれについて，係数の位置お
よび係数値と選択した係数の前後に位置する予め定めた
Ｎ個の係数の係数値とを所定のビット長に符号化するこ
とにより，ＤＣＴ変換係数列を所定のビット長に変換す
る（Ｓ５０２〜Ｓ５０５）。Thereafter, the coefficient conversion unit 303 determines in step S
For each of the coefficients selected in 801, the DCT transform coefficient sequence is obtained by encoding the position and coefficient value of the coefficient and the coefficient values of predetermined N coefficients located before and after the selected coefficient into a predetermined bit length. Is converted to a predetermined bit length (S502 to S505).

【００７９】このように，実施の形態５の音声圧縮符号
化装置によれば，複数のＤＣＴ変換係数から予め設定し
た閾値を超える係数値を有する係数を選択するため，符
号化すべき係数を容易に選択することができる。As described above, according to the speech compression encoding apparatus of the fifth embodiment, since a coefficient having a coefficient value exceeding a preset threshold value is selected from a plurality of DCT transform coefficients, the coefficient to be encoded can be easily determined. You can choose.

【００８０】〔実施の形態６〕実施の形態６の音声圧縮
符号化装置は，実施の形態１の音声圧縮符号化装置にお
いて，係数変換部３０３が，ＤＣＴ変換係数列を所定の
ビット長に変換する際に，複数のＤＣＴ変換係数から予
め定めた閾値を超える係数値を有する係数を選択し，選
択した係数の位置と係数値とを所定のビット長に符号化
することにより，ＤＣＴ変換係数列を所定のビット長に
変換するものである。[Embodiment 6] The speech compression encoding apparatus according to the sixth embodiment is the same as the speech compression encoding apparatus according to the first embodiment, except that the coefficient transforming unit 303 transforms the DCT transform coefficient sequence into a predetermined bit length. In this case, a coefficient having a coefficient value exceeding a predetermined threshold value is selected from a plurality of DCT transform coefficients, and a position and a coefficient value of the selected coefficient are encoded into a predetermined bit length, thereby obtaining a DCT transform coefficient sequence. Is converted into a predetermined bit length.

【００８１】図９は，実施の形態６の係数変換部３０３
の動作手順を示すフローチャートである。なお，実施の
形態６の音声圧縮符号化装置の基本的な構成および動作
は，図１から図４で説明した実施の形態１の音声圧縮符
号化装置と同様に付き，ここではその説明を省略する。FIG. 9 shows a coefficient conversion unit 303 according to the sixth embodiment.
6 is a flowchart showing the operation procedure of FIG. The basic configuration and operation of the speech compression encoding apparatus according to the sixth embodiment are the same as those of the speech compression encoding apparatus according to the first embodiment described with reference to FIGS. 1 to 4, and description thereof is omitted here. I do.

【００８２】図９において，係数変換部３０３は，図２
に示すＤＣＴ変換部３０２からＤＣＴ変換によって得ら
れた複数のＤＣＴ変換係数を入力し，予め設定した閾値
Ｔｈを超える係数値を有する係数（Ｃｉ（ｉ＝０，・・
・，Ｍ−１））を選択する（Ｓ９０１）。In FIG. 9, the coefficient conversion unit 303
, A plurality of DCT transform coefficients obtained by the DCT transform from the DCT transform unit 302, and a coefficient (Ci (i = 0,...) Having a coefficient value exceeding a preset threshold Th.
., M-1)) is selected (S901).

【００８３】続いて，係数変換部３０３は，ステップＳ
９０１で選択した係数の中から一つの係数，例えばＣｉ
（ｉ＝０）の係数を選択する（Ｓ９０２）。そして，ス
テップＳ９０３において，選択した係数の位置および係
数値を所定のビット長に符号化する。Subsequently, the coefficient conversion unit 303 determines in step S
One of the coefficients selected in 901, for example, Ci
The coefficient of (i = 0) is selected (S902). Then, in step S903, the position and coefficient value of the selected coefficient are encoded into a predetermined bit length.

【００８４】ステップＳ９０３で，係数（例えばＣｉ
（ｉ＝０））について符号化を行った後，ステップＳ９
０４において，ステップＳ９０１で選択した係数それぞ
れについてステップＳ９０３の処理を実行したか否かを
判定する。選択した全ての係数について符号化を行った
場合は処理を終了し，一方，全ての係数について符号化
が終了していない場合はステップＳ９０５に進み，残り
の係数についてステップＳ９０３の処理を実行する。す
なわち，符号変換部３０３は，図９に示すように，選択
した全ての係数についてステップＳ９０３の処理を実行
して符号化することにより，ＤＣＴ変換係数列を所定の
ビット長に変換する。In step S903, the coefficient (for example, Ci
(I = 0)), and then, step S9
In 04, it is determined whether or not the processing of step S903 has been performed for each coefficient selected in step S901. If the coding has been performed on all the selected coefficients, the process ends. On the other hand, if the coding has not been completed on all the coefficients, the process proceeds to step S905, and the process of step S903 is performed on the remaining coefficients. In other words, as shown in FIG. 9, the code conversion unit 303 executes the process of step S903 for all the selected coefficients to perform coding, thereby converting the DCT-transformed coefficient sequence into a predetermined bit length.

【００８５】このように，実施の形態６の音声圧縮符号
化装置によれば，ＤＣＴ変換係数列を所定のビット長に
変換する際に，複数のＤＣＴ変換係数から予め定めた閾
値を超える係数値を有する係数を選択するため，変換後
のＤＣＴ変換係数列のビット数を増加させることなく，
再生音の音質を向上させることができると共に，符号化
すべき係数を容易に選択することができる。As described above, according to the speech compression encoding apparatus of the sixth embodiment, when converting a DCT transform coefficient sequence into a predetermined bit length, a coefficient value exceeding a predetermined threshold value is obtained from a plurality of DCT transform coefficients. Is selected without increasing the number of bits in the transformed DCT coefficient sequence.
The sound quality of the reproduced sound can be improved, and the coefficient to be encoded can be easily selected.

【００８６】〔実施の形態７〕実施の形態７の音声圧縮
符号化装置は，実施の形態１の音声圧縮符号化装置にお
いて，係数変換部３０３が，ＤＣＴ変換係数列を所定の
ビット長に変換する際に，複数のＤＣＴ変換係数から一
つの係数を選択し，選択した係数の位置および係数値と
選択した係数の前後に位置する予め定めた数の係数の係
数値とを所定のビット長に符号化した後，さらに符号化
した係数を除いた複数のＤＣＴ変換係数から一つの係数
を選択し，選択した係数の位置および係数値と選択した
係数の前後に位置する予め定めた数の係数の係数値とを
所定のビット長に符号化する工程を予め設定した回数実
行することにより，ＤＣＴ変換係数列を所定のビット長
に変換するものである。[Seventh Embodiment] The speech compression encoding apparatus according to the seventh embodiment is the same as the speech compression encoding apparatus according to the first embodiment, except that the coefficient transforming unit 303 transforms the DCT transform coefficient sequence into a predetermined bit length. In this case, one coefficient is selected from a plurality of DCT transform coefficients, and the position and coefficient value of the selected coefficient and the coefficient values of a predetermined number of coefficients before and after the selected coefficient are set to a predetermined bit length. After encoding, one coefficient is selected from a plurality of DCT transform coefficients excluding the encoded coefficient, and the position and coefficient value of the selected coefficient and a predetermined number of coefficients before and after the selected coefficient are selected. The step of encoding the coefficient value into a predetermined bit length is performed a predetermined number of times, thereby converting the DCT transform coefficient sequence into a predetermined bit length.

【００８７】図１０は，実施の形態７の係数変換部３０
３の動作手順を示すフローチャートである。なお，実施
の形態７の音声圧縮符号化装置の基本的な構成および動
作は，図１から図４で説明した実施の形態１の音声圧縮
符号化装置と同様に付き，ここではその説明を省略す
る。FIG. 10 is a diagram showing a coefficient conversion unit 30 according to the seventh embodiment.
6 is a flowchart showing an operation procedure of No. 3; The basic configuration and operation of the speech compression encoding apparatus according to the seventh embodiment are the same as those of the speech compression encoding apparatus according to the first embodiment described with reference to FIGS. 1 to 4, and description thereof is omitted here. I do.

【００８８】係数変換部３０３には，複数のＤＣＴ変換
係数から選択して符号化を行う係数の数が予め設定され
ている（例えばＭ個）。図１０において，係数変換部３
０３は，図２に示すＤＣＴ変換部３０２からＤＣＴ変換
によって得られた複数のＤＣＴ変換係数を入力し，１つ
目の係数の処理に入ることを確認して（Ｓ１００１），
ＤＣＴ変換係数から一つの係数を選択する（Ｓ１００
２）。In the coefficient transforming section 303, the number of coefficients to be selected and encoded from a plurality of DCT transform coefficients is set in advance (for example, M). In FIG. 10, the coefficient conversion unit 3
03 inputs a plurality of DCT transform coefficients obtained by the DCT transform from the DCT transform unit 302 shown in FIG. 2 and confirms that the process enters the first coefficient process (S1001).
One coefficient is selected from the DCT transform coefficients (S100
2).

【００８９】続いて，係数変換部３０３は，ステップＳ
１００２で選択した係数の位置および係数値と選択した
係数の前後に位置する予め定めたＮ個の係数の係数値と
を所定のビット長に符号化する（Ｓ１００３）。Subsequently, the coefficient conversion unit 303 determines in step S
The position and coefficient value of the coefficient selected in 1002 and the coefficient values of the predetermined N coefficients positioned before and after the selected coefficient are encoded into a predetermined bit length (S1003).

【００９０】ステップＳ１００２で選択した係数につい
て符号化を行った後，ステップＳ１００４において，予
め設定されたＭ個の係数全ての処理が終了したか否かを
判定する。処理すべき全ての係数について符号化を行っ
た場合は処理を終了する。After coding the coefficients selected in step S1002, it is determined in step S1004 whether or not the processing of all of the M previously set coefficients has been completed. If encoding has been performed for all coefficients to be processed, the process ends.

【００９１】一方，全ての係数について符号化が終了し
ていない場合はステップＳ１００５に進み，符号化すべ
き係数の数に１加算する。そして，ステップＳ１００２
に進み，既に符号化した係数を除いたＤＣＴ変換係数か
ら一つの係数を選択し，ステップＳ１００３で符号化を
行う。そして，ステップＳ１００４で，Ｍ個の係数につ
いて上記処理を行ったことが確認されるまで繰り返し係
数の選択・符号化処理を実行する。On the other hand, if the coding has not been completed for all the coefficients, the flow advances to step S1005 to add 1 to the number of coefficients to be coded. Then, step S1002
Then, one coefficient is selected from the DCT transform coefficients excluding the already coded coefficients, and encoding is performed in step S1003. Then, in step S1004, the selection and coding processing of the repetition coefficient is executed until it is confirmed that the above processing has been performed on the M coefficients.

【００９２】このように，実施の形態７の音声圧縮符号
化装置によれば，選択した係数の前後に位置する係数に
ついては位置情報を符号化する必要をなくし，より多く
の係数について符号化を可能としたため，変換後のＤＣ
Ｔ変換係数列のビット数を増加させることなく，再生音
の音質を向上させることができる。また，既に符号化さ
れたＤＣＴ変換係数を再度符号化するということがない
ため，より効率的に符号化処理を実行することができ
る。As described above, according to the audio compression encoding apparatus of the seventh embodiment, it is not necessary to encode position information for coefficients located before and after the selected coefficient, and encoding is performed for more coefficients. Enabled, the converted DC
The sound quality of the reproduced sound can be improved without increasing the number of bits of the T conversion coefficient sequence. In addition, since the already encoded DCT transform coefficients are not re-encoded, the encoding process can be executed more efficiently.

【００９３】〔実施の形態８〕実施の形態８の音声圧縮
符号化装置は，実施の形態７の音声圧縮符号化装置にお
いて，係数変換部３０３が，複数のＤＣＴ変換係数から
一つの係数を選択する際に，係数値の絶対値が最大の係
数を選択し，符号化した係数を除いた複数のＤＣＴ変換
係数から一つの係数を選択する際に，係数値の絶対値が
最大の係数を選択するものである。[Eighth Embodiment] An audio compression encoding apparatus according to the eighth embodiment is the same as the audio compression encoding apparatus according to the seventh embodiment, except that the coefficient conversion unit 303 selects one coefficient from a plurality of DCT conversion coefficients. When the coefficient is selected, the coefficient with the largest absolute value is selected, and when selecting one coefficient from a plurality of DCT transform coefficients excluding the encoded coefficient, the coefficient with the largest absolute value is selected. Is what you do.

【００９４】図１１は，実施の形態８の係数変換部３０
３の動作手順を示すフローチャートである。なお，基本
的な動作は，図１０で示した実施の形態７の音声圧縮符
号化装置と同様に付き，ここでは異なる部分のみを説明
する。FIG. 11 shows the coefficient conversion unit 30 according to the eighth embodiment.
6 is a flowchart showing an operation procedure of No. 3; The basic operation is the same as that of the speech compression encoding apparatus according to the seventh embodiment shown in FIG. 10, and only different parts will be described here.

【００９５】係数変換部３０３には，ＤＣＴ変換係数か
ら選択して符号化を行う係数の数が予め設定されている
（例えばＭ個）。図１１において，係数変換部３０３
は，図２に示すＤＣＴ変換部３０２からＤＣＴ変換によ
って得られた複数のＤＣＴ変換係数を入力し，１つ目の
係数の処理に入ることを確認して（Ｓ１００１），ＤＣ
Ｔ変換係数から係数値の絶対値が最大の係数を選択する
（Ｓ１１０１）。In the coefficient transforming section 303, the number of coefficients to be selected and coded from the DCT transform coefficients is set in advance (for example, M). In FIG. 11, a coefficient conversion unit 303
Inputs a plurality of DCT transform coefficients obtained by the DCT transform from the DCT transform unit 302 shown in FIG. 2 and confirms that the process enters the first coefficient process (S1001).
The coefficient having the largest absolute value of the coefficient value is selected from the T-transform coefficients (S1101).

【００９６】続いて，係数変換部３０３は，ステップＳ
１００２で選択した係数の位置および係数値と選択した
係数の前後に位置する予め定めたＮ個の係数の係数値と
を所定のビット長に符号化した後（Ｓ１００３），更
に，既に符号化した係数を除いた複数のＤＣＴ変換係数
から係数値の絶対値が最大の係数を選択し（Ｓ１１０
１），ステップＳ１００３で符号化を行う。そして，ス
テップＳ１００４で，Ｍ個の係数について上記処理を行
ったことが確認されるまで繰り返し係数の選択・符号化
処理を実行する。Subsequently, the coefficient conversion unit 303 determines in step S
After the position and coefficient value of the coefficient selected in 1002 and the coefficient values of the predetermined N coefficients positioned before and after the selected coefficient are coded to a predetermined bit length (S1003), the coded data are further coded. A coefficient having the largest absolute value of the coefficient value is selected from a plurality of DCT transform coefficients excluding the coefficient (S110).
1), encoding is performed in step S1003. Then, in step S1004, the selection and coding processing of the repetition coefficient is executed until it is confirmed that the above processing has been performed on the M coefficients.

【００９７】このように，実施の形態８の音声圧縮符号
化装置によれば，複数のＤＣＴ変換係数から一つの係数
を選択する際に，係数値の絶対値が最大の係数を選択
し，符号化した係数を除いた複数のＤＣＴ変換係数から
一つの係数を選択する際に，係数値の絶対値が最大の係
数を選択するため，符号化すべき係数を容易に選択する
ことができる。また，ＤＣＴ変換係数においては，一般
に，係数値の絶対値が高い係数の近傍に位置する係数も
係数値の絶対値も高いという性質があるため，この性質
を活かして絶対値の最大の係数から選択していくことに
より，より効率よくＤＣＴ変換係数列を所定のビット長
に変換することができる。As described above, according to the audio compression encoding apparatus of the eighth embodiment, when one coefficient is selected from a plurality of DCT transform coefficients, the coefficient having the largest absolute value of the coefficient value is selected, When one coefficient is selected from the plurality of DCT transform coefficients excluding the converted coefficients, the coefficient having the largest absolute value of the coefficient value is selected, so that the coefficient to be coded can be easily selected. Also, DCT transform coefficients generally have a property that both the coefficient located near a coefficient having a high absolute value and the absolute value of the coefficient value are high. By making a selection, the DCT transform coefficient sequence can be more efficiently converted to a predetermined bit length.

【００９８】〔実施の形態９〕実施の形態９の音声圧縮
符号化装置は，実施の形態１の音声圧縮符号化装置にお
いて，係数変換部３０３が，ＤＣＴ変換係数列を所定の
ビット長に変換する際に，複数のＤＣＴ変換係数から予
め定めた数の係数を選択し，選択した係数の係数値と選
択した係数の前後に位置する予め定めた数の係数の係数
値との比のパターン（第１の比のパターン）を求めた
後，予め用意した複数の比のパターン（第２の比のパタ
ーン）と符号との対応関係情報（パターンデータベー
ス）から，第１の比のパターンに最も近い性質の第２の
比のパターンを選択し，選択した第２の比のパターンに
与えられた符号を用いて第１の比のパターンを符号化す
ることにより，ＤＣＴ変換係数列を所定のビット長に変
換するものである。[Embodiment 9] The speech compression encoding apparatus according to the ninth embodiment is the same as the speech compression encoding apparatus according to the first embodiment, except that the coefficient transform unit 303 transforms the DCT transform coefficient sequence into a predetermined bit length. In this case, a predetermined number of coefficients are selected from a plurality of DCT transform coefficients, and a pattern of the ratio between the coefficient value of the selected coefficient and the coefficient value of a predetermined number of coefficients located before and after the selected coefficient (see FIG. After obtaining the first ratio pattern), the closest relationship to the first ratio pattern is obtained from the correspondence information (pattern database) between a plurality of ratio patterns (second ratio patterns) and codes prepared in advance. By selecting the pattern of the second ratio having the property and encoding the pattern of the first ratio using the code given to the selected pattern of the second ratio, the DCT transform coefficient sequence has a predetermined bit length. Is converted to

【００９９】係数変換部３０３には，選択した係数の係
数値と選択した係数の前後に位置する予め定めた数の係
数の係数値との比のパターン（第１の比のパターン）に
対応する複数の比のパターン（第２の比のパターン）を
蓄積したパターンデータベース（図示せず）が予め設け
られている。このパターンデータベースに蓄積された第
２の比のパターンにはそれぞれ符号が付されており，係
数変換部３０３は，パターンデータベースに蓄積された
複数の第２の比のパターンと求めた第１の比のパターン
とをそれぞれ比較し，第１の比のパターンに最も近い性
質の第２の比のパターンを選択することにより，選択し
た第２の比のパターンに与えられた符号を用いて符号化
を行う。The coefficient conversion unit 303 corresponds to a ratio pattern (first ratio pattern) between the coefficient value of the selected coefficient and the coefficient value of a predetermined number of coefficients located before and after the selected coefficient. A pattern database (not shown) storing a plurality of ratio patterns (second ratio patterns) is provided in advance. Each of the patterns of the second ratio stored in the pattern database is given a sign, and the coefficient conversion unit 303 determines the plurality of patterns of the second ratio stored in the pattern database and the obtained first ratio. Are compared with each other, and the second ratio pattern having the property closest to the first ratio pattern is selected, whereby encoding is performed using the code given to the selected second ratio pattern. Do.

【０１００】図１２は，実施の形態９の係数変換部３０
３の動作手順を示すフローチャートである。なお，実施
の形態９の音声圧縮符号化装置の基本的な構成および動
作は，図１から図４で説明した実施の形態１の音声圧縮
符号化装置と同様に付き，ここではその説明を省略す
る。FIG. 12 is a diagram showing a coefficient conversion unit 30 according to the ninth embodiment.
6 is a flowchart showing an operation procedure of No. 3; The basic configuration and operation of the speech compression encoding apparatus according to the ninth embodiment are the same as those of the speech compression encoding apparatus according to the first embodiment described with reference to FIGS. 1 to 4, and description thereof is omitted here. I do.

【０１０１】図１２において，係数変換部３０３は，図
２に示すＤＣＴ変換部３０２からＤＣＴ変換によって得
られた複数のＤＣＴ変換係数を入力し，予め定めたＭ個
の係数（Ｃｉ（ｉ＝０，・・・，Ｍ−１））を選択する
（Ｓ１２０１）。In FIG. 12, a coefficient transforming unit 303 receives a plurality of DCT transform coefficients obtained by DCT transform from the DCT transforming unit 302 shown in FIG. 2, and sets M predetermined coefficients (Ci (i = 0) ,..., M-1)) are selected (S1201).

【０１０２】続いて，係数変換部３０３は，ステップＳ
１２０１で選択した係数の中から一つの係数（例えばＣ
ｉ（ｉ＝０））を選択し，その係数の係数値と選択した
係数の前後に位置する予め定めた数の係数の係数値との
比のパターン（第１の比のパターン）を求める（Ｓ１２
０２）。そして，係数変換部３０３は，上述したパター
ンデータベースに蓄積されたＪ個の比のパターン（第２
の比のパターン：Ｊ（ｊ＝０，・・・，ｊ−１））から
一つの第２の比のパターン（例えばＪ（ｊ＝０））を読
み出す（Ｓ１２０３）。Subsequently, the coefficient conversion unit 303 determines in step S
One coefficient (for example, C
i (i = 0)), and a ratio pattern (first ratio pattern) between the coefficient value of the coefficient and the coefficient value of a predetermined number of coefficients before and after the selected coefficient is obtained ( S12
02). Then, the coefficient conversion unit 303 determines that the J ratio patterns (second patterns) stored in the pattern database described above.
A second ratio pattern (for example, J (j = 0)) is read from the ratio pattern: J (j = 0,..., J−1)) (S1203).

【０１０３】ステップＳ１２０４において，係数変換部
３０３は，ステップＳ１２０２で求めた第１の比のパタ
ーンと，ステップＳ１２０３でパターンデータベースか
ら読み出した第２の比のパターンとを比較し，比較結果
としての評価値Ｄｊを求める。In step S1204, the coefficient conversion unit 303 compares the pattern of the first ratio obtained in step S1202 with the pattern of the second ratio read from the pattern database in step S1203, and evaluates as a comparison result. Find the value Dj.

【０１０４】評価値Ｄｊを求めた後，係数変換部３０３
は，求めた評価値ＤｊとＤｍｉｎとの大小関係を比較す
る（Ｓ１２０５）。例えば，係数Ｃｉ（ｉ＝０）に関す
る第１の比のパターンについて，パターンデータベース
に蓄積された第２の比のパターンと比較している最中で
あるとした場合，Ｄｍｉｍには，係数Ｃｉ（ｉ＝０）に
ついて現在のところ最も良い評価値Ｄｊの値が設定され
ている。ただし，係数Ｃｉ（ｉ＝０）に関する第１の比
のパターンについて初めて比較を行う場合には，予め設
定した初期値がＤｍｉｎとして用いられる。After obtaining the evaluation value Dj, the coefficient conversion unit 303
Compares the magnitude relationship between the obtained evaluation values Dj and Dmin (S1205). For example, if it is assumed that the pattern of the first ratio relating to the coefficient Ci (i = 0) is being compared with the pattern of the second ratio stored in the pattern database, the coefficient Ci ( For i = 0), the best evaluation value Dj at present is set. However, when comparing for the first time a pattern of the first ratio with respect to the coefficient Ci (i = 0), a preset initial value is used as Dmin.

【０１０５】ステップＳ１２０５でＤｊとＤｍｉｎとを
比較した結果，ＤｊがＤｍｉｍ小さい場合は，ステップ
Ｓ１２０６に進み，Ｄｍｉｍにこの評価値Ｄｊが設定さ
れると共に，求めた第１の比のパターンに現在最も近い
性質の第２の比のパターンＭＩＮとして，この評価値Ｄ
ｊが得られた第２の比のパターンＪが設定される。一
方，ＤｊがＤｍｉｎより大きい場合，ステップＳ１２０
７に進む。なお，一つの係数（例えばＣｉ（ｉ＝０））
に関する第１の比のパターンについて，パターンデータ
ベースに蓄積された全ての第２の比のパターンとの比較
が終了した結果，ＭＩＮに設定されている第２の比のパ
ターンが求めた第１の比のパターンに最も近い性質を有
するということになり，この第２の比のパターンに付さ
れた符号が，係数（Ｃｉ（ｉ＝０））に関する第１の比
のパターンの符号として選択される。As a result of comparing Dj and Dmin in step S1205, if Dj is smaller by Dmim, the flow advances to step S1206 to set this evaluation value Dj to Dmim, and to determine the current pattern in the first ratio pattern thus obtained. The evaluation value D is defined as a pattern MIN of a second ratio having similar properties.
A second ratio pattern J from which j is obtained is set. On the other hand, if Dj is greater than Dmin, step S120
Go to 7. In addition, one coefficient (for example, Ci (i = 0))
As a result of completing the comparison of the first ratio pattern with respect to all the second ratio patterns stored in the pattern database, the second ratio pattern set in MIN is the first ratio obtained. And the code assigned to the second ratio pattern is selected as the code of the first ratio pattern for the coefficient (Ci (i = 0)).

【０１０６】ステップＳ１２０７において，第１の比の
パターンとパターンデータベースに蓄積された全ての第
２の比のパターンとの比較が終了したか否かが判定され
る。全ての第２の比のパターンとの比較が終了していな
い場合は，ステップＳ１２０９に進み，第１の比のパタ
ーンと次の第２の比のパターン（例えばＪ＝１）とが比
較される。In step S1207, it is determined whether the comparison between the pattern of the first ratio and all the patterns of the second ratio stored in the pattern database has been completed. If the comparison with all the second ratio patterns has not been completed, the process proceeds to step S1209, and the first ratio pattern and the next second ratio pattern (for example, J = 1) are compared. .

【０１０７】一方，第１の比のパターンとパターンデー
タベースに蓄積された全ての第２の比のパターンとの比
較が終了した場合は，ステップＳ１２０８に進み，ステ
ップＳ１２０１で選択した全ての係数Ｃｉ（ｉ＝０，・
・・，Ｍ−１）について，パターンデータベースに蓄積
された第２の比のパターンとの比較が終了したか否かが
判定される。On the other hand, if the comparison between the pattern of the first ratio and all the patterns of the second ratio stored in the pattern database is completed, the process proceeds to step S1208, where all the coefficients Ci ( i = 0, ...
.., M-1), it is determined whether the comparison with the second ratio pattern stored in the pattern database has been completed.

【０１０８】選択した全ての係数Ｃｉについて比較が終
了した場合には，符号化処理を終了し，一方，選択した
全ての係数Ｃｉについて比較が終了していない場合に
は，次の係数（例えばｉ＝１）についてパターンデータ
ベースに蓄積された第２の比のパターンとの比較が行わ
れる。When the comparison has been completed for all the selected coefficients Ci, the encoding process is completed. On the other hand, when the comparison has not been completed for all the selected coefficients Ci, the next coefficient (for example, i = 1) is compared with the second ratio pattern stored in the pattern database.

【０１０９】このように，実施の形態９の音声圧縮符号
化装置によれば，選択した係数の前後に位置する係数に
ついては位置情報を符号化する必要をなくし，より多く
の係数について符号化を可能としたため，変換後のＤＣ
Ｔ変換係数列のビット数を増加させることなく，再生音
の音質を向上させることができる。なお，予め用意され
た比のパターンと求めた比のパターンとの整合を行うと
なると，結局符号帳探索を行うのと同等であるように見
えるが，実施の形態９の場合はＣＥＬＰにおける雑音符
号帳探索より次元数が非常に小さいため，演算量ははる
かに小さい。As described above, according to the speech compression encoding apparatus of the ninth embodiment, it is not necessary to encode position information for coefficients located before and after the selected coefficient, and encoding is performed for more coefficients. Enabled, the converted DC
The sound quality of the reproduced sound can be improved without increasing the number of bits of the T conversion coefficient sequence. It should be noted that when matching the pattern of the ratio prepared in advance and the pattern of the obtained ratio is performed, it looks like it is equivalent to performing a codebook search. However, in the case of the ninth embodiment, the noise code in CELP is used. Since the number of dimensions is much smaller than book search, the amount of computation is much smaller.

【０１１０】〔実施の形態１０〕実施の形態１０の音声
圧縮符号化装置は，実施の形態９の音声圧縮符号化装置
において，係数変換部３０３が，第２の比のパターンを
選択する際に，第１の比のパターンとのユークリッド距
離（二乗距離：それぞれの座標軸における差の自乗を加
えてその平方根をとるもの）が最も小さい第２の比のパ
ターンを選択するものである。[Embodiment 10] The speech compression encoding apparatus according to the tenth embodiment differs from the speech compression encoding apparatus according to the ninth embodiment in that the coefficient conversion unit 303 selects a pattern having the second ratio. , The second ratio pattern having the smallest Euclidean distance (square distance: the square root of the difference in each coordinate axis and taking the square root thereof) with the first ratio pattern is selected.

【０１１１】図１３は，実施の形態１０の係数変換部３
０３の動作手順を示すフローチャートである。なお，基
本的な動作は，図１２で示した実施の形態９の音声圧縮
符号化装置と同様に付き，ここでは異なる部分のみを説
明する。FIG. 13 shows the coefficient conversion unit 3 according to the tenth embodiment.
11 is a flowchart showing an operation procedure of the third embodiment. Note that the basic operation is the same as that of the speech compression encoding apparatus according to the ninth embodiment shown in FIG. 12, and only different parts will be described here.

【０１１２】実施の形態１０の係数変換部３０３は，ス
テップＳ１２０２で求めた第１の比のパターンと，ステ
ップＳ１２０３でパターンデータベースから読み出した
第２の比のパターンとを比較し，評価値Ｄｊとしてユー
クリッド距離を求める（Ｓ１３０１）。すなわち，パタ
ーンデータベースに蓄積された比のパターンの探索にお
いて，比のパターンを多次元空間におけるベクトルと考
え，求めた第１の比のパターンと蓄積された第２の比の
パターンとのユークリッド距離を求める。そして，第１
の比のパターンとパターンデータベースに蓄積された全
ての第２の比のパターンとを比較した結果，パターンデ
ータベースからユークリッド距離が最も小さい第２の比
のパターンを選択し，選択した第２の比のパターンに与
えられた符号を用いて符号化処理を実行する。The coefficient conversion unit 303 according to the tenth embodiment compares the pattern of the first ratio obtained in step S1202 with the pattern of the second ratio read from the pattern database in step S1203, and generates an evaluation value Dj. The Euclidean distance is obtained (S1301). That is, in searching for the ratio pattern stored in the pattern database, the ratio pattern is considered as a vector in a multidimensional space, and the Euclidean distance between the obtained first ratio pattern and the stored second ratio pattern is calculated. Ask. And the first
As a result of comparing the pattern of the second ratio with the patterns of all the second ratios stored in the pattern database, the pattern of the second ratio having the smallest Euclidean distance is selected from the pattern database, and the selected second ratio is selected. An encoding process is performed using the code given to the pattern.

【０１１３】このように，実施の形態１０の音声圧縮符
号化装置によれば，第２の比のパターンを選択する際
に，第１の比のパターンとのユークリッド距離が最も小
さい第２の比のパターンを選択するため，第１の比のパ
ターンと最も近い性質の第２の比のパターンを容易に選
択することができる。As described above, according to the speech compression encoding apparatus of the tenth embodiment, when selecting the pattern of the second ratio, the second ratio having the smallest Euclidean distance from the pattern of the first ratio is selected. In this case, the pattern of the second ratio having the property closest to the pattern of the first ratio can be easily selected.

【０１１４】〔実施の形態１１〕実施の形態１１の音声
圧縮符号化装置は，実施の形態１０の音声圧縮符号化装
置において，予め用意した複数の第２の比のパターンと
符号との対応関係情報（パターンデータベース）が，多
次元データ構造により管理されており，係数変換部３０
３が，第２の比のパターンを選択する際に，最近点探索
を用いて第１の比のパターンとのユークリッド距離が最
も小さい第２の比のパターンを選択するものである。[Eleventh Embodiment] The speech compression encoding apparatus according to the eleventh embodiment is the same as the speech compression encoding apparatus according to the tenth embodiment, except that the correspondence between a plurality of second ratio patterns prepared in advance and the code. Information (pattern database) is managed by a multi-dimensional data structure, and the coefficient conversion unit 30
3 selects the second ratio pattern having the shortest Euclidean distance from the first ratio pattern by using the nearest point search when selecting the second ratio pattern.

【０１１５】実施の形態１１におけるパターンデータベ
ースは，例えばＢ木やｋ−ｄ木，四分木等の多次元デー
タ構造を有している。係数変換部３０３は，図１３のス
テップＳ１３０１において，この多次元データ構造を有
するパターンデータベースを用いて最近点探索を行い，
求めた比のパターンとのユークリッド距離が最小となる
比のパターンに対応する符号をパターンデータベースか
ら選択することによって，符号化処理を実行する。The pattern database according to the eleventh embodiment has a multidimensional data structure such as a B-tree, kd-tree, and quadtree. In step S1301 in FIG. 13, the coefficient conversion unit 303 performs a nearest point search using the pattern database having the multidimensional data structure.
An encoding process is executed by selecting, from the pattern database, a code corresponding to the pattern having the ratio that minimizes the Euclidean distance from the obtained pattern of the ratio.

【０１１６】このように，実施の形態１１の音声圧縮符
号化装置によれば，例えばＢ木やｋ−ｄ木，四分木等の
多次元データ構造を有するパターンデータベースを用い
るため，蓄積した比のパターンの効率的な管理が可能と
なると共に，検索時間を減少させることができる。ま
た，第２の比のパターンを選択する際に，最近点探索を
用いて第１の比のパターンとのユークリッド距離が最も
小さい第２の比のパターンを選択するため，第１の比の
パターンと最も近い性質の第２の比のパターンを容易に
選択することができる。As described above, according to the speech compression encoding apparatus of the eleventh embodiment, since a pattern database having a multidimensional data structure such as a B-tree, kd-tree, or quadtree is used, the stored ratio Can be efficiently managed, and the search time can be reduced. Further, when selecting the pattern of the second ratio, the closest point search is used to select the pattern of the second ratio having the smallest Euclidean distance from the pattern of the first ratio. The pattern of the second ratio having the property closest to the above can be easily selected.

【０１１７】〔実施の形態１２〕実施の形態１２の音声
圧縮符号化装置は，実施の形態９の音声圧縮符号化装置
において，係数変換部３０３が，第２の比のパターンを
選択する際に，第１の比のパターンとのノルム（絶対値
距離：それぞれの座標軸における差の絶対値を加えるも
の）が最も小さい第２の比のパターンを選択するもので
ある。[Twelfth Embodiment] The speech compression encoding apparatus according to the twelfth embodiment is different from the speech compression encoding apparatus according to the ninth embodiment in that the coefficient conversion unit 303 selects a pattern having the second ratio. , The second ratio pattern having the smallest norm (absolute value distance: adding the absolute value of the difference in each coordinate axis) with the first ratio pattern is selected.

【０１１８】図１４は，実施の形態１２の係数変換部３
０３の動作手順を示すフローチャートである。なお，基
本的な動作は，図１２で示した実施の形態９の音声圧縮
符号化装置と同様に付き，ここでは異なる部分のみを説
明する。FIG. 14 is a diagram showing a coefficient conversion unit 3 according to the twelfth embodiment.
11 is a flowchart showing an operation procedure of the third embodiment. Note that the basic operation is the same as that of the speech compression encoding apparatus according to the ninth embodiment shown in FIG. 12, and only different parts will be described here.

【０１１９】実施の形態１２の係数変換部３０３は，ス
テップＳ１２０２で求めた第１の比のパターンと，ステ
ップＳ１２０３でパターンデータベースから読み出した
第２の比のパターンとを比較し，評価値Ｄｊとしてノル
ムを求める（Ｓ１４０１）。すなわち，パターンデータ
ベースに蓄積された比のパターンの探索において，比の
パターンを多次元空間におけるベクトルと考え，求めた
第１の比のパターンと蓄積された第２の比のパターンと
のノルムを求める。そして，第１の比のパターンとパタ
ーンデータベースに蓄積された全ての第２の比のパター
ンとを比較した結果，パターンデータベースからノルム
が最も小さい第２の比のパターンを選択し，選択した第
２の比のパターンに与えられた符号を用いて符号化処理
を実行する。The coefficient conversion unit 303 according to the twelfth embodiment compares the pattern of the first ratio obtained in step S1202 with the pattern of the second ratio read from the pattern database in step S1203, and generates an evaluation value Dj. A norm is obtained (S1401). That is, in the search for the ratio pattern stored in the pattern database, the ratio pattern is considered as a vector in a multidimensional space, and the norm between the obtained first ratio pattern and the stored second ratio pattern is calculated. . Then, as a result of comparing the pattern of the first ratio with all the patterns of the second ratio stored in the pattern database, the pattern of the second ratio having the smallest norm is selected from the pattern database, and the selected second pattern is selected. The encoding process is performed using the code given to the pattern of the ratio of.

【０１２０】このように，実施の形態１２の音声圧縮符
号化装置によれば，第２の比のパターンを選択する際
に，第１の比のパターンとのノルムが最も小さい第２の
比のパターンを選択するため，第１の比のパターンと最
も近い性質の第２の比のパターンを容易に選択すること
ができる。As described above, according to the speech compression encoding apparatus of the twelfth embodiment, when the second ratio pattern is selected, the second ratio pattern having the smallest norm with the first ratio pattern is selected. In order to select a pattern, it is possible to easily select a pattern having a second ratio having properties closest to the pattern having the first ratio.

【０１２１】[0121]

【発明の効果】以上説明したように，本発明の音声圧縮
符号化方法（請求項１）によれば，雑音源抽出工程が，
抽出したスペクトル包絡情報，サブフレーム，ピッチ情
報および利得情報を用いて，二次誤差信号を抽出する二
次誤差信号抽出工程と，抽出した二次誤差信号を離散コ
サイン変換する離散コサイン変換工程と，離散コサイン
変換工程で得られた離散コサイン変換係数列を所定のビ
ット長に変換する係数列変換工程と，を含むことによ
り，雑音源情報を抽出して符号化するため，ＣＥＬＰ方
式の符号化の過程において，演算量を削減できると共
に，メモリ量の低減を図ることができる。すなわち，本
発明の音声圧縮符号化方法によれば，二次誤差信号の量
子化において，符号帳を持たず，フィルタ計算を用いて
符号帳探索を行わないため，低演算量，低メモリ量を実
現する高音質かつ低ビットレート音声圧縮符号化方法を
得ることができる。As described above, according to the speech compression / encoding method of the present invention (claim 1), the noise source extracting step includes:
A second-order error signal extracting step of extracting a second-order error signal using the extracted spectral envelope information, subframe, pitch information, and gain information; a discrete cosine transform step of performing a discrete cosine transform of the extracted second-order error signal; And a coefficient sequence converting step of converting the discrete cosine transform coefficient sequence obtained in the discrete cosine transform step into a predetermined bit length, thereby extracting and encoding noise source information. In the process, the amount of calculation can be reduced and the amount of memory can be reduced. That is, according to the speech compression encoding method of the present invention, in the quantization of the secondary error signal, the codebook search is not performed by using the filter calculation without using the codebook. A high-quality and low-bit-rate voice compression encoding method to be realized can be obtained.

【０１２２】また，本発明の音声圧縮符号化方法（請求
項２）によれば，請求項１記載の音声圧縮符号化方法に
おいて，係数列変換工程が，離散コサイン変換係数列を
所定のビット長に変換する際に，離散コサイン変換工程
で得られた複数の離散コサイン変換係数から予め定めた
数の係数を選択した後，選択した係数の位置および係数
値と選択した係数の前後に位置する予め定めた数の係数
の係数値とを所定のビット長に符号化することにより，
離散コサイン変換係数列を所定のビット長に変換するた
め，選択した係数の前後に位置する係数について位置情
報を符号化する必要をなくし，より多くの係数について
符号化することができる。したがって，変換後のＤＣＴ
変換係数列のビット数を増加させることなく，再生音の
音質を向上させることができる。Further, according to the audio compression encoding method of the present invention (claim 2), in the audio compression encoding method according to claim 1, the coefficient sequence conversion step includes the step of converting the discrete cosine transform coefficient sequence to a predetermined bit length. In the conversion to a discrete cosine transform step, after selecting a predetermined number of coefficients from the plurality of discrete cosine transform coefficients obtained in the discrete cosine transform step, the positions of the selected coefficients and the coefficient values and the positions before and after the selected coefficients are determined. By coding the coefficient value of the specified number of coefficients and the predetermined bit length,
Since the discrete cosine transform coefficient sequence is converted to a predetermined bit length, it is not necessary to encode position information for coefficients located before and after the selected coefficient, and more coefficients can be encoded. Therefore, the transformed DCT
The sound quality of the reproduced sound can be improved without increasing the number of bits of the conversion coefficient sequence.

【０１２３】また，本発明の音声圧縮符号化方法（請求
項３）によれば，請求項２記載の音声圧縮符号化方法に
おいて，係数列変換工程が，離散コサイン変換工程で得
られた複数の離散コサイン変換係数から予め定めた数の
係数を選択する際に，係数値の絶対値が最大の係数を選
択するため，符号化すべき係数を容易に選択することが
できる。Further, according to the audio compression encoding method of the present invention (claim 3), in the audio compression encoding method according to claim 2, the coefficient sequence conversion step includes a plurality of steps obtained by the discrete cosine conversion step. When a predetermined number of coefficients are selected from the discrete cosine transform coefficients, the coefficient having the maximum absolute value of the coefficient value is selected, so that the coefficient to be coded can be easily selected.

【０１２４】また，本発明の音声圧縮符号化方法（請求
項４）によれば，請求項２記載の音声圧縮符号化方法に
おいて，係数列変換工程が，離散コサイン変換工程で得
られた複数の離散コサイン変換係数から予め定めた数の
係数を選択する際に，係数値の絶対値が最大のものから
予め定めた数の係数を選択するため，符号化すべき係数
を容易に選択することができる。加えて，ＤＣＴ変換係
数においては，一般に，係数値の絶対値が高い係数の近
傍に位置する係数も係数値の絶対値も高いという性質が
あるため，この性質を活かして絶対値の最大の係数から
選択していくことにより，より効率よくＤＣＴ変換係数
列を所定のビット長に変換することができる。Further, according to the voice compression encoding method of the present invention (claim 4), in the voice compression encoding method according to claim 2, the coefficient sequence conversion step includes a plurality of steps obtained by the discrete cosine conversion step. When selecting a predetermined number of coefficients from the discrete cosine transform coefficients, the coefficient to be coded can be easily selected because a predetermined number of coefficients having the largest absolute value of the coefficient value are selected. . In addition, the DCT transform coefficient generally has a property that both the coefficient located near the coefficient having a high absolute value and the absolute value of the coefficient value are high. , The DCT transform coefficient sequence can be more efficiently converted into a predetermined bit length.

【０１２５】また，本発明の音声圧縮符号化方法（請求
項５）によれば，請求項２記載の音声圧縮符号化方法に
おいて，係数列変換工程が，離散コサイン変換工程で得
られた複数の離散コサイン変換係数から予め定めた数の
係数を選択する際に，予め設定した閾値を超える係数値
を有する係数を選択するため，符号化すべき係数を容易
に選択することができる。Further, according to the audio compression encoding method of the present invention (claim 5), in the audio compression encoding method according to claim 2, the coefficient sequence conversion step includes a plurality of steps obtained by the discrete cosine conversion step. When a predetermined number of coefficients are selected from the discrete cosine transform coefficients, a coefficient having a coefficient value exceeding a preset threshold is selected, so that a coefficient to be coded can be easily selected.

【０１２６】また，本発明の音声圧縮符号化方法（請求
項６）によれば，請求項１記載の音声圧縮符号化方法に
おいて，係数列変換工程が，離散コサイン変換係数列を
所定のビット長に変換する際に，離散コサイン変換工程
で得られた複数の離散コサイン変換係数から予め定めた
閾値を超える係数値を有する係数を選択し，選択した係
数の位置と係数値とを所定のビット長に符号化すること
により，離散コサイン変換係数列を所定のビット長に変
換するため，変換後のＤＣＴ変換係数列のビット数を増
加させることなく，再生音の音質を向上させることがで
きると共に，符号化すべき係数を容易に選択することが
できる。According to the voice compression encoding method of the present invention (claim 6), in the voice compression encoding method according to claim 1, the coefficient sequence conversion step includes the step of converting the discrete cosine transform coefficient sequence to a predetermined bit length. When transforming to a coefficient, a coefficient having a coefficient value exceeding a predetermined threshold is selected from a plurality of discrete cosine transform coefficients obtained in the discrete cosine transform step, and the position of the selected coefficient and the coefficient value are determined by a predetermined bit length. , The discrete cosine transform coefficient sequence is converted to a predetermined bit length, so that the sound quality of the reproduced sound can be improved without increasing the number of bits of the converted DCT transform coefficient sequence. The coefficients to be coded can be easily selected.

【０１２７】また，本発明の音声圧縮符号化方法（請求
項７）によれば，請求項１記載の音声圧縮符号化方法に
おいて，係数列変換工程が，離散コサイン変換係数列を
所定のビット長に変換する際に，離散コサイン変換工程
で得られた複数の離散コサイン変換係数から一つの係数
を選択し，選択した係数の位置および係数値と選択した
係数の前後に位置する予め定めた数の係数の係数値とを
所定のビット長に符号化した後，さらに符号化した係数
を除いた複数の離散コサイン変換係数から一つの係数を
選択し，選択した係数の位置および係数値と選択した係
数の前後に位置する予め定めた数の係数の係数値とを所
定のビット長に符号化する工程を予め設定した回数実行
することにより，離散コサイン変換係数列を所定のビッ
ト長に変換するため，選択した係数の前後に位置する係
数については位置情報を符号化する必要をなくし，より
多くの係数について符号化を行うことができる。したが
って，変換後のＤＣＴ変換係数列のビット数を増加させ
ることなく，再生音の音質を向上させることができる。
また，既に符号化されたＤＣＴ変換係数を再度符号化す
るということがないため，より効率的に符号化処理を実
行することができる。According to the audio compression encoding method of the present invention (claim 7), in the audio compression encoding method according to claim 1, the coefficient sequence conversion step includes the step of converting the discrete cosine transform coefficient sequence to a predetermined bit length. When converting to a discrete cosine transform, one coefficient is selected from the plurality of discrete cosine transform coefficients obtained in the discrete cosine transform step, and the position of the selected coefficient and a predetermined number of coefficients located before and after the selected coefficient are selected. After the coefficient value of the coefficient is encoded to a predetermined bit length, one coefficient is selected from a plurality of discrete cosine transform coefficients excluding the encoded coefficient, and the position and coefficient value of the selected coefficient and the selected coefficient are selected. The discrete cosine transform coefficient sequence is converted to a predetermined bit length by executing a process of encoding a predetermined number of coefficient values of a predetermined number of coefficients positioned before and after the predetermined number of bits into a predetermined bit length. For coefficients located before and after the selected coefficients eliminates the need to encode the position information, it is possible to perform encoding for more coefficients. Therefore, the sound quality of the reproduced sound can be improved without increasing the number of bits of the converted DCT coefficient sequence.
Further, since the already encoded DCT transform coefficients are not re-encoded, the encoding process can be executed more efficiently.

【０１２８】また，本発明の音声圧縮符号化方法（請求
項８）によれば，請求項７記載の音声圧縮符号化方法に
おいて，係数列変換工程が，離散コサイン変換工程で得
られた複数の離散コサイン変換係数から一つの係数を選
択する際に，係数値の絶対値が最大の係数を選択し，符
号化した係数を除いた複数の離散コサイン変換係数から
一つの係数を選択する際に，係数値の絶対値が最大の係
数を選択するため，符号化すべき係数を容易に選択する
ことができる。また，ＤＣＴ変換係数においては，一般
に，係数値の絶対値が高い係数の近傍に位置する係数も
係数値の絶対値も高いという性質があるため，この性質
を活かして絶対値の最大の係数から選択していくことに
より，より効率よくＤＣＴ変換係数列を所定のビット長
に変換することができる。Further, according to the audio compression encoding method of the present invention (claim 8), in the audio compression encoding method according to claim 7, the coefficient sequence conversion step comprises a plurality of steps obtained by the discrete cosine conversion step. When selecting one coefficient from the discrete cosine transform coefficients, the coefficient having the largest absolute value of the coefficient value is selected, and when selecting one coefficient from a plurality of discrete cosine transform coefficients excluding the coded coefficients, Since the coefficient having the largest absolute value of the coefficient value is selected, the coefficient to be coded can be easily selected. Also, DCT transform coefficients generally have a property that both the coefficient located near a coefficient having a high absolute value and the absolute value of the coefficient value are high. By making a selection, the DCT transform coefficient sequence can be more efficiently converted to a predetermined bit length.

【０１２９】また，本発明の音声圧縮符号化方法（請求
項９）によれば，請求項１記載の音声圧縮符号化方法に
おいて，係数列変換工程が，離散コサイン変換係数列を
所定のビット長に変換する際に，離散コサイン変換工程
で得られた複数の離散コサイン変換係数から予め定めた
数の係数を選択し，選択した係数の係数値と選択した係
数の前後に位置する予め定めた数の係数の係数値との第
１の比のパターンを求めた後，予め用意した複数の第２
の比のパターンと符号との対応関係情報から第１の比の
パターンに最も近い性質の第２の比のパターンを選択
し，選択した第２の比のパターンに対応する符号を用い
て第１の比のパターンを符号化することにより，離散コ
サイン変換係数列を所定のビット長に変換するため，選
択した係数の前後に位置する係数については位置情報を
符号化する必要をなくし，より多くの係数について符号
化を行うことができる。したがって，変換後のＤＣＴ変
換係数列のビット数を増加させることなく，再生音の音
質を向上させることができる。Further, according to the audio compression encoding method of the present invention (claim 9), in the audio compression encoding method according to claim 1, the coefficient sequence conversion step includes converting the discrete cosine transform coefficient sequence to a predetermined bit length. When transforming to a discrete cosine transform step, a predetermined number of coefficients are selected from a plurality of discrete cosine transform coefficients obtained in the discrete cosine transform step, and a coefficient value of the selected coefficient and a predetermined number of coefficients located before and after the selected coefficient are selected. After obtaining the pattern of the first ratio with the coefficient value of the coefficient of
The second ratio pattern having the property closest to the first ratio pattern is selected from the correspondence information between the ratio pattern and the code, and the first pattern is selected using the code corresponding to the selected second ratio pattern. In order to convert a discrete cosine transform coefficient sequence into a predetermined bit length by encoding a pattern having a ratio of, it is not necessary to encode position information for coefficients located before and after the selected coefficient. Coding can be performed on the coefficients. Therefore, the sound quality of the reproduced sound can be improved without increasing the number of bits of the converted DCT coefficient sequence.

【０１３０】また，本発明の音声圧縮符号化方法（請求
項１０）によれば，請求項９記載の音声圧縮符号化方法
において，係数列変換工程が，第２の比のパターンを選
択する際に，第１の比のパターンとのユークリッド距離
が最も小さい第２の比のパターンを選択するため，第１
の比のパターンと最も近い性質の第２の比のパターンを
容易に選択することができる。According to the audio compression encoding method of the present invention (claim 10), in the audio compression encoding method according to claim 9, the coefficient sequence conversion step is performed when a pattern having the second ratio is selected. First, in order to select the second ratio pattern having the smallest Euclidean distance from the first ratio pattern, the first ratio pattern is selected.
The pattern of the second ratio having the property closest to the pattern of the ratio can be easily selected.

【０１３１】また，本発明の音声圧縮符号化方法（請求
項１１）によれば，請求項１０記載の音声圧縮符号化方
法において，予め用意した複数の第２の比のパターンと
符号との対応関係情報が，多次元データ構造により管理
されており，係数列変換工程が，第２の比のパターンを
選択する際に，最近点探索を用いて前記第１の比のパタ
ーンとのユークリッド距離が最も小さい第２の比のパタ
ーンを選択するため，蓄積した比のパターンの効率的な
管理が可能となると共に，検索時間を減少させることが
できる。また，第２の比のパターンを選択する際に，最
近点探索を用いて第１の比のパターンとのユークリッド
距離が最も小さい第２の比のパターンを選択するため，
第１の比のパターンと最も近い性質の第２の比のパター
ンを容易に選択することができる。According to the audio compression encoding method of the present invention (claim 11), in the audio compression encoding method according to claim 10, the correspondence between a plurality of second ratio patterns and codes prepared in advance is provided. The relation information is managed by a multi-dimensional data structure, and when the coefficient sequence conversion step selects a pattern of the second ratio, the Euclidean distance from the pattern of the first ratio is determined using nearest point search. Since the pattern with the smallest second ratio is selected, the stored ratio pattern can be efficiently managed, and the search time can be reduced. Further, when selecting the pattern of the second ratio, since the closest point search is used to select the pattern of the second ratio having the smallest Euclidean distance from the pattern of the first ratio,
The pattern of the second ratio having the property closest to the pattern of the first ratio can be easily selected.

【０１３２】また，本発明の音声圧縮符号化方法（請求
項１２）によれば，請求項９記載の音声圧縮符号化方法
において，係数列変換工程が，第２の比のパターンを選
択する際に，第１の比のパターンとのノルムが最も小さ
い第２の比のパターンを選択するため，第１の比のパタ
ーンと最も近い性質の第２の比のパターンを容易に選択
することができる。Further, according to the audio compression encoding method of the present invention, the coefficient sequence conversion step is performed when the second ratio pattern is selected. Since the second ratio pattern having the smallest norm with respect to the first ratio pattern is selected, the second ratio pattern having properties closest to the first ratio pattern can be easily selected. .

【０１３３】また，本発明の音声圧縮符号化装置（請求
項１３）によれば，雑音源抽出手段が，抽出したスペク
トル包絡情報，サブフレーム，ピッチ情報および利得情
報を用いて，二次誤差信号を抽出する二次誤差信号抽出
手段と，抽出した二次誤差信号を離散コサイン変換する
離散コサイン変換手段と，離散コサイン変換手段で得ら
れた離散コサイン変換係数列を所定のビット長に変換す
る係数列変換手段と，を含むことにより，雑音源情報を
抽出して符号化するため，ＣＥＬＰ方式の符号化の過程
において，演算量を削減できると共に，メモリ量の低減
を図ることができる。すなわち，本発明の音声圧縮符号
化装置によれば，二次誤差信号の量子化において，符号
帳を持たず，フィルタ計算を用いて符号帳探索を行わな
いため，低演算量，低メモリ量を実現する高音質かつ低
ビットレート音声圧縮符号化装置を得ることができる。Further, according to the speech compression encoding apparatus of the present invention (claim 13), the noise source extracting means uses the extracted spectral envelope information, subframe, pitch information and gain information to obtain a second error signal. , A discrete cosine transform unit for performing a discrete cosine transform of the extracted secondary error signal, and a coefficient for converting a discrete cosine transform coefficient sequence obtained by the discrete cosine transform unit into a predetermined bit length By including the column conversion means, the noise source information is extracted and encoded, so that the amount of calculation and the amount of memory can be reduced in the CELP encoding process. That is, according to the speech compression encoding apparatus of the present invention, in the quantization of the second-order error signal, the codebook is not provided and the codebook search is not performed by using the filter calculation. A high-quality and low-bit-rate audio compression / encoding device that can be realized can be obtained.

【０１３４】また，本発明の音声圧縮符号化装置（請求
項１４）によれば，請求項１３記載の音声圧縮符号化装
置において，係数列変換手段が，離散コサイン変換係数
列を所定のビット長に変換する際に，離散コサイン変換
手段で得られた複数の離散コサイン変換係数から予め定
めた数の係数を選択した後，選択した係数の位置および
係数値と選択した係数の前後に位置する予め定めた数の
係数の係数値とを所定のビット長に符号化することによ
り，離散コサイン変換係数列を所定のビット長に変換す
るため，選択した係数の前後に位置する係数について位
置情報を符号化する必要をなくし，より多くの係数につ
いて符号化することができる。したがって，変換後のＤ
ＣＴ変換係数列のビット数を増加させることなく，再生
音の音質を向上させることができる。According to the audio compression coding apparatus of the present invention, the coefficient sequence conversion means converts the discrete cosine transform coefficient sequence to a predetermined bit length. In the conversion into a plurality of discrete cosine transform coefficients, a predetermined number of coefficients are selected from the plurality of discrete cosine transform coefficients obtained by the discrete cosine transform means. In order to convert a discrete cosine transform coefficient sequence to a predetermined bit length by encoding a predetermined number of coefficient values and a coefficient value to a predetermined bit length, position information is encoded for coefficients located before and after the selected coefficient. Therefore, it is possible to encode more coefficients. Therefore, the converted D
The sound quality of the reproduced sound can be improved without increasing the number of bits of the CT conversion coefficient sequence.

【０１３５】また，本発明の音声圧縮符号化装置（請求
項１５）によれば，請求項１４記載の音声圧縮符号化装
置において，係数列変換手段が，離散コサイン変換手段
で得られた複数の離散コサイン変換係数から予め定めた
数の係数を選択する際に，係数値の絶対値が最大の係数
を選択するため，符号化すべき係数を容易に選択するこ
とができる。Further, according to the audio compression coding apparatus of the present invention, the coefficient sequence conversion means may include a plurality of coefficient sequences obtained by the discrete cosine conversion means. When a predetermined number of coefficients are selected from the discrete cosine transform coefficients, the coefficient having the maximum absolute value of the coefficient value is selected, so that the coefficient to be coded can be easily selected.

【０１３６】また，本発明の音声圧縮符号化装置（請求
項１６）によれば，請求項１４記載の音声圧縮符号化装
置において，係数列変換手段が，離散コサイン変換手段
で得られた複数の離散コサイン変換係数から予め定めた
数の係数を選択する際に，係数値の絶対値が最大のもの
から予め定めた数の係数を選択するため，符号化すべき
係数を容易に選択することができる。加えて，ＤＣＴ変
換係数においては，一般に，係数値の絶対値が高い係数
の近傍に位置する係数も係数値の絶対値も高いという性
質があるため，この性質を活かして絶対値の最大の係数
から選択していくことにより，より効率よくＤＣＴ変換
係数列を所定のビット長に変換することができる。Further, according to the speech compression encoding apparatus of the present invention, the coefficient sequence conversion means may include a plurality of coefficient sequences obtained by the discrete cosine transformation means. When selecting a predetermined number of coefficients from the discrete cosine transform coefficients, the coefficient to be coded can be easily selected because a predetermined number of coefficients having the largest absolute value of the coefficient value are selected. . In addition, the DCT transform coefficient generally has a property that both the coefficient located near the coefficient having a high absolute value and the absolute value of the coefficient value are high. , The DCT transform coefficient sequence can be more efficiently converted into a predetermined bit length.

【０１３７】また，本発明の音声圧縮符号化装置（請求
項１７）によれば，請求項１４記載の音声圧縮符号化装
置において，係数列変換手段が，離散コサイン変換手段
で得られた複数の離散コサイン変換係数から予め定めた
数の係数を選択する際に，予め設定した閾値を超える係
数値を有する係数を選択するため，符号化すべき係数を
容易に選択することができる。Further, according to the speech compression encoding apparatus of the present invention, the coefficient sequence conversion means may include a plurality of coefficient sequences obtained by the discrete cosine transformation means. When a predetermined number of coefficients are selected from the discrete cosine transform coefficients, a coefficient having a coefficient value exceeding a preset threshold is selected, so that a coefficient to be coded can be easily selected.

【０１３８】また，本発明の音声圧縮符号化装置（請求
項１８）によれば，請求項１３記載の音声圧縮符号化装
置において，係数列変換手段が，離散コサイン変換係数
列を所定のビット長に変換する際に，離散コサイン変換
手段で得られた複数の離散コサイン変換係数から予め定
めた閾値を超える係数値を有する係数を選択した後，選
択した係数の位置と係数値とを所定のビット長に符号化
することにより，離散コサイン変換係数列を所定のビッ
ト長に変換するため，変換後のＤＣＴ変換係数列のビッ
ト数を増加させることなく，再生音の音質を向上させる
ことができると共に，符号化すべき係数を容易に選択す
ることができる。Further, according to the audio compression encoding apparatus of the present invention, the coefficient sequence conversion means converts the discrete cosine transform coefficient sequence to a predetermined bit length. In the conversion to a coefficient, after selecting a coefficient having a coefficient value exceeding a predetermined threshold from a plurality of discrete cosine transform coefficients obtained by the discrete cosine transform means, the position of the selected coefficient and the coefficient value are converted into predetermined bits. Since the discrete cosine transform coefficient sequence is converted into a predetermined bit length by encoding to a long length, the sound quality of the reproduced sound can be improved without increasing the number of bits of the converted DCT transform coefficient sequence. , Coefficients to be encoded can be easily selected.

【０１３９】また，本発明の音声圧縮符号化装置（請求
項１９）によれば，請求項１３記載の音声圧縮符号化装
置において，係数列変換手段が，離散コサイン変換係数
列を所定のビット長に変換する際に，離散コサイン変換
手段で得られた複数の離散コサイン変換係数から一つの
係数を選択し，選択した係数の位置および係数値と選択
した係数の前後に位置する予め定めた数の係数の係数値
とを所定のビット長に符号化した後，さらに符号化した
係数を除いた複数の離散コサイン変換係数から一つの係
数を選択し，選択した係数の位置および係数値と選択し
た係数の前後に位置する予め定めた数の係数の係数値と
を所定のビット長に符号化する工程を予め設定した回数
実行することにより，離散コサイン変換係数列を所定の
ビット長に変換するため，選択した係数の前後に位置す
る係数については位置情報を符号化する必要をなくし，
より多くの係数について符号化を行うことができる。し
たがって，変換後のＤＣＴ変換係数列のビット数を増加
させることなく，再生音の音質を向上させることができ
る。また，既に符号化されたＤＣＴ変換係数を再度符号
化するということがないため，より効率的に符号化処理
を実行することができる。According to the audio compression encoding apparatus of the present invention, the coefficient sequence converting means converts the discrete cosine transform coefficient sequence to a predetermined bit length. When transforming to a coefficient, one coefficient is selected from a plurality of discrete cosine transform coefficients obtained by the discrete cosine transform means, and the position of the selected coefficient and a predetermined number of coefficients located before and after the coefficient are selected. After the coefficient value of the coefficient is encoded to a predetermined bit length, one coefficient is selected from a plurality of discrete cosine transform coefficients excluding the encoded coefficient, and the position and coefficient value of the selected coefficient and the selected coefficient are selected. The discrete cosine transform coefficient sequence is converted into a predetermined bit length by executing a process of coding a predetermined number of coefficient values of a predetermined number of coefficients positioned before and after the predetermined number of times into a predetermined bit length. Therefore, the coefficients located before and after the selected coefficients eliminates the need to encode the position information,
Encoding can be performed for more coefficients. Therefore, the sound quality of the reproduced sound can be improved without increasing the number of bits of the converted DCT coefficient sequence. Further, since the already encoded DCT transform coefficients are not re-encoded, the encoding process can be executed more efficiently.

【０１４０】また，本発明の音声圧縮符号化装置（請求
項２０）によれば，請求項１９記載の音声圧縮符号化装
置において，係数列変換手段が，離散コサイン変換手段
で得られた複数の離散コサイン変換係数から一つの係数
を選択する際に，係数値の絶対値が最大の係数を選択
し，符号化した係数を除いた複数の離散コサイン変換係
数から一つの係数を選択する際に，係数値の絶対値が最
大の係数を選択するため，符号化すべき係数を容易に選
択することができる。また，ＤＣＴ変換係数において
は，一般に，係数値の絶対値が高い係数の近傍に位置す
る係数も係数値の絶対値も高いという性質があるため，
この性質を活かして絶対値の最大の係数から選択してい
くことにより，より効率よくＤＣＴ変換係数列を所定の
ビット長に変換することができる。Further, according to the audio compression encoding apparatus of the present invention, the coefficient sequence converting means may include a plurality of coefficient sequences obtained by the discrete cosine converting means. When selecting one coefficient from the discrete cosine transform coefficients, the coefficient having the largest absolute value of the coefficient value is selected, and when selecting one coefficient from a plurality of discrete cosine transform coefficients excluding the coded coefficients, Since the coefficient having the largest absolute value of the coefficient value is selected, the coefficient to be coded can be easily selected. Further, DCT transform coefficients generally have a property that both the coefficient located in the vicinity of the coefficient having a high absolute value of the coefficient value and the absolute value of the coefficient value are high.
By taking advantage of this property and selecting from the coefficient having the largest absolute value, the DCT transform coefficient sequence can be more efficiently converted to a predetermined bit length.

【０１４１】また，本発明の音声圧縮符号化装置（請求
項２１）によれば，請求項１３記載の音声圧縮符号化装
置において，係数列変換手段が，離散コサイン変換係数
列を所定のビット長に変換する際に，離散コサイン変換
手段で得られた複数の離散コサイン変換係数から予め定
めた数の係数を選択し，選択した係数の係数値と選択し
た係数の前後に位置する予め定めた数の係数の係数値と
の第１の比のパターンを求めた後，予め用意した複数の
第２の比のパターンと符号との対応関係情報から第１の
比のパターンに最も近い性質の第２の比のパターンを選
択し，選択した第２の比のパターンに対応する符号を用
いて第１の比のパターンを符号化することにより，離散
コサイン変換係数列を所定のビット長に変換するため，
選択した係数の前後に位置する係数については位置情報
を符号化する必要をなくし，より多くの係数について符
号化を行うことができる。したがって，変換後のＤＣＴ
変換係数列のビット数を増加させることなく，再生音の
音質を向上させることができる。According to the audio compression encoding apparatus of the present invention, the coefficient sequence conversion means converts the discrete cosine transform coefficient sequence to a predetermined bit length. When transforming to a discrete cosine transform means, a predetermined number of coefficients are selected from a plurality of discrete cosine transform coefficients obtained by the discrete cosine transform means, and a coefficient value of the selected coefficient and a predetermined number of coefficients located before and after the selected coefficient are selected. After obtaining the pattern of the first ratio with the coefficient value of the coefficient of the second ratio, from the correspondence information between the plurality of patterns of the second ratio and the code prepared in advance, the second pattern having the property closest to the pattern of the first ratio To convert a discrete cosine transform coefficient sequence into a predetermined bit length by selecting a pattern having a ratio of the first ratio and encoding the pattern having the first ratio using a code corresponding to the selected second ratio pattern. ,
It is not necessary to encode position information for coefficients located before and after the selected coefficient, and it is possible to encode more coefficients. Therefore, the transformed DCT
The sound quality of the reproduced sound can be improved without increasing the number of bits of the conversion coefficient sequence.

【０１４２】また，本発明の音声圧縮符号化装置（請求
項２２）によれば，請求項２１記載の音声圧縮符号化装
置において，係数列変換手段が，第２の比のパターンを
選択する際に，第１の比のパターンとのユークリッド距
離が最も小さい第２の比のパターンを選択するため，第
１の比のパターンと最も近い性質の第２の比のパターン
を容易に選択することができる。According to the speech compression encoding apparatus of the present invention, the coefficient sequence conversion means may select a pattern having the second ratio. In addition, since the second ratio pattern having the smallest Euclidean distance from the first ratio pattern is selected, it is possible to easily select the second ratio pattern having the property closest to the first ratio pattern. it can.

【０１４３】また，本発明の音声圧縮符号化装置（請求
項２３）によれば，請求項２２記載の音声圧縮符号化装
置において，予め用意した複数の第２の比のパターンと
符号との対応関係情報が，多次元データ構造により管理
されており，係数列変換手段が，第２の比のパターンを
選択する際に，最近点探索を用いて第１の比のパターン
とのユークリッド距離が最も小さい第２の比のパターン
を選択するため，蓄積した比のパターンの効率的な管理
が可能となると共に，検索時間を減少させることができ
る。また，第２の比のパターンを選択する際に，最近点
探索を用いて第１の比のパターンとのユークリッド距離
が最も小さい第２の比のパターンを選択するため，第１
の比のパターンと最も近い性質の第２の比のパターンを
容易に選択することができる。According to the speech compression encoding apparatus of the present invention (claim 23), in the speech compression encoding apparatus according to claim 22, correspondence between a plurality of second ratio patterns and codes prepared in advance. The relation information is managed by a multi-dimensional data structure, and when the coefficient sequence conversion means selects the pattern of the second ratio, the nearest Euclidean distance from the pattern of the first ratio is determined using nearest point search. Since the pattern having the small second ratio is selected, the stored ratio pattern can be efficiently managed, and the search time can be reduced. Further, when selecting the pattern of the second ratio, since the closest point search is used to select the pattern of the second ratio having the smallest Euclidean distance from the pattern of the first ratio, the first ratio is selected.
The pattern of the second ratio having the property closest to the pattern of the ratio can be easily selected.

【０１４４】また，本発明の音声圧縮符号化装置（請求
項２４）によれば，請求項２１記載の音声圧縮符号化装
置において，係数列変換手段が，第２の比のパターンを
選択する際に，第１の比のパターンとのノルムが最も小
さい第２の比のパターンを選択するため，第１の比のパ
ターンと最も近い性質の第２の比のパターンを容易に選
択することができる。Further, according to the audio compression encoding apparatus of the present invention, in the audio compression encoding apparatus according to the present invention, when the coefficient sequence converting means selects the pattern of the second ratio. Since the second ratio pattern having the smallest norm with respect to the first ratio pattern is selected, the second ratio pattern having properties closest to the first ratio pattern can be easily selected. .

[Brief description of the drawings]

【図１】実施の形態１の音声圧縮符号化装置の概略構成
図である。FIG. 1 is a schematic configuration diagram of an audio compression encoding device according to a first embodiment.

【図２】実施の形態１の音声符号化部のブロック構成図
である。FIG. 2 is a block diagram of a speech encoding unit according to the first embodiment.

【図３】実施の形態１の雑音源抽出部の概略ブロック図
である。FIG. 3 is a schematic block diagram of a noise source extraction unit according to the first embodiment.

【図４】実施の形態１の音声復号化部の一部構成を示す
ブロック図である。FIG. 4 is a block diagram illustrating a partial configuration of a speech decoding unit according to the first embodiment.

【図５】実施の形態２の係数変換部の動作手順を示すフ
ローチャートである。FIG. 5 is a flowchart illustrating an operation procedure of a coefficient conversion unit according to the second embodiment.

【図６】実施の形態３の係数変換部の動作手順を示すフ
ローチャートである。FIG. 6 is a flowchart illustrating an operation procedure of a coefficient conversion unit according to the third embodiment.

【図７】実施の形態４の係数変換部の動作手順を示すフ
ローチャートである。FIG. 7 is a flowchart illustrating an operation procedure of a coefficient conversion unit according to the fourth embodiment.

【図８】実施の形態５の係数変換部の動作手順を示すフ
ローチャートである。FIG. 8 is a flowchart illustrating an operation procedure of a coefficient conversion unit according to the fifth embodiment.

【図９】実施の形態６の係数変換部の動作手順を示すフ
ローチャートである。FIG. 9 is a flowchart illustrating an operation procedure of a coefficient conversion unit according to the sixth embodiment.

【図１０】実施の形態７の係数変換部の動作手順を示す
フローチャートである。FIG. 10 is a flowchart illustrating an operation procedure of a coefficient conversion unit according to the seventh embodiment.

【図１１】実施の形態８の係数変換部の動作手順を示す
フローチャートである。FIG. 11 is a flowchart illustrating an operation procedure of a coefficient conversion unit according to the eighth embodiment.

【図１２】実施の形態９の係数変換部の動作手順を示す
フローチャートである。FIG. 12 is a flowchart illustrating an operation procedure of a coefficient conversion unit according to the ninth embodiment;

【図１３】実施の形態１０の係数変換部の動作手順を示
すフローチャートである。FIG. 13 is a flowchart illustrating an operation procedure of a coefficient conversion unit according to the tenth embodiment.

【図１４】実施の形態１２の係数変換部の動作手順を示
すフローチャートである。FIG. 14 is a flowchart illustrating an operation procedure of a coefficient conversion unit according to the twelfth embodiment.

[Explanation of symbols]

１００音声圧縮符号化装置１０１Ａ／Ｄ変換部１０２音声符号化部１０３蓄積部１０４音声復号化部１０５Ｄ／Ａ変換部２０１フレーム構成部２０２スペクトル包絡抽出部２０３サブフレーム構成部２０４ピッチ情報抽出部２０５利得抽出部２０６雑音源抽出部３０１二次誤差信号構成部３０２ＤＣＴ変換部３０３係数変換部４０１スペクトル包絡復号部４０２ピッチ情報復号部４０３雑音源復号部４０４利得復号部４０５音声合成部 REFERENCE SIGNS LIST 100 audio compression encoding apparatus 101 A / D conversion section 102 audio encoding section 103 storage section 104 audio decoding section 105 D / A conversion section 201 frame configuration section 202 spectrum envelope extraction section 203 subframe configuration section 204 pitch information extraction section 205 Gain extraction unit 206 Noise source extraction unit 301 Secondary error signal configuration unit 302 DCT conversion unit 303 Coefficient conversion unit 401 Spectrum envelope decoding unit 402 Pitch information decoding unit 403 Noise source decoding unit 404 Gain decoding unit 405 Voice synthesis unit

Claims

[Claims]

A first step of inputting an analog audio waveform and converting it into a digital audio waveform; a second step of encoding the digital audio waveform by a predetermined encoding method; A third step of storing an audio waveform, a fourth step of extracting and decoding the stored digital audio waveform, and a fifth step of converting the decoded digital audio waveform to an analog audio waveform. ,
Wherein the second step comprises: a frame dividing step of dividing the digital speech waveform into processing units called frames; and extracting spectrum envelope information representing a spectrum envelope from each of the divided frames. Spectral envelope information extracting step of performing coding and encoding, subframe dividing step of dividing each of the divided frames into processing units called subframes, and using the extracted spectral envelope information, A pitch information extracting step of extracting and encoding pitch information from each of them, a gain information extracting step of extracting and encoding gain information from the extracted pitch information, and the spectrum envelope information, subframe, pitch information and gain. Extracts and encodes noise source information, which is sound source information, from information A sound source information extracting step, wherein the fourth step is a restoring step of restoring the encoded spectrum envelope information, pitch information, gain information and noise source information, and the restored pitch information, gain information and An excitation source signal generation step of generating an excitation source signal from noise source information; and a synthesized speech generation step of generating a synthesized speech from the excitation source signal and the restored spectrum envelope information. An extraction step of extracting a secondary error signal using the extracted spectral envelope information, subframe, pitch information and gain information, and a discrete cosine transform of the extracted secondary error signal A discrete cosine transform step, and a coefficient sequence transforming step of transforming the discrete cosine transform coefficient sequence obtained in the discrete cosine transform step into a predetermined bit length. By,
A speech compression encoding method, wherein the noise source information is extracted and encoded.

2. The voice compression encoding method according to claim 1, wherein the coefficient sequence conversion step is performed by converting the discrete cosine transform coefficient sequence into a predetermined bit length. After selecting a predetermined number of coefficients from the plurality of discrete cosine transform coefficients, the position and the coefficient value of the selected coefficient and the coefficient values of the predetermined number of coefficients located before and after the selected coefficient are determined by a predetermined number. A speech compression encoding method characterized in that said discrete cosine transform coefficient sequence is converted into a predetermined bit length by encoding to a bit length.

3. The audio compression encoding method according to claim 2, wherein said coefficient sequence converting step selects a predetermined number of coefficients from a plurality of discrete cosine transform coefficients obtained in said discrete cosine transform step. And selecting a coefficient having the largest absolute value of the coefficient value.

4. The voice compression encoding method according to claim 2, wherein said coefficient sequence converting step selects a predetermined number of coefficients from a plurality of discrete cosine transform coefficients obtained in said discrete cosine transform step. And a method of selecting a predetermined number of coefficients from those having the largest absolute value of the coefficient values.

5. A speech compression encoding method according to claim 2, wherein said coefficient sequence converting step selects a predetermined number of coefficients from a plurality of discrete cosine transform coefficients obtained in said discrete cosine transform step. And selecting a coefficient having a coefficient value exceeding a preset threshold value.

6. The audio compression encoding method according to claim 1, wherein the coefficient sequence conversion step is performed by converting the discrete cosine transform coefficient sequence into a predetermined bit length. Selecting a coefficient having a coefficient value exceeding a predetermined threshold from a plurality of discrete cosine transform coefficients;
A speech compression encoding method characterized in that the discrete cosine transform coefficient sequence is converted into a predetermined bit length by coding the selected coefficient position and the coefficient value into a predetermined bit length.

7. The audio compression encoding method according to claim 1, wherein the coefficient sequence conversion step is performed by converting the discrete cosine transform coefficient sequence into a predetermined bit length. One coefficient is selected from a plurality of discrete cosine transform coefficients, and the position and coefficient value of the selected coefficient and the coefficient values of a predetermined number of coefficients positioned before and after the selected coefficient are encoded into a predetermined bit length. After that, one coefficient is selected from the plurality of discrete cosine transform coefficients excluding the coded coefficient, and a position and a coefficient value of the selected coefficient and a predetermined coefficient located before and after the selected coefficient are selected. By performing the step of encoding the coefficient value of the number coefficient and the coefficient value to a predetermined bit length a preset number of times,
A speech compression encoding method, wherein the discrete cosine transform coefficient sequence is converted into a predetermined bit length.

8. A speech compression encoding method according to claim 7, wherein said coefficient sequence converting step includes selecting one coefficient from a plurality of discrete cosine transform coefficients obtained in said discrete cosine transform step. Selecting the coefficient having the largest absolute value of the coefficient when selecting the coefficient having the largest absolute value of the numerical value and selecting one coefficient from the plurality of discrete cosine transform coefficients excluding the encoded coefficient; A voice compression encoding method characterized by the following.

9. The audio compression encoding method according to claim 1, wherein the coefficient sequence conversion step is performed by converting the discrete cosine transform coefficient sequence into a predetermined bit length. A predetermined number of coefficients are selected from a plurality of discrete cosine transform coefficients, and a first ratio of a coefficient value of the selected coefficient to a coefficient value of a predetermined number of coefficients located before and after the selected coefficient is selected. After obtaining the pattern, a pattern of the second ratio having a property closest to the pattern of the first ratio is selected from correspondence information between a plurality of patterns of the second ratio and codes prepared in advance, and And encoding the pattern of the first ratio using a code corresponding to the pattern of the second ratio to convert the discrete cosine transform coefficient sequence into a predetermined bit length. Method.

10. The voice compression encoding method according to claim 9, wherein the coefficient sequence conversion step comprises selecting the second ratio pattern such that the Euclidean distance from the first ratio pattern is the smallest. A voice compression encoding method characterized by selecting a pattern having a small second ratio.

11. The audio compression encoding method according to claim 10, wherein the correspondence information between the plurality of second ratio patterns and the codes prepared in advance is managed by a multidimensional data structure, and The column conversion step uses the nearest point search to select the second ratio pattern.
And selecting the second ratio pattern having the smallest Euclidean distance from the ratio pattern.

12. The voice compression encoding method according to claim 9, wherein the coefficient sequence conversion step has a minimum norm with the first ratio pattern when selecting the second ratio pattern. A speech compression encoding method, wherein the pattern having the second ratio is selected.

13. A / D conversion means for inputting an analog voice waveform and converting it into a digital voice waveform, voice coding means for coding the digital voice waveform by a predetermined coding method, Storage means for storing the digital voice waveform; voice decoding means for extracting and decoding the stored digital voice waveform; and D for converting the decoded digital voice waveform into an analog voice waveform.
/ A conversion means, wherein the speech encoding means divides the digital speech waveform into processing units called frames, and obtains a spectral envelope from each of the divided frames. Using spectral envelope information extracting means for extracting and encoding the represented spectral envelope information, subframe dividing means for dividing each of the divided frames into processing units called subframes, and using the extracted spectral envelope information, Pitch information extracting means for extracting and encoding pitch information from each of the divided subframes; gain information extracting means for extracting and encoding gain information from the extracted pitch information; Noise source information as sound source information from frame, pitch information and gain information Noise source information extracting means for extracting and encoding, wherein the speech decoding means restores the encoded spectrum envelope information, pitch information, gain information, and noise source information; Excitation source signal generation means for generating an excitation source signal from the obtained pitch information, gain information, and noise source information; and synthesized speech generation means for generating a synthesized speech from the excitation source signal and the restored spectral envelope information. The noise source extracting means for extracting a secondary error signal using the extracted spectral envelope information, subframe, pitch information, and gain information; A discrete cosine transform unit for performing a discrete cosine transform on the error signal; and a discrete cosine transform coefficient sequence obtained by the discrete cosine transform unit, which is transformed into a predetermined bit length. By including a coefficient sequence converting means, the speech compression encoding apparatus characterized by encoding by extracting the noise source information.

14. The audio compression encoding apparatus according to claim 13, wherein said coefficient sequence converting means obtains said discrete cosine transform coefficient sequence when converting said discrete cosine transform coefficient sequence into a predetermined bit length. After selecting a predetermined number of coefficients from the plurality of discrete cosine transform coefficients, the position and the coefficient value of the selected coefficient and the coefficient values of the predetermined number of coefficients located before and after the selected coefficient are determined by a predetermined number. An audio compression encoding apparatus characterized in that the discrete cosine transform coefficient sequence is converted into a predetermined bit length by encoding to a bit length.

15. A speech compression encoding apparatus according to claim 14, wherein said coefficient sequence converting means selects a predetermined number of coefficients from a plurality of discrete cosine transform coefficients obtained by said discrete cosine transform means. An audio compression encoding apparatus characterized by selecting a coefficient having a maximum absolute value of the coefficient value.

16. A speech compression encoding apparatus according to claim 14, wherein said coefficient sequence converting means selects a predetermined number of coefficients from a plurality of discrete cosine transform coefficients obtained by said discrete cosine transform means. An audio compression encoding apparatus, wherein a predetermined number of coefficients are selected from those having the largest absolute value of coefficient values.

17. A speech compression encoding apparatus according to claim 14, wherein said coefficient sequence converting means selects a predetermined number of coefficients from a plurality of discrete cosine transform coefficients obtained by said discrete cosine transform means. And a coefficient having a coefficient value exceeding a preset threshold value is selected.

18. The audio compression encoding apparatus according to claim 13, wherein said coefficient sequence converting means obtains said discrete cosine transform coefficient sequence when converting said discrete cosine transform coefficient sequence into a predetermined bit length. After selecting a coefficient having a coefficient value exceeding a predetermined threshold value from a plurality of discrete cosine transform coefficients, the position and the coefficient value of the selected coefficient are encoded into a predetermined bit length, thereby obtaining the discrete cosine transform coefficient. An audio compression encoding device for converting a sequence into a predetermined bit length.

19. The audio compression encoding apparatus according to claim 13, wherein said coefficient sequence converting means obtains said discrete cosine transform coefficient sequence when converting said discrete cosine transform coefficient sequence into a predetermined bit length. One coefficient is selected from a plurality of discrete cosine transform coefficients, and the position and coefficient value of the selected coefficient and the coefficient values of a predetermined number of coefficients positioned before and after the selected coefficient are encoded into a predetermined bit length. After that, one coefficient is selected from the plurality of discrete cosine transform coefficients excluding the coded coefficient, and a position and a coefficient value of the selected coefficient and a predetermined coefficient located before and after the selected coefficient are selected. Converting the discrete cosine transform coefficient sequence into a predetermined bit length by executing a step of encoding a coefficient value of a number coefficient and a predetermined bit length to a predetermined bit length. Audio compression encoding device.

20. A speech compression encoding apparatus according to claim 19, wherein said coefficient sequence converting means selects one coefficient from a plurality of discrete cosine transform coefficients obtained by said discrete cosine transform means. Selecting the coefficient having the largest absolute value of the coefficient when selecting the coefficient having the largest absolute value of the numerical value and selecting one coefficient from the plurality of discrete cosine transform coefficients excluding the encoded coefficient; An audio compression encoding device characterized by the following.

21. The audio compression encoding apparatus according to claim 13, wherein the coefficient sequence converting means obtains the discrete cosine transform coefficient sequence when converting the discrete cosine transform coefficient sequence into a predetermined bit length. A predetermined number of coefficients are selected from a plurality of discrete cosine transform coefficients, and a first ratio of a coefficient value of the selected coefficient to a coefficient value of a predetermined number of coefficients located before and after the selected coefficient is selected. After finding the pattern,
A pattern of the second ratio having the property closest to the pattern of the first ratio is selected from correspondence information between a plurality of patterns of the second ratio and codes prepared in advance, and a pattern of the selected second ratio is selected. An audio compression encoding device, wherein the discrete cosine transform coefficient sequence is converted into a predetermined bit length by encoding the pattern of the first ratio using a code corresponding to the pattern.

22. The audio compression encoding apparatus according to claim 21, wherein the coefficient sequence converting means selects the second ratio pattern such that the Euclidean distance from the first ratio pattern is the shortest. An audio compression encoding apparatus, wherein a pattern having a small second ratio is selected.

23. The audio compression encoding apparatus according to claim 22, wherein the correspondence information between the plurality of second ratio patterns and codes prepared in advance is managed by a multidimensional data structure, and When the column conversion means selects the pattern of the second ratio, the first conversion is performed by using a nearest point search.
A second ratio pattern having the smallest Euclidean distance from the second ratio pattern.

24. The audio compression encoding apparatus according to claim 21, wherein the coefficient sequence conversion means has a minimum norm with the first ratio pattern when selecting the second ratio pattern. An audio compression encoding apparatus, wherein the pattern having the second ratio is selected.