JP2003271199A

JP2003271199A - Audio signal encoding method and encoding device

Info

Publication number: JP2003271199A
Application number: JP2002072667A
Authority: JP
Inventors: Tomoyasu Komori; 智康小森; Kaoru Watanabe; 馨渡辺
Original assignee: Nippon Hoso Kyokai NHK; Japan Broadcasting Corp
Current assignee: Japan Broadcasting Corp
Priority date: 2002-03-15
Filing date: 2002-03-15
Publication date: 2003-09-25

Abstract

(57)【要約】【課題】効率的に符号化を行い、オーディオ符号化に
おいて、使用できるビット数が少ない場合にも検知され
る音質の劣化を軽減することができるオーディオ信号の
符号化方法及び符号化装置を提供することを目的とす
る。【解決手段】オーディオ信号の符号化方法において、
時間領域の信号を周波数領域の信号に変換する変換手順
と、前記変換ステップにて変換された周波数係数群を複
数の帯域で分割する分割手順と、ゲインと量子化値の積
で表現される周波数係数の値における前記ゲイン又は前
記量子化値をステップ状に制御する制御手順と、前記量
子化値を符号化する符号化手順とを有することにより上
記課題を解決する。 (57) Abstract: An audio signal encoding method capable of performing efficient encoding and reducing deterioration of detected sound quality even when the number of usable bits is small in audio encoding. It is an object to provide an encoding device. SOLUTION: In the encoding method of the audio signal,
A conversion procedure for converting a signal in the time domain into a signal in the frequency domain, a division procedure for dividing the frequency coefficient group converted in the conversion step into a plurality of bands, and a frequency expressed by a product of a gain and a quantization value. The above object is achieved by including a control procedure for controlling the gain or the quantization value in a coefficient value in a step-like manner and an encoding procedure for encoding the quantization value.

Description

Detailed Description of the Invention

【０００１】[0001]

【発明の属する技術分野】本発明は、オーディオ信号の
符号化方法及び符号化装置に係り、特に、効率的にオー
ディオ信号の符号化を行うためのオーディオ信号の符号
化方法及び符号化装置に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to an audio signal encoding method and an encoding apparatus, and more particularly to an audio signal encoding method and an encoding apparatus for efficiently encoding an audio signal.

【０００２】[0002]

【従来の技術】一般に、ＡＡＣ（Advanced Audio Codin
g）符号化等に代表されるオーディオ信号の変換符号化
では、時間領域の信号を周波数領域のＤＣＴ（Discrete
Cosine Transform）係数に変換する。また、符号化す
る場合にＤＣＴ係数の値をスケールファクターと呼ばれ
る量子化精度を制御する値（以下、スケールファクター
値という）と量子化値の積で表現し、ハフマン符号等の
符号語により符号化が行われる。ここで、量子化値と
は、ＤＣＴ係数を浮動小数点形式で表現した場合の仮数
値を指し、スケールファクター値は、指数値に対応する
値を表す。また、指数部自体をゲインと呼ぶ。ここで、
上述の関係を簡単な数式で表現すると、ＤＣＴ係数を
Ｋ、量子化値をＲ、スケールファクター値をＳ、ゲイン
をＧ^Ｓとすると、Ｋ＝Ｒ×Ｇ^Ｓとなる。2. Description of the Related Art Generally, AAC (Advanced Audio Codin
g) In transform coding of an audio signal typified by coding, a time domain signal is converted into a frequency domain DCT (Discrete
Cosine Transform) Convert to coefficient. Also, when encoding, the value of the DCT coefficient is expressed by the product of a value that controls the quantization accuracy called scale factor (hereinafter referred to as scale factor value) and the quantized value, and is encoded by a code word such as Huffman code. Is done. Here, the quantized value refers to a mantissa value when the DCT coefficient is represented in a floating point format, and the scale factor value represents a value corresponding to the exponent value. Further, the exponent part itself is called a gain. here,
If the above relationship is expressed by a simple mathematical expression, K = R × G ^S , where DCT coefficient is K, quantized value is R, scale factor value is S, and gain is G ^S.

【０００３】また、その他のオーディオ符号化方式とし
ては、国際標準機関であるＩＳＯ／ＩＥＣＪＴＣ１／
ＳＣ２９／ＷＧ１１で標準化されたＩＳＯ／ＩＥＣ１
３８１８（ＭＰＥＧ−２）がある。この標準方式は、符
号化されたビットストリーム（圧縮データ）の解釈とそ
の復号処理について規定しているだけであるため、符号
化でのビット割当てに関する処理については自由に行う
ことができる。Another audio encoding system is ISO / IEC JTC1 / which is an international standard organization.
ISO / IEC 1 standardized by SC29 / WG11
3818 (MPEG-2). Since this standard system only defines the interpretation of a coded bit stream (compressed data) and its decoding process, the process related to bit allocation in coding can be freely performed.

【０００４】ところで、オーディオ信号の符号化におい
て、符号化に使用できるビット数が不足している場合、
聴覚的に許された量子化ノイズよりも大きな量子化ノイ
ズが発生するビット配分が行われるため、周波数領域全
体の量子化歪みが大きくなり、符号化の劣化が検知され
ることがある。By the way, in encoding an audio signal, when the number of bits available for encoding is insufficient,
Bit allocation is performed in which quantization noise larger than that permitted perceptually is generated, so that quantization distortion in the entire frequency domain becomes large, and deterioration of coding may be detected.

【０００５】そこで、従来の技術では、必要な量子化ビ
ット数に足りない場合に各周波数係数群のゲインを決定
するスケールファクター値を１ずつ変化させて最適値を
求め、その値に基づいて符号化を行う符号化方法が提案
されている。Therefore, in the prior art, when the required number of quantization bits is insufficient, the scale factor value that determines the gain of each frequency coefficient group is changed by 1 to obtain an optimum value, and the code is coded based on that value. An encoding method for performing encoding has been proposed.

【０００６】上述の方法により、符号化に必要な量子化
ビット数を算出して割当てることができる。By the above method, the number of quantization bits required for encoding can be calculated and assigned.

【０００７】[0007]

【発明が解決しようとする課題】しかしながら、上述の
量子化ビットの割当て方法は、スケールファクター値を
１ずつ変化させる度に量子化値、及び必要なビット数を
再計算するため、計算量が膨大になってしまう。また、
符号語に変換際の効率的なビット配分という点に関して
考慮されていないので音質劣化の一因となっている。However, in the above-mentioned method of assigning quantized bits, the quantized value and the required number of bits are recalculated each time the scale factor value is changed by 1, so that the calculation amount is enormous. Become. Also,
This is one of the causes of sound quality deterioration because no consideration is given to efficient bit allocation when converting to codewords.

【０００８】また、これまでは、割当ての決定をフレー
ム全体のビット数で判断し、特に大きな量子化値に割当
てるビット数の効率は考慮されていなかった。そのた
め、帯域によってはビット数の過不足で音質が劣化して
しまうことがあった。Up to now, the efficiency of the number of bits to be assigned to a large quantized value has not been taken into consideration by making the determination of the assignment based on the number of bits of the entire frame. Therefore, depending on the band, the sound quality may be deteriorated due to the excess or deficiency of the number of bits.

【０００９】本発明は、上述した問題に鑑みなされたも
のであり、効率的に符号化を行いオーディオ符号化にお
いて、使用できるビット数が少ない場合にも検知される
音質の劣化を軽減することができるオーディオ信号の符
号化方法及び符号化装置を提供することを目的とする。The present invention has been made in view of the above-mentioned problems, and it is possible to reduce the deterioration of the sound quality that is detected even when the number of bits that can be used is small in audio encoding by performing efficient encoding. An object of the present invention is to provide an audio signal encoding method and an audio signal encoding apparatus which can be performed.

【００１０】[0010]

【課題を解決するための手段】上記課題を解決するため
に、本件発明は、以下の特徴を有する課題を解決するた
めの手段を採用している。In order to solve the above problems, the present invention employs means for solving the problems having the following features.

【００１１】請求項１に記載された発明は、オーディオ
信号の符号化方法において、時間領域の信号を周波数領
域の信号に変換する変換手順と、前記変換ステップにて
変換された周波数係数群を複数の帯域で分割する分割手
順と、ゲインと量子化値の積で表現される周波数係数の
値における前記ゲイン又は前記量子化値をステップ状に
制御する制御手順と、前記量子化値を符号化する符号化
手順とを有することを特徴とする。According to a first aspect of the invention, in the audio signal encoding method, a plurality of conversion procedures for converting a time domain signal into a frequency domain signal and a plurality of frequency coefficient groups converted in the conversion step are provided. , A division procedure for dividing in the band, a control procedure for stepwise controlling the gain or the quantized value in the value of the frequency coefficient represented by the product of the gain and the quantized value, and encoding the quantized value. And an encoding procedure.

【００１２】請求項１記載の発明によれば、ゲイン又は
量子化値をステップ状に制御することにより、ビット数
を削減することができる。また、削減できたビットは、
音質にとって重要な部分に割当てることが可能であるた
め、相対的に聴覚的な音質を向上させることができる。According to the first aspect of the invention, the number of bits can be reduced by controlling the gain or the quantized value stepwise. Also, the bits that have been reduced are
Since it can be assigned to a portion that is important for sound quality, it is possible to improve a relatively audible sound quality.

【００１３】請求項２に記載された発明は、前記ゲイン
をスケールファクターバンド毎に設定し、当該スケール
ファクターバンド内で最大の周波数係数の符号語が最大
の量子化値になるように符号化を行うことを特徴とす
る。According to a second aspect of the present invention, the gain is set for each scale factor band, and encoding is performed so that the code word of the maximum frequency coefficient in the scale factor band has the maximum quantization value. It is characterized by performing.

【００１４】請求項２記載の発明によれば、ゲインの値
をハフマン符号化等の符号語に基づいて量子化値を決定
するため、符号語の長さを短くすることができ、結果と
してビット数を削減することができる。According to the second aspect of the present invention, since the quantized value of the gain value is determined based on the code word of Huffman coding or the like, the length of the code word can be shortened and, as a result, the bit value can be reduced. The number can be reduced.

【００１５】請求項３に記載された発明は、前記ゲイン
をスケールファクターバンド毎に設定し、当該スケール
ファクターバンド内で最大の周波数係数の符号語の量子
化値を１又は任意の整数とすることを特徴とする。According to a third aspect of the present invention, the gain is set for each scale factor band, and the quantization value of the code word of the maximum frequency coefficient in the scale factor band is set to 1 or any integer. Is characterized by.

【００１６】請求項３記載の発明によれば、スケールフ
ァクターバンド毎の音質を判断して必要な量子化値を設
定することで、符号化の制御を容易に行うことができ
る。According to the third aspect of the present invention, it is possible to easily control the encoding by determining the sound quality for each scale factor band and setting the required quantization value.

【００１７】請求項４に記載された発明は、オーディオ
信号にＭＳステレオを適用した場合、前記スケールファ
クターバンドにおけるＭ成分とＳ成分のエナジー、若し
くは最大周波数係数の大きさ、又は聴覚エントロピーを
用いて、スケールファクターバンド内のＭ成分とＳ成分
夫々の最大周波数係数として異なる量子化値を用いるこ
とを特徴とする。According to a fourth aspect of the present invention, when MS stereo is applied to an audio signal, the energy of the M and S components in the scale factor band, or the magnitude of the maximum frequency coefficient, or the auditory entropy is used. , And different quantized values are used as the maximum frequency coefficients of the M component and the S component in the scale factor band.

【００１８】請求項４記載の発明によれば、ＭＳステレ
オを用いて、Ｍ成分、Ｓ成分毎にビット数を別々に割り
振ることができ、ビット数を減らした符号化を行うこと
ができる。According to the invention described in claim 4, the number of bits can be separately allocated to each of the M component and the S component by using the MS stereo, and the encoding with the reduced number of bits can be performed.

【００１９】請求項５に記載された発明は、オーディオ
信号の符号化装置において、時間領域の信号を周波数領
域の信号に変換する変換手段と、前記変換ステップにて
変換された周波数係数群を複数の帯域で分割する分割手
段と、ゲインと量子化値の積で表現される周波数係数の
値における前記ゲイン又は前記量子化値をステップ状に
制御する制御手段と、前記量子化値を符号化する符号化
手段とを有することを特徴とする。According to a fifth aspect of the present invention, in the audio signal encoding device, a plurality of converting means for converting a time domain signal into a frequency domain signal and a plurality of frequency coefficient groups converted in the converting step are provided. Division means for dividing in the band of, the control means for controlling the gain or the quantized value in the value of the frequency coefficient represented by the product of the gain and the quantized value in steps, and encoding the quantized value. And an encoding unit.

【００２０】請求項５記載の発明によれば、ゲイン又は
量子化値をステップ状に制御することにより、ビット数
を削減することができる。また、削減できたビットは、
音質にとって重要な部分に割当てることが可能であるた
め、相対的に聴覚的な音質を向上させることができる。According to the fifth aspect of the invention, the number of bits can be reduced by controlling the gain or the quantized value stepwise. Also, the bits that have been reduced are
Since it can be assigned to a portion that is important for sound quality, it is possible to improve a relatively audible sound quality.

【００２１】請求項６に記載された発明は、前記制御手
段は、前記ゲインをスケールファクターバンド毎に設定
し、前記符号化手段は、当該スケールファクターバンド
内で最大の周波数係数の符号語が最大の量子化値になる
ように符号化を行うことを特徴とする。According to a sixth aspect of the present invention, the control means sets the gain for each scale factor band, and the encoding means has a maximum code word of a frequency coefficient within the scale factor band. The encoding is performed so that the quantized value of

【００２２】請求項６記載の発明によれば、ゲインの値
をハフマン符号化等の符号語に基づいて量子化値を決定
するため、符号語の長さを短くすることができ、結果と
してビット数を削減することができる。According to the invention of claim 6, since the quantized value of the gain value is determined based on the code word of Huffman coding or the like, the length of the code word can be shortened, and as a result, the bit value can be reduced. The number can be reduced.

【００２３】請求項７に記載された発明は、前記制御手
段は、前記ゲインをスケールファクターバンド毎に設定
し、前記符号化手段は、当該スケールファクターバンド
内で最大の周波数係数の符号語の量子化値を１又は任意
の整数とすることを特徴とする。In a seventh aspect of the present invention, the control means sets the gain for each scale factor band, and the encoding means quantizes a code word having a maximum frequency coefficient in the scale factor band. The characteristic value is 1 or an arbitrary integer.

【００２４】請求項７記載の発明によれば、スケールフ
ァクターバンド毎の音質を判断して必要な量子化値を設
定することで、符号化の制御を容易に行うことができ
る。According to the seventh aspect of the invention, it is possible to easily control the coding by determining the sound quality for each scale factor band and setting the required quantization value.

【００２５】請求項８に記載された発明は、オーディオ
信号にＭＳステレオを適用した場合、前記スケールファ
クターバンドにおけるＭ成分とＳ成分のエナジー、若し
くは最大周波数係数の大きさ、又は聴覚エントロピーを
用いて、スケールファクターバンド内のＭ成分とＳ成分
夫々の最大周波数係数として異なる量子化値を用いるこ
とを特徴とする。In the invention described in claim 8, when MS stereo is applied to an audio signal, the energy of the M component and the S component in the scale factor band, or the magnitude of the maximum frequency coefficient, or the auditory entropy is used. , And different quantized values are used as the maximum frequency coefficients of the M component and the S component in the scale factor band.

【００２６】請求項８記載の発明によれば、ＭＳステレ
オを用いて、Ｍ成分、Ｓ成分毎にビット数を別々に割り
振ることができ、ビット数を減らした符号化を行うこと
ができる。According to the invention described in claim 8, the number of bits can be separately allocated to each of the M component and the S component by using the MS stereo, and the encoding with the reduced number of bits can be performed.

【００２７】請求項９に記載された発明は、符号化方法
が異なる複数の符号化手段と、スケールファクターバン
ド毎の必要なビットレート数に基づいて、前記複数の符
号化手段の中から１つの符号化手段を評価・選択する評
価・選択手段を有することを特徴する。According to a ninth aspect of the present invention, one of the plurality of encoding means is selected based on a plurality of encoding means having different encoding methods and the required number of bit rates for each scale factor band. It is characterized by having an evaluation / selection means for evaluating / selecting the encoding means.

【００２８】請求項９記載の発明によれば、ビット数を
最小に減らすことができる符号化手段を用いることで、
効率的に符号化を行うことができる。According to the ninth aspect of the invention, by using the encoding means capable of reducing the number of bits to a minimum,
Encoding can be performed efficiently.

【００２９】[0029]

【発明の実施の形態】本発明は、ＤＣＴ変換により、４
９個のスケールファクターバンドから出力される周波数
係数の量子化精度を制御してビット数を減らし、効率的
にハフマン符号等による符号化を行い、オーディオ符号
化において使用できるビット数が少ない場合でも検知さ
れる音質の劣化を軽減させることを主眼とする。BEST MODE FOR CARRYING OUT THE INVENTION The present invention uses a DCT transformation to
Controls the quantization accuracy of the frequency coefficients output from the 9 scale factor bands to reduce the number of bits, efficiently encodes using Huffman code, and detects even when the number of bits that can be used in audio encoding is small. The main purpose is to reduce the deterioration of the sound quality.

【００３０】また、各帯域周波数群の量子化値の値をハ
フマン符号に代表される符号語への変換を行う際、その
周波数帯域（バンド）内の最大量子化値によって異なる
符号語のテーブル或いは数式を使用するという特徴を利
用する。例えば、ＡＡＣの場合は、１１のハフマン符号
語テーブルを持っているが、夫々のテーブルの量子化値
の最大値は飛び飛びの値（例えば、１６，１２，７，
４，２，１）となっている。この最大値を考慮してビッ
ト数を割当てることで、効率的な符号化を行う。When converting the quantized value of each band frequency group into a code word represented by a Huffman code, a table of code words different depending on the maximum quantized value in the frequency band (band) or Utilizes the feature of using mathematical formulas. For example, AAC has 11 Huffman codeword tables, but the maximum quantized value of each table is a discrete value (for example, 16, 12, 7,
4, 2, 1). By assigning the number of bits in consideration of this maximum value, efficient encoding is performed.

【００３１】ここで、上述した１１種類のハフマン符号
語テーブルの一例を図１に示す。図１に示すテーブル
は、０から８０まで（８１種類）の「ｉｎｄｅｘ」と、
「ｌｅｎｇｔｈ」と、「ｃｏｄｅｗｏｒｄ（１６進
数）」とで構成されている。「ｉｎｄｅｘ」に対応する
「ｌｅｎｇｔｈ」と「ｃｏｄｅｗｏｒｄ」を用いてステ
ップ毎にビット数を調整することで、ビット数を減少さ
せて伝送効率のよいオーディオ信号を生成することがで
きる。Here, an example of the 11 kinds of Huffman code word tables described above is shown in FIG. The table shown in FIG. 1 has 0 to 80 (81 types) of "index",
It is composed of "length" and "codeword (hexadecimal number)". By adjusting the number of bits for each step using “length” and “codeword” corresponding to “index”, it is possible to reduce the number of bits and generate an audio signal with good transmission efficiency.

【００３２】例えば、符号化ビットが不足している場合
を考えた場合に、あるスケールファクターでの最大量子
化値が、例えば「１２」であったとする。この時スケー
ルファクター値を１ずつ変化させるのでなく、最大量子
化値が「７」となるようにすることにより、量子化誤差
は大きくなるかわりに、符号化される周波数係数を減ら
すことができる。また、最大量子化係数が小さくなった
場合、それぞれのハフマン符号に割当てられる符号語の
長さも短くなることが期待され、結果としてビット数を
削減する効果が期待できる。こうした手法により，削減
できたビットは音質にとって重要な部分に割当てること
も可態であるため相対的に聴覚的な音質を向上させるこ
とができる。For example, when considering the case where the number of encoded bits is insufficient, it is assumed that the maximum quantized value at a certain scale factor is, for example, "12". At this time, the maximum quantization value is set to "7" instead of changing the scale factor value by one, so that the frequency error to be encoded can be reduced although the quantization error increases. Further, when the maximum quantization coefficient becomes small, it is expected that the length of the code word assigned to each Huffman code also becomes short, and as a result, the effect of reducing the number of bits can be expected. By such a method, it is possible to allocate the reduced bits to the important parts for the sound quality, so that the relatively perceptual sound quality can be improved.

【００３３】また、スケールファクター値を1ずつ変化
させる手法と比較すると、スケールファクター値を大き
く変化させる場合が多いことから計算量を減らすことが
できる。上述した内容により、音質を劣化させずにビッ
ト数を削減することができる。Further, as compared with the method of changing the scale factor value by one, the amount of calculation can be reduced since the scale factor value is often changed greatly. With the contents described above, the number of bits can be reduced without deteriorating the sound quality.

【００３４】次に、本発明における実施の形態につい
て、図面に基づいて説明する。なお、実施例では、２０
４８サンプルを１０２４本のＤＣＴ係数に変換した例で
説明する。Next, embodiments of the present invention will be described with reference to the drawings. In the example, 20
An example in which 48 samples are converted into 1024 DCT coefficients will be described.

【００３５】図２は、本発明における符号化装置の構成
を示すブロック図の一例である。FIG. 2 is an example of a block diagram showing the configuration of the encoding apparatus according to the present invention.

【００３６】図２の符号化装置は、聴覚モデル１１と、
フィルターバンク１２と、スケールファクター１３と、
量子化器１４と、ノイズレスコーディング１５と、レー
ト／歪みコントローラ１６と、ステップ制御コントロー
ラ１７と、ビットストリームマルチプレクサ１８とを有
するように構成されている。The encoding apparatus shown in FIG. 2 includes an auditory model 11,
Filter bank 12, scale factor 13,
It is configured to have a quantizer 14, noiseless coding 15, a rate / distortion controller 16, a step control controller 17, and a bitstream multiplexer 18.

【００３７】図１において、聴覚モデル１１は、入力さ
れる音声信号の量子化雑音のマスキングパターンの計算
を行う。つまり、音声信号の聴覚的なマスキングスレシ
ョルドの計算を行う。In FIG. 1, the auditory model 11 calculates a masking pattern of quantization noise of an input voice signal. That is, the auditory masking threshold of the audio signal is calculated.

【００３８】更に具体的に説明すれば、後述するフィル
ターバンクでのＤＣＴの分析位置と一致するように、入
力される音声信号をＦＦＴ（Fast Fourier Transform）
を用いて分析し、音声信号がマスキングできる最大のノ
イズ量（スレッシュホールド）を計算して、スケールフ
ァクターバンド毎のSignal to Mask比とスレッシュホー
ルド値を出力する。また、ロング、スタート、ストッ
プ、ショートのどのブロックタイプを選択するかの結果
を出力する。なお、上述した聴覚モデルの詳細は、ＩＳ
Ｏ／ＩＥＣ１３８１７−７ＡＮＮＥＸＢ２章ＥＮＣ
ＯＲＤＥＲ２．１ＰｓｙｃｈｏａｃｏｕｓｔｉｃＭｏ
ｄｅｌに記載されている。More specifically, the input audio signal is FFT (Fast Fourier Transform) so as to coincide with the DCT analysis position in the filter bank described later.
The maximum amount of noise (threshold) that can be masked by the voice signal is calculated by using, and the Signal to Mask ratio and threshold value for each scale factor band are output. It also outputs the result of whether to select the long, start, stop, or short block type. For details of the above-mentioned auditory model, see IS
O / IEC 13817-7 ANNEX B2 Chapter ENC
ORDER 2.1 Psychoacoustic Mo
It is described in del.

【００３９】フィルターバンク１２は、入力される音声
信号の時間領域の信号をＦＦＴ変換或いは、ＤＣＴ変換
等を使用して周波数領域の信号に変換する。The filter bank 12 transforms the input voice signal in the time domain into a signal in the frequency domain using FFT transformation or DCT transformation.

【００４０】ここで、変換方法について更に具体的に説
明すると、ＡＡＣにおける符号化の基本処理は、エンコ
ーダにおいて、時間領域の信号を周波数領域の信号に変
換する。また、逆にデコーダにおいて、周波数領域の信
号を時間領域の信号に変換することである。これをＭＤ
ＣＴ（Modified Discrete Cosine Transform）及びＩＭ
ＤＣＴ（Inverse Modified Discrete Cosine Transfor
m）によって実行している。また、ＭＤＣＴ、ＩＭＤＣ
Ｔにはブロック歪みを減少させるためにＴＤＡＣ（時間
領域折り返し歪み除去技術）を利用する。The conversion method will be described more specifically. In the basic processing of encoding in AAC, an encoder converts a time domain signal into a frequency domain signal. On the contrary, in the decoder, the signal in the frequency domain is converted into the signal in the time domain. MD this
CT (Modified Discrete Cosine Transform) and IM
DCT (Inverse Modified Discrete Cosine Transfor
m) running by. In addition, MDCT, IMDC
For T, TDAC (time domain aliasing distortion removal technique) is used to reduce block distortion.

【００４１】なお、詳細は、ＩＳＯ／ＩＥＣ１３８１
７−７ＡＮＮＥＸＢ２章ＥＮＣＯＲＤＥＲ２．３Ｆ
ｉｌｔｅｒｂａｎｋａｎｄｂｌｏｃｋｓｗｉｔｃｈ
ｉｎｇに記載されている。For details, refer to ISO / IEC 1381.
7-7 ANNEX B2 Chapter ENCORDER 2.3 F
ilterbank and block switch
It is described in ing.

【００４２】スケールファクター１３は、周波数係数を
変換するゲインを表現するスケールファクター値を作成
する。The scale factor 13 creates a scale factor value expressing a gain for converting a frequency coefficient.

【００４３】ここで、スケールファクター値を示した一
例の数式を式（１）に示す。Here, an example of the formula showing the scale factor value is shown in the formula (1).

【００４４】[0044]

【数１】なお、ｓｃｆ[ｓｂ]は、ｓｂ番目のスケールファクター
バンドのスケールファクター値であり、この値を量子化
器１４に出力する。また、ＱＵＡＮＴ_ＳＴＥＰは、ス
テップ制御コントローラ１７から入力される量子化ステ
ップ値である。また、ｃｏｍｍｏｎ_ｓｃｆは補正項で
あり、例えば、１００がセットされる。式（１）内のｐ
ｏｗ_ｓｐｅｃｔｒｕｍ[ｓｂ]は、フィルターバンク１
２より入力される値ｍｄｃｔ[ｓｂ]を用いて、式（２）
のように計算する。[Equation 1] Note that scf [sb] is the scale factor value of the sbth scale factor band, and this value is output to the quantizer 14. QUANT_STEP is a quantization step value input from the step controller 17. Also, common_scf is a correction term, and for example, 100 is set. P in equation (1)
ow_spectrum [sb] is the filter bank 1
Using the value mdct [sb] input from 2, the formula (2)
Calculate as.

【００４５】[0045]

【数２】なお、式（２）のｍｄｃｔ[ｓｂ]は、ｓｂ番目のスケー
ルファクターバンド内の任意のｍｄｃｔ係数であり、例
えば、最大のｍｄｃｔ値が設定される。[Equation 2] Note that mdct [sb] in the equation (2) is an arbitrary mdct coefficient in the sbth scale factor band, and for example, the maximum mdct value is set.

【００４６】量子化器１４は、周波数係数を量子化値に
変換する。なお、変換手段は、詳細は、ＩＳＯ／ＩＥＣ
１３８１７−７ＡＮＮＥＸＢ２章ＥＮＣＯＲＤＥＲ
２．７Ｑｕａｎｔｉｚａｔｉｏｎに記載されている。
具体的には、式（３）、式（４）に示すような数式で表
現される。The quantizer 14 converts the frequency coefficient into a quantized value. The details of the conversion means are ISO / IEC.
13817-7 ANNEX B2 Chapter ENCORDER
2.7 Quantization.
Specifically, it is expressed by a mathematical expression as shown in Expressions (3) and (4).

【００４７】[0047]

【数３】ここで、x_ｑｕａｎｔ[ｉ]は、ｉ番目のインデックス
（図１の「ｉｎｄｅｘ」）を持つ周波数係数の量子化値
であり、ノイズレスコーディング１５に出力される。ま
た、ｍｄｃｔ＿ｌｉｎｅ[ｉ]は、フィルターバンクでＤ
ＣＴ変換されたｉ番目のインデックスをもつ係数であ
り、スケールファクター１３より入力される。また、Ｍ
ＡＧＩＣ＿ＮＵＭＢＥＲには、一般値として０．４０５
４（固定値）がセットされる。[Equation 3] Here, x_quant [i] is the quantized value of the frequency coefficient having the i-th index (“index” in FIG. 1) and is output to the noiseless coding 15. Also, mdct_line [i] is a filter bank D
The coefficient is the CT-transformed i-th index and is input from the scale factor 13. Also, M
0.405 as a general value for AGIC_NUMBER
4 (fixed value) is set.

【００４８】ノイズレスコーディング１５は、ハフマン
符号化等、スケールファクター値、量子化値を符号語に
変換する。詳細は、ＩＳＯ／ＩＥＣ１３８１７−７Ａ
ＮＮＥＸＢ２章ＥＮＣＯＲＤＥＲ２．８Ｎｏｉｓｅ
ｌｅｓｓＣｏｄｉｎｇに記載されている。また、参照
までに４つの係数をハフマン符号語に変換する一例の数
式の手順をフローチャートを用いて説明する。なお、こ
こで、使用するハフマンコード表は、図１を用いるが、
ハフマンコード表は、発明の範囲においてこの限りでは
ない。The noiseless coding 15 converts scale factor values and quantized values into codewords such as Huffman coding. For details, see ISO / IEC 13817-7A.
NNEX B2 Chapter ENCORDER 2.8 Noise
It is described in less Coding. In addition, a procedure of an example of a mathematical expression for converting four coefficients into a Huffman code word will be described with reference to a flowchart before reference. The Huffman code table used here is as shown in FIG.
The Huffman code table is not limited to this within the scope of the invention.

【００４９】図３は、ハフマン符号語に変換する処理の
流れをプログラム的に示す一例の図である。FIG. 3 is a diagram showing an example of the program flow of the process of converting to a Huffman code word.

【００５０】図３において、スケールファクターバンド
毎に繰り返し処理を行う（Ｓ１）。まず、初期値をセッ
トして（Ｓ２）、ｏｆｆｓｅｔ(ｓｂ)からｔｏｐ(ｓｂ)
になるまで、以下のＳ３からＳ８まで処理を行う（Ｓ
３）。また、４つの係数毎に処理を行うので、Ｓ３にお
ける増加分は＋４となる。また、ｏｆｆｓｅｔ(ｓｂ)と
は、各スケールファクターバンドの下限のＤＣＴ係数の
インデックス（ｉ）を表し、ｔｏｐ(ｓｂ)はｓｂバンド
の上限のＤＣＴ係数のインデックスを表す。なお、上述
したｓｂとｏｆｆｓｅｔ(ｓｂ)とｔｏｐ(ｓｂ)のＤＣＴ
係数のバンド分けの一例を図４に示す。図４に示すよう
に、１つのｓｂに対して、４つのＤＣＴ係数が割り振ら
れている（０〜１０２３の計１０２４本）。In FIG. 3, iterative processing is performed for each scale factor band (S1). First, the initial value is set (S2), and from offset (sb) to top (sb)
The following processes from S3 to S8 are performed until
3). Further, since the process is performed for each of the four coefficients, the increment in S3 is +4. Further, offset (sb) represents the index (i) of the lower limit DCT coefficient of each scale factor band, and top (sb) represents the index of the upper limit DCT coefficient of the sb band. The DCT of sb, offset (sb) and top (sb) described above
An example of band division of coefficients is shown in FIG. As shown in FIG. 4, four DCT coefficients are assigned to one sb (a total of 1024 0 to 1023).

【００５１】次に、図１を参照するためのインデックス
を計算する（Ｓ４）。ここで、x＿ｑｕａｎｔ[ｉ]とは
ｉ番目の係数の量子化値であり、量子化器１４から入力
される。Ｓ４にて計算されたｉｎｄｅｘ値は、図１を参
照して、ｃｏｄｅｗｏｒｄ（ハフマン符号）とｔｍｐを
抽出する（Ｓ５、Ｓ６）。例えば、Ｓ４にてｉｎｄｅｘ
が「１０」とすると、ｃｏｄｅｗｏｒｄには「７２」が
セットされｔｍｐ「７」がセットされる。また、量子化
値を符号化するのに必要なビット数の計算を行う（Ｓ
７）。Ｓ７の出力は、レート／歪みコントローラ１６へ
出力する。これを、スケールファクターバンド毎（ステ
ップ毎）に分割した全ての帯域で繰り返し行うことによ
り、全ての係数をハフマン符号語に変換することができ
る（Ｓ８、Ｓ９）。Next, the index for referring to FIG. 1 is calculated (S4). Here, x_quant [i] is the quantized value of the i-th coefficient, and is input from the quantizer 14. For the index value calculated in S4, the codeword (Huffman code) and tmp are extracted with reference to FIG. 1 (S5, S6). For example, in S4, index
Is "10", the codeword is set to "72" and the tmp is set to "7". In addition, the number of bits required to encode the quantized value is calculated (S
7). The output of S7 is output to the rate / distortion controller 16. By repeating this for all bands divided for each scale factor band (for each step), all the coefficients can be converted into Huffman code words (S8, S9).

【００５２】次に、レート／歪みコントローラ１６と、
ステップ制御コントローラ１７の動作について、数式の
フローチャートを用いて説明を行う。Next, the rate / distortion controller 16 and
The operation of the step controller 17 will be described with reference to a flowchart of mathematical expressions.

【００５３】図５は、レート／歪みコントローラ及びス
テップ制御コントローラの動作の一例を示すフローチャ
ートである。FIG. 5 is a flow chart showing an example of the operation of the rate / distortion controller and the step controller.

【００５４】図５において、ステップ制御コントローラ
１７に設定してある値に基づいて、レート／歪みコント
ローラ１６の動作を行う（Ｓ１１）。なお、ステップ制
御コントローラに設定される値は、量子化値の最大値を
設定することが好ましい。なお、本発明では、設定した
値を｛１６，１２，７，４，２，１｝としたが、設定す
る値については、この限りではない。In FIG. 5, the rate / distortion controller 16 operates based on the value set in the step controller 17 (S11). The value set in the step controller is preferably the maximum value of the quantized value. In the present invention, the set value is set to {16,12,7,4,2,1}, but the set value is not limited to this.

【００５５】まず、フィルターバンクでＤＣＴ変換され
たｉ番目のインデックスをもつ係数(ｍｄｃｔ＿ｌｉｎ
ｅ[ｉ]と聴覚モデルから入力される許容される量子化ノ
イズの大きさ（ａｌｌｗｅｄ＿ｄｉｓｔ(ｓｂ)）と比較
を行い（Ｓ１３）、量子化ノイズの方が大きかった場
合、ｍｄｃｔ＿ｌｉｎｅ[ｉ]に０をセットする（Ｓ１
４）。これを、インデックスが最大になるまで、繰り返
し行う（Ｓ１２、Ｓ１５）。First, the coefficient (mdct_lin) having the i-th index that has been DCT-transformed by the filter bank
e (i) is compared with the size of the allowable quantization noise (allwed_dist (sb)) input from the auditory model (S13), and if the quantization noise is larger, 0 is set in mdct_line [i]. Set (S1
4). This is repeated until the index becomes maximum (S12, S15).

【００５６】次に、レート／歪みの制御を行う。まず、
Ｓ１１のＳＴＥＰ＿ＭＡＴテーブルの添え字ｊを０（初
期化）にする（Ｓ１６）。次に、スケールファクターバ
ンド内で使用できるビット数（ａｖｅｒａｇｅ＿ｂｉｔ
ｓ）を、ビット数のカウント（ｂｉｔ＿ｃｏｕｎｔ()）
が超えないで処理ができるかの判断をＳ１７からＳ２３
までの処理を繰り返し行うことで確認する（Ｓ１７）。
なお、ａｖｅｒａｇｅ＿ｂｉｔｓは、予め設定してお
き、ｂｉｔ＿ｃｏｕｎｔ()は、ノイズレスコーディング
１５で計算されたビット数の総和であり、スケールファ
クター又はハフマンテーブルのコードブック番号等、伝
送のために必要なビット数も含む。Next, rate / distortion control is performed. First,
The subscript j in the STEP_MAT table in S11 is set to 0 (initialization) (S16). Next, the number of bits that can be used in the scale factor band (average_bit)
s) is the number of bits (bit_count ())
Is determined from S17 to S23.
This is confirmed by repeating the above process (S17).
It should be noted that average_bits is set in advance, and bit_count () is the sum of the number of bits calculated by the noiseless coding 15, and the number of bits required for transmission such as the scale factor or the codebook number of the Huffman table is also included. Including.

【００５７】まず、ＱＵＡＮＴ＿ＳＴＥＰに量子化値の
最大値をセットする（Ｓ１８）。なお、Ｓ１１の設定値
の大きい値からセットする。次に、スケールファクター
バンド毎に処理を行う（Ｓ１９）。まず、スケールファ
クター値（ｃａｌｃ＿ｓｃａｌｅ()）の計算を行う（Ｓ
２０）。計算されたスケールファクター値はスケールフ
ァクター１３に出力される。また、Ｓ２１では量子化値
（ｃａｌｃ＿ｑｕａｎｔ()）の計算を行い、量子化器１
４に出力される。First, the maximum quantized value is set in QUANT_STEP (S18). It should be noted that the setting value is set from a larger value in S11. Next, processing is performed for each scale factor band (S19). First, the scale factor value (calc_scale ()) is calculated (S
20). The calculated scale factor value is output to the scale factor 13. Further, in S21, the quantization value (calc_quant ()) is calculated, and the quantizer 1
4 is output.

【００５８】これを、スケールファクターバンド毎に行
い（Ｓ２２）、ビットカウント（ｂｉｔ＿ｃｏｕｎ
ｔ()）がａｖｅｒａｇｅ＿ｂｉｔｓを超なくなるまで、
繰り返し行う（Ｓ２３）。This is performed for each scale factor band (S22), and the bit count (bit_count) is calculated.
until t ()) no longer exceeds average_bits,
Repeatedly (S23).

【００５９】もし、ａｖｅｒａｇｅ＿ｂｉｔｓを超える
ことがあれば、必要なビット数を得ることができず、符
号化ができない場合や符号化した際に音質が歪んでしま
う等の問題が発生してしまうため、その場合は、ステッ
プ制御コントローラ内の量子化値の値を低く設定して再
度処理を行う。If the average_bits is exceeded, the required number of bits cannot be obtained, and there arises a problem that encoding cannot be performed or sound quality is distorted when encoded. In that case, the value of the quantized value in the step controller is set low and the process is performed again.

【００６０】つまり、Ｓ１７からＳ２３の処理でＳ１７
の条件に満たなければ、次の値（例えば、１６がセット
されて条件に合わなければ１２）がセットされ、再度Ｓ
１７からＳ２３までの処理を行う。このようにして、処
理を行い条件が最初にあったＱＵＡＮＴ＿ＳＴＥＰが最
適値ということになり、この値を用いて符号化すること
で、必要なビット数で最高の音質を得ることができる。That is, in the processing from S17 to S23, S17
If the condition is not satisfied, the next value (for example, 16 is set and 12 is set if the condition is not satisfied) is set, and S is again set.
The processing from 17 to S23 is performed. In this way, the QUANT_STEP having the first condition under the processing is the optimum value, and by encoding using this value, the highest sound quality can be obtained with the required number of bits.

【００６１】図５で示したフローチャートにより、ゲイ
ン又は前記量子化値の最適値を求めることができる。The gain or the optimum value of the quantized value can be obtained by the flowchart shown in FIG.

【００６２】次に、ビットストリームマルチプレクサ１
８で、入力された符号化データ及びスケールファクター
又はハフマンテーブルのコードブック番号等の制御情報
をビットストリームに変換する。これにより、オーディ
オ信号の符号化を効率よく行うことができる。なお、ビ
ットストリームマルチプレクサ１８の詳細については、
ＩＳＯ／ＩＥＣ１３８１７−７１章ｓｙｎｔａｘに記
載されている。つまり、符号化装置の各ブロックで出力
されたパラメータをｓｙｎｔａｘで詳解された形式に並
べ変えて出力する。Next, the bit stream multiplexer 1
At 8, the input coded data and the control information such as the scale factor or the codebook number of the Huffman table are converted into a bitstream. As a result, it is possible to efficiently encode the audio signal. For details of the bitstream multiplexer 18,
It is described in ISO / IEC13817-7 Chapter 1, syntax. That is, the parameters output from each block of the encoding device are rearranged into the format detailed in syntax and output.

【００６３】また、符号化装置の各ブロックの動作内容
はこの限りではなく、また、同ブロックに関して違う数
式による計算をさせてもよい。The operation content of each block of the encoding device is not limited to this, and the calculation may be performed by different mathematical expressions for the same block.

【００６４】例えば、式（１）を下記式（５）に変更す
ることにより、１又は複数のスケールファクターバンド
群で最大の周波数係数値に基づいて、スケールファクタ
ー値を導出してもよい。For example, the scale factor value may be derived based on the maximum frequency coefficient value in one or a plurality of scale factor band groups by changing the formula (1) into the following formula (5).

【００６５】[0065]

【数４】なお、ｍａｘ＿ｐｏｗ[ｓｂｍ，ｓｂｎ]は式（６）で計
算することができる。[Equation 4] Note that max_pow [sbm, sbn] can be calculated by equation (6).

【００６６】[0066]

【数５】また、ｍａｘ＿ｍｄｃｔ[ｓｂｍ，ｓｂｎ]は、ｍからｎ
番目のスケールファクターバンド内で最大のＭＤＣＴ係
数である（ｍ、ｎ：整数）。[Equation 5] Also, max_mdct [sbm, sbn] is from m to n
The largest MDCT coefficient in the second scale factor band (m, n: integer).

【００６７】更に、式（１）を下記式（７）に変更する
ことにより、任意のスケールファクターバンドのスケー
ルファクター値を各スケールファクターバンドで最大の
周波数係数値を用いて、更に分子を１或いは任意の整数
値として導出することもできる。Further, by changing the expression (1) to the following expression (7), the scale factor value of an arbitrary scale factor band is used as the maximum frequency coefficient value in each scale factor band, and the numerator is further set to 1 or It can also be derived as an arbitrary integer value.

【００６８】[0068]

【数６】なお、式（７）のｋの値を１或いは、ｎ＜ＱＵＡＮＴ_
ＳＴＥＰを満たす正の整数とする。また、ｍａｘ＿ｐｏ
ｗ＿ｓｐｅｃｔｒｕｍは式（８）で計算することができ
る。[Equation 6] Note that the value of k in equation (7) is 1 or n <QUANT_
It is a positive integer that satisfies STEP. Also, max_po
w_spectrum can be calculated by equation (8).

【００６９】[0069]

【数７】これにより、オーディオ信号の符号化における音質の制
御を行うことができる。[Equation 7] Thereby, it is possible to control the sound quality in encoding the audio signal.

【００７０】次に、本発明における符号化装置にＭＳコ
ントローラが具備された場合の符号化装置の動作例をブ
ロック構成図を用いて説明する。Next, an operation example of the encoding device in the case where the encoding device according to the present invention is equipped with the MS controller will be described with reference to a block diagram.

【００７１】図６は、本発明におけるＭＳコントローラ
を含む符号化装置の構成を示すブロック図の一例であ
る。FIG. 6 is an example of a block diagram showing a configuration of an encoding device including an MS controller according to the present invention.

【００７２】図６の符号化装置は、聴覚モデル１１と、
スケールファクター１３と、量子化器１４と、ノイズレ
スコーディング１５と、レート／歪みコントローラ１６
と、ステップ制御コントローラ１７と、ビットストリー
ムマルチプレクサ１８と、ＭＳコントローラ１９と、Ｍ
／Ｓステレオツール２０とを有するように構成されてい
る。The encoding apparatus shown in FIG. 6 includes an auditory model 11,
Scale factor 13, quantizer 14, noiseless coding 15, rate / distortion controller 16
, Step controller 17, bitstream multiplexer 18, MS controller 19, M
/ S stereo tool 20.

【００７３】ここで、各ブロックの動作について、主に
図２を用いて説明した各ブロック説明とことなる部分の
説明を行う。Here, the operation of each block will be described with respect to parts different from the description of each block mainly with reference to FIG.

【００７４】図６において、聴覚モデル１１は、量子化
雑音のマスキングパターンの計算を行う。ＭＳコントロ
ーラ１９は、Ｍ／Ｓエントロピーの計算を行う。つま
り、時間周波数変換された係数からＬ成分、Ｒ成分、Ｍ
成分（Ｌ＋Ｒ成分）、Ｓ成分（Ｌ−Ｒ成分）のエナジー
を計算し、聴覚モデル１１から入力されるマスキングパ
ターンの計算結果やエンコードに必要なビット数等の出
力情報に基づいて、ＭＳモード、ＬＲモードの切替えの
判定、又はＭＳモード時のチャンネル間のビット割当て
のための補助情報を作成する。In FIG. 6, the auditory model 11 calculates a masking pattern of quantization noise. The MS controller 19 calculates M / S entropy. That is, from the time-frequency converted coefficients, the L component, R component, M
The energy of the component (L + R component) and the S component (LR component) are calculated, and based on the calculation result of the masking pattern input from the auditory model 11 and output information such as the number of bits required for encoding, the MS mode, Auxiliary information is created for determination of LR mode switching or bit allocation between channels in MS mode.

【００７５】ここで、ＭＳコントローラ１９におけるエ
ナジーの計算によるＭＳモードとＬＲモードの判定内容
について説明する。まず、下記式（９）〜（１２）に夫
々のスケールファクターバンドの係数のＭ成分、Ｓ成
分、Ｌ成分、Ｒ成分のエナジー（ｅＭ(ｓｂ)、ｅＳ(ｓ
ｂ)、ｅＬ(ｓｂ)、ｅＲ(ｓｂ)）の計算式を示す。Here, the determination contents of the MS mode and the LR mode by the energy calculation in the MS controller 19 will be described. First, in the following equations (9) to (12), the energy (eM (sb), eS (s) of the M component, S component, L component, and R component of each scale factor band coefficient is calculated.
b), eL (sb), eR (sb)) are shown.

【００７６】[0076]

【数８】また、ｅＭ(ｓｂ)、ｅＳ(ｓｂ)、ｅＬ(ｓｂ)、及びｅＲ
(ｓｂ)成分の夫々のエネルギー比を式（１３）、（１
４）を用いて計算し、ＭＳモードとＬＲモードの切替え
を行う判定式を式（１５）に示す。[Equation 8] Also, eM (sb), eS (sb), eL (sb), and eR
The energy ratio of each of the (sb) components is calculated by using equations (13) and (1
Equation (15) shows the determination formula for switching between the MS mode and the LR mode calculated by using 4).

【００７７】[0077]

【数９】式（１５）におけるｋは正の定数であり、例えば、ｋ＝
１とする。[Equation 9] K in Expression (15) is a positive constant, for example, k =
Set to 1.

【００７８】なお、ＭＳモードとＬＳモードの切替え方
法は、この限りではなく、本出願人にて出願されている
特願２００１−７０９２６号「ステレオ信号の符号化方
法及び符号化装置」に記載されているように聴覚的なエ
ントロピーを使ってＭ／Ｓエントロピーの計算を行い、
その結果によりＬＲモードとＭＳモードの切替えを行う
こともできる。The method of switching between the MS mode and the LS mode is not limited to this, and is described in Japanese Patent Application No. 2001-70926 “Stereo signal encoding method and encoding apparatus” filed by the present applicant. As you can see, we use auditory entropy to calculate M / S entropy,
Depending on the result, it is possible to switch between the LR mode and the MS mode.

【００７９】次に、ＭＳモードが適用された場合のスケ
ールファクター１３におけるＭ成分とＳ成分の夫々のス
ケールファクター値（ｓｃｆ＿Ｍ[ｓｂ]、ｓｃｆ＿Ｓ
[ｓｂ]）を計算する計算式を式（１６）、式（１７）に
示す。Next, the scale factor values (scf_M [sb], scf_S of the M component and the S component in the scale factor 13 when the MS mode is applied).
The formulas for calculating [sb]) are shown in formulas (16) and (17).

【００８０】[0080]

【数１０】なお、ＱＵＡＮＴ＿Ｍ及びＱＵＡＮＴ＿Ｓは、Ｍ成分、
Ｓ成分の夫々の量子化ステップ値であり、レート／歪み
コントローラ１６から入力される。[Equation 10] QUANT_M and QUANT_S are M components,
It is the quantization step value of each S component and is input from the rate / distortion controller 16.

【００８１】次に、レート／歪みコントローラ１６と、
ステップ制御コントローラ１７のプログラム動作をフロ
ーチャートを用いて説明する。Next, the rate / distortion controller 16
The program operation of the step controller 17 will be described with reference to a flowchart.

【００８２】図７は、ＭＳコントローラを有する場合の
レート／歪みコントローラ及びステップ制御コントロー
ラの動作の一例を示すフローチャートである。なお、Ｍ
Ｓモード以外の場合は、図５と同様の動作を行うため説
明を省略する。FIG. 7 is a flow chart showing an example of the operation of the rate / distortion controller and the step control controller having the MS controller. In addition, M
The operations other than the S mode are the same as those in FIG.

【００８３】図７において、ステップ制御コントローラ
１７に設定してある値に基づいて、レート／歪みコント
ローラ１６の動作を行う（Ｓ３１）。まず、フィルター
バンク１２でＤＣＴ変換されたｉ番目のインデックスを
持つＭ成分、Ｓ成分夫々の係数（ｄＭ[ｉ]、dＳ[ｉ]）
と聴覚モデル１１（ＭＳコントロール）から入力される
許容されるＭ成分及びＳ成分の量子化ノイズの大きさ
（ａｌｌｗｅｄ＿ｄｉｓｔ＿Ｍ(ｓｂ)、ａｌｌｗｅｄ＿
ｄｉｓｔ＿Ｓ(ｓｂ)）とＭ成分、Ｓ成分を対応させて比
較を行い、許容される量子化ノイズの方が大きかった場
合、ｄＭ[ｉ]、dＳ[ｉ]に０をセットする（Ｓ３３〜Ｓ
３７）。これを、インデックスが最大になるまで繰り返
し行う（Ｓ３２、Ｓ３７）。In FIG. 7, the rate / distortion controller 16 operates based on the value set in the step controller 17 (S31). First, the coefficients (dM [i], dS [i]) of the M component and the S component having the i-th index that are DCT-transformed by the filter bank 12 respectively.
And the magnitudes of allowable M and S component quantization noises input from the auditory model 11 (MS control) (allwed_dist_M (sb), allwed_
dist_S (sb)) is compared with the M component and the S component, and when the allowable quantization noise is larger, 0 is set to dM [i] and dS [i] (S33 to S).
37). This is repeated until the index becomes maximum (S32, S37).

【００８４】次に、レート／歪みの制御を行う。まず、
Ｓ１１のＳＴＥＰ_ＭＡＴテーブルの添え字ｊを０（初
期化）にする（Ｓ３８）。次に、スケールファクターバ
ンド内で使用できるビット数（ａｖｅｒａｇｅ＿ｂｉｔ
ｓ）を、ビット数のカウント（ｂｉｔ＿ｃｏｕｎｔ()）
が超えないで処理ができるかの判断をＳ３９からＳ４５
までの処理を繰り返し行うことで確認する（Ｓ３９）。Next, rate / distortion control is performed. First,
The subscript j of the STEP_MAT table in S11 is set to 0 (initialization) (S38). Next, the number of bits that can be used in the scale factor band (average_bit)
s) is the number of bits (bit_count ())
It is judged from S39 to S45 whether processing can be performed without exceeding
This is confirmed by repeating the above process (S39).

【００８５】なお、ａｖｅｒａｇｅ＿ｂｉｔｓは、予め
設定しておき、ｂｉｔ＿ｃｏｕｎｔ()は、ノイズレスコ
ーディング１５で計算されたビット数の総和であり、ス
ケールファクター又はハフマンテーブルのコードブック
番号等、伝送のために必要なビット数（制御情報分）も
含む。It should be noted that the average_bits is set in advance, and the bit_count () is the sum of the number of bits calculated by the noiseless coding 15, which is necessary for transmission such as the scale factor or the codebook number of the Huffman table. It also includes the number of bits (control information).

【００８６】まず、ＱＵＡＮＴ＿ＳＴＥＰ＿ＭとＱＵＡ
ＮＴ＿ＳＴＥＰ＿ＳにＳ３１で設定した量子化値の最大
値をセットする（Ｓ４０）。First, QUANT_STEP_M and QUA
The maximum value of the quantized value set in S31 is set in NT_STEP_S (S40).

【００８７】なお、Ｓ３１の設定値の大きい値をＱＵＡ
ＮＴ＿ＳＴＥＰ＿Ｍにセットする。同時に、その次に大
きい値をＱＵＡＮＴ＿ＳＴＥＰ＿Ｓにセットする。次
に、スケールファクターバンド毎に処理を行う（Ｓ４
１）。まず、スケールファクター値（ｃａｌｃ＿ｓｃａ
ｌｅ()）の計算を行う（Ｓ４２）。計算されたスケール
ファクター値はスケールファクター１３に出力される。
また、Ｓ４３では量子化値（ｃａｌｃ＿ｑｕａｎｔ()）
の計算を行い量子化器１４に出力される。It should be noted that a large value of the set value of S31 is set to QUA.
Set to NT_STEP_M. At the same time, the next largest value is set in QUANT_STEP_S. Next, processing is performed for each scale factor band (S4
1). First, the scale factor value (calc_sca
le ()) is calculated (S42). The calculated scale factor value is output to the scale factor 13.
In S43, the quantized value (calc_quant ())
Is calculated and output to the quantizer 14.

【００８８】これを、スケールファクターバンド毎に行
い（Ｓ４４）、ビットカウント（ｂｉｔ＿ｃｏｕｎ
ｔ()）がａｖｅｒａｇｅ＿ｂｉｔｓを超なくなるまで繰
り返し行う（Ｓ４５）。This is performed for each scale factor band (S44), and the bit count (bit_count) is calculated.
It is repeated until t () does not exceed average_bits (S45).

【００８９】もし、ａｖｅｒａｇｅ＿ｂｉｔｓを超える
ことがあれば、必要なビット数を得ることができず、符
号化ができない場合や符号化した際に符号化音質が歪ん
でしまう等の問題が発生してしまうため、その場合は、
ステップ制御コントローラ内の量子化値の値を低く設定
して再度処理を行う。If the average_bits is exceeded, the required number of bits cannot be obtained, and there arises a problem that encoding cannot be performed or the encoded sound quality is distorted when encoded. So in that case,
The value of the quantized value in the step controller is set low and the process is performed again.

【００９０】つまり、Ｓ３９からＳ４５の処理で条件に
満たなければ、次の値（例えば、１６がセットされて条
件に合わなければ１２）がセットされ、再度Ｓ３９から
Ｓ４５までの処理を繰り返し行う。このようにして処理
を行い条件が最初にあったＱＵＡＮＴ＿ＳＴＥＰが最適
値ということになり、この値を用いて符号化すること
で、必要なビット数で最高の音質を得ることができる。That is, if the conditions are not satisfied in the processes of S39 to S45, the next value (for example, 16 is set and 12 is not satisfied) is set, and the processes of S39 to S45 are repeated. In this way, the QUANT_STEP with the first condition is the optimum value, and by encoding using this value, the highest sound quality can be obtained with the required number of bits.

【００９１】図７で示したフローチャートにより、Ｍ成
分及びＳ成分のゲイン又は前記量子化値の最適値を求め
ることができる。The gain of the M component and the S component or the optimum value of the quantized value can be obtained by the flowchart shown in FIG.

【００９２】なお、図７で示したフローチャートは、全
スケールファクターバンドについてＭＳモードを適用し
た場合であるが、本発明においてはこの限りではなく、
例えば、ＭＳモードとＬＳモードの判定をスケールファ
クターバンド毎に行うことで、スケールファクターバン
ド毎にモードの切替えを行い、効率的にゲイン及び量子
化値の最適値を求めることができる。The flow chart shown in FIG. 7 shows the case where the MS mode is applied to all scale factor bands, but the present invention is not limited to this.
For example, by determining the MS mode and the LS mode for each scale factor band, the mode can be switched for each scale factor band, and the optimum values of the gain and the quantized value can be efficiently obtained.

【００９３】ＭＳステレオツール２０は、周波数帯域の
信号の和信号（Ｌ＋Ｒ）、と差信号（Ｌ−Ｒ）を作成
し、ＭＳコントローラ１９からの制御信号によりＭＳモ
ード或いはＬＲモードの切替えを行う。また、ＭＳモー
ドのバンドの係数は、ＬＲ成分の夫々のｍｄｃｔ＿ｌｉ
ｎｅ[ｉ]をｄＬ(ｉ)、ｄＲ(ｉ)として、ｄＭ(ｉ)、ｄＳ
(ｉ)は夫々式（１８）、式（１９）で計算することがで
きる。The MS stereo tool 20 creates a sum signal (L + R) and a difference signal (LR) of frequency band signals, and switches between the MS mode and the LR mode by a control signal from the MS controller 19. The coefficient of the band in the MS mode is mdct_li of each LR component.
Let ne [i] be dL (i), dR (i), dM (i), dS
(i) can be calculated by equations (18) and (19), respectively.

【００９４】[0094]

【数１１】なお、その他の各ブロックは、図２のブロック説明と同
様であるため、詳細な説明は省略する。[Equation 11] Note that the other blocks are the same as the block description in FIG. 2, and thus detailed description will be omitted.

【００９５】上述により、任意のスケールファクターバ
ンドにＭＳステレオを適用した場合、スケールファクタ
ーバンドにおけるＭ成分とＳ成分のエナジー若しくは最
大周波数係数の大きさ、又は聴覚エントロピーに基づい
て、同じインデックス番号を持つスケールファクターバ
ンドのＭ成分とＳ成分の夫々の最大周波数係数の量子化
値を異なる大きさにすることができ、ビット数を成分に
より調整することができるため効率的なオーディオ信号
の符号化が可能となる。As described above, when MS stereo is applied to an arbitrary scale factor band, it has the same index number based on the magnitude of the energy or maximum frequency coefficient of the M component and S component in the scale factor band, or the auditory entropy. Since the quantized values of the maximum frequency coefficients of the M component and S component of the scale factor band can be made different and the number of bits can be adjusted by the component, efficient audio signal encoding is possible. Becomes

【００９６】また、スケールファクターバンド毎にＭＳ
モードとＬＳモードの判定及び切替えを行うことで、更
に効率的なオーディオ信号の符号化を行うことができ
る。In addition, MS for each scale factor band
By determining and switching between the mode and the LS mode, more efficient audio signal encoding can be performed.

【００９７】なお、図２、図６における各ブロックの動
作内容は上述した限りではなく、同一ブロックに関して
違う数式による計算をさせてもよい。The operation contents of each block in FIG. 2 and FIG. 6 are not limited to those described above, and the same block may be calculated by different mathematical expressions.

【００９８】例えば、スケールファクター１３におい
て、スケールファクター値を求める計算式、式（５）と
式（７）とを所定の条件で分けて処理をさせてもよい。
ここで、一例として、条件１とした場合の数式を式（２
０）に、また、条件２とした場合の数式を式（２１）に
示す。For example, in the scale factor 13, the calculation formula for obtaining the scale factor value, the formula (5) and the formula (7) may be divided and processed under predetermined conditions.
Here, as an example, the mathematical expression under the condition 1 is expressed by the formula (2
0) and the mathematical expression under the condition 2 are shown in Expression (21).

【００９９】[0099]

【数１２】 [Equation 12]

【０１００】[0100]

【数１３】なお、ｓｃｆ＿Ｉ[ｓｂ]は、条件１の場合のｓｂ番目の
スケールファクター値を示し、ｓｃｆ＿ＩＩ[ｓｂ]は、
条件２の場合のｓｂ番目のスケールファクター値を示
す。[Equation 13] Note that scf_I [sb] represents the sb-th scale factor value under condition 1, and scf_II [sb] is
The sb-th scale factor value in the case of condition 2 is shown.

【０１０１】なお、切替える条件としては、量子化器１
４の数式による計算式（式（３）、式（４））におい
て、最初に条件１としてx＿ｑｕａｎｔ[ｉ]を計算し
て、全てのスケールファクターバンドにおける量子化値
が０になった場合に、条件２に変更するように制御を行
う。The condition for switching is that the quantizer 1
In the calculation formulas (Formulas (3) and (4)) based on Formula 4, when x_quant [i] is first calculated as the condition 1 and the quantized values in all scale factor bands become 0, The control is performed so that the condition 2 is changed.

【０１０２】次に、条件１及び２を含む符号化装置のレ
ート／歪みコントローラ１６と、ステップコントローラ
１７の動作について、数式のフローチャートを用いて説
明を行う。Next, the operation of the rate / distortion controller 16 and the step controller 17 of the coding apparatus including the conditions 1 and 2 will be described with reference to a flow chart of mathematical expressions.

【０１０３】図８、図９は、切替え条件を有するレート
／歪みコントローラ及びステップ制御コントローラの動
作の一例を示すフローチャートである。ここで、図８に
おいて、Ｓ５１からＳ５６までの処理は、図５に示した
Ｓ１１からＳ１６と同様であるため、ここでの説明は省
略する。8 and 9 are flow charts showing an example of the operations of the rate / distortion controller and the step controller having the switching condition. Here, in FIG. 8, the processing from S51 to S56 is the same as the processing from S11 to S16 shown in FIG. 5, so description thereof will be omitted here.

【０１０４】レート／歪みの制御において、スケールフ
ァクターバンド内で使用できるビット数（ａｖｅｒａｇ
ｅ＿ｂｉｔｓ）を、ビット数のカウント（ｂｉｔ＿ｃｏ
ｕｎｔ()）が超えないで処理ができるかの判断を条件１
と条件２への切替え処理を含めて、Ｓ５７からＳ７２ま
での処理を繰り返し行うことで確認する（Ｓ５７）。In rate / distortion control, the number of bits available in the scale factor band (averag)
e_bits) is a bit number count (bit_co
Condition 1 to judge whether processing can be performed without exceeding unt ())
This is confirmed by repeating the processing from S57 to S72 including the processing for switching to Condition 2 and (S57).

【０１０５】なお、ａｖｅｒａｇｅ＿ｂｉｔｓは、予め
設定しておき、ｂｉｔ＿ｃｏｕｎｔ()は、ノイズレスコ
ーディング１５で計算されたビット数の総和であり、ス
ケールファクター又はハフマンテーブルのコードブック
番号等、伝送のために必要なビット数も含む。It should be noted that the average_bits is set in advance, and the bit_count () is the sum of the number of bits calculated by the noiseless coding 15, and is necessary for transmission such as the scale factor or the codebook number of the Huffman table. Including the number of bits.

【０１０６】まず、ＱＵＡＮＴ＿ＳＴＥＰに量子化値の
最大値をセットする（Ｓ５８）。次に、分割した全ての
スケールファクターバンドについて処理を行う（Ｓ５
９）。まず、初期値としてｆｌａｇ[ｓｂ]に１をセット
する（Ｓ６０）。次に、条件１の数式（式（２０））に
て、ｓｂ番目のスケールファクターバンドのスケールフ
ァクター値（ｓｃｆ[ｓｂ]）を求め（Ｓ６１）、量子化
値（ｃａｌｃ＿ｑｕａｎｔ()）の計算を行う（Ｓ６
２）。First, the maximum quantized value is set in QUANT_STEP (S58). Next, processing is performed on all the divided scale factor bands (S5).
9). First, 1 is set to flag [sb] as an initial value (S60). Next, the scale factor value (scf [sb]) of the sb-th scale factor band is obtained by the mathematical expression (Equation (20)) of Condition 1 (S61), and the quantized value (calc_quant ()) is calculated. (S6
2).

【０１０７】次に、ｆｌａｇ[ｓｂ]に０をセットし（Ｓ
６３）、Ｓ６２までに計算された量子化値がｏｆｆｓｅ
ｔ(ｓｂ)からｔｏｐ(ｓｂ)までで全て０か否かを確認を
行う（Ｓ６４〜Ｓ６７）。最初に、量子化値が０か否か
を判断し（Ｓ６５）、０でない量子化値があればｆｌａ
ｇ[ｓｂ]に１をセットする（Ｓ６６）。Next, 0 is set in flag [sb] (S
63), the quantized value calculated up to S62 is offse
Whether t (sb) to top (sb) are all 0 is confirmed (S64 to S67). First, it is judged whether or not the quantized value is 0 (S65), and if there is a quantized value other than 0, fla is determined.
1 is set to g [sb] (S66).

【０１０８】次に、ｆｌａｇ[ｓｂ]が０か否かを判断し
（Ｓ６８）、０であれば、式（２１）を用いてｓｂ番目
のスケールファクターバンドのスケールファクター値を
計算する（Ｓ６９）。Ｓ６８にて「ＮＯ」の場合、又は
Ｓ６９の処理が終了後、量子化値の計算を行う（Ｓ７
０）。これを、スケールファクターバンド毎に行い（Ｓ
７１）、ビットカウント（ｂｉｔ＿ｃｏｕｎｔ()）がａ
ｖｅｒａｇｅ＿ｂｉｔｓを超なくなるまで繰り返し行う
（Ｓ７２）。Next, it is judged whether or not flag [sb] is 0 (S68), and if it is 0, the scale factor value of the sbth scale factor band is calculated using the equation (21) (S69). . If "NO" in S68 or after the process of S69 is completed, the quantized value is calculated (S7).
0). Do this for each scale factor band (S
71) and the bit count (bit_count ()) is a
It is repeated until the number of average_bits is exceeded (S72).

【０１０９】もし、ａｖｅｒａｇｅ＿ｂｉｔｓを超える
ことがあれば、必要なビット数を得ることができず、符
号化ができない場合や符号化した際に音質が歪んでしま
う等の問題が発生してしまうため、その場合は、ステッ
プ制御コントローラ内の量子化値の値を低く設定して再
度処理を行う。If the average_bits is exceeded, the required number of bits cannot be obtained, and there arises a problem that the encoding cannot be performed or the sound quality is distorted when encoded. In that case, the value of the quantized value in the step controller is set low and the process is performed again.

【０１１０】図８、図９で示したフローチャートによ
り、ゲイン又は前記量子化値の最適値を求めることがで
きる。The gain or the optimum value of the quantized value can be obtained by the flow charts shown in FIGS. 8 and 9.

【０１１１】更に、符号化装置のその他のブロック構成
例として、上述した式（１）、式（６）、式（２０）及
び式（２１）の数式による計算を夫々行う量子化器１４
を有する各モジュール（スケールファクター１３、量子
化器１４、ノイズレスコーディング１５及びレート／歪
みコントローラ１６及びステップ制御コントローラ１
７）を符号化装置内に具備し、その中でビットカウント
数が少ないもので符号化を行うことにより、効率よく符
号化を行うことができる。Further, as another block configuration example of the encoding device, the quantizer 14 which respectively performs the calculation by the formulas of the above formulas (1), (6), (20) and (21).
Each module having (scale factor 13, quantizer 14, noiseless coding 15, rate / distortion controller 16 and step controller 1
7) is provided in the encoding device, and the one having a small bit count is used for the encoding, whereby the encoding can be performed efficiently.

【０１１２】上述したモジュールを有する符号化装置の
各ブロック構成の一例について、図１０を用いて説明す
る。An example of each block configuration of the coding apparatus having the above-mentioned module will be described with reference to FIG.

【０１１３】図１０の符号化装置は、聴覚モデル１１
と、ＭＳコントローラ１９と、フィルターバンク１２
と、Ｍ／Ｓステレオツール２０と、モジュールＡ２１
と、モジュールＢ２２と、モジュールＣ２３と、評価・
選択部２４と、ビットストリームマルチプレクサ１８と
を有するよう構成されている。また、モジュールＡ２１
とモジュールＢ２２とモジュールＣ２３の夫々のブロッ
クの構成例を図１１に示す。The coding apparatus shown in FIG.
, MS controller 19 and filter bank 12
, M / S stereo tool 20, and module A21
, Module B22, module C23, evaluation
It is configured to have a selection unit 24 and a bitstream multiplexer 18. In addition, the module A21
FIG. 11 shows a configuration example of each block of the module B22 and the module C23.

【０１１４】ここで、例えば、モジュールＡ２１には、
上述した条件１及び条件２による切替え（式（２０）、
式（２１）を行って量子化値を計算する上述の図８、図
９を用いて説明した動作を有し、モジュールＢ２２に
は、式（７）及び式（８）の数式により量子化値を計算
する動作を有し、また、モジュールＣ２３には、式
（５）、式（６）の数式により量子化値を計算する処理
を有する。Here, for example, in the module A21,
Switching according to the conditions 1 and 2 described above (equation (20),
The module B22 has the operation described with reference to FIGS. 8 and 9 for calculating the quantized value by performing the expression (21), and the module B22 uses the quantized value according to the expressions (7) and (8). In addition, the module C23 has a process of calculating a quantized value according to the formulas (5) and (6).

【０１１５】評価・選択部２４は、モジュールＡ２１、
モジュールＢ２２及びモジュールＣの出力に基づいて、
どのモジュールの制御信号とデータ信号をビットストリ
ームマルチプレクサ１８に出力するかを評価し選択する
機能を有する。なお、評価は、モジュールＡ２１、モジ
ュールＢ２２及びモジュールＣ２３から夫々出力された
使用ビット数と、各モジュールから出力されるデータを
比較して、使用ビット数とデータ量が最小となるモジュ
ールを用いて符号化を行う。The evaluation / selection unit 24 uses the module A21,
Based on the outputs of module B22 and module C,
It has a function of evaluating and selecting which module control signal and data signal are output to the bitstream multiplexer 18. The evaluation is performed by comparing the number of used bits output from each of the modules A21, B22, and C23 with the data output from each module, and using the module having the smallest number of used bits and the minimum data amount To convert.

【０１１６】図１０に示すブロック構成により、多種多
様なオーディオ信号において、効率のよい符号化を選択
して符号化を行うことができる。With the block configuration shown in FIG. 10, efficient encoding can be selected and encoded for a wide variety of audio signals.

【０１１７】上述したように本発明は、例えば、符号化
ビットが不足している場合等にハフマン符号化等の符号
語の特徴に基づいて量子化値を選定し符号化を行うこと
により、符号語の長さを短くすることができ、同時にビ
ット数を削減することができる。本発明により、削減で
きたビットは、音質にとって重要な部分に割当てられる
ことが可能であるため、相対的に聴覚的な音質を向上さ
せることができる。As described above, according to the present invention, for example, when a coding bit is insufficient, a quantized value is selected based on the characteristics of a code word such as Huffman coding, and the coding is performed. The word length can be reduced and at the same time the number of bits can be reduced. According to the present invention, the reduced bits can be assigned to a portion that is important for sound quality, so that the relatively perceptual sound quality can be improved.

【０１１８】また、スケールファクター値を量子化値の
最大値に基づいて、従来より大きく変化させる場合が多
いことから、計算量を減少させることができる。なお、
こうした変換符号化の規格は、デコーダのｓｙｎｔａｘ
であるので、本発明でエンコードしたビットストリーム
も既存のデコーダでデコードすることができる。Further, since the scale factor value is often changed largely based on the maximum value of the quantized value, the calculation amount can be reduced. In addition,
The standard of such transform coding is syntax of the decoder.
Therefore, the bitstream encoded by the present invention can be decoded by the existing decoder.

【０１１９】なお、本発明にて用いられた動作内容のフ
ローは一例であり、発明の範囲においては上述した限り
ではない。Note that the flow of operation contents used in the present invention is an example, and is not limited to the above description within the scope of the invention.

【０１２０】[0120]

【発明の効果】上述の如く本発明によれば、オーディオ
信号の符号化を効率的に行い、使用できるビット数を削
減することができる。また、オーディオ符号化において
使用できるビット数が少ない場合にも検知される音質の
劣化を軽減することができる。As described above, according to the present invention, it is possible to efficiently encode an audio signal and reduce the number of usable bits. In addition, even when the number of bits that can be used in audio encoding is small, it is possible to reduce deterioration in detected sound quality.

[Brief description of drawings]

【図１】ハフマン符号語テーブルの一例を示す図であ
る。FIG. 1 is a diagram showing an example of a Huffman codeword table.

【図２】本発明における符号化装置の構成を示すブロッ
ク図の一例である。FIG. 2 is an example of a block diagram showing a configuration of an encoding device according to the present invention.

【図３】ハフマン符号語に変換する処理の流れをプログ
ラム的に示す一例の図である。FIG. 3 is an example of programmatically showing a flow of processing for converting into a Huffman code word.

【図４】ＤＣＴ係数のバンド分けの一例を示す図であ
る。FIG. 4 is a diagram showing an example of band division of DCT coefficients.

【図５】レート／歪みコントローラ及びステップ制御コ
ントローラの動作の一例を示すフローチャートである。FIG. 5 is a flowchart showing an example of operations of a rate / distortion controller and a step controller.

【図６】本発明におけるＭＳコントローラを含む符号化
装置の構成を示すブロック図の一例である。FIG. 6 is an example of a block diagram showing a configuration of an encoding device including an MS controller according to the present invention.

【図７】ＭＳコントローラを有する場合のレート／歪み
コントローラ及びステップ制御コントローラの動作の一
例を示すフローチャートである。FIG. 7 is a flowchart showing an example of operations of a rate / distortion controller and a step control controller having an MS controller.

【図８】切替え条件を有するレート／歪みコントローラ
及びステップ制御コントローラの動作の一例を示すフロ
ーチャート（１）である。FIG. 8 is a flowchart (1) showing an example of operations of a rate / distortion controller having a switching condition and a step controller.

【図９】切替え条件を有するレート／歪みコントローラ
及びステップ制御コントローラの動作の一例を示すフロ
ーチャート（２）である。FIG. 9 is a flowchart (2) showing an example of the operations of the rate / distortion controller having a switching condition and the step controller.

【図１０】モジュールを有する符号化装置の各ブロック
構成の一例を示す図である。FIG. 10 is a diagram showing an example of each block configuration of an encoding device having a module.

【図１１】本発明における夫々のモジュールのブロック
構成例を示す図である。FIG. 11 is a diagram showing a block configuration example of each module in the present invention.

[Explanation of symbols]

１１聴覚モデル１２フィルターバンク１３スケールファクター１４量子化器１５ノイズレスコーディング１６レート／歪みコントローラ１７ステップ制御コントローラ１８ビットストリームマルチプレクサ１９ＭＳコントローラ２０Ｍ／Ｓステレオツール２１モジュールＡ２２モジュールＢ２３モジュールＣ２４評価・選択部 11 Hearing model 12 filter banks 13 Scale factor 14 Quantizer 15 Noiseless coding 16 rate / distortion controller 17 step controller 18 bitstream multiplexer 19 MS controller 20 M / S Stereo Tool 21 Module A 22 Module B 23 Module C 24 Evaluation / Selection Section

───────────────────────────────────────────────────── フロントページの続きＦターム(参考） 5D045 DA20 5J064 AA02 BA09 BA16 BB05 BC01 BC08 BC09 BC11 BC16 BC23 BD01 ─────────────────────────────────────────────────── ─── Continued front page F-term (reference) 5D045 DA20 5J064 AA02 BA09 BA16 BB05 BC01 BC08 BC09 BC11 BC16 BC23 BD01

Claims

[Claims]

1. An audio signal encoding method, comprising: a conversion procedure for converting a time domain signal into a frequency domain signal; and a division procedure for dividing the frequency coefficient group converted in the conversion step into a plurality of bands. A control procedure for controlling the gain or the quantized value at a value of a frequency coefficient represented by a product of a gain and a quantized value in a stepwise manner, and an encoding procedure for encoding the quantized value. A characteristic audio signal encoding method.

2. The gain is set for each scale factor band, and encoding is performed so that the code word of the maximum frequency coefficient in the scale factor band has the maximum quantized value. 1. The audio signal encoding method according to 1.

3. The gain is set for each scale factor band, and the quantization value of the code word of the maximum frequency coefficient in the scale factor band is set to 1 or an arbitrary integer. 2. The audio signal encoding method as described in 2.

4. When MS stereo is applied to an audio signal, the energy of M component and S component in the scale factor band, or the magnitude of the maximum frequency coefficient, or the auditory entropy is used to determine the M component in the scale factor band. 4. The audio signal encoding method according to claim 2, wherein different quantization values are used as the maximum frequency coefficients of the S component and the S component, respectively.

5. An audio signal coding apparatus, comprising: a transforming unit that transforms a time domain signal into a frequency domain signal; and a splitting unit that splits the frequency coefficient group transformed in the transforming step into a plurality of bands. A control unit that controls the gain or the quantized value at a value of a frequency coefficient represented by a product of a gain and a quantized value in a stepwise manner, and an encoding unit that encodes the quantized value. A characteristic audio signal encoding device.

6. The control means sets the gain for each scale factor band, and the encoding means sets the code word of the maximum frequency coefficient within the scale factor band to be the maximum quantized value. The audio signal encoding apparatus according to claim 5, wherein the audio signal encoding apparatus performs encoding.

7. The control means sets the gain for each scale factor band, and the encoding means sets the quantized value of the code word of the maximum frequency coefficient in the scale factor band to 1 or an arbitrary integer. 7. The audio signal encoding device according to claim 5, wherein:

8. When MS stereo is applied to an audio signal, the energy of M component and S component in the scale factor band, or the magnitude of the maximum frequency coefficient, or the auditory entropy is used to determine the M component in the scale factor band. 9. The audio signal coding apparatus according to claim 6, wherein different quantization values are used as the maximum frequency coefficients of the S component and the S component, respectively.

9. One of the plurality of encoding means is evaluated / selected based on a plurality of encoding means having different encoding methods and a required number of bit rates for each scale factor band. 9. The audio signal encoding apparatus according to claim 5, further comprising an evaluation / selection unit.