JP2003005797A

JP2003005797A - Audio signal encoding method and apparatus, and encoding and decoding system

Info

Publication number: JP2003005797A
Application number: JP2001188067A
Authority: JP
Inventors: Sen Chon Kok; セン・チョンコク; Hon Neo Sua; ホン・ネオスア; Han Kua Kim; ハン・クアキム
Original assignee: Matsushita Electric Industrial Co Ltd
Current assignee: Panasonic Holdings Corp
Priority date: 2001-06-21
Filing date: 2001-06-21
Publication date: 2003-01-08

Abstract

(57)【要約】【課題】修正関数をより正確に生成して利得制御を実
行することにより、プリエコー雑音及びポストエコー雑
音を確実に抑圧する。【解決手段】オーディオサンプル信号を含むサブバン
ドフレームを複数の区分に分けて各区分のピークの絶対
値を取得する。急激な音の立ち上がりを含む区分（アタ
ック信号）を識別し、識別された上記アタック信号の区
分と、前に修正された区分との間の区分を増幅してアタ
ック信号を処理し、また、急激な音の立下がりを含む区
分（リリース信号）を識別し、識別された上記リリース
信号の区分と、前に修正された区分との間の区分を減衰
してリリース信号を処理し、それによってプリエコーと
ポストエコーの発生を防止する。 (57) [Problem] To accurately suppress a pre-echo noise and a post-echo noise by generating a correction function more accurately and performing gain control. A subband frame including an audio sample signal is divided into a plurality of sections, and an absolute value of a peak in each section is obtained. Identify the segment (attack signal) that includes the sudden onset of sound, amplify the segment between the identified segment of the attack signal and the previously modified segment, process the attack signal, and The release signal is identified by attenuating the section between the identified release signal section and the previously modified section, and processing the release signal by attenuating the section between the identified release signal section and the previously modified section. And prevent the occurrence of post-echo.

Description

Detailed Description of the Invention

【０００１】[0001]

【発明の属する技術分野】本発明は、例えばＡＴＲＡＣ
３のような符号化方法に適用される、オーディオ信号の
符号化方法及び装置、並びに符号化及び復号化システム
に関し、特に、急激な立ち上がり信号であるアタック信
号の前に発生するプリエコー雑音、及び、急激な立ち下
がり信号であるリリース信号の後に発生するポストエコ
ー雑音を抑圧するためのオーディオ信号の符号化方法及
び装置、並びに符号化及び復号化システムに関する。TECHNICAL FIELD The present invention relates to, for example, ATRAC.
The present invention relates to an audio signal encoding method and apparatus, and an encoding and decoding system, which are applied to an encoding method such as 3), and in particular, pre-echo noise generated before an attack signal which is a sharp rising signal, and The present invention relates to an audio signal encoding method and apparatus, and an encoding and decoding system, for suppressing post-echo noise generated after a release signal which is a sharp falling signal.

【０００２】[0002]

【従来の技術】プリエコー雑音及びポストエコー雑音に
関する古典的な問題は、オーディオ信号の変換符号化に
おいて公知である（例えば、特開２０００−２５９１９
７号公報参照。）。図１１（ａ）は利得制御をせず、プ
リエコーやポストエコーが発生するときの従来技術のオ
ーディオ信号の符号化及び復号化装置の構成を示すブロ
ック図であり、図１１（ｂ）は利得制御を行い、プリエ
コーやポストエコーを抑圧するときの従来技術のオーデ
ィオ信号の符号化及び復号化装置の構成を示すブロック
図である。BACKGROUND OF THE INVENTION The classical problem of pre-echo noise and post-echo noise is well known in transform coding of audio signals (e.g. JP 2000-25919 A).
See Publication No. 7. ). FIG. 11A is a block diagram showing the configuration of a conventional audio signal encoding / decoding device when pre-echo and post-echo occur without gain control, and FIG. 11B shows gain control. FIG. 3 is a block diagram showing a configuration of a conventional audio signal encoding / decoding device when performing pre-echo and post-echo.

【０００３】図１１（ａ）に図示された、変形離散コサ
イン変換（以下、ＭＤＣＴという。）処理部１０１及び
量子化器１０２を備えた符号化器と、逆量子化器１０３
及び逆ＭＤＣＴ処理部１０４を備えた復号化器とを参照
すると、アタック信号（すなわち、急激な音の立ち上が
りを含む信号）を含む典型的な変換ブロックが符号化器
に入力されるとき、量子化器１０２は雑音を導入し、次
いで、復号化器は復元されたブロックにおいて上記雑音
を時間領域について均等に分布させる。すなわち、復元
された変換ブロック内にプリエコー雑音とポストエコー
雑音が発生する現象は、アタック信号を量子化すること
によって発生する量子化雑音を、アタック信号を含む変
換ブロック上に均等に分布させることによって引き起こ
される。アタック信号は、信号の強度の急激な増大の前
に、小さい振幅の期間が存在し、それらすべてが変換ブ
ロック内に存在することによって特徴づけられる。An encoder having a modified discrete cosine transform (hereinafter referred to as MDCT) processing unit 101 and a quantizer 102 shown in FIG. 11A, and an inverse quantizer 103.
And a decoder including the inverse MDCT processing unit 104, quantization when a typical transform block including an attack signal (that is, a signal including a rapid rising of a sound) is input to the encoder. The unit 102 introduces noise, and the decoder then distributes the noise evenly in the time domain in the reconstructed block. That is, the phenomenon in which the pre-echo noise and the post-echo noise occur in the restored transform block is that the quantization noise generated by quantizing the attack signal is evenly distributed on the transform block including the attack signal. Is triggered. The attack signal is characterized by the presence of periods of small amplitude, all within the transform block, before the sudden increase in signal strength.

【０００４】プリエコーは、２つの理由によって、聴覚
的に、ポストエコーよりも耳障りである。第１に、プリ
エコーは、大きな強度のアタック信号の部分の下に隠れ
てしまいがちなポストエコーとは異なり、それが出現す
るところの小さい振幅の信号を完全に覆い被してしま
う。第２に、人間の耳は、マスキングされる音がアタッ
ク信号の部分の時間的後方に存在する順方向マスキング
継続時間と比較して、マスキングされる音がアタック信
号の部分の時間的前方に存在する逆方向マスキング継続
時間は、はるかに短いという特徴を有する。具体的に
は、順方向マスキング継続時間は約２００ｍｓｅｃであ
り、逆方向マスキング継続時間は約５ｍｓｅｃである。Pre-echo is audibly more annoying than post-echo for two reasons. First, the pre-echo completely covers the small amplitude signal in which it appears, unlike the post-echo, which tends to hide under the portion of the attack signal of high intensity. Second, the human ear has the masked sound in front of the attack signal portion in time compared to the forward masking duration, in which the masked sound lies behind the attack signal portion in time. The reverse masking duration to be characterized is much shorter. Specifically, the forward masking duration is about 200 msec and the backward masking duration is about 5 msec.

【０００５】この問題を扱うために、３つのよく知られ
た以下の方法が存在する。（ａ）ブロックサイズ切り換え法；（ｂ）時間方向の雑音波形整形法（temporal noise sha
ping；ＴＮＳ）；及び（ｃ）利得制御法。There are three well-known methods for dealing with this problem: (A) Block size switching method; (b) Temporal noise shaping method
ping; TNS); and (c) gain control method.

【０００６】ここで、ブロックサイズ切り換え法は、標
準の変換ブロックをより短い部分ブロックに分割し、各
部分ブロックにより短い変換を適用する。このことは、
プリエコー雑音の拡散を、短い部分ブロック内に限定す
ることになる。また、時間方向の雑音波形整形法は、ア
タック信号が時間領域におけるその非均一性によって特
徴付けられているので、スペクトルデータに予測符号化
を適用することによって、時間−周波数の双対性を利用
する。最後に、利得制御法は、修正関数を用いて、小さ
い振幅の部分における信号強度を増幅することによっ
て、及び／又は大きい振幅部分の信号強度を抑制するこ
とによって、時間領域における信号を修正する。Here, the block size switching method divides a standard conversion block into shorter partial blocks and applies a shorter conversion to each partial block. This is
The pre-echo noise spread will be limited to within short sub-blocks. Also, the noise waveform shaping method in the time direction utilizes the time-frequency duality by applying predictive coding to the spectral data, since the attack signal is characterized by its non-uniformity in the time domain. . Finally, the gain control method uses a modification function to modify the signal in the time domain by amplifying the signal strength in the small amplitude part and / or suppressing the signal strength in the large amplitude part.

【０００７】従来技術に係る利得制御法の１つの例が図
１１（ｂ）に図示されている。ここでは、図１１（ａ）
の符号化器及び復号化器に対して、さらに利得制御部１
０５と逆利得制御部１０６が設けられている。図１１
（ａ）と同様のアタック信号を含む変換ブロックが入力
されたとき、利得制御部１０５は弱い信号を増幅し、及
び／又は強い信号を抑圧する。この増幅及び／又は抑圧
の情報は、符号化された信号とともに復号化器に送ら
れ、上記情報に基づいて、逆利得制御部１０６は、利得
制御部１０５において増幅された部分を抑圧し、及び／
又は抑圧された部分を増幅する。量子化器１０２は雑音
を導入し、復元されたブロックにおいて上記雑音を時間
領域について均等に分布させるが、逆利得制御部１０６
の後段において、雑音は、図１１（ａ）の利得制御なし
の例と比較すると、信号が予め小さな振幅だった領域に
おいて抑圧されている。One example of a gain control method according to the prior art is shown in FIG. 11 (b). Here, FIG. 11 (a)
Gain control unit 1 for the encoder and the decoder of
05 and an inverse gain control unit 106 are provided. Figure 11
When a conversion block including an attack signal similar to that in (a) is input, the gain control unit 105 amplifies a weak signal and / or suppresses a strong signal. This amplification and / or suppression information is sent to the decoder together with the encoded signal, and based on the above information, the inverse gain control unit 106 suppresses the amplified part in the gain control unit 105, and /
Alternatively, the suppressed part is amplified. The quantizer 102 introduces noise and evenly distributes the noise in the restored block in the time domain, but the inverse gain controller 106
In the latter stage, noise is suppressed in a region where the signal has a small amplitude in advance, as compared with the example without gain control in FIG. 11A.

【０００８】[0008]

【発明が解決しようとする課題】プリエコー雑音及びポ
ストエコー雑音の問題に取り組むためのよい方法は、い
くつかの要因を考慮しなければならない。アタック信号
に先行する小さい振幅の信号が、早すぎず、かつ遅すぎ
ないように増幅されるように、信号中のアタック信号の
開始部分（onset）を捕捉することが重要である。信号
をあまり早く増幅すると、プリエコーは効果的に除去さ
れない。信号をあまり遅く増幅すると、アタック信号そ
れ自体が増幅される危険があり、従って、プリエコーの
問題をさらに悪化させる。同様に、信号エネルギーの増
大（サージ）が、後に続くスペクトル量子化の処理に過
度の負担をかけないように、小さい振幅の信号を増幅し
た結果としての信号強度の増大は、大きい振幅の信号を
適当な程度に減衰することによってバランスを取る必要
がある。減衰された信号の、復号化器で復元されたとき
の正確さを減少させないように、減衰は過度であっては
ならない。最悪の場合には、過度の減衰は、われわれが
除去しようとするポストエコーよりも耳障りな、付加的
な人工物（アーティファクト）を導入することがある。
ゆえに、減衰が適用されるべき範囲を設定するための方
法も同様に重要である。上述の要因がすべて考慮された
ならば、そのとき、利得制御を達成するために、修正関
数を生成して、生成した修正関数を上記フレームに対し
て乗算することができる。上記修正関数は、前のフレー
ムと現在のフレームとのサンプル信号のオーバーフロー
を引き起こさないことが必要である。A good way to address the problems of pre-echo noise and post-echo noise must consider several factors. It is important to capture the onset of the attack signal in the signal so that the small amplitude signal preceding the attack signal is amplified not too early and not too late. If the signal is amplified too quickly, the pre-echo will not be effectively removed. Amplifying the signal too slowly risks amplifying the attack signal itself, thus exacerbating the pre-echo problem. Similarly, the increase in signal strength as a result of amplifying a signal of small amplitude causes the increase of signal of large amplitude so that the increase in signal energy (surge) does not overwhelm the subsequent processing of spectral quantization. It must be balanced by a suitable amount of damping. The attenuation should not be excessive so as not to reduce the accuracy of the attenuated signal when it is reconstructed at the decoder. In the worst case, excessive attenuation can introduce additional artifacts that are less annoying than the post-echo we are trying to eliminate.
Therefore, the method for setting the range to which the damping should be applied is also important. If all of the above factors are taken into account, then a correction function can be generated and the generated correction function can be multiplied to the frame in order to achieve gain control. The correction function should not cause overflow of the sampled signal between the previous frame and the current frame.

【０００９】アタック信号がプリエコー雑音の発生に関
与するのと同様に、急激な音の立下がりを含む信号であ
るリリース信号はポストエコー雑音の発生に関与するの
で、そのようなリリース信号の影響もまた抑圧しなけれ
ばならない。Since the release signal, which is a signal including a sudden fall of sound, is involved in the generation of post-echo noise, just as the attack signal is involved in the generation of pre-echo noise, the influence of such release signal is also exerted. You must also suppress it.

【００１０】本発明の目的は以上の問題点を解決し、修
正関数をより正確に生成して利得制御を実行することに
より、プリエコー雑音及びポストエコー雑音を確実に抑
圧することができるオーディオ信号の符号化方法及び装
置、並びに符号化及び復号化システムを提供することに
ある。An object of the present invention is to solve the above problems and to more accurately generate a correction function to execute gain control, thereby making it possible to reliably suppress pre-echo noise and post-echo noise. An object of the present invention is to provide an encoding method and apparatus, and an encoding and decoding system.

【００１１】[0011]

【課題を解決するための手段】修正関数を用いる利得制
御は、多くのオーディオ用符号化器においてプリエコー
及びポストエコーのアーティファクトを緩和するための
ポピュラーな方法である。本発明は、上記修正関数を生
成することにおいて、利得制御するポイント（位置）、
利得制御するレベル（増幅率及び減衰率）、及びオーバ
ーフローに対するセーフガードを決定することを実現す
る効率的な方法を提示する。Gain control using a correction function is a popular method for mitigating pre-echo and post-echo artifacts in many audio encoders. In the present invention, in generating the correction function, a point (position) for gain control,
We present an efficient way to realize determining gain control level (amplification factor and attenuation factor) and safeguard against overflow.

【００１２】本発明は、フレームの境界にまたがるアタ
ック信号を処理するために、フレームの複数の区分をバ
ッファメモリに記憶して遅延させるステップと、上記ア
タック信号の開始部分を検出するステップと、上記アタ
ック信号に先行する信号に適用される増幅係数を決定す
るステップと、減衰が適用されるべき位置を決定するス
テップと、上記減衰のための値を計算するステップと、
現在のフレームと前のフレームとにおけるオーバーフロ
ーを防止するステップとを含む。The present invention includes the steps of storing and delaying a plurality of sections of a frame in a buffer memory to process an attack signal that crosses frame boundaries; detecting the beginning of the attack signal; Determining the amplification factor applied to the signal preceding the attack signal, determining the position where the attenuation should be applied, calculating a value for said attenuation,
Preventing overflow in the current frame and the previous frame.

【００１３】本発明に係るオーディオ信号の符号化方法
は、入力されたオーディオ信号に基づいて修正関数を計
算して、計算された修正関数に従って上記オーディオ信
号に対して利得制御するステップと、上記利得制御され
たオーディオ信号に対してフレーム単位で直交変換処理
を行いかつ符号化処理を行うことにより符号化されたビ
ットストリーム信号を得るステップとを含むオーディオ
信号の符号化方法において、上記利得制御するステップ
は、入力されたオーディオ信号をフレームの時間よりも
短い区分の時間に分割し、少なくとも１個の区分の時間
だけオーディオ信号を遅延させ、遅延されたオーディオ
信号の処理すべきフレームとその前後のフレームとを含
む少なくとも３つのフレームに基づいて修正関数を計算
することを特徴とする。The audio signal encoding method according to the present invention comprises the steps of calculating a correction function based on the input audio signal and controlling the gain of the audio signal according to the calculated correction function; A step of performing an orthogonal transform process on the controlled audio signal on a frame-by-frame basis and performing an encoding process to obtain an encoded bitstream signal. Divides the input audio signal into time sections that are shorter than the frame time, delays the audio signal by at least one section time, and processes the delayed audio signal and the frames before and after that frame. Calculating a correction function based on at least three frames including That.

【００１４】上記オーディオ信号の符号化方法におい
て、上記利得制御するステップは、好ましくは、（ａ）
上記各区分のピークの絶対値MaxPeakを得るステップ
と、（ｂ）所定のアタック信号の基準を用いて、上記オ
ーディオ信号中の、急激な音の立ち上がりを含む信号の
部分であるアタック信号を識別するステップと、（ｃ）
処理すべきフレームとその前後のフレームとを含む少な
くとも３つのフレームに基づいて修正関数を計算し、計
算された修正関数に基づいて利得制御することにより、
上記識別された上記アタック信号の区分と、上記以前の
修正位置である区分との間の複数の区分を増幅するよう
に上記アタック信号を処理するステップとを含み、上記
アタック信号の基準は、次の区分に係る上記MaxPeak
と、上記次の区分に先行する予め決められた数の区分の
うちの最高値のMaxPeakとに基づいてアタック信号を識
別することを特徴とする。In the audio signal encoding method, the step of controlling the gain is preferably (a).
Using the step of obtaining the absolute value MaxPeak of the peak of each section, and (b) using a predetermined attack signal reference, identify an attack signal that is a part of the signal that includes a rapid sound rise. Step, (c)
By calculating a correction function based on at least three frames including a frame to be processed and frames before and after the frame, and performing gain control based on the calculated correction function,
Processing the attack signal to amplify a plurality of partitions between the identified partition of the attack signal and the partition that is the previous modified position, the criteria for the attack signal being: Max Peak related to the above categories
And an attack signal based on the highest value MaxPeak of a predetermined number of sections preceding the next section.

【００１５】また、上記オーディオ信号の符号化方法に
おいて、上記利得制御するステップは、好ましくは、上
記少なくとも３つのフレームに基づいて、上記修正関数
を、現在処理すべき区分から次の区分に係る上記MaxPea
kと、上記現在処理すべき区分から先行する予め決めら
れた数の区分における最高値MaxPeakとに基づいて計算
することを特徴とする。In the audio signal encoding method, the step of controlling the gain is preferably based on the at least three frames, and the correction function is related to the next section from the section to be currently processed. MaxPea
The calculation is based on k and the maximum value MaxPeak in a predetermined number of sections preceding the section to be currently processed.

【００１６】さらに、上記オーディオ信号の符号化方法
において、好ましくは、（ａ）現在処理すべき区分と、
それよりも以前の修正位置である区分との間のピークの
絶対値InterModMaxPeakを得るステップと、（ｂ）所定
のリリース信号の基準を用いて、上記オーディオ信号中
の、急激な音の立下がりを含む信号の部分であるリリー
ス信号を識別するステップと、（ｃ）処理すべきフレー
ムとその前後のフレームとを含む少なくとも３つのフレ
ームに基づいて修正関数を計算し、計算された修正関数
に基づいて利得制御することにより、識別されたリリー
ス信号の区分と、それよりも後ろの修正位置である区分
との間の複数の区分を増幅して上記リリース信号を処理
するステップとをさらに含み、上記リリース信号の基準
は、最新の上記InterModMaxPeakと、後続する予め決め
られた数の区分のうちの最高値のMaxPeakと、現在処理
すべき区分と最後にアタック信号が識別された区分との
間の時間とに基づいてリリース信号を識別することを特
徴とする。Further, in the above audio signal encoding method, preferably, (a) a segment to be currently processed,
Using the step of obtaining the absolute value InterModMaxPeak of the peak with respect to the section which is the correction position before that, and (b) using a predetermined release signal standard, a sudden fall of the sound in the audio signal is caused. Identifying a release signal that is a portion of the signal that includes, and (c) calculating a correction function based on at least three frames including the frame to be processed and the frames before and after it, and based on the calculated correction function. Gain controlling to further amplify a plurality of sections between the identified section of the release signal and a section that is a correction position behind the section to process the release signal. The signal is based on the latest InterModMaxPeak, the maximum value MaxPeak of the following predetermined number of divisions, the division to be currently processed and the attack signal at the end. It is characterized in that the release signal is identified based on the time to and from the identified section.

【００１７】またさらに、上記オーディオ信号の符号化
方法において、上記リリース信号を処理する修正関数
は、好ましくは、上記InterModMaxPeakと、上記後続す
る予め決められた数の区分のうちの最高値のMaxPeakと
に基づいて計算されることを特徴とする。Still further, in the audio signal encoding method, the modification function for processing the release signal is preferably the InterModMaxPeak and the maximum value MaxPeak of the subsequent predetermined number of partitions. It is characterized in that it is calculated based on.

【００１８】本発明に係るオーディオ信号の符号化装置
は、入力されたオーディオ信号に基づいて修正関数を計
算して、計算された修正関数に従って上記オーディオ信
号に対して利得制御手段と、上記利得制御されたオーデ
ィオ信号に対してフレーム単位で直交変換処理を行いか
つ符号化処理を行うことにより符号化されたビットスト
リーム信号を得る手段とを備えたオーディオ信号の符号
化装置において、上記利得制御手段は、入力されたオー
ディオ信号をフレームの時間よりも短い区分の時間に分
割し、少なくとも１個の区分の時間だけオーディオ信号
を遅延させ、遅延されたオーディオ信号の処理すべきフ
レームとその前後のフレームとを含む少なくとも３つの
フレームに基づいて修正関数を計算することを特徴とす
る。The audio signal encoding apparatus according to the present invention calculates a correction function based on the input audio signal, and gain control means and gain control means for the audio signal according to the calculated correction function. An audio signal encoding device, which comprises means for performing an orthogonal transformation process on the encoded audio signal in frame units and performing an encoding process to obtain an encoded bit stream signal, wherein the gain control means comprises: , The input audio signal is divided into time sections which are shorter than the frame time, the audio signal is delayed by at least one section time, and the frame to be processed of the delayed audio signal and the frames before and after it are processed. Characterized in that the correction function is calculated based on at least three frames including.

【００１９】上記オーディオ信号の符号化装置におい
て、上記利得制御手段は、好ましくは、（ａ）上記各区
分のピークの絶対値MaxPeakを得る手段と、（ｂ）所定
のアタック信号の基準を用いて、上記オーディオ信号中
の、急激な音の立ち上がりを含む信号の部分であるアタ
ック信号を識別する手段と、（ｃ）処理すべきフレーム
とその前後のフレームとを含む少なくとも３つのフレー
ムに基づいて修正関数を計算し、計算された修正関数に
基づいて利得制御することにより、上記識別された上記
アタック信号の区分と、上記以前の修正位置である区分
との間の複数の区分を増幅するように上記アタック信号
を処理する手段とを備え、上記アタック信号の基準は、
次の区分に係る上記MaxPeakと、上記次の区分に先行す
る予め決められた数の区分のうちの最高値のMaxPeakと
に基づいてアタック信号を識別することを特徴とする。In the audio signal encoding apparatus, the gain control means preferably uses (a) means for obtaining the absolute value MaxPeak of the peak of each section and (b) a predetermined attack signal reference. A means for identifying an attack signal which is a part of a signal including a sudden rise of sound in the audio signal, and (c) a modification based on at least three frames including a frame to be processed and frames before and after the frame. Calculating a function and controlling gain based on the calculated correction function so as to amplify a plurality of sections between the section of the identified attack signal and the section which is the previous correction position. And a means for processing the attack signal, the reference of the attack signal,
It is characterized in that the attack signal is identified based on the MaxPeak relating to the next division and the highest value MaxPeak of a predetermined number of divisions preceding the next division.

【００２０】また、上記オーディオ信号の符号化装置に
おいて、好ましくは、上記利得制御手段は、上記少なく
とも３つのフレームに基づいて、上記修正関数を、現在
処理すべき区分から次の区分に係る上記MaxPeakと、上
記現在処理すべき区分から先行する予め決められた数の
区分における最高値MaxPeakとに基づいて計算すること
を特徴とする。Further, in the audio signal encoding apparatus, preferably, the gain control means, based on the at least three frames, the correction function, the MaxPeak relating to a next section from a section to be currently processed. And a maximum value MaxPeak in a predetermined number of sections preceding the section to be currently processed.

【００２１】さらに、上記オーディオ信号の符号化装置
において、好ましくは、（ａ）現在処理すべき区分と、
それよりも以前の修正位置である区分との間のピークの
絶対値InterModMaxPeakを得る手段と、（ｂ）所定のリ
リース信号の基準を用いて、上記オーディオ信号中の、
急激な音の立下がりを含む信号の部分であるリリース信
号を識別する手段と、（ｃ）処理すべきフレームとその
前後のフレームとを含む少なくとも３つのフレームに基
づいて修正関数を計算し、計算された修正関数に基づい
て利得制御することにより、識別されたリリース信号の
区分と、それよりも後ろの修正位置である区分との間の
複数の区分を増幅して上記リリース信号を処理する手段
とをさらに備え、上記リリース信号の基準は、最新の上
記InterModMaxPeakと、後続する予め決められた数の区
分のうちの最高値のMaxPeakと、現在処理すべき区分と
最後にアタック信号が識別された区分との間の時間とに
基づいてリリース信号を識別することを特徴とする。Further, in the above audio signal encoding apparatus, preferably, (a) a segment to be currently processed,
A means for obtaining an absolute value InterModMaxPeak of a peak between a section which is a correction position before that and (b) a predetermined release signal reference, and in the audio signal,
Calculating means for calculating a correction function based on at least three frames including a frame to be processed and frames before and after the means for identifying a release signal which is a part of a signal including a sharp fall of sound; Means for amplifying a plurality of sections between the identified section of the release signal and the section which is the correction position behind the section by controlling the gain based on the corrected correction function, and processing the release signal. The release signal criterion further comprises the latest InterModMaxPeak, the maximum MaxPeak of a predetermined number of subsequent segments, the segment to be currently processed and the last attack signal identified. It is characterized in that the release signal is identified based on the time to and from the partition.

【００２２】またさらに、上記オーディオ信号の符号化
装置において、上記リリース信号を処理する修正関数
は、上記InterModMaxPeakと、上記後続する予め決めら
れた数の区分のうちの最高値のMaxPeakとに基づいて計
算されることを特徴とする。Furthermore, in the audio signal encoding device, the modification function for processing the release signal is based on the InterModMaxPeak and the maximum value MaxPeak of the following predetermined number of sections. It is calculated.

【００２３】本発明に係るオーディオ信号の符号化及び
復号化システムは、上記オーディオ信号の符号化装置
と、オーディオ信号の復号化装置とを備えたオーディオ
信号の符号化及び復号化システムであって、上記オーデ
ィオ信号の復号化装置は、上記符号化装置により符号化
されたビットストリーム信号を復号化しかつ直交変換処
理を行うことにより、複数のフレームからなるオーディ
オ信号を得る手段と、上記得られたオーディオ信号に対
して、上記修正関数とは逆の修正関数を用いて逆利得制
御を行って逆利得制御されたオーディオ信号を得て出力
する手段とを備えたことを特徴とする。An audio signal encoding / decoding system according to the present invention is an audio signal encoding / decoding system including the audio signal encoding device and the audio signal decoding device. The audio signal decoding device includes means for obtaining an audio signal composed of a plurality of frames by decoding the bit stream signal encoded by the encoding device and performing orthogonal transform processing; and the obtained audio signal. Means for performing inverse gain control on a signal using a correction function opposite to the above-mentioned correction function to obtain and output an audio signal subjected to inverse gain control.

【００２４】[0024]

【発明の実施の形態】以下、図面を参照して本発明の一
実施形態について説明する。本願明細書では、ＡＴＲＡ
Ｃ３のためのミニディスク記録再生システムを一実施形
態として参照して説明するが、本発明の符号化方法は、
利得制御を用いる他の符号化器にも適用することができ
る。DETAILED DESCRIPTION OF THE INVENTION An embodiment of the present invention will be described below with reference to the drawings. In the present specification, ATRA
A mini disk recording / reproducing system for C3 will be described with reference to an embodiment, but the encoding method of the present invention is
It can also be applied to other encoders that use gain control.

【００２５】図１は、本発明の一実施形態に係るミニデ
ィスク記録再生システムの構成を示すブロック図であ
る。この実施形態は、特に、オーディオエンコーダ２内
の詳細構成に特徴を有している。FIG. 1 is a block diagram showing the configuration of a mini disk recording / reproducing system according to an embodiment of the present invention. This embodiment is particularly characterized by the detailed configuration inside the audio encoder 2.

【００２６】図１において、Ａ／Ｄコンバータ１はオー
ディオ入力信号をＡ／Ｄ変換してディジタル化されたオ
ーディオサンプル信号に変換し、次いで、オーディオエ
ンコーダ２は、詳細後述するような利得制御方法を用い
て上記変換されたオーディオサンプル信号を圧縮して符
号化し、ＡＴＲＡＣ３のビットストリーム信号を発生す
る。次いで、ミニディスク記録装置３は、ＡＴＲＡＣ３
のビットストリーム信号を所定の変調記録信号に変調し
た後、上記変調記録信号をミニディスク４に記録する。
一方、ミニディスク再生装置５は、ミニディスク４から
記録信号を再生して復調することにより、ＡＴＲＡＣ３
のビットストリーム信号を再生する。さらに、オーディ
オデコーダ６は圧縮されているＡＴＲＡＣ３のビットス
トリーム信号に対して、上記利得制御方法とは逆の逆利
得制御方法を用いてその利得を制御しながらディジタル
オーディオ信号に復号化した後、最後にＤ／Ａコンバー
タ７はディジタルオーディオ信号をＤ／Ａ変換してオー
ディオ出力信号を出力する。In FIG. 1, an A / D converter 1 A / D converts an audio input signal to convert it into a digitized audio sample signal, and then an audio encoder 2 uses a gain control method as will be described later in detail. It is used to compress and encode the converted audio sample signal to generate an ATRAC3 bitstream signal. Next, the mini disk recording device 3 is set to ATRAC3.
After modulating the bit stream signal of 1 to a predetermined modulation recording signal, the modulation recording signal is recorded on the mini disk 4.
On the other hand, the mini-disc reproducing device 5 reproduces the recording signal from the mini-disc 4 and demodulates it so that the ATRAC3
Play the bitstream signal of. Further, the audio decoder 6 decodes the compressed bit stream signal of the ATRAC3 into a digital audio signal while controlling its gain by using an inverse gain control method which is the reverse of the above gain control method. The D / A converter 7 D / A converts the digital audio signal and outputs an audio output signal.

【００２７】上述の利得制御及び逆利得制御によって、
ディジタルオーディオ信号の符号化及び復号化の際に発
生するプリエコー雑音及びポストエコー雑音を抑圧する
ことができる。この実施形態では、記録再生システム
は、ＡＴＲＡＣ３のビットストリーム信号をミニディス
ク４に記録するように構成されているが、本発明はこれ
に限らず、その他の記録媒体または伝送媒体を用いるよ
うに記録再生システムや伝送システムを構成してもよ
い。By the above gain control and inverse gain control,
It is possible to suppress pre-echo noise and post-echo noise that occur when encoding and decoding a digital audio signal. In this embodiment, the recording / reproducing system is configured to record the bit stream signal of the ATRAC3 on the mini disc 4, but the present invention is not limited to this, and recording is performed using another recording medium or transmission medium. A reproduction system and a transmission system may be configured.

【００２８】図２は、図１のオーディオエンコーダ２の
詳細構成を示すブロック図である。図２において、オー
ディオエンコーダ２は、ＡＴＲＡＣ３の音声信号圧縮技
術を用いた典型的な変換符号化器であるが、オーディオ
エンコーダ２の詳細構成のうち、特に、利得検出部１４
−１乃至１４−４及び利得制御部１５−１乃至１５−４
の詳細構成において特徴を有している。ＡＴＲＡＣ３
は、基本的に、４４．１ｋＨｚでサンプリングされたオ
ーディオサンプル信号のストリームに対して構成され
た、サブバンド符号化と変換符号化の技術のハイブリッ
ド符号化方法である。FIG. 2 is a block diagram showing the detailed arrangement of the audio encoder 2 shown in FIG. In FIG. 2, the audio encoder 2 is a typical conversion encoder using the audio signal compression technique of the ATRAC 3, but among the detailed configuration of the audio encoder 2, in particular, the gain detection unit 14
-1 to 14-4 and gain control units 15-1 to 15-4
It has a feature in the detailed configuration of. ATRAC3
Is basically a hybrid coding method of sub-band coding and transform coding technology, which is configured for a stream of audio sample signals sampled at 44.1 kHz.

【００２９】図２に示すように、オーディオエンコーダ
２は、２ステージのＱＭＦフィルタ１０を用いて、４つ
の部分帯域（サブバンド）信号に周波数帯域分割し、次
に、各部分帯域毎に時間方向に、それぞれ２５６個のサ
ンプル信号を有するフレームに分割した信号（以下、サ
ブバンドフレームという。４つの部分帯域の合計で１０
２４個のサンプル信号を有する。）を出力する。ここ
で、ＱＭＦ分割フィルタ１０は、詳しくは、オーディオ
サンプル信号を低域信号と高域信号に等分するローパス
フィルタ１１及びハイパスフィルタ１２と、上記分割さ
れた低域及び高域信号を周波数についてさらに等分する
フィルタバンク１３−１乃至１３−４とを備えて構成さ
れる。従って、ＱＭＦフィルタ１０の出力は、それぞれ
２５６個のサンプル信号を有する、４個のダウンサンプ
リングされたサブバンドフレームである。それらの各々
は、ＭＤＣＴ変換のための変換ブロックの半分の長さで
あり、０乃至５．５ｋＨｚ、５．５ｋＨｚ乃至１１ｋＨ
ｚ、１１ｋＨｚ乃至１６．５ｋＨｚ、及び１６．５ｋＨ
ｚ乃至２２ｋＨｚの４つの周波数帯のそれぞれから得ら
れるオーディオ信号である。As shown in FIG. 2, the audio encoder 2 uses the 2-stage QMF filter 10 to divide the frequency band into four partial band (subband) signals, and then, in the time direction for each partial band. , A signal divided into frames each having 256 sample signals (hereinafter referred to as a subband frame. A total of four partial bands is 10
It has 24 sample signals. ) Is output. Here, in detail, the QMF division filter 10 further includes a low-pass filter 11 and a high-pass filter 12 that equally divide the audio sample signal into a low-pass signal and a high-pass signal, and further divides the divided low-pass and high-pass signals with respect to frequency. It is configured by including filter banks 13-1 to 13-4 that equally divide. Therefore, the output of QMF filter 10 is four downsampled subband frames, each having 256 sample signals. Each of them is half the length of the transform block for the MDCT transform, 0 to 5.5 kHz, 5.5 kHz to 11 kHz.
z, 11 kHz to 16.5 kHz, and 16.5 kHz
It is an audio signal obtained from each of the four frequency bands from z to 22 kHz.

【００３０】なお、フィルタバンク１３−１乃至１３−
４の後段にそれぞれ、ジョイントステレオ処理を実行す
る回路部をさらに備えてもよい。ジョイントステレオ処
理の回路部は、例えば、ステレオ信号として入力された
オーディオ信号を、右チャンネル信号と左チャンネル信
号との平均信号と、右チャンネル信号のゲインデータ
と、左チャンネル信号のゲインデータとに変換し、オー
ディオ信号のさらなる圧縮を達成することができる。The filter banks 13-1 to 13-
Each of the subsequent stages of 4 may further include a circuit unit that executes joint stereo processing. The circuit part of the joint stereo processing converts, for example, an audio signal input as a stereo signal into an average signal of the right channel signal and the left channel signal, gain data of the right channel signal, and gain data of the left channel signal. However, further compression of the audio signal can be achieved.

【００３１】次いで、修正関数を生成して、各サブバン
ドフレームに対して利得制御が実行される。フィルタバ
ンク１３−１乃至１３−４から出力された各サブバンド
フレームの信号は、利得検出部１４−１乃至１４−４と
利得制御部１５−１乃至１５−４にそれぞれ入力され、
各利得検出部１４−１乃至１４−４は、バッファメモリ
１４ａ−１乃至１４ａ−４と修正関数計算部１４ｂ−１
乃至１４ｂ−４を備えて構成され、各サブバンドフレー
ムに対する修正関数を計算して出力し、次いで、各利得
制御部１５−１乃至１５−４は、上記生成された修正関
数を用いて各サブバンドフレームに対して利得制御を実
行する。以下、図４を参照して、特に、フィルタバンク
１３−１から出力される０乃至５．５ｋＨｚの周波数帯
のサブバンドフレームについて説明する。Then, a modification function is generated and gain control is performed for each subband frame. The signals of the sub-band frames output from the filter banks 13-1 to 13-4 are input to the gain detection units 14-1 to 14-4 and the gain control units 15-1 to 15-4, respectively.
Each of the gain detection units 14-1 to 14-4 includes a buffer memory 14a-1 to 14a-4 and a correction function calculation unit 14b-1.
To 14b-4, calculates and outputs a correction function for each subband frame, and then each gain control unit 15-1 to 15-4 uses each correction function generated above to generate a correction function for each subband. Perform gain control on band frames. Hereinafter, with reference to FIG. 4, a subband frame in the frequency band of 0 to 5.5 kHz output from the filter bank 13-1 will be described in particular.

【００３２】図４は、ＭＤＣＴ処理部１５−１に先行す
る利得検出部１４−１及び利得制御部１５−１における
変換ブロックの生成を示すブロック図である。FIG. 4 is a block diagram showing the generation of conversion blocks in the gain detection unit 14-1 and the gain control unit 15-1 preceding the MDCT processing unit 15-1.

【００３３】図４において、各オーディオサンプル信号
のストリームから構成される入力されたオーディオ信号
のうちで、利得制御されるべきｎ番目のサブバンドフレ
ーム（以下、サブバンドフレームｎという。）は、修正
関数を計算するために前後のサブバンドフレームｎ−１
及びｎ＋１とともにバッファメモリ１４ａ−１に格納さ
れる。修正関数計算部１４ｂ−１ｂはサブバンドフレー
ムｎ−１乃至ｎ＋１を詳細後述するように１つの区分だ
け遅延させた後、遅延後の３つのサブバンドフレームｎ
−１乃至ｎ＋１に基づいてサブバンドフレームｎのため
の修正関数ＭＦｎを計算し、次いで、乗算器ＭＰ１ｂは
上記修正関数ＭＦｎをサブバンドフレームｎに乗算して
乗算結果のサブバンドフレームｎを縦続接続演算子ＣＯ
１及び乗算器ＭＰ２ｂに出力する。また、修正関数計算
部１４ｂ−１ａは、サブバンドフレームｎ−２，ｎ−１
及びｎを詳細後述するように１つの区分だけ遅延させた
後、遅延後の３つのサブバンドフレームｎ−２乃至ｎに
基づいてサブバンドフレームｎ−１のための修正関数Ｍ
Ｆｎ−１を計算し、次いで、乗算器ＭＰ１ａは上記修正
関数ＭＦｎ−１をサブバンドフレームｎ−１に乗算して
乗算結果のサブバンドフレームｎ−１を乗算器ＭＰ２ａ
に出力する。さらに、同様に、修正関数計算部１４ｂ−
１ｃは、サブバンドフレームｎ，ｎ＋１及びｎ＋２を詳
細後述するように１つの区分だけ遅延させた後、遅延後
の３つのサブバンドフレームｎ乃至ｎ＋２に基づいてサ
ブバンドフレームｎ＋１のための修正関数ＭＦｎ＋１を
計算し、次いで、乗算器ＭＰ１ｃは上記修正関数ＭＦｎ
＋１をサブバンドフレームｎ＋１に乗算して乗算結果の
サブバンドフレームｎ＋１を縦続接続演算子ＣＯ２及び
乗算器ＭＰ２ｃに出力する。In FIG. 4, the n-th sub-band frame (hereinafter referred to as sub-band frame n) to be gain-controlled among the input audio signals composed of streams of audio sample signals is modified. Previous subband frame n-1 to compute the function
And n + 1 are stored in the buffer memory 14a-1. The correction function calculator 14b-1b delays the sub-band frames n-1 to n + 1 by one segment as described later in detail, and then delays the three sub-band frames n.
The correction function MFn for the sub-band frame n is calculated based on -1 to n + 1, and then the multiplier MP1b multiplies the correction function MFn by the sub-band frame n to cascade the sub-band frame n of the multiplication result. Operator CO
1 and the multiplier MP2b. In addition, the correction function calculation unit 14b-1a determines the subband frames n-2 and n-1.
And n are delayed by one partition as described in detail below, and then a modification function M for subband frame n-1 is based on the delayed three subband frames n-2 to n.
Fn-1 is calculated, and then the multiplier MP1a multiplies the modified function MFn-1 by the subband frame n-1 and the resulting subband frame n-1 is multiplied by the multiplier MP2a.
Output to. Further, similarly, the correction function calculation unit 14b-
1c delays the subband frames n, n + 1 and n + 2 by one section as described in detail later, and then modifies the correction function MFn + 1 for the subband frame n + 1 based on the delayed three subband frames n to n + 2. And then the multiplier MP1c calculates the correction function MFn
The subband frame n + 1 is multiplied by +1 and the resulting subband frame n + 1 is output to the cascade connection operator CO2 and the multiplier MP2c.

【００３４】ＭＤＣＴ処理部１６−１における変形離散
コサイン変換（ＭＤＣＴ）処理に先行して、サブバンド
フレームの境界における修正関数の連続性を保証しなが
ら、２つの連続的なサブバンドフレームが互いに縦続接
続される。乗算器ＭＰ２ａは、修正関数ＭＦｎ−１によ
って利得制御され、乗算器ＭＰ１ａから出力されたサブ
バンドフレームｎ−１を、次のサブバンドフレームｎの
ための修正関数ＭＦｎの最初の区分（本実施形態におい
て、区分とは、１つのサブバンドフレームを３２個の区
分に分割したときの当該区分をいう。）の利得係数ＭＦ
ｎ［０］と乗算することにより正規化して、乗算結果の
サブバンドフレームｎ−１を縦続接続演算子ＣＯ１に出
力する。次いで、縦続接続演算子ＣＯ１は、乗算器ＭＰ
２ａから出力されたサブバンドフレームｎ−１と、乗算
器ＭＰ１ｂから出力されたサブバンドフレームｎとを縦
続接続して、ＭＤＣＴ処理部１６−１に出力する。同様
に、乗算器ＭＰ２ｂは、乗算器ＭＰ１ａから出力された
サブバンドフレームｎを、次のサブバンドフレームｎ＋
１のための修正関数ＭＦｎ＋１の最初の区分の利得係数
ＭＦｎ＋１［０］と乗算することにより正規化して、縦
続接続演算子ＣＯ２に出力する。次いで、縦続接続演算
子ＣＯ２は、乗算器ＭＰ２ｂから出力されたサブバンド
フレームｎと、乗算器ＭＰ１ｃから出力されたサブバン
ドフレームｎ＋１とを縦続接続して、ＭＤＣＴ処理部１
６−１に出力する。すなわち、ＡＴＲＡＣ３の標準は、
修正関数ＭＦｎの値がサブバンドフレームｎの端点にお
いて１に等しい（すなわち、利得が１である）ことを必
要とし、そのため、次の修正関数ＭＦｎ＋１と結合され
たとき、それは、次の修正関数の最初の区分の利得係数
ＭＦｎ＋１［０］との乗算によって、サブバンドフレー
ムｎ及びｎ＋１の境界において同一のレベルにするよう
に正規化することができる。ゆえに、結合された２つの
サブバンドフレームは、隣接するブロックがそれぞれ５
０％の重複を有するＭＤＣＴのための変換ブロック全体
を形成した後、ＭＤＣＴ処理部１６−１に出力される。Prior to the modified discrete cosine transform (MDCT) processing in the MDCT processing unit 16-1, two consecutive subband frames are cascaded with each other while guaranteeing the continuity of the correction function at the boundary of the subband frames. Connected. The gain of the multiplier MP2a is controlled by the correction function MFn-1, and the sub-band frame n-1 output from the multiplier MP1a is divided into the first section (the present embodiment) of the correction function MFn for the next sub-band frame n. In the above, the term "division" refers to the division when one subband frame is divided into 32 divisions.)
Normalization is performed by multiplying by n [0], and the subband frame n-1 as the multiplication result is output to the cascade connection operator CO1. Then the cascade operator CO1
The subband frame n-1 output from 2a and the subband frame n output from the multiplier MP1b are cascade-connected and output to the MDCT processing unit 16-1. Similarly, the multiplier MP2b converts the subband frame n output from the multiplier MP1a into the next subband frame n +.
The modified function MFn + 1 for 1 is normalized by being multiplied by the gain coefficient MFn + 1 [0] of the first section and is output to the cascade operator CO2. Next, the cascade connection operator CO2 cascade-connects the subband frame n output from the multiplier MP2b and the subband frame n + 1 output from the multiplier MP1c, and the MDCT processing unit 1
Output to 6-1. That is, the ATRAC3 standard is
It requires that the value of the modification function MFn be equal to 1 (ie, the gain is 1) at the endpoints of subband frame n, so that when combined with the next modification function MFn + 1, it is The multiplication with the gain factor MFn + 1 [0] of the first partition can be normalized to the same level at the boundaries of subband frames n and n + 1. Therefore, two sub-band frames that are combined have five adjacent blocks each.
After forming the entire transform block for MDCT having 0% overlap, it is output to the MDCT processing unit 16-1.

【００３５】図４を参照して以上に説明した乗算器ＭＰ
１ａ，ＭＰ１ｂ，ＭＰ１ｃ，ＭＰ２ａ，ＭＰ２ｂ及びＭ
Ｐ２ｃと、縦続接続演算子ＣＯ１，ＣＯ２とは、本実施
形態においては、利得制御部１５−１に含まれるように
構成されている。図４においては、第１のバンドに関す
る処理のみを図示しているが、第２乃至第４のバンドに
関する処理も同様に実行される。The multiplier MP described above with reference to FIG.
1a, MP1b, MP1c, MP2a, MP2b and M
The P2c and the cascade connection operators CO1 and CO2 are configured to be included in the gain control unit 15-1 in the present embodiment. In FIG. 4, only the processing regarding the first band is illustrated, but the processing regarding the second to fourth bands is similarly executed.

【００３６】再び図２を参照すると、ＭＤＣＴ処理部１
６−１乃至１６−４は、各周波数帯域で利得制御され、
かつＭＤＣＴ処理するための変換ブロックに縦続接続さ
れた少なくとも２つのサブバンドフレーム毎の信号に対
してそれぞれ変形離散コサイン変換（ＭＤＣＴ）の処理
を実行し、その結果として生じるスペクトル情報信号を
トーン成分及びスペクトル検出部１７に出力する。トー
ン成分及びスペクトル検出部１７は、ＭＤＣＴ処理され
た情報信号からトーン成分信号と非トーン性のスペクト
ル信号とを分離して、それぞれをトーン成分符号化器１
８とスペクトル符号化器１９に出力し、後段のトーン成
分符号化器１８とスペクトル符号化器１９はトーン成分
信号と非トーン性のスペクトル信号とに分けて個別に量
子化等の符号化の処理をする。ここで、トーン成分符号
化器１８とスペクトル符号化器１９は、例えば、それぞ
れスケールファクタ処理部、ビット割り当て部、及び量
子化器を備えて構成される。次いで、ビットストリーム
マルチプレクサ２０は、ハフマン符号化器とマルチプレ
クサプレクサとを備えて構成され、トーン成分符号化器
１８とスペクトル符号化器１９において符号化された信
号を、複数個（例えば１４個）のハフマン表を用いて圧
縮し、次いで、符号化されたトーン成分と、符号化され
た非トーン成分と、利得検出部１４−１乃至１４−４か
ら出力された修正関数のデータ（詳細後述するように、
サブバンドフレーム内の位置のデータと、レベルに関す
るデータと、変化点の個数のデータとを含む。）を含む
ＡＴＲＡＣ３標準に従うサイド情報とを多重化すること
により、ＡＴＲＡＣ３のビットストリーム信号を得て出
力する。Referring again to FIG. 2, the MDCT processing unit 1
6-1 to 16-4 are gain controlled in each frequency band,
Further, a modified discrete cosine transform (MDCT) process is performed on at least two sub-band frame signals that are cascade-connected to a transform block for MDCT processing, and the resulting spectrum information signal is converted into tone components and The spectrum is output to the spectrum detector 17. The tone component / spectrum detector 17 separates the tone component signal and the non-tone spectrum signal from the MDCT-processed information signal, and separates them into the tone component encoder 1
8 and the spectrum coder 19 and the tone component coder 18 and the spectrum coder 19 in the subsequent stage are divided into a tone component signal and a non-tone spectrum signal, and individually perform coding processing such as quantization. do. Here, the tone component encoder 18 and the spectrum encoder 19 are each configured to include, for example, a scale factor processing unit, a bit allocation unit, and a quantizer. Next, the bit stream multiplexer 20 is configured to include a Huffman encoder and a multiplexer plexer, and a plurality of signals (for example, 14) are encoded by the tone component encoder 18 and the spectrum encoder 19. Of the Huffman table, and then the encoded tone component, the encoded non-tone component, and the data of the correction function output from the gain detection units 14-1 to 14-4 (details will be described later). like,
It includes position data in the subband frame, level data, and the number of change points. , And side information according to the ATRAC3 standard, which includes a), is multiplexed to obtain and output a bitstream signal of ATRAC3.

【００３７】図３は、図１のオーディオデコーダ６の詳
細構成を示すブロック図である。最初に、ビットストリ
ームデマルチプレクサ２１は、ミニディスク４等の記録
媒体、又は伝送媒体から読み出された後復調された、Ａ
ＴＲＡＣ３のビットストリーム信号を、トーン成分の符
号列と、スペクトルの符号列（非トーン成分の符号列）
と、修正関数のデータを含むサイド情報とに分離する。
次いで、トーン成分復号化器２２とスペクトル復号化器
２３はそれぞれ、各符号列をスペクトル係数に復号化
し、トーン成分及びスペクトル合成部２４は分離され復
号化されたトーン成分信号と非トーン性のスペクトル信
号とを合成する。次いで、逆ＭＤＣＴ処理部２５−１乃
至２５−４は、各周波数帯域ごとに逆ＭＤＣＴによって
時間領域のサブバンドフレームを生成し、次いで、逆利
得制御部２６−１乃至２６−４は、ビットストリームデ
マルチプレクサ２１から入力されたサイド情報内の修正
関数のデータに基づいて、各サブバンドフレームに対し
て利得制御とは逆の修正関数で逆利得制御し、すなわ
ち、利得制御によって増幅された部分を抑圧し、抑圧さ
れた部分を増幅し、逆利得制御された各バンドのサブバ
ンドフレームをＱＭＦ合成フィルタ２７に出力する。最
後に、ＱＭＦ合成フィルタ２７は各バンド（周波数帯
域）毎のサブバンドフレームを合成して、合成されたデ
ィジタルオーディオ信号を出力する。FIG. 3 is a block diagram showing the detailed structure of the audio decoder 6 shown in FIG. First, the bit stream demultiplexer 21 is read from a recording medium such as a mini disk 4 or a transmission medium and then demodulated, A
The bit stream signal of TRAC3 is a code string of tone components and a code string of spectra (code string of non-tone components)
And side information including the data of the correction function.
Next, the tone component decoder 22 and the spectrum decoder 23 respectively decode each code string into spectrum coefficients, and the tone component / spectrum synthesis unit 24 separates the decoded tone component signal and the non-tone spectrum. Combine with the signal. Next, the inverse MDCT processing units 25-1 to 25-4 generate subband frames in the time domain by inverse MDCT for each frequency band, and then the inverse gain control units 26-1 to 26-4 perform bitstream decoding. Based on the data of the correction function in the side information input from the demultiplexer 21, inverse gain control is performed on each subband frame by a correction function opposite to the gain control, that is, the portion amplified by the gain control is It suppresses, amplifies the suppressed part, and outputs the sub-band frame of each band subjected to inverse gain control to the QMF synthesis filter 27. Finally, the QMF synthesis filter 27 synthesizes the sub-band frames for each band (frequency band) and outputs the synthesized digital audio signal.

【００３８】サブバンドフレームの基準によって、サブ
バンドフレームに対してオーディオコーディングが実行
されるとき、現在のサブバンドフレームの開始部（begi
nning）と未来のサブバンドフレームの開始部において
発生する複数のアタック信号を考慮しなければならな
い。実際の処理の前に、それぞれ２５６個のオーディオ
サンプル信号を備えるサブバンドフレームは、３２個の
区分（パーティション：partition）に分割される。現
在のサブバンドフレームｎの開始部におけるアタック信
号を除去するために、前のサブバンドフレームｎ−１か
らの最後の８個の区分が調べられ、現在のサブバンドフ
レームｎの開始部（start）における信号と比較され
る。図５を参照すると、未来のサブバンドフレームｎ＋
１の第１の区分において発生することがあるアタック信
号を除去するために、修正関数計算部において、１個の
区分の時間の遅延が導入され、そのため、現在のサブバ
ンドフレームｎの最後の区分は、常に、未来のサブバン
ドフレームｎ＋１の最初の３１個の区分と共に処理され
る。上記遅延は、現在のサブバンドフレームｎの最後の
区分を、「擬似未来」の区分に変化させ、従って、この
遅延は利得制御を補助して、サブバンドフレームの境界
で発生するアタック信号に対する安全装置となる。すな
わち、修正関数計算部による１個の区分の時間の遅延
は、ＭＤＣＴ処理部１６−１乃至１６−４において少な
くとも２つのサブバンドフレーム毎にＭＤＣＴ処理する
サブバンドフレームが、ＱＭＦ分割フィルタ１０の出力
であるサブバンドフレームに比較して１個の区分だけ遅
延することを与えている。When audio coding is performed on a subband frame according to the subband frame reference, the beginning of the current subband frame (begi
nning) and multiple attack signals that occur at the beginning of future subband frames must be considered. Prior to the actual processing, the subband frame, which comprises 256 audio sample signals each, is divided into 32 partitions. To remove the attack signal at the beginning of the current subband frame n, the last eight partitions from the previous subband frame n-1 are examined and the start of the current subband frame n is started. Compared to the signal at. Referring to FIG. 5, future subband frame n +
In order to eliminate the attack signal that may occur in the first partition of 1, the modification function calculator introduces a time delay of one partition, so that the last partition of the current subband frame n Are always processed with the first 31 partitions of future subband frame n + 1. The delay changes the last section of the current subband frame n into a "pseudo-future" section, thus this delay assists gain control and is safer for attack signals occurring at subband frame boundaries. It becomes a device. That is, the time delay of one section by the correction function calculation unit is the output of the QMF division filter 10 for the subband frame to be MDCT processed at least every two subband frames in the MDCT processing units 16-1 to 16-4. Is delayed by one segment as compared with the subband frame.

【００３９】サブバンドフレームのシフト及び「擬似未
来」の区分の概念を示す図５において、物理的なサブバ
ンドフレームの参照記号４ａで示された部分は、ＭＤＣ
Ｔ処理するサブバンドフレームｎに対する「擬似未来」
の区分である。それは、物理的なサブバンドフレームｎ
から引き出され、物理的なサブバンドフレームｎ＋１が
取得されるまで、一時的にバッファメモリ１４ａ−１乃
至１４ａ−４に記憶されている。物理的なサブバンドフ
レームｎ＋１が取得されると、ＭＤＣＴ処理するサブバ
ンドフレームｎ＋１は図６に示すように構成され、物理
的なサブバンドフレームｎ＋１の３１番目の区分は、一
時的にバッファメモリ１４ａ−１乃至１４ａ−４に記憶
される。また、参照記号４ｂで示される、ＭＤＣＴ処理
するサブバンドフレームｎ＋１に含まれる「アタック信
号」は、ＭＤＣＴ処理するサブバンドフレームｎに対し
て、検出可能な状態になる。In FIG. 5 showing the concept of shift of the sub-band frame and division of the "pseudo future", the portion indicated by the reference symbol 4a of the physical sub-band frame is MDC.
"Pseudo-future" for subband frame n to be processed
It is a category of. It is a physical subband frame n
Are temporarily stored in the buffer memories 14a-1 to 14a-4 until the physical subband frame n + 1 is obtained. When the physical subband frame n + 1 is acquired, the subband frame n + 1 to be MDCT processed is configured as shown in FIG. 6, and the 31st section of the physical subband frame n + 1 is temporarily buffered by the buffer memory 14a. -1 to 14a-4. The “attack signal” included in the subband frame n + 1 to be MDCT-processed, which is indicated by the reference symbol 4b, is in a detectable state with respect to the subband frame n to be MDCT-processed.

【００４０】サブバンドフレーム内でアタック信号の開
始部分を区分に基づいて検出するために、区分に含まれ
る複数個のサンプル信号のピークの絶対値（以下、ピー
ク値MaxPeakと呼ぶ。）が、８個の先行する区分のピー
ク値MaxPeakと比較される。そのピーク値MaxPeakが、８
個の先行する区分のピーク値MaxPeakよりも、ある比率
だけまさるときのみ、それはアタック信号として類別さ
れる。アタック信号に先行する、以前の利得制御が実行
された位置までのすべてのサンプル信号は、増幅の対象
となる。上記増幅の程度は、上記比率から導出される。In order to detect the start portion of the attack signal in the subband frame based on the section, the absolute value of the peak of a plurality of sample signals included in the section (hereinafter referred to as peak value MaxPeak) is 8. The peak value MaxPeak of the preceding sections is compared. The peak value MaxPeak is 8
It is categorized as an attack signal only when it exceeds the peak value MaxPeak of the preceding sections by a certain ratio. All sample signals preceding the attack signal up to the position where the previous gain control was performed are subject to amplification. The degree of amplification is derived from the ratio.

【００４１】必要な条件が適合するとき、各アタック信
号には減衰が付随し、その減衰が開始する位置を本発明
では「リリースポイント」、又は「リリース信号」と呼
ぶ。以前のアタック信号の位置と、潜在的にリリース信
号と識別されうる区分（すなわち現在の区分）との間
で、それらの間の区分におけるピークの絶対値が探索さ
れる（以下では、このピークの絶対値を、ピーク値Inte
rModMaxPeakと呼ぶ。）。ピーク値InterModMaxPeakが、
現在の区分のピーク値MaxPeakを予め決められた比率だ
け超過するとき、減衰が実行される。減衰係数は、上記
ピーク値InterModMaxPeakの値と、上記ピーク値MaxPeak
の値とに従って設定される。さらに、減衰の実行は、以
前のアタック信号の発生の時間を条件とする。以前のア
タック信号と、現在の潜在的なリリースポイントとの間
の時間的な分離が設定された継続時間より大きいとき
は、リリース信号に係る減衰は開始されない。When the necessary conditions are met, each attack signal is accompanied by attenuation, and the position where the attenuation starts is called "release point" or "release signal" in the present invention. Between the position of the previous attack signal and the segment that could potentially be identified as the release signal (ie the current segment), the absolute value of the peak in the segment between them is sought (below this peak Absolute value, peak value Inte
Call it rModMaxPeak. ). The peak value InterModMaxPeak is
Damping is performed when the peak value MaxPeak of the current segment is exceeded by a predetermined ratio. The damping coefficient is the peak value InterModMaxPeak and the peak value MaxPeak.
And the value of. Furthermore, the execution of the attenuation is conditional on the time of the occurrence of the previous attack signal. If the temporal separation between the previous attack signal and the current potential release point is greater than the set duration, then the decay of the release signal is not started.

【００４２】任意の瞬間において計算されるすべての増
幅係数及び減衰係数（すなわち、利得係数）は、同一の
サブバンドフレームに対して以前に決定された複数の増
幅係数及び複数の減衰係数に付け加えられる。All amplification and attenuation factors (ie gain factors) calculated at any instant are added to the previously determined amplification factors and attenuation factors for the same subband frame. .

【００４３】オーバーフローを防止するために、本発明
は、すべての増幅係数及び減衰係数に最終的な調整を実
行する。それによって、ＭＤＣＴの処理の前に縦続接続
されるとき、次のサブバンドフレームの最初の増幅係数
が、以前に修正されたサブバンドフレームをオーバーフ
ローさせないことと、それぞれ後続する増幅係数が、そ
れらの以前のピーク値InterModMaxPeakをオーバーフロ
ーさせないこととが保証される。To prevent overflow, the present invention makes a final adjustment to all amplification and attenuation factors. Thereby, when cascaded before the processing of the MDCT, the first amplification factor of the next subband frame does not overflow the previously modified subband frame and each subsequent amplification factor is It is guaranteed not to overflow the previous peak value InterModMaxPeak.

【００４４】本発明に係る実施形態は、サブバンドフレ
ームの境界にまたがるアタック信号を処理するために、
サブバンドフレームの複数の区分をバッファメモリ１４
ａ−１乃至１４ａ−４に記憶して遅延させるステップ
と、上記アタック信号の開始部分を検出するステップ
と、上記アタック信号に先行する信号に適用される増幅
係数を決定するステップと、減衰が適用されるべき位置
を決定するステップと、上記減衰のための減衰係数を計
算するステップと、現在のサブバンドフレームと前のサ
ブバンドフレームとにおけるオーバーフローを防止する
ステップとを含む。The embodiment according to the present invention is for processing an attack signal that crosses the boundary of a subband frame,
The buffer memory 14 stores a plurality of sections of the subband frame.
a-1 to 14a-4 for storing and delaying, detecting the beginning of the attack signal, determining an amplification factor applied to the signal preceding the attack signal, and applying attenuation The steps include: determining a position to be performed, calculating an attenuation coefficient for the attenuation, and preventing overflow in the current subband frame and the previous subband frame.

【００４５】図７乃至図１０は、図２の修正関数計算部
１４ｂ−１乃至１４ｂ−４によって実行される修正関数
計算処理の第１の部分を示すフローチャートである。こ
の利得制御処理は、利得検出部１４ｂ−１乃至１４ｂ−
４が各周波数帯域のサブバンドフレームｎに対してそれ
ぞれ実行するものとして以下に説明される。7 to 10 are flow charts showing the first part of the correction function calculation processing executed by the correction function calculation units 14b-1 to 14b-4 of FIG. This gain control processing is performed by the gain detection units 14b-1 to 14b-
4 is performed below for each subband frame n of each frequency band.

【００４６】図７のステップＳ１において、まず、バッ
ファメモリ１４ａ−１乃至１４ａ−４から、次に処理す
べき処理対象のサブバンドフレームとその前後のサブバ
ンドフレームとを含む３つのサブバンドフレームのサン
プル信号を読み出して、上記処理対象のサブバンドフレ
ームのための修正関数を計算するために用いるサブバン
ドフレームのデータとする。なお、バッファメモリ１４
ａ−１乃至１４ａ−４には、フィルタバンク１３−１乃
至１３−４から符号化すべきオーディオサンプル信号が
順次時系列で入力されて格納されているものとする。次
いで、ステップＳ２において、２５６個のサンプル信号
を含む各サブバンドフレームを３２個の区分（０番目乃
至３１番目）に分割した後、ステップＳ２Ａにおいて、
各サブバンドフレームを１つの区分だけ遅延するように
シフトすることにより、図５に示すような「擬似未来」
の区分を処理に用いるサブバンドフレームに対して供給
する。In step S1 of FIG. 7, first, from the buffer memories 14a-1 to 14a-4, three sub-band frames including the sub-band frame to be processed next and the sub-band frames before and after the sub-band frame are processed. The sample signal is read and used as the data of the subband frame used to calculate the correction function for the subband frame to be processed. The buffer memory 14
It is assumed that the audio sample signals to be encoded from the filter banks 13-1 to 13-4 are sequentially input in time series and stored in a-1 to 14a-4. Then, in step S2, each subband frame including 256 sample signals is divided into 32 sections (0th to 31st), and then in step S2A,
By shifting each subband frame so that it is delayed by one section, a "pseudo future" as shown in FIG.
Is supplied to the subband frame used for processing.

【００４７】次いで、ステップＳ３において、次式を用
いて各区分に対する対数スケールのピーク値MaxPeakを
計算する。ここで、ｍａｘ（ｘ，ｙ）は、ｘ，ｙのうち
の大きいほうを選択する関数である。Next, in step S3, the peak value MaxPeak on the logarithmic scale for each section is calculated using the following equation. Here, max (x, y) is a function that selects the larger one of x and y.

【００４８】[0048]

【数１】MaxPeak［ｉ］＝ｌｏｇ_２（ｍａｘ（１，｜ｉ
番目の区分における８個のサンプル信号のピーク値
｜）），０≦ｉ≦３１## EQU1 ## MaxPeak [i] = log ₂ (max (1, | i
Peak values of eight sample signals in the second section |)), 0 ≦ i ≦ 31

【００４９】ステップＳ４において、修正前のサブバン
ドフレームのピーク値UnModFramePeakに、サブバンドフ
レームｎ内の最大のピーク値MaxPeak［ｉ］を設定す
る。ステップＳ５において、種々のループ変数を初期化
する。変化点の数adjust_numは、サブバンドフレームｎ
に対して修正が実行されるべき変化点（増幅及び減衰を
含む）の個数を示し、それは０で開始する。すなわち、
変化点の数adjust_numは、各バンドに対する利得の変化
点の数を示す３ビットのフィールドデータであり、その
最大値は７である。現在の区分と前の修正ポイントとの
間の最高のピーク値MaxPeakとして定義されたピーク値I
nterModMaxPeak［０］は、前のサブバンドフレームｎ−
１の最後のピーク値InterModMaxPeakに初期化される。
増幅又は減衰の変化点の変数を表すnum_mod_ptは、ゼロ
に初期化される。In step S4, the maximum peak value MaxPeak [i] in the subband frame n is set to the peak value UnModFramePeak of the subband frame before modification. In step S5, various loop variables are initialized. The number of change points adjust_num is the subband frame n
Indicates the number of change points (including amplification and attenuation) for which the correction should be performed, which starts at 0. That is,
The number of change points adjust_num is 3-bit field data indicating the number of change points of the gain for each band, and its maximum value is 7. Peak value I defined as the highest peak value MaxPeak between the current segment and the previous correction point
nterModMaxPeak [0] is the previous subband frame n−
The last peak value of 1 is initialized to InterModMaxPeak.
Num_mod_pt, which represents the variable of the change point of amplification or attenuation, is initialized to zero.

【００５０】ステップＳ６において、区分のインデック
スｊを第１の区分を示す値、すなわち０に初期化し、区
分のインデックスｊは、アルゴリズムが区分から区分へ
と移動するように、フローチャートの主ループにおいて
インクリメントされる。ステップＳ７において、処理中
の最新のピーク値MaxPeak［ｊ］を現在のピーク値Inter
ModMaxPeak［num_mod_pt］と比較して、大きいほうを選
択することにより、ピーク値InterModMaxPeak［num_mod
_pt］を更新する。In step S6, the partition index j is initialized to a value indicating the first partition, ie 0, and the partition index j is incremented in the main loop of the flowchart so that the algorithm moves from partition to partition. To be done. In step S7, the latest peak value MaxPeak [j] being processed is set to the current peak value Inter.
Compared with ModMaxPeak [num_mod_pt], select the larger one to obtain the peak value InterModMaxPeak [num_mod
_pt] is updated.

【００５１】ステップＳ８及びＳ１０の分岐条件はアタ
ック信号の基準を示す。次の区分であるｊ＋１番目の区
分が確実にアタック信号であるためには、ピーク値MaxP
eak［ｊ＋１］は、８（線形スケールでは２^８）と、
（１＋MaxPeak［ｊ］）とを超過しなければならない。
ステップＳ９及びＳ１０において、ｊ−１乃至ｊ−７番
目の区分のピーク値MaxPeakの最大値MaxValueを計算
し、ピーク値MaxPeak［ｊ＋１］がその８個の先行する
区分のすべてよりも１だけ大きいことを要求する条件を
さらに課す。ｊ＝０乃至ｊ＝７に対して、前のサブバン
ドフレームｎ−１の最後のいくつかの区分を用いて比較
する必要がある。ｊ＝３１に対して、ｊ＋１番目の区分
は、上記一時的に記憶するためのバッファメモリ１４ａ
−１乃至１４ａ−４中の区分を参照する。図６に、上述
された特別な場合が図示されている。上述のアタック信
号の条件（ステップＳ８及びＳ１０）がいずれも成立し
ないときは、利得制御処理のアルゴリズムは図８のステ
ップＳ１３のリリース基準に進む。The branching conditions of steps S8 and S10 indicate the reference of the attack signal. To ensure that the next division, the j + 1th division, is an attack signal, the peak value MaxP
eak [j + 1] is 8 (2 ^{8 on the} linear scale),
(1 + MaxPeak [j]) must be exceeded.
In steps S9 and S10, the maximum value MaxValue of the peak value MaxPeak of the j-1 to j-7th segment is calculated, and the peak value MaxPeak [j + 1] is larger than all of the eight preceding segments by one. Further impose conditions that require For j = 0 to j = 7, we need to compare using the last few partitions of the previous subband frame n-1. For j = 31, the (j + 1) th section is the buffer memory 14a for temporarily storing the above.
-1 to 14a-4. FIG. 6 illustrates the special case described above. If none of the conditions of the above-mentioned attack signal (steps S8 and S10) are satisfied, the algorithm of the gain control process proceeds to the release reference of step S13 in FIG.

【００５２】ステップＳ８及びＳ１０のアタック信号の
基準が満たされるとき、ｊ＋１番目の区分は、アタック
信号の開始部分と判断され、図８のステップＳ１１の式
を用いて、さまざまな変数が更新される。expoは、次式
によって修正のべき乗（修正関数のレベルを示す）とし
て定義される。When the attack signal criteria of steps S8 and S10 are met, the j + 1 th partition is determined to be the beginning of the attack signal and the various variables are updated using the equation of step S11 of FIG. . expo is defined as the power of correction (indicating the level of the correction function) by

【００５３】[0053]

【数２】 expo←（int）（MaxPeak［ｊ＋１］−MaxValue）[Equation 2] expo ← (int) (MaxPeak [j + 1] −MaxValue)

【００５４】ここで、（ｉｎｔ）は引数の整数値を示す
関数である。修正レベルalevcodeは、ｊ＋１番目の区分
よりも前かつ以前の修正ポイントの後のすべての区分に
適用される修正のべき乗として定義される。すなわち、
修正レベルalevcodeは、１つの利得変化に対する利得値
を示す４ビットのフィールドデータである。修正位置al
occodeは、修正が実行される最後の区分、すなわちその
位置まで上記修正が実行される区分のインデックス（す
なわち、ｊ）である。すなわち、修正位置aloccodeは、
１つの利得変化に対する位置を示す５ビットのフィール
ドデータである。インデックスlast_attack_ptは、アタ
ック信号の直前の区分のインデックスを追跡する（それ
はリリース信号を追跡しない）。変化点の数adjust_num
及び変化点の変数num_mod_ptはそれぞれ１ずつインクリ
メントされる。次に、ピーク値InterModMaxPeak［num_m
od_pt］はピーク値MaxPeak［ｊ＋１］に更新（初期化）
され、ｊ＋１番目の区分から前方へ、最高のピーク値Ma
xPeakを追跡することを開始する。ステップＳ１２にお
いて、アタック信号に対して計算された上記修正のべき
乗expoの値を、以前に計算されたすべての修正レベルal
evcodeに加算する。Here, (int) is a function indicating the integer value of the argument. The modification level alevcode is defined as the power of modification applied to all partitions before the j + 1th partition and after the previous modification point. That is,
The modification level alevcode is 4-bit field data indicating a gain value for one gain change. Correction position al
The occode is the index (i.e. j) of the last partition where the modification is performed, i.e. the partition where the modification is performed up to that position. That is, the corrected position aloccode is
It is 5-bit field data indicating the position for one gain change. The index last_attack_pt tracks the index of the partition immediately preceding the attack signal (it does not track the release signal). Number of change points adjust_num
And the variable num_mod_pt at the change point are incremented by one. Next, the peak value InterModMaxPeak [num_m
od_pt] is updated (initialized) to the peak value MaxPeak [j + 1]
And the highest peak value Ma from the j + 1th section forward
Start tracking xPeak. In step S12, the value of the power expo of the above correction calculated for the attack signal is set to all the previously calculated correction levels al.
Add to evcode.

【００５５】ステップＳ１３及びＳ１５は、リリース信
号の基準である。減衰が開始されるためには、最新のピ
ーク値InterModMaxPeak［num_mod_pt］は（１＋MaxPeak
［ｊ］）よりも大きくなければならず、アタック信号の
直後の区分のインデックスlast_attack_ptによって追跡
されたような、最後のアタック信号のポイントからのそ
の時間的な分離は３２個の区分の長さよりも小さくなけ
ればならない。条件が合うならば、ステップＳ１４及び
Ｓ１５において、ｊ＋１乃至ｊ＋７番目の区分のピーク
値MaxPeakの最大値MaxValueを計算し、ピーク値InterMo
dMaxPeak［num_mod_pt］を次の７個のピーク値MaxPeak
とさらに比較する。ピーク値InterModMaxPeak［num_mod
_pt］がそれらすべてを超過するときは、ステップＳ１
６において、ステップＳ１１と同様の変数の更新が後に
続く。初めに、検出されたリリース信号に対する修正の
べき乗expoが、次式を用いて計算される。Steps S13 and S15 are the reference of the release signal. In order to start the attenuation, the latest peak value InterModMaxPeak [num_mod_pt] is (1 + MaxPeak
[J]) and its temporal separation from the point of the last attack signal, as tracked by the index last_attack_pt of the partition immediately after the attack signal, is greater than the length of 32 partitions. Must be small. If the conditions are met, in steps S14 and S15, the maximum value MaxValue of the peak values MaxPeak of the j + 1 to j + 7th sections is calculated, and the peak value InterMo is calculated.
dMaxPeak [num_mod_pt] is the next 7 peak values MaxPeak
Compare further with. Peak value InterModMaxPeak [num_mod
_pt] exceeds all of them, step S1
At 6, the updating of variables similar to step S11 follows. First, the exponentiation of the correction for the detected release signal is calculated using the following equation:

【００５６】[0056]

【数３】expo←（int）（InterModMaxPeak［num_mod_p
t］−MaxValue）[Equation 3] expo ← (int) (InterModMaxPeak [num_mod_p
t] −MaxValue)

【００５７】ここで、ステップＳ１１に対して２つの顕
著な違いがある。まず、−４の下限値（フロア値）が修
正のべき乗expoの値に課される。また、いったんリリー
ス信号が発見されると、次のアタック信号が到来するま
でさらなる減衰を防止するために、その値をリセットす
るように、アタック信号の直後の区分のインデックスla
st_attack_ptから３２が減算される。このステップは、
処理されている信号の正確さを損なうような、リリース
信号の検出、すなわち、アタック信号の後で連続的に複
数のリリース信号を検出することを防止するようになっ
ている。また、修正レベルalevcodeは、上記計算された
修正のべき乗expoを代入され、修正位置aloccodeは、現
在の区分のインデックスｊを代入される。ピーク値Inte
rModMaxPeak［num_mod_pt］はピーク値MaxPeak［ｊ］を
代入されて、更新される。またステップＳ１２と同様
に、図９のステップＳ１７において、リリース信号に対
して計算された上記修正のべき乗expoの値を、前に計算
されたすべての修正レベルalevcodeに加算する。Here, there are two significant differences with respect to step S11. First, a lower limit value (floor value) of -4 is imposed on the value of exponentiation of the correction. Also, once a release signal is found, its index is reset immediately after the attack signal so that its value is reset to prevent further attenuation until the next attack signal arrives.
32 is subtracted from st_attack_pt. This step
It is intended to prevent the detection of release signals, i.e. the detection of multiple release signals in succession after the attack signal, which would impair the accuracy of the signal being processed. Further, the correction level alevcode is assigned the exponentiation of the correction calculated above, and the correction position aloccode is assigned the index j of the current section. Peak value Inte
rModMaxPeak [num_mod_pt] is updated by substituting the peak value MaxPeak [j]. Similarly to step S12, in step S17 of FIG. 9, the value of the exponentiation of the correction calculated for the release signal is added to all the correction levels alevcode calculated previously.

【００５８】ステップＳ１８において、現在の区分ｊが
最後の区分であるか否かがチェックされる。そうでない
ときは、アルゴリズムはステップＳ１９に進み、次の区
分を処理するためにループがステップＳ７に戻る前に、
区分のインデックスｊを１だけインクリメントする。そ
うでないときは、アルゴリズムはステップＳ２０に進
み、アタック信号の直後の区分のインデックスlast_att
ack_ptから３２を減算して、無効化する。In step S18, it is checked whether the current section j is the last section. Otherwise, the algorithm proceeds to step S19, before the loop returns to step S7 to process the next partition,
The partition index j is incremented by 1. If not, the algorithm proceeds to step S20, where the index last_att of the partition immediately after the attack signal.
Subtract 32 from ack_pt to invalidate.

【００５９】ステップＳ２１は、信号の振幅が３２７６
７（すなわち、２^１５−１）を上に超過するか、−３２
７６８（すなわち、−２^１５）よりも下に超過すること
を防止するために、すべての修正レベルalevcode［ｉ］
（０≦ｉ≦adjust_num）を調整するマルチステップの処
理である。それは、標準に適合させるために、修正レベ
ルalevcodeの値を−４と１１の間（両端を含む）に制限
する。また、それは、ＭＤＣＴ処理のために隣接したサ
ブバンドフレームと結合されるときに、結合されたピー
ク値InterModMaxPeakがオーバーフローすることを防止
するために、次式により、その値を制限する。In step S21, the signal amplitude is 3276.
7 (ie 2 ¹⁵ −1) above or −32
768 (i.e., ^{-2 15)} in order to prevent exceeding the below, any modification level alevcode [i]
This is a multi-step process for adjusting (0 ≦ i ≦ adjust_num). It limits the value of the modification level alevcode to between -4 and 11 (inclusive) in order to comply with the standard. It also limits the combined peak value InterModMaxPeak when it is combined with adjacent subband frames for MDCT processing, by the following equation, to prevent overflow.

【００６０】[0060]

【数４】alevcode［ｉ］←ｍｉｎ（alevcode［ｉ］，１
５−InterModMaxPeak［ｉ］）[Formula 4] alevcode [i] ← min (alevcode [i], 1
5-InterModMaxPeak [i])

【００６１】最後に、第１の修正レベルalevcode［０］
に対して、その値が、以前に修正されたサブバンドフレ
ームをオーバーフローさせてはならない。これは、便利
なことには、以前に修正されたサブバンドフレームのピ
ーク値であるPrevFramePeakを追跡しつづけることによ
って次式のように実行される。Finally, the first modification level alevcode [0]
, Its value must not overflow a previously modified subband frame. This is conveniently done by keeping track of PrevFramePeak, the peak value of the previously modified subband frame, as follows:

【００６２】[0062]

【数５】alevcode［０］←ｍｉｎ（alevcode［０］，１
５−PrevFramePeak）[Formula 5] alevcode [0] ← min (alevcode [0], 1
5-PrevFramePeak)

【００６３】ステップＳ２２において、修正されたサブ
バンドフレームのピーク値MaxPeakとして定義されるMod
FramePeakが計算される。上記値は、処理を通じて生成
されたピーク値InterModMaxPeakと、修正レベルalevcod
eとの対から以下のように計算できる。In step S22, Mod defined as the peak value MaxPeak of the modified subband frame
FramePeak is calculated. The above value is the peak value InterModMaxPeak generated through the process and the modification level alevcod.
It can be calculated from the pair with e as follows.

【００６４】[0064]

【数６】ModFramePeak←ｍａｘ（InterModMaxPeak
［ｉ］＋alevcode［ｉ］）（０≦ｉ≦adjust_num）[Equation 6] ModFramePeak ← max (InterModMaxPeak
[I] + alevcode [i]) (0 ≦ i ≦ adjust_num)

【００６５】ステップＳ２３において、標準の要求によ
り、−４乃至１１の範囲で広がっているすべての修正レ
ベルalevcodeに対して、＋４のオフセットを加算する。In step S23, an offset of +4 is added to all modification levels alevcode that are spread in the range of -4 to 11 according to the standard request.

【００６６】ステップＳ２４において、最後の修正レベ
ルalevcodeが４であるか否か、すなわち、利得１の変化
であるか否かがチェックされる。最後の修正レベルalev
codeが４でないときはステップＳ２６に進み、修正レベ
ルalevcodeが４であるときは、最後の変化点（修正ポイ
ント）の後に、ＡＴＲＡＣ３の標準が修正関数を１に補
間するので、ステップＳ２５において、最後の変化点
（修正ポイント）が冗長さを考慮され、除去される。次
いで、ステップＳ２６において、変化点の数adjust_num
の値がチェックされ、現在のサブバンドフレームに対し
て利得制御が実行されたるべきか否かが判断される。利
得制御が必要でないときは、ステップＳ２７において、
以前に修正されたサブバンドフレームのピーク値PrevFr
amePeak及びピーク値InterModMaxPeak［０］を、修正前
のサブバンドフレームのピーク値UnModFramePeak（ステ
ップＳ４を参照）に設定し、また、変化点の変数num_mo
d_ptを０に設定する。利得制御が必要なときは、ステッ
プＳ２８で、以前の修正されたサブバンドフレームのピ
ーク値PrevFramePeakは、修正されたサブバンドフレー
ムのピーク値MaxPeakであるModFramePeakに設定され
る。In step S24, it is checked whether or not the last correction level alevcode is 4, that is, whether or not it is a change in gain 1. Last modification level alev
When the code is not 4, the process proceeds to step S26, and when the modification level alevcode is 4, the standard of ATRAC3 interpolates the modification function to 1 after the last change point (correction point). The change points (correction points) of are considered for redundancy and removed. Next, in step S26, the number of change points adjust_num
Value is checked to determine if gain control should be performed for the current subband frame. If gain control is not necessary, in step S27,
Precorrected subband frame peak value PrevFr
The amePeak and peak value InterModMaxPeak [0] are set to the peak value UnModFramePeak (see step S4) of the subband frame before modification, and the variable num_mo of the change point is set.
Set d_pt to 0. When gain control is required, the peak value PrevFramePeak of the previously modified subband frame is set to ModFramePeak which is the peak value MaxPeak of the modified subband frame in step S28.

【００６７】サブバンドフレームｎのすべての区分が検
査されると、変化点の数adjust_num、修正レベルalevco
de、及び修正位置aloccodeに基づいて、利得制御が実行
される。上記の３つのデータは、図４において修正関数
ＭＦｎとして表され、また、サイド情報としてビットス
トリームマルチプレクサ２０に送られる。When all partitions of subband frame n have been examined, the number of change points adjust_num, modification level alevco
Gain control is executed based on de and the corrected position aloccode. The above three data are represented as a modification function MFn in FIG. 4, and are also sent to the bitstream multiplexer 20 as side information.

【００６８】次に、図の１０のステップＳ２９におい
て、処理対象のサブバンドフレームｎが、入力されたオ
ーディオ信号において最後のサブバンドフレームであっ
たか否かが判断される。サブバンドフレームｎが入力信
号の最後のサブバンドフレームでないときは（ステップ
Ｓ２９でＮＯ）、次のサブバンドフレームｎ＋１を次の
処理すべき処理対象のサブバンドフレームとするため
に、ステップＳ１に戻る。一方、サブバンドフレームｎ
が入力信号の最後のサブバンドフレームのとき（ステッ
プＳ２９でＹＥＳ）、修正関数計算処理を終了する。Next, in step S29 of FIG. 10, it is determined whether or not the subband frame n to be processed is the last subband frame in the input audio signal. When the subband frame n is not the last subband frame of the input signal (NO in step S29), the process returns to step S1 to set the next subband frame n + 1 as the next subband frame to be processed. . On the other hand, subband frame n
Is the last subband frame of the input signal (YES in step S29), the correction function calculation process ends.

【００６９】本発明の実施形態に係るオーディオ信号の
符号化方法及び装置は、オーディオ信号の変換符号化器
のために、サンプル信号中のアタック信号及びリリース
信号を検出し、続いてプリエコー及びポストエコーのア
ーティファクトを減少させることに効果的である。この
相対的に低い計算量及びメモリの要求は、低コストのＬ
ＳＩ実装のための優れた候補を提供する。An audio signal encoding method and apparatus according to an embodiment of the present invention detects an attack signal and a release signal in a sample signal, and subsequently detects a pre-echo and a post-echo for an audio signal conversion encoder. It is effective in reducing the artifacts of. This relatively low computational complexity and memory requirement is due to the low cost of L
Provides excellent candidates for SI implementation.

【００７０】以上の実施形態においては、ＭＤＣＴ処理
を行っているが、本発明はこれに限らず、種々の直交変
換処理を行ってもよい。Although MDCT processing is performed in the above embodiments, the present invention is not limited to this, and various orthogonal transformation processing may be performed.

【００７１】以上の実施形態においては、修正関数を計
算するときに３つのサブバンドフレームに基づいて計算
しているが、本発明はこれに限らず、少なくとも３つの
サブバンドフレーム、又はサブバンドに分割されていな
い少なくとも３つのフレームに基づいて計算してもよ
い。In the above embodiment, the correction function is calculated based on three subband frames, but the present invention is not limited to this, and at least three subband frames or subbands are calculated. It may be calculated based on at least three non-divided frames.

【００７２】以上の実施形態においては、修正関数を計
算する前に、３つのサブバンドフレームを１つの区分だ
け遅延させているが、本発明はこれに限らず、サブバン
ドフレームよりも時間が短い少なくとも１つの区分だけ
遅延させるように構成してもよい。In the above embodiment, the three subband frames are delayed by one segment before the correction function is calculated, but the present invention is not limited to this, and the time is shorter than the subband frame. It may be configured to delay by at least one segment.

【００７３】[0073]

【発明の効果】以上詳述したように本発明に係るオーデ
ィオ信号の符号化方法又は装置によれば、入力されたオ
ーディオ信号に基づいて修正関数を計算して、計算され
た修正関数に従って上記オーディオ信号に対して利得制
御し、上記利得制御されたオーディオ信号に対してフレ
ーム単位で直交変換処理を行いかつ符号化処理を行うこ
とにより符号化されたビットストリーム信号を得るオー
ディオ信号の符号化方法又は装置において、上記利得制
御することは、入力されたオーディオ信号をフレームの
時間よりも短い区分の時間に分割し、少なくとも１個の
区分の時間だけオーディオ信号を遅延させ、遅延された
オーディオ信号の処理すべきフレームとその前後のフレ
ームとを含む少なくとも３つのフレームに基づいて修正
関数を計算する。As described in detail above, according to the audio signal encoding method or apparatus of the present invention, a correction function is calculated based on an input audio signal, and the audio signal is calculated according to the calculated correction function. A method of encoding an audio signal, in which gain control is performed on a signal, orthogonal transform processing is performed on the gain-controlled audio signal in frame units, and encoding processing is performed to obtain an encoded bit stream signal, or In the device, the gain control is performed by dividing the input audio signal into time segments which are shorter than a frame time, delaying the audio signal by at least one segment time, and processing the delayed audio signal. A correction function is calculated based on at least three frames including the frame to be performed and the frames before and after it.

【００７４】従って、入力されたオーディオ信号を少な
くとも１個の区分の時間だけ遅延させることにより、い
わゆる「擬似未来」を含む、少なくとも３つのフレーム
に基づいて修正関数を計算するので、互いに隣接する２
つのフレームの境界にアタック信号が存在しても、正確
にアタック信号とリリース信号とを検出することがで
き、より精度の高い修正関数を得ることができる。これ
により、修正関数をより正確に生成して利得制御を実行
して、プリエコー雑音及びポストエコー雑音を確実に抑
圧することができる。Therefore, since the input audio signal is delayed by the time of at least one section, the correction function is calculated based on at least three frames including the so-called "pseudo-future", so that the adjacent two are adjacent to each other.
Even if an attack signal is present at the boundary between two frames, the attack signal and the release signal can be accurately detected, and a more accurate correction function can be obtained. As a result, the correction function can be generated more accurately, the gain control can be executed, and the pre-echo noise and the post-echo noise can be surely suppressed.

[Brief description of drawings]

【図１】本発明の一実施形態に係るミニディスク記録
再生システムの構成を示すブロック図である。FIG. 1 is a block diagram showing a configuration of a mini disk recording / reproducing system according to an embodiment of the present invention.

【図２】図１のオーディオエンコーダ２の詳細構成を
示すブロック図である。FIG. 2 is a block diagram showing a detailed configuration of the audio encoder 2 of FIG.

【図３】図１のオーディオデコーダ６の詳細構成を示
すブロック図である。3 is a block diagram showing a detailed configuration of the audio decoder 6 of FIG.

【図４】ＭＤＣＴ処理部１５−１に先行する利得検出
部１４−１及び利得制御部１５−１における変換ブロッ
クの生成を示すブロック図である。FIG. 4 is a block diagram showing generation of a conversion block in a gain detection unit 14-1 and a gain control unit 15-1 preceding the MDCT processing unit 15-1.

【図５】サブバンドフレームのシフト及び「擬似未
来」の区分の概念を示す図である。FIG. 5 is a diagram showing the concept of subband frame shift and “pseudo future” division.

【図６】区分の番号ｊの異なる値に対してアタック信
号を決定することに寄与する複数の区分を示す図であ
る。FIG. 6 shows a plurality of partitions that contribute to determining attack signals for different values of partition number j.

【図７】図２の修正関数計算部１４ｂ−１乃至１４ｂ
−４によって実行される修正関数計算処理の第１の部分
を示すフローチャートである。FIG. 7 is a diagram showing correction function calculation units 14b-1 to 14b in FIG.
4 is a flowchart showing a first part of a correction function calculation process executed by -4.

【図８】図２の修正関数計算部１４ｂ−１乃至１４ｂ
−４によって実行される修正関数計算処理の第２の部分
を示すフローチャートである。FIG. 8 is a diagram showing correction function calculators 14b-1 to 14b in FIG.
4 is a flowchart showing a second part of the correction function calculation process executed by -4.

【図９】図２の修正関数計算部１４ｂ−１乃至１４ｂ
−４によって実行される修正関数計算処理の第３の部分
を示すフローチャートである。FIG. 9 is a diagram showing correction function calculators 14b-1 to 14b in FIG.
14 is a flowchart showing a third part of the correction function calculation process executed by -4.

【図１０】図２の修正関数計算部１４ｂ−１乃至１４
ｂ−４によって実行される修正関数計算処理の第４の部
分を示すフローチャートである。FIG. 10 is a schematic diagram of correction function calculation units 14b-1 to 14b of FIG.
It is a flowchart which shows the 4th part of the correction function calculation process performed by b-4.

【図１１】（ａ）利得制御をせず、プリエコーやポス
トエコーが発生するときの従来技術のオーディオ信号の
符号化及び復号化装置の構成を示すブロック図であり、
（ｂ）利得制御を行い、プリエコーやポストエコーを抑
圧するときの従来技術のオーディオ信号の符号化及び復
号化装置の構成を示すブロック図である。FIG. 11 (a) is a block diagram showing the configuration of a conventional audio signal encoding / decoding device when pre-echo and post-echo occur without gain control;
(B) is a block diagram showing a configuration of a conventional audio signal encoding / decoding apparatus for performing gain control and suppressing pre-echo and post-echo.

[Explanation of symbols]

１…Ａ／Ｄコンバータ、２…オーディオエンコーダ、３…ミニディスク記録装置、４…ミニディスク、５…ミニディスク再生装置、６…オーディオデコーダ、７…Ｄ／Ａコンバータ、１０…ＱＭＦ分割フィルタ、１１…ローパスフィルタ、１２…ハイパスフィルタ、１３−１乃至１３−４…フィルタバンク、１４−１乃至１４−４…利得検出部、１４ａ−１乃至１４ａ−４…バッファメモリ、１４ｂ−１乃至１４ｂ−４，１４ｂ−１ａ，１４ｂ−１
ｂ，１４ｂ−１ｃ，１４ｂ−１ｄ…修正関数計算部、１５−１乃至１５−４…利得制御部、１６−１乃至１６−４…ＭＤＣＴ処理部、１７…トーン成分及びスペクトル検出部、１８…トーン成分符号化器、１９…スペクトル符号化器、２０…ビットストリームマルチプレクサ、２１…ビットストリームデマルチプレクサ、２２…トーン成分復号化器、２３…スペクトル復号化器、２４…トーン成分及びスペクトル合成部、２５−１乃至２５−４…逆ＭＤＣＴ処理部、２６−１乃至２６−４…逆利得制御部、２７…ＱＭＦ合成フィルタ、ＣＯ１，ＣＯ２…縦続接続演算子、ＭＰ１ａ，ＭＰ１ｂ，ＭＰ１ｃ，ＭＰ２ａ，ＭＰ２ｂ，
ＭＰ２ｃ…乗算器。DESCRIPTION OF SYMBOLS 1 ... A / D converter, 2 ... Audio encoder, 3 ... Mini disc recording device, 4 ... Mini disc, 5 ... Mini disc reproducing device, 6 ... Audio decoder, 7 ... D / A converter, 10 ... QMF division filter, 11 ... low-pass filter, 12 ... high-pass filter, 13-1 to 13-4 ... filter bank, 14-1 to 14-4 ... gain detection section, 14a-1 to 14a-4 ... buffer memory, 14b-1 to 14b-4 , 14b-1a, 14b-1
b, 14b-1c, 14b-1d ... Correction function calculation unit, 15-1 to 15-4 ... Gain control unit, 16-1 to 16-4 ... MDCT processing unit, 17 ... Tone component and spectrum detection unit, 18 ... Tone component encoder, 19 ... Spectral encoder, 20 ... Bitstream multiplexer, 21 ... Bitstream demultiplexer, 22 ... Tone component decoder, 23 ... Spectral decoder, 24 ... Tone component and spectrum synthesis unit, 25-1 to 25-4 ... Inverse MDCT processing unit, 26-1 to 26-4 ... Inverse gain control unit, 27 ... QMF synthesis filter, CO1, CO2 ... Cascade operator, MP1a, MP1b, MP1c, MP2a, MP2b ，
MP2c ... Multiplier.

───────────────────────────────────────────────────── フロントページの続き (72)発明者スアホン・ネオシンガポール534415シンガポール、タイ・セン・アベニュー、ブロック1022、04− 3530番、タイ・セン・インダストリアル・エステイト、パナソニック・シンガポール研究所株式会社内 (72)発明者キムハン・クアシンガポール534415シンガポール、タイ・セン・アベニュー、ブロック1022、04− 3530番、タイ・セン・インダストリアル・エステイト、パナソニック・シンガポール研究所株式会社内Ｆターム(参考） 5D045 DA08 DA20 5J064 AA01 BA16 BB07 BC01 BC08 BC09 BC11 BC25 BD03 ─────────────────────────────────────────────────── ─── Continued front page (72) Inventor Suahone Neo Singapore 534415 Singapore, Thailand Sen Avenue, Block 1022, 04- No. 3530, Thai Sen Industrial Estate, Panasonic Singapore Research Institute Co., Ltd. (72) Inventor Kim Han Kua Singapore 534415 Singapore, Thailand Sen Avenue, Block 1022, 04- No. 3530, Thai Sen Industrial Estate, Panasonic Singapore Research Institute Co., Ltd. F-term (reference) 5D045 DA08 DA20 5J064 AA01 BA16 BB07 BC01 BC08 BC09 BC11 BC25 BD03

Claims

[Claims]

1. A step of calculating a correction function based on an input audio signal, and gain-controlling the audio signal according to the calculated correction function; and a frame unit for the gain-controlled audio signal. In the method of encoding an audio signal, the method comprises the step of performing an orthogonal transform process and an encoding process to obtain an encoded bitstream signal. Based on at least three frames including a frame to be processed and a frame to be processed of the delayed audio signal, the audio signal being delayed by at least one segment time A method for encoding an audio signal, characterized by calculating a correction function.

2. The step of controlling the gain comprises the steps of: (a) obtaining an absolute value MaxPeak of a peak of each section;
(B) a step of identifying an attack signal, which is a part of a signal including a sudden sound rise, in the audio signal by using a predetermined attack signal reference; and (c) a frame to be processed and a frame before and after the frame. Calculating a correction function based on at least three frames including a frame and gain control based on the calculated correction function, thereby identifying the identified section of the attack signal and the section that is the previous modified position. A step of processing the attack signal to amplify a plurality of sections between and, the reference of the attack signal is the MaxPea for the next section.
2. The audio signal coding method according to claim 1, wherein the attack signal is identified based on k and MaxPeak, which is the highest value among a predetermined number of partitions preceding the next partition.

3. The gain controlling step comprises: based on the at least three frames, applying the correction function to the MaxPeak relating to a next partition from a partition to be currently processed,
3. The audio signal encoding method according to claim 2, wherein the calculation is performed based on the maximum value MaxPeak in a predetermined number of sections preceding the section to be currently processed.

4. (a) Absolute value In of the peak between the segment to be currently processed and the segment which is the correction position before that.
terModMaxPeak, (b) using a predetermined release signal criterion, identifying a release signal that is a portion of the signal in the audio signal that includes a sharp fall of sound, and (c) processing By calculating the correction function based on at least three frames including the frame to be performed and the frames before and after it, and performing gain control based on the calculated correction function, the identified release signal segment and Processing the release signal by amplifying a plurality of sections between a section that is a modification position behind and the reference of the release signal is the latest InterModMaxPea.
Identify the release signal based on k, the highest value MaxPeak of the following predetermined number of partitions, and the time between the partition currently to be processed and the partition at which the last attack signal was identified. The audio signal encoding method according to claim 2 or 3, wherein

5. The modification function for processing the release signal is calculated based on the InterModMaxPeak and the highest value MaxPeak of the subsequent predetermined number of partitions. 4. The audio signal encoding method as described in 4.

6. A correction function is calculated based on the input audio signal, and gain control means is provided for the audio signal according to the calculated correction function, and frame units are provided for the gain-controlled audio signal. In the audio signal coding apparatus, which comprises means for performing an orthogonal transform process and for obtaining a coded bitstream signal by performing a coding process, the gain control means controls the input audio signal by a time of a frame. The audio signal is delayed by at least one segment time, and is corrected based on at least three frames including the frame to be processed and the frames before and after the delayed audio signal. An audio signal encoding device characterized by calculating a function.

7. The gain control means uses (a) means for obtaining the absolute value MaxPeak of the peak of each section, and (b) a predetermined attack signal reference, whereby a sudden sound in the audio signal is generated. Means for identifying an attack signal, which is a part of the signal including the rising edge of, and (c) a correction function is calculated based on at least three frames including the frame to be processed and the frames before and after it, and the calculated correction function is calculated. Controlling the gain of the attack signal to amplify a plurality of partitions between the identified partition of the attack signal and the partition that is the previous correction position. , The standard of the attack signal is the MaxPea according to the following categories.
7. The audio signal coding apparatus according to claim 6, wherein the attack signal is identified based on k and MaxPeak, which is the highest value among a predetermined number of partitions preceding the next partition.

8. The gain control means comprises at least the at least three gain control means.
Based on one frame, the correction function is based on the MaxPeak relating to the next division from the division to be currently processed and the maximum value MaxPeak in a predetermined number of divisions preceding the division to be currently processed. 8. The audio signal encoding apparatus according to claim 7, wherein the encoding is performed.

9. (a) Absolute value In of a peak between a segment to be currently processed and a segment which is a correction position earlier than that.
means for obtaining terModMaxPeak, (b) means for identifying a release signal which is a portion of a signal including a sudden fall of sound in the audio signal by using a predetermined release signal reference, and (c) processing Calculating a correction function based on at least three frames including a frame to be performed and a frame before and after the frame, and performing gain control based on the calculated correction function, thereby identifying the identified release signal;
Means for amplifying a plurality of sections between the section which is a modification position after that and processing the release signal, and the reference of the release signal is the latest InterModMaxPea.
Identify the release signal based on k, the highest value MaxPeak of the following predetermined number of partitions, and the time between the partition currently to be processed and the partition at which the last attack signal was identified. 9. The audio signal encoding device according to claim 7, wherein the audio signal encoding device is an audio signal encoding device.

10. A modification function for processing the release signal is calculated based on the InterModMaxPeak and the highest value MaxPeak of the subsequent predetermined number of partitions. 9. The audio signal encoding device according to item 9.

11. Any one of claims 6 to 10
An audio signal encoding and decoding system comprising: an audio signal encoding device and an audio signal decoding device, wherein the audio signal decoding device is encoded by the encoding device. Means for obtaining an audio signal composed of a plurality of frames by decoding the coded bit stream signal and performing orthogonal transformation processing; and a correction function opposite to the correction function for the obtained audio signal. And a means for performing inverse gain control by using the same to obtain and output an inversely gain-controlled audio signal, the audio signal encoding and decoding system.