JP6099602B2

JP6099602B2 - Information signal converter using duplicate conversion

Info

Publication number: JP6099602B2
Application number: JP2014158475A
Authority: JP
Inventors: シュネール、マルクス; ガイガー、ラルフ; ラヴェリ、エマニュエル; フォトポウロー、エレニ
Original assignee: フラウンホーファー−ゲゼルシャフト・ツール・フェルデルング・デル・アンゲヴァンテン・フォルシュング・アインゲトラーゲネル・フェライン
Priority date: 2011-02-14
Filing date: 2014-08-04
Publication date: 2017-03-22
Anticipated expiration: 2032-02-14
Also published as: US20130064383A1; RU2012148250A; EP2550653B1; JP2014240973A; TWI564882B; PL2550653T3; CA2799343A1; BR112012029132A2; JP5712288B2; KR101424372B1; WO2012110478A1; US9536530B2; KR20130007651A; CN102959620B; AU2012217158A1; BR112012029132B1; MX2012013025A; HK1181541A1; RU2580924C2; AU2012217158B2

Description

本発明は重複変換を使用した情報信号変換装置に関し、詳しくは、例えばオーディオ圧縮技術で使用されるようなエイリアシング解消を必要とする情報信号の重複変換を使用した情報信号変換装置に関する。 The present invention relates to an information signal conversion apparatus using overlap conversion, and more particularly to an information signal conversion apparatus using overlap conversion of an information signal that needs to eliminate aliasing as used in, for example, an audio compression technique.

ほとんどの圧縮技術は、特定の種類の情報信号や、最大許容遅延や可能な送信ビットレートというような圧縮データストリームの特定の条件に合うように設計されている。例えば高いビットレートでスピーチではなく音楽を符号化する場合には、オーディオ圧縮に関して、ＡＡＣのような変換に基づくコーデックの方が、ＡＣＥＬＰのような線形予測に基づく時間領域コーデックよりも優れた性能を示す傾向がある。例えばＵＳＡＣは、様々なオーディオコーディング原理を一つのコーデックに統合することにより、非常に多様な応用場面に対応しようとするものである。しかし、その利点を利用して、例えばより高いコーディング効率を達成するためには、送信ビットレートの変更というような様々なコーディング条件への適応性をさらに上げることが好ましい。 Most compression techniques are designed to meet specific conditions of a particular type of information signal and compressed data stream such as maximum allowable delay and possible transmission bit rate. For example, when encoding music rather than speech at high bit rates, codecs based on transforms such as AAC have better performance than audio time compression based on time domain codecs based on linear prediction such as ACELP. There is a tendency to show. For example, the USAC intends to cope with a wide variety of application scenes by integrating various audio coding principles into one codec. However, in order to achieve higher coding efficiency by using the advantages, it is preferable to further improve adaptability to various coding conditions such as changing the transmission bit rate.

３ＧＰＰ「オーディオコーデック処理機能、拡張適応マルチレート−広帯域（ＡＭＲ−ＷＢ）コーデック、トランスコーディング機能」２００９年、３ＧＰＰＴＳ２６．２９０3GPP “Audio Codec Processing Function, Extended Adaptive Multirate-Wideband (AMR-WB) Codec, Transcoding Function” 2009, 3GPP TS 26.290 ＵＳＡＣコーデック（音声合成コーデック）、ＩＳＯ／ＩＥＣＣＤ２３００３−３、２０１０年９月２４日USAC codec (speech synthesis codec), ISO / IEC CD 23003-3, September 24, 2010

従って、本発明の目標は、重複変換表記を実際の要求に適合させることが可能となるように、エイリアシング解消を必要とする重複変換による情報信号の表記を可能にする情報信号変換装置を提示することにより、このような概念を提供することであり、これにより、より高いコーディング効率を達成することができるであろう。 Accordingly, an object of the present invention is to provide an information signal conversion apparatus that enables information signal notation by duplicate conversion that requires aliasing cancellation so that the duplicate conversion notation can be adapted to actual requirements. By providing such a concept, a higher coding efficiency could be achieved.

この目標は独立請求項の内容によって達成される。 This goal is achieved by the content of the independent claims.

本発明につながる主な思想は以下のようなものである。例えばレートと歪みの比の点で情報信号を効率的に符号化する際にプリステートを形成するために、情報信号の重複変換表記がしばしば使用される。このようなコーデックの例はＡＡＣやＴＣＸ等である。しかし重複変換表記はまた、変換と再変換を様々なスペクトル分解能で連結させることによりリサンプリングを実行するのに使用され得る。一般的に、情報信号の連続する時間領域のウィンドウバージョンの変換形の個々の再変換形の重複部分でエイリアシングが生じる重複変換表記は、重複変換表記をするために符号化されるべき変換係数レベルの個数が少なくなるという点で有利である。極端な状態では、重複変換は「じっくりとサンプリング」されている。つまり、情報信号の時間サンプルの個数に比較して、重複変換表記の係数の個数は増加しない。重複変換表記の一例は、ＭＤＣＴ（修正離散余弦変換）またはＱＭＦ（直交ミラーフィルター）フィルターバンクである。従って、情報信号を効率的に符号化する際に、このような重複変換表記をプリステートとして使用することが好ましい場合がしばしばある。しかし、情報信号が重複変換表記される際のサンプリングレートが、例えば可能な送信ビットレートまたは他の環境条件に適合するように時間変動可能であれば、これもまた好ましい。可能送信ビットレートが変動的であると仮定すると、例えば可能送信ビットレートが所定の閾値よりも下がった場合は常にサンプリングレートを下げることが好ましく、可能送信レートが再び上がった場合には、情報信号を重複変換表記するサンプリングレートが上昇可能であることが好ましい。悪いことに、重複変換表記の再変換の重複エイリアシング部分が、このようなサンプリングレート変更に対して障害を設けており、サンプリングレート変更の場合に重複変換表記を完全に遮断することによってのみ、この障害を打開することが可能であるように思われる。しかし、本発明の発明者たちは上述の問題に対する解決を実現し、これにより、エイリアシングと懸案のサンプリングレート変更を伴う重複変換表記の効率的な使用が可能となる。特に、補間によって、情報信号の先行領域及び／または後続領域が、これらの領域の境界でのサンプリングレート変更に従い、エイリアシング解消部分でリサンプリングされる。そして、結合装置は、エイリアシング解消部分でのリサンプリングにより得られるような先行領域の再変換と後続領域の再変換との境界で、エイリアシング解消を行うことができる。この方法により、サンプリングレート変更／推移時点での重複変換信号の中断を引き起こすことなく、サンプリングレート変更に対して効率的に対処することができる。重複変換を適切に生成するために、変換側での同様の方法も可能である。 The main idea that leads to the present invention is as follows. In order to form a pre-state when encoding an information signal efficiently in terms of, for example, a ratio of rate and distortion, an information signal overlap transform notation is often used. Examples of such codecs are AAC and TCX. However, duplicate transform notation can also be used to perform resampling by concatenating transform and retransform at various spectral resolutions. In general, a duplicate transform notation in which aliasing occurs in the overlapping part of individual retransformed forms of a continuous time domain window version of an information signal is the transform coefficient level to be encoded to represent the duplicate transform notation. This is advantageous in that the number of the is reduced. In extreme situations, duplicate conversions are “sampled carefully”. That is, compared to the number of time samples of the information signal, the number of coefficients in the overlap conversion notation does not increase. An example of an overlap transform notation is MDCT (Modified Discrete Cosine Transform) or QMF (Orthogonal Mirror Filter) filter bank. Therefore, it is often desirable to use such a duplicate transform notation as a pre-state when encoding information signals efficiently. However, this is also preferred if the sampling rate at which the information signal is represented in duplicate conversion can be time-varying, for example to match possible transmission bit rates or other environmental conditions. Assuming that the possible transmission bit rate is variable, it is preferable to reduce the sampling rate whenever the possible transmission bit rate falls below a predetermined threshold, for example, and if the possible transmission rate rises again, the information signal It is preferable that the sampling rate that represents the overlap conversion can be increased. Unfortunately, the duplicate aliasing part of the reconversion of the duplicate conversion notation poses an obstacle to such a sampling rate change, and only by completely blocking the duplicate conversion notation in the case of a sampling rate change. It seems possible to overcome obstacles. However, the inventors of the present invention have realized a solution to the above-mentioned problem, which allows efficient use of duplicate conversion notation with aliasing and pending sampling rate changes. In particular, the preceding region and / or the subsequent region of the information signal are resampled at the aliasing elimination portion according to the sampling rate change at the boundary between these regions by interpolation. The coupling device can cancel aliasing at the boundary between reconversion of the preceding region and reconversion of the subsequent region as obtained by resampling at the aliasing canceling portion. By this method, it is possible to efficiently cope with the sampling rate change without causing the interruption of the duplicate conversion signal at the sampling rate change / transition time. A similar method on the conversion side is also possible in order to properly generate the duplicate conversion.

上述の考えを用いて、オーディオ圧縮技術のような情報信号圧縮技術を提供することが可能であり、これらの技術は、符号化の環境条件の広範囲にわたって、例えば可能転送帯域幅全体にわたって、サンプリングレート変更そのものによる不利益を全く被ることなく、与えられたサンプリングレートをこれらの条件に適合させることにより、高い符号化効率を得ることができるものである。 Using the above considerations, it is possible to provide information signal compression techniques, such as audio compression techniques, which can be sampled over a wide range of encoding environmental conditions, eg, over the entire possible transfer bandwidth. High coding efficiency can be obtained by adapting a given sampling rate to these conditions without incurring any disadvantages due to the change itself.

本発明の利点は、従属請求項に記載の内容である。さらに、本発明の好ましい実施形態を、添付図面を参照し以下に説明する。 Advantages of the invention are the subject-matter of the dependent claims. Further preferred embodiments of the present invention will be described below with reference to the accompanying drawings.

本発明の実施形態が適用可能である情報信号エンコーダのブロック図を示す。1 shows a block diagram of an information signal encoder to which an embodiment of the present invention is applicable. 本発明の実施形態が適用可能である情報信号デコーダのブロック図を示す。1 shows a block diagram of an information signal decoder to which an embodiment of the present invention can be applied. FIG. 図１Ａのコアエンコーダの可能な内部構成のブロック図を示す。1B shows a block diagram of a possible internal configuration of the core encoder of FIG. 1A. 図１Ｂのコアデコーダの可能な内部構成のブロック図を示す。1B shows a block diagram of a possible internal configuration of the core decoder of FIG. 1B. 図１Ａのリサンプラーの可能な実施例のブロック図を示す。1B shows a block diagram of a possible embodiment of the resampler of FIG. 1A. 図１Ｂのリサンプラーの可能な内部構成のブロック図を示す。1B shows a block diagram of a possible internal configuration of the resampler of FIG. 1B. 本発明の実施形態が適用された情報信号エンコーダのブロック図を示す。1 shows a block diagram of an information signal encoder to which an embodiment of the present invention is applied. FIG. 本発明の実施形態が適用された情報信号デコーダのブロック図を示す。1 is a block diagram of an information signal decoder to which an embodiment of the present invention is applied. FIG. 本発明の一実施形態に係る情報信号再構築装置のブロック図を示す。1 shows a block diagram of an information signal reconstruction device according to an embodiment of the present invention. FIG. 本発明の一実施形態に係る情報信号変換装置のブロック図を示す。1 is a block diagram of an information signal conversion apparatus according to an embodiment of the present invention. 図５の情報信号再構築装置が使用された別の実施形態に係る情報信号エンコーダのブロック図を示す。FIG. 6 shows a block diagram of an information signal encoder according to another embodiment in which the information signal reconstruction device of FIG. 5 is used. 図５の情報信号再構築装置が使用された別の実施形態に係る情報信号デコーダのブロック図を示す。FIG. 6 shows a block diagram of an information signal decoder according to another embodiment in which the information signal reconstruction device of FIG. 5 is used. 本発明の一実施形態に係る図６の情報信号エンコーダ及びデコーダで発生するサンプリングレート変更を示す概略図である。FIG. 7 is a schematic diagram illustrating a sampling rate change that occurs in the information signal encoder and decoder of FIG. 6 according to an embodiment of the present invention.

以下に説明する本発明の実施形態の動機付けために、前もって、本願の実施形態が使用でき、以下で述べるような本発明と本願の実施形態の利点を明らかにする実施形態について議論する。 In order to motivate embodiments of the present invention described below, embodiments of the present application can be used in advance, and embodiments that clarify the advantages of the present invention and embodiments of the present application as described below will be discussed.

図１Ａ，１Ｂは、例えば、以下に説明する実施形態を有利に使用し得る一対のエンコーダとデコーダを示している。図１Ａはエンコーダを示し、図１Ｂはデコーダを示す。図１Ａの情報信号エンコーダ１０は、情報信号が入力される入力部１２と、リサンプラー１４と、コアエンコーダ１６とを含み、リサンプラー１４とコアエンコーダ１６は、エンコーダ１０の入力部１２と出力部１８との間で連続的に接続されている。出力部１８で、エンコーダ１０は入力部１２の情報信号を表すデータストリームを出力する。同様に、参照符号２０で示された図１Ｂのデコーダは、コアデコーダ２２とリサンプラー２４とを含み、コアデコーダ２２とリサンプラー２４は、図１Ｂに示されているように、デコーダ２０の入力部２６と出力部２８との間で連続的に接続されている。 1A and 1B show, for example, a pair of encoders and decoders that may advantageously use the embodiments described below. FIG. 1A shows an encoder, and FIG. 1B shows a decoder. 1A includes an input unit 12 to which an information signal is input, a resampler 14, and a core encoder 16. The resampler 14 and the core encoder 16 include an input unit 12 and an output unit of the encoder 10. 18 is connected continuously. In the output unit 18, the encoder 10 outputs a data stream representing the information signal of the input unit 12. Similarly, the decoder of FIG. 1B, indicated by reference numeral 20, includes a core decoder 22 and a resampler 24, which are the inputs of the decoder 20, as shown in FIG. 1B. The unit 26 and the output unit 28 are continuously connected.

出力部１８で出力されたデータストリームをデコーダ２０の入力部２６に送る際に可能な転送ビットレートが高い場合には、データストリーム内で高サンプリングレートで情報信号１２を表すことが符号化効率の点で好ましく、これにより、情報信号のスペクトルの広スペクトル帯域をカバーすることができる。つまり、レート／歪み比尺度のような符号化効率尺度によると、情報信号の低サンプリングレートでの圧縮と比較して、コアエンコーダ１６がそれよりも高いサンプリングレートで入力信号１２を圧縮する場合には、符号化効率が高くなることが示されている。一方、可能転送ビットレートのうちの低い方のビットレートでは、情報信号１２を低サンプリングレートで符号化する際に符号化効率尺度はより高くなり得る。この点に関して、歪みは心理音響的に動機づけられた方法で、つまり、知覚的にあまり関係のない周波数領域（それに対する人間の耳の感度が低い周波数領域）内の歪みよりも、知覚的により関連のある周波数領域内の歪みを集中的に考慮して測定してもよいことに留意すべきである。一般的に、低周波領域は高周波領域よりも知覚的に関連深い傾向があり、従って、低サンプリングレート符号化では、入力部１２での信号のナイキスト周波数よりも高い周波数成分は符号化の対象から除外される。しかし、その結果ビットレートの節約を得ることができるので、レート／歪み比の点で、この低サンプリングレート符号化は高サンプリングレート符号化よりも好ましいものとなり得る。低周波部分と高周波部分との間の歪みの重要性に関するこれに類似の矛盾は、測定信号などのような他の情報信号内にも存在する。 When the transfer bit rate that is possible when the data stream output from the output unit 18 is sent to the input unit 26 of the decoder 20 is high, the encoding signal can be expressed in a high sampling rate in the data stream. It is preferable at this point, and thereby, a wide spectrum band of the spectrum of the information signal can be covered. That is, according to a coding efficiency measure, such as a rate / distortion ratio measure, when the core encoder 16 compresses the input signal 12 at a higher sampling rate compared to compression of the information signal at a lower sampling rate. Indicates that the encoding efficiency is increased. On the other hand, at the lower of the possible transfer bit rates, the encoding efficiency measure may be higher when encoding the information signal 12 at a low sampling rate. In this regard, distortion is a psycho-acoustically motivated way, that is, perceptually more than distortion in a frequency domain that is less perceptually relevant (a frequency domain in which the human ear is less sensitive). It should be noted that the distortion in the relevant frequency domain may be measured intensively. In general, the low frequency region tends to be perceptually more relevant than the high frequency region. Therefore, in the low sampling rate encoding, a frequency component higher than the Nyquist frequency of the signal at the input unit 12 is determined from the encoding target. Excluded. However, as a result, bit rate savings can be obtained, and this low sampling rate encoding can be preferable to high sampling rate encoding in terms of rate / distortion ratio. Similar inconsistencies regarding the importance of distortion between the low and high frequency portions also exist in other information signals such as measurement signals.

従って、リサンプラー１４は情報信号１２をサンプリングする際のサンプリングレートを変更するためのものである。とりわけ出力部１８と入力部２６との間の可能転送ビットレートにより規定されるような外部転送条件に応じてサンプリングレートを適切に制御することにより、外部転送条件が時間と共に変化するにもかかわらず、エンコーダ１０は符号化効率を向上させることができる。そして、デコーダ２０はデータストリームを展開するコアデコーダ２２を含み、また、リサンプラー２４は、出力部２８で出力される再構築情報信号が再び一定のサンプリングレートを有するように処理する。 Therefore, the resampler 14 is for changing the sampling rate when the information signal 12 is sampled. In particular, by appropriately controlling the sampling rate according to the external transfer condition as defined by the possible transfer bit rate between the output unit 18 and the input unit 26, the external transfer condition changes with time. The encoder 10 can improve the encoding efficiency. The decoder 20 includes a core decoder 22 that develops a data stream, and the resampler 24 performs processing so that the reconstructed information signal output from the output unit 28 has a constant sampling rate again.

しかし、図１Ａ，１Ｂのエンコーダ／デコーダ対で重複変換が使用される場合には必ず問題が発生する。再変換の重複領域でエイリアシングが発生する重複変換表記は、効率的な符号化ツールではあるが、時間的エイリアシング解消を必要とするので、サンプリングレート変更の際に問題が発生する。例えば図２Ａ，２Ｂを参照して下さい。図２Ａ，２Ｂは、変換符号化タイプのものであると仮定した場合のコアエンコーダ１６とコアデコーダ２２の可能な実施例を示す。従って、コアエンコーダ１６は変換装置３０を含み、この後に圧縮装置３２が設けられている。図２Ｂのコアデコーダは展開装置３４を含み、この後に今度は再変換装置３６が設けられている。図２Ａ，２Ｂに関して、コアエンコーダ１６とコアデコーダ２２内には、他に何のモジュールも存在し得ないというように理解すべきではない。例えば、変換装置３０の前にフィルターがあってもよく、この場合、変換装置３０はリサンプラー１４によって与えられたリサンプル情報信号そのままではなく、事前にフィルタリングされた形のものを変換することになる。同様に、再変換装置３６の後に、逆変換関数を有するフィルターがあってもよく、この場合、再変換信号は続いて逆フィルタリングされることになる。 However, problems always arise when duplicate transforms are used in the encoder / decoder pairs of FIGS. 1A and 1B. Although the overlap conversion notation in which aliasing occurs in the overlap region of the reconversion is an efficient encoding tool, it requires time aliasing cancellation, which causes a problem when changing the sampling rate. For example, see Figures 2A and 2B. 2A and 2B show possible embodiments of the core encoder 16 and the core decoder 22 assuming that they are of the transform coding type. Therefore, the core encoder 16 includes a conversion device 30, and a compression device 32 is provided after this. The core decoder of FIG. 2B includes a decompressor 34, followed by a retransformer 36. 2A and 2B, it should not be understood that there can be no other modules in the core encoder 16 and the core decoder 22. For example, there may be a filter in front of the conversion device 30, in which case the conversion device 30 converts the pre-filtered form rather than the resampled information signal provided by the resampler 14. Become. Similarly, there may be a filter with an inverse transform function after the retransformer 36, in which case the retransformed signal will be subsequently inverse filtered.

圧縮装置３２は、変換装置３０によって出力された重複変換表記を、ハフマン符号化または算術符号化のような例を含むエントロピーコーディングのようなロスレスコーディングを使用して圧縮し、展開装置３４は、再変換装置３６へ送られるべき重複変換表記を得るために、例えばハフマン復号または算術復号のようなエントロピーデコーディングにより、まさに逆の処理つまり展開を行う。 The compression device 32 compresses the overlapped transformation notation output by the transformation device 30 using lossless coding such as entropy coding, including examples such as Huffman coding or arithmetic coding, and the decompression device 34 In order to obtain the duplicate conversion notation to be sent to the conversion device 36, the exact reverse process or expansion is performed, for example by entropy decoding such as Huffman decoding or arithmetic decoding.

図２Ａ，２Ｂに示した変換符号化環境において、リサンプラー１４がサンプリングレートを変更するたびに問題が発生する。この問題は、情報信号１２が存在する符号化側においてはあまり深刻ではない。従って、変換装置３０には、サンプリングレート変更の瞬間を跨いでも、それぞれの領域のウィンドウバージョンを使用したそれぞれの変換のために継続的にサンプリングされた領域が与えられる。変換装置３０の可能な実施形態を、図６を参照して以下に説明する。一般的に、変換装置３０には情報信号の先行領域のウィンドウバージョンが現在のサンプリングレートで与えられ、その後、リサンプラー１４によって、情報信号の次の部分的重複領域が変換装置３０に与えられ、そして、情報信号のウィンドウバージョンの変換形が変換装置３０によって生成される。必要な時間的エイリアシング解消処理は変換装置３０よりもむしろ再変換装置３６において行われなければならないので、さらに追加の問題は起こらない。しかし再変換装置３６においては、前述のすぐ後に続く領域の再変換形は様々な異なるサンプリングレートに関連するので、再変換装置３６は時間的エイリアシング解消を実行することができないという点で、サンプリングレートの変更が問題を発生させる。以下に説明する実施形態はこれらの問題を解決するものである。これらの実施形態によると、再変換装置３６は、以下に記載のような情報信号再構築装置に置き代えられてもよい。 In the transform coding environment shown in FIGS. 2A and 2B, a problem occurs every time the resampler 14 changes the sampling rate. This problem is not so serious on the encoding side where the information signal 12 exists. Accordingly, the conversion device 30 is provided with continuously sampled regions for each conversion using the window version of each region, even across the instant of changing the sampling rate. A possible embodiment of the conversion device 30 is described below with reference to FIG. In general, the converter 30 is given a window version of the preceding region of the information signal at the current sampling rate, and then the resampler 14 gives the next partial overlap region of the information signal to the converter 30; Then, the conversion form of the window version of the information signal is generated by the conversion device 30. Since the necessary temporal aliasing elimination process must be performed in the reconversion device 36 rather than in the conversion device 30, no additional problems occur. However, in the reconversion unit 36, the reconversion unit 36 cannot perform temporal aliasing elimination because the reconversion form of the region immediately following is associated with a variety of different sampling rates. Changes cause problems. The embodiment described below solves these problems. According to these embodiments, the reconversion device 36 may be replaced by an information signal reconstruction device as described below.

しかし図１Ａ，１Ｂに関して説明した環境においては、コアエンコーダ１６とコアデコーダ２２が変換符号化タイプのものである場合にのみ、問題が発生する。より正確には、リサンプラー１４と２４をそれぞれ形成する重複変換に基づくフィルターバンクを使用した場合にも問題は発生する。例えば図３Ａ，３Ｂを参照して下さい。図３Ａ，３Ｂは、リサンプラー１４，２４を実現するための具体的な一実施形態を示している。図３Ａ，３Ｂの実施形態によると、どちらのリサンプラーも、解析フィルターバンク３８，４０とその後に配置された合成フィルターバンク４２，４４とをそれぞれ連結することにより形成されている。図３Ａ，３Ｂに示されているように、解析及び合成フィルターバンク３８〜４４は、ＱＭＦフィルターバンク、つまり情報信号を事前に分解し、そして再び信号を結合するためのＱＭＦを使用したＭＤＣＴに基づくフィルターバンクとして実施してもよい。このＱＭＦは、１０個のブロック重複している（１０個は単に一例である）マルチチャンネル変調フィルターバンクを意味するＭＰＥＧＨＥ−ＡＡＣまたはＡＡＣ−ＥＬＤのＳＢＲ部分で使用されているＱＭＦと同様に実施されてもよい。このように、重複変換表記は解析フィルターバンク３８，４０によって生成され、合成フィルターバンク４２，４４で、リサンプリングされた信号がこの重複変換表記から再構築される。サンプリングレート変更を可能にするために、合成フィルターバンク４２と解析フィルターバンク４０は様々な変換長で動作するよう構成されていてもよい。しかし、フィルターバンクまたはＱＭＦのレート、つまり、一方では解析フィルターバンク３８，４０それぞれによって連続的な変換形が生成され、他方では合成フィルターバンク４２，４４それぞれによって再変換が行われるレートは一定であり、全ての素子３８〜４０に関して同じである。変換長の変更は、しかし、サンプリングレートの変更をもたらす。例えば、解析フィルターバンク３８と合成フィルターバンク４２の対を考えてみる。解析フィルターバンク３８は、一定の変換長と一定のフィルターバンクまたは変換レートを使用して動作するものと仮定する。この場合、解析フィルターバンク３８によって出力される入力信号の重複変換表記は、入力信号の、連続重複し、一定のサンプル長さを有する領域のそれぞれに関して、それぞれの領域のウィンドウバージョンの変換形を含み、これらの変換形も一定長さを有する。換言すれば、解析フィルターバンク３８は一定の時間／周波数分解能のスペクトログラムを合成フィルターバンク４２へ送る。しかし、合成フィルターバンクの変換長は変動する。例えば、解析フィルターバンク３８の入力部での入力サンプリングレートと合成フィルターバンク４２の出力部での信号出力サンプリングレートとの間で、第１のダウンサンプリングレートから第２のダウンサンプリングレートに下げる場合を考える。第１のダウンサンプリングレートが有効である限り、解析フィルターバンク３８によって出力された重複変換表記またはスペクトログラムは単に部分的に使用され、合成フィルターバンク４２内で再変換をもたらす。合成フィルターバンク４２の再変換は、解析フィルターバンク３８のスペクトログラム内で連続する変換形の低周波部分に単純に適用される。合成フィルターバンク４２の再変換に使用される変換長が短いために、合成フィルターバンク４２の再変換形内のサンプル数もまた、それまでフィルターバンク３８での変換の対象となっていた重複時間部分でのサンプル数よりも少なく、これにより、解析フィルターバンク３８の入力部に入力された情報信号のオリジナルのサンプリングレートに比べて低いサンプリングレートとなる。ダウンサンプリングレートが一定に保たれる限り、合成フィルターバンク４２が、連続する再変換形の間の重複部分と、フィルターバンク４２の出力部での出力信号の連続重複領域とで、時間的エイリアシング解消を実行することに何の問題もないままなので、何も問題は起こらないだろう。 However, in the environment described with reference to FIGS. 1A and 1B, a problem occurs only when the core encoder 16 and the core decoder 22 are of the transform coding type. More precisely, problems also arise when using filter banks based on duplicate transformations forming resamplers 14 and 24, respectively. For example, see Figures 3A and 3B. 3A and 3B show a specific embodiment for realizing the resamplers 14 and 24. According to the embodiment of FIGS. 3A and 3B, both resamplers are formed by connecting the analysis filter banks 38 and 40 and the synthesis filter banks 42 and 44 disposed thereafter, respectively. As shown in FIGS. 3A and 3B, analysis and synthesis filter banks 38-44 are based on QMF filter banks, ie MDCT using QMF to pre-decompose information signals and recombine the signals. You may implement as a filter bank. This QMF is implemented in the same way as the QMF used in the SBR portion of MPEG HE-AAC or AAC-ELD, which means a multi-channel modulation filter bank with 10 blocks overlapping (10 is just an example) May be. In this way, duplicate conversion notations are generated by the analysis filter banks 38, 40, and the resampled signals are reconstructed from the duplicate conversion notations in the synthesis filter banks 42, 44. In order to enable changing the sampling rate, the synthesis filter bank 42 and the analysis filter bank 40 may be configured to operate with various conversion lengths. However, the rate of the filter bank or QMF, that is, the rate at which a continuous conversion form is generated by the analysis filter banks 38 and 40 on the one hand and the re-conversion is performed by the synthesis filter banks 42 and 44 on the other hand, is constant. The same for all elements 38-40. Changing the transform length, however, results in changing the sampling rate. For example, consider a pair of analysis filter bank 38 and synthesis filter bank 42. It is assumed that the analysis filter bank 38 operates using a constant conversion length and a constant filter bank or conversion rate. In this case, the overlap conversion notation of the input signal output by the analysis filter bank 38 includes, for each of the regions of the input signal that are continuously overlapping and having a certain sample length, the conversion form of the window version of the respective region. These conversion forms also have a certain length. In other words, the analysis filter bank 38 sends a spectrogram with a constant time / frequency resolution to the synthesis filter bank 42. However, the conversion length of the synthesis filter bank varies. For example, a case where the first down-sampling rate is lowered to the second down-sampling rate between the input sampling rate at the input unit of the analysis filter bank 38 and the signal output sampling rate at the output unit of the synthesis filter bank 42. Think. As long as the first downsampling rate is valid, the duplicate transform notation or spectrogram output by the analysis filter bank 38 is only partially used, resulting in retransformation within the synthesis filter bank 42. The retransformation of the synthesis filter bank 42 is simply applied to the low frequency part of the transform form that is continuous in the spectrogram of the analysis filter bank 38. Since the conversion length used for re-conversion of the synthesis filter bank 42 is short, the number of samples in the re-conversion form of the synthesis filter bank 42 is also the overlap time portion that has been subject to conversion in the filter bank 38 so far. Thus, the sampling rate is lower than the original sampling rate of the information signal input to the input part of the analysis filter bank 38. As long as the downsampling rate is kept constant, the synthesis filter bank 42 eliminates temporal aliasing between the overlap between successive reconversion forms and the continuous overlap region of the output signal at the output of the filter bank 42. There will be no problems in running, so no problems will occur.

ダウンサンプリングレートが第１のダウンサンプリングレートからそれよりも高い第２のダウンサンプリングレートに変更される場合には、常に問題が発生する。この場合、合成フィルターバンク４２内の再変換で使用される変換長はさらに縮小され、それにより、このサンプリングレート変更時点よりも後のそれぞれの領域に関しては、さらに低いサンプリングレートとなる。このサンプリングレート変更時点の直前の領域に関する再変換と、このサンプリングレート変更時点の直後にリサンプリングされた領域に関する再変換との間での時間的エイリアシング解消が妨げられるので、ここでも、合成フィルターバンク４２にとって問題が発生する。従って、変換長可変の解析フィルターバンク４０が変換長一定の合成フィルターバンク４４の前に備えられている場合には、デコーディングの側でこのような問題は起こらないという考えは、あまり助けにはならない。ここで、合成フィルターバンク４４は、様々な異なる周波数分解能ではない一定のＱＭＦ／変換レートのスペクトログラムに、つまり、解析フィルターバンク４０から合成フィルターバンク４４へ異なるまたは時間変動の変換長ではなく、一定のレートで送られた連続する変換形に適合し、全体変換長の高周波部分を０にして、合成フィルターバンク４４の全体変換長の低周波部分を保つ。合成フィルターバンク４４の出力部で出力される再構築信号のサンプリングレートは一定のサンプリングレートであるので、合成フィルターバンク４４によって出力された連続する再変換形の間の時間的エイリアシング解消は問題ではない。 Problems always arise when the downsampling rate is changed from the first downsampling rate to a higher second downsampling rate. In this case, the conversion length used for the re-conversion in the synthesis filter bank 42 is further reduced, so that a lower sampling rate is obtained for each region after the sampling rate change time. Again, this eliminates temporal aliasing between the reconversion for the region immediately before the change in sampling rate and the reconversion for the region resampled immediately after the change in sampling rate. Problems arise for 42. Therefore, if the analysis filter bank 40 with variable conversion length is provided in front of the synthesis filter bank 44 with constant conversion length, the idea that such a problem does not occur on the decoding side is not very helpful. Don't be. Here, the synthesis filter bank 44 has a constant QMF / conversion rate spectrogram that is not of various different frequency resolutions, i.e., a constant or different conversion length from the analysis filter bank 40 to the synthesis filter bank 44 instead of a different conversion length. The high frequency part of the total conversion length is set to 0, and the low frequency part of the total conversion length of the synthesis filter bank 44 is maintained. Since the sampling rate of the reconstructed signal output from the output unit of the synthesis filter bank 44 is a constant sampling rate, it is not a problem to eliminate temporal aliasing between successive reconversion forms output by the synthesis filter bank 44. .

このように、ここでも、図１Ａ，１Ｂに関して上述したようなサンプリングレートの変更／適合を実行しようとする際に問題があるが、これらの問題は、情報信号の再構築装置のための以下に説明する実施形態に従い図３Ａの逆フィルターバンクまたは合成フィルターバンク４２を実施することにより、克服できる。 Thus, again, there are problems in trying to perform the sampling rate change / adaptation as described above with respect to FIGS. 1A and 1B, but these problems are described below for the information signal reconstruction apparatus. This can be overcome by implementing the inverse or synthetic filter bank 42 of FIG. 3A according to the described embodiment.

サンプリングレートの適合／変更に関する上記の考えは、符号化されるべき情報信号の高周波部分がそれに従うパラメトリック手法で（例えば、その信号の低周波部分が変換符号化及び／または予測符号化などを使用して符号化されるスペクトル帯域複製（ＳＢＲ）を使用して）符号化されるコーディング概念を考慮すると、より興味深くなる。例えば情報信号エンコーダと情報信号デコーダの対を示す図４Ａ，４Ｂを参照して下さい。エンコーディング側では、コアエンコーダ１６が、図３Ａに示すように実施されるリサンプラー、つまり解析フィルターバンク３８と変換長可変の合成フィルターバンク４２の連結によって実施されるリサンプラーの後に続く。上述のように、解析フィルターバンク３８の入力と合成フィルターバンク４２の出力との間の時間変動ダウンサンプリングレートを達成するために、合成フィルターバンク４２は一定領域のスペクトルの一部（合成フィルターバンク４２の変換長の時間変動長を有する部分）に対して再変換、つまり、解析フィルターバンク３８によって出力された一定長と一定変換レート４６の変換を行う。時間変動は両矢印によって示されている。解析フィルターバンク３８と合成フィルターバンク４２の連結によってリサンプリングされた低周波部分５０はコアエンコーダ１６によって符号化されるが、残りの部分、つまりスペクトル４６の残りの周波数部分である高周波部分５２は、パラメトリックエンベロープコーダー５４でのエンベロープのパラメトリック符号化の対象とされてもよい。このように、コアデータストリーム５６は、パラメトリックエンベロープコーダー５４によって出力されたパラメトリック符号化データストリーム５８を伴う。 The above idea of adapting / changing the sampling rate is a parametric approach that the high frequency part of the information signal to be encoded follows (for example, the low frequency part of the signal uses transform coding and / or predictive coding, etc. It is more interesting to consider the coding concept encoded using spectral band replication (SBR). For example, see Figures 4A and 4B, which show a pair of information signal encoder and information signal decoder. On the encoding side, the core encoder 16 follows a resampler implemented as shown in FIG. 3A, that is, a resampler implemented by concatenating the analysis filter bank 38 and the variable conversion length synthesis filter bank 42. As described above, in order to achieve a time-varying downsampling rate between the input of the analysis filter bank 38 and the output of the synthesis filter bank 42, the synthesis filter bank 42 is a portion of the spectrum in a certain region (the synthesis filter bank 42). The portion having the time variation length of the conversion length) is reconverted, that is, conversion between the constant length output by the analysis filter bank 38 and the constant conversion rate 46 is performed. Time variation is indicated by double arrows. The low frequency portion 50 resampled by the concatenation of the analysis filter bank 38 and the synthesis filter bank 42 is encoded by the core encoder 16, but the remaining portion, that is, the high frequency portion 52 which is the remaining frequency portion of the spectrum 46, It may be a target of parametric encoding of the envelope by the parametric envelope coder 54. Thus, the core data stream 56 is accompanied by a parametric encoded data stream 58 output by the parametric envelope coder 54.

デコーディング側では、デコーダも同様にコアデコーダ２２を含み、この後に、図３Ｂに示されているようなリサンプラー、つまり、解析フィルターバンク４０とそれに続く合成フィルターバンク４４から成り、解析フィルターバンク４０は、エンコーディング側での合成フィルターバンク４２の変換長の時間変動に同期した時間変動変換長を有するリサンプラーが続く。コアデコーダ２２はコアデータストリーム５６を受信しそれを復号するが、パラメトリックデータストリーム５８を受信し、そこから高周波部分５２’を導き出し、可変変換長の低周波部分５０、つまりエンコーディング側で合成フィルターバンク４２によって使用された変換長の時間変動に同期し、コアデコーダ２２によって出力されたサンプリングレートの変動に同期した変換長の低周波部分５０を完全なものとするために、パラメトリックエンベロープデコーダ６０が設置されている。 On the decoding side, the decoder also includes a core decoder 22, followed by a resampler as shown in FIG. 3B, ie, an analysis filter bank 40 followed by a synthesis filter bank 44. Is followed by a resampler having a time variation conversion length synchronized with the time variation of the conversion length of the synthesis filter bank 42 on the encoding side. The core decoder 22 receives and decodes the core data stream 56, but receives the parametric data stream 58, derives a high frequency portion 52 'therefrom, and combines the variable transform length low frequency portion 50, ie, the synthesis filter bank at the encoding side. The parametric envelope decoder 60 is installed to complete the low frequency portion 50 of the transform length synchronized with the variation of the sampling rate output by the core decoder 22 in synchronism with the time variation of the transform length used by 42. Has been.

図４Ａのエンコーダの場合、解析フィルターバンク３８が存在していることが利点であり、これにより、リサンプラーの形成のためには単に合成フィルターバンク４２を追加するだけでよい。サンプリングレートの切り換えにより、スペクトル４６の低周波部分（低周波部分は、単にパラメトリックエンベロープコーディングの対象となる高周波部分と比較して、より正確なコアエンコーディングの対象となる）の割合を適合させることが可能である。特に、この割合は、データストリーム全体を送信するための可能送信帯域幅などのような外部条件に応じて、効率的に制御してもよい。エンコーディング側で制御される時間変動は、例えばそれぞれのサイド情報データによって、デコーディング側に簡単に信号伝達できる。 In the case of the encoder of FIG. 4A, it is an advantage that an analysis filter bank 38 is present, so that a synthesis filter bank 42 may simply be added to form a resampler. By switching the sampling rate, the proportion of the low frequency part of the spectrum 46 (the low frequency part is subject to more accurate core encoding compared to the high frequency part which is simply subject to parametric envelope coding) can be adapted. Is possible. In particular, this ratio may be efficiently controlled according to external conditions such as possible transmission bandwidth for transmitting the entire data stream. Time variations controlled on the encoding side can be easily signaled to the decoding side, for example, by the respective side information data.

このように、図１Ａ〜４Ｂは、時間的エイリアシング解消が必要となる重複変換表記を使用しているにもかかわらず、サンプリングレートの変更を事実上可能にする概念を持っていることが好ましいということを示している。図５は、合成フィルターバンク４２または図２Ｂの再変換装置３６を実施するために使用される場合には、上述のような問題を克服し、既に述べたようにサンプリングレート変更などの利点を活用することができる情報信号再構築装置の一実施形態を示す。 Thus, it is preferable that FIGS. 1A to 4B have a concept that makes it possible to change the sampling rate practically, even though it uses a duplicate conversion notation that requires elimination of temporal aliasing. It is shown that. FIG. 5 overcomes the above-described problems when used to implement the synthesis filter bank 42 or the re-conversion device 36 of FIG. 2B and takes advantage of the sampling rate change as described above. 1 shows an embodiment of an information signal reconstruction device that can be used.

図５に示された情報信号再構築装置は再変換装置７０とリサンプラー７２と結合装置７４を含み、これらは、この順番に、情報信号再変換装置８０の入力部７６と出力部７８との間に連続的に接続されている。 The information signal reconstructing device shown in FIG. 5 includes a reconverter 70, a resampler 72, and a combiner 74, which in this order are an input unit 76 and an output unit 78 of the information signal reconverter 80. Connected continuously between.

図５に示された情報信号再構築装置は、エイリアシング解消を使用して、入力部７６で入力された情報信号の重複変換表記から情報信号を再構築するためのものである。つまり、情報信号再構築装置は、入力部７６で入力された情報信号の重複変換表記を使用して、出力部７８で、時間変動サンプリングレートで情報信号を出力するためのものである。情報信号の重複変換表記は、情報信号のうちの連続する重複時間領域（または時間間隔）のそれぞれに関して、それぞれの領域のウィンドウバージョンの変換形を含む。以下により詳細に説明するように、情報信号再構築装置８０は、情報信号９０の先行領域８４と後続領域８６の境界部８２で変化するサンプリングレートで情報信号を再構築するよう構成されている。 The information signal reconstructing apparatus shown in FIG. 5 is for reconstructing an information signal from overlapping conversion notation of the information signal input at the input unit 76 using aliasing cancellation. That is, the information signal reconstructing apparatus is for outputting the information signal at the time varying sampling rate at the output unit 78 using the overlapping conversion notation of the information signal input at the input unit 76. The overlapping conversion notation of the information signal includes a converted version of the window version of each region for each successive overlapping time region (or time interval) of the information signal. As will be described in more detail below, the information signal reconstruction device 80 is configured to reconstruct the information signal at a sampling rate that changes at the boundary 82 between the preceding region 84 and the subsequent region 86 of the information signal 90.

情報信号再構築装置８０の個々のモジュール７０〜７４の機能性を説明するために、入力部７６で入力された情報信号の重複変換表記は一定の時間／周波数分解能を有する、つまり時間と周波数に関して分解能は一定であると予め仮定する。以降、別の状況について説明する。 In order to explain the functionality of the individual modules 70 to 74 of the information signal reconstruction device 80, the duplicate conversion notation of the information signal input at the input unit 76 has a certain time / frequency resolution, ie in terms of time and frequency. Assume that the resolution is constant. Hereinafter, another situation will be described.

この仮定によると、重複変換表記は図５の９２であると考えられる。そこに示されているように、重複変換表記は、ある変換レートΔｔで時間的に連続する一連の変換形を含む。各変換形９４は、情報信号のそれぞれの時間領域ｉのウィンドウバージョンの変換形を表す。特に、表記９２のための周波数分解能は時間に関して一定であるので、各変換形９４は一定個数の変換係数Ｎ_kを含む。これは、表記９２は、図５に示されているようにス
ペクトル軸ｋに沿って厳密に並べられていてもよいＮ_k個のスペクトル成分またはサブバ
ンドを含む情報信号のスペクトログラムであることを、事実上示している。それぞれのスペクトル成分またはサブバンドにおいて、スペクトログラム内の変換係数は変換レートΔｔで発生する。 According to this assumption, the overlap conversion notation is considered to be 92 in FIG. As shown therein, the overlap conversion notation includes a series of conversion forms that are temporally continuous at a certain conversion rate Δt. Each conversion form 94 represents a conversion form of the window version of the respective time domain i of the information signal. In particular, since the frequency resolution for notation 92 is constant with respect to time, each conversion type 94 includes a fixed number of conversion coefficients N _k . This means that the notation 92 is a spectrogram of an information signal including N _k spectral components or subbands that may be strictly aligned along the spectral axis k as shown in FIG. Demonstrates virtually. For each spectral component or subband, the transform coefficients in the spectrogram occur at the transform rate Δt.

このような一定の時間／周波数分解能を有する重複変換表記９２は、例えば、図３Ａに示されているようなＱＭＦ解析フィルターバンクによって出力される。この場合、各変換係数は複素数値となる。つまり、各変換係数は例えば実部と虚部を有することになる。しかし、重複変換表記９２の変換係数は、必ずしも複素数値である必要はなく、純粋なＭＤＣＴの場合のように、もっぱら実数値であってもよい。これ以外にも、図５の実施形態はまた、時間領域の重複部分でエイリアシングが発生する他の重複変換表記（その変換形９４が重複変換表記９２内に連続的に配列されるもの）にも適用可能であることに留意すべきである。 Such a duplicate conversion notation 92 having a constant time / frequency resolution is output by, for example, a QMF analysis filter bank as shown in FIG. 3A. In this case, each transform coefficient is a complex value. That is, each conversion coefficient has, for example, a real part and an imaginary part. However, the conversion coefficient of the overlap conversion notation 92 is not necessarily a complex value, and may be a real value exclusively as in the case of pure MDCT. In addition to this, the embodiment of FIG. 5 is also applicable to other overlapped conversion notations in which aliasing occurs in overlapping portions of the time domain (the conversion forms 94 are continuously arranged in the overlapped conversion notation 92). It should be noted that it is applicable.

再変換装置７０は、各変換形９４に関して、連続時間領域８４，８６に対して各時間エンベロープ９６によって示される再変換形を得るために、変換形９４に対して再変換を行うよう構成されている。時間エンベロープは、一連の変換形９４を生成するために情報信号の前述のような時間領域に適用されるウィンドウに大体相当するものである。先行する時間領域８４に関して、図５では、再変換装置７０は、重複変換表記９２内の領域８４に関連する変換形９４全体に対して再変換を行ったと仮定している。領域８４の再変換形９６は、時間領域８４全体の時間的長さΔｔ・ａ（ａは連続する時間領域間の重複部を決定する係数）をサンプリングした例えばＮ_k個のサンプルまたはＮ_kの二倍の個数のサンプルを含む（表記９２の変換形９４はΔｔ・ａを単位として生成された）。つまり、いずれの場合も、各変換形９４を得る元となったウィンドウバージョンを作り上げたのと同じ個数のサンプルを含む。ここで、時間領域８４内の時間サンプルの個数と同一の個数（または二倍の個数）と、その時間領域８４に属する変換形９４内の変換係数の個数は、単に説明のために選択されたものであり、別の実施形態においては、同一（または二倍）は、使用される重複変換の詳細に応じて、両方の数値間の別の一定比に置き代えられてもよい。 Reconversion device 70 is configured to perform reconversion on conversion form 94 to obtain the reconversion form indicated by each time envelope 96 for continuous time regions 84 and 86 for each conversion form 94. Yes. The time envelope roughly corresponds to the window applied to the time domain as described above of the information signal to generate a series of transforms 94. With respect to the preceding time region 84, FIG. 5 assumes that the reconversion device 70 has reconverted the entire conversion form 94 associated with the region 84 in the overlap conversion notation 92. The retransformation type 96 of the region 84 is obtained by sampling, for example, N _k samples or N _k of the time length Δt · a (a is a coefficient that determines an overlap between successive time regions) of the entire time region 84. It contains twice as many samples (the conversion form 94 of the notation 92 was generated in units of Δt · a). That is, in each case, the same number of samples as the window version from which each conversion form 94 is obtained is included. Here, the same number (or twice the number) of time samples in the time domain 84 and the number of transform coefficients in the transform form 94 belonging to the time domain 84 are selected merely for explanation. In other embodiments, the same (or double) may be replaced with another constant ratio between both numbers, depending on the details of the overlapping transformation used.

ここで、情報信号再構築装置は時間領域８４と時間領域８６の間で情報信号のサンプリングレートを変更しようとしていることを前提としている。そうすることの動機は外部信号９８から生じる。例えば、情報信号再構築装置８０が図３Ａ、図４Ａの合成フィルターバンク４２を実施するために使用される場合、データストリームの転送条件の変更の場合のように、サンプリングレートの変更がより効率的なコーディングを約束する場合には必ず信号９８が与えられ得る。 Here, it is assumed that the information signal reconstruction apparatus is changing the sampling rate of the information signal between the time domain 84 and the time domain 86. The motivation to do so arises from the external signal 98. For example, if the information signal reconstructing device 80 is used to implement the synthesis filter bank 42 of FIGS. 3A and 4A, changing the sampling rate is more efficient, as in the case of changing the data stream transfer conditions. A signal 98 can be provided whenever a correct coding is promised.

本件の場合、情報信号再構築装置８０が時間領域８４と８６の間でサンプリングレートを下げようとしている前提は、説明のためである。従って、再変換装置７０はまた、後続領域８６の再変換形１００を得るために、この後続領域８６のウィンドウバージョンの変換形に対して再変換を行うが、この時、再変換装置７０はこの再変換を行うのに短い方の変換長を使用する。より正確には、再変換装置７０は、後続領域８６の変換に関してだけ、変換係数１…Ｎ_k’のうちの最も低い値Ｎ_k’への再変換を実行し、これにより得られた再変換形１００はより低いサンプリングレートを有することになる。つまり、再変換形１００は、Ｎ_k（またはＮ_kに相当する割合）の代わりに単にＮ_k’でサンプリングされるこ
とになる。 In this case, the assumption that the information signal reconstruction device 80 is going to lower the sampling rate between the time regions 84 and 86 is for explanation. Therefore, the re-converter 70 also performs re-conversion on the window-version conversion form of the succeeding area 86 in order to obtain the re-converted form 100 of the succeeding area 86. The shorter conversion length is used to perform the reconversion. More precisely, the reconversion device 70 performs the reconversion to the lowest value N _k ′ of the conversion coefficients 1... N _k ′ only for the conversion of the subsequent area 86 and the reconversion obtained thereby. Form 100 will have a lower sampling rate. That is, the reconversion type 100 is simply sampled at N _k ′ instead of N _k (or a ratio corresponding to N _k ).

図５に示されているように、再変換形９６と１００の間に以下のような問題が起こる。先行領域８４の再変換形９６と後続領域８６の再変換形１００は、これらの先行領域８４と後続領域８６との間の境界部８２でのエイリアシング解消部分１０２で重なる。エイリアシング解消部分の時間的な長さは例えば（ａ−１）・Δｔであるが、このエイリアシング解消部分１０２内の再変換形９６のサンプルの個数は、同じエイリアシング解消部分１０２内の再変換形１００のサンプルの個数とは異なる（この例では、再変換形１００のサンプルの個数よりも高くなる）。従って、この時間間隔１０２で両方の再変換形９６と１００を重複加算することによる時間的エイリアシング解消は、単純なことではない。 As shown in FIG. 5, the following problem occurs between the reconversion forms 96 and 100. The retransformation type 96 of the preceding region 84 and the reconversion type 100 of the subsequent region 86 overlap at the aliasing elimination portion 102 at the boundary 82 between the preceding region 84 and the subsequent region 86. The time length of the aliasing elimination portion is, for example, (a-1) · Δt. The number of samples of the reconversion type 96 in the aliasing elimination portion 102 is equal to the reconversion type 100 in the same aliasing elimination portion 102. (In this example, it is higher than the number of samples of the reconversion type 100). Thus, the elimination of temporal aliasing by overlapping and adding both reconversion forms 96 and 100 in this time interval 102 is not simple.

従って、リサンプラー７２は再構築装置７０と結合装置７４の間に接続され、結合装置７４は時間的エイリアシング解消を実行する。特に、リサンプラー７２は、エイリアシング解消部分１０２における先行領域８４の再変換形９６及び／または後続領域８６の再変換形１００を、境界部８２でのサンプリングレート変更に従い、補間により、リサンプリングするよう構成されている。再変換形９６が再変換形１００よりも早くリサンプラー７２の入力部に到達するので、リサンプラー７２は先行領域８４の再変換形９６に対するリサンプリングを行うことが好ましい。つまり、補間１０４により、エイリアシング解消部分１０２に含まれている再変換形９６の部分が、同じエイリアシング解消部分１０２内の再変換形１００のサンプリング条件またはサンプル位置に相当するように、リサンプリングされる。そして、その時間間隔１０２内で新しいサンプリングレートでの再構築信号９０を得るために、結合装置７４は、再変換形９６のリサンプルバージョンと再変換形１００の同一場所のサンプルを単純に加算するだけでもよい。この場合、出力再構築信号は、時間領域８６の最初の部分で前のサンプリングレートから新しいサンプリングレートに変換されたものとなる。しかし、再構築信号９０におけるサンプリングレート変更に間に合う別のポイント８２を得るために、補間はまた、時間間隔１０２の前半と後半とで違う方法で行われてもよい。このように、瞬間８２は図５では領域８４と８６の重複部分の中間に示されているが、それは単に説明のためであり、他の実施形態においては、この同じ時間的ポイントは、領域８６の最初の部分と領域８４の最後の部分との間のどちらも含む部分のどこかにあればよい。 Accordingly, the resampler 72 is connected between the reconstruction device 70 and the coupling device 74, and the coupling device 74 performs temporal aliasing cancellation. In particular, the resampler 72 resamples the retransformation type 96 of the preceding region 84 and / or the reconversion type 100 of the subsequent region 86 in the aliasing elimination portion 102 by interpolation according to the change in the sampling rate at the boundary 82. It is configured. Since the reconverter 96 reaches the input of the resampler 72 earlier than the reconverter 100, the resampler 72 preferably performs resampling on the reconverter 96 in the preceding region 84. That is, by the interpolation 104, the part of the reconversion type 96 included in the aliasing elimination part 102 is resampled so as to correspond to the sampling condition or the sample position of the reconversion type 100 in the same aliasing elimination part 102. . Then, to obtain the reconstructed signal 90 at the new sampling rate within that time interval 102, the combiner 74 simply adds the resampled version of the reconversion 96 and the co-located sample of the reconversion 100. Just be fine. In this case, the output reconstructed signal is the first part of the time domain 86 converted from the previous sampling rate to the new sampling rate. However, in order to obtain another point 82 in time for the sampling rate change in the reconstructed signal 90, the interpolation may also be performed differently in the first half and second half of the time interval 102. Thus, while the instant 82 is shown in FIG. 5 in the middle of the overlap of regions 84 and 86, it is merely illustrative, and in other embodiments, this same temporal point is the region 86. Between the first part and the last part of the region 84 may be anywhere in the part.

従って、結合装置７４は、エイリアシング解消部分１０２でのリサンプリングによって得られた先行領域８４と後続領域８６それぞれの再変換形９６と１００の間でのエイリアシング解消を実行することができる。より正確には、エイリアシング解消部分１０２でエイリアシングを解消するために、結合装置７４は、リサンプラー７２によって得られたリサンプルバージョンを使用して、部分１０２内の再変換形９６と１００の重複加算処理を行う。情報信号９０のサンプリングレートが時間ポイント８２で高いサンプリングレートから低いサンプリングレートに変化しても、この重複加算処理により、変換形９４を生成するためのウィンドウ処理に沿って、境界部８２を渡っても、エイリアシングフリーで連続的に再構築された情報信号９０を出力部７８で出力することができる。 Accordingly, the combiner 74 can perform aliasing cancellation between the retransformation forms 96 and 100 of the preceding region 84 and the subsequent region 86 obtained by resampling in the aliasing cancellation portion 102, respectively. More precisely, in order to eliminate aliasing in the aliasing elimination part 102, the combiner 74 uses the resampled version obtained by the resampler 72 to overlap the retransformed forms 96 and 100 in the part 102. Process. Even if the sampling rate of the information signal 90 changes from a high sampling rate to a low sampling rate at the time point 82, this overlap addition process crosses the boundary 82 along the window process for generating the conversion form 94. In addition, the information signal 90 reconstructed continuously in an aliasing-free manner can be output from the output unit 78.

このように、図５に関する上述の説明から分かるように、先行時間領域８４のウィンドウバージョンの変換形９４に対して行われた再変換の変換長の、その先行領域８４の時間的長さに対する比は、後続時間領域８６のウィンドウバージョンに対して行われた再変換の変換長の、その後続領域８６の時間的長さに対する比とは、これらの領域８４と８６との間の境界部８２でのサンプリングレート変更に相当する係数分だけ異なっている。上述した例では、この比の変化は外部信号９８によって引き起こされたものである。先行領域８４と後続領域８６の時間的長さは互いに同じであり、再変換装置７０は、後続領域８６のウィンドウバージョンの変換形９４に対する再変換の適用を、例えばＮ_k’番目の変換
係数までの低周波部分に制限するよう構成されたものであるという前提で、説明してきた。もちろん、このような処理は、先行領域８４のウィンドウバージョンの変更結果９４に対しても可能である。さらに、上述の説明とは対照的に、境界部８２でのサンプリングレート変更は逆方向でも可能であり、従って、後続領域８６に関しては何の取得も行われず、先行領域８４のウィンドウバージョンの変換形９４に関してだけ処理が行われてもよい。 Thus, as can be seen from the above description with respect to FIG. 5, the ratio of the transform length of the retransformation performed on the window version transform form 94 of the preceding time region 84 to the temporal length of the preceding region 84. Is the ratio of the transformation length of the retransformation performed on the window version of the subsequent time region 86 to the temporal length of the subsequent region 86 at the boundary 82 between these regions 84 and 86. Are different by a factor corresponding to the change in sampling rate. In the example described above, this change in ratio is caused by the external signal 98. The temporal lengths of the preceding area 84 and the succeeding area 86 are the same, and the reconversion apparatus 70 applies reconversion to the window 94 conversion form 94 of the subsequent area 86, for example, up to the N _k 'th conversion coefficient. It has been described on the assumption that it is configured so as to be limited to the low frequency part. Of course, such processing is also possible for the window version change result 94 in the preceding area 84. Further, in contrast to the above description, the sampling rate change at the boundary 82 is possible in the reverse direction, so that no acquisition is performed for the succeeding area 86, and the conversion of the window version of the preceding area 84 is performed. Processing may be performed only for 94.

より正確には、ここまで、情報信号の領域のウィンドウバージョンの変換形９４の変換長と情報信号の領域の時間的長さが一定である場合、つまり、重複変換表記９２は一定の時間／周波数分解能を有するスペクトログラムである場合に対する図５の情報信号再構築装置の動作モードを説明してきた。境界部８２の位置設定の際に、情報信号再構築装置８０は、一例として制御信号９８に反応するものとして説明した。 More precisely, so far, when the transform length of the window version transform form 94 of the information signal region and the time length of the information signal region are constant, that is, the overlap transform notation 92 has a constant time / frequency. The operation mode of the information signal reconstruction apparatus of FIG. 5 for the case of a spectrogram having resolution has been described. The information signal reconstruction device 80 has been described as responding to the control signal 98 as an example when setting the position of the boundary portion 82.

従って、この構成において、図５の情報信号再構築装置８０は図３Ａのリサンプラー１４の一部となり得る。換言すれば、図３Ａのリサンプラー１４は、情報信号の重複変換表記を出力するフィルターバンク３８と、今まで説明してきたような情報信号の重複変換表記からエイリアシング解消を使用して情報信号を再構築するよう構成された情報信号再構築装置８０を含む逆フィルターバンクとの連結から成る。従って、例えば、図５の再変換装置７０はＱＭＦ合成フィルターバンクとして構成することができ、フィルターバンク３８はＱＭＦ解析フィルターバンクとして実施することができる。 Therefore, in this configuration, the information signal reconstruction device 80 of FIG. 5 can be a part of the resampler 14 of FIG. 3A. In other words, the resampler 14 in FIG. 3A regenerates the information signal using the filter bank 38 that outputs the duplicate conversion notation of the information signal and the alias conversion cancellation from the duplicate conversion notation of the information signal as described above. Consists of a connection with an inverse filter bank including an information signal reconstruction device 80 configured to be constructed. Thus, for example, the reconversion device 70 of FIG. 5 can be configured as a QMF synthesis filter bank, and the filter bank 38 can be implemented as a QMF analysis filter bank.

図１Ａ〜４Ａの説明から明らかなように、情報信号エンコーダは、コアエンコーダ１６または集隗コアエンコーダ１６のような圧縮ステージとパラメトリックエンベロープコーダー５４に加えて、このようなリサンプラーを含み得る。圧縮ステージは再構築情報信号を圧縮するよう構成されている。図１Ａ〜４Ａに示されているように、このような情報信号エンコーダは、例えば可能転送ビットレートに関する外部情報に応じて制御信号９８を制御するよう構成されたサンプリングレートコントローラをさらに含み得る。 As is apparent from the description of FIGS. 1A-4A, the information signal encoder may include such a resampler in addition to a compression stage and parametric envelope coder 54 such as the core encoder 16 or the concentrated core encoder 16. The compression stage is configured to compress the reconstructed information signal. As shown in FIGS. 1A-4A, such an information signal encoder may further include a sampling rate controller configured to control the control signal 98 in response to, for example, external information regarding possible transfer bit rates.

しかし別の例では、図５の情報信号再構築装置は、重複変換表記内で情報信号の領域のウィンドウバージョンの変換長の変化を検出することにより、領域８２の位置を特定するよう構成可能である。この可能な実施例をより明確にするために、入力された重複変換表記の一例が示されている図５の９２’を参照して下さい。それによると、表記９２’内の連続する変換形９４は一定の変換レートΔｔで再変換装置７０に到着するが、それぞれの変換形の変換長は変化している。図５において、例えば、先行時間領域８４のウィンドウバージョンの変換形の変換長（Ｎ_k）は、後続領域８６のウィンドウバージョンの変換形
の変換長（Ｎｋ’）よりも大きいと仮定する。ともかく、再変換装置７０は入力データストリームから重複変換表記９２’に関する情報をパースし、それに従い、再変換装置７０は、情報信号の連続領域のウィンドウバージョンの変換形に対して行われる再変換の変換長を、重複変換表記９２’の連続する変換形の変換長に適合させてもよい。従って、再変換装置７０は先行時間領域８４のウィンドウバージョンの変換形９４の再変換のために変換長Ｎ_kを使用し、後続時間領域８６のウィンドウバージョンの変換形の再変換のために
変換長Ｎ_k’を使用してもよい。これにより、前述し、図５の上部中央に示されているよ
うな再変換形の間のサンプリングレートの違いが生じる。従って、図５の情報信号再構築装置８０の動作モードに関して、この動作モードは、再変換の変換長を重複変換表記９２’内の変換形の変換長に適合させる際の今述べたような違いに加えて、上記説明と一致する。 In another example, however, the information signal reconstruction device of FIG. 5 can be configured to identify the location of the region 82 by detecting a change in the transform length of the window version of the region of the information signal within the overlap transform notation. is there. To make this possible embodiment more clear, see 92 'in Figure 5 where an example of the input duplicate conversion notation is shown. According to this, the continuous conversion forms 94 in the notation 92 ′ arrive at the reconversion device 70 at a constant conversion rate Δt, but the conversion length of each conversion form changes. In FIG. 5, for example, it is assumed that the conversion length (N _k ) of the conversion type of the window version in the preceding time region 84 is larger than the conversion length (Nk ′) of the conversion type of the window version in the subsequent region 86. In any case, the reconverter 70 parses information about the duplicate conversion notation 92 'from the input data stream, and accordingly the reconverter 70 performs the reconversion performed on the converted version of the window version of the continuous region of the information signal. The transform length may be adapted to the transform length of the continuous transform form of the duplicate transform notation 92 ′. Accordingly, the reconversion device 70 uses the conversion length N _k for the reconversion of the window version conversion form 94 in the preceding time region 84 and the conversion length for the reconversion of the window version conversion form in the subsequent time region 86. N _k ′ may be used. This causes a difference in sampling rate between the reconversion types as described above and shown in the upper center of FIG. Therefore, with respect to the operation mode of the information signal reconstruction apparatus 80 in FIG. 5, this operation mode is different from the one just described in adapting the conversion length of the reconversion to the conversion length of the conversion type in the overlap conversion notation 92 ′. In addition to the above description.

このように、後者の機能性に従えば、情報信号再構築装置は外部制御信号９８に反応する必要はない。むしろ、サンプリングレート変更時点に関する情報を情報信号再構築装置に通知するには、入力されてくる重複変換表記９２’で十分である。 Thus, according to the latter functionality, the information signal reconstructing device need not react to the external control signal 98. Rather, the input duplicate conversion notation 92 'is sufficient to notify the information signal reconstruction device of information relating to the sampling rate change point.

今説明したように動作する情報信号再構築装置８０は、図２Ｂの再変換装置３６を形成するために使用できる。つまり、情報信号デコーダは、データストリームから情報信号の重複変換表記９２’を再構築するよう構成された展開装置３４を含んでいてもよい。前述したように、この再構築はエントロピーデコーディングを伴う。変換形９４の時間変動変換長は、展開装置３４に入力されるデータストリーム内で適切な方法で信号伝達できる。図５に示されているような情報信号再構築装置は再構築装置３６として使用できる。図５の情報信号再構築装置は、展開装置によって与えられたような重複変換表記から、エイリアシング解消を使用して情報信号を再構築するよう構成できる。後者の場合、再変換装置７０は、再変換を実行するために、例えばＩＭＤＣＴを使用することもでき、変換形９４は複素数値係数よりもむしろ実数値係数によって表される。 An information signal reconstruction device 80 that operates as just described can be used to form the reconversion device 36 of FIG. 2B. That is, the information signal decoder may include a decompressor 34 that is configured to reconstruct the duplicate conversion representation 92 'of the information signal from the data stream. As described above, this reconstruction involves entropy decoding. The time variation conversion length of the conversion form 94 can be signaled in an appropriate manner within the data stream input to the decompressor 34. An information signal reconstruction device as shown in FIG. 5 can be used as the reconstruction device 36. The information signal reconstruction device of FIG. 5 can be configured to reconstruct the information signal using aliasing cancellation from the duplicate conversion notation as provided by the decompression device. In the latter case, the reconversion device 70 can also use, for example, IMDCT to perform the reconversion, and the transform form 94 is represented by real-valued coefficients rather than complex-valued coefficients.

このように、上記の実施形態によると多くの利点が達成できる。例えば毎秒８ｋｂから毎秒１２８ｋｂに渡るような広い範囲の様々なビットレートで動作するオーディオコーデックに関して、最適なサンプリングレートは、図４Ａ，４Ｂに関して上述したように、ビットレートに依存する場合もある。低いビットレートでは、低周波だけが、例えばＡＣＥＬＰや変換コーディングのような、より正確なコーディング方法で符号化されるべきであり、高周波はパラメトリック方法で符号化されるべきである。高いビットレートでは、スペクトル域全体が例えば正確な方法で符号化される。これは、例えば、これらの正確な方法は常に最適な表記で信号を符号化すべきであることを意味している。これらの信号のサンプリングレートは、ナイキスト原理に準じた最も関連性のある信号周波数成分の変換が可能となるよう、最適化されるべきである。ここで示されているサンプリングレートコントローラ１２０は、情報信号がコアエンコーダ１６に送られる際のサンプリングビットレートを、可能転送ビットレートに応じて制御するよう構成され得る。これは、解析フィルターバンクのスペクトルの低周波部分だけをコアエンコーダ１６に送ることを意味している。残りの高周波部分はパラメトリックエンベロープコーダー５４に送られる。上述したように、サンプリングレートの時間変動と転送ビットレートは問題ではない。 Thus, many advantages can be achieved according to the above embodiment. For audio codecs that operate at a wide range of bit rates, for example ranging from 8 kb per second to 128 kb per second, the optimal sampling rate may depend on the bit rate, as described above with respect to FIGS. 4A and 4B. At low bit rates, only low frequencies should be encoded with more accurate coding methods, such as ACELP and transform coding, and high frequencies should be encoded with parametric methods. At high bit rates, the entire spectral range is encoded, for example, in an accurate manner. This means, for example, that these exact methods should always encode the signal with an optimal notation. The sampling rate of these signals should be optimized so that the most relevant signal frequency components can be converted according to the Nyquist principle. The sampling rate controller 120 shown here may be configured to control the sampling bit rate at which the information signal is sent to the core encoder 16 according to the possible transfer bit rate. This means that only the low frequency part of the spectrum of the analysis filter bank is sent to the core encoder 16. The remaining high frequency part is sent to the parametric envelope coder 54. As described above, the sampling rate temporal variation and the transfer bit rate are not a problem.

図５の説明は、サンプリングレート変更時に時間的エイリアシング解消の問題に対処するために使用できる情報信号再構築装置に関するものである。また、図１Ａ〜図４Ｂに関して前述したように、変換器が重複変換表記を生成し、そして図５の情報信号再構築装置にそれを送る図１Ａ〜４Ｂの装置内で、連続するモジュール間のインターフェースで何らかの対策が行われなければならない。 The description of FIG. 5 relates to an information signal reconstruction device that can be used to address the problem of resolution of temporal aliasing when changing the sampling rate. Also, as described above with respect to FIGS. 1A-4B, between the successive modules in the apparatus of FIGS. 1A-4B, where the converter generates a duplicate conversion notation and sends it to the information signal reconstruction apparatus of FIG. Some measures must be taken at the interface.

図６は情報信号変換装置のこのような一実施形態を示す。図６の情報信号変換装置は、一連のサンプルという形態で情報信号を受け取る入力部１０５と、情報信号の連続重複領域を取得するよう構成された取込み器１０６と、各連続重複領域が一定のサンプリングレートを有するように（しかし連続重複領域の間ではサンプリングレートは異なっている）、連続重複領域の少なくとも一部に対してリサンプリングを行うよう構成されたリサンプラー１０７と、連続重複領域に対してウィンドウ処理を行うよう構成されたウィンドウ処理部１０８と、図６の情報信号変換器の出力部１１０で出力される重複変換表記９２’を構成する一連の変換形９４を得るために、ウィンドウ処理された部分に対して個々に変換を行うよう構成された変換器を含む。ウィンドウ処理部１０８はハフマンウィンドウ等を使用してもよい。 FIG. 6 shows such an embodiment of the information signal converter. The information signal conversion apparatus of FIG. 6 includes an input unit 105 that receives an information signal in the form of a series of samples, a capture unit 106 configured to acquire a continuous overlap region of the information signal, and a sampling in which each continuous overlap region is constant. A resampler 107 configured to perform resampling on at least a portion of the continuous overlap region to have a rate (but the sampling rate is different between the continuous overlap regions), and In order to obtain a series of conversion forms 94 constituting the overlap conversion notation 92 ′ output from the window processing unit 108 configured to perform window processing and the output unit 110 of the information signal converter of FIG. Including a converter configured to individually convert each portion. The window processing unit 108 may use a Huffman window or the like.

取込み器１０６は、情報信号の連続重複領域が同じ時間的長さを有するように、例えばそれぞれ２０ｍｓとなるように、取込みを行う。 The take-in device 106 takes in so that, for example, the continuous overlap regions of information signals have the same time length, for example, 20 ms each.

取込み器１０６は一連の情報信号部分をリサンプラー１０７に送る。入力情報信号が所定の瞬間に第１のサンプリングレートから第２のサンプリングレートに変わる時間変動サンプリングレートであると仮定すると、例えば、リサンプラー１０７は、図６の１１１で示されているように、サンプリングレートが第１のサンプリングレートから第２のサンプリングレートに一度変化するように時間的に所定の時点を含む入力されてくる情報信号部分を補間によってリサンプルするよう構成されていてもよい。これをより明確にするために、図６は、サンプリングレートが瞬間１１３で変わる一連のサンプル１１２を説明的に示しており、一例として、一定の時間的長さを有する領域１１４ａ〜１１４ｄが一定の領域オフセット１１５Δｔで取り込まれる。この領域オフセット１１５Δｔは、一定の領域時間的長さと共に、連続領域１１４ａ〜１１４ｄの間の所定の重複部分を例えば連続する二つの領域ごとに５０％の重複となるように規定する。しかし、これは単に一例にすぎない。この瞬間１１３より前の第１のサンプリングレートはδｔ₁で示され、この瞬間１１
３より後のサンプリングレートはδｔ₂で示されている。１１１で示されているように、リサンプラー１０７は、例えば、領域１１４ｂを一定のサンプリングレートδｔ₁を有するようリサンプリングするが、時間的後続領域１１４ｃに対しては、一定のサンプリングレートδｔ₂を有するようにリサンプリングするよう構成されていてもよい。原則的に、リサンプラー１０７が、時間的に瞬間１１３を含むそれぞれの領域１１４ｂ，１１４ｃの一部分を補間によってリサンプリングすれば十分であり、それがまだ目標サンプリングレートでなくても構わない。例えば領域１１４ｂの場合、リサンプラー１０７が、領域１１４ｂの時間的に瞬間１１３より後の部分をリサンプリングし、１１４ｃの場合には、瞬間１１３より前の部分だけをリサンプリングすれば十分である。その場合、取り込まれた領域１１４ａ〜１１４ｄは一定の時間的長さであるので、リサンプリングされた各領域は、それぞれの一定サンプリングレートδｔ₁，δｔ₂に対応した個数の時間サンプルＮ₁，Ｎ₂を有する。ウィンドウ処理部１０８は、そのウィンドウまたはウィンドウ長さを各入力部でのこのサンプルの個数に適合させてもよい。同じことが変換器１０９にも当てはまり、変換器１０９もその変換長または変換を同じように適合させてもよい。つまり、図６の１１１で示されている例では、出力部１１０での重複変換表記は一連の変換形から成り、変換形の長さはそれぞれ異なり、連続領域のサンプルの個数に対して一次従属的に、つまりそれぞれの領域に対して行ったリサンプリングの際のサンプリングレートに対して一次従属的に増加減少する。 The grabber 106 sends a series of information signal portions to the resampler 107. Assuming that the input information signal has a time-varying sampling rate that changes from the first sampling rate to the second sampling rate at a given moment, for example, the resampler 107 is as shown at 111 in FIG. The input information signal portion including a predetermined time point in time may be resampled by interpolation so that the sampling rate changes once from the first sampling rate to the second sampling rate. To make this clearer, FIG. 6 illustratively shows a series of samples 112 where the sampling rate changes at the instant 113, and as an example, regions 114a-114d having a constant time length are constant. Captured at region offset 115Δt. This region offset 115Δt defines a predetermined overlapping portion between the continuous regions 114a to 114d with a certain region temporal length so that, for example, two consecutive regions have 50% overlap. However, this is just an example. The first sampling rate prior to this instant 113 is denoted by δt ₁ and this instant 11
The sampling rate after 3 is denoted by δt ₂ . As shown at 111, the resampler 107, for example, resamples the region 114b to have a constant sampling rate δt ₁ , but for the temporal successor region 114c, it has a constant sampling rate δt ₂ . You may be comprised so that it may resample. In principle, it is sufficient for the resampler 107 to resample a part of each region 114b, 114c including the instant 113 in time by interpolation, and it does not have to be the target sampling rate yet. For example, in the case of the region 114b, it is sufficient for the resampler 107 to resample the portion of the region 114b after the instant 113 in time, and in the case of 114c, it is sufficient to resample only the portion before the instant 113. In that case, since the captured regions 114a to 114d have a certain time length, each of the resampled regions has a number of time samples N ₁ and N corresponding to the respective constant sampling rates δt ₁ and δt _2. _{Has two} . The window processing unit 108 may adapt the window or window length to the number of samples in each input unit. The same applies to the converter 109, which may also adapt its conversion length or conversion in the same way. That is, in the example shown by 111 in FIG. 6, the overlapped conversion notation in the output unit 110 is composed of a series of conversion forms, each of which has a different length, and is linearly dependent on the number of samples in the continuous region. In other words, it increases and decreases in a first-order manner with respect to the sampling rate at the time of resampling performed for each region.

リサンプラー１０７は、それぞれの連続領域１１４ａ〜１１４ｄ内のリサンプリングされるべきサンプル個数が最小となるように、これらの連続領域１１４ａ〜１１４ｄの間のサンプリングレート変更を記録するよう構成されていてもよい。あるいは、リサンプラー１０７はこれとは異なるように構成されていてもよい。例えば、リサンプラー１０７はダウンサンプリングよりもアップサンプリングを選択するまたはその逆であるように構成されていてもよく、つまり、瞬間１１３と重なる全ての領域が第１のサンプリングレートδｔ₁または第２のサンプリングレートδｔ₂でリサンプリングされるように、リサンプリングを実行するよう構成されていてもよい。 The resampler 107 may be configured to record the sampling rate change between these continuous regions 114a-114d so that the number of samples to be resampled in each continuous region 114a-114d is minimized. Good. Alternatively, the resampler 107 may be configured differently. For example, the resampler 107 may be configured to select upsampling rather than downsampling, or vice versa, that is, all regions that overlap the instant 113 are either the _first sampling rate δt ₁ or the second sampling rate Re-sampling may be performed so that re-sampling is performed at the sampling rate δt ₂ .

図６の情報信号変換装置は、例えば図２Ａの変換装置３０を実施するのに使用してもよい。その場合、例えば変換器１０９はＭＤＣＴを実行するよう構成されていてもよい。 The information signal converter of FIG. 6 may be used, for example, to implement the converter 30 of FIG. 2A. In that case, for example, the converter 109 may be configured to perform MDCT.

これに関して、変換器１０９によって行われる変換の変換長は、リサンプリングされたサンプルの個数で測定した領域１１４ｃのサイズよりも大きくてもよいことに留意すべきである。その場合、ウィンドウ処理部１０８から出力されたウィンドウ領域を超える変換長の部分は、変換器１０９による変換を行う前に０にセットされてもよい。 In this regard, it should be noted that the transform length of the transform performed by the converter 109 may be larger than the size of the region 114c measured by the number of resampled samples. In that case, the part of the conversion length exceeding the window area output from the window processing unit 108 may be set to 0 before conversion by the converter 109.

図５の補間１０４と図６のリサンプラー１０７内での補間を実現するための可能な実施例を詳細に説明する前に、図１Ａ，１Ｂのエンコーダとデコーダの可能な実施形態を示す図７Ａ，７Ｂを参照して下さい。特に、リサンプラー１４，２４は図３Ａ，３Ｂに示されているように実施されているが、コアエンコーダ１６とコアデコーダ２２は、それぞれ、ＭＤＣＴに基づく変換コーディングとＡＣＥＬＰコーディングのようなＣＥＬＰコーディングとの間で切り換え可能なコーデックとして実施されている。ＭＤＣＴに基づくコーディング／デコーディングブランチ１２２，１２４は、それぞれ、例えばＴＣＸエンコーダとＴＣＸデコーダであってもよい。あるいは、ＡＡＣコーダー／デコーダ対が使用されてもよい。ＣＥＬＰコーディングのために、ＡＣＥＬＰエンコーダ１２６がコアエンコーダ１６の他方のコーディングブランチとなり、ＡＣＥＬＰデコーダ１２８がコアデコーダ２２の他方のデコーディングブランチとなっていてもよい。これら両方のコーディングブランチ間での切り換えは、これらのコーディングモジュールの詳細についてその標準テキストに記載しているＵＳＡＣ［２］またはＡＭＲ−ＷＢ＋［１］の場合のように、フレーム毎に行われ得る。 Before describing in detail a possible implementation for implementing interpolation in the interpolator 104 of FIG. 5 and the resampler 107 of FIG. 6, FIG. 7A shows a possible embodiment of the encoder and decoder of FIGS. 1A and 1B. Refer to 7B. In particular, the resamplers 14 and 24 are implemented as shown in FIGS. 3A and 3B, but the core encoder 16 and the core decoder 22 are respectively converted into transform coding based on MDCT and CELP coding such as ACELP coding. It is implemented as a codec that can be switched between. The MDCT-based coding / decoding branches 122 and 124 may be, for example, a TCX encoder and a TCX decoder, respectively. Alternatively, an AAC coder / decoder pair may be used. For CELP coding, the ACELP encoder 126 may be the other coding branch of the core encoder 16, and the ACELP decoder 128 may be the other decoding branch of the core decoder 22. Switching between both of these coding branches can be done on a frame-by-frame basis, as in the case of USAC [2] or AMR-WB + [1], whose details are described in their standard text.

図７Ａ，７Ｂのエンコーダとデコーダをさらに詳しい具体例として考え、コーディングブランチ１２２，１２６への入力とデコーディングブランチ１２４，１２８による再構築のために内部サンプリングレートの切り換えを可能にするスキームを、以下に詳細に説明する。特に、入力部１２での入力信号の入力は、例えば３２ｋＨｚという一定のサンプリングレートであってもよい。この信号は、上述のような方法でＱＭＦ解析／合成フィルターバンク対３８，４２を使用して、すなわち、帯域数に関して１．２５または２．５というような適切な解析及び合成比でリサンプリングされてもよく、これは、例えば２５．６ｋＨｚまたは１２．８ｋＨｚの専用サンプリングレートを有するコアデコーダ１６に入力してくる内部時間信号となる。そして、ダウンサンプリングされた信号は、コーディングブランチのうちのコーディングモードに応じたものを使用して符号化される。コーディングブランチ１２２では、ＭＤＣＴ表記及び標準的な変換コーディングスキームを使用して符号化され、または、コーディングブランチ１２６ではＡＣＥＬＰを使用して時間領域で符号化される。コアエンコーダ１６のコーディングブランチ１２６，１２２によってこのように生成されたデータストリームは出力され、デコーディング側に送られ、そこで再構築される。 Considering the encoder and decoder of FIGS. 7A and 7B as a more specific example, a scheme that allows switching of the internal sampling rate for input to the coding branches 122 and 126 and reconstruction by the decoding branches 124 and 128 is as follows: Will be described in detail. In particular, the input signal at the input unit 12 may be input at a constant sampling rate of 32 kHz, for example. This signal is resampled using the QMF analysis / synthesis filter bank pair 38, 42 in the manner described above, ie, with an appropriate analysis and synthesis ratio such as 1.25 or 2.5 for the number of bands. This may be an internal time signal that is input to the core decoder 16 having a dedicated sampling rate of, for example, 25.6 kHz or 12.8 kHz. The downsampled signal is encoded using a signal corresponding to the coding mode in the coding branch. The coding branch 122 is encoded using MDCT notation and a standard transform coding scheme, or the coding branch 126 is encoded in the time domain using ACELP. The data stream thus generated by the coding branches 126, 122 of the core encoder 16 is output and sent to the decoding side where it is reconstructed.

内部サンプリングレートを切り換えるために、フィルターバンク３８，４４は、コアエンコーダ１６とコアデコーダ２２が動作するであろう内部サンプリングレートに従い、フレーム毎に適合されなければならない。図８は考えられるいくつかの切り換え場面を示しているが、ここでは、単にエンコーダとデコーダのＭＤＣＴコーディングの道筋を示しているだけである。 In order to switch the internal sampling rate, the filter banks 38, 44 must be adapted from frame to frame according to the internal sampling rate at which the core encoder 16 and core decoder 22 will operate. FIG. 8 shows some possible switching situations, but here only shows the MDCT coding path of the encoder and decoder.

特に、図８は、３２ｋＨｚであると想定されている入力サンプリングレートが２５．６ｋＨｚ、１２．８ｋＨｚ、８ｋＨｚのいずれかにダウンサンプリングされるか、その入力サンプリングレートが維持される可能性があることを示している。入力サンプリングレートと内部サンプリングレートとの間の選択されたサンプリングレート比に応じて、フィルターバンク解析とフィルターバンク合成との間の変換長の比が決まる。これらの比は図８の影付きの部分（フィルターバンク３８，４４では、選択された内部サンプリングレートとは関係なく、それぞれ４０個のサブバンド、フィルターバンク４２，４０では、選択された内部サンプリングレートに応じて、それぞれ４０個、３２個、１６個または１０個のサブバンド）から導き出すことができる。コアエンコーダ内で使用されるＭＤＣＴの変換長はこのように決定された内部サンプリングレートに適合され、結果的に変換レートまたは変換ピッチ時間間隔が一定または選択された内部サンプリングレートとは無関係となるように適合される。これは例えば常に２０ｍｓであってもよく、その結果、選択された内部サンプリングレートに応じて、それぞれ６４０、５１２、２５６、１６０の変換長となる。 In particular, FIG. 8 shows that the input sampling rate assumed to be 32 kHz may be downsampled to either 25.6 kHz, 12.8 kHz, or 8 kHz, or the input sampling rate may be maintained. Is shown. Depending on the selected sampling rate ratio between the input sampling rate and the internal sampling rate, the ratio of transform lengths between filter bank analysis and filter bank synthesis is determined. These ratios are shaded in FIG. 8 (in the filter banks 38 and 44, regardless of the internal sampling rate selected, each of the 40 subbands and in the filter banks 42 and 40, the selected internal sampling rate). Respectively, 40, 32, 16 or 10 subbands). The conversion length of the MDCT used in the core encoder is adapted to the internal sampling rate determined in this way, so that the conversion rate or the conversion pitch time interval is constant or independent of the selected internal sampling rate. Is adapted to. This may for example always be 20 ms, resulting in a conversion length of 640, 512, 256, 160, respectively, depending on the selected internal sampling rate.

上述のような原理を使用して、フィルターバンク切り換えに関する以下の規制に従い、内部サンプリングレートを切り換えることができる。
−切り換えの間にいかなる遅延も追加されない。
−この切り換えつまりサンプリングレート変更は瞬時に行われる。
−切り換えアーチファクトは最低限に抑えられるかまたは少なくとも低減される。
−計算量が小さい。 Using the principle as described above, the internal sampling rate can be switched according to the following regulations regarding filter bank switching.
-No delay is added during switching.
-This switching or sampling rate change is instantaneous.
-Switching artifacts are minimized or at least reduced.
-The amount of calculation is small.

基本的に、フィルターバンク３８〜４４とコアコーダー内のＭＤＣＴは、フィルターバンクにおいて、コアエンコーダとデコーダのＭＤＣＴと比較して、ウィンドウ領域の重複度が高くてもよい重複変換である。例えば、フィルターバンクにおいて１０回の重複が適用されてもよく、ＭＤＣＴ１２２，１２４において２回の重複が適用されてもよい。重複変換のために、ステートバッファが、解析フィルターバンクとＭＤＣＴのための解析ウィンドウバッファとして、また合成フィルターバンクとＩＭＤＣＴのための重複加算バッファとして説明できる。レート切り換えの際に、これらのステートバッファは、図５，６に関して上述したような方法で、サンプリングレートの切り換えに応じて調整されるべきである。以下に、図５に関して説明した合成側よりもむしろ、図６に関して説明した解析側でも実行され得る補間に関して、以下に詳細に説明する。重複変換のプロトタイプまたはウィンドウが適合されてもよい。切り換えアーチファクトを低減するには、重複変換部のエイリアシング解消特性を保持するためにステートバッファ内の信号成分を保存すべきである。 Basically, the MDCTs in the filter banks 38 to 44 and the core coder are overlapping transforms in which the overlapping degree of the window region may be higher than that of the core encoder and the decoder in the filter bank. For example, 10 overlaps may be applied in the filter bank, and 2 overlaps may be applied in the MDCTs 122 and 124. For duplicate conversion, the state buffer can be described as an analysis window buffer for the analysis filter bank and MDCT, and as a duplicate addition buffer for the synthesis filter bank and IMDCT. Upon rate switching, these state buffers should be adjusted in response to sampling rate switching in the manner described above with respect to FIGS. In the following, interpolation will be described in detail below that can be performed on the analysis side described with respect to FIG. 6, rather than the synthesis side described with respect to FIG. A duplicate transform prototype or window may be adapted. In order to reduce switching artifacts, the signal components in the state buffer should be preserved in order to preserve the aliasing elimination characteristics of the duplicate transform.

以下に、リサンプラー７２内での補間１０４の実行方法について詳細に説明する。 Hereinafter, a method for executing the interpolation 104 in the resampler 72 will be described in detail.

以下のように２種類の場合に区分できる。
１）スイッチアップは、先行時間部分８４から後続時間部分８６へサンプリングレートが増加される処理である。
２）スイッチダウンは、先行時間部分８４から後続時間部分８６へサンプリングレートが減少される処理である。 It can be divided into two cases as follows.
1) Switch-up is a process in which the sampling rate is increased from the preceding time portion 84 to the succeeding time portion 86.
2) Switch down is a process in which the sampling rate is reduced from the preceding time portion 84 to the following time portion 86.

例えば１２．８ｋＨｚ（２０ｍｓごとに２５６個のサンプル）から３２ｋＨｚ（２０ｍｓごとに６４０個のサンプル）へのようなスイッチアップを想定すると、図５に参照符号１３０で示されているようなリサンプラー７２のステートバッファまたはその容量は、サンプリングレート変更に相当する係数（上述の例では２．５）分だけ拡張される必要がある。追加遅延を発生させない拡張のための可能な方法は、例えば、線形補間またはスプライン補間である。つまり、リサンプラー７２は、先行時間領域８４に関する再変換形９６の後部の（時間間隔１０２に存在するような）サンプルを、ステートバッファ１３０内ですぐに補間してもよい。ステートバッファは、図５に示されているように、先入れ先出しバッファとして機能してもよい。当然、完全なエイリアシング解消のために必要な全ての周波数成分がこの処理によって得られるわけではないが、例えば０〜６．４ｋＨｚのような少なくとも低周波域が何の歪みもなく生成可能であり、これらの周波数は心理音響的な点で最も関連深いものである。 For example, assuming a switch-up from 12.8 kHz (256 samples every 20 ms) to 32 kHz (640 samples every 20 ms), a resampler 72 as shown at 130 in FIG. The state buffer or its capacity needs to be expanded by a factor (2.5 in the above example) corresponding to the sampling rate change. Possible methods for expansion without causing additional delay are, for example, linear interpolation or spline interpolation. That is, the resampler 72 may immediately interpolate the samples (as present in the time interval 102) of the retransformation type 96 for the preceding time region 84 within the state buffer 130. The state buffer may function as a first-in first-out buffer, as shown in FIG. Naturally, not all frequency components necessary for complete aliasing cancellation are obtained by this process, but at least a low frequency range such as 0 to 6.4 kHz can be generated without any distortion, These frequencies are most relevant in psychoacoustic terms.

低いサンプリングレートへのスイッチダウンの場合には、線形またはスプライン補間は、また、追加遅延を発生させずにステートバッファを縮小するためにも使用できる。つまり、リサンプラー７２は補間によりサンプリングレートを減少させてもよい。しかし、大きい縮小係数でのサンプリングレートへのスイッチダウン、例えば３２ｋＨｚ（２０ｍｓごとに６４０個のサンプル）から１２．８ｋＨｚ（２０ｍｓごとに２５６個のサンプル）への切り換え（この場合、縮小係数は２．５）は、高周波成分が除去されなければエイリアシング解消をひどく妨害する可能性がある。合成フィルタリングがこの現象に対処してもよく、この合成フィルタリングでは、フィルターバンクまたは再変換装置を「フラッシュ」することにより、高周波成分を除去することができる。これは、切り換えの瞬間にフィルターバンクが少ない周波数成分を合成し、従って、重複加算バッファから高スペクトル成分を取り除いてきれいにすることを意味している。より正確には、先行時間領域８４のための第１のサンプリングレートから後続時間領域８６のための第２のサンプリングレートへのスイッチダウンを想像して下さい。上記説明から離れて、再変換装置７０は、先行時間領域８４のウィンドウバージョンの変換形９４の周波数成分の全てを再変換の対象とするわけではなく、そうすることによりスイッチダウンに備えるよう構成されている。むしろ、再変換装置７０は、変換形９４のあまり関係のない高周波成分を例えば０にセットすることにより、あるいは、これらの高周波成分を次第に減衰させるなどしてそれらの再変換に対する影響を減じることで、高周波成分を再変換から除外してもよい。例えば、この処理の対象となる高周波成分は、周波数成分Ｎ_k’よりも高いものであってもよい。
従って、結果的に生じた情報信号内では、時間領域８４は、意図的に入力部７６で入力された重複変換表記で入手可能であった帯域幅よりも低いスペクトル帯域で再構築されたものとなる。しかし、補間処理１０４にもかかわらず、高周波部分を気付かずに結合装置７４内でのエイリアシング解消処理に導入した場合に重複加算処理で起こるであろうエイリアシング問題を避けることができる。 In the case of a switch down to a lower sampling rate, linear or spline interpolation can also be used to shrink the state buffer without incurring additional delay. That is, the resampler 72 may reduce the sampling rate by interpolation. However, switching down to a sampling rate with a large reduction factor, eg switching from 32 kHz (640 samples every 20 ms) to 12.8 kHz (256 samples every 20 ms) (in this case the reduction factor is 2. In 5), if the high-frequency component is not removed, there is a possibility that aliasing may be severely disturbed. Synthetic filtering may address this phenomenon, where high frequency components can be removed by “flashing” the filter bank or reconversion device. This means that at the instant of switching, the frequency components with fewer filter banks are synthesized, thus removing the high spectral components from the overlap-add buffer and cleaning. More precisely, imagine a switch down from a first sampling rate for the preceding time region 84 to a second sampling rate for the following time region 86. Apart from the above description, the reconversion device 70 is configured not to subject all of the frequency components of the transform version 94 of the window version of the preceding time domain 84 to reconversion, and thereby to be prepared for switchdown. ing. Rather, the re-conversion device 70 reduces the influence on the re-conversion by setting the high-frequency components of the conversion type 94 that are not very relevant to 0, for example, or by gradually attenuating these high-frequency components. The high frequency component may be excluded from the reconversion. For example, the high frequency component to be processed may be higher than the frequency component N _k ′.
Therefore, in the resulting information signal, the time domain 84 is intentionally reconstructed with a lower spectral bandwidth than the bandwidth available in the duplicate transform notation entered at the input 76. Become. However, in spite of the interpolation process 104, the aliasing problem that may occur in the overlap addition process when the high-frequency part is introduced into the aliasing elimination process in the coupling device 74 without noticing can be avoided.

別の例として、高サンプリングレート表記からの切り換えのために適当なステートバッファ内で使用できるように、さらに低サンプリングレート表記も同時に生成可能である。これにより、デシメーション係数（デシメーションが必要な場合）が常に比較的低く（つまり２より小さく）保たれ、妨害となるようなアーチファクトがエイリアシングから起こることはない。前述したように、これが全ての周波数成分を維持するわけではないが、少なくとも、心理音響的に関連のある低周波を維持することになる。 As another example, a lower sampling rate notation can be generated simultaneously so that it can be used in a suitable state buffer for switching from a higher sampling rate notation. This keeps the decimation factor (if decimation is necessary) always relatively low (i.e. less than 2) and no disturbing artifacts arise from aliasing. As described above, this does not maintain all frequency components, but at least maintains a psychoacoustically relevant low frequency.

従って、特定の実施形態によれば、ＵＳＡＣの低遅延型を得るために、以下の方法でＵＳＡＣコーデックを修正することができる。最初に、ＴＣＸコーディングモードとＡＣＥＬＰコーディングモードのみが許可される。ＡＡＣモードは回避できる。２０ｍｓのフレーミングを得るために、そのフレーム長を選択できる。そして、動作モード（超広帯域（ＳＷＢ）、広帯域（ＷＢ）、狭帯域（ＮＢ）または全帯域幅）とビットレートに応じて、以下のようなシステムパラメータが選択可能である。システムパラメータの概略を以下の表１に示す。 Thus, according to a particular embodiment, the USAC codec can be modified in the following manner to obtain a low latency version of USAC. Initially, only TCX coding mode and ACELP coding mode are allowed. AAC mode can be avoided. The frame length can be selected to obtain 20 ms framing. Then, the following system parameters can be selected according to the operation mode (ultra-wide band (SWB), wide band (WB), narrow band (NB) or full bandwidth) and the bit rate. A summary of the system parameters is shown in Table 1 below.

狭帯域（ＮＢ）モードに関して、サンプリングレートの増加を避けることができ、内部サンプリングレートを入力サンプリングレートと等しくなるように、つまり８ｋＨｚにセットし、それに応じたフレーム長つまりサンプル数１６０のフレーム長を選択することにより、元に戻すことができる。同様に、広帯域（ＷＢ）動作モードの場合には、１６ｋＨｚを選択し、ＴＣＸのためのＭＤＣＴのフレーム長を、サンプル数２５６ではなく、３２０とすることができる。 For narrowband (NB) mode, an increase in sampling rate can be avoided, the internal sampling rate is set equal to the input sampling rate, ie 8 kHz, and the corresponding frame length, ie the frame length of 160 samples, is set. By selecting, it can be restored. Similarly, in the wideband (WB) mode of operation, 16 kHz may be selected and the MDCT frame length for TCX may be 320 instead of 256 samples.

特に、動作ポイントのリスト全体を通して、つまりサポートされているサンプリングレート、ビットレート及び帯域幅を通して変更動作を支えることができる。以下の表２に、ＵＳＡＣコーデックの予想低遅延型の内部サンプリングレートに関する様々な構成を示す。 In particular, the change operation can be supported throughout the list of operation points, i.e. through supported sampling rates, bit rates and bandwidths. Table 2 below shows various configurations for the expected low-latency internal sampling rate of the USAC codec.

サイド情報として、図２Ａ，２Ｂのリサンプラーを使用する必要はないことに留意すべきである。入力サンプリングレートから専用のコアサンプリング周波数へのリサンプリング機能を負うために、代わりにＩＩＲフィルターセットを設けることができる。これらのＩＩＲフィルター遅延は０．５ｍｓ未満であるが、入力周波数と出力周波数との間の比が半端なものであるので、その複雑さは相当なものである。全てのＩＩＲフィルターに関して遅延が同じであると仮定すると、違うサンプリングレート間での変更が可能となる。 It should be noted that it is not necessary to use the resampler of FIGS. 2A and 2B as side information. To take the resampling function from the input sampling rate to a dedicated core sampling frequency, an IIR filter set can be provided instead. These IIR filter delays are less than 0.5 ms, but the complexity is considerable because the ratio between the input frequency and the output frequency is odd. Assuming that the delay is the same for all IIR filters, it is possible to change between different sampling rates.

従って、図２Ａ，２Ｂのリサンプラーの実施例を使用することが好ましい。パラメトリックエンベロープモジュール（つまりＳＢＲ）のＱＭＦフィルターバンクが、上述したようなリサンプリング機能を実現するための共同作業に加わってもよい。ＳＷＢの場合、このことは、ＳＢＲエンコーダモジュールにより既に解析ステージが実現されている一方で、合成フィルターバンクステージをエンコーダに付加することになる。デコーダ側では、ＳＢＲが使用可能である場合にＱＭＦがアップサンプリング機能を既に負っている。このスキームは他の全ての帯域幅モードにおいても使用可能である。以下の表３に、必要なＱＭＦ構成の概略を示す。 Therefore, it is preferred to use the resampler embodiment of FIGS. 2A and 2B. A parametric envelope module (ie, SBR) QMF filter bank may participate in collaborative work to implement the resampling function as described above. In the case of SWB, this means that an analysis stage has already been realized by the SBR encoder module while a synthesis filter bank stage is added to the encoder. On the decoder side, the QMF already has an upsampling function when SBR is available. This scheme can also be used in all other bandwidth modes. Table 3 below outlines the required QMF configuration.

入力サンプリング周波数が一定であると仮定すると、ＱＭＦ合成プロトタイプを変えることにより、内部サンプリングレート間での変更が可能となる。デコーダ側には逆の動作が適用できる。ＱＭＦ帯域の帯域幅は動作ポイントの全域を通して同じであることに留意すべきである。 Assuming that the input sampling frequency is constant, changing between internal sampling rates is possible by changing the QMF synthesis prototype. The reverse operation can be applied to the decoder side. It should be noted that the bandwidth of the QMF band is the same throughout the operating point.

本発明のいくつかの態様を装置に関して説明してきたが、これらの態様はまたこれらに相当する方法の説明でもあり、ブロックや装置は方法ステップや方法ステップの特徴に対応する。同様に、方法ステップに関して説明した態様はまた、これらに相当するブロックやアイテムまたはこれらに相当する装置の特徴の説明でもある。これらの方法ステップのうちのいくつかまたは全てが、例えばマイクロプロセッサ、プログラム制御可能なコンピュータや電子回路のようなハードウェア装置により（またはそれを使用して）実施してもよい。いくつかの実施形態において、最も重要な方法ステップのうちの一つまたはそれ以上のものが、このような装置によって実行されてもよい。 Although some aspects of the present invention have been described with respect to apparatus, these aspects are also descriptions of corresponding methods, where blocks and apparatus correspond to method steps and features of method steps. Similarly, the aspects described with respect to the method steps are also descriptions of the corresponding blocks and items or the features of the apparatus corresponding thereto. Some or all of these method steps may be performed by (or using) a hardware device such as, for example, a microprocessor, programmable computer or electronic circuit. In some embodiments, one or more of the most important method steps may be performed by such an apparatus.

実施条件に応じて、本発明の実施形態はハードウェアまたはソフトウェアで実現可能である。これは、例えばフロッピーディスク、ＤＶＤ、ブルーレイ、ＣＤ、ＲＯＭ、ＰＲＯＭ、ＥＰＲＯＭ、ＥＥＰＲＯＭやＦＬＡＳＨメモリーなどの、電子読み取り制御可能な信号が中に保存されたデジタル記憶媒体を使用して実施することができ、これらの電子読み取り制御可能な信号は、それぞれの方法が実行できるように、プログラム可能なコンピュータシステムと協働する（または協働可能である）。従って、このようなデジタル記憶媒体はコンピュータ読み取り可能なものであってもよい。 Depending on implementation conditions, embodiments of the present invention can be implemented in hardware or software. This can be done using a digital storage medium in which signals that can be read electronically are stored, such as floppy disk, DVD, Blu-ray, CD, ROM, PROM, EPROM, EEPROM or FLASH memory. These electronic reading controllable signals cooperate (or can cooperate) with a programmable computer system so that the respective methods can be performed. Accordingly, such digital storage media may be computer readable.

本発明のいくつかの実施形態は、電子読み取り制御可能な信号を有するデータキャリアを含み、これらの電子読み取り制御可能な信号は、ここで説明した方法のうちの一つを実行できるように、プログラム可能なコンピュータシステムと協働可能である。 Some embodiments of the present invention include a data carrier having electronic read controllable signals that can be programmed to perform one of the methods described herein. Can collaborate with possible computer systems.

一般的に、本発明の実施形態は、プログラムコードを備えたコンピュータプログラム製品として実施でき、このプログラム製品がコンピュータで動作した際、このプログラムコードは前述の方法のうちの一つを実行するためのものである。このようなプログラムコードは、例えば機械読み取り可能なキャリアに保存されていてもよい。 In general, embodiments of the present invention can be implemented as a computer program product with program code, which when run on a computer, the program code performs one of the methods described above. Is. Such a program code may be stored on a machine-readable carrier, for example.

他の実施形態は、ここで説明した方法のうちの一つを実行するためのものであり、機械読み取り可能なキャリアに保存されているコンピュータプログラムを含む。 Another embodiment is for performing one of the methods described herein and includes a computer program stored on a machine readable carrier.

換言すれば、本発明の方法の一実施形態は、従って、コンピュータで動作した際、前述の方法のうちの一つを実行するためのプログラムコードを有するコンピュータプログラムである。 In other words, one embodiment of the method of the present invention is therefore a computer program having program code for performing one of the aforementioned methods when run on a computer.

本発明の方法の別の実施形態は、従って、前述の方法のうちの一つを実行するためのコンピュータプログラムを格納しているデータキャリア（またはデジタル媒体またはコンピュータ読み取り可能な媒体）である。 Another embodiment of the method of the present invention is therefore a data carrier (or digital or computer readable medium) that stores a computer program for performing one of the aforementioned methods.

本発明の方法の別の実施形態は、ここで説明した方法のうちの一つを実行するためのコンピュータプログラムを表すデータストリームまたは一連の信号である。このデータストリームまたは一連の信号は、例えばインターネットのようなデータ通信接続を介して送信されるように構成されていてもよい。 Another embodiment of the method of the present invention is a data stream or a series of signals representing a computer program for performing one of the methods described herein. This data stream or series of signals may be configured to be transmitted over a data communication connection, such as the Internet.

さらに別の実施形態は、ここで説明した方法のうちの一つを実行するように構成された、例えばコンピュータやプログラム可能な論理装置のような処理手段を含む。 Yet another embodiment includes a processing means, such as a computer or programmable logic device, configured to perform one of the methods described herein.

本発明のさらに別の実施形態は、ここで説明した方法のうちの一つを実行するためのコンピュータプログラムがインストールされているコンピュータを含む。 Yet another embodiment of the invention includes a computer having a computer program installed for performing one of the methods described herein.

本発明の別の実施形態は、ここで説明した方法のうちの一つを実行するためのコンピュータプログラムを受信機に転送する（例えば電子的にまたは光学的に）よう構成された装置またはシステムを含む。 Another embodiment of the present invention provides an apparatus or system configured to transfer (eg, electronically or optically) a computer program for performing one of the methods described herein to a receiver. Including.

いくつかの実施形態において、ここで説明した方法の機能性のうちのいくつかまたは全てを実行するために、プログラム可能な論理装置（例えばフィールドプログラマブルゲートアレイ）を使用してもよい。いくつかの実施形態において、ここで説明した方法のうちの一つを実行するために、フィールドプログラマブルゲートアレイがマイクロプロセッサと協働してもよい。一般的に、これらの方法は何らかのハードウェア装置によって実行されることが好ましい。 In some embodiments, programmable logic devices (eg, field programmable gate arrays) may be used to perform some or all of the functionality of the methods described herein. In some embodiments, a field programmable gate array may cooperate with a microprocessor to perform one of the methods described herein. In general, these methods are preferably performed by some hardware device.

上述の実施形態は単に本発明の原理を説明しているにすぎない。ここで説明した配置や詳細に関して様々な修正や変更が当業者には明らかであろう。従って、本発明は以下の特許請求項の範囲によってのみ制限され、上述の実施形態で示された詳細によっては制限されない。 The above-described embodiments are merely illustrative of the principles of the invention. Various modifications and changes to the arrangements and details described herein will be apparent to those skilled in the art. Accordingly, the invention is limited only by the scope of the following claims and not by the details shown in the above-described embodiments.

Claims

An information signal conversion device configured to generate an overlap conversion representation of an information signal using aliasing occurrence overlap conversion,
An input unit (105) for receiving an information signal in the form of a series of samples;
A grabber (106) configured to obtain a continuous overlap region of the information signal;
A resampler (107) configured to perform resampling by interpolation on at least a part of the continuous overlapping regions so that each continuous overlapping region has a constant sampling rate, but the sampling rate is different between the continuous overlapping regions. )When,
A window processing unit (108) configured to perform window processing on a continuous overlapping region of information signals;
Converter configured to perform individually transform to windowed region (109) seen including,
The capture unit (106) is configured to capture the continuous overlap region of the information signal so that the continuous overlap region of the information signal has a certain length in time.

2. The information signal conversion device according to claim 1, wherein the capture unit (106) is configured to capture the continuous overlap region of the information signal so that the continuous overlap region of the information signal has a constant offset in time. Has been.

The information signal conversion device according to claim 1 or 2 ,
The series of samples has a variable sampling rate that changes from a first sampling rate to a second sampling rate at a predetermined instant (113);
The resampler (107) is configured so that the continuous overlapping region (114b, c) overlapping at a predetermined moment is such that the constant sampling rate of the continuous overlapping region changes only once from the first sampling rate to the second sampling rate. ) For resampling.

4. The information signal conversion apparatus according to claim 3 , wherein the converter is configured to adapt the conversion length of the conversion type of each windowed area to the number of samples of each windowed area.

A method for generating a double conversion representation of an information signal using aliasing generation double conversion,
Receiving an information signal in the form of a series of samples;
Obtaining a continuous overlap region of information signals;
Resampling by interpolation on at least a portion of the continuous overlap areas so that each continuous overlap area has a constant sampling rate, but the sampling rate is different between the continuous overlap areas;
Performing window processing on continuous overlapping areas of information signals;
It looks including that performed individually transform to windowed area,
In the acquisition of the continuous overlapping area, the continuous overlapping area of the information signal is captured so that the continuous overlapping area of the information signal has a certain length in time.

A computer program having program code for causing a computer to execute the method according to claim 5 when started on the computer.