CN100339886C - Encoder capable of detecting transient position of sound signal and encoding method - Google Patents
Encoder capable of detecting transient position of sound signal and encoding method Download PDFInfo
- Publication number
- CN100339886C CN100339886C CNB031103685A CN03110368A CN100339886C CN 100339886 C CN100339886 C CN 100339886C CN B031103685 A CNB031103685 A CN B031103685A CN 03110368 A CN03110368 A CN 03110368A CN 100339886 C CN100339886 C CN 100339886C
- Authority
- CN
- China
- Prior art keywords
- sub
- data
- frequency
- sampling data
- subband
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Fee Related
Links
Images
Landscapes
- Compression, Expansion, Code Conversion, And Decoders (AREA)
Abstract
Description
技术领域technical field
本发明提供一种编码器,尤指一种可以检测声音信号的暂态位置的编码器。本发明的编码器亦可以进一步判断频域编码时使用视窗数据的块长度。The invention provides an encoder, especially an encoder capable of detecting the transient position of a sound signal. The encoder of the present invention can further determine the block length of the window data used in the frequency domain encoding.
背景技术Background technique
目前有许多编码器依据人类听觉系统的特性而采用特殊的编码演算法,可将数字声音信号数据压缩至十倍以上,如MP3、AAC、WWA及Dolby Digital等,这些编码器采用了知觉编码、频域编码、视窗切换及动态位分配等技术来消除原始声音信号数据中不必要的内容。At present, many encoders use special encoding algorithms based on the characteristics of the human auditory system, which can compress digital audio signal data to more than ten times, such as MP3, AAC, WWA and Dolby Digital, etc. These encoders use perceptual encoding, Frequency domain coding, window switching and dynamic bit allocation and other technologies are used to eliminate unnecessary content in the original sound signal data.
知觉编码是通过消除一般人类听觉系统所感受不到的声音信号数据来进行压缩。一般来说,人类所能听到的声音频率约为20Hz到20kHz之间,而其他频域的声音一般人类是感受不到的。另一方面,人类的听觉系统在某些情况下也会产生听觉的屏蔽(mask),而无法分辨出量化的噪声,例如当有一个音量或音色特别突出的声音出现时,其邻近的细小的声音会比较难被察觉,因此在编码时不需要把所有的声音细节都编进去。Perceptual coding is compression by eliminating sound signal data that is not perceived by the general human auditory system. Generally speaking, the sound frequency that humans can hear is about 20Hz to 20kHz, while the sound in other frequency domains is generally not felt by humans. On the other hand, the human auditory system also produces an auditory mask in some cases, and cannot distinguish quantized noise. The sound will be more difficult to detect, so you don't need to encode all the sound details into it.
频域编码是一种可以有效消除不必要数据的方法,将有很强相关性的时域数据转换到各元素几乎不相关的频域内,来除去除数据中不必要的内容,一般可分为变换编码或子带(subband)编码。变换编码的频谱解析度较高,而子带编码的解析度低但效率较高,所以可以将这两种编码结合成一个混合滤波器,在不同频率处有不同的解析度。然而,频域编码有一个显著的现象称为前向回波(preechoes),举例来说,一般静音之后倘若突然出现很大的声音,可能会使得量化误差增大。在变换编码和子带编码中都会产生这种现象,导致数据在转换回时域之后出现声音的前向回波。Frequency domain coding is a method that can effectively eliminate unnecessary data. It converts time domain data with strong correlation into frequency domain where each element is almost irrelevant to remove unnecessary content in the data. Generally, it can be divided into Transform coding or subband coding. Transform coding has higher spectral resolution, while subband coding has lower resolution but higher efficiency, so the two codes can be combined into a hybrid filter with different resolutions at different frequencies. However, frequency domain coding has a remarkable phenomenon called preechoes. For example, if there is a sudden loud sound after general silence, the quantization error may increase. This phenomenon occurs in both transform coding and subband coding, causing a forward echo of the sound after the data has been converted back to the time domain.
消除前向回波的一种方法是将误差限制在一个较小的时间段内,把声音的其它部分与前向回波分开,使前向回波产生于屏蔽区之中。将误差限制在一个较小的时间段内需要使用较小的块来进行频域变换,这种方法称为视窗切换,当信号稳定时使用较大的块来进行频域编码,而当信号有大幅度的暂态(Transient)时,就使用较小的块来进行频域编码。视窗切换的缺点是表示相同数据时需要更多的位数,因为随着编码数据数量的增加需要更多的信息。One way to eliminate the forward echo is to limit the error to a smaller time period, separate the other parts of the sound from the forward echo, and make the forward echo occur in the shielded area. Limiting the error to a smaller time period requires the use of smaller blocks for frequency domain transformation. This method is called window switching. When the signal is stable, larger blocks are used for frequency domain coding, and when the signal has When there is a large transient (Transient), smaller blocks are used for frequency domain coding. The disadvantage of window switching is that more bits are required to represent the same data, because more information is required as the amount of encoded data increases.
一个编码器是否有好的编码品质、与位在各个子带或系数之间的分配有很大的关系。为有效地分配位,必须不断地分析输入信号,并根据对人类听觉系统的知识所建立的模型,将较多位分配到人的听觉最有效的区域,在人耳不敏感的区域就不用分配或只分配很少的编码位。因为信号不停变化,人的听觉系统在不同条件下对信号也会有不同的反应,这就是动态位分配的技术。好的位分配方案需要精确的心理声学模型(psychoacoustic model)。Whether an encoder has good encoding quality has a lot to do with the allocation of bits between subbands or coefficients. In order to effectively allocate bits, the input signal must be continuously analyzed, and based on the model established by the knowledge of the human auditory system, more bits are allocated to the most effective area of human hearing, and no allocation is required in areas where the human ear is not sensitive Or only allocate few encoding bits. Because the signal is constantly changing, the human auditory system will respond differently to the signal under different conditions. This is the technology of dynamic bit allocation. A good bit allocation scheme requires an accurate psychoacoustic model.
请参考图1,图1为已知MPEG layer-3声音信号编码的示意图。首先,脉冲码调制(pulse code modulation,PCM)的输入信号10由多相滤波器组(polyphase filter bank)12分成32个等宽的频率子带(frequencysubbands),多相滤波器组12可以简易的分析频率对时间的关系,但是等宽的频率子带并不能准确地反映出人类听觉系统的听觉特性,此外,邻近的频率子带会有较多的重叠部分,所以多相滤波器组12的输出需使用修正离散余弦变换(modified discrete cosine transform,MDCT)14来补偿。修正离散余弦变换14进一步将频率子带做细分,以获得较好的频谱解析度,而且可以将一些由多相滤波器组12所产生的重叠消除掉。修正离散余弦变换14包含两个不同长度的视窗块,分别为一个十八采样的长块和一个六采样的短块,因为连续的转移视窗块有百分之五十的重叠,所以块的长度分别是三十六和十二。在声音信号稳定时,长块有较高的频率解析度及较好的压缩率,而短块则提供较好的时间解析度。由于长块的时间解析度较低,若在处理的块中发生暂态现象,因量化噪声(Quantization Noise)会扩散到整个块,使得能量较小的信号因本身屏蔽效应(Mask)较低无法遮蔽量化噪声而产生失真,如前向回波。为避免前向回波,已知MPEG声音信号编码使用心理声学模型16来检测声音信号的暂态(Transient)位置,以使用短块进行修正离散余弦变换14来避免前向回波。在将输入信号10使用频域编码的技术转换到频域后,接着进行量化程序18,根据心理声学模型16来量化数据,然后进行封装程序20,将数据封装后输出数据位流(bitstream)的输出信号22。Please refer to FIG. 1, which is a schematic diagram of known MPEG layer-3 audio signal encoding. First, the
由上述可知,在进行频域编码时,为避免前向回波,视窗切换是一种常用的技术,这时检测声音信号暂态位置的机制便很重要。已知MPEG声音信号编码使用心理声学模型16来检测声音信号的暂态位置,虽然很准确,但由于心理声模型16相当复杂,所需的成本也很高,若因为使用视窗切换需要检测声音信号的暂态位置而使用高成本的心理声学模型16,是相当不经济的。It can be seen from the above that, in the frequency domain encoding, in order to avoid the forward echo, window switching is a commonly used technique, and the mechanism for detecting the transient position of the sound signal is very important at this time. It is known that MPEG sound signal coding uses
发明内容Contents of the invention
因此本发明的主要目的是提供一种可检测声音信号暂态位置的编码器。另一方面,本发明亦提供一种可判断频域编码时使用视窗数据的块长度的编码器及编码方法,以解决上述问题。It is therefore the main object of the present invention to provide an encoder which detects the transient position of an acoustic signal. On the other hand, the present invention also provides an encoder and an encoding method capable of determining the block length of window data used in frequency domain encoding, so as to solve the above problems.
本发明是提供一种编码器,用来将输入信号编码为输出信号。该编码器包含多相滤波器组,用来根据该输入信号产生多个子带样本,不同的子带样本对应于不同时段的输入信号波形,而每一子带样本中包含多个频率子带;暂态检测器,连接到该多相滤波器组,用来决定视窗数据的块长度,该视窗数据中包含有多个加权值,该暂态检测器包含子带选择器,用来选择该多个子带样本作为参考采样数据;能量计算器,连接到该子带选择器,用来计算该参考采样数据中频率子带的能量总和;分区器,连接到该子带选择器与该能量计算器之间,用来将该参考采样数据分成数组子采样数据,每一组子采样数据包含至少一个子带样本;以及比较器,连接到该能量计算器,用来将能量计算器的输出值与第一临界值作比较,根据该比较结果输出表示视窗数据的块长度的信号;以及编码处理单元,连接到该多相滤波器组与该暂态检测器,用来将该多个频率子带乘以该暂态视窗数据中的多个加权值以产生加权结果,再以预设的转换演算法根据该加权结果产生该输出信号。The present invention provides an encoder for encoding an input signal into an output signal. The encoder includes a polyphase filter bank, which is used to generate a plurality of subband samples according to the input signal, and different subband samples correspond to input signal waveforms of different periods, and each subband sample includes a plurality of frequency subbands; A transient detector, connected to the polyphase filter bank, is used to determine the block length of the window data, the window data contains a plurality of weighted values, and the transient detector includes a subband selector, which is used to select the multiple weighted values sub-band samples as reference sampling data; an energy calculator, connected to the sub-band selector, used to calculate the energy sum of the frequency sub-bands in the reference sampling data; a partitioner, connected to the sub-band selector and the energy calculator between, used to divide the reference sampling data into an array of sub-sampling data, each group of sub-sampling data includes at least one sub-band sample; and a comparator, connected to the energy calculator, used to compare the output value of the energy calculator with The first critical value is compared, and a signal representing the block length of the window data is output according to the comparison result; and an encoding processing unit, connected to the polyphase filter bank and the transient detector, is used for the plurality of frequency subbands Multiple weighted values in the transient window data are multiplied to generate a weighted result, and then a preset conversion algorithm is used to generate the output signal according to the weighted result.
本发明另提供一种编码方法,用来将输入信号编码为输出信号。该编码方法包含有进行子带编码步骤,以根据该输入信号产生多个子带样本,不同的子带样本对应于不同时段的输入信号波形,而每一子带样本中包含多个频率子带;进行选择步骤,以提供对应于预设块长度的视窗数据,该视窗数据中包含有多个加权值;而该选择步骤中包含有:在该多个子带样本中,选出多个子带样本作为参考采样数据,并根据该参考采样数据在预设频率范围内的频率子带的能量总和来决定该视窗数据的块长度;以及进行变换编码步骤,将该多个频率子带乘以该选择步骤所决定的视窗数据的多个加权值以产生加权结果,并以预设的转换演算法根据该加权结果产生该输出信号。The present invention further provides an encoding method for encoding an input signal into an output signal. The encoding method includes a subband encoding step to generate a plurality of subband samples according to the input signal, different subband samples correspond to input signal waveforms of different periods, and each subband sample includes a plurality of frequency subbands; A selection step is performed to provide window data corresponding to a preset block length, the window data includes a plurality of weighted values; and the selection step includes: among the plurality of sub-band samples, selecting a plurality of sub-band samples as Referring to the sampled data, and determining the block length of the window data according to the energy sum of the frequency subbands of the reference sampled data within the preset frequency range; and performing a transform encoding step, multiplying the plurality of frequency subbands by the selection step A plurality of weighted values of the determined window data are used to generate a weighted result, and a preset conversion algorithm is used to generate the output signal according to the weighted result.
相对已知技术,本发明提供一种编码器及编码方法可用来决定进行修正离散余弦变换时使用的视窗数据的块长度,利用编码器的过程中所产生的子带样本中频率子带所含的能量值为判断声音信号数据是否发生暂态,远比已知使用心理声学模型需要较低的成本,符合经济效益。Compared with the known technology, the present invention provides an encoder and an encoding method that can be used to determine the block length of the window data used when performing Modified Discrete Cosine Transformation. The energy value of judging whether the sound signal data has a transient state requires far lower cost than the known psychoacoustic model, which is in line with economic benefits.
附图说明Description of drawings
图1为已知MPEG layer-3声音信号编码的示意图。Fig. 1 is the schematic diagram of known MPEG layer-3 sound signal encoding.
图2为本发明实施例的编码器的示意图。Fig. 2 is a schematic diagram of an encoder according to an embodiment of the present invention.
图3为本实施例的子带样本的示意图。FIG. 3 is a schematic diagram of subband samples in this embodiment.
图4为本发明实施例中编码器检测声音信号的暂态位置方法的流程图。FIG. 4 is a flow chart of a method for an encoder to detect a transient position of a sound signal in an embodiment of the present invention.
附图符号说明Description of reference symbols
10 输入信号 12 多相滤波器组10
14 修正离散余弦变换 16 心理声学模型14 Modified
18 量化程序 20 封装程序18
22 输出信号 30 本发明编码器22 Output signal 30 Encoder of the present invention
32 暂态检测器 34 编码处理单元32 Transient detector 34 Encoding processing unit
36 子带选择器 38 能量计算器36 Subband Selector 38 Energy Calculator
40 分区器 42 比较器40 Partitioner 42 Comparator
50 参考采样数据50 reference sampling data
具体实施方式Detailed ways
请参考图2,图2为本发明实施例的编码器30的示意图。编码器30用来将脉冲码调制的输入信号10编码为位流的输出信号22。编码器20包含多相滤波器组12、暂态检测器32以及编码处理单元34。多相滤波器组12根据该输入信号10产生多个子带样本,不同的子带样本对应于不同时段的输入信号10波形,而每一子带样本中包含多个频率子带。编码处理单元34可对该多个频率子带进行修正离散余弦变换。暂态检测器32连接到多相滤波器组12及编码处理单元34之间,可决定编码处理单元34进行修正离散余弦变换时所使用的视窗数据的块长度。暂态检测器32包含子带选择器36、能量计算器38、分区器40以及比较器42。子带选择器36会在预设频率范围选择该多个子带样本中部分的子带样本作为参考采样数据,接着能量计算器38会计算参考采样数据中所含的能量值,之后将该能量值交给比较器42与临界值作比较。若是参考采样数据的总能量超过该临界值时,也就是在参考采样数据中可能存在暂态的情形,则再由分区器40将参考采样数据分成数组等宽的子采样数据,而每一组子采样数据至少包含一子带样本,此时能量计算器38会计算相邻两组子采样数据在预设频率范围内的频率子带的能量差值,再将该能量差值传送至比较器42与预定的临界值作比较。如果该能量差值大于预定的临界值时,则可决定编码处理单元34使用短块的视窗数据进行修正离散余弦变换,如此反覆直到分区器42完成所有可能的子采样数据组合。若此时相邻两组的子采样数据的能量差值仍小于预定的临界值,则可决定编码处理单元34使用长块的视窗数据进行修正离散余弦变换。Please refer to FIG. 2 , which is a schematic diagram of an encoder 30 according to an embodiment of the present invention. The encoder 30 is used to encode the pulse code modulated
请参考图3,图3为本实施例的子带样本的示意图。多相滤波器组12在一个时段t1中输出十八个子带样本,每一个子带样本中含有三十二个频率子带。编码处理单元34对重叠时段中的每一个频率子带进行修正离散余弦变换,也就是三十六个子带样本。暂态检测器32针对发生声音信号暂态的位置作检测以决定编码处理单元34应使用何种视窗块来进行修正离散余弦变换。所谓的预设频率范围通常指的是介于截止子带与编码限制子带之间的频率,子带选择器36会选择这个频率范围内的频率子带来作为参考采样数据50。截止子带可以根据经验或是实验值来选择第一个子带或是更高频的子带。在本实施例中,截止子带的频率大约为4kHz。编码限制子带就必须要根据编码规则来决定。由于位率(bitrate)以及带宽(bandwidth)都有其限制,编码器30必须舍弃部分高频子带的信息,而被舍弃的频率子带的数据就不再列入考虑。假设没有信息被舍弃的话,则最后一个子带就是编码限制子带。在参考采样数据50选定后,能量计算器38会计算出参考采样数据50中所含的能量值,再由比较器42来判断是否对参考采样数据50继续作检测,分区器40可将参考采样数据50再分成数组等宽的子采样数据,然后能量计算器38会计算相邻两组子采样数据的能量差值,由比较器42决定视窗数据的块长度。举例来说,首先能量计算器38计算子带选择器36选出的参考采样数据50中所有频率子带的总能量,若总能量大于-60dB,则参考采样数据中可能存在有暂态的情形发生,由分区器40将参考采样数据50中的子带样本分成六组等宽的子采样数据,接着由能量计算器38计算相邻两组子采样数据的能量差值交给比较器42进行比较,若两子采样数据的能量差值并未大于20dB,表示这两此子采样数据之间其实并无暂态的情形发生,分区器40会重新将参考采样数据50中的子带样本分成3组等宽的子采样数据,此时再由能量计算器38计算相邻两组子采样数据的能量差值交给比较器42判断是否大于12dB。若大于12dB,则表示数据中含有暂态的情形,因此判断应使用短块视窗;若并未大于12dB,则使用长块视窗。Please refer to FIG. 3 , which is a schematic diagram of subband samples in this embodiment. The
请参考图4,图4为本发明实施例中,编码器30检测声音信号暂态位置的方法的流程图。本实施例的编码方法可检测声音信号的暂态位置。本实施例的编码方法首先进行子带编码步骤,根据输入信号10产生多个子带样本,不同的子带样本对应于不同时段的输入信号10波形,而每一子带样本中包含多个频率子带。接着进行选择步骤,以决定下一步骤所需使用的视窗数据的块长度,视窗数据中含有多个加权值,选择步骤的方法为在该多个子带样本中,选出多个子带样本作为参考采样数据,并根据参考采样数据在预设频率范围内的频率子带的能量总和来决定该视窗数据的块长度。最后进行变换编码步骤,将该多个频率子带乘以选择步骤所决定的视窗数据的多个加权值以产生加权结果,并根据加权结果使用修正离散余弦变换产生输出信号。而检测声音信号暂态位置的详细步骤如下:Please refer to FIG. 4 . FIG. 4 is a flowchart of a method for the encoder 30 to detect the transient position of the sound signal in an embodiment of the present invention. The encoding method of this embodiment can detect the transient position of the sound signal. The encoding method of this embodiment first performs the sub-band encoding step, and generates a plurality of sub-band samples according to the
步骤110:开始进行检测声音信号的暂态位置;Step 110: start to detect the transient position of the sound signal;
步骤120:计算选择作为参考采样数据中的频率子带的总能量是否大于预定的临界值,若是,则进行步骤130,若否,则进行步骤170;Step 120: Calculate whether the total energy of the frequency sub-band selected as the reference sampling data is greater than a predetermined critical value, if so, proceed to step 130, if not, proceed to step 170;
步骤130:将参考采样数据分成数组等宽的子采样数据,每一组子采样数据包含一个以上的子带样本,计算每一组子采样数据中所有的频率子带在预设频率范围中的能量值,接着进行步骤140;Step 130: Divide the reference sampling data into arrays of sub-sampling data of equal width, each group of sub-sampling data contains more than one sub-band sample, and calculate the frequency sub-bands in each group of sub-sampling data in the preset frequency range Energy value, then proceed to step 140;
步骤140:判断相邻两组子采样数据的能量差值是否大于预定的临界值,若是,则进行步骤160,若否,则进行步骤150;Step 140: judging whether the energy difference between two adjacent groups of sub-sampling data is greater than a predetermined critical value, if so, proceed to step 160, if not, proceed to step 150;
步骤150:判断参考采样数据是否还可以分成不同的子采样数据,若是,则回到步骤130,若否,则进行步骤170;Step 150: judging whether the reference sampling data can also be divided into different sub-sampling data, if so, then return to step 130, if not, then proceed to step 170;
步骤160:参考采样数据中含有暂态位置,输出使用短块的视窗数据信号,进行步骤180;Step 160: Refer to the transient position contained in the sampling data, output the window data signal using a short block, and proceed to step 180;
步骤170:参考采样数据中不含暂态位置,输出使用长块的视窗数据信号,进行步骤180;Step 170: Referring to the sampling data that does not contain the transient position, output the window data signal using a long block, and proceed to Step 180;
步骤180:输出判断结果,结束检测声音信号的暂态位置。Step 180: output the judgment result, and end detecting the transient position of the sound signal.
相对已知技术,本发明提供一种编码器及编码方法可用来决定进行修正离散余弦变换时使用的视窗数据的块长度,利用编码器的过程中所产生的子带样本中频率子带所含的能量值为判断声音信号数据是否发生暂态,远比已知使用心理声学模型需要较低的成本,符合经济效益。Compared with the known technology, the present invention provides an encoder and an encoding method that can be used to determine the block length of the window data used when performing Modified Discrete Cosine Transformation. The energy value of judging whether the sound signal data has a transient state requires far lower cost than the known psychoacoustic model, which is in line with economic benefits.
以上所述仅为本发明的优选实施例,凡依本发明权利要求书所做的相似变化与改进,皆应属本发明专利的涵盖范围。The above descriptions are only preferred embodiments of the present invention, and all similar changes and improvements made according to the claims of the present invention shall fall within the scope of the patent of the present invention.
Claims (18)
Priority Applications (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CNB031103685A CN100339886C (en) | 2003-04-10 | 2003-04-10 | Encoder capable of detecting transient position of sound signal and encoding method |
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CNB031103685A CN100339886C (en) | 2003-04-10 | 2003-04-10 | Encoder capable of detecting transient position of sound signal and encoding method |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| CN1536559A CN1536559A (en) | 2004-10-13 |
| CN100339886C true CN100339886C (en) | 2007-09-26 |
Family
ID=34319676
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CNB031103685A Expired - Fee Related CN100339886C (en) | 2003-04-10 | 2003-04-10 | Encoder capable of detecting transient position of sound signal and encoding method |
Country Status (1)
| Country | Link |
|---|---|
| CN (1) | CN100339886C (en) |
Families Citing this family (12)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN101247129B (en) * | 2004-09-17 | 2012-05-23 | 广州广晟数码技术有限公司 | Signal processing method |
| US8204261B2 (en) * | 2004-10-20 | 2012-06-19 | Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. | Diffuse sound shaping for BCC schemes and the like |
| CN100573665C (en) * | 2005-05-10 | 2009-12-23 | 凌阳科技股份有限公司 | Energy detection device and method |
| DE102006047197B3 (en) * | 2006-07-31 | 2008-01-31 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Device for processing realistic sub-band signal of multiple realistic sub-band signals, has weigher for weighing sub-band signal with weighing factor that is specified for sub-band signal around subband-signal to hold weight |
| CN101308655B (en) * | 2007-05-16 | 2011-07-06 | 展讯通信(上海)有限公司 | Audio coding and decoding method and layout design method of static discharge protective device and MOS component device |
| CN101308651B (en) * | 2007-05-17 | 2011-05-04 | 展讯通信(上海)有限公司 | Detection method of audio transient signal |
| US8630848B2 (en) * | 2008-05-30 | 2014-01-14 | Digital Rise Technology Co., Ltd. | Audio signal transient detection |
| EP2214165A3 (en) * | 2009-01-30 | 2010-09-15 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Apparatus, method and computer program for manipulating an audio signal comprising a transient event |
| JP5799707B2 (en) * | 2011-09-26 | 2015-10-28 | ソニー株式会社 | Audio encoding apparatus, audio encoding method, audio decoding apparatus, audio decoding method, and program |
| WO2014123926A1 (en) | 2013-02-05 | 2014-08-14 | Interdigital Patent Holdings, Inc. | Pulse-shaped orthogonal frequency division multiplexing |
| CN106683687B (en) * | 2016-12-30 | 2020-02-14 | 杭州华为数字技术有限公司 | Abnormal sound classification method and device |
| CN118800251A (en) * | 2023-04-13 | 2024-10-18 | 华为技术有限公司 | Method and device for encoding scene audio signal |
Citations (7)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US5197087A (en) * | 1989-07-19 | 1993-03-23 | Naoto Iwahashi | Signal encoding apparatus |
| CN1093843A (en) * | 1993-02-02 | 1994-10-19 | 索尼公司 | Method and device for high-efficiency encoding and decoding |
| CN1125010A (en) * | 1993-04-22 | 1996-06-19 | 弗兰克·乌达尔·莱昂哈德 | Method and system for detecting and generating transient conditions in auditory signals |
| JPH08286699A (en) * | 1995-04-14 | 1996-11-01 | Tech Res & Dev Inst Of Japan Def Agency | Transient sound frequency analysis method and device |
| CN1208489A (en) * | 1995-12-01 | 1999-02-17 | 数字剧场系统股份有限公司 | Multi-channel Predictive Subband Encoder Using Psychoacoustic Adaptive Bit Allocation |
| CN1312977A (en) * | 1998-05-27 | 2001-09-12 | 微软公司 | Scalable audio coder and decoder |
| CN1361594A (en) * | 2000-12-25 | 2002-07-31 | 松下电器产业株式会社 | Equipment and method for coding frequency signal and computer program products |
-
2003
- 2003-04-10 CN CNB031103685A patent/CN100339886C/en not_active Expired - Fee Related
Patent Citations (7)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US5197087A (en) * | 1989-07-19 | 1993-03-23 | Naoto Iwahashi | Signal encoding apparatus |
| CN1093843A (en) * | 1993-02-02 | 1994-10-19 | 索尼公司 | Method and device for high-efficiency encoding and decoding |
| CN1125010A (en) * | 1993-04-22 | 1996-06-19 | 弗兰克·乌达尔·莱昂哈德 | Method and system for detecting and generating transient conditions in auditory signals |
| JPH08286699A (en) * | 1995-04-14 | 1996-11-01 | Tech Res & Dev Inst Of Japan Def Agency | Transient sound frequency analysis method and device |
| CN1208489A (en) * | 1995-12-01 | 1999-02-17 | 数字剧场系统股份有限公司 | Multi-channel Predictive Subband Encoder Using Psychoacoustic Adaptive Bit Allocation |
| CN1312977A (en) * | 1998-05-27 | 2001-09-12 | 微软公司 | Scalable audio coder and decoder |
| CN1361594A (en) * | 2000-12-25 | 2002-07-31 | 松下电器产业株式会社 | Equipment and method for coding frequency signal and computer program products |
Also Published As
| Publication number | Publication date |
|---|---|
| CN1536559A (en) | 2004-10-13 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| US20040181403A1 (en) | Coding apparatus and method thereof for detecting audio signal transient | |
| JP5539203B2 (en) | Improved transform coding of speech and audio signals | |
| US7548850B2 (en) | Techniques for measurement of perceptual audio quality | |
| KR100348368B1 (en) | A digital acoustic signal coding apparatus, a method of coding a digital acoustic signal, and a recording medium for recording a program of coding the digital acoustic signal | |
| JP4676139B2 (en) | Multi-channel audio encoding and decoding | |
| US7613603B2 (en) | Audio coding device with fast algorithm for determining quantization step sizes based on psycho-acoustic model | |
| EP1808851B1 (en) | System and method for low power stereo perceptual audio coding using adaptive masking threshold | |
| US7917369B2 (en) | Quality improvement techniques in an audio encoder | |
| KR102081043B1 (en) | Companding apparatus and method to reduce quantization noise using advanced spectral extension | |
| Sinha et al. | Audio compression at low bit rates using a signal adaptive switched filterbank | |
| CN100339886C (en) | Encoder capable of detecting transient position of sound signal and encoding method | |
| CN1926608A (en) | Device and method for processing a multi-channel signal | |
| CN1702974B (en) | Method and apparatus for encoding/decoding a digital signal | |
| KR100472442B1 (en) | Method for compressing audio signal using wavelet packet transform and apparatus thereof | |
| EP1228506A1 (en) | Method of encoding an audio signal using a quality value for bit allocation | |
| CN1487746A (en) | Method and device for encoding or decoding audio signals | |
| CN1689372A (en) | Method and apparatus for selecting a sound algorithm | |
| JP3894722B2 (en) | Stereo audio signal high efficiency encoding device | |
| WO2009129822A1 (en) | Efficient encoding and decoding for multi-channel signals | |
| JP2021535426A (en) | Coding of dense transient events by companding | |
| CN110998722B (en) | Low complexity dense transient event detection and decoding | |
| JP2008129250A (en) | Window switching method for AAC and band determination method for M / S encoding | |
| JP2002182695A (en) | High efficiency coding method and apparatus | |
| RU2801156C2 (en) | Companding system and method for reducing quantization noise using improved spectral expansion | |
| CN1199959A (en) | Audio coding method and device |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| C06 | Publication | ||
| PB01 | Publication | ||
| C10 | Entry into substantive examination | ||
| SE01 | Entry into force of request for substantive examination | ||
| C14 | Grant of patent or utility model | ||
| GR01 | Patent grant | ||
| CF01 | Termination of patent right due to non-payment of annual fee |
Granted publication date: 20070926 Termination date: 20200410 |
|
| CF01 | Termination of patent right due to non-payment of annual fee |