CN101115051B

CN101115051B - Audio signal processing method, system and audio signal transmitting/receiving device

Info

Publication number: CN101115051B
Application number: CN2006101035713A
Authority: CN
Inventors: 马鸿飞; 徐明亮; 唐骏; 郭艳雯; 闫相斌; 余水安; 詹五洲
Original assignee: Huawei Technologies Co Ltd
Current assignee: Huawei Technologies Co Ltd
Priority date: 2006-07-25
Filing date: 2006-07-25
Publication date: 2011-08-10
Anticipated expiration: 2026-07-25
Also published as: CN101115051A

Abstract

The invention discloses an audio signal processing method, system, and audio signal transceiver device. At the audio signal sending end, the residual signal and the masking threshold signal obtained by processing the audio signal are coded into multiple descriptions of the residual signal and multiple descriptions of the masking threshold signal. , and then respectively combine the residual signal descriptions of each channel with one of the multi-channel masking threshold signal descriptions to generate a multi-channel description that contains both the residual signal and the masking threshold signal; at the audio signal receiving end, all the descriptions received The residual signal description and masking threshold signal description contained in each description are split, and then all the residual signal descriptions generated after splitting are decoded into one residual signal, and all the masking threshold signal descriptions generated after splitting are decoded as Mask the threshold signal all the way. The invention can effectively improve the audio quality in the communication process and improve user satisfaction.

Description

Audio signal processing method, system and audio signal transceiving device

技术领域technical field

本发明涉及通信领域，具体涉及一种音频信号处理方法、系统以及音频信号收发装置。The present invention relates to the communication field, in particular to an audio signal processing method, system, and audio signal transceiving device.

背景技术Background technique

随着通信技术的迅猛发展，信道带宽和传输速率得以极大提高，网络设备和通信终端处理能力也不断增强；同时，各种音频编码算法在编码质量和编码效率上都获得了相当的提高，这使得实时高质量音频业务也迅速地向各种现代通信系统融合。然而，在实际运营中，由于网络拥塞、信道干扰和噪声等原因，以包交换为基础的通信网络都不可避免的面临丢包和较长迟延的问题，这导致通过网际协议(IP)网络和移动通信系统传输的音频信号质量无疑会受到丢包和迟延的严重影响。With the rapid development of communication technology, the channel bandwidth and transmission rate have been greatly improved, and the processing capabilities of network equipment and communication terminals have also been continuously enhanced; at the same time, various audio coding algorithms have been considerably improved in coding quality and coding efficiency. This makes real-time high-quality audio services rapidly merged into various modern communication systems. However, in actual operation, due to reasons such as network congestion, channel interference and noise, communication networks based on packet switching inevitably face the problems of packet loss and long delay, which leads The quality of audio signals transmitted by mobile communication systems will undoubtedly be severely affected by packet loss and delay.

目前，用于减小因丢包而导致音频质量下降的音频信号处理方法大体有两种，下面对这两种方法分别介绍。At present, there are generally two audio signal processing methods for reducing audio quality degradation caused by packet loss, and the two methods will be introduced respectively below.

第一种音频信号处理方法为：将信源按一定准则划分为优先级不同的层次，再根据通信信道特性对各层进行分层优化；之后将完成分层优化的信源按各自所对应的不同层次分别进行数据打包处理，形成分层多描述编码比特流，包括基本层多描述编码比特流以及增强层多描述编码比特流；最后，将数据打包处理所形成的上述比特流发送给接收端。The first audio signal processing method is: divide the information sources into levels with different priorities according to certain criteria, and then optimize each layer according to the characteristics of the communication channel; Data packaging processing is performed at different levels to form a layered multi-description coded bit stream, including the basic layer multi-description coded bit stream and the enhancement layer multi-description coded bit stream; finally, the above-mentioned bit stream formed by the data packaging process is sent to the receiving end .

当收到基本层多描述编码比特流时，接收端可以恢复出来自信源的基本信息；当收同时到基本层多描述编码比特流和增强层多描述编码比特流时，接收端则能以较小的失真更好的恢复出来自信源的信息。When receiving the base layer multi-description coded bit stream, the receiving end can recover the basic information from the source; Small distortions are better at recovering information from the source.

第一种音频信号处理方法的目的是使接收端以尽量小的失真恢复出来自信源的信息，但其操作方式会不可避免地引入下列问题：The purpose of the first audio signal processing method is to enable the receiver to recover the information from the source with as little distortion as possible, but its operation method will inevitably introduce the following problems:

1、只有在收到来自信源的基本层多描述编码比特流的情况下，接收端才能进行正常解码；一旦基本层多描述编码比特流发生丢包，那么接收端将无法正常进行解码过程；1. Only when receiving the base layer multi-description coded bit stream from the source, the receiving end can perform normal decoding; once the base layer multi-description coded bit stream loses packets, the receiving end will not be able to normally perform the decoding process;

2、接收端在只收到增强层多描述编码比特流时，只能恢复来自信源的增强层部分信息，无法重构来自信源的主要信息；2. When the receiver only receives the enhancement layer multi-description coded bit stream, it can only recover part of the enhancement layer information from the source, and cannot reconstruct the main information from the source;

3、为了使接收端能够正常接收到基本层多描述编码比特流，并防止误码或丢包的发生，在传输时通常要对基本层多描述编码比特流进行前向纠错或丢包重传等处理；这将明显降低通信效率，并增加通信迟延；3. In order to enable the receiving end to receive the base layer multi-description coded bit stream normally, and to prevent bit errors or packet loss, it is usually necessary to perform forward error correction or packet loss repetition on the base layer multi-description coded bit stream during transmission. transmission and other processing; this will significantly reduce communication efficiency and increase communication delay;

4、由于信源被按照优先级进行了分层，因此生成的各分层编码比特流也将具有不同的优先级，并且低优先级分层编码比特流的正确解码必须以高优先级分层编码比特流的正确解码为基础；当高优先级分层编码比特流的解码过程出现问题时，低优先级分层编码比特流的正确解码根本无从谈起。4. Since the source is layered according to priority, the generated layered coded bit streams will also have different priorities, and the correct decoding of low priority layered coded bit streams must be layered with high priority The correct decoding of the coded bit stream is based on; when there is a problem in the decoding process of the high-priority layered coded bit stream, the correct decoding of the low-priority layered coded bit stream is impossible.

综上所述，由于第一种音频信号处理方法对信源按照优先级进行了分层，因此提高通信过程中音频质量的可靠性较低，无法有效减小因丢包、迟延而导致的通信过程中音频质量下降的问题；并且，音频质量下降将明显降低用户满意度。To sum up, since the first audio signal processing method stratifies the source according to the priority, the reliability of improving the audio quality in the communication process is low, and it cannot effectively reduce the communication loss caused by packet loss and delay. The problem of audio quality degradation during the process; and, audio quality degradation will significantly reduce user satisfaction.

第二种音频信号处理方法为：利用目前普遍应用的由心理声学模型控制的预处理器对音频信号进行处理，得到去除了与听觉不相干的时域白化的剩余信号，并对得到的剩余信号进行多描述编码以得到两路或多路描述编码；再对得到的描述编码进行无失真熵编码以去除信源的多余度，最后将无失真熵编码后所生成的编码比特流送入发射信道。The second audio signal processing method is: use the commonly used preprocessor controlled by the psychoacoustic model to process the audio signal to obtain the residual signal that has been removed from the time-domain whitening that is irrelevant to the auditory sense, and the obtained residual signal Perform multi-description coding to obtain two or more description codes; then perform distortion-free entropy coding on the obtained description codes to remove the redundancy of the information source, and finally send the coded bit stream generated after distortion-free entropy coding to the transmission channel .

第二种音频信号处理方法力图以心理声学模型提高音频信号质量，但其操作方式会不可避免地引入下列问题：The second audio signal processing method seeks to improve audio signal quality with psychoacoustic models, but it operates in such a way that it inevitably introduces the following problems:

1、第二种音频信号处理方法主要采用时域预测和时域编码方法，频率分辨率较低，不能较好地去除听觉不相干度和音频信号各频率分量间的相关性；1. The second audio signal processing method mainly adopts time-domain prediction and time-domain coding methods, the frequency resolution is low, and the correlation between the auditory irrelevance and the frequency components of the audio signal cannot be removed well;

2、由心理声学模型处理后的信号只是作为边信息被传输给接收端，而并不是作为描述编码的一部分进行传输；所以一旦心理声学模型中的参数丢失，接收端将无法对收到的音频信号正确解码，因而抗丢包性能较差。2. The signal processed by the psychoacoustic model is only transmitted to the receiving end as side information, not as part of the description encoding; so once the parameters in the psychoacoustic model are lost, the receiving end will not be able to understand the received audio. The signal is decoded correctly and thus less resistant to packet loss.

综上所述，由于第二种音频信号处理方法只是以抗丢包性能较差的心理声学模型为音频信号处理的基础，因此提高通信过程中音频质量的可靠性较低，无法有效减小因丢包、迟延而导致的通信过程中音频质量下降的问题；并且，音频质量下降将明显降低用户满意度。To sum up, since the second audio signal processing method only uses the psychoacoustic model with poor anti-packet loss performance as the basis of audio signal processing, the reliability of improving audio quality in the communication process is low, and it cannot effectively reduce the The problem of audio quality degradation in the communication process caused by packet loss and delay; moreover, audio quality degradation will significantly reduce user satisfaction.

发明内容Contents of the invention

有鉴于此，本发明的主要目的在于提供一种音频信号处理方法和系统，以有效提高通信过程中的音频质量，提高用户满意度。In view of this, the main purpose of the present invention is to provide an audio signal processing method and system, so as to effectively improve audio quality during communication and improve user satisfaction.

本发明的另一目的在于提供一种音频信号收发装置，以有效提高通信过程中的音频质量，提高用户满意度。Another object of the present invention is to provide an audio signal transceiving device, so as to effectively improve audio quality during communication and improve user satisfaction.

为达到上述目的，本发明的技术方案是这样实现的：In order to achieve the above object, technical solution of the present invention is achieved in that way:

一种音频信号处理方法，包括：An audio signal processing method, comprising:

在音频信号发送端，分别将针对音频信号处理所得的剩余信号和掩蔽门限信号编码为多路剩余信号描述和多路掩蔽门限信号描述，再分别将各路剩余信号描述与多路掩蔽门限信号描述中的一路合路，生成多路均包含有剩余信号描述和掩蔽门限信号描述的描述；At the audio signal sending end, the residual signal and the masking threshold signal obtained by processing the audio signal are encoded into multiple residual signal descriptions and multi-channel masking threshold signal descriptions, and then the residual signal descriptions of each channel and the multi-channel masking threshold signal description are respectively In one path combination, the generated multiple paths all contain descriptions of residual signal description and masking threshold signal description;

在音频信号接收端，将接收到的所有均包含有剩余信号描述和掩蔽门限信号描述的描述中的每个描述中所包含的剩余信号描述和掩蔽门限信号描述分路，再将分路后所生成的所有剩余信号描述解码为一路剩余信号，还将分路后所生成的所有掩蔽门限信号描述解码为一路掩蔽门限信号，对解码后得到的剩余信号和掩蔽门限信号进行参数重构和时频合成处理，生成重构的音频信号。At the audio signal receiving end, the remaining signal description and the masking threshold signal description contained in each of the received descriptions containing the description of the residual signal and the description of the masking threshold signal are branched, and then the branched Decode all the generated residual signal descriptions into one residual signal, and decode all the masking threshold signal descriptions generated after demultiplexing into one masking threshold signal, and perform parameter reconstruction and time-frequency analysis on the decoded residual signal and masking threshold signal Synthesis processing to generate a reconstructed audio signal.

将剩余信号编码为多路剩余信号描述的方法为：The method of encoding the residual signal into a multiple residual signal description is:

对所述剩余信号进行奇偶分离处理，并对完成该处理所生成的多路剩余信号描述进行熵编码处理；Perform parity separation processing on the residual signal, and perform entropy coding processing on the multi-channel residual signal description generated by completing the processing;

将掩蔽门限信号编码为多路掩蔽门限信号描述的方法为：The method of encoding the masking threshold signal into a multi-channel masking threshold signal description is:

对所述掩蔽门限信号进行奇偶分离处理，并对完成该处理所生成的多路掩蔽门限信号描述进行熵编码处理。Parity separation processing is performed on the masking threshold signal, and entropy coding processing is performed on the multi-channel masking threshold signal description generated by completing the processing.

将所有剩余信号描述解码为一路剩余信号的方法为：The method of decoding all residual signal descriptions into one residual signal is:

对所有剩余信号描述进行熵解码处理，并对完成解码处理后的所有剩余信号描述进行奇偶合成处理，将进行奇偶合成处理后的所有剩余信号描述合成为一路剩余信号；Perform entropy decoding processing on all remaining signal descriptions, and perform parity synthesis processing on all remaining signal descriptions after decoding processing, and synthesize all remaining signal descriptions after parity synthesis processing into one residual signal;

将所有掩蔽门限信号描述解码为一路掩蔽门限信号的方法为：The method of decoding all masking threshold signal descriptions into one masking threshold signal is:

对所有掩蔽门限信号描述进行熵解码处理，并对完成解码处理后的所有掩蔽门限信号描述进行奇偶合成处理，将进行奇偶合成处理后的所有掩蔽门限信号描述合成为一路掩蔽门限信号。Entropy decoding processing is performed on all masking threshold signal descriptions, parity synthesis processing is performed on all masking threshold signal descriptions after decoding processing, and all masking threshold signal descriptions after parity synthesis processing are synthesized into one masking threshold signal.

对所述剩余信号进行信号分对处理，并对完成处理所生成的多路剩余信号分别进行对偶变换并生成相应路数的多路剩余信号描述，再对生成的多路剩余信号描述进行熵编码处理；Perform signal pairing processing on the residual signal, and perform dual transformation on the multi-channel residual signal generated after the processing is completed to generate a corresponding multi-channel residual signal description, and then perform entropy coding on the generated multi-channel residual signal description deal with;

对所述掩蔽门限信号进行信号分对处理，并对完成处理所生成的多路掩蔽门限信号分别进行对偶变换并生成相应路数的多路掩蔽门限信号描述，再对生成的多路掩蔽门限信号描述进行熵编码处理。Perform signal pairing processing on the masking threshold signal, perform dual transformation on the multi-channel masking threshold signal generated after the processing, and generate a multi-channel masking threshold signal description corresponding to the number of channels, and then describe the generated multi-channel masking threshold signal Describes entropy encoding processing.

对所有剩余信号描述进行熵解码处理，并对完成解码处理后的所有剩余信号描述进行对偶反变换处理，再将完成对偶反变换处理后的所有剩余信号描述合成为一路剩余信号；performing entropy decoding processing on all remaining signal descriptions, and performing dual inverse transform processing on all remaining signal descriptions after the decoding process is completed, and then synthesizing all remaining signal descriptions after completing the dual inverse transform processing into one residual signal;

对所有掩蔽门限信号描述进行熵解码处理，并对完成解码处理后的所有掩蔽门限信号描述进行对偶反变换处理，再将完成对偶反变换处理后的所有掩蔽门限信号描述合成为一路掩蔽门限信号。Perform entropy decoding processing on all masking threshold signal descriptions, and perform dual inverse transform processing on all masking threshold signal descriptions after the decoding process, and then synthesize all masking threshold signal descriptions after completing the dual inverse transform processing into one masking threshold signal.

所述多路描述为两路描述。The multi-way description is a two-way description.

对所述剩余信号进行多描述标量量化编码处理，并对完成处理所生成的多路剩余信号描述进行熵编码处理；performing multi-description scalar quantization encoding processing on the residual signal, and performing entropy encoding processing on the multi-channel residual signal description generated after the processing;

对所述掩蔽门限信号进行多描述标量量化编码处理，并对完成处理所生成的多路掩蔽门限信号描述进行熵编码处理。Perform multi-description scalar quantization encoding processing on the masking threshold signal, and perform entropy encoding processing on the multi-channel masking threshold signal description generated after the processing.

对所有剩余信号描述进行熵解码处理，并对完成解码处理后的所有剩余信号描述进行多描述标量量化解码处理，将进行多描述标量量化解码处理后的所有剩余信号描述解码为一路剩余信号；Perform entropy decoding processing on all remaining signal descriptions, and perform multi-description scalar quantization decoding processing on all remaining signal descriptions after decoding processing, and decode all remaining signal descriptions after performing multi-description scalar quantization decoding processing into one residual signal;

对所有掩蔽门限信号描述进行熵解码处理，并对完成解码处理后的所有掩蔽门限信号描述进行多描述标量量化解码处理，将进行多描述标量量化解码处理后的所有掩蔽门限信号描述解码为一路掩蔽门限信号。Perform entropy decoding processing on all masking threshold signal descriptions, and perform multi-description scalar quantization decoding processing on all masking threshold signal descriptions after decoding processing, and decode all masking threshold signal descriptions after multi-description scalar quantization decoding processing into one masking threshold signal.

所述多路描述为两路以上描述。The multi-way description is more than two-way descriptions.

所述剩余信号是对原始音频信号进行时频分析、剩余信号分析后所得到的；The residual signal is obtained after performing time-frequency analysis and residual signal analysis on the original audio signal;

所述时频分析的方法为：对所述原始音频信号进行包括修正离散余弦变换MDCT在内的处理，得到时频变换参数；The method of the time-frequency analysis is: performing processing including modified discrete cosine transform (MDCT) on the original audio signal to obtain time-frequency transform parameters;

所述剩余信号分析的方法为：去除所述时频变换参数中的听觉不相干信息或不相干度。The method for analyzing the residual signal is: removing the auditory irrelevant information or irrelevant degree in the time-frequency transformation parameters.

所述掩蔽门限信号是对原始音频信号进行心理声学模型分析后所得到的。The masking threshold signal is obtained by analyzing the original audio signal with a psychoacoustic model.

进一步对发送端所生成的多路剩余信号描述和多路掩蔽门限信号描述进行无失真编码和音频包处理。Further, lossless encoding and audio packet processing are performed on the multi-channel residual signal description and the multi-channel masking threshold signal description generated by the sending end.

在接收端进行分路之前，进一步进行音频包解包和无失真解码处理。Before splitting at the receiving end, the audio packet unpacking and distortion-free decoding are further performed.

一种音频信号处理系统，该系统包括位于音频信号发送端的多描述编码器，由均与合路器相连的剩余信号多描述编码器、掩蔽门限信号多描述编码器组成；还包括位于音频信号接收端的多描述解码器，由均与分路器相连的剩余信号多描述解码器、掩蔽门限信号多描述解码器组成；An audio signal processing system, the system includes a multi-description encoder located at the audio signal sending end, composed of a residual signal multi-description encoder and a masking threshold signal multi-description encoder all connected to a combiner; The multi-description decoder at the end is composed of a multi-description decoder for residual signals and a multi-description decoder for masked threshold signals, both of which are connected to the splitter;

其中，剩余信号/掩蔽门限信号多描述编码器，用于将收到的剩余信号编码为多路剩余信号描述，将接收到的掩蔽门限信号编码为多路掩蔽门限信号描述，并将编码后所生成的各路描述发送给合路器；Among them, the residual signal/concealment threshold signal multi-description encoder is used to encode the received residual signal into a multi-channel residual signal description, encode the received masking threshold signal into a multi-channel masking threshold signal description, and convert the encoded The generated descriptions are sent to the combiner;

合路器，用于分别将收到的各路剩余信号描述与所有掩蔽门限信号描述中的一路合路，生成多路均包含有剩余信号描述和掩蔽门限信号描述的描述；A combiner, configured to respectively combine the received residual signal descriptions with one of the masking threshold signal descriptions to generate multiple descriptions that include both the residual signal description and the masking threshold signal description;

分路器，用于将接收到的所有均包含有剩余信号描述和掩蔽门限信号描述的描述中的每个描述中所包含的剩余信号描述和掩蔽门限信号描述分路，并将分路后所生成的所有剩余信号描述和掩蔽门限信号描述发送给剩余信号/掩蔽门限信号多描述解码器；A splitter, configured to split the remaining signal descriptions and masking threshold signal descriptions contained in each of the received descriptions containing the remaining signal descriptions and masking threshold signal descriptions, and split the splitting All generated residual signal descriptions and masked threshold signal descriptions are sent to the residual signal/masked threshold signal multiple description decoder;

剩余信号/掩蔽门限信号多描述解码器，用于将收到的所有剩余信号描述解码为一路剩余信号，将接收到的所有掩蔽门限信号描述解码为一路掩蔽门限信号；A residual signal/concealed threshold signal multi-description decoder, configured to decode all received residual signal descriptions into one residual signal, and decode all received masked threshold signal descriptions into one concealed threshold signal;

所述多描述解码器进一步与连接有时频合成模块的参数重构模块相连；The multiple description decoder is further connected to a parameter reconstruction module connected to a time-frequency synthesis module;

其中，参数重构模块，用于接收多描述解码器所生成的剩余信号和掩蔽门限信号，并对收到的信号进行参数重构处理，再将完成处理后所生成的时频变换参数发送给时频合成模块；Among them, the parameter reconstruction module is used to receive the residual signal and the masking threshold signal generated by the multi-description decoder, and perform parameter reconstruction processing on the received signal, and then send the time-frequency transformation parameters generated after the processing to Time-frequency synthesis module;

时频合成模块，用于对收到的时频变换参数进行时频合成处理，生成重构的音频信号。The time-frequency synthesis module is used to perform time-frequency synthesis processing on the received time-frequency transformation parameters to generate reconstructed audio signals.

所述剩余信号/掩蔽门限信号多描述编码器，包括相连的奇偶分离模块、熵编码器；The residual signal/masking threshold signal multi-descriptive encoder includes a connected parity separation module and an entropy encoder;

其中，奇偶分离模块，用于对所述剩余信号进行奇偶分离处理，并将完成该处理所生成的多路剩余信号描述发送给所述熵编码器，对所述掩蔽门限信号进行奇偶分离处理，并将完成该处理所生成的多路掩蔽门限信号描述发送给所述熵编码器；Wherein, the parity separation module is configured to perform parity separation processing on the residual signal, and send the multi-channel residual signal description generated by completing the processing to the entropy encoder, and perform parity separation processing on the masked threshold signal, and sending the description of the multi-channel masking threshold signal generated by completing the processing to the entropy encoder;

熵编码器，用于对收到的多路剩余信号描述和多路掩蔽门限信号描述进行熵编码处理。An entropy coder is used for performing entropy coding processing on the received multi-channel residual signal description and multi-channel masking threshold signal description.

所述剩余信号/掩蔽门限信号多描述解码器，包括相连的熵解码器、奇偶合成模块；The residual signal/masking threshold signal multi-description decoder includes a connected entropy decoder and a parity synthesis module;

其中，熵解码器，用于对所有剩余信号描述进行熵解码处理，并将完成解码处理后的所有剩余信号描述发送给奇偶合成模块，对所有掩蔽门限信号描述进行熵解码处理，并将完成解码处理后的所有掩蔽门限信号描述发送给奇偶合成模块；Among them, the entropy decoder is used to perform entropy decoding processing on all remaining signal descriptions, and send all remaining signal descriptions after decoding processing to the parity synthesis module, perform entropy decoding processing on all masking threshold signal descriptions, and complete decoding All processed masking threshold signal descriptions are sent to the parity synthesis module;

奇偶合成模块，用于对收到的所有剩余信号描述进行奇偶合成处理，将进行奇偶合成处理后的所有剩余信号描述合成为一路剩余信号，对收到的所有掩蔽门限信号描述进行奇偶合成处理，将进行奇偶合成处理后的所有掩蔽门限信号描述合成为一路掩蔽门限信号。The parity synthesis module is used to perform parity synthesis processing on all received residual signal descriptions, synthesize all residual signal descriptions after parity synthesis processing into one residual signal, and perform parity synthesis processing on all received masking threshold signal descriptions, The descriptions of all the masking threshold signals after parity combining processing are combined into one masking threshold signal.

所述剩余信号/掩蔽门限信号多描述编码器，包括依次相连的剩余信号/掩蔽门限信号分对模块、对偶变换模块和熵编码器；The residual signal/masking threshold signal multi-description encoder includes a residual signal/masking threshold signal pairing module, a dual transformation module, and an entropy encoder connected in sequence;

其中，剩余信号/掩蔽门限信号分对模块，用于对所述剩余信号进行信号分对处理，并将完成处理所生成的多路剩余信号发送给多描述对偶变换模块，对所述掩蔽门限信号进行信号分对处理，并将完成处理所生成的多路掩蔽门限信号发送给多描述对偶变换模块；Wherein, the residual signal/masking threshold signal pairing module is used to perform signal pairing processing on the residual signal, and send the multi-channel residual signal generated after the processing to the multi-description dual transformation module, and process the masking threshold signal Perform signal pairing processing, and send the multi-channel masking threshold signal generated by completing the processing to the multiple description dual transformation module;

多描述对偶变换模块，用于对收到的多路剩余信号分别进行对偶变换并生成相应路数的多路剩余信号描述，再将所生成的多路剩余信号描述发送给熵编码器，对收到的多路掩蔽门限信号分别进行对偶变换并生成相应路数的多路掩蔽门限信号描述，再将所生成的多路掩蔽门限信号描述发送给熵编码器；The multi-description dual transformation module is used to perform dual transformation on the received multi-channel residual signals and generate corresponding multi-channel residual signal descriptions, and then send the generated multi-channel residual signal descriptions to the entropy encoder, and the received multi-channel residual signal descriptions Dual transformation is performed on the received multi-channel masking threshold signals respectively to generate corresponding multi-channel masking threshold signal descriptions, and then the generated multi-channel masking threshold signal descriptions are sent to the entropy encoder;

熵编码器，用于对收到多路剩余信号描述和多路掩蔽门限信号描述进行熵编码处理。An entropy coder is used for performing entropy coding processing on the received multi-channel residual signal description and multi-channel masking threshold signal description.

所述剩余信号/掩蔽门限信号多描述解码器，包括依次相连的熵解码器、对偶反变换解码器和剩余信号/掩蔽门限信号合成模块；The residual signal/masking threshold signal multi-description decoder includes an entropy decoder, a dual inverse transform decoder and a residual signal/masking threshold signal synthesis module connected in sequence;

其中，熵解码器，用于对所有剩余信号描述进行熵解码处理，并将完成解码处理后所生成的所有剩余信号描述发送给多描述对偶反变换解码器，对所有掩蔽门限信号描述进行熵解码处理，并将完成解码处理后的所有掩蔽门限信号描述发送给多描述对偶反变换解码器；Among them, the entropy decoder is used to perform entropy decoding processing on all remaining signal descriptions, and send all remaining signal descriptions generated after the decoding process to the multi-description dual inverse transform decoder, and perform entropy decoding on all masked threshold signal descriptions. Decoding processing, and sending all masked threshold signal descriptions after the decoding processing to the multiple description dual inverse transform decoder;

多描述对偶反变换解码器，用于对收到的所有剩余信号描述进行对偶反变换处理，并将完成对偶反变换处理后的所有剩余信号描述发送给剩余信号/掩蔽门限信号合成模块，对收到的所有掩蔽门限信号描述进行对偶反变换处理，并将完成对偶反变换处理后的所有掩蔽门限信号描述发送给剩余信号/掩蔽门限信号合成模块；The multi-description dual inverse transform decoder is used to perform dual inverse transform processing on all remaining signal descriptions received, and send all remaining signal descriptions after the dual inverse transform processing to the residual signal/masking threshold signal synthesis module for receiving Perform dual inverse transform processing on all received masking threshold signal descriptions, and send all masking threshold signal descriptions after dual inverse transform processing to the remaining signal/masking threshold signal synthesis module;

剩余信号/掩蔽门限信号合成模块，用于将收到的所有剩余信号描述合成为一路剩余信号，将接收到的所有掩蔽门限信号描述合成为一路掩蔽门限信号。The residual signal/masking threshold signal synthesis module is used to synthesize all received residual signal descriptions into one residual signal, and synthesize all received masking threshold signal descriptions into one masking threshold signal.

所述剩余信号/掩蔽门限信号多描述编码器，包括相连的多描述标量量化器、熵编码器；The residual signal/masking threshold signal multi-description encoder includes a connected multi-description scalar quantizer and an entropy encoder;

其中，多描述标量量化器，用于对所述剩余信号进行多描述标量量化编码处理，并将完成处理所生成的多路剩余信号描述发送给熵编码器，对所述掩蔽门限信号进行多描述标量量化编码处理，并将完成处理所生成的多路掩蔽门限信号描述发送给熵编码器；Wherein, the multi-description scalar quantizer is used to perform multi-description scalar quantization and encoding processing on the residual signal, and send the multi-channel residual signal description generated after the processing to the entropy encoder to perform multi-description on the masked threshold signal Scalar quantization encoding processing, and sending the multi-channel masking threshold signal description generated by the processing to the entropy encoder;

所述剩余信号/掩蔽门限信号多描述解码器，包括相连的熵解码器、多描述标量量化解码器；The residual signal/masking threshold signal multi-description decoder includes a connected entropy decoder and a multi-description scalar quantization decoder;

其中，熵解码器，用于对所有剩余信号描述进行熵解码处理，并将完成解码处理后的所有剩余信号描述发送给多描述标量量化解码器，对所有掩蔽门限信号描述进行熵解码处理，并将完成解码处理后的所有掩蔽门限信号描述发送给多描述标量量化解码器；Wherein, the entropy decoder is used to perform entropy decoding processing on all remaining signal descriptions, and send all remaining signal descriptions after decoding processing to the multi-description scalar quantization decoder, perform entropy decoding processing on all masked threshold signal descriptions, and Send all masked threshold signal descriptions after decoding processing to the multi-description scalar quantization decoder;

多描述标量量化解码器，用于对收到的所有剩余信号描述进行多描述标量量化解码处理，将进行多描述标量量化解码处理后的所有剩余信号描述解码为一路剩余信号，对收到的所有掩蔽门限信号描述进行多描述标量量化解码处理，将进行多描述标量量化解码处理后的所有掩蔽门限信号描述解码为一路掩蔽门限信号。The multi-description scalar quantization decoder is used to perform multi-description scalar quantization and decoding processing on all received residual signal descriptions, decode all residual signal descriptions after multi-description scalar quantization and decoding processing into one residual signal, and decode all received The masking threshold signal description is subjected to multi-description scalar quantization decoding processing, and all masking threshold signal descriptions after the multi-description scalar quantization decoding processing are decoded into one masking threshold signal.

所述多描述标量量化器是能够处理两个以上描述的多描述标量量化器，所述多描述标量量化解码器是能够处理两个以上描述的多描述标量量化解码器。The multiple description scalar quantizer is a multiple description scalar quantizer capable of processing two or more descriptions, and the multiple description scalar quantization decoder is a multiple description scalar quantization decoder capable of processing two or more descriptions.

所述多描述编码器进一步与连接有时频分析模块的剩余信号分析模块相连，还与心理声学模块相连；The multi-description encoder is further connected to the residual signal analysis module connected to the time-frequency analysis module, and also connected to the psychoacoustic module;

其中，时频分析模块，用于对原始音频信号进行时频分析，并将完成分析后所生成的时频变换参数发送给剩余信号分析模块；Wherein, the time-frequency analysis module is used for performing time-frequency analysis on the original audio signal, and sending the time-frequency transformation parameters generated after the analysis to the remaining signal analysis module;

剩余信号分析模块，用于对收到的时频变换参数和掩蔽门限信号进行剩余信号分析，并将分析后所生成的剩余信号发送给所述多描述编码器；A residual signal analysis module, configured to perform residual signal analysis on received time-frequency transformation parameters and masking threshold signals, and send the residual signal generated after the analysis to the multi-description encoder;

心理声学模块，用于对原始音频信号进行心理声学模型分析，将分析后所生成的掩蔽门限信号发送给所述多描述编码器和剩余信号分析模块。The psychoacoustic module is configured to perform psychoacoustic model analysis on the original audio signal, and send the masking threshold signal generated after analysis to the multiple description encoder and residual signal analysis module.

所述多描述编码器进一步和无失真编码及音频包处理模块相连；The multi-description encoder is further connected with the lossless encoding and audio packet processing module;

无失真编码及音频包处理模块，用于接收多描述编码器所生成的描述，并对收到的描述进行无失真编码和音频包处理。The lossless encoding and audio packet processing module is used to receive the description generated by the multi-description encoder, and perform lossless encoding and audio packet processing on the received description.

所述多描述解码器进一步和音频包解包及无失真解码模块相连；The multi-description decoder is further connected with the audio packet unpacking and distortion-free decoding module;

音频包解包及无失真解码模块，用于对收到的多路描述进行音频包解包和无失真解码处理。The audio packet unpacking and lossless decoding module is used for unpacking the audio packet and decoding without distortion for the received multi-channel description.

一种音频信号发送装置，该装置由均与合路器相连的剩余信号多描述编码器、掩蔽门限信号多描述编码器组成；An audio signal sending device, which is composed of a multi-descriptive coder for residual signals and a multi-descriptive coder for masking threshold signals, both of which are connected to a combiner;

剩余信号/掩蔽门限信号多描述编码器，用于将收到的剩余信号编码为多路剩余信号描述，将接收到的掩蔽门限信号编码为多路掩蔽门限信号描述，并将编码后所生成的各路描述发送给合路器；Residual signal/masking threshold signal multi-description encoder, used to encode the received residual signal into multiple residual signal descriptions, encode the received masking threshold signal into multiple masking threshold signal descriptions, and encode the generated The description of each channel is sent to the combiner;

合路器，用于分别将收到的各路剩余信号描述与所有掩蔽门限信号描述中的一路合路，生成多路均包含有剩余信号描述和掩蔽门限信号描述的描述。The combiner is used to respectively combine the received residual signal descriptions with one of all masking threshold signal descriptions to generate multiple channels of descriptions that both include the residual signal descriptions and the masking threshold signal descriptions.

所述剩余信号/掩蔽门限信号多描述编码器，包括依次相连的剩余信号/掩蔽门限信号分对模块、多描述对偶变换模块和熵编码器；The residual signal/masking threshold signal multi-description encoder includes a residual signal/masking threshold signal pairing module, a multi-describing dual transformation module, and an entropy encoder that are sequentially connected;

所述多描述标量量化器是能够处理两个以上描述的多描述标量量化器。The multiple description scalar quantizer is a multiple description scalar quantizer capable of processing more than two descriptions.

无失真编码及音频包处理模块，用于接收多描述编码器所生成的多路描述，并对收到的多路描述进行无失真编码和音频包处理。The lossless encoding and audio package processing module is used to receive the multi-channel description generated by the multi-description encoder, and perform lossless encoding and audio package processing on the received multi-channel description.

一种音频信号接收装置，该装置由均与分路器相连的剩余信号多描述解码器、掩蔽门限信号多描述解码器组成；An audio signal receiving device, which is composed of a multi-descriptive decoder for residual signals and a multi-descriptive decoder for masking threshold signals, both of which are connected to splitters;

其中，分路器，用于将接收到的所有均包含有剩余信号描述和掩蔽门限信号描述的描述中的每个描述中所包含的剩余信号描述和掩蔽门限信号描述分路，并将分路后所生成的所有剩余信号描述和掩蔽门限信号描述发送给剩余信号/掩蔽门限信号多描述解码器；Wherein, the splitter is used for splitting the remaining signal description and the masking threshold signal description included in each of the received descriptions containing the description of the remaining signal and the description of the masking threshold signal, and splitting the All remaining signal descriptions and masking threshold signal descriptions generated after are sent to the residual signal/masking threshold signal multi-description decoder;

所述剩余信号/掩蔽门限信号多描述解码器，包括相连的熵解码器、奇偶合成模块；The residual signal/masking threshold signal multi-descriptive decoder includes connected entropy decoders and parity synthesis modules;

奇偶合成模块，用于对收到的所有剩余信号描述进行奇偶合成处理，将进行奇偶合成处理后的所有剩余信号描述合成为一路剩余信号，对收到的所有掩蔽门限信号描述进行奇偶合成处理，敬进行奇偶合成处理后的所有掩蔽门限信号描述合成为一路掩蔽门限信号。The parity synthesis module is used to perform parity synthesis processing on all received residual signal descriptions, synthesize all residual signal descriptions after parity synthesis processing into one residual signal, and perform parity synthesis processing on all received masking threshold signal descriptions, Respectfully, all masking threshold signals after parity combining processing are described and synthesized into one masking threshold signal.

附图说明Description of drawings

所述剩余信号/掩蔽门限信号多描述解码器，包括依次相连的熵解码器、多描述对偶反变换解码器和剩余信号/掩蔽门限信号合成模块；The residual signal/masking threshold signal multi-description decoder includes an entropy decoder, a multi-description dual inverse transform decoder and a residual signal/masking threshold signal synthesis module connected in sequence;

其中，熵解码器，用于对所有剩余信号描述进行熵解码处理，并将完成解码处理后所生成的所有剩余信号描述发送给多描述对偶反变换解码器，对所有掩蔽门限信号描述进行熵解码处理，并将完成解码处理后的所有掩蔽门限信号描述发送给多描述对偶反变换解码器；Among them, the entropy decoder is used to perform entropy decoding processing on all remaining signal descriptions, and send all remaining signal descriptions generated after the decoding process to the multiple description dual inverse transform decoder, and perform entropy decoding on all masking threshold signal descriptions processing, and send all masked threshold signal descriptions after decoding processing to the multiple description dual inverse transform decoder;

多描述标量量化解码器，用于对收到的所有剩余信号描述进行多描述标量量化解码处理，将进行多描述标量量化解码处理后的所有剩余信号描述解码为一路剩余信号，对收到的所有掩蔽门限信号描述进行多描述标量量化解码处理，将进行多描述标量量化解码处理后的所有掩蔽门限信号描述解码为一路掩蔽门限信号。The multi-description scalar quantization decoder is used to perform multi-description scalar quantization and decoding processing on all received residual signal descriptions, decode all residual signal descriptions after multi-description scalar quantization and decoding processing into one residual signal, and decode all received The masking threshold signal description is subjected to multi-description scalar quantization decoding processing, and all masking threshold signal descriptions after multi-description scalar quantization decoding processing are decoded into one masking threshold signal.

所述多描述标量量化解码器是能够处理两个以上描述的多描述标量量化解码器。The multiple description scalar quantization decoder is a multiple description scalar quantization decoder capable of processing two or more descriptions.

音频包解包及无失真解码模块，用于对收到的描述进行音频包解包和无失真解码处理。The audio packet unpacking and distortion-free decoding module is used to unpack the audio packet and decode the received description without distortion.

与现有技术相比，本发明所提供的音频信号处理方法、系统以及音频信号收发装置，在音频信号发送端，分别将针对音频信号处理所得的剩余信号和掩蔽门限信号编码为剩余信号多描述和掩蔽门限信号多描述，再分别将各路剩余信号描述与多路掩蔽门限信号描述中的一路合路，生成均包含有剩余信号和掩蔽门限信号的多路描述；在音频信号接收端，将接收到的所有描述中每个描述所包含的剩余信号描述和掩蔽门限信号描述分路，再将分路后所生成的所有剩余信号描述解码为一路剩余信号，还将分路后所生成的所有掩蔽门限信号描述解码为一路掩蔽门限信号。Compared with the prior art, the audio signal processing method, system and audio signal transceiving device provided by the present invention, at the audio signal sending end, respectively encode the residual signal and the masking threshold signal obtained from the audio signal processing into the residual signal. More description and masking threshold signal multiple descriptions, and then respectively combine the remaining signal descriptions of each channel with one of the multi-channel masking threshold signal descriptions to generate a multi-channel description that includes both the residual signal and the masking threshold signal; at the audio signal receiving end, the The remaining signal descriptions and masking threshold signal descriptions contained in each description in all the received descriptions are split, and then all the remaining signal descriptions generated after splitting are decoded into one residual signal, and all the generated after splitting The masking threshold signal description is decoded into one masking threshold signal.

可见，本发明可有效提高通信过程中的音频质量，提高用户满意度。It can be seen that the present invention can effectively improve the audio quality in the communication process and improve user satisfaction.

图1是本发明一较佳实施例的音频信号编码器结构及原理图；Fig. 1 is the audio signal coder structure and schematic diagram of a preferred embodiment of the present invention;

图2是本发明一较佳实施例的音频信号解码器结构及原理图；Fig. 2 is the audio signal decoder structure and schematic diagram of a preferred embodiment of the present invention;

图3是本发明多描述编码器结构及原理图；Fig. 3 is the multi-describing encoder structure and schematic diagram of the present invention;

图4是本发明多描述解码器结构及原理图；Fig. 4 is the multi-descriptive decoder structure and schematic diagram of the present invention;

图5是本发明剩余信号多描述解码器结构及原理图；Fig. 5 is the structural and schematic diagram of the multi-descriptive decoder for the residual signal of the present invention;

图6是本发明掩蔽门限信号多描述解码器结构及原理图；Fig. 6 is the structure and schematic diagram of the masking threshold signal multi-describing decoder of the present invention;

图7是本发明剩余信号多描述编码器结构及原理图；Fig. 7 is the structural and schematic diagram of the residual signal multi-describing encoder of the present invention;

图8是与图7中的多描述编码器相配合的多描述解码器结构及原理图；Fig. 8 is the multi-description decoder structure and schematic diagram that cooperates with the multi-description encoder in Fig. 7;

图9是本发明剩余信号多描述编码器结构及原理图；Fig. 9 is a structural and schematic diagram of a residual signal multi-describing encoder of the present invention;

图10是与图9中的多描述编码器相配合的多描述解码器结构及原理图；Fig. 10 is the multi-description decoder structure and schematic diagram that cooperates with the multi-description encoder in Fig. 9;

图11是本发明一较佳实施例的音频信号处理原理图；Fig. 11 is a schematic diagram of audio signal processing in a preferred embodiment of the present invention;

图12是本发明一较佳实施例的量化区间示意图；Fig. 12 is a schematic diagram of a quantization interval in a preferred embodiment of the present invention;

图13是本发明一较佳实施例的单描述和多描述量化区间对比示意图；Fig. 13 is a schematic diagram of a comparison of single-description and multiple-description quantization intervals in a preferred embodiment of the present invention;

图14是以图11为基础的多描述编码器结构及原理图；Fig. 14 is a multi-description encoder structure and schematic diagram based on Fig. 11;

图15是以图11为基础的多描述解码器结构及原理图；Fig. 15 is a multi-description decoder structure and schematic diagram based on Fig. 11;

图16是本发明另一较佳实施例的音频信号处理原理图；Fig. 16 is a schematic diagram of audio signal processing in another preferred embodiment of the present invention;

图17是以图16为基础的多描述编码器结构及原理图；Fig. 17 is a multi-description encoder structure and schematic diagram based on Fig. 16;

图18是以图16为基础的多描述解码器结构及原理图；Fig. 18 is a multi-description decoder structure and schematic diagram based on Fig. 16;

图19是本发明一较佳实施例的多描述标量量化器编解码立体图表。Fig. 19 is a stereo diagram of encoding and decoding of a multi-description scalar quantizer according to a preferred embodiment of the present invention.

具体实施方式下面结合附图及具体实施例对本发明详细说明。DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS The present invention will be described in detail below in conjunction with the drawings and specific embodiments.

本发明所提供的音频信号处理方法，在音频信号发送端，分别将针对音频信号处理所得的剩余信号和掩蔽门限信号编码为剩余信号多描述和掩蔽门限信号多描述，再分别将各路剩余信号描述与多路掩蔽门限信号描述中的一路合路，生成均包含有剩余信号和掩蔽门限信号的多路描述；在音频信号接收端，将接收到的所有描述中每个描述所包含的剩余信号描述和掩蔽门限信号描述分路，再将分路后所生成的所有剩余信号描述解码为一路剩余信号，还将分路后所生成的所有掩蔽门限信号描述解码为一路掩蔽门限信号。In the audio signal processing method provided by the present invention, at the audio signal sending end, the residual signal and the masking threshold signal obtained by processing the audio signal are respectively coded into multiple descriptions of the residual signal and multiple descriptions of the masking threshold signal, and then the residual signals of each channel are respectively encoded into The description is combined with one of the multi-channel masking threshold signal descriptions to generate a multi-channel description that contains both the residual signal and the masking threshold signal; at the audio signal receiving end, the residual signal contained in each description in all received descriptions Describing and masking threshold signal descriptions are split, and then all remaining signal descriptions generated after splitting are decoded into one residual signal, and all masking threshold signal descriptions generated after splitting are decoded into one masking threshold signal.

本发明所提供的音频信号处理系统，包括位于音频信号发送端的多描述编码器，由均与合路器相连的剩余信号多描述编码器、掩蔽门限信号多描述编码器组成；还包括位于音频信号接收端的多描述解码器，由均与分路器相连的剩余信号多描述解码器、掩蔽门限信号多描述解码器组成；The audio signal processing system provided by the present invention includes a multi-description encoder located at the audio signal sending end, composed of a residual signal multi-description encoder and a masking threshold signal multi-description encoder all connected to the combiner; The multi-description decoder at the receiving end is composed of a residual signal multi-description decoder and a masked threshold signal multi-description decoder all connected to the splitter;

其中，剩余信号/掩蔽门限信号多描述编码器，用于将收到的剩余信号/掩蔽门限信号编码为剩余信号/掩蔽门限信号多描述，并将编码后所生成的多描述发送给合路器；合路器，用于分别将收到的各路剩余信号描述与所有掩蔽门限信号描述中的一路合路，生成均包含有剩余信号和掩蔽门限信号的多描述；分路器，用于将接收到的所有描述中每个描述所包含的剩余信号描述和掩蔽门限信号描述分路，并将分路后所生成的剩余信号/掩蔽门限信号多描述发送给剩余信号/掩蔽门限信号多描述解码器；剩余信号/掩蔽门限信号多描述解码器，用于将收到的剩余信号/掩蔽门限信号多描述解码为一路剩余信号/掩蔽门限信号。Among them, the residual signal/masked threshold signal multi-description encoder is used to encode the received residual signal/masked threshold signal into a residual signal/masked threshold signal multi-description, and send the multi-description generated after encoding to the combiner The combiner is used to respectively combine the received residual signal descriptions with one of the masked threshold signal descriptions to generate multiple descriptions that contain the residual signal and the masked threshold signal; the splitter is used to combine The residual signal description and masking threshold signal description contained in each description in all received descriptions are divided into branches, and the residual signal/masking threshold signal multi-description generated after splitting is sent to the residual signal/masking threshold signal multi-description decoding The residual signal/masking threshold signal multi-description decoder is used to decode the received residual signal/masking threshold signal multi-describing into one path of residual signal/masking threshold signal.

本发明所提供的音频信号发送装置由均与合路器相连的剩余信号多描述编码器、掩蔽门限信号多描述编码器组成；其中，剩余信号/掩蔽门限信号多描述编码器，用于将收到的剩余信号/掩蔽门限信号编码为剩余信号/掩蔽门限信号多描述，并将编码后所生成的多描述发送给合路器；合路器，用于分别将收到的各路剩余信号描述与所有掩蔽门限信号描述中的一路合路，生成均包含有剩余信号和掩蔽门限信号的多路描述。The audio signal sending device provided by the present invention is composed of residual signal multi-description encoders and masking threshold signal multi-description encoders which are both connected to the combiner; wherein, the residual signal/masking threshold signal multi-description encoders are used to The received residual signal/masking threshold signal is coded into multiple descriptions of the residual signal/masking threshold signal, and the multiple description generated after encoding is sent to the combiner; the combiner is used to separately describe the received residual signals Combined with one path in all masking threshold signal descriptions to generate a multiplexed description that includes both the remaining signal and the masking threshold signal.

本发明所提供的音频信号接收装置由均与分路器相连的剩余信号多描述解码器、掩蔽门限信号多描述解码器组成；其中，分路器，用于将接收到的所有描述中每个描述所包含的剩余信号描述和掩蔽门限信号描述分路，并将分路后所生成的剩余信号/掩蔽门限信号多描述发送给剩余信号/掩蔽门限信号多描述解码器；剩余信号/掩蔽门限信号多描述解码器，用于将收到的剩余信号/掩蔽门限信号多描述解码为一路剩余信号/掩蔽门限信号。The audio signal receiving device provided by the present invention is composed of a multi-description decoder for residual signals and a multi-description decoder for masked threshold signals, both of which are connected to splitters; wherein, the splitter is used to divide each of the received descriptions into The residual signal description and masking threshold signal description contained in the description are divided into branches, and the residual signal/masking threshold signal multi-description generated after the splitting is sent to the residual signal/masking threshold signal multi-description decoder; the residual signal/masking threshold signal The multi-description decoder is used for multi-description decoding the received residual signal/masking threshold signal into one path of residual signal/masking threshold signal.

从原理上讲，本发明可大致分为两个层面，在音频信号处理层面：需要对音频信号进行多描述的分析与合成，如：将音频信号分解为互不相关的掩蔽门限信号和剩余信号；在量化和编码层面：分别对掩蔽门限信号和剩余信号进行多描述多解码器的多描述编解码处理。再有，在信道丢包严重时，还可以根据不同描述的历史记录对丢包进行差错隐藏。In principle, the present invention can be roughly divided into two levels. At the level of audio signal processing: it is necessary to analyze and synthesize the audio signal with multiple descriptions, such as: decomposing the audio signal into mutually irrelevant masking threshold signals and residual signals. ; At the level of quantization and coding: perform multiple description codec processing with multiple description and multiple decoders on the masked threshold signal and the residual signal respectively. Furthermore, when the channel packet loss is serious, the error concealment of the packet loss can also be carried out according to the historical records of different descriptions.

参见图1，图1是本发明一较佳实施例的音频信号编码器结构及原理图。图1中，经过窗选的原始音频信号被分成两路，将其中一路输入时频分析模块110，将另一路输入心理声学模块120。Referring to FIG. 1, FIG. 1 is a structural and schematic diagram of an audio signal encoder according to a preferred embodiment of the present invention. In FIG. 1 , the window-selected original audio signal is divided into two channels, one of which is input to the time-frequency analysis module 110 , and the other is input to the psychoacoustics module 120 .

心理声学模块120对收到的音频信号进行心理声学模型分析，得到与当前收到的音频帧相关的掩蔽门限信号，并将得到的掩蔽门限信号分别发送给剩余信号分析模块130、多描述编码器140和比特分配模块150。时频分析模块110对收到的音频信号进行修正离散余弦变换(MDCT)等时频分析处理，并将时频分析处理所得到的频域MDCT系数等时频变换参数发送给剩余信号分析模块130；剩余信号分析模块130利用收到的掩蔽门限信号去除收到的时频变换参数中的听觉不相干信息或不相干度，得到去除了听觉不相干性的剩余信号并发送给多描述编码器140。The psychoacoustic module 120 performs psychoacoustic model analysis on the received audio signal to obtain a masking threshold signal related to the currently received audio frame, and sends the obtained masking threshold signal to the residual signal analysis module 130 and the multi-description encoder respectively. 140 and bit allocation module 150. The time-frequency analysis module 110 performs time-frequency analysis processing such as Modified Discrete Cosine Transform (MDCT) on the received audio signal, and sends time-frequency transformation parameters such as frequency domain MDCT coefficients obtained by the time-frequency analysis processing to the residual signal analysis module 130 ; The residual signal analysis module 130 uses the received masking threshold signal to remove the auditory irrelevant information or degree of incoherence in the received time-frequency transformation parameters, obtains the residual signal from which the auditory irrelevance has been removed and sends it to the multi-description encoder 140 .

多描述编码器140对收到的能表征当前音频信号信息的剩余信号和掩蔽门限信号进行多描述编码，得到两个可以单独或联合处理的描述：描述1和描述2，并将描述1和描述2发送给无失真编码及音频包处理模块160。比特分配模块150将收到的掩蔽门限信号作为比特分配的控制信息，根据该掩蔽门限信号确定比特分配方式、描述标识和量化器标识等边信息，并将确定的边信息发送给无失真编码及音频包处理模块160。The multi-description encoder 140 performs multi-description encoding on the received residual signal and the masking threshold signal that can represent the information of the current audio signal, to obtain two descriptions that can be processed separately or jointly: description 1 and description 2, and description 1 and description 2 to the lossless encoding and audio packet processing module 160. The bit allocation module 150 uses the received masking threshold signal as the control information of bit allocation, determines side information such as bit allocation mode, description identifier and quantizer identifier according to the masking threshold signal, and sends the determined side information to the lossless coding and Audio package processing module 160.

无失真编码及音频包处理模块160对收到的描述1和描述2进行Huffman编码、算术编码或游程长度编码等无失真编码处理，以消除信源多余度，进一步降低比特率；之后将所述边信息加入完成编码处理的描述1和描述2并进行比特封装，再将封装好的描述1和描述2的编码比特流送入信道。Lossless encoding and audio packet processing module 160 performs lossless encoding processes such as Huffman encoding, arithmetic encoding or run-length encoding to the received description 1 and description 2, to eliminate the redundancy of information sources and further reduce the bit rate; The side information is added to the description 1 and description 2 after encoding processing and bit-encapsulated, and then the encoded bit streams of the encapsulated description 1 and description 2 are sent to the channel.

参见图2，图2是本发明一较佳实施例的音频信号解码器结构及原理图。图2中，音频包解包及无失真解码模块210、多描述解码器220、参数重构模块230和时频合成模块240依次相连。其中，音频包解包及无失真解码模块210对收到的描述1和描述2进行解包和解码操作，并将得到的描述1、描述2和边信息发送给多描述解码器220；多描述解码器220对收到的描述 1、描述2和边信息进行多描述解码，将解码所得到的掩蔽门限信号和剩余信号发送给参数重构模块230；参数重构模块230利用收到的掩蔽门限信号和剩余信号重构时频变换参数，并将完成重构所得到的时频变换参数发送给时频合成模块240，由时频合成模块240根据时频变换参数重构出音频信号。Referring to FIG. 2, FIG. 2 is a structural and schematic diagram of an audio signal decoder in a preferred embodiment of the present invention. In FIG. 2 , an audio packet unpacking and distortion-free decoding module 210 , a multi-description decoder 220 , a parameter reconstruction module 230 and a time-frequency synthesis module 240 are connected in sequence. Among them, the audio packet unpacking and lossless decoding module 210 performs unpacking and decoding operations on the received description 1 and description 2, and sends the obtained description 1, description 2 and side information to the multi-description decoder 220; The decoder 220 performs multi-description decoding on the received description 1, description 2 and side information, and sends the decoded masking threshold signal and the remaining signal to the parameter reconstruction module 230; the parameter reconstruction module 230 utilizes the received masking threshold The time-frequency transformation parameters are reconstructed from the signal and the remaining signal, and the time-frequency transformation parameters obtained after the reconstruction are completed are sent to the time-frequency synthesis module 240, and the audio signal is reconstructed by the time-frequency synthesis module 240 according to the time-frequency transformation parameters.

在上述的图1、图2中，心理声学模块120中设置有心理声学模型，该模型描述有人耳对音频信号的感知特性，该描述主要体现在掩蔽特性；从音频压缩和编码的角度看，心理声学模型决定了临界频带内人耳感觉不到的最大量化噪声的能量，或者说是在听觉感知意义上的噪声掩蔽门限。具体而言，所述心理声学模型可以有不同的实现方法，如：采用目前常用的MPEG-1和MPEG-2的第一、二层音频编码所采用的模型1，或采用作为第三层的MP3音频编码所采用的模型2，或采用Ogg Vorbis音频编码中的基底曲线(Floor)，也可以采用AC3音频编码中的心理声学模型等其它种类的心理声学模型。In above-mentioned Fig. 1, Fig. 2, psychoacoustic model is set in the psychoacoustic module 120, and this model describes the perceptual characteristic of human ear to audio signal, and this description is mainly embodied in masking characteristic; From the perspective of audio compression and coding, The psychoacoustic model determines the energy of the maximum quantization noise that cannot be perceived by the human ear in the critical frequency band, or the noise masking threshold in the sense of auditory perception. Specifically, the psychoacoustic model can have different implementation methods, such as: adopting the model 1 adopted by the first and second layers of audio coding of MPEG-1 and MPEG-2 commonly used at present, or adopting the model 1 used as the third layer The model 2 used in MP3 audio coding, or the base curve (Floor) in Ogg Vorbis audio coding, can also use other types of psychoacoustic models such as the psychoacoustic model in AC3 audio coding.

时频分析模块110的功能是对时域音频信号进行变换或滤波，以去除原始音频信号中的相关性所带来的多余度。基于变换的时频分析可以采用MDCT、调制重叠变换(MLT)或离散小波变换(DWT)等时频变换方式，基于变换的时频分析得到的是变换域或频域的音频参数；基于滤波的时频分析可以采用类似于MPEG-1和MPEG-2音频编码采用的子带滤波算法，并在各个子带内进行如MDCT、MLT或DWT等的时频变换处理，得到的也是变换域或频域的音频参数。另外，图2中的时频合成模块240的功能与时频分析模块110的功能相反，即：利用频域音频参数进行反变换，得到重构后的音频信号。The function of the time-frequency analysis module 110 is to transform or filter the time-domain audio signal, so as to remove the redundancy brought about by the correlation in the original audio signal. Transform-based time-frequency analysis can use time-frequency transformation methods such as MDCT, modulated lapped transform (MLT) or discrete wavelet transform (DWT), and transform-based time-frequency analysis obtains audio parameters in the transform domain or frequency domain; filter-based Time-frequency analysis can adopt the sub-band filtering algorithm similar to MPEG-1 and MPEG-2 audio coding, and perform time-frequency transformation processing such as MDCT, MLT or DWT in each sub-band, and obtain the transform domain or frequency Audio parameters for the domain. In addition, the function of the time-frequency synthesis module 240 in FIG. 2 is opposite to that of the time-frequency analysis module 110 , that is, the frequency-domain audio parameters are used for inverse transformation to obtain the reconstructed audio signal.

剩余信号分析模块130的主要功能是剔除存留在经过时频分析后的频域音频信号中的听觉不相关度。如果这种处理是在线性域进行的，可以用音频频域参数除以掩蔽门限信号得到剩余信号；如果是在对数域进行的，可以用音频频域参数减去掩蔽门限信号得到剩余信号。另外，图2中的参数重构模块230的功能与剩余信号分析模块130的功能相反，即：用多描述解码得到的剩余信号和掩蔽门限信号重构音频频域参数。如果是线性域，可以用听觉剩余信号与掩蔽门限信号相乘得到时频变换参数；如果是对数域，可以用剩余信号与掩蔽门限信号相加得到时频变换参数。The main function of the remaining signal analysis module 130 is to remove the auditory irrelevance remaining in the frequency domain audio signal after the time-frequency analysis. If this processing is performed in the linear domain, the residual signal can be obtained by dividing the audio frequency domain parameters by the masking threshold signal; if it is performed in the logarithmic domain, the residual signal can be obtained by subtracting the masking threshold signal from the audio frequency domain parameters. In addition, the function of the parameter reconstruction module 230 in Fig. 2 is opposite to the function of the residual signal analysis module 130, that is: the residual signal obtained by multi-description decoding and the masking threshold signal are used to reconstruct the audio frequency domain parameters. If it is in the linear domain, the time-frequency transformation parameters can be obtained by multiplying the auditory residual signal and the masking threshold signal; if it is in the logarithmic domain, the time-frequency transformation parameters can be obtained by adding the residual signal and the masking threshold signal.

比特分配模块150的功能是根据收到的掩蔽门限信号控制多描述编码的量化器量化精度，同时根据可用比特数对无失真编码和音频包的形成进行动态比特分配，用迭代方法反复调整量化精度和比特分配，直到可用比特数用完，或者已达到预设编码质量。在实际应用中，比特分配模块150支持固定速率(CBR)、可变速率(VBR)和平均速率(ABR)等编码模式。The function of the bit allocation module 150 is to control the quantization precision of the multi-description coding quantizer according to the received masking threshold signal, and at the same time perform dynamic bit allocation to the distortion-free coding and the formation of audio packets according to the available bits, and repeatedly adjust the quantization precision with an iterative method and bits are allocated until the number of available bits is exhausted, or the preset encoding quality has been reached. In practical applications, the bit allocation module 150 supports coding modes such as constant rate (CBR), variable rate (VBR) and average rate (ABR).

无失真编码及音频包处理模块160用于分别对描述1和描述2进行无失真熵编码，之后再加上边信息形成两个编码描述比特流输出。输出的所述两个描述比特流在重要性方面可以是等同的或不等同的，上述两个描述的编码所需的比特数可以是相同的或不同的，这两个描述比特流的编码速率可以是相同的或不同的。另外，音频包解包及无失真解码模块210的功能与无失真编码及音频包处理模块160的功能相反，即：对两个描述比特流进行解包和无失真解码，得到两个音频描述信息。The lossless encoding and audio packet processing module 160 is used to perform lossless entropy encoding on description 1 and description 2 respectively, and then add side information to form two encoded description bit stream outputs. The two output description bit streams may be equal or not equal in importance, the number of bits required for the encoding of the above two descriptions may be the same or different, and the encoding rates of the two description bit streams Can be the same or different. In addition, the functions of the audio packet unpacking and lossless decoding module 210 are opposite to those of the lossless encoding and audio packet processing module 160, that is, two description bit streams are unpacked and lossless decoded to obtain two audio description information .

在实际应用中，图1所示的多描述编码器140可以如图3所示，图2所示的多描述解码器220则可以如图4所示。In practical applications, the multi-description encoder 140 shown in FIG. 1 may be shown in FIG. 3 , and the multi-description decoder 220 shown in FIG. 2 may be shown in FIG. 4 .

参见图3，图3是本发明多描述编码器结构及原理图。图3中，剩余信号多描述编码器310和掩蔽门限信号多描述编码器320均分别与合路器330、合路器340相连。实际应用时，剩余信号多描述编码器310对收到的剩余信号进行编码处理，并将编码处理所形成的剩余信号描述1和剩余信号描述2分别发送给合路器330、合路器340；掩蔽门限信号多描述编码器320对收到的掩蔽门限信号进行编码处理，并将编码处理所形成的掩蔽门限信号描述1和掩蔽门限信号描述2分别发送给合路器330、合路器340。合路器330对收到的剩余信号描述1和掩蔽门限信号描述1进行合路处理，并将完成合路后所生成的描述1发送出去；合路器340对收到的剩余信号描述2和掩蔽门限信号描述2进行合路处理，并将完成合路后所生成的描述2发送出去。Referring to FIG. 3 , FIG. 3 is a structure and principle diagram of a multi-descriptive encoder according to the present invention. In FIG. 3 , the multi-description encoder 310 for the residual signal and the multi-description encoder 320 for the masked threshold signal are connected to a combiner 330 and a combiner 340 respectively. In practical application, the residual signal multi-description encoder 310 encodes the received residual signal, and sends the residual signal description 1 and residual signal description 2 formed by the encoding process to the combiner 330 and the combiner 340 respectively; The masking threshold signal multiple description encoder 320 encodes the received masking threshold signal, and sends the masking threshold signal description 1 and masking threshold signal description 2 formed by the encoding process to the combiner 330 and the combiner 340 respectively. The combiner 330 performs combination processing on the received residual signal description 1 and the masking threshold signal description 1, and sends the generated description 1 after the combination is completed; the combiner 340 combines the received residual signal description 2 and The description 2 of the masked threshold signal is combined, and the description 2 generated after the combination is completed is sent out.

剩余信号多描述编码器310和掩蔽门限信号多描述编码器320可以采用的编码算法有多种，如：目前常用的多描述标量量化算法(MDSQ)、多描述变换编码算法(MDTC)或多描述矢量量化(VQ)方法。值得注意的是：相对于剩余信号而言，由于掩蔽门限信号可能只包含较小的数据量，所以掩蔽门限信号所对应的多描述编码方式还可以是直接拷贝。There are many encoding algorithms that can be used by the multi-description encoder 310 for the residual signal and the multi-description encoder 320 for the masked threshold signal, such as: the currently commonly used multi-description scalar quantization algorithm (MDSQ), the multi-description transform coding algorithm (MDTC) or the multi-description Vector quantization (VQ) method. It is worth noting that: compared with the remaining signal, since the masking threshold signal may only contain a small amount of data, the multiple description encoding method corresponding to the masking threshold signal can also be a direct copy.

当然，剩余信号多描述编码器310和掩蔽门限信号多描述编码器320还可以接收并处理边信息，再将完成处理的边信息发送给合路器330和合路器340，由合路器330、合路器340对收到的边信号以及其它描述进行合路处理。Of course, the multi-description encoder 310 for the residual signal and the multi-description encoder 320 for the masked threshold signal can also receive and process the side information, and then send the processed side information to the combiner 330 and the combiner 340, and the combiner 330, The combiner 340 combines the received side signals and other descriptions.

参见图4，图4是本发明多描述解码器结构及原理图。图4中，分路器410、分路器420均分别与剩余信号多描述解码器430和掩蔽门限信号多描述解码器440相连。实际应用时，分路器410对收到的描述1进行分路处理，并将分路处理所形成的剩余信号描述1和掩蔽门限信号描述1分别发送给剩余信号多描述解码器430、掩蔽门限信号多描述解码器440；分路器420对收到的描述2进行分路处理，并将分路处理所形成的剩余信号描述2和掩蔽门限信号描述2分别发送给剩余信号多描述解码器430、掩蔽门限信号多描述解码器440。剩余信号多描述解码器430对收到的剩余信号描述1和剩余信号描述2进行解码处理，并将完成解码后所生成的完成重构的剩余信号发送出去；掩蔽门限信号多描述解码器440对收到的掩蔽门限信号描述1和掩蔽门限信号描述2进行解码处理，并将完成解码后所生成的完成重构的掩蔽门限信号发送出去。Referring to FIG. 4, FIG. 4 is a structural and schematic diagram of a multi-descriptive decoder in the present invention. In FIG. 4 , the splitter 410 and the splitter 420 are respectively connected to the residual signal multi-description decoder 430 and the masked threshold signal multi-description decoder 440 . In practical application, the demultiplexer 410 performs demultiplexing processing on the received description 1, and sends the residual signal descriptive 1 and the masking threshold signal descriptive 1 formed by the demultiplexing processing to the residual signal multi-description decoder 430, the masking threshold Signal multi-description decoder 440; splitter 420 splits the received description 2, and sends the remaining signal description 2 and masked threshold signal description 2 formed by the splitting process to the remaining signal multi-description decoder 430 , The masking threshold signal multi-descriptor decoder 440 . The residual signal multi-description decoder 430 decodes the received residual signal description 1 and residual signal description 2, and sends out the reconstructed residual signal generated after decoding; the masked threshold signal multi-description decoder 440 pairs The received masking threshold signal description 1 and masking threshold signal description 2 are decoded, and the reconstructed masking threshold signal generated after decoding is sent out.

当然，分路器410和分路器420还可以接收并处理边信息，再将完成处理的边信息发送给剩余信号多描述解码器430、掩蔽门限信号多描述解码器440，由剩余信号多描述解码器430、掩蔽门限信号多描述解码器440对收到的边信号以及其它描述进行解码处理。再有，在实际应用中，发送给所述分路器的可能只有描述1或描述2；在这种情况下，接收到所述描述的分路器会正常对该描述进行分路等后续处理，完成分路的描述也会被正常进行后续的解码处理。Of course, the splitter 410 and the splitter 420 can also receive and process the side information, and then send the processed side information to the residual signal multiple description decoder 430 and the masking threshold signal multiple description decoder 440, and the residual signal multiple description The decoder 430 and the masked threshold signal multi-description decoder 440 decode the received side signals and other descriptions. Furthermore, in practical applications, only description 1 or description 2 may be sent to the splitter; in this case, the splitter that receives the description will normally perform subsequent processing such as splitting on the description , the completion of the branch description will also be followed by the normal decoding process.

在实际应用中，图4所示的剩余信号多描述解码器430可以如图5所示，而掩蔽门限信号多描述解码器440则可以如图6所示。In practical applications, the residual signal multi-description decoder 430 shown in FIG. 4 may be shown in FIG. 5 , and the masked threshold signal multi-description decoder 440 may be shown in FIG. 6 .

参见图5，图5是本发明剩余信号多描述解码器结构及原理图，图5所示的解码器是采用三解码器结构对接收到的描述进行多描述解码的。具体而言，如果只接收到一个描述，就用收到该描述的作为边解码器的剩余信号多描述解码器a 510或剩余信号多描述解码器c 530对所述描述解码；如果接收到两个描述，就用收到该描述的作为中心解码器的剩余信号多描述解码器b 520对所述描述解码。Referring to Fig. 5, Fig. 5 is a structure and schematic diagram of a residual signal multi-description decoder according to the present invention. The decoder shown in Fig. 5 adopts a three-decoder structure to perform multi-description decoding on received descriptions. Specifically, if only one description is received, the description is decoded by the residual signal multi-description decoder a 510 or the residual signal multi-description decoder c 530 as a side decoder receiving the description; if two A description, the multi-description decoder b 520 decodes the description with the remaining signal received as the central decoder.

参见图6，图6是本发明掩蔽门限信号多描述解码器结构及原理图，图6所示的解码器是采用三解码器结构对接收到的描述进行多描述解码的。具体而言，如果只接收到一个描述，就用收到该描述的作为边解码器的掩蔽门限信号多描述解码器a 610或掩蔽门限信号多描述解码器c 630对所述描述解码；如果接收到两个描述，就用收到该描述的作为中心解码器的掩蔽门限信号多描述解码器b 620对所述描述解码。Referring to Fig. 6, Fig. 6 is a structure and schematic diagram of a multi-description decoder for a masked threshold signal according to the present invention. The decoder shown in Fig. 6 adopts a three-decoder structure to perform multi-description decoding on received descriptions. Specifically, if only one description is received, the description is decoded with the masked threshold signal multi-description decoder a 610 or the masked threshold signal multi-description decoder c 630 that has received the description as a side decoder; When two descriptions are received, the description is decoded by the multi-description decoder b 620 that receives the description as the masked threshold signal of the central decoder.

图5、图6中所示的多描述解码器输出信号的开关位置可根据接收到描述的情况自动进行选择。The switch position of the output signal of the multi-description decoder shown in Fig. 5 and Fig. 6 can be automatically selected according to the situation of the received description.

在实际应用中，图3、图4中的剩余信号和掩蔽门限信号多描述编解码器可以由多描述奇偶分离算法实现，如图7、图8所示；也可以由多描述对偶变换算法实现，如图9、图10所示；还可以由多描述标量量化算法实现，如图11至图18所示。In practical applications, the residual signal and masked threshold signal multi-description codec in Fig. 3 and Fig. 4 can be realized by multi-description parity separation algorithm, as shown in Fig. 7 and Fig. 8; it can also be realized by multi-description dual transformation algorithm , as shown in FIG. 9 and FIG. 10 ; it can also be realized by a multi-description scalar quantization algorithm, as shown in FIG. 11 to FIG. 18 .

下面，以附图为例，分别针对上述的不同多描述编解码算法进行描述。需要说明的是，在以下描述中，多描述编解码的对象主要是剩余信号；在实际应用中，也可以应用相同的多描述编解码算法对掩蔽门限信号或其它音频信号分量进行处理。In the following, the accompanying drawings are taken as an example to describe the above-mentioned different multi-description encoding and decoding algorithms respectively. It should be noted that, in the following description, the object of multi-description coding and decoding is mainly the residual signal; in practical applications, the same multi-description coding and decoding algorithm can also be applied to process masking threshold signals or other audio signal components.

针对由多描述奇偶分离算法实现多描述编解码的情况而言，其操作原则为：将时域或频域音频参数按照其索引值或自然顺序奇偶分开。由于奇偶分开的两个描述完全不相关；因此在两个描述之间引入的冗余度为零，总的编码速率也不会因此而增加。For the case where the multi-description codec is realized by the multi-description parity separation algorithm, its operating principle is: the time-domain or frequency-domain audio parameters are separated according to their index values or natural order. Since the parity-separated two descriptions are completely uncorrelated; therefore the redundancy introduced between the two descriptions is zero and the overall encoding rate will not increase as a result.

参见图7，图7是本发明剩余信号多描述编码器结构及原理图。图7中，奇偶分离模块710对收到的剩余信号进行奇偶分离处理，并将完成处理所生成的描述1、描述2分别发送给熵编码器720、熵编码器730；熵编码器720对收到的描述1进行编码处理，并将完成编码处理后所生成的描述1的比特流发送出去；熵编码器730对收到的描述2进行编码处理，并将完成编码处理后所生成的描述2的比特流发送出去。Referring to FIG. 7, FIG. 7 is a structural and schematic diagram of a multi-descriptive encoder for residual signals according to the present invention. In Fig. 7, the parity separation module 710 performs parity separation processing on the remaining signals received, and sends the description 1 and description 2 generated by completing the processing to the entropy encoder 720 and the entropy encoder 730 respectively; The received description 1 is encoded, and the generated description 1 bit stream is sent after the encoding process is completed; the entropy encoder 730 is encoded for the received description 2, and the generated description 2 is generated after the encoding process is completed The bit stream is sent out.

具体而言，剩余信号用R(k)表示：R(k)，k＝1，2，3，......NSpecifically, the residual signal is represented by R(k): R(k), k=1, 2, 3,...N

其中，N为剩余信号的个数(一般为偶数)，同时也是音频分析窗长的一半。Wherein, N is the number of remaining signals (usually an even number), and is also half of the audio analysis window length.

剩余信号的两个多描述算法信号分别用两个描述M₁(k₁)和M₂(k₂)表示：The two multi-description algorithm signals of the remaining signal are represented by two descriptions M ₁ (k ₁ ) and M ₂ (k ₂ ) respectively:

描述1：M₁(k₁)，k₁＝1，2，3，......N/2Description 1: M ₁ (k ₁ ), k ₁ =1, 2, 3, ... N/2

描述2：M₂(k₂)，k₂＝1，2，3，......N/2Description 2: M ₂ (k ₂ ), k ₂ =1, 2, 3, ... N/2

那么奇偶多描述变换算法及结果如下：Then the odd-even multi-description transformation algorithm and the results are as follows:

当k为奇数，即k＝1，3，5，......N-1时，M₁(k₁)＝R(k)When k is an odd number, namely k=1, 3, 5, ... N-1, M ₁ (k ₁ )=R(k)

其中，k₁＝(k+1)/2；Among them, k ₁ =(k+1)/2;

当k为偶数，即k＝2，4，6，......N时，M₂(k₂)＝R(k)When k is an even number, namely k=2, 4, 6, ... N, M ₂ (k ₂ )=R(k)

其中，k₂＝k/2。Among them, k ₂ =k/2.

在上述奇偶多描述变换后，熵编码器720、730分别对多描述信号M₁(k₁)和M₂(k₂)进行编码，并把编码所得的数据和各自的掩蔽门限信号组合形成描述比特流发送出去。After the above odd-even multi-description transformation, the entropy encoders 720 and 730 respectively encode the multi-description signals M ₁ (k ₁ ) and M ₂ (k ₂ ), and combine the encoded data with their respective masking threshold signals to form a description The bitstream is sent out.

参见图8，图8是与图7中的多描述编码器相配合的多描述解码器结构及原理图。图8中，熵解码器810对收到的描述1的比特流进行解码处理，并将完成解码处理后所生成的描述1发送给奇偶合成模块830、奇偶合成模块840；熵解码器820对收到的描述2的比特流进行解码处理，并将完成解码处理后所生成的描述2发送给奇偶合成模块850、奇偶合成模块840。Referring to FIG. 8 , FIG. 8 is a structural and schematic diagram of a multi-description decoder cooperating with the multi-description encoder in FIG. 7 . In Fig. 8, the entropy decoder 810 decodes the bit stream of the received description 1, and sends the generated description 1 to the parity synthesis module 830 and the parity synthesis module 840 after completing the decoding process; The received bit stream of description 2 is decoded, and the description 2 generated after the decoding process is completed is sent to the parity synthesis module 850 and the parity synthesis module 840.

图8中所示的奇偶合成模块输出信号的开关位置可根据接收到描述的情况自动进行选择，如：当只接收到一个描述时，就用收到该描述的奇偶合成模块对该描述进行解码，并连通该奇偶合成模块的输出端以输出经奇偶合成后所生成的完成重构的剩余信号；当两个描述都接收到时，就用收到这两个描述的奇偶合成模块840对这两个描述进行解码，并连通奇偶合成模块840的输出端以输出经奇偶合成后所生成的完成重构的剩余信号。The switch position of the output signal of the parity synthesis module shown in Figure 8 can be automatically selected according to the situation of receiving the description, such as: when only one description is received, the parity synthesis module that receives the description is used to decode the description , and connected to the output end of the parity synthesis module to output the remaining signal after the parity synthesis to complete the reconstruction; when both descriptions are received, the parity synthesis module 840 that receives the two descriptions The two descriptions are decoded and connected to the output terminal of the parity combination module 840 to output the reconstructed residual signal generated after the parity combination.

具体而言，如果两个描述都接收到了，就分别对两个描述进行解码，得到两个多描述信号：描述1(M₁)和描述2(M₂)，并将这两个描述合成为剩余信号，具体的合成算法如下：Specifically, if both descriptions are received, the two descriptions are decoded separately to obtain two multi-description signals: description 1 (M ₁ ) and description 2 (M ₂ ), and these two descriptions are synthesized into The residual signal, the specific synthesis algorithm is as follows:

当k为奇数，即k＝1，3，5，......N-1时，使R(k)＝M₁(k₁)，其中k₁＝(k+1)/2；When k is an odd number, that is, k=1, 3, 5, ... N-1, make R(k)=M ₁ (k ₁ ), where k ₁ =(k+1)/2;

当k为偶数，即k＝2，4，6，......N时，使R(k)＝M₂(k₂)，其中k₂＝k/2。When k is an even number, that is, k=2, 4, 6, ... N, make R(k)=M ₂ (k ₂ ), where k ₂ =k/2.

如果只接收到一个描述，就对这个描述进行解码，得到一个多描述信号。If only one description is received, this description is decoded to obtain a multi-description signal.

当这个多描述信号为描述1(M₁)时：When this multiple description signal is description 1(M ₁ ):

当k为奇数，即k＝1，3，5，......N-1时，使R(k)＝M₁(k₁)，其中k₁＝(k+1)/2，When k is an odd number, namely k=1, 3, 5, ... N-1, make R(k)=M ₁ (k ₁ ), where k ₁ =(k+1)/2,

当k为偶数，即k＝2，4，6，......N时，使R(k)＝0，其中k₂＝k/2。When k is an even number, that is, k=2, 4, 6, . . . N, set R(k)=0, where k ₂ =k/2.

当这个描述信号为描述2(M₂)时：When this description signal is description 2(M ₂ ):

当k为奇数，即k＝1，3，5，......N-1时，使R(k)＝0，其中k₁＝(k+1)/2，When k is an odd number, that is, when k=1, 3, 5, ... N-1, make R(k)=0, wherein k ₁ =(k+1)/2,

至此，就成功重构出了剩余信号。So far, the remaining signal has been successfully reconstructed.

针对由多描述对偶变换算法实现多描述编解码的情况而言，其操作原则为：通过变换矩阵T把两个不相关的变量A和B变换成两个具有一定相关性的变量C和D。变量C和D之间相关性的大小由变换矩阵T决定。具体而言，所述变换矩阵通常被分为正交矩阵和非正交矩阵，这两种变换矩阵所引入的相关性的范围不相同，而且对应的对偶变换算法的实现方法也完全不同。For the case of multi-description encoding and decoding realized by the multi-description dual transformation algorithm, its operating principle is: through the transformation matrix T, two irrelevant variables A and B are transformed into two variables C and D with certain correlation. The magnitude of the correlation between variables C and D is determined by the transformation matrix T. Specifically, the transformation matrix is generally divided into an orthogonal matrix and a non-orthogonal matrix. The ranges of correlations introduced by the two transformation matrices are different, and the implementation methods of the corresponding dual transformation algorithms are also completely different.

下面，只以非正交矩阵为例描述多描述对偶变换算法。In the following, only non-orthogonal matrices are taken as an example to describe the multiple description dual transformation algorithm.

所述对偶变换算法中各变换参数的定义为：The definition of each transformation parameter in the dual transformation algorithm is:

输入矩阵： $[\begin{matrix} A \\ B \end{matrix}]$ Input matrix: $[\begin{matrix} A \\ B \end{matrix}]$

对偶变换矩阵： $T = [\begin{matrix} a & b \\ c & d \end{matrix}]$ Dual transformation matrix: $T = [\begin{matrix} a & b \\ c & d \end{matrix}]$

对偶变换输出矩阵： $[\begin{matrix} C \\ D \end{matrix}] = T [\begin{matrix} A \\ B \end{matrix}]$ Dual transformation output matrix: $[\begin{matrix} C \\ D. \end{matrix}] = T [\begin{matrix} A \\ B \end{matrix}]$

其中，a，b，c，d为变换矩阵的系数，决定着在两个描述之间的引入的冗余度，且ad-bc＝1；A，B为输入信号；C，D为输出信号。Among them, a, b, c, d are the coefficients of the transformation matrix, which determine the redundancy introduced between the two descriptions, and ad-bc=1; A, B are input signals; C, D are output signals .

整型变换的具体算法如下：在编码端，编码器对信号A和B进行对偶正变换：The specific algorithm of integer transformation is as follows: At the encoding end, the encoder performs dual positive transformation on signals A and B:

$\overset{&OverBar; &OverBar;}{A A} = = [[\frac{A A}{Q Q}]],,$ $\overset{&OverBar; &OverBar;}{B B} = = [[\frac{B B}{Q Q}]]$

$W W = = \overset{&OverBar; &OverBar;}{B B} + + [[\frac{11 + + c c}{d d} \overset{&OverBar; &OverBar;}{A A}]]$

$\overset{&OverBar; &OverBar;}{D D.} = = [[dW wxya]] - - \overset{&OverBar; &OverBar;}{A A}$

$\overset{&OverBar; &OverBar;}{C C} = = W W - - [[\frac{11 - - b b}{d d} \overset{&OverBar; &OverBar;}{D D.}]]$

其中，Q为量化步长，W为中间变量，符号‘[]’表示取整操作。

和

分别为A和B量化后的整型变量，

和

分别为两个整型输出变量，

和

将被分别编码以形成两个描述比特流。Among them, Q is the quantization step size, W is an intermediate variable, and the symbol '[]' represents the rounding operation.

and

are the quantized integer variables of A and B respectively,

and

are two integer output variables,

and

will be encoded separately to form two description bitstreams.

在解码端，具体的解码过程因接收到的描述的不同而存在以下三种情况：At the decoding end, the specific decoding process has the following three situations depending on the received description:

(1)

和

两个描述都被正确接收时，由收到这两个描述的解码器对信号

和进行下面对偶反变换：(1)

and

When both descriptions are received correctly, the signal is signaled by the decoder receiving both descriptions

and Perform the following dual inverse transformation:

$W W = = \overset{&OverBar; &OverBar;}{C C} + + [[\frac{11 - - b b}{d d} \overset{&OverBar; &OverBar;}{D D.}]]$

$\overset{&OverBar; &OverBar;}{A A} = = [[dW wxya]] - - \overset{&OverBar; &OverBar;}{D D.}$

$\overset{&OverBar; &OverBar;}{B B} = = W W - - [[\frac{11 + + c c}{d d} \overset{&OverBar; &OverBar;}{A A}]]$

$\overset{^^}{A A} = = \overset{&OverBar; &OverBar;}{A A} Q Q,,$ $\overset{^^}{B B} = = \overset{&OverBar; &OverBar;}{B B} Q Q$

其中，Q为量化步长；W为中间变量；和

分别为

和

逆量化后的值；‘[]’符号表示取整操作。Wherein, Q is the quantization step size; W is an intermediate variable; and

respectively

and

The value after inverse quantization; the '[]' symbol indicates a rounding operation.

(2)只有一个描述

被正确接收时，首先对丢失的信号

进行预测：(2) Only one description

When correctly received, the missing signal is first

Make predictions:

$\overset{~ ~}{C C} = = \overset{&OverBar; &OverBar;}{C C} Q Q$

$\overset{^^}{D D.} = = \frac{{σ σ}_{c c} * * {σ σ}_{d d}}{{σ σ}_{c c}^{22} + + {σ σ}_{q q}^{22}} * * \overset{~ ~}{C C} * * cos cos φ φ$

再对

和

进行反变换：again

and

Do the inverse transformation:

$[\begin{matrix} \overset{^^}{A A} \\ B B \end{matrix}] = = {T T}^{- - 11} [\begin{matrix} \overset{~ ~}{C C} \\ \overset{^^}{D D.} \end{matrix}]$

其中，σ_c、σ_d和σ_q分别为变量C、D和量化误差的标准差；

为

经过解码器逆量化后的值；

为解码器对

预测恢复出来的值；矩阵T^-1为矩阵T的逆矩阵；和为解码器重构出来值；cosφ是变量C、D之间的相关系数。(3)只有一个描述

被正确接收时，首先对丢失的信号

进行预测，Among them, σ _c , σ _d and σ _q are the standard deviations of variables C, D and quantization error, respectively;

for

The value after inverse quantization by the decoder;

for the decoder pair

Predict the recovered value; matrix T ^-1 is the inverse matrix of matrix T; and is the value reconstructed by the decoder; cosφ is the correlation coefficient between variables C and D. (3) Only one description

When correctly received, the missing signal is first

make predictions,

$\overset{~ ~}{D D.} = = \overset{&OverBar; &OverBar;}{D D.} Q Q$

$\overset{^^}{C C} = = \frac{{σ σ}_{c c} * * {σ σ}_{d d}}{{σ σ}_{c c}^{22} + + {σ σ}_{q q}^{22}} * * \overset{~ ~}{D D.} * * cos cos φ φ$

再对和

进行反变换：again and

Do the inverse transformation:

$[\begin{matrix} \overset{^^}{A A} \\ \overset{^^}{B B} \end{matrix}] = = {T T}^{- - 11} [\begin{matrix} \overset{^^}{C C} \\ \overset{~ ~}{D D.} \end{matrix}]$

其中，σ_c、σ_d和σ_q分别为变量C、D和量化误差的标准差；为经过解码器逆量化后的值；为解码器经过对

预测恢复出来的值；矩阵T^-1为矩阵T的逆矩阵；和为解码器重构出来值；cosφ是变量C、D之间的相关系数。Among them, σ _c , σ _d and σ _q are the standard deviations of variables C, D and quantization error, respectively; for The value after dequantization by the decoder;

Predict the recovered value; matrix T ^-1 is the inverse matrix of matrix T; and is the value reconstructed by the decoder; cosφ is the correlation coefficient between variables C and D.

需要说明的是：C、D是由输入信号A、B直接和矩阵T相乘得到的：

而

是把量化后的A、B经过整型变换得到的。It should be noted that: C and D are obtained by multiplying the input signals A and B directly with the matrix T:

and

It is obtained by transforming quantized A and B through integer transformation.

参见图9，图9是本发明剩余信号多描述编码器结构及原理图。图9中，剩余信号分对模块910对收到的剩余信号进行信号分对处理，并将完成处理所生成的两路剩余信号发送给对偶变换模块920；对偶变换模块920对收到的两路剩余信号分别进行对偶变换处理并生成两路剩余信号描述M₁、M₂，再将M₁、M₂分别发送给熵编码器930、熵编码器940；熵编码器930对收到的M₁进行编码处理，并将完成编码处理后所生成的M₁的比特流发送出去；熵编码器940对收到的M₂进行编码处理，并将完成编码处理后所生成的M₂的比特流发送出去。Referring to Fig. 9, Fig. 9 is a structural and principle diagram of a multi-descriptive encoder for residual signals according to the present invention. In Fig. 9, the remaining signal pairing module 910 performs signal pairing processing on the received remaining signals, and sends the two-way remaining signals generated by completing the processing to the dual transformation module 920; The remaining signals are subjected to dual transformation processing respectively to generate two residual signals describing M ₁ and M ₂ , and then M ₁ and M ₂ are sent to the entropy encoder 930 and the entropy encoder ₉₄₀ respectively; Perform encoding processing, and send the bit stream of _M1 generated after the encoding process is completed; the entropy encoder 940 performs encoding processing on the received _M2 , and send the bit stream of _M2 generated after the encoding process is completed go out.

其中，N为剩余信号的个数(N一般为偶数)，同时也是音频分析窗长的一半。Wherein, N is the number of remaining signals (N is generally an even number), and is also half of the audio analysis window length.

图9所示原理可以表述为以下步骤：The principle shown in Figure 9 can be expressed as the following steps:

步骤1、初始化循环变量k＝1和其它对偶变换的参数。Step 1. Initialize the loop variable k=1 and other parameters of the dual transformation.

步骤2、剩余信号分对模块910对收到的剩余信号进行分对处理，生成变量A和B并发送给对偶变换模块920；其中，A＝R(k)，B＝R(k+1)。Step 2, the residual signal pairing module 910 performs pairing processing on the received residual signal, generates variables A and B and sends them to the dual transformation module 920; wherein, A=R(k), B=R(k+1) .

步骤3、对偶变换模块920对A和B进行多描述对偶变换，得到与描述1对应的

以及与描述2对应的

Step 3, the dual transformation module 920 performs multiple description dual transformation on A and B, and obtains the corresponding to description 1

and corresponding to description 2

$M_{1} (k_{1}) = \overset{&OverBar;}{C},$ 其中k₁＝(k+1)/2， $m_{1} (k_{1}) = \overset{&OverBar;}{C},$ where k ₁ =(k+1)/2,

$M_{2} (k_{2}) = \overset{&OverBar;}{D},$ 其中k₂＝(k+1)/2； $m_{2} (k_{2}) = \overset{&OverBar;}{D.},$ where k ₂ =(k+1)/2;

当然，还要将

发送给熵编码器930，将

发送给熵编码器940。Of course, the

Sent to the entropy encoder 930, the

Sent to entropy encoder 940.

步骤4、令k＝k+2，如果k＜N-1，转到步骤2；否则，转到步骤5。Step 4. Let k=k+2, if k<N-1, go to step 2; otherwise, go to step 5.

步骤5、熵编码器930、940分别对收到的多描述信号M₁(k₁)、M₂(k₂)进行编码，并把编码所得的数据和各自的掩蔽门限信号一起形成描述的比特流发送出去。Step 5. The entropy encoders 930 and 940 respectively encode the received multi-description signals M ₁ (k ₁ ) and M ₂ (k ₂ ), and form the described bits together with the encoded data and the respective masking threshold signals The stream is sent out.

需要说明的是，在多描述对偶变换中，变换矩阵起着重要的作用，它控制着两个描述之间所引入的冗余度。It should be noted that in the dual transformation of multiple descriptions, the transformation matrix plays an important role, which controls the redundancy introduced between the two descriptions.

参见图10，图10是与图9中的多描述编码器相配合的多描述解码器结构及原理图。图10中，熵解码器1001对收到的描述1的比特流进行解码处理，并将完成解码处理后所生成的描述1发送给对偶反变换边解码器1003、对偶反变换中心解码器1004；熵解码器1002对收到的描述2的比特流进行解码处理，并将完成解码处理后所生成的描述2发送给对偶反变换边解码器1005、对偶反变换中心解码器1004。再有，对偶反变换边解码器、对偶反变换中心解码器均将自身处理描述所得的剩余信号发送给剩余信号合成模块；由剩余信号合成模块对收到的剩余信号进行合成处理，生成合成的剩余信号。Referring to FIG. 10 , FIG. 10 is a structural and schematic diagram of a multi-description decoder cooperating with the multi-description encoder in FIG. 9 . In FIG. 10 , the entropy decoder 1001 decodes the received bit stream of description 1, and sends the generated description 1 after decoding to the dual inverse transform side decoder 1003 and the dual inverse transform central decoder 1004; The entropy decoder 1002 decodes the received bit stream of description 2, and sends the generated description 2 to the dual inverse transform side decoder 1005 and the dual inverse transform central decoder 1004 after the decoding process is completed. Furthermore, both the dual inverse transform side decoder and the dual inverse transform central decoder send the residual signal obtained by their own processing and description to the residual signal synthesis module; the residual signal synthesis module synthesizes the received residual signal to generate a synthesized remaining signal.

图10中所示的剩余信号合成模块输出信号的开关位置可根据接收到描述的情况自动进行选择，如：当只接收到一个描述时，就用收到该描述的对偶反变换边解码器对该描述进行解码，并连通该对偶反变换边解码器所连接的剩余信号合成模块的输出端，以输出经合成后所生成的完成重构的剩余信号；当两个描述都接收到时，就用收到这两个描述的对偶反变换中心解码器1004对这两个描述进行解码，并连通对偶反变换边解码器1004所连接的剩余信号合成模块的输出端，以输出经合成后所生成的完成重构的剩余信号。The switch position of the output signal of the residual signal synthesis module shown in Figure 10 can be selected automatically according to the situation of receiving the description, such as: when only one description is received, the dual inverse transform side decoder that receives the description is used to pair The description is decoded, and connected to the output end of the residual signal synthesis module connected to the dual inverse transform side decoder, so as to output the reconstructed residual signal generated after synthesis; when both descriptions are received, the Decode these two descriptions with the dual inverse transform center decoder 1004 that receives these two descriptions, and connect the output end of the residual signal synthesis module connected to the dual inverse transform side decoder 1004, to output the generated after synthesis The remaining signal of the completed reconstruction.

具体的解码算法可以表述为以下步骤：The specific decoding algorithm can be expressed as the following steps:

步骤2、根据接收到的描述的个数，可以应用不同的解码算法对接收到的描述进行解码。具体而言，根据接收到描述的情况分别进行以下处理：Step 2. Depending on the number of received descriptions, different decoding algorithms may be applied to decode the received descriptions. Specifically, the following processing is performed according to the received description:

(1)如果熵解码器1001、1002共接收到了两个描述：M₁和M₂，那么，接收到上述描述的熵解码器分别确定自身收到的描述所对应的

(1) If the

entropy decoders

1001 and 1002 have received two descriptions: M ₁ and M ₂ , then the entropy decoders receiving the above descriptions respectively determine the corresponding

$\overset{&OverBar;}{C} = M_{1} (k_{1}),$ 其中k₁＝(k+1)/2， $\overset{&OverBar;}{C} = m_{1} (k_{1}),$ where k ₁ =(k+1)/2,

$\overset{&OverBar;}{D} = M_{2} (k_{2}),$ 其中k₂＝(k+1)/2。 $\overset{&OverBar;}{D.} = m_{2} (k_{2}),$ where k ₂ =(k+1)/2.

接着，将

和

发送给对偶反变换中心解码器1004，由对偶反变换中心解码器1004根据多描述对偶变换算法求出和

并发送给剩余信号合成模块1007；之后，由剩余信号合成模块1007对收到的和进行合成处理，并将完成合成处理所生成的完成重构的剩余信号R(k)发送出去。Next, put

and

Sent to the dual inverse transform central decoder 1004, the dual inverse transform central decoder 1004 calculates and

And sent to the residual signal synthesis module 1007; Afterwards, the residual signal synthesis module 1007 performs synthesis processing on the received and , and sends out the reconstructed residual signal R(k) generated after the synthesis processing.

其中， $R (k) = \hat{A},$ $R (k + 1) = \hat{B} .$ in, $R (k) = \hat{A},$ $R (k + 1) = \hat{B} .$

(2)如果只接收到了M₁，那么，接收到M₁的熵解码器确定M₁所对应的

( ₂ ) If only M ₁ is received, then the entropy decoder that receives M ₁ determines the corresponding

所述熵解码器将

发送给对偶反变换边解码器，对偶反变换边解码器根据多描述对偶变换算法预测出并对和

进行对偶反变换求出和

再将其发送给剩余信号合成模块，由该剩余信号合成模块对收到的和进行合成处理，并将完成合成处理所生成的完成重构的剩余信号R(k)发送出去。The entropy decoder will

Sent to the dual inverse transform side decoder, the dual inverse transform side decoder predicts and to and

Perform dual inverse transformation to find and

It is then sent to the residual signal synthesis module, which performs synthesis processing on the received and , and sends out the reconstructed residual signal R(k) generated after the synthesis processing.

(3)如果只接收到了M₂，那么，接收到M₂的熵解码器确定M₂所对应的

( ₃ ) If only M ₂ is received, then the entropy decoder that receives M ₂ determines the corresponding

所述熵解码器将

发送给对偶反变换边解码器；对偶反变换边解码器根据多描述对偶变换算法预测出

并对

和

进行对偶反变换求出和

Sent to the dual inverse transform side decoder; the dual inverse transform side decoder predicts

and to

and

Perform dual inverse transformation to find and

步骤3，令k＝k+2，如果k＜N-1，转到步骤2；否则，转到步骤4。Step 3, let k=k+2, if k<N-1, go to step 2; otherwise, go to step 4.

步骤4，结束。Step 4, end.

至此，剩余信号就被成功重构出来了。So far, the remaining signal has been successfully reconstructed.

针对由多描述标量量化算法实现的多描述编解码而言，其操作原则为：用一个单输入多输出的标量量化器对信源进行量化编码，并将量化所得的多个输出结果作为这个信源的多描述；用一个多输入单输出的标量量化解码器对多描述输入进行编码，并将解码所得的单个输出结果作为这个信源的重构信号。For the multi-description codec implemented by the multi-description scalar quantization algorithm, its operating principle is: use a single-input multi-output scalar quantizer to quantize and encode the information source, and use the multiple output results obtained by quantization as the information Multiple descriptions of the source; a multi-input single-output scalar quantization decoder is used to encode the multi-description input, and the single output result obtained by decoding is used as the reconstructed signal of the source.

下面以两个描述为例阐述多描述标量量化的原理和方法。The principle and method of multi-description scalar quantization are described below by taking two descriptions as examples.

参见图11，图11是本发明一较佳实施例的音频信号处理原理图。图11中，x是原始音频信号；l是经过量化编码后所得到的索引；匹配器a(·)1103用于将索引l匹配成索引对(i，j)；边解码器1104、1106分别用于对接收到的i和j进行解码，产生完成解码后的信号和

中心解码器1105用于对接收到的i和j进行解码，产生完成解码后的信号

Referring to Fig. 11, Fig. 11 is a schematic diagram of audio signal processing in a preferred embodiment of the present invention. In Fig. 11, x is the original audio signal; l is the index obtained after quantization encoding; the matcher a(·) 1103 is used to match the index l into an index pair (i, j); the

side decoders

1104, 1106 respectively Used to decode the received i and j to generate a decoded signal and

The central decoder 1105 is used to decode the received i and j to generate a decoded signal

在整个多描述标量量化的过程中，最重要的就是匹配器a(·)1103的实现，也就是怎样把l匹配成(i，j)。下面举例说明：In the whole process of multi-description scalar quantization, the most important thing is the realization of the matcher a(·)1103, that is, how to match l into (i, j). The following example illustrates:

x经过编码器1102的编码处理后，所生成的索引l的范围如图12所示，为：1～10，即量化x的量化区间总共有十个。After x undergoes encoding processing by the encoder 1102 , the range of the generated index l is as shown in FIG. 12 , which is: 1-10, that is, there are ten quantization intervals for quantizing x.

这时，可以使用表1来实现匹配器a(·)1103的功能，即把l匹配成i和j：At this time, Table 1 can be used to realize the function of the matcher a(·)1103, that is, to match l into i and j:

表1多描述标量量化表Table 1 Multi-description scalar quantization table

由表1所实现的匹配过程的本质是用两个多描述的量化器(分别对应量化后的索引值i和j)来代替单描述时的量化器(对应量化后的索引值l)，这三个量化器的量化区间之间的对应关系如图13所示。在实际应用中，可以通过调整多描量化精度来控制在多描述之间所引入的冗余度，通常可以通过设计不同表格的方式来实现这种调整。一般情况下，多描述量化精度越高，所引入的冗余度就越高。The essence of the matching process realized by Table 1 is to replace the single-description quantizer (corresponding to the quantized index value l) with two multi-description quantizers (corresponding to the quantized index value i and j respectively), which The correspondence between the quantization intervals of the three quantizers is shown in FIG. 13 . In practical applications, the redundancy introduced between multiple descriptions can be controlled by adjusting the quantization precision of multiple descriptions. Usually, this adjustment can be realized by designing different tables. In general, the higher the quantization accuracy of multiple descriptions, the higher the redundancy introduced.

在解码器一侧，解码处理过程可以会有三种具体形式：On the decoder side, the decoding process can take three specific forms:

(1)当接收到i和j时，中心解码器1105可以根据表1查找到唯一的l值，可无损的解出信号l。(1) When receiving i and j, the central decoder 1105 can find a unique value of l according to Table 1, and can decode the signal l losslessly.

具体而言，中心解码器1105首先在表1中找到i和j分别对应的行和列，然后再找该行和该列在表1中相交处的值，并将该值作为多描述标量量化前的值l。Specifically, the central decoder 1105 first finds the row and column corresponding to i and j in Table 1, and then finds the value at the intersection of the row and the column in Table 1, and quantizes the value as a multi-description scalar The previous value l.

(2)当只接收到i时，收到i的解码器可以根据表1中的数据估算出l的值。估算的方法有很多种，如：将表1中的i所对应的每行数据的平均值作为l的值；也可以将表1中的i所对应的每行数据的最大值或最小值作为l的值。(2) When only i is received, the decoder that receives i can estimate the value of l according to the data in Table 1. There are many methods of estimation, such as: take the average value of each row of data corresponding to i in Table 1 as the value of l; you can also use the maximum or minimum value of each row of data corresponding to i in Table 1 as The value of l.

(3)当只接收到j时，具体的解码过程与只接收到i时的解码过程相同。(3) When only j is received, the specific decoding process is the same as when only i is received.

需要说明的是，在设计解码算法时，可以根据信源的特点来具体确定最优的估算方法。It should be noted that when designing a decoding algorithm, the optimal estimation method can be specifically determined according to the characteristics of the information source.

图11中所描述的编解码原理可以应用于图14、15中；实际上，图14、15所描述的编解码过程只是对图11中的编解码原理的一个具体应用。The encoding and decoding principle described in Figure 11 can be applied to Figures 14 and 15; in fact, the encoding and decoding process described in Figures 14 and 15 is only a specific application of the encoding and decoding principle in Figure 11.

参见图14，图14是以图11为基础的多描述编码器结构及原理图。图14中，双描述标量量化器1401对收到的剩余信号进行标量量化处理，并将完成处理所生成的描述1、描述2分别发送给熵编码器1402、熵编码器1403；熵编码器1402对收到的描述1进行编码处理，并将完成编码处理后所生成的描述1的比特流发送出去；熵编码器1403对收到的描述2进行编码处理，并将完成编码处理后所生成的描述2的比特流发送出去。Referring to FIG. 14 , FIG. 14 is a structure and schematic diagram of a multi-description encoder based on FIG. 11 . In Fig. 14, the dual-description scalar quantizer 1401 performs scalar quantization processing on the received residual signal, and sends the description 1 and description 2 generated after the processing to the entropy encoder 1402 and the entropy encoder 1403 respectively; the entropy encoder 1402 Encode the received description 1, and send the bit stream of description 1 generated after the encoding process is completed; the entropy encoder 1403 encodes the received description 2, and send the bit stream generated after the encoding process is completed. The bitstream describing 2 is sent.

其中，N为剩余信号的个数，同时也是音频分析窗长的一半。Wherein, N is the number of remaining signals, and is also half of the audio analysis window length.

M₁(k₁)，k₁＝1，2，3，......N；M ₁ (k ₁ ), k ₁ =1, 2, 3, . . . N;

M₂(k₂)，k₂＝1，2，3，......N。M ₂ (k ₂ ), k ₂ =1, 2, 3, . . . N.

在实际应用中，图14所示原理可以表述为以下步骤：In practical application, the principle shown in Figure 14 can be expressed as the following steps:

步骤1、初始化循环变量k：k＝1。Step 1. Initialize the loop variable k: k=1.

步骤2、将剩余信号R(k)作为索引值，根据表2查找匹配的索引对M₁(k₁)，M₂(k₂)；其中k₁＝k₂＝k。M₁(k₁)，M₂(k₂)索引对正是需要进行编码处理的两个描述信号，并且该索引对相当于图11中的索引对(i，j)。Step 2. Using the remaining signal R(k) as an index value, search for a matching index pair M ₁ (k ₁ ), M ₂ (k ₂ ) according to Table 2; where k ₁ =k ₂ =k. The M ₁ (k ₁ ), M ₂ (k ₂ ) index pair are exactly the two description signals that need to be encoded, and this index pair is equivalent to the index pair (i, j) in FIG. 11 .

表2多描述标量量化表Table 2 Multi-description scalar quantization table

步骤3、令k＝k+1，如果k＜N，转到步骤2；否则，转到步骤4。Step 3. Let k=k+1, if k<N, go to step 2; otherwise, go to step 4.

步骤4、由收到M₁(k₁)、M₂(k₂)的熵编码器分别对M₁(k₁)、M₂(k₂)进行编码处理，并将编码所得的数据和各自的掩蔽门限信号一起形成描述的比特流发送出去。Step 4. The entropy encoder that receives M ₁ (k ₁ ) and M ₂ ( _k ₂ ) encodes M ₁ (k 1 ) and M ₂ (k ₂ ) respectively, and combines the encoded data with their respective The masking threshold signals together form the described bit stream and send it out.

在多描述标量量化的过程中，表2的设计起着关键的作用。表2中对角线上的各数据之间的差值越小，描述的量化精度就越高，相同音质的编码速率就越高。In the process of multi-description scalar quantization, the design of Table 2 plays a key role. The smaller the difference between the data on the diagonal in Table 2, the higher the described quantization accuracy and the higher the encoding rate with the same sound quality.

与图14中描述的编码过程相对应的解码过程如图15所示。参见图15，图15是以图11为基础的多描述解码器结构及原理图。图15中，熵解码器1501对收到的描述1的比特流进行解码处理，并将完成解码处理后所生成的描述1发送给多描述标量量化边解码器1503、多描述标量量化中心解码器1504；熵解码器1502对收到的描述2的比特流进行解码处理，并将完成解码处理后所生成的描述2发送给多描述标量量化边解码器1505、多描述标量量化中心解码器1504。The decoding process corresponding to the encoding process described in FIG. 14 is shown in FIG. 15 . Referring to FIG. 15 , FIG. 15 is a structural and schematic diagram of a multi-description decoder based on FIG. 11 . In Fig. 15, the entropy decoder 1501 decodes the received bit stream of description 1, and sends the description 1 generated after the decoding processing to the multi-description scalar quantization side decoder 1503 and the multi-description scalar quantization central decoder 1504 ; the entropy decoder 1502 decodes the received bit stream of description 2, and sends the generated description 2 to the multi-description scalar quantization side decoder 1505 and the multi-description scalar quantization central decoder 1504 .

图15中所示的多描述标量量化边解码器和多描述标量量化中心解码器输出信号的开关位置可根据接收到描述的情况自动进行选择，如：当只接收到一个描述时，就用收到该描述的多描述标量量化边解码器对该描述进行标量量化解码，并连通该多描述标量量化边解码器的输出端以输出经解码后所生成的完成重构的剩余信号；当两个描述都接收到时，就用收到这两个描述的多描述标量量化中心解码器1504对这两个描述进行双描述标量量化解码，并连通多描述标量量化中心解码器1504的输出端以输出经解码后所生成的完成重构的剩余信号。The switch positions of the multi-description scalar quantization side decoder and the multi-description scalar quantization center decoder output signal shown in Figure 15 can be automatically selected according to the situation of the received description, such as: when only one description is received, the received The multi-description scalar quantization side decoder to the description performs scalar quantization decoding on the description, and connects the output end of the multi-description scalar quantization side decoder to output the reconstructed residual signal generated after decoding; when two When the descriptions are all received, the multi-description scalar quantization central decoder 1504 that receives the two descriptions is used to perform dual-description scalar quantization decoding on the two descriptions, and the output terminal of the multi-description scalar quantization central decoder 1504 is connected to output The reconstructed residual signal generated after decoding.

具体而言，图15所示原理可以表述为以下步骤：Specifically, the principle shown in Figure 15 can be expressed as the following steps:

步骤1、设置k＝1。Step 1. Set k=1.

步骤2、根据接收到描述比特流的情况分别处理：Step 2. Process respectively according to the situation of receiving the description bit stream:

如果接收到了两个描述：M₁(k₁)和M₂(k₂)，根据表2可以唯一查到剩余信号R(k)的值；If two descriptions are received: M ₁ (k ₁ ) and M ₂ (k ₂ ), the value of the remaining signal R(k) can be found uniquely according to Table 2;

如果只接收到了描述1：M₁(k₁)，根据表2查到描述1所对应的行，并将该行中绝对值最小的值作为剩余信号R(k)的值；If only description 1: M ₁ (k ₁ ) is received, find the row corresponding to description 1 according to Table 2, and use the smallest absolute value in this row as the value of the remaining signal R(k);

如果只接收到了描述2：M₂(k₂)，根据表2查到描述2所对应的列，并将该列中绝对值最小的值作为剩余信号R(k)的值；If only description 2: M ₂ (k ₂ ) is received, find the column corresponding to description 2 according to Table 2, and use the minimum absolute value in this column as the value of the remaining signal R(k);

其中k₁＝k₂＝k。where k ₁ =k ₂ =k.

步骤4、结束。Step 4, end.

以上所述为针对两个描述的标量量化原理和方法；在实际应用中，当信道丢包率大于25％甚至更高时，如果仍采用两个描述的标量量化方式，那么当两个描述间断甚至连续同时丢失时，音频通信所必需的音频连续性将受到严重影响。在这种情况下，如果能够增加描述的个数，将可以有效避免较高丢包率所带来的音质下降和音频不连续等问题；可见，包含三个或者三个以上描述的算法构架在实际应用中通常也是十分必要的。The above is the principle and method of scalar quantization for the two descriptions; in practical applications, when the channel packet loss rate is greater than 25% or even higher, if the scalar quantization method of the two descriptions is still used, then when the two descriptions are intermittent Even when consecutive simultaneous losses occur, the audio continuity necessary for audio communication will be severely affected. In this case, if the number of descriptions can be increased, problems such as sound quality degradation and audio discontinuity caused by higher packet loss rates can be effectively avoided; it can be seen that the algorithm framework containing three or more descriptions is in It is often necessary in practical applications.

为此，下面以三个描述为例阐述多描述标量量化的原理和方法；至于其它类型以及更多描述的算法可以此为参考进行设计。并且，在这些算法描述中，多描述编解码的对象主要是剩余信号；当然，在实际应用中，多描述编解码的对象也可以是掩蔽门限信号或其它音频信号分量。To this end, the following three descriptions are taken as examples to illustrate the principles and methods of multi-description scalar quantization; as for other types and algorithms with more descriptions, this can be used as a reference for design. Moreover, in these algorithm descriptions, the object of multi-description coding and decoding is mainly the residual signal; of course, in practical applications, the object of multi-description coding and decoding can also be the masking threshold signal or other audio signal components.

参见图16，图16是本发明另一较佳实施例的音频信号处理原理图。图16中，x是原始音频信号，l是经过量化编码后所得到的索引；匹配器a(·) 1603用于将索引l匹配成索引集合(h，i，j)；单描述边解码器用于对接收到的h、i或j进行解码，产生完成解码后的信号

或

双描述边解码器用于对接收到的h和i、h和j、i和j这样的两个描述进行解码，产生完成解码后的信号

中心解码器用于对接收到的全部三个描述信号h、i和j进行解码，产生完成解码后的信号

Referring to Fig. 16, Fig. 16 is a schematic diagram of audio signal processing in another preferred embodiment of the present invention. In Fig. 16, x is the original audio signal, and l is the index obtained after quantization and encoding; the matcher a(·) 1603 is used to match the index l into an index set (h, i, j); It is used to decode the received h, i or j, and generate the decoded signal

or

The dual description side decoder is used to decode the received two descriptions of h and i, h and j, i and j, and generate the decoded signal

The central decoder is used to decode all three received description signals h, i, and j to generate a decoded signal

图16所示的多描述标量量化过程与图11所示的多描述标量量化过程原理相同，在此不再赘述。实际上，无论是双描述、三描述还是更多描述的编解码过程，其操作原理都是相同的：The multi-description scalar quantization process shown in FIG. 16 is the same in principle as the multi-description scalar quantization process shown in FIG. 11 , and will not be repeated here. In fact, the principle of operation is the same whether it is a double-description, triple-description or more-description encoding and decoding process:

针对编码过程而言，根据要生成的描述的数量，分别将剩余信号和掩蔽门限信号编码为与所述描述数量相同的剩余信号描述和掩蔽门限信号描述，再分别将各剩余信号描述与其中一路掩蔽门限信号描述合路；最终生成多个描述，并且每个描述中都包含有剩余信号和掩蔽门限信号。For the encoding process, according to the number of descriptions to be generated, the residual signal and the masking threshold signal are respectively encoded into the remaining signal description and the masking threshold signal description with the same number of descriptions, and then each residual signal description is combined with one of the The masked-threshold signal descriptions are combined; multiple descriptions are finally generated, and each description contains the residual signal and the masked-threshold signal.

针对解码过程而言，将接收到的各描述中的剩余信号描述和掩蔽门限信号描述分路，再将分路后所生成的所有剩余信号描述解码为一路剩余信号，还将分路后所生成的所有掩蔽门限信号描述解码为一路掩蔽门限信号。For the decoding process, the remaining signal descriptions and masking threshold signal descriptions in the received descriptions are split, and then all the remaining signal descriptions generated after splitting are decoded into one residual signal, and the generated after splitting All masking threshold signal descriptions of are decoded into one masking threshold signal.

图16中所描述的编解码原理可以应用于图17、18中；实际上，图17、18所描述的编解码过程只是对图16中的编解码原理的一个具体应用。The encoding and decoding principle described in Figure 16 can be applied to Figures 17 and 18; in fact, the encoding and decoding process described in Figures 17 and 18 is only a specific application of the encoding and decoding principle in Figure 16.

参见图17，图17是以图16为基础的多描述编码器结构及原理图。图17中，三描述标量量化器1701对收到的剩余信号进行三描述标量量化处理，并将完成处理所生成的描述1、描述2、描述3分别发送给熵编码器1702、熵编码器1703、熵编码器1704；熵编码器1702、1703、1704分别对收到的描述进行编码处理，并将完成编码处理后所生成的描述比特流发送出去。Referring to FIG. 17 , FIG. 17 is a structure and principle diagram of a multi-description encoder based on FIG. 16 . In Fig. 17, the three-description scalar quantizer 1701 performs three-description scalar quantization processing on the received residual signal, and sends the description 1, description 2, and description 3 generated after the processing to the entropy encoder 1702 and the entropy encoder 1703 respectively . Entropy encoder 1704; entropy encoders 1702, 1703, and 1704 respectively encode the received description, and send out the description bit stream generated after the encoding process is completed.

具体而言，剩余信号用R(k)表示：R(k)，k＝1，2，3，......N；Specifically, the remaining signal is represented by R(k): R(k), k=1, 2, 3,...N;

其中，N为剩余信号的个数，并且也是音频分析窗长的一半。Wherein, N is the number of remaining signals, and is also half of the audio analysis window length.

剩余信号的三个多描述算法信号分别用两个描述M₁(k₁)、M₂(k₂)和M₃(k₃)表示：The three multi-description algorithm signals of the remaining signal are respectively represented by two descriptions M ₁ (k ₁ ), M ₂ (k ₂ ) and M ₃ (k ₃ ):

M₂(k₂)，k₂＝1，2，3，......N；M ₂ (k ₂ ), k ₂ =1, 2, 3, ... N;

M₃(k₃)，k₃＝1，2，3，......N。M ₃ (k ₃ ), k ₃ =1, 2, 3, . . . N.

在实际应用中，图17所示原理可以表述为以下步骤：In practical application, the principle shown in Figure 17 can be expressed as the following steps:

步骤2、将剩余信号R(k)作为索引值，并查找匹配的三维坐标点M₁(k₁)，M₂(k₂)和M₃(k₃)；其中k₁＝k₂＝k₃＝k。三维坐标点M₁(k₁)，M₂(k₂)和M₃(k₃)正是需要进行编码处理的三个描述信号；并且三维坐标点M₁(k₁)，M₂(k₂)和M₃(k₃)相当于图16中的三个描述信号(h，i，j)。Step 2. Use the remaining signal R(k) as an index value, and search for matching three-dimensional coordinate points M ₁ (k ₁ ), M ₂ (k ₂ ) and M ₃ (k ₃ ); where k ₁ =k ₂ =k ₃ = k. The three-dimensional coordinate points M ₁ (k ₁ ), M ₂ (k ₂ ) and M ₃ (k ₃ ) are just the three description signals that need to be encoded; and the three-dimensional coordinate points M ₁ (k ₁ ), M ₂ (k ₂ ) and M ₃ (k ₃ ) correspond to the three description signals (h, i, j) in FIG. 16 .

步骤4、由收到M₁(k₁)、M₂(k₂)、M₃(k₃)的熵编码器分别对M₁(k₁)、M₂(k₂)、M₃(k₃)进行编码处理，并将编码所得的数据和各自的掩蔽门限信号一起形成描述的比特流发送出去。Step 4. Entropy encoders that receive M ₁ (k ₁ ), M ₂ (k ₂ ), and M ₃ (k ₃ ) respectively encode M ₁ (k ₁ ), M ₂ (k ₂ ), M ₃ (k 3 ₃ ) Perform coding processing, and send the coded data together with their respective masking threshold signals to form a described bit stream.

与图17中描述的编码过程相对应的解码过程如图18所示。参见图18，图18是以图16为基础的多描述解码器结构及原理图。图18中，熵解码器1801对收到的描述1的比特流进行解码处理，并将完成解码处理后所生成的描述1发送给多描述标量量化边解码器1804、1806、1809和多描述标量量化中心解码器1807；熵解码器1802对收到的描述2的比特流进行解码处理，并将完成解码处理后所生成的描述2发送给多描述标量量化边解码器1805、1806、1808和多描述标量量化中心解码器1807；熵解码器1803对收到的描述3的比特流进行解码处理，并将完成解码处理后所生成的描述3发送给多描述标量量化边解码器1808、1809、1810和多描述标量量化中心解码器1807。The decoding process corresponding to the encoding process described in FIG. 17 is shown in FIG. 18 . Referring to FIG. 18 , FIG. 18 is a structure and principle diagram of a multi-description decoder based on FIG. 16 . In Fig. 18, the entropy decoder 1801 decodes the received bit stream of description 1, and sends the generated description 1 to the multi-description scalar quantization side decoders 1804, 1806, 1809 and multi-description scalar Quantization center decoder 1807; entropy decoder 1802 decodes the received bitstream of description 2, and sends description 2 generated after decoding processing to multi-description scalar quantization side decoders 1805, 1806, 1808 and multiple Descriptor scalar quantization central decoder 1807; entropy decoder 1803 decodes the received bit stream of description 3, and sends description 3 generated after decoding processing to multi-description scalar quantization side decoders 1808, 1809, 1810 and multi-description scalar quantization center decoder 1807.

图18中所示的多描述标量量化边解码器和多描述标量量化中心解码器输出信号的开关位置可根据接收到描述的情况自动进行选择，如：当只接收到一个或两个描述时，就用收到所述描述的多描述标量量化边解码器对该描述进行解码，并连通该多描述标量量化边解码器的输出端以输出经解码后所生成的完成重构的剩余信号；当三个描述都接收到时，就用收到这三个描述的多描述标量量化中心解码器1807对这三个描述进行解码，并连通多描述标量量化中心解码器1807的输出端以输出经解码后所生成的完成重构的剩余信号。The switch positions of the multi-description scalar quantization side decoder and the multi-description scalar quantization center decoder output signal shown in Figure 18 can be automatically selected according to the situation of the received description, such as: when only one or two descriptions are received, Decode the description with the multi-description scalar quantization side decoder receiving the description, and connect the output end of the multi-description scalar quantization side decoder to output the reconstructed residual signal generated after decoding; when When all three descriptions are received, the multi-description scalar quantization central decoder 1807 that receives these three descriptions is used to decode the three descriptions, and the output port of the multi-description scalar quantization central decoder 1807 is connected to output the decoded The remaining signal generated after the completion of the reconstruction.

具体而言，图18所示原理可以表述为以下步骤：Specifically, the principle shown in Figure 18 can be expressed as the following steps:

步骤1、设置k＝1。Step 1. Set k=1.

如果接收到了三个描述：M₁(k₁)、M₂(k₂)和M₃(k₃)，根据收到的描述可唯一查找到剩余信号R(k)的值；If three descriptions are received: M ₁ (k ₁ ), M ₂ (k ₂ ) and M ₃ (k ₃ ), the value of the remaining signal R(k) can be found uniquely according to the received descriptions;

如果只接收到M₁(k₁)、M₂(k₂)和M₃(k₃)中的两个描述，根据收到的描述查到对应的轴，并把该轴上绝对值最小的值作为剩余信号R(k)的值；If only two descriptions in M ₁ (k ₁ ), M ₂ (k ₂ ) and M ₃ (k ₃ ) are received, find the corresponding axis according to the received description, and put the axis with the smallest absolute value value as the value of the residual signal R(k);

如果只接收到了一个描述：M₁(k₁)、M₂(k₂)或M₃(k₃)，根据收到的描述查到对应的平面，并把该平面上绝对值最小的值作为剩余信号R(k)的值；If only one description is received: M ₁ (k ₁ ), M ₂ (k ₂ ) or M ₃ (k ₃ ), find the corresponding plane according to the received description, and take the value with the smallest absolute value on the plane as the value of the residual signal R(k);

其中k₁＝k₂＝k₃＝k。where k ₁ =k ₂ =k ₃ =k.

步骤4、结束。Step 4, end.

需要说明的是：在三描述标量量化的过程中，图19所示的设计在根据描述查找剩余信号的值时起着关键的作用。参见图19，图19是本发明一较佳实施例的多描述标量量化器编解码立体图表。图19中，三个正交的坐标轴分别代表着三个描述：M₁(k₁)、M₂(k₂)、M₃(k₃)。图19的设计原理和表2的基本一样，区别只在于由表2中的对角线变成了分别与三个坐标轴成60度角的一条直线，这条直线上的各数据之间的差值越小，说明其周围分布的数据越少，每个描述的值对应的剩余信号的值的个数就越少，描述的量化精度就越高，相同音质的编码速率也就越高。It should be noted that in the process of three-description scalar quantization, the design shown in Figure 19 plays a key role in finding the value of the remaining signal according to the description. Referring to Fig. 19, Fig. 19 is a three-dimensional diagram of encoding and decoding of a multi-description scalar quantizer according to a preferred embodiment of the present invention. In Fig. 19, three orthogonal coordinate axes respectively represent three descriptions: M ₁ (k ₁ ), M ₂ (k ₂ ), and M ₃ (k ₃ ). The design principle of Figure 19 is basically the same as that of Table 2, the only difference is that the diagonal line in Table 2 becomes a straight line at an angle of 60 degrees to the three coordinate axes, and the distance between the data on this straight line The smaller the difference, the less data is distributed around it, the less the number of remaining signal values corresponding to each described value, the higher the quantization accuracy of the description, and the higher the encoding rate of the same sound quality.

在实际应用中，以上所述的所有多描述编解码算法均可保证：在理想无丢包的情况下利用接收到的两个描述正常地对音频信号进行解码，也可以在一定的丢包率范围内利用接收到的一个描述去估算另外一个丢掉的描述，进而利用描述之间的相关性恢复和重构音频信号。然而，当丢包率继续增大时，两个描述可能会同时丢失，这时多描述解码器也很难较好地进行音频解码与音频重构。因此，为了在丢包严重时改善音质，可以进一步对多描述编解码算法进行改进和完善；为此，下面提出几种基于多描述编码的丢包隐藏处理算法：In practical applications, all the multi-description encoding and decoding algorithms mentioned above can guarantee: in the ideal case of no packet loss, the audio signal can be decoded normally by using the two descriptions received, and can be In the scope, one received description is used to estimate another lost description, and then the correlation between descriptions is used to recover and reconstruct the audio signal. However, when the packet loss rate continues to increase, two descriptions may be lost at the same time, and it is difficult for a multi-description decoder to perform audio decoding and audio reconstruction well. Therefore, in order to improve the sound quality when the packet loss is serious, the multi-description coding and decoding algorithm can be further improved and perfected; therefore, several packet loss concealment processing algorithms based on multi-description coding are proposed below:

1.在前述的奇偶分离多描述编码算法中，由于奇描述与偶描述是完全独立的两个描述，它们之间没有任何相关性；因此，在其中的一个描述丢失时，无法利用接收到的另一个描述估算出丢失的描述。为此，可以采用如下丢包隐藏处理算法：1. In the aforementioned odd-even separation multi-description encoding algorithm, since the odd and even descriptions are completely independent two descriptions, there is no correlation between them; therefore, when one of the descriptions is lost, the received Another description estimates the missing description. To this end, the following packet loss concealment processing algorithm can be used:

用前一帧正常接收的描述代替当前帧丢失的描述；或者，用前一帧正常接收的描述乘以一个衰减因子(可以设置为：在0.5至0.9之间)，用乘得的值代替当前帧丢失的描述；或者，用当前帧正常接收的描述与0的线性插值代替当前帧丢失的描述。Replace the description of the current frame loss with the description of the normal reception of the previous frame; or, multiply the description of the normal reception of the previous frame by an attenuation factor (can be set to: between 0.5 and 0.9), and replace the current frame with the multiplied value The description of frame loss; or, replace the description of current frame loss with the description of normal reception of the current frame and the linear interpolation of 0.

2.当因丢包严重而导致两个描述同时丢失时，分别采用前一帧正常接收的描述乘以一个衰减因子(可以设置为：在0.5至0.9之间)，用乘得的值分别代替当前帧丢失的描述，并以此估算当前帧音频参数或音频信号。2. When two descriptions are lost at the same time due to severe packet loss, the descriptions received normally in the previous frame are multiplied by an attenuation factor (can be set between 0.5 and 0.9), and the multiplied values are used to replace The description of the current frame loss, and use it to estimate the audio parameters or audio signals of the current frame.

3.当因丢包严重而导致某个描述连续丢失时，采用按帧递减的算法，利用最后一帧正常接收的描述按帧乘以一个衰减因子(可以设置为：在0.5至0.9之间)，用乘得的值分别代替当前帧丢失的描述。3. When a certain description is continuously lost due to severe packet loss, the frame-by-frame decrement algorithm is used, and the description of the last frame received normally is multiplied by an attenuation factor by frame (can be set to: between 0.5 and 0.9) , replace the description of the current frame loss with the multiplied value respectively.

以上各图中，如出现熵编/解码器的，实际上可以只应用一个熵编/解码器对描述进行编/解码处理，而不是像图中那样应用两个甚至多个熵编/解码器；并且，也可以将熵编/解码器替换为其它类型的编/解码器；再有，在对所述剩余信号进行处理之前，可以进一步对其进行取整、量化和/或编码处理。In the above figures, if there is an entropy encoder/decoder, in fact, only one entropy encoder/decoder can be used to encode/decode the description, instead of two or more entropy encoders/decoders as shown in the figure and, the entropy encoder/decoder can also be replaced by another type of encoder/decoder; moreover, before processing the residual signal, it can be further rounded, quantized and/or encoded.

由以上所述可以看出，本发明所提供的音频信号处理方法、系统以及音频信号收发装置，均可有效提高通信过程中的音频质量，提高用户满意度。It can be seen from the above description that the audio signal processing method, system and audio signal transceiving device provided by the present invention can effectively improve the audio quality in the communication process and improve user satisfaction.

Claims

1. An audio signal processing method characterized by:

at an audio signal sending end, coding a residual signal and a masking threshold signal obtained by processing the audio signal into a plurality of paths of residual signal descriptions and a plurality of paths of masking threshold signal descriptions respectively, and combining the residual signal descriptions and one path of the masking threshold signal descriptions respectively to generate a plurality of paths of descriptions which all comprise the residual signal descriptions and the masking threshold signal descriptions;

at an audio signal receiving end, branching all the received residual signal description and masking threshold signal description contained in each description in the descriptions which all contain residual signal description and masking threshold signal description, decoding all the residual signal descriptions generated after branching into a path of residual signal, decoding all the masking threshold signal descriptions generated after branching into a path of masking threshold signal, and performing parameter reconstruction and time-frequency synthesis processing on the residual signal and the masking threshold signal obtained after decoding to generate a reconstructed audio signal.

2. A method as claimed in claim 1, characterized in that the residual signal is encoded into the multi-residual signal description by:

carrying out odd-even separation processing on the residual signals, and carrying out entropy coding processing on the multi-channel residual signal description generated by the processing;

the method for encoding the masking threshold signal into a multi-path masking threshold signal description comprises the following steps:

and carrying out odd-even separation processing on the masking threshold signal, and carrying out entropy coding processing on the multi-path masking threshold signal description generated by the processing.

3. A method as claimed in claim 2, characterized in that the decoding of all residual signal descriptions into one residual signal is performed by:

entropy decoding all the residual signal descriptions, performing parity synthesis processing on all the residual signal descriptions after decoding processing, and synthesizing all the residual signal descriptions after parity synthesis processing into one residual signal;

the method for decoding all the masking threshold signal descriptions into a path of masking threshold signals comprises the following steps:

and performing entropy decoding processing on all the masking threshold signal descriptions, performing parity synthesis processing on all the masking threshold signal descriptions after the decoding processing is completed, and synthesizing all the masking threshold signal descriptions after the parity synthesis processing into a masking threshold signal.

4. A method as claimed in claim 1, characterized in that the residual signal is encoded into the multi-residual signal description by:

carrying out signal pair division processing on the residual signals, carrying out dual transformation on the multi-channel residual signals generated after the processing is finished, generating multi-channel residual signal descriptions with corresponding channels, and carrying out entropy coding processing on the generated multi-channel residual signal descriptions;

and carrying out signal pair division processing on the masking threshold signals, carrying out dual transformation on the multi-path masking threshold signals generated after the processing is finished, generating multi-path masking threshold signal descriptions of corresponding paths, and carrying out entropy coding processing on the generated multi-path masking threshold signal descriptions.

5. The method of claim 4, wherein decoding all residual signal descriptions into one residual signal comprises:

entropy decoding all the residual signal descriptions, dual inverse transformation processing all the residual signal descriptions after decoding processing, and synthesizing all the residual signal descriptions after the dual inverse transformation processing into a path of residual signal;

and performing entropy decoding processing on all the masking threshold signal descriptions, performing dual inverse transformation processing on all the masking threshold signal descriptions after the decoding processing is completed, and synthesizing all the masking threshold signal descriptions after the dual inverse transformation processing into a masking threshold signal.

6. The method of any of claims 2 to 5, wherein the multi-path description is a two-path description.

7. A method as claimed in claim 1, characterized in that the residual signal is encoded into the multi-residual signal description by:

carrying out multi-description scalar quantity quantization coding processing on the residual signals, and carrying out entropy coding processing on the multi-path residual signal description generated after the processing is finished;

and carrying out multi-description scalar quantity quantization coding processing on the masking threshold signal, and carrying out entropy coding processing on the multi-path masking threshold signal description generated after the processing is finished.

8. The method of claim 7 wherein decoding all residual signal descriptions into a residual signal comprises:

entropy decoding all residual signal descriptions, performing multi-description scalar quantization decoding on all residual signal descriptions after decoding processing, and decoding all residual signal descriptions after the multi-description scalar quantization decoding processing into one path of residual signal;

and performing entropy decoding processing on all the masking threshold signal descriptions, performing multi-description scalar quantity quantization decoding processing on all the masking threshold signal descriptions after the decoding processing is completed, and decoding all the masking threshold signal descriptions after the multi-description scalar quantity quantization decoding processing into one path of masking threshold signals.

9. The method of claim 7 or 8, wherein the multiple descriptions are two or more descriptions.

10. The method of claim 1, wherein the residual signal is obtained by performing time-frequency analysis on an original audio signal and residual signal analysis;

11. the method of claim 10, wherein:

the time-frequency analysis method comprises the following steps: processing the original audio signal including Modified Discrete Cosine Transform (MDCT) to obtain time-frequency transform parameters;

the method for analyzing the residual signal comprises the following steps: and removing auditory incoherent information or incoherent degree in the time-frequency transformation parameters.

12. The method of claim 1, wherein the masking threshold signal is obtained by performing a psychoacoustic model analysis on an original audio signal.

13. The method of claim 1, further comprising performing distortion-free encoding and audio packet processing on the plurality of residual signal descriptions and the plurality of masking threshold signal descriptions generated at the transmitting end.

14. The method of claim 13, wherein audio packet de-packetization and distortion-free decoding are further performed before splitting at the receiving end.

15. An audio signal processing system is characterized by comprising a multiple description encoder positioned at an audio signal transmitting end, a multi-description encoder for residual signals and a multi-description encoder for masking threshold signals, wherein the multiple description encoder for residual signals and the multi-description encoder for masking threshold signals are connected with a combiner; the multi-description decoder is positioned at the audio signal receiving end and consists of a residual signal multi-description decoder and a masking threshold signal multi-description decoder which are connected with the splitter;

the device comprises a residual signal/masking threshold signal multi-description encoder, a combiner and a residual signal/masking threshold signal multi-description encoder, wherein the residual signal/masking threshold signal multi-description encoder is used for encoding a received residual signal into a plurality of paths of residual signal descriptions, encoding a received masking threshold signal into a plurality of paths of masking threshold signal descriptions and sending each path of descriptions generated after encoding to the combiner;

the combiner is used for respectively combining the received residual signal descriptions with one of all the masking threshold signal descriptions to generate a plurality of paths of descriptions which all contain residual signal descriptions and masking threshold signal descriptions;

a splitter, configured to split the residual signal description and the masking threshold signal description contained in each of the descriptions that all contain the residual signal description and the masking threshold signal description, and send all the residual signal descriptions and the masking threshold signal descriptions generated after splitting to a residual signal/masking threshold signal multiple description decoder;

a residual signal/masking threshold signal multi-description decoder, for decoding all residual signal descriptions received into a path of residual signal, and decoding all masking threshold signal descriptions received into a path of masking threshold signal;

the multi-description decoder is further connected with a parameter reconstruction module connected with a time-frequency synthesis module;

the parameter reconstruction module is used for receiving the residual signal and the masking threshold signal generated by the multi-description decoder, performing parameter reconstruction processing on the received signal, and sending the time-frequency transformation parameter generated after the processing to the time-frequency synthesis module;

and the time-frequency synthesis module is used for carrying out time-frequency synthesis processing on the received time-frequency transformation parameters to generate a reconstructed audio signal.

16. The system of claim 15, wherein said residual signal/masking threshold signal multiple description coder comprises a connected parity separation module, an entropy coder;

the parity separation module is used for performing parity separation processing on the residual signals, sending the multi-channel residual signal description generated by the processing to the entropy coder, performing parity separation processing on the masking threshold signal, and sending the multi-channel masking threshold signal description generated by the processing to the entropy coder;

and the entropy coder is used for performing entropy coding processing on the received multi-path residual signal description and the multi-path masking threshold signal description.

17. The system of claim 16, wherein the residual signal/masking threshold signal multiple description decoder comprises a connected entropy decoder, parity synthesis module;

the entropy decoder is used for performing entropy decoding processing on all residual signal descriptions, sending all residual signal descriptions subjected to decoding processing to the parity synthesis module, performing entropy decoding processing on all masking threshold signal descriptions, and sending all masking threshold signal descriptions subjected to decoding processing to the parity synthesis module;

the parity synthesis module is used for performing parity synthesis processing on all the received residual signal descriptions, synthesizing all the residual signal descriptions subjected to the parity synthesis processing into a path of residual signal, performing parity synthesis processing on all the received masking threshold signal descriptions, and synthesizing all the masking threshold signal descriptions subjected to the parity synthesis processing into a path of masking threshold signal.

18. The system of claim 15, wherein said residual signal/masking threshold signal multiple description coder comprises a residual signal/masking threshold signal pair-dividing module, a dual transform module and an entropy coder connected in sequence;

the device comprises a residual signal/masking threshold signal pair division module, a multi-description dual transformation module and a multi-description dual transformation module, wherein the residual signal/masking threshold signal pair division module is used for carrying out signal pair division processing on the residual signal, sending the multi-path residual signal generated after the processing is finished to the multi-description dual transformation module, carrying out signal pair division processing on the masking threshold signal and sending the multi-path masking threshold signal generated after the processing is finished to the multi-description dual transformation module;

the multi-description dual transformation module is used for respectively carrying out dual transformation on the received multi-channel residual signals to generate multi-channel residual signal descriptions with corresponding paths, then sending the generated multi-channel residual signal descriptions to the entropy coder, respectively carrying out dual transformation on the received multi-channel masking threshold signals to generate multi-channel masking threshold signal descriptions with corresponding paths, and then sending the generated multi-channel masking threshold signal descriptions to the entropy coder;

19. The system of claim 18, wherein the residual signal/masking threshold signal multiple description decoder comprises an entropy decoder, a dual inverse transform decoder, and a residual signal/masking threshold signal synthesis module connected in series;

the entropy decoder is used for performing entropy decoding processing on all residual signal descriptions, sending all residual signal descriptions generated after the decoding processing to the multi-description dual inverse transform decoder, performing entropy decoding processing on all masking threshold signal descriptions, and sending all the masking threshold signal descriptions after the decoding processing to the multi-description dual inverse transform decoder;

the multi-description dual inverse transformation decoder is used for carrying out dual inverse transformation processing on all received residual signal descriptions, sending all residual signal descriptions subjected to the dual inverse transformation processing to the residual signal/masking threshold signal synthesis module, carrying out dual inverse transformation processing on all received masking threshold signal descriptions, and sending all masking threshold signal descriptions subjected to the dual inverse transformation processing to the residual signal/masking threshold signal synthesis module;

and the residual signal/masking threshold signal synthesis module is used for describing and synthesizing all the received residual signals into a residual signal and describing and synthesizing all the received masking threshold signals into a masking threshold signal.

20. The system of claim 15, wherein said residual signal/masking threshold signal multiple description encoder comprises a concatenated multiple description scalar quantizer, entropy encoder;

the multi-description scalar quantizer is used for performing multi-description scalar quantization coding processing on the residual signals, sending the multi-path residual signal description generated after the processing to the entropy coder, performing multi-description scalar quantization coding processing on the masking threshold signals, and sending the multi-path masking threshold signal description generated after the processing to the entropy coder;

21. The system of claim 20, wherein the residual signal/masking threshold signal multiple description decoder comprises a concatenated entropy decoder, multiple description scalar quantization decoder;

the entropy decoder is used for performing entropy decoding processing on all residual signal descriptions, sending all residual signal descriptions subjected to decoding processing to the multi-description scalar quantization decoder, performing entropy decoding processing on all masking threshold signal descriptions, and sending all masking threshold signal descriptions subjected to decoding processing to the multi-description scalar quantization decoder;

and the multi-description scalar quantization decoder is used for performing multi-description scalar quantization decoding processing on all the received residual signal descriptions, decoding all the residual signal descriptions subjected to the multi-description scalar quantization decoding processing into one path of residual signals, performing multi-description scalar quantization decoding processing on all the received masking threshold signal descriptions, and decoding all the masking threshold signal descriptions subjected to the multi-description scalar quantization decoding processing into one path of masking threshold signals.

22. The system of claim 20, wherein:

the multiple description scalar quantizer is a multiple description scalar quantizer capable of processing two or more descriptions, and the multiple description scalar quantization decoder is a multiple description scalar quantization decoder capable of processing two or more descriptions.

23. The system of claim 15, wherein the multiple description coder is further coupled to a residual signal analysis module coupled to a time-frequency analysis module, and to a psychoacoustic module;

the time-frequency analysis module is used for performing time-frequency analysis on the original audio signal and sending time-frequency transformation parameters generated after the analysis is finished to the residual signal analysis module;

a residual signal analysis module, configured to perform residual signal analysis on the received time-frequency transform parameter and masking threshold signal, and send a residual signal generated after the analysis to the multi-description encoder;

and the psychoacoustic module is used for performing psychoacoustic model analysis on the original audio signal and sending the masking threshold signal generated after the analysis to the multi-description encoder and the residual signal analysis module.

24. The system of any of claims 15 to 23, wherein the multiple description coder is further coupled to an undistorted coding and audio packet processing module;

and the undistorted coding and audio packet processing module is used for receiving the description generated by the multi-description coder and carrying out undistorted coding and audio packet processing on the received description.

25. The system of claim 24, wherein the multiple description decoder is further coupled to an audio packet unpacking and distortion-free decoding module;

and the audio packet unpacking and distortion-free decoding module is used for carrying out audio packet unpacking and distortion-free decoding processing on the received multi-channel description.

26. An audio signal transmitting device is characterized in that the device consists of a residual signal multi-description encoder and a masking threshold signal multi-description encoder which are connected with a combiner;

a residual signal/masking threshold signal multi-description encoder for encoding the received residual signal into a plurality of residual signal descriptions, encoding the received masking threshold signal into a plurality of masking threshold signal descriptions, and sending each path of description generated after encoding to a combiner;

and the combiner is used for respectively combining the received residual signal descriptions with one of all the masking threshold signal descriptions to generate a plurality of paths of descriptions which all contain residual signal descriptions and masking threshold signal descriptions.

27. The apparatus of claim 26, wherein said residual signal/masking threshold signal multiple description coder comprises a connected parity separation module, an entropy coder;

28. The apparatus of claim 26, wherein said residue signal/masking threshold signal multiple description coder comprises a residue signal/masking threshold signal pair division module, a multiple description dual transform module and an entropy coder connected in sequence;

29. The apparatus of claim 26, wherein said residual signal/masking threshold signal multiple description encoder comprises a concatenated multiple description scalar quantizer, entropy encoder;

30. The apparatus of claim 29, wherein the multiple description scalar quantizer is a multiple description scalar quantizer capable of processing more than two descriptions.

31. The apparatus of claim 26, wherein said multiple description coder is further coupled to a residual signal analysis module coupled to a time-frequency analysis module, and to a psychoacoustic module;

32. The apparatus of any of claims 26 to 31, wherein the multiple description coder is further coupled to an undistorted coding and audio packet processing module;

and the undistorted coding and audio packet processing module is used for receiving the multi-path description generated by the multi-description coder and carrying out undistorted coding and audio packet processing on the received multi-path description.

33. An audio signal receiving apparatus, characterized in that the apparatus is composed of a residual signal multiple description decoder and a masking threshold signal multiple description decoder both connected to a splitter;

the device comprises a splitter, a residual signal/masking threshold signal multi-description decoder and a residual signal/masking threshold signal multi-description decoder, wherein the splitter is used for splitting residual signal descriptions and masking threshold signal descriptions contained in all descriptions which contain residual signal descriptions and masking threshold signal descriptions and sending all residual signal descriptions and masking threshold signal descriptions generated after splitting to the residual signal/masking threshold signal multi-description decoder;

34. The apparatus of claim 33, wherein the residual signal/masking threshold signal multiple description decoder comprises an entropy decoder, a parity synthesis module, connected;

and the parity synthesis module is used for performing parity synthesis processing on all the received residual signal descriptions, synthesizing all the residual signal descriptions subjected to the parity synthesis processing into a path of residual signal, performing parity synthesis processing on all the received masking threshold signal descriptions, and synthesizing all the masking threshold signal descriptions subjected to the parity synthesis processing into a path of masking threshold signal.

35. The apparatus of claim 33, wherein the residual signal/masking threshold signal multiple description decoder comprises an entropy decoder, a multiple description dual inverse transform decoder, and a residual signal/masking threshold signal synthesis module connected in sequence;

36. The apparatus of claim 33, wherein the residual signal/masking threshold signal multiple description decoder comprises a concatenated entropy decoder, multiple description scalar quantization decoder;

37. The apparatus of claim 36, wherein the multiple description scalar quantization decoder is a multiple description scalar quantization decoder capable of processing more than two descriptions.

38. The apparatus of claim 33, wherein the multiple description decoder is further coupled to an audio packet unpacking and distortion-free decoding module;

and the audio packet unpacking and distortion-free decoding module is used for carrying out audio packet unpacking and distortion-free decoding processing on the received description.