CN101964189B

CN101964189B - Audio signal switching method and device

Info

Publication number: CN101964189B
Application number: CN2010101634063A
Authority: CN
Inventors: 刘泽新; 苗磊; 胡晨; 吴文海; 郎玥; 张清
Original assignee: Huawei Technologies Co Ltd
Current assignee: Huawei Technologies Co Ltd
Priority date: 2010-04-28
Filing date: 2010-04-28
Publication date: 2012-08-08
Anticipated expiration: 2030-04-28
Also published as: EP3249648A1; ES2635212T3; BR112012013306B8; JP2015045888A; JP6027081B2; KR20120074303A; ES2718947T3; WO2011134415A1; JP2013512468A; JP2017033015A; KR101377547B1; EP2485029A1; EP2485029A4; AU2011247719B2; JP5667202B2; BR112012013306A2; EP2485029B1; EP3249648B1; BR112012013306B1; CN101964189A

Abstract

The invention provides a voice and audio signal switching method and device. The voice and audio signal switching method includes: when the voice and audio signal is switched, weighting the first high frequency band signal of the current frame of voice and audio signal and the second high frequency band signal of the previous M frames of voice and audio signal, so as to be processed The processed first high frequency band signal; wherein, M is greater than or equal to 1; the processed first high frequency band signal and the first low frequency band signal of the current frame speech and audio signal are synthesized into a broadband signal. Process the first high-band signal of the current frame according to the second high-band signal of the speech and audio signal of the previous M frames, so that the second high-band signal of the speech and audio signal of the previous M frames can smoothly transition to the processed The first high-frequency band signal can smoothly switch voice and audio signals of different bandwidths, improving the quality of the user's listening to voice and audio signals.

Description

Speech and audio signal switching method and device

技术领域 technical field

本发明实施例涉及通信技术领域，尤其涉及一种语音频信号切换方法及装置。Embodiments of the present invention relate to the field of communication technologies, and in particular, to a voice and audio signal switching method and device.

背景技术 Background technique

目前，语音频信号在网络状态传输过程中，由于网络状态的不同，网络会对从编码端传输到网络的语音频信号的码流做不同码率的截断，从而解码端就会根据截断后的码流解码出不同带宽的语音频信号。At present, during the transmission of voice and audio signals in the network state, due to the different network conditions, the network will truncate the code stream of the voice and audio signal transmitted from the encoding end to the network at different bit rates, so that the decoding end will use the truncated The code stream decodes voice and audio signals with different bandwidths.

在现有技术中，由于网络中传输的语音频信号的带宽不同，在语音频信号传输过程中，存在窄频带语音频信号向宽频带语音频信号切换，以及宽频带语音频信号向窄频带语音频信号切换的现象。本发明中提到的窄频带信号为通过上采样和低通滤波，切换为只有低频带成分而高频带成分为空的宽频带信号，而宽频带语音频信号既有低频带信号成分又有高频带信号成分。In the prior art, due to the different bandwidths of voice and audio signals transmitted in the network, during the transmission of voice and audio signals, there are switching from narrowband voice and audio signals to wideband voice and audio signals, and switching from wideband voice and audio signals to narrowband voice signals. A phenomenon in which audio signals switch. The narrow-band signal mentioned in the present invention is through up-sampling and low-pass filtering, is switched to the wide-band signal that has only low-band components and the high-band components are empty, and the wide-band voice and audio signals have both low-band signal components and High frequency band signal components.

在实现本发明过程中，发明人发现现有技术中至少存在如下问题：由于窄频带语音频信号与宽频带语音频信号之间相差高频带信号中的信息，在切换不同带宽的语音频信号时，会出现音频信号能量激变的现象，从而会导致用户听觉上感到不舒服，造成用户接听音频信号的质量变差。In the process of realizing the present invention, the inventor has found that there are at least the following problems in the prior art: due to the difference in the information in the high-band signal between the narrowband speech and audio signal and the wideband speech and audio signal, when switching speech and audio signals of different bandwidths , there will be a phenomenon of sudden change in the energy of the audio signal, which will cause the user to feel uncomfortable in hearing, and cause the quality of the user to listen to the audio signal to deteriorate.

发明内容 Contents of the invention

本发明实施例提供一种语音频信号切换方法及装置，实现平滑的进行不同带宽的语音频信号切换，以提高用户接听音频信号的质量。Embodiments of the present invention provide a method and device for switching voice and audio signals, so as to realize smooth switching of voice and audio signals of different bandwidths, so as to improve the quality of audio signals listened to by users.

本发明实施例提供一种语音频信号切换方法，包括：An embodiment of the present invention provides a voice and audio signal switching method, including:

当语音频信号出现切换时，将当前帧语音频信号的第一高频带信号和前M帧语音频信号的第二高频带信号进行加权处理，以得到处理后的第一高频带信号；其中，M大于等于1；When the voice and audio signal is switched, the first high frequency band signal of the current frame voice and audio signal and the second high frequency band signal of the previous M frames of voice and audio signal are weighted to obtain the processed first high frequency band signal ; Among them, M is greater than or equal to 1;

将所述处理后的第一高频带信号与所述当前帧语音频信号的第一低频带信号合成宽频带信号。Combining the processed first high frequency band signal and the first low frequency band signal of the speech and audio signal of the current frame into a broadband signal.

本发明实施例提供一种音频信号切换装置，包括：An embodiment of the present invention provides an audio signal switching device, including:

处理模块，用于当语音频信号出现切换时，将当前帧语音频信号的第一高频带信号和前M帧语音频信号的第二高频带信号进行加权处理，以得到处理后的第一高频带信号；其中，M大于等于1；The processing module is used for weighting the first high-band signal of the current frame of speech and audio signals and the second high-band signal of the previous M frames of speech and audio signals when the speech and audio signals are switched, so as to obtain the processed first A high frequency band signal; wherein, M is greater than or equal to 1;

第一合成模块，用于将所述处理后的第一高频带信号与所述当前帧语音频信号的第一低频带信号合成宽频带信号。The first synthesizing module is configured to synthesize the processed first high frequency band signal and the first low frequency band signal of the speech and audio signal of the current frame into a broadband signal.

本发明实施例的语音频信号切换方法及装置，通过根据前M帧的语音频信号的第二高频带信号对当前帧语音频信号的第一高频带信号进行处理，以使前M帧的语音频信号的第二高频带信号能平滑过渡到处理后的第一高频带信号，并将处理后的第一高频带信号与第一低频带信号合成宽频带信号，从而在切换不同带宽的语音频信号过程中，能够平滑的进行不同带宽的语音频信号切换，减小了能量激变造成语音频信号的主观听觉质量差的影响，提高了用户接听语音频信号的质量。In the speech and audio signal switching method and device of the embodiment of the present invention, the first high frequency band signal of the speech and audio signal of the current frame is processed according to the second high frequency band signal of the speech and audio signal of the previous M frames, so that the previous M frames The second high-frequency band signal of the speech and audio signal can smoothly transition to the processed first high-frequency band signal, and the processed first high-frequency band signal and the first low-frequency band signal are synthesized into a wide-band signal, thereby switching In the process of voice and audio signals of different bandwidths, it can smoothly switch between voice and audio signals of different bandwidths, which reduces the impact of poor subjective auditory quality of voice and audio signals caused by energy shocks, and improves the quality of users listening to voice and audio signals.

附图说明 Description of drawings

为了更清楚地说明本发明实施例或现有技术中的技术方案，下面将对实施例或现有技术描述中所需要使用的附图作一简单地介绍，显而易见地，下面描述中的附图是本发明的一些实施例，对于本领域普通技术人员来讲，在不付出创造性劳动性的前提下，还可以根据这些附图获得其他的附图。In order to more clearly illustrate the technical solutions in the embodiments of the present invention or the prior art, the following will briefly introduce the drawings that need to be used in the description of the embodiments or the prior art. Obviously, the accompanying drawings in the following description These are some embodiments of the present invention. For those skilled in the art, other drawings can also be obtained according to these drawings without any creative effort.

图1为本发明语音频信号切换方法实施例一流程图；Fig. 1 is a flow chart of Embodiment 1 of the voice and audio signal switching method of the present invention;

图2为本发明语音频信号切换方法实施例二流程图；Fig. 2 is the flow chart of Embodiment 2 of the voice and audio signal switching method of the present invention;

图3为图2中步骤201一个实施例的流程图；Fig. 3 is a flowchart of an embodiment of step 201 in Fig. 2;

图4为图3中步骤302一个实施例的流程图；Fig. 4 is a flowchart of an embodiment of step 302 in Fig. 3;

图5为图3中步骤302另一个实施例的流程图二；FIG. 5 is a second flowchart of another embodiment of step 302 in FIG. 3;

图6为图2中步骤202一个实施例的流程图；Fig. 6 is a flowchart of an embodiment of step 202 in Fig. 2;

图7为图2中步骤201另一个实施例的流程图二；FIG. 7 is a second flowchart of another embodiment of step 201 in FIG. 2;

图8为图2中步骤201另一个实施例的流程图三；FIG. 8 is a third flowchart of another embodiment of step 201 in FIG. 2;

图9为本发明语音频信号切换装置实施例一的结构示意图；FIG. 9 is a schematic structural diagram of Embodiment 1 of the voice and audio signal switching device of the present invention;

图10为本发明语音频信号切换装置实施例二的结构示意图；FIG. 10 is a schematic structural diagram of Embodiment 2 of the voice and audio signal switching device of the present invention;

图11为本发明语音频信号切换装置实施例二中的处理模块的结构示意图一；FIG. 11 is a first structural schematic diagram of the processing module in Embodiment 2 of the voice and audio signal switching device of the present invention;

图12为本发明语音频信号切换装置实施例二中的第一模块的结构示意图；12 is a schematic structural diagram of the first module in Embodiment 2 of the voice and audio signal switching device of the present invention;

图13a为本发明语音频信号切换装置实施例二中的处理模块的结构示意图二；Fig. 13a is a second structural schematic diagram of the processing module in the second embodiment of the voice and audio signal switching device of the present invention;

图13b为本发明语音频信号切换装置实施例二中的处理模块的结构示意图三。Fig. 13b is a third structural schematic diagram of the processing module in the second embodiment of the voice and audio signal switching device of the present invention.

具体实施方式 Detailed ways

为使本发明实施例的目的、技术方案和优点更加清楚，下面将结合本发明实施例中的附图，对本发明实施例中的技术方案进行清楚、完整地描述，显然，所描述的实施例是本发明一部分实施例，而不是全部的实施例。基于本发明中的实施例，本领域普通技术人员在没有作出创造性劳动前提下所获得的所有其他实施例，都属于本发明保护的范围。In order to make the purpose, technical solutions and advantages of the embodiments of the present invention clearer, the technical solutions in the embodiments of the present invention will be clearly and completely described below in conjunction with the drawings in the embodiments of the present invention. Obviously, the described embodiments It is a part of embodiments of the present invention, but not all embodiments. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without creative efforts fall within the protection scope of the present invention.

图1为本发明语音频信号切换方法实施例一流程图。如图1所示，本实施例语音频信号切换方法，当语音频信号出现切换时，对切换帧之后的每一帧采用如下方式进行处理：FIG. 1 is a flow chart of Embodiment 1 of the voice and audio signal switching method of the present invention. As shown in Figure 1, in the voice and audio signal switching method of this embodiment, when the voice and audio signal is switched, each frame after the switching frame is processed in the following manner:

步骤101、当语音频信号出现切换时，将当前帧语音频信号的第一高频带信号和前M帧语音频信号的第二高频带信号进行加权处理，以得到处理后的第一高频带信号；其中，M大于等于1。Step 101, when the speech and audio signals switch, weighting the first high frequency band signal of the current frame speech and audio signal and the second high frequency band signal of the previous M frames of speech and audio signals, to obtain the processed first high frequency band signal A frequency band signal; wherein, M is greater than or equal to 1.

步骤102、将处理后的第一高频带信号与当前帧语音频信号的第一低频带信号合成宽频带信号。Step 102: Synthesize the processed first high frequency band signal and the first low frequency band signal of the speech and audio signal of the current frame into a wide frequency band signal.

本发明提供的实施例中的前M帧语音频信号指当前帧之前的M帧语音频信号。切换前L帧语音频信号指出现语音频信号切换时切换帧之前的L帧语音频信号。当前语音帧为宽频带信号而前一帧语音帧为窄频带信号；或者当前语音帧为窄频带信号而前一帧语音帧为宽频带信号，则语音频信号出现切换，当前语音帧为切换帧。The previous M frames of speech and audio signals in the embodiments provided by the present invention refer to M frames of speech and audio signals before the current frame. The L frames of speech and audio signals before switching refer to the L frames of speech and audio signals before the switching frame when the speech and audio signals are switched. The current speech frame is a wideband signal and the previous speech frame is a narrowband signal; or the current speech frame is a narrowband signal and the previous speech frame is a broadband signal, then the speech and audio signals are switched, and the current speech frame is a switching frame .

本发明实施例的语音频信号切换方法，通过根据前M帧语音频信号中的第二高频带信号对当前帧语音频信号中的第一高频带信号进行处理，以使的前M帧语音频信号中的第二高频带信号能平滑的过渡到处理后的第一高频带信号，从而在切换不同带宽的语音频信号过程中，使不同带宽的语音频信号的高频带信号能够平滑的过渡切换；最后，处理后的第一高频带信号与第一低频带信号合成宽频带信号，将该宽频带信号传输到用户终端，使用户享受到高质量的语音频信号。本实施例语音频信号切换方法能够平滑的进行不同带宽的语音频信号切换，减小了能量激变造成语音频信号的主观听觉质量差的影响，提高了用户接听语音频信号的质量。In the speech and audio signal switching method of the embodiment of the present invention, the first high frequency band signal in the speech and audio signal of the current frame is processed according to the second high frequency band signal in the speech and audio signal of the previous M frames, so that the first M frames The second high frequency band signal in the speech and audio signal can smoothly transition to the processed first high frequency band signal, so that in the process of switching speech and audio signals of different bandwidths, the high frequency band signals of speech and audio signals of different bandwidths Smooth transition switching is possible; finally, the processed first high-frequency signal and the first low-frequency signal synthesize a wide-band signal, and transmit the wide-band signal to the user terminal, so that the user can enjoy high-quality voice and audio signals. The speech and audio signal switching method of this embodiment can smoothly switch speech and audio signals of different bandwidths, reduces the impact of poor subjective auditory quality of speech and audio signals caused by energy shocks, and improves the quality of speech and audio signals received by users.

图2为本发明语音频信号切换方法实施例二流程图。如图2所示，本实施例语音频信号切换方法，包括：FIG. 2 is a flow chart of Embodiment 2 of the voice and audio signal switching method of the present invention. As shown in Figure 2, the voice and audio signal switching method of this embodiment includes:

步骤200、当语音频信号没有出现切换时，将当前帧语音频信号的第一高频带信号与第一低频带信号合成宽频带信号。Step 200: When the speech and audio signal does not switch, synthesize the first high frequency band signal and the first low frequency band signal of the speech and audio signal of the current frame into a broadband signal.

具体的，本实施例中的第一频带语音频信号可以是宽频带语音频信号或者是窄频带语音频信号。在语音频信号的传输过程中，当第一频带语音频信号不发生切换时，分以下两种情况进行处理：一、若第一频带语音频信号为宽频带语音频信号时，则将宽频带语音频信号中的低频带信号和高频带信号合成宽频带信号；二、若第一频带语音频信号为窄频带语音频信号时，则将窄频带语音频信号中的低频带信号和高频带信号合成宽频带信号，此时，虽然为宽频带信号，但高频带是空，没有信息。Specifically, the first frequency-band speech and audio signal in this embodiment may be a wide-band speech and audio signal or a narrow-band speech and audio signal. In the transmission process of the speech and audio signal, when the first frequency band speech and audio signal does not switch, the following two situations are processed: 1. If the first frequency band speech and audio signal is a wideband speech and audio signal, the wideband The low-band signal and the high-band signal in the voice-audio signal are synthesized into a broadband signal; two, if the first-band voice-audio signal is a narrow-band voice-audio signal, then the low-band signal and the high-frequency signal in the narrow-band voice-audio signal are The band signal is synthesized into a broadband signal. At this time, although it is a broadband signal, the high frequency band is empty and has no information.

步骤201、当语音频信号出现切换时，将当前帧语音频信号的第一高频带信号和前M帧语音频信号的第二高频带信号进行加权处理，以得到处理后的第一高频带信号。其中，M大于等于1。Step 201, when the speech and audio signal switches, perform weighting processing on the first high frequency band signal of the speech and audio signal of the current frame and the second high frequency band signal of the previous M frames of speech and audio signals, so as to obtain the first high frequency band signal after processing band signal. Wherein, M is greater than or equal to 1.

具体的，当出现不同带宽的语音频信号进行切换时，根据前M帧语音频信号的第二高频带信号，对当前帧语音频信号的第一高频带信号进行处理，以使得前M帧语音频信号的第二高频带信号能平滑过渡到处理后的第一高频带信号，例如，当宽频带语音频信号切换到窄频带语音频信号，由于窄频带语音频信号对应的的高频带信号为空，因此为了使宽频带语音频信号平滑切换到窄频带语音频信号，需要恢复窄频带语音频信号对应的的高频带信号的成分，而当窄频带语音频信号切换到宽频带语音频信号，由于宽频带语音频信号中的高频带信号不为空，因此为了使窄频带语音频信号平滑切换到宽频带语音频信号，需要减弱切换后连续多帧宽频带语音频信号中的高频带信号的能量，使得宽频带语音频信号的高频带信号逐渐过渡到真实的高频带信号。通过步骤201对当前帧语音频信号进行处理，使得不同带宽的语音频信号中的高频带信号能够平滑的进行过渡，解决了宽频带语音频信号与窄频带语音频信号之间进行切换时，由于能量激变而造成用户的听觉不舒适，从而使用户接收到高质量的音频信号。其中，为了简化得到处理后的第一高频带信号的过程，可以将第一高频带信号与前M帧语音频信号的第二高频带信号直接进行加权处理，处理后获得的结果即为处理后的第一高频带信号。Specifically, when voice and audio signals of different bandwidths are switched, the first high frequency band signal of the voice and audio signal of the current frame is processed according to the second high frequency band signal of the voice and audio signal of the previous M frames, so that the first M The second high-band signal of the frame speech and audio signal can smoothly transition to the processed first high-band signal. For example, when the wideband speech and audio signal is switched to the narrowband speech and audio signal, due to the corresponding The high-band signal is empty, so in order to smoothly switch the wide-band speech and audio signal to the narrow-band speech and audio signal, it is necessary to restore the components of the high-band signal corresponding to the narrow-band speech and audio signal, and when the narrow-band speech and audio signal is switched to For wideband voice and audio signals, since the high frequency band signal in wideband voice and audio signals is not empty, in order to smoothly switch narrowband voice and audio signals to wideband voice and audio signals, it is necessary to weaken the continuous multi-frame wideband voice and audio after switching The energy of the high-frequency band signal in the signal makes the high-band signal of the broadband speech and audio signal gradually transition to the real high-band signal. By step 201, the current frame of voice and audio signal is processed, so that the high frequency band signal in the voice and audio signals of different bandwidths can smoothly transition, and when switching between the wideband voice and audio signal and the narrowband voice and audio signal, The user's hearing is uncomfortable due to the sudden energy change, so that the user receives high-quality audio signals. Wherein, in order to simplify the process of obtaining the processed first high-band signal, the first high-band signal and the second high-band signal of the previous M frames of speech and audio signals can be directly weighted, and the obtained result after processing is is the processed first high frequency band signal.

步骤202、将处理后的第一高频带信号与当前帧语音频信号的第一低频带信号合成宽频带信号。Step 202: Synthesize the processed first high-frequency signal and the first low-frequency signal of the speech and audio signal of the current frame into a wide-band signal.

具体的，当前帧语音频信号通过步骤201处理后，使前M帧语音频信号的第二高频带信号能够平滑的过渡到当前帧的处理后的第一高频带信号，然后通过步骤202将处理后的第一高频带信号与当前帧语音频信号的第一低频带信号合成宽频带信号，使用户接收到的语音频信号均为宽频带语音频信号，实现了不同带宽的语音频信号平稳的切换，有利于提高用户接听音频信号的质量。Specifically, after the current frame speech and audio signal is processed by step 201, the second high frequency band signal of the previous M frames of speech and audio signal can be smoothly transitioned to the processed first high frequency band signal of the current frame, and then by step 202 Synthesize the processed first high-frequency signal and the first low-frequency signal of the current frame voice and audio signal into a broadband signal, so that the voice and audio signals received by the user are all broadband voice and audio signals, realizing voice and audio with different bandwidths The smooth switching of the signal is conducive to improving the quality of the user's listening to the audio signal.

本发明实施例的语音频信号切换方法，通过根据前M帧语音频信号的第二高频带信号对当前帧语音频信号中的第一高频带信号进行处理，以使的前M帧语音频信号的第二高频带信号能平滑的过渡到处理后的第一高频带信号，从而在切换不同带宽的语音频信号过程中，使不同带宽的语音频信号的高频带信号能够平滑的过渡切换；最后，处理后的第一高频带信号与第一低频带信号合成宽频带信号，该宽频带信号传输到用户终端，使用户享受到高质量的语音信号。本实施例语音频信号切换方法能够平滑的进行不同带宽的语音频信号切换，减小了能量激变造成音频信号的主观听觉质量差的影响，提高了用户接听音频信号的质量。另外，通过在不发生不同带宽的语音频信号切换时，将当前帧语音频信号的第一高频带信号和第一低频带信号合成宽频带信号，使用户获得高质量的音频信号。In the speech and audio signal switching method of the embodiment of the present invention, the first high frequency band signal in the speech and audio signal of the current frame is processed according to the second high frequency band signal of the speech and audio signal of the previous M frames, so that the speech of the previous M frames The second high frequency band signal of the audio signal can smoothly transition to the processed first high frequency band signal, so that the high frequency band signals of the voice and audio signals of different bandwidths can be smoothed during the process of switching voice and audio signals of different bandwidths Transition switching; Finally, the processed first high-frequency signal and the first low-frequency signal synthesize a wide-band signal, and the wide-band signal is transmitted to the user terminal, so that the user can enjoy high-quality voice signals. The speech and audio signal switching method of this embodiment can smoothly switch speech and audio signals of different bandwidths, which reduces the impact of poor subjective auditory quality of audio signals caused by energy shocks, and improves the quality of audio signals listened to by users. In addition, by synthesizing the first high frequency band signal and the first low frequency band signal of the current frame voice and audio signal into a broadband signal when switching between voice and audio signals of different bandwidths does not occur, users can obtain high-quality audio signals.

基于上述技术方案，可选的，当宽频带语音频信号向窄频带语音频信号转换时，如图3所示，本实施例中的步骤201包括：Based on the above technical solution, optionally, when the wideband voice and audio signal is converted to the narrowband voice and audio signal, as shown in Figure 3, step 201 in this embodiment includes:

步骤301、预测第一高频带信号对应的预测的精细结构信息和预测的包络信息。Step 301. Predict predicted fine structure information and predicted envelope information corresponding to the first high-frequency band signal.

具体的，语音频信号可以分解为精细结构信息和包络信息两部分，从而可以根据精细结构信息和包络信息恢复语音频信号。在由宽频带语音频信号切换到窄频带语音频信号的过程中，由于窄频带语音频信号中只有低频带信号，其对应的高频带信号为空，为了使宽频带语音频信号能够平滑的切换到窄频带语音频信号，需要恢复出当前的窄频带语音频信号所需要的高频带信号，以实现语音频信号的平滑切换。本实施例中的步骤301将预测窄频带语音频信号中的第一高频带信号对应的预测的精细结构信息和预测的包络信息。Specifically, the speech and audio signal can be decomposed into fine structure information and envelope information, so that the speech and audio signal can be restored according to the fine structure information and the envelope information. In the process of switching from wideband speech and audio signals to narrowband speech and audio signals, since there are only low-band signals in the narrowband speech and audio signals, the corresponding high-band signals are empty. In order to make the wideband speech and audio signals smooth To switch to the narrow-band speech and audio signals, it is necessary to restore the high-band signals required by the current narrow-band speech and audio signals, so as to realize smooth switching of the speech and audio signals. Step 301 in this embodiment is to predict the predicted fine structure information and the predicted envelope information corresponding to the first high-band signal in the narrow-band speech and audio signal.

为了更加准确的预测当前帧语音频信号对应的预测的精细结构信息和预测的包络信息，步骤301还可以对当前帧语音频信号的第一低频带信号进行信号分类；再根据第一低频带信号对应的信号类型预测第一高频带信号对应的预测的精细结构信息和预测的包络信息。例如，当前帧的窄频带语音频信号可以为谐波信号、或非谐波信号或瞬态信号等信息类型，则可以根据窄频带语音频信号对应的信息类型，得知该种类型的信号应具有的精细结构信息和包络信息，从而更加准确的预测当前帧的高频带信号对应的精细结构信息和包络信息。其中，本发明语音频信号切换方法对窄频带语音频信号的信号类型不做限制。In order to more accurately predict the predicted fine structure information and the predicted envelope information corresponding to the current frame speech and audio signal, step 301 can also perform signal classification on the first low frequency band signal of the current frame speech and audio signal; then according to the first low frequency band The signal type prediction corresponding to the signal corresponds to the predicted fine structure information and the predicted envelope information corresponding to the first high frequency band signal. For example, the narrow-band speech and audio signal of the current frame can be information types such as harmonic signals, or non-harmonic signals or transient signals, and then it can be known that the signal of this type should be based on the information type corresponding to the narrow-band speech and audio signal. It has fine structure information and envelope information, so as to more accurately predict the fine structure information and envelope information corresponding to the high frequency band signal of the current frame. Wherein, the speech and audio signal switching method of the present invention does not limit the signal type of the narrowband speech and audio signal.

步骤302、将预测的包络信号和前M帧语音频信号的第二高频带信号对应的前M帧包络信息进行加权处理，以得到第一高频带信号对应的第一包络信息。Step 302: Perform weighting processing on the predicted envelope signal and the envelope information of the previous M frames corresponding to the second high-band signal of the previous M frames of speech and audio signals, so as to obtain the first envelope information corresponding to the first high-band signal .

具体的，在步骤301预测当前帧的第一高频带信号对应的预测的精细结构信息和预测的包络信息后，可以根据预测的包络信息和前M帧语音频信号的第二高频带信号对应的前M帧包络信息，生成第一高频带信号对应的第一包络信息。Specifically, after predicting the predicted fine structure information and predicted envelope information corresponding to the first high-frequency band signal of the current frame in step 301, the second high-frequency The envelope information of the first M frames corresponding to the band signal is used to generate the first envelope information corresponding to the first high frequency band signal.

具体而言，步骤302中生成第一高频带信号对应的第一包络信息的过程可以通过如下两种方式实现，具体如下：Specifically, the process of generating the first envelope information corresponding to the first high-frequency band signal in step 302 can be realized in the following two ways, specifically as follows:

方式一，如图4所示，通过步骤302获得第一包络信息的一个实施例可以包括：Way 1, as shown in Figure 4, an embodiment of obtaining the first envelope information through step 302 may include:

步骤401、根据第一低频带信号和前N帧的语音频信号的低频带信号，计算第一低频带信号与前N帧的语音频信号的低频带信号之间的相关性系数；其中，N大于等于1。Step 401. Calculate the correlation coefficient between the first low-frequency signal and the low-frequency signal of the speech and audio signal of the previous N frames according to the first low-frequency signal and the low-frequency signal of the speech and audio signal of the previous N frames; wherein, N greater than or equal to 1.

具体的，对当前帧语音频信号的第一低频带信号与前N帧的语音频信号的低频带信号进行比较，以得出当前帧语音频信号的第一低频带信号与前N帧的语音频信号的低频带信号之间的相关性系数，例如，可以通过判断当前帧语音频信号的第一低频带信号中的某频段信息，与前N帧的语音频信号的低频带信号的相同频段信息的能量大小或信息类型的差异等，确定它们之间的相关性，以计算得出所要的相关性系数。其中，前N帧的语音频信号可以为窄频带语音频信号、宽频带语音频信号或者窄频带语音频信号和宽频带语音频信号组成的混合信号。Specifically, compare the first low frequency band signal of the speech and audio signal of the current frame with the low frequency band signal of the speech and audio signal of the previous N frames to obtain the first low frequency band signal of the speech and audio signal of the current frame and the speech and audio signals of the previous N frames. The correlation coefficient between the low-frequency band signals of the audio signal, for example, can be determined by judging a certain frequency band information in the first low-frequency band signal of the current frame speech and audio signal, and the same frequency band of the low-frequency band signal of the speech and audio signal of the previous N frames The energy size of the information or the difference in the type of information, etc., determine the correlation between them, and calculate the desired correlation coefficient. Wherein, the speech and audio signals of the first N frames may be narrowband speech and audio signals, wideband speech and audio signals, or a mixed signal composed of narrowband speech and audio signals and wideband speech and audio signals.

步骤402、判断相关性系数是否在给定的第一阈值范围内。Step 402, judging whether the correlation coefficient is within a given first threshold range.

具体的，在步骤401计算出相关性系数后，判断该相关性系数是否在给定的阈值范围内。计算相关性系数的作用是为了得知当前帧语音频信号是从前N帧的语音频信号渐变过来的还是突变过来的，也就是说看它们的特性是否相同，进而判断预测当前帧语音频信号的高频带信号时前面帧的高频带信号所占的权重。例如，如果当前帧语音频信号的第一低频带信号与前一帧的语音频信号的低频带信号能量相当，且类型相同，则说明前一帧的语音频信号与当前帧语音频信号有较高的相关性，因此，为了准确的恢复当前帧语音频信号对应的第一包络信息，恢复当前帧语音频信号对应的第一包络信息时，前一帧的语音频信号对应的高频带包络信息或过渡的包络信息占较大的权重；否则，如果当前帧语音频信号的第一低频带信号与前一帧的语音频信号的低频带信号能量相差很大，且类型不同，则说明前一帧的语音频信号与当前帧语音频信号有较低的相关性，因此，为了准确的恢复当前帧语音频信号对应的第一包络信息，恢复当前帧语音频信号对应的第一包络信息时，前一帧的语音频信号对应的高频带包络信息或过渡的包络信息占较小的权重；Specifically, after the correlation coefficient is calculated in step 401, it is judged whether the correlation coefficient is within a given threshold range. The function of calculating the correlation coefficient is to know whether the speech and audio signal of the current frame is gradually changed or mutated from the speech and audio signal of the previous N frames, that is to say, to see whether their characteristics are the same, and then to judge and predict the speech and audio signal of the current frame The weight of the high-band signal in the previous frame when the high-band signal is used. For example, if the first low frequency band signal of the speech and audio signal of the current frame is equivalent to the low frequency signal energy of the speech and audio signal of the previous frame, and the type is the same, then the speech and audio signal of the previous frame is compared with the speech and audio signal of the current frame. High correlation, therefore, in order to accurately restore the first envelope information corresponding to the current frame speech and audio signal, when restoring the first envelope information corresponding to the current frame speech and audio signal, the high frequency corresponding to the speech and audio signal of the previous frame Envelope information with envelope information or transition envelope information takes a larger weight; otherwise, if the energy of the first low-frequency band signal of the current frame of speech and audio signal is very different from that of the speech and audio signal of the previous frame, and the types are different , it means that the speech and audio signal of the previous frame has a low correlation with the speech and audio signal of the current frame. Therefore, in order to accurately restore the first envelope information corresponding to the speech and audio signal of the current frame, restore the corresponding During the first envelope information, the high-frequency band envelope information corresponding to the speech and audio signal of the previous frame or the transitional envelope information accounted for a smaller weight;

步骤403、如果相关性系数不在给定的第一阈值范围内，则根据已设定好的第一权重一和第一权重二进行加权处理，以计算出第一包络信息。其中，第一权重一为前一帧语音频信号的高频带信号对应的前一帧包络信息的权重值，第一权重二为预测的包络信息的权重值。Step 403 , if the correlation coefficient is not within the given first threshold range, perform weighting processing according to the set first weight 1 and first weight 2 to calculate the first envelope information. Wherein, the first weight one is the weight value of the envelope information of the previous frame corresponding to the high frequency band signal of the voice and audio signal of the previous frame, and the first weight two is the weight value of the predicted envelope information.

具体的，当步骤402得出相关性系数不在给定的第一阈值范围内时，则可以得知当前帧语音频信号与前N帧语音频信号有较小的相关性，所以，前M帧的第一频带语音频信号对应的前M帧包络信息或过渡的包络信息或前一帧的语音频信号对应的高频带包络信息对第一包络信息影响较小，在恢复当前帧语音频信号对应的第一包络信息时，前M帧的第一频带语音频信号对应的前M帧包络信息或过渡的包络信息或前一帧的语音频信号对应的高频带包络信息占的权重较小。因此，根据已设定好的第一权重一和第一权重二，便可以计算出当前帧的第一包络信息。其中，第一权重一为前一帧语音频信号的高频带信号对应的包络信息的权重值，该前一帧语音频信号可以是宽频带语音频信号或者是已处理过的窄频带语音频信号，当第一次切换时，前一帧语音频信号即为宽频带语音频信号；而第一权重二为预测的包络信息的权重值。将预测的包络信息与第一权重二的乘积，加上前一帧的包络信息与第一权重一的乘积之和，所求得的加权之和为当前帧的第一包络信息。另外，以后传输的语音频信号都按此方式和权重，恢复该语音频信号对应的第一包络信息，直到语音频信号再次发生切换。Specifically, when step 402 shows that the correlation coefficient is not within the given first threshold range, it can be known that the speech and audio signal of the current frame has a small correlation with the speech and audio signal of the previous N frames, so the previous M frames The first M frame envelope information corresponding to the speech and audio signal of the first frequency band or the envelope information of the transition or the high frequency band envelope information corresponding to the speech and audio signal of the previous frame have little influence on the first envelope information, and the current When the first envelope information corresponding to the frame speech and audio signal, the first M frame envelope information corresponding to the first frequency band speech and audio signal of the previous M frames or the transitional envelope information or the high frequency band corresponding to the speech and audio signal of the previous frame Envelope information takes less weight. Therefore, according to the set first weight 1 and first weight 2, the first envelope information of the current frame can be calculated. Wherein, the first weight one is the weight value of the envelope information corresponding to the high-band signal of the previous frame of speech and audio signal, and the speech and audio signal of the previous frame can be a wideband speech and audio signal or a processed narrowband speech signal. For the audio signal, when switching for the first time, the speech and audio signal of the previous frame is the broadband speech and audio signal; and the first weight 2 is the weight value of the predicted envelope information. The product of the predicted envelope information and the first weight two is added to the sum of the product of the envelope information of the previous frame and the first weight one, and the obtained weighted sum is the first envelope information of the current frame. In addition, the speech and audio signals to be transmitted later will be restored according to the method and weight, and the first envelope information corresponding to the speech and audio signals will be restored until the speech and audio signals are switched again.

步骤404、如果相关性系数在第一阈值范围内，根据已设定好的第二权重一和第二权重二进行加权处理，以计算出过渡包络信息。其中，第二权重一为切换前的包络信息的权重值，第二权重二为前M帧包络信息的权重值；其中，L大于等于1。Step 404, if the correlation coefficient is within the first threshold range, perform weighting processing according to the set second weight 1 and second weight 2 to calculate the transition envelope information. Wherein, the second weight one is the weight value of the envelope information before switching, and the second weight two is the weight value of the envelope information of the previous M frames; wherein, L is greater than or equal to 1.

具体的，当步骤402得出相关性系数在给定的阈值范围内时，则可以得知当前帧语音频信号与前连续N帧的语音频信号特性相似，当前帧语音频信号对应的第一包络信息受前连续N帧的语音频信号的包络信息影响较大，同时考虑到前M帧包络的真实性，因此，需要根据前M帧包络信息和切换前的包络信息求解当前帧语音频信号对应的过渡包络信息，在恢复当前帧语音频信号的第一包络信息时，让前M帧包络信息和切换前L帧的包络信息占较大的权重；再通过过渡包络信息求解第一包络信息。其中，第二权重一为切换前的包络信息的权重值，而第二权重二为前M帧包络信息的权重值。则切换前的包络信息与第二权重一的乘积，加上前M帧包络信息与第二权重二的乘积之和，所求得的加权值即为过渡包络信息。Specifically, when step 402 shows that the correlation coefficient is within a given threshold range, it can be known that the speech and audio signals of the current frame are similar to the speech and audio signals of the previous N consecutive frames, and the speech and audio signals of the current frame correspond to the first The envelope information is greatly affected by the envelope information of the voice and audio signals of the previous N consecutive frames. At the same time, considering the authenticity of the envelope of the previous M frames, it is necessary to solve the problem based on the envelope information of the previous M frames and the envelope information before switching The transition envelope information corresponding to the current frame speech and audio signal, when recovering the first envelope information of the current frame speech and audio signal, allow the envelope information of the previous M frame envelope information and the envelope information of the L frame before switching to account for a larger weight; The first envelope information is solved by the transition envelope information. Wherein, the second weight one is the weight value of the envelope information before switching, and the second weight two is the weight value of the envelope information of the previous M frames. Then, the product of the envelope information before switching and the second weight 1 is added to the sum of the products of the envelope information of the previous M frames and the second weight 2, and the obtained weight value is the transition envelope information.

步骤405、以第一权重步长为单位减小第二权重一，以第一权重步长为单位增加第二权重二。Step 405: Decrease the second weight one by the unit of the first weight step, and increase the second weight two by the unit of the first weight step.

具体的，随着语音频信号的传输，后续的窄频带语音频信号受切换前的宽频带语音频信号的影响逐渐变小，为了使计算得出的第一包络信息更加准确，需要对第二权重一和第二权重二进行适用性的调整。由于后续的音频信号受切换前L帧的宽频带语音频信号的影响逐渐变小，因此，第二权重一的数值逐渐变小，而第二权重二的数值逐渐增大，从而减弱切换前的包络信息对第一包络信息的影响。其中，步骤405对第二权重一和第二权重二进行修改可以通过如下方法：新的第二权重一等于旧的第二权重一减去第一权重步长，新的第二权重二等于旧的第二权重二加上第一权重步长；其中，第一权重步长为已设定好的值。Specifically, with the transmission of voice and audio signals, the subsequent narrowband voice and audio signals are gradually less affected by the wideband voice and audio signals before switching. In order to make the calculated first envelope information more accurate, it is necessary to The second weight one and the second weight two are adjusted for applicability. Since the follow-up audio signal is gradually reduced by the influence of the broadband speech and audio signal of L frames before the switch, the value of the second weight 1 gradually decreases, while the value of the second weight 2 gradually increases, thereby weakening the value before the switch. The influence of the envelope information on the first envelope information. Among them, step 405 can modify the second weight 1 and the second weight 2 through the following method: the new second weight 1 is equal to the old second weight 1 minus the first weight step size, and the new second weight 2 is equal to the old The second weight of 2 plus the first weight step; wherein, the first weight step is a set value.

步骤406、判断已设定好的第三权重一是否大于第一权重一。Step 406 , judging whether the set third weight one is greater than the first weight one.

具体的，第三权重一为过渡包络信息的权重值，通过比较第三权重一与第二权重一的大小，可以得知当前帧的第一包络信息受过渡包络信息的影响大小。其中，过渡包络信息中由前M帧包络信息和切换前的包络信息计算而来，因此，第三权重一实际上代表了第一包络信息受切换前的包络信息的影响程度。Specifically, the third weight one is a weight value of the transition envelope information, and by comparing the third weight one with the second weight one, it can be known how much the first envelope information of the current frame is affected by the transition envelope information. Among them, the transition envelope information is calculated from the envelope information of the previous M frames and the envelope information before switching, therefore, the third weight one actually represents the influence degree of the first envelope information by the envelope information before switching .

步骤407、如果第三权重一不大于第一权重一，则根据已设定好的第一权重一和第一权重二进行加权处理，以计算出所述第一包络信息。Step 407. If the third weight one is not greater than the first weight one, perform weighting processing according to the set first weight one and first weight two to calculate the first envelope information.

具体的，在步骤406判断得知第三权重一小于等于第一权重一时，说明当前帧语音频信号离切换前L帧的语音频信号较远，第一包络信息主要受前M帧包络信息的影响，因此，根据已设定好的第一权重一和第一权重二，便可以计算出当前帧的第一包络信息。Specifically, when it is judged in step 406 that the third weight 1 is less than or equal to the first weight 1, it means that the speech and audio signal of the current frame is far from the speech and audio signal of L frames before switching, and the first envelope information is mainly affected by the envelope of the previous M frames. Therefore, according to the set first weight 1 and first weight 2, the first envelope information of the current frame can be calculated.

步骤408、如果第三权重一大于第一权重一，根据已设定好的第三权重一和第三权重二进行加权处理，以计算出第一包络信息。其中，第三权重一为过渡包络信息的权重值，第三权重二为预测的包络信息的权重值。Step 408, if the third weight 1 is greater than the first weight 1, perform weighting processing according to the set third weight 1 and third weight 2 to calculate the first envelope information. Wherein, the third weight one is the weight value of the transitional envelope information, and the third weight two is the weight value of the predicted envelope information.

具体的，在步骤406判断得知第三权重一大于第一权重一时，说明当前帧语音频信号离切换前L帧语音频信号较近，第一包络信息受切换前的包络信息影响较大，因此，需要根据过渡包络信息求解当前帧的第一包络信息。其中，第三权重一为过渡包络信息的权重值，而第三权重二为预测的包络信息的权重值。则过渡包络信息与第三权重一的乘积，加上预测的包络信息与第三权重二的乘积之和所求得的加权值即为第一包络信息。Specifically, when it is judged in step 406 that the third weight one is greater than the first weight one, it means that the current frame speech and audio signal is closer to the L frame speech and audio signal before switching, and the first envelope information is less affected by the envelope information before switching. Therefore, it is necessary to obtain the first envelope information of the current frame according to the transition envelope information. Wherein, the third weight one is the weight value of the transition envelope information, and the third weight two is the weight value of the predicted envelope information. The weighted value obtained by adding the product of the transition envelope information and the third weight one to the sum of the product of the predicted envelope information and the third weight two is the first envelope information.

步骤409、以第二权重步长为单位减小第三权重一，以第二权重步长为单位增加第三权重二，直至第三权重一等于零。Step 409: Decrease the third weight 1 by the unit of the second weight step, increase the third weight 2 by the unit of the second weight step, until the third weight 1 is equal to zero.

具体的，步骤409中修改第三权重一和第三权重二的目的，与步骤405中修改第二权重一和第二权重二的目的相同，都是为了在后续传输的语音频信号受切换前L帧语音频信号的影响逐渐变小的情况下，为了使计算得出的第一包络信息更加准确，对第三权重一和第三权重二进行适用性的调整。由于后续的音频信号受切换前L帧语音频信号的影响逐渐变小，因此，第三权重一的数值逐渐变小，而第三权重二的数值逐渐增大，从而也实现了减弱切换前的包络信息对第一包络信息的影响。其中，步骤409对第三权重一和第三权重二进行修改可以通过如下方法：新的第三权重一等于旧的第三权重一减去第二权重步长，新的第三权重二等于旧的第三权重二加上第二权重步长；其中，第二权重步长为已设定好的值。Specifically, the purpose of modifying the third weight 1 and the third weight 2 in step 409 is the same as the purpose of modifying the second weight 1 and the second weight 2 in step 405. When the influence of the voice and audio signal of the L frame gradually decreases, in order to make the calculated first envelope information more accurate, the third weight 1 and the third weight 2 are adjusted for applicability. Since the subsequent audio signal is gradually reduced by the influence of the L-frame speech and audio signal before the switch, the value of the third weight 1 gradually decreases, and the value of the third weight 2 gradually increases, thereby also reducing the value of the voice signal before the switch. The influence of the envelope information on the first envelope information. Among them, step 409 can modify the third weight 1 and the third weight 2 through the following method: the new third weight 1 is equal to the old third weight 1 minus the second weight step size, and the new third weight 2 is equal to the old The third weight of 2 plus the second weight step; wherein, the second weight step is a set value.

第一权重一和第一权重二的和为一，第二权重一和第二权重二的和为一，第三权重一和第三权重二的和为一；第三权重一的初始值大于第一权重一的初始值；第一权重一和第一权重二是固定的常数。具体的，本实施例中的权重一和权重二，实际上代表了切换前的包络信息和前M帧包络信息组成当前帧的第一包络信息时所占的百分比。对于当前帧语音频信号，离切换前L帧的语音频信号越近且相关性较大时，切换前的包络信息所占的百分比就越高，而相反的前M帧包络信息所占的百分比就越低。在当前帧语音频信号离切换前L帧的语音频信号较远时，说明网络中已经稳定的传输语音频信号，或者当前帧语音频信号与切换前L帧语音频信号相关性较低时，说明当前帧语音频信号特性已发生改变，因此，当前帧语音频信号受切换前L帧的语音频信号影响较小，切换前的包络信息所占的百分比就越低。The sum of the first weight one and the first weight two is one, the sum of the second weight one and the second weight two is one, the sum of the third weight one and the third weight two is one; the initial value of the third weight one is greater than The initial value of the first weight one; the first weight one and the first weight two are fixed constants. Specifically, weight 1 and weight 2 in this embodiment actually represent the percentages when the envelope information before switching and the envelope information of the previous M frames form the first envelope information of the current frame. For the speech and audio signal of the current frame, the closer the speech and audio signal of L frames before switching and the greater the correlation, the higher the percentage of the envelope information before switching is, while the envelope information of M frames before the opposite. the lower the percentage. When the voice and audio signal of the current frame is far from the voice and audio signal of L frames before switching, it means that the voice and audio signal has been transmitted stably in the network, or when the correlation between the voice and audio signal of the current frame and the voice and audio signal of L frames before switching is low, It shows that the characteristics of the speech and audio signal of the current frame have changed. Therefore, the speech and audio signal of the current frame is less affected by the speech and audio signal of L frames before switching, and the percentage of the envelope information before switching is lower.

另外，本实施例中的步骤404和步骤405的执行顺序可以互换，即可以先修改第二权重一和第二权重二，再根据第二权重一和第二权重二，计算出过渡包络信息。同样的，本实施例中的步骤408和步骤409的执行顺序可以互换，即可以先修改修改第三权重一和第三权重二，再根据修改第三权重一和第三权重二，计算出第一包络信息。In addition, the execution order of step 404 and step 405 in this embodiment can be interchanged, that is, the second weight 1 and the second weight 2 can be modified first, and then the transition envelope can be calculated according to the second weight 1 and the second weight 2 information. Similarly, the execution order of step 408 and step 409 in this embodiment can be interchanged, that is, the third weight 1 and the third weight 2 can be modified first, and then the third weight 1 and the third weight 2 can be modified to calculate First envelope information.

方式二，如图5所示，通过步骤302获得第一包络信息的另一个实施例还可以包括：Way 2, as shown in Figure 5, another embodiment of obtaining the first envelope information through step 302 may also include:

步骤501、根据当前帧语音频信号的第一低频带信号和前一帧的语音频信号的低频信号，计算第一低频带信号与前一帧的语音频信号的低频信号之间的相关性系数。Step 501, according to the first low frequency band signal of the speech and audio signal of the current frame and the low frequency signal of the speech and audio signal of the previous frame, calculate the correlation coefficient between the first low frequency band signal and the low frequency signal of the speech and audio signal of the previous frame .

具体的，为了更加准确的得出第一包络信息，求解当前帧语音频信号的第一低频带信号某频段和前一帧的语音频信号的低频信号相同频段的能量的关系。本实施例可以以“corr”代表相关性系数，通过当前帧语音频信号的第一低频带信号与前一帧的语音频信号的低频带信号相同频段的能量关系，得出当前帧语音频信号的第一低频带信号与前一帧的语音频信号的低频带信号之间的相关性系数corr，能量相差越小，corr越大，否则，corr越小。具体过程可以参见步骤401中关于前N帧的语音频信号相关性计算的介绍。Specifically, in order to obtain the first envelope information more accurately, the relationship between the energy of a certain frequency band of the first low frequency band signal of the speech and audio signal of the current frame and the same frequency band of the low frequency signal of the speech and audio signal of the previous frame is solved. In this embodiment, "corr" can be used to represent the correlation coefficient, and the current frame speech and audio signal can be obtained through the energy relationship of the first low frequency band signal of the current frame speech and audio signal and the low frequency band signal of the speech and audio signal of the previous frame in the same frequency band The correlation coefficient corr between the first low-frequency band signal of and the low-frequency band signal of the speech audio signal of the previous frame, the smaller the energy difference is, the larger the corr is, otherwise, the smaller the corr is. For the specific process, please refer to the introduction about the calculation of the correlation of speech and audio signals of the previous N frames in step 401 .

步骤502、判断相关性系数是否在给定的第二阈值范围内。Step 502, judging whether the correlation coefficient is within a given second threshold range.

具体的，在步骤501计算出corr的值后，判断计算出的corr是否在给定的第二阈值范围内。例如，本实施例可以将第二阈值范围用c1～c2代表。Specifically, after the value of corr is calculated in step 501, it is determined whether the calculated corr is within a given second threshold range. For example, in this embodiment, the second threshold range may be represented by c1-c2.

步骤503、如果相关性系数不在第二阈值范围内，则根据已设定好的第一权重一和第一权重二进行加权处理，以计算出第一包络信息。其中，第一权重一为前一帧的语音频信号的高频带信号对应的前一帧包络信息的权重值，第一权重二为预测的包络信息的权重值；第一权重一和第一权重二是固定的常数。Step 503 , if the correlation coefficient is not within the second threshold range, perform weighting processing according to the set first weight 1 and first weight 2 to calculate the first envelope information. Wherein, the first weight one is the weight value of the previous frame envelope information corresponding to the high frequency band signal of the speech audio signal of the previous frame, and the first weight two is the weight value of the predicted envelope information; the first weight one and The first weight two is a fixed constant.

具体的，当步骤502得出corr小于c1或大于c2时，得知当前帧语音频信号对应的第一包络信息受切换前一帧语音频信号的包络信息影响较小，因此，根据已设定好的第一权重一和第一权重二，便可以计算出当前帧的第一包络信息。预测的包络信息与第一权重二的乘积，加上前一帧包络信息与第一权重一的乘积之和，所求得的加权和即为当前帧的第一包络信息。另外，以后传输的窄带语音频信号都按此方式和权重恢复该窄带语音频信号对应的第一包络信息，直到不同带宽的语音频信号再次发生切换。例如：本实施例中的第一权重一可以用a1代表、第一权重二可以用b1代表、前一帧包络信息可以用pre_fenv代表、预测的包络信息可以用fenv代表、第一包络信息可以用cur_fenv代表。则步骤503可以利用以下公式表示：cur_fenv＝pre_fenv*a1+fenv*b1。Specifically, when it is obtained in step 502 that corr is less than c1 or greater than c2, it is known that the first envelope information corresponding to the speech and audio signal of the current frame is less affected by the envelope information of the speech and audio signal of the previous frame before switching. Therefore, according to the After setting the first weight 1 and the first weight 2, the first envelope information of the current frame can be calculated. The product of the predicted envelope information and the first weight 2 is added to the sum of the product of the envelope information of the previous frame and the first weight 1, and the obtained weighted sum is the first envelope information of the current frame. In addition, the narrowband speech and audio signals to be transmitted later will restore the first envelope information corresponding to the narrowband speech and audio signals in this manner and weight until the speech and audio signals of different bandwidths are switched again. For example: in this embodiment, the first weight one can be represented by a1, the first weight two can be represented by b1, the envelope information of the previous frame can be represented by pre_fenv, the predicted envelope information can be represented by fenv, the first envelope Information can be represented by cur_fenv. Then step 503 can be expressed by the following formula: cur_fenv=pre_fenv*a1+fenv*b1.

步骤504、如果相关性系数在第二阈值范围内，判断已设定好的第二权重一是否大于第一权重一。其中，第二权重一为切换前一帧语音频信号的高频带信号对应的切换前的包络信息的权重值，Step 504, if the correlation coefficient is within the second threshold range, determine whether the set second weight one is greater than the first weight one. Wherein, the second weight one is the weight value of the envelope information before switching corresponding to the high frequency band signal of the voice and audio signal of the previous frame,

具体的，如果c1＜corr＜c2，则通过比较第二权重一与第一权重一的大小，可以得知当前帧的第一包络信息受切换前的包络信息和前一帧包络信息的影响程度。Specifically, if c1<corr<c2, by comparing the size of the second weight one with the first weight one, it can be known that the first envelope information of the current frame is affected by the envelope information before switching and the envelope information of the previous frame degree of influence.

步骤505、如果第二权重一不大于第一权重一，则根据已设定好的第一权重一和第一权重二，计算出第一包络信息。Step 505. If the second weight one is not greater than the first weight one, calculate the first envelope information according to the set first weight one and first weight two.

具体的，在步骤504判断得知第二权重一小于第一权重一时，说明当前帧语音频信号离切换前一帧的语音频信号较远，第一包络信息受切换前的包络信息的影响较小，因此，根据已设定好的第一权重一和第一权重二，便可以计算出当前帧的第一包络信息。则步骤505可以利用以下公式表示：cur_fenv＝pre_fenv*a1+fenv*b1。Specifically, when it is judged in step 504 that the second weight one is less than the first weight one, it means that the speech and audio signal of the current frame is far from the speech and audio signal of the frame before switching, and the first envelope information is influenced by the envelope information before switching. Therefore, according to the set first weight 1 and first weight 2, the first envelope information of the current frame can be calculated. Then step 505 can be expressed by the following formula: cur_fenv=pre_fenv*a1+fenv*b1.

步骤506、如果第二权重一大于第一权重一，根据第二权重一和已设定好的第二权重二进行加权处理，以计算出第一包络信息。其中，第二权重二为预测的包络信息的权重值。例如：第二权重一可以用a2代表，第二权重二可以用b2代表。Step 506, if the second weight one is greater than the first weight one, perform weighting processing according to the second weight one and the set second weight two to calculate the first envelope information. Wherein, the second weight two is the weight value of the predicted envelope information. For example: the second weight one can be represented by a2, and the second weight two can be represented by b2.

具体的，在步骤504判断得知第二权重一大于第一权重一时，说明当前帧语音频信号离切换前一帧的第一频带语音频信号较近，第一包络信息受切换前一帧语音频信号对应的切换前的包络信息影响较大。因此，根据已设定好的第二权重一和第二权重二，便可以计算出当前帧的第一包络信息。则预测的包络信息与第二权重二的乘积，加上切换前的包络信息与第二权重一的乘积之和，所求得的加权和即为当前帧的第一包络信息。其中，切换前的包络信息可以用con_fenv代表，则步骤506可以利用以下公式表示：cur_fenv＝con_fenv*a2+fenv*b2。Specifically, when it is judged in step 504 that the second weight one is greater than the first weight one, it means that the speech and audio signal of the current frame is closer to the speech and audio signal of the first frequency band of the previous frame before switching, and the first envelope information is affected by the previous frame of switching. The envelope information corresponding to the speech and audio signals before switching has a greater influence. Therefore, according to the set second weight 1 and second weight 2, the first envelope information of the current frame can be calculated. Then the product of the predicted envelope information and the second weight 2 is added to the sum of the products of the envelope information before switching and the second weight 1, and the obtained weighted sum is the first envelope information of the current frame. Wherein, the envelope information before switching can be represented by con_fenv, and step 506 can be expressed by the following formula: cur_fenv=con_fenv*a2+fenv*b2.

步骤507、以第二权重步长为单位减小第二权重一，以第二权重步长为单位增加第二权重二。Step 507: Decrease the second weight one by the unit of the second weight step, and increase the second weight two by the unit of the second weight step.

具体的，随着语音频信号的传输，后续的当前帧语音频信号受切换前一语音频信号的影响逐渐变小，为了使计算得出的第一包络信息更加准确，需要对第二权重一和第二权重二进行适用性的调整。由于后续的音频信号受切换前一帧语音频信号的影响逐渐变小；而靠近当前帧语音频信号的前一帧语音频信号的影响变大。因此，第二权重一的数值逐渐变小，而第二权重二的数值逐渐增大，从而减弱切换前的包络信息对第一包络信息的影响，而增强预测的包络信息对第一包络信息的影响。其中，步骤507对第二权重一和第二权重二进行修改可以通过如下方法：新的第二权重一等于旧的第二权重一减去第一权重步长，新的第二权重二等于旧的第二权重二加上第一权重步长；其中，第一权重步长为已设定好的值。Specifically, with the transmission of speech and audio signals, the subsequent speech and audio signals of the current frame are gradually less affected by the speech and audio signals before switching. In order to make the calculated first envelope information more accurate, the second weight One and the second weight two are adjusted for applicability. Since the subsequent audio signal is gradually less affected by the voice and audio signal of the previous frame after switching; and the influence of the previous frame of voice and audio signal close to the current frame of the voice and audio signal becomes greater. Therefore, the value of the second weight 1 gradually decreases, while the value of the second weight 2 gradually increases, thereby weakening the influence of the envelope information before switching on the first envelope information, and enhancing the predicted envelope information on the first envelope information. The impact of envelope information. Among them, step 507 can modify the second weight 1 and the second weight 2 through the following method: the new second weight 1 is equal to the old second weight 1 minus the first weight step size, and the new second weight 2 is equal to the old The second weight of 2 plus the first weight step; wherein, the first weight step is a set value.

其中，第一权重一和第一权重二的和为一，第二权重一和第二权重二的和为一；第二权重一的初始值大于第一权重一的初始值。Wherein, the sum of the first weight one and the first weight two is one, and the sum of the second weight one and the second weight two is one; the initial value of the second weight one is greater than the initial value of the first weight one.

步骤303、根据第一包络信息和预测的精细结构信息，生成处理后的第一高频带信号。Step 303: Generate a processed first high frequency band signal according to the first envelope information and the predicted fine structure information.

具体的，通过步骤302得出当前帧的第一包络信息后，可以根据第一包络信息和预测的精细结构信息，生成所需要的处理后的第一高频带信号，从而使第二高频带信号能平滑过渡到处理后的第一高频带信号。Specifically, after obtaining the first envelope information of the current frame through step 302, the required processed first high-frequency band signal can be generated according to the first envelope information and the predicted fine structure information, so that the second The high frequency band signal can smoothly transition to the processed first high frequency band signal.

本实施例语音频信号切换方法，在语音频信号发生从宽频带语音频信号向窄频带语音频信号切换的过程中，通过预测的精细结构信息和第一包络信息获得当前帧的处理后的第一高频带信号，从而能够使切换前的宽频带语音频信号的第二高频带信号能够平滑的过渡到窄频带语音频信号所对应的处理后的第一高频带信号，更有利于提高用户接听音频信号的质量。In the speech and audio signal switching method of this embodiment, in the process of switching the speech and audio signal from the broadband speech and audio signal to the narrowband speech and audio signal, the processed information of the current frame is obtained through the predicted fine structure information and the first envelope information. The first high-band signal, so that the second high-band signal of the wide-band speech and audio signal before switching can smoothly transition to the processed first high-band signal corresponding to the narrow-band speech and audio signal, and more It is beneficial to improve the quality of the user listening to the audio signal.

基于上述技术方案，可选的，如图6所示，本实施例中的步骤202包括：Based on the above technical solution, optionally, as shown in FIG. 6, step 202 in this embodiment includes:

步骤601、根据当前帧语音频信号与切换前一帧的语音频信号，判断处理后的第一高频带信号是否需要衰减。Step 601 : According to the speech and audio signal of the current frame and the speech and audio signal of the frame before switching, determine whether the processed first high frequency band signal needs to be attenuated.

具体的，由于窄带语音频信号的第一高频带信号为空，在宽频带语音频信号切换到窄频带语音频信号的过程中，为了防止恢复出的窄频带语音频信号对应的处理后的第一高频带信号带来不好的影响，在由窄频带语音频信号扩展为宽频带信号的帧数达到给定的帧数后，将处理后的第一高频带信号的能量逐帧进行衰减，直到衰减系数达到给定的阈值。通过当前帧语音频信号与切换前一帧的语音频信号，可以得知当前帧语音频信号与切换前一帧的语音频信号之间的间隔时间，例如，可以通过计数器记录窄频带语音频信号传输的帧数，此帧数可以为已预定好的大于等于0的值。Specifically, since the first high-band signal of the narrowband speech and audio signal is empty, in the process of switching the wideband speech and audio signal to the narrowband speech and audio signal, in order to prevent the recovered narrowband speech and audio signal from processing The first high-frequency band signal has a bad influence. After the number of frames extended from the narrow-band speech and audio signal to the wide-band signal reaches a given number of frames, the energy of the processed first high-band signal is frame by frame. Decay until the decay coefficient reaches the given threshold. Through the voice and audio signal of the current frame and the voice and audio signal of the previous frame, the interval time between the voice and audio signal of the current frame and the voice and audio signal of the previous frame can be known. For example, the narrowband voice and audio signal can be recorded by the counter The number of frames to be transmitted, which can be a predetermined value greater than or equal to 0.

步骤602、如果不需要衰减，将处理后的第一高频带信号与第一低频带信号合成宽频带信号。Step 602. If no attenuation is required, synthesize the processed first high-frequency signal and the first low-frequency signal into a wide-band signal.

具体的，如果步骤601判断得出处理后的第一高频带信号无需进行衰减，则直接将处理后的第一高频带信号与第一低频带信号合成宽频带信号。Specifically, if it is determined in step 601 that the processed first high-frequency band signal does not need to be attenuated, the processed first high-frequency band signal and the first low-frequency band signal are directly synthesized into a wide-band signal.

步骤603、如果需要衰减，判断处理后的第一高频带信号对应的衰减因子是否大于阈值。Step 603. If attenuation is required, determine whether the attenuation factor corresponding to the processed first high frequency band signal is greater than a threshold.

具体的，衰减因子的初始值为一；阈值小于一并大于等于零。如果步骤601判断得出处理后的第一高频带信号需要进行衰减，则通过步骤603判断处理后的第一高频带信号对应的衰减因子是否大于给定的阈值。Specifically, the initial value of the attenuation factor is one; the threshold value is less than one and greater than or equal to zero. If it is determined in step 601 that the processed first high frequency band signal needs to be attenuated, then in step 603 it is determined whether the attenuation factor corresponding to the processed first high frequency band signal is greater than a given threshold.

步骤604、如果衰减因子不大于给定的阈值，则将处理后的第一高频带信号乘以阈值，然后与第一低频带信号合成宽频带信号。Step 604: If the attenuation factor is not greater than a given threshold, multiply the processed first high-frequency signal by the threshold, and then synthesize a broadband signal with the first low-frequency signal.

具体的，如果步骤603得出衰减因子的值不大于给定的阈值时，说明处理后的第一高频带信号的能量已经衰减到一定的程度，处理后的第一高频带信号已经不会带来不好的影响，以后就可以保持此衰减比例。则将处理后的第一高频带信号乘以阈值，然后与第一低频带信号合成宽频带信号。Specifically, if it is obtained in step 603 that the value of the attenuation factor is not greater than the given threshold, it means that the energy of the processed first high-frequency band signal has been attenuated to a certain extent, and the processed first high-frequency band signal has no longer It will bring bad effects, and this attenuation ratio can be maintained in the future. Then the processed first high frequency band signal is multiplied by a threshold, and then synthesized with the first low frequency band signal into a broadband signal.

步骤605、如果衰减因子大于给定的阈值，则将处理后的第一高频带信号乘以衰减因子后，再与第一低频带信号合成宽频带信号。Step 605: If the attenuation factor is greater than a given threshold, multiply the processed first high-frequency signal by the attenuation factor, and then synthesize a broadband signal with the first low-frequency signal.

具体的，如果步骤603得出衰减因子的值大于给定的阈值时，则说明在此衰减因子时，处理后的第一高频带信号还有可能引起不好的听觉影响，还需要继续衰减，直到给定的阈值。则将处理后的第一高频带信号乘以衰减因子后，再与第一低频带信号合成宽频带信号。Specifically, if step 603 shows that the value of the attenuation factor is greater than the given threshold, it means that at this attenuation factor, the processed first high-frequency band signal may cause bad auditory effects, and further attenuation is required. , up to a given threshold. After the processed first high frequency band signal is multiplied by the attenuation factor, the broadband signal is synthesized with the first low frequency band signal.

步骤606、修改衰减因子，以使衰减因子减小。Step 606. Modify the attenuation factor to reduce the attenuation factor.

具体的，随着语音频信号的传输，后续的窄带语音频信号受切换前的语音频信号的影响逐渐变小，相对应的，衰减因子也应该逐渐变小。Specifically, as the voice and audio signals are transmitted, the subsequent narrowband voice and audio signals are gradually less affected by the voice and audio signals before switching, and correspondingly, the attenuation factor should also be gradually reduced.

基于上述技术方案，可选的，当窄频带语音频信号向宽频带语音频信号切换时，如图7所示，本实施例中通过步骤201获得处理后的第一高频带信号的一个实施例包括：Based on the above technical solution, optionally, when the narrowband speech and audio signal is switched to the wideband speech and audio signal, as shown in Figure 7, an implementation of obtaining the processed first high frequency band signal through step 201 in this embodiment Examples include:

步骤701、根据已设定好的第四权重一和第四权重二进行加权处理，以计算出处理后的第一高频带信号。其中，第四权重一为第二高频带信号的权重值，第四权重二为当前帧语音频信号的第一高频带信号的权重值。Step 701. Perform weighting processing according to the set fourth weight 1 and fourth weight 2 to calculate the processed first high frequency band signal. Wherein, the fourth weight 1 is the weight value of the second high frequency band signal, and the fourth weight 2 is the weight value of the first high frequency band signal of the current frame audio signal.

具体的，在由窄频带语音频信号切换到宽频带语音频信号的过程中，由于宽频带语音频信号中的高频带信号不为空，而窄频带语音频信号对应的高频带信号为空或者处理后的高频带信号，为了使窄频带语音频信号能够平滑的切换到宽频带语音频信号，需要对宽频带语音频信号中的高频带信号进行能量衰减，以实现语音频信号的平滑的进行切换。通过第二高频带信号与第四权重一的乘积，加上第一高频带信号与第四权重二的乘积之和，所求得的加权值即为处理后的第一高频带信号。Specifically, in the process of switching from the narrowband speech and audio signal to the wideband speech and audio signal, since the high frequency signal in the wideband speech and audio signal is not empty, the high frequency signal corresponding to the narrowband speech and audio signal is Empty or processed high-frequency signal, in order to make the narrow-band speech and audio signal can be smoothly switched to the wide-band speech and audio signal, it is necessary to attenuate the energy of the high-band signal in the wide-band speech and audio signal to realize speech and audio signal to switch smoothly. Through the product of the second high frequency band signal and the fourth weight one, plus the sum of the product of the first high frequency band signal and the fourth weight two, the obtained weighted value is the processed first high frequency band signal .

步骤702、以第三权重步长为单位减小第四权重一，以第三权重步长为单位增加第四权重二，直至第四权重一等于零。其中，第四权重一和第四权重二的和为一。Step 702: Decrease the fourth weight 1 by the unit of the third weight step, increase the fourth weight 2 by the unit of the third weight step, until the fourth weight 1 is equal to zero. Wherein, the sum of the fourth weight one and the fourth weight two is one.

具体的，随着语音频信号的传输，后续的宽频带语音频信号受切换前的窄频带语音频信号的影响逐渐变小。因此，第四权重一逐渐变小，而第四权重二逐渐增大，直至第四权重一变为零，而第四权重二变为一，即传输的语音频信号一直为宽频带语音频信号。Specifically, with the transmission of the voice and audio signals, the subsequent wideband voice and audio signals are gradually less affected by the narrowband voice and audio signals before switching. Therefore, the fourth weight one gradually becomes smaller, and the fourth weight two gradually increases until the fourth weight one becomes zero, and the fourth weight two becomes one, that is, the transmitted voice and audio signal is always a broadband voice and audio signal .

同样的，如图8所示，本实施例中通过步骤201获得处理后的第一高频带信号的另一个实施例还可以包括：Similarly, as shown in FIG. 8, another embodiment of obtaining the processed first high-frequency band signal through step 201 in this embodiment may also include:

步骤801、根据已设定好的第五权重一和第五权重二进行加权处理，以计算出处理后的第一高频带信号。其中，第五权重一为已设定好的固定参数的权重值，第五权重二为当前帧语音频信号的第一高频带信号的权重值。Step 801: Perform weighting processing according to the set fifth weight 1 and fifth weight 2 to calculate the processed first high frequency band signal. Wherein, the fifth weight 1 is the weight value of the fixed parameter that has been set, and the fifth weight 2 is the weight value of the first high frequency band signal of the speech and audio signal of the current frame.

具体的，由于窄频带语音频信号的第一高频带信号为空，因此可以设定一固定参数来代替窄频带语音频信号的高频带信号，其中，该固定参数为一个大于等于零小于第一高频带信号能量的常数。通过固定参数与第五权重一的乘积，加上第一高频带信号与第五权重二的乘积之和，所求得的加权值即为处理后的第一高频带信号。Specifically, since the first high-band signal of the narrow-band speech and audio signal is empty, a fixed parameter can be set to replace the high-band signal of the narrow-band speech and audio signal, wherein the fixed parameter is one greater than or equal to zero and less than the first A constant for high-band signal energy. The weighted value obtained by adding the product of the fixed parameter and the fifth weight one to the sum of the product of the first high frequency band signal and the fifth weight two is the processed first high frequency band signal.

步骤802、以第四权重步长为单位减小第五权重一，以第四权重步长为单位增加第五权重二，直至第五权重一等于零；；其中，第五权重一和第五权重二的和为一。Step 802: Decrease the fifth weight 1 by the unit of the fourth weight step, increase the fifth weight 2 by the unit of the fourth weight step, until the fifth weight 1 is equal to zero; wherein, the fifth weight 1 and the fifth weight The sum of two is one.

具体的，随着语音频信号的传输，后续的宽频带语音频信号受切换前的窄频带语音频信号的影响逐渐变小。因此，第五权重一逐渐变小，而第五权重二逐渐增大，直至第五权重一变为零，而第五权重二变为一，即传输的语音频信号一直为真实的宽频带语音频信号。Specifically, with the transmission of the voice and audio signals, the subsequent wideband voice and audio signals are gradually less affected by the narrowband voice and audio signals before switching. Therefore, the fifth weight 1 gradually decreases, and the fifth weight 2 gradually increases until the fifth weight 1 becomes zero, and the fifth weight 2 becomes 1, that is, the transmitted voice and audio signal is always a real broadband voice signal. audio signal.

本实施例语音频信号切换方法，在语音频信号发生从窄频带语音频信号向宽频带语音频信号切换的过程中，通过对宽频带语音频信号的高频带信号进行衰减处理得到处理后的高频带信号，从而能够使切换前的窄频带语音频信号对应的高频带信号能够平滑的过渡到宽频带语音频信号所对应的处理后的高频带信号，更有利于提高用户接听音频信号的质量。In the speech and audio signal switching method of this embodiment, in the process of switching the speech and audio signal from the narrowband speech and audio signal to the wideband speech and audio signal, the processed signal is obtained by attenuating the high frequency band signal of the wideband speech and audio signal. High-band signal, so that the high-band signal corresponding to the narrow-band voice and audio signal before switching can smoothly transition to the processed high-band signal corresponding to the wide-band voice and audio signal, which is more conducive to improving the user's ability to listen to audio The quality of the signal.

其中，本实施例中的包络信息还可以通过能够代表高频带信号的其他参数代替，例如：线性预测编码(Linear Predictive Coding，以下简称：LPC)参数，或幅度参数等。Wherein, the envelope information in this embodiment may also be replaced by other parameters that can represent high-frequency band signals, such as: Linear Predictive Coding (Linear Predictive Coding, hereinafter referred to as: LPC) parameters, or amplitude parameters, etc.

本领域普通技术人员可以理解：实现上述方法实施例的全部或部分步骤可以通过程序指令相关的硬件来完成，前述的程序可以存储于一计算机可读取存储介质中，该程序在执行时，执行包括上述方法实施例的步骤；而前述的存储介质包括：ROM、RAM、磁碟或者光盘等各种可以存储程序代码的介质。Those of ordinary skill in the art can understand that all or part of the steps for realizing the above-mentioned method embodiments can be completed by hardware related to program instructions, and the aforementioned program can be stored in a computer-readable storage medium. When the program is executed, the It includes the steps of the above method embodiments; and the aforementioned storage medium includes: ROM, RAM, magnetic disk or optical disk and other various media that can store program codes.

图9为本发明语音频信号切换装置实施例一的结构示意图。如图9所示，本实施例音频信号切换装置，包括：处理模块91和第一合成模块92。FIG. 9 is a schematic structural diagram of Embodiment 1 of the speech and audio signal switching device of the present invention. As shown in FIG. 9 , the audio signal switching device of this embodiment includes: a processing module 91 and a first synthesis module 92 .

处理模块91用于当语音频信号出现切换时，将当前帧语音频信号的第一高频带信号和前M帧语音频信号的第二高频带信号进行加权处理，以得到处理后的第一高频带信号。其中，M大于等于1。The processing module 91 is used to perform weighting processing on the first high-band signal of the current frame of speech and audio signal and the second high-frequency signal of the previous M frames of speech and audio signal when the speech and audio signal is switched, so as to obtain the processed first high frequency band signal. A high frequency band signal. Wherein, M is greater than or equal to 1.

第一合成模块92用于将处理后的第一高频带信号与当前帧语音频信号的第一低频带信号合成宽频带信号。The first synthesizing module 92 is used for synthesizing the processed first high frequency band signal and the first low frequency band signal of the speech and audio signal of the current frame into a broadband signal.

本发明实施例的语音频信号切换装置，通过处理模块根据前M帧语音频信号中的第二高频带信号对当前帧语音频信号中的第一高频带信号进行处理，以使的第二高频带信号能平滑的过渡到处理后的第一高频带信号，从而在切换不同带宽的语音频信号过程中，使不同带宽的语音频信号的高频带信号能够平滑的过渡切换；最后，处理后的第一高频带信号与第一低频带信号通过第一合成模块合成宽频带信号，将该宽频带信号传输到用户终端，使用户享受到高质量的语音频信号。本实施例语音频信号切换方法能够平滑的进行不同带宽的语音频信号切换，减小了能量激变造成语音频信号的主观听觉质量差的影响，提高了用户接听语音频信号的质量。In the speech and audio signal switching device of the embodiment of the present invention, the first high frequency band signal in the speech and audio signal of the current frame is processed by the processing module according to the second high frequency band signal in the speech and audio signal of the previous M frames, so that the first The second high frequency band signal can smoothly transition to the processed first high frequency band signal, so that in the process of switching voice and audio signals of different bandwidths, the high frequency band signals of voice and audio signals of different bandwidths can be switched smoothly; Finally, the processed first high-frequency signal and the first low-frequency signal are synthesized into a wide-band signal through the first synthesis module, and the wide-band signal is transmitted to the user terminal, so that the user can enjoy high-quality voice and audio signals. The speech and audio signal switching method of this embodiment can smoothly switch speech and audio signals of different bandwidths, reduces the impact of poor subjective auditory quality of speech and audio signals caused by energy shocks, and improves the quality of speech and audio signals received by users.

图10为本发明语音频信号切换装置实施例二的结构示意图。如图10所示，本实施例音频信号切换装置基于上述音频信号切换装置实施例一，其区别在于：本实施例音频信号切换装置还包括：第二合成模块103。FIG. 10 is a schematic structural diagram of Embodiment 2 of the speech and audio signal switching device of the present invention. As shown in FIG. 10 , the audio signal switching device of this embodiment is based on the first embodiment of the audio signal switching device above, and the difference is that the audio signal switching device of this embodiment further includes: a second synthesis module 103 .

第二合成模块103用于当语音频信号没有出现切换时，将第一高频带信号与第一低频带信号合成宽频带信号。The second synthesizing module 103 is used for synthesizing the first high-frequency signal and the first low-frequency signal into a wide-band signal when the voice and audio signals are not switched.

本实施例音频信号切换装置，通过设置第二合成模块，可以在不发生不同带宽的语音频信号切换的前提下，由第二合成模块将当前帧的第一频带语音频信号中的第一低频带信号和第一高频带信号合成宽频带信号，从而有利于提高用户接听语音频信号的质量。In the audio signal switching device of this embodiment, by setting the second synthesis module, the second synthesis module can combine the first low-frequency audio signal in the first frequency band audio and video signal of the current frame without switching audio and audio signals of different bandwidths. The broadband signal is synthesized from the band signal and the first high frequency band signal, thereby improving the quality of the user listening to the voice and audio signals.

基于上述技术方案，可选的，当宽频带语音频信号向窄频带语音频信号切换时，如图10和图11所示，本实施例中的处理模块101包括：Based on the above technical solution, optionally, when the wideband voice and audio signal is switched to the narrowband voice and audio signal, as shown in Figure 10 and Figure 11, the processing module 101 in this embodiment includes:

预测模块1011用于预测第一高频带信号对应的预测的精细结构信息和预测的包络信息。The prediction module 1011 is configured to predict the predicted fine structure information and the predicted envelope information corresponding to the first high frequency band signal.

第一生成模块1012用于根据预测的包络信息和前M帧语音频信号的第二高频带信号对应的前M帧包络信息进行加权处理，以得到第一高频带信号对应的第一包络信息。The first generating module 1012 is used to perform weighting processing according to the predicted envelope information and the envelope information of the previous M frames corresponding to the second high-band signal of the previous M frames of speech and audio signals, so as to obtain the first high-band signal corresponding to the first An envelope of information.

第二生成模块1013用于根据第一包络信息和预测的精细结构信息，生成处理后的第一高频带信号。The second generating module 1013 is configured to generate the processed first high frequency band signal according to the first envelope information and the predicted fine structure information.

更进一步的，本实施例音频信号切换装置可以还包括：分类模块1010，用于对当前帧语音频信号的第一低频带信号进行信号分类；而预测模块1011还用于根据第一低频带信号对应的信号类型预测当前帧语音频信号的第一低频带信号对应的预测的精细结构信息和预测的包络信息。Furthermore, the audio signal switching device in this embodiment may further include: a classification module 1010, configured to perform signal classification on the first low-frequency band signal of the current frame of speech and audio signals; and the prediction module 1011 is also configured to The corresponding signal type predicts the predicted fine structure information and the predicted envelope information corresponding to the first low frequency band signal of the speech and audio signal of the current frame.

本实施例音频信号切换装置，通过预测模块预测出第一高频带信号对应的预测的精细结构信息和预测的包络信息，从而可以通过第一生成模块和第二生成模块准确的生成处理后的第一高频带信号，从而使第一高频带信号能够更加平滑的过渡到处理后的第一高频带信号，更有利于提高用户接听语音频信号的质量。另外，通过分类模块对当前帧语音频信号的第一低频带信号进行信号分类，然后，预测模块根据信号类型得到预测的精细结构信息和预测的包络信息，从而使预测的精细结构信息和预测的包络信息更加准确，使用户接听到的语音频信号的质量更高。The audio signal switching device in this embodiment uses the prediction module to predict the predicted fine structure information and predicted envelope information corresponding to the first high-frequency band signal, so that the first generation module and the second generation module can accurately generate the processed The first high-frequency band signal, so that the first high-frequency band signal can transition to the processed first high-frequency band signal more smoothly, which is more conducive to improving the quality of the user's listening to the voice and audio signals. In addition, signal classification is performed on the first low-frequency band signal of the current frame speech and audio signal through the classification module, and then the prediction module obtains the predicted fine structure information and the predicted envelope information according to the signal type, so that the predicted fine structure information and the predicted The envelope information is more accurate, so that the quality of the voice and audio signals received by the user is higher.

基于上述技术方案，可选的，如图10和图12所示，本实施例中的第一合成模块102包括：Based on the above technical solution, optionally, as shown in Figure 10 and Figure 12, the first synthesis module 102 in this embodiment includes:

第一判断模块1021用于根据当前帧语音频信号与切换前一帧的语音频信号，判断处理后的第一高频带信号是否需要衰减。The first judging module 1021 is configured to judge whether the processed first high frequency band signal needs to be attenuated according to the speech and audio signal of the current frame and the speech and audio signal of the previous frame before switching.

第三合成模块1022用于若第一判断模块1021得出处理后的第一高频带信号不需要衰减，将处理后的第一高频带信号与第一低频带信号合成宽频带信号。The third synthesis module 1022 is used for synthesizing the processed first high frequency signal and the first low frequency signal into a broadband signal if the first judging module 1021 finds that the processed first high frequency signal does not need to be attenuated.

第二判断模块1023用于若第一判断模块1021得出处理后的第一高频带信号需要衰减，判断处理后的第一高频带信号对应的衰减因子是否大于给定的阈值。The second judging module 1023 is configured to judge whether the attenuation factor corresponding to the processed first high-frequency signal is greater than a given threshold if the first judging module 1021 finds that the processed first high-frequency signal needs to be attenuated.

第四合成模块1024用于若第二判断模块1023得出衰减因子不大于给定的阈值，则将处理后的第一高频带信号乘以阈值，然后与第一低频带信号合成宽频带信号。The fourth synthesizing module 1024 is used for if the second judging module 1023 obtains that the attenuation factor is not greater than a given threshold value, then multiply the processed first high frequency band signal by the threshold value, and then synthesize the broadband signal with the first low frequency band signal .

第五合成模块1025用于若第二判断模块1023得出衰减因子大于给定的阈值，则将处理后的第一高频带信号乘以衰减因子后，再与第一低频带信号合成宽频带信号。The fifth synthesizing module 1025 is used for if the second judging module 1023 obtains that the attenuation factor is greater than a given threshold, then multiply the processed first high-frequency band signal by the attenuation factor, and then synthesize the broadband with the first low-frequency band signal Signal.

第一修改模块1026用于修改衰减因子，以使衰减因子减小。The first modification module 1026 is used for modifying the attenuation factor, so as to reduce the attenuation factor.

其中，衰减因子的初始值为一；阈值小于一并大于等于零。Wherein, the initial value of the attenuation factor is one; the threshold value is less than one and greater than or equal to zero.

本实施例音频信号切换装置，通过对处理后的第一高频带信号进行衰减处理，可以使当前帧语音频信号经过处理后得到的宽频带信号更加准确，更有利于提高用户接听音频信号的质量。The audio signal switching device of this embodiment, by attenuating the processed first high-frequency band signal, can make the broadband signal obtained after the current frame speech and audio signal is processed more accurate, which is more conducive to improving the user's ability to listen to the audio signal. quality.

基于上述技术方案，可选的，当窄频带语音频信号向宽频带语音频信号切换时，如图10和图13a所示，本实施例中的处理模块101包括：Based on the above technical solution, optionally, when the narrowband speech and audio signal is switched to the wideband speech and audio signal, as shown in Figure 10 and Figure 13a, the processing module 101 in this embodiment includes:

第一计算模块1011a用于根据已设定好的第四权重一和第四权重二进行加权处理，以计算出处理后的第一高频带信号；其中，第四权重一为第二高频带信号的权重值，第四权重二为第一高频带信号的权重值；The first calculation module 1011a is used to perform weighting processing according to the set fourth weight one and fourth weight two to calculate the processed first high frequency band signal; wherein, the fourth weight one is the second high frequency The weight value of the band signal, the fourth weight two is the weight value of the first high frequency band signal;

第二修改模块1012a用于以第三权重步长为单位减小第四权重一，以第三权重步长为单位增加第四权重二，直至第四权重一等于零；其中，第四权重一和第四权重二的和为一。The second modifying module 1012a is used to reduce the fourth weight 1 with the third weight step as a unit, and increase the fourth weight 2 with the third weight step as a unit, until the fourth weight 1 is equal to zero; wherein, the fourth weight 1 and The sum of the fourth weight two is one.

同样的，当窄频带语音频信号向宽频带语音频信号切换时，如图10和图13b所示，本实施例中的处理模块101可以还包括：Similarly, when the narrowband speech and audio signal is switched to the wideband speech and audio signal, as shown in Figure 10 and Figure 13b, the processing module 101 in this embodiment may further include:

第二计算模块1011b用于根据已设定好的第五权重一和第五权重二进行加权处理，以计算出处理后的第一高频带信号；其中，第五权重一为已设定好的固定参数的权重值，第五权重二为第一高频带信号的权重值；The second calculation module 1011b is used to perform weighting processing according to the set fifth weight 1 and the fifth weight 2 to calculate the processed first high-frequency band signal; wherein, the fifth weight 1 is set The weight value of the fixed parameter, the fifth weight two is the weight value of the first high frequency band signal;

第三修改模块1012b用于以第四权重步长为单位减小第五权重一，以第四权重步长为单位增加第五权重二，直至第五权重一等于零；；其中，第五权重一和第五权重二的和为一；其中，固定参数为一个大于等于零并小于第一高频带信号能量值的常数。The third modification module 1012b is used to reduce the fifth weight 1 in units of the fourth weight step size, increase the fifth weight 2 in units of the fourth weight step size, until the fifth weight 1 is equal to zero; wherein, the fifth weight 1 The sum of the fifth weight 2 and the fifth weight 2 is one; wherein, the fixed parameter is a constant greater than or equal to zero and less than the energy value of the first high frequency band signal.

本实施例语音频信号切换装置，在语音频信号发生从窄频带语音频信号向宽频带语音频信号切换的过程中，通过对宽频带语音频信号的高频带信号进行衰减处理得到处理后的高频带信号，从而能够使切换前的窄频带语音频信号对应的高频带信号能够平滑的过渡到宽频带语音频信号所对应的处理后的高频带信号，更有利于提高用户接听音频信号的质量。The speech and audio signal switching device of this embodiment, in the process of switching the speech and audio signal from the narrowband speech and audio signal to the wideband speech and audio signal, obtains the processed signal by attenuating the high frequency band signal of the wideband speech and audio signal High-band signal, so that the high-band signal corresponding to the narrow-band voice and audio signal before switching can smoothly transition to the processed high-band signal corresponding to the wide-band voice and audio signal, which is more conducive to improving the user's ability to listen to audio The quality of the signal.

最后应说明的是：以上实施例仅用以说明本发明的技术方案，而非对其限制；尽管参照前述实施例对本发明进行了详细的说明，本领域的普通技术人员应当理解：其依然可以对前述各实施例所记载的技术方案进行修改，或者对其中部分技术特征进行等同替换；而这些修改或者替换，并不使相应技术方案的本质脱离本发明各实施例技术方案的精神和范围。Finally, it should be noted that: the above embodiments are only used to illustrate the technical solutions of the present invention, rather than to limit them; although the present invention has been described in detail with reference to the foregoing embodiments, those of ordinary skill in the art should understand that: it can still be Modifications are made to the technical solutions described in the foregoing embodiments, or equivalent replacements are made to some of the technical features; and these modifications or replacements do not make the essence of the corresponding technical solutions deviate from the spirit and scope of the technical solutions of the various embodiments of the present invention.

Claims

1. A voice and audio signal switching method, characterized in that, comprising:

When the voice and audio signal is switched, the first high frequency band signal of the current frame voice and audio signal and the second high frequency band signal of the previous M frames of voice and audio signal are weighted to obtain the processed first high frequency band signal ; Among them, M is greater than or equal to 1;

Combining the processed first high frequency band signal and the first low frequency band signal of the speech and audio signal of the current frame into a broadband signal.

2. The speech and audio signal switching method according to claim 1, is characterized in that, also comprises:

When the speech and audio signals do not switch, the first high frequency band signal and the first low frequency band signal are synthesized into the broadband signal.

3. The voice and audio signal switching method according to claim 1 or 2, wherein, when the broadband voice and audio signal is switched to the narrowband voice and audio signal;

The first high-band signal of the current frame speech and audio signal and the second high-band signal of the previous M frames of speech and audio signal are weighted to obtain the processed first high-band signal as follows:

Predicting the predicted fine structure information and predicted envelope information corresponding to the first high-frequency band signal of the speech and audio signal of the current frame;

performing weighting processing on the predicted envelope information and the envelope information of the previous M frames corresponding to the second high-band signal of the speech and audio signal of the previous M frames, so as to obtain the first envelope information;

The processed first high frequency band signal is generated according to the first envelope information and the predicted fine structure information.

4. The voice-audio signal switching method according to claim 3, wherein the predicted fine structure information and the predicted envelope information corresponding to the first high frequency band signal are specifically:

performing signal classification on the first low frequency band signal of the speech and audio signal of the current frame;

Predicting the predicted fine structure information and the predicted envelope information according to the signal type corresponding to the first low-frequency signal.

5. voice signal switching method according to claim 3, is characterized in that, the envelope signal of described prediction and the first M frame corresponding to the second high frequency band signal of described preceding M frame voice signal The envelope information is weighted to obtain the first envelope information corresponding to the first high-frequency band signal, specifically:

According to the first low frequency band signal and the low frequency band signal of the speech audio signal of the previous N frames, calculate the correlation coefficient between the first low frequency band signal and the low frequency band signal of the speech audio signal of the previous N frames; Among them, N is greater than or equal to 1;

judging whether the correlation coefficient is within a given first threshold range;

If the correlation coefficient is not within the first threshold range, perform weighting processing according to the set first weight one and first weight two to calculate the first envelope information; wherein, the The first weight one is the weight value of the envelope information of the previous frame corresponding to the high frequency band signal of the voice and audio signal of the previous frame, and the first weight two is the weight value of the envelope information;

If the correlation coefficient is within the first threshold range, perform weighting processing according to the set second weight 1 and second weight 2 to calculate transition envelope information; wherein, the second weight 1 It is the weight value of the envelope information before the switch corresponding to the high frequency band signal of the L frame voice and audio signal before the switch, and the second weight two is the weight value of the envelope information of the M frame before; wherein, L is greater than equal to 1;

Decreasing the second weight one by the unit of the first weight step, and increasing the second weight two by the unit of the first weight step;

judging whether the set third weight one is greater than the first weight one;

If the third weight 1 is not greater than the first weight 1, perform weighting processing according to the set first weight 1 and the first weight 2 to calculate the first envelope information ;

If the third weight 1 is greater than the first weight 1, perform weighting processing according to the set third weight 1 and third weight 2 to calculate the first envelope information; wherein, the first 3. Weight 1 is the weight value of the transition envelope information, and the third weight 2 is the weight value of the predicted envelope information;

Decreasing the third weight 1 in units of the second weight step, increasing the third weight 2 in units of the second weight step, until the third weight 1 is equal to zero;

Wherein, the sum of the first weight one and the first weight two is one, the sum of the second weight one and the second weight two is one, the third weight one and the third weight The sum of two is one; the initial value of the third weight one is greater than the initial value of the first weight one; the first weight one and the first weight two are fixed constants.

6. The speech and audio signal switching method according to claim 3, characterized in that, the envelope signal of said prediction and the corresponding front M frame envelope of the second high frequency band signal of the previous M frame speech and audio signal The information is weighted to obtain the first envelope information corresponding to the first high-frequency band signal, which is specifically:

Calculate the correlation coefficient between the first low frequency band signal and the low frequency signal of the speech and audio signal of the previous frame according to the first low frequency band signal of the current frame and the low frequency signal of the speech and audio signal of the previous frame ;

judging whether the correlation coefficient is within a given second threshold range;

If the correlation coefficient is not within the second threshold range, perform weighting processing according to the set first weight 1 and first weight 2 to calculate the first envelope information; wherein, the The first weight one is the weight value of the envelope information of the previous frame corresponding to the high frequency band signal of the speech and audio signal of the previous frame, and the first weight two is the weight value of the predicted envelope information; The first weight one and the first weight two are fixed constants;

If the correlation coefficient is within the second threshold range, it is judged whether the set second weight one is greater than the first weight one; wherein, the second weight one is the speech and audio signal of the previous frame before switching The weight value of the envelope information before switching corresponding to the high frequency band signal,

If the second weight one is not greater than the first weight one, performing weighting processing according to the set first weight one and first weight two to calculate the first envelope information;

If the second weight 1 is greater than the first weight 1, perform weighting processing according to the second weight 1 and the set second weight 2 to calculate the first envelope information; wherein, The second weight two is the weight value of the predicted envelope information;

Decreasing the second weight one by the unit of the second weight step, increasing the second weight two by the unit of the second weight step;

Wherein, the sum of the first weight one and the first weight two is one, and the sum of the second weight one and the second weight two is one; the initial value of the second weight one is greater than the The initial value of the first weight one.

7. The voice-audio signal switching method according to claim 3, wherein the first low-band signal of the processed first high-band signal and the current frame voice-audio signal is synthesized into a broadband The signals are specifically:

According to the voice and audio signal of the current frame and the voice and audio signal of the previous frame before switching, it is judged whether the processed first high frequency band signal needs to be attenuated;

If attenuation is not required, combining the processed first high-frequency signal with the first low-frequency signal to synthesize the broadband signal;

If attenuation is required, judging whether the attenuation factor corresponding to the processed first high-frequency band signal is greater than a given threshold;

If the attenuation factor is not greater than the given threshold, multiplying the processed first high-frequency signal by a threshold, and then synthesizing the broadband signal with the first low-frequency signal;

If the attenuation factor is greater than the given threshold, multiplying the processed first high-frequency signal by the attenuation factor, and then synthesizing the broadband signal with the first low-frequency signal;

modifying the attenuation factor such that the attenuation factor is reduced;

Wherein, the initial value of the attenuation factor is one; the threshold is less than one and greater than or equal to zero.

8. The voice and audio signal switching method according to claim 1 or 2, characterized in that,

When the narrowband speech and audio signal is switched to the wideband speech and audio signal;

The first high frequency band signal of the current frame speech and audio signal and the second high frequency band signal of the previous M frame speech and audio signals are weighted to obtain the processed first high frequency band signal as follows:

Perform weighting processing according to the set fourth weight 1 and fourth weight 2 to calculate the processed first high-frequency band signal; wherein, the fourth weight 1 is the second high-frequency band A weight value of the signal, the fourth weight two being the weight value of the first high frequency band signal;

Decreasing the fourth weight 1 with the third weight step as a unit, increasing the fourth weight 2 with the third weight step as a unit, until the fourth weight 1 is equal to zero; wherein, the fourth weight 1 and the sum of the fourth weight two is one.

9. The voice and audio signal switching method according to claim 1 or 2, wherein, when the narrowband voice and audio signal is switched to the wideband voice and audio signal;

Perform weighting processing according to the set fifth weight 1 and fifth weight 2 to calculate the processed first high-frequency band signal; wherein, the fifth weight 1 is a set fixed parameter The weight value of the fifth weight two is the weight value of the first high frequency band signal;

Decreasing the fifth weight 1 with the fourth weight step as a unit, increasing the fifth weight 2 with the fourth weight step as a unit, until the fifth weight 1 is equal to zero; wherein, the fifth weight 1 and the sum of the fifth weight two is one;

Wherein, the fixed parameter is a constant greater than or equal to zero and less than the energy value of the first high frequency band signal.

10. A voice and audio signal switching device, characterized in that, comprising:

The processing module is used for weighting the first high-band signal of the current frame of speech and audio signals and the second high-band signal of the previous M frames of speech and audio signals when the speech and audio signals are switched, so as to obtain the processed first A high frequency band signal; wherein, M is greater than or equal to 1;

The first synthesizing module is configured to synthesize the processed first high frequency band signal and the first low frequency band signal of the speech and audio signal of the current frame into a broadband signal.

11. The voice and audio signal switching device according to claim 10, further comprising:

The second synthesizing module is used for synthesizing the wideband signal with the first high frequency band signal and the first low frequency band signal when the voice and audio signal is not switched.

12. The speech and audio signal switching device according to claim 10 or 11, wherein, when the broadband speech and audio signal is switched to the narrowband speech and audio signal;

The processing modules include:

A prediction module, configured to predict the predicted fine structure information and predicted envelope information corresponding to the first high-frequency band signal of the current frame speech and audio signal;

The first generation module is configured to perform weight processing on the predicted envelope information and the envelope information of the previous M frames corresponding to the second high frequency band signal of the speech and audio signal of the previous M frames, so as to obtain the first high First envelope information corresponding to the frequency band signal;

A second generating module, configured to generate the processed first high frequency band signal according to the first envelope information and the predicted fine structure information.

13. The speech and audio signal switching device according to claim 12, further comprising: a classification module for performing signal classification on the first low frequency band signal of the current frame speech and audio signal;

The prediction module is further configured to predict the predicted fine structure information and the predicted envelope information according to the signal type corresponding to the first low frequency band signal.

14. The speech and audio signal switching device according to claim 12, wherein the first synthesis module comprises:

The first judging module is used to judge whether the processed first high frequency band signal needs to be attenuated according to the speech and audio signal of the current frame and the speech and audio signal of the previous frame before switching;

A third synthesis module, configured to combine the processed first high frequency signal with the first low frequency signal if the first judging module obtains that the processed first high frequency signal does not need to be attenuated signal synthesizing the broadband signal;

The second judging module is used for judging whether the attenuation factor corresponding to the processed first high-frequency band signal is greater than a given value if the first judging module obtains that the processed first high-frequency band signal needs to be attenuated the threshold;

A fourth synthesis module, configured to multiply the processed first high frequency band signal by a threshold if the attenuation factor obtained by the second judging module is not greater than the given threshold, and then combine it with the combining the first low-band signal with the wide-band signal;

The fifth synthesis module is configured to multiply the processed first high-frequency band signal by the attenuation factor if the attenuation factor is greater than the given threshold value by the second judging module, and then combine with the attenuation factor Synthesizing the broadband signal with the first low-band signal;

A first modification module, configured to modify the attenuation factor, so that the attenuation factor is reduced;

15. The speech and audio signal switching device according to claim 10 or 11, wherein when the narrowband speech and audio signal is switched to the wideband speech and audio signal;

The processing modules include:

The first calculation module is configured to perform weighting processing according to the set fourth weight one and fourth weight two, so as to calculate the processed first high-frequency band signal; wherein, the fourth weight one is The weight value of the second high frequency band signal, the fourth weight two is the weight value of the first high frequency band signal;

The second modifying module is configured to reduce the fourth weight one by the unit of the third weight step, and increase the fourth weight two by the unit of the third weight step until the fourth weight one is equal to zero; wherein , the sum of the fourth weight one and the fourth weight two is one.

16. The speech and audio signal switching device according to claim 13, wherein, when the narrowband speech and audio signal is switched to the wideband speech and audio signal; the processing module includes:

The second calculation module is configured to perform weighting processing according to the set fifth weight 1 and fifth weight 2 to calculate the processed first high-frequency band signal; wherein, the fifth weight 1 is The weight value of the fixed parameter that has been set, the fifth weight 2 is the weight value of the first high frequency band signal;

A third modifying module, configured to decrease the fifth weight 1 by the unit of the fourth weight step, increase the fifth weight 2 by the unit of the fourth weight step, until the fifth weight 1 is equal to zero; Wherein, the sum of the fifth weight 1 and the fifth weight 2 is one; wherein, the fixed parameter is a constant greater than or equal to zero and less than the energy value of the first high frequency band signal.