CN101964189B - Audio signal switching method and device - Google Patents
Audio signal switching method and device Download PDFInfo
- Publication number
- CN101964189B CN101964189B CN2010101634063A CN201010163406A CN101964189B CN 101964189 B CN101964189 B CN 101964189B CN 2010101634063 A CN2010101634063 A CN 2010101634063A CN 201010163406 A CN201010163406 A CN 201010163406A CN 101964189 B CN101964189 B CN 101964189B
- Authority
- CN
- China
- Prior art keywords
- weight
- signal
- frequency band
- speech
- audio signal
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/16—Vocoder architecture
- G10L19/18—Vocoders using multiple modes
- G10L19/24—Variable rate codecs, e.g. for generating different qualities using a scalable representation such as hierarchical encoding or layered encoding
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/08—Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters
- G10L19/12—Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters the excitation function being a code excitation, e.g. in code excited linear prediction [CELP] vocoders
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/038—Speech enhancement, e.g. noise reduction or echo cancellation using band spreading techniques
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Quality & Reliability (AREA)
- Telephone Function (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
- Circuits Of Receivers In General (AREA)
Abstract
本发明提供一种语音频信号切换方法及装置。语音频信号切换方法,包括:当语音频信号出现切换时,将当前帧语音频信号的第一高频带信号和前M帧语音频信号的第二高频带信号进行加权处理,以得到处理后的第一高频带信号;其中,M大于等于1;将所述处理后的第一高频带信号与所述当前帧语音频信号的第一低频带信号合成宽频带信号。通过根据前M帧语音频信号的第二高频带信号对当前帧的第一高频带信号进行处理,以使前M帧语音频信号的第二高频带信号能平滑过渡到处理后的第一高频带信号,从而能够平滑的进行不同带宽的语音频信号切换,提高了用户接听语音频信号的质量。
The invention provides a voice and audio signal switching method and device. The voice and audio signal switching method includes: when the voice and audio signal is switched, weighting the first high frequency band signal of the current frame of voice and audio signal and the second high frequency band signal of the previous M frames of voice and audio signal, so as to be processed The processed first high frequency band signal; wherein, M is greater than or equal to 1; the processed first high frequency band signal and the first low frequency band signal of the current frame speech and audio signal are synthesized into a broadband signal. Process the first high-band signal of the current frame according to the second high-band signal of the speech and audio signal of the previous M frames, so that the second high-band signal of the speech and audio signal of the previous M frames can smoothly transition to the processed The first high-frequency band signal can smoothly switch voice and audio signals of different bandwidths, improving the quality of the user's listening to voice and audio signals.
Description
技术领域 technical field
本发明实施例涉及通信技术领域,尤其涉及一种语音频信号切换方法及装置。Embodiments of the present invention relate to the field of communication technologies, and in particular, to a voice and audio signal switching method and device.
背景技术 Background technique
目前,语音频信号在网络状态传输过程中,由于网络状态的不同,网络会对从编码端传输到网络的语音频信号的码流做不同码率的截断,从而解码端就会根据截断后的码流解码出不同带宽的语音频信号。At present, during the transmission of voice and audio signals in the network state, due to the different network conditions, the network will truncate the code stream of the voice and audio signal transmitted from the encoding end to the network at different bit rates, so that the decoding end will use the truncated The code stream decodes voice and audio signals with different bandwidths.
在现有技术中,由于网络中传输的语音频信号的带宽不同,在语音频信号传输过程中,存在窄频带语音频信号向宽频带语音频信号切换,以及宽频带语音频信号向窄频带语音频信号切换的现象。本发明中提到的窄频带信号为通过上采样和低通滤波,切换为只有低频带成分而高频带成分为空的宽频带信号,而宽频带语音频信号既有低频带信号成分又有高频带信号成分。In the prior art, due to the different bandwidths of voice and audio signals transmitted in the network, during the transmission of voice and audio signals, there are switching from narrowband voice and audio signals to wideband voice and audio signals, and switching from wideband voice and audio signals to narrowband voice signals. A phenomenon in which audio signals switch. The narrow-band signal mentioned in the present invention is through up-sampling and low-pass filtering, is switched to the wide-band signal that has only low-band components and the high-band components are empty, and the wide-band voice and audio signals have both low-band signal components and High frequency band signal components.
在实现本发明过程中,发明人发现现有技术中至少存在如下问题:由于窄频带语音频信号与宽频带语音频信号之间相差高频带信号中的信息,在切换不同带宽的语音频信号时,会出现音频信号能量激变的现象,从而会导致用户听觉上感到不舒服,造成用户接听音频信号的质量变差。In the process of realizing the present invention, the inventor has found that there are at least the following problems in the prior art: due to the difference in the information in the high-band signal between the narrowband speech and audio signal and the wideband speech and audio signal, when switching speech and audio signals of different bandwidths , there will be a phenomenon of sudden change in the energy of the audio signal, which will cause the user to feel uncomfortable in hearing, and cause the quality of the user to listen to the audio signal to deteriorate.
发明内容 Contents of the invention
本发明实施例提供一种语音频信号切换方法及装置,实现平滑的进行不同带宽的语音频信号切换,以提高用户接听音频信号的质量。Embodiments of the present invention provide a method and device for switching voice and audio signals, so as to realize smooth switching of voice and audio signals of different bandwidths, so as to improve the quality of audio signals listened to by users.
本发明实施例提供一种语音频信号切换方法,包括:An embodiment of the present invention provides a voice and audio signal switching method, including:
当语音频信号出现切换时,将当前帧语音频信号的第一高频带信号和前M帧语音频信号的第二高频带信号进行加权处理,以得到处理后的第一高频带信号;其中,M大于等于1;When the voice and audio signal is switched, the first high frequency band signal of the current frame voice and audio signal and the second high frequency band signal of the previous M frames of voice and audio signal are weighted to obtain the processed first high frequency band signal ; Among them, M is greater than or equal to 1;
将所述处理后的第一高频带信号与所述当前帧语音频信号的第一低频带信号合成宽频带信号。Combining the processed first high frequency band signal and the first low frequency band signal of the speech and audio signal of the current frame into a broadband signal.
本发明实施例提供一种音频信号切换装置,包括:An embodiment of the present invention provides an audio signal switching device, including:
处理模块,用于当语音频信号出现切换时,将当前帧语音频信号的第一高频带信号和前M帧语音频信号的第二高频带信号进行加权处理,以得到处理后的第一高频带信号;其中,M大于等于1;The processing module is used for weighting the first high-band signal of the current frame of speech and audio signals and the second high-band signal of the previous M frames of speech and audio signals when the speech and audio signals are switched, so as to obtain the processed first A high frequency band signal; wherein, M is greater than or equal to 1;
第一合成模块,用于将所述处理后的第一高频带信号与所述当前帧语音频信号的第一低频带信号合成宽频带信号。The first synthesizing module is configured to synthesize the processed first high frequency band signal and the first low frequency band signal of the speech and audio signal of the current frame into a broadband signal.
本发明实施例的语音频信号切换方法及装置,通过根据前M帧的语音频信号的第二高频带信号对当前帧语音频信号的第一高频带信号进行处理,以使前M帧的语音频信号的第二高频带信号能平滑过渡到处理后的第一高频带信号,并将处理后的第一高频带信号与第一低频带信号合成宽频带信号,从而在切换不同带宽的语音频信号过程中,能够平滑的进行不同带宽的语音频信号切换,减小了能量激变造成语音频信号的主观听觉质量差的影响,提高了用户接听语音频信号的质量。In the speech and audio signal switching method and device of the embodiment of the present invention, the first high frequency band signal of the speech and audio signal of the current frame is processed according to the second high frequency band signal of the speech and audio signal of the previous M frames, so that the previous M frames The second high-frequency band signal of the speech and audio signal can smoothly transition to the processed first high-frequency band signal, and the processed first high-frequency band signal and the first low-frequency band signal are synthesized into a wide-band signal, thereby switching In the process of voice and audio signals of different bandwidths, it can smoothly switch between voice and audio signals of different bandwidths, which reduces the impact of poor subjective auditory quality of voice and audio signals caused by energy shocks, and improves the quality of users listening to voice and audio signals.
附图说明 Description of drawings
为了更清楚地说明本发明实施例或现有技术中的技术方案,下面将对实施例或现有技术描述中所需要使用的附图作一简单地介绍,显而易见地,下面描述中的附图是本发明的一些实施例,对于本领域普通技术人员来讲,在不付出创造性劳动性的前提下,还可以根据这些附图获得其他的附图。In order to more clearly illustrate the technical solutions in the embodiments of the present invention or the prior art, the following will briefly introduce the drawings that need to be used in the description of the embodiments or the prior art. Obviously, the accompanying drawings in the following description These are some embodiments of the present invention. For those skilled in the art, other drawings can also be obtained according to these drawings without any creative effort.
图1为本发明语音频信号切换方法实施例一流程图;Fig. 1 is a flow chart of Embodiment 1 of the voice and audio signal switching method of the present invention;
图2为本发明语音频信号切换方法实施例二流程图;Fig. 2 is the flow chart of Embodiment 2 of the voice and audio signal switching method of the present invention;
图3为图2中步骤201一个实施例的流程图;Fig. 3 is a flowchart of an embodiment of
图4为图3中步骤302一个实施例的流程图;Fig. 4 is a flowchart of an embodiment of step 302 in Fig. 3;
图5为图3中步骤302另一个实施例的流程图二;FIG. 5 is a second flowchart of another embodiment of step 302 in FIG. 3;
图6为图2中步骤202一个实施例的流程图;Fig. 6 is a flowchart of an embodiment of
图7为图2中步骤201另一个实施例的流程图二;FIG. 7 is a second flowchart of another embodiment of
图8为图2中步骤201另一个实施例的流程图三;FIG. 8 is a third flowchart of another embodiment of
图9为本发明语音频信号切换装置实施例一的结构示意图;FIG. 9 is a schematic structural diagram of Embodiment 1 of the voice and audio signal switching device of the present invention;
图10为本发明语音频信号切换装置实施例二的结构示意图;FIG. 10 is a schematic structural diagram of Embodiment 2 of the voice and audio signal switching device of the present invention;
图11为本发明语音频信号切换装置实施例二中的处理模块的结构示意图一;FIG. 11 is a first structural schematic diagram of the processing module in Embodiment 2 of the voice and audio signal switching device of the present invention;
图12为本发明语音频信号切换装置实施例二中的第一模块的结构示意图;12 is a schematic structural diagram of the first module in Embodiment 2 of the voice and audio signal switching device of the present invention;
图13a为本发明语音频信号切换装置实施例二中的处理模块的结构示意图二;Fig. 13a is a second structural schematic diagram of the processing module in the second embodiment of the voice and audio signal switching device of the present invention;
图13b为本发明语音频信号切换装置实施例二中的处理模块的结构示意图三。Fig. 13b is a third structural schematic diagram of the processing module in the second embodiment of the voice and audio signal switching device of the present invention.
具体实施方式 Detailed ways
为使本发明实施例的目的、技术方案和优点更加清楚,下面将结合本发明实施例中的附图,对本发明实施例中的技术方案进行清楚、完整地描述,显然,所描述的实施例是本发明一部分实施例,而不是全部的实施例。基于本发明中的实施例,本领域普通技术人员在没有作出创造性劳动前提下所获得的所有其他实施例,都属于本发明保护的范围。In order to make the purpose, technical solutions and advantages of the embodiments of the present invention clearer, the technical solutions in the embodiments of the present invention will be clearly and completely described below in conjunction with the drawings in the embodiments of the present invention. Obviously, the described embodiments It is a part of embodiments of the present invention, but not all embodiments. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without creative efforts fall within the protection scope of the present invention.
图1为本发明语音频信号切换方法实施例一流程图。如图1所示,本实施例语音频信号切换方法,当语音频信号出现切换时,对切换帧之后的每一帧采用如下方式进行处理:FIG. 1 is a flow chart of Embodiment 1 of the voice and audio signal switching method of the present invention. As shown in Figure 1, in the voice and audio signal switching method of this embodiment, when the voice and audio signal is switched, each frame after the switching frame is processed in the following manner:
步骤101、当语音频信号出现切换时,将当前帧语音频信号的第一高频带信号和前M帧语音频信号的第二高频带信号进行加权处理,以得到处理后的第一高频带信号;其中,M大于等于1。
步骤102、将处理后的第一高频带信号与当前帧语音频信号的第一低频带信号合成宽频带信号。Step 102: Synthesize the processed first high frequency band signal and the first low frequency band signal of the speech and audio signal of the current frame into a wide frequency band signal.
本发明提供的实施例中的前M帧语音频信号指当前帧之前的M帧语音频信号。切换前L帧语音频信号指出现语音频信号切换时切换帧之前的L帧语音频信号。当前语音帧为宽频带信号而前一帧语音帧为窄频带信号;或者当前语音帧为窄频带信号而前一帧语音帧为宽频带信号,则语音频信号出现切换,当前语音帧为切换帧。The previous M frames of speech and audio signals in the embodiments provided by the present invention refer to M frames of speech and audio signals before the current frame. The L frames of speech and audio signals before switching refer to the L frames of speech and audio signals before the switching frame when the speech and audio signals are switched. The current speech frame is a wideband signal and the previous speech frame is a narrowband signal; or the current speech frame is a narrowband signal and the previous speech frame is a broadband signal, then the speech and audio signals are switched, and the current speech frame is a switching frame .
本发明实施例的语音频信号切换方法,通过根据前M帧语音频信号中的第二高频带信号对当前帧语音频信号中的第一高频带信号进行处理,以使的前M帧语音频信号中的第二高频带信号能平滑的过渡到处理后的第一高频带信号,从而在切换不同带宽的语音频信号过程中,使不同带宽的语音频信号的高频带信号能够平滑的过渡切换;最后,处理后的第一高频带信号与第一低频带信号合成宽频带信号,将该宽频带信号传输到用户终端,使用户享受到高质量的语音频信号。本实施例语音频信号切换方法能够平滑的进行不同带宽的语音频信号切换,减小了能量激变造成语音频信号的主观听觉质量差的影响,提高了用户接听语音频信号的质量。In the speech and audio signal switching method of the embodiment of the present invention, the first high frequency band signal in the speech and audio signal of the current frame is processed according to the second high frequency band signal in the speech and audio signal of the previous M frames, so that the first M frames The second high frequency band signal in the speech and audio signal can smoothly transition to the processed first high frequency band signal, so that in the process of switching speech and audio signals of different bandwidths, the high frequency band signals of speech and audio signals of different bandwidths Smooth transition switching is possible; finally, the processed first high-frequency signal and the first low-frequency signal synthesize a wide-band signal, and transmit the wide-band signal to the user terminal, so that the user can enjoy high-quality voice and audio signals. The speech and audio signal switching method of this embodiment can smoothly switch speech and audio signals of different bandwidths, reduces the impact of poor subjective auditory quality of speech and audio signals caused by energy shocks, and improves the quality of speech and audio signals received by users.
图2为本发明语音频信号切换方法实施例二流程图。如图2所示,本实施例语音频信号切换方法,包括:FIG. 2 is a flow chart of Embodiment 2 of the voice and audio signal switching method of the present invention. As shown in Figure 2, the voice and audio signal switching method of this embodiment includes:
步骤200、当语音频信号没有出现切换时,将当前帧语音频信号的第一高频带信号与第一低频带信号合成宽频带信号。Step 200: When the speech and audio signal does not switch, synthesize the first high frequency band signal and the first low frequency band signal of the speech and audio signal of the current frame into a broadband signal.
具体的,本实施例中的第一频带语音频信号可以是宽频带语音频信号或者是窄频带语音频信号。在语音频信号的传输过程中,当第一频带语音频信号不发生切换时,分以下两种情况进行处理:一、若第一频带语音频信号为宽频带语音频信号时,则将宽频带语音频信号中的低频带信号和高频带信号合成宽频带信号;二、若第一频带语音频信号为窄频带语音频信号时,则将窄频带语音频信号中的低频带信号和高频带信号合成宽频带信号,此时,虽然为宽频带信号,但高频带是空,没有信息。Specifically, the first frequency-band speech and audio signal in this embodiment may be a wide-band speech and audio signal or a narrow-band speech and audio signal. In the transmission process of the speech and audio signal, when the first frequency band speech and audio signal does not switch, the following two situations are processed: 1. If the first frequency band speech and audio signal is a wideband speech and audio signal, the wideband The low-band signal and the high-band signal in the voice-audio signal are synthesized into a broadband signal; two, if the first-band voice-audio signal is a narrow-band voice-audio signal, then the low-band signal and the high-frequency signal in the narrow-band voice-audio signal are The band signal is synthesized into a broadband signal. At this time, although it is a broadband signal, the high frequency band is empty and has no information.
步骤201、当语音频信号出现切换时,将当前帧语音频信号的第一高频带信号和前M帧语音频信号的第二高频带信号进行加权处理,以得到处理后的第一高频带信号。其中,M大于等于1。
具体的,当出现不同带宽的语音频信号进行切换时,根据前M帧语音频信号的第二高频带信号,对当前帧语音频信号的第一高频带信号进行处理,以使得前M帧语音频信号的第二高频带信号能平滑过渡到处理后的第一高频带信号,例如,当宽频带语音频信号切换到窄频带语音频信号,由于窄频带语音频信号对应的的高频带信号为空,因此为了使宽频带语音频信号平滑切换到窄频带语音频信号,需要恢复窄频带语音频信号对应的的高频带信号的成分,而当窄频带语音频信号切换到宽频带语音频信号,由于宽频带语音频信号中的高频带信号不为空,因此为了使窄频带语音频信号平滑切换到宽频带语音频信号,需要减弱切换后连续多帧宽频带语音频信号中的高频带信号的能量,使得宽频带语音频信号的高频带信号逐渐过渡到真实的高频带信号。通过步骤201对当前帧语音频信号进行处理,使得不同带宽的语音频信号中的高频带信号能够平滑的进行过渡,解决了宽频带语音频信号与窄频带语音频信号之间进行切换时,由于能量激变而造成用户的听觉不舒适,从而使用户接收到高质量的音频信号。其中,为了简化得到处理后的第一高频带信号的过程,可以将第一高频带信号与前M帧语音频信号的第二高频带信号直接进行加权处理,处理后获得的结果即为处理后的第一高频带信号。Specifically, when voice and audio signals of different bandwidths are switched, the first high frequency band signal of the voice and audio signal of the current frame is processed according to the second high frequency band signal of the voice and audio signal of the previous M frames, so that the first M The second high-band signal of the frame speech and audio signal can smoothly transition to the processed first high-band signal. For example, when the wideband speech and audio signal is switched to the narrowband speech and audio signal, due to the corresponding The high-band signal is empty, so in order to smoothly switch the wide-band speech and audio signal to the narrow-band speech and audio signal, it is necessary to restore the components of the high-band signal corresponding to the narrow-band speech and audio signal, and when the narrow-band speech and audio signal is switched to For wideband voice and audio signals, since the high frequency band signal in wideband voice and audio signals is not empty, in order to smoothly switch narrowband voice and audio signals to wideband voice and audio signals, it is necessary to weaken the continuous multi-frame wideband voice and audio after switching The energy of the high-frequency band signal in the signal makes the high-band signal of the broadband speech and audio signal gradually transition to the real high-band signal. By
步骤202、将处理后的第一高频带信号与当前帧语音频信号的第一低频带信号合成宽频带信号。Step 202: Synthesize the processed first high-frequency signal and the first low-frequency signal of the speech and audio signal of the current frame into a wide-band signal.
具体的,当前帧语音频信号通过步骤201处理后,使前M帧语音频信号的第二高频带信号能够平滑的过渡到当前帧的处理后的第一高频带信号,然后通过步骤202将处理后的第一高频带信号与当前帧语音频信号的第一低频带信号合成宽频带信号,使用户接收到的语音频信号均为宽频带语音频信号,实现了不同带宽的语音频信号平稳的切换,有利于提高用户接听音频信号的质量。Specifically, after the current frame speech and audio signal is processed by
本发明实施例的语音频信号切换方法,通过根据前M帧语音频信号的第二高频带信号对当前帧语音频信号中的第一高频带信号进行处理,以使的前M帧语音频信号的第二高频带信号能平滑的过渡到处理后的第一高频带信号,从而在切换不同带宽的语音频信号过程中,使不同带宽的语音频信号的高频带信号能够平滑的过渡切换;最后,处理后的第一高频带信号与第一低频带信号合成宽频带信号,该宽频带信号传输到用户终端,使用户享受到高质量的语音信号。本实施例语音频信号切换方法能够平滑的进行不同带宽的语音频信号切换,减小了能量激变造成音频信号的主观听觉质量差的影响,提高了用户接听音频信号的质量。另外,通过在不发生不同带宽的语音频信号切换时,将当前帧语音频信号的第一高频带信号和第一低频带信号合成宽频带信号,使用户获得高质量的音频信号。In the speech and audio signal switching method of the embodiment of the present invention, the first high frequency band signal in the speech and audio signal of the current frame is processed according to the second high frequency band signal of the speech and audio signal of the previous M frames, so that the speech of the previous M frames The second high frequency band signal of the audio signal can smoothly transition to the processed first high frequency band signal, so that the high frequency band signals of the voice and audio signals of different bandwidths can be smoothed during the process of switching voice and audio signals of different bandwidths Transition switching; Finally, the processed first high-frequency signal and the first low-frequency signal synthesize a wide-band signal, and the wide-band signal is transmitted to the user terminal, so that the user can enjoy high-quality voice signals. The speech and audio signal switching method of this embodiment can smoothly switch speech and audio signals of different bandwidths, which reduces the impact of poor subjective auditory quality of audio signals caused by energy shocks, and improves the quality of audio signals listened to by users. In addition, by synthesizing the first high frequency band signal and the first low frequency band signal of the current frame voice and audio signal into a broadband signal when switching between voice and audio signals of different bandwidths does not occur, users can obtain high-quality audio signals.
基于上述技术方案,可选的,当宽频带语音频信号向窄频带语音频信号转换时,如图3所示,本实施例中的步骤201包括:Based on the above technical solution, optionally, when the wideband voice and audio signal is converted to the narrowband voice and audio signal, as shown in Figure 3,
步骤301、预测第一高频带信号对应的预测的精细结构信息和预测的包络信息。Step 301. Predict predicted fine structure information and predicted envelope information corresponding to the first high-frequency band signal.
具体的,语音频信号可以分解为精细结构信息和包络信息两部分,从而可以根据精细结构信息和包络信息恢复语音频信号。在由宽频带语音频信号切换到窄频带语音频信号的过程中,由于窄频带语音频信号中只有低频带信号,其对应的高频带信号为空,为了使宽频带语音频信号能够平滑的切换到窄频带语音频信号,需要恢复出当前的窄频带语音频信号所需要的高频带信号,以实现语音频信号的平滑切换。本实施例中的步骤301将预测窄频带语音频信号中的第一高频带信号对应的预测的精细结构信息和预测的包络信息。Specifically, the speech and audio signal can be decomposed into fine structure information and envelope information, so that the speech and audio signal can be restored according to the fine structure information and the envelope information. In the process of switching from wideband speech and audio signals to narrowband speech and audio signals, since there are only low-band signals in the narrowband speech and audio signals, the corresponding high-band signals are empty. In order to make the wideband speech and audio signals smooth To switch to the narrow-band speech and audio signals, it is necessary to restore the high-band signals required by the current narrow-band speech and audio signals, so as to realize smooth switching of the speech and audio signals. Step 301 in this embodiment is to predict the predicted fine structure information and the predicted envelope information corresponding to the first high-band signal in the narrow-band speech and audio signal.
为了更加准确的预测当前帧语音频信号对应的预测的精细结构信息和预测的包络信息,步骤301还可以对当前帧语音频信号的第一低频带信号进行信号分类;再根据第一低频带信号对应的信号类型预测第一高频带信号对应的预测的精细结构信息和预测的包络信息。例如,当前帧的窄频带语音频信号可以为谐波信号、或非谐波信号或瞬态信号等信息类型,则可以根据窄频带语音频信号对应的信息类型,得知该种类型的信号应具有的精细结构信息和包络信息,从而更加准确的预测当前帧的高频带信号对应的精细结构信息和包络信息。其中,本发明语音频信号切换方法对窄频带语音频信号的信号类型不做限制。In order to more accurately predict the predicted fine structure information and the predicted envelope information corresponding to the current frame speech and audio signal, step 301 can also perform signal classification on the first low frequency band signal of the current frame speech and audio signal; then according to the first low frequency band The signal type prediction corresponding to the signal corresponds to the predicted fine structure information and the predicted envelope information corresponding to the first high frequency band signal. For example, the narrow-band speech and audio signal of the current frame can be information types such as harmonic signals, or non-harmonic signals or transient signals, and then it can be known that the signal of this type should be based on the information type corresponding to the narrow-band speech and audio signal. It has fine structure information and envelope information, so as to more accurately predict the fine structure information and envelope information corresponding to the high frequency band signal of the current frame. Wherein, the speech and audio signal switching method of the present invention does not limit the signal type of the narrowband speech and audio signal.
步骤302、将预测的包络信号和前M帧语音频信号的第二高频带信号对应的前M帧包络信息进行加权处理,以得到第一高频带信号对应的第一包络信息。Step 302: Perform weighting processing on the predicted envelope signal and the envelope information of the previous M frames corresponding to the second high-band signal of the previous M frames of speech and audio signals, so as to obtain the first envelope information corresponding to the first high-band signal .
具体的,在步骤301预测当前帧的第一高频带信号对应的预测的精细结构信息和预测的包络信息后,可以根据预测的包络信息和前M帧语音频信号的第二高频带信号对应的前M帧包络信息,生成第一高频带信号对应的第一包络信息。Specifically, after predicting the predicted fine structure information and predicted envelope information corresponding to the first high-frequency band signal of the current frame in step 301, the second high-frequency The envelope information of the first M frames corresponding to the band signal is used to generate the first envelope information corresponding to the first high frequency band signal.
具体而言,步骤302中生成第一高频带信号对应的第一包络信息的过程可以通过如下两种方式实现,具体如下:Specifically, the process of generating the first envelope information corresponding to the first high-frequency band signal in step 302 can be realized in the following two ways, specifically as follows:
方式一,如图4所示,通过步骤302获得第一包络信息的一个实施例可以包括:Way 1, as shown in Figure 4, an embodiment of obtaining the first envelope information through step 302 may include:
步骤401、根据第一低频带信号和前N帧的语音频信号的低频带信号,计算第一低频带信号与前N帧的语音频信号的低频带信号之间的相关性系数;其中,N大于等于1。
具体的,对当前帧语音频信号的第一低频带信号与前N帧的语音频信号的低频带信号进行比较,以得出当前帧语音频信号的第一低频带信号与前N帧的语音频信号的低频带信号之间的相关性系数,例如,可以通过判断当前帧语音频信号的第一低频带信号中的某频段信息,与前N帧的语音频信号的低频带信号的相同频段信息的能量大小或信息类型的差异等,确定它们之间的相关性,以计算得出所要的相关性系数。其中,前N帧的语音频信号可以为窄频带语音频信号、宽频带语音频信号或者窄频带语音频信号和宽频带语音频信号组成的混合信号。Specifically, compare the first low frequency band signal of the speech and audio signal of the current frame with the low frequency band signal of the speech and audio signal of the previous N frames to obtain the first low frequency band signal of the speech and audio signal of the current frame and the speech and audio signals of the previous N frames. The correlation coefficient between the low-frequency band signals of the audio signal, for example, can be determined by judging a certain frequency band information in the first low-frequency band signal of the current frame speech and audio signal, and the same frequency band of the low-frequency band signal of the speech and audio signal of the previous N frames The energy size of the information or the difference in the type of information, etc., determine the correlation between them, and calculate the desired correlation coefficient. Wherein, the speech and audio signals of the first N frames may be narrowband speech and audio signals, wideband speech and audio signals, or a mixed signal composed of narrowband speech and audio signals and wideband speech and audio signals.
步骤402、判断相关性系数是否在给定的第一阈值范围内。
具体的,在步骤401计算出相关性系数后,判断该相关性系数是否在给定的阈值范围内。计算相关性系数的作用是为了得知当前帧语音频信号是从前N帧的语音频信号渐变过来的还是突变过来的,也就是说看它们的特性是否相同,进而判断预测当前帧语音频信号的高频带信号时前面帧的高频带信号所占的权重。例如,如果当前帧语音频信号的第一低频带信号与前一帧的语音频信号的低频带信号能量相当,且类型相同,则说明前一帧的语音频信号与当前帧语音频信号有较高的相关性,因此,为了准确的恢复当前帧语音频信号对应的第一包络信息,恢复当前帧语音频信号对应的第一包络信息时,前一帧的语音频信号对应的高频带包络信息或过渡的包络信息占较大的权重;否则,如果当前帧语音频信号的第一低频带信号与前一帧的语音频信号的低频带信号能量相差很大,且类型不同,则说明前一帧的语音频信号与当前帧语音频信号有较低的相关性,因此,为了准确的恢复当前帧语音频信号对应的第一包络信息,恢复当前帧语音频信号对应的第一包络信息时,前一帧的语音频信号对应的高频带包络信息或过渡的包络信息占较小的权重;Specifically, after the correlation coefficient is calculated in
步骤403、如果相关性系数不在给定的第一阈值范围内,则根据已设定好的第一权重一和第一权重二进行加权处理,以计算出第一包络信息。其中,第一权重一为前一帧语音频信号的高频带信号对应的前一帧包络信息的权重值,第一权重二为预测的包络信息的权重值。
具体的,当步骤402得出相关性系数不在给定的第一阈值范围内时,则可以得知当前帧语音频信号与前N帧语音频信号有较小的相关性,所以,前M帧的第一频带语音频信号对应的前M帧包络信息或过渡的包络信息或前一帧的语音频信号对应的高频带包络信息对第一包络信息影响较小,在恢复当前帧语音频信号对应的第一包络信息时,前M帧的第一频带语音频信号对应的前M帧包络信息或过渡的包络信息或前一帧的语音频信号对应的高频带包络信息占的权重较小。因此,根据已设定好的第一权重一和第一权重二,便可以计算出当前帧的第一包络信息。其中,第一权重一为前一帧语音频信号的高频带信号对应的包络信息的权重值,该前一帧语音频信号可以是宽频带语音频信号或者是已处理过的窄频带语音频信号,当第一次切换时,前一帧语音频信号即为宽频带语音频信号;而第一权重二为预测的包络信息的权重值。将预测的包络信息与第一权重二的乘积,加上前一帧的包络信息与第一权重一的乘积之和,所求得的加权之和为当前帧的第一包络信息。另外,以后传输的语音频信号都按此方式和权重,恢复该语音频信号对应的第一包络信息,直到语音频信号再次发生切换。Specifically, when
步骤404、如果相关性系数在第一阈值范围内,根据已设定好的第二权重一和第二权重二进行加权处理,以计算出过渡包络信息。其中,第二权重一为切换前的包络信息的权重值,第二权重二为前M帧包络信息的权重值;其中,L大于等于1。
具体的,当步骤402得出相关性系数在给定的阈值范围内时,则可以得知当前帧语音频信号与前连续N帧的语音频信号特性相似,当前帧语音频信号对应的第一包络信息受前连续N帧的语音频信号的包络信息影响较大,同时考虑到前M帧包络的真实性,因此,需要根据前M帧包络信息和切换前的包络信息求解当前帧语音频信号对应的过渡包络信息,在恢复当前帧语音频信号的第一包络信息时,让前M帧包络信息和切换前L帧的包络信息占较大的权重;再通过过渡包络信息求解第一包络信息。其中,第二权重一为切换前的包络信息的权重值,而第二权重二为前M帧包络信息的权重值。则切换前的包络信息与第二权重一的乘积,加上前M帧包络信息与第二权重二的乘积之和,所求得的加权值即为过渡包络信息。Specifically, when
步骤405、以第一权重步长为单位减小第二权重一,以第一权重步长为单位增加第二权重二。Step 405: Decrease the second weight one by the unit of the first weight step, and increase the second weight two by the unit of the first weight step.
具体的,随着语音频信号的传输,后续的窄频带语音频信号受切换前的宽频带语音频信号的影响逐渐变小,为了使计算得出的第一包络信息更加准确,需要对第二权重一和第二权重二进行适用性的调整。由于后续的音频信号受切换前L帧的宽频带语音频信号的影响逐渐变小,因此,第二权重一的数值逐渐变小,而第二权重二的数值逐渐增大,从而减弱切换前的包络信息对第一包络信息的影响。其中,步骤405对第二权重一和第二权重二进行修改可以通过如下方法:新的第二权重一等于旧的第二权重一减去第一权重步长,新的第二权重二等于旧的第二权重二加上第一权重步长;其中,第一权重步长为已设定好的值。Specifically, with the transmission of voice and audio signals, the subsequent narrowband voice and audio signals are gradually less affected by the wideband voice and audio signals before switching. In order to make the calculated first envelope information more accurate, it is necessary to The second weight one and the second weight two are adjusted for applicability. Since the follow-up audio signal is gradually reduced by the influence of the broadband speech and audio signal of L frames before the switch, the value of the second weight 1 gradually decreases, while the value of the second weight 2 gradually increases, thereby weakening the value before the switch. The influence of the envelope information on the first envelope information. Among them, step 405 can modify the second weight 1 and the second weight 2 through the following method: the new second weight 1 is equal to the old second weight 1 minus the first weight step size, and the new second weight 2 is equal to the old The second weight of 2 plus the first weight step; wherein, the first weight step is a set value.
步骤406、判断已设定好的第三权重一是否大于第一权重一。
具体的,第三权重一为过渡包络信息的权重值,通过比较第三权重一与第二权重一的大小,可以得知当前帧的第一包络信息受过渡包络信息的影响大小。其中,过渡包络信息中由前M帧包络信息和切换前的包络信息计算而来,因此,第三权重一实际上代表了第一包络信息受切换前的包络信息的影响程度。Specifically, the third weight one is a weight value of the transition envelope information, and by comparing the third weight one with the second weight one, it can be known how much the first envelope information of the current frame is affected by the transition envelope information. Among them, the transition envelope information is calculated from the envelope information of the previous M frames and the envelope information before switching, therefore, the third weight one actually represents the influence degree of the first envelope information by the envelope information before switching .
步骤407、如果第三权重一不大于第一权重一,则根据已设定好的第一权重一和第一权重二进行加权处理,以计算出所述第一包络信息。
具体的,在步骤406判断得知第三权重一小于等于第一权重一时,说明当前帧语音频信号离切换前L帧的语音频信号较远,第一包络信息主要受前M帧包络信息的影响,因此,根据已设定好的第一权重一和第一权重二,便可以计算出当前帧的第一包络信息。Specifically, when it is judged in
步骤408、如果第三权重一大于第一权重一,根据已设定好的第三权重一和第三权重二进行加权处理,以计算出第一包络信息。其中,第三权重一为过渡包络信息的权重值,第三权重二为预测的包络信息的权重值。
具体的,在步骤406判断得知第三权重一大于第一权重一时,说明当前帧语音频信号离切换前L帧语音频信号较近,第一包络信息受切换前的包络信息影响较大,因此,需要根据过渡包络信息求解当前帧的第一包络信息。其中,第三权重一为过渡包络信息的权重值,而第三权重二为预测的包络信息的权重值。则过渡包络信息与第三权重一的乘积,加上预测的包络信息与第三权重二的乘积之和所求得的加权值即为第一包络信息。Specifically, when it is judged in
步骤409、以第二权重步长为单位减小第三权重一,以第二权重步长为单位增加第三权重二,直至第三权重一等于零。Step 409: Decrease the third weight 1 by the unit of the second weight step, increase the third weight 2 by the unit of the second weight step, until the third weight 1 is equal to zero.
具体的,步骤409中修改第三权重一和第三权重二的目的,与步骤405中修改第二权重一和第二权重二的目的相同,都是为了在后续传输的语音频信号受切换前L帧语音频信号的影响逐渐变小的情况下,为了使计算得出的第一包络信息更加准确,对第三权重一和第三权重二进行适用性的调整。由于后续的音频信号受切换前L帧语音频信号的影响逐渐变小,因此,第三权重一的数值逐渐变小,而第三权重二的数值逐渐增大,从而也实现了减弱切换前的包络信息对第一包络信息的影响。其中,步骤409对第三权重一和第三权重二进行修改可以通过如下方法:新的第三权重一等于旧的第三权重一减去第二权重步长,新的第三权重二等于旧的第三权重二加上第二权重步长;其中,第二权重步长为已设定好的值。Specifically, the purpose of modifying the third weight 1 and the third weight 2 in
第一权重一和第一权重二的和为一,第二权重一和第二权重二的和为一,第三权重一和第三权重二的和为一;第三权重一的初始值大于第一权重一的初始值;第一权重一和第一权重二是固定的常数。具体的,本实施例中的权重一和权重二,实际上代表了切换前的包络信息和前M帧包络信息组成当前帧的第一包络信息时所占的百分比。对于当前帧语音频信号,离切换前L帧的语音频信号越近且相关性较大时,切换前的包络信息所占的百分比就越高,而相反的前M帧包络信息所占的百分比就越低。在当前帧语音频信号离切换前L帧的语音频信号较远时,说明网络中已经稳定的传输语音频信号,或者当前帧语音频信号与切换前L帧语音频信号相关性较低时,说明当前帧语音频信号特性已发生改变,因此,当前帧语音频信号受切换前L帧的语音频信号影响较小,切换前的包络信息所占的百分比就越低。The sum of the first weight one and the first weight two is one, the sum of the second weight one and the second weight two is one, the sum of the third weight one and the third weight two is one; the initial value of the third weight one is greater than The initial value of the first weight one; the first weight one and the first weight two are fixed constants. Specifically, weight 1 and weight 2 in this embodiment actually represent the percentages when the envelope information before switching and the envelope information of the previous M frames form the first envelope information of the current frame. For the speech and audio signal of the current frame, the closer the speech and audio signal of L frames before switching and the greater the correlation, the higher the percentage of the envelope information before switching is, while the envelope information of M frames before the opposite. the lower the percentage. When the voice and audio signal of the current frame is far from the voice and audio signal of L frames before switching, it means that the voice and audio signal has been transmitted stably in the network, or when the correlation between the voice and audio signal of the current frame and the voice and audio signal of L frames before switching is low, It shows that the characteristics of the speech and audio signal of the current frame have changed. Therefore, the speech and audio signal of the current frame is less affected by the speech and audio signal of L frames before switching, and the percentage of the envelope information before switching is lower.
另外,本实施例中的步骤404和步骤405的执行顺序可以互换,即可以先修改第二权重一和第二权重二,再根据第二权重一和第二权重二,计算出过渡包络信息。同样的,本实施例中的步骤408和步骤409的执行顺序可以互换,即可以先修改修改第三权重一和第三权重二,再根据修改第三权重一和第三权重二,计算出第一包络信息。In addition, the execution order of
方式二,如图5所示,通过步骤302获得第一包络信息的另一个实施例还可以包括:Way 2, as shown in Figure 5, another embodiment of obtaining the first envelope information through step 302 may also include:
步骤501、根据当前帧语音频信号的第一低频带信号和前一帧的语音频信号的低频信号,计算第一低频带信号与前一帧的语音频信号的低频信号之间的相关性系数。
具体的,为了更加准确的得出第一包络信息,求解当前帧语音频信号的第一低频带信号某频段和前一帧的语音频信号的低频信号相同频段的能量的关系。本实施例可以以“corr”代表相关性系数,通过当前帧语音频信号的第一低频带信号与前一帧的语音频信号的低频带信号相同频段的能量关系,得出当前帧语音频信号的第一低频带信号与前一帧的语音频信号的低频带信号之间的相关性系数corr,能量相差越小,corr越大,否则,corr越小。具体过程可以参见步骤401中关于前N帧的语音频信号相关性计算的介绍。Specifically, in order to obtain the first envelope information more accurately, the relationship between the energy of a certain frequency band of the first low frequency band signal of the speech and audio signal of the current frame and the same frequency band of the low frequency signal of the speech and audio signal of the previous frame is solved. In this embodiment, "corr" can be used to represent the correlation coefficient, and the current frame speech and audio signal can be obtained through the energy relationship of the first low frequency band signal of the current frame speech and audio signal and the low frequency band signal of the speech and audio signal of the previous frame in the same frequency band The correlation coefficient corr between the first low-frequency band signal of and the low-frequency band signal of the speech audio signal of the previous frame, the smaller the energy difference is, the larger the corr is, otherwise, the smaller the corr is. For the specific process, please refer to the introduction about the calculation of the correlation of speech and audio signals of the previous N frames in
步骤502、判断相关性系数是否在给定的第二阈值范围内。
具体的,在步骤501计算出corr的值后,判断计算出的corr是否在给定的第二阈值范围内。例如,本实施例可以将第二阈值范围用c1~c2代表。Specifically, after the value of corr is calculated in
步骤503、如果相关性系数不在第二阈值范围内,则根据已设定好的第一权重一和第一权重二进行加权处理,以计算出第一包络信息。其中,第一权重一为前一帧的语音频信号的高频带信号对应的前一帧包络信息的权重值,第一权重二为预测的包络信息的权重值;第一权重一和第一权重二是固定的常数。
具体的,当步骤502得出corr小于c1或大于c2时,得知当前帧语音频信号对应的第一包络信息受切换前一帧语音频信号的包络信息影响较小,因此,根据已设定好的第一权重一和第一权重二,便可以计算出当前帧的第一包络信息。预测的包络信息与第一权重二的乘积,加上前一帧包络信息与第一权重一的乘积之和,所求得的加权和即为当前帧的第一包络信息。另外,以后传输的窄带语音频信号都按此方式和权重恢复该窄带语音频信号对应的第一包络信息,直到不同带宽的语音频信号再次发生切换。例如:本实施例中的第一权重一可以用a1代表、第一权重二可以用b1代表、前一帧包络信息可以用pre_fenv代表、预测的包络信息可以用fenv代表、第一包络信息可以用cur_fenv代表。则步骤503可以利用以下公式表示:cur_fenv=pre_fenv*a1+fenv*b1。Specifically, when it is obtained in
步骤504、如果相关性系数在第二阈值范围内,判断已设定好的第二权重一是否大于第一权重一。其中,第二权重一为切换前一帧语音频信号的高频带信号对应的切换前的包络信息的权重值,
具体的,如果c1<corr<c2,则通过比较第二权重一与第一权重一的大小,可以得知当前帧的第一包络信息受切换前的包络信息和前一帧包络信息的影响程度。Specifically, if c1<corr<c2, by comparing the size of the second weight one with the first weight one, it can be known that the first envelope information of the current frame is affected by the envelope information before switching and the envelope information of the previous frame degree of influence.
步骤505、如果第二权重一不大于第一权重一,则根据已设定好的第一权重一和第一权重二,计算出第一包络信息。
具体的,在步骤504判断得知第二权重一小于第一权重一时,说明当前帧语音频信号离切换前一帧的语音频信号较远,第一包络信息受切换前的包络信息的影响较小,因此,根据已设定好的第一权重一和第一权重二,便可以计算出当前帧的第一包络信息。则步骤505可以利用以下公式表示:cur_fenv=pre_fenv*a1+fenv*b1。Specifically, when it is judged in
步骤506、如果第二权重一大于第一权重一,根据第二权重一和已设定好的第二权重二进行加权处理,以计算出第一包络信息。其中,第二权重二为预测的包络信息的权重值。例如:第二权重一可以用a2代表,第二权重二可以用b2代表。
具体的,在步骤504判断得知第二权重一大于第一权重一时,说明当前帧语音频信号离切换前一帧的第一频带语音频信号较近,第一包络信息受切换前一帧语音频信号对应的切换前的包络信息影响较大。因此,根据已设定好的第二权重一和第二权重二,便可以计算出当前帧的第一包络信息。则预测的包络信息与第二权重二的乘积,加上切换前的包络信息与第二权重一的乘积之和,所求得的加权和即为当前帧的第一包络信息。其中,切换前的包络信息可以用con_fenv代表,则步骤506可以利用以下公式表示:cur_fenv=con_fenv*a2+fenv*b2。Specifically, when it is judged in
步骤507、以第二权重步长为单位减小第二权重一,以第二权重步长为单位增加第二权重二。Step 507: Decrease the second weight one by the unit of the second weight step, and increase the second weight two by the unit of the second weight step.
具体的,随着语音频信号的传输,后续的当前帧语音频信号受切换前一语音频信号的影响逐渐变小,为了使计算得出的第一包络信息更加准确,需要对第二权重一和第二权重二进行适用性的调整。由于后续的音频信号受切换前一帧语音频信号的影响逐渐变小;而靠近当前帧语音频信号的前一帧语音频信号的影响变大。因此,第二权重一的数值逐渐变小,而第二权重二的数值逐渐增大,从而减弱切换前的包络信息对第一包络信息的影响,而增强预测的包络信息对第一包络信息的影响。其中,步骤507对第二权重一和第二权重二进行修改可以通过如下方法:新的第二权重一等于旧的第二权重一减去第一权重步长,新的第二权重二等于旧的第二权重二加上第一权重步长;其中,第一权重步长为已设定好的值。Specifically, with the transmission of speech and audio signals, the subsequent speech and audio signals of the current frame are gradually less affected by the speech and audio signals before switching. In order to make the calculated first envelope information more accurate, the second weight One and the second weight two are adjusted for applicability. Since the subsequent audio signal is gradually less affected by the voice and audio signal of the previous frame after switching; and the influence of the previous frame of voice and audio signal close to the current frame of the voice and audio signal becomes greater. Therefore, the value of the second weight 1 gradually decreases, while the value of the second weight 2 gradually increases, thereby weakening the influence of the envelope information before switching on the first envelope information, and enhancing the predicted envelope information on the first envelope information. The impact of envelope information. Among them, step 507 can modify the second weight 1 and the second weight 2 through the following method: the new second weight 1 is equal to the old second weight 1 minus the first weight step size, and the new second weight 2 is equal to the old The second weight of 2 plus the first weight step; wherein, the first weight step is a set value.
其中,第一权重一和第一权重二的和为一,第二权重一和第二权重二的和为一;第二权重一的初始值大于第一权重一的初始值。Wherein, the sum of the first weight one and the first weight two is one, and the sum of the second weight one and the second weight two is one; the initial value of the second weight one is greater than the initial value of the first weight one.
步骤303、根据第一包络信息和预测的精细结构信息,生成处理后的第一高频带信号。Step 303: Generate a processed first high frequency band signal according to the first envelope information and the predicted fine structure information.
具体的,通过步骤302得出当前帧的第一包络信息后,可以根据第一包络信息和预测的精细结构信息,生成所需要的处理后的第一高频带信号,从而使第二高频带信号能平滑过渡到处理后的第一高频带信号。Specifically, after obtaining the first envelope information of the current frame through step 302, the required processed first high-frequency band signal can be generated according to the first envelope information and the predicted fine structure information, so that the second The high frequency band signal can smoothly transition to the processed first high frequency band signal.
本实施例语音频信号切换方法,在语音频信号发生从宽频带语音频信号向窄频带语音频信号切换的过程中,通过预测的精细结构信息和第一包络信息获得当前帧的处理后的第一高频带信号,从而能够使切换前的宽频带语音频信号的第二高频带信号能够平滑的过渡到窄频带语音频信号所对应的处理后的第一高频带信号,更有利于提高用户接听音频信号的质量。In the speech and audio signal switching method of this embodiment, in the process of switching the speech and audio signal from the broadband speech and audio signal to the narrowband speech and audio signal, the processed information of the current frame is obtained through the predicted fine structure information and the first envelope information. The first high-band signal, so that the second high-band signal of the wide-band speech and audio signal before switching can smoothly transition to the processed first high-band signal corresponding to the narrow-band speech and audio signal, and more It is beneficial to improve the quality of the user listening to the audio signal.
基于上述技术方案,可选的,如图6所示,本实施例中的步骤202包括:Based on the above technical solution, optionally, as shown in FIG. 6,
步骤601、根据当前帧语音频信号与切换前一帧的语音频信号,判断处理后的第一高频带信号是否需要衰减。Step 601 : According to the speech and audio signal of the current frame and the speech and audio signal of the frame before switching, determine whether the processed first high frequency band signal needs to be attenuated.
具体的,由于窄带语音频信号的第一高频带信号为空,在宽频带语音频信号切换到窄频带语音频信号的过程中,为了防止恢复出的窄频带语音频信号对应的处理后的第一高频带信号带来不好的影响,在由窄频带语音频信号扩展为宽频带信号的帧数达到给定的帧数后,将处理后的第一高频带信号的能量逐帧进行衰减,直到衰减系数达到给定的阈值。通过当前帧语音频信号与切换前一帧的语音频信号,可以得知当前帧语音频信号与切换前一帧的语音频信号之间的间隔时间,例如,可以通过计数器记录窄频带语音频信号传输的帧数,此帧数可以为已预定好的大于等于0的值。Specifically, since the first high-band signal of the narrowband speech and audio signal is empty, in the process of switching the wideband speech and audio signal to the narrowband speech and audio signal, in order to prevent the recovered narrowband speech and audio signal from processing The first high-frequency band signal has a bad influence. After the number of frames extended from the narrow-band speech and audio signal to the wide-band signal reaches a given number of frames, the energy of the processed first high-band signal is frame by frame. Decay until the decay coefficient reaches the given threshold. Through the voice and audio signal of the current frame and the voice and audio signal of the previous frame, the interval time between the voice and audio signal of the current frame and the voice and audio signal of the previous frame can be known. For example, the narrowband voice and audio signal can be recorded by the counter The number of frames to be transmitted, which can be a predetermined value greater than or equal to 0.
步骤602、如果不需要衰减,将处理后的第一高频带信号与第一低频带信号合成宽频带信号。Step 602. If no attenuation is required, synthesize the processed first high-frequency signal and the first low-frequency signal into a wide-band signal.
具体的,如果步骤601判断得出处理后的第一高频带信号无需进行衰减,则直接将处理后的第一高频带信号与第一低频带信号合成宽频带信号。Specifically, if it is determined in step 601 that the processed first high-frequency band signal does not need to be attenuated, the processed first high-frequency band signal and the first low-frequency band signal are directly synthesized into a wide-band signal.
步骤603、如果需要衰减,判断处理后的第一高频带信号对应的衰减因子是否大于阈值。Step 603. If attenuation is required, determine whether the attenuation factor corresponding to the processed first high frequency band signal is greater than a threshold.
具体的,衰减因子的初始值为一;阈值小于一并大于等于零。如果步骤601判断得出处理后的第一高频带信号需要进行衰减,则通过步骤603判断处理后的第一高频带信号对应的衰减因子是否大于给定的阈值。Specifically, the initial value of the attenuation factor is one; the threshold value is less than one and greater than or equal to zero. If it is determined in step 601 that the processed first high frequency band signal needs to be attenuated, then in step 603 it is determined whether the attenuation factor corresponding to the processed first high frequency band signal is greater than a given threshold.
步骤604、如果衰减因子不大于给定的阈值,则将处理后的第一高频带信号乘以阈值,然后与第一低频带信号合成宽频带信号。Step 604: If the attenuation factor is not greater than a given threshold, multiply the processed first high-frequency signal by the threshold, and then synthesize a broadband signal with the first low-frequency signal.
具体的,如果步骤603得出衰减因子的值不大于给定的阈值时,说明处理后的第一高频带信号的能量已经衰减到一定的程度,处理后的第一高频带信号已经不会带来不好的影响,以后就可以保持此衰减比例。则将处理后的第一高频带信号乘以阈值,然后与第一低频带信号合成宽频带信号。Specifically, if it is obtained in step 603 that the value of the attenuation factor is not greater than the given threshold, it means that the energy of the processed first high-frequency band signal has been attenuated to a certain extent, and the processed first high-frequency band signal has no longer It will bring bad effects, and this attenuation ratio can be maintained in the future. Then the processed first high frequency band signal is multiplied by a threshold, and then synthesized with the first low frequency band signal into a broadband signal.
步骤605、如果衰减因子大于给定的阈值,则将处理后的第一高频带信号乘以衰减因子后,再与第一低频带信号合成宽频带信号。Step 605: If the attenuation factor is greater than a given threshold, multiply the processed first high-frequency signal by the attenuation factor, and then synthesize a broadband signal with the first low-frequency signal.
具体的,如果步骤603得出衰减因子的值大于给定的阈值时,则说明在此衰减因子时,处理后的第一高频带信号还有可能引起不好的听觉影响,还需要继续衰减,直到给定的阈值。则将处理后的第一高频带信号乘以衰减因子后,再与第一低频带信号合成宽频带信号。Specifically, if step 603 shows that the value of the attenuation factor is greater than the given threshold, it means that at this attenuation factor, the processed first high-frequency band signal may cause bad auditory effects, and further attenuation is required. , up to a given threshold. After the processed first high frequency band signal is multiplied by the attenuation factor, the broadband signal is synthesized with the first low frequency band signal.
步骤606、修改衰减因子,以使衰减因子减小。Step 606. Modify the attenuation factor to reduce the attenuation factor.
具体的,随着语音频信号的传输,后续的窄带语音频信号受切换前的语音频信号的影响逐渐变小,相对应的,衰减因子也应该逐渐变小。Specifically, as the voice and audio signals are transmitted, the subsequent narrowband voice and audio signals are gradually less affected by the voice and audio signals before switching, and correspondingly, the attenuation factor should also be gradually reduced.
基于上述技术方案,可选的,当窄频带语音频信号向宽频带语音频信号切换时,如图7所示,本实施例中通过步骤201获得处理后的第一高频带信号的一个实施例包括:Based on the above technical solution, optionally, when the narrowband speech and audio signal is switched to the wideband speech and audio signal, as shown in Figure 7, an implementation of obtaining the processed first high frequency band signal through
步骤701、根据已设定好的第四权重一和第四权重二进行加权处理,以计算出处理后的第一高频带信号。其中,第四权重一为第二高频带信号的权重值,第四权重二为当前帧语音频信号的第一高频带信号的权重值。
具体的,在由窄频带语音频信号切换到宽频带语音频信号的过程中,由于宽频带语音频信号中的高频带信号不为空,而窄频带语音频信号对应的高频带信号为空或者处理后的高频带信号,为了使窄频带语音频信号能够平滑的切换到宽频带语音频信号,需要对宽频带语音频信号中的高频带信号进行能量衰减,以实现语音频信号的平滑的进行切换。通过第二高频带信号与第四权重一的乘积,加上第一高频带信号与第四权重二的乘积之和,所求得的加权值即为处理后的第一高频带信号。Specifically, in the process of switching from the narrowband speech and audio signal to the wideband speech and audio signal, since the high frequency signal in the wideband speech and audio signal is not empty, the high frequency signal corresponding to the narrowband speech and audio signal is Empty or processed high-frequency signal, in order to make the narrow-band speech and audio signal can be smoothly switched to the wide-band speech and audio signal, it is necessary to attenuate the energy of the high-band signal in the wide-band speech and audio signal to realize speech and audio signal to switch smoothly. Through the product of the second high frequency band signal and the fourth weight one, plus the sum of the product of the first high frequency band signal and the fourth weight two, the obtained weighted value is the processed first high frequency band signal .
步骤702、以第三权重步长为单位减小第四权重一,以第三权重步长为单位增加第四权重二,直至第四权重一等于零。其中,第四权重一和第四权重二的和为一。Step 702: Decrease the fourth weight 1 by the unit of the third weight step, increase the fourth weight 2 by the unit of the third weight step, until the fourth weight 1 is equal to zero. Wherein, the sum of the fourth weight one and the fourth weight two is one.
具体的,随着语音频信号的传输,后续的宽频带语音频信号受切换前的窄频带语音频信号的影响逐渐变小。因此,第四权重一逐渐变小,而第四权重二逐渐增大,直至第四权重一变为零,而第四权重二变为一,即传输的语音频信号一直为宽频带语音频信号。Specifically, with the transmission of the voice and audio signals, the subsequent wideband voice and audio signals are gradually less affected by the narrowband voice and audio signals before switching. Therefore, the fourth weight one gradually becomes smaller, and the fourth weight two gradually increases until the fourth weight one becomes zero, and the fourth weight two becomes one, that is, the transmitted voice and audio signal is always a broadband voice and audio signal .
同样的,如图8所示,本实施例中通过步骤201获得处理后的第一高频带信号的另一个实施例还可以包括:Similarly, as shown in FIG. 8, another embodiment of obtaining the processed first high-frequency band signal through
步骤801、根据已设定好的第五权重一和第五权重二进行加权处理,以计算出处理后的第一高频带信号。其中,第五权重一为已设定好的固定参数的权重值,第五权重二为当前帧语音频信号的第一高频带信号的权重值。Step 801: Perform weighting processing according to the set fifth weight 1 and fifth weight 2 to calculate the processed first high frequency band signal. Wherein, the fifth weight 1 is the weight value of the fixed parameter that has been set, and the fifth weight 2 is the weight value of the first high frequency band signal of the speech and audio signal of the current frame.
具体的,由于窄频带语音频信号的第一高频带信号为空,因此可以设定一固定参数来代替窄频带语音频信号的高频带信号,其中,该固定参数为一个大于等于零小于第一高频带信号能量的常数。通过固定参数与第五权重一的乘积,加上第一高频带信号与第五权重二的乘积之和,所求得的加权值即为处理后的第一高频带信号。Specifically, since the first high-band signal of the narrow-band speech and audio signal is empty, a fixed parameter can be set to replace the high-band signal of the narrow-band speech and audio signal, wherein the fixed parameter is one greater than or equal to zero and less than the first A constant for high-band signal energy. The weighted value obtained by adding the product of the fixed parameter and the fifth weight one to the sum of the product of the first high frequency band signal and the fifth weight two is the processed first high frequency band signal.
步骤802、以第四权重步长为单位减小第五权重一,以第四权重步长为单位增加第五权重二,直至第五权重一等于零;;其中,第五权重一和第五权重二的和为一。Step 802: Decrease the fifth weight 1 by the unit of the fourth weight step, increase the fifth weight 2 by the unit of the fourth weight step, until the fifth weight 1 is equal to zero; wherein, the fifth weight 1 and the fifth weight The sum of two is one.
具体的,随着语音频信号的传输,后续的宽频带语音频信号受切换前的窄频带语音频信号的影响逐渐变小。因此,第五权重一逐渐变小,而第五权重二逐渐增大,直至第五权重一变为零,而第五权重二变为一,即传输的语音频信号一直为真实的宽频带语音频信号。Specifically, with the transmission of the voice and audio signals, the subsequent wideband voice and audio signals are gradually less affected by the narrowband voice and audio signals before switching. Therefore, the fifth weight 1 gradually decreases, and the fifth weight 2 gradually increases until the fifth weight 1 becomes zero, and the fifth weight 2 becomes 1, that is, the transmitted voice and audio signal is always a real broadband voice signal. audio signal.
本实施例语音频信号切换方法,在语音频信号发生从窄频带语音频信号向宽频带语音频信号切换的过程中,通过对宽频带语音频信号的高频带信号进行衰减处理得到处理后的高频带信号,从而能够使切换前的窄频带语音频信号对应的高频带信号能够平滑的过渡到宽频带语音频信号所对应的处理后的高频带信号,更有利于提高用户接听音频信号的质量。In the speech and audio signal switching method of this embodiment, in the process of switching the speech and audio signal from the narrowband speech and audio signal to the wideband speech and audio signal, the processed signal is obtained by attenuating the high frequency band signal of the wideband speech and audio signal. High-band signal, so that the high-band signal corresponding to the narrow-band voice and audio signal before switching can smoothly transition to the processed high-band signal corresponding to the wide-band voice and audio signal, which is more conducive to improving the user's ability to listen to audio The quality of the signal.
其中,本实施例中的包络信息还可以通过能够代表高频带信号的其他参数代替,例如:线性预测编码(Linear Predictive Coding,以下简称:LPC)参数,或幅度参数等。Wherein, the envelope information in this embodiment may also be replaced by other parameters that can represent high-frequency band signals, such as: Linear Predictive Coding (Linear Predictive Coding, hereinafter referred to as: LPC) parameters, or amplitude parameters, etc.
本领域普通技术人员可以理解:实现上述方法实施例的全部或部分步骤可以通过程序指令相关的硬件来完成,前述的程序可以存储于一计算机可读取存储介质中,该程序在执行时,执行包括上述方法实施例的步骤;而前述的存储介质包括:ROM、RAM、磁碟或者光盘等各种可以存储程序代码的介质。Those of ordinary skill in the art can understand that all or part of the steps for realizing the above-mentioned method embodiments can be completed by hardware related to program instructions, and the aforementioned program can be stored in a computer-readable storage medium. When the program is executed, the It includes the steps of the above method embodiments; and the aforementioned storage medium includes: ROM, RAM, magnetic disk or optical disk and other various media that can store program codes.
图9为本发明语音频信号切换装置实施例一的结构示意图。如图9所示,本实施例音频信号切换装置,包括:处理模块91和第一合成模块92。FIG. 9 is a schematic structural diagram of Embodiment 1 of the speech and audio signal switching device of the present invention. As shown in FIG. 9 , the audio signal switching device of this embodiment includes: a
处理模块91用于当语音频信号出现切换时,将当前帧语音频信号的第一高频带信号和前M帧语音频信号的第二高频带信号进行加权处理,以得到处理后的第一高频带信号。其中,M大于等于1。The
第一合成模块92用于将处理后的第一高频带信号与当前帧语音频信号的第一低频带信号合成宽频带信号。The
本发明实施例的语音频信号切换装置,通过处理模块根据前M帧语音频信号中的第二高频带信号对当前帧语音频信号中的第一高频带信号进行处理,以使的第二高频带信号能平滑的过渡到处理后的第一高频带信号,从而在切换不同带宽的语音频信号过程中,使不同带宽的语音频信号的高频带信号能够平滑的过渡切换;最后,处理后的第一高频带信号与第一低频带信号通过第一合成模块合成宽频带信号,将该宽频带信号传输到用户终端,使用户享受到高质量的语音频信号。本实施例语音频信号切换方法能够平滑的进行不同带宽的语音频信号切换,减小了能量激变造成语音频信号的主观听觉质量差的影响,提高了用户接听语音频信号的质量。In the speech and audio signal switching device of the embodiment of the present invention, the first high frequency band signal in the speech and audio signal of the current frame is processed by the processing module according to the second high frequency band signal in the speech and audio signal of the previous M frames, so that the first The second high frequency band signal can smoothly transition to the processed first high frequency band signal, so that in the process of switching voice and audio signals of different bandwidths, the high frequency band signals of voice and audio signals of different bandwidths can be switched smoothly; Finally, the processed first high-frequency signal and the first low-frequency signal are synthesized into a wide-band signal through the first synthesis module, and the wide-band signal is transmitted to the user terminal, so that the user can enjoy high-quality voice and audio signals. The speech and audio signal switching method of this embodiment can smoothly switch speech and audio signals of different bandwidths, reduces the impact of poor subjective auditory quality of speech and audio signals caused by energy shocks, and improves the quality of speech and audio signals received by users.
图10为本发明语音频信号切换装置实施例二的结构示意图。如图10所示,本实施例音频信号切换装置基于上述音频信号切换装置实施例一,其区别在于:本实施例音频信号切换装置还包括:第二合成模块103。FIG. 10 is a schematic structural diagram of Embodiment 2 of the speech and audio signal switching device of the present invention. As shown in FIG. 10 , the audio signal switching device of this embodiment is based on the first embodiment of the audio signal switching device above, and the difference is that the audio signal switching device of this embodiment further includes: a
第二合成模块103用于当语音频信号没有出现切换时,将第一高频带信号与第一低频带信号合成宽频带信号。The
本实施例音频信号切换装置,通过设置第二合成模块,可以在不发生不同带宽的语音频信号切换的前提下,由第二合成模块将当前帧的第一频带语音频信号中的第一低频带信号和第一高频带信号合成宽频带信号,从而有利于提高用户接听语音频信号的质量。In the audio signal switching device of this embodiment, by setting the second synthesis module, the second synthesis module can combine the first low-frequency audio signal in the first frequency band audio and video signal of the current frame without switching audio and audio signals of different bandwidths. The broadband signal is synthesized from the band signal and the first high frequency band signal, thereby improving the quality of the user listening to the voice and audio signals.
基于上述技术方案,可选的,当宽频带语音频信号向窄频带语音频信号切换时,如图10和图11所示,本实施例中的处理模块101包括:Based on the above technical solution, optionally, when the wideband voice and audio signal is switched to the narrowband voice and audio signal, as shown in Figure 10 and Figure 11, the
预测模块1011用于预测第一高频带信号对应的预测的精细结构信息和预测的包络信息。The prediction module 1011 is configured to predict the predicted fine structure information and the predicted envelope information corresponding to the first high frequency band signal.
第一生成模块1012用于根据预测的包络信息和前M帧语音频信号的第二高频带信号对应的前M帧包络信息进行加权处理,以得到第一高频带信号对应的第一包络信息。The first generating module 1012 is used to perform weighting processing according to the predicted envelope information and the envelope information of the previous M frames corresponding to the second high-band signal of the previous M frames of speech and audio signals, so as to obtain the first high-band signal corresponding to the first An envelope of information.
第二生成模块1013用于根据第一包络信息和预测的精细结构信息,生成处理后的第一高频带信号。The second generating module 1013 is configured to generate the processed first high frequency band signal according to the first envelope information and the predicted fine structure information.
更进一步的,本实施例音频信号切换装置可以还包括:分类模块1010,用于对当前帧语音频信号的第一低频带信号进行信号分类;而预测模块1011还用于根据第一低频带信号对应的信号类型预测当前帧语音频信号的第一低频带信号对应的预测的精细结构信息和预测的包络信息。Furthermore, the audio signal switching device in this embodiment may further include: a classification module 1010, configured to perform signal classification on the first low-frequency band signal of the current frame of speech and audio signals; and the prediction module 1011 is also configured to The corresponding signal type predicts the predicted fine structure information and the predicted envelope information corresponding to the first low frequency band signal of the speech and audio signal of the current frame.
本实施例音频信号切换装置,通过预测模块预测出第一高频带信号对应的预测的精细结构信息和预测的包络信息,从而可以通过第一生成模块和第二生成模块准确的生成处理后的第一高频带信号,从而使第一高频带信号能够更加平滑的过渡到处理后的第一高频带信号,更有利于提高用户接听语音频信号的质量。另外,通过分类模块对当前帧语音频信号的第一低频带信号进行信号分类,然后,预测模块根据信号类型得到预测的精细结构信息和预测的包络信息,从而使预测的精细结构信息和预测的包络信息更加准确,使用户接听到的语音频信号的质量更高。The audio signal switching device in this embodiment uses the prediction module to predict the predicted fine structure information and predicted envelope information corresponding to the first high-frequency band signal, so that the first generation module and the second generation module can accurately generate the processed The first high-frequency band signal, so that the first high-frequency band signal can transition to the processed first high-frequency band signal more smoothly, which is more conducive to improving the quality of the user's listening to the voice and audio signals. In addition, signal classification is performed on the first low-frequency band signal of the current frame speech and audio signal through the classification module, and then the prediction module obtains the predicted fine structure information and the predicted envelope information according to the signal type, so that the predicted fine structure information and the predicted The envelope information is more accurate, so that the quality of the voice and audio signals received by the user is higher.
基于上述技术方案,可选的,如图10和图12所示,本实施例中的第一合成模块102包括:Based on the above technical solution, optionally, as shown in Figure 10 and Figure 12, the
第一判断模块1021用于根据当前帧语音频信号与切换前一帧的语音频信号,判断处理后的第一高频带信号是否需要衰减。The
第三合成模块1022用于若第一判断模块1021得出处理后的第一高频带信号不需要衰减,将处理后的第一高频带信号与第一低频带信号合成宽频带信号。The
第二判断模块1023用于若第一判断模块1021得出处理后的第一高频带信号需要衰减,判断处理后的第一高频带信号对应的衰减因子是否大于给定的阈值。The
第四合成模块1024用于若第二判断模块1023得出衰减因子不大于给定的阈值,则将处理后的第一高频带信号乘以阈值,然后与第一低频带信号合成宽频带信号。The
第五合成模块1025用于若第二判断模块1023得出衰减因子大于给定的阈值,则将处理后的第一高频带信号乘以衰减因子后,再与第一低频带信号合成宽频带信号。The
第一修改模块1026用于修改衰减因子,以使衰减因子减小。The first modification module 1026 is used for modifying the attenuation factor, so as to reduce the attenuation factor.
其中,衰减因子的初始值为一;阈值小于一并大于等于零。Wherein, the initial value of the attenuation factor is one; the threshold value is less than one and greater than or equal to zero.
本实施例音频信号切换装置,通过对处理后的第一高频带信号进行衰减处理,可以使当前帧语音频信号经过处理后得到的宽频带信号更加准确,更有利于提高用户接听音频信号的质量。The audio signal switching device of this embodiment, by attenuating the processed first high-frequency band signal, can make the broadband signal obtained after the current frame speech and audio signal is processed more accurate, which is more conducive to improving the user's ability to listen to the audio signal. quality.
基于上述技术方案,可选的,当窄频带语音频信号向宽频带语音频信号切换时,如图10和图13a所示,本实施例中的处理模块101包括:Based on the above technical solution, optionally, when the narrowband speech and audio signal is switched to the wideband speech and audio signal, as shown in Figure 10 and Figure 13a, the
第一计算模块1011a用于根据已设定好的第四权重一和第四权重二进行加权处理,以计算出处理后的第一高频带信号;其中,第四权重一为第二高频带信号的权重值,第四权重二为第一高频带信号的权重值;The
第二修改模块1012a用于以第三权重步长为单位减小第四权重一,以第三权重步长为单位增加第四权重二,直至第四权重一等于零;其中,第四权重一和第四权重二的和为一。The second modifying
同样的,当窄频带语音频信号向宽频带语音频信号切换时,如图10和图13b所示,本实施例中的处理模块101可以还包括:Similarly, when the narrowband speech and audio signal is switched to the wideband speech and audio signal, as shown in Figure 10 and Figure 13b, the
第二计算模块1011b用于根据已设定好的第五权重一和第五权重二进行加权处理,以计算出处理后的第一高频带信号;其中,第五权重一为已设定好的固定参数的权重值,第五权重二为第一高频带信号的权重值;The
第三修改模块1012b用于以第四权重步长为单位减小第五权重一,以第四权重步长为单位增加第五权重二,直至第五权重一等于零;;其中,第五权重一和第五权重二的和为一;其中,固定参数为一个大于等于零并小于第一高频带信号能量值的常数。The
本实施例语音频信号切换装置,在语音频信号发生从窄频带语音频信号向宽频带语音频信号切换的过程中,通过对宽频带语音频信号的高频带信号进行衰减处理得到处理后的高频带信号,从而能够使切换前的窄频带语音频信号对应的高频带信号能够平滑的过渡到宽频带语音频信号所对应的处理后的高频带信号,更有利于提高用户接听音频信号的质量。The speech and audio signal switching device of this embodiment, in the process of switching the speech and audio signal from the narrowband speech and audio signal to the wideband speech and audio signal, obtains the processed signal by attenuating the high frequency band signal of the wideband speech and audio signal High-band signal, so that the high-band signal corresponding to the narrow-band voice and audio signal before switching can smoothly transition to the processed high-band signal corresponding to the wide-band voice and audio signal, which is more conducive to improving the user's ability to listen to audio The quality of the signal.
最后应说明的是:以上实施例仅用以说明本发明的技术方案,而非对其限制;尽管参照前述实施例对本发明进行了详细的说明,本领域的普通技术人员应当理解:其依然可以对前述各实施例所记载的技术方案进行修改,或者对其中部分技术特征进行等同替换;而这些修改或者替换,并不使相应技术方案的本质脱离本发明各实施例技术方案的精神和范围。Finally, it should be noted that: the above embodiments are only used to illustrate the technical solutions of the present invention, rather than to limit them; although the present invention has been described in detail with reference to the foregoing embodiments, those of ordinary skill in the art should understand that: it can still be Modifications are made to the technical solutions described in the foregoing embodiments, or equivalent replacements are made to some of the technical features; and these modifications or replacements do not make the essence of the corresponding technical solutions deviate from the spirit and scope of the technical solutions of the various embodiments of the present invention.
Claims (16)
Priority Applications (13)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN2010101634063A CN101964189B (en) | 2010-04-28 | 2010-04-28 | Audio signal switching method and device |
| EP17151713.9A EP3249648B1 (en) | 2010-04-28 | 2011-04-28 | Method and apparatus for switching speech or audio signals |
| ES17151713T ES2718947T3 (en) | 2010-04-28 | 2011-04-28 | Procedure and apparatus for switching voice or audio signals |
| BR112012013306A BR112012013306B8 (en) | 2010-04-28 | 2011-04-28 | method and apparatus for switching speech or audio signals |
| KR1020127012328A KR101377547B1 (en) | 2010-04-28 | 2011-04-28 | Method and apparatus for switching speech or audio signals |
| ES11774406.0T ES2635212T3 (en) | 2010-04-28 | 2011-04-28 | Procedure and device for switching audio signals |
| JP2012541316A JP5667202B2 (en) | 2010-04-28 | 2011-04-28 | Audio signal switching method and device |
| EP11774406.0A EP2485029B1 (en) | 2010-04-28 | 2011-04-28 | Audio signal switching method and device |
| AU2011247719A AU2011247719B2 (en) | 2010-04-28 | 2011-04-28 | Method and apparatus for switching speech or audio signals |
| PCT/CN2011/073479 WO2011134415A1 (en) | 2010-04-28 | 2011-04-28 | Audio signal switching method and device |
| US13/162,309 US8214218B2 (en) | 2010-04-28 | 2011-06-16 | Method and apparatus for switching speech or audio signals |
| JP2014250689A JP6027081B2 (en) | 2010-04-28 | 2014-12-11 | Audio signal switching method and device |
| JP2016201593A JP6410777B2 (en) | 2010-04-28 | 2016-10-13 | Audio signal switching method and device |
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN2010101634063A CN101964189B (en) | 2010-04-28 | 2010-04-28 | Audio signal switching method and device |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| CN101964189A CN101964189A (en) | 2011-02-02 |
| CN101964189B true CN101964189B (en) | 2012-08-08 |
Family
ID=43517042
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CN2010101634063A Active CN101964189B (en) | 2010-04-28 | 2010-04-28 | Audio signal switching method and device |
Country Status (8)
| Country | Link |
|---|---|
| EP (2) | EP2485029B1 (en) |
| JP (3) | JP5667202B2 (en) |
| KR (1) | KR101377547B1 (en) |
| CN (1) | CN101964189B (en) |
| AU (1) | AU2011247719B2 (en) |
| BR (1) | BR112012013306B8 (en) |
| ES (2) | ES2718947T3 (en) |
| WO (1) | WO2011134415A1 (en) |
Families Citing this family (29)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JP2541078B2 (en) | 1992-06-26 | 1996-10-09 | 住友金属工業株式会社 | ERW pipe defect discrimination method |
| KR101110800B1 (en) * | 2003-05-28 | 2012-07-06 | 도꾸리쯔교세이호진 상교기쥬쯔 소고겡뀨죠 | Process for producing hydroxyl group-containing compound |
| US10848118B2 (en) | 2004-08-10 | 2020-11-24 | Bongiovi Acoustics Llc | System and method for digital signal processing |
| US11431312B2 (en) | 2004-08-10 | 2022-08-30 | Bongiovi Acoustics Llc | System and method for digital signal processing |
| US10158337B2 (en) | 2004-08-10 | 2018-12-18 | Bongiovi Acoustics Llc | System and method for digital signal processing |
| US8284955B2 (en) | 2006-02-07 | 2012-10-09 | Bongiovi Acoustics Llc | System and method for digital signal processing |
| US10848867B2 (en) | 2006-02-07 | 2020-11-24 | Bongiovi Acoustics Llc | System and method for digital signal processing |
| US10701505B2 (en) | 2006-02-07 | 2020-06-30 | Bongiovi Acoustics Llc. | System, method, and apparatus for generating and digitally processing a head related audio transfer function |
| CN101964189B (en) * | 2010-04-28 | 2012-08-08 | 华为技术有限公司 | Audio signal switching method and device |
| US8000968B1 (en) | 2011-04-26 | 2011-08-16 | Huawei Technologies Co., Ltd. | Method and apparatus for switching speech or audio signals |
| CN105469805B (en) * | 2012-03-01 | 2018-01-12 | 华为技术有限公司 | A kind of voice frequency signal treating method and apparatus |
| CN105761724B (en) * | 2012-03-01 | 2021-02-09 | 华为技术有限公司 | Voice frequency signal processing method and device |
| CN103516440B (en) | 2012-06-29 | 2015-07-08 | 华为技术有限公司 | Audio signal processing method and encoding device |
| CN103971693B (en) * | 2013-01-29 | 2017-02-22 | 华为技术有限公司 | High-band signal prediction method, encoding/decoding device |
| US9883318B2 (en) | 2013-06-12 | 2018-01-30 | Bongiovi Acoustics Llc | System and method for stereo field enhancement in two-channel audio systems |
| US9397629B2 (en) * | 2013-10-22 | 2016-07-19 | Bongiovi Acoustics Llc | System and method for digital signal processing |
| US9906858B2 (en) | 2013-10-22 | 2018-02-27 | Bongiovi Acoustics Llc | System and method for digital signal processing |
| US20150170655A1 (en) * | 2013-12-15 | 2015-06-18 | Qualcomm Incorporated | Systems and methods of blind bandwidth extension |
| CN103714822B (en) * | 2013-12-27 | 2017-01-11 | 广州华多网络科技有限公司 | Sub-band coding and decoding method and device based on SILK coder decoder |
| KR101864122B1 (en) | 2014-02-20 | 2018-06-05 | 삼성전자주식회사 | Electronic apparatus and controlling method thereof |
| US10820883B2 (en) | 2014-04-16 | 2020-11-03 | Bongiovi Acoustics Llc | Noise reduction assembly for auscultation of a body |
| US10639000B2 (en) | 2014-04-16 | 2020-05-05 | Bongiovi Acoustics Llc | Device for wide-band auscultation |
| EP2980794A1 (en) | 2014-07-28 | 2016-02-03 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Audio encoder and decoder using a frequency domain processor and a time domain processor |
| KR102067044B1 (en) * | 2016-02-17 | 2020-01-17 | 프라운호퍼 게젤샤프트 쭈르 푀르데룽 데어 안겐반텐 포르슝 에. 베. | Post Processor, Pre Processor, Audio Encoder, Audio Decoder, and Related Methods for Enhancing Transient Processing |
| US11211043B2 (en) | 2018-04-11 | 2021-12-28 | Bongiovi Acoustics Llc | Audio enhanced hearing protection system |
| CN110556116B (en) * | 2018-05-31 | 2021-10-22 | 华为技术有限公司 | Method and apparatus for computing downmix signal and residual signal |
| US10959035B2 (en) | 2018-08-02 | 2021-03-23 | Bongiovi Acoustics Llc | System, method, and apparatus for generating and digitally processing a head related audio transfer function |
| CN112002333B (en) * | 2019-05-07 | 2023-07-18 | 海能达通信股份有限公司 | Voice synchronization method and device and communication terminal |
| CN117373465B (en) * | 2023-12-08 | 2024-04-09 | 富迪科技(南京)有限公司 | Voice frequency signal switching system |
Citations (2)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN1488137A (en) * | 2001-01-24 | 2004-04-07 | �����ɷ� | Enhanced conversion of wideband signals to narrow band signals |
| CN1942928A (en) * | 2004-04-15 | 2007-04-04 | 诺基亚公司 | Coding of audio signals |
Family Cites Families (17)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US4330689A (en) * | 1980-01-28 | 1982-05-18 | The United States Of America As Represented By The Secretary Of The Navy | Multirate digital voice communication processor |
| US4769833A (en) * | 1986-03-31 | 1988-09-06 | American Telephone And Telegraph Company | Wideband switching system |
| US5019910A (en) * | 1987-01-29 | 1991-05-28 | Norsat International Inc. | Apparatus for adapting computer for satellite communications |
| FI115329B (en) * | 2000-05-08 | 2005-04-15 | Nokia Corp | Method and arrangement for switching the source signal bandwidth in a communication connection equipped for many bandwidths |
| KR100940531B1 (en) * | 2003-07-16 | 2010-02-10 | 삼성전자주식회사 | Broadband voice signal compression and decompression device and method |
| JP2005080079A (en) * | 2003-09-02 | 2005-03-24 | Sony Corp | Audio reproduction apparatus and audio reproduction method |
| JPWO2005106848A1 (en) * | 2004-04-30 | 2007-12-13 | 松下電器産業株式会社 | Scalable decoding apparatus and enhancement layer erasure concealment method |
| JP4558734B2 (en) * | 2004-07-28 | 2010-10-06 | パナソニック株式会社 | Signal decoding device |
| US7895035B2 (en) * | 2004-09-06 | 2011-02-22 | Panasonic Corporation | Scalable decoding apparatus and method for concealing lost spectral parameters |
| JP5046654B2 (en) * | 2005-01-14 | 2012-10-10 | パナソニック株式会社 | Scalable decoding apparatus and scalable decoding method |
| US8249861B2 (en) * | 2005-04-20 | 2012-08-21 | Qnx Software Systems Limited | High frequency compression integration |
| US8150684B2 (en) * | 2005-06-29 | 2012-04-03 | Panasonic Corporation | Scalable decoder preventing signal degradation and lost data interpolation method |
| WO2008103925A1 (en) * | 2007-02-22 | 2008-08-28 | Personics Holdings Inc. | Method and device for sound detection and audio control |
| EP2207166B1 (en) * | 2007-11-02 | 2013-06-19 | Huawei Technologies Co., Ltd. | An audio decoding method and device |
| CN101425292B (en) * | 2007-11-02 | 2013-01-02 | 华为技术有限公司 | Decoding method and device for audio signal |
| CN100585699C (en) * | 2007-11-02 | 2010-01-27 | 华为技术有限公司 | Method and device for audio decoding |
| CN101964189B (en) * | 2010-04-28 | 2012-08-08 | 华为技术有限公司 | Audio signal switching method and device |
-
2010
- 2010-04-28 CN CN2010101634063A patent/CN101964189B/en active Active
-
2011
- 2011-04-28 BR BR112012013306A patent/BR112012013306B8/en active IP Right Grant
- 2011-04-28 JP JP2012541316A patent/JP5667202B2/en active Active
- 2011-04-28 ES ES17151713T patent/ES2718947T3/en active Active
- 2011-04-28 EP EP11774406.0A patent/EP2485029B1/en active Active
- 2011-04-28 KR KR1020127012328A patent/KR101377547B1/en active Active
- 2011-04-28 AU AU2011247719A patent/AU2011247719B2/en active Active
- 2011-04-28 ES ES11774406.0T patent/ES2635212T3/en active Active
- 2011-04-28 EP EP17151713.9A patent/EP3249648B1/en active Active
- 2011-04-28 WO PCT/CN2011/073479 patent/WO2011134415A1/en not_active Ceased
-
2014
- 2014-12-11 JP JP2014250689A patent/JP6027081B2/en active Active
-
2016
- 2016-10-13 JP JP2016201593A patent/JP6410777B2/en active Active
Patent Citations (2)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN1488137A (en) * | 2001-01-24 | 2004-04-07 | �����ɷ� | Enhanced conversion of wideband signals to narrow band signals |
| CN1942928A (en) * | 2004-04-15 | 2007-04-04 | 诺基亚公司 | Coding of audio signals |
Also Published As
| Publication number | Publication date |
|---|---|
| EP3249648A1 (en) | 2017-11-29 |
| ES2635212T3 (en) | 2017-10-02 |
| BR112012013306B8 (en) | 2021-02-17 |
| JP2015045888A (en) | 2015-03-12 |
| JP6027081B2 (en) | 2016-11-16 |
| KR20120074303A (en) | 2012-07-05 |
| ES2718947T3 (en) | 2019-07-05 |
| WO2011134415A1 (en) | 2011-11-03 |
| JP2013512468A (en) | 2013-04-11 |
| JP2017033015A (en) | 2017-02-09 |
| KR101377547B1 (en) | 2014-03-25 |
| EP2485029A1 (en) | 2012-08-08 |
| EP2485029A4 (en) | 2013-01-02 |
| AU2011247719B2 (en) | 2013-07-11 |
| JP5667202B2 (en) | 2015-02-12 |
| BR112012013306A2 (en) | 2016-03-01 |
| EP2485029B1 (en) | 2017-06-14 |
| EP3249648B1 (en) | 2019-01-09 |
| BR112012013306B1 (en) | 2020-11-10 |
| CN101964189A (en) | 2011-02-02 |
| JP6410777B2 (en) | 2018-10-24 |
| AU2011247719A1 (en) | 2012-06-07 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| CN101964189B (en) | Audio signal switching method and device | |
| US10559313B2 (en) | Speech/audio signal processing method and apparatus | |
| US8214218B2 (en) | Method and apparatus for switching speech or audio signals | |
| KR101770237B1 (en) | Method, apparatus, and system for processing audio data | |
| JP2022548299A (en) | Audio encoding method and apparatus | |
| KR101924767B1 (en) | Voice frequency code stream decoding method and device | |
| US8311842B2 (en) | Method and apparatus for expanding bandwidth of voice signal | |
| CN102044250B (en) | Band spreading method and apparatus | |
| KR20150127041A (en) | Device and method for reducing quantization noise in a time-domain decoder | |
| CN101996640B (en) | Frequency band expansion method and device | |
| US20200227061A1 (en) | Signal codec device and method in communication system | |
| CN106486129B (en) | A kind of audio coding method and device | |
| EP2774148B1 (en) | Bandwidth extension of audio signals | |
| CN104751849B (en) | Decoding method and device of audio streams | |
| CN101751925A (en) | Tone decoding method and device | |
| JP3219467B2 (en) | Audio decoding method | |
| KR20100084632A (en) | Transmission error dissimulation in a digital signal with complexity distribution | |
| JPH0944196A (en) | Speech decoding method and apparatus |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| C06 | Publication | ||
| PB01 | Publication | ||
| C10 | Entry into substantive examination | ||
| SE01 | Entry into force of request for substantive examination | ||
| C14 | Grant of patent or utility model | ||
| GR01 | Patent grant |
