CN115223582B - Audio noise processing method, system, electronic device and medium - Google Patents
Audio noise processing method, system, electronic device and medium Download PDFInfo
- Publication number
- CN115223582B CN115223582B CN202111540141.9A CN202111540141A CN115223582B CN 115223582 B CN115223582 B CN 115223582B CN 202111540141 A CN202111540141 A CN 202111540141A CN 115223582 B CN115223582 B CN 115223582B
- Authority
- CN
- China
- Prior art keywords
- audio signal
- noise reduction
- audio
- voice
- processing
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/48—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
- G10L25/51—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Quality & Reliability (AREA)
- Telephone Function (AREA)
- Soundproofing, Sound Blocking, And Sound Damping (AREA)
Abstract
本发明提供一种音频的噪声处理方法及系统,其中方法包括:获取麦克风端的音频信号,并确定音频信号的音源;若音频信号的音源为通话音频信号,则通过DSP降噪芯片对通话音频信号进行降噪处理,并将得到的第一降噪信号经过音频硬件抽象层输入至通话模块;若音频信号的音源为本地语音音频信号,则经过音频硬件抽象层输入至语音助理应用软件,并通过集成在语音助理应用软件中的语音降噪算法对本地语音音频信号进行降噪处理和回声消除处理;若音频信号的音源为手机互联语音音频信号,则通过硬件抽象层的降噪单元对手机互联语音音频信号进行降噪处理。本发明降低噪声处理的硬件成本及开发成本,并避免对音频硬件抽象层大幅度修改引起的质量风险。
The present invention provides an audio noise processing method and system. The method includes: obtaining the audio signal at the microphone end and determining the source of the audio signal; if the source of the audio signal is a call audio signal, the call audio signal is processed through a DSP noise reduction chip. Perform noise reduction processing, and input the obtained first noise reduction signal to the call module through the audio hardware abstraction layer; if the source of the audio signal is a local voice audio signal, then input it to the voice assistant application software through the audio hardware abstraction layer, and pass The voice noise reduction algorithm integrated in the voice assistant application software performs noise reduction processing and echo cancellation processing on the local voice audio signal; if the source of the audio signal is a mobile phone interconnection voice audio signal, the noise reduction unit of the hardware abstraction layer will perform noise reduction processing on the mobile phone interconnection. The speech audio signal undergoes noise reduction processing. The invention reduces the hardware cost and development cost of noise processing, and avoids quality risks caused by substantial modification of the audio hardware abstraction layer.
Description
技术领域Technical field
本发明涉及噪声处理技术领域,特别是涉及一种音频的噪声处理方法、系统、电子装置及介质。The present invention relates to the technical field of noise processing, and in particular to an audio noise processing method, system, electronic device and medium.
背景技术Background technique
随着车联网的发展,语音,蓝牙通话等已成为车内主流和重要的交互方式,在车载环境下,车内包含环境噪声、风噪、胎噪、播放的音频等噪声,造成车机语音和蓝牙等录入的信号较差,导致通话和识别等质量不高。车内降噪应用场景分为三类:一是用于语音识别的前端音频处理,二是用于蓝牙通话、CarPlay(由美国苹果公司发布的车载系统,将用户的iOS设备与仪表盘系统结合)、车载网联终端(Telematics BOX,TBOX)等通话功能的前端音频处理,三是用于CarPlay、Carlife(由百度公司推出的一款手机车机智能互联产品)等手机互联语音音频处理。当前业内主要采用硬降噪处理或者软降噪两种其一的处理方案进行。With the development of the Internet of Vehicles, voice, Bluetooth calls, etc. have become the mainstream and important interaction methods in the car. In the vehicle environment, the car contains environmental noise, wind noise, tire noise, played audio and other noises, resulting in vehicle voice The input signal of Bluetooth and other devices is poor, resulting in low quality of calls and recognition. In-car noise reduction application scenarios are divided into three categories: one is for front-end audio processing for speech recognition, and the other is for Bluetooth calls and CarPlay (a vehicle-mounted system released by Apple in the United States that integrates the user's iOS device with the dashboard system ), front-end audio processing for call functions such as telematics BOX, TBOX, etc. The third is used for mobile phone interconnection voice audio processing such as CarPlay and Carlife (a mobile phone and car intelligent interconnection product launched by Baidu). Currently, the industry mainly uses one of two processing solutions: hard noise reduction or soft noise reduction.
现有的车载降噪方法主要采用硬降噪处理或者软降噪两种其一的处理方案进行处理。硬降噪是指通过独立的硬件降噪模块进行降噪和消除回音,但此种方法的缺点在于:1)硬件成本高,对于录音通道等存在硬件设计难度大;2)印刷线路板(Printed CircuitBoard,PCB)占用空间大;3)刷新需结合固件更新,无法灵活调整降噪参数/算法,无法通过空中下载技术(Over-the-Air Technology,OTA)去更新,对于售后更新时的刷新难度大。软降噪是指通过软件算法进行降噪,但此种方法的缺点在于:1)成本较高,由于全部算法均集成在片上系统(System on Chip,SOC)中,导致CPU的性能消耗极大,因此需采用高性能的SOC芯片,增加成本;若CPU的性能不足时存在时延情况,降噪效果不好;2)由于所有的软降噪算法都部署在安卓系统的音频硬件抽象层(Audio Hardware Abstraction Layer,Audio-HAL),导致修改HAL层引起质量风险增加;3)由于软降噪算法校验过程复杂,增加生产复杂度,降低生产效率。Existing vehicle noise reduction methods mainly use one of two processing solutions: hard noise reduction processing or soft noise reduction processing. Hard noise reduction refers to noise reduction and echo elimination through independent hardware noise reduction modules. However, the disadvantages of this method are: 1) high hardware cost and difficulty in hardware design for recording channels; 2) printed circuit board (Printed circuit board). CircuitBoard (PCB) takes up a lot of space; 3) Refreshing needs to be combined with firmware updates, and the noise reduction parameters/algorithms cannot be flexibly adjusted. It cannot be updated through Over-the-Air Technology (OTA). It is difficult to refresh during after-sales updates. big. Soft noise reduction refers to noise reduction through software algorithms. However, the disadvantages of this method are: 1) The cost is high. Since all algorithms are integrated in the system on chip (SOC), the performance of the CPU is greatly consumed. , therefore it is necessary to use a high-performance SOC chip, which increases the cost; if there is a delay when the CPU performance is insufficient, the noise reduction effect is not good; 2) Since all soft noise reduction algorithms are deployed in the audio hardware abstraction layer of the Android system ( Audio Hardware Abstraction Layer (Audio-HAL), leading to an increase in quality risks caused by modifying the HAL layer; 3) Due to the complex verification process of the soft noise reduction algorithm, production complexity is increased and production efficiency is reduced.
发明内容Contents of the invention
为解决以上现有技术问题,本发明提供一种音频的噪声处理方法、系统、电子装置及介质,通过为不同音源的音频信号提供与之相适应的降噪处理方法,降低车载音频噪声干扰,提高车载音频质量,降低噪声处理的硬件成本及开发成本。In order to solve the above existing technical problems, the present invention provides an audio noise processing method, system, electronic device and medium, which reduces vehicle audio noise interference by providing noise reduction processing methods suitable for audio signals from different sound sources. Improve the quality of in-vehicle audio and reduce the hardware cost and development cost of noise processing.
本发明第一方面提供一种音频的噪声处理方法,包括:A first aspect of the present invention provides an audio noise processing method, including:
获取麦克风端的音频信号,并确定所述音频信号的音源;Obtain the audio signal from the microphone end and determine the source of the audio signal;
若所述音频信号的音源为通话音频信号,则通过DSP降噪芯片对所述通话音频信号进行降噪处理,并将降噪处理得到的第一降噪信号经过音频硬件抽象层输入至通话模块,以完成双向通话交互及响应;If the source of the audio signal is a call audio signal, then the DSP noise reduction chip performs noise reduction processing on the call audio signal, and the first noise reduction signal obtained by the noise reduction processing is input to the call module through the audio hardware abstraction layer. , to complete two-way call interaction and response;
若所述音频信号的音源为本地语音音频信号,则经过音频硬件抽象层输入至语音助理应用软件,并通过集成在所述语音助理应用软件中的语音降噪算法对所述本地语音音频信号进行降噪处理和回声消除处理;If the source of the audio signal is a local voice audio signal, it is input to the voice assistant application software through the audio hardware abstraction layer, and the local voice audio signal is processed through the voice noise reduction algorithm integrated in the voice assistant application software. Noise reduction processing and echo cancellation processing;
若所述音频信号的音源为手机互联语音音频信号,则通过音频硬件抽象层的降噪单元对所述手机互联语音音频信号进行降噪处理。If the source of the audio signal is a mobile phone Internet voice audio signal, the noise reduction process is performed on the mobile phone Internet voice audio signal through the noise reduction unit of the audio hardware abstraction layer.
进一步地,所述通过DSP降噪芯片对所述通话音频信号进行降噪处理,包括:Further, the noise reduction processing of the call audio signal through the DSP noise reduction chip includes:
通过DSP降噪芯片对所述通话音频信号进行增益,得到第一增益信号;Gain the call audio signal through a DSP noise reduction chip to obtain a first gain signal;
对所述第一增益信号进行噪声分离及滤波,得到的单声道音频信号;Perform noise separation and filtering on the first gain signal to obtain a monophonic audio signal;
对所述单声道音频信号进行重采样,使所述单声道音频信号的采样率满足所述通话模块的输入信号的采样率,并将重采样后的单声道音频信号作为第一降噪信号。The mono audio signal is resampled so that the sampling rate of the mono audio signal meets the sampling rate of the input signal of the call module, and the resampled mono audio signal is used as the first downscaled signal. Noisy signal.
进一步地,所述对所述单声道音频信号进行重采样,具体地:Further, the mono audio signal is resampled, specifically:
通过SOC芯片中内置ADSP芯片对所述单声道音频信号进行硬件重采样。The monophonic audio signal is resampled by hardware through the built-in ADSP chip in the SOC chip.
进一步地,所述经过音频硬件抽象层输入至语音助理应用软件,并通过集成在所述语音助理应用软件中的语音降噪算法对所述语音音频信号进行降噪处理和回声消除处理之前,还包括:Further, before the voice audio signal is input to the voice assistant application software through the audio hardware abstraction layer, and the voice audio signal is subjected to noise reduction processing and echo cancellation processing through the voice noise reduction algorithm integrated in the voice assistant application software, include:
获取扬声器端的音频信号,作为参考信号;Obtain the audio signal at the speaker end as a reference signal;
将所述参考信号与所述语音音频信号进行混音,得到混音音频信号;Mix the reference signal and the speech audio signal to obtain a mixed audio signal;
将所述混音音频信号输入至CPU中,并调用所述集成在语音助理应用软件中的语音降噪算法。The mixed audio signal is input into the CPU, and the voice noise reduction algorithm integrated in the voice assistant application software is called.
进一步地,所述通过DSP降噪芯片对所述通话音频信号进行降噪处理之前,还包括:Further, before performing noise reduction processing on the call audio signal through the DSP noise reduction chip, it also includes:
将音频源程序设置为通话模式,并通过SPI串口通信协议将DSP降噪芯片切换为通话通道模式。Set the audio source program to call mode, and switch the DSP noise reduction chip to call channel mode through the SPI serial communication protocol.
进一步地,所述通过集成在语音助理应用软件中的语音降噪算法对所述本地语音音频信号进行降噪处理和回声消除处理之前,还包括:Further, before performing noise reduction processing and echo cancellation processing on the local voice audio signal through the voice noise reduction algorithm integrated in the voice assistant application software, it also includes:
将音频源程序设置为智能语音模式,并将DSP降噪芯片切换为智能语音通道模式。Set the audio source program to intelligent voice mode, and switch the DSP noise reduction chip to intelligent voice channel mode.
进一步地,所述确定所述音频信号的音源,具体地:Further, the sound source of the audio signal is determined, specifically:
通过所述音频信号的音源流通通道对所述音频信号的音源进行判断。The sound source of the audio signal is determined through the sound source circulation channel of the audio signal.
本发明第二方面提供一种音频的噪声处理系统,包括:A second aspect of the present invention provides an audio noise processing system, including:
音频信号获取及音源判断模块,音源获取麦克风端的音频信号,并确定所述音频信号的音源;Audio signal acquisition and sound source judgment module, the sound source obtains the audio signal from the microphone end, and determines the sound source of the audio signal;
通话音频信号处理模块,用于当所述音频信号的音源为通话音频信号时,通过DSP降噪芯片对所述通话音频信号进行降噪处理,并将降噪处理得到的第一降噪信号经过音频硬件抽象层输入至通话模块,以完成双向通话交互及响应;A call audio signal processing module, used to perform noise reduction processing on the call audio signal through a DSP noise reduction chip when the source of the audio signal is a call audio signal, and pass the first noise reduction signal obtained by the noise reduction processing through The audio hardware abstraction layer is input to the call module to complete two-way call interaction and response;
本地语音音频信号处理模块,用于当所述音频信号的音源为本地语音音频信号时,经过音频硬件抽象层输入至语音助理应用软件,并通过集成在所述语音助理应用软件中的语音降噪算法对所述本地语音音频信号进行降噪处理和回声消除处理;A local voice audio signal processing module, used when the source of the audio signal is a local voice audio signal, input to the voice assistant application software through the audio hardware abstraction layer, and use the voice noise reduction integrated in the voice assistant application software The algorithm performs noise reduction processing and echo cancellation processing on the local voice audio signal;
手机互联语音音频信号处理模块,用于当所述音频信号的音源为手机互联语音音频信号时,通过音频硬件抽象层的降噪单元对所述手机互联语音音频信号进行降噪处理。The mobile phone Internet voice audio signal processing module is used to perform noise reduction processing on the mobile phone Internet voice audio signal through the noise reduction unit of the audio hardware abstraction layer when the source of the audio signal is the mobile phone Internet voice audio signal.
本发明第三方面提供一种电子装置,包括处理器、存储器以及存储在所述存储器中且被配置为由所述处理器执行的计算机程序,所述处理器执行所述计算机程序时实现如上述第一方面中任意一项所述的音频的噪声处理方法。A third aspect of the present invention provides an electronic device, including a processor, a memory, and a computer program stored in the memory and configured to be executed by the processor. When the processor executes the computer program, the above-mentioned steps are implemented. The audio noise processing method according to any one of the first aspects.
本发明第四方面提供一种计算机可读存储介质,所述计算机可读存储介质包括存储的计算机程序,其中,在所述计算机程序运行时控制所述计算机可读存储介质所在设备执行如上述第一方面中任意一项所述的音频的噪声处理方法。A fourth aspect of the present invention provides a computer-readable storage medium. The computer-readable storage medium includes a stored computer program, wherein when the computer program is running, the device where the computer-readable storage medium is located is controlled to execute the above-mentioned first step. The audio noise processing method described in any one of the aspects.
与现有技术相比,本发明实施例的有益效果在于:Compared with the prior art, the beneficial effects of the embodiments of the present invention are:
本发明提供一种音频的噪声处理方法及系统,其中方法包括:获取麦克风端的音频信号,并确定所述音频信号的音源;若所述音频信号的音源为通话音频信号,则通过DSP降噪芯片对所述通话音频信号进行降噪处理,并将降噪处理得到的第一降噪信号输入至经过音频硬件抽象层通话模块,以完成双向通话交互及响应;若所述音频信号的音源为本地语音音频信号,则经过音频硬件抽象层输入至语音助理应用软件,并通过集成在所述语音助理应用软件中的语音降噪算法对所述本地语音音频信号进行降噪处理和回声消除处理;若所述音频信号的音源为手机互联语音音频信号,则通过音频硬件抽象层的降噪单元对所述手机互联语音音频信号进行降噪处理。本发明根据音频信号的音源对音频信号进行分离,针对不同音源的音频信号采用不同的降噪方法,降低噪声处理的硬件成本及开发成本,并避免对音频硬件抽象层大幅度修改引起的质量风险。The present invention provides an audio noise processing method and system, wherein the method includes: obtaining the audio signal at the microphone end, and determining the source of the audio signal; if the source of the audio signal is a call audio signal, then through the DSP noise reduction chip Perform noise reduction processing on the call audio signal, and input the first noise reduction signal obtained by the noise reduction processing to the call module through the audio hardware abstraction layer to complete two-way call interaction and response; if the source of the audio signal is local The voice audio signal is input to the voice assistant application software through the audio hardware abstraction layer, and the local voice audio signal is subjected to noise reduction processing and echo cancellation processing through the voice noise reduction algorithm integrated in the voice assistant application software; if If the source of the audio signal is a mobile phone Internet voice audio signal, the noise reduction process is performed on the mobile phone Internet voice audio signal through the noise reduction unit of the audio hardware abstraction layer. The present invention separates the audio signal according to its source, adopts different noise reduction methods for audio signals from different sources, reduces the hardware cost and development cost of noise processing, and avoids quality risks caused by substantial modifications to the audio hardware abstraction layer. .
附图说明Description of the drawings
为了更清楚地说明本发明的技术方案,下面将对实施方式中所需要使用的附图作简单地介绍,显而易见地,下面描述中的附图仅仅是本发明的一些实施方式,对于本领域普通技术人员来讲,在不付出创造性劳动的前提下,还可以根据这些附图获得其他的附图。In order to explain the technical solution of the present invention more clearly, the drawings needed to be used in the implementation will be briefly introduced below. Obviously, the drawings in the following description are only some implementations of the present invention. For ordinary people in the art, For technical personnel, other drawings can also be obtained based on these drawings without exerting creative work.
图1是本发明实施例1提供的一种音频的噪声处理方法的流程图;Figure 1 is a flow chart of an audio noise processing method provided by Embodiment 1 of the present invention;
图2是本发明实施例2提供的一种音频的噪声处理方法的流程图;Figure 2 is a flow chart of an audio noise processing method provided by Embodiment 2 of the present invention;
图3是本发明实施例3提供的一种音频的噪声处理方法的流程图;Figure 3 is a flow chart of an audio noise processing method provided by Embodiment 3 of the present invention;
图4是本发明实施例5提供的一种音频的噪声处理系统的装置图;Figure 4 is a device diagram of an audio noise processing system provided by Embodiment 5 of the present invention;
图5是本发明某一实施例提供的一种电子设备的结构图。Figure 5 is a structural diagram of an electronic device provided by an embodiment of the present invention.
具体实施方式Detailed ways
下面将结合本发明实施例中的附图,对本发明实施例中的技术方案进行清楚、完整地描述,显然,所描述的实施例仅仅是本发明一部分实施例,而不是全部的实施例。基于本发明中的实施例,本领域普通技术人员在没有作出创造性劳动前提下所获得的所有其他实施例,都属于本发明保护的范围。The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only some of the embodiments of the present invention, rather than all the embodiments. Based on the embodiments of the present invention, all other embodiments obtained by those of ordinary skill in the art without making creative efforts fall within the scope of protection of the present invention.
应当理解,文中所使用的步骤编号仅是为了方便描述,不对作为对步骤执行先后顺序的限定。It should be understood that the step numbers used in the text are only for convenience of description and are not intended to limit the execution order of the steps.
应当理解,在本发明说明书中所使用的术语仅仅是出于描述特定实施例的目的而并不意在限制本发明。如在本发明说明书和所附权利要求书中所使用的那样,除非上下文清楚地指明其它情况,否则单数形式的“一”、“一个”及“该”意在包括复数形式。It should be understood that the terminology used in the description of the present invention is for the purpose of describing particular embodiments only and is not intended to be limiting of the invention. As used in this specification and the appended claims, the singular forms "a", "an" and "the" are intended to include the plural forms unless the context clearly dictates otherwise.
术语“包括”和“包含”指示所描述特征、整体、步骤、操作、元素和/或组件的存在,但并不排除一个或多个其它特征、整体、步骤、操作、元素、组件和/或其集合的存在或添加。The terms "comprises" and "comprising" indicate the presence of described features, integers, steps, operations, elements and/or components but do not exclude the presence of one or more other features, integers, steps, operations, elements, components and/or The existence or addition to its collection.
术语“和/或”是指相关联列出的项中的一个或多个的任何组合以及所有可能组合,并且包括这些组合。The term "and/or" refers to any and all possible combinations of one or more of the associated listed items and includes such combinations.
当前业内主要采用硬降噪处理或者软降噪两种其一的处理方案进行:单一采用硬降噪,由于通话语音等对于硬件降噪模块要求较高,需要专门采用独立的降噪模块,由降噪模块进行降噪回音,但此种方案对于硬件成本上升严重,且对于印制电路板(PrintedCircuit Board,PCB)占用空间大,且每次均需更新固件无法灵活调整降噪参数/算法,特别是售后更新,刷新难度大,无法通过OTA去更新。Currently, the industry mainly adopts one of two processing solutions: hard noise reduction processing or soft noise reduction: Hard noise reduction is used alone. Due to high requirements for hardware noise reduction modules such as voice calls, an independent noise reduction module needs to be specially used. The noise reduction module reduces noise echo, but this solution seriously increases the hardware cost, takes up a lot of space on the printed circuit board (PCB), and needs to update the firmware every time, making it difficult to flexibly adjust the noise reduction parameters/algorithm. Especially after-sales updates are difficult to refresh and cannot be updated through OTA.
单一采用软降噪,由于软降噪所需的算法集成在SOC中,若通话语音采用软降噪,相比硬降噪对于CPU的性能消耗极大,若性能不足,则会存在时延问题,降低通话效果;且通话软降噪算法需部署在安卓系统的Audio-HAL,导致修改HAL层引起质量风险增加,且多路的软降噪算法校验步骤容易造成影响产线复杂度,减慢生产节拍。Soft noise reduction is used alone. Since the algorithms required for soft noise reduction are integrated in the SOC, if soft noise reduction is used for call voice, it consumes a lot of CPU performance compared to hard noise reduction. If the performance is insufficient, there will be a delay problem. , reducing the call effect; and the call soft noise reduction algorithm needs to be deployed in the Audio-HAL of the Android system, resulting in an increase in quality risks caused by modifying the HAL layer, and the multi-channel soft noise reduction algorithm verification steps can easily affect the complexity of the production line, reducing Slow production pace.
相比业内采用硬降噪或者软降噪其一的方法,本发明提出一种将硬降噪与软降噪结合的音频的噪声处理方法,根据音频信号的音源对音频信号进行分离,采用对不同音源的音频信号采用不同的降噪方法,降低车载音频噪声干扰,提高车载音频质量,降低噪声处理的硬件成本及开发成本。Compared with the industry's method of using either hard noise reduction or soft noise reduction, the present invention proposes an audio noise processing method that combines hard noise reduction and soft noise reduction. The audio signal is separated according to the source of the audio signal, and the audio signal is separated. Audio signals from different sources use different noise reduction methods to reduce vehicle audio noise interference, improve vehicle audio quality, and reduce noise processing hardware costs and development costs.
第一方面。first.
请参阅图1,本发明实施例1提供一种音频的噪声处理方法,包括步骤S100、S200、S300及S400;需要说明的是,步骤S100执行结束后,根据S100的结果,选择S200或S300或S400中的一个步骤进行执行;且步骤S200、S300及S400之间为并列关系,而非先后关系。具体地:Please refer to Figure 1. Embodiment 1 of the present invention provides an audio noise processing method, including steps S100, S200, S300 and S400; it should be noted that after the execution of step S100 is completed, according to the result of S100, select S200 or S300 or One step in S400 is executed; and the relationship between steps S200, S300 and S400 is parallel rather than sequential. specifically:
S100、获取麦克风端的音频信号,并确定所述音频信号的音源。S100. Obtain the audio signal from the microphone end and determine the source of the audio signal.
需要说明的是,所述麦克风端的音频信号为用户对麦克风端的输入信号,DSP芯片采集到麦克风端的音频信号后,由车载系统对所述音频信号的音源进行判定。It should be noted that the audio signal at the microphone end is the user's input signal to the microphone end. After the DSP chip collects the audio signal at the microphone end, the vehicle system determines the source of the audio signal.
可以理解的是,由于不同的音频信号的音频通道不同,所以确定所述音频信号的音源可以为:通过所述音频信号的音源流通通道对所述音频信号的音源进行判断。由于不同的音频信号的频域不同,所以确定所述音频信号的音源还可以为根据所述音频信号的频域对所述音频信号的音源进行判断。It can be understood that since different audio signals have different audio channels, determining the sound source of the audio signal may be: judging the sound source of the audio signal through the sound source circulation channel of the audio signal. Since different audio signals have different frequency domains, determining the sound source of the audio signal may also include determining the sound source of the audio signal based on the frequency domain of the audio signal.
优选地,所述确定所述音频信号的音源,具体地:Preferably, the sound source of the audio signal is determined, specifically:
通过所述音频信号的音源流通通道对所述音频信号的音源进行判断。The sound source of the audio signal is determined through the sound source circulation channel of the audio signal.
S200、若所述音频信号的音源为通话音频信号,则通过DSP降噪芯片对所述通话音频信号进行降噪处理,并将降噪处理得到的第一降噪信号经过音频硬件抽象层输入至通话模块,以完成双向通话交互及响应。S200. If the source of the audio signal is a call audio signal, perform noise reduction processing on the call audio signal through the DSP noise reduction chip, and input the first noise reduction signal obtained by the noise reduction processing to the audio hardware abstraction layer. Call module to complete two-way call interaction and response.
需要说明的是,所述通话音频信号包括:蓝牙通话、CarPlay电话、ECALL(车载紧急呼叫系统)、BCAll(一键电话救援,主要是道路救援)等通话音频信号。所述通过DSP降噪芯片对所述通话音频信号进行降噪处理即为对通话音频信号进行硬降噪处理。It should be noted that the call audio signals include: Bluetooth calls, CarPlay calls, ECALL (car emergency call system), BCAll (one-button phone rescue, mainly road rescue) and other call audio signals. The noise reduction processing of the call audio signal through the DSP noise reduction chip is the hard noise reduction processing of the call audio signal.
可以理解的是,DSP芯片采集到麦克风端的音频信号且车载系统判断为通话音频信号后,直接由DSP芯片内部进行降噪处理,得到第一降噪信号,将所述第一降噪信号经过音频硬件抽象层传输至通话模块中,以完成双向通话交互及响应。It can be understood that after the DSP chip collects the audio signal from the microphone and the vehicle system determines that it is a call audio signal, the DSP chip directly performs noise reduction processing inside the chip to obtain the first noise reduction signal, and the first noise reduction signal is passed through the audio signal The hardware abstraction layer is transmitted to the call module to complete two-way call interaction and response.
S300、若所述音频信号的音源为本地语音音频信号,则经过音频硬件抽象层输入至语音助理应用软件,并通过集成在所述语音助理应用软件中的语音降噪算法对所述本地语音音频信号进行降噪处理和回声消除处理。S300. If the source of the audio signal is a local voice audio signal, it is input to the voice assistant application software through the audio hardware abstraction layer, and the local voice audio is processed through the voice noise reduction algorithm integrated in the voice assistant application software. The signal is processed for noise reduction and echo cancellation.
需要说明的是,本地语音音频信号包括:车载人工智能语音、车载语音识别助手等本地语音音频信号。所述通过集成在语音助理应用软件中的语音降噪算法对所述本地语音音频信号进行降噪处理和回声消除处理即为对本地语音音频信号进行软降噪。It should be noted that local voice audio signals include: vehicle-mounted artificial intelligence voice, vehicle-mounted voice recognition assistant and other local voice audio signals. Performing noise reduction processing and echo cancellation processing on the local voice audio signal through the voice noise reduction algorithm integrated in the voice assistant application software is to perform soft noise reduction on the local voice audio signal.
可以理解的是,DSP芯片采集到麦克风端的音频信号且车载系统判断为本地语音音频信号后,将所述本地语音音频信号传输至CPU中,经过音频硬件抽象层输入至语音助理应用软件,并通过集成在所述语音助理应用软件中的语音降噪算法对所述本地语音音频信号进行降噪处理和回声消除处理。It can be understood that after the DSP chip collects the audio signal at the microphone end and the vehicle system determines that it is a local voice audio signal, the local voice audio signal is transmitted to the CPU and input to the voice assistant application software through the audio hardware abstraction layer, and through The voice noise reduction algorithm integrated in the voice assistant application software performs noise reduction processing and echo cancellation processing on the local voice audio signal.
S400、若所述音频信号的音源为手机互联语音音频信号,则通过音频硬件抽象层的降噪单元对所述手机互联语音音频信号进行降噪处理。S400. If the source of the audio signal is a mobile phone Internet voice audio signal, perform noise reduction processing on the mobile phone Internet voice audio signal through the noise reduction unit of the audio hardware abstraction layer.
需要说明的是,手机互联语音音频为语音和音乐混音,手机互联语音音频信号包括:Carplay、Carlife等第三方应用程序的手机互联语音音频信号,且所述第三方应用程序未提供降噪功能。所述通过音频硬件抽象层的降噪单元对所述手机互联语音音频信号进行降噪处理即为对手机互联语音音频信号进行软降噪。It should be noted that the mobile phone Internet voice audio is a mix of voice and music. The mobile phone Internet voice audio signals include: mobile phone Internet voice audio signals from third-party applications such as Carplay and Carlife, and the third-party applications do not provide noise reduction functions. . The noise reduction processing of the mobile phone Internet voice audio signal through the noise reduction unit of the audio hardware abstraction layer is to perform soft noise reduction on the mobile phone Internet voice audio signal.
可以理解的是,DSP芯片采集到麦克风端的音频信号且车载系统判断为手机互联语音音频信号后,需要在播放音乐时进行语音监听,将所述本地语音音频信号传输至CPU中,并在音频硬件抽象层通过车载SOC系统对所述手机互联语音音频信号进行降噪处理。It can be understood that after the DSP chip collects the audio signal from the microphone and the vehicle system determines that it is the mobile phone Internet voice audio signal, it needs to perform voice monitoring when playing music, transmit the local voice audio signal to the CPU, and use it in the audio hardware The abstraction layer performs noise reduction processing on the mobile phone Internet voice audio signal through the vehicle-mounted SOC system.
本发明实施例1提供的音频的噪声处理方法,根据音频信号的音源对音频信号进行分离,针对通话音频信号采用DSP降噪芯片进行降噪,针对本地语音音频信号采用语音降噪算法进行降噪,针对手机互联语音音频信号采用硬件抽象层的降噪单元进行降噪,即对不同音源的音频信号采用不同的降噪方法,降低车载音频噪声干扰,提高车载音频质量,降低噪声处理的硬件成本及开发成本。The audio noise processing method provided in Embodiment 1 of the present invention separates the audio signal according to the source of the audio signal, uses a DSP noise reduction chip for call audio signal to reduce noise, and uses a voice noise reduction algorithm for local voice audio signal to reduce noise. , for mobile phone Internet voice audio signals, the noise reduction unit of the hardware abstraction layer is used for noise reduction, that is, different noise reduction methods are used for audio signals from different sources to reduce vehicle audio noise interference, improve vehicle audio quality, and reduce the hardware cost of noise processing. and development costs.
此外,对于通话音频信号的降噪过程不占用CPU性能,不需要改动HAL层代码,且不需要授权或激活影响产线,将语音资源包和蓝牙电话等解耦,减少通话故障概率;对于通话音频信号的降噪的售后更新无需更换DSP芯片,仅通过软件升级刷新进行售后更新,可随时OTA升级语音降噪算法,减少硬件刷新成本。In addition, the noise reduction process of the call audio signal does not occupy CPU performance, does not need to modify the HAL layer code, and does not require authorization or activation to affect the production line. It decouples the voice resource package and Bluetooth phone, etc., reducing the probability of call failure; for calls After-sales updates for audio signal noise reduction do not require replacement of the DSP chip. After-sales updates are only performed through software upgrades and refreshes. The voice noise reduction algorithm can be upgraded OTA at any time to reduce hardware refresh costs.
对于本地语音音频信号的降噪过程,由于每个车型由于麦克风布置位置、角度、间距、内饰等差异以及车内发动机、空调、胎噪、路噪等噪音环境不同,为保障良好的语音识别效果,语音唤醒阈值、车内混响信号模型等语音参数随着车型变化需要进行标定调整,本方案提出的语音降噪算法集成在APP层,仅修改APP层参数,可以无需对HAL层修改,即可实现各车型快速修改适用,可根据不同的车型,不同应用场景更加灵活调整语音调音参数,且可以减少单独的硬件降噪模块,减少PCB的占用,减少对硬件的依赖,大大降低成本,对于本地语音音频信号的降噪的售后更新并可灵活调整调音参数,无需更新固件,减少刷新成本。For the noise reduction process of local voice audio signals, since each car model has different microphone layout positions, angles, spacing, interior decorations, etc., as well as different noise environments such as engines, air conditioners, tire noise, and road noise in the car, in order to ensure good speech recognition Effect, voice parameters such as voice wake-up threshold and in-car reverberation signal model need to be calibrated and adjusted as the vehicle model changes. The voice noise reduction algorithm proposed in this solution is integrated in the APP layer. Only the APP layer parameters are modified, and there is no need to modify the HAL layer. It can be quickly modified and adapted to each car model. The voice tuning parameters can be more flexibly adjusted according to different car models and different application scenarios. It can also reduce the need for separate hardware noise reduction modules, reduce PCB occupation, reduce dependence on hardware, and greatly reduce costs. , after-sales updates for noise reduction of local voice audio signals and flexible adjustment of tuning parameters without the need to update firmware, reducing refresh costs.
对于手机互联语音音频信号的降噪过程,由于DSP无法支持同时对语音和音乐混音情况下降噪,故无法支持对Carplay、Carlife等未提供降噪功能的第三方应用程序的手机互联语音音频信号进行降噪,因此,对于手机互联语音音频信号的降噪过程由HAL层的车载SOC系统进行处理,减少PCB的占用,减少对硬件的依赖,大大降低成本。Regarding the noise reduction process of mobile phone Internet voice audio signals, since the DSP cannot support simultaneous noise reduction for voice and music mixing, it cannot support the mobile phone Internet voice audio for third-party applications such as Carplay and Carlife that do not provide noise reduction functions. The signal is denoised. Therefore, the noise reduction process of the mobile phone Internet voice audio signal is processed by the vehicle SOC system at the HAL layer, which reduces PCB occupation, reduces dependence on hardware, and greatly reduces costs.
请参阅图2,本发明实施例2提供的音频的噪声处理方法,其步骤S100-S400均与实施例1中的步骤S100-S400相同,不同之处在于:所述步骤S200中通过DSP降噪芯片对所述通话音频信号进行降噪处理,包括:Please refer to Figure 2. In the audio noise processing method provided in Embodiment 2 of the present invention, steps S100-S400 are the same as steps S100-S400 in Embodiment 1. The difference lies in that: in step S200, DSP is used to reduce noise. The chip performs noise reduction processing on the call audio signal, including:
S210、通过DSP降噪芯片对所述通话音频信号进行增益,得到第一增益信号。S210. Use the DSP noise reduction chip to gain the call audio signal to obtain a first gain signal.
需要说明的是,所述增益即为对信号进行放大倍数。It should be noted that the gain is the signal amplification factor.
S220、对所述第一增益信号进行噪声分离及滤波,得到的单声道音频信号。S220. Perform noise separation and filtering on the first gain signal to obtain a monophonic audio signal.
需要说明的是,所述分离机滤波为对信号进行去噪处理。It should be noted that the separator filtering is used to denoise the signal.
可以理解的是,DSP降噪芯片将通话音频信号进行增益,并将外部环境噪音分离及滤波出来,将噪音消除,从而提高通话质量。It is understandable that the DSP noise reduction chip gains the call audio signal, separates and filters the external environmental noise, and eliminates the noise, thereby improving the call quality.
S230对所述单声道音频信号进行重采样,使所述单声道音频信号的采样率满足所述通话模块的输入信号的采样率,并将重采样后的单声道音频信号作为第一降噪信号。S230 resamples the mono audio signal so that the sampling rate of the mono audio signal meets the sampling rate of the input signal of the call module, and uses the resampled mono audio signal as the first Noise reduction signal.
需要说明的是,重采样即为对信号再次采样,以得到满足所述通话模块的输入信号的采样率的信号。由于DSP降噪芯片输出的信号的采样率与通话模块的输入信号的采样率有差异,因此,需要对信号进行重采样。例如,将DSP降噪芯片输出的采样率为48k的信号,通过重采样使信号的采用率变为通话模块的输入信号的采样率8K或24K或16K中的一种。It should be noted that resampling means resampling the signal to obtain a signal that satisfies the sampling rate of the input signal of the call module. Since the sampling rate of the signal output by the DSP noise reduction chip is different from the sampling rate of the input signal of the call module, the signal needs to be resampled. For example, the sampling rate of the signal output by the DSP noise reduction chip is 48k, and the sampling rate of the signal is changed to one of the sampling rates of 8K, 24K or 16K of the input signal of the call module through resampling.
优选地,所述步骤S230中对所述单声道音频信号进行重采样,具体地:Preferably, the mono audio signal is resampled in step S230, specifically:
通过SOC芯片中内置ADSP芯片对所述单声道音频信号进行硬件重采样。The monophonic audio signal is resampled by hardware through the built-in ADSP chip in the SOC chip.
需要说明的是,采用SOC芯片中内置ADSP芯片进行硬件重采样后,再送到核心算法处理引擎中,而不采用Androidframework层的软件重采样实现,大大减小系统的开销,增加系统效率。It should be noted that the built-in ADSP chip in the SOC chip is used for hardware resampling and then sent to the core algorithm processing engine instead of software resampling implementation at the Android framework layer, which greatly reduces system overhead and increases system efficiency.
本发明实施例2提供的音频的噪声处理方法,通过DSP降噪芯片对所述通话音频信号进行增益、分离、滤波及重采样,使DSP输出的通话音频信号的信号可以完全输入至通话模块,以完成双向通话交互及响应。对于通话音频信号的降噪过程不占用CPU性能,不需要改动HAL层代码,且不需要授权或激活影响产线,将语音资源包和蓝牙电话等解耦,减少通话故障概率;对于通话音频信号的降噪的售后更新无需更换DSP芯片,仅通过软件升级刷新进行售后更新,可随时OTA升级语音降噪算法,减少硬件刷新成本。The audio noise processing method provided in Embodiment 2 of the present invention uses a DSP noise reduction chip to gain, separate, filter and resample the call audio signal, so that the call audio signal output by the DSP can be completely input to the call module. To complete two-way call interaction and response. The noise reduction process for call audio signals does not occupy CPU performance, does not require changes to the HAL layer code, and does not require authorization or activation to affect the production line. It decouples the voice resource package from Bluetooth phones, etc., reducing the probability of call failures; for call audio signals After-sales updates for noise reduction do not require replacement of the DSP chip. After-sales updates are only performed through software upgrades and refreshes. The voice noise reduction algorithm can be upgraded OTA at any time to reduce hardware refresh costs.
请参阅图3,本发明实施例3提供的音频的噪声处理方法,其步骤S100-S400均与实施例1中的步骤S100-S400相同,不同之处在于:所述步骤S300中经过音频硬件抽象层输入至语音助理应用软件,并通过集成在所述语音助理应用软件中的语音降噪算法对所述语音音频信号进行降噪处理和回声消除处理之前,还包括:Please refer to Figure 3. In the audio noise processing method provided in Embodiment 3 of the present invention, steps S100-S400 are the same as steps S100-S400 in Embodiment 1. The difference is that in step S300, audio hardware abstraction is used. layer is input to the voice assistant application software, and before the voice audio signal is subjected to noise reduction processing and echo cancellation processing through the voice noise reduction algorithm integrated in the voice assistant application software, it also includes:
S310、获取扬声器端的音频信号,作为参考信号。S310. Obtain the audio signal at the speaker end as a reference signal.
需要说明的是,DSP实时采集扬声器端的音频信号,作为参考信号。It should be noted that the DSP collects the audio signal from the speaker in real time as a reference signal.
S320、将所述参考信号与所述语音音频信号进行混音,得到混音音频信号。S320. Mix the reference signal and the speech audio signal to obtain a mixed audio signal.
需要说明的是,DSP将所述参考信号与所述语音音频信号等多路信号通过ADC(模拟数字转换器)采集和转化后,并通过时分复用(time-division multiplexing,TDM)模式进行混音,得到一路信号后传输到CPU中,实现多路信号同时传输。It should be noted that the DSP collects and converts the reference signal and the voice audio signal and other multi-channel signals through an ADC (analog-to-digital converter), and then mixes them through time-division multiplexing (TDM) mode. After receiving one signal, it transmits it to the CPU to realize simultaneous transmission of multiple signals.
S330、将所述混音音频信号输入至CPU中,并调用所述集成在语音助理应用软件中的语音降噪算法。S330: Input the mixed audio signal into the CPU, and call the voice noise reduction algorithm integrated in the voice assistant application software.
需要说明的是,调用SOC中集成在语音助理应用软件中的语音降噪算法,实现对语音音频信号的降噪和回声消除,保障良好的识别效。所述语音降噪算法的接口函数如表1:It should be noted that the voice noise reduction algorithm integrated in the voice assistant application software in the SOC is called to achieve noise reduction and echo elimination of the voice audio signal to ensure good recognition effect. The interface functions of the speech noise reduction algorithm are as shown in Table 1:
表1Table 1
可以理解的是,对语音音频信号的降噪和回声消除的语音降噪算法集成在语音助理应用软件中,可以无需HAL层修改,且可以减少单独的硬件降噪模块,减少PCB的占用,减少对硬件的依赖,大大降低成本,并可灵活调整调音参数,无需更新固件,减少刷新成本。It can be understood that the voice noise reduction algorithm for voice audio signal noise reduction and echo cancellation is integrated in the voice assistant application software, which does not require HAL layer modification, and can reduce the need for separate hardware noise reduction modules, reduce PCB occupation, and reduce Reliance on hardware greatly reduces costs, and tuning parameters can be flexibly adjusted without the need to update firmware, reducing refresh costs.
本发明实施例3提供的音频的噪声处理方法,对于本地语音音频信号的降噪过程,由于每个车型由于麦克风布置位置、角度、间距、内饰等差异以及车内发动机、空调、胎噪、路噪等噪音环境不同,为保障良好的语音识别效果,语音唤醒阈值、车内混响信号模型等语音参数随着车型变化需要进行标定调整,通过将多路音频进行混音后,由集成在语音助理应用软件中的语音降噪算法对语音音频信号进行降噪,由于语音降噪算法集成在APP层,仅修改APP层参数,可以无需HAL层修改,即可实现各车型快速修改适用,可根据不同的车型,不同应用场景更加灵活调整语音调音参数,且可以减少单独的硬件降噪模块,减少PCB的占用,减少对硬件的依赖,大大降低成本,对于本地语音音频信号的降噪的售后更新并可灵活调整调音参数,无需更新固件,减少刷新成本。The audio noise processing method provided in Embodiment 3 of the present invention, for the noise reduction process of local voice audio signals, due to the differences in the microphone arrangement position, angle, spacing, interior decoration, etc. of each vehicle model, as well as the in-car engine, air conditioner, tire noise, Noise environments such as road noise are different. In order to ensure good speech recognition effects, speech parameters such as voice wake-up threshold and in-car reverberation signal model need to be calibrated and adjusted as the car model changes. By mixing multi-channel audio, it is integrated into The voice noise reduction algorithm in the voice assistant application software de-noises the voice audio signal. Since the voice noise reduction algorithm is integrated in the APP layer, only the APP layer parameters are modified. It can be quickly modified and adapted to each car model without the need for HAL layer modification. According to different car models and different application scenarios, the voice tuning parameters can be more flexibly adjusted, and separate hardware noise reduction modules can be reduced, reducing PCB occupation, reducing dependence on hardware, and greatly reducing costs. For the noise reduction of local voice audio signals After-sales updates allow for flexible adjustment of tuning parameters without the need to update firmware, reducing refresh costs.
本发明实施例4提供的音频的噪声处理方法,其步骤S100-S400均与实施例1中的步骤S100-S400相同,不同之处在于:In the audio noise processing method provided in Embodiment 4 of the present invention, steps S100-S400 are the same as steps S100-S400 in Embodiment 1, except that:
所述步骤S200中通过DSP降噪芯片对所述通话音频信号进行降噪处理之前,还包括:Before performing noise reduction processing on the call audio signal through the DSP noise reduction chip in step S200, it also includes:
将音频源程序设置为通话模式,并通过SPI串口通信协议将DSP降噪芯片切换为通话通道模式。Set the audio source program to call mode, and switch the DSP noise reduction chip to call channel mode through the SPI serial communication protocol.
需要说明的是,SPI串口通信协议是一种同步串行接口技术,是一种高速的,全双工,同步的通信总线,具有支持全双工通信、通信简单、数据传输速率快等优点。It should be noted that the SPI serial communication protocol is a synchronous serial interface technology. It is a high-speed, full-duplex, synchronous communication bus. It has the advantages of supporting full-duplex communication, simple communication, and fast data transmission rate.
可以理解的是,DSP芯片采集到麦克风端的音频信号且车载系统判断为通话音频信号后,将音频源程序设置为通话模式,并通过SPI串口消息通知DSP切换到通话通道模式。It can be understood that after the DSP chip collects the audio signal from the microphone and the vehicle system determines that it is a call audio signal, it sets the audio source program to call mode and notifies the DSP to switch to call channel mode through an SPI serial port message.
所述步骤S300中通过集成在语音助理应用软件中的语音降噪算法对所述本地语音音频信号进行降噪处理和回声消除处理之前,还包括:In step S300, before performing noise reduction processing and echo cancellation processing on the local voice audio signal through the voice noise reduction algorithm integrated in the voice assistant application software, it also includes:
将音频源程序设置为智能语音模式,并将DSP降噪芯片切换为智能语音通道模式。Set the audio source program to intelligent voice mode, and switch the DSP noise reduction chip to intelligent voice channel mode.
可以理解的是,DSP芯片采集到麦克风端的音频信号且车载系统判断为本地语音音频信号后,将音频源程序设置为智能语音模式,并通过SPI串口消息通知DSP切换到智能语音通道模式。It is understandable that after the DSP chip collects the audio signal from the microphone and the vehicle system determines that it is a local voice audio signal, it sets the audio source program to the intelligent voice mode and notifies the DSP to switch to the intelligent voice channel mode through the SPI serial port message.
如果采用同本地语音音频信号同样的软降噪模式对通话音频信号进行降噪,会有较多难点和问题。问题一是会导致软降噪算法复杂度和占用空间增加,消耗CPU运行性能,导致可能存在时延。问题二是通话音频信号降噪不同于语音音频信号降噪,由于语音音频信号降噪的算法可以集成在应用层语音助理APP中,而通话音频信号软降噪方案中蓝牙通话算法BTNR由于牵涉底层较深,需要在HAL层集成软降噪库,调用接口库并修改函数进行配置和创建录音线程,最后降录音送入语音助理才能实现软降噪。此种方案会导致需要深度更改安卓HAL层接口,修改风险大,容易发生问题。问题三是因为一般软降噪算法均需要进行授权激活,如果通话音频信号也采用软降噪算法会导致和本地语音音频信号软降噪一样,产线需要进行至少两次通话和语音的算法校验,增加生产产线的复杂度,减慢生产节拍,而且极易出现某些场景下未授权的问题。问题四是通话音频信号软降噪算法和本地语音音频信号软降噪算法一同需要存储在语音资源包中,当语音资源包缺失时,也会直接影响电话录音功能,耦合性较大,而电话功能本身要求等级比语音高,且语音资源包较大且由于拷贝升级等容易出现缺失,导致电话不可用概率升高。If the same soft noise reduction mode as the local voice audio signal is used to reduce noise on the call audio signal, there will be many difficulties and problems. The first problem is that the complexity and space occupied of the soft noise reduction algorithm will increase, which will consume CPU running performance and lead to possible delays. The second problem is that call audio signal noise reduction is different from voice audio signal noise reduction. Since the voice audio signal noise reduction algorithm can be integrated in the application layer voice assistant APP, the Bluetooth call algorithm BTNR in the call audio signal soft noise reduction solution involves the underlying layer. Deeper, it is necessary to integrate the soft noise reduction library in the HAL layer, call the interface library and modify the function to configure and create a recording thread, and finally reduce the recording and send it to the voice assistant to achieve soft noise reduction. This solution will require deep changes to the Android HAL layer interface, which is risky and prone to problems. The third problem is that general soft noise reduction algorithms require authorization activation. If the call audio signal also uses the soft noise reduction algorithm, it will be the same as the local voice audio signal soft noise reduction. The production line needs to perform at least two call and voice algorithm calibrations. It increases the complexity of the production line, slows down the production cycle, and is prone to unauthorized issues in certain scenarios. The fourth problem is that the call audio signal soft noise reduction algorithm and the local voice audio signal soft noise reduction algorithm need to be stored in the voice resource package. When the voice resource package is missing, it will also directly affect the phone recording function. The coupling is large, and the phone The function itself requires a higher level than voice, and the voice resource package is large and prone to defects due to copy upgrades, etc., resulting in an increased probability of phone unavailability.
故本发明实施例4提出音频的噪声处理方法,通话音频信号降噪基于DSP芯片进行硬降噪:用户对麦克风输入音频,DSP对蓝牙/CarPlay电话/ECALL/BCAll等通话音频进行采样后直接由底层DSP芯片内部进行降噪处理,降噪后的录音最后传输到蓝牙APP应用,完成通话识别功能。采用此种方案,通话音频信号降噪无需不占用CPU性能、不需要改动HAL层代码,且不需要授权或激活影响产线,将语音资源包和蓝牙电话等解耦,减少通话故障概率。Therefore, Embodiment 4 of the present invention proposes an audio noise processing method. The call audio signal noise reduction is based on the DSP chip for hard noise reduction: the user inputs audio to the microphone, and the DSP samples the call audio such as Bluetooth/CarPlay phone/ECALL/BCAll and then directly The underlying DSP chip performs noise reduction processing internally, and the noise-reduced recording is finally transmitted to the Bluetooth APP application to complete the call recognition function. Using this solution, call audio signal noise reduction does not require CPU performance, does not need to modify the HAL layer code, and does not require authorization or activation to affect the production line. It decouples the voice resource package from Bluetooth phones and reduces the probability of call failures.
第二方面。The second aspect.
请参阅图4,本发明实施例5提供一种音频的噪声处理系统,包括:Please refer to Figure 4. Embodiment 5 of the present invention provides an audio noise processing system, including:
音频信号获取及音源判断模块100,音源获取麦克风端的音频信号,并确定所述音频信号的音源。The audio signal acquisition and audio source determination module 100 acquires the audio signal from the microphone and determines the audio source of the audio signal.
需要说明的是,所述麦克风端的音频信号为用户对麦克风端的输入信号,DSP芯片采集到麦克风端的音频信号后,由车载系统对所述音频信号的音源进行判定。It should be noted that the audio signal at the microphone end is the user's input signal to the microphone end. After the DSP chip collects the audio signal at the microphone end, the vehicle system determines the source of the audio signal.
可以理解的是,由于不同的音频信号的音频通道不同,所以确定所述音频信号的音源可以为:通过所述音频信号的音源流通通道对所述音频信号的音源进行判断。由于不同的音频信号的频域不同,所以确定所述音频信号的音源还可以为根据所述音频信号的频域对所述音频信号的音源进行判断。It can be understood that since different audio signals have different audio channels, determining the sound source of the audio signal may be: judging the sound source of the audio signal through the sound source circulation channel of the audio signal. Since different audio signals have different frequency domains, determining the sound source of the audio signal may also include determining the sound source of the audio signal based on the frequency domain of the audio signal.
优选地,所述确定所述音频信号的音源,具体地:Preferably, the sound source of the audio signal is determined, specifically:
通过所述音频信号的音源流通通道对所述音频信号的音源进行判断。The sound source of the audio signal is determined through the sound source circulation channel of the audio signal.
通话音频信号处理模块200,用于当所述音频信号的音源为通话音频信号时,通过DSP降噪芯片对所述通话音频信号进行降噪处理,并将降噪处理得到的第一降噪信号经过音频硬件抽象层输入至通话模块,以完成双向通话交互及响应。The call audio signal processing module 200 is used to perform noise reduction processing on the call audio signal through a DSP noise reduction chip when the source of the audio signal is a call audio signal, and use the first noise reduction signal obtained by the noise reduction processing to It is input to the call module through the audio hardware abstraction layer to complete two-way call interaction and response.
需要说明的是,所述通话音频信号包括:蓝牙通话、CarPlay电话、ECALL、BCAll等通话音频信号。所述通过DSP降噪芯片对所述通话音频信号进行降噪处理即为对通话音频信号进行硬降噪处理。It should be noted that the call audio signals include: Bluetooth calls, CarPlay calls, ECALL, BCAll and other call audio signals. The noise reduction processing of the call audio signal through the DSP noise reduction chip is the hard noise reduction processing of the call audio signal.
可以理解的是,DSP芯片采集到麦克风端的音频信号且车载系统判断为通话音频信号后,直接由DSP芯片内部进行降噪处理,得到第一降噪信号,将所述第一降噪信号经过音频硬件抽象层传输至通话模块中,以完成双向通话交互及响应。It can be understood that after the DSP chip collects the audio signal from the microphone and the vehicle system determines that it is a call audio signal, the DSP chip directly performs noise reduction processing inside the chip to obtain the first noise reduction signal, and the first noise reduction signal is passed through the audio signal The hardware abstraction layer is transmitted to the call module to complete two-way call interaction and response.
本地语音音频信号处理模块300,用于当所述音频信号的音源为本地语音音频信号时,经过音频硬件抽象层输入至语音助理应用软件,并通过集成在所述语音助理应用软件中的语音降噪算法对所述本地语音音频信号进行降噪处理和回声消除处理。The local voice audio signal processing module 300 is used to input the audio signal to the voice assistant application software through the audio hardware abstraction layer when the source of the audio signal is a local voice audio signal, and use the voice reduction function integrated in the voice assistant application software to The noise algorithm performs noise reduction processing and echo cancellation processing on the local voice audio signal.
需要说明的是,本地语音音频信号包括:车载人工智能语音、车载语音识别助手等本地语音音频信号。所述通过集成在语音助理应用软件中的语音降噪算法对所述本地语音音频信号进行降噪处理和回声消除处理即为对本地语音音频信号进行软降噪。It should be noted that local voice audio signals include: vehicle-mounted artificial intelligence voice, vehicle-mounted voice recognition assistant and other local voice audio signals. Performing noise reduction processing and echo cancellation processing on the local voice audio signal through the voice noise reduction algorithm integrated in the voice assistant application software is to perform soft noise reduction on the local voice audio signal.
可以理解的是,DSP芯片采集到麦克风端的音频信号且车载系统判断为本地语音音频信号后,将所述本地语音音频信号传输至CPU中,经过音频硬件抽象层输入至语音助理应用软件,并通过集成在所述集成在语音助理应用软件中的语音降噪算法对所述本地语音音频信号进行降噪处理和回声消除处理。It can be understood that after the DSP chip collects the audio signal at the microphone end and the vehicle system determines that it is a local voice audio signal, the local voice audio signal is transmitted to the CPU and input to the voice assistant application software through the audio hardware abstraction layer, and through The voice noise reduction algorithm integrated in the voice assistant application software performs noise reduction processing and echo cancellation processing on the local voice audio signal.
手机互联语音音频信号处理模块400,用于当所述音频信号的音源为手机互联语音音频信号时,通过音频硬件抽象层的降噪单元对所述手机互联语音音频信号进行降噪处理。The mobile phone Internet voice audio signal processing module 400 is used to perform noise reduction processing on the mobile phone Internet voice audio signal through the noise reduction unit of the audio hardware abstraction layer when the source of the audio signal is a mobile phone Internet voice audio signal.
需要说明的是,手机互联语音音频为语音和音乐混音,手机互联语音音频信号包括:carplay、Carlife等第三方应用程序的手机互联语音音频信号,且所述第三方应用程序未提供降噪功能。所述通过音频硬件抽象层的降噪单元对所述手机互联语音音频信号进行降噪处理即为对手机互联语音音频信号进行软降噪。It should be noted that the mobile phone Internet voice audio is a mix of voice and music. The mobile phone Internet voice audio signal includes: the mobile phone Internet voice audio signal of third-party applications such as carplay and Carlife, and the third-party application does not provide noise reduction function. . The noise reduction processing of the mobile phone Internet voice audio signal through the noise reduction unit of the audio hardware abstraction layer is to perform soft noise reduction on the mobile phone Internet voice audio signal.
可以理解的是,DSP芯片采集到麦克风端的音频信号且车载系统判断为手机互联语音音频信号后,需要在播放音乐时进行语音监听,将所述本地语音音频信号传输至CPU中,并在HAL层通过车载SOC系统对所述手机互联语音音频信号进行降噪处理。It can be understood that after the DSP chip collects the audio signal from the microphone and the vehicle system determines that it is a mobile phone interconnected voice audio signal, it needs to perform voice monitoring when playing music, transmit the local voice audio signal to the CPU, and perform it at the HAL layer The mobile phone Internet voice audio signal is subjected to noise reduction processing through the vehicle-mounted SOC system.
本发明实施例5提供的音频的噪声处理系统,根据音频信号的音源对音频信号进行分离,针对通话音频信号采用DSP降噪芯片进行降噪,针对本地语音音频信号采用语音降噪算法进行降噪,针对手机互联语音音频信号采用硬件抽象层的降噪单元进行降噪,即对不同音源的音频信号采用不同的降噪方法,降低车载音频噪声干扰,提高车载音频质量,降低噪声处理的硬件成本及开发成本。The audio noise processing system provided in Embodiment 5 of the present invention separates the audio signal according to the source of the audio signal, uses a DSP noise reduction chip for call audio signal to reduce noise, and uses a speech noise reduction algorithm for local voice audio signal to reduce noise. , for mobile phone Internet voice audio signals, the noise reduction unit of the hardware abstraction layer is used for noise reduction, that is, different noise reduction methods are used for audio signals from different sources to reduce vehicle audio noise interference, improve vehicle audio quality, and reduce the hardware cost of noise processing. and development costs.
此外,对于通话音频信号的降噪过程不占用CPU性能,不需要改动HAL层代码,且不需要授权或激活影响产线,将语音资源包和蓝牙电话等解耦,减少通话故障概率;对于通话音频信号的降噪的售后更新无需更换DSP芯片,仅通过软件升级刷新进行售后更新,可随时OTA升级语音降噪算法,减少硬件刷新成本。In addition, the noise reduction process of the call audio signal does not occupy CPU performance, does not need to modify the HAL layer code, and does not require authorization or activation to affect the production line. It decouples the voice resource package and Bluetooth phone, etc., reducing the probability of call failure; for calls After-sales updates for audio signal noise reduction do not require replacement of the DSP chip. After-sales updates are only performed through software upgrades and refreshes. The voice noise reduction algorithm can be upgraded OTA at any time to reduce hardware refresh costs.
对于本地语音音频信号的降噪过程,由于每个车型由于麦克风布置位置、角度、间距、内饰等差异以及车内发动机、空调、胎噪、路噪等噪音环境不同,为保障良好的语音识别效果,语音唤醒阈值、车内混响信号模型等语音参数随着车型变化需要进行标定调整,本方案提出的语音降噪算法集成在APP层,仅修改APP层参数,可以无需HAL层修改,即可实现各车型快速修改适用,可根据不同的车型,不同应用场景更加灵活调整语音调音参数,且可以减少单独的硬件降噪模块,减少PCB的占用,减少对硬件的依赖,大大降低成本,对于本地语音音频信号的降噪的售后更新并可灵活调整调音参数,无需更新固件,减少刷新成本。For the noise reduction process of local voice audio signals, since each car model has different microphone layout positions, angles, spacing, interior decorations, etc., as well as different noise environments such as engines, air conditioners, tire noise, and road noise in the car, in order to ensure good speech recognition Effect, voice parameters such as voice wake-up threshold and in-vehicle reverberation signal model need to be calibrated and adjusted as the vehicle model changes. The voice noise reduction algorithm proposed in this solution is integrated in the APP layer, and only the APP layer parameters are modified, without the need for HAL layer modification, that is It can be quickly modified and adapted to each car model. The voice tuning parameters can be more flexibly adjusted according to different car models and different application scenarios. It can also reduce the need for separate hardware noise reduction modules, reduce PCB occupation, reduce dependence on hardware, and greatly reduce costs. After-sales updates for noise reduction of local voice audio signals and flexible adjustment of tuning parameters without the need to update firmware, reducing refresh costs.
对于手机互联语音音频信号的降噪过程,由于DSP无法支持同时对语音和音乐混音情况下降噪,故无法支持对Carplay、Carlife等未提供降噪功能的第三方应用程序的手机互联语音音频信号进行降噪,因此,对于手机互联语音音频信号的降噪过程由HAL层的车载SOC系统进行处理,减少PCB的占用,减少对硬件的依赖,大大降低成本。Regarding the noise reduction process of mobile phone Internet voice audio signals, since the DSP cannot support simultaneous noise reduction for voice and music mixing, it cannot support the mobile phone Internet voice audio for third-party applications such as Carplay and Carlife that do not provide noise reduction functions. The signal is denoised. Therefore, the noise reduction process of the mobile phone Internet voice audio signal is processed by the vehicle SOC system at the HAL layer, which reduces PCB occupation, reduces dependence on hardware, and greatly reduces costs.
本发明实施例6提供的音频的噪声处理系统,其模块100-400均与实施例5中的模块100-400相同,不同之处在于:所述通话音频信号处理模块200,还用于执行步骤S210-S230:The modules 100-400 of the audio noise processing system provided in Embodiment 6 of the present invention are all the same as the modules 100-400 in Embodiment 5. The difference is that the call audio signal processing module 200 is also used to perform steps S210-S230:
S210、通过DSP降噪芯片对所述通话音频信号进行增益,得到第一增益信号。S210. Use the DSP noise reduction chip to gain the call audio signal to obtain a first gain signal.
需要说明的是,所述增益即为对信号进行放大倍数。It should be noted that the gain is the signal amplification factor.
S220、对所述第一增益信号进行噪声分离及滤波,得到的单声道音频信号。S220. Perform noise separation and filtering on the first gain signal to obtain a monophonic audio signal.
需要说明的是,所述分离机滤波为对信号进行去噪处理。It should be noted that the separator filtering is used to denoise the signal.
可以理解的是,DSP降噪芯片将通话音频信号进行增益,并将外部环境噪音分离及滤波出来,将噪音消除,从而提高通话质量。It is understandable that the DSP noise reduction chip gains the call audio signal, separates and filters the external environmental noise, and eliminates the noise, thereby improving the call quality.
S230对所述单声道音频信号进行重采样,使所述单声道音频信号的采样率满足所述通话模块的输入信号的采样率,并将重采样后的单声道音频信号作为第一降噪信号。S230 resamples the mono audio signal so that the sampling rate of the mono audio signal meets the sampling rate of the input signal of the call module, and uses the resampled mono audio signal as the first Noise reduction signal.
需要说明的是,重采样即为对信号再次采样,以得到满足所述通话模块的输入信号的采样率的信号。由于DSP降噪芯片输出的信号的采样率与通话模块的输入信号的采样率有差异,因此,需要对信号进行重采样。例如,将DSP降噪芯片输出的采样率为48k的信号,通过重采样使信号的采用率变为通话模块的输入信号的采样率8K或24K或16K中的一种。It should be noted that resampling means resampling the signal to obtain a signal that satisfies the sampling rate of the input signal of the call module. Since the sampling rate of the signal output by the DSP noise reduction chip is different from the sampling rate of the input signal of the call module, the signal needs to be resampled. For example, the sampling rate of the signal output by the DSP noise reduction chip is 48k, and the sampling rate of the signal is changed to one of the sampling rates of 8K, 24K or 16K of the input signal of the call module through resampling.
优选地,所述步骤S230中对所述单声道音频信号进行重采样,具体地:Preferably, the mono audio signal is resampled in step S230, specifically:
通过SOC芯片中内置ADSP芯片对所述单声道音频信号进行硬件重采样。The monophonic audio signal is resampled by hardware through the built-in ADSP chip in the SOC chip.
需要说明的是,采用SOC芯片中内置ADSP芯片进行硬件重采样后,再送到核心算法处理引擎中,而不采用Androidframework层的软件重采样实现,大大减小系统的开销,增加系统效率。It should be noted that the built-in ADSP chip in the SOC chip is used for hardware resampling and then sent to the core algorithm processing engine instead of software resampling implementation at the Android framework layer, which greatly reduces system overhead and increases system efficiency.
本发明实施例6提供的音频的噪声处理系统,通过DSP降噪芯片对所述通话音频信号进行增益、分离、滤波及重采样,使DSP输出的通话音频信号的信号可以完全输入至通话模块,以完成双向通话交互及响应。对于通话音频信号的降噪过程不占用CPU性能,不需要改动HAL层代码,且不需要授权或激活影响产线,将语音资源包和蓝牙电话等解耦,减少通话故障概率;对于通话音频信号的降噪的售后更新无需更换DSP芯片,仅通过软件升级刷新进行售后更新,可随时OTA升级语音降噪算法,减少硬件刷新成本。The audio noise processing system provided in Embodiment 6 of the present invention uses a DSP noise reduction chip to gain, separate, filter and resample the call audio signal, so that the call audio signal output by the DSP can be completely input to the call module. To complete two-way call interaction and response. The noise reduction process for call audio signals does not occupy CPU performance, does not require changes to the HAL layer code, and does not require authorization or activation to affect the production line. It decouples the voice resource package from Bluetooth phones, etc., reducing the probability of call failures; for call audio signals After-sales updates for noise reduction do not require replacement of the DSP chip. After-sales updates are only performed through software upgrades and refreshes. The voice noise reduction algorithm can be upgraded OTA at any time to reduce hardware refresh costs.
本发明实施例7提供的音频的噪声处理系统,其模块100-400均与实施例5中的模块100-400相同,不同之处在于:所述本地语音音频信号处理模块300,还用于执行步骤S310-S3330:The modules 100-400 of the audio noise processing system provided in Embodiment 7 of the present invention are all the same as the modules 100-400 in Embodiment 5. The difference is that the local voice audio signal processing module 300 is also used to execute Steps S310-S3330:
S310、获取扬声器端的音频信号,作为参考信号。S310. Obtain the audio signal at the speaker end as a reference signal.
需要说明的是,DSP实时采集扬声器端的音频信号,作为参考信号。It should be noted that the DSP collects the audio signal from the speaker in real time as a reference signal.
S320、将所述参考信号与所述语音音频信号进行混音,得到混音音频信号。S320. Mix the reference signal and the speech audio signal to obtain a mixed audio signal.
需要说明的是,DSP将所述参考信号与所述语音音频信号等多路信号通过ADC采集和转化后,并通过TDM时分复用模式进行混音,得到一路信号后传输到CPU中,实现多路信号同时传输。It should be noted that the DSP collects and converts the reference signal and the voice audio signal and other multi-channel signals through the ADC, and mixes them through the TDM time division multiplexing mode. After obtaining one channel of signal, it is transmitted to the CPU to realize multiple signals. signals are transmitted simultaneously.
S330、将所述混音音频信号输入至CPU中,并调用所述集成在语音助理应用软件中的语音降噪算法。S330: Input the mixed audio signal into the CPU, and call the voice noise reduction algorithm integrated in the voice assistant application software.
需要说明的是,调用SOC中集成在语音助理应用软件中的语音降噪算法,实现对语音音频信号的降噪和回声消除,保障良好的识别效。所述语音降噪算法的接口函数如表1:It should be noted that the voice noise reduction algorithm integrated in the voice assistant application software in the SOC is called to achieve noise reduction and echo elimination of the voice audio signal to ensure good recognition effect. The interface functions of the speech noise reduction algorithm are as shown in Table 1:
表1Table 1
可以理解的是,对语音音频信号的降噪和回声消除的语音降噪算法集成在语音助理应用软件中,可以无需HAL层修改,且可以减少单独的硬件降噪模块,减少PCB的占用,减少对硬件的依赖,大大降低成本,并可灵活调整调音参数,无需更新固件,减少刷新成本。It can be understood that the voice noise reduction algorithm for voice audio signal noise reduction and echo cancellation is integrated in the voice assistant application software, which does not require HAL layer modification, and can reduce the need for separate hardware noise reduction modules, reduce PCB occupation, and reduce Reliance on hardware greatly reduces costs, and tuning parameters can be flexibly adjusted without the need to update firmware, reducing refresh costs.
本发明实施例7提供的音频的噪声处理系统,对于本地语音音频信号的降噪过程,由于每个车型由于麦克风布置位置、角度、间距、内饰等差异以及车内发动机、空调、胎噪、路噪等噪音环境不同,为保障良好的语音识别效果,语音唤醒阈值、车内混响信号模型等语音参数随着车型变化需要进行标定调整,通过将多路音频进行混音后,由集成在语音助理应用软件中的语音降噪算法对语音音频信号进行降噪,由于语音降噪算法集成在APP层,仅修改APP层参数,可以无需HAL层修改,即可实现各车型快速修改适用,可根据不同的车型,不同应用场景更加灵活调整语音调音参数,且可以减少单独的硬件降噪模块,减少PCB的占用,减少对硬件的依赖,大大降低成本,对于本地语音音频信号的降噪的售后更新并可灵活调整调音参数,无需更新固件,减少刷新成本。The audio noise processing system provided in Embodiment 7 of the present invention, for the noise reduction process of local voice audio signals, due to the differences in the microphone arrangement position, angle, spacing, interior decoration, etc. of each vehicle model, as well as the in-car engine, air conditioner, tire noise, Noise environments such as road noise are different. In order to ensure good speech recognition effects, speech parameters such as voice wake-up threshold and in-car reverberation signal model need to be calibrated and adjusted as the car model changes. By mixing multi-channel audio, it is integrated into The voice noise reduction algorithm in the voice assistant application software de-noises the voice audio signal. Since the voice noise reduction algorithm is integrated in the APP layer, only the APP layer parameters are modified. It can be quickly modified and adapted to each car model without the need for HAL layer modification. According to different car models and different application scenarios, the voice tuning parameters can be more flexibly adjusted, and separate hardware noise reduction modules can be reduced, reducing PCB occupation, reducing dependence on hardware, and greatly reducing costs. For the noise reduction of local voice audio signals After-sales updates allow for flexible adjustment of tuning parameters without the need to update firmware, reducing refresh costs.
本发明实施例8提供的音频的噪声处理系统,其模块100-400均与实施例5中的模块100-400相同,不同之处在于:所述通话音频信号处理模块200,还用于:The modules 100-400 of the audio noise processing system provided in Embodiment 8 of the present invention are all the same as the modules 100-400 in Embodiment 5. The difference is that the call audio signal processing module 200 is also used to:
将音频源程序设置为通话模式,并通过SPI串口通信协议将DSP降噪芯片切换为通话通道模式。Set the audio source program to call mode, and switch the DSP noise reduction chip to call channel mode through the SPI serial communication protocol.
需要说明的是,SPI串口通信协议是一种同步串行接口技术,是一种高速的,全双工,同步的通信总线,具有支持全双工通信、通信简单、数据传输速率快等优点。It should be noted that the SPI serial communication protocol is a synchronous serial interface technology. It is a high-speed, full-duplex, synchronous communication bus. It has the advantages of supporting full-duplex communication, simple communication, and fast data transmission rate.
可以理解的是,DSP芯片采集到麦克风端的音频信号且车载系统判断为通话音频信号后,将音频源程序设置为通话模式,并通过SPI串口消息通知DSP切换到通话通道模式。It can be understood that after the DSP chip collects the audio signal from the microphone and the vehicle system determines that it is a call audio signal, it sets the audio source program to call mode and notifies the DSP to switch to call channel mode through an SPI serial port message.
所述本地语音音频信号处理模块300,还用于:The local voice audio signal processing module 300 is also used to:
将音频源程序设置为智能语音模式,并将DSP降噪芯片切换为智能语音通道模式。Set the audio source program to intelligent voice mode, and switch the DSP noise reduction chip to intelligent voice channel mode.
可以理解的是,DSP芯片采集到麦克风端的音频信号且车载系统判断为本地语音音频信号后,将音频源程序设置为智能语音模式,并通过SPI串口消息通知DSP切换到智能语音通道模式。It is understandable that after the DSP chip collects the audio signal from the microphone and the vehicle system determines that it is a local voice audio signal, it sets the audio source program to the intelligent voice mode and notifies the DSP to switch to the intelligent voice channel mode through the SPI serial port message.
如果采用同本地语音音频信号同样的软降噪模式对通话音频信号进行降噪,会有较多难点和问题。问题一是会导致软降噪算法复杂度和占用空间增加,消耗CPU运行性能,导致可能存在时延。问题二是通话音频信号降噪不同于语音音频信号降噪,由于语音音频信号降噪的算法可以集成在应用层语音助理APP中,而通话音频信号软降噪方案中蓝牙通话算法BTNR由于牵涉底层较深,需要在HAL层集成软降噪库,调用接口库并修改函数进行配置和创建录音线程,最后降录音送入语音助理才能实现软降噪。此种方案会导致需要深度更改安卓HAL层接口,修改风险大,容易发生问题。问题三是因为一般软降噪算法均需要进行授权激活,如果通话音频信号也采用软降噪算法会导致和本地语音音频信号软降噪一样,产线需要进行至少两次通话和语音的算法校验,增加生产产线的复杂度,减慢生产节拍,而且极易出现某些场景下未授权的问题。问题四是通话音频信号软降噪算法和本地语音音频信号软降噪算法一同需要存储在语音资源包中,当语音资源包缺失时,也会直接影响电话录音功能,耦合性较大,而电话功能本身要求等级比语音高,且语音资源包较大且由于拷贝升级等容易出现缺失,导致电话不可用概率升高。If the same soft noise reduction mode as the local voice audio signal is used to reduce noise on the call audio signal, there will be many difficulties and problems. The first problem is that the complexity and space occupied of the soft noise reduction algorithm will increase, which will consume CPU running performance and lead to possible delays. The second problem is that call audio signal noise reduction is different from voice audio signal noise reduction. Since the voice audio signal noise reduction algorithm can be integrated in the application layer voice assistant APP, the Bluetooth call algorithm BTNR in the call audio signal soft noise reduction solution involves the underlying layer. Deeper, it is necessary to integrate the soft noise reduction library in the HAL layer, call the interface library and modify the function to configure and create a recording thread, and finally reduce the recording and send it to the voice assistant to achieve soft noise reduction. This solution will require deep changes to the Android HAL layer interface, which is risky and prone to problems. The third problem is that general soft noise reduction algorithms require authorization activation. If the call audio signal also uses the soft noise reduction algorithm, it will be the same as the local voice audio signal soft noise reduction. The production line needs to perform at least two call and voice algorithm calibrations. It increases the complexity of the production line, slows down the production pace, and is prone to unauthorized problems in certain scenarios. The fourth problem is that the call audio signal soft noise reduction algorithm and the local voice audio signal soft noise reduction algorithm need to be stored in the voice resource package. When the voice resource package is missing, it will also directly affect the phone recording function. The coupling is large, and the phone The function itself requires a higher level than voice, and the voice resource package is large and prone to defects due to copy upgrades, etc., resulting in an increased probability of phone unavailability.
故本发明实施例8提出音频的噪声处理系统,通话音频信号降噪基于DSP芯片进行硬降噪:用户对麦克风输入音频,DSP对蓝牙/CarPlay电话/ECALL/BCAll等通话音频进行采样后直接由底层DSP芯片内部进行降噪处理,降噪后的录音最后传输到蓝牙APP应用,完成通话识别功能。采用此种方案,通话音频信号降噪无需不占用CPU性能、不需要改动HAL层代码,且不需要授权或激活影响产线,将语音资源包和蓝牙电话等解耦,减少通话故障概率。Therefore, Embodiment 8 of the present invention proposes an audio noise processing system. The call audio signal noise reduction is based on the DSP chip for hard noise reduction: the user inputs audio to the microphone, and the DSP samples the call audio such as Bluetooth/CarPlay phone/ECALL/BCAll and then directly The underlying DSP chip performs noise reduction processing internally, and the noise-reduced recording is finally transmitted to the Bluetooth APP application to complete the call recognition function. Using this solution, call audio signal noise reduction does not require CPU performance, does not need to modify the HAL layer code, and does not require authorization or activation to affect the production line. It decouples the voice resource package from Bluetooth phones and reduces the probability of call failures.
第三方面。The third aspect.
本发明提供了一种电子设备,该电子设备包括:The invention provides an electronic device, which includes:
处理器、存储器和总线;Processors, memories, and buses;
所述总线,用于连接所述处理器和所述存储器;The bus is used to connect the processor and the memory;
所述存储器,用于存储操作指令;The memory is used to store operating instructions;
所述处理器,用于通过调用所述操作指令,可执行指令使处理器执行如本申请的第一方面所示的一种音频的噪声处理方法对应的操作。The processor is configured to, by invoking the operation instructions, have executable instructions that cause the processor to perform operations corresponding to the audio noise processing method shown in the first aspect of this application.
在一个可选实施例中提供了一种电子设备,如图5所示,图5所示的电子设备5000包括:处理器5001和存储器5003。其中,处理器5001和存储器5003相连,如通过总线5002相连。可选地,电子设备5000还可以包括收发器5004。需要说明的是,实际应用中收发器5004不限于一个,该电子设备5000的结构并不构成对本申请实施例的限定。In an optional embodiment, an electronic device is provided, as shown in Figure 5. The electronic device 5000 shown in Figure 5 includes: a processor 5001 and a memory 5003. Among them, the processor 5001 and the memory 5003 are connected, such as through a bus 5002. Optionally, electronic device 5000 may also include a transceiver 5004. It should be noted that in practical applications, the number of transceivers 5004 is not limited to one, and the structure of the electronic device 5000 does not limit the embodiments of the present application.
处理器5001可以是CPU,通用处理器,DSP,ASIC,FPGA或者其他可编程逻辑器件、晶体管逻辑器件、硬件部件或者其任意组合。其可以实现或执行结合本申请公开内容所描述的各种示例性的逻辑方框,模块和电路。处理器5001也可以是实现计算功能的组合,例如包含一个或多个微处理器组合,DSP和微处理器的组合等。The processor 5001 may be a CPU, a general-purpose processor, a DSP, an ASIC, an FPGA, or other programmable logic devices, transistor logic devices, hardware components, or any combination thereof. It may implement or execute the various illustrative logical blocks, modules, and circuits described in connection with this disclosure. The processor 5001 may also be a combination that implements computing functions, such as a combination of one or more microprocessors, a combination of a DSP and a microprocessor, etc.
总线5002可包括一通路,在上述组件之间传送信息。总线5002可以是PCI总线或EISA总线等。总线5002可以分为地址总线、数据总线、控制总线等。为便于表示,图5中仅用一条粗线表示,但并不表示仅有一根总线或一种类型的总线。Bus 5002 may include a path that carries information between the above-mentioned components. The bus 5002 may be a PCI bus or an EISA bus, etc. The bus 5002 can be divided into an address bus, a data bus, a control bus, etc. For ease of presentation, only one thick line is used in Figure 5, but it does not mean that there is only one bus or one type of bus.
存储器5003可以是ROM或可存储静态信息和指令的其他类型的静态存储设备,RAM或者可存储信息和指令的其他类型的动态存储设备,也可以是EEPROM、CD-ROM或其他光盘存储、光碟存储(包括压缩光碟、激光碟、光碟、数字通用光碟、蓝光光碟等)、磁盘存储介质或者其他磁存储设备、或者能够用于携带或存储具有指令或数据结构形式的期望的程序代码并能够由计算机存取的任何其他介质,但不限于此。The memory 5003 can be ROM or other types of static storage devices that can store static information and instructions, RAM or other types of dynamic storage devices that can store information and instructions, or it can be EEPROM, CD-ROM or other optical disk storage, optical disk storage (including compressed optical discs, laser discs, optical discs, digital versatile discs, Blu-ray discs, etc.), disk storage media or other magnetic storage devices, or can be used to carry or store desired program code in the form of instructions or data structures and can be used by a computer Any other medium for access, but not limited to this.
存储器5003用于存储执行本申请方案的应用程序代码,并由处理器5001来控制执行。处理器5001用于执行存储器5003中存储的应用程序代码,以实现前述任一方法实施例所示的内容。The memory 5003 is used to store application program code for executing the solution of the present application, and is controlled by the processor 5001 for execution. The processor 5001 is configured to execute the application program code stored in the memory 5003 to implement the content shown in any of the foregoing method embodiments.
其中,电子设备包括但不限于:移动电话、笔记本电脑、数字广播接收器、PDA(个人数字助理)、PAD(平板电脑)、PMP(便携式多媒体播放器)、车载终端(例如车载导航终端)等等的移动终端以及诸如数字TV、台式计算机等等的固定终端。Among them, electronic devices include but are not limited to: mobile phones, notebook computers, digital broadcast receivers, PDAs (personal digital assistants), PAD (tablet computers), PMP (portable multimedia players), vehicle-mounted terminals (such as vehicle-mounted navigation terminals), etc. mobile terminals such as digital TVs, desktop computers, etc.
第四方面。The fourth aspect.
本发明提供了一种计算机可读存储介质,计算机可读存储介质上存储有计算机程序,该程序被处理器执行时实现本申请第一方面所示的一种音频的噪声处理方法。The present invention provides a computer-readable storage medium. A computer program is stored on the computer-readable storage medium. When the program is executed by a processor, the audio noise processing method shown in the first aspect of the application is implemented.
本申请的又一实施例提供了一种计算机可读存储介质,该计算机可读存储介质上存储有计算机程序,当其在计算机上运行时,使得计算机可以执行前述方法实施例中相应内容。Another embodiment of the present application provides a computer-readable storage medium, which stores a computer program. When run on a computer, the computer can execute the corresponding content in the foregoing method embodiment.
Claims (9)
Priority Applications (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN202111540141.9A CN115223582B (en) | 2021-12-16 | 2021-12-16 | Audio noise processing method, system, electronic device and medium |
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN202111540141.9A CN115223582B (en) | 2021-12-16 | 2021-12-16 | Audio noise processing method, system, electronic device and medium |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| CN115223582A CN115223582A (en) | 2022-10-21 |
| CN115223582B true CN115223582B (en) | 2024-01-30 |
Family
ID=83606003
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CN202111540141.9A Active CN115223582B (en) | 2021-12-16 | 2021-12-16 | Audio noise processing method, system, electronic device and medium |
Country Status (1)
| Country | Link |
|---|---|
| CN (1) | CN115223582B (en) |
Families Citing this family (2)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN115966217B (en) * | 2023-02-10 | 2025-09-16 | 思必驰科技股份有限公司 | Audio processing device and method |
| CN116142101B (en) * | 2023-02-21 | 2026-01-02 | 奇瑞汽车股份有限公司 | Vehicle entertainment system, method, vehicle and storage medium |
Citations (8)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN104575510A (en) * | 2015-02-04 | 2015-04-29 | 深圳酷派技术有限公司 | Noise reduction method, noise reduction device and terminal |
| CN110097891A (en) * | 2019-04-22 | 2019-08-06 | 广州视源电子科技股份有限公司 | Microphone signal processing method, device, equipment and storage medium |
| CN110769111A (en) * | 2019-10-28 | 2020-02-07 | 珠海格力电器股份有限公司 | Noise reduction method, system, storage medium and terminal |
| CN111683319A (en) * | 2020-06-08 | 2020-09-18 | 北京爱德发科技有限公司 | Call pickup noise reduction method, earphone and storage medium |
| CN112235674A (en) * | 2020-09-24 | 2021-01-15 | 海菲曼(天津)科技有限公司 | Active noise reduction processing method, system and chip based on noise analysis |
| CN113096677A (en) * | 2021-03-31 | 2021-07-09 | 深圳市睿耳电子有限公司 | Intelligent noise reduction method and related equipment |
| CN113163152A (en) * | 2021-03-30 | 2021-07-23 | 九音(南京)集成电路技术有限公司 | Multi-dimensional sound pickup and noise reduction method, video teleconference system and computer storage medium |
| GB202112256D0 (en) * | 2020-08-31 | 2021-10-13 | Parker Tom Richard | Systems and methods for moderating noise levels in a communication session |
Family Cites Families (6)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US6937980B2 (en) * | 2001-10-02 | 2005-08-30 | Telefonaktiebolaget Lm Ericsson (Publ) | Speech recognition using microphone antenna array |
| JP2009539306A (en) * | 2006-05-31 | 2009-11-12 | アギア システムズ インコーポレーテッド | Noise reduction with mobile communication devices in non-call situations |
| FR2992459B1 (en) * | 2012-06-26 | 2014-08-15 | Parrot | METHOD FOR DEBRUCTING AN ACOUSTIC SIGNAL FOR A MULTI-MICROPHONE AUDIO DEVICE OPERATING IN A NOISE MEDIUM |
| KR102502601B1 (en) * | 2015-11-27 | 2023-02-23 | 삼성전자주식회사 | Electronic device and controlling voice signal method |
| KR101842777B1 (en) * | 2016-07-26 | 2018-03-27 | 라인 가부시키가이샤 | Method and system for audio quality enhancement |
| US10354673B2 (en) * | 2018-01-24 | 2019-07-16 | Hisense Mobile Communications Technology Co., Ltd. | Noise reduction method and electronic device |
-
2021
- 2021-12-16 CN CN202111540141.9A patent/CN115223582B/en active Active
Patent Citations (8)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN104575510A (en) * | 2015-02-04 | 2015-04-29 | 深圳酷派技术有限公司 | Noise reduction method, noise reduction device and terminal |
| CN110097891A (en) * | 2019-04-22 | 2019-08-06 | 广州视源电子科技股份有限公司 | Microphone signal processing method, device, equipment and storage medium |
| CN110769111A (en) * | 2019-10-28 | 2020-02-07 | 珠海格力电器股份有限公司 | Noise reduction method, system, storage medium and terminal |
| CN111683319A (en) * | 2020-06-08 | 2020-09-18 | 北京爱德发科技有限公司 | Call pickup noise reduction method, earphone and storage medium |
| GB202112256D0 (en) * | 2020-08-31 | 2021-10-13 | Parker Tom Richard | Systems and methods for moderating noise levels in a communication session |
| CN112235674A (en) * | 2020-09-24 | 2021-01-15 | 海菲曼(天津)科技有限公司 | Active noise reduction processing method, system and chip based on noise analysis |
| CN113163152A (en) * | 2021-03-30 | 2021-07-23 | 九音(南京)集成电路技术有限公司 | Multi-dimensional sound pickup and noise reduction method, video teleconference system and computer storage medium |
| CN113096677A (en) * | 2021-03-31 | 2021-07-09 | 深圳市睿耳电子有限公司 | Intelligent noise reduction method and related equipment |
Also Published As
| Publication number | Publication date |
|---|---|
| CN115223582A (en) | 2022-10-21 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| US7085710B1 (en) | Vehicle computer system audio entertainment system | |
| US8019454B2 (en) | Audio processing system | |
| CN115223582B (en) | Audio noise processing method, system, electronic device and medium | |
| CN114724566B (en) | Voice processing method, device, storage medium and electronic device | |
| CN113689062B (en) | Agent coordination device, agent coordination method, and recording medium having agent coordination program recorded thereon | |
| CN109273006A (en) | Sound control method, onboard system, vehicle and the storage medium of onboard system | |
| CN112542183A (en) | Audio data processing method, device, equipment and storage medium | |
| CN114420146A (en) | Audio data processing method and device, electronic equipment and storage medium | |
| CN116709112B (en) | Audio data processing method, system, data processing device and storage medium | |
| CN115079993A (en) | A cross-system audio playback control method, device, vehicle and storage medium | |
| CN113840034A (en) | Sound signal processing method and terminal device | |
| CN103533672B (en) | A wireless karaoke system and its internal data interaction method | |
| CN114882721B (en) | Vehicle navigation information playing method and device, electronic equipment and storage medium | |
| CN118828312A (en) | Method, system, device and storage medium for controlling spatial sound effects of vehicle-mounted amplifier equipment | |
| CN116471517A (en) | Audio mixing method and device for car cockpit, car and storage medium | |
| CN111666062A (en) | Audio processing device and method and vehicle-mounted information entertainment system | |
| CN115426585A (en) | Sound alarm control method and system for automobile cabin | |
| CN114827797A (en) | In-vehicle multi-audio playing method and device based on earphone and storage medium | |
| CN116841950A (en) | Audio data transmission method, device, chip and computer-readable storage medium | |
| CN115373623A (en) | Audio processing method, audio playing control device, mobile device and vehicle | |
| US20250104686A1 (en) | Information processing device | |
| CN120462296A (en) | Audio processing method and system on chip based on MTK vehicle platform | |
| CN114827834A (en) | In-vehicle multi-audio playing method and device based on earphone and storage medium | |
| CN114827835A (en) | In-vehicle multi-audio playing method and device based on earphone and storage medium | |
| CN120935494A (en) | Sound field control method, electronic equipment, vehicle, medium and product |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| PB01 | Publication | ||
| PB01 | Publication | ||
| SE01 | Entry into force of request for substantive examination | ||
| SE01 | Entry into force of request for substantive examination | ||
| GR01 | Patent grant | ||
| GR01 | Patent grant |