CN110444222B

CN110444222B - Voice noise reduction method based on information entropy weighting

Info

Publication number: CN110444222B
Application number: CN201910413996.1A
Authority: CN
Inventors: 杨静; 钟森; 刘辉辉; 陈福; 陈玮; 张琴; 李海军; 孔薇
Original assignee: Chengdu Aerospace Communication Equipment Co ltd
Current assignee: Chengdu Aerospace Communication Equipment Co ltd
Priority date: 2019-05-17
Filing date: 2019-05-17
Publication date: 2021-12-14
Anticipated expiration: 2039-05-17
Also published as: CN110444222A

Abstract

The invention discloses a voice noise reduction method based on information entropy weighting, which comprises the following steps: step one, demodulating a received signal and then performing framing processing; step two, calculating a power spectrum by using continuous M frames of signals; extracting components in a voice frequency range in the power spectrum and calculating the normalized power spectral density of the components; fourthly, performing spectral entropy estimation on all frequency components in the continuous M frame signals; step five, calculating a voice attribute value; and step six, weighting the voice signals by using the voice attribute values. Compared with the prior art, the invention has the following positive effects: the invention distinguishes whether the received signal is voice or noise by using the characteristic that the spectral entropy of the voice signal and the noise signal has obvious difference, and realizes the amplification of the voice signal and the suppression of the noise by weighting the received signal by using the spectral entropy value.

Description

Voice noise reduction method based on information entropy weighting

Technical Field

The invention relates to a voice noise reduction method based on information entropy weighting.

Background

During transmission of signals, noise and interference are important factors affecting the signals. In order to effectively retain the useful signal, remove the background noise and the interference generated in the transmission, and make the receiving party have good listening feeling, it is necessary to perform voice noise reduction processing on the signal at the receiving end. Conventional voice stations typically employ spectral subtraction for noise reduction: and subtracting the estimation value of the noise power spectrum from the voice power spectrum containing the noise, and inversely transforming the residual signal energy back to the time domain to obtain the voice signal after noise reduction. The method is equivalent to that certain equalization processing is carried out on the signal with noise in the frequency domain, amplitude information of specific frequency points is lost, and due to random distribution of noise, the suppression of the noise is not thorough; under the condition of low signal-to-noise ratio, the spectral subtraction method can cause excessive suppression to voice signals, and the subsequent processing and communication quality are affected.

Disclosure of Invention

In order to overcome the defects of the prior art, the invention provides a voice noise reduction method based on information entropy weighting, which not only provides a calculation method for demodulating the spectral entropy of a voice signal, but also further provides a noise suppression method for spectral entropy weighting.

The technical scheme adopted by the invention is as follows: a voice noise reduction method based on information entropy weighting comprises the following steps:

step one, demodulating a received signal and then performing framing processing;

step two, calculating a power spectrum by using continuous M frames of signals;

extracting components in a voice frequency range in the power spectrum and calculating the normalized power spectral density of the components;

performing spectral entropy estimation on all frequency components in the continuous M frame signals;

step five, calculating a voice attribute value;

and step six, weighting the voice signals by using the voice attribute values.

Compared with the prior art, the invention has the following positive effects:

the invention distinguishes whether the received signal is voice or noise by using the characteristic that the spectral entropy of the voice signal and the noise signal has obvious difference, and realizes the amplification of the voice signal and the suppression of the noise by weighting the received signal by using the spectral entropy value.

Drawings

The invention will now be described, by way of example, with reference to the accompanying drawings, in which:

FIG. 1 is a signal processing flow for speech noise reduction based on information entropy weighting;

FIG. 2 is a simulation result of voice noise reduction using entropy weighting.

Detailed Description

The invention provides a voice noise reduction method based on spectral entropy weighting, wherein the spectral entropy refers to Shannon information entropy of a baseband signal power spectrum of a power station. The voice noise reduction method based on spectral entropy weighting distinguishes whether a received signal is voice or noise by using the characteristic that the size of the spectral entropy of a voice signal is obviously different from that of a noise signal, and realizes the amplification of the voice signal and the suppression of the noise by weighting the received signal by using the value of the spectral entropy.

The received demodulated digital signal x (n), n ═ 1,2, …, uses the way of speech frame-dividing process, the time length of each frame signal is adapted to the time length of one syllable of speech signal, the time length of one syllable is usually between 3ms and 30ms, here, the concrete speech frame length can be selected in this range according to the need. Assuming that the number of samples of one frame signal is N, the k-th frame signal can be expressed as

x_k＝[x((k-1)N+1),x((k-1)N+2),…,x(kN)]^T,k＝1,2,…

Fourier transform is carried out by utilizing signals of two adjacent frames of the k and the k +1 to obtain

Fourier transform matrix writeable

Wherein N ═ 0,1, 2.. 2N-1]^T. The power spectrum of the received signal is calculated using the fourier transform of the successive frame signals as follows:

wherein "o" denotes multiplication of corresponding elements of the vector. Since the voice frequency is rarely higher than 3.5KHz, the frequency component power spectrum lower than 3.5KHz can be selected

To calculate the spectral entropy of the speech signal portion, assuming

The sample length is L. For the influence of the signal power magnitude on the signal spectrum entropy calculation, the spectrum entropy is calculated by using a normalized power spectrum density which can be expressed as

Here, the

Is a column vector of L elements, the L-th element of which is represented as

The spectral entropy of the k frame signal is estimated as

Assuming that the spectral entropy decision threshold is η, the voice attribute variable F of the frame signal can be decided as follows_k，

F_k1 means that the k-th frame signal has a speech feature, F_k< 1 indicates that the signal of the k-th frame does not have voice feature, F_kAs the weighting coefficient of the output audio signal, the noise suppression capability of the output signal can be improved.

As shown in fig. 1, the method of the present invention comprises the steps of:

1) demodulating the received signal and then performing framing processing;

2) calculating a power spectrum using the successive M frames of signals;

3) extracting the components in the voice frequency range in the power spectrum and calculating the normalized power spectral density of the components;

4) calculating spectral entropy by using the calculated power spectral density, and taking the spectral entropy as a spectral entropy estimation result of a first frame signal in the continuous M frames of signals;

5) calculating a voice attribute value F_k；

6) Weighting the voice signal with the voice attributes.

The following are the implementation examples and simulation results of the method of the present invention:

assuming that an audio signal sampling frequency of 8kHz is adopted, a signal power spectrum is calculated by using a time length of 64ms every 8ms of a frame signal, and voice attribute values of continuous 10 frames of signals are adopted to realize the functions of noise opening and noise closing, and the noise reduction effect is shown in FIG. 2.

Claims

1. A voice noise reduction method based on information entropy weighting is characterized in that: the method comprises the following steps:

step two, calculating a power spectrum by using continuous M frames of signals;

performing spectral entropy estimation on all frequency components in the continuous M frames of signals;

step five, calculating a voice attribute value:

judging whether the requirements are met

If so, then

Otherwise, F_k1 is ═ 1; wherein,

representing the spectral entropy estimation result of the kth frame signal, wherein eta is a spectral entropy judgment threshold;

and step six, weighting the voice signals by using the voice attribute values.

2. A method of speech noise reduction based on information entropy weighting according to claim 1, wherein: step one the method for performing framing processing is as follows: assuming that the received demodulated digital signal is x (N), N is 1,2, …, selecting the duration of each frame signal in a syllable duration range of the speech signal by adopting a speech framing processing mode, assuming that the number of samples of a frame signal is N, and expressing the k-th frame signal as x (N)_k＝[x((k-1)N+1),x((k-1)N+2),…,x(kN)]^T,k＝1,2,…。

3. A method of speech noise reduction based on information entropy weighting according to claim 2, wherein: the duration of the one syllable is 3ms to 30 ms.

4. A method of speech noise reduction based on information entropy weighting according to claim 2, wherein: step two the method for calculating the power spectrum by using the continuous M frame signals comprises the following steps:

1) fourier transform is carried out by utilizing signals of two adjacent frames of the k and the k +1 to obtain

The Fourier transform matrix F in the formula is calculated according to the following formula:

wherein N ═ 0,1, 2.. 2N-1]^T；

2) The power spectrum of the received signal is calculated using the fourier transform of the successive frame signals as follows:

therein

Representing the multiplication of corresponding elements of the vector.

5. A method of speech noise reduction based on information entropy weighting according to claim 4, wherein: step three, the method for calculating the normalized power spectral density comprises the following steps: selecting a frequency component power spectrum with voice frequencies below 3.5KHz

Suppose that

The length of the sample is L, and the normalized power spectral density is calculated according to the following formula:

therein

Is a column vector of L elements, the L-th element of which is

6. A method of speech noise reduction based on information entropy weighting according to claim 5, wherein: the method for estimating the spectral entropy comprises the following steps: will be provided with

The L-th element of

According to the following formulaCalculating to obtain a spectral entropy estimation result of the kth frame signal:

7. a method of speech noise reduction based on information entropy weighting according to claim 1, wherein: step six the method for weighting the voice signal by using the voice attribute value comprises the following steps: when F is present_kWhen 1, the k frame signal has a voice feature; when F is present_k<When 1, the signal of the k frame does not have the voice feature, F_kThe noise suppression is implemented as a weighting factor of the output audio signal.