WO2021190274A1 - Method and device for determining state of echo sound field, storage medium, and terminal - Google Patents
Method and device for determining state of echo sound field, storage medium, and terminal Download PDFInfo
- Publication number
- WO2021190274A1 WO2021190274A1 PCT/CN2021/079181 CN2021079181W WO2021190274A1 WO 2021190274 A1 WO2021190274 A1 WO 2021190274A1 CN 2021079181 W CN2021079181 W CN 2021079181W WO 2021190274 A1 WO2021190274 A1 WO 2021190274A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- signal
- determined
- state
- sound field
- echo
- Prior art date
Links
- 238000000034 method Methods 0.000 title claims abstract description 81
- 230000008859 change Effects 0.000 claims abstract description 82
- 238000001514 detection method Methods 0.000 claims abstract description 38
- 238000012545 processing Methods 0.000 claims description 45
- 230000001629 suppression Effects 0.000 claims description 43
- 230000004913 activation Effects 0.000 claims description 33
- 230000003044 adaptive effect Effects 0.000 claims description 12
- 238000012805 post-processing Methods 0.000 claims description 9
- 230000009977 dual effect Effects 0.000 description 8
- 238000010586 diagram Methods 0.000 description 6
- 230000008054 signal transmission Effects 0.000 description 6
- 238000004891 communication Methods 0.000 description 5
- 230000000694 effects Effects 0.000 description 4
- 238000005516 engineering process Methods 0.000 description 4
- 230000004044 response Effects 0.000 description 4
- 230000009286 beneficial effect Effects 0.000 description 3
- 230000002452 interceptive effect Effects 0.000 description 3
- 230000001419 dependent effect Effects 0.000 description 2
- 230000007613 environmental effect Effects 0.000 description 2
- 230000006870 function Effects 0.000 description 2
- 230000003993 interaction Effects 0.000 description 2
- 230000008569 process Effects 0.000 description 2
- 238000003070 Statistical process control Methods 0.000 description 1
- 230000005540 biological transmission Effects 0.000 description 1
- 238000012790 confirmation Methods 0.000 description 1
- 238000007796 conventional method Methods 0.000 description 1
- 238000001914 filtration Methods 0.000 description 1
- 230000014759 maintenance of location Effects 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 239000007787 solid Substances 0.000 description 1
- 230000003595 spectral effect Effects 0.000 description 1
Images
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04M—TELEPHONIC COMMUNICATION
- H04M9/00—Arrangements for interconnection not involving centralised switching
- H04M9/08—Two-way loud-speaking telephone systems with means for conditioning the signal, e.g. for suppressing echoes for one or both directions of traffic
- H04M9/082—Two-way loud-speaking telephone systems with means for conditioning the signal, e.g. for suppressing echoes for one or both directions of traffic using echo cancellers
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04M—TELEPHONIC COMMUNICATION
- H04M7/00—Arrangements for interconnection between switching centres
- H04M7/006—Networks other than PSTN/ISDN providing telephone service, e.g. Voice over Internet Protocol (VoIP), including next generation networks with a packet-switched transport layer
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
- G10L2021/02082—Noise filtering the noise being echo, reverberation of the speech
Definitions
- the present invention relates to the technical field of acoustic echo cancellation, in particular to a method and device for determining the state of an echo sound field, a storage medium, and a terminal.
- AEC Acoustic Echo Canceler
- the robustness and stability of the corresponding AEC technology are greatly challenged. For example, if the update of adaptive filtering is not controlled in dual-talk and no-speech scenarios, it will face the risk of divergence and misalignment.At the same time, when the echo path changes, if the update speed is not increased, the convergence speed will be too slow, resulting in The residual echo; similarly, in the non-linear or residual echo processing, if the single talk and the dual state are not distinguished, it will often lead to the damage of the effective speech and reduce the performance of the dual talk.
- Double Talk State in the echo sound field state is particularly important.
- Conventional Double Talk Detection (DTD) methods can be roughly divided into three categories: energy-based detection, correlation-based detection, and Detection based on echo path.
- the energy-based detection is the simplest, which is extremely dependent on the stability of the echo signal strength, the near-end speech signal strength and the background noise strength, and the misjudgment rate is very high;
- the correlation-based detection is limited by the characteristics of the device, when the speaker is nonlinear
- the performance of this method drops sharply; based on the detection of the echo path, such as estimating the horn impulse response, variable impulse response, etc., the performance becomes worse when the echo path changes.
- the technical problem solved by the present invention is to provide a method and device for determining the state of the echo sound field, a storage medium, and a terminal, which can effectively improve the accuracy of determining the state of the echo path change.
- an embodiment of the present invention provides a method for determining the state of an echo sound field, which includes the following steps: acquiring a signal to be determined; determining the far-end signal X n (k) and the near-end signal D n ( k) and the filter coefficient W n (k); at least according to the far-end signal X n (k), the near-end signal D n (k) and the filter coefficient W n (k), the filter update degree Cef update is determined At least according to the filter update degree Cef update being greater than the preset update degree threshold Thrd update , it is determined whether the echo sound field state of the signal to be determined is the echo path change state.
- the method for determining the echo sound field state further includes: determining whether the echo sound field state of the signal to be determined is far, at least according to the filter update degree Cef update being less than or equal to the preset update degree threshold Thrd update Single talk status.
- determining the filter update degree Cef update at least according to the far-end signal X n (k), the near-end signal D n (k) and the filter coefficient W n (k) includes: according to the far-end signal X n (k), near-end signal D n (k) and the filter coefficients W n (k), determining the residual signal E n (k); according to the residual signal E n (k), to determine the updated Filter coefficient W n+1 (k); determine the filter update degree Cef update according to the filter coefficient W n (k) and the updated filter coefficient W n+1 (k).
- the method for determining the echo sound field state further includes: performing voice activation detection on the near-end signal D n (k), To obtain the near-end voice activation flag DVflag; if the near-end voice activation flag DVflag is not equal to 1, it is determined that the echo sound field state of the signal to be determined is an idle state.
- the method for determining the echo sound field state further includes: performing voice activation detection on the far-end signal X n (k), To obtain the far-end voice activation flag XVflag; if the far-end voice activation flag XVflag is not equal to 1, it is determined that the echo sound field state of the signal to be determined is the near-end single talk state.
- the method for determining the echo sound field state further includes: determining the echo suppression ratio Err of the signal to be determined; if said If the echo suppression ratio Err is greater than the preset echo threshold Thrd err , it is determined that the echo sound field state of the signal to be determined is the remote single talk state.
- determining the echo suppression ratio Err of the signal to be determined includes: determining the residual signal according to the far-end signal X n (k), the near-end signal D n (k), and the filter coefficient W n (k). the difference signal E n (k); according to the end signal D n (k) and the residual signal E n (k), the echo signal suppression ratio determined Err.
- k is the frequency index of the signal to be determined.
- the echo sound field state determination method further includes: determining the normalized cross-correlation values C YE and C DE ; if C DE Is greater than the first preset cross-correlation threshold Thrd1 coh and C YE is less than the second preset cross-correlation threshold Thrd2 coh , then it is determined that the echo sound field state of the signal to be determined is a dual-talk state; wherein, the first preset cross-correlation threshold The correlation threshold Thrd1 coh is greater than or equal to the second preset cross-correlation threshold Thrd2 coh .
- it further includes one or more of the following: if the filter update degree Cef update is greater than the preset update degree threshold Thrd update , determining that the echo sound field state of the signal to be determined is the echo path change state; If the update degree Cef update is less than or equal to the preset update degree threshold Thrd update , it is determined that the echo sound field state of the signal to be determined is the remote single talk state.
- M and L are the frequency band indexes of the signal to be determined.
- the normalized cross-correlation values C YE and C DE are normalized cross-correlation values in the linear region; where M and L are frequency band indexes of the linear region.
- the method for determining the echo sound field state further includes: adjusting the update step size ⁇ n (k) of the signal to be determined according to the echo sound field state of the signal to be determined; wherein the update step size ⁇ n ( k) is used to indicate the update step size of the filter coefficient W n (k).
- an echo adaptive filter is used to adjust the update step size ⁇ n (k) of the signal to be determined.
- the method for determining the echo sound field state further includes: determining whether to perform non-linear processing on the signal to be determined according to the echo sound field state of the signal to be determined.
- determining whether to perform nonlinear processing on the signal to be determined includes one or more of the following: if it is determined that the echo sound field state of the signal to be determined is a dual-talk state, reducing the degree of nonlinear processing; If the echo sound field state of the signal to be determined is the echo path change state, then the nonlinear processing of the signal to be determined is enhanced; if it is determined that the echo sound field state of the signal to be determined is the near-end single talk state, stop talking to the Non-linear processing of the signal to be determined; if it is determined that the echo sound field state of the signal to be determined is an idle state, then the non-linear processing of the signal to be determined is stopped.
- a post-processing non-linear processing unit is used to perform non-linear processing on the signal to be determined.
- the method for determining the echo sound field state further includes: determining, according to the echo sound field state of the signal to be determined, to reduce the noise update speed of the signal to be determined or to increase the non-stationary noise suppression capability of the signal to be determined .
- determining to reduce the noise update speed or to improve the non-stationary noise suppression capability includes one or more of the following: if it is determined that the echo sound field state of the signal to be determined is the near-end single talk state, reducing the signal to be determined Noise update speed; if it is determined that the echo sound field state of the signal to be determined is a dual-talk state, reduce the noise update speed of the signal to be determined; if it is determined that the echo sound field state of the signal to be determined is a remote single talk state, The non-stationary noise suppression capability of the signal to be determined is improved; if it is determined that the echo sound field state of the signal to be determined is an echo path change state, the non-stationary noise suppression capability of the signal to be determined is improved.
- a post-processing noise suppression unit is used to reduce the noise update speed of the signal to be determined or to improve the non-stationary noise suppression capability of the signal to be determined.
- the method for determining the state of the echo sound field further includes: determining the temporary sound field state of the signal to be determined; and determining to maintain the dual sound field state of the signal to be determined according to the echo sound field state and the temporary sound field state of the signal to be determined. Talking status output or output of delaying echo path change for the signal to be determined.
- the output determined to maintain the dual-talk state output for the signal to be determined or to suspend the echo path change for the signal to be determined includes one or more of the following: if the echo sound field state of the signal to be determined is dual-talk State, the temporary sound field state is the remote single-talk state, the signal to be determined is maintained in the dual-talk state output through the hold time; if the echo sound field state of the signal to be determined is the dual-talk state, the temporary sound field state If it is the echo path change state, the output of the echo path change is suspended for the signal to be determined through the start time.
- an embodiment of the present invention provides an echo sound field state determination device, which includes: an acquisition module for acquiring a signal to be determined; a signal determination module for determining the far-end signal X n ( k), the near-end signal D n (k) and the filter coefficient W n (k); the update degree determination module is used to determine at least the far-end signal X n (k), the near-end signal D n (k) and the The filter coefficient W n (k) determines the filter update degree Cef update ; the state determination module is used to determine the echo sound field state of the signal to be determined at least according to the filter update degree Cef update being greater than the preset update degree threshold Thrd update Whether it is the state of echo path change.
- an embodiment of the present invention provides a storage medium on which computer instructions are stored, and the computer instructions execute the steps of the method for determining the state of the echo sound field when the computer instructions are executed.
- an embodiment of the present invention provides a terminal, including a memory and a processor, the memory stores computer instructions that can run on the processor, and the processor executes the computer instructions when the computer instructions are run. The steps of the method for determining the state of the echo sound field.
- the filter update degree Cef update is set to be greater than the preset update degree threshold Thrd update to determine whether the echo sound field state of the signal to be determined is the echo path change state, appropriate parameters can be set,
- the signal to be determined is actually the state of the echo path change.
- the echo sound field state is simply divided into a single-talk state and a dual-talk state for detection, which makes it easier to detect the echo path change state.
- the misjudgment is the dual-talk state.
- the solution of the embodiment of the present invention can effectively improve the accuracy of the judgment of the echo path change state, and in the subsequent steps, there is an opportunity to use more parameters to judge more echo sound field states, and more Effectively realize multi-feature detection and improve the completeness of the judgment of the echo sound field state.
- the filter update degree Cef update being less than or equal to the preset update degree threshold Thrd update , it is determined that the echo sound field state of the signal to be determined is the remote single talk state, which is easier to change than in the prior art
- the change state of the echo path is misjudged as the dual-talk state, and the solution of the embodiment of the present invention can further effectively improve the accuracy of the determination of the change state of the echo path.
- the echo sound field state of the signal to be determined is idle.
- the near-end voice activation flag DVflag is not 1, it can be considered that there is no voice at the near end, otherwise it means that the near end There is voice at the end, and the signal to be determined needs to be further judged.
- the echo sound field state of the signal to be determined is the near-end single talk state, and when the far-end voice activation flag XVflag is not 1, it can be considered that there is no signal at the far end, There is no echo signal in the near-end signal, and the current state is the near-end single-talk state. Otherwise, it indicates that there is echo in the near-end signal, and further judgment on the signal to be determined is required.
- the echo suppression ratio Err is greater than the preset echo threshold Thrd err , it indicates that the relative amplitude of the residual signal is very small, and most of the near-end signal components are determined to be echo signals, which have been eliminated by the adaptive filter AF, and the current state It is the far-end single-talk state, otherwise it indicates that the relative amplitude of the residual signal is still high and the component in the near-end signal is uncertain, and further judgments on the signal to be determined need to be made.
- the normalized cross-correlation value to make the near-end signal and the residual signal component further determines, in the filter converges, the residual data E n (k) corresponding to the de-correlated echo signal, at this time if the C DE is greater than the threshold Thrd1 coh , indicating that the near-end signal contains many components that are not related to echo, but if the filter does not converge, the residual signal will also contain a large amount of echo components. This conclusion is not valid; therefore, C YE is used to further Confirm that if C YE is less than the threshold Thrd2 coh, it means that there are few echo components in the residual signal.
- the near-end signal contains components that are not related to echo.
- the current state is dual-talk. Status, otherwise it means that the signal component cannot be determined, and further judgment on the signal to be determined is required.
- the echo path change state according to the filter convergence Cef update being greater than the threshold Thrd update , and judging according to the filter update degree Cef update being less than or equal to the preset update degree threshold Thrd update
- It is the far-end single-talk state which can indicate that the filter is in a fast update state. Since the previous judgment has ruled out the deterministic dual-talk state, the interference of the near-end voice signal to the filter is not too high. Due to convergence or echo path change, the current state is the echo path change state. Otherwise, the current feature has no obvious distinction and is regarded as an uncertain state. In the embodiment of the present invention, it is determined as the remote single talk state.
- the normalized cross-correlation values C YE and C DE are the normalized cross-correlation values in the linear region; where M and L are the frequency band indexes of the linear region, and the accuracy of judgment can be improved by taking the value in the linear region .
- the update step size ⁇ n (k) can be increased to speed up the update and fast convergence; when the signal to be determined is the dual-talk state DTS, adjust ⁇ n (k) Slow down the update to ensure the robustness of the filter; when the signal to be determined is the remote single talk state FSTS, ⁇ n (k) takes the normal value without special adjustment; when the signal to be determined is In the idle state IDS or the near-end single talk state NSTS, ⁇ n (k) is taken as 0, and the update is stopped to prevent divergence, thereby improving the signal transmission quality.
- the degree of nonlinear processing can be reduced when the signal to be determined is in the dual-talk state, so that effective speech is not damaged, and dual-talk performance is ensured; when the signal to be determined is the echo path change state PCS, the degree of nonlinear processing can be enhanced , To prevent the leakage of residual echo; when the signal to be determined is near-end single talk NSTS and idle state IDS, stop non-linear processing to avoid causing near-end voice and environmental sound distortion; when the signal to be determined is far-end No special processing is done in the single-talk state FSTS, and the residual echo is normally suppressed, thereby improving the signal transmission quality.
- the noise update speed can be slowed down to ensure the intelligibility of the effective voice; when the signal to be determined is the far-end single-talk and echo path changes When the non-stationary noise suppression ability is improved, the residual echo is suppressed; when the signal to be determined is in the idle state, that is, the background noise IDS state, no special processing is performed, and the background noise is normally tracked, thereby improving the signal transmission quality .
- Figure 1 is a schematic diagram of the structure of an AEC system in the prior art
- FIG. 2 is a flowchart of a method for determining the state of an echo sound field in an embodiment of the present invention
- FIG. 3 is a flowchart of another method for determining the state of an echo sound field in an embodiment of the present invention.
- Figure 4 is a schematic structural diagram of an AEC system in an embodiment of the present invention.
- Fig. 5 is a schematic structural diagram of a device for determining an echo sound field state in an embodiment of the present invention.
- a typical AEC system includes an adaptive filter AF for linear echo processing and a nonlinear part for residual echo processing.
- Fig. 1 is a schematic structural diagram of an AEC system in the prior art.
- the signal x(n) passes through the speaker (statistical process control, SPK) to obtain the signal h(n). After (MIC), the signal d(n) is output.
- Short-time Fourier transform short-time Fourier transform, or short-term Fourier transform, STFT
- STFT short-time Fourier transform
- the adaptive filter adaptive filters, AF
- the filter coefficient can be updated according to the filter coefficient W n (k) to obtain W n+1 (k).
- residual signal E n (k) may be nonlinear input processing unit (Non-linear programming, NLP) and post-processing noise suppression unit (Noise suppression, NS).
- NLP nonlinear programming
- NS post-processing noise suppression unit
- the detection of the dual-talk state in the echo sound field state is particularly important.
- Conventional dual-talk detection methods can be roughly divided into three categories: energy-based detection, correlation-based detection, and echo path-based detection.
- the energy-based detection is the simplest, which is extremely dependent on the stability of the echo signal strength, the near-end speech signal strength and the background noise strength, and the misjudgment rate is very high;
- the correlation-based detection is limited by the characteristics of the device, when the speaker is nonlinear
- the performance of this method drops sharply; based on the detection of the echo path, such as estimating the horn impulse response, variable impulse response, etc., the performance becomes worse when the echo path changes.
- the accuracy of determining the state of the echo sound field is low, which in turn affects the effect of echo cancellation.
- the inventors of the present invention have discovered through research that the existing methods for determining the state of the echo sound field simply divide the state of the echo sound field into a single talk state (Single Talk State, STS) and a double talk state (Double Talk State, DTS).
- STS Single Talk State
- DTS Double Talk State
- PCS path Change State
- the filter update degree Cef update is set to be greater than the preset update degree threshold Thrd update to determine whether the echo sound field state of the signal to be determined is the echo path change state, appropriate parameters can be set,
- the signal to be determined is actually the state of the echo path change.
- the echo sound field state is simply divided into a single-talk state and a dual-talk state for detection, which makes it easier to detect the echo path change state.
- the misjudgment is the dual-talk state, and the solution of the embodiment of the present invention can effectively improve the accuracy of the judgment of the echo path change state.
- FIG. 2 is a flowchart of a method for determining the state of an echo sound field in an embodiment of the present invention.
- the method for determining the state of the echo sound field includes steps S21 to S24:
- Step S21 Obtain a signal to be determined
- Step S22 Determine the far-end signal, the near-end signal, and filter coefficients of the signal to be determined
- Step S23 Determine the filter update degree at least according to the far-end signal, the near-end signal and the filter coefficient
- Step S24 Determine whether the echo sound field state of the signal to be determined is the echo path change state at least according to the filter update degree being greater than the preset update degree threshold.
- the method can be implemented in the form of a software program that runs on a processor integrated inside a chip or a chip module.
- the to-be-determined signals with different echo sound field states may include different signals, for example, may include the signal obtained after the sound emitted by the speaker of the communication terminal is picked up by the microphone of the terminal, and may also include only the remote Signal.
- the echo cancellation can be achieved more effectively.
- step S22 the far-end signal X n (k), the near-end signal D n (k) and the filter coefficient W n (k) of the signal to be determined are determined.
- conventional techniques may be used to determine the far-end signal X n (k), the near-end signal D n (k), and the filter coefficient W n (k) of the signal to be determined. For example, short-time Fourier transform is performed on the signal d(n) and signal x(n) shown in FIG. 1 to obtain the near-end signal D n (k) and the far-end signal X n (k). Determine the filter coefficient W n (k) by an appropriate method.
- step S23 the filter update degree Cef update is determined.
- the step of determining the filter update degree Cef update may include: end signal X n (k), near-end signal D n (k) and the filter coefficients W n (k), determining the residual signal E n (k); according to the residual signal E n (k), determine the update After the filter coefficient W n+1 (k); according to the filter coefficient W n (k) and the updated filter coefficient W n+1 (k), determine the filter update degree Cef update .
- step S24 it may be determined whether the echo sound field state of the signal to be determined is the echo path change state at least according to the filter update degree Cef update being greater than the preset update degree threshold Thrd update.
- the filter update degree Cef update is greater than the preset update degree threshold Thrd update , it can be determined that the echo sound field state of the signal to be determined is the echo path change state .
- the filter update degree Cef update is set to be greater than the preset update degree threshold Thrd update to determine whether the echo sound field state of the signal to be determined is the echo path change state, appropriate parameters can be set,
- the signal to be determined is actually the state of the echo path change.
- the echo sound field state is simply divided into a single-talk state and a dual-talk state for detection, which makes it easier to detect the echo path change state.
- the misjudgment is the dual-talk state, and the solution of the embodiment of the present invention can effectively improve the accuracy of the judgment of the echo path change state.
- the method for determining the echo sound field state may further include: determining whether the echo sound field state of the signal to be determined is far, at least according to the filter update degree Cef update being less than or equal to the preset update degree threshold Thrd update Single talk status.
- the filter update degree Cef update is less than or equal to the preset update degree threshold Thrd update , it can be determined that the echo sound field state of the signal to be determined is remote single talk state.
- the filter update degree Cef update being less than or equal to the preset update degree threshold Thrd update , it is determined that the echo sound field state of the signal to be determined is the remote single talk state, which is compared with the current state.
- the solution of the embodiment of the present invention can further effectively improve the accuracy of the judgment of the change state of the echo path.
- the method for determining the echo sound field state may further include: performing voice activation detection on the near-end signal D n (k), To obtain the near-end voice activation flag DVflag; if the near-end voice activation flag DVflag is not equal to 1, it is determined that the echo sound field state of the signal to be determined is an idle state.
- the near-end signal D n (k) is subjected to voice activation detection, and the echo sound field state of the signal to be determined is determined to be idle according to the near-end voice activation flag DVflag
- the state step can also be set to be executed after step S24.
- the embodiment of the present invention does not limit the sequence of the step of judging the near-end voice activation flag DVflag and the step S24.
- the echo sound field state of the signal to be determined is idle.
- the near-end voice activation flag DVflag is not 1, it can be considered that the near-end has no Voice, otherwise it means that there is voice at the near end, and further judgment on the to-be-determined signal is needed.
- the method for determining the echo sound field state may further include: performing voice activation detection on the far-end signal X n (k), To obtain the far-end voice activation flag XVflag; if the far-end voice activation flag XVflag is not equal to 1, it is determined that the echo sound field state of the signal to be determined is the near-end single talk state.
- the voice activation detection is performed on the far-end signal X n (k), and the echo sound field state of the signal to be determined is judged to be close according to the far-end voice activation flag XVflag.
- the step of the single-talk state can also be set to be executed after step S24.
- the embodiment of the present invention does not limit the sequence of the step of determining the remote voice activation flag XVflag and the step S24.
- voice activation detection technology can adopt well-known technologies, such as energy detection, zero-crossing rate detection, spectral entropy detection, pitch detection, etc., which are not specifically limited in the embodiment of the present invention.
- the echo sound field state of the signal to be determined is the near-end single talk state, and it can be considered that when the far-end voice activation flag XVflag is not 1, There is no signal at the far end, no echo signal in the near-end signal, and the current state is the near-end single talk state. Otherwise, it indicates that there is echo in the near-end signal, and further judgment on the to-be-determined signal is required.
- the method for determining the echo sound field state may further include: determining the echo suppression ratio Err of the signal to be determined; if said If the echo suppression ratio Err is greater than the preset echo threshold Thrd err , it is determined that the echo sound field state of the signal to be determined is the remote single talk state.
- the step of determining the echo suppression ratio Err of the signal to be determined and determining that the echo sound field state of the signal to be determined is the remote single talk state can also be set in step S24. Execute afterwards.
- the embodiment of the present invention does not limit the sequence of the step of determining the echo suppression ratio Err of the signal to be determined and the step S24.
- the step of determining the echo suppression ratio Err of the signal to be determined may include: according to the far-end signal X n (k), the near-end signal D n (k), and the filter coefficient W n (k) determining residual signal E n (k); according to the end signal D n (k) and the residual signal E n (k), the echo signal suppression ratio determined Err.
- k is the frequency index of the signal to be determined.
- the echo suppression ratio Err is greater than the preset echo threshold Thrd err , it indicates that the relative amplitude of the residual signal is very small, and most of the near-end signal components are determined to be echo signals, which have been determined by the adaptive filter. AF is eliminated, and the current state is the far-end single-talk state. Otherwise, it indicates that the relative amplitude of the residual signal is still high and the component in the near-end signal is uncertain, and further judgment on the signal to be determined is required.
- the threshold Thrd err reference value may be 12 to 20 dB.
- the echo sound field state determination method may further include: determining the normalized cross-correlation values C YE and C DE ; if C DE is greater than If the first preset cross-correlation threshold Thrd1 coh and C YE is less than the second preset cross-correlation threshold Thrd2 coh , it is determined that the echo sound field state of the signal to be determined is a dual-talk state; wherein, the first preset cross-correlation The threshold Thrd1 coh is greater than or equal to the second preset cross-correlation threshold Thrd2 coh .
- M and L are the frequency band indexes of the signal to be determined.
- the residual signal of the near-end signal component further determined by normalizing the cross-correlation value, at the convergence of the filter, the residual data E n (k) corresponding to the echo signal is decorrelated
- C DE is greater than the threshold Thrd1 coh , it means that the near-end signal contains many components that are not related to echo.
- the filter does not converge, the residual signal will also contain a large amount of echo components, and this conclusion is not valid; C YE is used for further confirmation. If C YE is less than the threshold Thrd2 coh, it means that there are few echo components in the residual signal.
- the near-end signal contains components that are not related to echo.
- the current state is a dual-talk state, otherwise it means that the signal component cannot be determined, and the signal to be determined needs to be further judged.
- the normalized cross-correlation values C YE and C DE are normalized cross-correlation values in the linear region; wherein M and L are frequency band indexes of the linear region.
- the normalized cross-correlation values C YE and C DE are the normalized cross-correlation values of the linear region; where M and L are the frequency band indexes of the linear region.
- M and L as the frequency band index corresponding to the linear region, since the nonlinear distortion of the device has harmonic characteristics and is often distributed in the middle and high frequencies, the present invention gives the reference frequency range, and M corresponds to the low frequency band in 100 ⁇ In the 300Hz interval, L corresponds to the high frequency band in the 2500 ⁇ 3000Hz interval. This range is only a reference value, and the actual use is not limited by this.
- the filter update degree Cef update is greater than the preset update degree threshold Thrd update , it is determined that the echo sound field state of the signal to be determined is the echo path change state; if the filter update degree Cef update is less than or equal to the The preset update threshold Thrd update determines that the echo sound field state of the signal to be determined is the remote single talk state.
- the step of judging that the filter update degree Cef update is greater than the preset update degree threshold Thrd update may be set after judging the dual-talk state.
- the echo suppression ratio Err is the relative cancellation amount of the echo signal, which avoids the influence of the echo signal strength; the normalized cross-correlation quantities C YE and C DE are normalized and have nothing to do with the signal strength of the far and near ends.
- the linear region calculation is used to reduce the influence of device distortion; the filter update degree Cef update uses a certain degree of robustness of the AF itself to reflect the change intensity of the echo path. Therefore, the comprehensive use of these features can effectively solve the influence of uncertain factors such as echo signal strength changes, far and near-end signal strength changes, device distortion, and echo path changes on the detection accuracy.
- the echo path change state according to the filter convergence Cef update being greater than the threshold Thrd update , and according to the filter update degree Cef update being less than or equal to the preset update degree Threshold Thrd update , judged as the far-end single-talk state, can indicate that the filter is in the fast update state. Since the previous judgment has ruled out the deterministic dual-talk state, the interference of the near-end voice signal to the filter is not too high. The update can only be caused by non-convergence or echo path change.
- the current state is the echo path change state. Otherwise, the current feature has no obvious distinction and is regarded as an uncertain state. In the embodiment of the present invention, it can be determined as the remote single talk state .
- the reference value of Thrd1 coh may be 0.3 to 0.5, and the reference value of Thrd2 coh may be 0.1 to 0.3.
- FIG. 3 is a flowchart of another method for determining the state of an echo sound field in an embodiment of the present invention.
- the another method for determining the state of the echo sound field may include step S301 to step S311, and each step will be described below.
- step S301 it is judged whether DVflag is equal to 1; when the judgment result is yes, step S302 can be executed; otherwise, step S303 can be executed.
- step S302 it is judged whether XVflag is equal to 1; when the judgment result is yes, step S304 can be executed; otherwise, step S305 can be executed.
- step S303 it is determined that the state of the echo sound field is the idle state (IDS).
- step S304 it is judged whether Err is greater than Thrd err ; when the judgment result is yes, step S306 can be executed; otherwise, step S307 can be executed.
- step S305 it is determined that the state of the echo sound field is the near-end single talk state (NSTS).
- NSTS near-end single talk state
- step S306 it is determined that the state of the echo sound field is the far-end single talk state (FSTS).
- FSTS far-end single talk state
- step S307 it is judged whether C DE is greater than Thrd1 coh and C YE is less than Thrd2 coh ; when the judgment result is yes, step S308 can be executed; otherwise, step S309 can be executed.
- step S308 it is determined that the state of the echo sound field is a dual talk state (DTS).
- DTS dual talk state
- step S309 it is determined that Cef update is greater than Thrd update ; when the determination result is yes, step S310 can be executed; otherwise, step S311 can be executed.
- step S310 it is determined that the echo sound field state is the echo path change state (PCS).
- PCS echo path change state
- step S311 it is determined that the state of the echo sound field is the far-end single talk state (FSTS).
- FSTS far-end single talk state
- sequence number of each step in this embodiment does not represent a limitation on the execution order of each step.
- the order of steps between steps S301, S302, S304, S307, and S309 is not limited.
- step S309 may be set after S307 to improve the accuracy of judging the change state of the echo path.
- the selected features and decision methods are robust against uncertain factors such as signal strength changes (far and near ends, echo signals), device distortion and echo path changes, and the combined use of multiple features Makes the detection accuracy higher and the performance more reliable.
- the method for determining the echo sound field state may further include adjusting the update step size ⁇ n (k) of the signal to be determined according to the echo sound field state of the signal to be determined; wherein the update step size ⁇ n (k) ) Is used to indicate the update step size of the filter coefficient W n (k).
- an echo adaptive filter may be used to adjust the update step size ⁇ n (k) of the signal to be determined.
- the value of the update step ⁇ n (k) can be increased to speed up the update and fast convergence; when the signal to be determined is in the dual-talk state DTS When, adjust ⁇ n (k) to slow down the update to ensure the robustness of the filter; when the signal to be determined is the remote single talk state FSTS, ⁇ n (k) takes the normal value without special adjustment; When the signal to be determined is in the idle state IDS or the near-end single talk state NSTS, ⁇ n (k) is set to 0, and the update is stopped to prevent divergence, thereby improving the signal transmission quality.
- the method for determining the state of the echo sound field may further include: determining whether to perform nonlinear processing on the signal to be determined according to the state of the echo sound field of the signal to be determined.
- the step of determining whether to perform nonlinear processing on the signal to be determined may include one or more of the following: if it is determined that the echo sound field state of the signal to be determined is a dual-talk state, reducing the degree of nonlinear processing; If it is determined that the echo sound field state of the signal to be determined is the echo path change state, the nonlinear processing of the signal to be determined is enhanced; if it is determined that the echo sound field state of the signal to be determined is the near-end single talk state, stop Non-linear processing of the signal to be determined; if it is determined that the echo sound field state of the signal to be determined is an idle state, the non-linear processing of the signal to be determined is stopped.
- a post-processing non-linear processing unit can be used to perform non-linear processing on the signal to be determined.
- the degree of non-linear processing can be reduced when the signal to be determined is in the dual-talk state, so that the effective voice is not damaged, and the dual-talk performance is ensured; when the signal to be determined is the echo path change state PCS Enhance the degree of non-linear processing to prevent leakage of residual echo; when the signal to be determined is near-end single talk NSTS and idle state IDS, stop non-linear processing to avoid causing near-end voice and environmental sound distortion; When it is determined that the signal is in the far-end single talk state FSTS, no special processing is performed, and the residual echo is normally suppressed, thereby improving the signal transmission quality.
- the method for determining the state of the echo sound field may further include: according to the state of the echo sound field of the signal to be determined, determining to reduce the noise update speed of the signal to be determined or to increase the non-stationary noise suppression capability of the signal to be determined .
- the step of determining to reduce the noise update speed or to improve the non-stationary noise suppression capability may include one or more of the following: if it is determined that the echo sound field state of the signal to be determined is the near-end single talk state, then the standby state is reduced.
- the non-stationary noise suppression capability of the signal to be determined is improved; if it is determined that the echo sound field state of the signal to be determined is the echo path change state, the non-stationary noise suppression capability of the signal to be determined is improved.
- a post-processing noise suppression unit is used to reduce the noise update speed of the signal to be determined or to improve the non-stationary noise suppression capability of the signal to be determined.
- the noise update speed can be slowed down to ensure the intelligibility of effective speech; when the signal to be determined is the far-end single-talk state When the echo path is changed, the non-stationary noise suppression capability is improved, and the residual echo is suppressed; when the signal to be determined is in the idle state, that is, the background noise IDS state, no special processing is performed, and the background noise is normally tracked. Thereby improving the quality of signal transmission.
- Fig. 4 is a schematic structural diagram of an AEC system in an embodiment of the present invention.
- the signal x(n) passes through the loudspeaker (SPK) to obtain the signal h(n), which has echo, and the voice signal (voice) and noise signal (noise) after passing through the microphone (MIC) Output signal d(n).
- SPK loudspeaker
- MIC microphone
- the short-time Fourier transform is performed on the signal d(n) and signal x(n) to obtain the near-end signal D n (k) and the far-end signal X n (k).
- the adaptive filter (AF ) can be calculated far-end signal X n (k) with the filter coefficients W n (k) the echo estimation signal Y n (k), and the near-end signal D n (k) obtained by subtracting the residual signal E n ( k).
- the filter coefficient can be updated according to the filter coefficient W n (k) to obtain W n+1 (k).
- Further far-end signal may be X n (k), near-end signal D n (k), the echo estimation signal Y n (k), the residual signal E n (k) with the filter coefficients W n (k) back to the input sound field
- the state detection unit ESD performs signal feature calculation, and makes the echo sound field state judgment based on the calculation result, and obtains the specific echo sound field state.
- the echo state can be subdivided into five sound field states: far-end single-talk state FSTS, near-end single-talk state NSTS, dual-talk state DTS, echo path change state PCS, and IDS in idle state (ie, background noise).
- an adaptive filter AF and a post-processing non-linear processing unit (NLP) and a post-processing noise suppression unit (NS) can be set to obtain a specific sound field state through ESD, and perform corresponding processing.
- the method for determining the state of the echo sound field may further include: determining the temporary sound field state of the signal to be determined; and determining to maintain the dual sound field state of the signal to be determined according to the echo sound field state and the temporary sound field state of the signal to be determined. Talking status output or output of delaying echo path change for the signal to be determined.
- the DTS output is maintained through the holding time Thold to protect the near-end voice to the greatest extent.
- the output determined to maintain the dual-talk state output for the signal to be determined or to suspend the echo path change for the signal to be determined includes one or more of the following: if the echo sound field state of the signal to be determined is dual-talk State, the temporary sound field state is the remote single-talk state, the signal to be determined is maintained in the dual-talk state output through the hold time; if the echo sound field state of the signal to be determined is the dual-talk state, the temporary sound field state If it is the echo path change state, the output of the echo path change is suspended for the signal to be determined through the start time.
- the output of the PCS will be suspended through the start time Tstart. At this time, the state output is forced to be the remote single talk FSTS to reduce the risk of filter divergence A compromise effect with suppressing echo residue.
- the value of Thold and Tstart can be set between 20 and 100 ms.
- FIG. 5 is a schematic structural diagram of a device for determining an echo sound field state in an embodiment of the present invention.
- the apparatus for determining the state of the echo sound field may include:
- the obtaining module 51 is used to obtain the signal to be determined
- the signal determining module 52 is configured to determine the far-end signal X n (k), the near-end signal D n (k), and the filter coefficient W n (k) of the signal to be determined;
- the update degree determination module 53 is configured to determine the filter update degree Cef update at least according to the far-end signal X n (k), the near-end signal D n (k) and the filter coefficient W n (k);
- the state determination module 54 is configured to determine whether the echo sound field state of the signal to be determined is the echo path change state at least according to the filter update degree Cef update being greater than the preset update degree threshold Thrd update.
- the foregoing device may correspond to a chip with data processing function in user equipment, such as a baseband chip; or a chip module including a chip with data processing function in user equipment, or a user equipment.
- the embodiment of the present invention also provides a storage medium on which computer instructions are stored, and the computer instructions execute the steps of the foregoing method when the computer instructions are executed.
- the storage medium may be a computer-readable storage medium, for example, it may include non-volatile memory (non-volatile) or non-transitory (non-transitory) memory, and may also include optical disks, mechanical hard drives, solid state hard drives, and the like.
- An embodiment of the present invention also provides a terminal, including a memory and a processor, the memory stores computer instructions that can run on the processor, and the processor executes the steps of the above method when the computer instructions are executed.
- the terminal includes, but is not limited to, terminal devices such as mobile phones, computers, and tablets.
- modules/units contained in the various devices and products described in the above embodiments they may be software modules/units, hardware modules/units, or part software modules/units and part hardware modules/units.
- the various modules/units contained therein can be implemented in the form of hardware such as circuits, or at least part of the modules/units can be implemented in the form of software programs. Runs on the integrated processor inside the chip, and the remaining (if any) part of the modules/units can be implemented by hardware methods such as circuits; for each device and product applied to or integrated in the chip module, the modules/units contained therein can be All are implemented by hardware such as circuits.
- Different modules/units can be located in the same component (such as a chip, circuit module, etc.) or different components of the chip module, or at least part of the modules/units can be implemented by software programs.
- the software program runs on the processor integrated inside the chip module, and the remaining (if any) part of the modules/units can be implemented by hardware methods such as circuits; for each device and product applied to or integrated in the terminal, the modules contained therein
- the modules/units can all be implemented by hardware such as circuits, and different modules/units can be located in the same component (for example, chip, circuit module, etc.) or different components in the terminal, or at least part of the modules/units can be implemented in the form of software programs Implementation, the software program runs on the processor integrated inside the terminal, and the remaining (if any) part of the modules/units can be implemented by hardware such as circuits.
Landscapes
- Engineering & Computer Science (AREA)
- Signal Processing (AREA)
- Computational Linguistics (AREA)
- Quality & Reliability (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Computer Networks & Wireless Communication (AREA)
- Cable Transmission Systems, Equalization Of Radio And Reduction Of Echo (AREA)
- Telephone Function (AREA)
Abstract
Description
Claims (27)
- 一种回声声场状态确定方法,其特征在于,包括以下步骤:A method for determining the state of an echo sound field is characterized in that it comprises the following steps:获取待确定信号;Obtain the signal to be determined;确定所述待确定信号的远端信号X n(k)、近端信号D n(k)以及滤波器系数W n(k); Determine the far-end signal X n (k), the near-end signal D n (k) and the filter coefficient W n (k) of the signal to be determined;至少根据所述远端信号X n(k)、近端信号D n(k)以及滤波器系数W n(k),确定滤波器更新度Cef update; Determine the filter update degree Cef update at least according to the far-end signal X n (k), the near-end signal D n (k) and the filter coefficient W n (k);至少根据滤波器更新度Cef update大于预设更新度阈值Thrd update,确定所述待确定信号的回声声场状态是否为回声路径变化状态。 At least according to the filter update degree Cef update being greater than the preset update degree threshold Thrd update , it is determined whether the echo sound field state of the signal to be determined is the echo path change state.
- 根据权利要求1所述的回声声场状态确定方法,其特征在于,还包括:The method for determining the state of the echo sound field according to claim 1, further comprising:至少根据所述滤波器更新度Cef update小于等于所述预设更新度阈值Thrd update,确定所述待确定信号的回声声场状态是否为远端单讲状态。 Determine whether the echo sound field state of the signal to be determined is a remote single talk state at least according to the filter update degree Cef update being less than or equal to the preset update degree threshold Thrd update.
- 根据权利要求1所述的回声声场状态确定方法,其特征在于,至少根据所述远端信号X n(k)、近端信号D n(k)以及滤波器系数W n(k),确定滤波器更新度Cef update包括: The method for determining the state of the echo sound field according to claim 1, wherein the filter is determined based on at least the far-end signal X n (k), the near-end signal D n (k), and the filter coefficient W n (k). Cef update includes:根据所述远端信号X n(k)、近端信号D n(k)以及滤波器系数W n(k),确定残差信号E n(k); Based on the far-end signal X n (k), near-end signal D n (k) and the filter coefficients W n (k), determining the residual signal E n (k);根据所述残差信号E n(k),确定更新后的滤波器系数W n+1(k); According to the residual signal E n (k), determines the filter coefficient W updated n + 1 (k);根据所述滤波器系数W n(k)以及更新后的滤波器系数W n+1(k),确定所述滤波器更新度Cef update。 Determine the filter update degree Cef update according to the filter coefficient W n (k) and the updated filter coefficient W n+1 (k).
- 根据权利要求3所述的回声声场状态确定方法,其特征在于,满足以下一项或多项:The method for determining the state of the echo sound field according to claim 3, wherein one or more of the following is satisfied:采用下述公式,确定残差信号E n(k): Using the following equation to determine the residual signal E n (k):采用下述公式,确定更新后的滤波器系数W n+1(k),其中,更新步长μ n(k)用于指示所述滤波器系数W n(k)更新的步长: The following formula is used to determine the updated filter coefficient W n+1 (k), where the update step size μ n (k) is used to indicate the update step size of the filter coefficient W n (k):采用下述公式,确定滤波器更新度Cef update: Use the following formula to determine the filter update degree Cef update :
- 根据权利要求1所述的回声声场状态确定方法,其特征在于,在确定所述待确定信号的回声声场状态是否为回声路径变化状态之前,还包括:The method for determining an echo sound field state according to claim 1, wherein before determining whether the echo sound field state of the signal to be determined is an echo path change state, the method further comprises:对所述近端信号D n(k)进行语音激活检测,以得到近端语音激活标志DVflag; Performing voice activation detection on the near-end signal D n (k) to obtain a near-end voice activation flag DVflag;如果所述近端语音激活标志DVflag不等于1,则判断所述待确定信号的回声声场状态为空闲状态。If the near-end voice activation flag DVflag is not equal to 1, it is determined that the echo sound field state of the signal to be determined is an idle state.
- 根据权利要求1所述的回声声场状态确定方法,其特征在于,在确定所述待确定信号的回声声场状态是否为回声路径变化状态之前,还包括:The method for determining an echo sound field state according to claim 1, wherein before determining whether the echo sound field state of the signal to be determined is an echo path change state, the method further comprises:对所述远端信号X n(k)进行语音激活检测,以得到远端语音激活标志XVflag; Perform voice activation detection on the far-end signal X n (k) to obtain a far-end voice activation flag XVflag;如果所述远端语音激活标志XVflag不等于1,则判断所述待确定信号的回声声场状态为近端单讲状态。If the far-end voice activation flag XVflag is not equal to 1, it is determined that the echo sound field state of the signal to be determined is the near-end single talk state.
- 根据权利要求1所述的回声声场状态确定方法,其特征在于,在确定所述待确定信号的回声声场状态是否为回声路径变化状态之 前,还包括:The method for determining an echo sound field state according to claim 1, wherein before determining whether the echo sound field state of the signal to be determined is an echo path change state, the method further comprises:确定所述待确定信号的回波抑制比Err;Determining the echo suppression ratio Err of the signal to be determined;如果所述回波抑制比Err大于预设回波阈值Thrd err,则判断所述待确定信号的回声声场状态为远端单讲状态。 If the echo suppression ratio Err is greater than the preset echo threshold Thrd err , it is determined that the echo sound field state of the signal to be determined is the remote single talk state.
- 根据权利要求7所述的回声声场状态确定方法,其特征在于,确定所述待确定信号的回波抑制比Err包括:The method for determining an echo sound field state according to claim 7, wherein determining the echo suppression ratio Err of the signal to be determined comprises:根据所述远端信号X n(k)、近端信号D n(k)以及滤波器系数W n(k),确定残差信号E n(k); Based on the far-end signal X n (k), near-end signal D n (k) and the filter coefficients W n (k), determining the residual signal E n (k);根据所述近端信号D n(k)与残差信号E n(k),确定信号的回波抑制比Err。 The proximal end of the signal D n (k) and the residual signal E n (k), the echo signal suppression ratio determined Err.
- 根据权利要求8所述的回声声场状态确定方法,其特征在于,满足以下一项或多项:The method for determining the state of the echo sound field according to claim 8, wherein one or more of the following is satisfied:采用下述公式,确定残差信号E n(k): Using the following equation to determine the residual signal E n (k):采用下述公式,确定信号的回波抑制比Err:Use the following formula to determine the signal echo suppression ratio Err:其中,k为所述待确定信号的频率索引。Wherein, k is the frequency index of the signal to be determined.
- 根据权利要求1所述的回声声场状态确定方法,其特征在于,在确定所述待确定信号的回声声场状态是否为回声路径变化状态之前,还包括:The method for determining an echo sound field state according to claim 1, wherein before determining whether the echo sound field state of the signal to be determined is an echo path change state, the method further comprises:确定归一化互相关值C YE与C DE; Determine the normalized cross-correlation values C YE and C DE ;如果C DE大于第一预设互相关阈值Thrd1 coh,且C YE小于第二预设互相关阈值Thrd2 coh,则判断所述待确定信号的回声声场状态为双讲状态; If C DE is greater than the first preset cross-correlation threshold Thrd1 coh and C YE is less than the second preset cross-correlation threshold Thrd2 coh , determining that the echo sound field state of the signal to be determined is a dual-talk state;其中,所述第一预设互相关阈值Thrd1 coh大于等于所述第二预设互相关阈值Thrd2 coh。 Wherein, the first preset cross-correlation threshold Thrd1 coh is greater than or equal to the second preset cross-correlation threshold Thrd2 coh .
- 根据权利要求10所述的回声声场状态确定方法,其特征在于,还包括以下一项或多项:The method for determining the state of the echo sound field according to claim 10, further comprising one or more of the following:如果滤波器更新度Cef update大于预设更新度阈值Thrd update,则确定所述待确定信号的回声声场状态为回声路径变化状态; If the filter update degree Cef update is greater than the preset update degree threshold Thrd update , determining that the echo sound field state of the signal to be determined is the echo path change state;如果所述滤波器更新度Cef update小于等于所述预设更新度阈值Thrd update,则确定所述待确定信号的回声声场状态为远端单讲状态。 If the filter update degree Cef update is less than or equal to the preset update degree threshold Thrd update , it is determined that the echo sound field state of the signal to be determined is the remote single talk state.
- 根据权利要求10所述的回声声场状态确定方法,其特征在于,采用下述公式,确定归一化互相关值C YE与C DE: The method for determining the state of the echo sound field according to claim 10, wherein the following formula is used to determine the normalized cross-correlation values C YE and C DE :其中,M与L为所述待确定信号的频段索引。Wherein, M and L are the frequency band indexes of the signal to be determined.
- 根据权利要求12所述的回声声场状态确定方法,其特征在于,The method for determining the state of the echo sound field according to claim 12, wherein:所述归一化互相关值C YE与C DE为线性区归一化互相关值; The normalized cross-correlation values C YE and C DE are normalized cross-correlation values in the linear region;其中,M与L为线性区的频段索引。Among them, M and L are the frequency band indexes of the linear region.
- 根据权利要求1所述的回声声场状态确定方法,其特征在于,还 包括:The method for determining the state of the echo sound field according to claim 1, further comprising:根据所述待确定信号的回声声场状态,调整所述待确定信号的更新步长μ n(k); Adjusting the update step size μ n (k) of the signal to be determined according to the echo sound field state of the signal to be determined;其中,更新步长μ n(k)用于指示所述滤波器系数W n(k)更新的步长。 Wherein, the update step μ n (k) is used to indicate the update step of the filter coefficient W n (k).
- 根据权利要求14所述的回声声场状态确定方法,其特征在于,调整更新步长μ n(k)包括以下一项或多项: The method for determining the state of the echo sound field according to claim 14, wherein the adjusting and updating step size μ n (k) includes one or more of the following:如果确定所述待确定信号的回声声场状态为回声路径变化状态,则增加更新步长μ n(k); If it is determined that the echo sound field state of the signal to be determined is the echo path change state, increase the update step size μ n (k);如果确定所述待确定信号的回声声场状态为双讲状态,则调整μ n(k)放慢更新; If it is determined that the echo sound field state of the signal to be determined is a dual-talk state, adjust μ n (k) to slow down the update;如果确定所述待确定信号的回声声场状态为空闲状态或近端单讲状态,则调整μ n(k)=0。 If it is determined that the echo sound field state of the signal to be determined is the idle state or the near-end single talk state, adjust μ n (k)=0.
- 根据权利要求14所述的回声声场状态确定方法,其特征在于,采用回声自适应滤波器调整所述待确定信号的更新步长μ n(k)。 The method for determining the state of the echo sound field according to claim 14, wherein an echo adaptive filter is used to adjust the update step size μ n (k) of the signal to be determined.
- 根据权利要求1所述的回声声场状态确定方法,其特征在于,还包括:The method for determining the state of the echo sound field according to claim 1, further comprising:根据所述待确定信号的回声声场状态,确定是否对所述待确定信号进行非线性处理。According to the echo sound field state of the signal to be determined, it is determined whether to perform nonlinear processing on the signal to be determined.
- 根据权利要求17所述的回声声场状态确定方法,其特征在于,确定是否对所述待确定信号进行非线性处理包括以下一项或多项:The method for determining the state of the echo sound field according to claim 17, wherein determining whether to perform nonlinear processing on the signal to be determined comprises one or more of the following:如果确定所述待确定信号的回声声场状态为双讲状态,则减少非线性处理程度;If it is determined that the echo sound field state of the signal to be determined is a dual-talk state, reduce the degree of non-linear processing;如果确定所述待确定信号的回声声场状态为回声路径变化状态,则增强对所述待确定信号的非线性处理;If it is determined that the echo sound field state of the signal to be determined is an echo path change state, the nonlinear processing of the signal to be determined is enhanced;如果确定所述待确定信号的回声声场状态为近端单讲状态,则停 止对所述待确定信号的非线性处理;If it is determined that the echo sound field state of the signal to be determined is the near-end single talk state, stop the non-linear processing of the signal to be determined;如果确定所述待确定信号的回声声场状态为空闲状态,则停止对所述待确定信号的非线性处理。If it is determined that the echo sound field state of the signal to be determined is an idle state, the non-linear processing of the signal to be determined is stopped.
- 根据权利要求17所述的回声声场状态确定方法,其特征在于,采用后处理非线性处理单元对所述待确定信号进行非线性处理。The method for determining the state of the echo sound field according to claim 17, wherein a post-processing non-linear processing unit is used to perform non-linear processing on the signal to be determined.
- 根据权利要求1所述的回声声场状态确定方法,其特征在于,还包括:The method for determining the state of the echo sound field according to claim 1, further comprising:根据所述待确定信号的回声声场状态,确定降低所述待确定信号的噪声更新速度或者提高所述待确定信号的非平稳噪声抑制能力。According to the echo sound field state of the signal to be determined, it is determined to reduce the noise update speed of the signal to be determined or to increase the non-stationary noise suppression capability of the signal to be determined.
- 根据权利要求20所述的回声声场状态确定方法,其特征在于,确定降低噪声更新速度或者提高非平稳噪声抑制能力包括以下一项或多项:The method for determining the state of the echo sound field according to claim 20, wherein the determining to reduce the noise update speed or to improve the non-stationary noise suppression capability includes one or more of the following:如果确定所述待确定信号的回声声场状态为近端单讲状态,则降低所述待确定信号的噪声更新速度;If it is determined that the echo sound field state of the signal to be determined is the near-end single talk state, reducing the noise update speed of the signal to be determined;如果确定所述待确定信号的回声声场状态为双讲状态,则降低所述待确定信号的噪声更新速度;If it is determined that the echo sound field state of the signal to be determined is a dual-talk state, reducing the noise update speed of the signal to be determined;如果确定所述待确定信号的回声声场状态为远端单讲状态,则提高所述待确定信号的非平稳噪声抑制能力;If it is determined that the echo sound field state of the signal to be determined is the far-end single talk state, improving the non-stationary noise suppression capability of the signal to be determined;如果确定所述待确定信号的回声声场状态为回声路径变化状态,则提高所述待确定信号的非平稳噪声抑制能力。If it is determined that the echo sound field state of the signal to be determined is an echo path change state, the non-stationary noise suppression capability of the signal to be determined is improved.
- 根据权利要求20所述的回声声场状态确定方法,其特征在于,采用后处理噪声抑制单元降低所述待确定信号的噪声更新速度或者提高所述待确定信号的非平稳噪声抑制能力。The method for determining the state of the echo sound field according to claim 20, wherein a post-processing noise suppression unit is used to reduce the noise update speed of the signal to be determined or to improve the non-stationary noise suppression capability of the signal to be determined.
- 根据权利要求1所述的回声声场状态确定方法,其特征在于,还包括:The method for determining the state of the echo sound field according to claim 1, further comprising:确定所述待确定信号的临时声场状态;Determining the temporary sound field state of the signal to be determined;根据所述待确定信号的回声声场状态以及临时声场状态,确定对所述待确定信号保持双讲状态输出或者对所述待确定信号暂缓回声路径改变的输出。According to the echo sound field state and the temporary sound field state of the signal to be determined, it is determined that the signal to be determined is kept in a dual-talk state output or the output of the echo path change of the signal to be determined is suspended.
- 根据权利要求23所述的回声声场状态确定方法,其特征在于,确定对所述待确定信号保持双讲状态输出或者对所述待确定信号暂缓回声路径改变的输出包括以下一项或多项:The method for determining the state of the echo sound field according to claim 23, wherein the output determined to maintain the dual-talk state output for the signal to be determined or to suspend the echo path change of the signal to be determined includes one or more of the following:如果所述待确定信号的回声声场状态为双讲状态,所述临时声场状态为远端单讲状态,则通过保持时间对所述待确定信号保持双讲状态输出;If the echo sound field state of the signal to be determined is a dual-talk state, and the temporary sound field state is a remote single-talk state, then the signal to be determined is kept in a dual-talk state output through the holding time;如果所述待确定信号的回声声场状态为双讲状态,所述临时声场状态为回声路径变化状态,则通过开始时间对所述待确定信号暂缓回声路径改变的输出。If the echo sound field state of the signal to be determined is a dual-talk state, and the temporary sound field state is an echo path change state, the output of the echo path change for the signal to be determined is temporarily suspended based on the start time.
- 一种回声声场状态确定装置,其特征在于,包括:A device for determining the state of an echo sound field, characterized in that it comprises:获取模块,用于获取待确定信号;The acquisition module is used to acquire the signal to be determined;信号确定模块,用于确定所述待确定信号的远端信号X n(k)、近端信号D n(k)以及滤波器系数W n(k); A signal determining module for determining the far-end signal X n (k), the near-end signal D n (k) and the filter coefficient W n (k) of the signal to be determined;更新度确定模块,用于至少根据所述远端信号X n(k)、近端信号D n(k)以及滤波器系数W n(k),确定滤波器更新度Cef update; The update degree determination module is configured to determine the filter update degree Cef update at least according to the far-end signal X n (k), the near-end signal D n (k) and the filter coefficient W n (k);状态确定模块,用于至少根据滤波器更新度Cef update大于预设更新度阈值Thrd update,确定所述待确定信号的回声声场状态是否为回声路径变化状态。 The state determination module is configured to determine whether the echo sound field state of the signal to be determined is the echo path change state at least according to the filter update degree Cef update being greater than the preset update degree threshold Thrd update.
- 一种存储介质,其上存储有计算机指令,其特征在于,所述计算机指令运行时执行权利要求1至24任一项所述回声声场状态确定方法的步骤。A storage medium having computer instructions stored thereon, wherein the computer instructions execute the steps of the method for determining the state of the echo sound field according to any one of claims 1 to 24 when the computer instructions are run.
- 一种终端,包括存储器和处理器,所述存储器上存储有能够在所 述处理器上运行的计算机指令,其特征在于,所述处理器运行所述计算机指令时执行权利要求1至24任一项所述回声声场状态确定方法的步骤。A terminal, comprising a memory and a processor, and computer instructions that can run on the processor are stored on the memory, wherein the processor executes any one of claims 1 to 24 when the computer instructions are executed. The steps of the method for determining the state of the echo sound field described in the item.
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202010223647.6A CN111654585B (en) | 2020-03-26 | 2020-03-26 | Echo sound field state determination method and device, storage medium and terminal |
CN202010223647.6 | 2020-03-26 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2021190274A1 true WO2021190274A1 (en) | 2021-09-30 |
Family
ID=72346411
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/CN2021/079181 WO2021190274A1 (en) | 2020-03-26 | 2021-03-05 | Method and device for determining state of echo sound field, storage medium, and terminal |
Country Status (2)
Country | Link |
---|---|
CN (1) | CN111654585B (en) |
WO (1) | WO2021190274A1 (en) |
Families Citing this family (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111654585B (en) * | 2020-03-26 | 2021-08-03 | 紫光展锐(重庆)科技有限公司 | Echo sound field state determination method and device, storage medium and terminal |
CN115019816B (en) * | 2021-03-03 | 2025-07-11 | 阿里巴巴创新公司 | Echo state detection method, device, computer storage medium and chip |
CN115641866A (en) * | 2022-10-13 | 2023-01-24 | 展讯通信(上海)有限公司 | Signal processing method and device, computer readable storage medium and terminal |
Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20160006880A1 (en) * | 2014-07-02 | 2016-01-07 | Youhong Lu | Variable step size echo cancellation with accounting for instantaneous interference |
CN108986837A (en) * | 2018-09-05 | 2018-12-11 | 科大讯飞股份有限公司 | A kind of filter update method and device |
CN109348072A (en) * | 2018-08-30 | 2019-02-15 | 湖北工业大学 | A double-ended talk detection method applied to echo cancellation system |
CN109524018A (en) * | 2017-09-19 | 2019-03-26 | 华为技术有限公司 | A kind of echo processing method and equipment |
CN109712636A (en) * | 2019-03-07 | 2019-05-03 | 出门问问信息科技有限公司 | Near-end speech restorative procedure and system in a kind of echo cancellation process |
CN111654585A (en) * | 2020-03-26 | 2020-09-11 | 紫光展锐(重庆)科技有限公司 | Echo sound field state determination method and device, storage medium and terminal |
Family Cites Families (13)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6563803B1 (en) * | 1997-11-26 | 2003-05-13 | Qualcomm Incorporated | Acoustic echo canceller |
US6434110B1 (en) * | 1998-03-20 | 2002-08-13 | Cirrus Logic, Inc. | Full-duplex speakerphone circuit including a double-talk detector |
DE19935587A1 (en) * | 1998-08-04 | 2000-02-17 | Motorola Inc | Detection of echo state in duplex transmission e.g. when using mobile phone system, is achieved by monitoring adaptive filter coefficient update, which reveals phantom signal presence on exceeding threshold |
JP3492315B2 (en) * | 2000-12-15 | 2004-02-03 | 沖電気工業株式会社 | Echo canceller with automatic volume adjustment |
JP3917116B2 (en) * | 2003-08-01 | 2007-05-23 | 日本電信電話株式会社 | Echo canceling apparatus, method, echo canceling program, and recording medium recording the program |
JP4678349B2 (en) * | 2006-08-31 | 2011-04-27 | ヤマハ株式会社 | Call determination device |
CN102739286B (en) * | 2011-04-01 | 2014-06-11 | 中国科学院声学研究所 | Echo cancellation method used in communication system |
US9088336B2 (en) * | 2012-09-06 | 2015-07-21 | Imagination Technologies Limited | Systems and methods of echo and noise cancellation in voice communication |
US9191493B2 (en) * | 2013-12-09 | 2015-11-17 | Captioncall, Llc | Methods and devices for updating an adaptive filter for echo cancellation |
CN107332591B (en) * | 2016-04-29 | 2021-01-05 | 北京紫光展锐通信技术有限公司 | Repeater and echo interference elimination method and device thereof |
CN108630219B (en) * | 2018-05-08 | 2021-05-11 | 北京小鱼在家科技有限公司 | Processing system, method and device for echo suppression audio signal feature tracking |
CN110634496B (en) * | 2019-10-22 | 2021-12-24 | 广州视源电子科技股份有限公司 | Double-talk detection method and device, computer equipment and storage medium |
CN110838300B (en) * | 2019-11-18 | 2022-03-25 | 紫光展锐(重庆)科技有限公司 | Echo cancellation processing method and processing system |
-
2020
- 2020-03-26 CN CN202010223647.6A patent/CN111654585B/en active Active
-
2021
- 2021-03-05 WO PCT/CN2021/079181 patent/WO2021190274A1/en active Application Filing
Patent Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20160006880A1 (en) * | 2014-07-02 | 2016-01-07 | Youhong Lu | Variable step size echo cancellation with accounting for instantaneous interference |
CN109524018A (en) * | 2017-09-19 | 2019-03-26 | 华为技术有限公司 | A kind of echo processing method and equipment |
CN109348072A (en) * | 2018-08-30 | 2019-02-15 | 湖北工业大学 | A double-ended talk detection method applied to echo cancellation system |
CN108986837A (en) * | 2018-09-05 | 2018-12-11 | 科大讯飞股份有限公司 | A kind of filter update method and device |
CN109712636A (en) * | 2019-03-07 | 2019-05-03 | 出门问问信息科技有限公司 | Near-end speech restorative procedure and system in a kind of echo cancellation process |
CN111654585A (en) * | 2020-03-26 | 2020-09-11 | 紫光展锐(重庆)科技有限公司 | Echo sound field state determination method and device, storage medium and terminal |
Also Published As
Publication number | Publication date |
---|---|
CN111654585B (en) | 2021-08-03 |
CN111654585A (en) | 2020-09-11 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US11601554B2 (en) | Detection of acoustic echo cancellation | |
WO2021190274A1 (en) | Method and device for determining state of echo sound field, storage medium, and terminal | |
US20220301577A1 (en) | Echo cancellation method and apparatus | |
US9088336B2 (en) | Systems and methods of echo and noise cancellation in voice communication | |
US6792107B2 (en) | Double-talk detector suitable for a telephone-enabled PC | |
US9516159B2 (en) | System and method of double talk detection with acoustic echo and noise control | |
US5598468A (en) | Method and apparatus for echo removal in a communication system | |
JP4282260B2 (en) | Echo canceller | |
CN103748865B (en) | Utilize the clock deskew of the acoustic echo arrester of not audible tone | |
CN109716743B (en) | Full duplex voice communication system and method | |
CN111768796A (en) | Method and device for acoustic echo cancellation and de-reverberation | |
CN102065190A (en) | Method and device for eliminating echo | |
TWI594234B (en) | A method and device for detecting near-end voice signal | |
CN111742541A (en) | Acoustic echo cancellation method and device | |
CN106571147A (en) | Method for suppressing acoustic echo of network telephone | |
CN110995951A (en) | Echo cancellation method, device and system based on double-end sounding detection | |
US8831210B2 (en) | Method and system for detection of onset of near-end signal in an echo cancellation system | |
CN111756906B (en) | Echo suppression method and device for voice signal and computer readable medium | |
CN111355855B (en) | Echo processing method, device, equipment and storage medium | |
US9083783B2 (en) | Detecting double talk in acoustic echo cancellation using zero-crossing rate | |
CN106297816B (en) | Echo cancellation nonlinear processing method and device and electronic equipment | |
EP4554101A1 (en) | Echo suppressing device, echo suppressing method, and echo suppressing program | |
CN111970410B (en) | Echo cancellation method and device, storage medium and terminal | |
CN113808609B (en) | Echo detection method and device, computer readable storage medium and terminal equipment | |
CN119091902A (en) | Echo cancellation method, electronic device, computer program product and storage medium |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 21774667 Country of ref document: EP Kind code of ref document: A1 |
|
WWE | Wipo information: entry into national phase |
Ref document number: 202217060680 Country of ref document: IN |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 21774667 Country of ref document: EP Kind code of ref document: A1 |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 21774667 Country of ref document: EP Kind code of ref document: A1 |