Disclosure of Invention
The invention aims to provide a method and a system for eliminating crosstalk of a bone conduction earphone, which have good crosstalk elimination effect.
In order to achieve the purpose, the invention provides the following scheme:
a method for eliminating crosstalk of a bone conduction headset comprises the following steps:
acquiring a bone conduction transmission matrix when a user wears a bone conduction earphone;
acquiring a stop band frequency range of the shape factor according to a frequency range of appearance of singular values of personalized bone conduction transfer functions at two sides of ears of a user;
obtaining a filtered approximate cross talk elimination matrix according to the bone conduction transfer matrix, the stop band frequency range of the shape factor and the filter order during filtering;
using the approximate cross-talk elimination matrix to eliminate cross-talk of the binaural signals input to the bone conduction earphone, so as to obtain sound signals eliminating the cross-talk; the binaural signal is a signal containing sound source position information;
playing the sound signals for eliminating the crosstalk to obtain sound signals heard by the two ears of the user;
determining a final cross talk elimination matrix according to the bone conduction transfer matrix, the approximate cross talk elimination matrix and the sound signals heard by the ears of the user;
and carrying out crosstalk elimination on the binaural signals input to the bone conduction earphone according to the final crosstalk elimination matrix.
Optionally, the obtaining of the bone conduction transmission matrix when the user wears the bone conduction headset specifically includes:
inducing a stimulation frequency otoacoustic emission signal with bone conduction sweep tones;
according to the sweep frequency stimulation signal and the stimulation frequency otoacoustic emission signal, a bone conduction transfer function from the bone conduction earphone to the inner ears on the two sides of the user is worked out;
and obtaining a bone conduction transfer matrix according to the bone conduction transfer function.
Optionally, the obtaining a filtered approximate cross talk cancellation matrix according to the bone conduction transfer matrix, the stopband frequency range of the shape factor, and the filter order during filtering specifically includes: and obtaining a filtered approximate cross talk elimination matrix by utilizing a frequency domain fast deconvolution method according to the bone conduction transfer matrix, the stopband frequency range of the shape factor and the filter order during filtering.
Optionally, the determining a final crosstalk elimination matrix according to the bone conduction transfer matrix, the approximate crosstalk elimination matrix, and the sound signals heard by the ears of the user specifically includes:
calculating a channel separation degree according to the bone conduction transfer matrix and the approximate crosstalk elimination matrix;
calculating a performance error from the sound signals heard by both ears of the user and the desired signal;
and determining a final crosstalk elimination matrix according to the channel separation degree and the performance error.
Optionally, the binaural signal is obtained by synthesizing the single-channel signal according to a head-related transfer function including sound source position information.
Optionally, the single-channel signal is a maximum length pseudorandom sequence signal of 1.5 seconds, and a sampling rate of the single-channel signal is 44.1 kHz.
A crosstalk cancellation system for a bone conduction headset, comprising:
the transmission matrix acquisition module is used for acquiring a bone conduction transmission matrix when a user wears the bone conduction earphone;
the shape factor stopband frequency range acquisition module is used for acquiring the stopband frequency range of the shape factor according to the frequency range of the appearance of the singular value of the personalized bone conduction transfer function at two sides of the ears of the user;
the crosstalk elimination matrix obtaining module is used for obtaining a filtered approximate crosstalk elimination matrix according to the bone conduction transfer matrix, the stop band frequency range of the shape factor and the filter order during filtering;
a first crosstalk elimination module for performing crosstalk elimination on a binaural signal input to the bone conduction headset by using the approximate crosstalk elimination matrix; obtaining a sound signal for eliminating crosstalk; the binaural signal is a binaural signal containing sound source position information;
the sound signal acquisition module is used for playing the sound signal for eliminating the crosstalk to obtain sound signals heard by two ears of the user;
a cross talk elimination matrix determination module for determining a final cross talk elimination matrix according to the bone conduction transfer matrix, the approximate cross talk elimination matrix and the sound signals heard by the ears of the user;
and the second crosstalk elimination module is used for performing crosstalk elimination on the binaural signals input to the bone conduction earphone according to the final crosstalk elimination matrix.
Optionally, the transfer matrix obtaining module includes:
a transmitting signal unit for inducing a stimulation frequency otoacoustic transmitting signal using bone conduction sweep tones;
the bone conduction transfer function calculation unit is used for solving a bone conduction transfer function from the bone conduction earphone to the inner ears on the two sides of the user according to the sweep frequency stimulation signal and the stimulation frequency otoacoustic emission signal;
and the bone conduction transfer matrix acquisition unit is used for obtaining a bone conduction transfer matrix according to the bone conduction transfer function.
Optionally, the cross-talk elimination matrix determining module includes:
a channel separation degree calculation unit for calculating a channel separation degree according to the bone conduction transfer matrix and the approximate cross-talk elimination matrix;
a performance error calculation unit for calculating a performance error from the sound signal heard by both ears of the user and the desired signal;
and the determining unit is used for determining a final crosstalk elimination matrix according to the channel separation degree and the performance error.
Optionally, the first crosstalk elimination module includes a synthesis unit, configured to synthesize the one-channel signal according to a head-related transfer function including sound source position information to obtain the binaural signal
According to the specific embodiment provided by the invention, the invention discloses the following technical effects:
the invention discloses a method and a system for eliminating crosstalk of a bone conduction headset, wherein the method comprises the following steps: acquiring a bone conduction transmission matrix when a user wears a bone conduction earphone; obtaining a stopband frequency range of the shape factor; obtaining a filtered approximate cross talk elimination matrix according to the bone conduction transfer matrix, the stop band frequency range of the shape factor and the filter order during filtering; using the approximate cross-talk elimination matrix to eliminate cross-talk of the binaural signals input to the bone conduction earphone; obtaining a sound signal for eliminating crosstalk; playing the sound signals for eliminating the crosstalk to obtain the sound signals heard by the two ears of the user; determining a final crosstalk elimination matrix according to the bone conduction transfer matrix, the approximate crosstalk elimination matrix and the sound signals heard by the ears of the user; and carrying out crosstalk elimination on the binaural signals input to the bone conduction earphone according to the final crosstalk elimination matrix. The invention adjusts and obtains the proper crosstalk elimination matrix according to the bone conduction transfer matrix, the approximate crosstalk elimination matrix and the sound signals heard by the ears of the user, and can obtain the optimal crosstalk elimination effect.
Detailed Description
The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.
The invention aims to provide a method and a system for eliminating crosstalk of a bone conduction earphone, which have good crosstalk elimination effect.
In order to make the aforementioned objects, features and advantages of the present invention comprehensible, embodiments accompanied with figures are described in further detail below.
Example 1
Fig. 1 is a flowchart of a crosstalk elimination method for a bone conduction headset according to an embodiment of the present invention. As shown in fig. 1, the method comprises:
step 101: and acquiring a bone conduction transmission matrix when the user wears the bone conduction earphone. In this embodiment, the following concrete steps are performed: and inducing a stimulation frequency otoacoustic emission signal by utilizing bone conduction frequency sweeping sound, solving a bone conduction transfer function from the bone conduction earphone to the inner ears at the two sides of the user according to the frequency sweeping stimulation signal and the stimulation frequency otoacoustic emission signal, and obtaining a bone conduction transfer matrix according to the bone conduction transfer function.
Step 102: and acquiring the stop band frequency range of the shape factor according to the frequency range of the appearance of the singular value of the personalized bone conduction transfer function at two sides of the ears of the user.
Step 103: and obtaining a filtered approximate cross talk elimination matrix according to the bone conduction transfer matrix, the stop band frequency range of the shape factor and the filter order during filtering. In this embodiment, the following concrete steps are performed: and obtaining a filtered approximate cross talk elimination matrix by utilizing a frequency domain fast deconvolution method according to the bone conduction transfer matrix, the stopband frequency range of the shape factor and the filter order during filtering.
Step 104: and performing crosstalk cancellation on the binaural signal input to the bone conduction earphone by using the approximate crosstalk cancellation matrix to obtain a sound signal with crosstalk cancelled, wherein the binaural signal is a signal containing sound source position information. In the present embodiment, the binaural signal is obtained by synthesizing a mono signal based on a Head-Related Transfer Function (HRTF) including sound source position information. The single-path signal is a maximum length pseudo-random sequence signal of 1.5 seconds, and the sampling rate of the single-path signal is 44.1 kHz.
Step 105: and playing the sound signal for eliminating the crosstalk to obtain the sound signals heard by the two ears of the user. In this embodiment, when bone conduction sound reproduction is performed in both ears, the wearing positions of the two bone conduction headphones should be the same as the head positions of the subject when measuring BCTF, and the error should not be too large. The wearing position is the mastoid position behind the ears at both sides.
Step 106: determining a final cross talk cancellation matrix from the bone conduction transfer matrix, the approximate cross talk cancellation matrix, and the sound signals heard by the ears of the user. In this embodiment, the following are specifically mentioned: and calculating a channel separation degree according to the bone conduction transfer matrix and the approximate cross talk elimination matrix, calculating a performance error according to the sound signals heard by the ears of the user and the expected signals, and determining a final cross talk elimination matrix according to the channel separation degree and the performance error.
Step 107: and carrying out crosstalk elimination on the binaural signals input to the bone conduction earphone according to the final crosstalk elimination matrix.
Example 2
Selecting a subject (24 years old) with normal hearing and no history of external ears or middle ears, firstly inducing a stimulation frequency otoacoustic emission signal by utilizing bone conduction frequency sweeping sound, subsequently extracting a pure frequency sweeping stimulation frequency otoacoustic emission signal, obtaining a bone conduction impulse response signal by performing cross-correlation operation on the bone conduction impulse response signal and the frequency sweeping stimulation signal, and estimating a bone conduction transfer function from a bone conduction earphone to the inner ears at two sides of the subject by Fourier transform so as to obtain a corresponding bone conduction transmission matrix. The subject was seated in a sound isolation booth for listening and the equipment connections are shown in figure 2.
Firstly, setting the stopband frequency range of a shape factor B (z) according to the frequency range of singular values of personalized BCTF (bone conduction transfer function) on two sides of the subject to suppress pathological phenomena, wherein the stopband frequency range of B (z) set according to the data of the subject is 0.55-7.5 kHz.
Obtaining a set of causal finite impulse response filters H (z) by a frequency domain fast deconvolution algorithm such that the product of H (z) and C (z) approximates to identity matrix I, such that the signal arriving at both ears is approximately equal to the input binaural signal XLAnd XR. Fig. 3 is a process diagram of a frequency domain fast deconvolution method according to an embodiment of the present invention, as shown in fig. 3, x (z) is an input binaural signal, h (z) is a cross talk cancellation filter, v (z) is a bone conduction earphone input signal, w (z) is an estimation signal, d (z) is a desired signal, e (z) is an error signal, a (z) is a target function, and z is a target function-mThe target is delayed by m sample points. The design of the cross-talk cancellation filter can be equivalent to the problem of minimizing a cost function, where the cost function is:
J=E+β1V=eH(z)e(z)+β1vH(z)v(z) (1)
wherein the superscript H denotes the conjugate transpose operator, eH(z) e (z) is a performance error term, β1vH(z) v (z) is a cost term, wherein1For regularizationThe parameters, representing the weights of the frequency weighting function, which are often used to limit the filter gain and the matrix reversibility of the cross-talk cancellation system, may be constant or frequency dependent. In order to facilitate the control of the gain of each frequency point, it is considered to add a shape factor b (z) related to the frequency, i.e. β1Decomposed into a regularization constant gain factor β and a shape factor b (z). From the condition of J minimization, the approximate solution of the cross-talk cancellation matrix H can be solved as:
H(z)=(CH(z)C(z)+βBH(z)B(z))-1CH(z)z-m (2)
wherein C represents a bone conduction transfer function of the subject including the transmission characteristic of the bone conduction vibrator, and z-mThe method is used for ensuring causality of the cross-talk elimination filtering, m in the embodiment is 3.125ms, beta is a regularization constant gain factor, beta is more than 0 and less than or equal to 1, and a shape factor B (Z) is a filter gain of a Z-domain filter for limiting a selected frequency band, so that a pathological phenomenon is prevented. Digital sampling is performed on the basis of the formula (2), and then the coefficients of the corresponding discrete frequency points can be obtained, as shown in the following formula:
H(z)=[CH(k)C(k)+βBH(k)B(k)]-1CH(k)exp{[-j2π(k-1)m]/N} (3)
where k is 1 … N, and N is the order of the filter.
In this embodiment, the regularization constant gain factors β are 10 respectively-2、10-4、10-6And 10-8The influence of different β on the Crosstalk Cancellation System (CCS) Performance is observed, and an optimal β value is selected by Channel Separation (CS) and Performance Error (PE). The filter order is 2048.
The performance of the cross-talk cancellation system is adjusted by the CS and PE.
CS represents the amplitude ratio between the cross-talk and direct sounds in dB, with smaller values indicating better channel separation and vice versa. The channel separation calculation formula for the left and right ears is:
wherein, the bone conduction transmission matrix C and the crosstalk elimination matrix H are respectively
PE represents the ratio of the output of the system to the desired signal in the frequency domain, with the unit being expressed in dB, with values closer to 0dB indicating smaller performance errors and vice versa. The performance error calculation formula is as follows:
wherein XWFor the signal heard by the ears of the user, XDIs the desired signal.
Fig. 4 is a block diagram of a cross talk cancellation algorithm for binaural bone conduction sound according to an embodiment of the present invention. After a cross-talk elimination matrix H is obtained through a fast deconvolution algorithm, a binaural signal X is obtained by filtering a single-path signal S through an HRTFLAnd XRWhere S is the 1.5 second MLS signal and the sample rate is 44.1 kHz. Obtaining input signals Y of the left and right bone conduction earphones after filtering through a cross-talk elimination filter HLAnd YRThen, the subject was subjected to subjective listening test as shown in FIG. 2, and Y was measuredLAnd YRRespectively to bone conduction earphones worn on the left mastoid process and the right mastoid process. Meanwhile, two indexes of CS and PE are calculated according to the expressions (4), (5) and (8), and the shape factor B (z), the regularization constant gain factor beta and the filter order N are adjusted according to the calculation result.
FIG. 5 shows the present inventionFig. 6 is a schematic diagram of the degree of separation of a left ear channel provided in the embodiment of the present invention. The curves of the two graphs are similar in trend, and it can be seen from the graphs that the smaller the regularization constant gain factor beta is, the higher the channel separation degree is, i.e., the better the crosstalk elimination effect is. This is because the smaller β is, the lower the proportion of the cost term in equation (1) is, and the performance of error approximation is improved, and the channel separation degree is also improved. When beta is less than or equal to 10-4When the value of CS is basically below-50 dB, when beta is 10-2For frequencies above 5kHz, the CS values are all above-50 dB, and the four curves all have a large valley near 2 kHz. FIG. 7 is a schematic diagram of performance error when β ≧ 10-4The PE value is closer to 0dB, namely the performance error is smaller, the fluctuation is larger in the middle-low frequency range before 4kHz, and the performance error is larger near 9-10 kHz along with the increase of the frequency; when beta is less than 10-4The performance error is large at the first 5kHz, after which the PE value is close to 0dB, probably because the stopband range of the shape factor is not set properly. According to the CS and PE indexes of the subject, the optimal regularization constant gain factor suitable for the subject can be judged to have the value range of beta larger than or equal to 10-4The actual most suitable parameter value is also selected according to the comprehensive influence effect of the parameters such as the shape factor and the filter order.
Example 3
Fig. 8 is a block diagram of a crosstalk elimination system of a bone conduction earphone according to an embodiment of the present invention, and as shown in fig. 8, the system includes:
a transmission matrix obtaining module 201, configured to obtain a bone conduction transmission matrix when a user wears the bone conduction headset.
In this embodiment, the transfer matrix obtaining module 201 includes: a transmitting signal unit for inducing a stimulation frequency otoacoustic transmitting signal using bone conduction sweep tones; the bone conduction transfer function calculation unit is used for solving a bone conduction transfer function from the bone conduction earphone to the inner ears on the two sides of the user according to the sweep frequency stimulation signal and the stimulation frequency otoacoustic emission signal; and the bone conduction transfer matrix acquisition unit is used for obtaining a bone conduction transfer matrix according to the bone conduction transfer function.
The shape factor stop band frequency range obtaining module 202 is configured to obtain a stop band frequency range of the shape factor according to a frequency range where singular values of the personalized bone conduction transfer functions on two sides of ears of the user appear.
The cross talk elimination matrix obtaining module 203 is configured to obtain a filtered approximate cross talk elimination matrix according to the bone conduction transfer matrix, the stop band frequency range of the shape factor, and the filter order during filtering.
A first crosstalk elimination module 204, configured to perform crosstalk elimination on a binaural signal input to the bone conduction earphone by using the approximate crosstalk elimination matrix, to obtain a sound signal with crosstalk eliminated; the binaural signal is a signal containing sound source position information.
In this embodiment, the first crosstalk elimination module 204 includes a synthesis unit, configured to synthesize the single-channel signal according to a head-related transfer function including sound source position information to obtain the binaural signal.
And the sound signal acquisition module 205 is configured to play the sound signal with the crosstalk eliminated, so as to obtain sound signals heard by two ears of the user.
A cross talk cancellation matrix determination module 206, configured to determine a final cross talk cancellation matrix according to the bone conduction transfer matrix, the approximate cross talk cancellation matrix, and the sound signals heard by both ears of the user.
In this embodiment, the cross-talk elimination matrix determining module 206 includes: a channel separation degree calculation unit for calculating a channel separation degree according to the bone conduction transfer matrix and the approximate cross-talk elimination matrix; a performance error calculation unit for calculating a performance error from the sound signal heard by both ears of the user and the desired signal; and the determining unit is used for determining a final crosstalk elimination matrix according to the channel separation degree and the performance error.
And a second crosstalk elimination module 207, configured to perform crosstalk elimination on the binaural signal input to the bone conduction headset according to the final crosstalk elimination matrix.
According to the embodiment of the invention, the invention discloses the following technical effects:
1. according to the invention, the cross-talk elimination in the binaural bone conduction sound reproduction is researched according to the bilateral BCTF data measured by the heads of different subjects, so that the binaural bone conduction sound reproduction effect can be better improved.
2. The invention adjusts and obtains the proper crosstalk elimination matrix according to the bone conduction transfer matrix, the approximate crosstalk elimination matrix and the sound information heard by the ears of the user, and can obtain the optimal crosstalk elimination effect.
3. The applied frequency domain fast deconvolution algorithm has the advantages of small calculation amount, simplicity, effectiveness and high practicability.
The embodiments in the present description are described in a progressive manner, each embodiment focuses on differences from other embodiments, and similar parts between the embodiments are referred to each other. For the system disclosed by the embodiment, the description is relatively simple because the system corresponds to the method disclosed by the embodiment, and the relevant points can be referred to the method part for description.
The principles and embodiments of the present invention have been described herein using specific examples, which are provided only to help understand the method and the core concept of the present invention; meanwhile, for a person skilled in the art, according to the idea of the present invention, the specific embodiments and the application range may be changed. In view of the above, the present disclosure should not be construed as limiting the invention.