Description APPARATUS AND METHOD TO CANCEL CROSSTALK AND
STEREO SOUND GENERATION SYSTEM USING THE SAME
Technical Field
[1] The present general inventive concept relates to a virtual sound system, and more particularly, to an apparatus and method to cancel crosstalk between 2-channel speakers and two ears of a listener and a stereo sound generation system using the same.
Background Art
[2] A stereo sound system disposes a sound source in a predetermined position of a virtual space through a headphone or speaker and provides a directional perception, a distance perception, and a spatial perception as though a sound is actually being heard from a place at which a virtual sound source of the sound is located. Generally, a stereo sound is implemented by a binaural synthesis filter using a head related transfer function (HRTF) that is an acoustic transfer function between sound sources and eardrums. A stereo sound using the binaural synthesis filter shows the best performance when a signal is reproduced through a headphone. However, if the signal is reproduced through two speakers, crosstalk between the two speakers and two ears of a listener occur such that a stereo perception is degraded. Accordingly, a crosstalk canceller cancels the crosstalk between both signals so that a signal reproduced through a left speaker is not heard in a right ear of the listener and a signal reproduced through a right speaker is not heard in a left ear of the listener.
[3] A technology related to this crosstalk canceller is described in U.S. Patent No.
6,668,061 Bl (filed 18 Nov. 1998, entitled, 'CROSSTALK CANCELLER ').
[4] FIG. 1 illustrates a conventional crosstalk canceller. The crosstalk canceller of FIG.
1 is called a lattice structure and includes four filters 142, 143, 144, and 145.
[5] Referring to HG. 1, a left input signal 140
is convoluted through a filter 142, and a right input signal 141
(B R) is convoluted through a filter 144. Subsequently, the two convoluted signals are added to each other by an adder 150 and reproduced as a left output signal 152
(SL) In addition, the right input signal 141
(B R)
is convoluted through a filter 145, and the left input signal 140
is convoluted through a filter 143. Subsequently, the two convoluted signals are added to each other by an adder 151 and reproduced as a right output signal 153
Disclosure of Invention
Technical Problem
[6] In the cross-talk canceling method illustrated in FlG. 1, a convolution operation is performed four times with respect to the four filters 142, 143, 144, and 145. Thus, a large amount of computation is required when the order of each filter is high. Accordingly, in the conventional crosstalk canceller, it is difficult to perform convolution in a time domain and convolution should be performed in a frequency domain. However, since convolution in the frequency domain requires a large-size memory, the size of a program must also be increased.
Technical Solution
[7] The present general inventive concept provides an apparatus and method to cancel a crosstalk phenomenon between 2-channel speakers and two ears of a listener and a stereo sound generation system using the same.
[8] Additional aspects of the present general inventive concept will be set forth in part in the description which follows and, in part, will be obvious from the description, or may be learned by practice of the general inventive concept.
Advantageous Effects
[9] According to embodiments of the present general inventive concept as described above, a crosstalk phenomenon between two speakers and two ears of a listener is cancelled such that a desired performance in many applied fields including a stereo sound system can be maximized. In addition, in a crosstalk canceller according to the embodiments of the present general inventive concept, a number of filters is reduced from 4 to 2 from a conventional lattice structure and a convolution is performed only twice such that an amount of computation and a size of a memory can be reduced by 50% from the conventional lattice structure.
Description of Drawings
[10] These and/or other aspects and advantages of the present general inventive concept will become apparent and more readily appreciated from the following description of the embodiments, taken in conjunction with the accompanying drawings of which:
[11] FlG. 1 illustrates a conventional crosstalk canceller;
[12] FlG. 2 illustrates an apparatus to cancel a crosstalk according to an embodiment of the present general inventive concept;
[13] FlG. 3 illustrates a crosstalk phenomenon that occurs between two speakers and two ears of a listener;
[14] FlG. 4 illustrates a crosstalk canceller having a lattice structure for explaining the cancellation of the crosstalk phenomenon of FlG. 3 in more detail;
[15] FlG. 5 illustrates head related transfer function (HRTF) pairs of adjacent loud speakers;
[16] FlG. 6 illustrates an approximated asymmetrical crosstalk canceller, according to an embodiment of the present general inventive concept;
[17] FlG. 7 is a block diagram illustrating the approximated asymmetrical crosstalk canceller of FlG. 6;
[18] FlG. 8 illustrates an approximated symmetrical crosstalk canceller, according to an embodiment of the present general inventive concept; and
[19] FlG. 9 is a block diagram illustrating the approximated symmetrical crosstalk canceller of FlG. 8, according to an embodiment of the present general inventive concept.
Best Mode
[20] The foregoing and/or other aspects of the present general inventive concept are achieved by providing an apparatus to cancel a crosstalk between two speakers and two ears of a listener, the apparatus including a delay unit to delay first and second channel input signals with respective predetermined delay values, a gain unit to adjust an output gain of each of the first and second channel input signals delayed in the delay unit, a first addition unit to add the first channel input signal to the gain and delay- adjusted second channel signal, a first filter unit to adjust a frequency characteristic of a signal output from the first addition unit, a second addition unit to add the second channel input signal to the gain and delay-adjusted first channel signal, and a second filter unit to adjust a frequency characteristic of a signal output from the second addition unit.
[21] The foregoing and/or other aspects of the present general inventive concept are also achieved by providing an apparatus to cancel a crosstalk between two speakers and two ears of a listener, the apparatus including first and second filter units to adjust frequency characteristics of first and second channel signals, a delay unit to delay output signals of the first and second filter units with respective predetermined delay values, a gain unit to adjust an output level of each of the signals delayed in the delay unit, a first addition unit to add an output signal of the first filter unit to a gain and delay-adjusted output signal of the second filter unit, and a second addition unit to add an output signal of the second filter unit to a gain and delay-adjusted output signal of
the first filter unit.
[22] The foregoing and/or other aspects of the present general inventive concept are also achieved by providing a crosstalk canceling apparatus, including a gain/delay processing unit to receive first and second input channel signals, to apply a first gain and a first delay to the first input channel signal, to apply a second gain and a second delay to the second input channel signal, to add the gain/delayed first channel signal to the second input channel signal to obtain a first added signal, to add the gain/delayed second channel signal to the first input channel signal to obtain a second added signal, and to output the first and second added signals, and a filter unit to perform a first convolution operation to the first added signal and a second convolution operation to the second added signal and to output the first and second convoluted signals to first and second speakers, respectively.
[23] The foregoing and/or other aspects of the present general inventive concept are also achieved by providing a crosstalk processing apparatus, including a filter unit to filter left and right channel signals associated with left and right channel speakers, respectively, and a gain/delay unit to process the filtered left channel signal by approximating a first head related transfer function for the left channel speaker and predetermining a first gain difference and a first delay difference between a right ear position and a left ear position in a sound space with respect to the left channel speaker, and to process the filtered right channel signal by approximating a second head related transfer function for the right channel speaker and predetermining a second gain difference and a second delay difference between the right ear position and the left ear position in the sound space with respect to the right channel speaker.
[24] The foregoing and/or other aspects of the present general inventive concept are also achieved by providing a stereo sound production system, including first and second speakers, and a crosstalk canceling apparatus to cancel a crosstalk between the first and second speakers and two ears of a listener. The crosstalk canceling apparatus includes a first signal processing unit to cross-mix first and second channel signals with gain and delay-adjusted first and second channel signals, and a second signal processing unit to adjust frequency characteristics of the signals mixed in the first signal processing unit and to provide the signals with the adjusted frequency characteristics to the first and second speakers.
[25] The foregoing and/or other aspects of the present general inventive concept are also achieved by providing a crosstalk canceling apparatus to generate a virtual sound without crosstalk between left and right channel speakers using the following predetermined relationship between acoustic transfer functions for each of the left and right speakers
[26]
[27] where
H1(Z) represents a first acoustic transfer function between a selected one of the left and right speakers and an ear that is closer to the selected speaker,
H2(Z) represents a second acoustic transfer function between the selected speaker and an ear that is distant from the selected speaker, α represents a gain difference between the selected speaker and the close and distant ears, and β represents a delay difference between the selected speaker and the close and distant ears.
[28] The foregoing and/or other aspects of the present general inventive concept are also achieved by providing a method of canceling crosstalk between two speakers and two ears of a listener, the method including inputting left and right channel signals binaural synthesized by a head related transfer function (HRTF), adjusting a gain and a delay of the left channel input signal, adjusting a gain and a delay of the right channel input signal, adding the left channel input signal to the gain and delay-adjusted right channel signal to obtain a first mixed signal, adjusting a frequency characteristic of the first mixed signal in an inverse HRTF form and outputting a result to a left speaker, adding the right channel input signal to the gain and delay-adjusted left channel signal to obtain a second mixed signal, and adjusting a frequency characteristic of the second mixed signal in an inverse HRTF form and outputting a result to a right speaker.
[29] The foregoing and/or other aspects of the present general inventive concept are also achieved by providing a method of generating sound at a listening point using 2-channel speakers, the method including receiving first and second channel signals corresponding to first and second speakers, respectively, approximating a second head related transfer function between the first speaker and a second ear of a listener based on a first head related transfer function between the first speaker and a first ear of the listener, a corresponding first delay value, and a corresponding first gain value, approximating a fourth head related transfer function between the second speaker and the first ear of the listener based on a third head related transfer function between the second speaker and the second ear of the listener, a corresponding second delay value, and a corresponding second gain value, and processing the first and second channel signals according to the first, approximated second, third, and approximated fourth head related transfer functions to cancel crosstalk between the first and second speakers.
[30] The foregoing and/or other aspects of the present general inventive concept are also achieved by providing a computer readable medium containing executable code to
cancel a crosstalk between two speakers and two ears of a listener, the medium including first executable code to cross-mix first and second channel signals with gain and delay-adjusted first and second channel signals, and second executable code to adjust frequency characteristics of the signals mixed in the first signal processing unit.
Mode for Invention
[31] Reference will now be made in detail to the embodiments of the present general inventive concept, examples of which are illustrated in the accompanying drawings, wherein like reference numerals refer to the like elements throughout. The embodiments are described below in order to explain the present general inventive concept by referring to the figures.
[32] FlG. 2 illustrates an apparatus to cancel a crosstalk according to an embodiment of the present general inventive concept. The crosstalk canceling apparatus of FlG. 2 includes a first signal processing unit 210 and a second signal processing unit 220. The first signal processing unit 210 includes a first gain unit 212, a second gain unit 216, a first delay unit 214, a second delay unit 218, a first addition unit 219-1, and a second addition unit 219-2. The first signal processing unit 210 cross- mixes a left channel signal
and a right channel signal
(B R) with gain/delay-adjusted or delay/gain-adjusted left channel signal
and right channel signal
(B R)
. The second signal processing unit 220 includes a first filter unit 222 and a second filter unit 224. The second signal processing unit 220 adjusts frequency characteristics of each of the signals mixed in the first signal processing unit 210. The order of the first and second gain units 212 and 216 and the first and second delay units 214 and 218 can be changed according to the desired implementation. That is, in another embodiment of the crosstalk canceling apparatus, the first and second delay units 214 and 218 may be switched with the first and second gain units 212 and 216, respectively. [33] Referring to FlG. 2, the first gain unit 212 adjusts the gain of the left channel signal
being input with a first predetermined gain value.
[34] The second gain unit 216 adjusts the gain of the right channel signal
(B R) being input with a second predetermined gain value. [35] The first delay unit 214 delays the left channel signal
gain-adjusted in the first gain unit 212 with a first predetermined delay value. [36] The second delay unit 218 delays the right channel signal
(B R) gain-adjusted in the second gain unit 216 with a second predetermined delay value. [37] The first addition unit 219-1 adds the left channel signal
being input to the first signal processing unit 210 to the right channel signal
(B R)
, which has been gain and delay-adjusted by the second gain unit 216 and the second delay unit 218. [38] The second addition unit 219-2 adds the right channel signal
(B R) being input to the first signal processing unit 210 to the left channel signal
(BJ
, which has been gain and delay-adjusted by the first gain unit 212 and the first delay unit 214. [39] The first filter unit 222 has an inverse HRTF form of an HRTF that is an acoustic transfer function between speakers and two ears of a listener, and adjusts the frequency characteristic of a signal mixed in the first addition unit 219-1. An output signal
(SL) of the first filter unit 222 is output to a left speaker. [40] The second filter unit 224 has the inverse HRTF form of the HRTF that is the acoustic transfer function between the speakers and the two ears of the listener, and adjusts the frequency characteristic of a signal mixed in the second addition unit 219-2. An output signal
of the second filter unit 224 is output to a right speaker.
[41] The apparatus to cancel a crosstalk having an improved structure of FIG. 2 will be described with more detail with reference to FIGS. 3 through 8.
[42] Referring to FlG. 3, a crosstalk phenomenon between two speakers 310 and 320 and two ears of a listener occurs in many applied fields including a stereo sound.
[43] A crosstalk canceller cancels the crosstalk phenomenon by compensating for signals immediately before output signals are output to the two speakers 310 and 320. The crosstalk canceller is implemented as an inverse matrix of an HRTF matrix between the two speakers 310 and 320 and the two ears of the listener, as the following equation 1:
[44]
IN21(Z) N22(S)] l~N2l(z)
. . . (1) [45] where
H11(Z)1 H12(Z)1 H21(Z) 1 3 nCJ H22(Z)
, respectively, constitute the HRTF that is the acoustic transfer function between the two speakers and the two ears of the listener, as illustrated in FlG. 3. The crosstalk canceller has a secondary square matrix to generate two output signals in response to two input signals, and thus, is implemented as a structure illustrated in FlG. 4. Generally, the structure illustrated in FlG. 4 is referred to as a lattice structure. Here,
K11(Z) 1 K12(Z)1 K21(Z)1 3 nCi K22(Z)
, respectively, are elements of a secondary square matrix of equation 1. [46] Basically, a stereo speaker system is mounted on current digital media products. In portable devices such as personal multimedia players (PMP) or personal digital assistants (PDA) as well as TVs, two speakers are adjacent to each other. FlG. 5 illustrates the stereo speaker system having the two speakers 310' and 320' adjacent to each other such that sounds approximately originate from the same location. As illustrated in FlG. 5, when speakers 310' and 320' are adjacent to each other, HRTF pairs
(H1 (Z)1 H2(Z)) between the two ears of the listener in one speaker 310' have similar acoustic characteristics due to the fact that the sounds of the both speakers 310' and 320' originate from approximately the same location. The closer the two speakers 310' and 320' are to one another, the higher the correlation between
H1(Z) and
H2(Z) Here,
H1(Z) is an HRTF of an ear that is close to the speakers 310' and 320', and
H2(Z) is an HRTF of an ear that is distant from the speakers 310' and 320'. [47] Considering that the correlation between the HRTF pairs
(H1 (Z), H2(Z)) is high, an assumption of the following equation 2 can be made [48]
. . . (2) [49] That is, assuming that there is only a difference between a gain and a delay between
H1(Z) and
H2(Z)
H2(Z) can be obtained using equation 2 by adjusting the gain and the delay from
H1(Z)
. Here, a gain value (α) is a level difference between the two HRTFs, and a delay value (β) is a delay difference between the two HRTFs. The level difference (α) between the two HRTFs is obtained from the difference between maximum values of impulse responses of the two HRTFs
between the speakers 310' and 320' and the two ears of the listener, or the difference between root mean square (RMS) values. The delay difference (β) between the two HRTFs
is obtained from a time when a cross-correlation function of the impulse responses of the two HRTFs
between the speakers 310' and 320' and the two ears of the listener becomes a maximum. [50] Accordingly, when the two speakers 310 and 320 are disposed symmetrically about
the listener as illustrated in FlG. 3, the crosstalk canceller is obtained by the above equation 1. When the two speakers 310' and 320' are disposed asymmetrically about the listener as illustrated in FlG. 5, the assumption of the following equations 3 and 4 can be made based on equation 2. HRTFs
(H21(Z) 1 H12(Z)) about the ear that is distant from the speakers 310' and 320' can be obtained from H HRRTTFFss
( (HH1111((ZZ)),1 H M2222((ZZ)j)) about the ear that is close to the speakers 310' and 320' as indicated by the following equations 3 and 4:
[51]
H21(Z) ≡ aιz-*Hn{z)
. . . (3) [52]
. . (4) [53] where,
Qf1 and α2 are a level difference between two HRTFs, and β2 are a delay difference between the two HRTFs, as mentioned in equation 2.
[54] By using equations 3 and 4, equation 1 can be approximated as the following eeqquuaattiioonn 5: [55]
[56] where
[57]
11 JZ1 ,UWs(O- JY1 a<z>JΪ2l<r)
. . . (5)
[58] Equation 5 that represents the approximated crosstalk canceller can be expressed as the block diagram of FIG. 6.
[59] The block diagram of the crosstalk canceller of FIG. 6 can be expanded as the block diagram of FIG. 7. That is, the crosstalk canceller includes first and second gain units, first and second delay units, and first and second filters. As a result, while the crosstalk canceller of the lattice structure of FIG. 4 performs a convolution operation four times with respect to four filters, a crosstalk canceller of the present embodiment performs the convolution operation only twice with respect to the two filters such that the amount of computation and a size of a memory can be reduced.
[60] Additionally, when the two speakers 310 and 320 are disposed symmetrically about the listener (FIG. 3), the symmetrical crosstalk canceller can employ the same method as the asymmetrical crosstalk canceller used when the two speakers 310' and 320' are disposed asymmetrically about the listener (FIG. 5).
[61] A general crosstalk canceller becomes Hl l(z) = H22(z) and H21(z) = H12(z). Accordingly, the crosstalk canceller can be expressed as the following expression 6:
[62]
. . . (6) [63] By using an assumption of the following equation 7, equation 6 is approximated as equation 8 (below): [64]
. . . (7) [65]
. . . (8)
[66] Equation 8 represents an approximated symmetrical crosstalk canceller and can be expressed as the block diagram of FIG. 8. The approximated symmetrical crosstalk
canceller of the block diagram of FlG. 8 can be expanded as the block diagram of a symmetrical crosstalk canceller of FlG. 9.
[67] Referring to FlG. 9, the symmetrical crosstalk canceller includes a first signal processing unit 910 and a second signal processing unit 920. The first signal processing unit 910 includes first and second filter units 912 and 914 to adjust frequency characteristics of input left and right channel signals
and
(B R)
, respectively.
[68] The second signal processing unit 920 includes first and second gain units 922 and
926 to adjust gains of output signals of the first and second filter units 912 and 914, respectively, with predetermined gain values. The second signal processing unit 920 further includes first and second delay units 924 and 928 to delay the signals that are gain-adjusted in the first and second gain units 922 and 926, respectively, with predetermined delay values. A first addition unit 929-1 adds an output signal of the first filter unit 912 and a gain and delay-adjusted output signal of the second filter unit 914. A second addition unit 929-2 adds an output signal of the second filter unit 914 and a gain and delay-adjusted output signal of the first filter unit 912.
[69] The symmetrical crosstalk canceller in which the first filter unit 912 (i.e., Cl(z)) is connected to an input terminal, as illustrated in FIGS. 8 and 9, and the asymmetrical crosstalk canceller in which the first and second filter units (i.e., Cl l(z),C22(z)) are connected to an output terminal, as illustrated in FlG. 7, produce about the same result.
[70] As a result, the crosstalk canceller of the embodiments of the present general inventive concept are represented by FIGS. 7 and 9. As illustrated in FIGS. 7 and 9, in the crosstalk canceller of the present embodiment compared to the conventional lattice structure, a number of filters is reduced (from 4 to 2) such that the convolution operation is performed only twice and remaining signals can be processed using simple gain values and simple delay values. Accordingly, in the crosstalk canceller according to the embodiments of the present general inventive concept, an amount of computation required in the conventional crosstalk canceller structure can be decreased by 50%. In addition, since the number of filters is reduced, a size of a memory can be reduced.
[71] It should be understood that although the embodiments of the present general inventive concept have been described with reference to a listener and two ears of the listener, the crosstalk canceller of the embodiments of the present general inventive concept may be used to cancel crosstalk occurring about a listening point of a stereo
sound generation system and/or a virtual surround system. The listening point may refer to a position where a listener perceives optimal stereo effect, and this can be approximated using, for example, a dummy head. Thus, a listener need not necessarily be present when the crosstalk canceller and the stereo sound generation system operate.
[72] In another embodiment of the present general inventive concept, gain units, delay units, and filter units in a crosstalk canceller can be obtained directly not only from HRTFs using equations 1 through 8, but also from the conventional lattice structure. For example, as illustrated in FIG. 4, if there are four filter coefficients (Kll(z), K12(z ), K21(z), K22(z)) of a lattice structure designed in advance, the gain units, the delay units, and the filter units can be obtained from the four filter coefficients (Kl l(z), K12(z), K21(z), K22(z)). That is, referring to FIG. 7, the first filter unit becomes Kl l(z) and the second filter unit becomes K22(z). The first gain unit and the first delay unit are obtained from a time when a difference of maximum values (or RMS values) between the filter coefficients K22(z) and K21(z) and a cross-correlation function of the filter coefficients K22(z) and K21(z) becomes a maximum. The second gain unit and the second delay unit are obtained from a time when a difference of maximum values (or RMS values) between the filter coefficients Kl l(z) and K12(z) and a cross- correlation function of the filter coefficients Kl l(z) and K12(z) becomes a maximum.
[73] In another embodiment of the present general inventive concept, a widening filter based on a filter infinite impulse response filter (IIR) is designed by performing convolution of a binaural synthesis portion and a crosstalk canceller in a stereo sound generation system. When a number of virtual speakers is 2N, the binaural synthesis portion may be a square matrix of the size of 2 and the crosstalk canceller portion may also be a square matrix of the size of 2 such that the widening filter becomes a square matrix of the size of 2 which is a matrix form obtained by multiplying the two matrices corresponding to the binaural synthesis portion and the crosstalk canceller portion. Accordingly, the structure illustrated in FIGS. 7 and 9 can also be applied to the stereo sound generation apparatus to perform convolution of a square matrix structure of the size of 2 in relation to 2-channel input signals.
Industrial Applicability
[74] The present general inventive concept can also be embodied as computer readable codes on a computer readable recording medium. The computer readable recording medium is any data storage device that can store data which can be thereafter read by a computer system. Examples of the computer readable recording medium include readonly memory (ROM), random-access memory (RAM), CD-ROMs, magnetic tapes, floppy disks, optical data storage devices, and carrier waves (such as data transmission through the Internet). The computer readable recording medium can also be distributed over network coupled computer systems so that the computer readable code is stored
and executed in a distributed fashion. Also, functional programs, codes, and code segments for accomplishing the present general inventive concept can be easily construed by programmers skilled in the art to which the present general inventive concept pertains.