CN1255255A - Echo reducing phone with state machine controlled switches - Google Patents
Echo reducing phone with state machine controlled switches Download PDFInfo
- Publication number
- CN1255255A CN1255255A CN 98804832 CN98804832A CN1255255A CN 1255255 A CN1255255 A CN 1255255A CN 98804832 CN98804832 CN 98804832 CN 98804832 A CN98804832 A CN 98804832A CN 1255255 A CN1255255 A CN 1255255A
- Authority
- CN
- China
- Prior art keywords
- state machine
- microphone
- finite state
- signal
- value
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 230000005540 biological transmission Effects 0.000 claims description 17
- 238000000034 method Methods 0.000 claims description 12
- 230000007774 longterm Effects 0.000 claims description 4
- 206010019133 Hangover Diseases 0.000 claims 5
- 238000005070 sampling Methods 0.000 claims 3
- 238000001514 detection method Methods 0.000 description 3
- 238000002592 echocardiography Methods 0.000 description 3
- 238000010586 diagram Methods 0.000 description 2
- 230000006870 function Effects 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 230000008569 process Effects 0.000 description 2
- 230000007704 transition Effects 0.000 description 2
- 238000006243 chemical reaction Methods 0.000 description 1
- 238000012937 correction Methods 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 238000012545 processing Methods 0.000 description 1
- 238000001228 spectrum Methods 0.000 description 1
Images
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04M—TELEPHONIC COMMUNICATION
- H04M9/00—Arrangements for interconnection not involving centralised switching
- H04M9/08—Two-way loud-speaking telephone systems with means for conditioning the signal, e.g. for suppressing echoes for one or both directions of traffic
Landscapes
- Engineering & Computer Science (AREA)
- Signal Processing (AREA)
- Cable Transmission Systems, Equalization Of Radio And Reduction Of Echo (AREA)
- Telephone Function (AREA)
- Circuit For Audible Band Transducer (AREA)
- Interconnected Communication Systems, Intercoms, And Interphones (AREA)
- Selective Calling Equipment (AREA)
Abstract
本发明的目的为减少串音引入的回声。上述如何减少串音引入回声的问题是用对麦克风与扬声器引入受状态机控制的开关解决的,该状态机取来自麦克风的信号的信号能量、来自麦克风的信号的VAD标志、到达扬声器的信号的信号能量及到达扬声器的信号的VAD标志作为输入。
The purpose of the present invention is to reduce the echo introduced by crosstalk. The above-mentioned problem of how to reduce crosstalk and introduce echo is solved by introducing a switch controlled by a state machine to the microphone and the speaker. The state machine takes the signal energy of the signal from the microphone, the VAD flag of the signal from the microphone, and the The signal energy and the VAD signature of the signal reaching the loudspeaker are taken as input.
Description
本发明一般涉及电信,更具体地涉及因特网上的话音通信的语音处理。This invention relates generally to telecommunications, and more particularly to speech processing for voice communications over the Internet.
典型的因特网电话利用带声板、麦克风及两个扬声器的PC。麦克风与扬声器通常互相靠近地放在桌上。这一配置导致在接收机端听起来象回声的可观的串音量。为了使因特网电话能使用必须抑制这一回声。A typical Internet phone utilizes a PC with a soundboard, microphone and two speakers. The microphone and speaker are usually placed close to each other on a table. This configuration results in a considerable amount of crosstalk that sounds like an echo at the receiver. This echo must be suppressed in order for Internet telephony to be usable.
在GSM中,已知用VAD(话音活动检测)来检测移动电话用户是否在讲话。利用这一信息能减少发射话音时的带宽。在按照VOX原理(话音操作的传输)的不连续语音编码中,VAD单元负责检测所接收的声音序列是否表示人的语音。VAD单元能取两种不同的状态,其中第一状态表示声音序列为人的话音而另一状态表示声音序列不是人的话音。In GSM it is known to use VAD (Voice Activity Detection) to detect whether a mobile phone user is speaking or not. Utilizing this information reduces bandwidth when transmitting voice. In discontinuous speech coding according to the VOX principle (Voice Operated Transmission), the VAD unit is responsible for detecting whether the received sound sequence represents human speech. The VAD unit can take two different states, where the first state indicates that the sound sequence is human voice and the other state indicates that the sound sequence is not human voice.
如果VAD单元检测出给定的声音序列表示人的话音,该VAD单元将发布第一状态信号到语音编码单元,后者将该声音序列编码在语音帧中。反之,如果给定的声音序列表示人的语音以外的东西,该VAD单元将发布第二状态信号给SID(静默描述符)单元。所述SID单元每N个帧发送一个SID帧。在其余N-1个可能发送帧的时机中不发送任何东西。SID帧包括发送方上的关于估计的背景噪声及估计的噪声频谱的信息。用这一过程能节省电池功率与无线电带宽。If the VAD unit detects that a given sound sequence represents human speech, the VAD unit will issue a first status signal to the speech encoding unit, which encodes the sound sequence in a speech frame. Conversely, if the given sound sequence represents something other than human speech, the VAD unit will issue a second status signal to the SID (Silence Descriptor) unit. The SID unit sends a SID frame every N frames. Nothing is sent on the remaining N-1 possible occasions to send a frame. The SID frame includes information on the sender about the estimated background noise and the estimated noise spectrum. Using this process saves battery power and radio bandwidth.
当SID单元从生成第一状态信号改变到生成第二状态信号时,便是从检测到语音到检测到非语音的时间间隔,通常施加所谓的释放延迟,在这一期间语音编码单元继续发送语音帧,似乎所接收的声音序列依然是人的语音。如果释放延迟时间之后VAD单元仍检测到非语音,便生成SID帧。这一过程的原因在于不应将人的语言中的字之间的短的停顿解释为非语音,而语音帧发生器仍应活跃。When the SID unit changes from generating the first state signal to generating the second state signal, that is the time interval from the detection of speech to the detection of non-speech, usually imposing a so-called release delay, during which time the speech encoding unit continues to send speech frame, it seems that the received sound sequence is still human speech. If the VAD unit still detects non-speech after the release delay time, a SID frame is generated. The reason for this process is that short pauses between words in human language should not be interpreted as non-speech, and the speech frame generator should still be active.
本发明公开了减少串音引入的回声的方法与装置。The invention discloses a method and a device for reducing the echo introduced by crosstalk.
因而本发明的目的为减少串音引入的回声。It is therefore an object of the present invention to reduce echoes introduced by crosstalk.
上述关于如何减少串音引入的回声问题是通过向麦克风与扬声器引入受状态机控制的开关解决的,该状态机以来自麦克风的信号的信号能量、来自麦克风的信号的VAD标志、到扬声器的信号的信号能量及到扬声器的信号的VAD标志作为输入。The above-mentioned problem of how to reduce the echo introduced by crosstalk is solved by introducing a switch to the microphone and speaker controlled by a state machine, which uses the signal energy of the signal from the microphone, the VAD flag of the signal from the microphone, the signal The signal energy and the VAD flag of the signal to the loudspeaker are taken as input.
本发明的优点之一是明显地减少了串音引入的回声而无需更多的计算能力。One of the advantages of the present invention is that crosstalk-induced echoes are significantly reduced without requiring more computing power.
从下面给出的详细描述中,对于熟悉本技术的人员其它优点将是显而易见的。Other advantages will be apparent to those skilled in the art from the detailed description given below.
从下面给出的详细描述中,本发明的进一步应用范围将是显而易见的。然而,应理解,本发明的最佳实施例只是示例性的,因为从这一详细描述中,在本发明范围内的各种改变与修正对熟悉本技术的人员是显而易见的。Further scope of applicability of the present invention will be apparent from the detailed description given below. It should be understood, however, that the preferred embodiment of the invention is illustrative only, since various changes and modifications within the scope of the invention will become apparent to those skilled in the art from this detailed description.
图1示出本发明的一个实施例的方框图。Figure 1 shows a block diagram of one embodiment of the present invention.
图2示出有限状态图。Figure 2 shows a finite state diagram.
在图1中麦克风101连接在GSM编码器102上。在信号到达GSM编码器102之前,它已按照图1中未示出的已知技术被数字化与抽样。从GSM编码器102首先通过能启动或截止传输的开关103将编码信号传输给图中未示出的接收机。从GSM编码器102将ACFE(自动校正系数)传递给VAD单元104。从GSM帧还传递长期预测器滞后值NE给VAD单元104。从VAD单元104将表示信号的能量的值PE传递给有限状态机105。VAD单元104还计算指示VAD单元104是否已检测到人的语音的标志FE。将标志FE传递给有限状态机105。如果检测到人的话音则标志FE为真。In FIG. 1 a microphone 101 is connected to a GSM encoder 102 . Before the signal reaches the GSM encoder 102, it is digitized and sampled according to known techniques not shown in FIG. The encoded signal is first transmitted from the GSM encoder 102 to a receiver (not shown in the figure) via a switch 103 which can activate or deactivate the transmission. ACFE (Auto Correction Coefficient) is passed from the GSM encoder 102 to the VAD unit 104 . The long-term predictor lag value NE is also passed to the VAD unit 104 from the GSM frame. A value PE representing the energy of the signal is passed from the VAD unit 104 to the finite state machine 105 . The VAD unit 104 also calculates a flag FE indicating whether the VAD unit 104 has detected human speech. The flag FE is passed to the finite state machine 105 . Flag FE is true if human voice is detected.
图1中还有从发送者(未示出)接收并传递给GSM解码器106的抽样的编码话音信号。从GSM解码器106首先通过能使或截止话音信号到达扬声器的开关108将解码的抽样话音信号传递给扬声器107。按照图1中未示出的已知技术,为了使扬声器能正常工作,需要D/A转换。从所接收的编码话音信号中推导出长期预测器滞后值N0并传递给VAD单元109。Also shown in FIG. 1 is a sampled encoded speech signal received from a sender (not shown) and passed to the GSM decoder 106 . From the GSM decoder 106 the decoded sampled voice signal is first delivered to the speaker 107 through a switch 108 which enables or disables the voice signal to the speaker. According to known techniques not shown in Fig. 1, a D/A conversion is required for the loudspeaker to function properly. The long-term predictor lag value N 0 is derived from the received encoded speech signal and passed to the VAD unit 109 .
由于GSM帧的解码通常不包含使用VAD单元,GSM解码器缺少用于计算ACF的必要参数。为了能计算ACF,自相关单元110接收来自GSM解码器106的数据及计算传递给VAD单元109的ACFD。自相关单元110为标准中所描述的GSM编码器的一部分。从VAD单元109将到达扬声器的话音信号中的能量的指示值PD传递给有限状态机105。从VAD单元109还将标志FD传递给所述有限状态机,指示VAD单元是否检测到人的话音。Since the decoding of GSM frames usually does not involve the use of VAD units, GSM decoders lack the necessary parameters for calculating the ACF. To be able to calculate the ACF, the autocorrelation unit 110 receives the data from the GSM decoder 106 and calculates the ACF D which is passed to the VAD unit 109 . The autocorrelation unit 110 is part of the GSM encoder described in the standard. An indication value PD of the energy in the speech signal arriving at the loudspeaker is passed from the VAD unit 109 to the finite state machine 105 . A flag FD is also passed from the VAD unit 109 to the finite state machine, indicating whether the VAD unit detected human speech or not.
有限状态机106包括根据输入到有限状态机的值设定开关103与109的功能。The finite state machine 106 includes the function of setting the switches 103 and 109 according to the values input to the finite state machine.
图2中示出图1中的有限状态机的状态与可能的转移。FIG. 2 shows the states and possible transitions of the finite state machine in FIG. 1 .
状态之间的转移是按照以下描述进行的。利用下述定义:Transitions between states are performed as described below. Use the following definitions:
·FE:编码时的VAD标志· F E : VAD flag when encoding
·FD:解码时的VAD标志F D : VAD flag when decoding
·PE:编码时的信号能量P E : signal energy during encoding
·PD:解码时的信号能量P D : signal energy when decoding
·释放延迟:从决定开关方向到进行开关的时间。这一时间必须足够长以补偿室内回声。·Release delay: the time from when the switch direction is determined to when the switch is made. This time must be long enough to compensate for room echoes.
201.FE=1AND FD=0 OR FE=1及PE>PD,释放延迟=0201. FE = 1AND F D = 0 OR FE = 1 and P E > P D , release delay = 0
202.FE=0,释放延迟=600ms202. F E = 0, release delay = 600ms
203.FD=1 AND FE=0 OR FD=1及PD>PE,释放延迟=0203. F D =1 AND FE =0 OR F D =1 and P D >P E , release delay=0
204.FD=0,释放延迟=600ms204. F D = 0, release delay = 600ms
205.FD=1 AND PD>PE,释放延迟=600ms205.F D =1 AND P D >P E , release delay=600ms
205.FE=1 AND PE>PD,释放延迟=600ms 205.FE = 1 AND P E > P D , release delay = 600ms
在状态TRANSMITTING(传输)207中,启动控制从麦克风传输话音信号的开关及截止控制传输话音信号到扬声器的开关。在状态RECEIVING(接收)208中,截止控制从麦克风传输话音信号的开关及启动控制向扬声器传输的开关。在IDLE(空闲)状态209中,两个开关都截止。In
这样描述了本发明,显而易见可用多种方式改变本发明。而这种改变不认为是偏离本发明的精神与范围,对于熟悉本技术的人员显而易见的所有这些修改都旨在包含在以下的权利要求的范围之内。The invention thus being described, it will be obvious that the invention may be varied in various ways. While such changes are not to be regarded as a departure from the spirit and scope of the invention, all such modifications as would be obvious to one skilled in the art are intended to be embraced within the scope of the following claims.
Claims (9)
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
SE97008734 | 1997-03-11 | ||
SE9700873A SE511650C2 (en) | 1997-03-11 | 1997-03-11 | Method and apparatus for reducing echo in a telephone application |
Publications (1)
Publication Number | Publication Date |
---|---|
CN1255255A true CN1255255A (en) | 2000-05-31 |
Family
ID=20406109
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN 98804832 Pending CN1255255A (en) | 1997-03-11 | 1998-02-24 | Echo reducing phone with state machine controlled switches |
Country Status (9)
Country | Link |
---|---|
EP (1) | EP0974205A1 (en) |
JP (1) | JP2001514823A (en) |
CN (1) | CN1255255A (en) |
AU (1) | AU735505B2 (en) |
BR (1) | BR9808240A (en) |
CA (1) | CA2283590A1 (en) |
SE (1) | SE511650C2 (en) |
TW (1) | TW407435B (en) |
WO (1) | WO1998040974A1 (en) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101145803B (en) * | 2007-09-06 | 2012-09-05 | 杭州华三通信技术有限公司 | A method and device for separating echo reflection |
Families Citing this family (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP1163787A2 (en) * | 1999-03-15 | 2001-12-19 | Vocaltec Communications Ltd. | Echo suppression device and method for performing the same |
US6754337B2 (en) * | 2002-01-25 | 2004-06-22 | Acoustic Technologies, Inc. | Telephone having four VAD circuits |
US7020257B2 (en) * | 2002-04-17 | 2006-03-28 | Texas Instruments Incorporated | Voice activity identiftication for speaker tracking in a packet based conferencing system with distributed processing |
Family Cites Families (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4897832A (en) * | 1988-01-18 | 1990-01-30 | Oki Electric Industry Co., Ltd. | Digital speech interpolation system and speech detector |
GB2256351B (en) * | 1991-05-25 | 1995-07-05 | Motorola Inc | Enhancement of echo return loss |
FI110826B (en) * | 1995-06-08 | 2003-03-31 | Nokia Corp | Eliminating an acoustic echo in a digital mobile communication system |
-
1997
- 1997-03-11 SE SE9700873A patent/SE511650C2/en not_active IP Right Cessation
-
1998
- 1998-02-24 CN CN 98804832 patent/CN1255255A/en active Pending
- 1998-02-24 CA CA002283590A patent/CA2283590A1/en not_active Abandoned
- 1998-02-24 AU AU64264/98A patent/AU735505B2/en not_active Ceased
- 1998-02-24 EP EP98909895A patent/EP0974205A1/en not_active Withdrawn
- 1998-02-24 WO PCT/SE1998/000332 patent/WO1998040974A1/en not_active Application Discontinuation
- 1998-02-24 BR BR9808240-0A patent/BR9808240A/en not_active IP Right Cessation
- 1998-02-24 JP JP53949498A patent/JP2001514823A/en active Pending
- 1998-03-06 TW TW87103302A patent/TW407435B/en not_active IP Right Cessation
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101145803B (en) * | 2007-09-06 | 2012-09-05 | 杭州华三通信技术有限公司 | A method and device for separating echo reflection |
Also Published As
Publication number | Publication date |
---|---|
JP2001514823A (en) | 2001-09-11 |
WO1998040974A1 (en) | 1998-09-17 |
SE511650C2 (en) | 1999-11-01 |
TW407435B (en) | 2000-10-01 |
AU6426498A (en) | 1998-09-29 |
BR9808240A (en) | 2000-05-16 |
SE9700873L (en) | 1998-09-12 |
SE9700873D0 (en) | 1997-03-11 |
EP0974205A1 (en) | 2000-01-26 |
AU735505B2 (en) | 2001-07-12 |
CA2283590A1 (en) | 1998-09-17 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN1128512C (en) | Method of providing hands-free telephone operation in a portable communication device | |
EP2130203B1 (en) | Method of transmitting data in a communication system | |
EP1346553B1 (en) | Audio signal quality enhancement in a digital network | |
CN101502089B (en) | Method for carrying out an audio conference, audio conference device, and method for switching between encoders | |
US7907977B2 (en) | Echo canceller with correlation using pre-whitened data values received by downlink codec | |
GB2388001A (en) | Compensating for beamformer steering delay during handsfree speech recognition | |
US20060067512A1 (en) | Speakerphone having improved outbound audio quality | |
CN1255255A (en) | Echo reducing phone with state machine controlled switches | |
US7233893B2 (en) | Method and apparatus for transmitting wideband speech signals | |
US8509460B2 (en) | Sound mixing apparatus and method and multipoint conference server | |
JPH1032804A (en) | Video conference equipment | |
KR101493459B1 (en) | APPARATUS AND METHOD FOR IMPROVING SOUND QUALITY OF FEM RAD | |
WO2004012426A1 (en) | System and method for speakerphone operation in a communications device | |
JP2010028570A (en) | Interactive hands-free speaking speed converting speech apparatus | |
JP2974427B2 (en) | Voice communication system and voice communication device | |
JPH09200308A (en) | Communication terminal equipment | |
JPH1032638A (en) | Video conference system | |
MXPA99008026A (en) | Echo reducing phone with state machine controlled switches | |
JPS6338339A (en) | Digital voice communication system | |
Valin et al. | RFC 6366: Requirements for an Internet Audio Codec | |
JPH11331320A (en) | Interactive speech rate conversion apparatus and method | |
JP2000101683A (en) | Voice communication equipment | |
JP2009302984A (en) | Voice communication apparatus and voice communication method | |
AU2012200349A1 (en) | Method of transmitting data in a communication system | |
KR20050070338A (en) | Apparatus and its method for providing echo cancellation using delay prediction |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C02 | Deemed withdrawal of patent application after publication (patent law 2001) | ||
WD01 | Invention patent application deemed withdrawn after publication |