JP2012105115A

JP2012105115A - Echo canceller, echo cancellation program, and telephone apparatus

Info

Publication number: JP2012105115A
Application number: JP2010252518A
Authority: JP
Inventors: Shinsuke Takada; 真資高田
Original assignee: Oki Electric Industry Co Ltd
Current assignee: Oki Electric Industry Co Ltd
Priority date: 2010-11-11
Filing date: 2010-11-11
Publication date: 2012-05-31

Abstract

PROBLEM TO BE SOLVED: To provide an echo canceller, echo cancellation program, and a telephone apparatus for removing an echo generated on a telephone apparatus stably and continuously.SOLUTION: An echo canceller removes an echo component from near-end input signals. The echo canceller includes: echo cancelling means for generating a signal cancelling an echo component of near-end input signals using a pseudo echo generated by pseudo echo generation means; sound switch means for attenuating the near-end input signals using an attenuator; switching means for selecting a post-cancellation signal generated by the echo cancellation means or a signal generated by the sound switch means, and sending the signal to a far-end side; call state determination means for determining the call state between a far-end speaker and a near-end speaker using the far-end input signal and the near-end input signal; and switch controlling means for controlling the switching means in response to the determination result of the call state determination means.

Description

この発明は、エコーキャンセラ及びエコーキャンセルプログラム、並びに電話装置に関し、例えば、ハンズフリー機能に対応した電話装置に適用し得る。 The present invention relates to an echo canceller, an echo cancellation program, and a telephone device, and can be applied to, for example, a telephone device that supports a hands-free function.

近年ＶｏＩＰの普及により、通話料が安価になるとともに通信速度が高速になったことで従来はそれほど一般的ではなかった電話会議装置やテレビ電話装置が急速に普及している。 In recent years, with the spread of VoIP, call charges have become cheaper and communication speeds have become faster, so telephone conference devices and videophone devices, which have not been so common in the past, are rapidly spreading.

このような電話装置が普及するにともない、利用者が増えることで新たな技術的要求が生まれている。従来は、電話会議はあくまでも高価で特殊な利用方法であったから、そのような装置を利用できる利用者の使う装置は利用者の使い勝手を重視し、今日のような一般的な利用者が用いるよりもずっと大規模で高価な会議システムが用いられてきた。 As such telephone devices become widespread, new technical demands are generated as the number of users increases. Conventionally, since teleconference was an expensive and special method of use, devices used by users who can use such devices emphasize the convenience of users, rather than those used by general users like today. However, much larger and more expensive conference systems have been used.

これらの会議システムを高価にしてしまう原因のひとつにハンズフリー機能の実現がある。これは、非特許文献１に記載されるようにスピーカの音がマイクに再入力されてハウリングを発生してしまい、通話が不可能になってしまうのを防止する機能を導入する必要があるからである。 One of the reasons for making these conference systems expensive is the implementation of a hands-free function. This is because, as described in Non-Patent Document 1, it is necessary to introduce a function for preventing the sound of the speaker from being re-input to the microphone and generating howling and making the call impossible. It is.

また、会議電話を行いたい一方で、使い勝手よりもむしろコスト面を重視するような利用者も存在する。例えば、会議場への集合のためのコストを第一優先し、利用者の使い勝手の優先度を下げるような場合があげられる。この場合、上述のような高価な装置を用いない代わりに、ハンズフリーの状態を実現したい利用者が、会議中にずっとヘッドセット付けて会議をおこなっていた。しかし、ヘッドセットは装着すると、違和感のために疲労がつよくでたり、イヤーパッドなど耳に触れる部分のこすれによって皮膚を傷めたりするなど、利用者への相当の負担があった。 There are also users who want to make a conference call but attach importance to cost rather than usability. For example, there is a case where the cost for the meeting place is given first priority and the user's usability is lowered. In this case, instead of using an expensive apparatus as described above, a user who wants to realize a hands-free state has been holding a conference with a headset during the conference. However, wearing a headset puts a considerable burden on the user, such as fatigue due to a sense of incongruity, and damage to the skin caused by rubbing the earpad and other parts that touch the ear.

会議中の利用者は、会議中のメモを取ったり、配布された資料の一部に書き込みを加えるなどの作業をする場合もあり、両手を自由にするハンズフリー機能の必要性は自然かつ必須な機能なのであり、今後もその要求は途絶えることはない。 Users during the meeting may take notes during the meeting or write down some of the distributed materials, so the need for a hands-free function that frees both hands is natural and essential It's a feature, and the demand will never stop.

さらに、近年、働き方の多様性、効率性の観点や、各種災害時における企業の業継続の確保の観点から、職場に通わず在宅や自宅近隣のサテライトオフィスで仕事を行ういわゆるテレワークが普及し始めている。これらテレワークの普及は今後急速に広がることが考えられる。このようなテレワークの場合、会議の主な形式は直接対話よりむしろ通信機を経由した電話会議、テレビ会議がおもになることが十分予想される。 Furthermore, in recent years, so-called telework, which does not go to the workplace and works at satellite offices near homes, has become widespread from the viewpoints of diversity and efficiency of work styles and ensuring business continuity of companies in the event of various disasters. I'm starting. The spread of telework can be expected to spread rapidly in the future. In the case of such telework, it is fully expected that the main form of the conference will be telephone conference and video conference via a communication device rather than direct dialogue.

また、すでに職場同士を映像や音声で常時接続してオフィスの地理的分断感を解消するシステムも利用され始めており、ますます遠隔の利用者同士が自然なコミュニケーションを行いたいという要求が高まることが予想される。しかし、その一方で、たとえば従来技術の延長では、会議中だけヘッドセットを装着すればよかった利用者は、始終ヘッドセットを装着する必要に迫られてしまう。 In addition, systems that always connect offices with video and audio to eliminate the sense of geographical division in the office have begun to be used, and there is an increasing demand for remote users to communicate naturally. is expected. On the other hand, for example, in the extension of the prior art, a user who only needs to wear a headset only during a meeting is required to wear the headset all the time.

上述のような常時接続でハンズフリーの状態を維持するなら、利用者のすべてが常時ヘッドセットの装着を強いられるか、高価な装置の利用を強いられてしまう。このような問題への対策として従来、非特許文献２に記載されるような音声スイッチが用いられてきた。 If a hands-free state is maintained with the above-described constant connection, all users are forced to wear a headset at all times or use an expensive device. Conventionally, a voice switch as described in Non-Patent Document 2 has been used as a countermeasure to such a problem.

非特許文献２に記載された音声スイッチは、送信・受信の各信号線のパワーを比較し、パワーの大きいほうの信号を透過させ、パワーの小さい方の信号を遮断されるものであった。非特許文献２の音声スイッチを用いた方法は比較的安価に装置を作成することができ、現在も広く安価なハンズフリー装置の実現に役立っている。 The voice switch described in Non-Patent Document 2 compares the power of each signal line for transmission and reception, transmits the signal with the higher power, and blocks the signal with the lower power. The method using the voice switch of Non-Patent Document 2 can create a device at a relatively low cost, and is currently useful for realizing a wide and inexpensive hands-free device.

しかしながら、非特許文献２に記載されている音声スイッチの技術では、必然的に片方向通話の切り替えが基本であるから、検出切替えの反応時間までの間、片方の通話は遮断される。通常会話は交互になされるので、通話方向が反転するまでの反転検出切り替えの検出時間までは直前の通話方向が維持されるので、発話の最初の部分の欠損が発生してしまうことが多かった。この欠点を改善するため、特許文献１には送信状態、受信状態のほかにその中間の中間状態という状態を設け、発話の最初及び最後の欠損を軽減しようとする技術が開示されている。 However, the voice switch technology described in Non-Patent Document 2 inevitably involves switching of a one-way call, so that one of the calls is blocked until the detection switching reaction time. Since normal conversations are alternated, the previous call direction is maintained until the detection time of reversal detection switching until the call direction is reversed, so the first part of the utterance is often lost. . In order to remedy this drawback, Patent Document 1 discloses a technique for providing an intermediate state between the transmission state and the reception state in addition to the transmission state and the reception state so as to reduce the first and last missing speech.

しかしながら、この特許文献１の記載技術では、上述の中間状態で、送信経路、受信経路の両方に減衰を加えるものであった。つまり発話の最初及び最後の欠損を軽減する代償として本来透過すべき信号にも減衰を加えるので、通話音声は音量が激しく上下することになり、聞きづらさがともない利用者の疲労感が伴うものであった。 However, in the technique described in Patent Document 1, attenuation is applied to both the transmission path and the reception path in the intermediate state described above. In other words, since the signal that should be transmitted originally is attenuated as a price to reduce the first and last deficits in the speech, the volume of the call voice increases and decreases sharply, and the user feels tired with difficulty in hearing. Met.

この一方で、音声スイッチの発話の最初の部分が欠損するという欠点を解消できるエコーキャンセラ技術もすでに公知のものである。エコーキャンセラは信号処理プロセッサなどで構成される適応フィルタをもち、受信信号とマイクロホンに再入力されたエコーを用いて計算処理で擬似エコーを作成し、エコーと擬似エコーを相殺してスピーカからマイクロホンに再入力された信号を除去し、ハウリングを除去するものである。 On the other hand, an echo canceller technique that can eliminate the defect that the first part of the speech of the voice switch is lost is already known. The echo canceller has an adaptive filter composed of a signal processor, etc., creates a pseudo echo by calculation processing using the received signal and the echo re-input to the microphone, cancels the echo and the pseudo echo, and transmits from the speaker to the microphone. This removes the re-input signal and removes howling.

しかし、エコーキャンセラは適応フィルタが上記のスピーカからマイクロホンにいたるまでの音響経路（エコーパス）が固定であることを大前提にしている。したがって、従来のエコーキャンセラでは、話者が身じろぎをしたり、移動したりの原因でエコーパスが変化するとエコー打消し性能がひどく劣化し、再学習するまでエコーを消せなくなるなどの欠点があった。さらに、従来のエコーキャンセラでは、エコーと話者の音声が混在すると音声までも劣化させてしまうため、非常に高精度の通話状態判定機能を併せ持つ必要があり、通話状態判定機能によって適応フィルタのフィルタ係数更新を停止するなどの必要があった。 However, the echo canceller presupposes that the acoustic path (echo path) from the adaptive filter to the microphone is fixed. Therefore, the conventional echo canceller has a drawback in that if the echo path is changed due to the speaker's comforting or moving, the echo cancellation performance is severely deteriorated and the echo cannot be erased until re-learning. Furthermore, since the conventional echo canceller mixes echo and the voice of the speaker, even the voice is degraded. Therefore, it is necessary to have a highly accurate call state determination function. There was a need to stop the coefficient update.

さらに、従来のエコーキャンセラは、エコーと受信信号の両方が精度良く入手できて初めて擬似エコーを作成できるので、マイクロホン側（近端側）の背景雑音が大きいときは効果が大きく劣化する問題がある。 Furthermore, since the conventional echo canceller can create a pseudo echo only when both the echo and the received signal can be obtained with high accuracy, there is a problem that the effect is greatly deteriorated when the background noise on the microphone side (near end side) is large. .

上述のような問題を解決する技術として、従来特許文献２〜４の記載技術がある。 As a technique for solving the above problems, there are conventional techniques described in Patent Documents 2 to 4.

特許文献２には、音声スイッチとエコーキャンセラの両方の利点を利用しようという技術が開示されている。 Japanese Patent Application Laid-Open No. 2004-228561 discloses a technique for utilizing the advantages of both a voice switch and an echo canceller.

また、特許文献３には、エコーキャンセラと音声スイッチを併用してエコーキャンセラの性能劣化分をエコーキャンセラの出力後段に設けた普声スイッチによって補う技術が開示されている。 Patent Document 3 discloses a technique in which an echo canceller and a voice switch are used together to compensate for the performance deterioration of the echo canceller by a voice switch provided at the output stage of the echo canceller.

さらに、特許文献４には、音声スイッチだけを用いる方法ではあるが、会話音声の性質を利用して音声スイッチの切り替え特性を改良し通話の自然性を確保しようとする技術が開示されている。 Further, Patent Document 4 discloses a technique that uses only the voice switch but improves the switching characteristics of the voice switch by using the nature of the conversational voice to ensure the naturalness of the call.

特開平１１−８８４９７号公報Japanese Patent Laid-Open No. 11-88497 特開２００６−２７０１４７号公報JP 2006-270147 A 特開平７−１７０３３７号公報JP-A-7-170337 特開平１０−３０８８１６号公報Japanese Patent Laid-Open No. 10-308816

株式会社エー・アール・アイ編、「音響技術とオーディオ・ＡＶ用語と捕捉」、[Online]、INTERNET、[２０１０年１０月２６日検索]、＜ＵＲＬ：http://www.ari-web.com/sound/measurement/note.htm＞Edited by ARL, Inc., “Acoustic technology and audio / AV terminology and capture”, [Online], INTERNET, [October 26, 2010 search], <URL: http: //www.ari-web. com / sound / measurement / note.htm> ＩＴＵ（ＩＮＴＥＲＮＡＴＩＯＮＡＬＴＥＬＥＣＯＭＭＵＮＩＣＡＴＩＯＮＵＮＩＯＮ）編、「ＥＣＨＯＳＵＰＰＲＥＳＳＯＲＳＩＴＵ−ＴＲｅｃｏｍｍｅｎｄａｔｉｏｎＧ．１６４」、[Online]、INTERNET、[２０１０年１０月２６日検索]、＜ＵＲＬ：http://www.itu.int/rec/T-REC-G.164-198811-I/en＞ITU (INTERNATIONAL TELECOMMUNICATION UNION), “ECHO SUPPRESSORS ITU-T Recommendation G.164”, [Online], INTERNET, [October 26, 2010 search], <URL: http://www.itu.int/rec /T-REC-G.164-198811-I/en>

しかし、特許文献２の方法ではエコーキャンセラからみて、エコー経路上に音声スイッチを設けているのでエコーキャンセラにとってエコー経路が激しく非線形変動することになり、結局、音声スイッチと併用したエコーキャンセラの効果がなくなってしまい、双方向通話を実現するのが困難であった。 However, in the method of Patent Document 2, since the voice switch is provided on the echo path as seen from the echo canceller, the echo path is severely non-linearly changed for the echo canceller. Consequently, the effect of the echo canceller combined with the voice switch is effective. It was difficult to realize a two-way call.

また、特許文献３の記載技術では、ダブルトーク時以外ではエコーキャンセラと音声スイッチが従属接続されている。したがって、特許文献３の記載技術では、前段のエコーキャンセラにおいてエコー経路変動におけるエコー打消し量の劣化と、本当のダブルトーク音声（反対方向であるマイク側の話者音声）による打ち消し量の劣化との区別がつかないため、エコー経路の変化で発生したエコー消し残りパワーはそのままエコーキャンセラ出力パワーとみなされてしまう。結果、特許文献３の記載技術では、エコー経路の変動で発生したエコーはマイク側話者音声と区別なく、なんら抑圧処理を受けずにそのまま遠端側に送信されてしまうという問題があった。この間題点は常時接続通信における音響エコー抑圧やハウリング防止策としては、音響エコー経路が動くことは必ず発生するので通信系の安定に致命的な欠点となってしまうものであった。 In the technique described in Patent Document 3, the echo canceller and the voice switch are cascade-connected except during double talk. Therefore, in the technique described in Patent Document 3, the echo cancellation amount in the echo path variation is deteriorated in the preceding echo canceller, and the cancellation amount is deteriorated due to the real double talk voice (the microphone side speaker voice in the opposite direction). Therefore, the echo remaining power generated by the change in the echo path is regarded as the echo canceller output power as it is. As a result, the technique described in Patent Document 3 has a problem that an echo generated due to a change in the echo path is transmitted to the far end side without being subjected to any suppression processing without being distinguished from the microphone-side speaker voice. The problem is that acoustic echo path movement always occurs as a measure for suppressing acoustic echo and howling in always-on communication, which is a fatal defect in the stability of the communication system.

さらに、特許文献４の記載技術では、音声中の切り替えばたつきを軽減することは可能となるが、依然として同時通話が不可能であるという音声スイッチの宿命的な欠点を回避できるものではなかった。 Furthermore, although the technique described in Patent Document 4 can reduce switching flutter during voice, it has not been possible to avoid the fatal drawback of the voice switch that simultaneous calls are still impossible.

そこで、上述のような問題点に鑑みて、電話装置において発生するエコーを安定継続して除去することができるエコーキャンセラ及びエコーキャンセルプログラム、並びに電話装置が望まれている。 Therefore, in view of the above-described problems, an echo canceller, an echo cancellation program, and a telephone device that can stably and continuously remove echoes generated in the telephone device are desired.

第１の本発明は、近端入力信号からエコー成分を除去するエコーキャンセラにおいて、（１）擬似エコー生成手段により生成した擬似エコーを用いて、上記近端入力信号の上記エコー成分を打消した信号を生成するエコー打消し手段と、（２）上記近端入力信号を減衰器により減衰させた信号を生成する音声スイッチ手段と、（３）上記エコー打消し手段が生成する打消し後信号、又は上記音声スイッチ手段が生成する信号のいずれかを選択して、遠端側へ送出するスイッチ手段と、（４）遠端入力信号と上記近端入力信号とを利用して、遠端話者と近端話者との間の通話状態を判定する通話状態判定手段と、（５）上記通話状態判定手段の判定結果に応じて、上記スイッチ手段を制御するスイッチ制御手段とを有することを特徴とする。 The first aspect of the present invention is an echo canceller for removing an echo component from a near-end input signal. (1) A signal obtained by canceling the echo component of the near-end input signal by using a pseudo echo generated by a pseudo-echo generating means. (2) a voice switch means for generating a signal obtained by attenuating the near-end input signal by an attenuator, and (3) a post-cancellation signal generated by the echo cancellation means, or A switch unit that selects one of the signals generated by the voice switch unit and transmits the signal to the far end side; and (4) a far-end speaker using the far-end input signal and the near-end input signal. A call state determination unit that determines a call state with a near-end speaker, and (5) a switch control unit that controls the switch unit according to a determination result of the call state determination unit. To do.

第２の本発明のエコーキャンセルプログラムは、近端入力信号からエコー成分を除去するエコーキャンセラに搭載されたコンピュータを、（１）擬似エコー生成手段により生成した擬似エコーを用いて、上記近端入力信号の上記エコー成分を打消した信号を生成するエコー打消し手段と、（２）上記近端入力信号を減衰器により減衰させた信号を生成する音声スイッチ手段と、（３）上記エコー打消し手段が生成する打消し後信号、又は上記音声スイッチ手段が生成する信号のいずれかを選択して、遠端側へ送出するスイッチ手段と、（４）遠端入力信号と上記近端入力信号とを利用して、遠端話者と近端話者との間の通話状態を判定する通話状態判定手段と、（５）上記通話状態判定手段の判定結果に応じて、上記スイッチ手段を制御するスイッチ制御手段として機能させることを特徴とする。 The echo cancellation program according to the second aspect of the present invention provides a computer installed in an echo canceller that removes an echo component from a near-end input signal by using (1) a pseudo echo generated by a pseudo-echo generating means, and Echo canceling means for generating a signal obtained by canceling the echo component of the signal; (2) voice switch means for generating a signal obtained by attenuating the near-end input signal by an attenuator; and (3) the echo canceling means. Switch means for selecting one of the post-cancellation signal generated by the voice signal or the signal generated by the voice switch means and sending it to the far end side; and (4) the far end input signal and the near end input signal. A call state determination unit that determines a call state between a far-end speaker and a near-end speaker, and (5) controls the switch unit according to a determination result of the call state determination unit. Characterized in that to function as switch control means.

第３の本発明の電話装置は、第１の本発明のエコーキャンセラを搭載したことを特徴とする。 A telephone device according to a third aspect of the present invention includes the echo canceller according to the first aspect of the present invention.

本発明によれば、電話装置において発生するエコーを安定継続して除去することができる。 According to the present invention, echo generated in the telephone device can be stably and continuously removed.

第１の実施形態に係る電話装置の機能的構成を示すブロック図である。It is a block diagram which shows the functional structure of the telephone apparatus which concerns on 1st Embodiment. 第１の実施形態に係る適応フィルタの機能的構成を示すブロック図である。It is a block diagram which shows the functional structure of the adaptive filter which concerns on 1st Embodiment. 第２の実施形態に係る電話装置の機能的構成を示すブロック図である。It is a block diagram which shows the functional structure of the telephone apparatus which concerns on 2nd Embodiment. 第２の実施形態に係るエコーロス推定部の計算結果の例について示したグラフである。It is the graph shown about the example of the calculation result of the echo loss estimation part which concerns on 2nd Embodiment. 第２の実施形態に係るエコーロス推定部の動作について示した説明図（１）である。It is explanatory drawing (1) shown about operation | movement of the echo loss estimation part which concerns on 2nd Embodiment. 第２の実施形態に係るエコーロス推定部の動作について示した説明図（２）である。It is explanatory drawing (2) shown about operation | movement of the echo loss estimation part which concerns on 2nd Embodiment. 第３の実施形態に係る電話装置の機能的構成を示すブロック図である。It is a block diagram which shows the functional structure of the telephone apparatus which concerns on 3rd Embodiment. 第４の実施形態に係る電話装置の機能的構成を示すブロック図である。It is a block diagram which shows the functional structure of the telephone apparatus which concerns on 4th Embodiment. 第４の実施形態に係る信号混合部の機能的構成を示すブロック図である。It is a block diagram which shows the functional structure of the signal mixing part which concerns on 4th Embodiment.

（Ａ）第１の実施形態
以下、本発明によるエコーキャンセラ及びエコーキャンセルプログラム、並びに電話装置の第１の実施形態を、図面を参照しながら詳述する。 (A) First Embodiment Hereinafter, an echo canceller, an echo cancellation program, and a telephone device according to a first embodiment of the present invention will be described in detail with reference to the drawings.

（Ａ−１）第１の実施形態の構成
図１は、第１の実施形態の電話装置１０の全体構成を示すブロック図である。 (A-1) Configuration of the First Embodiment FIG. 1 is a block diagram showing the overall configuration of the telephone device 10 of the first embodiment.

電話装置１は、エコーキャンセラ１０、ディジタル・アナログ変換器（以下「Ｄ／Ａ」という）２０、アナログ・ディジタル変換器（以下、「Ａ／Ｄ」という）３０、スピーカ４０、マイクロホン５０を有している。 The telephone device 1 includes an echo canceller 10, a digital / analog converter (hereinafter referred to as “D / A”) 20, an analog / digital converter (hereinafter referred to as “A / D”) 30, a speaker 40, and a microphone 50. ing.

図１では、左側が遠端側、右側が近端側を表している。又、図１においては図示を省略しているが、電話装置１には、電話発信するためのプッシュボタン等、電話装置として必要な他の構成を備えているものとする。また、図１において、電話装置１は、ハンズフリーに対応した電話装置であるものとして説明するが、電話装置１に適用する電話装置の種類は限定されないものである。 In FIG. 1, the left side represents the far end side, and the right side represents the near end side. Although not shown in FIG. 1, the telephone device 1 is assumed to have other components necessary for the telephone device, such as a push button for making a call. In FIG. 1, the telephone device 1 is described as a hands-free telephone device, but the type of telephone device applied to the telephone device 1 is not limited.

エコーキャンセラ１０は、ＣＰＵ、ＲＯＭ、ＲＡＭ、ＥＥＰＲＯＭ、ハードディスクなどのプログラムの実行構成（１台に限定されず、複数台を分散処理し得るようにしたものであっても良い。）に、実施形態のエコーキャンセルプログラム等をインストールすることにより構築しても良く、その場合でも、機能的には上述の図１のように示すことができる。例えば、電話装置１が、パソコンや携帯電話端末等の情報処理装置上で、いわゆるソフトフォンとして構築されたものである場合には、その情報処理装置に、実施形態のエコーキャンセルプログラム等をインストールすることにより、エコーキャンセラ１０を構築するようにしても良い。 The echo canceller 10 is implemented in a program execution configuration such as a CPU, a ROM, a RAM, an EEPROM, and a hard disk (not limited to one, but may be configured so that a plurality of units can be distributedly processed). 1 may be constructed by installing an echo canceling program or the like, and even in that case, it can be functionally shown as in FIG. For example, when the telephone device 1 is constructed as a so-called soft phone on an information processing device such as a personal computer or a mobile phone terminal, the echo cancellation program of the embodiment is installed in the information processing device. Thus, the echo canceller 10 may be constructed.

遠端側から電話装置１が受信した遠端入力信号ｘ（ｎ）は、受信入力端子Ｒｉｎ、受信出力端子Ｒｏｕｔを介して、Ｄ／Ａ変換器２０に入力され、Ｄ／Ａ変換器２０においてアナログ音声信号に変換される。そして、スピーカ４０は、Ｄ／Ａ変換器２０から与えられた電気信号を、音響信号に変換し、近端話者に供する。なお、ｘ（ｎ）における「ｎ」はサンプル順を表わしている。また、以下において、同様に信号を表現する際に用いる「ｎ」も同様にサンプル順を表わすものである。 The far-end input signal x (n) received by the telephone device 1 from the far-end side is input to the D / A converter 20 via the reception input terminal Rin and the reception output terminal Rout, and the D / A converter 20 Converted to an analog audio signal. And the speaker 40 converts the electric signal given from the D / A converter 20 into an acoustic signal, and provides it to a near-end speaker. Note that “n” in x (n) represents the sample order. In the following, “n” used when expressing a signal in the same manner also represents the order of samples.

近端話者が発声することにより、マイクロホン５０に音響信号が入力されると、マイクロホン５０は、その音響信号を電気信号に変換し、その電気信号は、Ａ／Ｄ変換器３０に与えられる。そして、Ａ／Ｄ変換器３０では、マイクロホン５０から与えられた電気信号（アナログ信号）が、ディジタル変換され、近端入力信号として、送信入力端子Ｓｉｎを介して、エコーキャンセラ１０に入力される。そして、エコーキャンセラ１０では、近端入力信号は、送信入力端子Ｓｉｎを介してエコーキャンセラ１０に入力され、エコーキャンセラ１０によりエコー抑制処理が行われたのち送信出力端子Ｓｏｕｔを介して、遠端側へ向けて出力される。 When an acoustic signal is input to the microphone 50 by the near-end speaker uttering, the microphone 50 converts the acoustic signal into an electrical signal, and the electrical signal is given to the A / D converter 30. In the A / D converter 30, the electrical signal (analog signal) given from the microphone 50 is digitally converted and input to the echo canceller 10 via the transmission input terminal Sin as a near-end input signal. In the echo canceller 10, the near-end input signal is input to the echo canceller 10 via the transmission input terminal Sin, and after echo suppression processing is performed by the echo canceller 10, the far-end side signal is transmitted via the transmission output terminal Sout. Is output towards

スピーカ４０から出力された音響信号は反射されたり、空間を伝播したりする経路（エコー経路）を経由して、近端話者の発声した音声と共にマイクロホン５０に入力（混入）されてしまう。 The acoustic signal output from the speaker 40 is input (mixed) into the microphone 50 together with the voice uttered by the near-end speaker via a path (echo path) that is reflected or propagates through space.

マイクロホン５０に混入されたエコーは、音響電気変換でアナログ電気信号に変換され、Ａ／Ｄ変換器３０でディジタル変換されてエコーｙ（ｎ）となり、エコーキャンセラ１０に入力される。エコーキャンセラ１０では、近端入力信号に混入されるこのエコーｙ（ｎ）の成分を、抑制する処理を行う。 The echo mixed in the microphone 50 is converted into an analog electric signal by acoustoelectric conversion, and is digitally converted by the A / D converter 30 to become an echo y (n), which is input to the echo canceller 10. The echo canceller 10 performs processing for suppressing the component of this echo y (n) mixed in the near-end input signal.

次に、エコーキャンセラ１０の内部構成について説明する。 Next, the internal configuration of the echo canceller 10 will be described.

エコーキャンセラ１０は、加算器１０１、適応フィルタ（以下、「ＡＤＦ」ともいう）１０２、双方向通話検出器（以下、「ＤＴＤ」ともいう）１０３、制御部１０４、遅延器１０５、遅延器１０６、減衰器１０７、及びスイッチ１０８を有している。 The echo canceller 10 includes an adder 101, an adaptive filter (hereinafter also referred to as “ADF”) 102, a bidirectional communication detector (hereinafter also referred to as “DTD”) 103, a control unit 104, a delay unit 105, a delay unit 106, An attenuator 107 and a switch 108 are included.

加算器１０１は、後述する処理により、適応フィルタ１０２で生成された擬似エコーｙ’（ｎ）と、Ａ／Ｄ変換器３０から出力される近端入力信号に混入されたエコーｙ（ｎ）を加算相殺し、エコーｙ（ｎ）の成分を抑制した信号ｅ（ｎ）を、双方向通話検出器１０３、遅延部１０６、及び適応フィルタ１０２に与える。 The adder 101 receives the pseudo echo y ′ (n) generated by the adaptive filter 102 and the echo y (n) mixed in the near-end input signal output from the A / D converter 30 by a process described later. The signal e (n) that has been canceled by addition and the echo y (n) component is suppressed is provided to the bidirectional communication detector 103, the delay unit 106, and the adaptive filter 102.

遅延部１０６は、加算器１０１から与えられた信号ｅ（ｎ）を所定時間保持して遅延させてから、スイッチ１０８に与えるものである。 The delay unit 106 holds and delays the signal e (n) given from the adder 101 for a predetermined time and then gives it to the switch 108.

遅延部１０５は、Ａ／Ｄ変換器３０から供給された、エコーｙ（ｎ）が混入された近端入力信号を、所定時間保持して遅延させた信号ｙｄ（ｎ）を、減衰器１０７に与える。 The delay unit 105 supplies, to the attenuator 107, a signal yd (n) that is supplied from the A / D converter 30 and delayed by holding the near-end input signal mixed with the echo y (n) for a predetermined time. give.

減衰器１０７は、遅延部１０５から与えられたｙｄ（ｎ）を、所定の減衰量分減衰させた信号ａｔｔ（ｎ）を生成して、スイッチ１０８に与える。 The attenuator 107 generates a signal att (n) obtained by attenuating yd (n) given from the delay unit 105 by a predetermined attenuation amount, and supplies the signal att (n) to the switch 108.

スイッチ１０８は、遅延部１０６から与えられる信号、又は、減衰器１０７から与えられる信号のいずれかを切替えて、遠端側（送信出力端子Ｓｏｕｔ）に向けて送出するものである。 The switch 108 switches either the signal given from the delay unit 106 or the signal given from the attenuator 107 and sends it to the far end side (transmission output terminal Sout).

スイッチ１０８では、端子ａ１が遅延部１０６の出力と接続している。また、スイッチ１０８では、端子ｂ１が、減衰器１０７の出力と接続している。さらに、スイッチ１０８では、端子ｃ１が、送信出力端子Ｓｏｕｔと接続している。すなわち、スイッチ１０８では、端子ｃ１に対する接続が、端子ａ１又は端子ｂ１のいずれかに切り替えられる。スイッチ１０８は、制御部１０４によって、上述の切替えを行う。 In the switch 108, the terminal a1 is connected to the output of the delay unit 106. In the switch 108, the terminal b1 is connected to the output of the attenuator 107. Further, in the switch 108, the terminal c1 is connected to the transmission output terminal Sout. That is, in the switch 108, the connection to the terminal c1 is switched to either the terminal a1 or the terminal b1. The switch 108 performs the above-described switching by the control unit 104.

制御部１０４は、後述する双方向通話検出器１０３から与えられる判定結果に応じて、スイッチ１０８の切替えを制御するものであるが、具体的な制御内容については、後述する動作説明において詳述する。 The control unit 104 controls switching of the switch 108 in accordance with a determination result given from a two-way call detector 103 described later. Specific control contents will be described in detail in an operation description described later. .

次に適応フィルタ１０２の詳細について説明する。 Next, details of the adaptive filter 102 will be described.

図２は、適応フィルタ１０２内部の機能的構成について示したブロック図である。 FIG. 2 is a block diagram showing a functional configuration inside the adaptive filter 102.

適応フィルタ１０２は、擬似エコー作成部２０１、係数更新量計算部２０２、及びステップゲイン計算部２０３を有している。 The adaptive filter 102 includes a pseudo echo creation unit 201, a coefficient update amount calculation unit 202, and a step gain calculation unit 203.

擬似エコー作成部２０１は、遠端入力信号ｘ（ｎ）及び係数更新量計算部２０２で求められたフィルタ係数を利用して擬似エコーｙ’（ｎ）を生成するものであり、擬似エコー作成部２０１は、保持レジスタ２０１１、積和演算部２０１２、及び係数部２０１３を有している。なお、擬似エコー作成部２０１の処理については後述する動作説明において詳述する。 The pseudo echo creation unit 201 generates a pseudo echo y ′ (n) using the far-end input signal x (n) and the filter coefficient obtained by the coefficient update amount calculation unit 202. The pseudo echo creation unit 201 201 includes a holding register 2011, a product-sum operation unit 2012, and a coefficient unit 2013. Note that the processing of the pseudo echo creating unit 201 will be described in detail in an operation description to be described later.

係数更新量計算部２０２は、擬似エコー作成部２０１（保持レジスタ２０１１）からフィードバックされる内容と、ステップゲイン計算部２０３が求めたステップゲインとを利用して、擬似エコー作成部２０１に供給するためのフィルタ係数を求める。なお、係数更新量計算部２０２の処理については後述する動作説明において詳述する。 The coefficient update amount calculation unit 202 uses the content fed back from the pseudo echo creation unit 201 (holding register 2011) and the step gain obtained by the step gain calculation unit 203 to supply to the pseudo echo creation unit 201. The filter coefficient of is obtained. Note that the processing of the coefficient update amount calculation unit 202 will be described in detail in the description of operations described later.

ステップゲイン計算部２０３は、後述する双方向通話検出器１０３による判定結果と、信号ｅ（ｎ）とを利用して、係数更新量計算部２０２に供給するステップゲインを求める。なお、擬似エコー作成部２０１の処理については後述する動作説明において詳述する。 The step gain calculation unit 203 obtains a step gain to be supplied to the coefficient update amount calculation unit 202 by using a determination result by the bidirectional call detector 103 described later and the signal e (n). Note that the processing of the pseudo echo creating unit 201 will be described in detail in an operation description to be described later.

次に、双方向通話検出器１０３について説明する。 Next, the bidirectional call detector 103 will be described.

双方向通話検出器１０３は、遠端入力信号ｘ（ｎ）と、加算器１０１からの信号ｅ（ｎ）を用いて電話装置１における通話状態（遠端話者と近端話者との間の通話状態）を判定し、その判定結果を、適応フィルタ１０２及び制御部１０４に供給する。 The two-way call detector 103 uses the far-end input signal x (n) and the signal e (n) from the adder 101 to talk on the telephone device 1 (between the far-end talker and the near-end talker). ) And supplies the determination result to the adaptive filter 102 and the control unit 104.

双方向通話検出器１０３は、遠端話者の音声だけがある状態（ｘ（ｎ）だけが有音の状態；以下「遠端シングルトーク状態」又は「ＳＴ」という）、遠端話者及び近端話者の両方が話している状態（ｘ（ｎ）及びｅ（ｎ）が有音の状態；以下「ダブルトーク状態」又は「ＤＴ」という）、近端話者だけが話している状態（ｅ（ｎ）だけが有音状態の場合；以下「近端トーク状態」又は、「ＮＴ」という）、双方とも話していない状態（ｘ（ｎ）及びｅ（ｎ）が無音状態の場合；以下「無音状態」又は「ＩＤＬ」という）のいずれかを検出する。 The two-way call detector 103 is in a state where only the voice of the far-end speaker is present (only x (n) is voiced; hereinafter referred to as “far-end single talk state” or “ST”), State where both near-end speakers are speaking (x (n) and e (n) are voiced; hereinafter referred to as “double talk state” or “DT”), state where only near-end speakers are speaking (When only e (n) is in a sound state; hereinafter referred to as “near-end talk state” or “NT”), neither is speaking (when x (n) and e (n) are in a silent state; Hereinafter, either “silent state” or “IDL” is detected.

また、双方向通話検出器１０３は、エコー経路の変動があった状態（以下、「エコー経路変動状態」又は「ＥＰＣ」という）についても検出する。 The two-way call detector 103 also detects a state in which the echo path has changed (hereinafter referred to as “echo path change state” or “EPC”).

双方向通話検出器１０３による、これらの状態の検出は、例えば、「特開２００９−３３３４４号公報」（参考文献１）の記載技術を用いることができるが、これに限定されず、その結果を正しく利用できる方法であれば他の方法を用いてもよいのは当然である。 The detection of these states by the two-way call detector 103 can use, for example, the technology described in “JP 2009-33344 A” (reference document 1), but is not limited to this. Of course, other methods may be used as long as they can be used correctly.

一般に、双方向通話検出器の送信側の入力に加算器の出力を用いると、エコー成分を大まかに除去した上で送信信号（すなわち近端話者信号）の検出が可能になるので、単純に送信経路、受信経路での信号パワーを比較するような通話状態検出方法と比較すると、劇的に通話状態検出の精度が向上することが知られている。これはマイクロホンに入力される信号がエコーも近端話者信号も共に人間の音声であることを考えれば、エコーを区別して除去したうえでの近端信号検出を行うことが有利であるのは自然なことである。 In general, if the output of the adder is used as the input on the transmission side of the two-way call detector, it becomes possible to detect the transmission signal (ie, the near-end speaker signal) after roughly removing the echo component. It is known that the accuracy of call state detection is dramatically improved as compared with a call state detection method in which signal powers in a transmission path and a reception path are compared. Considering that both the echo and the near-end speaker signal are human speech signals, it is advantageous to detect the near-end signal after distinguishing and removing the echo. It is natural.

（Ａ−２）第１の実施形態の動作
次に、以上のような構成を有する第１の実施形態の電話装置１におけるエコーキャンセラ１０の動作を説明する。 (A-2) Operation of the First Embodiment Next, the operation of the echo canceller 10 in the telephone device 1 of the first embodiment having the above configuration will be described.

（Ａ−２−１）適応フィルタ１０２の動作について
まず、受信側入力端子Ｒｉｎから入力された遠端入力信号ｘ（ｎ）は、適応フィルタ１０２の保持レジスタ２０１１に格納され、以下の（１）式に示すようなデータベクトル（以下、「受信信号ベクトル」ともいう）として扱われる。以下の（１）式において、Ｌは適応フィルタ１０２のタップ数である。（ここではではＬ＝１０２４としたが限定されないものである）。また、以下の（１）式において、ｔは行列の転置を表している。以下の（１）式は、現在の時刻ｎを開始時点とし、過去のＬサンプルまでを保持した受信信号ｘ（ｎ）データの集まりが、式（１）の受信信号ベクトルを構成していることを表している。 (A-2-1) Operation of Adaptive Filter 102 First, the far-end input signal x (n) input from the receiving side input terminal Rin is stored in the holding register 2011 of the adaptive filter 102, and the following (1) It is treated as a data vector as shown in the equation (hereinafter also referred to as “received signal vector”). In the following equation (1), L is the number of taps of the adaptive filter 102. (L = 1024 here, but is not limited). In the following formula (1), t represents transposition of a matrix. In the following equation (1), a collection of received signal x (n) data having the current time n as a starting point and holding up to past L samples constitutes a received signal vector of equation (1). Represents.

Ｘ（ｎ）＝［ｘ（ｎ）、ｘ（ｎ−１）、…、ｘ（ｎ−L＋１）］^ｔ …（１）
一方、係数部２０１３には、係数更新量計算部２０２から供給されたフィルタ係数（Ｌ個の係数値により構成されている）のデータが格納されている。フィルタ係数を構成する係数値は後述するように、例えば、ＮＬＭＳ（ＮｏｒｍａｌｉｚｅｄＬｅａｓｔＭｅａｎＳｑｕａｒｅ）アルゴリズムを用いて係数更新量計算部２０２で時々刻々と更新される。係数部２０１３のＬ個の係数値も、上記（１）式のように１つの係数ベクトルとして扱われ、以下の（２）式のように示される。 X (n) = [x (n), x (n−1),..., X (n−L + 1)] ^t (1)
On the other hand, the coefficient unit 2013 stores data of filter coefficients (consisting of L coefficient values) supplied from the coefficient update amount calculation unit 202. As will be described later, the coefficient value constituting the filter coefficient is updated from time to time by the coefficient update amount calculation unit 202 using, for example, an NLMS (Normalized Last Mean Square) algorithm. The L coefficient values of the coefficient unit 2013 are also handled as one coefficient vector as in the above expression (1), and are expressed as in the following expression (2).

Ｈ（ｎ）＝［ｈ（０）、ｈ（１），…、ｈ（Ｌ−１）］^ｔ …（２）
上記（２）式でも、Ｌはフィルタのタップ数である。積和演算部２０１２では、係数部２０１３の係数ベクトルと、保持レジスタ２０１１のデータベクトルを用い、以下の（３）式のようにベクトルの積ｙ’（ｎ）を計算するが、このｙ’（ｎ）が、擬似エコーのサンプルデータとなる。なお、ｙ’（ｎ）は１つの値を持つスカラー値データである。 H (n) = [h (0), h (1),..., H (L−1)] ^t (2)
In the above equation (2), L is the number of filter taps. The product-sum operation unit 2012 uses the coefficient vector of the coefficient unit 2013 and the data vector of the holding register 2011 to calculate a vector product y ′ (n) as in the following equation (3). This y ′ ( n) is the sample data of the pseudo echo. Note that y ′ (n) is scalar value data having one value.

ｙ’（ｎ）＝Ｈ^ｔ（ｎ）×（ｎ） …（３）
上述のように作成された擬似エコーｙ’（ｎ）は、加算器１０１に供給される。加算器１０１では、上述したようにスピーカ４０からマイクロホン５０を経由しディジタル変換されたエコーｙ（ｎ）が入力されている。 y ′ (n) = H ^t (n) × (n) (3)
The pseudo echo y ′ (n) created as described above is supplied to the adder 101. The adder 101 receives the echo y (n) digitally converted from the speaker 40 via the microphone 50 as described above.

そして、加算器１０１は、以下の（４）式のように信号を加算相殺した残差信号ｅ（ｎ）を、送信出力端子Ｓｏｕｔに出力する。 Then, the adder 101 outputs a residual signal e (n) obtained by adding and canceling signals as shown in the following equation (4) to the transmission output terminal Sout.

ｅ（ｎ）＝ｙ（ｎ）−ｙ’（ｎ） …（４）
そして、ｅ（ｎ）は、送信出力端子Ｓｏｕｔを介して、遠端側へ向けて送出される。 e (n) = y (n) −y ′ (n) (4)
Then, e (n) is sent out toward the far end via the transmission output terminal Sout.

一方、加算器１０１の出力であるｅ（ｎ）は、ステップゲイン計算部２０３を経由して係数更新量計算部２０２に供給される。そして、更新量計算部２０２では、ｅ（ｎ）と、保持レジスタ２０１１から出力される信号ｘ（ｎ）とを用いて、例えば、以下の（５）式のような公知のＮＬＭＳアルゴリズムにより、適応フィルタ１０２のフィルタ系数が更新される。

On the other hand, e (n) that is the output of the adder 101 is supplied to the coefficient update amount calculation unit 202 via the step gain calculation unit 203. Then, the update amount calculation unit 202 uses e (n) and the signal x (n) output from the holding register 2011, for example, by a known NLMS algorithm such as the following equation (5). The filter system number of the filter 102 is updated.

上記（５）式は、サンプルｅ（ｎ）、ｘ（ｎ）から、次回の擬似エコー作成に用いるための適応フィルタ１０２係数ベクトルが作成されることを示している。ここでαはステップゲインであり、０≦α≦２の定数である。αが大きいときは収束時間は短いが、定常的なエコー消去性能は劣る。αが小さいと定常的なエコー消去性能は大きいが収束時間は長くなる。 The above equation (5) indicates that an adaptive filter 102 coefficient vector to be used for the next pseudo echo creation is created from the samples e (n) and x (n). Here, α is a step gain and is a constant of 0 ≦ α ≦ 2. When α is large, the convergence time is short, but the steady echo cancellation performance is poor. If α is small, steady echo cancellation performance is large, but the convergence time is long.

ただし、ステップゲイン計算部２０３に対して、双方向通話検出器１０３から、遠端シングルトーク状態（ＳＴ）、エコー経路変動状態（ＥＰＣ）以外の信号が入力されたときは、ステップゲイン計算部２０３は、α＝０として係数更新量計算部２０２に出力し、係数の更新を停止する。 However, when a signal other than the far-end single talk state (ST) and the echo path fluctuation state (EPC) is input from the bidirectional call detector 103 to the step gain calculation unit 203, the step gain calculation unit 203 Is output to the coefficient update amount calculation unit 202 as α = 0, and the coefficient update is stopped.

ここでは、係数更新量計算部２０２において、α＝０の場合係数更新を停止するようにしたが、他にもスイッチを設けてｅ（ｎ）を遮断して係数更新を停止するようにしてもよく、係数更新停止の方法はα＝０とすることに限定されない。 Here, in the coefficient update amount calculation unit 202, the coefficient update is stopped when α = 0, but a switch may be provided to cut off e (n) and stop the coefficient update. The coefficient update stop method is not limited to α = 0.

上述のように作成した係数ベクトル（フィルタ係数）は、係数更新量計算部２０２から係数部２０１３に出力され、入力ベクトルＸ（ｎ）とともに上記の（３）式による擬似エコーの作成に用いられる。 The coefficient vector (filter coefficient) created as described above is output from the coefficient update amount calculation unit 202 to the coefficient unit 2013, and is used together with the input vector X (n) to create a pseudo echo according to the above equation (3).

時刻が進むとともに上記（５）式の計算が何回か実行されて係数更新が進行すると、上記（４）式により求められるｅ（ｎ）は、徐々に小さくなる。やがて十分にｅ（ｎ）が小さくなると、上記（５）式の係数更新での係数更新は、ほとんど発生しなくなる。適応フィルタ１０２の収束後は、以後継続的に加算器１０１で精度よくエコーを打ち消すことが可能になる。 As the time advances, the calculation of the above equation (5) is executed several times and the coefficient update proceeds, and e (n) obtained by the above equation (4) gradually decreases. When e (n) becomes sufficiently small in the future, the coefficient update in the coefficient update of the above equation (5) hardly occurs. After the convergence of the adaptive filter 102, the echo can be canceled out with high accuracy by the adder 101 thereafter.

ここでは、適応フィルタ１０２の係数更新アルゴリズムに上述のようなＮＬＭＳアルゴリズムを用いたが、エコー推定アルゴリズムはエコーを推定できればどのようなアルゴリズムでもよくとくにＮＬＭＳに限定されないのは当然である。 Here, the NLMS algorithm as described above is used as the coefficient update algorithm of the adaptive filter 102, but the echo estimation algorithm may be any algorithm as long as it can estimate the echo, and is not limited to NLMS.

（Ａ−２−２）双方向通話検出器１０３の動作について
双方向通話検出器１０３は、上述の通り、遠端からの音声信号ｘ（ｎ）と、加算器１０１からの信号ｅ（ｎ）を用いて、電話装置１について、遠端シングルトーク状態（ＳＴ）、ダブルトーク状態（ＤＴ）、近端トーク状態（ＮＴ）、無音状態（ＩＤＬ）、エコー経路変動状態（ＥＰＣ）を検出する。 (A-2-2) Operation of bidirectional call detector 103 As described above, the bidirectional call detector 103 includes the voice signal x (n) from the far end and the signal e (n) from the adder 101. Is used to detect the far-end single talk state (ST), double-talk state (DT), near-end talk state (NT), silence state (IDL), and echo path variation state (EPC).

そして、第１の実施形態では、双方向通話検出器１０３による、遠端シングルトーク状態（ＳＴ）の検出がとくに重要である。第１の実施形態では、人間の会話が主に交互におこなわれることに着目し、従来とは異なり、適応フィルタ１０２と加算器１０１で構成される擬似エコーを用いてエコーを打消す構成（以下、「エコーキャンセラ部」ともいう）と、遅延部１０５と減衰器１０７で構成される近端入力信号を減衰させる構成（以下、「音声スイッチ」ともいう）を従属接続するのではなく、両者を並列に接続している。 In the first embodiment, detection of the far-end single talk state (ST) by the two-way call detector 103 is particularly important. In the first embodiment, paying attention to the fact that human conversation is mainly performed alternately, unlike conventional ones, a configuration in which echoes are canceled using a pseudo echo composed of an adaptive filter 102 and an adder 101 (hereinafter referred to as “echo”). , And a configuration for attenuating the near-end input signal composed of the delay unit 105 and the attenuator 107 (hereinafter also referred to as “voice switch”), but not connected in cascade. Connected in parallel.

これにより、第１の実施形態では、遠端シングルトーク状態（ＳＴ）の場合には、エコーの除去に主に音声スイッチ部を利用して強力にエコーを減衰してハウリングを防止している。一方、双方向通話が必要な一瞬（ダブルトーク状態（ＤＴ）の場合）など、エコーキャンセラ部を用いて双方向性を確保する。このようにすると、音響信号の循環経路にエコーキャンセラ部が介在する時間を非常に短くできるので、たとえエコーキャンセラ部が多少不安定な動作に陥ったとしてもそのままエコーが通話経路を無限に巡回し始めてハウリングに成長するのを防止することができるのである。 As a result, in the first embodiment, in the far-end single talk state (ST), howling is prevented by strongly attenuating the echo mainly using the voice switch part for the removal of the echo. On the other hand, bidirectionality is ensured by using an echo canceller, such as for a moment when a two-way call is necessary (in the case of a double talk state (DT)). In this way, the time for the echo canceller to intervene in the circulation path of the acoustic signal can be very short, so even if the echo canceller falls into a somewhat unstable operation, the echo circulates indefinitely along the call path. You can prevent it from growing for the first time.

（Ａ−２−３）制御部１０４、スイッチ１０８、及び減衰器１０７の動作について
次に、制御部１０４、スイッチ１０８、及び減衰器１０７の動作について説明する。 (A-2-3) Operations of Control Unit 104, Switch 108, and Attenuator 107 Next, operations of the control unit 104, switch 108, and attenuator 107 will be described.

双方向通話検出器１０３は、電話装置１の通話状態の検出結果を、制御部１０４に供給する。 The two-way call detector 103 supplies a detection result of the call state of the telephone device 1 to the control unit 104.

制御部１０４は、双方向通話検出器１０３から遠端シングルトーク状態（ＳＴ）が与えられたときは、スイッチ１０８に対して、端子ｃ１を端子ｂ１の側に接続をさせるように制御信号を与える。 When the far-end single talk state (ST) is given from the two-way call detector 103, the control unit 104 gives a control signal to the switch 108 so as to connect the terminal c1 to the terminal b1 side. .

第１の実施形態では、制御部１０４は、遠端シングルトーク状態（ＳＴ）以外の場合、スイッチ１０８に対して、端子ｃ１を端子ａ１の側に接続するように制御信号を出力する。特に、制御部１０４は、遠端シングルトーク状態（ＳＴ）が終了した時点では、必ずスイッチ１０８に対して、端子ｃ１を端子ａ１の側に接続するように制御するものとする。 In the first embodiment, the control unit 104 outputs a control signal to the switch 108 so as to connect the terminal c1 to the terminal a1 side in a case other than the far-end single talk state (ST). In particular, the control unit 104 always controls the switch 108 to connect the terminal c1 to the terminal a1 side when the far-end single talk state (ST) ends.

ただし、例えば、近端側の背景ノイズが無い環境などの場合、制御部１０４によるスイッチ１０８の制御処理は、これに限定しない。このような場合、制御部１０４は、例えば、無音状態（ＩＤＬ）、エコー経路変動状態（ＥＰＣ）のときにも、スイッチ１０８に対して、端子ｃ１を端子ｂ１の側に接続させるようにしても良い。なぜなら、無音状態（ＩＤＬ）のときに、減衰器１０７を挿入しても通話行為自体は無害であるし、エコー経路変動状態（ＥＰＣ）のときには適応フィルタ１０２の追従劣化で、適応フィルタ１０２の打ち消し効果よりも減衰器１０７によるエコー抑圧が好ましい場合があるからである。 However, for example, in an environment where there is no near-end background noise, the control processing of the switch 108 by the control unit 104 is not limited to this. In such a case, for example, the control unit 104 may connect the terminal c1 to the terminal b1 side with respect to the switch 108 even in the silent state (IDL) and the echo path variation state (EPC). good. This is because the speech act itself is harmless even if the attenuator 107 is inserted in the silent state (IDL), and the adaptive filter 102 cancels out due to the follow-up deterioration of the adaptive filter 102 in the echo path fluctuation state (EPC). This is because echo suppression by the attenuator 107 may be preferable to the effect.

ただし、上述したように、遠端シングルトーク状態（ＳＴ）の時に、制御部１０４は、スイッチ１０８に対して、必ず端子ｃ１を端子ｂ１の側に接続させることが望ましい。またスイッチ１０８の初期状態としては、端子ｃ１が端子ａ１の側に接続されていることが望ましい。 However, as described above, in the far-end single talk state (ST), it is desirable that the control unit 104 always connects the terminal c1 to the terminal b1 side with respect to the switch 108. As an initial state of the switch 108, it is desirable that the terminal c1 is connected to the terminal a1 side.

すでに述べたように遅延部１０５には、信号ｙ（ｎ）が供給されていて、遅延部１０５で、双方向通話検出器１０３の状態判定に必要な時間と同じ量の遅延を加えてｙｄ（ｎ）とし、減衰器１０７に供給する。 As already described, the delay unit 105 is supplied with the signal y (n), and the delay unit 105 adds a delay of the same amount as the time required for the state determination of the bidirectional call detector 103 to add yd ( n) and supplied to the attenuator 107.

減衰器１０７ではハウリング防止に必要な減衰を加え信号ａｔｔ（ｎ）としてスイッチ１０８の端子ｂ１に出力する。 The attenuator 107 adds attenuation necessary for preventing howling and outputs the signal att (n) to the terminal b1 of the switch 108.

ここでは、減衰器１０７の減衰量を１０ｄＢとするが、これに限定されない。ただしハウリングの防止効果を得るために、少なくともエコー経路の利得を相殺する程度の減衰を設定することが望ましい。 Here, the attenuation amount of the attenuator 107 is 10 dB, but is not limited to this. However, in order to obtain a howling prevention effect, it is desirable to set an attenuation that at least cancels the gain of the echo path.

上述したように、特に、制御部１０４は遠端シングルトーク状態（ＳＴ）が終了したときにはかならず端子をａ１に接続するようにスイッチ１０８を制御する。 As described above, in particular, the control unit 104 controls the switch 108 so as to connect the terminal to a1 when the far-end single talk state (ST) ends.

ここで、遅延部１０６について述べておくと、遅延部１０６は、近端側から見て、エコーキャンセラ部の出力（加算器１０１）の後段に設けることが望ましい。また、遅延部１０５は、近端側から見て、減衰器１０７の前段に設けることが望ましい。 Here, the delay unit 106 is described. It is desirable that the delay unit 106 is provided at the subsequent stage of the output of the echo canceller unit (adder 101) as viewed from the near end side. Further, the delay unit 105 is desirably provided in front of the attenuator 107 when viewed from the near end side.

遅延部１０６を、エコーキャンセラ部の出力（加算器１０１）の後段、すなわち、双方向通話検出器１０３の入力点の後段に設けるのは、双方向通話検出器１０３の判定を遅らせないためである。また、遅延部１０５を減衰器１０７の前段に設けるのは、先行した双方向通話検出器１０３の判定結果に応じて、通話状態を反映して信号に減衰を加えるためである。 The reason why the delay unit 106 is provided after the output of the echo canceller unit (adder 101), that is, after the input point of the bidirectional call detector 103, is to prevent the determination of the bidirectional call detector 103 from being delayed. . The reason why the delay unit 105 is provided in the preceding stage of the attenuator 107 is to add attenuation to the signal reflecting the call state according to the determination result of the preceding two-way call detector 103.

（Ａ−２−４）エコーキャンセラ１０全体の動作について
ここまで詳細に説明してきたように、エコー及びハウリングを抑制するのを目的としたエコーキャンセラ１０は、人同士の会話において、基本的に音声のやり取りが交互に行われることに着目したものである。 (A-2-4) Overall Operation of the Echo Canceller 10 As described in detail so far, the echo canceller 10 for the purpose of suppressing echo and howling is basically a voice in a conversation between people. It pays attention to the fact that these exchanges are performed alternately.

従来のエコーキャンセラでは、単純に物理的に信号のパワーを検出し、基本的に信号の存在はどちらにでも均等に発生することを前提としている。そのため、従来のエコーキャンセラでは、誤検出を承知の上であっても、検出精度を非常な高感度に設定する必要があったり、または送信受信側のどちらの通話状態でもない中立の状態を設定し、中立状態の次に発生する音声の発生を常時待ち構えるかわりに、この中立状態では双方向通話の音声の減衰してハウリングを防止する代わりに音質を犠牲にするものであった。 In the conventional echo canceller, it is assumed that the signal power is simply detected physically, and basically the presence of the signal occurs evenly in either case. For this reason, the conventional echo canceller needs to set the detection accuracy to a very high sensitivity even if it is aware of the erroneous detection, or sets a neutral state that is neither the call state on the transmission / reception side. However, instead of always waiting for the voice to be generated next to the neutral state, in this neutral state, the voice of the two-way call is attenuated and the sound quality is sacrificed instead of preventing howling.

これに対し、第１の実施形態のエコーキャンセラ１０では、上述のように、人の会話は言葉のやり取りが交互に行われることが多いという基本的な人の会話の性質に着目し、これを利用しており、遠端からの信号だけがあった状態（ＳＴ）の後には、近端の話者がそれに応答する状態（ＮＴ）になるであろうことを、スイッチ１０８動作に反映し、これを交互に繰りかえすのである。 On the other hand, in the echo canceller 10 of the first embodiment, as described above, paying attention to the basic nature of human conversation, in which human conversation is often performed alternately, After the state where only the signal from the far end (ST) is in use and the near end speaker will respond (NT), the switch 108 operation is reflected. This is repeated alternately.

すなわち、遠端シングルトーク状態（ＳＴ）の後は、近端話者が遠端話者の言葉に応じるであろうから、遠端シングルトーク状態（ＳＴ）の後は必ず、制御部１０４は、スイッチ１０８の端子ｃ１を端子をａ１の側に接続させ、近端話者の音声が発生しても音切れのないエコーキャンセラ部側の信号ｅ（ｎ）を選択するように処理する。もし、このとき仮定に反して、もう一度継続して遠端話者が話し、遠端シングルトーク状態（ＳＴ）になったとしても、多少なりともエコーキャンセラ（加算器１０１及び適応フィルタ１０２）によるエコー打消しが、加算器１０１で行われるので問題は発生しない。 That is, after the far-end single talk state (ST), the near-end speaker will respond to the words of the far-end talker. The terminal c1 of the switch 108 is connected to the a1 side, and processing is performed so as to select the signal e (n) on the echo canceller unit side without sound interruption even when the near-end speaker's voice is generated. At this time, even if the far-end speaker speaks again and goes into the far-end single talk state (ST) contrary to the assumption, the echo by the echo canceller (adder 101 and adaptive filter 102) is more or less. Since the cancellation is performed by the adder 101, no problem occurs.

一方で想定するように、電話装置１で近端トーク状態（ＮＴ）が発生しても、スイッチ１０８では、エコーキャンセラ部側の信号ｅ（ｎ）が選択（遠端側へ送信）されるので問題はない。さらに、一瞬会話衝突であるダブルトーク状態（ＤＴ）が発生しても、どのみち双方向通話検出器１０３から制御部１０４に、ダブルトーク状態（ＤＴ）が入力され、スイッチ１０８では、端子ｃ１を端子ａ１の側に接続するように制御される。 On the other hand, as expected, even if the near-end talk state (NT) occurs in the telephone device 1, the switch 108 selects the signal e (n) on the echo canceller side (transmits to the far end side). No problem. Furthermore, even if a double talk state (DT) that is a momentary conversation collision occurs, the double talk state (DT) is input from the bidirectional call detector 103 to the control unit 104, and the switch 108 connects the terminal c1. Control is performed so as to connect to the terminal a1 side.

そのため、双方向通話信号のうち、遠端からの音声（遠端入力信号）は透過され、近端話者の音声信号（送話信号）として、エコーキャンセラ部側の信号（信号ｅ（ｎ））が選択される。そして、適応フィルタ１０２のフィルタ係数は、上述したようにダブルトーク状態（ＤＴ）の時には、凍結（更新停止）されるため、加算器１０１でエコーが軽減された信号が、遠端側に送信される。これにより、エコーキャンセラ１０では、遠端側へ送信する音声信号の音質はさほど劣化させないようにすることができる。 Therefore, the voice from the far end (far-end input signal) is transmitted among the two-way call signals, and the signal (signal e (n)) on the echo canceller side is used as the voice signal (transmission signal) of the near-end speaker. ) Is selected. Since the filter coefficient of the adaptive filter 102 is frozen (stops updating) in the double talk state (DT) as described above, the signal whose echo is reduced by the adder 101 is transmitted to the far end side. The Thereby, the echo canceller 10 can prevent the sound quality of the audio signal transmitted to the far end side from being deteriorated so much.

（Ａ−３）第１の実施形態の効果
（Ａ−３−１）エコーキャンセラ１０において、双方向通話検出器１０３で、遠端シングルトーク状態（ＳＴ）が検出された場合には、スイッチ１０８を制御して、音声スイッチ部（遅延部１０５及び、減衰器１０７）からの信号ａｔｔ（ｎ）を遠端側に出力する。一方、双方向通話検出器１０３で、遠端シングルトーク状態（ＳＴ）以外の状態が検出された場合には、スイッチ１０８を制御して、エコーキャンセラ部（加算器１０１、適応フィルタ１０２）からの信号ｅ（ｎ）を遠端側に送信する。 (A-3) Effect of First Embodiment (A-3-1) In the echo canceller 10, when the two-way call detector 103 detects the far-end single talk state (ST), the switch 108 And the signal att (n) from the voice switch unit (delay unit 105 and attenuator 107) is output to the far end side. On the other hand, when a state other than the far-end single talk state (ST) is detected by the two-way call detector 103, the switch 108 is controlled so that the echo canceller unit (adder 101, adaptive filter 102) The signal e (n) is transmitted to the far end side.

特に、エコーキャンセラ１０では、遠端シングルトーク状態（ＳＴ）が終了した時点では必ずスイッチ１０８の端子ｃ１を端子ａ１に接続するようにした。これにより、人同士の会話の交互のやり取りにおいて、遠端シングルトーク状態（ＳＴ）の時には、確実に通話経路に減衰が挿入されてエコー感を防止するとともに、仮に成長中の準ハウリング信号成分があっても、これを遠端シングルトーク状態（ＳＴ）のたびに減衰できる。そして、そのほかの通話状態、たとえば近端トーク状態（ＮＴ）では、エコーキャンセラ部（加算器１０１）の出力を選択するので発話の最初及び最後の途切れを防止し、安定継続的にエコーを除去することができる。 In particular, in the echo canceller 10, the terminal c1 of the switch 108 is always connected to the terminal a1 when the far-end single talk state (ST) ends. As a result, in the alternating exchange of conversations between people, in the far-end single talk state (ST), the attenuation is surely inserted into the call path to prevent the feeling of echo, and the quasi-howling signal component that is growing is assumed to be Even if it exists, it can be attenuated at each far-end single talk state (ST). In other speech states, for example, near-end talk state (NT), the output of the echo canceller (adder 101) is selected, so that the first and last interruptions in speech are prevented and echoes are stably and continuously removed. be able to.

また、エコーキャンセラ１０では、通話の最中に一瞬発生するダブルトーク状態（ＤＴ）のときにも、送話状態では適応フィルタ１０２側の打ち消し加算器１０１の出力を選択するので発話の最初及び最後の途切れや、極端な音量の上下がなく安定的で自然な通話が可能となる。 The echo canceller 10 also selects the output of the cancellation adder 101 on the adaptive filter 102 side in the transmission state even in the double talk state (DT) that occurs momentarily during a call, so the first and last utterances This makes it possible to make a stable and natural call without interruptions or extreme volume levels.

（Ａ−３−２）エコーキャンセラ１０で重要なのは、シングルトーク状態（ＳＴ）でのエコー除去の主体を、音声スイッチ部（遅延部１０５及び減衰器１０７）にして、エコーｙ（ｎ）に、一定量の減衰が確実に加わるようにすることである。なぜなら、通話を不可能にするハウリングは、通話経路のレベル配分設定が悪いときに発生する現象である。具体的には、この減少は、非特許文献１に記載されているように、音響信号の一巡経路の利得が１．０以上であるときに発生する。しかし、実際には、通信経路の巡回利得が１．０をはじめから大きく上回ることは、さほど多くはない。このようなときハウリングは、いきなり発生するのではなく、信号が何回も巡回した結果として、最終的にハウリングにいたることが多いのである。エコーキャンセラはエコー経路の変動などがあると、エコー打消し性能の劣化があったり、シングルトーク状態のときでも背景ノイズレベルまでがエコー打ち消しの限界であることが制約となり、巡回するエコーの成分を１．０以下に押さえ込めないことがある。 (A-3-2) What is important in the echo canceller 10 is that the main body of echo cancellation in the single talk state (ST) is the voice switch unit (the delay unit 105 and the attenuator 107), and the echo y (n) It is to ensure that a certain amount of attenuation is applied. This is because howling that makes a call impossible is a phenomenon that occurs when the level distribution setting of a call path is poor. Specifically, as described in Non-Patent Document 1, this decrease occurs when the gain of the round-trip path of the acoustic signal is 1.0 or more. However, in reality, it is not so much that the cyclic gain of the communication path greatly exceeds 1.0 from the beginning. In such a case, howling does not occur suddenly, but it often ends up in howling as a result of many rounds of the signal. When the echo canceller has a variation in the echo path, the echo cancellation performance is degraded, and even in the single talk state, the background noise level is the limit of echo cancellation. It may not be able to hold down to 1.0 or less.

しかし、第１の実施形態のエコーキャンセラ１０では、少なくとも遠端シングルトーク状態（ＳＴ）の検出中では、必ず強制的に通信経路上に減衰器１０７の効果を取り入れる時間が発生する。つまり、徐々に成長するハウリング原因信号、いわば「準ハウリング信号」を、遠端シングルトーク状態（ＳＴ）、のたびに成長前の状態に引き戻すことができ、ハウリングの発生頻度を下げることができる。 However, in the echo canceller 10 of the first embodiment, there is always a time for forcibly incorporating the effect of the attenuator 107 on the communication path at least during detection of the far-end single talk state (ST). In other words, the howling cause signal that gradually grows, that is, the “quasi-howling signal”, can be pulled back to the state before the growth every time the far-end single talk state (ST), and the frequency of howling can be reduced.

（Ｂ）第２の実施形態
以下、本発明によるエコーキャンセラ及びエコーキャンセルプログラム、並びに電話装置の第２の実施形態を、図面を参照しながら詳述する。 (B) Second Embodiment Hereinafter, an echo canceller, an echo cancellation program, and a telephone device according to a second embodiment of the present invention will be described in detail with reference to the drawings.

（Ｂ−１）第２の実施形態の構成
図３は、第２の実施形態の電話装置１Ａの全体構成を示すブロック図であり、上述した図１との同一、対応部分には同一、対応符号を付して示している。 (B-1) Configuration of the Second Embodiment FIG. 3 is a block diagram showing the overall configuration of the telephone device 1A of the second embodiment, which is the same as FIG. A reference numeral is attached.

以下、第２の実施形態について、第１の実施形態との差異を説明する。 Hereinafter, the difference between the second embodiment and the first embodiment will be described.

電話装置１Ａでは、エコーキャンセラ１０が、エコーキャンセラ１０Ａに置き換わっている点で第１の実施形態と異なっている。また、エコーキャンセラ１０Ａは、減衰量計算部１０９、エコーロス推定部（以下、「ＥＬＥ」ともいう）１１０、及び減衰器１１１が追加されている点で、第１の実施形態とは異なっている。また、双方向通話検出器１０３による通話状態の判定結果が、エコーロス推定部１１０にも供給されている。 The telephone device 1A is different from the first embodiment in that the echo canceller 10 is replaced with an echo canceller 10A. The echo canceller 10A is different from the first embodiment in that an attenuation calculation unit 109, an echo loss estimation unit (hereinafter also referred to as “ELE”) 110, and an attenuator 111 are added. In addition, the determination result of the call state by the two-way call detector 103 is also supplied to the echo loss estimation unit 110.

なお、減衰量計算部１０９、エコーロス推定部１１０、及び減衰器１１１の詳細については、後述する動作説明において説明する。 Note that details of the attenuation calculation unit 109, the echo loss estimation unit 110, and the attenuator 111 will be described in the description of operations described later.

（Ｂ−２）第２の実施形態の動作
次に、以上のような構成を有する第２の実施形態の電話装置１Ａにおけるエコーキャンセラ１０Ａの動作を説明する。 (B-2) Operation | movement of 2nd Embodiment Next, operation | movement of 10 A of echo cancellers in 1 A of telephone apparatuses of 2nd Embodiment which has the above structures is demonstrated.

（Ｂ−２−１）エコーロス推定部１１０（ＥＬＥ）の動作について
エコーロス推定部１１０には遠端入力信号ｘ（ｎ）と、Ａ／Ｄ変換器３０からのエコーｙ（ｎ）の成分を含む近端入力信号が供給されている。エコーロス推定部１１０は、以下に説明する方法でエコーロス、すなわち信号が受信出力端子Ｒｏｕｔから送信入力端子Ｓｉｎにいたるエコーパスを経由したときのエコー経路減衰量（あるいは増幅量）を計算する。 (B-2-1) Operation of Echo Loss Estimator 110 (ELE) The echo loss estimator 110 includes the far-end input signal x (n) and the component of the echo y (n) from the A / D converter 30. Near-end input signal is supplied. The echo loss estimation unit 110 calculates an echo loss, that is, an echo path attenuation amount (or an amplification amount) when the signal passes through an echo path from the reception output terminal Rout to the transmission input terminal Sin by a method described below.

エコーロス推定部１１０は、双方向通話検出器１０３から遠端シングルトーク状態（ＳＴ）の判定結果が供給されているときだけ、以下のように動作する。エコーロス推定部１１０は、双方向通話検出器１０３から、その他の判定結果を入力されたときには何もしないので、ない場合と同じ状態となる
エコーロス推定部１１０は、双方向通話検出器１０３から遠端シングルトーク状態（ＳＴ）の判定結果が入力されているとき、以下の（６）式のように遠端入力信号ｘ（ｎ）のパワー平滑値Ｐ＿ａｖ（ｎ）を計算する。双方向通話検出器１０３から遠端シングルトーク状態（ＳＴ）の判定結果が入力されているときに、以下の（６）式を用いて計算することは、最もエコー経路の信号増減を正確に計算する上で望ましい。なお、以下の（６）式において、δｐａｖは平滑の度合いを決定する０＜δｐａｖ＜１．０の定数である。 The echo loss estimation unit 110 operates as follows only when the determination result of the far-end single talk state (ST) is supplied from the bidirectional call detector 103. The echo loss estimation unit 110 does nothing when other determination results are input from the two-way call detector 103, so the echo loss estimation unit 110 is in the same state as when there is no echo loss. The echo loss estimation unit 110 is far from the two-way call detector 103. When the determination result of the single talk state (ST) is input, the power smoothing value P_av (n) of the far-end input signal x (n) is calculated as in the following equation (6). When the determination result of the far-end single talk state (ST) is input from the bidirectional call detector 103, the calculation using the following equation (6) is the most accurate calculation of the signal increase / decrease in the echo path. This is desirable. In the following expression (6), δpav is a constant of 0 <δpav <1.0 that determines the degree of smoothness.

Ｐ＿ａｖ（ｎ）＝δｐａｖ×ｘ（ｎ）^２
＋（１．０−δｐａｖ）×Ｐ＿ａｖ（ｎ） …（６）
また、エコーロス推定部１１０は、以下の（７）式のように、エコーｙ（ｎ）のパワー平滑値ＥＣ＿ａｖ（ｎ）を計算する。 P_av (n) = δpav × x (n) ²
+ (1.0−δpav) × P_av (n) (6)
Further, the echo loss estimation unit 110 calculates the power smooth value EC_av (n) of the echo y (n) as in the following equation (7).

ＥＣ＿ａｖ（ｎ）＝δｐａｖ×ｙ（ｎ）^２
＋（１．０−δｐａｖ）×ＥＣ＿ａｖ（ｎ） …（７）
上記（７）式において、δｐａｖが大きければ信号の急な変化に対応するが、ノイズの影響を受けやすくなり、逆に小さければ信号の大まかな変化に対応し、ノイズの影響を受けにくくなる。ここでは、δｐａｖ＝０．０１を用いるものとするが、この値に限定されないものである。 EC_av (n) = δpav × y (n) ²
+ (1.0−δpav) × EC_av (n) (7)
In the above equation (7), if δpav is large, it corresponds to a sudden change in the signal, but it is easily affected by noise, and conversely, if it is small, it corresponds to a rough change in the signal and is hardly affected by noise. Here, δpav = 0.01 is used, but it is not limited to this value.

エコーロス推定部１１０は、所定期間Ｔ＿ＥＬＥ（例えば、１０秒の期間）で、Ｐ＿ａｖ（ｎ）、ＥＣ＿ａｖ（ｎ）のそれぞれの最大値Ｐ＿ＭＡＸ（ｋ）、ＥＣ＿ＭＡＸ（ｋ）を求める。なお、Ｔ＿ＥＬＥに適用する期間の長さは限定されないものである。なお、この場合、ｋは１０秒ごとの順番を表す整数であるものとする。 The echo loss estimation unit 110 obtains the maximum values P_MAX (k) and EC_MAX (k) of P_av (n) and EC_av (n) in a predetermined period T_ELE (for example, a period of 10 seconds). Note that the length of the period applied to T_ELE is not limited. In this case, k is an integer representing the order every 10 seconds.

図４は、Ｐ＿ＭＡＸ（ｋ）、ＥＣ＿ＭＡＸ（ｋ）の例について示した説明図である。 FIG. 4 is an explanatory diagram showing examples of P_MAX (k) and EC_MAX (k).

上述したように、エコーロス推定部１１０では、上記（６）式でスピーカ４０から出力される遠端入力信号ｘ（ｎ）のパワー推移、すなわちＰ＿ａｖ（ｎ）が計算される。そして、ここでは、Ｐ＿ａｖ（ｎ）をグラフで示すと、図４（ａ）のようになったものとする。そして、エコーロス推定部１１０では、Ｔ＿ＥＬＥの期間内でのＰ＿ａｖ（ｎ）の最大値Ｐ＿ＭＡＸ（ｋ）が求められる。 As described above, the echo loss estimation unit 110 calculates the power transition of the far-end input signal x (n) output from the speaker 40, that is, P_av (n) by the above equation (6). Here, it is assumed that P_av (n) is represented as a graph as shown in FIG. Then, the echo loss estimation unit 110 obtains the maximum value P_MAX (k) of P_av (n) within the period of T_ELE.

本来エコーｙ（ｎ）は、スピーカ４０から出力された遠端入力信号ｘ（ｎ）が、エコー経路を介して、マイクロホン５０に戻ってしまい、再び観測される信号である。したがって、エコー経路に背景ノイズがなければ、エコーｙ（ｎ）は、スピーカ４０から放射されて空間伝播でパワーが降下し、マイクロホン５０に入力される信号である。つまり、ＥＣ＿ａｖ（ｎ）の値は、図４（ａ）に示すように、通常はＰ＿ａｖ（ｎ）が幾分小さくなった値となる。 Originally, the echo y (n) is a signal that the far-end input signal x (n) output from the speaker 40 returns to the microphone 50 via the echo path and is observed again. Therefore, if there is no background noise in the echo path, the echo y (n) is a signal that is radiated from the speaker 40 and whose power decreases due to spatial propagation and is input to the microphone 50. That is, the value of EC_av (n) is usually a value obtained by slightly reducing P_av (n) as shown in FIG.

すなわち、エコーロス推定部１１０で求めたいエコー経路の減衰は、単純に、以下の（８）式で求められるＰ＿ａｖ（ｎ）とＥＣ＿ａｖ（ｎ）との差分ｄｉｆｆ＿ＥＬ（ｎ）、又は、以下の（９）式で求められるＰ＿ａｖ（ｎ）とＥＣ＿ａｖ（ｎ）との比ｒａｔｉｏ＿ＥＬ（ｎ）で表すことができる。 That is, the echo path attenuation desired by the echo loss estimation unit 110 is simply the difference diff_EL (n) between P_av (n) and EC_av (n) obtained by the following equation (8), or (9 ) Expression P_av (n) and EC_av (n) ratio ratio_EL (n).

ｄｉｆｆ＿ＥＬ（ｎ）＝Ｐ＿ａｖ（ｎ）−ＥＣ＿ａｖ（ｎ） …（８）
ｒａｔｉｏ＿ＥＬ（ｎ）＝Ｐ＿ａｖ（ｎ）／ＥＣ＿ａｖ（ｎ） …（９）
しかし、現実において、エコー経路は、近端話者側の電話装置１Ａのスピーカ４０やマイクロホン５０が置かれる実環境であるから、背景ノイズが皆無ということはほとんど無い。そこで、背景ノイズとして、図４（ｂ）のように、時間軸ではパワーが一様な背景ノイズ（黒塗りで表わされた成分；以下「ＢＧＮ」ともいう）がある場合を考える。すると、図４（ａ）で示したＥＣ＿ａｖ（ｎ）と背景ノイズＢＧＮ、エコーロスの推定値ＥＬＥ＿ＥＳＴ（ｋ）の関係は図４（ｃ）のようになる。つまり、図４（ｃ）からわかるように、遠端入力信号ｘ（ｎ）のパワーが小さい部分はそれに応じてエコーｙ（ｎ）のパワーも小さくなるから、エコーｙ（ｎ）は、場合によって背景ノイズＢＧＮのパワーより小さくなり、背景ノイズＢＧＮに埋もれてしまうことになる。 diff_EL (n) = P_av (n) −EC_av (n) (8)
ratio_EL (n) = P_av (n) / EC_av (n) (9)
However, in reality, since the echo path is a real environment where the speaker 40 and the microphone 50 of the telephone device 1A on the near-end speaker side are placed, there is almost no background noise. Therefore, as a background noise, as shown in FIG. 4B, consider a case where there is a background noise having a uniform power on the time axis (a component represented by black paint; hereinafter also referred to as “BGN”). Then, the relationship between EC_av (n), background noise BGN, and echo loss estimation value ELE_EST (k) shown in FIG. 4A is as shown in FIG. That is, as can be seen from FIG. 4C, since the power of the echo y (n) decreases correspondingly in the portion where the power of the far-end input signal x (n) is small, the echo y (n) It becomes smaller than the power of the background noise BGN and is buried in the background noise BGN.

このような場合、単純に上記（８）式、上記（９）式を計算しても、実際にはＥＣ＿ａｖ（ｎ）を観測すべきところ、パワーが打ち勝った背景ノイズＢＧＮのパワーを計算してしまうため、真のエコー経路のロスを求めることができないことになる。 In such a case, even if the above formulas (8) and (9) are simply calculated, the EC_av (n) should actually be observed, but the power of the background noise BGN that has overcome the power is calculated. Therefore, the loss of the true echo path cannot be obtained.

そこで、エコーロス推定部１１０では、後述するように、エコーロス推定値ＥＬＥ＿ＥＳＴ（ｋ）を求め、ハウリング防止を担う２つの減衰器１０７、１１１の動作に反映させる。 Therefore, as will be described later, the echo loss estimation unit 110 obtains an echo loss estimated value ELE_EST (k) and reflects it in the operations of the two attenuators 107 and 111 responsible for preventing howling.

エコーロス推定部１１０は、上記（７）式に示すＥＣ＿ａｖ（ｎ）の所定期間Ｔ＿ＢＧ（例えば、Ｔ＿ＥＬＥと同じ１０秒間）の最小値としてＢＧＮ’（ｋ）を計算する。ｋは、上記（７）式と同様に期間区切りの順番（フレーム）を表す整数である。なお、ここでは、ＢＧＮ’（ｋ）を求める際の期間を１０秒としたが、これに限定されない。 The echo loss estimation unit 110 calculates BGN ′ (k) as the minimum value of a predetermined period T_BG (for example, the same 10 seconds as T_ELE) of EC_av (n) shown in the above equation (7). k is an integer representing the order (frame) of period breaks as in the above equation (7). Here, the period for obtaining BGN '(k) is 10 seconds, but the present invention is not limited to this.

なお、初期状態から（例えば、エコーキャンセラ１０で、信号処理を開始してから）１０秒間が経過していないうちは所定の初期値を用いるようにしても良い。ここでは、上述の初期値として−４５ｄＢｍ０（０ｄＢｍ０は国際規格ＩＴＵ−ＴＧ．７１１の０ｄＢｍ０基準信号を用いるようにしても良い）を設定したものとして説明するが、これに限定されない。また、ＢＧＮ’（ｋ）として、初期状態から、現時点までの期間内での最小値を用いるようにしてもよい。 Note that a predetermined initial value may be used as long as 10 seconds have not elapsed from the initial state (for example, after signal processing is started by the echo canceller 10). Here, description will be made assuming that −45 dBm0 (0 dBm0 may use the 0 dBm0 reference signal of the international standard ITU-TG.711) as the initial value described above, but is not limited thereto. Further, as BGN '(k), the minimum value within the period from the initial state to the present time may be used.

図４（ｄ）では、ＥＣ＿ａｖ（ｎ）とＢＧＮ’（ｋ）関係の例について示している。また、図４（ｅ）では、図４（ｄ）のグラフにさらにＰ＿ａｖ（ｎ）を重ねて表示した例について示している。 FIG. 4D shows an example of the relationship between EC_av (n) and BGN ′ (k). FIG. 4 (e) shows an example in which P_av (n) is further superimposed on the graph of FIG. 4 (d).

図４（ｄ）では、エコー成分を表わすＥＣ＿ａｖ（ｎ）の多くの部分が、背景ノイズＢＧＮの成分に埋もれてしまい、ＢＧＮ’（ｋ）がＢＧＮのレベルと一致する様子を示している。つまり、図４（ｄ）では、ＥＣ＿ａｖ（ｎ）が本当のエコーのレベルを表せるのは、レベルとしてはＢＧＮ’（ｋ）を超えた部分だけとなっている。 FIG. 4D shows a state in which many parts of EC_av (n) representing the echo component are buried in the background noise BGN component, and BGN '(k) matches the BGN level. That is, in FIG. 4D, EC_av (n) can represent the true echo level only in the portion exceeding BGN '(k).

なかでも、最も確からしく浮き出したエコーロスを推定することが出来そうなのは、図４（ｅ）において、スピーカ４０出力側の信号（遠端入力信号ｘ（ｎ））のピーク値を示す付近、すなわちＰ＿ＭＡＸ（ｋ）を示す点と、ＥＣ＿ＭＡＸ（ｋ）との差分をあらわす部分であるのがわかる。このことは、スピーカ４０から放射された音声信号（遠端入力信号ｘ（ｎ））の包絡線の最大部分が、エコーｙ（ｎ）としても最大部分となることを考え合わせれば自然なことである。 Among them, the echo loss that is most likely to be estimated is likely to be estimated in the vicinity of the peak value of the signal on the output side of the speaker 40 (far-end input signal x (n)), that is, P_MAX in FIG. It can be seen that this is a portion representing the difference between the point indicating (k) and EC_MAX (k). This is natural considering that the maximum portion of the envelope of the audio signal radiated from the speaker 40 (far-end input signal x (n)) becomes the maximum portion even as the echo y (n). is there.

上述のように、電話装置１Ａを、例えば、ハンズフリー通話装置に適用した場合、当該ハンズフリー通話装置が置かれた音環境においては、ハウリングの発生有無は、エコーロスが増幅系かどうかに左右される。この場合、当該ハンズフリー通話装置において、そのエコーロスを推定計算するためには、当該ハンズフリー通話装置の設置された音環境において、背景ノイズＢＧＮのパワーと、エコーのパワーとの関係が重要である。 As described above, when the telephone device 1A is applied to, for example, a hands-free call device, in a sound environment where the hands-free call device is placed, whether or not howling occurs depends on whether echo loss is an amplification system. The In this case, in order to estimate and calculate the echo loss in the hands-free call device, the relationship between the power of the background noise BGN and the power of the echo is important in the sound environment where the hands-free call device is installed. .

基本的にはエコーロスは、スピーカ４０から出力する信号に係るレベルＰ＿ａｖ（ｎ）（最大値はＰ＿ＭＡＸ（ｋ））と、エコー経路を経由してマイクロホン５０に入力されるエコーに係るレベルＥＣ＿ａｖ（ｎ）（最大値はＥＣ＿ＭＡＸ（ｋ））がわかれば計算が可能である。 Basically, the echo loss includes the level P_av (n) (the maximum value is P_MAX (k)) related to the signal output from the speaker 40 and the level EC_av (n) related to the echo input to the microphone 50 via the echo path. ) (The maximum value is EC_MAX (k)) and can be calculated.

ただし、実際にはエコーレベルと紛らわしい背景ノイズＢＧＮ’（ｋ）のレベル（パワーと言い換えても同じである）の影響を考慮する必要がある。エコーロス推定部１１０は、Ｐ＿ＭＡＸ（ｋ）、ＥＣ＿ＭＡＸ（ｋ）、ＢＧＮ’（ｋ）の大小関係を比較して、検出した音環境の状態に係る判定を行い、その判定結果等を、後述する減衰量計算部１０９に与える。 However, in practice, it is necessary to consider the influence of the level of the background noise BGN ′ (k) that is confusing with the echo level (the same is true for power). The echo loss estimation unit 110 compares the magnitude relationships of P_MAX (k), EC_MAX (k), and BGN ′ (k), makes a determination related to the detected sound environment state, and determines the determination result, etc., as described later. This is given to the quantity calculation unit 109.

図５、図６は、エコーロス推定部１１０による、音環境に係る判定動作について示した説明図である。 FIGS. 5 and 6 are explanatory diagrams showing the determination operation related to the sound environment by the echo loss estimation unit 110.

図５、図６では、縦軸はパワーの大きさ、横軸は、エコーロス推定部１１０で用いる音環境状態の状態種別と状態番号（以下、「ＥＰ＿ＳＴＡＴ」ともいう）を表している。 5 and 6, the vertical axis represents the power magnitude, and the horizontal axis represents the state type and state number (hereinafter also referred to as “EP_STAT”) of the sound environment state used in the echo loss estimation unit 110.

エコーロス推定部１１０では、音環境状態を、状態１（ＥＰ＿ＳＴＡＴ：１、状態種別：良好状態）、状態２（ＥＰ＿ＳＴＡＴ：２、状態種別：ノイズ状態）、状態３（ＥＰ＿ＳＴＡＴ：３、状態種別：不適状態）、状態４（ＥＰ＿ＳＴＡＴ：４、状態種別：増幅状態）のいずれかと判定する。以下、状態１〜状態４のそれぞれについて説明する。 In the echo loss estimation unit 110, the sound environment state is set to state 1 (EP_STAT: 1, state type: good state), state 2 (EP_STAT: 2, state type: noise state), state 3 (EP_STAT: 3, state type: unsuitable). State) and state 4 (EP_STAT: 4, state type: amplification state). Hereinafter, each of the states 1 to 4 will be described.

［状態１（状態種別：良好状態、状態番号（ＥＰ＿ＳＴＡＴ）：１）の場合］
エコーロス推定部１１０は、スピーカ４０から放射される信号に係るパワー最大値Ｐ＿ＭＡＸ（ｋ）と、エコーに係るパワー最大値ＥＣ＿ＭＡＸ（ｋ）、推定背景ノイズパワーＢＧＮ’（ｋ）が順に小さくなっている場合に、音環境状態を、状態１（良好状態）と判定する。なお、状態１（良好状態）における、Ｐ＿ＭＡＸ（ｋ）、ＥＣ＿ＭＡＸ（ｋ）、ＢＧＮ’（ｋ）の関係は、図５（ａ）に図示されている。すなわち、状態１（良好状態）は、スピーカ出力信号よりも、エコーが小さく、さらにその下に背景ノイズがある状態といえる。 [In the case of state 1 (state type: good state, state number (EP_STAT): 1)]
In the echo loss estimation unit 110, the power maximum value P_MAX (k) related to the signal radiated from the speaker 40, the power maximum value EC_MAX (k) related to the echo, and the estimated background noise power BGN ′ (k) are sequentially decreased. In this case, the sound environment state is determined as state 1 (good state). The relationship among P_MAX (k), EC_MAX (k), and BGN ′ (k) in state 1 (good state) is illustrated in FIG. That is, state 1 (good state) can be said to be a state in which the echo is smaller than the speaker output signal and there is background noise below it.

このとき、以下の（１０）式に示すように、Ｐ＿ＭＡＸ（ｋ）とＥＣ＿ＭＡＸ（ｋ）の差ｄｉｆｆ＿ＰＯＷ（ｋ）は、エコー経路の減衰の量を正確に表している。そして、このｄｉｆｆ＿ＰＯＷ（ｋ）は、以下の（１０）式に示すように、エコーロスＡＥＣＨＯ（ｋ）を正しく表している。 At this time, as shown in the following equation (10), the difference diff_POW (k) between P_MAX (k) and EC_MAX (k) accurately represents the amount of attenuation of the echo path. The diff_POW (k) correctly represents the echo loss AECHO (k) as shown in the following equation (10).

ＡＥＣＨＯ（ｋ）＝ｄｉｆｆ＿ＰＯＷ（ｋ）
＝Ｐ＿ＭＡＸ（ｋ）−ＥＣ＿ＭＡＸ（ｋ）［ｄＢ］ …（１０）
また、上記（１０）式では、Ｐ＿ＭＡＸ（ｋ）とＥＣ＿ＭＡＸ（ｋ）の差分を用いてｄｉｆｆ＿ＰＯＷ（ｋ）を求めたが、以下の（１０）’式に示すように、線形比で求めるようにしても良い。以下の（１０）’式を用いてとしても、上記（１０）式と意味するところは同じである。ここでは、説明を簡易にするため、ＡＥＣＨＯ（ｋ）は、上記（１０）式のような差分の表現で説明するが、これに限定されないのは当然である。 AECHO (k) = diff_POW (k)
= P_MAX (k) -EC_MAX (k) [dB] (10)
In the above equation (10), diff_POW (k) is obtained by using the difference between P_MAX (k) and EC_MAX (k). However, as shown in the following equation (10) ′, it is obtained by a linear ratio. May be. Even if the following expression (10) ′ is used, the same meaning as the above expression (10) is the same. Here, in order to simplify the description, AECHO (k) will be described using a differential expression such as the above equation (10), but it is natural that the present invention is not limited to this.

ＡＥＣＨＯ（ｋ）＝ｄｉｆｆ＿ＰＯＷ（ｋ）
＝ＥＣ＿ＭＡＸ（ｋ）／Ｐ＿ＭＡＸ（ｋ） …（１０）’
すなわち、状態１（良好状態）のとき、エコー経路を通過する事で、スピーカ４０から放射された音信号は、減衰を受けることを示している。上述の通り、このようなエコー経路の系構成では、エコーは通信経路循環で発散に向かうことはないから、エコーが成長してハウリングは発生しない。 AECHO (k) = diff_POW (k)
= EC_MAX (k) / P_MAX (k) (10) '
That is, in the state 1 (good state), the sound signal radiated from the speaker 40 is attenuated by passing through the echo path. As described above, in such a system configuration of the echo path, since the echo does not go to divergence by the communication path circulation, the echo grows and no howling occurs.

そして、エコーロス推定部１１０は、状態１（良好状態）を検出すると、状態１（良好状態）を表すＥＰ＿ＳＴＡＴ＝１とｄｉｆｆ＿ＰＯＷ（ｋ）を減衰量計算部１０９に与える。 Then, when detecting the state 1 (good state), the echo loss estimation unit 110 gives EP_STAT = 1 and diff_POW (k) representing the state 1 (good state) to the attenuation amount calculation unit 109.

［状態２（状態種別：ノイズ状態、状態番号（ＥＰ＿ＳＴＡＴ）：２）の場合］
推定背景ノイズパワーＢＧＮ’（ｋ）が、エコーに係るパワー最大値ＥＣ＿ＭＡＸ（ｋ）を上回る強さの場合、エコーロス推定部１１０は、音環境状態を状態２（ノイズ状態）と判定する。なお、状態２（ノイズ状態）における、Ｐ＿ＭＡＸ（ｋ）、ＥＣ＿ＭＡＸ（ｋ）、ＢＧＮ’（ｋ）の関係は、図５（ｂ）に図示されている。状態２（ノイズ状態）の場合、ＥＣ＿ＭＡＸ（ｋ）はＢＧＮ’（ｋ）に埋もれてしまい、本当のエコーロスは背景ノイズのために観測が不可能な状態である。 [In the case of state 2 (state type: noise state, state number (EP_STAT): 2)]
When the estimated background noise power BGN ′ (k) has a strength exceeding the power maximum value EC_MAX (k) related to the echo, the echo loss estimation unit 110 determines the sound environment state as state 2 (noise state). The relationship among P_MAX (k), EC_MAX (k), and BGN ′ (k) in state 2 (noise state) is illustrated in FIG. 5B. In state 2 (noise state), EC_MAX (k) is buried in BGN ′ (k), and real echo loss cannot be observed due to background noise.

しかし、その一方で、状態２（ノイズ状態）の場合には、Ｐ＿ＭＡＸ（ｋ）に比較してＥＣ＿ＭＡＸ（ｋ）が相当小さいという事実を反映しており、たまたま真のＡＥＣＨＯ（ｋ）が計算できないに過ぎず、やはりハウリングが発生しない安全な状態である。 On the other hand, however, in the case of state 2 (noise state), it reflects the fact that EC_MAX (k) is considerably smaller than P_MAX (k), and it is impossible to calculate true AECHO (k) by chance. This is a safe state where no howling occurs.

状態２（ノイズ状態）の場合、ＢＧＮ’（ｋ）の意味合いとしては、ＥＣ＿ａｖ（ｎ）の最小値を追尾したものであるから、ＥＣ＿ＭＡＸ（ｋ）とＢＧＮ’（ｋ）が同じように見えてしまっており、実際にはＥＣ＿ＭＡＸ（ｋ）＝ＢＧＮ’（ｋ）となっている。したがって、状態２（ノイズ状態）の場合に、上記（１０）式を計算することは、実際にはＰ＿ＭＡＸ（ｋ）−ＢＧＮ’（ｋ）を求めるのと同じであるが、エコーロス値としては減衰方向の値が計算される。 In the case of state 2 (noise state), the meaning of BGN ′ (k) is the result of tracking the minimum value of EC_av (n), so EC_MAX (k) and BGN ′ (k) look the same. Actually, EC_MAX (k) = BGN ′ (k). Therefore, in the case of state 2 (noise state), calculating the above equation (10) is actually the same as obtaining P_MAX (k) −BGN ′ (k), but the echo loss value is attenuated. A direction value is calculated.

エコーロス推定部１１０は、状態２（ノイズ状態）を検出すると、状態２（ノイズ状態）を表すＥＰ＿ＳＴＡＴ＝２と、ｄｉｆｆ＿ＰＯＷ（ｋ）を減衰量計算部１０９に与える。 When detecting the state 2 (noise state), the echo loss estimation unit 110 gives EP_STAT = 2 representing the state 2 (noise state) and diff_POW (k) to the attenuation amount calculation unit 109.

［状態３（状態種別：不適状態、状態番号（ＥＰ＿ＳＴＡＴ）：３）の場合］
状態２（ノイズ状態）から、さらに推定背景ノイズパワーＢＧＮ’（ｋ）が増加し、Ｐ＿ＭＡＸ（ｋ）を超えると、エコーロス推定部１１０は、音環境状態を状態３（不適状態）と判定する。なお、状態３（不適状態）における、Ｐ＿ＭＡＸ（ｋ）、ＥＣ＿ＭＡＸ（ｋ）、ＢＧＮ’（ｋ）の関係は、図５（ｃ）に図示されている。 [In the case of state 3 (state type: inappropriate state, state number (EP_STAT): 3)]
When the estimated background noise power BGN ′ (k) further increases from state 2 (noise state) and exceeds P_MAX (k), the echo loss estimation unit 110 determines the sound environment state to be state 3 (unsuitable state). The relationship among P_MAX (k), EC_MAX (k), and BGN ′ (k) in the state 3 (unsuitable state) is illustrated in FIG.

状態３（不適状態）では、Ｐ＿ＭＡＸ（ｋ）よりもマイクロホン５０に入力されるＢＧＮ’（ｋ）のほうが大きいことを意味している。状態３（不適状態）は、言い換えれば、近端話者の置かれた状態としては、周囲の強力な背景ノイズの中でスピーカ４０から放射された相手音声を懸命に聞き取るような状態のはずである。したがって、電話装置１Ａにおける通話手段として、ハンズフリー通話の利用は、本来不適な状態であるといえる。当然このような状態ではもはや正しいＡＥＣＨＯ（ｋ）の推定は不可能である。 In state 3 (unsuitable state), it means that BGN ′ (k) input to the microphone 50 is larger than P_MAX (k). State 3 (inappropriate state), in other words, as a state where the near-end speaker is placed, should be a state where the other party's voice radiated from the speaker 40 in the surrounding strong background noise can be heard hard. is there. Accordingly, it can be said that the use of hands-free calling is originally unsuitable as a calling means in the telephone device 1A. Naturally, in such a state, it is no longer possible to estimate AECHO (k) correctly.

エコーロス推定部１１０は、状態３（不適状態）の場合であっても、上記（１０）式を用いて、ＡＥＣＨＯ（ｋ）を求める計算は行うが、エコーと背景ノイズの関係は、状態２（ノイズ状態）と同様である。 The echo loss estimation unit 110 performs calculation to obtain AECHO (k) using the above equation (10) even in the state 3 (unsuitable state), but the relationship between the echo and the background noise is the state 2 ( Noise state).

そして、エコーロス推定部１１０は、状態３（不適状態）を検出すると、状態３（不適状態）を表すＥＰ＿ＳＴＡＴ＝３とｄｉｆｆ＿ＰＯＷ（ｋ）を減衰量計算部１０９に与える。ただし、状態３（不適状態）の場合、ｄｉｆｆ＿ＰＯＷ（ｋ）は減衰の量ではなく、むしろ増幅の量を表す値として計算される。すなわち、状態３（不適状態）の状態では、ハウリングが発生する危険度が高いといえる。 Then, when detecting the state 3 (unsuitable state), the echo loss estimation unit 110 gives EP_STAT = 3 and diff_POW (k) representing the state 3 (unsuitable state) to the attenuation amount calculation unit 109. However, in the case of state 3 (unsuitable state), diff_POW (k) is not calculated as an amount of attenuation but rather as a value representing the amount of amplification. That is, in the state 3 (unsuitable state), it can be said that there is a high risk of howling.

［状態４（状態種別：増幅状態、状態番号（ＥＰ＿ＳＴＡＴ）：４）の場合］
エコーロス推定部１１０は、スピーカ４０から放射される信号に係るパワー最大値Ｐ＿ＭＡＸ（ｋ）より、エコーに係るパワー最大値ＥＣ＿ＭＡＸ（ｋ）が大きい場合、すなわち、スピーカ４０から放射された信号のパワーが、エコー経路で増幅している場合に、音環境状態を、状態４（増幅状態）と判定する。 [In the case of state 4 (state type: amplification state, state number (EP_STAT): 4)]
The echo loss estimation unit 110 has a power maximum value EC_MAX (k) related to the echo larger than the power maximum value P_MAX (k) related to the signal radiated from the speaker 40, that is, the power of the signal radiated from the speaker 40 is higher. When the sound path is amplified by the echo path, the sound environment state is determined as state 4 (amplified state).

通常、空気伝播の結果で自然に音のパワーが増加することはない。このような場合の多くではマイクロホン５０からエコーロス推定部１１０に至るエコー経路のどこかに、設計者の意図しない人工的な増幅機能が混入しているのがほとんどである。この状態を放置した場合、電話装置１Ａにおいて、音声信号が巡回しつつパワー上昇をつづけ、確実にハウリングを発生する。すなわち、状態４では、電話装置１Ａにおいて、ハウリングが発生する危険度が極めて高い。 Usually, the sound power does not increase naturally as a result of air propagation. In many cases, an artificial amplification function not intended by the designer is mixed in somewhere in the echo path from the microphone 50 to the echo loss estimation unit 110. If this state is left unattended, in the telephone device 1A, the power continues to increase while the audio signal circulates, and howling is surely generated. That is, in the state 4, the risk of howling occurring in the telephone device 1A is extremely high.

なお、図６（ａ）では、ＢＧＮ’（ｋ）がＰ＿ＭＡＸ（ｋ）よりも小さい場合を「状態４」と図示し、図６（ｂ）では、ＢＧＮ’（ｋ）がＰ＿ＭＡＸ（ｋ）よりも大きい場合を「状態４’」と図示している。しかし、ＥＣ＿ＭＡＸ（ｋ）の方が、Ｐ＿ＭＡＸ（ｋ）より大きい状態においては、その区別はあまり重要でないので、以下では、上述の「状態４」及び「状態４’」を、まとめて「状態４」として説明する。 In FIG. 6A, the case where BGN ′ (k) is smaller than P_MAX (k) is illustrated as “state 4”, and in FIG. 6B, BGN ′ (k) is derived from P_MAX (k). Is also shown as “state 4 ′”. However, in the state where EC_MAX (k) is larger than P_MAX (k), the distinction is not so important. Therefore, hereinafter, “state 4” and “state 4 ′” are collectively referred to as “state 4”. ".

そして、エコーロス推定部１１０は、状態４（増幅状態）を検出すると、状態４（増幅状態）を表すＥＰ＿ＳＴＡＴ＝４と、ｄｉｆｆ＿ＰＯＷ（ｋ）を減衰量計算部１０９に与える。 Then, when detecting the state 4 (amplification state), the echo loss estimation unit 110 gives EP_STAT = 4 representing the state 4 (amplification state) and diff_POW (k) to the attenuation amount calculation unit 109.

（Ｂ−２−２）減衰量計算部１０９の動作について
減衰量計算部１０９は、エコーロス推定部１１０から与えられる音環境状態の状態番号ＥＰ＿ＳＴＡＴに応じて、減衰器１０７、１１１に設定する減衰量を決定する。 (B-2-2) Operation of attenuation amount calculation unit 109 The attenuation amount calculation unit 109 sets the attenuation amount to be set in the attenuators 107 and 111 according to the state number EP_STAT of the sound environment state given from the echo loss estimation unit 110. To decide.

以下、与えられるＥＰ＿ＳＴＡＴの値ごとに、減衰量計算部１０９の動作を説明する。 Hereinafter, the operation of the attenuation amount calculation unit 109 will be described for each given EP_STAT value.

［状態１（良好状態）、又は、状態２（ノイズ状態）の場合］
まず、エコーロス推定部１１０から与えられた判定結果が、ＥＰ＿ＳＴＡＴ＝１（状態１、良好状態）、又は、ＥＰ＿ＳＴＡＴ＝２（状態２、ノイズ状態）の場合の、減衰量計算部１０９の動作について説明する。上述の通り、状態１又は状態２の場合には、電話装置１Ａにおいてハウリングが発生する危険度は少ない。 [In the case of state 1 (good state) or state 2 (noise state)]
First, the operation of the attenuation calculation unit 109 when the determination result given from the echo loss estimation unit 110 is EP_STAT = 1 (state 1, good state) or EP_STAT = 2 (state 2, noise state) will be described. To do. As described above, in the case of the state 1 or the state 2, the risk of howling occurring in the telephone device 1A is small.

状態１又は状態２の場合、減衰量計算部１０９は、エコー感（近端話者にとってエコーによる違和感を感知する程度）の軽減のために、減衰器１０７に第１の実施形態で適用した減衰量と同様の減衰量を通知する。そして、減衰器１０７は減衰量計算部１０９から通知された減衰量の減衰を、信号ｙｄ（ｎ）に与えてａｔｔ（ｎ）を出力する。 In the case of the state 1 or the state 2, the attenuation amount calculation unit 109 performs the attenuation applied to the attenuator 107 in the first embodiment in order to reduce the feeling of echo (the degree to which the near-end speaker perceives discomfort due to the echo). The amount of attenuation similar to the amount is notified. The attenuator 107 gives the attenuation amount notified from the attenuation amount calculation unit 109 to the signal yd (n) and outputs att (n).

また、状態１又は状態２の場合、減衰量計算部１０９は、減衰器１１１には何も通知しないようにしても良いし、減衰を行わない旨を示す値（例えば、ｇａｉｎ＝０等）を通知するようにしても良い。このとき減衰器１１１は、エコーｙ（ｎ）を含む近端話者信号を、透過する（減衰を行わないようにする）。 Further, in the case of the state 1 or the state 2, the attenuation amount calculation unit 109 may not notify the attenuator 111 of anything, and may indicate a value indicating that no attenuation is performed (for example, gain = 0). You may make it notify. At this time, the attenuator 111 transmits the near-end speaker signal including the echo y (n) (does not attenuate).

［状態３（不適状態）の場合］
次に、エコーロス推定部１１０から与えられた判定結果が、ＥＰ＿ＳＴＡＴ＝３（状態３、不適状態）の場合の、減衰量計算部１０９の動作について説明する。 [In case of state 3 (unsuitable state)]
Next, the operation of the attenuation amount calculation unit 109 when the determination result given from the echo loss estimation unit 110 is EP_STAT = 3 (state 3, unsuitable state) will be described.

上述の通り、電話装置１Ａを、ハンズフリー通話機器に適用した場合、状態３（不適状態）は、利用が不適な状態である。すなわち、状態３（不適状態）の場合には、近端側の背景ノイズのために、遠端話者と近端話者の音声によるやり取りが困難になっている可能性が大きい。さらに、上述の通り、エコーロス推定部１１０において計算されたｄｉｆｆ＿ＰＯＷ（ｋ）は、ＡＥＣＨＯ（ｋ）を表しているとは限らない。 As described above, when the telephone device 1A is applied to a hands-free calling device, the state 3 (unsuitable state) is a state in which use is inappropriate. That is, in the case of the state 3 (unsuitable state), there is a high possibility that the voice communication between the far-end speaker and the near-end speaker is difficult due to background noise on the near-end side. Furthermore, as described above, diff_POW (k) calculated by the echo loss estimation unit 110 does not necessarily represent AECHO (k).

状態３（不適状態）の場合には、上述の状態４’においてＥＣ＿ＭＡＸ（ｋ）＝ＢＧＮ’（ｋ）となった特殊な場合と考えることもできる。すなわち、状態３（不適状態）の場合の減衰量計算部１０９の動作は、状態４（増幅状態）の時の動作に順ずる。状態４（増幅状態）の時の減衰量計算部１０９の動作については後述する。 The state 3 (unsuitable state) can be considered as a special case where EC_MAX (k) = BGN '(k) in the state 4' described above. That is, the operation of the attenuation amount calculation unit 109 in the state 3 (unsuitable state) is in accordance with the operation in the state 4 (amplified state). The operation of the attenuation amount calculation unit 109 in the state 4 (amplification state) will be described later.

［状態４（増幅状態）の場合］
次に、エコーロス推定部１１０から与えられた判定結果が、ＥＰ＿ＳＴＡＴ＝４（状態４、増幅状態）の場合の、減衰量計算部１０９の動作について説明する。 [In case of state 4 (amplification state)]
Next, the operation of the attenuation amount calculation unit 109 when the determination result given from the echo loss estimation unit 110 is EP_STAT = 4 (state 4, amplification state) will be described.

上述の通り、電話装置１Ａをハンズフリー通話機器に適用した場合、状態４（増幅状態）は、ハウリングが発生する危険度が極めて高い状態である。 As described above, when the telephone device 1A is applied to a hands-free calling device, the state 4 (amplified state) is a state in which the risk of occurrence of howling is extremely high.

状態４（増幅状態）の場合、減衰量計算部１０９は、減衰器１０７に第１の実施形態と同様の減衰量を通知する。また、状態４（増幅状態）の場合、減衰量計算部１０９は、減衰器１１１に対しては、減衰器１０７に通知した増幅（減衰）を打ち消す減衰量を通知する。上述したように、状態４（増幅状態）の場合、ｄｉｆｆ＿ＰＯＷ（ｋ）の値は実際には減衰の量ではなく、差分計算の順序のために、増幅の量を表す値となっている。しがたって、減衰量計算部１０９は、ｄｉｆｆ＿ＰＯＷ（ｋ）の正負を反転して、ｇａｉｎ＝−ｄｉｆｆ＿ＰＯＷ（ｋ）と変換し、減衰器１１１に通知する。さらに、減衰量計算部１０９は、減衰器１１１の減衰のマージンとして、δａｔｔ２０（たとえば、６ｄＢ）の減衰量を追加した減衰量を、減衰器１１１に通知するようにしても良い。なお、このマージンの減衰量δａｔｔ２０は６ｄＢに限定されないものである。 In the state 4 (amplification state), the attenuation calculation unit 109 notifies the attenuator 107 of the same attenuation as in the first embodiment. In the state 4 (amplification state), the attenuation amount calculation unit 109 notifies the attenuator 111 of an attenuation amount that cancels the amplification (attenuation) notified to the attenuator 107. As described above, in the case of state 4 (amplified state), the value of diff_POW (k) is not actually the amount of attenuation, but is a value representing the amount of amplification because of the order of difference calculation. Therefore, the attenuation amount calculation unit 109 inverts the sign of diff_POW (k), converts it to gain = −diff_POW (k), and notifies the attenuator 111 of the result. Furthermore, the attenuation amount calculation unit 109 may notify the attenuator 111 of an attenuation amount obtained by adding an attenuation amount of δatt20 (for example, 6 dB) as an attenuation margin of the attenuator 111. The margin attenuation δatt20 is not limited to 6 dB.

状態４（増幅状態）の場合、上述のように減衰量計算部１０９及びエコーロス推定部１１０を動作させると、減衰器１１１の減衰で、ＢＧＮ’（ｋ）のパワーは少なくともＰ＿ＭＡＸ（ｋ）−δａｔｔ２０（上述のマージンの減衰量）以下のレベルまで下がり、受信出力端子Ｒｏｕｔから、加算器１０１にいたるまでのエコー経路は必ず減衰経路となる。 In the state 4 (amplification state), when the attenuation calculation unit 109 and the echo loss estimation unit 110 are operated as described above, the power of BGN ′ (k) is at least P_MAX (k) −δatt20 due to the attenuation of the attenuator 111. The echo path from the reception output terminal Rout to the adder 101 is always an attenuation path.

また、減衰量計算部１０９から減衰器１１１に通知する減衰量に、上述のマージンの減衰量δａｔｔ２０（６ｄＢ）を加えることにより、一層送信方向の信号（近端入力信号）のレベルを、Ｐ＿ＭＡＸ（ｋ）以下に限定できる。これにより、適応フィルタ１０２が収束するまでの間、双方向通話検出器１０３がエコーｙ（ｎ）と送信話者信号を、誤判定するのを防止することができる。 Further, by adding the above-mentioned margin attenuation amount δatt20 (6 dB) to the attenuation amount notified from the attenuation amount calculation unit 109 to the attenuator 111, the level of the signal in the transmission direction (near-end input signal) is further increased to P_MAX ( k) It can be limited to the following. This can prevent the bidirectional call detector 103 from erroneously determining the echo y (n) and the transmitted speaker signal until the adaptive filter 102 converges.

すなわち、エコーキャンセラ１０Ａでは、遠端側で、極端な信号の増幅帰還がない限り、ハウリングの発生を防止できる。 That is, in the echo canceller 10A, occurrence of howling can be prevented unless there is an extreme signal amplification feedback on the far end side.

さらに、エコーキャンセラ１０Ａでは、第１の実施形態と同様に、スイッチ１０８で、音声スイッチ部側（遅延部１０５及び減衰器１０７）の経路を選択されたときの動作で、エコーｙ（ｎ）が、減衰器１０７で減衰される。したがって、予めエコー経路の様子を知ることができない場合であっても、ハウリングの発生をより確実に防止する事ができる。 Further, in the echo canceller 10A, as in the first embodiment, the echo y (n) is generated when the switch 108 selects the path on the voice switch unit side (the delay unit 105 and the attenuator 107). Attenuator 107 attenuates. Therefore, even when the state of the echo path cannot be known in advance, the occurrence of howling can be prevented more reliably.

（Ｂ−３）第２の実施形態の効果
第２の実施形態によれば、以下のような効果を奏することができる。 (B-3) Effects of Second Embodiment According to the second embodiment, the following effects can be achieved.

エコーキャンセラ１０Ａでは、エコーロス推定部１１０で、スピーカ４０を経由して出力される受信信号の最大パワー値Ｐ＿ＭＡＸ（ｋ）と、エコーの最大パワー値ＥＣ＿ＭＡＸ（ｋ）と、マイクロホン５０側の背景ノイズの推定値ＢＧＮ’（ｋ）とを計算している。 In the echo canceller 10A, the echo loss estimation unit 110 receives the maximum power value P_MAX (k) of the received signal output via the speaker 40, the maximum power value EC_MAX (k) of the echo, and the background noise on the microphone 50 side. An estimated value BGN ′ (k) is calculated.

そして、エコーロス推定部１１０では、エコー経路における信号の増減や背景ノイズといった周囲環境に応じて、減衰器１１１でマイクロホン５０の出力（近端入力信号）を減衰するようにしている。これにより、エコーキャンセラ１０Ａでは、予め知ることができないエコー経路の中に増幅の要素があったときにも、特段の熟練設計者の設定を必要とせずに、エコーの増幅分を自動的に減衰器１１１で相殺してハウリングを防止することができる。 The echo loss estimation unit 110 attenuates the output (near-end input signal) of the microphone 50 by the attenuator 111 in accordance with the surrounding environment such as signal increase / decrease and background noise in the echo path. As a result, the echo canceller 10A automatically attenuates the amplification of the echo without the need for special expert designer settings even when there is an amplification factor in the echo path that cannot be known in advance. Howling can be prevented by canceling out by the device 111.

（Ｃ）第３の実施形態
以下、本発明によるエコーキャンセラ及びエコーキャンセルプログラム、並びに電話装置の第３の実施形態を、図面を参照しながら詳述する。 (C) Third Embodiment Hereinafter, an echo canceller, an echo cancellation program, and a telephone device according to a third embodiment of the present invention will be described in detail with reference to the drawings.

第１、第２の実施形態のエコーキャンセラ１０、１０Ａでは、上述の通り、人同士の会話のやり取りが、基本的にはフレーズが交互にやり取りされることで会話が成り立っていることに着目し、遠端からの信号があるとき（遠端入力信号ｘ（ｎ）が有音状態の場合）には、遠端側の音声（遠端入力信号ｘ（ｎ））を劣化させないように、受信側の経路の信号を透過してスピーカ４０に出力し、送信側（近端から遠端への方向の信号送信）では、音声スイッチ部により、減衰を挿入する経路を選択することで、ハウリングを確実に防止している。そして、第１、第２の実施形態のエコーキャンセラ１０、１０Ａでは、遠端からの信号が終了（遠端入力信号ｘ（ｎ）の有音状態が終了）すると同時に、自動的に送信側経路として、一旦減衰なしの経路（音声スイッチ部を経由しない経路）を選択することで、近端話者が発話しても発話の最初及び最後の欠損がおこらないようにしている。 In the echo cancellers 10 and 10A of the first and second embodiments, as described above, attention is paid to the fact that conversations between people basically consist of phrases being exchanged alternately. When there is a signal from the far end (when the far end input signal x (n) is in a voiced state), reception is performed so as not to degrade the far end side voice (far end input signal x (n)). Side signal is transmitted and output to the speaker 40, and on the transmitting side (signal transmission in the direction from the near end to the far end), the voice switch unit selects a path for inserting attenuation, thereby performing howling. It is surely prevented. In the echo cancellers 10 and 10A according to the first and second embodiments, the signal from the far end ends (the voiced state of the far end input signal x (n) ends), and at the same time, the transmission side path automatically. As described above, once a route without attenuation (a route that does not pass through the voice switch unit) is selected, even if a near-end speaker utters, the first and last loss of utterance does not occur.

そして、第１、第２の実施形態のエコーキャンセラ１０、１０Ａでは、一瞬発生する双方向通信状態（ダブルトーク状態（ＤＴ））、又は、送信状態（近端トーク状態（ＮＴ））においてのみ、エコーキャンセラ部（適応フィルタ及び加算器）の出力（信号ｅ（ｎ））を選択して遠端側へ送信している。これにより、第１、第２の実施形態のエコーキャンセラ１０、１０Ａでは、エコー経路減衰の確保よりも、発話の最初及び最後の欠損防止を優先させている。 And in the echo cancellers 10 and 10A of the first and second embodiments, only in the bi-directional communication state (double talk state (DT)) or the transmission state (near end talk state (NT)) that occurs for a moment, The output (signal e (n)) of the echo canceller (adaptive filter and adder) is selected and transmitted to the far end side. Thereby, in the echo cancellers 10 and 10A of the first and second embodiments, priority is given to prevention of loss at the beginning and end of the utterance rather than ensuring the echo path attenuation.

これは、音声スイッチ部特有の、近端話者の音声が強力な減衰を受けたり、減衰の程度が激変することで遠端話者が聞き取りにくくなって会話が破綻することを防止するものである。 This is unique to the voice switch section and prevents the voice of the near-end speaker from being strongly attenuated, or the degree of attenuation drastically changing, making it difficult for the far-end speaker to hear and disrupting the conversation. is there.

そこで、第３の実施形態では、さらに通話品質を向上するため、音声スイッチ部の利用頻度をやや下げ、よりエコーキャンセラ部の出力を利用する機会を多くして、通話品質を向上する構成となっている。 Therefore, in the third embodiment, in order to further improve the call quality, the frequency of use of the voice switch unit is slightly lowered, the opportunity to use the output of the echo canceller unit is increased, and the call quality is improved. ing.

（Ｃ−１）第３の実施形態の構成
図７は、第３の実施形態の電話装置１Ｂの全体構成を示すブロック図であり、上述した図３との同一、対応部分には同一、対応符号を付して示している。 (C-1) Configuration of the Third Embodiment FIG. 7 is a block diagram showing the overall configuration of the telephone device 1B of the third embodiment, which is the same as FIG. A reference numeral is attached.

以下、第３の実施形態について、第２の実施形態との差異を説明する。 Hereinafter, the difference between the third embodiment and the second embodiment will be described.

電話装置１Ｂでは、エコーキャンセラ１０Ａが、エコーキャンセラ１０Ｂに置き換わっている点で第２の実施形態と異なっている。 The telephone device 1B is different from the second embodiment in that an echo canceller 10A is replaced with an echo canceller 10B.

また、エコーキャンセラ１０Ｂでは、エコー減衰量計算部（以下、「ＡＣＯＭ」ともいう）１１２、ハウリング危険度判定部（以下、「ＨＷＭ判定部」ともいう）１１３、及びエコーキャンセラ効果判定部（以下、「ＥＣ判定部」ともいう）１１４が追加されている点で、第２の実施形態と異なっている。また、エコーキャンセラ１０Ｂでは、制御部１０４、スイッチ１０８、及びエコーロス推定部１１０が、制御部１０４Ｂ、スイッチ１０８Ｂ、及びエコーロス推定部１１０Ｂに置き換わっている点で、第２の実施形態と異なっている。 In the echo canceller 10B, an echo attenuation amount calculation unit (hereinafter also referred to as “ACOM”) 112, a howling risk determination unit (hereinafter also referred to as “HWM determination unit”) 113, and an echo canceller effect determination unit (hereinafter, referred to as “AWM”). The second embodiment is different from the second embodiment in that an “EC determination unit” 114 is added. The echo canceller 10B is different from the second embodiment in that the control unit 104, the switch 108, and the echo loss estimation unit 110 are replaced with the control unit 104B, the switch 108B, and the echo loss estimation unit 110B.

（Ｃ−２）第３の実施形態の動作
次に、以上のような構成を有する第３の実施形態の電話装置１Ｂにおけるエコーキャンセラ１０Ｂの動作について、第２の実施形態との差異を説明する。 (C-2) Operation of the Third Embodiment Next, the difference between the operation of the echo canceller 10B in the telephone device 1B of the third embodiment having the above configuration and the second embodiment will be described. .

まず、エコーロス推定部１１０Ｂの動作について、第２の実施形態との差異を説明する。 First, the difference between the operation of the echo loss estimation unit 110B and the second embodiment will be described.

エコーロス推定部１１０Ｂの動作は、第２の実施形態とほぼ同様であるが、推定した背景ノイズ推定値ＢＧＮ’（ｋ）をＥＣ判定部１１４にも与える点が異なっている。 The operation of the echo loss estimation unit 110B is substantially the same as that of the second embodiment, except that the estimated background noise estimated value BGN '(k) is also given to the EC determination unit 114.

次に、ＥＣ判定部１１４の動作について説明する。 Next, the operation of the EC determination unit 114 will be described.

ＥＣ判定部１１４には、加算器１０１から信号ｅ（ｎ）が供給されている。そして、ＥＣ判定部１１４は、後述するように、エコーキャンセラ部の性能が、背景ノイズに応じた適正値まで性能を発揮しているか否かを判定して、良好状態であると判定したときにはＨＷＭ判定部１１３に対し、性能良好を示すエコーキャンセル成功信号ｃｏｎｖを出力し、それ以外のときにはなにも出力しない。 The EC determination unit 114 is supplied with the signal e (n) from the adder 101. Then, as will be described later, the EC determination unit 114 determines whether or not the performance of the echo canceller unit is performing up to an appropriate value corresponding to the background noise. An echo cancellation success signal conv indicating good performance is output to the determination unit 113, and nothing is output at other times.

ＥＣ判定部１１４は、エコーロス推定部１１０Ｂからの背景ノイズ推定値ＢＧＮ’（ｋ）のほかに、加算器１０１から信号ｅ（ｎ）が供給されている。ＥＣ判定部１１４では、第２の実施形態でのエコーロス推定部１１０と同様に、ｅ（ｎ）について、双方向通話検出器１０３からの通話状態判定結果が、遠端シングルトーク状態（ＳＴ）の時だけ、ＢＧＮ’（ｋ）と同期間（例えば上述のＴ＿ＥＬＥと同期間）で、以下の（１１）式のように、ｅ（ｎ）の平滑値Ｅｒｒ＿ａｖ（ｎ）を計算する。そして、ＥＣ判定部１１４は、さらにその所定期間Ｔ＿ＥＬＥでの、Ｅｒｒ＿ａｖ（ｎ）の最大値ＥＲＲ＿ＭＡＸ（ｋ）を計算する。以下の（１１）式において、ｎ、ｋおよび、δｐａｖの意味合いは第２の実施形態と同じである。 The EC determination unit 114 is supplied with a signal e (n) from the adder 101 in addition to the background noise estimation value BGN ′ (k) from the echo loss estimation unit 110B. In the EC determination unit 114, as in the echo loss estimation unit 110 in the second embodiment, the call state determination result from the two-way call detector 103 is the far-end single talk state (ST) for e (n). Only when the smoothing value Err_av (n) of e (n) is calculated between BGN ′ (k) and the synchronization (for example, between the above-described T_ELE and the synchronization) as in the following equation (11). Then, the EC determination unit 114 further calculates the maximum value ERR_MAX (k) of Err_av (n) during the predetermined period T_ELE. In the following formula (11), the meanings of n, k, and δpav are the same as those in the second embodiment.

Ｅｒｒ＿ａｖ（ｎ）＝δｐａｖ×ｅ（ｎ）^２
＋（１．０−δｐａｖ）×Ｅｒｒ＿ａｖ（ｎ） …（１１）
そして、ＥＣ判定部１１４は、計算したＥＲＲ＿ＭＡＸ（ｋ）と、エコーロス推定部１１０Ｂから与えられたＢＧＮ’（ｋ）を、以下の（１２）式のように比較する。ここでは、以下の（１２）式において、δｅｃ＿ｍ＝２ｄＢと設定するものとして説明するが、δｅｃ＿ｍの値はこれに限定されない。 Err_av (n) = δpav × e (n) ²
+ (1.0−δpav) × Err_av (n) (11)
Then, the EC determination unit 114 compares the calculated ERR_MAX (k) and BGN ′ (k) given from the echo loss estimation unit 110B as in the following equation (12). Here, in the following equation (12), it is assumed that δec_m = 2 dB is set, but the value of δec_m is not limited to this.

ＥＲＲ＿ＭＡＸ（ｋ）≦ＢＧＮ’（ｋ）＋δｅｃ＿ｍ …（１２）
次に、上記（１２）式の意味について説明する。 ERR_MAX (k) ≦ BGN ′ (k) + δec_m (12)
Next, the meaning of the above equation (12) will be described.

一般に適応フィルタと加算器を用いたエコーキャンセラ部のエコー除去性能は、エコーと同時にマイクロホンに入力した背景ノイズパワー（レベル）までが限界となることが公知である。通常エコーキャンセラ部は、精度よくエコー打消しができるようになるためには、第１の実施形態で説明したように、適応フィルタのフィルタ係数の更新を繰りかえす必要があり、最初のうちは上記（１２）式のような関係は成立しない。したがって、双方向通話検出器１０３からの状態検出が、遠端シングルトーク状態（ＳＴ）の時に、上記（１２）式が成立するということは、適応フィルタ１０２の係数更新が進行し、加算器１０１の出力パワーが、背景ノイズパワーＢＧＮ’（ｋ）近くまで小さくなったということであり、δｅｃ＿ｍは効果確認のマージン閾値であると把握できる。 It is known that the echo cancellation performance of an echo canceller using an adaptive filter and an adder is generally limited to the background noise power (level) input to the microphone simultaneously with the echo. In order for the normal echo canceller to be able to cancel the echo with high accuracy, it is necessary to repeat the update of the filter coefficient of the adaptive filter as described in the first embodiment. The relationship as in equation 12) does not hold. Therefore, when the state detection from the two-way call detector 103 is in the far-end single talk state (ST), the above equation (12) is established. This means that the coefficient update of the adaptive filter 102 proceeds, and the adder 101 Is reduced to near the background noise power BGN ′ (k), and it can be understood that δec_m is a margin threshold value for effect confirmation.

したがって、ＥＣ判定部１１４は、上記（１２）式を満足したとき、ＨＷＭ判定部１１３に対して、エコーキャンセル成功信号ｃｏｎｖを供給し、それ以外はなにも供給しない。 Therefore, the EC determination unit 114 supplies the echo cancellation success signal conv to the HWM determination unit 113 when the above expression (12) is satisfied, and does not supply anything else.

次に、ＡＣＯＭ１１２の動作について説明する。 Next, the operation of the ACOM 112 will be described.

ＡＣＯＭ１１２には、受信入力端子Ｒｉｎからの遠端入力信号ｘ（ｎ）と、加算器１０１の出力信号ｅ（ｎ）が供給されている。そして、ＡＣＯＭ１１２は、エコーロス推定部１１０と同様に、上記（６）式によりＰ＿ａｖ（ｎ）を求め、求めたＰ＿ａｖ（ｎ）における期間Ｔ＿ＥＬＥの最大値Ｐ＿ＭＡＸ（ｋ）を求める。さらに、ＡＣＯＭ１１２は、上述のＥＣ判定部１１４と同様に、信号ｅ（ｎ）の平滑値Ｅｒｒ＿ａｖ（ｎ）を上記（１１）式で求め、さらに、求めたＥｒｒ＿ａｖ（ｎ）における所定期間Ｔ＿ＥＬＥの最大値ＥＲＲ＿ＭＡＸ（ｋ）を求める。 The ACOM 112 is supplied with the far-end input signal x (n) from the reception input terminal Rin and the output signal e (n) of the adder 101. Then, similarly to the echo loss estimation unit 110, the ACOM 112 obtains P_av (n) by the above equation (6) and obtains the maximum value P_MAX (k) of the period T_ELE in the obtained P_av (n). Further, the ACOM 112 obtains the smoothed value Err_av (n) of the signal e (n) by the above equation (11) as in the EC determination unit 114 described above, and further calculates the maximum value of the predetermined period T_ELE in the obtained Err_av (n). The value ERR_MAX (k) is obtained.

なお、ＡＣＯＭ１１２では、上述の通り、上記（６）式、及び、上記（１１）式を計算しているが、ＥＲＲ＿ＭＡＸ（ｋ）、Ｐ＿ＭＡＸ（ｋ）を計算する構成（処理）を、ＥＣ判定部１１４、ＥＬＥ２３と共通にして、計算結果だけをＥＣ判定部１１４、ＥＬＥ２３から取得するようにしてもよい。 As described above, the ACOM 112 calculates the above formulas (6) and (11), but the configuration (processing) for calculating ERR_MAX (k) and P_MAX (k) is the EC determination unit. 114 and ELE23 may be shared, and only the calculation result may be acquired from the EC determination unit 114 and ELE23.

そして、ＡＣＯＭ１１２は、エコー総合減衰量ＡＣＯＭ＿Ｅ（ｋ）を、以下の（１３）式のように計算して、ＨＷＭ判定部１１３に与える。 Then, the ACOM 112 calculates the echo total attenuation amount ACOM_E (k) as shown in the following equation (13), and gives it to the HWM determination unit 113.

ＡＣＯＭ＿Ｅ（ｋ）＝Ｐ＿ＭＡＸ（ｋ）
−ＥＲＲ＿ＭＡＸ（ｋ）[ｄＢ] …（１３）
次に、ＨＷＭ判定部１１３の動作について説明する。 ACOM_E (k) = P_MAX (k)
-ERR_MAX (k) [dB] (13)
Next, the operation of the HWM determination unit 113 will be described.

ＨＷＭ判定部１１３は、ＡＣＯＭ１１２から与えられたＡＣＯＭ＿Ｅ（ｋ）と、閾値δａｃｏｍを比較する。ここでは、δａｃｏｍは、６ｄＢとして説明するが、これに限定されないものである。 The HWM determination unit 113 compares ACOM_E (k) given from the ACOM 112 with the threshold value δacom. Here, δacom is described as 6 dB, but is not limited to this.

そして、ＨＷＭ判定部１１３は、ＡＣＯＭ＿Ｅ（ｋ）≧δａｃｏｍを満たし、かつ、ＥＣ判定部１１４からの信号ｃｏｎｖが供給されている間、ハウリング危険度小信号ＮＯ＿ＨＷ（ハウリングの危険度が小さいことを示している信号）を、制御部１０４Ｂに供給する。一方、ＨＷＭ判定部１１３は、それ以外のときは、制御部１０４Ｂに対して何も出力しない。 Then, the HWM determination unit 113 satisfies ACOM_E (k) ≧ δacom, and while the signal conv from the EC determination unit 114 is supplied, the howling risk low signal NO_HW (indicating that the howling risk is low). Is supplied to the control unit 104B. On the other hand, HWM determination unit 113 outputs nothing to control unit 104B at other times.

制御部１０４Ｂは、ＨＷＭ判定部１１３から、ＮＯ＿ＨＷが与えられているとき、スイッチ１０８Ｂに対して、端子ｃ１を端子ａ１側に接続させる制御を行う。 When NO_HW is given from the HWM determination unit 113, the control unit 104B controls the switch 108B to connect the terminal c1 to the terminal a1 side.

制御部１０４ＢはＨＷＭ判定部１１３からＮＯ＿ＨＷが与えられているときは、双方向通話検出器１０３から遠端シングルトーク状態（ＳＴ）の通知がされても、ＨＷＭ判定部１１３からの信号を優先し、スイッチ１０８Ｂの状態をそのまま（端子ｃ１を端子ａ１の側に接続させたままの状態）にする。制御部１０４Ｂは、それ以外の状態では、スイッチ１０８Ｂに対して、第１の実施形態と同様の制御を行う。 When NO_HW is given from the HWM determination unit 113, the control unit 104B gives priority to the signal from the HWM determination unit 113 even if the two-way call detector 103 is notified of the far-end single talk state (ST). Then, the state of the switch 108B is left as it is (the state in which the terminal c1 is connected to the terminal a1 side). In other states, the control unit 104B performs the same control as the first embodiment on the switch 108B.

（Ｃ−３）第３の実施形態の効果
第３の実施形態によれば、以下のような効果を奏することができる。 (C-3) Effects of Third Embodiment According to the third embodiment, the following effects can be achieved.

以上説明したように、第３の実施形態では、ＡＣＯＭ１１２、ＨＷＭ判定部１１３、及びＥＣ判定部１１４を設けて、スイッチ１０８Ｂを制御し、エコーキャンセラ部側の出力（加算器１０１の出力）を用いる機会を増やすようにした。以下、第３の実施形態の効果の詳細について説明する。 As described above, in the third embodiment, the ACOM 112, the HWM determination unit 113, and the EC determination unit 114 are provided to control the switch 108B and use the output on the echo canceller unit side (the output of the adder 101). Increased opportunities. Details of the effects of the third embodiment will be described below.

第１、第２の実施形態では双方向通話検出器１０３の状態検出の結果、遠端話者だけが話す遠端シングルトーク状態（ＳＴ）になると、スイッチ部により必ず遠端側へ送信する信号に減衰が入り、逆に遠端話者が話し終わると、減衰がなくなる（エコーキャンセラ部の出力を遠端側へ送信する）構成であった。第１、第２の実施形態では確かにハウリングを防止でき、ダブルトーク状態（ＤＴ）でも、発話の最初及び最後の欠損を防止することができる。 In the first and second embodiments, when the far-end single talk state (ST) in which only the far-end speaker speaks as a result of the state detection of the two-way call detector 103, a signal that is always transmitted to the far-end side by the switch unit When the far end speaker finishes speaking, the attenuation disappears (the output of the echo canceller is transmitted to the far end side). In the first and second embodiments, howling can be surely prevented, and even at the double talk state (DT), the first and last loss of speech can be prevented.

しかしその一方で、第３の実施形態の遠端話者からしてみれば、場合によっては、以下に説明するように、耳障りな現象を起こすことがある。すなわち、遠端話者にとっては、自分が話すタイミングに一致して近端側の背景ノイズがひどく増減するように聞こえる場合があるのである。この現象は会話中に気づくと、遠端側の利用者によっては、大きな違和感を訴える場合がある。 On the other hand, however, from the viewpoint of the far-end speaker of the third embodiment, an unpleasant phenomenon may be caused in some cases as described below. That is, the far-end speaker may sound that the background noise on the near-end side increases or decreases significantly in accordance with the timing when he / she speaks. When this phenomenon is noticed during a conversation, some users at the far end may complain of great discomfort.

第３の実施形態では、そのことに鑑みたものであり、エコーキャンセラ部側の性能が十分発揮できてハウリングを防止できるときは、エコー除去の主体をエコーキャンセラ部に担わせ、そうでないときは、第１、第２の実施形態と同様に、スイッチ部の強力な減衰器に、エコー除去の主体を担わせるようになっている。 In the third embodiment, in view of that, when the performance of the echo canceller unit side can be sufficiently exhibited and howling can be prevented, the echo canceller unit is responsible for echo cancellation, and otherwise As in the first and second embodiments, the strong attenuator of the switch unit is made to play the main role in echo cancellation.

つまり、ＥＣ判定部１１４では、背景ノイズのレベルまで、エコーキャンセラ部によるエコー打消しが成功しているかどうかを検出することにより、遠端話者が受け取るエコーを少なくできているかどうかを主に検出している。一方で、ＡＣＯＭ１１２とＨＷＭ判定部１１３で、エコーキャンセラの打消しの結果、ハウリングを起こさない状態を確保できているかどうかを主に検出するようにしている。 That is, the EC determination unit 114 mainly detects whether or not the far-end speaker has received echoes by detecting whether or not the echo cancellation by the echo canceller unit is successful up to the level of background noise. is doing. On the other hand, the ACOM 112 and the HWM determination unit 113 mainly detect whether or not a state that does not cause howling is secured as a result of cancellation of the echo canceller.

例えば、電話装置１Ｂの立ち上げ時において、エコーキャンセラ部の適応フィルタ１０２では、フィルタ係数が成長（収束）していない。この場合、エコーキャンセラ部は、ハウリングに対して効果を発揮できないが、適応フィルタ１０２の係数が成長（収束）すれば、むしろ音声スイッチ部よりも聴感が自然なままエコー除去とハウリング防止を行えるようになる。第３の実施形態では、そのような状態を自動検出し、音声スイッチ部よりもエコーキャンセラ部の使用が好ましい場合は、自動的にエコーキャンセラ部の出力を選択し、装置の内部状態が許す限り、送信信号レベルの急変がない自然な音声を遠端話者への通話信号として供することができる。 For example, when the telephone device 1B is started up, the filter coefficient does not grow (converge) in the adaptive filter 102 of the echo canceller unit. In this case, the echo canceller cannot exert an effect on howling. However, if the coefficient of the adaptive filter 102 grows (converges), the echo can be canceled and the howling can be prevented while the audibility is more natural than the voice switch. become. In the third embodiment, when such a state is automatically detected and the use of the echo canceler unit is preferable to the voice switch unit, the output of the echo canceler unit is automatically selected, as long as the internal state of the apparatus permits. Natural speech without a sudden change in the transmission signal level can be provided as a call signal to the far-end speaker.

（Ｄ）第４の実施形態
以下、本発明によるエコーキャンセラ及びエコーキャンセルプログラム、並びに電話装置の第４の実施形態を、図面を参照しながら詳述する。 (D) Fourth Embodiment Hereinafter, an echo canceller, an echo cancellation program, and a telephone device according to a fourth embodiment of the present invention will be described in detail with reference to the drawings.

第１〜第３の実施形態のエコーキャンセラ１０、１０Ａ、１０Ｂでは、主に近端側のエコー経路で発生するエコーが、遠端側に送信する信号に漏れこんでいくのを防止するのと同時に、エコーが遠端側に送信する信号に漏れこんで、遠端側と近端側を巡回しながら成長して発生するハウリングを防止するものであった。 In the echo cancellers 10, 10 </ b> A, and 10 </ b> B of the first to third embodiments, the echo generated mainly in the near-end side echo path is prevented from leaking into the signal transmitted to the far-end side. At the same time, the echo leaks into the signal transmitted to the far end side, and prevents howling which occurs while growing around the far end side and the near end side.

ここまで述べてきたように、一般的には、音声スイッチ部（遅延部及び減衰器によりエコーを抑制する構成）とエコーキャンセラ部（適応フィルタにより生成した擬似エコーを用いて加算部によりエコーを打ち消す構成）は、共にエコー信号を抑圧する目的で用いられる技術である。しかし実際には、エコー対策としては、エコーキャンセラ部のほうが音声スイッチ部よりも好まれることが多い。エコーキャンセラ部は、マイクロホンに漏れこんでくるエコーを、高度なアルゴリズムで計算によって求め、擬似エコーを作成し、加算器で反転相殺するので、通話状態の判定（例えば、双方向通話検出器１０３のような判定処理）がうまくいく場合は、エコーだけをうまく除去できて通話品質がよい。これに対して、音声スイッチ部は、送信または受信の信号線上に設けられた減衰器の減衰を挿抜して信号の音量を変化させる。すなわち、音声スイッチ部は、本発明でいう遠端シングルトーク状態（ＳＴ）のときに、送信信号線上の減衰器に大きな減衰を入れてエコー感をなくすものである。 As described so far, generally, an audio switch unit (a configuration in which echo is suppressed by a delay unit and an attenuator) and an echo canceller unit (a pseudo echo generated by an adaptive filter is used to cancel the echo by an adder unit) Configuration) is a technique used for the purpose of suppressing echo signals. However, in practice, the echo canceller is often preferred over the voice switch as an echo countermeasure. The echo canceller unit calculates the echo leaking into the microphone by calculation with an advanced algorithm, creates a pseudo echo, and reverses and cancels it with an adder. If such a determination process is successful, only the echo can be removed well and the call quality is good. On the other hand, the voice switch unit inserts and removes attenuation of an attenuator provided on the transmission or reception signal line to change the volume of the signal. That is, the voice switch unit eliminates the feeling of echo by adding a large attenuation to the attenuator on the transmission signal line in the far-end single talk state (ST) referred to in the present invention.

しかし、音声スイッチ部は、原理的に単純に信号線上に減衰を加えるのであるから、仮に通話状態の判定処理（例えば、双方向通話検出器１０３のような判定処理）でどれほど精密に遠端シングルトーク状態（ＳＴ）状態の検出に成功しても、音声スイッチ部の処理では、エコー以外の信号（近端側の背景ノイズ等）も一緒に減衰してしまう。 However, since the voice switch section simply attenuates the signal line in principle, it is assumed that how far the far end single is accurately determined by a call state determination process (for example, a determination process such as the bidirectional call detector 103). Even if the talk state (ST) state is successfully detected, signals other than echoes (such as background noise on the near end side) are also attenuated together in the processing of the voice switch unit.

さらに、第１〜第３の実施形態で説明した、エコーキャンセラ部と音声スイッチ部の切り替えにおいても、近端側の背景ノイズの状態によっては、音声スイッチ部の出力（減衰器からの出力）を、エコーキャンセラ部側の出力に切り替えるとき、加えた減衰切り替えのために、遠端側に送信する信号においては、背景ノイズの成分の大きさが急変するという現象が発生する場合がある。 Furthermore, also in the switching between the echo canceller unit and the voice switch unit described in the first to third embodiments, the output of the voice switch unit (output from the attenuator) depends on the state of background noise on the near end side. When switching to the output on the echo canceller side, a phenomenon that the magnitude of the background noise component suddenly changes may occur in the signal transmitted to the far end due to the added attenuation switching.

この現象は、遠端話者にとってみれば実に不自然なことであり、遠端話者自身が話すたびに、相手側の背景ノイズが急に断続するように聞こえる。これにより、遠端話者は、常時不自然な違和感を持ちながら、近端話者と通話することを強いられてしまう。 This phenomenon is very unnatural for the far-end speaker, and every time the far-end speaker speaks, the other party's background noise sounds like suddenly intermittent. As a result, the far-end speaker is forced to talk with the near-end speaker while always having an unnatural feeling.

この不自然感は、遠端話者と近端話者との間で、要件伝達のため一時的に利用すればよいハンズフリー電話装置や、せいぜい会議の開催中のみ使えればよい会議電話装置では許容される場合も多い。 This unnatural feeling can be caused by a hands-free telephone device that can be used temporarily to communicate requirements between the far-end speaker and the near-end speaker, or a conference phone device that can be used only during a conference. In many cases, it is acceptable.

しかし近年普及しているテレワーク環境や、ＳＯＨＯ環境ではハンズフリー通話装置を近端側及び遠端側の双方で、常時接続して利用する形態が増え、事情が異なっている。すなわち、このような新しい使い方では、上述のような発話と同時に発生する相手側背景ノイズの急変は、遠端側の話者にとって強く違和感を訴える。なぜなら、常時接続するハンズフリー通話装置の場合、いままで説明してきた「背景ノイズ」は場合によっては単なる余分なノイズではなく、相手側の状態変化や環境変化、雰囲気の手がかりになる「背景音」あるいは「背景環境音」として遠端側の話者に聞き取られている場合があるからである。 However, in a telework environment and a SOHO environment that have become widespread in recent years, the number of forms in which hands-free communication devices are always connected and used on both the near end side and the far end side is increasing, and the circumstances are different. That is, in such a new usage, the sudden change in the other party background noise that occurs simultaneously with the utterance as described above strongly appeals to the far-end speaker. This is because, in the case of hands-free communication devices that are always connected, the “background noise” that has been explained so far is not just extra noise in some cases, but “background sound” that is a clue to the other party's state change, environmental change, and atmosphere. Alternatively, it may be heard by the far-end speaker as “background environmental sound”.

一方で、エコーキャンセラ部に関しては、適応フィルタの係数更新に加算器の出力フィードバックを用いる構造上、エコーを近端背景ノイズレベルまでにしか除去できないという公知の性質がある。したがって近端側の背景ノイズ（音）が大きいときには、会話の最中に十分エコーを除去できないという問題が発生してしまう。特に、人間はノイズ交じりであっても人間の音声を聞き分けられることが多く、遠端話者の発話の最後の部分のエコーが遠端話者自身にとって耳障りになることがおおい。 On the other hand, the echo canceller has a known property that the echo can be removed only to the near-end background noise level because of the structure in which the output feedback of the adder is used to update the coefficient of the adaptive filter. Therefore, when the background noise (sound) on the near end side is large, there arises a problem that the echo cannot be sufficiently removed during the conversation. In particular, humans can often hear human voices even if they are mixed with noise, and the echo at the end of the far-end speaker's utterance is often harsh to the far-end speaker.

従来の「特開平３−２５４５３０号公報」（参考文献２）には、エコーキャンセラ部の性能が発揮できないとき、帯域制限したガウスノイズを送信代替信号とする技術が開示されているが、単なる送信信号の挿げ替えでは、上述の遠端話者が持つ違和感は、むしろ大きくなってしまうおそれがあった。 The conventional “Japanese Patent Laid-Open No. 3-254530” (reference 2) discloses a technique that uses band-limited Gaussian noise as a transmission substitute signal when the performance of the echo canceller cannot be exhibited. When the signal is replaced, the discomfort of the far-end speaker described above may rather increase.

そこで、第４の実施形態では、背景ノイズが大きいときには、確実にエコーを抑圧できる音声スイッチ部を用いつつも、背景ノイズが不自然で急激な増減を起こさないようにする。これにより、第４の実施形態では、エコーキャンセラ部と、音声スイッチ部を併用して遠端側へ送信する信号を切り替えても、遠端側へ送信する信号における背景ノイズ成分の急激な増減をなくすことができる。さらに、第４の実施形態では、第３の実施形態のように、音声スイッチ部を使わなくても、エコーキャンセラ部だけで、エコーを除去できる程度に背景ノイズが小さい場合には、エコーキャンセラ部を主体にエコーを除去し、ハウリングを防止することができる。 Therefore, in the fourth embodiment, when the background noise is large, the voice switch unit that can reliably suppress the echo is used, but the background noise is not unnatural and suddenly increases or decreases. Thus, in the fourth embodiment, even if the echo canceller unit and the voice switch unit are used together to switch the signal to be transmitted to the far end side, the background noise component in the signal to be transmitted to the far end side is rapidly increased or decreased. Can be eliminated. Further, in the fourth embodiment, as in the third embodiment, when the background noise is small enough to remove the echo by using only the echo canceller without using the voice switch, the echo canceller Echoes can be removed mainly to prevent howling.

（Ｄ−１）第４の実施形態の構成
図８は、第３の実施形態の電話装置１Ｂの全体構成を示すブロック図であり、上述した図７との同一、対応部分には同一、対応符号を付して示している。 (D-1) Configuration of the Fourth Embodiment FIG. 8 is a block diagram showing the overall configuration of the telephone device 1B according to the third embodiment, which is the same as FIG. A reference numeral is attached.

電話装置１Ｃでは、エコーキャンセラ１０Ｂが、エコーキャンセラ１０Ｃに置き換わっている点で第３の実施形態と異なっている。 The telephone device 1C is different from the third embodiment in that an echo canceller 10B is replaced with an echo canceller 10C.

また、エコーキャンセラ１０Ｃでは、減衰器１１１が省略されている点で、第３の実施形態と異なっている。 The echo canceller 10C is different from the third embodiment in that the attenuator 111 is omitted.

さらに、エコーキャンセラ１０Ｃでは、遅延部１０６、減衰器１０７、及び減衰量計算部１０９、が、遅延部１０６Ｃ、減衰器１０７Ｃ、減衰量計算部１０９Ｃに置き換わっている点で、第３の実施形態と異なっている。さらにまた、エコーキャンセラ１０Ｃでは、スイッチ１０８Ｂが、時変加重平均部１１６に置き換わっている点で、第３の実施形態と異なっている。また、エコーキャンセラ１０Ｃでは、減衰器１０７Ｃと時変加重平均部１１６との間に信号混合部１１５（以下、「Ｎ＿ＭＩＸ」ともいう）が設けられている。なお、信号混合部１１５による処理の詳細については、後述する動作説明において説明する。 Further, in the echo canceller 10C, the delay unit 106, the attenuator 107, and the attenuation amount calculation unit 109 are replaced with the delay unit 106C, the attenuator 107C, and the attenuation amount calculation unit 109C. Is different. Furthermore, the echo canceller 10C is different from the third embodiment in that the switch 108B is replaced with a time-varying weighted average unit 116. In the echo canceller 10 </ b> C, a signal mixing unit 115 (hereinafter also referred to as “N_MIX”) is provided between the attenuator 107 </ b> C and the time-varying weighted average unit 116. The details of the processing by the signal mixing unit 115 will be described in the operation description to be described later.

図９は、信号混合部１１５内部の機能的構成について示したブロック図である。 FIG. 9 is a block diagram illustrating a functional configuration inside the signal mixing unit 115.

信号混合部１１５は、スイッチ制御部３０１、スイッチ３０２、スイッチ３０３、擬似背景ノイズ保持部３０４、混合部３０５、擬似背景ノイズパワー計算部３０６、実背景ノイズパワー計算部３０７、変動分ノイズパワー計算部３０８、及び混合比計算部３０９を有している。なお、信号混合部１１５の各部の構成（処理）については後述する動作説明において詳述する。 The signal mixing unit 115 includes a switch control unit 301, a switch 302, a switch 303, a pseudo background noise holding unit 304, a mixing unit 305, a pseudo background noise power calculation unit 306, an actual background noise power calculation unit 307, and a fluctuation noise power calculation unit. 308 and a mixture ratio calculation unit 309. Note that the configuration (processing) of each part of the signal mixing unit 115 will be described in detail in the description of operations described later.

（Ｄ−２）第４の実施形態の動作
次に、以上のような構成を有する第４の実施形態の電話装置１Ｃにおけるエコーキャンセラ１０Ｃの動作について、第３の実施形態との差異を説明する。 (D-2) Operation of the Fourth Embodiment Next, the difference between the operation of the echo canceller 10C in the telephone device 1C of the fourth embodiment having the above-described configuration and the third embodiment will be described. .

まず、減衰量計算部１０９Ｃの動作について説明する。 First, the operation of the attenuation calculation unit 109C will be described.

減衰量計算部１０９Ｃは、第３の実施形態と同様に、一旦減衰量ｇａｉｎを計算する。 The attenuation amount calculation unit 109C calculates the attenuation amount once as in the third embodiment.

そして、減衰量計算部１０９Ｃは、予め定めた所望エコー減衰量ａｃｏｍ＿４（例えば、５０ｄＢ減衰とする）と、ｇａｉｎとを比較して、減衰がａｃｏｍ＿４に到達するための不足分の減衰量を追加した量を、新たな減衰量ｇａｉｎ４として、信号混合部１１５及び減衰器１０７Ｃに通知する。なお、ａｃｏｍ＿４に設定する値は、限定されないものである。 Then, the attenuation amount calculation unit 109C compares a predetermined desired echo attenuation amount acom_4 (for example, 50 dB attenuation) with gain, and adds an attenuation amount that is insufficient for the attenuation to reach acom_4. The amount is notified to the signal mixing unit 115 and the attenuator 107C as a new attenuation amount gain4. Note that the value set in acom_4 is not limited.

すなわち、減衰器１０７Ｃに適用される減衰量は、第１〜第３の実施形態で設定される減衰量よりも相当に大きいものとなる。たとえば所望エコー減衰量ａｃｏｍ＿４を、５０ｄＢ減衰とし、さらに、減衰量計算部１０９Ｃにおいて、ｇｅｉｎ＝１０ｄＢの減衰が算出された場合は、ｇａｉｎ４＝４０ｄＢ（減衰）となる。言い換えると、第４の実施形態では、スピーカ４０とマイクロホン５０の間の音響エコー経路での増減と、減衰器１０７Ｃによる減衰をあわせたときに、必ず所望エコー減衰量ａｃｏｍ＿４が達成されるべく計算された減衰量ｇａｉｎ４が、減衰器１０７Ｃに通知されることになる。 That is, the attenuation applied to the attenuator 107C is considerably larger than the attenuation set in the first to third embodiments. For example, if the desired echo attenuation amount acom_4 is set to 50 dB attenuation, and further attenuation of gain = 10 dB is calculated in the attenuation amount calculation unit 109C, gain4 = 40 dB (attenuation). In other words, in the fourth embodiment, when the increase / decrease in the acoustic echo path between the speaker 40 and the microphone 50 is combined with the attenuation by the attenuator 107C, the desired echo attenuation amount acom_4 is always calculated. The attenuation amount gain4 is notified to the attenuator 107C.

次に、信号混合部１１５の動作について説明する。 Next, the operation of the signal mixing unit 115 will be described.

信号混合部１１５には、減衰量計算部１０９Ｃからｇａｉｎ４（例えば、−４０ｄＢ）が通知されている。また、信号混合部１１５には、減衰器１０７Ｃから信号ａｔｔ（ｎ）が供給されている。さらに、信号混合部１１５には、双方向通話検出器１０３からの通話状態判定の結果が供給されている。そして、信号混合部１１５が生成する信号ｍｉｘ（ｎ）は、時変加重平均部１１６に供給されている。 The signal mixing unit 115 is notified of gain4 (for example, −40 dB) from the attenuation calculation unit 109C. The signal mixing unit 115 is supplied with the signal att (n) from the attenuator 107C. Further, the result of the call state determination from the two-way call detector 103 is supplied to the signal mixing unit 115. The signal mix (n) generated by the signal mixing unit 115 is supplied to the time-varying weighted average unit 116.

次に、上述の図９を用いて、信号混合部１１５の各部の動作を説明する。 Next, the operation of each unit of the signal mixing unit 115 will be described with reference to FIG.

スイッチ制御部３０１は、双方向通話検出器１０３から通知された通話状態の判定結果が無音状態（ＩＤＬ）であると、スイッチ３０２を閉じて、遅延部１０６Ｃの出力信号を擬似背景ノイズ保持部３０４に供給するように制御する。 When the call state determination result notified from the two-way call detector 103 is a silent state (IDL), the switch control unit 301 closes the switch 302 and sends the output signal of the delay unit 106C to the pseudo background noise holding unit 304. Control to supply.

擬似背景ノイズ保持部３０４は、リングバッファで構成されており、遅延部１０６Ｃから供給される信号について、一定期間分（たとえば、１０秒分）のデータを保持し、それを超えた場合は一番古いデータを新しいデータで上書きしていくようにしても良い。ここでは、擬似背景ノイズ保持部３０４のデータ保持期間を１０秒にしたが、この期間は限定されないものである。 The pseudo background noise holding unit 304 is configured by a ring buffer, holds data for a certain period (for example, 10 seconds) of the signal supplied from the delay unit 106C, and when the signal exceeds that, the pseudo background noise holding unit 304 Old data may be overwritten with new data. Here, the data holding period of the pseudo background noise holding unit 304 is set to 10 seconds, but this period is not limited.

スイッチ制御部３０１は、双方向通話検出器１０３から通知される通話状態の判定結果が、無音状態（ＩＤＬ）以外のときはスイッチ３０２を開放して、擬似背景ノイズ保持部３０４への信号の供給をさせない。 The switch control unit 301 opens the switch 302 and supplies a signal to the pseudo background noise holding unit 304 when the determination result of the call state notified from the two-way call detector 103 is other than the silence state (IDL). I will not let you.

したがって、擬似背景ノイズ保持部３０４に保持される信号のデータは、遠端話者も近端話者も発話していない無音状態で、マイクロホン５０に入力された信号、すなわち近端側の背景ノイズを示す信号のデータとなる。 Therefore, the signal data held in the pseudo background noise holding unit 304 is a signal input to the microphone 50 in a silent state in which neither a far-end speaker nor a near-end speaker is speaking, that is, background noise on the near-end side. This is signal data indicating.

擬似背景ノイズ保持部３０４で保持された信号のデータは、擬似背景ノイズパワー計算部３０６に供給される。 The signal data held by the pseudo background noise holding unit 304 is supplied to the pseudo background noise power calculation unit 306.

擬似背景ノイズパワー計算部３０６は、供給された信号のデータから、予め定めた期間Ｔ＿ｂｎ（例えば、１秒であるものとする）ごとのパワーｐ＿ｐａｄ＿ｎを計算して、変動分ノイズパワー計算部３０８に与える。なお、擬似背景ノイズパワー計算部３０６では、保持した信号のデータが１秒分未満のときは、保持分のパワーを計算するようにしても良い。 The pseudo background noise power calculation unit 306 calculates power p_pad_n for each predetermined period T_bn (for example, 1 second) from the supplied signal data, and supplies the fluctuation noise power calculation unit 308 to the fluctuation noise power calculation unit 308. give. The pseudo background noise power calculation unit 306 may calculate the power of the held signal when the data of the held signal is less than one second.

一方、スイッチ制御部３０１では、双方向通話検出器１０３から、通話状態の判定結果として、遠端シングルトーク状態（ＳＴ）が入力されると、スイッチ３０３を閉じて、混合部３０５に、擬似背景ノイズ保持部３０４で保持した信号のデータが供給されるようにする。なお、このスイッチ３０３は、遠端シングルトーク状態（ＳＴ）以外では、スイッチ制御部３０１の制御により開放されるものとする。 On the other hand, when the far-end single talk state (ST) is input from the bidirectional call detector 103 as the call state determination result from the switch control unit 301, the switch 303 is closed and the mixing unit 305 receives the pseudo background. The signal data held by the noise holding unit 304 is supplied. Note that the switch 303 is opened under the control of the switch control unit 301 except in the far-end single talk state (ST).

混合部３０５では、上述の通り、スイッチ３０３を経由した擬似背景ノイズ保持部３０４からの信号のデータが供給される。 As described above, the mixing unit 305 is supplied with signal data from the pseudo background noise holding unit 304 via the switch 303.

次に、実背景ノイズパワー計算部３０７の動作について説明する。 Next, the operation of the actual background noise power calculation unit 307 will be described.

実背景ノイズパワー計算部３０７は、双方向通話検出器１０３から、通話状態の判定結果として、遠端シングルトーク状態（ＳＴ）が入力されたときだけ、加算器１０１から供給される信号ｅ＿ｄ（ｎ）を用いて、実ノイズパワーｐ＿ｒｅａｌ＿ｎを、一定期間Ｔ＿ｂｎ分だけ計算し、その計算結果を、変動分ノイズパワー計算部３０８に供給する。 The actual background noise power calculation unit 307 receives the signal e_d (n) supplied from the adder 101 only when the far-end single talk state (ST) is input from the two-way call detector 103 as the determination result of the call state. ) To calculate the actual noise power p_real_n for a certain period T_bn, and supplies the calculation result to the fluctuation noise power calculation unit 308.

変動分ノイズパワー計算部３０８では、実ノイズパワーｐ＿ｒｅａｌ＿ｎと擬似背景ノイズパワーｐ＿ｐａｄ＿ｎの比率ｄｉｆ＿ｐａｄ＿ｎが計算される。詳細については後述するが、これは、擬似背景ノイズパワー計算部３０６で保持したデータを取得した時点の背景ノイズのレベルと、実際に遠端シングルトーク状態（ＳＴ）が発生しているときの背景ノイズのレベルに近いエコーキャンセラ部の加算器１０１の出力との差分を、混合部３０５の処理に反映させるためである。 The fluctuation noise power calculation unit 308 calculates a ratio dif_pad_n between the real noise power p_real_n and the pseudo background noise power p_pad_n. Although details will be described later, this is because the background noise level at the time when the data held by the pseudo background noise power calculation unit 306 is acquired and the background when the far-end single talk state (ST) actually occurs. This is because the difference from the output of the adder 101 of the echo canceller unit close to the noise level is reflected in the processing of the mixing unit 305.

混合比計算部３０９では、減衰器１０７Ｃから供給される信号と、擬似背景ノイズ保持部３０４から供給される信号の混合比を、例えば次のように計算する。 The mixing ratio calculation unit 309 calculates the mixing ratio between the signal supplied from the attenuator 107C and the signal supplied from the pseudo background noise holding unit 304 as follows, for example.

例えば、混合比計算部３０９に、減衰量計算部１０９Ｃから、ｇａｉｎ４が供給されている場合には、減衰器１０７Ｃにおいてｙｄ（ｎ）がどれだけ減衰されてａｔｔ（ｎ）が作成されたか計算できることになる。例えば、減衰量計算部１０９Ｃにおいて、ｇａｉｎ４＝４０ｄＢと算出されて、混合比計算部３０９に供給されたとき、ｇａｉｎ４を線形の比率で考えると倍率を０．０１倍にしたことに相当する。 For example, when gain 4 is supplied from the attenuation amount calculation unit 109C to the mixing ratio calculation unit 309, it is possible to calculate how much yd (n) has been attenuated in the attenuator 107C to create att (n). become. For example, in the attenuation calculation unit 109C, when gain4 = 40 dB is calculated and supplied to the mixture ratio calculation unit 309, when gain4 is considered as a linear ratio, this corresponds to a magnification of 0.01.

混合比計算部３０９では、供給されたｇａｉｎ４を、線形の比率値δｓに変換してから、擬似背景ノイズ保持部３０４の出力との混合比δｎを、以下の（１４）式により決定する。 The mixing ratio calculation unit 309 converts the supplied gain 4 into a linear ratio value δs, and then determines the mixing ratio δn with the output of the pseudo background noise holding unit 304 by the following equation (14).

δｎ＝（１．０−δｓ） …（１４）
この場合、ｇａｉｎ４は、もともと１倍であった信号をδｓ＝０．０１倍に縮小しものであり、擬似背景ノイズ保持部３０４出力信号を、０．９９倍したものと混合すれば、信号パワーはもとのままで、エコーの成分比だけが小さな信号を作成できることになる。 δn = (1.0−δs) (14)
In this case, the gain 4 is obtained by reducing the signal that was originally 1 time to δs = 0.01 times, and if the pseudo background noise holding unit 304 output signal is mixed with the signal that is multiplied by 0.99, the signal power As a result, a signal having a small echo component ratio can be created.

さらに、混合比計算部３０９には、変動分ノイズパワー計算部３０８の出力ｄｉｆ＿ｐａｄ＿ｎが供給されており、擬似背景ノイズ保持部３０４にデータを保持した時点の背景ノイズと、現在エコーキャンセラ部（加算器１０１）が出力している残差信号との、パワーの食い違いを修正する。 Further, the output ratio dif_pad_n of the fluctuation noise power calculation unit 308 is supplied to the mixture ratio calculation unit 309, and the background noise at the time when the data is held in the pseudo background noise holding unit 304 and the current echo canceller unit (adder) 101) Correct the discrepancy in power with the residual signal that is output.

すなわち、以下の（１５）式のように、混合する擬似背景ノイズの量を、遠端シングルトーク状態（ＳＴ）が発生している現時点の近端側の背景ノイズ量に即したものにする。これにより、遠端シングルトーク状態（ＳＴ）の時には、エコーキャンセラ部が、背景ノイズのレベルまで、エコーを除去していることに鑑み、混合部３０５が混合する擬似背景ノイズｂｎ（ｎ）の量を調整することができる。 That is, as shown in the following equation (15), the amount of pseudo background noise to be mixed is set in accordance with the current near-end background noise amount in which the far-end single talk state (ST) occurs. Thus, in the far-end single talk state (ST), the amount of the pseudo background noise bn (n) mixed by the mixing unit 305 in view of the fact that the echo canceller removes the echo to the background noise level. Can be adjusted.

なお、ｄｉｆ＿ｐａｄ＿ｎ＝１．０（データを保持したときと遠端シングルトーク状態（ＳＴ）が発生している現時点の加算器出力のパワーが同じ）の場合には、δｎ＝（１．０−δｓ）のままである。 In the case of dif_pad_n = 1.0 (the power of the adder output at the time when the far-end single talk state (ST) is generated is the same as when data is held), δn = (1.0−δs) ).

δｎ＝（１．０−δｓ）×ｄｉｆ＿ｐａｄ＿ｎ …（１５）
そして、混合部３０５は、以下の（１６）式により、ｍｉｘ（ｎ）を生成して、生成したｍｉｘ（ｎ）を時変加重平均部１１６に供給する。 δn = (1.0−δs) × dif_pad_n (15)
Then, the mixing unit 305 generates mix (n) according to the following equation (16), and supplies the generated mix (n) to the time-varying weighted average unit 116.

ｍｉｘ（ｎ）＝δｎ×ｂｎ（ｎ）＋δｓ×ａｔｔ（ｎ） …（１６）
最後に、時変加重平均部１１６に関して説明する。 mix (n) = δn × bn (n) + δs × att (n) (16)
Finally, the time-varying weighted average unit 116 will be described.

時変加重平均部１１６は、第１〜第３の実施形態におけるスイッチ１０８（１０８Ｂ）に相当する機能を担うものである。具体的には、時変加重平均部１１６は、遅延部１０６Ｃから端子ａ２を介して供給される信号ｅ＿ｄ（ｎ）、又は、信号混合部１１５から端子ｂ２を介して供給される信号ｍｉｘ（ｎ）のいずれかの信号を、双方向通話検出器１０３から供給される通話状態の判定結果に基づいて選択し、選択した信号を、端子ｃ２を介して遠端側（送信出力端子Ｓｏｕｔ）に送信するように状態を遷移させる。なお、図８においては、図示を省略しているが、時変加重平均部１１６には、双方向通話検出器１０３からの通話状態の判定結果が供給されている。 The time-varying weighted average unit 116 has a function corresponding to the switch 108 (108B) in the first to third embodiments. Specifically, the time-varying weighted average unit 116 receives the signal e_d (n) supplied from the delay unit 106C through the terminal a2, or the signal mix (n) supplied from the signal mixing unit 115 through the terminal b2. ) Is selected based on the determination result of the call state supplied from the bidirectional call detector 103, and the selected signal is transmitted to the far end side (transmission output terminal Sout) via the terminal c2. The state is transitioned to Although not shown in FIG. 8, the time-variant weighted average unit 116 is supplied with the determination result of the call state from the two-way call detector 103.

時変加重平均部１１６は、以下の（１７）式のように、遅延部１０６Ｃから供給される信号ｅ＿ｄ（ｎ）と、信号混合部１１５から供給される信号ｍｉｘ（ｎ）とを時間的に重み配分した信号ｓｏｕｔ（ｎ）を生成して、送信出力端子Ｓｏｕｔに出力（遠端側へ送信）する。以下の（１７）式において、δｅｃｈｏは時間的な重み配分の係数である。 The time-varying weighted average unit 116 temporally combines the signal e_d (n) supplied from the delay unit 106C and the signal mix (n) supplied from the signal mixing unit 115 as shown in the following equation (17). A weight-distributed signal sout (n) is generated and output to the transmission output terminal Sout (transmitted to the far end side). In the following equation (17), δecho is a temporal weight distribution coefficient.

時変加重平均部１１６は、遅延部１０５から端子ｂ２を選択する制御信号をうけたときは、δｅｃｈｏ＝０．０とする。その場合、以下の（１７）式の結果は、以下の（１８）式のようになり、第１〜第３の実施形態において、スイッチ１０８（１０８Ｂ）の端子ｃ１を端子ｂ１と接続した場合と同じ結果となる。 When the time-varying weighted average unit 116 receives a control signal for selecting the terminal b2 from the delay unit 105, δecho = 0.0. In that case, the result of the following equation (17) is as shown in the following equation (18). In the first to third embodiments, the terminal c1 of the switch 108 (108B) is connected to the terminal b1. The same result.

そして、時変加重平均部１１６は、遅延部１０５から端子ａ２を選択する制御信号をうけたときは、δｅｃｂｏ＝１．０とする。その場合、以下の（１７）式の結果は、以下の（１９）式のようになり、第１〜第３の実施形態において、スイッチ１０８（１０８Ｂ）の端子ｃ１を端子ａ１と接続した場合と同じ結果となる。 When the time-varying weighted average unit 116 receives a control signal for selecting the terminal a2 from the delay unit 105, δecbo = 1.0. In that case, the result of the following formula (17) is as shown in the following formula (19). In the first to third embodiments, the terminal c1 of the switch 108 (108B) is connected to the terminal a1. The same result.

ｓｏｕｔ（ｎ）＝δｅｃｈｏ×ｅ＿ｄ（ｎ）
＋（１．０−δｅｃｈｏ）×ｍｉｘ（ｎ） …（１７）
ｓｏｕｔ（ｎ）＝ｍｉｘ（ｎ） …（１８）
ｓｏｕｔ（ｎ）＝ｅ＿ｄ（ｎ） …（１９）
時変加重平均部１１６は、遠端シングルトーク状態（ＳＴ）が終了すると、かならず端子ａ２の重みが大きくなるように動作するが、δｅｃｈｏ＝０．０からδｅｃｈｏ＝１．０に移るまでに予め定めた期間たとえば０．５秒を掛けて滑らかに重みが変化するように動作するようにしても良い。 sout (n) = δecho × e_d (n)
+ (1.0−δecho) × mix (n) (17)
sout (n) = mix (n) (18)
sout (n) = e_d (n) (19)
When the far-end single talk state (ST) ends, the time-varying weighted average unit 116 operates so that the weight of the terminal a2 is always increased. However, the time-varying weighted average unit 116 is required in advance to shift from δecho = 0.0 to δecho = 1.0. The operation may be performed so that the weight changes smoothly over a predetermined period, for example, 0.5 seconds.

たとえば背景ノイズが大きいとき、前述したようにエコーキャンセラ部では、背景ノイズのレベルまでしかエコーを除去できない。そのため、急に強力な信号減衰をした信号であるｍｉｘ（ｎ）から、加算器１０１の出力信号を遅延したｅ＿ｄ（ｎ）へと遷移すると、エコーキャンセラ部の消し残りとなった、遠端話者の発話の最後の部分のエコーが際立ってしまう。そのため、上述のように時変加重平均部１１６を設けて、信号の切替えを滑らかに（徐々に）行うことにより、上述の遠端話者の発話の最後の部分のエコーが際立ってしまうような現象を防止することができる。 For example, when the background noise is large, as described above, the echo canceller can only remove the echo up to the background noise level. For this reason, when the output signal of the adder 101 is shifted to e_d (n), which is a signal abruptly subjected to strong signal attenuation, to e_d (n), the far-end talk that has remained unerased by the echo canceller unit. The echo of the last part of the person's utterance stands out. Therefore, by providing the time-varying weighted average unit 116 as described above and switching signals smoothly (gradually), the echo of the last part of the far-end speaker's utterance becomes conspicuous. The phenomenon can be prevented.

（Ｄ−３）第４の実施形態の効果
第４の実施形態によれば、以下のような効果を奏することができる。 (D-3) Effects of the fourth embodiment According to the fourth embodiment, the following effects can be achieved.

（Ｄ−３−１）エコーキャンセラ１０Ｃでは、音声スイッチ部に信号混合部１１５を設けている。信号混合部１１５では、無音状態（ＩＤＬ）のときの近端入力信号（エコーの混入していないノイズ）を擬似背景ノイズとして保持し、遠端シングルトーク状態（ＳＴ）の時に、減衰器１０７Ｃから出力されるａｔｔ（ｎ）に、保持していた擬似背景ノイズを混合した信号ｍｉｘ（ｎ）を出力する。そして、信号混合部１１５では、信号の混合を行う際の混合比を、ｍｉｘ（ｎ）のパワーが、音声スイッチ部による減衰前の近端入力信号パワーを保つような値に設定している。 (D-3-1) In the echo canceller 10C, the signal mixing unit 115 is provided in the voice switch unit. The signal mixing unit 115 holds the near-end input signal (noise not mixed with echo) in the silent state (IDL) as pseudo background noise, and from the attenuator 107C in the far-end single talk state (ST). A signal mix (n) obtained by mixing the pseudo background noise held in the output att (n) is output. In the signal mixing unit 115, the mixing ratio at the time of signal mixing is set to a value such that the power of mix (n) maintains the near-end input signal power before attenuation by the voice switch unit.

これにより、第４の実施形態では、遠端話者の聴感上、背景ノイズレベルは現在の近端話者周囲ノイズの様子を模擬しつつ、エコー信号だけを抑圧することができる。すなわち、エコーキャンセラ１０Ｃでは、遠端側へ送出する信号の品質を向上させることができる。 As a result, in the fourth embodiment, the background noise level can suppress only the echo signal while simulating the current state of the near-end speaker ambient noise in terms of the audibility of the far-end speaker. That is, the echo canceller 10C can improve the quality of the signal transmitted to the far end side.

（Ｄ−３−２）エコーキャンセラ１０Ｃでは、時変加重平均部１１６を設けて、音声スイッチ部の出力（信号混合部１１５の出力）と、エコーキャンセラ部の出力（加算器１０１の出力）とが、時間的に滑らかに切り替わるようにしている。これにより、第４の実施形態では、遠端シングルトーク状態（ＳＴ）のあと、かならずエコーキャンセラ部側に、遠端側へ送出する信号の選択を切り替えたとしても、直ちに音声スイッチ部の出力信号の減衰が０にならないので、遠端話者の発話による発話の最後の部分のエコーが遠端側に戻ってしまうことも防止できるようになる。 (D-3-2) In the echo canceller 10C, a time-varying weighted average unit 116 is provided, and an output of the voice switch unit (output of the signal mixing unit 115) and an output of the echo canceller unit (output of the adder 101) However, it is designed to switch smoothly in time. As a result, in the fourth embodiment, after the far-end single talk state (ST), even if the selection of the signal to be sent to the far-end side is always switched to the echo canceller side, the output signal of the voice switch unit is immediately Therefore, it is possible to prevent the echo of the last part of the utterance due to the utterance of the far-end speaker from returning to the far-end side.

また、エコーキャンセラ１０Ｃでは、時変加重平均部１１６を設けたことにより、音声スイッチ部とエコーキャンセラ部の出力の切り替えにともなう、遠端話者が感じる近端側の背景ノイズの急変も防止できるようになる。これにより、第４の実施形態では、通信路のなかに所望減衰を確実に挿入できるようになるので、ハウリングをおこさない自然な通話が可能なエコーキャンセラを提供することができる。 Further, in the echo canceller 10C, by providing the time-varying weighted average unit 116, it is possible to prevent a sudden change in the background noise on the near end side felt by the far end speaker due to the switching of the outputs of the voice switch unit and the echo canceller unit. It becomes like this. Accordingly, in the fourth embodiment, desired attenuation can be surely inserted into the communication path, so that it is possible to provide an echo canceller capable of a natural call without howling.

（Ｅ）他の実施形態
本発明は、上記の各実施形態に限定されるものではなく、以下に例示するような変形実施形態も挙げることができる。 (E) Other Embodiments The present invention is not limited to the above-described embodiments, and may include modified embodiments as exemplified below.

（Ｅ−１）第３の実施形態において、減衰量計算部１０９、エコーロス推定部１１０Ｂ、減衰器１１１は省略するようにしても良い。すなわち、第１の実施形態に、第３の実施形態の特徴となる構成（ＡＣＯＭ１１２、ＨＷＭ判定部１１３、ＥＣ判定部１１４等）を追加した構成としても良い。ただし、その場合、第３の実施形態でＥＣ判定部１１４がエコーロス推定部１１０Ｂから取得している内容を、ＥＣ判定部１１４で求めることができるようにする必要がある。 (E-1) In the third embodiment, the attenuation calculation unit 109, the echo loss estimation unit 110B, and the attenuator 111 may be omitted. That is, a configuration (ACOM 112, HWM determination unit 113, EC determination unit 114, etc.) that is a feature of the third embodiment may be added to the first embodiment. However, in that case, it is necessary that the EC determination unit 114 can obtain the content acquired by the EC determination unit 114 from the echo loss estimation unit 110B in the third embodiment.

（Ｅ−２）第４の実施形態において、時変加重平均部１１６を、第１〜第３の実施形態と同様のスイッチに置き換えるようにしても良い。また、逆に、第４の実施形態の時変加重平均部１１６を、第１〜第３の実施形態に適用するようにしても良い。 (E-2) In the fourth embodiment, the time-varying weighted average unit 116 may be replaced with the same switch as in the first to third embodiments. Conversely, the time-varying weighted average unit 116 of the fourth embodiment may be applied to the first to third embodiments.

さらに、第４の実施形態の信号混合部１１５を、第１〜第３の実施形態に適用するようにしても良い。ただし、その場合、第４の実施形態で、信号混合部１１５が減衰量計算部１０９Ｃから取得している内容を、信号混合部１１５で求めることができるようにする必要がある。 Furthermore, the signal mixing unit 115 of the fourth embodiment may be applied to the first to third embodiments. However, in that case, in the fourth embodiment, it is necessary that the signal mixing unit 115 can obtain the content acquired by the signal mixing unit 115 from the attenuation amount calculation unit 109C.

（Ｅ−３）第２の実施形態ではエコーロス推定部１１０は双方向通話検出器１０３から、遠端シングルトーク状態（ＳＴ）を入力されたときだけ動作するようにしたが、無音状態（ＩＤＬ）、近端トーク状態（ＮＴ）のときにはエコーが発生していないので、この状態の間にマイクロホン５０周辺の背景ノイズＢＧＮ’（ｋ）を計算するようにしてもよい。 (E-3) In the second embodiment, the echo loss estimation unit 110 operates only when the far-end single talk state (ST) is input from the two-way call detector 103. However, the silence state (IDL) Since no echo is generated in the near-end talk state (NT), the background noise BGN ′ (k) around the microphone 50 may be calculated during this state.

（Ｅ−４）第３の実施形態では受信入力（遠端入力信号ｘ（ｎ））と、加算器１０１の出力とからエコー減衰量を計算して、エコー経路の総合減衰量を計算するようにしたが、加算器１０１の前後の信号を入力してパワー消去分を計算し、これとは別にエコーロス推定部１１０Ｂで計算したエコー経路の減衰量と合算した減衰の量でＡＣＯＭ１１２全体の計算をするようにしてもよい。 (E-4) In the third embodiment, the echo attenuation amount is calculated from the received input (far-end input signal x (n)) and the output of the adder 101 to calculate the total attenuation amount of the echo path. However, the power cancellation amount is calculated by inputting the signals before and after the adder 101, and the entire ACOM 112 is calculated by the amount of attenuation combined with the attenuation amount of the echo path calculated by the echo loss estimation unit 110B. You may make it do.

（Ｅ−５）第３の実施形態でＨＷＭ判定部１１３は、「ＡＣＯＭ＿Ｅ（ｋ）≧δａｃｏｍ」の成立とエコーキャンセル成功信号ｃｏｎｖがあることの両方を満たしてハウリング危険度小信号ＮＯ＿ＨＷを出力するようにしたが、エコー感の多少を気にしなくてもよい場合は、上記判定の不等式の成立だけでハウリング危険度小信号ＮＯ＿ＨＷを出力するようにしてもよい。 (E-5) In the third embodiment, the HWM determination unit 113 outputs both the establishment of “ACOM_E (k) ≧ δacom” and the presence of the echo cancellation success signal conv, and outputs the small howling risk signal NO_HW. However, if there is no need to worry about the feeling of echo, the small howling risk signal NO_HW may be output only by the establishment of the above inequality.

（Ｅ−６）第３の実施形態で、エコー経路の状態に応じた減衰量の付加を減衰器１１１に施し、一方で通話状態に応じた減衰量の付加を減衰器１０７に分割して施すようにしたが、これらの減衰量を減衰器１０７ひとつで加えるようにしてもよい。 (E-6) In the third embodiment, addition of attenuation according to the state of the echo path is applied to the attenuator 111, while addition of attenuation according to the call state is divided and applied to the attenuator 107. However, the attenuation amount may be added by one attenuator 107.

また、第３の実施形態で、双方向通話検出器１０３の検出遅延が無視できるほどに小さいときには、同様に減衰器１１１ひとつで減衰させるようにしてもよい。 Further, in the third embodiment, when the detection delay of the bidirectional call detector 103 is small enough to be ignored, it may be attenuated by one attenuator 111 in the same manner.

（Ｅ−７）第４の実施形態において、信号混合部１１５内部に、擬似背景ノイズパワー計算部３０６、実背景ノイズパワー計算部３０７、変動分ノイズパワー計算部３０８を設けて、データ保持時点のノイズ量と、遠端シングルトーク状態（ＳＴ）発生中のノイズ量との乖離を縮小するようにした。しかし、背景ノイズの変動がさほど重大でないときは、擬似背景ノイズパワー計算部３０６、実背景ノイズパワー計算部３０７、変動分ノイズパワー計算部３０８を省略して、擬似背景ノイズｂｎ（ｎ）と減衰器１０７Ｃの出力信号ａｔｔ（ｎ）を混合するようにしてもよい。 (E-7) In the fourth embodiment, a pseudo background noise power calculation unit 306, an actual background noise power calculation unit 307, and a fluctuation noise power calculation unit 308 are provided in the signal mixing unit 115 so that data at the time of data retention is provided. The divergence between the amount of noise and the amount of noise during the far-end single talk state (ST) is reduced. However, when the fluctuation of the background noise is not so significant, the pseudo background noise power calculation unit 306, the actual background noise power calculation unit 307, and the fluctuation noise power calculation unit 308 are omitted, and the pseudo background noise bn (n) and the attenuation are reduced. The output signal att (n) of the device 107C may be mixed.

１…電話装置、２０…Ｄ／Ａ変換器、３０…Ａ／Ｄ変換器、４０…スピーカ、５０…マイクロホン、１０…エコーキャンセラ、１０１…加算器、１０２…適応フィルタ、２０１…擬似エコー作成部、２０１１…保持レジスタ、２０１２…積和演算部、２０１３…係数部、２０２…係数更新量計算部、２０３…ステップゲイン計算部、１０３…双方向通話検出器、１０４…制御部、１０５…遅延部、１０６…遅延部、１０７…減衰器、１０８…スイッチ、ａ１、ｂ１、ｃ１…端子。 DESCRIPTION OF SYMBOLS 1 ... Telephone apparatus, 20 ... D / A converter, 30 ... A / D converter, 40 ... Speaker, 50 ... Microphone, 10 ... Echo canceller, 101 ... Adder, 102 ... Adaptive filter, 201 ... Pseudo echo creation part , 2011 ... holding register, 2012 ... product-sum operation unit, 2013 ... coefficient unit, 202 ... coefficient update amount calculation unit, 203 ... step gain calculation unit, 103 ... bidirectional call detector, 104 ... control unit, 105 ... delay unit 106 delay unit 107 attenuator 108 switch a1, b1, c1 terminal.

Claims

In an echo canceller that removes echo components from the near-end input signal,
Echo canceling means for generating a signal obtained by canceling the echo component of the near-end input signal using the pseudo echo generated by the pseudo echo generating means;
Voice switch means for generating a signal obtained by attenuating the near-end input signal by an attenuator;
Switch means for selecting either the signal after cancellation generated by the echo cancellation means or the signal generated by the voice switch means and sending it to the far end side;
Call state determination means for determining a call state between a far end speaker and a near end speaker using the far end input signal and the near end input signal;
An echo canceller comprising: switch control means for controlling the switch means in accordance with a determination result of the call state determination means.

The call state determination means detects a voiced state or a silent state for each of the far-end input signal and the near-end input signal, and based on the result, the far-end speaker and the near-end speaker The echo canceller according to claim 1, wherein a call state between the two is determined.

When the call state determination unit determines that the far-end single talk state in which only the far-end speaker is speaking, the switch control unit sends a signal generated by the voice switch unit to the switch unit. The echo canceller according to claim 2, wherein the echo canceller is controlled so as to select.

When it is determined that the call state determining unit has transitioned to a state other than the far-end single talk state after the determination, the switch control unit generates the echo canceling unit for the switch unit. The echo canceller according to claim 3, wherein control is performed so as to select a signal.

Echo path characteristic calculating means for calculating an attenuation amount or an amplification amount of an echo path related to the echo component;
Power comparison means for comparing the power of the entire near-end input signal, the power of the echo component, and the power of the background noise component on the near-end side included in the near-end input signal;
A signal attenuating means for generating a signal giving a set amount of attenuation to the near-end input signal, and supplying the generated signal to the echo canceling means and the voice switch means;
Attenuation amount calculating means for obtaining an attenuation amount to be set in the signal attenuating means using the calculation result of the echo path characteristic calculating means and the comparison result of the power comparing means,
The echo canceller according to claim 1, further comprising:

In the near-end input signal and the far-end input signal, further comprising howling occurrence risk determination means for determining a risk that howling based on the echo component occurs,
The switch control means selects the signal generated by the voice switch means for the switch means when the howling occurrence risk judgment means determines that the risk of howling is greater than or equal to a predetermined value. The echo canceller according to any one of claims 1 to 5, wherein

The above-mentioned howling occurrence risk determination means is:
The far-end input signal power and the cancellation generated by the echo cancellation means while the speech state determination means determines that only the far-end speaker is speaking a far-end single talk state. Using the power of the post-signal, an echo attenuation amount calculation unit for calculating the power of the echo component remaining in the post-cancellation signal generated by the echo cancellation means;
The echo canceller according to claim 6, further comprising: a howling occurrence risk level determination unit that determines a risk level of howling using the calculation result of the echo attenuation amount calculation unit.

The above-mentioned howling occurrence risk determination means is:
The signal after cancellation generated by the echo cancellation means is compared with the power of the background noise component on the near end side, and based on the comparison result, the cancellation of the echo component by the echo cancellation means is successful. It further has an echo cancellation effect determination unit that determines whether or not
The echo canceller according to claim 7, wherein the howling occurrence risk determination unit determines a risk level of howling using the determination result of the echo cancellation effect determination unit.

The switch means includes
A time-variant weighted average unit that generates a signal weighted and distributed to the far end side of the signal generated by the voice switch unit and the signal after cancellation generated by the echo cancellation unit;
For the time-varying weighted average unit, the weighting for the post-cancellation signal generated by the echo cancellation unit and the weighting for the signal generated by the voice switch unit are updated according to the control of the switch control unit. The echo canceller according to claim 1, further comprising: a weight update unit.

The voice switch part
Pseudo background noise holding means for holding the near end input signal as a pseudo background noise signal when both the far end input signal and the near end input signal are determined to be silent in the call state determination means;
While the speech state determination means determines that only the far-end input signal is in a sound state, the signal output from the attenuator and the pseudo background noise held by the pseudo background noise holding means Signal mixing means for generating a mixed signal as a signal output by the voice switch unit;
The mixing ratio to be set in the signal mixing means is determined, and the mixing ratio is obtained by using the power of the pseudo background noise held by the pseudo background noise holding means and the power of the near-end input signal. The echo canceller according to claim 1, further comprising: a mixture ratio determining unit that determines the ratio.

11. The echo canceller according to claim 10, wherein the mixing ratio determining means determines a mixing ratio such that the power of the signal output from the signal mixing means does not deviate from the power of the near-end input signal.

A computer installed in an echo canceller that removes the echo component from the near-end input signal.
Echo canceling means for generating a signal obtained by canceling the echo component of the near-end input signal using the pseudo echo generated by the pseudo echo generating means;
Voice switch means for generating a signal obtained by attenuating the near-end input signal by an attenuator;
Switch means for selecting either the signal after cancellation generated by the echo cancellation means or the signal generated by the voice switch means and sending it to the far end side;
Call state determination means for determining a call state between a far end speaker and a near end speaker using the far end input signal and the near end input signal;
An echo canceling program that functions as a switch control unit that controls the switch unit according to a determination result of the call state determination unit.

A telephone apparatus comprising the echo canceller according to any one of claims 1 to 11.