JP5293342B2

JP5293342B2 - Voice communication apparatus, method and program

Info

Publication number: JP5293342B2
Application number: JP2009082733A
Authority: JP
Inventors: 一良秋江; 弘美青柳
Original assignee: Oki Electric Industry Co Ltd
Current assignee: Oki Electric Industry Co Ltd
Priority date: 2009-03-30
Filing date: 2009-03-30
Publication date: 2013-09-18
Anticipated expiration: 2029-03-30
Also published as: JP2010239243A

Abstract

<P>PROBLEM TO BE SOLVED: To provide a voice communication device which does not degrade communication quality even if a residual echo suppression function overworks. <P>SOLUTION: The voice communication device includes an echo canceller which is composed of an echo canceller body that generates a quasi-echo to remove echo components included in transmission signals, and a residual echo component suppression means for suppressing residual echo components included in transmission signals that are output from the echo canceller body. In addition, the voice communication device includes a background noise component extraction means for extracting background noise components from transmission signals, and a noise superimposition means for superimposing the extracted background noise components on transmission signals that are output from the echo canceller. <P>COPYRIGHT: (C)2011,JPO&INPIT

Description

本発明は音声通信装置、方法及びプログラムに関し、例えば、送受話路間で発生するエコーを消去するために用いるエコーキャンセラと送話マイクロフォンから入力されるノイズを消去するために用いるノイズキャンセラとを併用する場合に適用し得るものである。 The present invention relates to a voice communication apparatus, method, and program, and uses, for example, an echo canceller used for canceling an echo generated between transmission and reception paths and a noise canceller used for canceling noise input from a transmission microphone. Applicable to the case.

電話通信網には、ハイブリッド回路（２線４線変換回路）で発生するエコー（回線エコー）や、電話端末のスピーカからマイクロフォンに回り込むエコー（音響エコー）を除去するために、適応型エコーキャンセラが広く用いられている。 The telephone communication network has an adaptive echo canceller to remove echo (line echo) generated by a hybrid circuit (2-wire 4-wire conversion circuit) and echo (acoustic echo) that circulates from the speaker of the telephone terminal to the microphone. Widely used.

一般的に、エコーキャンセラは、適応フィルタを用いて、相手話者からの受話信号と、その受話信号のエコーが混じった送話信号を入力として、疑似エコーを生成し、送話信号から減算する方法が採られている。さらに、エコーキャンセラの機能の一部として、アッテネータやセンタークリッパ等を用いて、適応フィルタを用いて推定した疑似エコーによる減算処理で消し残ったエコーのレベルを非線形に抑圧する方法も併用されている（特許文献１参照）。 In general, an echo canceller uses an adaptive filter to input a received signal from the other party's speaker and a transmitted signal mixed with an echo of the received signal, generate a pseudo echo, and subtract it from the transmitted signal. The method is taken. Furthermore, as part of the echo canceller function, there is also used a method of nonlinearly suppressing the level of echo that has remained unerased by subtraction processing using a pseudo echo estimated using an adaptive filter, using an attenuator, center clipper, etc. (See Patent Document 1).

また、エコーキャンセラとは別に、マイクロフォンから拾うノイズを除去するためにノイズキャンセラを有する音声通信装置もある。 In addition to the echo canceller, there is also a voice communication apparatus having a noise canceller to remove noise picked up from the microphone.

さらに、これらのエコーキャンセラとノイズキャンセラの両機能を有する音声通信装置がある（特許文献２参照）。このような両機能を有する音声通信装置は、ノイズキャンセラの挿入位置として、２つのパターンが考えられる。すなわち、ノイズを先にキャンセルした後に、エコーをキャンセルする音声通信装置と、エコーを先にキャンセルした後に、ノイズをキャンセルする音声通信装置とがある。 Furthermore, there is a voice communication apparatus having both functions of these echo canceller and noise canceller (see Patent Document 2). In such a voice communication apparatus having both functions, two patterns can be considered as insertion positions of the noise canceller. That is, there are a voice communication device that cancels the echo after canceling the noise first, and a voice communication device that cancels the noise after canceling the echo first.

ノイズを先にキャンセルした後に、エコーをキャンセルする音声通信装置のエコーキャンセラは、遠端入力（Ｒｉｎ）端からの受話信号と近端入力（Ｓｉｎ）端からの送話信号のノイズキャンセラ出力を入力とする。 The echo canceller of the voice communication apparatus that cancels the echo after canceling the noise first receives the received signal from the far end input (Rin) end and the noise canceller output of the transmit signal from the near end input (Sin) end. To do.

エコーを先にキャンセルした後に、ノイズをキャンセルする音声通信装置のエコーキャンセラは、Ｒｉｎ端からの受話信号とＳｉｎ端からの送話信号を入力とする。エコーキャンセラの出力信号を、ノイズキャンセラの入力とする。 The echo canceller of the voice communication apparatus that cancels the noise after canceling the echo first receives the reception signal from the Rin end and the transmission signal from the Sin end. The output signal of the echo canceller is used as the input of the noise canceller.

特開平８−３２４９４JP-A-8-32494 特開２００２−１１８５０４JP 2002-118504 A

しかしながら、エコーキャンセラとノイズキャンセラを併用する場合、以下の課題を有するものであった。 However, when the echo canceller and the noise canceller are used together, they have the following problems.

ノイズを先にキャンセルした後でエコーをキャンセルする場合には、ノイズをキャンセルするフィルタが時間によって変化するため、エコーを学習する際、ノイズキャンセラの影響が入って正確な学習ができないという課題がある。 When canceling the echo after canceling the noise first, the filter for canceling the noise changes with time, and therefore, when learning the echo, there is a problem that accurate learning cannot be performed due to the influence of the noise canceller.

一方、エコーを先にキャンセルした後でノイズをキャンセルする場合には、エコーキャンセラが学習中はエコーキャンセラ出力に誤差が多く残り、ノイズキャンセラが、これを雑音又は音声と誤判断し、状態判定及びパラメータ推定に誤りが生ずる。 On the other hand, when canceling the noise after canceling the echo first, there is a lot of error in the echo canceller output while the echo canceller is learning, and the noise canceller erroneously determines this as noise or speech, and determines the state and parameters. An error occurs in the estimation.

特許文献２に記載の音声通信装置では、エコーキャンセラとノイズキャンセラ間でパラメータの受け渡すことで、エコー抑圧とノイズ抑圧の効果を向上させようとしている。しかし、エコーキャンセラの誤学習やノイズキャンセラの打ち消し誤りを防ぐことは不完全であり、通話品質に劣化が生じることがあった。 In the voice communication device described in Patent Document 2, the effect of echo suppression and noise suppression is improved by passing parameters between the echo canceller and the noise canceller. However, preventing erroneous learning of the echo canceller and cancellation error of the noise canceller is incomplete, and the call quality may be deteriorated.

本発明は、以上の点に鑑みなされたものであり、残留エコー抑圧機能が過度に作用しても、通話品質を劣化させることがない音声通信装置、方法及びプログラムを提供しようとしたものである。 The present invention has been made in view of the above points, and it is an object of the present invention to provide a voice communication device, method, and program that do not deteriorate the call quality even if the residual echo suppression function acts excessively. .

第１の本情報の音声通信装置は、（１）疑似エコーを生成して送話信号に含まれているエコー成分を除去するエコーキャンセラ本体と、当該エコーキャンセラ本体からの出力送話信号に含まれている残留エコー成分を非線形な抑圧特性に従って抑圧する残留エコー成分抑圧手段とを有するエコーキャンセラと、（２）上記送話信号を周波数領域の信号に変換して背景雑音成分を抽出し、抽出した背景雑音成分を時間領域の信号に戻して出力する背景雑音成分抽出手段と、（３）上記エコーキャンセラから出力された送話信号に、上記背景雑音成分抽出手段が抽出した背景雑音成分を第１の所定倍して重畳し、上記残留エコー成分抑圧手段による抑圧過度期間の送話信号を補償する雑音重畳手段とを有することを特徴とする。 The first voice communication apparatus of this information includes (1) an echo canceller body that generates a pseudo echo and removes an echo component included in the transmission signal, and an output transmission signal from the echo canceller body. An echo canceller having residual echo component suppression means for suppressing the residual echo component in accordance with a nonlinear suppression characteristic ; and (2) extracting the background noise component by converting the transmission signal into a frequency domain signal. Background noise component extraction means for outputting the background noise component returned to the time domain signal, and (3) the background noise component extracted by the background noise component extraction means is added to the transmission signal output from the echo canceller. And a noise superimposing unit that superimposes the signal by a predetermined multiple of 1 and compensates the transmission signal in the excessive suppression period by the residual echo component suppressing unit .

第２の本発明の音声通信方法は、（１）エコーキャンセラのエコーキャンセラ本体は、疑似エコーを生成して送話信号に含まれているエコー成分を除去し、上記エコーキャンセラの残留エコー成分抑圧手段は、上記エコーキャンセラ本体からの出力送話信号に含まれている残留エコー成分を非線形な抑圧特性に従って抑圧し、（２）背景雑音成分抽出手段は、上記送話信号を周波数領域の信号に変換して背景雑音成分を抽出し、抽出した背景雑音成分を時間領域の信号に戻して出力し、（３）雑音重畳手段は、上記エコーキャンセラから出力された送話信号に、上記背景雑音成分抽出手段が抽出した背景雑音成分を第１の所定倍して重畳し、上記残留エコー成分抑圧手段による抑圧過度期間の送話信号を補償することを特徴とする。 In the voice communication method of the second aspect of the present invention, (1) the echo canceller body of the echo canceller generates a pseudo echo to remove the echo component contained in the transmission signal, and suppresses the residual echo component of the echo canceller. The means suppresses the residual echo component included in the output transmission signal from the echo canceller main body according to a nonlinear suppression characteristic , and (2) the background noise component extraction means converts the transmission signal into a frequency domain signal. A background noise component is extracted by conversion, and the extracted background noise component is converted back into a time domain signal for output . (3) The noise superimposing means adds the background noise component to the transmission signal output from the echo canceller. The background noise component extracted by the extraction means is superimposed by a first predetermined multiple, and the transmission signal in the excessive suppression period by the residual echo component suppression means is compensated .

第３の本発明の音声通信プログラムは、コンピュータを、（１）疑似エコーを生成して送話信号に含まれているエコー成分を除去するエコーキャンセラ本体と、当該エコーキャンセラ本体からの出力送話信号に含まれている残留エコー成分を非線形な抑圧特性に従って抑圧する残留エコー成分抑圧手段とを有するエコーキャンセラと、（２）上記送話信号を周波数領域の信号に変換して背景雑音成分を抽出し、抽出した背景雑音成分を時間領域の信号に戻して出力する背景雑音成分抽出手段と、（３）上記エコーキャンセラから出力された送話信号に、上記背景雑音成分抽出手段が抽出した背景雑音成分を第１の所定倍して重畳し、上記残留エコー成分抑圧手段による抑圧過度期間の送話信号を補償する雑音重畳手段として機能させることを特徴とする。 The voice communication program according to the third aspect of the present invention provides a computer that (1) generates an echo echo and removes an echo component included in a transmission signal, and an output transmission from the echo canceller main body. An echo canceller having residual echo component suppression means for suppressing the residual echo component contained in the signal according to a nonlinear suppression characteristic ; and (2) extracting the background noise component by converting the transmission signal into a frequency domain signal. And a background noise component extracting means for outputting the extracted background noise component back to a time domain signal, and (3) background noise extracted by the background noise component extracting means to the transmission signal output from the echo canceller. superimposing the component first predetermined multiple to, that to function as the noise superimposing means for compensating a transmission signal suppression excessive period by the residual echo component suppressing means And butterflies.

本発明によれば、残留エコー抑圧機能が過度に作用した期間に背景雑音成分を重畳できるので、その期間の通話品質が劣化することを防止できるようになる。 According to the present invention, since the background noise component can be superimposed in a period in which the residual echo suppression function is excessively applied, it is possible to prevent deterioration in call quality during that period.

第１の実施形態に係る音声通信装置の構成を示すブロック図である。It is a block diagram which shows the structure of the audio | voice communication apparatus which concerns on 1st Embodiment. 第１の実施形態におけるノイズ抽出部の構成例を示すブロック図である。It is a block diagram which shows the structural example of the noise extraction part in 1st Embodiment. 第１の実施形態における、背景雑音（ノイズ）のレベルが大きい状況下での遠端話者によるシングルトーク状態前後の各部信号波形図である。It is a signal waveform diagram of each part before and after a single talk state by a far-end speaker under a situation where the level of background noise (noise) is large in the first embodiment. 第２の実施形態に係る音声通信装置の構成を示すブロック図である。It is a block diagram which shows the structure of the audio | voice communication apparatus which concerns on 2nd Embodiment.

（Ａ）第１の実施形態
以下、本発明に係る音声通信装置、方法及びプログラムの第１の実施形態を、図面を参照しながら詳述する。 (A) First Embodiment Hereinafter, a first embodiment of a voice communication apparatus, method, and program according to the present invention will be described in detail with reference to the drawings.

（Ａ−１）第１の実施形態の構成
図１は、第１の実施形態に係る音声通信装置の構成を示すブロック図である。例えば、第１の実施形態の音声通信装置がソフトフォンであって、ＣＰＵと、ＣＰＵが実行するプログラム（第１の実施形態の音声通信プログラム）で実現される場合であっても、機能的には、図１の機能ブロック図で表すことができる。 (A-1) Configuration of First Embodiment FIG. 1 is a block diagram showing a configuration of a voice communication apparatus according to the first embodiment. For example, even if the voice communication device of the first embodiment is a soft phone and is realized by a CPU and a program executed by the CPU (voice communication program of the first embodiment), it is functionally Can be represented by the functional block diagram of FIG.

図１において、第１の実施形態の音声通信装置１は、電話端末のスピーカ２からマイクロフォン３に回り込む音響エコーを消去するためのエコーキャンセラ４を有する音声通信装置である。なお、図１に示す第１の実施形態のエコーキャンセラ４は、音響エコーを消去するものになっているが、電話通信網に存在するハイブリッド回路で発生する回線エコーを消去するものであっても良い。 In FIG. 1, a voice communication apparatus 1 according to the first embodiment is a voice communication apparatus having an echo canceller 4 for erasing acoustic echo that circulates from a speaker 2 of a telephone terminal to a microphone 3. Although the echo canceller 4 of the first embodiment shown in FIG. 1 is designed to cancel the acoustic echo, the echo canceller 4 of the first embodiment can also cancel the line echo generated in the hybrid circuit existing in the telephone communication network. good.

第１の実施形態のエコーキャンセラ４は、適応フィルタ１０１、減算器１０２及びアッテネータ１０３を有する。 The echo canceller 4 of the first embodiment includes an adaptive filter 101, a subtracter 102, and an attenuator 103.

適応フィルタ１０１は、スピーカ２からマイクロフォン３へ回り込むエコーのインパルス応答を推定し、疑似エコーを生成するものである。 The adaptive filter 101 estimates an impulse response of an echo that goes from the speaker 2 to the microphone 3 and generates a pseudo echo.

減算器１０２は、マイクロフォン３が捕捉した送話信号から、適応フィルタ１０１によって生成された疑似エコーを減算することにより、音声通信装置１が送信する送話信号からエコー成分を消去するものである。 The subtracter 102 eliminates the echo component from the transmission signal transmitted by the voice communication apparatus 1 by subtracting the pseudo echo generated by the adaptive filter 101 from the transmission signal captured by the microphone 3.

アッテネータ１０３は、減算器１０２から出力された送話信号における、消去し切れなかったエコー成分のレベルを減衰するものである。アッテネータ１０３は、近端からマイクロフォン３へ近端話者の音声信号が入力されることがなく、遠端話者の音声信号を受信してスピーカ２に遠端話者の音声信号が出力されている状態（遠端話者のシングルトーク状態）のときに動作するものである。 The attenuator 103 attenuates the level of the echo component that cannot be completely erased in the transmission signal output from the subtractor 102. The attenuator 103 receives the voice signal of the far-end speaker without receiving the voice signal of the far-end speaker from the near end to the microphone 3 and outputs the voice signal of the far-end talker to the speaker 2. It operates in the state of being in a single talk state of a far-end speaker.

なお、図１では、消去し切れなかったエコー成分のレベルを減衰するものとして、アッテネータ１０３を適用したものを示したが、他の構成によって、消去し切れなかったエコー成分のレベルを減衰するようにしても良い。例えば、アッテネータ１０３に代えて、センタークリッパを適用することができる。 Although FIG. 1 shows the case where the attenuator 103 is applied as an attenuation of the level of the echo component that cannot be completely erased, the level of the echo component that cannot be completely erased is attenuated by another configuration. Anyway. For example, instead of the attenuator 103, a center clipper can be applied.

ここで、マイクロフォン３が拾う背景雑音のレベルが高い場合には、アッテネータ１０３は、消し残りのエコー成分のみならずその背景雑音まで除去してしまう。このような背景雑音の除去によって、相手通話者には、音声が途切れるように聴こえ、通話に違和感が生まれることになる。また、遠端話者及び近端話者の音声信号がない区間における、マイクロフォン３が拾う背景雑音のレベルがアッテネータ１０３の抑圧動作が働かないレベルである場合には、アッテネータ１０３からは背景雑音が出力される。 Here, when the level of the background noise picked up by the microphone 3 is high, the attenuator 103 removes not only the remaining echo component but also the background noise. By removing such background noise, the other party's caller hears the voice being interrupted, creating a sense of incongruity in the call. When the background noise level picked up by the microphone 3 in a section where there is no voice signal of the far-end speaker and the near-end speaker is a level at which the suppression operation of the attenuator 103 does not work, the background noise is generated from the attenuator 103. Is output.

第１の実施形態の場合、背景雑音に対する処理を、近端話者が選択し得るようになされている。例えば、近端話者は遠端話者から希望を聴き取り、背景雑音に対する処理を選択する。 In the case of the first embodiment, the processing for background noise can be selected by the near-end speaker. For example, the near-end speaker listens for hope from the far-end speaker and selects processing for background noise.

図１において、第１の実施形態の音声通信装置１は、背景雑音に対する処理構成として、ノイズ抽出部１０４、レベル可変器１０５、加算器１０６、レベル可変器１０７、減算器１０８、スイッチ１０９及び１１０を有する。ここで、ノイズ抽出部１０４、レベル可変器１０７及び減算器１０８は、ノイズキャンセラを構成している。 In FIG. 1, the voice communication apparatus 1 according to the first embodiment has a noise extraction unit 104, a level variable unit 105, an adder 106, a level variable unit 107, a subtractor 108, and switches 109 and 110 as a processing configuration for background noise. Have Here, the noise extraction unit 104, the level variable unit 107, and the subtractor 108 constitute a noise canceller.

ノイズ抽出部１０４には、マイクロフォン３が捕捉して得た信号が入力され、ノイズ抽出部１０４は、その信号に基づいてノイズ（背景雑音）成分を生成（抽出）するものである。ノイズ抽出部１０４として、既存のノイズキャンセラにおけるノイズ成分の抽出構成を適用することができる。図２は、ノイズ抽出部１０４の構成例を示すブロック図である。なお、図２では、ＦＦＴ（ＦａｓｔＦｏｕｒｉｅｒＴｒａｎｓｆｏｒｍ）を用いた周波数領域での方式の例を挙げたが、この方式に限るものではない。 A signal obtained by the microphone 3 is input to the noise extraction unit 104, and the noise extraction unit 104 generates (extracts) a noise (background noise) component based on the signal. As the noise extraction unit 104, a noise component extraction configuration in an existing noise canceller can be applied. FIG. 2 is a block diagram illustrating a configuration example of the noise extraction unit 104. In FIG. 2, an example of a method in the frequency domain using FFT (Fast Fourier Transform) is described, but the method is not limited to this method.

図２において、ノイズ抽出部１０４は、窓関数処理部１１１、ＦＦＴ処理部１１２、抑圧器１１３、ＩＦＦＴ処理部１１４及び窓関数オーバーラップ部１１５を有する。 In FIG. 2, the noise extraction unit 104 includes a window function processing unit 111, an FFT processing unit 112, a suppressor 113, an IFFT processing unit 114, and a window function overlap unit 115.

窓関数処理部１１１は、マイクロフォン３が捕捉して得た信号を入力し、窓関数処理によって、ＦＦＴ処理の分析区間データを切り出すものである。 The window function processing unit 111 inputs a signal acquired by the microphone 3 and extracts analysis interval data of FFT processing by window function processing.

ＦＦＴ処理部１１２は、窓関数処理部１１１から出力された実時間領域データを入力とし、ＦＦＴ処理によって、その時間信号をＦＦＴポイント数個の周波数領域データに変換するものである。 The FFT processing unit 112 receives the real time domain data output from the window function processing unit 111 and converts the time signal into frequency domain data of several FFT points by FFT processing.

抑圧器１１３は、ＦＦＴ処理部１１２から出力された周波数領域データを入力とし、ノイズ成分を抑圧せずに音声成分のみを抑圧し、ノイズ成分のみの周波数領域データを出力するものである。 The suppressor 113 receives the frequency domain data output from the FFT processing unit 112, suppresses only the voice component without suppressing the noise component, and outputs the frequency domain data of only the noise component.

ＩＦＦＴ処理部１１４は、抑圧器１１３によってノイズ成分だけとなった周波数領域データを入力し、逆ＦＦＴ処理によって、周波数領域データから実時間領域データに逆変換するものである。 The IFFT processing unit 114 is input with frequency domain data that includes only noise components by the suppressor 113, and inversely transforms the frequency domain data into real-time domain data by inverse FFT processing.

窓関数オーバーラップ部１１５は、切り出されている状態のデータを連続するデータにつなぎ合わせて出力するものである。 The window function overlap unit 115 connects the data in the cut out state to continuous data and outputs it.

スイッチ１０９及び１１０は、ユーザ（近端話者）の操作に応じて択一的かつ相補的にオン動作するものである。 The switches 109 and 110 are turned on alternatively and complementarily according to the operation of the user (near-end speaker).

スイッチ１０９は、例えば、アッテネータ１０３によって非線形に減衰された信号部分に対し、ノイズを付加し、信号の途切れによる通話の違和感を解消したい場合にオンにされるものである。スイッチ１０９のオン時には、レベル可変器１０５及び加算器１０６が有効に機能する。 The switch 109 is turned on when, for example, it is desired to add noise to the signal portion nonlinearly attenuated by the attenuator 103 to eliminate the uncomfortable feeling of the call due to signal interruption. When the switch 109 is turned on, the level variable device 105 and the adder 106 function effectively.

レベル可変器１０５は、スイッチ１０９のオン時に、ノイズ抽出部１０４によって抽出されたノイズ成分のレベルを可変して加算器１０６に与えるものである。レベル可変器１０５は、ユーザ（近端話者）の操作に応じて、可変利得を変化させるものである。 The level variable unit 105 varies the level of the noise component extracted by the noise extraction unit 104 when the switch 109 is turned on and supplies the level to the adder 106. The level variable device 105 changes the variable gain according to the operation of the user (near-end speaker).

加算器１０６は、アッテネータ１０３から出力された信号に、レベル可変器１０５から出力されたノイズ成分を加算するものである。なお、加算器１０６は、スイッチ１１０のオン時には、アッテネータ１０３から出力された信号をそのまま通過させるものとなっている。 The adder 106 adds the noise component output from the level variable device 105 to the signal output from the attenuator 103. The adder 106 passes the signal output from the attenuator 103 as it is when the switch 110 is on.

スイッチ１１０は、例えば、アッテネータ１０３による減衰がなされないとき（遠端話者のシングルトーク状態でないとき）、ノイズを減算するノイズキャンセラとして動作させ、ノイズによる音質劣化を解消したい場合にオンにされるものである。 The switch 110 is turned on when, for example, the attenuation by the attenuator 103 is not performed (when the far-end speaker is not in a single talk state), the switch 110 is operated as a noise canceller that subtracts noise, and sound quality deterioration due to noise is to be eliminated. It is.

レベル可変器１０７は、スイッチ１１０のオン時に、ノイズ抽出部１０４によって抽出されたノイズ成分のレベルを可変して減算器１０８に与えるものである。レベル可変器１０７は、ユーザ（近端話者）の操作に応じて、可変利得を変化させるものである。 The level variable unit 107 varies the level of the noise component extracted by the noise extraction unit 104 when the switch 110 is turned on, and supplies the level to the subtractor 108. The level variable unit 107 changes the variable gain according to the operation of the user (near-end speaker).

減算器１０８は、アッテネータ１０３から出力された信号から、レベル可変器１０７から出力されたノイズ成分を減算するものである。なお、減算器１０８は、スイッチ１０９のオン時には、加算器１０６から出力された信号をそのまま通過させるものとなっている。 The subtracter 108 subtracts the noise component output from the level variable unit 107 from the signal output from the attenuator 103. The subtractor 108 passes the signal output from the adder 106 as it is when the switch 109 is turned on.

（Ａ−２）第１の実施形態の動作
次に、第１の実施形態に係る音声通信装置１の動作を、スイッチ１０９のオン時の動作、スイッチ１１０のオン時の動作の順に、説明する。 (A-2) Operation of the First Embodiment Next, the operation of the voice communication device 1 according to the first embodiment will be described in the order of the operation when the switch 109 is on and the operation when the switch 110 is on. .

背景雑音（ノイズ）のレベルが大きく、アッテネータ１０３による減衰によって、背景雑音までが大きく抑圧され、遠端話者への信号に途切れが生じ、遠端話者が通話の違和感を抱くような状況では、ユーザ（近端話者）は、スイッチ１０９をオン操作する。 In a situation where the level of background noise (noise) is large, attenuation by the attenuator 103 greatly suppresses the background noise, the signal to the far-end speaker is interrupted, and the far-end speaker feels uncomfortable with the call The user (near-end speaker) turns on the switch 109.

このときには、マイクロフォン３が捕捉して得た信号からノイズ抽出部１０４によって抽出されたノイズ成分が、スイッチ１０９を介してレベル可変器１０５に与えられ、そのレベルが調整される。このようにして調整されたノイズ成分が、加算器１０５によって、アッテネータ１０３から出力された信号に加算される。この加算により、アッテネータ１０３から出力された信号に途切れがあったとしても、その途切れがなくなり、信号の途切れによる通話の違和感を解消することができる。 At this time, the noise component extracted by the noise extraction unit 104 from the signal obtained by the microphone 3 is applied to the level variable device 105 via the switch 109, and the level is adjusted. The noise component adjusted in this way is added to the signal output from the attenuator 103 by the adder 105. By this addition, even if there is a break in the signal output from the attenuator 103, the break is eliminated, and the uncomfortable feeling of the call due to the break in the signal can be eliminated.

図３は、背景雑音（ノイズ）のレベルが大きい状況における、遠端話者のシングルトーク状態前後の各部の信号波形を示している。マイクロフォン３が捕捉した信号は、図３（Ａ）に示すように、エコー成分と背景雑音（ノイズ）とが重畳されたものとなっている。疑似エコーを利用したエコーの除去動作を経ても、背景雑音（ノイズ）のレベルが大きく適応フィルタの１０１のフィルタ係数の収束性などが悪いと、図３（Ｂ）に示すように、残留エコーにノイズが重畳した信号が減算器１０２から出力される。アッテネータ１０３の非線形な抑圧特性により、残留エコーが大きい区間で大きく抑圧され、アッテネータ１０３からの出力信号は、図３（Ｃ）に示すように、一部区間でノイズも有効な信号成分も存在しないようになる。このようなアッテネータ１０３からの出力信号に、ノイズ抽出部１０４によって抽出され、レベル可変器１０５によってレベル調整されたノイズを、加算器１０６で加算することにより、当該音声通信装置１からは、図３（Ｄ）に示すような概ねレベル調整されたノイズだけを含む信号が、遠端側に送信される。 FIG. 3 shows signal waveforms of respective parts before and after the single talk state of the far-end speaker in a situation where the level of background noise (noise) is large. As shown in FIG. 3A, the signal captured by the microphone 3 is obtained by superimposing an echo component and background noise (noise). Even after the echo removal operation using the pseudo echo, if the background noise level is large and the convergence of the filter coefficient of the adaptive filter 101 is poor, as shown in FIG. A signal superimposed with noise is output from the subtractor 102. Due to the non-linear suppression characteristic of the attenuator 103, the residual echo is largely suppressed in a section where the residual echo is large, and the output signal from the attenuator 103 does not include noise or an effective signal component in a certain section as shown in FIG. It becomes like this. By adding the noise extracted by the noise extraction unit 104 and level-adjusted by the level variable unit 105 to the output signal from the attenuator 103 by the adder 106, the voice communication apparatus 1 receives the signal shown in FIG. A signal including only noise whose level is substantially adjusted as shown in (D) is transmitted to the far end side.

アッテネータ１０３の動作が背景雑音（ノイズ）の途切れを生じさせない場合には、ノイズキャンセラを有効に機能させるべく、ユーザ（近端話者）は、スイッチ１１０をオン操作する。 When the operation of the attenuator 103 does not cause the background noise (noise) to be interrupted, the user (near-end speaker) turns on the switch 110 so that the noise canceller functions effectively.

遠端話者の音声信号も近端話者の音声信号も存在しない状態や、遠端話者の音声信号だけが存在する状態においては、アッテネータ１０３からの出力信号は、概ね背景雑音（ノイズ）だけを含むものとなっている。 In a state where neither the far-end speaker's audio signal nor the near-end speaker's audio signal exists, or in the state where only the far-end speaker's audio signal exists, the output signal from the attenuator 103 is almost background noise. It is meant to include only.

スイッチ１１０のオンによって、マイクロフォン３が捕捉して得た信号からノイズ抽出部１０４によって抽出されたノイズ成分が、スイッチ１１０を介してレベル可変器１０７に与えられ、そのレベルが調整される。このようにして調整されたノイズ成分が、減算器１０８によって、アッテネータ１０３から出力された信号から減算され、これにより、当該音声通信装置１からは、ノイズが除去された信号が遠端側に送信される。 When the switch 110 is turned on, the noise component extracted by the noise extraction unit 104 from the signal acquired by the microphone 3 is given to the level variable unit 107 via the switch 110, and the level is adjusted. The noise component adjusted in this way is subtracted from the signal output from the attenuator 103 by the subtractor 108, whereby the signal from which the noise has been removed is transmitted from the voice communication apparatus 1 to the far end side. Is done.

なお、ノイズキャンセラとして動作させる際の各部信号波形図は、従来のノイズキャンセラのものと同様であるので、その図示は省略する。 Since the signal waveform diagram of each part when operating as a noise canceller is the same as that of a conventional noise canceller, its illustration is omitted.

（Ａ−３）第１の実施形態の効果
第１の実施形態によれば、エコーキャンセラとして、アッテネータやセンタークリッパなどの残留エコーを抑圧する補器を有するものを適用した場合でも、大きなエコー消去量を確保しつつ途切れ感の少ない通話を可能とすることができる。 (A-3) Effect of First Embodiment According to the first embodiment, even when an echo canceller having an auxiliary device that suppresses residual echo such as an attenuator or a center clipper is applied, large echo cancellation is performed. It is possible to make a call with little sense of interruption while securing the amount.

また、ユーザの選択によっては、途切れ感の補償に代え、ノイズ除去に重みをおいた動作を実行させることもできる。 Further, depending on the user's selection, it is possible to execute an operation weighted for noise removal instead of compensating for the discontinuity.

（Ｂ）第２の実施形態
次に、本発明に係る音声通信装置、方法及びプログラムの第２の実施形態を、図面を参照しながら詳述する。 (B) Second Embodiment Next, a second embodiment of the voice communication apparatus, method and program according to the present invention will be described in detail with reference to the drawings.

図４は、第２の実施形態に係る音声通信装置の構成を示すブロック図であり、第１の実施形態に係る図１との同一、対応部分には同一、対応符号を付して示している。 FIG. 4 is a block diagram showing the configuration of the voice communication apparatus according to the second embodiment, in which the same and corresponding parts as those in FIG. 1 according to the first embodiment are indicated by the same reference numerals. Yes.

第１の実施形態ではスイッチ１０９及び１１０はユーザ（近端話者）が手動操作するものであったが、この第２の実施形態の音声通信装置１Ａでは、スイッチ制御回路１２０がスイッチ１０９及び１１０のオンオフ状態を制御する。 In the first embodiment, the switches 109 and 110 are manually operated by the user (near-end speaker). However, in the voice communication device 1A of the second embodiment, the switch control circuit 120 includes the switches 109 and 110. Controls the on / off state of.

スイッチ制御回路１２０は、例えば、（１）適応フィルタ１０１が適応動作を開始した時点から所定時間の間（若しくは収束したと捉えられるまで）、（２）受話信号だけがあるシングルトーク状態においてアッテネータ１０３への入力レベルの平均レベルが所定閾値を越えているとき、（３）アッテネータ１０３に設定されているアッテネート量が大きい（例えば６ｄＢ）ときに、スイッチ１０９をオンさせ、（１）〜（３）以外ではスイッチ１１０をオンさせる。 The switch control circuit 120, for example, (1) for a predetermined time from when the adaptive filter 101 starts an adaptive operation (or until it is considered to have converged), (2) in the single talk state where only the received signal exists, the attenuator 103 When the average level of the input level exceeds the predetermined threshold value (3) When the attenuation amount set in the attenuator 103 is large (for example, 6 dB), the switch 109 is turned on, and (1) to (3) Otherwise, the switch 110 is turned on.

このようなスイッチ１０９及び１１０に対する制御を除けば、第２の実施形態の構成及び動作は、第１の実施形態と同様である。 Except for such control of the switches 109 and 110, the configuration and operation of the second embodiment are the same as those of the first embodiment.

第２の実施形態によっても、残留エコーを抑圧する補器を有するものを適用した場合でも、大きなエコー消去量を確保しつつ途切れ感の少ない通話を可能とすることができ、しかも、途切れ感の補償に代え、ノイズ除去に重みをおいた動作を実行させることもできる。 According to the second embodiment, even when a device having an auxiliary device for suppressing a residual echo is applied, it is possible to make a call with a little sense of interruption while ensuring a large amount of echo cancellation. Instead of compensation, it is also possible to execute an operation weighted for noise removal.

第２の実施形態によれば、このような種類の動作モードを自動的に設定することができるという効果を奏することができる。 According to the second embodiment, there is an effect that such an operation mode can be automatically set.

（Ｃ）他の実施形態
上記各実施形態の説明においても、種々変形実施形態に言及したが、さらに、以下に例示するような変形実施形態を挙げることができる。 (C) Other Embodiments In the description of each of the above-described embodiments, various modified embodiments have been referred to. However, modified embodiments as exemplified below can be given.

上記各実施形態では、残留エコーを抑圧する補器の機能のために途切れ感が生じることを補償する構成に加え、ノイズキャンセラの構成を備える音声通信装置を示したが、ノイズキャンセラ構成をも備えないように音声通信装置を構成しても良い。 In each of the above embodiments, the voice communication device having the noise canceller configuration is shown in addition to the configuration for compensating for the occurrence of the discontinuity due to the function of the auxiliary device for suppressing the residual echo, but the noise canceller configuration is not provided. A voice communication device may be configured.

また、上記各実施形態では、途切れ感が生じることを補償する構成と、ノイズキャンセラの構成との切換えを瞬間的に行うものを示したが、徐々に切り替えるようにしても良い。例えば、有効に動作していた一方のレベル可変器１０５又は１０７の利得を徐々に小さくしていくと共に、無効となっていた他方のレベル可変器１０７又は１０５の利得を徐々に大きくしていくことで切り替えるようにしても良い。 Further, in each of the above embodiments, the configuration for instantaneously switching between the configuration for compensating for the feeling of discontinuity and the configuration for the noise canceller has been shown, but it may be switched gradually. For example, the gain of one level variable device 105 or 107 that has been operating effectively is gradually reduced, and the gain of the other level variable device 107 or 105 that has been disabled is gradually increased. You may make it switch with.

さらに、上記各実施形態では、スイッチ１０９及び１１０が、択一的かつ相補的にオン動作するものを示したが、同時にオンするものでなければ良い。例えば、第２の実施形態の変形例として、スイッチ１０９をオンしない期間に常にスイッチ１１０をオンするのに代え、スイッチ１０９をオンしない期間で、しかも、受話音声がなく、ノイズパワーが所定の閾値以上のときだけスイッチ１１０をオンするようにしても良い。 Further, in each of the above embodiments, the switches 109 and 110 are alternatively and complementarily turned on. However, the switches 109 and 110 may not be turned on at the same time. For example, as a modification of the second embodiment, instead of always turning on the switch 110 during the period when the switch 109 is not turned on, there is no received voice and the noise power is a predetermined threshold value during the period when the switch 109 is not turned on. The switch 110 may be turned on only at the above time.

上記実施形態の電話通信装置が適用される装置は、電話端末に限定されるものではなく、ＰＢＸなどであっても良い。 An apparatus to which the telephone communication apparatus of the above embodiment is applied is not limited to a telephone terminal, and may be a PBX or the like.

１、１Ａ…音声通信装置、４…エコーキャンセラ、１０４…ノイズ抽出部、１０５、１０７…レベル可変器、１０６…加算器、１０８…減算器、１０９、１１０…スイッチ、１２０…スイッチ制御回路。 DESCRIPTION OF SYMBOLS 1, 1A ... Voice communication apparatus, 4 ... Echo canceller, 104 ... Noise extraction part, 105, 107 ... Level variable device, 106 ... Adder, 108 ... Subtractor, 109, 110 ... Switch, 120 ... Switch control circuit.

Claims

Echo canceller body that generates pseudo echo and removes echo component contained in transmission signal, and residual echo component contained in output transmission signal from echo canceller body is suppressed according to nonlinear suppression characteristics An echo canceller having residual echo component suppression means;
A background noise component extracting means for converting the transmission signal into a frequency domain signal to extract a background noise component, and returning the extracted background noise component to a time domain signal ;
The background noise component extracted by the background noise component extraction unit is superimposed on the transmission signal output from the echo canceller by a first predetermined multiple, and the transmission signal of the excessive suppression period by the residual echo component suppression unit is obtained. And a noise superimposing means for compensating .

Noise removal means for removing the background noise component extracted by the background noise component extraction means by a second predetermined multiple from the transmission signal output from the echo canceller, and most of the noise superimposing means and the noise removal means. The voice communication apparatus according to claim 1, further comprising: a noise processing selection unit that makes only one of them effective.

The echo canceller body of the echo canceller generates a pseudo echo to remove the echo component contained in the transmission signal, and the residual echo component suppression means of the echo canceller applies to the output transmission signal from the echo canceller body. Suppresses the residual echo component included according to the nonlinear suppression characteristics ,
The background noise component extraction means converts the transmission signal to a frequency domain signal to extract a background noise component, returns the extracted background noise component to a time domain signal, and outputs it,
The noise superimposing unit superimposes the background noise component extracted by the background noise component extracting unit on the transmission signal output from the echo canceller by a first predetermined multiple, and the excessive suppression period by the residual echo component suppressing unit A voice communication method characterized by compensating for the transmission signal .

The noise removal means removes the background noise component extracted by the background noise component extraction means from the transmission signal output from the echo canceller by a second predetermined multiple,
4. The voice communication method according to claim 3, wherein the noise processing selecting means enables at least one of the noise superimposing means and the noise removing means at most.

Computer
Echo canceller body that generates pseudo echo and removes echo component contained in transmission signal, and residual echo component contained in output transmission signal from echo canceller body is suppressed according to nonlinear suppression characteristics An echo canceller having residual echo component suppression means;
A background noise component extracting means for converting the transmission signal into a frequency domain signal to extract a background noise component, and returning the extracted background noise component to a time domain signal ;
The background noise component extracted by the background noise component extraction unit is superimposed on the transmission signal output from the echo canceller by a first predetermined multiple, and the transmission signal of the excessive suppression period by the residual echo component suppression unit is obtained. A voice communication program that functions as a noise superimposing means for compensation .

Computer
Noise removal means for removing the background noise component extracted by the background noise component extraction means by a second predetermined multiple from the transmission signal output from the echo canceller, and most of the noise superimposing means and the noise removal means. 6. The voice communication program according to claim 5, wherein the voice communication program is made to function as a noise processing selection unit that enables only one of them.