JP5051235B2

JP5051235B2 - Echo suppression system, echo suppression method, echo suppression program, echo suppression device, and sound output device

Info

Publication number: JP5051235B2
Application number: JP2009536910A
Authority: JP
Inventors: 直司松尾; 太介伊藤
Original assignee: Fujitsu Ltd
Current assignee: Fujitsu Ltd
Priority date: 2007-10-12
Filing date: 2007-12-10
Publication date: 2012-10-17
Anticipated expiration: 2027-12-10
Also published as: JPWO2009047871A1

Description

本発明は、音信号に基づいて音を出力する音出力装置と、音が入力される音入力装置と、該音入力装置が入力した音から、前記音出力装置が出力した音に基づくエコーを抑圧するエコー抑圧装置とを備えるエコー抑圧システム、該エコー抑圧システムに適用されるエコー抑圧方法、前記エコー抑圧システムにて用いられるエコー抑圧装置を実現するためのエコー抑圧プログラム、前記エコー抑圧システムにて用いられるエコー抑圧装置及び音出力装置に関する。 The present invention relates to a sound output device that outputs sound based on a sound signal, a sound input device to which sound is input, and an echo based on the sound output by the sound output device from the sound input by the sound input device. In an echo suppression system comprising an echo suppression device for suppression, an echo suppression method applied to the echo suppression system, an echo suppression program for realizing an echo suppression device used in the echo suppression system, and the echo suppression system about the echo suppressor and a sound output equipment used.

例えば、従来のカーナビゲーションシステムにおける音声認識では、話者が発声する前に発話スイッチを押下することにより、カーナビゲーションシステムを消音し、カーナビゲーションシステムのスピーカから出力された音が、エコーとしてマイクロホンに回り込み、音声の誤認識の要因となることを防止している。これに対し、出力音を消音することなく音声認識による操作を行いたいという使用者からの要望がある。この様な要望に応えるためにはスピーカからマイクロホンに回り込む音を抑圧するエコー抑圧方法が必要となる。また車両内の音響、即ちカーオーディオシステムでは、マルチチャネルに対応すべく複数のスピーカが用いられているため、マルチチャネルのカーオーディオシステムから音声認識等の用途に用いられるマイクロホンに回り込むエコーに対応するエコー抑圧方法が要求される。 For example, in speech recognition in a conventional car navigation system, the car navigation system is muted by pressing the utterance switch before the speaker speaks, and the sound output from the speaker of the car navigation system is echoed to the microphone. This prevents wraparound and misrecognition of voice. On the other hand, there is a request from the user to perform an operation by voice recognition without muting the output sound. In order to meet such a demand, an echo suppression method for suppressing the sound that circulates from the speaker to the microphone is required. In addition, in a car audio system, that is, in a car audio system, a plurality of speakers are used to support multi-channel, so that it corresponds to an echo that circulates from a multi-channel car audio system to a microphone used for voice recognition or the like. An echo suppression method is required.

そこでモノラルチャネルのオーディオシステムに対応するエコー抑圧方法をマルチチャネルのオーディオシステムに適用したエコー抑圧装置が提案されている。図１は、従来の第１のエコー抑圧装置の構成を示す模式図である。図１中１０００は、エコー抑圧装置であり、エコー抑圧装置１０００は、マルチチャネルオーディオ２０００から出力された複数チャネルの音信号を、複数のスピーカ２００１，２００１，…へ出力し、各スピーカ２００１，２００１，…は、夫々の音信号に基づく出力を行う。そしてエコー抑圧装置１０００は、マイクロホン２００２にて受音した音から、出力された複数チャネルの音に基づくエコーを除去する。エコーの除去は、受音した音に基づく観測音信号ｙ（ｔ）から、出力する複数チャネル（ｎ）の音に基づく参照音信号ｘ１（ｔ），…，ｘｎ（ｔ）を、夫々のチャネルの参照音信号ｘ１（ｔ），…，ｘｎ（ｔ）に対応した複数の抑圧機構（エコーキャンセラ）１００１−１，…，１００１−ｎにて夫々に対応するエコーを抑圧することにより行う。 Therefore, an echo suppression apparatus has been proposed in which an echo suppression method corresponding to a monaural channel audio system is applied to a multi-channel audio system. FIG. 1 is a schematic diagram showing a configuration of a conventional first echo suppressor. In FIG. 1, reference numeral 1000 denotes an echo suppression apparatus. The echo suppression apparatus 1000 outputs a plurality of channels of sound signals output from the multichannel audio 2000 to a plurality of speakers 2001, 2001,. ,... Output based on the respective sound signals. Then, the echo suppression apparatus 1000 removes echoes based on the output sound of a plurality of channels from the sound received by the microphone 2002. The echo is removed from the observed sound signal y (t) based on the received sound by using the reference sound signals x1 (t),..., Xn (t) based on the sounds of the plurality of channels (n) to be output on the respective channels. , Xn (t) corresponding to the reference sound signals x1 (t),..., Xn (t) are suppressed by suppressing the corresponding echoes by a plurality of suppression mechanisms (echo cancellers) 1001-1,.

またモノラルチャネルのオーディオシステムに対応するエコー抑圧方法をマルチチャネルのオーディオシステムに適用したエコー抑圧装置は、図１以外の形態についても提案されている。図２は、従来の第２のエコー抑圧装置の構成を示す模式図である。第２のエコー抑圧装置１０００は、複数チャネルの音に基づく参照音信号ｘ１（ｔ），…，ｘｎ（ｔ）を加算する加算機構１００２にて加算参照音信号ｘ（ｔ）を生成し、一の抑圧機構１００１にて、観測音信号ｙ（ｔ）から、加算参照音信号ｘ（ｔ）に基づいてエコーを抑圧する。 An echo suppressor that applies an echo suppression method corresponding to a monaural channel audio system to a multi-channel audio system has also been proposed for forms other than FIG. FIG. 2 is a schematic diagram showing a configuration of a conventional second echo suppression device. The second echo suppressor 1000 generates an added reference sound signal x (t) by an adding mechanism 1002 that adds reference sound signals x1 (t),..., Xn (t) based on sounds of a plurality of channels. The suppression mechanism 1001 suppresses echoes from the observed sound signal y (t) based on the added reference sound signal x (t).

図３は、従来のエコー抑圧装置が備える抑圧機構１００１の機能構成を示す機能ブロック図である。抑圧機構１００１は、話者が発声しているダブルトークの状態及び発声していないシングルトークの状態を検出する検出部１００１０と、学習同定法を用いた適応処理により、エコーの推定に要するフィルタ係数を更新するフィルタ係数更新部１００１１と、フィルタ係数を用いた数百次の積和演算により、参照音信号ｘ（ｔ）から、エコー信号ｘ'（ｔ）を推定する線形ＦＩＲフィルタ部１００１２と、観測音信号ｙ（ｔ）からエコー信号ｘ'（ｔ）を減算することによりエコーの抑圧を行った抑圧結果ｒ（ｔ）を出力する減算部１００１３とを備えている。検出部１００１０は、抑圧結果ｒ（ｔ）の強度変化に基づいてシングルトークの状態とダブルトークの状態とを検出し、ダブルトークの状態時には、フィルタ係数更新部１００1１のフィルタ係数の更新を停止させる。フィルタ係数更新部１００１１は、抑圧結果ｒ（ｔ）に基づいてフィルタ係数を算出する。 FIG. 3 is a functional block diagram showing a functional configuration of a suppression mechanism 1001 provided in a conventional echo suppression device. The suppression mechanism 1001 includes a detection unit 10010 that detects a state of double talk that the speaker is uttering and a state of single talk that is not uttered, and a filter coefficient required for echo estimation by adaptive processing using a learning identification method. A filter coefficient updating unit 10011 for updating the signal, a linear FIR filter unit 10012 for estimating the echo signal x ′ (t) from the reference sound signal x (t) by a several hundredth-order product-sum operation using the filter coefficient, A subtractor 10013 that outputs a suppression result r (t) obtained by performing echo suppression by subtracting the echo signal x ′ (t) from the observation sound signal y (t). The detection unit 10010 detects a single talk state and a double talk state based on the intensity change of the suppression result r (t), and stops the update of the filter coefficient of the filter coefficient update unit 10011 in the double talk state. . The filter coefficient update unit 10011 calculates a filter coefficient based on the suppression result r (t).

なお図１に示したエコー抑圧装置１０００では、図３に示す抑圧機構１００１を参照音信号ｘ１（ｔ），…，ｘｎ（ｔ）毎に備えている。この様なエコー抑圧方法は、例えば特許文献１に開示されている。
特開２００２−２３７７６９号公報 The echo suppression apparatus 1000 shown in FIG. 1 includes the suppression mechanism 1001 shown in FIG. 3 for each reference sound signal x1 (t),..., Xn (t). Such an echo suppression method is disclosed in Patent Document 1, for example.
JP 2002-237769 A

しかしながら図１に示した様に各チャネルに対応する抑圧機構を要する方法では、コストの増加、装置の大型化等の問題がある。特に設置スペースの制約が大きいカーナビゲーションシステムに適用する場合、大型化の問題は深刻である。 However, the method requiring a suppression mechanism corresponding to each channel as shown in FIG. 1 has problems such as an increase in cost and an increase in the size of the apparatus. In particular, when applied to a car navigation system having a large installation space restriction, the problem of enlargement is serious.

さらに図２に示した様に参照音信号を加算してモノラルチャネルの加算参照音信号とした場合には、抑圧できていない残留エコーが大きくなるという問題が生じる。これはマルチチャネルオーディオ２０００による音楽等の音の出力では、各スピーカの再生音及び強度が独立して変化するため、複数経路のエコーを一の適応処理にて学習すると、推定精度が低下するためである。 Further, as shown in FIG. 2, when the reference sound signals are added to obtain a mono channel addition reference sound signal, there arises a problem that the residual echo that cannot be suppressed increases. This is because in the output of sound such as music by the multi-channel audio 2000, the reproduction sound and intensity of each speaker change independently, and therefore, if the echoes of a plurality of paths are learned by one adaptive process, the estimation accuracy is reduced. It is.

本発明は斯かる事情に鑑みてなされたものであり、複数チャネルの音信号に対して異なる周波数帯域の成分を通過させ、また夫々異なる周波数帯域の成分の複数の音信号を加算した参照音信号及び入力された音を変換した観測音信号に基づいて、観測音信号に含まれるエコーを抑圧することにより、一つの抑圧機構で複数チャネルに対応するエコーの抑圧を行うことができるので、コストの増加及び装置の大型化を抑制することが可能であり、また観測音信号の任意の周波数には、一のチャネルの音信号に基づく一の経路のエコーのみが含まれているため、エコーの推定精度を上げることができ、残留エコーを抑制することが可能なエコー抑圧システム、該エコー抑圧システムに適用されるエコー抑圧方法、前記エコー抑圧システムにて用いられるエコー抑圧装置を実現するためのエコー抑圧プログラム、前記エコー抑圧システムにて用いられるエコー抑圧装置及び音出力装置の提供を目的とする。 The present invention has been made in view of such circumstances, and a reference sound signal in which components of different frequency bands are passed through a sound signal of a plurality of channels and a plurality of sound signals of components of different frequency bands are added. In addition, by suppressing the echo included in the observation sound signal based on the observation sound signal obtained by converting the input sound, it is possible to suppress the echo corresponding to a plurality of channels with a single suppression mechanism. It is possible to suppress an increase in the size and size of the apparatus, and since an arbitrary frequency of the observation sound signal includes only one path echo based on the sound signal of one channel, it is possible to estimate the echo. Echo suppression system capable of increasing accuracy and suppressing residual echo, echo suppression method applied to the echo suppression system, used in the echo suppression system Echo suppressing program for realizing the echo suppressor which is directed to the provision of the echo suppressor and a sound output equipment used in the echo suppressing system.

第１のエコー抑圧システムは、音信号に基づいて音を出力する音出力装置と、入力された音を音信号に変換する音入力装置と、該音入力装置が入力された音から、前記音出力装置から出力した音に基づくエコーを抑圧するエコー抑圧装置とを備えるエコー抑圧システムにおいて、前記音出力装置は、複数の音信号に対し、夫々異なる周波数帯域の成分を通過させる通過部と、該通過部を通過した複数の音信号に基づく音を夫々出力する複数の音出力部とを有し、前記通過部を通過した複数の音信号を加算して参照音信号を生成する加算部を有し、前記エコー抑圧装置は、前記音入力装置から音信号を観測音信号として入力される入力部と、前記観測音信号及び前記参照音信号に基づいて、前記観測音信号に含まれるエコーを抑圧すべく前記観測音信号を補正する補正部とを有することを要件とする。 A first echo suppression system includes: a sound output device that outputs sound based on a sound signal; a sound input device that converts an input sound into a sound signal; and a sound input from the sound input device, In an echo suppression system comprising an echo suppression device that suppresses echo based on sound output from an output device, the sound output device includes a passing unit that allows a plurality of sound signals to pass through components in different frequency bands, and A plurality of sound output units that output sounds based on the plurality of sound signals that have passed through the passage unit, respectively, and an addition unit that generates a reference sound signal by adding the plurality of sound signals that have passed through the passage unit. The echo suppression device suppresses an echo included in the observation sound signal based on the input unit that receives the sound signal from the sound input device as an observation sound signal, and the observation sound signal and the reference sound signal. The above view It is a requirement to have a correcting unit for correcting a sound signal.

第２のエコー抑圧システムは、第１のエコー抑圧システムにおいて、前記通過部は、複数の音信号を夫々周波数軸上の成分に変換する第１変換部と、周波数軸上の成分に変換した複数の音信号に対し、夫々異なる周波数帯域の成分を通過させる帯域通過フィルタ部と、夫々の周波数帯域の成分を通過させた周波数軸上の成分に変換されている複数の音信号を、夫々時間軸上の音信号に変換する第２変換部とを有することを要件とする。 The second echo suppression system is the first echo suppression system, wherein the passing unit includes a first conversion unit that converts a plurality of sound signals into components on the frequency axis, and a plurality of components that are converted into components on the frequency axis. A plurality of sound signals converted to components on the frequency axis through which the components of the different frequency bands are passed and the components on the frequency axis through which the components of the respective frequency bands are passed. It is a requirement to have a second conversion unit for converting into the above sound signal.

第３のエコー抑圧システムは、第２のエコー抑圧システムにおいて、前記通過部が夫々異なる周波数帯域の成分を通過させる複数の音信号は、第１音信号、第２音信号、第１音信号及び／又は第２音信号を所定の加工方法で加工した加工音信号であり、前記帯域通過フィルタ部は、音信号毎にフィルタ係数が設定されてあり、第１音信号に対して第１フィルタ係数、第２音信号に対して第２フィルタ係数、並びに加工音信号に対して加工方法に応じた第１フィルタ係数及び／又は第２フィルタ係数に基づく加工フィルタ係数が夫々設定されていることを要件とする。 The third echo suppression system is the second echo suppression system, wherein the plurality of sound signals through which the passing section passes components of different frequency bands are a first sound signal, a second sound signal, a first sound signal, and And / or a processed sound signal obtained by processing the second sound signal by a predetermined processing method, wherein the band-pass filter unit has a filter coefficient set for each sound signal, and the first filter coefficient for the first sound signal The second filter coefficient is set for the second sound signal, and the first filter coefficient and / or the processing filter coefficient based on the second filter coefficient corresponding to the processing method is set for the processed sound signal. And

第４のエコー抑圧システムは、第３のエコー抑圧システムにおいて、前記加工音信号は、前記第１音信号及び前記第２音信号の和である和音信号、前記第１音信号を所定時間遅延させた遅延第１音信号、並びに前記第２音信号を所定時間遅延させた遅延第２音信号であり、前記加工フィルタ係数は、和音信号に対して第１音信号及び第２音信号の和に基づくフィルタ係数、遅延第１音信号に対して第１フィルタ係数に基づくフィルタ係数、並びに遅延第２音信号に対して第２フィルタ係数に基づくフィルタ係数が夫々設定されていることを要件とする。 The fourth echo suppression system is the third echo suppression system, wherein the processed sound signal delays the chord signal, which is the sum of the first sound signal and the second sound signal, and the first sound signal by a predetermined time. The delayed first sound signal and the delayed second sound signal obtained by delaying the second sound signal for a predetermined time, and the processing filter coefficient is the sum of the first sound signal and the second sound signal with respect to the chord signal. It is a requirement that a filter coefficient based on the first filter coefficient is set for the delayed first sound signal, and a filter coefficient based on the second filter coefficient is set for the delayed second sound signal.

第５のエコー抑圧システムは、第１乃至第４のいずれかのエコー抑圧システムにおいて、前記通過部は、等間隔に分割した周波数帯域の成分を通過させる様にしてあることを要件とする。 According to a fifth echo suppression system, in any one of the first to fourth echo suppression systems, the passage section is configured to pass components in a frequency band divided at equal intervals.

第６のエコー抑圧システムは、第１乃至第４のいずれかのエコー抑圧システムにおいて、前記通過部は、対数値が等間隔となるように分割した周波数帯域の成分を通過させる様にしてあることを要件とする。 In a sixth echo suppression system, in any one of the first to fourth echo suppression systems, the passing unit allows a frequency band component divided so that logarithmic values are equally spaced to pass. Is a requirement.

第７のエコー抑圧システムは、第１乃至第６のいずれかのエコー抑圧システムにおいて、前記音出力装置は、予め設定されている除去帯域に該当する周波数帯域の成分を、何れの音信号に対しても通過させない様にしてあり、前記補正部は、前記参照音信号を、周波数毎に設定されたフィルタ係数にてフィルタリングすることにより、前記観測音信号の補正に要する補正量を導出する補正用フィルタ部と、前記補正後の観測音信号に基づいて、前記補正用フィルタのフィルタ係数の算出及び更新を行う係数更新部と、前記観測音信号の除去帯域の成分に基づいて、前記係数更新部による更新の可否を判定する更新可否判定部とを有することを要件とする。 According to a seventh echo suppression system, in any one of the first to sixth echo suppression systems, the sound output device applies a frequency band component corresponding to a preset removal band to any sound signal. The correction unit performs correction for deriving a correction amount required for correcting the observed sound signal by filtering the reference sound signal with a filter coefficient set for each frequency. A filter unit; a coefficient update unit that calculates and updates a filter coefficient of the correction filter based on the corrected observation sound signal; and the coefficient update unit based on a component of the removal band of the observation sound signal It is a requirement to have an update availability determination unit that determines whether update is possible.

第８のエコー抑圧システムは、第１乃至第７のいずれかのエコー抑圧システムにおいて、前記音出力装置は、更に、周波数帯域毎に、複数の音信号の振幅の大きさを所定の方法で比較し、振幅の大きさが最大である一の音信号の振幅と、他の全ての音信号の振幅との比較結果を示す値が所定値以上か否かを判定する比較部を更に備え、前記通過部は、前記比較部が所定値以上と判定した周波数帯域に対しては、前記一の音信号のみを通過させる様にしてあることを要件とする。 An eighth echo suppression system according to any one of the first to seventh echo suppression systems, wherein the sound output device further compares amplitudes of a plurality of sound signals for each frequency band by a predetermined method. A comparator that determines whether or not a value indicating a comparison result between the amplitude of one sound signal having the maximum amplitude and the amplitudes of all other sound signals is equal to or greater than a predetermined value; The passage section is required to pass only the one sound signal for the frequency band determined by the comparison section to be equal to or greater than a predetermined value.

第９のエコー抑圧システムは、第１乃至第７のいずれかのエコー抑圧システムにおいて、前記音出力装置は、更に、周波数帯域毎に、複数の音信号の振幅の大きさを所定の方法で比較し、振幅の大きさが最大である一の音信号の振幅と、他の全ての音信号の振幅との比較結果を示す値が所定値以上か否かを判定する比較部を更に備え、前記通過部は、前記比較部が所定値以上と判定した周波数帯域に対しては、全ての音信号を通過させる様にしてあることを要件とする。 A ninth echo suppression system according to any one of the first to seventh echo suppression systems, wherein the sound output device further compares amplitudes of a plurality of sound signals for each frequency band by a predetermined method. A comparator that determines whether or not a value indicating a comparison result between the amplitude of one sound signal having the maximum amplitude and the amplitudes of all other sound signals is equal to or greater than a predetermined value; The passage section is required to pass all sound signals for the frequency band determined by the comparison section to be equal to or greater than a predetermined value.

第１０のエコー抑圧システムは、第１乃至第９のいずれかのエコー抑圧システムにおいて、前記通過部は、通過させる音信号毎の周波数帯域の成分を、経時的に変更する様にしてあることを要件とする。 According to a tenth echo suppression system, in any one of the first to ninth echo suppression systems, the passing section changes a frequency band component for each sound signal to be passed with time. As a requirement.

第１１のエコー抑圧方法は、音信号に基づいて音を出力する音出力装置と、音を入力される音入力装置と、該音入力装置が入力された音から、前記音出力装置から出力した音に基づくエコーを抑圧するエコー抑圧装置とを用いたエコー抑圧方法において、前記音出力装置により、複数の音信号に対し、夫々異なる周波数帯域の成分を通過させる通過手順と、夫々異なる周波数帯域の成分が通過した複数の音信号に基づく音を夫々出力する音出力手順と、前記音入力装置により、入力された音を音信号に変換する音入力手順と、前記エコー抑圧装置により、夫々異なる周波数帯域の成分が通過した複数の音信号を加算して参照音信号を生成する加算手順と、前記音入力装置から音信号を観測音信号として入力される入力手順と、前記観測音信号及び前記参照音信号に基づいて、前記観測音信号に含まれるエコーを抑圧すべく前記観測音信号を補正する補正手順とを含むことを要件とする。 An eleventh echo suppression method includes: a sound output device that outputs sound based on a sound signal; a sound input device that receives sound; and a sound input device that outputs the sound from the sound output device. In an echo suppression method using an echo suppressor that suppresses echo based on sound, the sound output device causes a plurality of sound signals to pass through components in different frequency bands, and in each of the different frequency bands. The sound output procedure for outputting sounds based on a plurality of sound signals through which the component has passed, the sound input procedure for converting the input sound into a sound signal by the sound input device, and the frequency different by the echo suppression device, respectively. An addition procedure for generating a reference sound signal by adding a plurality of sound signals through which band components have passed, an input procedure for inputting a sound signal from the sound input device as an observation sound signal, and the observation sound signal Based on the fine the reference sound signal, it is required for the containing and correction procedure for correcting the observation sound signal so as to suppress the echo contained in the observed sound signal.

第１２のエコー抑圧プログラムは、音信号に基づいて音を出力する音出力装置、及び入力された音に基づいて音信号を生成する音入力装置と連携し、前記音入力装置が入力された音から、前記音出力装置から出力した音に基づくエコーを抑圧させる手順をエコー抑圧装置に実行させるエコー抑圧プログラムにおいて、前記エコー抑圧装置に、前記音出力装置から出力された夫々異なる周波数帯域の成分を通過させた複数の音信号を加算して生成した参照音信号と、前記音入力装置にて生成された音信号である観測音信号とに基づいて、前記観測音信号に含まれるエコーを抑圧すべく前記観測音信号を補正する補正手順を実行させることを要件とする。 The twelfth echo suppression program cooperates with a sound output device that outputs a sound based on a sound signal and a sound input device that generates a sound signal based on an input sound, and the sound input by the sound input device In an echo suppression program for causing an echo suppression apparatus to execute a procedure for suppressing an echo based on the sound output from the sound output apparatus, the echo suppression apparatus includes components of different frequency bands output from the sound output apparatus. Based on a reference sound signal generated by adding a plurality of sound signals passed through and an observation sound signal which is a sound signal generated by the sound input device, echoes contained in the observation sound signal are suppressed. Therefore, it is necessary to execute a correction procedure for correcting the observation sound signal.

第１３のエコー抑圧装置は、音信号に基づいて音を出力する音出力装置、及び入力された音に基づいて音信号を生成する音入力装置と連携し、前記音入力装置が入力された音から、前記音出力装置から出力した音に基づくエコーを抑圧するエコー抑圧装置において、前記音出力装置から出力された夫々異なる周波数帯域の成分を通過させた複数の音信号を加算して参照音信号を生成する加算部と、前記音入力装置から音信号を観測音信号として入力される入力部と、前記観測音信号及び前記参照音信号に基づいて、前記観測音信号に含まれるエコーを抑圧すべく前記観測音信号を補正する補正部とを有することを要件とする。 The thirteenth echo suppressor cooperates with a sound output device that outputs sound based on the sound signal and a sound input device that generates sound signal based on the input sound, and the sound input by the sound input device In the echo suppressor for suppressing echo based on the sound output from the sound output device, a reference sound signal is obtained by adding a plurality of sound signals that have passed through components of different frequency bands output from the sound output device. Based on the observation sound signal and the reference sound signal, and an echo contained in the observation sound signal is suppressed, based on the addition part that generates the sound signal from the sound input device as an observation sound signal Therefore, it is necessary to have a correction unit that corrects the observation sound signal.

第１４の音出力装置は、複数の音信号に基づく音を出力する複数の音出力部を備えており、入力された音に基づいて音信号を生成する音入力装置及び該音入力装置が入力された音から前記音出力部から出力した音に基づくエコーを抑圧するエコー抑圧装置と連携する音出力装置において、複数の音信号に対し、夫々異なる周波数帯域の成分を通過させる通過部を有し、前記複数の音出力部は、前記通過部を通過した複数の音信号に基づく音を夫々出力する様に構成してあることを要件とする。 The fourteenth sound output device includes a plurality of sound output units that output sound based on a plurality of sound signals, and the sound input device that generates a sound signal based on the input sound and the sound input device input A sound output device that cooperates with an echo suppression device that suppresses echo based on the sound output from the sound output portion from the generated sound, and has a passing portion that allows components of different frequency bands to pass through for each of the plurality of sound signals It is a requirement that the plurality of sound output units are configured to output sounds based on the plurality of sound signals that have passed through the passage unit.

第１５のオーディオシステムは、音信号に基づいて音を出力する音出力装置と、入力された音を音信号に変換する音入力装置と、該音入力装置が入力された音から、前記音出力装置から出力した音に基づくエコーを抑圧するエコー抑圧装置と、前記音出力装置、音入力装置及びエコー抑圧装置の少なくとも一を制御する制御部を有する制御装置とを備えたオーディオシステムにおいて、前記音出力装置は、複数の音信号に対し、夫々異なる周波数帯域の成分を通過させる通過部と、該通過部を通過した複数の音信号に基づく音を夫々出力する複数の音出力部とを有し、前記通過部を通過した複数の音信号を加算して参照音信号を生成する加算部を有し、前記エコー抑圧装置は、前記音入力装置から音信号を観測音信号として入力される入力部と、前記観測音信号及び前記参照音信号に基づいて、前記観測音信号に含まれるエコーを抑圧すべく前記観測音信号を補正する補正部と、該補正部が補正した観測音信号を前記制御装置へ出力する出力部とを有し、前記制御装置は、前記出力された補正後の観測音信号を入力される入力部と、該入力部から入力された補正後の観測音信号に基づいて音声認識処理を行う認識部とを有し、前記制御部は、認識部による認識結果に基づき制御を行うことを要件とする。 A fifteenth audio system includes: a sound output device that outputs sound based on a sound signal; a sound input device that converts an input sound into a sound signal; and the sound output from the sound input to the sound input device An audio system comprising: an echo suppression device that suppresses echo based on sound output from a device; and a control device that has a control unit that controls at least one of the sound output device, the sound input device, and the echo suppression device. The output device has a passage unit that allows components of different frequency bands to pass through a plurality of sound signals, and a plurality of sound output units that respectively output sounds based on the plurality of sound signals that have passed through the passage unit. An adder that generates a reference sound signal by adding a plurality of sound signals that have passed through the passage, and the echo suppression device is an input unit that receives the sound signal from the sound input device as an observation sound signal A correction unit that corrects the observation sound signal to suppress an echo included in the observation sound signal based on the observation sound signal and the reference sound signal, and the control unit that corrects the observation sound signal corrected by the correction unit. And an output unit that outputs the corrected observation sound signal that has been output to the input unit, and an audio signal based on the corrected observation sound signal that has been input from the input unit. A recognition unit that performs a recognition process, and the control unit performs control based on a recognition result of the recognition unit.

第１６のナビゲーションシステムは、音信号に基づいて音を出力する音出力装置と、入力された音を音信号に変換する音入力装置と、該音入力装置が入力された音から、前記音出力装置から出力した音に基づくエコーを抑圧するエコー抑圧装置と、ナビゲーション処理を行う制御部を有するナビゲーション装置とを備えるナビゲーションシステムにおいて、前記音出力装置は、複数の音信号に対し、夫々異なる周波数帯域の成分を通過させる通過部と、該通過部を通過した複数の音信号に基づく音を夫々出力する複数の音出力部とを有し、前記通過部を通過した複数の音信号を加算して参照音信号を生成する加算部を有し、前記エコー抑圧装置は、前記音入力装置から音信号を観測音信号として入力される入力部と、前記観測音信号及び前記参照音信号に基づいて、前記観測音信号に含まれるエコーを抑圧すべく前記観測音信号を補正する補正部と、該補正した観測音信号を前記ナビゲーション装置へ出力する出力部とを有し、前記ナビゲーション装置は、前記出力された補正後の観測音信号を入力される入力部と、該入力部から入力された補正後の観測音信号に基づいて音声認識処理を行う認識部とを有し、前記制御部は、前記認識部による認識結果に基づいてナビゲーション処理を行うことを要件とする。 A sixteenth navigation system includes a sound output device that outputs a sound based on a sound signal, a sound input device that converts an input sound into a sound signal, and the sound output from the sound input to the sound input device. In a navigation system including an echo suppression device that suppresses echo based on sound output from a device and a navigation device having a control unit that performs navigation processing, the sound output device has different frequency bands for a plurality of sound signals, respectively. And a plurality of sound output units for outputting sounds based on a plurality of sound signals that have passed through the passage unit, and adding a plurality of sound signals that have passed through the passage unit. An adder for generating a reference sound signal, wherein the echo suppressor includes an input unit that receives a sound signal from the sound input device as an observation sound signal, the observation sound signal, and the observation sound signal. A correction unit that corrects the observation sound signal to suppress an echo included in the observation sound signal based on the illuminating signal, and an output unit that outputs the corrected observation sound signal to the navigation device; The navigation device includes an input unit that receives the output of the corrected observation sound signal that has been output, and a recognition unit that performs speech recognition processing based on the corrected observation sound signal that has been input from the input unit. The control unit is required to perform navigation processing based on the recognition result by the recognition unit.

第１７の移動体は、音信号に基づいて音を出力する音出力装置と、入力された音を音信号に変換する音入力装置と、該音入力装置が入力された音から、前記音出力装置から出力した音に基づくエコーを抑圧するエコー抑圧装置と、ナビゲーション処理を行う制御部を有するナビゲーション装置とを備える移動体において、前記音出力装置は、複数の音信号に対し、夫々異なる周波数帯域の成分を通過させる通過部と、該通過部を通過した複数の音信号に基づく音を夫々出力する複数の音出力部とを有し、前記通過部を通過した複数の音信号を加算して参照音信号を生成する加算部を有し、前記エコー抑圧装置は、前記音入力装置から音信号を観測音信号として入力される入力部と、前記観測音信号及び前記参照音信号に基づいて、前記観測音信号に含まれるエコーを抑圧すべく前記観測音信号を補正する補正部と、該補正した観測音信号を前記ナビゲーション装置へ出力する出力部とを有し、前記ナビゲーション装置は、前記出力された補正後の観測音信号を入力される入力部と、該入力部から入力された補正後の観測音信号に基づいて音声認識処理を行う認識部とを有し、前記制御部は、前記認識部による認識結果に基づいてナビゲーション処理を行うことを要件とする。 The seventeenth moving body includes a sound output device that outputs a sound based on a sound signal, a sound input device that converts an input sound into a sound signal, and the sound output from the sound input to the sound input device. In a moving body including an echo suppression device that suppresses echo based on sound output from a device and a navigation device having a control unit that performs navigation processing, the sound output device has different frequency bands for a plurality of sound signals, respectively. And a plurality of sound output units for outputting sounds based on a plurality of sound signals that have passed through the passage unit, and adding a plurality of sound signals that have passed through the passage unit. The echo suppression device includes an adder that generates a reference sound signal, and the echo suppression device is based on an input unit that receives a sound signal from the sound input device as an observation sound signal, and the observation sound signal and the reference sound signal. The observed sound A correction unit that corrects the observation sound signal so as to suppress the echo included in the signal, and an output unit that outputs the corrected observation sound signal to the navigation device, and the navigation device outputs the correction An input unit to which a later observation sound signal is input; and a recognition unit that performs speech recognition processing based on the corrected observation sound signal input from the input unit; and the control unit is configured by the recognition unit It is a requirement to perform navigation processing based on the recognition result.

第１、第２、第５及び第６のエコー抑圧システム、第１１のエコー抑圧方法、第１２のエコー抑圧プログラム、第１３のエコー抑圧装置、第１４の音出力装置、第１５のオーディオシステム、第１６のナビゲーションシステム並びに第１７の移動体では、周波数毎に音信号が異なることから、観測音信号の任意の周波数に対し、一の音信号に基づく一の経路のエコーのみが含まれているため、エコーの推定精度を上げることができるので、残留エコーを抑制することが可能である。しかも複数の音信号の夫々に対してエコーを抑圧するための補正部を設ける必要がなく、一の補正部にて複数の音信号によるエコーを抑圧することができるので、補正部の増設に伴うコストの増加及び装置の大型化を防止することが可能である。 First, second, fifth and sixth echo suppression systems, eleventh echo suppression method, twelfth echo suppression program, thirteenth echo suppression device, fourteenth sound output device, fifteenth audio system, In the sixteenth navigation system and the seventeenth moving body, since the sound signal is different for each frequency, only one path echo based on one sound signal is included for an arbitrary frequency of the observed sound signal. Therefore, since the estimation accuracy of echo can be increased, residual echo can be suppressed. In addition, it is not necessary to provide a correction unit for suppressing echoes for each of a plurality of sound signals, and one correction unit can suppress echoes caused by a plurality of sound signals. It is possible to prevent an increase in cost and an increase in size of the apparatus.

第３及び第４のエコー抑圧システムでは、例えば第１音信号及び第２音信号の和に基づく和音信号、第１音信号を所定時間遅延させた遅延第１音信号、第２音信号を遅延させた遅延第２音信号等の加工音信号を用いて擬似５チャネルステレオを実現する場合に、加工音信号に対しては、第１音信号に用いる第１フィルタ係数及び第２音信号に用いる第２フィルタ係数を転用して周波数帯域の割り当てを行うことにより、二種類のフィルタで三種類以上の音信号に対する処理を行うことができるので、音出力装置のリソースを効率的に利用することが可能となる。 In the third and fourth echo suppression systems, for example, a chord signal based on the sum of the first sound signal and the second sound signal, a delayed first sound signal obtained by delaying the first sound signal for a predetermined time, and a second sound signal are delayed. When a pseudo 5-channel stereo is realized using the processed sound signal such as the delayed second sound signal, the processed sound signal is used for the first filter coefficient and the second sound signal used for the first sound signal. By diverting the second filter coefficient and allocating the frequency band, it is possible to process three or more types of sound signals with two types of filters, so that the resources of the sound output device can be used efficiently. It becomes possible.

第７のエコー抑圧システムでは、例えば人の声に対応する周波数帯を除去帯域として設定し、観測音信号の除去帯域の成分に基づいて、話者が発声しているダブルトークの状態であるか発声していないシングルトークの状態であるかを検出し、例えばダブルトークの状態であると判定した場合に、フィルタ係数の更新を停止させることにより、音声が含まれている区間の認識精度を向上させ、音声認識処理の精度を向上させることが可能である。 In the seventh echo suppression system, for example, a frequency band corresponding to a human voice is set as a removal band, and based on a component of the removal band of the observation sound signal, whether the speaker is in a double talk state. Detecting whether it is in a single talk state that is not speaking, for example, if it is determined that it is in a double talk state, the update of the filter coefficient is stopped, thereby improving the recognition accuracy of the section containing speech Thus, it is possible to improve the accuracy of the voice recognition process.

第８及び第９のエコー抑圧システムでは、一の音信号の振幅の大きさが他の全ての音信号の振幅の大きさに対して、所定の条件以上大きい場合に、一の音信号が通過部で遮断されることを防止するので、複数チャネルで違和感のない音を出力させることが可能である。 In the eighth and ninth echo suppression systems, one sound signal passes when the amplitude of one sound signal is greater than the amplitude of all other sound signals by a predetermined condition or more. Therefore, it is possible to output a sound with no sense of incongruity in a plurality of channels.

第１０のエコー抑圧システムでは、夫々の音信号で出力に偏りがある場合でも、特定の周波数帯域において、特定の音信号が遮断され続けることがないので、複数チャネルで違和感のない音を出力させることが可能である。 In the tenth echo suppression system, even when there is a bias in the output of each sound signal, the specific sound signal does not continue to be blocked in a specific frequency band, so that a sound with no sense of incongruity is output in a plurality of channels. It is possible.

本願は、マルチチャネルオーディオ等の複数の音信号に対し、夫々異なる周波数帯域の成分を通過させ、夫々異なる周波数帯域の成分が通過した複数の音信号に基づく音をスピーカ等の複数の音出力部から夫々出力する。マイク等の音入力部を有する音入力装置は、入力された音を音信号に変換する。そしてエコー抑圧装置は、夫々異なる周波数帯域の成分を有する複数の音信号を加算した参照音信号と、音入力装置から入力された音信号である観測音信号とに基づいて、観測音信号に含まれるエコーを抑圧すべく観測音信号を補正するという技術を開示する。 The present application relates to a plurality of sound output units such as a speaker and the like based on a plurality of sound signals through which components of different frequency bands pass through a plurality of sound signals such as multi-channel audio. Respectively. A sound input device having a sound input unit such as a microphone converts an input sound into a sound signal. The echo suppressor is included in the observed sound signal based on a reference sound signal obtained by adding a plurality of sound signals having components of different frequency bands, and an observed sound signal that is a sound signal input from the sound input device. Disclosed is a technique for correcting an observed sound signal so as to suppress echoes generated.

この構成により、本願では、周波数毎に割り当てられている音信号が異なることから、観測音信号の任意の周波数に対し、一の音信号に基づく一の経路のエコーのみが含まれているため、出力した音の任意の周波数による影響は、入力した音の同じ周波数に出現するという線形性を利用して、エコーの推定精度を上げることができるので、残留エコーを抑制することが可能である等、優れた効果を奏する。しかも本発明では、複数の音信号の夫々に対してエコーを抑圧するための補正部を設ける必要がなく、一の補正部にて複数の音信号によるエコーを抑圧することができるので、補正部の増設に伴うコストの増加及び装置の大型化を防止することが可能である等、優れた効果を奏する。 In this application, since the sound signal assigned for each frequency is different in this application, for any frequency of the observed sound signal, only the echo of one path based on one sound signal is included. Since the influence of an arbitrary frequency of the output sound appears at the same frequency of the input sound, it is possible to improve the estimation accuracy of the echo, so that the residual echo can be suppressed, etc. Has an excellent effect. In addition, in the present invention, it is not necessary to provide a correction unit for suppressing the echo for each of the plurality of sound signals, and the correction unit can suppress the echo due to the plurality of sound signals with one correction unit. As a result, it is possible to prevent an increase in cost and an increase in the size of the apparatus due to the increase in the number of devices.

また本願は、複数の音信号として、第１音信号、第２音信号、第１音信号及び／又は第２音信号を所定の加工方法で加工した加工音信号を用いる場合、例えば第１音信号及び第２音信号が、２チャネルのステレオ信号である場合で、第１音信号及び第２音信号の和に基づく和音信号、第１音信号を所定時間遅延させた遅延第１音信号、第２音信号を遅延させた遅延第２音信号等の加工音信号を用いて擬似５チャネルステレオを実現するときに、加工音信号に対しては、第１音信号に用いる第１フィルタ係数及び第２音信号に用いる第２フィルタ係数を転用して周波数帯域の割り当てを行う技術を開示する。 In the present application, when a processed sound signal obtained by processing the first sound signal, the second sound signal, the first sound signal and / or the second sound signal by a predetermined processing method is used as the plurality of sound signals, for example, the first sound signal When the signal and the second sound signal are two-channel stereo signals, a chord signal based on the sum of the first sound signal and the second sound signal, a delayed first sound signal obtained by delaying the first sound signal for a predetermined time, When the pseudo 5-channel stereo is realized using the processed sound signal such as the delayed second sound signal obtained by delaying the second sound signal, the first filter coefficient used for the first sound signal and the processed sound signal A technique for allocating a frequency band by diverting a second filter coefficient used for a second sound signal is disclosed.

この構成により、本願では、二種類のフィルタで三種類以上の音信号に対する処理を行うことができるので、音出力装置のリソースを効率的に利用することが可能となる等、優れた効果を奏する。 With this configuration, in the present application, since it is possible to perform processing on three or more types of sound signals with two types of filters, it is possible to effectively use the resources of the sound output device, and thus have excellent effects. .

さらに本願は、音出力装置において、例えば人の声に対応する周波数帯を除去帯域として予め設定しておき、除去帯域に該当する周波数帯域の成分を、何れの音信号に対しても通過させないようにする。そしてエコー抑圧装置では、観測音信号の除去帯域の成分に基づいて、話者が発声しているダブルトークの状態であるか発声していないシングルトークの状態であるかを検出し、シングルトークの状態であると判定した場合、周波数毎に設定されたフィルタ係数にてフィルタリングすることにより、観測音信号の補正に要する補正量を導出する補正用フィルタ部のフィルタ係数の算出及び更新を行い、ダブルトークの状態であると判定した場合、フィルタ係数の更新を停止させる技術を開示する。 Further, in the present invention, in the sound output device, for example, a frequency band corresponding to a human voice is set as a removal band in advance, and a component of the frequency band corresponding to the removal band is not allowed to pass through any sound signal. To. Then, the echo suppressor detects whether the speaker is uttering a double talk state or not singing a single talk state based on the component of the observed sound signal removal band, When it is determined that the state is in a state, the filter coefficient of the correction filter unit for deriving the correction amount necessary for correcting the observation sound signal is calculated and updated by filtering with the filter coefficient set for each frequency, and doubled. Disclosed is a technique for stopping the update of the filter coefficient when it is determined that the state is a talk state.

この構成により、本願では、音声が含まれている区間の認識精度を向上させ、音声認識精度を向上させることが可能である等、優れた効果を奏する。 With this configuration, in the present application, there are excellent effects such as improving the recognition accuracy of the section including the speech and improving the speech recognition accuracy.

さらに本願は、複数の音信号の振幅の大きさを所定の方法で比較して、振幅の大きさが最大である一の音信号の振幅と、他の全ての音信号の振幅との比較結果を示す値が所定値以上であると判定した周波数帯域に対しては、全ての音信号又は大きさが最大である一の音信号のみを通過させる技術を開示する。 Further, the present application compares the amplitudes of a plurality of sound signals by a predetermined method, and compares the amplitude of one sound signal having the maximum amplitude with the amplitudes of all other sound signals. For a frequency band in which the value indicating the value is determined to be greater than or equal to a predetermined value, a technique is disclosed in which all sound signals or only one sound signal having the maximum magnitude is passed.

この構成により、本願では、例えばオーケストラにおけるソロ演奏の様に、一の音信号の振幅が他の音信号に対して、所定値以上である場合に、一の音信号が通過部で遮断されることを防止するので、複数チャネルで違和感のない音を出力させることが可能である等、優れた効果を奏する。 With this configuration, in the present application, for example, in the case of solo performance in an orchestra, when the amplitude of one sound signal is greater than or equal to a predetermined value relative to other sound signals, the one sound signal is blocked at the passage portion. As a result, it is possible to output an uncomfortable sound on a plurality of channels, and the effects are excellent.

さらに本願は、通過させる音信号毎の周波数帯域の成分を、経時的に変更する技術を開示する。 Furthermore, the present application discloses a technique for changing a frequency band component for each sound signal to be passed with time.

この構成により、本願では、夫々の音信号で出力に偏りがある場合でも、特定の周波数帯域において、特定の音信号が遮断され続けることがないので、複数チャネルで違和感のない音を出力させることが可能である等、優れた効果を奏する。 With this configuration, in the present application, even when there is a bias in the output of each sound signal, since the specific sound signal is not continuously blocked in a specific frequency band, it is possible to output a sound that does not feel uncomfortable in multiple channels. It is possible to achieve an excellent effect.

従来の第１のエコー抑圧装置の構成を示す模式図である。It is a schematic diagram which shows the structure of the conventional 1st echo suppression apparatus. 従来の第２のエコー抑圧装置の構成を示す模式図である。It is a schematic diagram which shows the structure of the conventional 2nd echo suppression apparatus. 従来のエコー抑圧装置が備える抑圧機構の機能構成を示す機能ブロック図である。It is a functional block diagram which shows the function structure of the suppression mechanism with which the conventional echo suppression apparatus is provided. 本発明の実施の形態１に係るエコー抑圧システムの構成例を模式的に示すブロック図である。It is a block diagram which shows typically the structural example of the echo suppression system which concerns on Embodiment 1 of this invention. 本発明の実施の形態１に係る音出力装置の通過機構の構成例を示す機能ブロック図である。It is a functional block diagram which shows the structural example of the passage mechanism of the sound output device which concerns on Embodiment 1 of this invention. 本発明の実施の形態１に係る音出力装置が備える通過機構の通過フィルタ部のフィルタ係数を示すグラフである。It is a graph which shows the filter coefficient of the passage filter part of the passage mechanism with which the sound output device concerning Embodiment 1 of the present invention is provided. 本発明の実施の形態１に係るエコー抑圧装置が備える抑圧機構の構成例を示す機能ブロック図である。It is a functional block diagram which shows the structural example of the suppression mechanism with which the echo suppression apparatus which concerns on Embodiment 1 of this invention is provided. 本発明の実施の形態１に係る音処理装置の構成例を示すブロック図である。It is a block diagram which shows the structural example of the sound processing apparatus which concerns on Embodiment 1 of this invention. 本発明の実施の形態１に係る音出力装置の音出力処理の一例を示すフローチャートである。It is a flowchart which shows an example of the sound output process of the sound output device which concerns on Embodiment 1 of this invention. 本発明の実施の形態１に係る音入力装置の音入力処理の一例を示すフローチャートである。It is a flowchart which shows an example of the sound input process of the sound input device which concerns on Embodiment 1 of this invention. 本発明の実施の形態１に係るエコー抑圧装置のエコー抑圧処理の一例を示すフローチャートである。It is a flowchart which shows an example of the echo suppression process of the echo suppression apparatus which concerns on Embodiment 1 of this invention. 本発明の実施の形態１に係るエコー抑圧装置のフィルタ係数更新処理の一例を示すフローチャートである。It is a flowchart which shows an example of the filter coefficient update process of the echo suppression apparatus which concerns on Embodiment 1 of this invention. 本発明の実施の形態２に係る音出力装置の通過機構の構成例を示す機能ブロック図である。It is a functional block diagram which shows the structural example of the passage mechanism of the sound output device which concerns on Embodiment 2 of this invention. 本発明の実施の形態２に係る音出力装置が備える通過機構の通過フィルタのフィルタ係数を示すグラフである。It is a graph which shows the filter coefficient of the passage filter of the passage mechanism with which the sound output device concerning Embodiment 2 of the present invention is provided. 本発明の実施の形態３に係る音出力装置の通過機構の構成例を示す機能ブロック図である。It is a functional block diagram which shows the structural example of the passage mechanism of the sound output device which concerns on Embodiment 3 of this invention. 本発明の実施の形態３に係る音出力装置が備える通過機構の除去部のフィルタ係数を示すグラフである。It is a graph which shows the filter coefficient of the removal part of the passage mechanism with which the sound output device which concerns on Embodiment 3 of this invention is provided. 本発明の実施の形態３に係るエコー抑圧装置が備える抑圧機構の構成例を示す機能ブロック図である。It is a functional block diagram which shows the structural example of the suppression mechanism with which the echo suppression apparatus which concerns on Embodiment 3 of this invention is provided. 本発明の実施の形態３に係るエコー抑圧装置が備える抑圧機構の除去帯域通過フィルタ部のフィルタ係数を示すグラフである。It is a graph which shows the filter coefficient of the removal band pass filter part of the suppression mechanism with which the echo suppression apparatus which concerns on Embodiment 3 of this invention is provided. 本発明の実施の形態３に係る音出力装置の音出力処理の一例を示すフローチャートである。It is a flowchart which shows an example of the sound output process of the sound output device which concerns on Embodiment 3 of this invention. 本発明の実施の形態３に係るエコー抑圧装置のフィルタ係数更新処理の一例を示すフローチャートである。It is a flowchart which shows an example of the filter coefficient update process of the echo suppression apparatus which concerns on Embodiment 3 of this invention. 本発明の実施の形態４に係るエコー抑圧システムの構成例を模式的に示すブロック図である。It is a block diagram which shows typically the structural example of the echo suppression system which concerns on Embodiment 4 of this invention. 本発明の実施の形態５に係る音出力装置の通過機構の構成例を示す機能ブロック図である。It is a functional block diagram which shows the structural example of the passage mechanism of the sound output device which concerns on Embodiment 5 of this invention. 本発明の実施の形態５に係る音出力装置の音出力処理の一例を示すフローチャートである。It is a flowchart which shows an example of the sound output process of the sound output device which concerns on Embodiment 5 of this invention. 本発明の実施の形態５に係る音出力装置が備える通過機構の処理の例を概念的に示す説明図である。It is explanatory drawing which shows notionally the example of the process of the passage mechanism with which the sound output device which concerns on Embodiment 5 of this invention is provided. 本発明の実施の形態６に係る音出力装置の通過機構の構成例を示す機能ブロック図である。It is a functional block diagram which shows the structural example of the passage mechanism of the sound output device which concerns on Embodiment 6 of this invention. 本発明の実施の形態６に係る音出力装置の音出力処理の一例を示すフローチャートである。It is a flowchart which shows an example of the sound output process of the sound output device which concerns on Embodiment 6 of this invention. 本発明の実施の形態７に係る音出力装置の通過機構の構成例を示す機能ブロック図である。It is a functional block diagram which shows the structural example of the passage mechanism of the sound output device which concerns on Embodiment 7 of this invention. 本発明の実施の形態７に係る音出力装置の係数変更処理の一例を示すフローチャートである。It is a flowchart which shows an example of the coefficient change process of the sound output device which concerns on Embodiment 7 of this invention. 本発明の実施の形態７に係る音出力装置が備える通過機構の各帯域通過フィルタ部のフィルタ係数の経時変化の例を示す説明図である。It is explanatory drawing which shows the example of a time-dependent change of the filter coefficient of each band pass filter part of the pass mechanism with which the sound output device which concerns on Embodiment 7 of this invention is provided.

Explanation of symbols

１エコー抑圧装置
１０第１入力機構
１１第２入力機構
１２抑圧機構
１２０加算部
１２１補正部
１２１０線形ＦＩＲフィルタ部
１２１１減算部
１２１２フィルタ係数更新部
１２１３検出部
１２１４除去帯域通過フィルタ部
１３出力機構
２音出力装置
２０音出力機構
２１音信号生成機構
２２通過機構
２２０ＦＦＴ変換部
２２１帯域通過フィルタ部
２２２ＩＦＦＴ変換部
２２３除去部
２２４比較部
２２５係数部
２３操作機構
２４切替機構
３音入力装置
３０音入力機構
４音処理装置
４０入力機構
４１認識機構
４２制御機構
５移動体
１００エコー抑圧プログラムDESCRIPTION OF SYMBOLS 1 Echo suppression apparatus 10 1st input mechanism 11 2nd input mechanism 12 Suppression mechanism 120 Adder 121 Correction | amendment part 1210 Linear FIR filter part 1211 Subtraction part 1212 Filter coefficient update part 1213 Detection part 1214 Removal band pass filter part 13 Output mechanism 2 Sound Output device 20 Sound output mechanism 21 Sound signal generation mechanism 22 Passing mechanism 220 FFT conversion unit 221 Band pass filter unit 222 IFFT conversion unit 223 Removal unit 224 Comparison unit 225 Coefficient unit 23 Operation mechanism 24 Switching mechanism 3 Sound input device 30 Sound input mechanism 4 sound processing device 40 input mechanism 41 recognition mechanism 42 control mechanism 5 moving body 100 echo suppression program

以下、本発明をその実施の形態を示す図面に基づいて詳述する。 Hereinafter, the present invention will be described in detail with reference to the drawings illustrating embodiments thereof.

実施の形態１．
図４は、本発明の実施の形態１に係るエコー抑圧システムの構成例を模式的に示すブロック図である。図４中１は、カーナビゲーションシステム等のシステムに適用されるエコー抑圧システムにて用いられるエコー抑圧装置であり、エコー抑圧装置１は、複数チャネルの音を出力するスピーカ等の複数の音出力機構２０，２０，…を有するマルチチャネルオーディオ等の音出力装置２、コンデンサマイク等のマイクロホンを用いた音入力機構３０を有する音入力装置３、及び音声認識システム等の音処理装置４と連携して動作する。そしてエコー抑圧装置１、音出力装置２、音入力装置３及び音処理装置４は、車両等の移動体５に搭載されている。なお本発明のエコー抑圧システムは、図４に例示したカーナビゲーションシステムに限らず、オーディオシステム、テレビ会議システム等の様々なシステムに適用することが可能である。Embodiment 1 FIG.
FIG. 4 is a block diagram schematically showing a configuration example of the echo suppression system according to Embodiment 1 of the present invention. In FIG. 4, reference numeral 1 denotes an echo suppression device used in an echo suppression system applied to a system such as a car navigation system. The echo suppression device 1 includes a plurality of sound output mechanisms such as speakers that output a plurality of channels. In cooperation with a sound output device 2 such as multi-channel audio having 20, 20,..., A sound input device 3 having a sound input mechanism 30 using a microphone such as a condenser microphone, and a sound processing device 4 such as a speech recognition system. Operate. The echo suppression device 1, the sound output device 2, the sound input device 3, and the sound processing device 4 are mounted on a moving body 5 such as a vehicle. The echo suppression system of the present invention is not limited to the car navigation system illustrated in FIG. 4 but can be applied to various systems such as an audio system and a video conference system.

音出力装置２は、例えば音データを記録する音楽ＣＤ(Compact Disc)、ＤＶＤ(Digital Versatile Disc)等の記録媒体から音データを読み取り、読み取った音データに基づいて複数チャネル分の音信号を生成し、生成した複数の音信号をマルチチャネルオーディオ信号として出力する音信号生成機構２１と、複数チャネル分の音信号に対し、夫々異なる周波数帯域の成分を通過させるＤＳＰ(Digital Signal Processor)にて構成された通過機構２２とを備え、通過機構２２を通過した複数チャネル分の音信号は、音出力機構２０，２０，…から複数チャネルの音として出力される。また音出力装置２は、通過機構２２を通過した複数チャネル分の音信号をエコー抑圧装置１へ送信する。なお音信号生成機構２１から出力される音信号は、アナログ信号であり、図示しないＡ／Ｄ(Analog to Digital )変換器にてデジタル信号に変換後、通過機構２２へ出力される。また通過機構２２からはデジタル信号である音信号を出力し、図示しないＤ／Ａ(Digital to Analog )変換器にてアナログ信号に変換後、音出力機構２０，２０，…及びエコー抑圧装置１へ出力する。音出力機構２０，２０，…は、夫々チャネルが割り当てられており、通過機構２２を通過した各音信号はチャネルが対応している音出力機構２０，２０，…へ夫々出力される。なお本発明のエコー抑圧システムは、音出力機構２０，２０，…にデジタル信号である音信号に基づいて音を出力する機能を持たせ、各機構及び装置間の入出力を全てデジタル信号で行う等、適宜設計することが可能である。 The sound output device 2 reads sound data from a recording medium such as a music CD (Compact Disc) or DVD (Digital Versatile Disc) for recording sound data, and generates sound signals for a plurality of channels based on the read sound data. The sound signal generating mechanism 21 outputs a plurality of generated sound signals as a multi-channel audio signal, and a DSP (Digital Signal Processor) that allows components of different frequency bands to pass through the sound signals for a plurality of channels. The sound signals for a plurality of channels that have passed through the passage mechanism 22 are output from the sound output mechanisms 20, 20,. The sound output device 2 transmits sound signals for a plurality of channels that have passed through the passage mechanism 22 to the echo suppression device 1. Note that the sound signal output from the sound signal generation mechanism 21 is an analog signal, and is converted to a digital signal by an A / D (Analog to Digital) converter (not shown) and then output to the passage mechanism 22. Further, a sound signal which is a digital signal is output from the passage mechanism 22, converted into an analog signal by a D / A (Digital to Analog) converter (not shown), and then to the sound output mechanism 20, 20... And the echo suppressor 1. Output. The sound output mechanisms 20, 20,... Are each assigned a channel, and each sound signal that has passed through the passage mechanism 22 is output to the sound output mechanisms 20, 20,. In the echo suppression system of the present invention, the sound output mechanisms 20, 20,... Have a function of outputting sound based on the sound signal which is a digital signal, and all inputs and outputs between the mechanisms and the apparatus are performed by digital signals. It is possible to design appropriately.

音入力装置３は、音入力機構３０により、入力された音に基づいてアナログ信号である音信号を生成し、生成した音信号をゲインアンプ等の図示しない増幅器にて増幅し、増幅した音信号を図示しないＡ／Ｄ変換器にて８０００Ｈｚ、１２０００Ｈｚ等のサンプリング周波数でサンプリングしてデジタル信号に変換し、デジタル信号に変換した音信号をエコー抑圧装置１へ出力する。 The sound input device 3 uses the sound input mechanism 30 to generate a sound signal that is an analog signal based on the input sound, amplifies the generated sound signal with an amplifier (not shown) such as a gain amplifier, and the amplified sound signal Is sampled at a sampling frequency such as 8000 Hz and 12000 Hz by an A / D converter (not shown) and converted into a digital signal, and the sound signal converted into the digital signal is output to the echo suppressor 1.

エコー抑圧装置１は、音出力装置２から出力された複数チャネル分の音信号を入力される第１入力機構１０と、音入力装置３から出力された音信号を観測音信号として入力される第２入力機構１１と、観測音信号に含まれるエコー成分を抑圧するＤＳＰ等の抑圧機構１２と、抑圧機構１２にてエコー成分を抑圧した観測音信号を音処理装置４へ出力する出力機構１３とを備えている。 The echo suppression apparatus 1 includes a first input mechanism 10 to which sound signals for a plurality of channels output from the sound output apparatus 2 are input, and a sound signal output from the sound input apparatus 3 as an observation sound signal. A two-input mechanism 11, a suppression mechanism 12 such as a DSP that suppresses an echo component included in the observation sound signal, and an output mechanism 13 that outputs the observation sound signal in which the echo component is suppressed by the suppression mechanism 12 to the sound processing device 4. It has.

抑圧機構１２には、エコー抑圧プログラム１００及びデータ等のファームウェアが組み込まれており、ファームウェアとして組み込まれた本発明のエコー抑圧プログラム１００を実行することにより、第１入力機構１０にて入力された複数チャネル分の音信号を加算して参照音信号を生成する加算部１２０、観測音信号及び参照音信号に基づいて、観測音信号に含まれるエコーを抑圧すべく観測音信号を補正する補正部１２１等の各種プログラムモジュールを生成する。そして生成した各種プログラムモジュールを実行することにより、エコー抑圧装置１が備える抑圧機構１２は、音出力機構２０，２０，…から夫々出力された音に基づくエコーを、音入力機構３０に入力された音から除去するエコーキャンセラとして機能する。 The suppression mechanism 12 incorporates an echo suppression program 100 and firmware such as data. By executing the echo suppression program 100 of the present invention that is incorporated as firmware, a plurality of inputs input by the first input mechanism 10 are performed. An adder 120 that adds the sound signals for the channels to generate a reference sound signal, and a correction unit 121 that corrects the observation sound signal to suppress echoes included in the observation sound signal based on the observation sound signal and the reference sound signal. Various program modules such as are generated. Then, by executing the generated various program modules, the suppression mechanism 12 included in the echo suppression device 1 inputs echoes based on sounds output from the sound output mechanisms 20, 20,... To the sound input mechanism 30. It functions as an echo canceller that removes sound.

また第１入力機構１０は、音出力装置２から入力されるアナログ信号である複数チャネル分の音信号をデジタル信号に変換するＡ／Ｄ変換器を備えている。但し、第１入力機構１０からデジタル信号として音信号が入力される場合、Ａ／Ｄ変換器は不要となる。また音入力装置３が備える増幅器及びＡ／Ｄ変換器の機能を第２入力機構１１に備えさせる様にしても良い。 Further, the first input mechanism 10 includes an A / D converter that converts sound signals for a plurality of channels, which are analog signals input from the sound output device 2, into digital signals. However, when a sound signal is input as a digital signal from the first input mechanism 10, an A / D converter is not necessary. Further, the functions of the amplifier and the A / D converter included in the sound input device 3 may be provided in the second input mechanism 11.

図４に示した構成例は、本発明の無限にある形態の一を例示したに過ぎず、必要に応じて適宜ハードウェア及びソフトウェア構成を変更することも可能である。例えば各プログラムモジュールは、ＶＬＳＩ等の演算回路を用いたハードウェアとして構成することも可能であり、更には加算部１２０の機能を音出力装置２に備えさせる等、適宜、実装形態を変更することができる。また加算部１２０を、抑圧機構１２外のハードウェアとして構成する場合、アナログ信号をミキシングするミキシング回路を用いた加算部１２０を用いて構成するようにしても良く、その場合、音出力装置２から入力されたアナログ信号である複数チャネルの音信号は、加算部１２０にてミキシング（加算）された後、図示しないＡ／Ｄ変換器によりデジタル信号に変換される。 The configuration example shown in FIG. 4 is merely an example of an infinite form of the present invention, and the hardware and software configurations can be changed as necessary. For example, each program module can also be configured as hardware using an arithmetic circuit such as a VLSI, and the implementation form can be changed as appropriate, for example, by providing the sound output device 2 with the function of the adder 120. Can do. Further, when the adder 120 is configured as hardware outside the suppression mechanism 12, the adder 120 may be configured using the adder 120 using a mixing circuit that mixes analog signals. The sound signals of a plurality of channels that are input analog signals are mixed (added) by the adding unit 120 and then converted into digital signals by an A / D converter (not shown).

図４において、１ｃｈ，…，ｎｃｈ（ｎは自然数）は、複数のチャネルを示しており、ｘ１（ｔ），…，ｘｎ（ｔ）は、音出力機構２０から通過機構２２へ出力される１チャネルからｎチャネルまでの音信号を示している。なお変数ｔは、アナログ信号である音信号を８０００Ｈｚ、１２０００Ｈｚ等のサンプリング周波数でサンプリングしてデジタル信号に変換した際の各サンプルを特定するサンプル番号である。またｘ１＿ｆ（ｔ），…，ｘｎ＿ｆ（ｔ）は、通過機構２２を通過した１チャネルからｎチャネルまでの音信号を示しており、通過機構２２を通過した音信号ｘ１＿ｆ（ｔ），…，ｘｎ＿ｆ（ｔ）を加算することにより、参照音信号ｘ＿ｆ（ｔ）が生成される。またｙ（ｔ）は、観測音信号を示しており、ｒ（ｔ）は、抑圧機構１２にて観測音信号ｙ（ｔ）のエコー成分が抑圧された抑圧結果を示す音信号であり、抑圧結果ｒ（ｔ）は、音処理装置４へ出力され、音処理装置４にて音声認識等の処理がなされる。 4, 1ch,..., Nch (n is a natural number) indicate a plurality of channels, and x1 (t),..., Xn (t) are output from the sound output mechanism 20 to the passage mechanism 22. The sound signals from channel to n channel are shown. The variable t is a sample number for identifying each sample when a sound signal, which is an analog signal, is sampled at a sampling frequency such as 8000 Hz or 12000 Hz and converted into a digital signal. Further, x1_f (t),..., Xn_f (t) indicate sound signals from the 1 channel to the n channel that have passed through the passing mechanism 22, and the sound signals x1_f (t),. By adding (t), the reference sound signal x_f (t) is generated. Further, y (t) indicates an observation sound signal, and r (t) is a sound signal indicating a suppression result in which the echo component of the observation sound signal y (t) is suppressed by the suppression mechanism 12. The result r (t) is output to the sound processing device 4, and the sound processing device 4 performs processing such as speech recognition.

図５は、本発明の実施の形態１に係る音出力装置２の通過機構２２の構成例を示す機能ブロック図である。ＤＳＰを用いた通過機構２２は、入力された複数チャネルの音信号をＦＦＴ（高速フーリエ変換:Fast Fourier Transformation）処理にて夫々周波数軸上の成分の音信号に変換するＦＦＴ変換部２２０，２２０，…、周波数軸上の成分に変換した複数の音信号に対し、夫々異なる周波数帯域の成分を通過させる複数の帯域通過フィルタ部２２１，２２１，…、夫々の周波数帯域の成分を通過させた周波数軸上の成分に変換されている複数の音信号を、ＩＦＦＴ（逆フーリエ変換）処理にて夫々時間軸上の音信号に変換する複数のＩＦＦＴ変換部２２２，２２２，…等の各種プログラムモジュールを実行する。ＦＦＴ変換部２２０は、音信号の変換に際し、例えば５１２サンプル分の信号を１フレームとしたフレーム単位の音信号を生成する。なお各フレームは、１２８〜２５６サンプル分程度ずつオーバーラップしており、各フレームに対しては、ハミング窓、ハニング窓等の窓関数、高域強調フィルタによるフィルタリング等の音声認識の分野で一般的なフレーム処理が施される。そしてＦＦＴ変換部２２０は、生成したフレーム単位の音信号に対してＦＦＴ処理を行う。帯域通過フィルタ部２２１，２２１，…は、夫々チャネルに対応しており、周波数軸上の成分に変換された各音信号は、チャネルが対応している帯域通過フィルタ部２２１，２２１，…を通過する。また各帯域通過フィルタ部２２１，２２１，…は、周波数毎の透過度を示した夫々異なるフィルタ係数が予め設定されており、設定されているフィルタ係数に基づいて音信号に対するフィルタリングを行う。 FIG. 5 is a functional block diagram showing a configuration example of the passage mechanism 22 of the sound output device 2 according to Embodiment 1 of the present invention. The passing mechanism 22 using the DSP converts the input sound signals of a plurality of channels into sound signals of components on the frequency axis by FFT (Fast Fourier Transformation) processing, respectively. ..., a plurality of band-pass filter sections 221, 221 that allow components of different frequency bands to pass through a plurality of sound signals converted into components on the frequency axis, and frequency axes that pass components of the respective frequency bands Various program modules such as a plurality of IFFT converters 222, 222,... That convert a plurality of sound signals converted into the above components into sound signals on the time axis by IFFT (inverse Fourier transform) processing are executed. To do. When converting the sound signal, the FFT conversion unit 220 generates a sound signal in units of frames with, for example, a signal of 512 samples as one frame. Each frame overlaps by about 128 to 256 samples, and each frame is generally used in the field of speech recognition such as a Hamming window, a window function such as a Hanning window, and filtering using a high frequency enhancement filter. Frame processing is performed. Then, the FFT conversion unit 220 performs FFT processing on the generated sound signal in units of frames. The band pass filter units 221, 221,... Each correspond to a channel, and each sound signal converted into a component on the frequency axis passes through the band pass filter units 221, 221,. To do. Further, each of the band-pass filter units 221, 221... Is preset with different filter coefficients indicating the transmissivity for each frequency, and performs filtering on the sound signal based on the set filter coefficients.

図５において、ｘ１（ｔ），…，ｘｎ（ｔ）は、ＦＦＴ変換部２２０，２２０，…に入力された複数のチャネルの音信号であり、ＦＦＴ変換部２２０，２２０，…は、複数チャネルの音信号ｘ１（ｔ），…，ｘｎ（ｔ）を周波数軸上の成分に変換した複数の音信号Ｘ１（ｆ），…，Ｘｎ（ｆ）を帯域通過フィルタ部２２１，２２１，…へ出力する。なお変数ｆは、周波数を示す。また帯域通過フィルタ部２２１，２２１，…は、夫々の周波数帯域の成分を通過させた複数の音信号Ｘ１＿ｆ（ｆ），…，Ｘｎ＿ｆ（ｆ）をＩＦＦＴ変換部２２２，２２２，…へ出力する。さらにＩＦＦＴ変換部２２２，２２２，…は、夫々の周波数帯域の成分を通過させた複数の音信号Ｘ１＿ｆ（ｆ），…，Ｘｎ＿ｆ（ｆ）をＩＦＦＴ処理にて夫々時間軸上の音信号ｘ１＿ｆ（ｔ），…，ｘｎ＿ｆ（ｔ）に変換する。 5, x1 (t),..., Xn (t) are sound signals of a plurality of channels input to the FFT converters 220, 220,..., And the FFT converters 220, 220,. , Xn (t) are converted to components on the frequency axis and a plurality of sound signals X1 (f),..., Xn (f) are output to the bandpass filter units 221, 221. To do. The variable f indicates a frequency. Further, the band pass filter units 221, 221,... Output a plurality of sound signals X1_f (f),..., Xn_f (f) through which the components of the respective frequency bands are passed to the IFFT conversion units 222, 222,. Further, the IFFT converters 222, 222,... Respectively convert a plurality of sound signals X1_f (f),..., Xn_f (f) through which the components of the respective frequency bands have passed into sound signals x1_f ( t),..., xn_f (t).

図６は、本発明の実施の形態１に係る音出力装置２が備える通過機構２２の帯域通過フィルタ部２２１のフィルタ係数を示すグラフである。図６（ａ），（ｂ），（ｃ）は、夫々第１チャネル１ｃｈ、第２チャネル２ｃｈ、及び第ｎチャネルｎｃｈの音信号Ｘ１（ｆ）、Ｘ２（ｆ）、及びＸｎ（ｆ）に対する係数フィルタＣ１（ｆ）、Ｃ２（ｆ）、及びＣｎ（ｆ）を示しており、横軸を周波数ｆとし、縦軸をフィルタ係数として、その関係を示したグラフである。フィルタ係数とは、音信号に乗じられる周波数毎に設定された係数であり、各帯域通過フィルタ部２２１，２２１，…とは、夫々設定されているフィルタ係数を周波数軸上の成分に変換された音信号に乗じる処理を行うプログラムモジュールである。帯域通過フィルタ部２２１の処理により、フィルタ係数が１．０である周波数帯の成分は、帯域通過フィルタ部２２１を通過することになるが、フィルタ係数が０．０である周波数帯の成分は、帯域通過フィルタ部２２１の通過の際に振幅が０となり、帯域通過フィルタ部２２１で除去されることになる。図６（ａ），（ｂ），（ｃ）に示す様に、各帯域通過フィルタ部２２１，２２１，…のフィルタ係数は、夫々異なる周波数帯域の成分を通過させる様に設定されている。 FIG. 6 is a graph showing the filter coefficient of the band pass filter unit 221 of the pass mechanism 22 provided in the sound output device 2 according to Embodiment 1 of the present invention. FIGS. 6A, 6B, and 6C show the sound signals X1 (f), X2 (f), and Xn (f) of the first channel 1ch, the second channel 2ch, and the nth channel nch, respectively. The coefficient filters C1 (f), C2 (f), and Cn (f) are shown, and the horizontal axis is the frequency f and the vertical axis is the filter coefficient. The filter coefficient is a coefficient set for each frequency multiplied by the sound signal, and each of the band pass filter units 221, 221,... Has converted the set filter coefficient into a component on the frequency axis. It is a program module that performs processing for multiplying sound signals. By the processing of the band pass filter unit 221, the component of the frequency band whose filter coefficient is 1.0 passes through the band pass filter unit 221, but the component of the frequency band whose filter coefficient is 0.0 is When passing through the band pass filter unit 221, the amplitude becomes 0 and is removed by the band pass filter unit 221. As shown in FIGS. 6A, 6B, and 6C, the filter coefficients of the band pass filter units 221, 221,... Are set so as to pass components in different frequency bands.

各帯域通過フィルタ部２２１，２２１，…では、周波数帯域を等間隔に分割し、分割した夫々の周波数帯域の成分が、いずれかの帯域通過フィルタ部２２１のみを通過する様にフィルタ係数が設定されている。図６の様なフィルタ係数を設定することにより、各帯域通過フィルタ部２２１，２２１，…は、櫛形フィルタとして機能する。なお帯域通過フィルタ部２２１，２２１，…のフィルタ係数の設定に際し、周波数の対数値が等間隔となる様に分割し、分割した夫々の周波数帯域の成分が、いずれかの帯域通過フィルタ部２２１のみを通過する様にフィルタ係数を設定して櫛形フィルタを形成する様にしても良く、対数値を用いて周波数帯域を分割することにより、聴取時に違和感の少ない音を出力させることが可能となる。 In each of the band pass filter units 221, 221,..., The frequency band is divided at equal intervals, and the filter coefficients are set so that the components of the divided frequency bands pass through only one of the band pass filter units 221. ing. By setting the filter coefficients as shown in FIG. 6, each bandpass filter unit 221, 221... Functions as a comb filter. When setting the filter coefficients of the band pass filter units 221, 221,..., It is divided so that the logarithmic values of the frequencies are equally spaced, and each of the divided frequency band components is only one of the band pass filter units 221. A comb filter may be formed by setting a filter coefficient so as to pass, and by dividing a frequency band using a logarithmic value, it is possible to output a sound with a less uncomfortable feeling during listening.

図７は、本発明の実施の形態１に係るエコー抑圧装置１が備える抑圧機構１２の構成例を示す機能ブロック図である。ＤＳＰを用いた抑圧機構１２は、前述した様に加算部１２０、補正部１２１等のプログラムモジュールを実行する。 FIG. 7 is a functional block diagram showing a configuration example of the suppression mechanism 12 included in the echo suppression device 1 according to Embodiment 1 of the present invention. The suppression mechanism 12 using a DSP executes program modules such as the addition unit 120 and the correction unit 121 as described above.

さらに補正部１２１は、エコーの推定に要する周波数毎に設定されているフィルタ係数を用いた数百次の積和演算にて、参照音信号ｘ＿ｆ（ｔ）をフィルタリングすることにより、観測音信号ｙ（ｔ）の補正に要する補正量として、エコー信号ｘ'（ｔ）を導出する線形ＦＩＲフィルタ（補正用フィルタ）部１２１０と、観測音信号ｙ（ｔ）からエコー信号ｘ'（ｔ）を減算することにより、観測音信号ｙ（ｔ）を補正し、エコー成分を抑圧した観測音信号を抑圧結果ｒ（ｔ）として出力する減算部１２１１と、補正後の観測音信号、即ち抑圧結果ｒ（ｔ）に基づいて、学習同定法を用いた適応処理により、線形ＦＩＲフィルタ部１２１０のフィルタ係数の算出及び更新を行うフィルタ係数更新部１２１２と、抑圧結果ｒ（ｔ）に基づいて、話者が発声しているダブルトークの状態か発声していないシングルトークの状態かを検出する検出部１２１３とをサブモジュールとして実行する。そして検出部１２１３は、抑圧結果ｒ（ｔ）の強度変化に基づいてシングルトークの状態とダブルトークの状態とを検出し、ダブルトークの状態時には、フィルタ係数更新部１２１２によるフィルタ係数の算出及び更新を停止させる。なお補正部１２１の処理に際し、必要に応じて、参照音信号ｘ＿ｆ（ｔ）及び観測音信号ｙ（ｔ）に対してフレーム化及び周波数軸上の成分へ変換するＦＦＴ処理が行われた上でエコー抑圧処理がなされ、更に時間軸上の成分へ変換するＩＦＦＴ処理が行われた抑圧結果ｒ（ｔ）が出力される様に構成することも可能である。 Further, the correction unit 121 filters the reference sound signal x_f (t) by a several hundredth-order product-sum operation using a filter coefficient set for each frequency required for echo estimation, so that the observed sound signal y As a correction amount required for correcting (t), a linear FIR filter (correction filter) unit 1210 for deriving an echo signal x ′ (t), and subtracting the echo signal x ′ (t) from the observed sound signal y (t) Thus, the observation sound signal y (t) is corrected and the observation sound signal in which the echo component is suppressed is output as the suppression result r (t), and the corrected observation sound signal, that is, the suppression result r (t t), the filter coefficient updating unit 1212 that calculates and updates the filter coefficient of the linear FIR filter unit 1210 by adaptive processing using the learning identification method, and the speech based on the suppression result r (t). There executes a detection unit 1213 for detecting whether a state of single-talk not speaking or the state of double-talk which is uttered as a submodule. Then, the detection unit 1213 detects the single talk state and the double talk state based on the intensity change of the suppression result r (t). When the double talk state is detected, the filter coefficient update unit 1212 calculates and updates the filter coefficient. Stop. In the processing of the correction unit 121, the reference sound signal x_f (t) and the observation sound signal y (t) are subjected to framing and FFT processing for converting into components on the frequency axis as necessary. It is also possible to configure so that a suppression result r (t) that has been subjected to echo suppression processing and further subjected to IFFT processing for conversion to a component on the time axis is output.

図８は、本発明の実施の形態１に係る音処理装置４の構成例を示すブロック図である。音処理装置４は、エコー抑圧装置１の出力機構１３から出力された抑圧結果ｒ（ｔ）を入力される入力機構４０と、入力機構４０から入力された抑圧結果ｒ（ｔ）に基づいて音声認識処理を行う認識機構４１と、認識機構４１による認識結果に基づいてナビゲーション処理の制御を行う制御機構４２とを備えている。制御機構４２による制御とは、話者が発声した音声から認識した命令に基づく移動体５の目的地の入力、走行経路の導出、走行経路の表示等のカーナビゲーションシステム本来の処理である。なお本発明のシステムをオーディオシステムに適用する場合、音処理装置４は、エコー抑圧装置１、音出力装置２及び音入力装置３を制御する制御装置として機能し、話者が発生した音声による命令、例えば音源切替、再生開始、音声認識終了等の命令に基づいて、制御機構４２は、エコー抑圧装置１、音出力装置２及び音入力装置３を制御する。 FIG. 8 is a block diagram showing a configuration example of the sound processing device 4 according to Embodiment 1 of the present invention. The sound processing device 4 receives the suppression result r (t) output from the output mechanism 13 of the echo suppression device 1 and the voice based on the suppression result r (t) input from the input mechanism 40. A recognition mechanism 41 that performs recognition processing and a control mechanism 42 that controls navigation processing based on the recognition result of the recognition mechanism 41 are provided. The control by the control mechanism 42 is processing inherent in the car navigation system such as input of a destination of the moving body 5, derivation of a travel route, and display of a travel route based on a command recognized from a voice uttered by a speaker. When the system of the present invention is applied to an audio system, the sound processing device 4 functions as a control device that controls the echo suppression device 1, the sound output device 2, and the sound input device 3, and a voice command generated by a speaker. For example, the control mechanism 42 controls the echo suppression device 1, the sound output device 2, and the sound input device 3 based on commands such as sound source switching, reproduction start, and voice recognition end.

次に本発明の実施の形態１に係るエコー抑圧システムが備える各装置の処理について説明する。図９は、本発明の実施の形態１に係る音出力装置２の音出力処理の一例を示すフローチャートである。音出力装置２は、音信号生成機構２１により、複数チャネル分の音信号ｘ１（ｔ），…，ｘｎ（ｔ）を生成し（Ｓ１０１）、生成した音信号ｘ１（ｔ），…，ｘｎ（ｔ）を通過機構２２へ出力する。 Next, processing of each device provided in the echo suppression system according to Embodiment 1 of the present invention will be described. FIG. 9 is a flowchart showing an example of sound output processing of the sound output device 2 according to Embodiment 1 of the present invention. The sound output device 2 generates sound signals x1 (t),..., Xn (t) for a plurality of channels by the sound signal generation mechanism 21 (S101), and the generated sound signals x1 (t),. t) is output to the passing mechanism 22.

音出力装置２の通過機構２２は、入力された複数チャネルの音信号ｘ１（ｔ），…，ｘｎ（ｔ）をデジタル信号に変換し、フレーム化して、各ＦＦＴ変換部２２０，２２０，…により、複数の音信号をＦＦＴ処理にて夫々周波数軸上の成分の音信号Ｘ１（ｆ），…，Ｘｎ（ｆ）に変換し（Ｓ１０２）、周波数軸上の成分に変換した各音信号Ｘ１（ｆ），…，Ｘｎ（ｆ）を、夫々帯域通過フィルタ部２２１，２２１，…へ渡す。ステップＳ１０２において、周波数軸上の成分に変換する方法としては、必ずしもＦＦＴを用いる必要はなく、ＤＣＴ（離散コサイン変換：Discrete Cosine Transform ）等の他の変換方法を用いてもよい。 The passage mechanism 22 of the sound output device 2 converts the input sound signals x1 (t),..., Xn (t) of a plurality of channels into digital signals, framing them, and using the FFT converters 220, 220,. The sound signals are converted into sound signals X1 (f),..., Xn (f) on the frequency axis by FFT processing (S102), and the sound signals X1 ( f),..., Xn (f) are passed to the band-pass filter units 221, 221. In step S102, as a method of converting to a component on the frequency axis, it is not always necessary to use FFT, and other conversion methods such as DCT (Discrete Cosine Transform) may be used.

音出力装置２の通過機構２２は、各帯域通過フィルタ部２２１，２２１，…により、周波数軸上の成分に変換された複数の音信号Ｘ１（ｆ），…，Ｘｎ（ｆ）に対し、夫々異なる周波数帯域の成分Ｘ１＿ｆ（ｆ），…，Ｘｎ＿ｆ（ｆ）を通過させ（Ｓ１０３）、ＩＦＦＴ変換部２２２，２２２，…により、ＩＦＦＴ処理にて夫々時間軸上の音信号ｘ１＿ｆ（ｔ），…，ｘｎ＿ｆ（ｔ）に変換し（Ｓ１０４）、アナログ信号に変換して、音出力機構２０，２０，…へ出力し、またエコー抑圧装置１へ出力する（Ｓ１０５）。 The pass mechanism 22 of the sound output device 2 respectively receives a plurality of sound signals X1 (f),..., Xn (f) converted into components on the frequency axis by the band pass filter units 221, 221. The components X1_f (f),..., Xn_f (f) of different frequency bands are passed (S103), and the IFFT converters 222, 222,. , Xn_f (t) (S104), converted into an analog signal, output to the sound output mechanism 20, 20,..., And output to the echo suppressor 1 (S105).

音出力装置２の音出力機構２０，２０，…は、入力された音信号ｘ１＿ｆ（ｔ），…，ｘｎ＿ｆ（ｔ）に基づいて音を出力する（Ｓ１０６）。 The sound output mechanisms 20, 20,... Of the sound output device 2 output sounds based on the input sound signals x1_f (t),..., Xn_f (t) (S106).

図１０は、本発明の実施の形態１に係る音入力装置３の音入力処理の一例を示すフローチャートである。音入力装置３は、音入力機構３０により、音を入力し（Ｓ２０１）、入力された音に基づいて音信号を生成し、生成した音信号を観測音信号ｙ（ｔ）としてエコー抑圧装置１へ出力する（Ｓ２０２）。ステップＳ２０１において、音入力装置３は、音出力装置２が音を出力する音場からの音が入力される。従って音入力装置３が入力される音には、音出力装置２から出力された音が混入する可能性がある。 FIG. 10 is a flowchart showing an example of sound input processing of the sound input device 3 according to Embodiment 1 of the present invention. The sound input device 3 inputs a sound by the sound input mechanism 30 (S201), generates a sound signal based on the input sound, and uses the generated sound signal as an observed sound signal y (t) as an echo suppression device 1 (S202). In step S201, the sound input device 3 receives sound from a sound field from which the sound output device 2 outputs sound. Therefore, the sound input from the sound input device 3 may be mixed with the sound output from the sound output device 2.

図１１は、本発明の実施の形態１に係るエコー抑圧装置１のエコー抑圧処理の一例を示すフローチャートである。エコー抑圧装置１は、第１入力機構１０により、音出力装置２から出力された複数チャネル分の音信号ｘ１＿ｆ（ｔ），…，ｘｎ＿ｆ（ｔ）を入力し（Ｓ３０１）、入力された音信号ｘ１＿ｆ（ｔ），…，ｘｎ＿ｆ（ｔ）をデジタル信号に変換して抑圧機構１２へ渡し、また第２入力機構１１により、音入力装置３から出力された観測音信号ｙ（ｔ）を入力し（Ｓ３０２）、入力された観測音信号ｙ（ｔ）をデジタル信号に変換して抑圧機構１２へ渡す。なおステップＳ３０１及びステップＳ３０２の処理は、実質的に並行して行われる。 FIG. 11 is a flowchart showing an example of echo suppression processing of the echo suppression apparatus 1 according to Embodiment 1 of the present invention. The echo suppression device 1 inputs sound signals x1_f (t),..., Xn_f (t) for a plurality of channels output from the sound output device 2 by the first input mechanism 10 (S301), and the input sound signals x1_f (t),..., xn_f (t) are converted into digital signals and transferred to the suppression mechanism 12, and the observation sound signal y (t) output from the sound input device 3 is input by the second input mechanism 11. (S302), the input observation sound signal y (t) is converted into a digital signal and passed to the suppression mechanism 12. Note that the processes in steps S301 and S302 are performed substantially in parallel.

エコー抑圧装置１の抑圧機構１２は、加算部１２０により、音出力装置２から出力された複数チャネル分の音信号ｘ１＿ｆ（ｔ），…，ｘｎ＿ｆ（ｔ）を加算して参照音信号ｘ＿ｆ（ｘ）を生成し（Ｓ３０３）、生成した参照音信号ｘ＿ｆ（ｘ）を補正部１２１へ渡す。 The suppression mechanism 12 of the echo suppressor 1 adds the sound signals x1_f (t),..., Xn_f (t) for a plurality of channels output from the sound output device 2 by the adding unit 120 to add the reference sound signal x_f (x ) Is generated (S303), and the generated reference sound signal x_f (x) is passed to the correction unit 121.

エコー抑圧装置１の抑圧機構１２は、補正部１２１の処理として、線形ＦＩＲフィルタ部１２１０にて、設定されているフィルタ係数を用いた積和演算により、参照音信号ｘ＿ｆ（ｘ）をフィルタリングすることで、観測音信号ｙ（ｔ）の補正に要する補正量としてエコー信号ｘ'（ｔ）を導出し（Ｓ３０４）、導出した補正量であるエコー信号ｘ'（ｔ）を減算部１２１１へ渡す。 The suppression mechanism 12 of the echo suppression apparatus 1 filters the reference sound signal x_f (x) by the product-sum operation using the set filter coefficient in the linear FIR filter unit 1210 as processing of the correction unit 121. Thus, the echo signal x ′ (t) is derived as a correction amount required for correcting the observed sound signal y (t) (S304), and the echo signal x ′ (t), which is the derived correction amount, is passed to the subtracting unit 1211.

エコー抑圧装置１の抑圧機構１２は、補正部１２１の処理として、減算部１２１１により、観測音信号ｙ（ｔ）からエコー信号ｘ'（ｔ）を減算することで観測音信号ｙ（ｔ）を補正し（Ｓ３０５）、補正した観測音信号ｙ（ｔ）である抑圧結果ｒ（ｔ）をアナログ信号に変換し、変換した抑圧結果ｒ（ｔ）を音処理装置４へ出力する（Ｓ３０６）。そして音処理装置４では、入力された抑圧結果ｒ（ｔ）に基づいて音声認識等の処理を実行する。 The suppression mechanism 12 of the echo suppressor 1 subtracts the echo signal x ′ (t) from the observation sound signal y (t) by the subtraction unit 1211 as the processing of the correction unit 121, thereby obtaining the observation sound signal y (t). It correct | amends (S305), converts the suppression result r (t) which is the corrected observation sound signal y (t) into an analog signal, and outputs the converted suppression result r (t) to the sound processor 4 (S306). The sound processing device 4 executes processing such as speech recognition based on the input suppression result r (t).

図１２は、本発明の実施の形態１に係るエコー抑圧装置１のフィルタ係数更新処理の一例を示すフローチャートである。エコー抑圧装置１では、図１１を用いたエコー抑圧処理と並行して、エコー信号の導出に用いるフィルタ係数の更新処理を実行する。エコー抑圧装置１の抑圧機構１２は、補正部１２１の処理として、検出部１２１３により、減算部１２１１から出力される抑圧結果ｒ（ｔ）に基づいて、話者が発声しているダブルトークの状態か発声していないシングルトークの状態かを検出し（Ｓ４０１）、検出した結果を示す検出結果をフィルタ係数更新部１２１２へ渡す。 FIG. 12 is a flowchart showing an example of the filter coefficient update process of the echo suppression apparatus 1 according to Embodiment 1 of the present invention. In the echo suppression apparatus 1, in parallel with the echo suppression process using FIG. 11, the update process of the filter coefficient used for the derivation of the echo signal is executed. The suppression mechanism 12 of the echo suppressor 1 is a double-talk state in which the speaker is speaking based on the suppression result r (t) output from the subtraction unit 1211 by the detection unit 1213 as the processing of the correction unit 121. It is detected whether the single talk is not uttered (S401), and the detection result indicating the detected result is passed to the filter coefficient updating unit 1212.

エコー抑圧装置１の抑圧機構１２は、補正部１２１の処理として、フィルタ係数更新部１２１２により、検出部１２１３から受け付けた検出結果に基づいて、フィルタ係数の算出及び更新の要否を判定する（Ｓ４０２）。ステップＳ４０２では、検出結果がシングルトークを示す場合、フィルタ係数の算出及び更新を実行し、ダブルトークを示す場合、フィルタ係数の算出及び更新を停止する。 The suppression mechanism 12 of the echo suppression apparatus 1 determines whether or not it is necessary to calculate and update the filter coefficient based on the detection result received from the detection unit 1213 by the filter coefficient update unit 1212 as processing of the correction unit 121 (S402). ). In step S402, when the detection result indicates single talk, calculation and update of the filter coefficient are executed, and when double detection is indicated, calculation and update of the filter coefficient are stopped.

ステップＳ４０２において、フィルタ係数の算出及び更新を要すると判定した場合（Ｓ４０２：ＹＥＳ）、エコー抑圧装置１の抑圧機構１２は、補正部１２１の処理として、フィルタ係数更新部１２１２により、減算部１２１１から出力される抑圧結果ｒ（ｔ）に基づいて、学習同定法を用いた適応処理にてフィルタ係数を算出し（Ｓ４０３）、線形ＦＩＲフィルタ部１２１０にて用いられているフィルタ係数を、ステップＳ４０３にて算出したフィルタ係数に更新し（Ｓ４０４）、ステップＳ４０１へ戻り、以降の処理を繰り返す。 If it is determined in step S402 that the filter coefficient needs to be calculated and updated (S402: YES), the suppression mechanism 12 of the echo suppression device 1 performs processing of the correction unit 121 by the filter coefficient update unit 1212 from the subtraction unit 1211. Based on the output suppression result r (t), filter coefficients are calculated by adaptive processing using a learning identification method (S403), and the filter coefficients used in the linear FIR filter unit 1210 are set in step S403. (S404), the process returns to step S401, and the subsequent processing is repeated.

ステップＳ４０２において、フィルタ係数の算出及び更新を要しないと判定した場合（Ｓ４０２：ＮＯ）、エコー抑圧装置１の抑圧機構１２は、補正部１２１の処理として、フィルタ係数更新部１２１２によるフィルタ係数の算出及び更新を行わず、ステップＳ４０１へ戻り、以降の処理を繰り返す。 When it is determined in step S402 that calculation and update of the filter coefficient are not required (S402: NO), the suppression mechanism 12 of the echo suppression device 1 calculates the filter coefficient by the filter coefficient update unit 1212 as processing of the correction unit 121. And without updating, it returns to step S401 and repeats the subsequent processing.

音出力装置２から音場に対して複数チャネルの音が出力されている場合、当該音場から入力された音信号には音出力装置２から出力された音に基づくエコーが含まれる。当該音場が音に対する線形性を保っている場合、音出力装置２から出力された音は、その周波数を維持して音入力装置３に入力される。また参照音信号ｘ＿ｆ（ｔ）は、夫々異なる周波数帯域が割り当てられた複数チャネルの音信号ｘ１＿ｆ（ｔ），…，ｘｎ＿ｆ（ｔ）を加算した信号であるから、任意の周波数ｆにおいて、参照音信号ｘ＿ｆ（ｔ）及び観測音信号ｙ（ｔ）の関係は、１チャネルの音を出力する場合の関係と等価であると見なすことができる。従って本願のエコー抑圧システムでは、複数チャネルの音信号を出力する場合であっても、１チャネルの音信号を出力する場合と同等の精度でエコーを抑圧することが可能である。 When sounds of a plurality of channels are output from the sound output device 2 to the sound field, the sound signal input from the sound field includes an echo based on the sound output from the sound output device 2. When the sound field maintains linearity with respect to sound, the sound output from the sound output device 2 is input to the sound input device 3 while maintaining its frequency. In addition, the reference sound signal x_f (t) is a signal obtained by adding the sound signals x1_f (t),..., Xn_f (t) of a plurality of channels to which different frequency bands are assigned, and therefore the reference sound at any frequency f The relationship between the signal x_f (t) and the observation sound signal y (t) can be regarded as equivalent to the relationship in the case of outputting a sound of one channel. Therefore, in the echo suppression system of the present application, it is possible to suppress echoes with the same accuracy as when outputting a sound signal of one channel even when outputting a sound signal of a plurality of channels.

実施の形態２．
実施の形態２は、実施の形態１において、原音信号を所定の方法で加工した加工音信号を用いて実現される複数チャネルの音信号に対し、効率的なフィルタリングを実現する形態である。ここでは実施の形態２として、Ｌ（Ｌｅｆｔ）チャネルの第１音信号、Ｒ（Ｒｉｇｈｔ）チャネルの第２音信号、第１音信号及び第２音信号の和であるＣ（Ｃｅｎｔｅｒ）チャネルの和音信号、第１音信号を所定時間遅延させたｓＬ（ＳｕｒｒｏｕｎｄＬｅｆｔ）チャネルの遅延第１音信号、及び第２音信号を所定時間遅延させたｓＲ（ＳｕｒｒｏｕｎｄＲｉｇｈｔ）チャネルの遅延第２音信号による擬似５チャネルのステレオシステムに適用する例について説明する。擬似５チャネルにおいて、第１音信号及び第２音信号は、原音信号であり、和音信号、遅延第１音信号及び遅延第２音信号は、原音信号を加算又は遅延により加工した加工音信号である。但し、本発明の実施の形態２に係るエコー抑圧システムは、必ずしも擬似５チャネルに限らず、第１音信号、第２音信号、遅延第１音信号及び遅延第２音信号からなる擬似４チャネルのステレオシステムに適用する形態、和音信号を遅延させた遅延和音信号を擬似信号として用いる形態等、様々な形態に展開することが可能である。Embodiment 2. FIG.
The second embodiment is a form in which efficient filtering is realized for the sound signals of a plurality of channels realized by using the processed sound signal obtained by processing the original sound signal by a predetermined method in the first embodiment. Here, as the second embodiment, the first sound signal of the L (Left) channel, the second sound signal of the R (Right) channel, the chord of the C (Center) channel that is the sum of the first sound signal and the second sound signal. Signal, a delayed first sound signal of an sL (Surround Left) channel obtained by delaying the first sound signal by a predetermined time, and a pseudo second sound signal of an sR (Surround Right) channel obtained by delaying the second sound signal by a predetermined time. An example applied to a 5-channel stereo system will be described. In the pseudo 5 channel, the first sound signal and the second sound signal are original sound signals, and the chord signal, the delayed first sound signal and the delayed second sound signal are processed sound signals processed by adding or delaying the original sound signals. is there. However, the echo suppression system according to the second embodiment of the present invention is not necessarily limited to the pseudo 5 channel, but the pseudo 4 channel including the first sound signal, the second sound signal, the delayed first sound signal, and the delayed second sound signal. The present invention can be developed in various forms such as a form applied to the stereo system and a form using a delayed chord signal obtained by delaying the chord signal as a pseudo signal.

以降の説明において、実施の形態１と同様の構成については、実施の形態１と同様の符号を付すものとし、その詳細な説明を省略する。実施の形態２におけるエコー抑圧システムの構成例は、図４を用いて示した実施の形態１と同様であるので、実施の形態１を参照するものとし、その説明を省略する。 In the following description, components similar to those in the first embodiment are denoted by the same reference numerals as those in the first embodiment, and detailed description thereof is omitted. Since the configuration example of the echo suppression system in the second embodiment is the same as that in the first embodiment shown in FIG. 4, the first embodiment is referred to and the description thereof is omitted.

図１３は、本発明の実施の形態２に係る音出力装置２の通過機構２２の構成例を示す機能ブロック図である。実施の形態２に係る音出力装置２の通過機構２２の構成は、実質的に実施の形態１と同様である。なお図１３において、ｘＬ（ｔ），ｘＲ（ｔ），ｘＣ（ｔ），ｘｓＬ（ｔ），ｘｓＲ（ｔ）は、ＦＦＴ変換部２２０，２２０，…に入力された第１音信号、第２音信号、和音信号、遅延第１音信号及び遅延第２音信号を示す。またこれらの音信号ｘＬ（ｔ），ｘＲ（ｔ），ｘＣ（ｔ），ｘｓＬ（ｔ），ｘｓＲ（ｔ）をＦＦＴ変換部２２０，２２０，…により、周波数軸上の成分に変換した音信号がＸＬ（ｆ），ＸＲ（ｆ），ＸＣ（ｆ），ＸｓＬ（ｆ），ＸｓＲ（ｆ）である。さらに周波数軸上の成分に変換したこれらの音信号ＸＬ（ｆ），ＸＲ（ｆ），ＸＣ（ｆ），ＸｓＬ（ｆ），ＸｓＲ（ｆ）が夫々帯域通過フィルタ部２２１，２２１，…を通過した音信号がＸＬ＿ｆ（ｆ），ＸＲ＿ｆ（ｆ），ＸＣ＿ｆ（ｆ），ＸｓＬ＿ｆ（ｆ），ＸｓＲ＿ｆ（ｆ）である。そしてこれらの音信号ＸＬ＿ｆ（ｆ），ＸＲ＿ｆ（ｆ），ＸＣ＿ｆ（ｆ），ＸｓＬ＿ｆ（ｆ），ＸｓＲ＿ｆ（ｆ）をＩＦＦＴ変換部２２２，２２２，…にて変換した時間軸上の音信号がｘＬ＿ｆ（ｔ），ｘＲ＿ｆ（ｔ），ｘＣ＿ｆ（ｔ），ｘｓＬ＿ｆ（ｔ），ｘｓＲ＿ｆ（ｔ）である。 FIG. 13 is a functional block diagram showing a configuration example of the passing mechanism 22 of the sound output device 2 according to Embodiment 2 of the present invention. The configuration of the passage mechanism 22 of the sound output device 2 according to the second embodiment is substantially the same as that of the first embodiment. In FIG. 13, xL (t), xR (t), xC (t), xsL (t), xsR (t) are the first sound signal input to the FFT converters 220, 220,. A sound signal, a chord signal, a delayed first sound signal, and a delayed second sound signal are shown. These sound signals xL (t), xR (t), xC (t), xsL (t), xsR (t) are converted into components on the frequency axis by the FFT converters 220, 220,. Are XL (f), XR (f), XC (f), XsL (f), and XsR (f). Further, these sound signals XL (f), XR (f), XC (f), XsL (f), XsR (f) converted into components on the frequency axis pass through the band-pass filter units 221, 221. The sound signals obtained are XL_f (f), XR_f (f), XC_f (f), XsL_f (f), and XsR_f (f). The sound signals on the time axis obtained by converting these sound signals XL_f (f), XR_f (f), XC_f (f), XsL_f (f), XsR_f (f) by the IFFT converters 222, 222,. (T), xR_f (t), xC_f (t), xsL_f (t), xsR_f (t).

図１４は、本発明の実施の形態２に係る音出力装置２が備える通過機構２２の帯域通過フィルタ部２２１のフィルタ係数を示すグラフである。図１４（ａ），（ｂ），（ｃ），（ｄ），（ｅ）は、夫々Ｌチャネルの第１音信号ＸＬ（ｆ）、Ｒチャネルの第２音信号ＸＲ（ｆ）、Ｃチャネルの和音信号ＸＣ（ｆ）、ｓＬチャネルの遅延第１音信号ＸｓＬ（ｆ）及びｓＲチャネルの遅延第２音信号ＸｓＲ（ｆ）に対する係数フィルタＣＬ（ｆ）、ＣＲ（ｆ）、ＣＣ（ｆ）、ＣｓＬ（ｆ）及びＣｓＲ（ｆ）を示しており、横軸を周波数ｆとし、縦軸をフィルタ係数として、その関係を示したグラフである。 FIG. 14 is a graph showing filter coefficients of the band pass filter unit 221 of the pass mechanism 22 provided in the sound output device 2 according to Embodiment 2 of the present invention. FIGS. 14 (a), (b), (c), (d), and (e) show the L channel first sound signal XL (f), the R channel second sound signal XR (f), and the C channel, respectively. Coefficient filters CL (f), CR (f), CC (f) for the chord signal XC (f), the delayed first sound signal XsL (f) of the sL channel and the delayed second sound signal XsR (f) of the sR channel , CsL (f) and CsR (f), the horizontal axis is the frequency f, and the vertical axis is the filter coefficient.

図１４（ａ）に示した第１音信号ＸＬ（ｆ）に対する係数フィルタＣＬ（ｆ）及び図１４（ｂ）に示した第２音信号ＸＲ（ｆ）に対する係数フィルタＣＲ（ｆ）は、夫々異なる周波数帯域の成分を通過させるように設定されたフィルタである。和音信号ＸＣ（ｆ）は、第１音信号ＸＬ（ｆ）及び第２音信号ＸＲ（ｆ）を加算した音信号であるので、和音信号ＸＣ（ｆ）に対しては、第１音信号ＸＬ（ｆ）に対する係数フィルタＣＬ（ｆ）及び第２音信号ＸＲ（ｆ）に対する係数フィルタＣＲ（ｆ）の和である図１４（ｃ）に示した係数フィルタＣＣ（ｆ）が用いられる。実施の形態２として示す例では、係数フィルタＣＣ（ｆ）は、図１４（ｃ）に示す様にフィルタ処理の対象となる全周波数帯域の成分を通過させるフィルタであり、周波数に関わらずフィルタ係数が１．０となる。また遅延第１音信号ＸｓＬ（ｆ）は、第１音信号ＸＬ（ｆ）を遅延した音信号であるので、遅延第１音信号ＸｓＬ（ｆ）に対しては、第１音信号ＸＬ（ｆ）に対する係数フィルタＣＬ（ｆ）を転用（複写）した図１４（ｄ）に示す係数フィルタＣｓＬ（ｆ）が用いられる。従って第１音信号ＸＬ（ｆ）に対する係数フィルタＣＬ（ｆ）と、遅延第１音信号ＸｓＬ（ｆ）に対する係数フィルタＣｓＬ（ｆ）とは、同一のフィルタである。さらに遅延第２音信号ＸｓＲ（ｆ）は、第２音信号ＸＲ（ｆ）を遅延した音信号であるので、遅延第２音信号ＸｓＲ（ｆ）に対しては、第２音信号ＸＲ（ｆ）に対する係数フィルタＣＲ（ｆ）を転用（複写）した図１４（ｅ）に示す係数フィルタＣｓＲ（ｆ）が用いられる。従って第２音信号ＸＲ（ｆ）に対する係数フィルタＣＲ（ｆ）と、遅延第２音信号ＸｓＲ（ｆ）に対する係数フィルタＣｓＲ（ｆ）とは、同一のフィルタ係数を持つフィルタである。 The coefficient filter CL (f) for the first sound signal XL (f) shown in FIG. 14A and the coefficient filter CR (f) for the second sound signal XR (f) shown in FIG. It is a filter set to pass components of different frequency bands. Since the chord signal XC (f) is a sound signal obtained by adding the first sound signal XL (f) and the second sound signal XR (f), the chord signal XC (f) has a first sound signal XL. The coefficient filter CC (f) shown in FIG. 14C, which is the sum of the coefficient filter CL (f) for (f) and the coefficient filter CR (f) for the second sound signal XR (f), is used. In the example shown as the second embodiment, the coefficient filter CC (f) is a filter that passes the components of the entire frequency band to be filtered as shown in FIG. 14C, and the filter coefficient regardless of the frequency. Becomes 1.0. Since the delayed first sound signal XsL (f) is a sound signal obtained by delaying the first sound signal XL (f), the first sound signal XL (f The coefficient filter CsL (f) shown in FIG. 14 (d) obtained by diverting (copying) the coefficient filter CL (f) to () is used. Therefore, the coefficient filter CL (f) for the first sound signal XL (f) and the coefficient filter CsL (f) for the delayed first sound signal XsL (f) are the same filter. Further, since the delayed second sound signal XsR (f) is a sound signal obtained by delaying the second sound signal XR (f), the second sound signal XR (f) ) Is used (copied) as a coefficient filter CsR (f) shown in FIG. 14E. Accordingly, the coefficient filter CR (f) for the second sound signal XR (f) and the coefficient filter CsR (f) for the delayed second sound signal XsR (f) are filters having the same filter coefficient.

この様に実施の形態２において、帯域通過フィルタ部２２１，２２１，…は、５チャネル分の信号を処理するため、５チャネル分の信号を格納するメモリ及びフィルタ処理を行うプログラムモジュールにて構成される。夫々のプログラムモジュールにて用いられるフィルタ係数として、第１音信号ＸＬ（ｆ）及び第２音信号ＸＲ（ｆ）に対しては夫々独自に設定されたフィルタ係数が用いられる。また和音信号ＸＣ（ｆ）に対しては第１音信号ＸＬ（ｆ）のフィルタ係数及び第２音信号ＸＲ（ｆ）のフィルタ係数の和として算出されるフィルタ係数が用いられる。さらに遅延第１音信号ＸｓＬ（ｆ）に対しては、第１音信号ＸＬ（ｆ）のフィルタ係数を複写したフィルタ係数が用いられ、遅延第２音信号ＸｓＲ（ｆ）に対しては、第２音信号ＸＲ（ｆ）のフィルタ係数を複写したフィルタ係数が用いられる。なお各チャネルのフィルタ処理を行うプログラムモジュールは、一のプログラムモジュールを時分割で使用し、信号毎にフィルタ係数を設定し直すようにしても良い。 As described above, in the second embodiment, the bandpass filter units 221, 221... Are configured by a memory for storing signals for five channels and a program module for performing filter processing in order to process signals for five channels. The As the filter coefficients used in the respective program modules, filter coefficients uniquely set for the first sound signal XL (f) and the second sound signal XR (f) are used. For the chord signal XC (f), a filter coefficient calculated as the sum of the filter coefficient of the first sound signal XL (f) and the filter coefficient of the second sound signal XR (f) is used. Further, for the delayed first sound signal XsL (f), a filter coefficient obtained by copying the filter coefficient of the first sound signal XL (f) is used, and for the delayed second sound signal XsR (f), the first A filter coefficient obtained by copying the filter coefficient of the two-tone signal XR (f) is used. The program module that performs the filtering process for each channel may use one program module in a time-sharing manner and reset the filter coefficient for each signal.

その他の構成及び処理は、実施の形態１と同様であるので、実施の形態１を参照するものとし、その説明を省略する。 Since other configurations and processes are the same as those in the first embodiment, the first embodiment will be referred to and description thereof will be omitted.

実施の形態３．
実施の形態３は、実施の形態１において、音信号生成機構が生成した複数チャネル分の音信号に対し、人が発声する音声に対応する周波数帯域等の予め設定されている除去帯域に該当する周波数帯域の成分を、何れの音信号に対しても通過させないようにし、除去帯域に該当する周波数帯域の成分に基づいてダブルトーク及びシングルトークを検出する形態である。Embodiment 3 FIG.
The third embodiment corresponds to a preset removal band such as a frequency band corresponding to a voice uttered by a person with respect to sound signals for a plurality of channels generated by the sound signal generation mechanism in the first embodiment. In this configuration, the frequency band component is not allowed to pass through any sound signal, and double talk and single talk are detected based on the frequency band component corresponding to the removal band.

以降の説明において、実施の形態１と同様の構成については、実施の形態１と同様の符号を付すものとし、その詳細な説明を省略する。実施の形態３におけるエコー抑圧システムの構成例は、図４を用いて示した実施の形態１と同様であるので、実施の形態１を参照するものとし、その説明を省略する。 In the following description, components similar to those in the first embodiment are denoted by the same reference numerals as those in the first embodiment, and detailed description thereof is omitted. Since the configuration example of the echo suppression system in the third embodiment is the same as that in the first embodiment shown in FIG. 4, the first embodiment is referred to and the description thereof is omitted.

図１５は、本発明の実施の形態３に係る音出力装置２の通過機構２２の構成例を示す機能ブロック図である。実施の形態３に係る通過機構２２は、予め設定されている除去帯域ｆ＿ｃｕｔに該当する周波数帯域の成分を除去する除去部２２３，２２３，…を備えている。除去部２２３，２２３，…は、帯域通過フィルタ部２２１，２２１，…を通過した周波数軸上の成分に変換されている複数の音信号Ｘ１＿ｆ（ｆ），…，Ｘｎ＿ｆ（ｆ）から、除去帯域ｆ＿ｃｕｔに該当する周波数帯域の成分を除去し、除去帯域ｆ＿ｃｕｔの成分を除去した信号Ｘ１＿ｆ＿ｃ（ｆ），…，Ｘｎ＿ｆ＿ｃ（ｆ）をＩＦＦＴ変換部２２２，２２２，…へ出力する。ＩＦＦＴ変換部２２２，２２２，…は、除去部２２３，２２３，…を通過させた複数の音信号Ｘ１＿ｆ＿ｃ（ｆ），…，Ｘｎ＿ｆ＿ｃ（ｆ）をＩＦＦＴ処理にて夫々時間軸上の音信号ｘ１＿ｆ＿ｃ（ｔ），…，ｘｎ＿ｆ＿ｃ（ｔ）に変換する。そしてＩＦＦＴ変換部２２２，２２２，…は、各音信号ｘ１＿ｆ＿ｃ（ｔ），…，ｘｎ＿ｆ＿ｃ（ｔ）を、チャネルが対応している音出力機構２０，２０，…及びエコー抑圧装置１へ出力する。 FIG. 15 is a functional block diagram showing a configuration example of the passage mechanism 22 of the sound output device 2 according to Embodiment 3 of the present invention. The passing mechanism 22 according to Embodiment 3 includes removal units 223, 223,... That remove components in a frequency band corresponding to a preset removal band f_cut. The removal units 223, 223,... Are removed from a plurality of sound signals X1_f (f),..., Xn_f (f) converted into components on the frequency axis that have passed through the bandpass filter units 221, 221. The frequency band component corresponding to f_cut is removed, and signals X1_f_c (f),..., Xn_f_c (f) from which the removed band f_cut components are removed are output to IFFT converters 222, 222,. .., Xn_f_c (f) are passed through the removal units 223, 223,... To the sound signal x1_f_c (on the time axis by IFFT processing, respectively. t),..., xn_f_c (t). The IFFT converters 222, 222,... Output the sound signals x1_f_c (t),..., Xn_f_c (t) to the sound output mechanisms 20, 20,.

図１６は、本発明の実施の形態３に係る音出力装置２が備える通過機構２２の除去部２２３のフィルタ係数を示すグラフである。図１６は、除去用係数フィルタＣＣＵＴ（ｆ）を示しており、横軸を周波数とし、縦軸をフィルタ係数として、その関係を示したグラフである。図１６に示すように除去用係数フィルタＣＣＵＴ（ｆ）は、人の発声に対応する除去帯域ｆ＿ｃｕｔのフィルタ係数が０．０であり、その他の帯域については１．０となっている。この様にして実施の形態３に係る音出力装置２は、除去帯域ｆ＿ｃｕｔを設定することで、出力する全てのチャネルの音から除去帯域ｆ＿ｃｕｔに対応する周波数成分を除去することになる。なお除去部２２３を設けるのではなく、帯域通過フィルタ部２２１，２２１，…が、除去帯域ｆ＿ｃｕｔに該当する周波数の成分を遮断する様に各帯域通過フィルタ部２２１，２２１，…のフィルタ係数を設定するようにしても良い。 FIG. 16 is a graph showing filter coefficients of the removal unit 223 of the passage mechanism 22 included in the sound output device 2 according to Embodiment 3 of the present invention. FIG. 16 shows a removal coefficient filter CCUT (f), which is a graph showing the relationship with the horizontal axis representing frequency and the vertical axis representing filter coefficient. As shown in FIG. 16, in the removal coefficient filter CCUT (f), the filter coefficient of the removal band f_cut corresponding to human speech is 0.0, and 1.0 for other bands. In this way, the sound output device 2 according to Embodiment 3 sets the removal band f_cut, thereby removing the frequency component corresponding to the removal band f_cut from the sound of all the channels to be output. Instead of providing the removal unit 223, the band-pass filter units 221, 221,... Set the filter coefficients of the band-pass filter units 221, 221,... So that the frequency components corresponding to the removal band f_cut are blocked. You may make it do.

図１７は、本発明の実施の形態３に係るエコー抑圧装置１が備える抑圧機構１２の構成例を示す機能ブロック図である。実施の形態３に係る抑圧機構１２の補正部１２１は、観測音信号ｙ（ｔ）から除去帯域ｆ＿ｃｕｔに対応する周波数成分のみを通過させる除去帯域通過フィルタ部１２１４を、サブモジュールとして実行する。そして検出部１２１３は、除去帯域通過フィルタ部１２１４を通過した検出用観測音信号ｙ＿ｐ（ｔ）に基づいてシングルトークの状態かダブルトークの状態かを検出する。音出力装置２が複数チャネルの音を出力する音場が、音に対する線形性を保っている場合、観測音信号ｙ（ｔ）の除去帯域ｆ＿ｃｕｔには、出力した音に基づくエコー成分が含まれていない筈である。従って除去帯域ｆ＿ｃｕｔの成分のみを通過させた検出用観測音信号ｙ＿ｐ（ｔ）に基づいて、シングルトークの状態とダブルトークの状態とを検出する場合、実施の形態１に示した様に抑圧結果ｒ（ｔ）から検出する場合と比べて、高精度に状態の検出を行うことが可能である。 FIG. 17 is a functional block diagram showing a configuration example of the suppression mechanism 12 included in the echo suppression apparatus 1 according to Embodiment 3 of the present invention. The correction unit 121 of the suppression mechanism 12 according to Embodiment 3 executes, as a submodule, a removal band pass filter unit 1214 that passes only a frequency component corresponding to the removal band f_cut from the observed sound signal y (t). The detection unit 1213 detects whether the signal is in a single talk state or a double talk state based on the detection sound signal y_p (t) for detection that has passed through the removal band pass filter unit 1214. When the sound field in which the sound output device 2 outputs sound of a plurality of channels maintains linearity with respect to the sound, the removal band f_cut of the observed sound signal y (t) includes an echo component based on the output sound. It is not a habit. Therefore, when the single talk state and the double talk state are detected based on the detection sound signal y_p (t) for detection in which only the component of the removal band f_cut is passed, the suppression result as shown in the first embodiment. Compared to the case of detecting from r (t), it is possible to detect the state with higher accuracy.

図１８は、本発明の実施の形態３に係るエコー抑圧装置１が備える抑圧機構１２の除去帯域通過フィルタ部１２１４のフィルタ係数を示すグラフである。図１８は、除去帯域通過フィルタ部１２１４に設定された除去帯域通過係数フィルタＣＰＡＳＳ（ｆ）を示しており、横軸を周波数とし、縦軸をフィルタ係数として、その関係を示したグラフである。図１８に示すように除去帯域通過係数フィルタＣＰＡＳＳ（ｆ）は、除去帯域ｆ＿ｃｕｔのフィルタ係数が１．０であり、その他の帯域については０．０となっている。この様にして実施の形態３に係る除去帯域通過フィルタ部１２１４は、観測音信号ｙ（ｔ）から除去帯域ｆ＿ｃｕｔに対応する周波数成分のみを通過させることになる。 FIG. 18 is a graph showing filter coefficients of the removal band-pass filter unit 1214 of the suppression mechanism 12 included in the echo suppression apparatus 1 according to Embodiment 3 of the present invention. FIG. 18 shows the removal band pass coefficient filter CPASS (f) set in the removal band pass filter unit 1214, and is a graph showing the relationship with the horizontal axis representing frequency and the vertical axis representing filter coefficient. As shown in FIG. 18, in the removal band pass coefficient filter CPASS (f), the filter coefficient of the removal band f_cut is 1.0, and 0.0 for the other bands. In this way, the removal band pass filter unit 1214 according to Embodiment 3 passes only the frequency component corresponding to the removal band f_cut from the observation sound signal y (t).

次に本発明の実施の形態３に係るエコー抑圧システムが備える各装置の処理について説明する。図１９は、本発明の実施の形態３に係る音出力装置２の音出力処理の一例を示すフローチャートである。実施の形態３では、実施の形態１に係る音出力処理のフローチャートに示したステップＳ１０１〜Ｓ１０３の処理を実行後、通過機構２２の除去部２２３により、除去帯域ｆ＿ｃｕｔに対応する周波数成分を除去する処理を実行する。 Next, processing of each device provided in the echo suppression system according to Embodiment 3 of the present invention will be described. FIG. 19 is a flowchart showing an example of sound output processing of the sound output device 2 according to Embodiment 3 of the present invention. In the third embodiment, the frequency components corresponding to the removal band f_cut are removed by the removal unit 223 of the passing mechanism 22 after performing the processing of steps S101 to S103 shown in the flowchart of the sound output processing according to the first embodiment. Execute the process.

音出力装置２は、実施の形態１にて示したステップＳ１０１〜Ｓ１０３の処理を実行する。そして音出力装置２の通過機構２２は、除去部２２３により、帯域通過フィルタ部２２１，２２１，…を通過した周波数軸上の成分に変換されている複数の音信号Ｘ１＿ｆ（ｆ），…，Ｘｎ＿ｆ（ｆ）から、除去帯域ｆ＿ｃｕｔに該当する周波数帯域の成分を除去し（Ｓ５０１）、除去帯域ｆ＿ｃｕｔの成分を除去した信号Ｘ１＿ｆ＿ｃ（ｆ），…，Ｘｎ＿ｆ＿ｃ（ｆ）をＩＦＦＴ変換部２２２，２２２，…へ出力する。そして音出力装置２は、実施の形態１の音出力処理にて示したステップＳ１０４以降の処理を実行する。 The sound output device 2 executes the processes of steps S101 to S103 shown in the first embodiment. Then, the passing mechanism 22 of the sound output device 2 includes a plurality of sound signals X1_f (f),..., Xn_f converted by the removing unit 223 into components on the frequency axis that have passed through the bandpass filter units 221, 221. From (f), the components of the frequency band corresponding to the removal band f_cut are removed (S501), and the signals X1_f_c (f),. Output to…. Then, the sound output device 2 executes the processes after step S104 shown in the sound output process of the first embodiment.

実施の形態３に係る音入力装置３の音入力処理及びエコー抑圧装置１のエコー抑圧処理は、実施の形態１と同様であるので、実施の形態１を参照するものとし、その説明を省略する。ただしエコー抑圧処理において、抑圧機構１２の加算部１２０は、除去帯域ｆ＿ｃｕｔの成分が除去された複数チャネル分の音信号ｘ１＿ｆ＿ｃ（ｔ），…，ｘｎ＿ｆ＿ｃ（ｔ）を加算して参照音信号ｘ＿ｆ'（ｘ）を生成する。 Since the sound input process of the sound input device 3 and the echo suppression process of the echo suppression device 1 according to the third embodiment are the same as those in the first embodiment, the first embodiment is referred to and the description thereof is omitted. . However, in the echo suppression processing, the addition unit 120 of the suppression mechanism 12 adds the sound signals x1_f_c (t),..., Xn_f_c (t) for a plurality of channels from which the components of the removal band f_cut are removed to add the reference sound signal x_f ′ (X) is generated.

図２０は、本発明の実施の形態３に係るエコー抑圧装置１のフィルタ係数更新処理の一例を示すフローチャートである。エコー抑圧装置１の抑圧機構１２は、補正部１２１の処理として、検出部１２１３により、除去帯域通過フィルタ部１２１４を通過した検出用観測音信号ｙ＿ｐ（ｔ）に基づいて、話者が発声しているダブルトークの状態か発声していないシングルトークの状態かを検出し（Ｓ６０１）、検出した結果を示す検出結果をフィルタ係数更新部１２１２へ渡す。そしてエコー抑圧装置１は、実施の形態１のフィルタ係数更新処理にて示したステップＳ４０２以降の処理を実行する。 FIG. 20 is a flowchart showing an example of filter coefficient update processing of the echo suppression apparatus 1 according to Embodiment 3 of the present invention. In the suppression mechanism 12 of the echo suppression device 1, as processing of the correction unit 121, a speaker speaks based on the detection sound signal y_p (t) for detection that has passed through the removal band pass filter unit 1214 by the detection unit 1213. It is detected whether it is a double talk state or a single talk state that is not uttered (S601), and a detection result indicating the detection result is passed to the filter coefficient updating unit 1212. Then, the echo suppression apparatus 1 executes the processes after step S402 shown in the filter coefficient update process of the first embodiment.

実施の形態４．
実施の形態４は、実施の形態１において、カーナビゲーションシステムの発話スイッチの押下等の所定の操作を行った場合に限り、本発明のエコー抑圧方法を実行する形態である。なお以降の説明において、実施の形態１と同様の構成については、実施の形態１と同様の符号を付すものとし、その詳細な説明を省略する。Embodiment 4 FIG.
The fourth embodiment is a form in which the echo suppression method of the present invention is executed only when a predetermined operation such as pressing a speech switch of the car navigation system is performed in the first embodiment. In the following description, components similar to those in the first embodiment are denoted by the same reference numerals as those in the first embodiment, and detailed description thereof is omitted.

図２１は、本発明の実施の形態４に係るエコー抑圧システムの構成例を模式的に示すブロック図である。実施の形態４に係る音出力装置２は、発話スイッチとして機能する操作機構２３と、操作機構２３の操作に基づき信号経路の切替処理を行うスイッチ等の切替機構２４，２４，…を備えている。切替機構２４，２４，…は、音信号生成機構２１から通過機構２２へ複数チャネルの音信号を伝送する各信号線上に夫々配設されており、音信号生成機構２１から出力された複数チャネルの音信号を、通過機構２２又は複数の音出力機構２０，２０，…へ出力する。 FIG. 21 is a block diagram schematically showing a configuration example of an echo suppression system according to Embodiment 4 of the present invention. The sound output device 2 according to the fourth embodiment includes an operation mechanism 23 that functions as an utterance switch, and switching mechanisms 24, 24,... Such as a switch that performs a signal path switching process based on the operation of the operation mechanism 23. . The switching mechanisms 24, 24,... Are respectively disposed on the signal lines for transmitting a plurality of channels of sound signals from the sound signal generation mechanism 21 to the passage mechanism 22, and the plurality of channels output from the sound signal generation mechanism 21. The sound signal is output to the passage mechanism 22 or the plurality of sound output mechanisms 20, 20,.

話者が操作機構２３に対する押下等の操作を行った場合、操作機構２３は、切替機構２４，２４，…へ操作信号を出力し、切替機構２４，２４，…は、操作信号を入力中又は操作信号の入力から一定時間の間、音信号生成機構２１から出力された複数チャネルの音信号を通過機構２２へ出力するように信号経路の切替処理を行う。複数チャネルの音信号が通過機構２２へ出力されている間、本発明のエコー抑圧システムは、実施の形態１にて説明した各種処理を行う。 When the speaker performs an operation such as pressing the operation mechanism 23, the operation mechanism 23 outputs an operation signal to the switching mechanisms 24, 24,..., And the switching mechanisms 24, 24,. A signal path switching process is performed so that a plurality of channels of sound signals output from the sound signal generating mechanism 21 are output to the passing mechanism 22 for a predetermined time from the input of the operation signal. While the sound signals of a plurality of channels are being output to the passing mechanism 22, the echo suppression system of the present invention performs various processes described in the first embodiment.

話者が操作機構２３に対する操作を行っていない場合、又は操作から一定時間が経過した場合、切替機構２４，２４，…は、音信号生成機構２１から出力された複数チャネルの音信号を音出力機構２０，２０，…へ出力する様に信号経路を設定する。 When the speaker does not operate the operation mechanism 23, or when a certain time has elapsed since the operation, the switching mechanisms 24, 24, ... output sound signals of a plurality of channels output from the sound signal generation mechanism 21 as sound outputs. A signal path is set so as to output to the mechanisms 20, 20,.

この様に実施の形態４では、話者が操作を行った場合にのみ本発明のエコー抑圧方法を実行し、操作を行っていない通常時においては、音信号生成機構２１から出力された複数チャネルの音信号をそのまま音出力機構２０，２０，…へ出力することにより、音信号の周波数成分の選択通過が行われないため、出力する音の音質を維持することができる。 As described above, in the fourth embodiment, the echo suppression method of the present invention is executed only when the speaker performs an operation, and in a normal time when the operation is not performed, a plurality of channels output from the sound signal generating mechanism 21 are output. Are output as they are to the sound output mechanisms 20, 20,..., So that the frequency components of the sound signals are not selectively passed, so that the sound quality of the output sound can be maintained.

実施の形態５．
実施の形態５は、実施の形態１において、音信号間の相関に基づいて通過機構による処理を動的に変更する形態である。Embodiment 5 FIG.
The fifth embodiment is a form in which the processing by the passage mechanism is dynamically changed based on the correlation between sound signals in the first embodiment.

以降の説明において、実施の形態１と同様の構成については、実施の形態１と同様の符号を付すものとし、その詳細な説明を省略する。実施の形態５におけるエコー抑圧システムの構成例は、図４を用いて示した実施の形態１と同様であるので、実施の形態１を参照するものとし、その説明を省略する。 In the following description, components similar to those in the first embodiment are denoted by the same reference numerals as those in the first embodiment, and detailed description thereof is omitted. Since the configuration example of the echo suppression system in the fifth embodiment is the same as that in the first embodiment shown in FIG. 4, the first embodiment is referred to and the description thereof is omitted.

図２２は、本発明の実施の形態５に係る音出力装置２の通過機構２２の構成例を示す機能ブロック図である。実施の形態５に係る通過機構２２は、フレーム単位で周波数軸上の成分に変換されている夫々のチャネルの音信号Ｘ１（ｆ），…，Ｘｎ（ｆ）の振幅を比較し、比較した振幅の相関に基づいて各帯域通過フィルタ部２２１，２２１，…のフィルタ係数を動的に変更させる比較部２２４を備えている。なお各音信号Ｘ１（ｆ），…，Ｘｎ（ｆ）の振幅を比較するのではなく、振幅の二乗であるパワーを振幅の代替値として比較する様にしても良い。 FIG. 22 is a functional block diagram showing a configuration example of the passing mechanism 22 of the sound output device 2 according to Embodiment 5 of the present invention. The passing mechanism 22 according to the fifth embodiment compares the amplitudes of the sound signals X1 (f),..., Xn (f) of the respective channels converted into components on the frequency axis in units of frames, and compares the compared amplitudes. Is provided with a comparison unit 224 that dynamically changes the filter coefficients of the bandpass filter units 221, 221. Instead of comparing the amplitudes of the sound signals X1 (f),..., Xn (f), the power that is the square of the amplitude may be compared as an alternative value of the amplitude.

比較部２２４は、所定数のフレーム単位又は所定の時間間隔で、周波数帯域毎に各音信号Ｘ１（ｆ），…，Ｘｎ（ｆ）間の相関を導出する。相関の導出は、夫々の音信号の振幅の大きさを比較することにより行われる。そして比較により他の全ての音信号の振幅に対して、所定値以上大きい振幅を有する一の音信号が存在すると判定した場合、各音信号Ｘ１（ｆ），…，Ｘｎ（ｆ）間の相関が弱いと判断し、またその場合に最も振幅が大きい一の音信号を導出する。一の音信号と他の音信号との振幅の比較は、例えば他の音信号の振幅に対する一の音信号の振幅の比、一の音信号の振幅から他の音信号の振幅を減じた差、振幅の絶対値の比或いは差、振幅の二乗であるパワーの比或いは差等の所定の方法で比較した比較結果が、予め設定されている所定値以上であるか否かを判定することにより行われる。比較に用いる所定値は、一の音信号の振幅に対して他の全ての音信号の振幅が無視できる程度に小さいと言える条件を満たす様に設定されている。 The comparison unit 224 derives a correlation between the sound signals X1 (f),..., Xn (f) for each frequency band at a predetermined number of frames or at predetermined time intervals. The correlation is derived by comparing the amplitudes of the sound signals. If it is determined by comparison that there is one sound signal having an amplitude greater than a predetermined value relative to the amplitude of all other sound signals, the correlation between the sound signals X1 (f),..., Xn (f). Is determined to be weak, and in that case, one sound signal having the largest amplitude is derived. The comparison of the amplitude of one sound signal with another sound signal is, for example, the ratio of the amplitude of one sound signal to the amplitude of the other sound signal, or the difference obtained by subtracting the amplitude of the other sound signal from the amplitude of the one sound signal. By determining whether or not the comparison result compared by a predetermined method such as a ratio or difference of amplitude absolute values or a power ratio or difference which is the square of amplitude is greater than or equal to a predetermined value set in advance Done. The predetermined value used for the comparison is set so as to satisfy a condition that it can be said that the amplitude of all other sound signals is small enough to be ignored with respect to the amplitude of one sound signal.

例えば周波数ｆにおける第ｎチャネルｎｃｈの振幅を、ｎｃｈ（ｆ）で表し、所定値をαとすると、下記の式（１）が成立する場合、各音信号Ｘ１（ｆ），…，Ｘｎ（ｆ）間の相関が弱く、最も振幅が大きい音信号は第ｎチャネルｎｃｈの音信号Ｘｎ（ｆ）であると判定する。 For example, when the amplitude of the n-th channel nch at the frequency f is represented by nch (f) and the predetermined value is α, each sound signal X1 (f),..., Xn (f ) Is weak and the sound signal having the largest amplitude is determined to be the sound signal Xn (f) of the nth channel nch.

ｎｃｈ（ｆ）／１ｃｈ（ｆ）≧α，ｎｃｈ（ｆ）／２ｃｈ（ｆ）≧α，… 式（１） nch (f) / 1ch (f) ≧ α, nch (f) / 2ch (f) ≧ α, Formula (1)

比較部２２４は、各音信号Ｘ１（ｆ），…，Ｘｎ（ｆ）間の相関が弱いと判定した周波数ｆを示す偏向周波数ｆｄにおける各帯域通過フィルタ部２２１，２２１，…のフィルタ係数を導出する。比較部２２４は、振幅が最大であるチャネルの音信号に係る帯域通過フィルタ部２２１のフィルタ係数として１．０を導出し、他のチャネルの音信号に係る帯域通過フィルタ部２２１のフィルタ係数として０．０を導出する。そして比較部２２４は偏向周波数ｆｄにおける各帯域通過フィルタ部２２１，２２１，…のフィルタ係数を、夫々対応する帯域通過フィルタ部２２１，２２１，…へ渡す。 The comparison unit 224 derives filter coefficients of the band-pass filter units 221, 221,... At the deflection frequency fd indicating the frequency f determined that the correlation between the sound signals X1 (f),..., Xn (f) is weak. To do. The comparison unit 224 derives 1.0 as the filter coefficient of the band pass filter unit 221 related to the sound signal of the channel having the maximum amplitude, and 0 as the filter coefficient of the band pass filter unit 221 related to the sound signal of the other channel. .0 is derived. The comparison unit 224 passes the filter coefficients of the band pass filter units 221, 221,... At the deflection frequency fd to the corresponding band pass filter units 221, 221,.

帯域通過フィルタ部２２１，２２１，…では、偏向周波数ｆｄにおけるフィルタ係数を受け付けた場合、偏向周波数ｆｄにおけるフィルタ係数を、受け付けたフィルタ係数に変更する。即ち振幅が最大である音信号のみを通過させる様に設定する。 When the band pass filter units 221, 221,... Accept filter coefficients at the deflection frequency fd, the filter coefficients at the deflection frequency fd are changed to the accepted filter coefficients. That is, it is set so that only the sound signal having the maximum amplitude is passed.

比較部２２４の比較により、他の全ての音信号の振幅に対して、所定値以上大きい振幅を有する一の音信号が存在しないと判定した場合、各音信号Ｘ１（ｆ），…，Ｘｎ（ｆ）間の相関が強いと判断する。相関が強いと判断した場合、比較部２２４は、偏向周波数ｆｄ及びフィルタ係数の導出を行わない。従って帯域通過フィルタ部２２１，２２１，…では、前述した実施の形態１と同様の処理を実行することになる。 When it is determined by comparison of the comparison unit 224 that there is no one sound signal having an amplitude greater than a predetermined value with respect to the amplitudes of all other sound signals, the sound signals X1 (f),. It is judged that the correlation between f) is strong. When determining that the correlation is strong, the comparison unit 224 does not derive the deflection frequency fd and the filter coefficient. Therefore, the band pass filter units 221, 221,... Execute the same processing as in the first embodiment.

各音信号Ｘ１（ｆ），…，Ｘｎ（ｆ）間の相関が低く、一の音信号に係る振幅が他の音信号に係る振幅に対して大きくなる状況は、例えば複数チャネルのオーケストラ演奏を再生する場合における、いずれかのチャネルに出力が偏ったソロ演奏時に発生する。この様な状況において、ソロ演奏を行う楽器に係る音信号が帯域通過フィルタ部２２１，２２１，…で抑制された場合、聴取者は出力される音に違和感を覚えることになる。実施の形態５では、この様な状況において、ソロ演奏を行う楽器に係る音信号に基づく音を、出力する音として選択することになるので、違和感の少ない音を出力させることが可能となる。 A situation in which the correlation between the sound signals X1 (f),..., Xn (f) is low and the amplitude related to one sound signal is larger than the amplitude related to the other sound signal is, for example, a multi-channel orchestra performance. Occurs during a solo performance in which the output is biased to one of the channels during playback. In such a situation, if the sound signal related to the instrument performing the solo performance is suppressed by the band-pass filter units 221, 221,..., The listener will feel uncomfortable with the output sound. In the fifth embodiment, in such a situation, the sound based on the sound signal related to the musical instrument performing the solo performance is selected as the output sound, so that it is possible to output a sound with less sense of incongruity.

次に本発明の実施の形態５に係るエコー抑圧システムが備える各装置の処理について説明する。図２３は、本発明の実施の形態５に係る音出力装置２の音出力処理の一例を示すフローチャートである。実施の形態５では、実施の形態１に係る音出力処理のステップＳ１０１〜Ｓ１０２の処理を実行する。そして音出力装置２の通過機構２２は、比較部２２４により、周波数帯域毎に各音信号Ｘ１（ｆ），…，Ｘｎ（ｆ）の振幅の大きさを比較し、各音信号Ｘ１（ｆ），…，Ｘｎ（ｆ）間の相関が弱く、他の全ての音信号の振幅に対して、所定値以上大きい振幅を有する一の音信号が存在するか否かを判定する（Ｓ７０１）。 Next, processing of each device provided in the echo suppression system according to Embodiment 5 of the present invention will be described. FIG. 23 is a flowchart showing an example of sound output processing of the sound output device 2 according to Embodiment 5 of the present invention. In the fifth embodiment, the processes of steps S101 to S102 of the sound output process according to the first embodiment are executed. Then, the passing mechanism 22 of the sound output device 2 uses the comparison unit 224 to compare the amplitudes of the sound signals X1 (f),..., Xn (f) for each frequency band, and each sound signal X1 (f). ,..., Xn (f) are weak, and it is determined whether or not there is one sound signal having an amplitude greater than a predetermined value with respect to the amplitude of all other sound signals (S701).

ステップＳ７０１において、各音信号Ｘ１（ｆ），…，Ｘｎ（ｆ）間の相関が弱く、他の全ての音信号の振幅に対して所定値以上大きい振幅を有する一の音信号が存在すると判定した場合（Ｓ７０１：ＹＥＳ）、音出力装置２の通過機構２２は、比較部２２４により、当該周波数ｆ、即ち偏向周波数ｆｄにおいて、最も振幅が大きい一の音信号を特定し、特定した音信号に基づいて各帯域通過フィルタ部２２１，２２１，…のフィルタ係数を導出し（Ｓ７０２）、導出したフィルタ係数を夫々対応する帯域通過フィルタ部２２１，２２１，…へ偏向周波数ｆｄにおけるフィルタ係数として渡す。ステップＳ７０２では、最も振幅が大きい一の音信号に係る帯域通過フィルタ部２２１のフィルタ係数が１．０として導出され、他の音信号に係る帯域通過フィルタ部２２１，２２１，…のフィルタ係数が０．０として導出される。 In step S701, it is determined that the correlation between the sound signals X1 (f),..., Xn (f) is weak, and there is one sound signal having an amplitude greater than a predetermined value with respect to the amplitudes of all other sound signals. In the case (S701: YES), the passage mechanism 22 of the sound output device 2 uses the comparison unit 224 to identify one sound signal having the largest amplitude at the frequency f, that is, the deflection frequency fd, and use the identified sound signal. Are derived as the filter coefficients at the deflection frequency fd to the corresponding band-pass filter sections 221, 221,..., Respectively. In step S702, the filter coefficient of the band-pass filter unit 221 relating to one sound signal having the largest amplitude is derived as 1.0, and the filter coefficient of the band-pass filter units 221, 221,. Derived as .0.

音出力装置２の通過機構２２は、各帯域通過フィルタ部２２１，２２１，…の偏向周波数ｆｄにおけるフィルタ係数の設定を、夫々受け付けたフィルタ係数に変更する（Ｓ７０３）。そして音出力装置２は、実施の形態１の音出力処理にて示したステップＳ１０３以降の処理を実行する。 The passing mechanism 22 of the sound output device 2 changes the setting of the filter coefficient at the deflection frequency fd of each of the band-pass filter units 221, 221... To the accepted filter coefficient (S703). Then, the sound output device 2 executes the processes after step S103 shown in the sound output process of the first embodiment.

ステップＳ７０１において、各音信号Ｘ１（ｆ），…，Ｘｎ（ｆ）間の相関が強く、他の全ての音信号の振幅に対して所定値以上大きい振幅を有する一の音信号が存在しないと判定した場合（Ｓ７０１：ＮＯ）、音出力装置２の通過機構２２は、各帯域通過フィルタ部２２１，２２１，…のフィルタ係数の設定の変更を解除する（Ｓ７０４）。ステップＳ７０４の変更の解除とは、ステップＳ７０３でフィルタ係数の設定が変更されていた場合に、その変更された設定を解除して、元のフィルタ係数の設定に戻す処理である。従ってフィルタ係数が変更されていない場合、実質的な処理は行われない。但し、フィルタ係数の設定が変更されていたとしても、当該周波数において、一の音信号の成分のみを通過させるという本来の処理は行われているため、必ずしも元のフィルタ係数の設定に戻す必要はない。そして音出力装置２は、実施の形態１の音出力処理にて示したステップＳ１０３以降の処理を実行する。 In step S701, if there is a strong correlation between the sound signals X1 (f),..., Xn (f) and there is no sound signal having an amplitude greater than a predetermined value with respect to the amplitudes of all other sound signals. When it determines (S701: NO), the passage mechanism 22 of the sound output device 2 cancels the change of the filter coefficient setting of each bandpass filter unit 221,221,... (S704). Canceling the change in step S704 is a process of canceling the changed setting and returning it to the original filter coefficient setting when the filter coefficient setting has been changed in step S703. Therefore, when the filter coefficient is not changed, no substantial processing is performed. However, even if the setting of the filter coefficient is changed, the original process of passing only the component of one sound signal at that frequency is performed, so it is not always necessary to return to the original setting of the filter coefficient. Absent. Then, the sound output device 2 executes the processes after step S103 shown in the sound output process of the first embodiment.

図２４は、本発明の実施の形態５に係る音出力装置２が備える通過機構２２の処理の例を概念的に示す説明図である。図２４は、ある時点における周波数ｆに対する振幅の大きさを音信号に係るチャネル毎に示したグラフであり、図２４（ａ）は、第１チャネルｃｈ１（ｆ）、図２４（ｂ）は、第２チャネルｃｈ２（ｆ）、そして図２４（ｃ）は、第ｎチャネルｃｈｎ（ｆ）を夫々示している。周波数ｆ１では、各チャネル間の相関が高いため、通過機構２２では、予め設定されているフィルタ係数に基づいて処理が行われる。周波数ｆ２では、相関が低く、第１チャネルｃｈ１（ｆ）に係る音信号の振幅が大きいため、第１チャネルｃｈ１（ｆ）に係る帯域通過フィルタ部２２１のフィルタ係数は１．０となり、他のチャネルに係る帯域通過フィルタ部２２１，２２１，…のフィルタ係数は０．０となる。また周波数ｆ３では、相関が低く、第２チャネルｃｈ２（ｆ）に係る音信号の振幅が大きいため、第２チャネルｃｈ２（ｆ）に係る帯域通過フィルタ部２２１のフィルタ係数は１．０となり、他のチャネルに係る帯域通過フィルタ部２２１，２２１，…のフィルタ係数は０．０となる。さらに周波数ｆ４では、相関が低く、第ｎチャネルｃｈｎ（ｆ）に係る音信号の振幅が大きいため、第ｎチャネルｃｈｎ（ｆ）に係る帯域通過フィルタ部２２１のフィルタ係数は１．０となり、他のチャネルに係る帯域通過フィルタ部２２１，２２１，…のフィルタ係数は０．０となる。 FIG. 24 is an explanatory diagram conceptually showing an example of processing of the passage mechanism 22 provided in the sound output device 2 according to Embodiment 5 of the present invention. FIG. 24 is a graph showing the magnitude of the amplitude with respect to the frequency f at a certain point of time for each channel related to the sound signal. FIG. 24A shows the first channel ch1 (f), and FIG. The second channel ch2 (f) and FIG. 24C show the nth channel chn (f), respectively. Since the correlation between the channels is high at the frequency f1, the passage mechanism 22 performs processing based on a preset filter coefficient. At the frequency f2, since the correlation is low and the amplitude of the sound signal related to the first channel ch1 (f) is large, the filter coefficient of the band pass filter unit 221 related to the first channel ch1 (f) is 1.0. The filter coefficients of the band pass filter units 221, 221 and so on relating to the channel are 0.0. Further, at the frequency f3, since the correlation is low and the amplitude of the sound signal related to the second channel ch2 (f) is large, the filter coefficient of the band pass filter unit 221 related to the second channel ch2 (f) is 1.0. The filter coefficients of the band-pass filter units 221, 221. Furthermore, at the frequency f4, since the correlation is low and the amplitude of the sound signal related to the n-th channel chn (f) is large, the filter coefficient of the band-pass filter unit 221 related to the n-th channel chn (f) is 1.0. The filter coefficients of the band-pass filter units 221, 221.

実施の形態６．
実施の形態６は、実施の形態５において、フィルタ係数の設定方法を変更する形態である。Embodiment 6 FIG.
The sixth embodiment is a form in which the filter coefficient setting method is changed in the fifth embodiment.

以降の説明において、実施の形態５又は実施の形態５の元となる実施の形態１と同様の構成については、実施の形態５又は実施の形態１を参照するものとし、その説明を省略する。実施の形態６におけるエコー抑圧システムの構成例は、図４を用いて示した実施の形態１と同様であるので、実施の形態１を参照するものとし、その説明を省略する。 In the following description, the fifth embodiment or the first embodiment is referred to for the same configuration as the first embodiment that is the basis of the fifth embodiment or the fifth embodiment, and the description thereof is omitted. Since the configuration example of the echo suppression system in the sixth embodiment is the same as that in the first embodiment shown in FIG. 4, the first embodiment is referred to and the description thereof is omitted.

図２５は、本発明の実施の形態６に係る音出力装置２の通過機構２２の構成例を示す機能ブロック図である。通過機構２２が備える比較部２２４は、実施の形態５と同様に、所定数のフレーム単位又は所定の時間間隔で、他の全ての音信号の振幅に対して、所定値以上大きい振幅を有する一の音信号が存在するか否かを判定し、存在すると判定した場合、当該周波数、即ち偏向周波数ｆｄを各帯域通過フィルタ部２２１，２２１，…へ渡す。 FIG. 25 is a functional block diagram showing a configuration example of the passage mechanism 22 of the sound output device 2 according to Embodiment 6 of the present invention. As in the fifth embodiment, the comparison unit 224 included in the passing mechanism 22 has an amplitude larger than a predetermined value by a predetermined number of frames or a predetermined time interval. , And if it is determined that the sound signal is present, the frequency, that is, the deflection frequency fd is passed to each of the bandpass filter units 221, 221,.

帯域通過フィルタ部２２１，２２１，…では、偏向周波数ｆｄを受け付けた場合、受け付けた偏向周波数ｆｄにおけるフィルタ係数を１．０に変更する。この様に全ての帯域通過フィルタ部２２１，２２１，…のフィルタ係数を１．０に設定することにより、全ての音信号Ｘ１（ｆ），…，Ｘｎ（ｆ）を通過させることになる。一の音信号の振幅に対して、他の全ての音信号の振幅は、無視できる程度に小さいため、全ての音信号Ｘ１（ｆ），…，Ｘｎ（ｆ）を通過させた場合でも、実質的に実施の形態５と同様の効果が得られる。 When the deflection frequency fd is received, the band pass filter units 221, 221,... Change the filter coefficient at the received deflection frequency fd to 1.0. In this way, by setting the filter coefficients of all the band pass filter units 221, 221... To 1.0, all the sound signals X1 (f),. Since the amplitudes of all other sound signals are negligibly small relative to the amplitude of one sound signal, even if all the sound signals X1 (f),..., Xn (f) are passed through, Thus, the same effect as in the fifth embodiment can be obtained.

比較部２２４の比較により、他の全ての音信号の振幅に対して、所定値以上大きい振幅を有する一の音信号が存在しないと判定した場合、各音信号Ｘ１（ｆ），…，Ｘｎ（ｆ）間の相関が強いと判断する。相関が強いと判断した場合、比較部２２４は、偏向周波数ｆｄの導出を行わない。従って帯域通過フィルタ部２２１，２２１，…では、前述した実施の形態１と同様の処理を実行することになる。なお比較部２２４から帯域通過フィルタ部２２１，２２１，…へ、偏向周波数ｆｄのみを渡すのではなく、偏向周波数ｆｄにおける各帯域通過フィルタ部２２１，２２１，…のフィルタ係数を、夫々対応する帯域通過フィルタ部２２１，２２１，…へ渡す様にしても良い。この場合、比較部２２４から帯域通過フィルタ部２２１，２２１，…へ渡すフィルタ係数は、全て１．０である。 When it is determined by comparison of the comparison unit 224 that there is no one sound signal having an amplitude greater than a predetermined value with respect to the amplitudes of all other sound signals, the sound signals X1 (f),. It is judged that the correlation between f) is strong. When it is determined that the correlation is strong, the comparison unit 224 does not derive the deflection frequency fd. Therefore, the band pass filter units 221, 221,... Execute the same processing as in the first embodiment. In addition, not only the deflection frequency fd is passed from the comparison unit 224 to the bandpass filter units 221, 221,..., But the filter coefficients of the bandpass filter units 221, 221,. You may make it pass to filter part 221,221, .... In this case, all the filter coefficients passed from the comparison unit 224 to the band pass filter units 221, 221... Are 1.0.

次に本発明の実施の形態６に係るエコー抑圧システムが備える各装置の処理について説明する。図２６は、本発明の実施の形態６に係る音出力装置２の音出力処理の一例を示すフローチャートである。実施の形態６では、実施の形態１に係る音出力処理のステップＳ１０１〜Ｓ１０２の処理を実行する。そして音出力装置２の通過機構２２は、比較部２２４により、周波数帯域毎に各音信号Ｘ１（ｆ），…，Ｘｎ（ｆ）の振幅の大きさを比較し、各音信号Ｘ１（ｆ），…，Ｘｎ（ｆ）間の相関が弱く、他の全ての音信号の振幅に対して、所定値以上大きい振幅を有する一の音信号が存在するか否かを判定する（Ｓ８０１）。 Next, processing of each device provided in the echo suppression system according to Embodiment 6 of the present invention will be described. FIG. 26 is a flowchart showing an example of sound output processing of the sound output device 2 according to Embodiment 6 of the present invention. In the sixth embodiment, the processes of steps S101 to S102 of the sound output process according to the first embodiment are executed. Then, the passing mechanism 22 of the sound output device 2 uses the comparison unit 224 to compare the amplitudes of the sound signals X1 (f),..., Xn (f) for each frequency band, and each sound signal X1 (f). ,..., Xn (f) are weak, and it is determined whether or not there is one sound signal having an amplitude greater than a predetermined value with respect to the amplitudes of all other sound signals (S801).

ステップＳ８０１において、各音信号Ｘ１（ｆ），…，Ｘｎ（ｆ）間の相関が弱く、他の全ての音信号の振幅に対して、所定値以上大きい振幅を有する一の音信号が存在すると判定した場合（Ｓ８０１：ＹＥＳ）、音出力装置２の通過機構２２は、比較部２２４から各帯域通過フィルタ部２２１，２２１，…へ、偏向周波数ｆｄを渡す。 In step S801, if the correlation between the sound signals X1 (f),..., Xn (f) is weak and there is one sound signal having an amplitude greater than a predetermined value with respect to the amplitudes of all other sound signals. If it is determined (S801: YES), the pass mechanism 22 of the sound output device 2 passes the deflection frequency fd from the comparison unit 224 to each of the band pass filter units 221, 221,.

音出力装置２の通過機構２２は、各帯域通過フィルタ部２２１，２２１，…の偏向周波数ｆｄにおけるフィルタ係数の設定を夫々１．０に変更する（Ｓ８０２）。そして音出力装置２は、実施の形態１の音出力処理にて示したステップＳ１０３以降の処理を実行する。 The pass mechanism 22 of the sound output device 2 changes the setting of the filter coefficient at the deflection frequency fd of each of the band pass filter units 221, 221, ... to 1.0 (S802). Then, the sound output device 2 executes the processes after step S103 shown in the sound output process of the first embodiment.

ステップＳ８０１において、各音信号Ｘ１（ｆ），…，Ｘｎ（ｆ）間の相関が強く、他の全ての音信号の振幅に対して、所定値以上大きい振幅を有する一の音信号が存在しないと判定した場合（Ｓ８０１：ＮＯ）、音出力装置２の通過機構２２は、各帯域通過フィルタ部２２１，２２１，…のフィルタ係数の設定の変更を解除する（Ｓ８０３）。ステップＳ８０３の変更の解除とは、ステップＳ８０２でフィルタ係数の設定が変更されていた場合に、その変更された設定を解除して、元のフィルタ係数の設定に戻す処理である。従ってフィルタ係数が変更されていない場合、実質的な処理は行われない。そして音出力装置２は、実施の形態１の音出力処理にて示したステップＳ１０３以降の処理を実行する。 In step S801, the correlation between the sound signals X1 (f),..., Xn (f) is strong, and there is no one sound signal having an amplitude greater than a predetermined value with respect to the amplitudes of all other sound signals. (S801: NO), the pass mechanism 22 of the sound output device 2 cancels the change of the filter coefficient setting of each of the band pass filter units 221, 221, ... (S803). Canceling the change in step S803 is a process of canceling the changed setting and returning it to the original filter coefficient setting when the filter coefficient setting has been changed in step S802. Therefore, when the filter coefficient is not changed, no substantial processing is performed. Then, the sound output device 2 executes the processes after step S103 shown in the sound output process of the first embodiment.

ステップＳ８０１〜Ｓ８０３の処理後に実行されるステップＳ１０３の処理として、音出力装置２の通過機構２２は、各帯域通過フィルタ部２２１，２２１，…により、周波数軸上の成分に変換された複数の音信号Ｘ１（ｆ），…，Ｘｎ（ｆ）に対し、夫々異なる周波数帯域の成分Ｘ１＿ｆ（ｆ），…，Ｘｎ＿ｆ（ｆ）を通過させる。なお各周波数ｆにおいて、通過する音信号は一つであるが、偏向周波数ｆｄの成分については、全ての音信号Ｘ１（ｆｄ），…，Ｘｎ（ｆｄ）が通過する。 As a process of step S103 executed after the processes of steps S801 to S803, the pass mechanism 22 of the sound output device 2 uses a plurality of sounds converted into components on the frequency axis by the band pass filter units 221, 221. The components X1_f (f),..., Xn_f (f) in different frequency bands are passed through the signals X1 (f),. Note that at each frequency f, there is only one sound signal passing through, but all the sound signals X1 (fd),..., Xn (fd) pass through the component of the deflection frequency fd.

実施の形態７．
実施の形態７は、実施の形態１において、各帯域通過フィルタ部のフィルタ係数を経時的に変化させる形態である。Embodiment 7 FIG.
The seventh embodiment is a form in which the filter coefficient of each bandpass filter unit is changed over time in the first embodiment.

以降の説明において、実施の形態１と同様の構成については、実施の形態１と同様の符号を付すものとし、その詳細な説明を省略する。実施の形態７におけるエコー抑圧システムの構成例は、図４を用いて示した実施の形態１と同様であるので、実施の形態１を参照するものとし、その説明を省略する。 In the following description, components similar to those in the first embodiment are denoted by the same reference numerals as those in the first embodiment, and detailed description thereof is omitted. Since the configuration example of the echo suppression system in the seventh embodiment is the same as that in the first embodiment shown in FIG. 4, the first embodiment is referred to and the description thereof is omitted.

図２７は、本発明の実施の形態７に係る音出力装置２の通過機構２２の構成例を示す機能ブロック図である。実施の形態７に係る通過機構２２は、各帯域通過フィルタ部２２１，２２１，…のフィルタ係数を変更する係数部２２５を備えている。 FIG. 27 is a functional block diagram showing a configuration example of the passing mechanism 22 of the sound output device 2 according to Embodiment 7 of the present invention. The pass mechanism 22 according to the seventh embodiment includes a coefficient unit 225 that changes the filter coefficient of each bandpass filter unit 221, 221.

係数部２２５は、時刻を取得する図示しない時計回路を有し、所定の時間間隔で、各帯域通過フィルタ部２２１，２２１，…のフィルタ係数を導出する。係数部２２５は、一の帯域通過フィルタ部２２１のフィルタ係数として１．０を導出し、他の帯域通過フィルタ部２２１，２２１，…のフィルタ係数として０．０を導出する。なおフィルタ係数が１．０となる帯域通過フィルタ部２２１は毎回変化する。係数部２２５は、例えば予め時間に対応付けてフィルタ係数を記録したテーブルの使用、フィルタ係数を出力する所定の関数の使用等の様々な方法によりフィルタ係数を導出する。そして導出した各帯域通過フィルタ部２２１，２２１，…のフィルタ係数を、夫々対応する帯域通過フィルタ部２２１，２２１，…へ渡す。 The coefficient unit 225 has a clock circuit (not shown) that acquires time, and derives filter coefficients of the band-pass filter units 221, 221,... At a predetermined time interval. The coefficient unit 225 derives 1.0 as the filter coefficient of one band pass filter unit 221, and derives 0.0 as the filter coefficient of the other band pass filter units 221, 221,. Note that the band-pass filter unit 221 having a filter coefficient of 1.0 changes every time. The coefficient unit 225 derives the filter coefficient by various methods such as using a table in which the filter coefficient is recorded in advance in association with time and using a predetermined function that outputs the filter coefficient. Then, the derived filter coefficients of the band pass filter units 221, 221,... Are passed to the corresponding band pass filter units 221, 221,.

帯域通過フィルタ部２２１，２２１，…では、夫々フィルタ係数を受け付けたフィルタ係数に変更する。係数部２２５は、所定の時間間隔で毎回変更されるフィルタ係数を導出することにより、各帯域通過フィルタ部２２１，２２１，…のフィルタ係数は、所定の時間間隔で、即ち経時的に変更されることになる。またこれにより通過機構２２は、通過させる音信号毎の周波数帯域の成分を、経時的に変更することになる。なお時計回路が示す時刻に基づいてフィルタ係数を変更するのではなく、所定のフレーム数毎にフィルタ係数を変更することで、経時的に変更する様にしても良い。 In the band pass filter units 221, 221 and so on, the filter coefficients are changed to the accepted filter coefficients. The coefficient unit 225 derives filter coefficients that are changed every time at a predetermined time interval, so that the filter coefficients of the bandpass filter units 221, 221,... Are changed at a predetermined time interval, that is, with time. It will be. Further, the passing mechanism 22 changes the frequency band component for each sound signal to be passed with time. Instead of changing the filter coefficient based on the time indicated by the clock circuit, the filter coefficient may be changed over time by changing the filter coefficient every predetermined number of frames.

次に本発明の実施の形態７に係るエコー抑圧システムが備える各装置の処理について説明する。図２８は、本発明の実施の形態７に係る音出力装置２の係数変更処理の一例を示すフローチャートである。音出力装置２の通過機構２２は、係数部２２５により、時計回路を参照し、予め設定されている所定時間が経過したか否かを判定する（Ｓ９０１）。 Next, processing of each device provided in the echo suppression system according to Embodiment 7 of the present invention will be described. FIG. 28 is a flowchart showing an example of coefficient change processing of the sound output device 2 according to Embodiment 7 of the present invention. The passage mechanism 22 of the sound output device 2 uses the coefficient unit 225 to refer to the clock circuit and determine whether a predetermined time set in advance has elapsed (S901).

ステップＳ９０１において、所定時間が経過したと判定した場合（Ｓ９０１：ＹＥＳ）、音出力装置２の通過機構２２は、係数部２２５により、各帯域通過フィルタ部２２１，２２１，…のフィルタ係数を導出し（Ｓ９０２）、導出したフィルタ係数を夫々対応する帯域通過フィルタ部２２１，２２１，…へ渡す。ステップＳ９０１において、所定時間が経過していないと判定した場合（Ｓ９０１：ＮＯ）、ステップＳ９０１の処理を繰り返す。 If it is determined in step S901 that the predetermined time has elapsed (S901: YES), the pass mechanism 22 of the sound output device 2 derives the filter coefficients of the band pass filter units 221, 221. (S902), the derived filter coefficients are passed to the corresponding band-pass filter units 221, 221. If it is determined in step S901 that the predetermined time has not elapsed (S901: NO), the process of step S901 is repeated.

音出力装置２の通過機構２２は、各帯域通過フィルタ部２２１，２２１，…のフィルタ係数の設定を夫々変更する（Ｓ９０３）。係数変更処理により、通過機構２２は、通過させる音信号毎の周波数帯域の成分を、経時的に変更することになる。 The pass mechanism 22 of the sound output device 2 changes the setting of the filter coefficient of each band pass filter unit 221, 221... (S903). By the coefficient changing process, the passing mechanism 22 changes the frequency band component for each sound signal to be passed with time.

図２９は、本発明の実施の形態７に係る音出力装置２が備える通過機構２２の各帯域通過フィルタ部２２１，２２１，…のフィルタ係数の経時変化の例を示す説明図である。図２９（ａ），（ｂ），（ｃ）は、夫々第１チャネル１ｃｈ、第２チャネル２ｃｈ、及び第ｎチャネルｎｃｈの音信号Ｘ１（ｆ）、Ｘ２（ｆ）、及びＸｎ（ｆ）に対する係数フィルタＣ１（ｆ）、Ｃ２（ｆ）、及びＣｎ（ｆ）を示しており、横軸を周波数ｆとし、縦軸をフィルタ係数として、その関係を示したグラフである。そして図２９（ｄ），（ｅ），（ｆ）は、夫々図２９（ａ），（ｂ），（ｃ）に示す状態から所定時間経過後の第１チャネル１ｃｈ、第２チャネル２ｃｈ、及び第ｎチャネルｎｃｈの音信号Ｘ１（ｆ）、Ｘ２（ｆ）、及びＸｎ（ｆ）に対する係数フィルタＣ１（ｆ）、Ｃ２（ｆ）、及びＣｎ（ｆ）を示している。図２９（ａ），（ｂ），（ｃ）と、図２９（ｄ），（ｅ），（ｆ）とを比較すると明らかな様に、各チャネルの係数フィルタの値、即ちフィルタ係数が経時的に変更されている。そして通過する音信号毎の周波数帯域の成分が、経時的に変化することになる。 FIG. 29 is an explanatory diagram illustrating an example of a change over time of the filter coefficients of the band pass filter units 221, 221... Of the pass mechanism 22 included in the sound output device 2 according to Embodiment 7 of the invention. FIGS. 29A, 29B, and 29C show the sound signals X1 (f), X2 (f), and Xn (f) of the first channel 1ch, the second channel 2ch, and the nth channel nch, respectively. The coefficient filters C1 (f), C2 (f), and Cn (f) are shown, and the horizontal axis is the frequency f and the vertical axis is the filter coefficient. 29 (d), (e), and (f) show the first channel 1ch, the second channel 2ch, and the second channel after a predetermined time has elapsed from the states shown in FIGS. 29 (a), (b), and (c), respectively. The coefficient filters C1 (f), C2 (f), and Cn (f) for the sound signals X1 (f), X2 (f), and Xn (f) of the n-th channel nch are shown. As is clear from comparison of FIGS. 29A, 29B and 29C with FIGS. 29D, 29E and 29F, the value of the coefficient filter of each channel, that is, the filter coefficient is a function of time. Has been changed. And the component of the frequency band for every sound signal to pass changes with time.

前記実施の形態１乃至７は、本発明の無限にある実施の形態の一部を例示したに過ぎず、各種ハードウェア及びソフトフェア等の構成は、適宜設定することが可能であり、また例示した基本的な処理以外にも様々な処理を組み合わせることが可能である。例えば本発明のエコー抑圧システムを、ナビゲーションシステム、テレビ会議システム以外の音声、音響に係る様々なシステムに適用することが可能であり、更にはエコー抑圧装置、音出力装置、音入力装置及び音処理装置の全て又は適宜選択された二或いは三の装置を一の装置として構成する様にしても良い。また音出力装置を、音信号を生成する音信号生成装置と、音出力装置とに分割する等、一の装置を複数の装置として構成する様にしても良い。さらに前記実施の形態１乃至７では、フィルタ係数を０．０又は１．０として櫛形フィルタを形成する形態を示したが、０．０以上１．０以下の値をも取る様にして、台形型フィルタ、三角形型フィルタ等の様々な特性のフィルタを形成させることが可能である。そして前記実施の形態１乃至７は夫々独立して実現されるのではなく、適宜組み合わせることも可能である。 The first to seventh embodiments are merely examples of an infinite embodiment of the present invention, and various hardware and software configurations can be set as appropriate. Various processes can be combined in addition to the basic processes. For example, the echo suppression system of the present invention can be applied to various systems related to voice and sound other than navigation systems and video conference systems, and further, echo suppression devices, sound output devices, sound input devices, and sound processing All of the devices or two or three devices appropriately selected may be configured as one device. Also, one device may be configured as a plurality of devices, such as dividing the sound output device into a sound signal generating device that generates a sound signal and a sound output device. Further, in the first to seventh embodiments, the form in which the comb filter is formed by setting the filter coefficient to 0.0 or 1.0 has been shown. It is possible to form filters having various characteristics such as a mold filter and a triangular filter. The first to seventh embodiments are not realized independently, but can be appropriately combined.

Claims

A sound output device that outputs a sound based on a sound signal, a sound input device that converts an input sound into a sound signal, and a sound that is input from the sound input device based on a sound that is output from the sound output device In an echo suppression system comprising an echo suppression device for suppressing echo,
The sound output device is
For a plurality of sound signals, passing parts that pass components in different frequency bands, and
A plurality of sound output sections for outputting sounds based on the plurality of sound signals that have passed through the passage section,
An adder that generates a reference sound signal by adding a plurality of sound signals that have passed through the passage;
The echo suppressor is
An input unit for inputting a sound signal from the sound input device as an observation sound signal;
An echo suppression system, comprising: a correction unit that corrects the observation sound signal so as to suppress an echo included in the observation sound signal based on the observation sound signal and the reference sound signal.

The passage part is
A first converter for converting a plurality of sound signals into components on the frequency axis,
Band pass filter units that pass components of different frequency bands for a plurality of sound signals converted into components on the frequency axis,
2. A second conversion unit that converts a plurality of sound signals converted into components on a frequency axis through which components of respective frequency bands are passed, into sound signals on the time axis, respectively. 2. The echo suppression system according to 1.

The plurality of sound signals through which the passing sections pass components of different frequency bands are processed sound obtained by processing the first sound signal, the second sound signal, the first sound signal and / or the second sound signal by a predetermined processing method. Signal,
The band pass filter unit has a filter coefficient set for each sound signal,
Processing based on the first filter coefficient for the first sound signal, the second filter coefficient for the second sound signal, and the first filter coefficient and / or the second filter coefficient corresponding to the processing method for the processed sound signal The echo suppression system according to claim 2, wherein filter coefficients are respectively set.

The sound output device is
The frequency band component corresponding to the preset removal band is not allowed to pass through any sound signal,
The correction unit is
A filter unit for correction for deriving a correction amount required for correcting the observation sound signal by filtering the reference sound signal with a filter coefficient set for each frequency;
A coefficient updating unit for calculating and updating the filter coefficient of the correction filter based on the corrected observation sound signal;
The echo according to any one of claims 1 to 3 , further comprising: an update availability determination unit that determines whether the coefficient update unit can update based on a component of a removal band of the observation sound signal. Suppression system.

The sound output device further includes:
For each frequency band, the amplitudes of multiple sound signals are compared using a predetermined method, and the result of comparison between the amplitude of one sound signal with the maximum amplitude and the amplitudes of all other sound signals A comparison unit for determining whether or not the value indicating is equal to or greater than a predetermined value;
The said passage part is made to pass only said one sound signal with respect to the frequency band which the said comparison part determined to be more than predetermined value. The any one of Claim 1 thru | or 4 characterized by the above-mentioned. The echo suppression system described in 1.

The sound output device further includes:
For each frequency band, the amplitudes of multiple sound signals are compared using a predetermined method, and the result of comparison between the amplitude of one sound signal with the maximum amplitude and the amplitudes of all other sound signals A comparison unit for determining whether or not the value indicating is equal to or greater than a predetermined value;
The said passage part is made to pass all the sound signals with respect to the frequency band which the said comparison part determined to be more than predetermined value. The Claim 1 thru | or 5 characterized by the above-mentioned. Echo suppression system.

A sound output device that outputs sound based on a sound signal, a sound input device that receives sound, and an echo that suppresses echo based on the sound output from the sound output device from the sound input by the sound input device In an echo suppression method using a suppressor,
By the sound output device,
A procedure for passing components of different frequency bands for a plurality of sound signals,
A sound output procedure for outputting sounds based on a plurality of sound signals through which components of different frequency bands have passed,
With the sound input device,
A sound input procedure for converting the input sound into a sound signal;
By the echo suppression device,
An addition procedure for generating a reference sound signal by adding a plurality of sound signals that have passed through components of different frequency bands;
An input procedure in which a sound signal is input as an observation sound signal from the sound input device;
An echo suppression method, comprising: a correction procedure for correcting the observation sound signal so as to suppress an echo included in the observation sound signal based on the observation sound signal and the reference sound signal.

In cooperation with a sound output device that outputs a sound based on a sound signal and a sound input device that generates a sound signal based on an input sound, the sound input device outputs from the input sound. In an echo suppression program for causing an echo suppressor to execute a procedure for suppressing echo based on the sound
In the echo suppressor,
A reference sound signal generated by adding a plurality of sound signals that have passed through components of different frequency bands output from the sound output device, and an observation sound signal that is a sound signal generated by the sound input device; And executing a correction procedure for correcting the observation sound signal so as to suppress the echo contained in the observation sound signal.

In cooperation with a sound output device that outputs a sound based on a sound signal and a sound input device that generates a sound signal based on an input sound, the sound input device outputs from the input sound. In an echo suppressor that suppresses echo based on the generated sound,
An adder that generates a reference sound signal by adding a plurality of sound signals that have passed through components of different frequency bands output from the sound output device;
An input unit for inputting a sound signal from the sound input device as an observation sound signal;
An echo suppression apparatus comprising: a correction unit that corrects the observation sound signal so as to suppress an echo included in the observation sound signal based on the observation sound signal and the reference sound signal.

A sound input device that includes a plurality of sound output units that output a sound based on a plurality of sound signals, generates a sound signal based on the input sound, and the sound output unit from the sound input to the sound input device In a sound output device that cooperates with an echo suppressor that suppresses echo based on sound output from
For a plurality of sound signals, passing parts that pass components in different frequency bands, and
An adder that generates a reference sound signal by adding a plurality of sound signals that have passed through the passage;
With
Wherein the plurality of sound output section, Ri said sound based on the plurality of sound signals passed through the passing section configured so as to respectively output Thea,
The addition unit, a sound output device, wherein the generated reference sound signal that Ru Thea configured so as to be output to the echo suppressor.