JP2012244336A

JP2012244336A - Audio signal processing device, audio signal processing method and acoustic reproduction device

Info

Publication number: JP2012244336A
Application number: JP2011111338A
Authority: JP
Inventors: Shiro Suzuki; 志朗鈴木
Original assignee: Sony Corp
Current assignee: Sony Corp
Priority date: 2011-05-18
Filing date: 2011-05-18
Publication date: 2012-12-10

Abstract

PROBLEM TO BE SOLVED: To obtain good listening quality for a reflected sound of a sound which was output from a speaker and then thrown back to the listener upon reflection at a surrounding object.SOLUTION: A cancellation audio signal is generated based on an input audio signal Di. Then, the cancellation audio signal is added to the input audio signal to obtain an output audio signal Di". The cancellation audio signal is used to cancel, among reflected sounds reaching a listening point after being output from a speaker, a prescribed second reflected sound (sound to be a sub) which reaches the listening point after a first reflected sound (sound to be the main) has arrived. Since the output audio signal is supplied to the speaker, the reflected sounds reaching a listening point after being output from the speaker become to include a reflected sound which cancels the second reflected sound. As a result, the second reflected sound is canceled at the listening point. The speaker is installed face down on the rear side of the cabinet of a display apparatus such as a television receiver, etc.

Description

本技術は、音声信号処理装置、音声信号処理方法および音響再生装置に関する。特に、本技術は、スピーカから出力される音が周囲の物体に反射した後に到達する音、つまり反射音を受聴する音響再生装置に適用し得る音声信号処理装置等に関する。 The present technology relates to an audio signal processing device, an audio signal processing method, and an acoustic reproduction device. In particular, the present technology relates to a sound signal processing device and the like that can be applied to a sound that is received after a sound output from a speaker is reflected by a surrounding object, that is, a reflected sound.

近年、プラズマ、液晶などの表示デバイスが広く一般化し、テレビ受信機は薄型化、大型化の一途を辿っている。このようなテレビ受信機においては、薄型化によってスピーカサイズは大きな制約を受け、また、画面の大型化、狭縁化によって、画面両端へのスピーカ設置が困難になってきている。その結果、テレビ受信機に設置されるスピーカに関しては、サイズが小さく、かつ設置位置が背面に移設され、さらには下向きに設置されることが多くなっている（例えば、特許文献１参照）。 In recent years, display devices such as plasma and liquid crystal have been widely used, and television receivers are becoming thinner and larger. In such a television receiver, the speaker size is greatly restricted due to the reduction in thickness, and the installation of speakers at both ends of the screen has become difficult due to the enlargement and narrowing of the screen. As a result, the speaker installed in the television receiver is often small in size, moved to the rear side, and further installed downward (see, for example, Patent Document 1).

そのため、使用者に届く音はとても「良い音」と呼べるものではなくなってきている。なお、ここで言う「良い音」とは、個々人の主観的なものではなく、物理的に理想的な特性、つまりフラットな周波数特性を持つスピーカがあったとして、これを受聴点に向けて正面に配置して視聴者が受聴する場合を想定する。つまり、再生されるべき記録された音の周波数特性が、できる限りそのままに受聴できる場合を「良い音」が実現できている、とする。 Therefore, the sound that reaches the user is no longer a very good sound. Note that “good sound” here is not an individual's subjective sound, but a speaker with a physically ideal characteristic, that is, a flat frequency characteristic. It is assumed that the viewer listens to the music by placing them on the screen. That is, it is assumed that a “good sound” is realized when the frequency characteristics of the recorded sound to be reproduced can be received as much as possible.

さて、使用者からみてスピーカが下向きになっただけで、大抵の人は音が良くきこえないことは容易に想像がつく。その原因の客観的な一例を挙げる。すなわち、音の低域成分は指向性が鋭くないため、スピーカが下向きでも使用者に届きやすいので、比較的聞こえやすい。しかし、高域成分になればなるほど指向性が強くなるため、使用者には聞こえづらくなる。このため、音としては高域がないようないわゆる「こもった音」になる。この対策として、高域を持ち上げる信号処理を施すことで周波数特性をフラットに近づけて十分な高域再生を実現し、「こもった音」を解消することで「良い音」を実現するための努力がなされている。 Now, it is easy to imagine that most people will not be able to hear sound well just because the speaker is facing downward. Here is an objective example of the cause. That is, since the directivity of the low frequency component of the sound is not sharp, it is easy to reach the user even if the speaker is facing downward, so that it is relatively easy to hear. However, the higher the component, the stronger the directivity, making it difficult for the user to hear. For this reason, the sound is a so-called “muffled sound” with no high range. As measures against this, efforts are made to achieve a “good sound” by eliminating the “muffled sound” by implementing signal processing that raises the high range to achieve a sufficiently high frequency reproduction by making the frequency characteristics close to flat. Has been made.

特開２０１０−２３９２４９号公報JP 2010-239249 A

しかし、信号処理で周波数特性を調整するといっても、設計段階では、テレビが使用者環境でどのように設置されるのかは不明である。例えば、テレビ受信機の筐体にスピーカを下向きに取り付けた場合、使用者の受聴点においてどのような周波数特性を持つか、設計者には把握できない。この場合、テレビ受信機が低いテレビ台に置かれるのか、あるいは高さが１ｍ位もあるテレビスタンドに設置されるのかによって、周波数特性が大きく異なってくるからである。 However, even if the frequency characteristics are adjusted by signal processing, it is unclear at the design stage how the television is installed in the user environment. For example, when a speaker is mounted downward on a housing of a television receiver, the designer cannot grasp what frequency characteristics are present at the user's listening point. In this case, the frequency characteristics vary greatly depending on whether the television receiver is placed on a low television stand or on a television stand having a height of about 1 m.

結局、設計段階では、信号処理でどの程度高域を上げるべきかなど、周波数特性をフラット化するための逆フィルタの特性をはっきりと決めることができない。そのため、設計者は使用者の環境を「推測」し、あるいは特定の環境を想定した「仮定」で、信号処理の内容を決めなくてはならない。したがって、その信号処理機能をテレビ受信機に実装しても、使用者がその恩恵を受けることができるかは曖昧といわざるを得ない。 After all, at the design stage, it is impossible to clearly determine the characteristics of the inverse filter for flattening the frequency characteristics, such as how much high frequency should be increased in signal processing. For this reason, the designer must “guess” the user's environment or decide the contents of the signal processing based on “assuming” assuming a specific environment. Therefore, even if the signal processing function is implemented in the television receiver, it is unclear whether the user can receive the benefit.

また、大局的に低域、高域といった２点の周波数特性を調整しただけではフラット化は困難であり、「良い音」が実現できるわけではない。低域、高域で大雑把にでも周波数特性を調整すれば、多少なりとも音質は改善する。しかし、それでも「音声が聞き取りにくい」、「楽器ごとのバランスがおかしい」といった、細かい音質問題は容易に改善しないことが多い。 Further, flat adjustment is difficult only by adjusting the frequency characteristics of two points such as a low range and a high range as a whole, and a “good sound” cannot be realized. If the frequency characteristics are adjusted roughly even in the low and high frequencies, the sound quality will improve somewhat. However, there are many cases where minor sound quality problems such as “difficult to hear sound” and “balance of each instrument” are not easily improved.

本技術の目的は、スピーカから出力される音が周囲の物体に反射した後に使用者に到達する反射音を良好に受聴可能とすることにある。 An object of the present technology is to make it possible to satisfactorily listen to a reflected sound that reaches a user after sound output from a speaker is reflected by a surrounding object.

本技術の概念は、
スピーカから出力されて受聴点に到達する反射音のうち、第１の反射音が到達した後に到達する所定の第２の反射音をキャンセルするためのキャンセル用音声信号を、入力音声信号に基づいて生成するキャンセル用音声信号生成部と、
上記入力音声信号に、上記キャンセル用音声信号生成部で生成されたキャンセル用音声信号を加算して出力音声信号を得る音声信号加算部と
を備える音声信号処理装置にある。 The concept of this technology is
Based on the input audio signal, a canceling audio signal for canceling a predetermined second reflected sound that arrives after the first reflected sound among the reflected sounds that are output from the speaker and reaches the listening point An audio signal generation unit for cancellation to be generated;
An audio signal processing apparatus comprising: an audio signal adding unit that adds the cancellation audio signal generated by the cancellation audio signal generation unit to the input audio signal to obtain an output audio signal.

本技術において、キャンセル用音声信号生成部により、キャンセル用音声信号が生成される。そして、音声信号加算部により、このキャンセル用音声信号が入力音声信号に加算されて出力音声信号が得られる。キャンセル用音声信号は、スピーカから出力されて受聴点に到達する反射音のうち、第１の反射音が到達した後に到達する所定の第２の反射音をキャンセルするためのものである。スピーカは、例えばテレビ受信機等の表示機器の筐体の背面側に下向きに取り付けられている。この場合、使用者は、スピーカから出力されて受聴点に到達する反射音を受聴する。この反射音は、スピーカから出力される音が周囲の物体に反射した後に到達する音である。 In the present technology, a cancellation audio signal is generated by the cancellation audio signal generation unit. The audio signal adding unit adds the cancel audio signal to the input audio signal to obtain an output audio signal. The cancellation audio signal is for canceling a predetermined second reflected sound that arrives after the first reflected sound is reached among the reflected sounds that are output from the speaker and reach the listening point. The speaker is attached downward on the back side of the housing of a display device such as a television receiver. In this case, the user listens to the reflected sound that is output from the speaker and reaches the listening point. This reflected sound is a sound that arrives after the sound output from the speaker is reflected by surrounding objects.

このように本技術においては、出力音声信号は、入力音声信号に、キャンセル用音声信号が加算されたものとなる。そのため、この出力音声信号をスピーカに供給することで、スピーカから出力して受聴点に到達する反射音に、上述の第２の反射音をキャンセルする反射音が含まれるようになり、結果的に受聴点において第２の反射音がキャンセルされる。したがって、使用者は、受聴点において、第１の反射音を、上述の第２の反射音に影響されることなく、良好に受聴可能となる。 As described above, in the present technology, the output audio signal is obtained by adding the cancellation audio signal to the input audio signal. Therefore, by supplying the output audio signal to the speaker, the reflected sound that is output from the speaker and reaches the listening point includes the reflected sound that cancels the second reflected sound, and as a result, The second reflected sound is canceled at the listening point. Therefore, the user can listen to the first reflected sound satisfactorily at the listening point without being affected by the second reflected sound.

なお、本技術において、例えば、キャンセル用音声信号生成部は、入力音声信号を、受聴点に到達する第１の反射音と第２の反射音との時間差に対応する時間だけ遅延させる遅延部と、この遅延部の出力信号のゲインを反射量に応じて調整してキャンセル用音声信号を得るゲイン調整部とを有する、ようにされてもよい。 In the present technology, for example, the cancellation audio signal generation unit includes a delay unit that delays the input audio signal by a time corresponding to a time difference between the first reflected sound and the second reflected sound that reaches the listening point. And a gain adjusting unit that adjusts the gain of the output signal of the delay unit according to the amount of reflection to obtain a canceling audio signal.

そして、本技術において、例えば、遅延部における遅延量を調整するための情報としてスピーカの設置位置から壁までの距離の情報をユーザが入力するためのユーザインタフェースと、このユーザインタフェースで入力される距離の情報に基づいて、遅延部における遅延量を制御する制御部とをさらに備える、ようにされてもよい。この場合、ユーザインタフェースは、距離の情報をユーザが入力するためのＧＵＩ表示を行う表示部と、表示部に表示されたＧＵＩ表示を用いて距離の情報をユーザが入力するためのユーザ入力部とを有する、ようにされてもよい。このようにユーザインタフェースが備えられることで、実際のスピーカの設置位置から壁までの距離に応じて、遅延部における遅延量を最適に調整することが可能となる。 In the present technology, for example, a user interface for a user to input information about a distance from a speaker installation position to a wall as information for adjusting a delay amount in the delay unit, and a distance input through the user interface And a control unit that controls a delay amount in the delay unit based on the information. In this case, the user interface includes a display unit that performs GUI display for the user to input distance information, and a user input unit for the user to input distance information using the GUI display displayed on the display unit. It may be made to have. By providing the user interface in this way, the delay amount in the delay unit can be optimally adjusted according to the distance from the actual speaker installation position to the wall.

また、本技術において、例えば、スピーカの設定位置から壁までの距離の情報を得る距離センサと、この距離センサで得られる距離の情報に基づいて、遅延部における遅延量を制御する制御部とをさらに備える、ようにされてもよい。この場合、実際のスピーカの設置位置から壁までの距離の情報が距離センサにより得られるので、使用者の手間を必要とすることなく、遅延部における遅延量を最適に調整することが可能となる。 Further, in the present technology, for example, a distance sensor that obtains information on a distance from a speaker setting position to a wall, and a control unit that controls a delay amount in the delay unit based on the distance information obtained by the distance sensor. Further, it may be provided. In this case, since the distance sensor can obtain information on the distance from the actual speaker installation position to the wall, it is possible to optimally adjust the delay amount in the delay unit without requiring the user. .

また、本技術において、例えば、ゲイン調整部におけるゲインを調整するための情報として反射量の情報をユーザが入力するためのユーザインタフェースと、このユーザインタフェースで入力される反射量の情報に基づいて、ゲイン調整部におけるゲインを制御する制御部とをさらに備える、ようにされてもよい。この場合、ユーザインタフェースは、反射量の情報をユーザが入力するためのＧＵＩ表示を行う表示部と、表示部に表示されたＧＵＩ表示を用いて反射量の情報をユーザが入力するためのユーザ入力部とを有する、ようにされてもよい。このようにユーザインタフェースが備えられることで、ゲイン調整部におけるゲインを最適に調整することが可能となる。 Further, in the present technology, for example, based on a user interface for the user to input information on the reflection amount as information for adjusting the gain in the gain adjustment unit, and information on the reflection amount input on the user interface, And a controller that controls the gain in the gain adjusting unit. In this case, the user interface includes a display unit that performs GUI display for the user to input information on the amount of reflection, and a user input for the user to input information about the amount of reflection using the GUI display displayed on the display unit. It may be made to have a part. By providing the user interface in this way, it is possible to optimally adjust the gain in the gain adjustment unit.

本技術によれば、スピーカから出力される音が周囲の物体に反射した後に到達する反射音を良好に受聴できるようになる。 According to the present technology, it is possible to satisfactorily listen to the reflected sound that arrives after the sound output from the speaker is reflected on a surrounding object.

スピーカＳＰがテレビ受信機（ＴＶ）の筐体の前面側に配置されている例を示す図である。It is a figure which shows the example by which the speaker SP is arrange | positioned at the front side of the housing | casing of a television receiver (TV). スピーカＳＰがテレビ受信機（ＴＶ）の筐体の背面側に下向きで配置されている例を示す図である。It is a figure which shows the example by which the speaker SP is arrange | positioned downward on the back side of the housing | casing of a television receiver (TV). 受聴点Ｐに届く反射音の周波数特性を説明するための図である。It is a figure for demonstrating the frequency characteristic of the reflected sound which reaches the listening point. 音響再生装置の一例を示すブロック図である。It is a block diagram which shows an example of a sound reproducing device. スピーカＳＰがテレビ受信機（ＴＶ）の筐体の前面側に配置され、さらに、音の反射を考慮した一例を示す図である。It is a figure which shows an example in which speaker SP is arrange | positioned at the front side of the housing | casing of a television receiver (TV), and also considered reflection of sound. スピーカＳＰがテレビ受信機（ＴＶ）の筐体の背面側に下向きで配置され、受聴するための主とする音に関する反射が考慮され、さらに主とする音以外の反射をも考慮した一例を示す図である。An example is shown in which the speaker SP is disposed downward on the rear side of the housing of the television receiver (TV), taking into account reflections relating to the main sound for listening, and also taking into account reflections other than the main sound. FIG. 主とする音と副とする音が混じり合った際の、周波数特性のシミュレーション結果のグラフの一例を示す図である。It is a figure which shows an example of the graph of the simulation result of a frequency characteristic when the main sound and the sub sound are mixed. 主とする音と副とする音が混じり合った際の、周波数特性のシミュレーション結果のグラフの他の例を示す図である。It is a figure which shows the other example of the graph of the simulation result of a frequency characteristic when the main sound and the sub sound are mixed. 本技術による課題解決方法の一例を示す図である。It is a figure which shows an example of the problem-solving method by this technique. 第１の実施の形態としての音響再生装置の構成例を示すブロック図である。It is a block diagram which shows the structural example of the sound reproduction apparatus as 1st Embodiment. 本技術における音響再生装置の効果を説明するための図である。It is a figure for demonstrating the effect of the sound reproduction apparatus in this technique. テレビ受信機（ＴＶ）を、台の受聴点Ｐよりに設置した場合を示す図である。It is a figure which shows the case where a television receiver (TV) is installed from the listening point P of a stand. テレビ受信機（ＴＶ）を、台の壁よりに設置した場合を示す図である。It is a figure which shows the case where a television receiver (TV) is installed from the wall of a stand. 第２の実施の形態としての音響再生装置の構成例を示すブロック図である。It is a block diagram which shows the structural example of the sound reproduction apparatus as 2nd Embodiment. 表示部に、距離情報をユーザが入力するためのＧＵＩ表示が行われることを示す図である。It is a figure which shows that the GUI display for a user to input distance information is performed on a display part. テレビ受信機（ＴＶ）の筐体の背面側に距離センサを設置することを説明するための図である。It is a figure for demonstrating installing a distance sensor in the back side of the housing | casing of a television receiver (TV). 第３の実施の形態としての音響再生装置の構成例を示すブロック図である。It is a block diagram which shows the structural example of the sound reproduction apparatus as 3rd Embodiment. 表示部に、反射量情報をユーザが入力するためのＧＵＩ表示が行われることを示す図である。It is a figure which shows that the GUI display for a user to input reflection amount information is performed on a display part.

以下、発明を実施するための形態（以下、「実施の形態」とする）について説明する。なお、説明は以下の順序で行う。
１．第１の実施の形態
２．第２の実施の形態
３．第３の実施の形態
４．変形例 Hereinafter, modes for carrying out the invention (hereinafter referred to as “embodiments”) will be described. The description will be given in the following order.
1. 1. First embodiment 2. Second embodiment 3. Third embodiment Modified example

＜１．第１の実施の形態＞
最初に、本技術の原理について説明する。
［スピーカが前面側にある場合］
図１は、スピーカＳＰがテレビ受信機（ＴＶ）の筐体の前面側に配置されている例を示している。この場合、使用者は、主に、テレビ受信機の正面側に位置して視聴する。そのため、スピーカＳＰから出力された音Ｄｏが直接に受聴点Ｐに音Ｄｐとして届く。 <1. First Embodiment>
First, the principle of the present technology will be described.
[When the speaker is on the front side]
FIG. 1 shows an example in which the speaker SP is arranged on the front side of the casing of a television receiver (TV). In this case, the user mainly views and watches on the front side of the television receiver. Therefore, the sound Do output from the speaker SP reaches the listening point P directly as the sound Dp.

この音Ｄｐは、本来であれば空間伝播時になんらかの減衰や周波数特性の変化を受ける可能性がある。しかし、ここでは、説明の簡単化のため、空間伝播時の減衰や周波数特性の変化は非常に小さく、無視できるものとして説明する。ただし、伝播経路長が異なる場合には、時間遅れは発生するものと仮定する。 This sound Dp may be subject to some attenuation or change in frequency characteristics during propagation through space. However, here, for the sake of simplification of explanation, it is assumed that attenuation and frequency characteristic change during spatial propagation are very small and can be ignored. However, it is assumed that a time delay occurs when the propagation path lengths are different.

［スピーカが背面側にある場合］
図２は、スピーカＳＰがテレビ受信機（ＴＶ）の筐体の背面側に下向きで配置されている例を示している。この場合、使用者は、主に、テレビ受信機の正面側に位置して視聴する。この場合、スピーカＳＰから出力された音Ｄｏは、テレビ受信機を載せた台のＳ点でまず反射し、その後、受聴点Ｐに音Ｄｐ′として届く。本技術では、この反射に着目しており、反射による減衰はあるものとする。 [When the speaker is on the back side]
FIG. 2 shows an example in which the speaker SP is disposed downward on the back side of the housing of the television receiver (TV). In this case, the user mainly views and watches on the front side of the television receiver. In this case, the sound Do output from the speaker SP is first reflected at the point S on the stage on which the television receiver is mounted, and then reaches the listening point P as the sound Dp ′. The present technology focuses on this reflection, and it is assumed that there is attenuation due to the reflection.

一般的には、物体による音の反射率は、全ての周波数成分で均一ではない。そのため、反射音はＳ点において周波数特性の変化を受けたのち、Ｐ点に音Ｄｐ′として届く。この際、たとえば高域が大きく減衰することなどにより、Ｐ点に届くのは「こもった音」になる場合がある。その結果、音楽においてきらびやかさがなくなってつまらなくなってしまう、あるいは音声では明瞭度が下がって聞き取りにくくなってしまう、といった問題がある。 In general, the reflectance of sound by an object is not uniform for all frequency components. For this reason, the reflected sound reaches the point P as the sound Dp ′ after undergoing a change in frequency characteristics at the point S. At this time, for example, it may be a “muffled sound” that reaches the point P due to a large attenuation in the high frequency range. As a result, there is a problem in that the music is not gorgeous and boring, or the voice becomes less clear and difficult to hear.

［理想周波数特性Ｄと、実際の周波数特性Ｄ′の違い］
図３（ａ）は、一例として、図１のＰ点に届く音Ｄｐの周波数特性ｆ（Ｄｐ）と、図２のＰ点に届く音Ｄｐ′の周波数特性ｆ（Ｄｐ′）の比較を示している。ここでは、説明の簡単化のため、ｆ（Ｄｐ）を基準として、これを平坦化して評価するものとする。ｆ（Ｄｐ′）はＳ点における反射により周波数特性の変化を受け、例えば、図示のように、全体的に大きさＧが小さくなったり、山谷ができたりする。 [Difference between ideal frequency characteristic D and actual frequency characteristic D ']
FIG. 3A shows, as an example, a comparison between the frequency characteristic f (Dp) of the sound Dp reaching the point P in FIG. 1 and the frequency characteristic f (Dp ′) of the sound Dp ′ reaching the point P in FIG. ing. Here, for simplification of description, it is assumed that the evaluation is performed by flattening this with reference to f (Dp). f (Dp ′) is subjected to a change in frequency characteristics due to reflection at the point S, and for example, as shown in the figure, the size G is reduced as a whole, or a valley is formed.

全体的に大きさが小さくなるのは直感的に理解できるが、これは物体における反射率が１００％でないために生じる現象である。また、周波数によって減衰率が異なるのは、反射を起こす物体表面の材質や内部損失率が周波数によって均一でないために生じる現象である。その結果、ｆ（Ｄｐ′）は、ｆ（Ｄｐ）に比べ、高域が減衰していわゆる「こもった音」になる。このような現象は当該業者間ではよく知られている。そのため、これ以上の説明は省略する。 Although it can be intuitively understood that the size is reduced as a whole, this is a phenomenon that occurs because the reflectance of an object is not 100%. Further, the attenuation rate varies depending on the frequency, which is a phenomenon that occurs because the material of the object surface that causes reflection and the internal loss rate are not uniform depending on the frequency. As a result, f (Dp ′) becomes a so-called “muffled sound” as the high frequency is attenuated compared to f (Dp). Such a phenomenon is well known among the traders. Therefore, further explanation is omitted.

［ｆ（Ｄｐ′）を理想特性ｆ（Ｄｐ）に近づける信号処理を実施してｆ（Ｄｐ″）に調整］
このように、同じスピーカＳＰから同じ音を出したとしても、スピーカＳＰの配置によってＰ点では大きな音の違いが出ることは従来からよく知られている。そこで、ｆ（Ｄｐ′）の音のきらびやかさや明瞭度を回復させるために、図３（ｂ）に示すように、Ｄｐ′の周波数特性ｆ（Ｄｐ′）を、ｆ（Ｄｐ″）のように調整する技術が一般的に用いられてきた。具体的には、アナログ信号処理、あるいはデジタル信号処理によってフィルタ処理を行い、ｆ（Ｄｐ′）をｆ（Ｄｐ″）のように調整する。 [Signal processing for bringing f (Dp ′) closer to the ideal characteristic f (Dp) and adjusting it to f (Dp ″)]
Thus, even if the same sound is output from the same speaker SP, it has been well known that a large difference in sound appears at the point P depending on the arrangement of the speakers SP. Therefore, in order to restore the glitteringness and clarity of the sound of f (Dp ′), as shown in FIG. 3B, the frequency characteristic f (Dp ′) of Dp ′ is expressed as f (Dp ″). A technique for adjusting is generally used, specifically, filtering is performed by analog signal processing or digital signal processing to adjust f (Dp ′) as f (Dp ″).

アナログ、デジタルに関わらず、用いられる信号処理は、ある周波数領域ごとに大きさを調整するものである。現在はデジタル信号処理が広く用いられており、例えばＦＩＲ（Finite Impulse Response）フィルタやＩＩＲ（InfiniteImpulse Response）フィルタを使うことで、このような調整は容易に実現可能となっている。 Regardless of analog or digital, the signal processing used is to adjust the size for each frequency region. At present, digital signal processing is widely used. Such adjustment can be easily realized by using, for example, a FIR (Finite Impulse Response) filter or an IIR (Infinite Impulse Response) filter.

ただし、ｆ（Ｄｐ′）を完全にｆ（Ｄｐ）の周波数特性に戻すことは困難である。その理由は、ＦＩＲ／ＩＩＲでも、製品においてはそのリソース（演算量、ＲＯＭ／ＲＡＭ容量）に制限があるためであり、例えば、いくら調整したとしても、ｆ（Ｄｐ″）のように若干のズレは残ったままとなる。ｆ（Ｄｐ）に近づけようとすればするほど、リソースは巨大化し、コストや使用電力量の増加を招き、製品の価格を上昇させてしまう要因となる。よって、使用者におけるメリット、デメリットと、設計者側のメリット、デメリットの適切なバランスを取ることが常に求められることとなる。 However, it is difficult to completely return f (Dp ′) to the frequency characteristic of f (Dp). The reason is that even in FIR / IIR, there are limitations on the resources (computation amount, ROM / RAM capacity) in the product. For example, no matter how much the adjustment is made, a slight deviation such as f (Dp ″) will occur. The more you try to get closer to f (Dp), the larger the resources become, causing cost and power consumption to increase, causing the price of the product to rise. There is always a need to properly balance the advantages and disadvantages of the designer with the advantages and disadvantages of the designer.

図４（ａ）には、図３（ａ）のｆ（Ｄｐ）を実現する音響再生装置の一例を示している。入力音声信号Ｄｉはアンプにて電力増幅されたのちスピーカＳＰに送られ、このスピーカＳＰから音Ｄｏとして再生され、Ｐ点に音Ｄｐとして到達する。ここでは、説明の都合上、このときのＰ点での再生音Ｄｐの周波数特性と、入力音声信号Ｄｉの周波数特性は等しいと定義する。 FIG. 4A shows an example of a sound reproducing device that realizes f (Dp) in FIG. The input audio signal Di is amplified by an amplifier and then sent to the speaker SP. The input signal is reproduced as a sound Do from the speaker SP and reaches the point P as a sound Dp. Here, for convenience of explanation, it is defined that the frequency characteristic of the reproduced sound Dp at the point P at this time is equal to the frequency characteristic of the input audio signal Di.

図４（ｂ）には、図３（ａ）のｆ（Ｄｐ′）を実現する音響再生装置の一例を示している。入力音声信号Ｄｉの特性は、図４（ａ）と同じであり、これがアンプにて電力増幅されたのちスピーカＳＰに送られ、このスピーカＳＰから音Ｄｏとして再生される。しかし、この音Ｄｏは、Ｐ点に届くまでに、図２および図３（ａ）で説明したように、Ｓ点での反射により周波数特性に変化が生じ、ＤｐではなくＤｐ′となる。 FIG. 4B shows an example of a sound reproducing device that realizes f (Dp ′) in FIG. The characteristics of the input audio signal Di are the same as those in FIG. 4A. After the power of the input audio signal Di is amplified by the amplifier, it is sent to the speaker SP, and is reproduced from the speaker SP as sound Do. However, as described with reference to FIGS. 2 and 3A, the sound Do changes to the frequency characteristic due to reflection at the point S and reaches Dp ′ instead of Dp before reaching the point P.

図４（ｃ）には、図３（ｂ）のｆ（Ｄｐ″）を実現する音響再生装置の一例を示している。入力音声信号Ｄｉの特性は、図４（ａ）、図４（ｂ）と同じである。しかし、アンプで電力増幅される前に、フィルタにて周波数調整が実施されてＤｉ′に変化させられた後、アンプに伝送される。このように周波数調整された音声信号は、アンプで電力増幅された後、スピーカから音Ｄｏ′として再生され、Ｐ点に音Ｄｐ″として到達する。 4 (c) shows an example of a sound reproducing device that realizes f (Dp ″) in FIG. 3 (b). The characteristics of the input audio signal Di are shown in FIGS. 4 (a) and 4 (b). However, before the power is amplified by the amplifier, the frequency is adjusted by the filter, changed to Di ′, and transmitted to the amplifier. Is amplified by an amplifier, reproduced from the speaker as sound Do ′, and reaches the point P as sound Dp ″.

周波数調整の内容は、図３（ｂ）の説明で詳細を示したのでここでは省略する。その結果、Ｐ点におけるＤｐ″の周波数特性ｆ（Ｄｐ″）を、Ｄｐの周波数特性ｆ（Ｄｐ）に近い形に調整することが可能となり、本来再生されるべきである周波数特性に近い音質を実現できることとなる。この調整方法を実施するためには、周波数特性が変化した原因は必要とされず、ｆ（Ｄｐ）とｆ（Ｄｐ″）ができるだけ一致するように、結果だけをみてフィルタの周波数調整を設定すればよい。 The details of the frequency adjustment have been described in detail with reference to FIG. As a result, it is possible to adjust the frequency characteristic f (Dp ″) of Dp ″ at the point P to a form close to the frequency characteristic f (Dp) of Dp, and to achieve a sound quality close to the frequency characteristic that should be reproduced originally. It can be realized. In order to carry out this adjustment method, the cause of the change in the frequency characteristics is not required, and the frequency adjustment of the filter is set only by looking at the result so that f (Dp) and f (Dp ″) match as much as possible. That's fine.

［スピーカが前面側にある場合＋反射を考慮］
上述の図１にはスピーカＳＰがテレビ受信機（ＴＶ）の筐体の前面側に配置されている例を示したが、音の反射という現象が考慮されていない。図５は、スピーカＳＰがテレビ受信機（ＴＶ）の筐体の前面側に配置され、さらに、音の反射を考慮した一例を示している。 [When the speaker is on the front side + Reflection is considered]
FIG. 1 described above shows an example in which the speaker SP is disposed on the front side of the casing of the television receiver (TV), but the phenomenon of sound reflection is not taken into consideration. FIG. 5 shows an example in which the speaker SP is disposed on the front side of the casing of the television receiver (TV) and further reflection of sound is taken into consideration.

この図５では、図１と同様に、Ｐ点に届く受聴のための主とする音は、スピーカＳＰから届く直接音Ｄｐであるが、それ以外に、例えば、天井のＳ１点からの反射音Ｒｐ１や、床のＳ２点からの反射音Ｒｐ２が存在するものとする。現実的には反射は至るところからあるが、説明の簡単化のため、反射率の大きそうな、上述の２点に絞って説明を続ける。ここで、Ｄｐを「主とする音」と呼ぶことに対し、Ｒｐ１とＲｐ２は本来不要なものであるため「副とする音」と呼ぶこととする。この場合、Ｐ点に、Ｄｐ以外にもＲｐ１やＲｐ２が届くことになるため、スピーカＳＰからどんなに理想的な周波数特性で音を出したとしても、Ｐ点に届く際にはなんらかの周波数特性の変化を受けることとなる。 In FIG. 5, as in FIG. 1, the main sound for listening to the point P is the direct sound Dp that reaches from the speaker SP. In addition to this, for example, the reflected sound from the point S1 on the ceiling It is assumed that there is Rp1 and reflected sound Rp2 from point S2 on the floor. In reality, reflections are everywhere, but for the sake of simplicity of explanation, the explanation will be continued by focusing on the above-mentioned two points that are likely to have a high reflectance. Here, Dp is referred to as “main sound”, whereas Rp1 and Rp2 are originally unnecessary, and thus referred to as “sub-sound”. In this case, since Rp1 and Rp2 other than Dp reach the point P, no matter how ideal the frequency characteristic is emitted from the speaker SP, when the sound reaches the point P, some change in the frequency characteristic occurs. Will receive.

［スピーカが背面側にある場合＋反射を考慮］
上述の図２にはスピーカＳＰがテレビ受信機（ＴＶ）の筐体の背面側に下向きで配置されている例を示した。この例では、受聴するための主とする音に関する反射は考慮されているが、図１と同様に、主とする音以外の反射音は考慮されていない。図６は、スピーカＳＰがテレビ受信機（ＴＶ）の筐体の背面側に下向きで配置され、受聴するための主とする音に関する反射が考慮され、さらに主とする音以外の反射をも考慮した一例を示している。 [When the speaker is on the back side + Reflection is considered]
FIG. 2 described above shows an example in which the speaker SP is disposed downward on the back side of the casing of the television receiver (TV). In this example, the reflection related to the main sound for listening is taken into consideration, but the reflected sound other than the main sound is not taken into account as in FIG. In FIG. 6, the speaker SP is disposed on the back side of the television receiver (TV) housing in a downward direction, taking into account reflections relating to the main sound for listening, and also taking into account reflections other than the main sound. An example is shown.

この図６では、図２と同様に、受聴のための主とする音は、スピーカＳＰから出た音ＤｏがＳ１点で反射してＰ点に届くＤｐである。それに加え、例えば同じスピーカＳＰから出た音ＤｏはＳ２点で反射し、さらに壁のＳ３点で反射したのち、ＲｐとしてＰ点に到達する。ここでも、実際には反射は至るところからあるが、説明の簡単化のため、スピーカＳＰに対しＳ１点とは対称位置にあるＳ２点からの反射のみがある、として説明を続ける。上述の図５と同様に、Ｄｐを「主とする音」と呼ぶことに対し、Ｒｐを「副とする音」と呼ぶこととする。この場合、Ｐ点ではＤｐ以外にもＲｐが届くことになるため、やはりスピーカＳＰからどんなに理想的な周波数特性で音を出したとしても、Ｐ点に届く際にはなんらかの周波数特性の変化を受けることとなる。 In FIG. 6, as in FIG. 2, the main sound for listening is Dp that the sound Do emitted from the speaker SP reflects at the point S <b> 1 and reaches the point P. In addition, for example, the sound Do emitted from the same speaker SP is reflected at the point S2 and further reflected at the point S3 on the wall, and then reaches the point P as Rp. Here, in practice, the reflection is everywhere, but for the sake of simplicity, the description will be continued assuming that there is only reflection from the point S2 that is symmetrical to the point S1 with respect to the speaker SP. Similarly to FIG. 5 described above, Dp is referred to as “main sound”, while Rp is referred to as “sub sound”. In this case, since Rp reaches other than Dp at point P, no matter how ideal the frequency characteristic is emitted from the speaker SP, when the point P is reached, some frequency characteristic change is received. It will be.

［スピーカが前面側にあり、反射を考慮した場合の周波数特性］
図７は、図５における、主とする音と副とする音が混じり合った際の、周波数特性のシミュレーション結果のグラフを示す。このシミュレーションでは、音源としては特性を評価しやすい白色雑音を１０秒間再生するものとし、この１０秒間の全体的な周波数特性を調べるものとした。なお、図４（ａ）に示したように、ここでも再生装置の周波数特性は平坦であることとする。 [Frequency characteristics when the speaker is on the front side and reflection is considered]
FIG. 7 shows a graph of the simulation result of the frequency characteristics when the main sound and the sub sound are mixed in FIG. In this simulation, white noise whose characteristics are easily evaluated is reproduced as a sound source for 10 seconds, and the overall frequency characteristics for 10 seconds are examined. As shown in FIG. 4A, the frequency characteristic of the playback device is also flat here.

まず、スピーカＳＰからＰ点までの直接の伝達特性をｔｆ０とおく。説明の簡単化のため、この伝達特性は大きさ及び周波数特性の変化をもたらさないものと仮定する。ｔｆ０が持つ時間遅れ量ｘを括弧内に反映させるとすると、Ｄｐは、以下の（１）式で表すことができる。なお、演算子「＊」は、伝達特性を反映させるものとして、以下使用する。
Ｄｐ＝Ｄｏ（０）＊ｔｆ０＝Ｄｏ（ｘ）・・・（１） First, let tf0 be a direct transfer characteristic from the speaker SP to the point P. For simplicity of explanation, it is assumed that this transfer characteristic does not cause a change in magnitude and frequency characteristic. If the time delay amount x possessed by tf0 is reflected in parentheses, Dp can be expressed by the following equation (1). The operator “*” is used below to reflect the transfer characteristics.
Dp = Do (0) * tf0 = Do (x) (1)

Ｄｏ（０）はスピーカＳＰから出たばかりの音である。この括弧内の「０」を基準の時間とすると、伝達特性ｔｆ０によって周波数特性の変化は生じないものの、スピーカＳＰからＰ点までの距離に応じた時間遅れ量ｘが生じるので、ＤｐはＤｏ（ｘ）となる。なお、この括弧内の数字は、Ｄｏ（０）の「０」を基準とした時間遅れ量を示しており、デジタルデータなどのサンプルを特定するためのインデックスではない。図７（ａ）は、Ｄｏ（ｘ）の周波数特性を示しており、白色雑音の平坦な特性が見て取れる。 Do (0) is a sound just emitted from the speaker SP. When “0” in the parentheses is a reference time, although the frequency characteristic does not change due to the transfer characteristic tf0, a time delay amount x corresponding to the distance from the speaker SP to the point P is generated, so Dp is Do ( x). The numbers in parentheses indicate the amount of time delay based on “0” of Do (0), and are not indexes for specifying samples such as digital data. FIG. 7A shows the frequency characteristic of Do (x), and a flat characteristic of white noise can be seen.

次に、スピーカＳＰからＳ１点を経由しＰ点までの伝達特性をｔｆ１とおく。説明の簡単化のため、この伝達特性は周波数特性の変化をもたらさないものの、Ｓ１点での反射の影響で大きさは１／１０になるものと仮定する。ｔｆ１が持つ時間遅れ量をｘ＋ｙとおくと、Ｐ点に到達するＲｐ１は、以下の（２）式で表すことができる。なお、ｔｆ１の時間遅れ量がｘ＋ｙとなるのは、ｔｆ０に比べｔｆ１の経路が長い分、ｔｆ０よりもｙ分だけ増加するためである。
Ｒｐ１＝Ｄｏ（０）＊ｔｆ１
＝Ｄｏ（ｘ＋ｙ）＊１／１０・・・（２） Next, the transfer characteristic from the speaker SP to the point P via the point S1 is set to tf1. For simplicity of explanation, it is assumed that this transfer characteristic does not cause a change in frequency characteristic, but the size becomes 1/10 due to the influence of reflection at the point S1. When the time delay amount of tf1 is x + y, Rp1 reaching the point P can be expressed by the following equation (2). The time delay amount of tf1 is x + y because the path of tf1 is longer than tf0 and increases by y from tf0.
Rp1 = Do (0) * tf1
= Do (x + y) * 1/10 (2)

同様に、スピーカＳＰからＳ２点を経由しＰ点までの伝達特性をｔｆ２とおき、反射による大きさが１／１０になるものとし、時間遅れ量をｘ＋ｚとすると、Ｐ点に到達するＲｐ１は、以下の（３）式で表すことができる。
Ｒｐ２＝Ｄｏ（０）＊ｔｆ２
＝Ｄｏ（ｘ＋ｚ）＊１／１０・・・（３） Similarly, assuming that the transfer characteristic from the speaker SP to the point P via the point S2 is tf2, the magnitude due to reflection is 1/10, and the amount of time delay is x + z, Rp1 reaching the point P is Can be expressed by the following equation (3).
Rp2 = Do (0) * tf2
= Do (x + z) * 1/10 (3)

また、説明の簡単化のため、ｚ＝ｙと仮定すると、Ｐ点に到達する全ての音Ｐａｌｌは、以下の（４）式で表すことができる。
Ｐａｌｌ＝Ｄｐ＋Ｒｐ１＋Ｒｐ２
＝Ｄｏ（ｘ）＋Ｄｏ（ｘ＋ｙ）＊１／５・・・（４） For simplicity of explanation, assuming that z = y, all sounds Pall reaching the point P can be expressed by the following equation (4).
Pall = Dp + Rp1 + Rp2
= Do (x) + Do (x + y) * 1/5 (4)

つまり、本来のＤｏに加え、不要な成分がＤｏの１／５の大きさで混じり合っていることになる。さらに、この不要な成分は、Ｄｏ（ｘ）に比べｙ分だけ遅れている。この時間遅れ量は経路の長さによってまちまちである。例えば、部屋のサイズ感から考えて、説明の簡単化のために、余分な経路の長さＬを３．４ｍとすると、音速を３４０ｍ／ｓとすれば、その時間遅れ量は１０ｍｓｅｃとなる。図７（ｂ）は、このようにＲｐ１とＲｐ２を定義した場合の、ＤｐとＲｐ１およびＲｐ２が混じり合った際の周波数特性を示している。 That is, in addition to the original Do, unnecessary components are mixed with each other at 1/5 the size of Do. Furthermore, this unnecessary component is delayed by y as compared with Do (x). This amount of time delay varies depending on the length of the route. For example, considering the size of the room, for the sake of simplicity of explanation, if the length L of the extra path is 3.4 m, the amount of time delay is 10 msec if the speed of sound is 340 m / s. FIG. 7B shows frequency characteristics when Dp, Rp1, and Rp2 are mixed when Rp1 and Rp2 are defined in this way.

この図７（ｂ）に示す周波数特性においては、Ｄｏのみの場合の図７（ａ）に示す周波数特性と比べると、一定の細かい間隔で±２ｄＢ程度の山谷が発生していることがわかる。この現象はＤｐとＲｐ１およびＲｐ２の干渉により現実的によく起きていることである。しかし、実際には、Ｒｐ１とＲｐ２の大きさがもっと小さいことが多いため、図７（ａ）の周波数特性と、図７（ｂ）の周波数特性の違いは、それほど大きなものとはならない場合が殆どである。そのため、音質に与える影響は少なく、問題になることはまれである。 In the frequency characteristic shown in FIG. 7B, it can be seen that peaks and valleys of about ± 2 dB are generated at a constant fine interval as compared with the frequency characteristic shown in FIG. This phenomenon is frequently caused by interference between Dp and Rp1 and Rp2. However, in reality, since the sizes of Rp1 and Rp2 are often smaller, the difference between the frequency characteristics in FIG. 7A and the frequency characteristics in FIG. 7B may not be so large. It is almost. Therefore, it has little influence on sound quality and rarely becomes a problem.

［スピーカが背面にあり反射を考慮した場合の周波数特性］
図８は、図６における、主とする音と副とする音が混じり合った際の、周波数特性のシミュレーション結果のグラフを示す。このシミュレーションも、図７と同様に、音源としては特性を評価しやすい白色雑音を約１０秒間再生するものとし、この１０秒間の全体的な周波数特性を調べるものとする。 [Frequency characteristics when the speaker is on the back and reflection is considered]
FIG. 8 is a graph showing a simulation result of frequency characteristics when the main sound and the sub sound in FIG. 6 are mixed. In this simulation, as in FIG. 7, it is assumed that white noise whose characteristics are easy to evaluate is reproduced as a sound source for about 10 seconds, and the overall frequency characteristics for 10 seconds are examined.

まず、スピーカＳＰからＳ１点を経由してＰ点までの伝達特性をｔｆ１とおく。説明の簡単化のため、この伝達特性は周波数特性の変化をもたらさないものの反射の影響で大きさが１／２になると仮定する。ｔｆ１が持つ時間遅れ量ｘを括弧内に反映させるとすると、Ｄｐは、以下の（５）式で表すことができる。
Ｄｐ＝Ｄｏ（０）＊ｔｆ１＝Ｄｏ（ｘ）＊１／２・・・（５） First, let tf1 be the transfer characteristic from the speaker SP to point P via point S1. For simplification of explanation, it is assumed that this transfer characteristic does not cause a change in frequency characteristic, but its size is halved due to the influence of reflection. If the time delay amount x possessed by tf1 is reflected in parentheses, Dp can be expressed by the following equation (5).
Dp = Do (0) * tf1 = Do (x) * 1/2 (5)

Ｄｏ（０）はスピーカＳＰから出たばかりの音である。この括弧内の「０」を基準の時間とすると、伝達特性ｔｆ１によって周波数特性の変化は生じないものの、大きさの変化とスピーカからＰ点までの距離に応じた時間遅れ量ｘが生じるので、ＤｐはＤｏ（ｘ）＊１／２と表現される。 Do (0) is a sound just emitted from the speaker SP. Assuming that “0” in the parentheses is a reference time, although the frequency characteristic does not change due to the transfer characteristic tf1, a time delay amount x corresponding to the change in size and the distance from the speaker to the point P occurs. Dp is expressed as Do (x) * 1/2.

図８（ａ）は、Ｄｏ（ｘ）の周波数特性を示しており、白色雑音の平坦な特性が見て取れる。ただし、現実的には周波数ごとに反射率が異なることがあり、図３（ａ），（ｂ）で示したように、その周波数特性は平坦にならないことがある。つまり、図７（ａ）と図８（ａ）は一致しないことがある。しかし、本技術では、次に述べる反射時における干渉問題の改善が主眼であるので、上述したように伝達特性ｔｆ１による周波数特性変化がないもの同士で効果を比較しても良い。 FIG. 8A shows the frequency characteristic of Do (x), and the flat characteristic of white noise can be seen. However, in reality, the reflectance may be different for each frequency, and as shown in FIGS. 3A and 3B, the frequency characteristics may not be flat. That is, FIG. 7A and FIG. 8A may not match. However, since the present technology mainly focuses on the improvement of the interference problem at the time of reflection described below, the effects may be compared between those having no frequency characteristic change due to the transfer characteristic tf1 as described above.

次に、スピーカＳＰからＳ２点までの伝達特性をｔｆ２、さらにＳ２点からＳ３点を経由してＰ点までの伝達特性をｔｆ３とおく。説明の簡単化のため、この伝達特性は周波数特性の変化をもたらさないものの、Ｓ２点、Ｓ３点では反射の影響で大きさはそれぞれ１／２になるものと仮定する。説明の簡単化のため、ｔｆ２およびｔｆ３が持つ時間遅れ量を合わせてｘ＋ｙとおくと、Ｐ点に到達するＲｐは、以下の（６）式で表すことができる。なお、時間遅れ量がｘ＋ｙとなるのは、ｔｆ１に比べｔｆ２とｔｆ３の経路が長い分、ｔｆ１よりもｙ分だけ増加するためである。
Ｒｐ＝Ｄｏ（０）＊ｔｆ２＊ｔｆ３
＝Ｄｏ（ｘ＋ｙ）＊１／２＊１／２
＝Ｄｏ（ｘ＋ｙ）＊１／４・・・（６） Next, let tf2 be the transfer characteristic from the speaker SP to the point S2, and let tf3 be the transfer characteristic from the point S2 to the point P through the point S3. For simplification of explanation, it is assumed that this transfer characteristic does not cause a change in frequency characteristic, but the size is halved at the points S2 and S3 due to the influence of reflection. For simplification of description, if the time delay amounts of tf2 and tf3 are combined and set to x + y, Rp reaching the point P can be expressed by the following equation (6). Note that the amount of time delay is x + y because the path of tf2 and tf3 is longer than tf1 and increases by y from tf1.
Rp = Do (0) * tf2 * tf3
= Do (x + y) * 1/2 * 1/2
= Do (x + y) * 1/4 (6)

よって、Ｐ点に到達する全ての音Ｐａｌｌは、以下の（７）式で表すことができる。
Ｐａｌｌ＝Ｄｐ＋Ｒｐ
＝Ｄｏ（ｘ）＊１／２＋Ｄｏ（ｘ＋ｙ）＊１／４・・・（７） Therefore, all the sounds Pall reaching the point P can be expressed by the following equation (7).
Pall = Dp + Rp
= Do (x) * 1/2 + Do (x + y) * 1/4 (7)

つまり、本来のＤｏに加え、本来不要な成分がＤｏの１／２の大きさで混じり合っていることになる。さらに、この本来不要な成分は、Ｄｏ（ｘ）に比べｙ分だけ遅れている。この時間遅れ量は経路の長さによってまちまちである。テレビ受信機（ＴＶ）と壁の距離感から考えて、説明の簡単化のために、余分な経路の長さＬを３４ｃｍとすると、音速を３４０ｍ／ｓとすれば、その時間遅れ量は１ｍｓｅｃとなる。図８（ｂ）は、このようにＲｐを定義した場合の、ＤｐとＲｐが混じり合った際の周波数特性を示している。 That is, in addition to the original Do, originally unnecessary components are mixed in a size that is ½ of Do. Furthermore, this originally unnecessary component is delayed by y as compared with Do (x). This amount of time delay varies depending on the length of the route. Considering the distance between the TV receiver (TV) and the wall, for simplicity of explanation, if the length L of the extra path is 34 cm, the time delay amount is 1 msec if the speed of sound is 340 m / s. It becomes. FIG. 8B shows the frequency characteristics when Dp and Rp are mixed when Rp is defined in this way.

この図８（ｂ）に示す周波数特性においては、Ｄｏのみの場合の図８（ａ）に示す周波数特性と比べると、一定の間隔で±６ｄＢ程度の山谷が発生していることがわかる。これは、そもそもＤｐ，Ｒｐともに反射成分であるため、主とする音であるＤｐ自体が小さくなり、さらに反射成分であるＲｐの大きさに近く、しかもＲｐの時間遅れ量も比較的に短いために生じる現象である。よって、図７（ｂ）に比べると、特定の周波数においてはＤｐとＲｐが強めあい、また別の周波数においてはＤｐとＲｐが弱めあうという干渉作用が顕著にでて、違和感を生じることがある。 In the frequency characteristic shown in FIG. 8B, it can be seen that peaks and valleys of about ± 6 dB are generated at regular intervals as compared with the frequency characteristic shown in FIG. This is because both Dp and Rp are reflection components in the first place, the main sound Dp itself is small, and it is close to the magnitude of the reflection component Rp, and the time delay amount of Rp is relatively short. It is a phenomenon that occurs. Therefore, compared with FIG. 7B, the interference effect that Dp and Rp are strengthened at a specific frequency and Dp and Rp are weakened at another frequency is remarkable, which may cause a sense of discomfort. .

この干渉作用による山谷の間隔は、ＤｐとＲｐの時間遅れ量ｙが大きいほど間隔が短くなるものであり、図７（ｂ）と比べ、図８（ｂ）では山谷の間隔が長く、かつ山谷が大きいことがわかる。この現象は、単に２つの音を加算することを考えるとよくわかる。例えば、時間遅れのない同一の２つの信号を加算すれば、信号の大きさは２倍になる。次に、この信号がデジタル信号であるとし、片方の信号のみ１サンプルだけ遅らせて加算すると、低域では位相差が小さいため大きさはほぼ２倍近くになるが、高域になるにつれ大きさは小さくなっていく。 As the time delay amount y between Dp and Rp is larger, the interval between the peaks and valleys due to this interference action is shorter. In FIG. 8B, the interval between the peaks and valleys is longer than that in FIG. It can be seen that is large. This phenomenon is well understood by simply considering adding two sounds. For example, if the same two signals without time delay are added, the magnitude of the signal is doubled. Next, assuming that this signal is a digital signal, if only one of the signals is delayed and added, the phase difference is small in the low range, and the magnitude is almost doubled, but the magnitude increases as the high range is reached. Is getting smaller.

これは、ナイキスト周波数ギリギリの信号を１サンプルずらして加算すれば、完全に符号の異なる信号同士を加算するわけであるから、大きさは０になる、というローパス特性を有することから直感的に理解できる。時間遅れを２サンプル、３サンプルと遅らせていくと、単なるローパス特性ではなく、櫛歯状のフィルタ特性を有するようになり、時間遅れ量が大きいほど、櫛歯は細かくなってゆく。 This is intuitively understood because it has a low-pass characteristic that the magnitude is 0 because signals with completely different signs are added if the signals of the Nyquist frequency limit are shifted by one sample and added. it can. When the time delay is delayed by 2 samples and 3 samples, the filter has a comb-like filter characteristic rather than a simple low-pass characteristic, and the comb teeth become finer as the time delay amount increases.

さて、上述の違和感の原因としては、例えば音声の特徴である山の部分（フォルマント）と、干渉によって発生した谷が一致して、山を潰してしまうことが挙げられる。図７（ｂ）のように、干渉による山谷が細かく、音声の特徴の山や谷の１つ１つに干渉の山谷が複数個分入ったりしても、干渉の影響が山や谷ごとに平均化されるため音声の特徴は大きく崩れない。しかし、図８（ｂ）のように、干渉による山谷の間隔が大きいと、音声の特徴である特定の山が大きく潰れ、逆に谷の部分が不用に増強され、音声の特徴が大きく崩されてしまう。 As a cause of the above-mentioned uncomfortable feeling, for example, a mountain portion (formant) that is a feature of voice coincides with a valley generated by interference, and the mountain is crushed. As shown in FIG. 7B, even if the peaks and valleys due to interference are fine and there are multiple peaks and valleys of interference in each of the peaks and valleys of the voice feature, the influence of the interference is different for each peak or valley. Since it is averaged, the characteristics of the voice are not greatly changed. However, as shown in FIG. 8B, when the interval between the peaks and valleys due to the interference is large, a specific peak that is a feature of the voice is largely crushed, and conversely, the valley portion is unnecessarily enhanced, and the feature of the voice is greatly destroyed. End up.

このように音声の特徴が崩されてしまうと、「もぞもぞしゃべっているような感じ」や「しゃべっている内容が聞き取りにくい」といった音質劣化が生じる。これは、いわゆるフォルマントの構造が崩されるためである。フォルマントを取り除いて再生すると母音が認識でなくなることは良く知られた事実である。また、音楽においては、山に当たった特定の楽器のみが大きく聞こえ、谷に当たった別の楽器については小さく聞こえるようになり、楽曲全体のバランスがおかしくなる、といった音質劣化を生じる。 If the characteristics of the voice are disrupted in this way, sound quality degradation such as “feeling like talking” or “difficult to hear what is being spoken” occurs. This is because the so-called formant structure is destroyed. It is a well-known fact that vowels become unrecognized when formants are removed and played back. In music, only a specific musical instrument hitting a mountain can be heard loudly, and another musical instrument hitting a valley can be heard small, resulting in a sound quality deterioration such that the balance of the entire music becomes strange.

上述の問題は、周波数特性が単に低域が強いとか高域が弱いといった大局的な従来の視点からみたものではなく、周波数特性の微細構造に着目した、新しい問題の捉え方といえる。このような微細構造を放置したまま、低域や高域を大局的に調整しても、音質劣化は改善しないことがある。 The above-mentioned problem is not seen from a global conventional viewpoint that the frequency characteristic is simply strong in the low range or weak in the high range, but can be said to be a new way of grasping the problem focusing on the fine structure of the frequency characteristic. Even if such a fine structure is left untouched, the sound quality deterioration may not be improved even if the low and high frequencies are adjusted globally.

［本技術による問題解決方法］
図９は、上述の問題に対する本技術による解決方法の一例を示している。まず、図６に示したように、スピーカＳＰがテレビ受信機（ＴＶ）の筐体の背面側に下向きで配置されているものとする。そして、Ｐ点にはスピーカＳＰから出た音がＳ１点で反射した主とする音Ｄｐと、スピーカＳＰから出た音がＳ２点およびＳ３点にて反射した副とする音Ｒｐの２つが届くこととする。ここで、図８で説明したように、ＤｐとＲｐの干渉によって、聴感上の違和感が発生しているとする。 [Problem solving method using this technology]
FIG. 9 shows an example of a solution according to the present technology for the above problem. First, as shown in FIG. 6, it is assumed that the speaker SP is disposed downward on the back side of the casing of the television receiver (TV). Then, the main sound Dp that the sound emitted from the speaker SP is reflected at the point S1 and the subordinate sound Rp that the sound emitted from the speaker SP is reflected at the points S2 and S3 reach the point P. I will do it. Here, as described with reference to FIG. 8, it is assumed that an uncomfortable feeling in hearing occurs due to the interference between Dp and Rp.

違和感の主な原因は本来不要なＲｐがＤｐに対して悪影響を及ぼすためである。そのため、本技術では、図に示すようにＲｐをキャンセルするための音Ｃｐを生成するキャンセル信号Ｃｉを予め生成して入力音声信号Ｄｉに重畳する。そして、スピーカＳＰから合成音Ｄｏ″（＝Ｄｏ＋Ｃｏ）として再生することで、Ｐ点におけるＲｐをキャンセルする。この処理を数式で表現すると、以下のようになる。 The main reason for the uncomfortable feeling is that Rp which is originally unnecessary has an adverse effect on Dp. Therefore, in the present technology, as shown in the figure, a cancel signal Ci for generating a sound Cp for canceling Rp is generated in advance and superimposed on the input audio signal Di. And by reproducing | regenerating as synthetic | combination sound Do "(= Do + Co) from the speaker SP, Rp in P point is canceled. When this process is expressed by numerical formula, it will become as follows.

まず、スピーカＳＰから発生させる音Ｄｏ″を、以下の（８）式のように定義する。この（８）式において、Ｃｏは、新たに加えられてスピーカＳＰから発生するキャンセル音である。また、ｙは、図６、図８で示した、スピーカＳＰからＳ１点を経由してＰ点に到達するまでの時間遅れと、スピーカＳＰからＳ２点およびＳ３点を経由してＰ点に到達するまでの時間遅れの差分である。
Ｄｏ″（０）＝Ｄｏ（０）＋Ｃｏ（ｙ）・・・（８） First, the sound Do ″ generated from the speaker SP is defined as the following equation (8). In this equation (8), Co is a canceling sound newly added and generated from the speaker SP. , Y reaches the point P from the speaker SP via the points S2 and S3, and the time delay from the speaker SP to the point P via the point S1, as shown in FIGS. It is the difference of the time delay until.
Do ″ (0) = Do (0) + Co (y) (8)

次に、Ｄｐを、以下の（９）式のように表現する。
Ｄｐ＝Ｄｏ″（０）＊ｔｆ１
＝Ｄｏ（ｘ）＊１／２＋Ｃｏ（ｘ＋ｙ）＊１／２・・・（９） Next, Dp is expressed as the following equation (9).
Dp = Do ″ (0) * tf1
= Do (x) * 1/2 + Co (x + y) * 1/2 (9)

さらに、Ｒｐを、以下の（１０）式のように表現する。
Ｒｐ＝Ｄｏ″（０）＊ｔｆ２＊ｔｆ３
＝Ｄｏ（ｘ＋ｙ）＊１／４＋Ｃｏ（ｘ＋２ｙ）＊１／４・・・（１０） Further, Rp is expressed as the following equation (10).
Rp = Do ″ (0) * tf2 * tf3
= Do (x + y) * 1/4 + Co (x + 2y) * 1/4 (10)

よって、Ｐ点に到達する全ての音Ｐａｌｌは、以下の（１１）式のように表すことができる。
Ｐａｌｌ＝Ｄｐ＋Ｒｐ
＝Ｄｏ（ｘ）＊１／２＋Ｃｏ（ｘ＋ｙ）＊１／２
＋Ｄｏ（ｘ＋ｙ）＊１／４＋Ｃｏ（ｘ＋２ｙ）＊１／４
・・・（１１） Therefore, all the sounds Pall reaching the point P can be expressed as the following equation (11).
Pall = Dp + Rp
= Do (x) * 1/2 + Co (x + y) * 1/2
+ Do (x + y) * 1/4 + Co (x + 2y) * 1/4
(11)

ここで、Ｃｏ（ｘ＋ｙ）＝ −Ｄｏ（ｘ＋ｙ）＊１／２、Ｃｏ（ｘ＋２ｙ）＝ −Ｄｏ（ｘ＋２ｙ）＊１／２とおくと、中間の２項がキャンセルされ、（１１）式は、以下の（１２）式のようになる。
Ｐａｌｌ＝Ｄｏ（ｘ）＊１／２− Ｄｏ（ｘ＋ｙ）＊１／４
＋Ｄｏ（ｘ＋ｙ）＊１／４− Ｄｏ（ｘ＋２ｙ）＊１／８
＝Ｄｏ（ｘ）＊１／２ −Ｄｏ（ｘ＋２ｙ）＊１／８
・・・（１２） Here, if Co (x + y) = − Do (x + y) * 1/2, Co (x + 2y) = − Do (x + 2y) * 1/2, the two intermediate terms are canceled, and the equation (11) is The following equation (12) is obtained.
Pall = Do (x) * 1 / 2-Do (x + y) * 1/4
+ Do (x + y) * 1 / 4-Do (x + 2y) * 1/8
= Do (x) * 1 / 2-Do (x + 2y) * 1/8
(12)

つまり、スピーカＳＰから、Ｄｏに加え、Ｄｏを１／２の大きさにして時間遅れ量ｙを与えたのち位相を反転した音を出せば、中間の２項がキャンセルされることとなる。この結果、従来は主とする音に対して１／２の大きさを持つ副とする音がｙ時間だけ遅れて加算されていたものが、本技術を用いることで主とする音の１／４の大きさを持つ副とする音が２ｙ時間だけ遅れて減算されるようになる。 In other words, if the speaker SP produces a sound with the phase reversed after the Do is made 1/2 and the time delay amount y is given in addition to Do, the two intermediate terms are canceled. As a result, in the past, a sub sound having a magnitude of 1/2 with respect to the main sound was added with a delay of y time. Sub sound having a magnitude of 4 is subtracted with a delay of 2y hours.

この場合、スピーカＳＰからの出力音Ｄｏ″は、以下の（１３）式で表される。このようにＤｏ″を設定すれば、Ｐ点にてＲｐを１／２の大きさに低減することが可能となる。
Ｄｏ″（０）＝Ｄｏ（０）＋Ｃｏ（ｙ）
＝Ｄｏ（０） − Ｄｏ（ｙ）＊１／２・・・（１３） In this case, the output sound Do ″ from the speaker SP is expressed by the following equation (13). If Do ″ is set in this way, Rp is reduced to ½ at the point P. Is possible.
Do ″ (0) = Do (0) + Co (y)
= Do (0)-Do (y) * 1/2 (13)

［反射率を一般化してαとした場合の式］
なお、上述は、（５）式に示したように、反射率（反射量）を１／２とした場合である。反射率を一般化して、１／αとおくと、（５）式は、以下の（１４）式のように変形される。
Ｄｐ＝Ｄｏ（０）＊ｔｆ１＝Ｄｏ（ｘ）＊１／α ・・・（１４） [Expression when the reflectance is generalized to α]
The above is a case where the reflectance (reflection amount) is halved as shown in the equation (5). If the reflectance is generalized to 1 / α, the equation (5) is transformed into the following equation (14).
Dp = Do (0) * tf1 = Do (x) * 1 / α (14)

さらに、（６）式は、以下の（１５）式のように変形される。
Ｒｐ＝Ｄｏ（０）＊ｔｆ２＊ｔｆ３
＝Ｄｏ（ｘ＋ｙ）＊１／α ＊１／α
＝Ｄｏ（ｘ＋ｙ）＊（１／α）^２・・・（１５） Furthermore, the equation (6) is transformed into the following equation (15).
Rp = Do (0) * tf2 * tf3
= Do (x + y) * 1 / α * 1 / α
= Do (x + y) * (1 / α) ² (15)

よって、Ｐ点に到達する全ての音Ｐａｌｌは、以下の（１６）式で表すことができる。つまり、本来のＤｏに加え、本来不要な成分がＤｏの１／α倍の大きさで混じり合っていることになる。
Ｐａｌｌ＝Ｄｐ＋Ｒｐ
＝Ｄｏ（ｘ）＊１／α ＋Ｄｏ（ｘ＋ｙ）＊（１／α）^２
・・・（１６） Therefore, all the sounds Pall reaching the point P can be expressed by the following equation (16). That is, in addition to the original Do, originally unnecessary components are mixed in a size 1 / α times Do.
Pall = Dp + Rp
= Do (x) * 1 / α + Do (x + y) * (1 / α) ²
... (16)

ここで、図９で示したように、ＲｐをキャンセルさせるためにスピーカＳＰから発生させる音Ｄｏ″を、以下の（１７）式のように定義する。この（１７）式において、Ｃｏは、新たに加えられてスピーカＳＰから発生するキャンセル音である。また、ｙは、図６、図８で示した、スピーカＳＰからＳ１点を経由してＰ点に到達するまでの時間遅れと、スピーカＳＰからＳ２点およびＳ３点を経由してＰ点に到達するまでの時間遅れの差分である。
Ｄｏ″（０）＝Ｄｏ（０）＋Ｃｏ（ｙ）・・・（１７） Here, as shown in FIG. 9, the sound Do ″ generated from the speaker SP to cancel Rp is defined as the following equation (17). In this equation (17), Co is a new one. 6 is a canceling sound generated from the speaker SP, and y is the time delay from the speaker SP to the point P via the point S1 as shown in FIGS. Is the difference in time delay until the point P is reached via the points S2 and S3.
Do ″ (0) = Do (0) + Co (y) (17)

次に、Ｄｐを、以下の（１８）式のように表現する。
Ｄｐ＝Ｄｏ″（０）＊ｔｆ１
＝Ｄｏ（ｘ）＊１／α ＋Ｃｏ（ｘ＋ｙ）＊１／α
・・・（１８） Next, Dp is expressed as the following equation (18).
Dp = Do ″ (0) * tf1
= Do (x) * 1 / α + Co (x + y) * 1 / α
... (18)

さらに、Ｒｐを、以下の（１９）式のように表現する。
Ｒｐ＝Ｄｏ″（０）＊ｔｆ２＊ｔｆ３
＝Ｄｏ（ｘ＋ｙ）＊（１／α）^２
＋Ｃｏ（ｘ＋２ｙ）＊（１／α）^２・・・（１９） Further, Rp is expressed as the following equation (19).
Rp = Do ″ (0) * tf2 * tf3
= Do (x + y) * (1 / α) ²
+ Co (x + 2y) * (1 / α) ² (19)

よって、Ｐ点に到達する全ての音Ｐａｌｌは、以下の（２０）式のように表すことができる。
Ｐａｌｌ＝Ｄｐ＋Ｒｐ
＝Ｄｏ（ｘ）＊１／α ＋Ｃｏ（ｘ＋ｙ）＊１／α
＋Ｄｏ（ｘ＋ｙ）＊（１／α）^２
＋Ｃｏ（ｘ＋２ｙ）＊（１／α）^２・・・（２０） Therefore, all the sounds Pall reaching the point P can be expressed as the following equation (20).
Pall = Dp + Rp
= Do (x) * 1 / α + Co (x + y) * 1 / α
+ Do (x + y) * (1 / α) ²
+ Co (x + 2y) * (1 / α) ² (20)

ここで、Ｃｏ（ｘ＋ｙ）＝ −Ｄｏ（ｘ＋ｙ）＊１／α、Ｃｏ（ｘ＋２ｙ）＝ −Ｄｏ（ｘ＋２ｙ）＊１／αとおくと、中間の２項がキャンセルされ、（２０）式は、以下の（２１）式のようになる。
Ｐａｌｌ＝Ｄｏ（ｘ）＊１／α− Ｄｏ（ｘ＋ｙ）＊（１／α）^２
＋Ｄｏ（ｘ＋ｙ）＊（１／α）^２− Ｄｏ（ｘ＋２ｙ）＊（１／α）^３
＝Ｄｏ（ｘ）＊１／α − Ｄｏ（ｘ＋２ｙ）＊（１／α）^３
・・・（２１） Here, if Co (x + y) = − Do (x + y) * 1 / α and Co (x + 2y) = − Do (x + 2y) * 1 / α, the intermediate two terms are canceled, and the equation (20) is The following equation (21) is obtained.
Pall = Do (x) * 1 / α−Do (x + y) * (1 / α) ²
+ Do (x + y) * (1 / α) ² −Do (x + 2y) * (1 / α) ³
= Do (x) * 1 / α-Do (x + 2y) * (1 / α) ³
... (21)

つまり、スピーカＳＰから、Ｄｏに加え、Ｄｏを１／α倍の大きさにして時間遅れ量ｙを与えたのち位相を反転した音を出せば、中間の２項がキャンセルされることとなる。この結果、従来は主たる音に対して１／α倍の大きさを持つ副たる音がｙ時間だけ遅れて加算されていたものが、本技術を用いることで主たる音の（１／α）^２倍の大きさを持つ副たる音が２ｙ時間だけ遅れて減算されるようになる。 In other words, if the speaker SP outputs a sound in which the phase is reversed after setting Do to 1 / α times in addition to Do and then giving the time delay amount y, the two intermediate terms are canceled. As a result, in the past, a sub sound having a magnitude of 1 / α times that of the main sound was added with a delay of y hours, but the (1 / α) ² of the main sound is obtained by using this technology. A secondary sound having a double magnitude is subtracted with a delay of 2y hours.

この場合、スピーカＳＰからの出力音Ｄｏ″は、以下の（２２）式で表される。このようにＤｏ″を設定すれば、Ｐ点にてＲｐを１／α倍の大きさに低減することが可能となる。つまり、α＞１という条件を満たせば、反射音の影響を低減できる、ということになる。
Ｄｏ″（０）＝Ｄｏ（０）＋Ｃｏ（ｙ）
＝Ｄｏ（０） − Ｄｏ（ｙ）＊１／α ・・・（２２） In this case, the output sound Do ″ from the speaker SP is expressed by the following equation (22). If Do ″ is set in this way, Rp is reduced to 1 / α times the magnitude at the point P. It becomes possible. That is, if the condition of α> 1 is satisfied, the influence of reflected sound can be reduced.
Do ″ (0) = Do (0) + Co (y)
= Do (0) −Do (y) * 1 / α (22)

［音響再生装置の構成例］
図１０は、第１の実施の形態としての音響再生装置１００の構成例を示している。この音響再生装置１００は、デジタルシグナルプロセッサ（ＤＳＰ：Digital Signal Processor）１０１と、アンプ１０２と、スピーカ１０３を有している。デジタルシグナルプロセッサ１０１は、音声信号処理部を構成している。また、アンプ１０２は、音声信号増幅部を構成している。 [Configuration example of sound reproduction device]
FIG. 10 shows a configuration example of the sound reproducing device 100 as the first embodiment. The sound reproducing device 100 includes a digital signal processor (DSP) 101, an amplifier 102, and a speaker 103. The digital signal processor 101 constitutes an audio signal processing unit. The amplifier 102 constitutes an audio signal amplification unit.

デジタルシグナルプロセッサ１０１は、入力音声信号Ｄｉを処理して、出力音声信号Ｄｉ″を得る。このデジタルシグナルプロセッサ１０１は、遅延部１１１、ゲイン調整部１１２および加算部１１３により構成されている。遅延部１１１およびゲイン調整部１１２は、キャンセル用音声信号を生成するキャンセル用音声信号生成部を構成している。加算部１１３は、入力音声信号Ｄｉにキャンセル用音声信号を加算して、出力音声信号Ｄｉ″を得る。 The digital signal processor 101 processes the input audio signal Di to obtain an output audio signal Di ″. The digital signal processor 101 includes a delay unit 111, a gain adjustment unit 112, and an addition unit 113. The delay unit. 111 and the gain adjusting unit 112 constitute a canceling audio signal generating unit that generates a canceling audio signal, and an adding unit 113 adds the canceling audio signal to the input audio signal Di and outputs the output audio signal Di. ″ Get.

キャンセル用音声信号は、スピーカ１０３から出力されて受聴点Ｐに到達する反射音のうち、第１の反射音（主とする音）が到達した後に到達する所定の第２の反射音（副とする音）をキャンセルするための音声信号である。すなわち、遅延部１１１は、入力音声信号Ｄｉを、受聴点Ｐに到達する第１の反射音と第２の反射音との時間差に対応する時間だけ遅延させる。また、ゲイン調整部１１２は、遅延部１１１の出力信号のゲインを反射量に応じて調整してキャンセル用音声信号を得る。ここで、第１の反射音は図９のＤｐに同等し、第２の反射音は図９のＲｐに相当する。 The cancellation audio signal is a predetermined second reflected sound (secondary sound) that arrives after the first reflected sound (main sound) reaches among the reflected sounds that are output from the speaker 103 and reach the listening point P. Is a sound signal for canceling. That is, the delay unit 111 delays the input audio signal Di by a time corresponding to the time difference between the first reflected sound and the second reflected sound that reaches the listening point P. In addition, the gain adjustment unit 112 adjusts the gain of the output signal of the delay unit 111 according to the amount of reflection to obtain a cancellation audio signal. Here, the first reflected sound is equivalent to Dp in FIG. 9, and the second reflected sound is equivalent to Rp in FIG.

アンプ１０２は、デジタルシグナルプロセッサ１０１で得られた出力音声信号Ｄｉ″を増幅してスピーカ１０３に供給する。スピーカ１０３は、電子機器、例えばテレビ受信機（ＴＶ）の筐体の背面側に下向きで配置されている（図９のスピーカＳＰ参照）。そのため、使用者は、このスピーカ１０３から出力されて受聴点Ｐに到達する反射音を受聴する。なお、アンプ１０２はアナログである場合が多い。アンプ１０２がアナログである場合、Ｄｉ″はデジタル信号からアナログ信号に変換されてからアンプ１０２に供給される必要がある。しかし、昨今はデジタルアンプも一般化しているし、説明の簡単化のために、アンプ１０２にはデジタル信号をそのまま伝送可能ということにする。スピーカ１０３は、アンプ１０２から供給される音声信号による音声を出力する The amplifier 102 amplifies the output audio signal Di ″ obtained by the digital signal processor 101 and supplies it to the speaker 103. The speaker 103 faces downward on the back side of the casing of an electronic device, for example, a television receiver (TV). Therefore, the user listens to the reflected sound that is output from the speaker 103 and reaches the listening point P. The amplifier 102 is often analog. When the amplifier 102 is analog, Di ″ needs to be supplied to the amplifier 102 after being converted from a digital signal to an analog signal. However, recently, digital amplifiers are also common, and for simplicity of explanation, it is assumed that digital signals can be transmitted to the amplifier 102 as they are. The speaker 103 outputs sound based on the sound signal supplied from the amplifier 102.

図１０に示す音響再生装置１００の動作を説明する。入力音声信号Ｄｉは、デジタルシグナルプロセッサ１０１に供給される。すなわち、この音声信号Ｄｉは、加算部１１３に供給されると共に、遅延部１１１に供給される。遅延部１１１では、入力音声信号Ｄｉが遅延される。ここで、入力音声信号をＤｉ（ｔ）とするとき、遅延部１１１では入力音声信号Ｄｉがｎサンプル分保持され、遅延音声信号Ｄｉ（ｔ＋ｎ）が出力される。 The operation of the sound reproducing device 100 shown in FIG. 10 will be described. The input audio signal Di is supplied to the digital signal processor 101. That is, the audio signal Di is supplied to the adding unit 113 and also supplied to the delay unit 111. In the delay unit 111, the input audio signal Di is delayed. Here, when the input audio signal is Di (t), the delay unit 111 holds the input audio signal Di for n samples and outputs the delayed audio signal Di (t + n).

次に、この遅延音声信号Ｄｉ（ｔ＋ｎ）は、ゲイン調整部１１２に供給される。このゲイン調整部１１２では、ゲインＧｉが調整され、ゲイン調整された音声信号Ｄｉ（ｔ＋ｎ）＊Ｇｉが得られる。この音声信号Ｄｉ（ｔ＋ｎ）＊Ｇｉは、加算部１１３に供給される。そして、加算部１１３では、出力音声信号Ｄｉ″として、Ｄｉ（ｔ）＋Ｄｉ（ｔ＋ｎ）＊Ｇｉが得られる。この出力音声信号Ｄｉ″は、アンプ１０２で電力増幅されたのちスピーカ１０３に送られ、このスピーカ１０３からは、音Ｄｏ″が出力される。 Next, the delayed audio signal Di (t + n) is supplied to the gain adjustment unit 112. In the gain adjusting unit 112, the gain Gi is adjusted, and a gain-adjusted audio signal Di (t + n) * Gi is obtained. The audio signal Di (t + n) * Gi is supplied to the adder 113. Then, the adder 113 obtains Di (t) + Di (t + n) * Gi as the output audio signal Di ″. The output audio signal Di ″ is amplified by the amplifier 102 and then sent to the speaker 103. The speaker 103 outputs a sound Do ″.

ここで、以下の（２３）式が成り立っている。
Ｄｉ″ ＝Ｄｉ（ｔ）＋Ｄｉ（ｔ＋ｎ）＊Ｇｉ・・・（２３） Here, the following equation (23) is established.
Di ″ = Di (t) + Di (t + n) * Gi (23)

ｔ＝０、ｎ＝ｙ、さらに、Ｇｉ＝ − １／２とすれば、（２３）式は、以下の（２４）式となる。この（２４）式のＤｉ″を、図９で得られたＤｏ″（（１３）式参照）と比較すれば、これが正にＤｏ″を出力するための入力信号Ｃｉであることがわかる。
Ｄｉ″ ＝Ｄｉ（０） −Ｄｉ（ｙ）＊１／２・・・（２４） If t = 0, n = y, and Gi = −1 / 2, the equation (23) becomes the following equation (24). Comparing Di ″ in the equation (24) with Do ″ (see equation (13)) obtained in FIG. 9, it can be seen that this is the input signal Ci for outputting Do ″ exactly.
Di ″ = Di (0) −Di (y) * 1/2 (24)

上述したように、図１０に示す音響再生装置１００においては、スピーカ１０３（スピーカＳＰ）から出力される音が周囲の物体に反射した後に受聴点Ｐに到達する反射音Ｄｐを良好に受聴可能となる。また、図１０に示す音響再生装置１００においては、従来の信号処理技術と比べて平易、かつ低コストにも関わらず、効果的な音質調整を実現し、音質的に不利なスピーカ設置位置であっても、「良い音」を楽しめるようになる。 As described above, in the sound reproducing device 100 shown in FIG. 10, it is possible to satisfactorily listen to the reflected sound Dp that reaches the listening point P after the sound output from the speaker 103 (speaker SP) is reflected by a surrounding object. Become. In addition, the sound reproducing device 100 shown in FIG. 10 achieves effective sound quality adjustment and is a disadvantageous sound quality speaker installation position, despite being simpler and lower in cost than the conventional signal processing technology. However, you can enjoy “good sound”.

図１１は、図１０に示す音響再生装置１００の効果を示している。図１１（ａ）は、本技術の適用前のＰａｌｌの周波数特性を示している。図１１（ｂ）は、本技術の適用後のＰａｌｌの周波数特性を示している。干渉によって生じる山谷の大きさが適用の前後で±６ｄＢから±３ｄＢ程に減少していることがわかる。これにより、干渉によって生じる山谷が音声の特徴を崩すことを効果的に低減させることが可能であり、またその山谷の間隔も密にすることで影響が平均化されやすくなる。そのため、音質劣化を効果的に抑制することが可能となる。 FIG. 11 shows the effect of the sound reproducing device 100 shown in FIG. FIG. 11A shows the frequency characteristics of Pall before application of the present technology. FIG. 11B shows the frequency characteristics of Pall after application of the present technology. It can be seen that the size of the peaks and valleys caused by the interference decreases from ± 6 dB to ± 3 dB before and after application. As a result, it is possible to effectively reduce the peaks and valleys caused by the interference from destroying the voice characteristics, and the influences are easily averaged by making the intervals between the peaks and valleys close. Therefore, it is possible to effectively suppress sound quality deterioration.

＜２．第２の実施の形態＞
［遅延量の調整］
図８、図１１において、本技術の有効性を説明した。しかし、例えば、テレビ受信機（ＴＶ）を、台の上のどこにおくかで主とする音Ｄｐと副とする音Ｒｐの関係は変化する。例えば、図１２は、テレビ受信機（ＴＶ）を台のＰ点よりに設置した場合を示している。この場合、Ｒｐは、図９のＲｐに比べて、Ｄｐを基準とすると、相対的により遅く届く。図１３は、テレビ受信機（ＴＶ）を台の壁よりに設置した場合を示している。この場合、Ｒｐは、図９のＲｐに比べて、Ｄｐを基準とすると、相対的により速く届く。 <2. Second Embodiment>
[Adjust delay amount]
The effectiveness of the present technology has been described with reference to FIGS. However, for example, the relationship between the main sound Dp and the subordinate sound Rp changes depending on where the television receiver (TV) is placed on the stand. For example, FIG. 12 shows a case where a television receiver (TV) is installed from the point P of the stand. In this case, Rp arrives relatively later with respect to Dp as compared to Rp in FIG. FIG. 13 shows a case where a television receiver (TV) is installed on the base wall. In this case, Rp reaches faster relative to Rp in FIG. 9 when Dp is used as a reference.

つまり、図９の状態で効果が得られていても、テレビ受信機（ＴＶ）の設置位置を変更すると効果が得られなくなってしまう場合がある。これは、以下のことからも言える。上述したように、スピーカＳＰから発生させる音Ｄｏ″を、以下の（２５）式のように定義した。なお、この（２５）式は、上述の（８）式と同じものである。
Ｄｏ″（０）＝Ｄｏ（０）＋Ｃｏ（ｙ）・・・（２５） That is, even if the effect is obtained in the state of FIG. 9, the effect may not be obtained if the installation position of the television receiver (TV) is changed. This can also be said from the following. As described above, the sound Do ″ generated from the speaker SP is defined as the following equation (25). The equation (25) is the same as the above equation (8).
Do ″ (0) = Do (0) + Co (y) (25)

この（２５）式において、ｙ、つまり、Ｄｐを基準とするＲｐの時間遅れ量ｙが適切でないと、以下の（２６）式の中間の２項のｙが一致しなくなることから、ＤｐとＲｐの干渉を抑制できなくなるためである。なお、この（２６）式は、上述の（１２）式に含まれている。
Ｐａｌｌ＝Ｄｏ（ｘ）＊１／２− Ｄｏ（ｘ＋ｙ）＊１／４
＋Ｄｏ（ｘ＋ｙ）＊１／４ −Ｄｏ（ｘ＋２ｙ）＊１／８
・・・（２６） In this equation (25), if y, that is, the time delay amount y of Rp with reference to Dp is not appropriate, y in the middle two terms of the following equation (26) will not match, so Dp and Rp This is because the interference cannot be suppressed. The equation (26) is included in the above equation (12).
Pall = Do (x) * 1 / 2-Do (x + y) * 1/4
+ Do (x + y) * 1 / 4-Do (x + 2y) * 1/8
... (26)

［音響再生装置の構成例］
図１４は、第２の実施の形態としての音響再生装置１００Ａの構成例を示している。この図１４において、図１０と対応する部分には、同一符号を付し、その詳細説明は省略する。この音響再生装置１００Ａは、デジタルシグナルプロセッサ１０１と、アンプ１０２と、スピーカ１０３と、制御部１０４と、ユーザ入力部１０５と、表示部１０６を有している。ユーザ入力部１０５および表示部１０６は、ユーザインタフェースを構成している。 [Configuration example of sound reproduction device]
FIG. 14 shows a configuration example of a sound reproducing device 100A as the second embodiment. In FIG. 14, parts corresponding to those in FIG. 10 are denoted by the same reference numerals, and detailed description thereof is omitted. The sound reproducing device 100A includes a digital signal processor 101, an amplifier 102, a speaker 103, a control unit 104, a user input unit 105, and a display unit 106. The user input unit 105 and the display unit 106 constitute a user interface.

制御部１０４は、ユーザ入力部１０５から使用者（ユーザ）が入力するスピーカ１０３から壁までの距離の情報に基づいて、遅延部１１１における遅延量を制御する。この制御部１０４には、ユーザ入力部１０５の他に、表示部１０６が接続されている。ユーザ入力部１０５は、例えば、テレビ受信機（ＴＶ）の筐体に配置された操作ボタン、操作摘み、さらには、リモートコントロール装置等である。表示部１０６は、液晶表示素子などで構成されるが、テレビ受信機（ＴＶ）の画像表示部を兼用することもできる。 The control unit 104 controls the delay amount in the delay unit 111 based on the distance information from the speaker 103 to the wall input by the user (user) from the user input unit 105. In addition to the user input unit 105, a display unit 106 is connected to the control unit 104. The user input unit 105 is, for example, an operation button, an operation knob, a remote control device, or the like disposed on a housing of a television receiver (TV). The display unit 106 includes a liquid crystal display element or the like, but can also serve as an image display unit of a television receiver (TV).

使用者（ユーザ）がユーザ入力部１０５から上述の距離情報を入力する際には、表示部１０６に、図１５に示すように、距離の情報をユーザが入力するためのＧＵＩ（Graphical User Interface）表示が行われる。使用者（ユーザ）は、このＧＵＩ表示を用いて、スピーカ１０３から壁までの距離の情報を入力する。このようにユーザ入力部１０５から距離の情報が入力されることで、制御部１０４により、遅延部１１１における遅延量は、スピーカ１０３から壁までの実際の距離に応じた値に調整される。 When the user (user) inputs the above-described distance information from the user input unit 105, a GUI (Graphical User Interface) for the user to input distance information on the display unit 106 as shown in FIG. Display is performed. The user (user) inputs information on the distance from the speaker 103 to the wall using this GUI display. Thus, by inputting the distance information from the user input unit 105, the control unit 104 adjusts the delay amount in the delay unit 111 to a value corresponding to the actual distance from the speaker 103 to the wall.

上述したように、（２６）式の中間２項がキャンセルしなくなる理由は、それぞれのｙが一致しないためである。これは、上述の（２５）式におけるｙを調整することで解決可能な問題である。図１４に示す音響再生装置１００Ａにおいては、使用者（ユーザ）がユーザ入力部１０５により、スピーカ１０３から壁までの距離の情報を入力して、ｙを適切に調整でき、上述の問題を解決できる。 As described above, the reason why the middle two terms in the equation (26) are not canceled is that the respective y's do not match. This is a problem that can be solved by adjusting y in the above equation (25). In the sound reproducing device 100A shown in FIG. 14, the user (user) can input information on the distance from the speaker 103 to the wall by the user input unit 105 to appropriately adjust y, and the above-described problem can be solved. .

例えば、テレビ受信機（ＴＶ）が相対的にＰ点に近い場合には、壁までの距離が長くなるため、使用者（ユーザ）は、背面距離を長く設定する。逆に、テレビ受信機（ＴＶ）が相対的にＰ点から遠い場合には、壁までの距離が短くなるため、使用者（ユーザ）は、背面距離を短く設定する。 For example, when the television receiver (TV) is relatively close to the point P, the distance to the wall becomes long, so the user (user) sets the back distance long. Conversely, when the television receiver (TV) is relatively far from the point P, the distance to the wall is shortened, so the user (user) sets the back distance short.

背面距離を長くするということは、Ｄｐに比べＲｐの遅れが大きくなることである。そのため、この場合、制御部１０４は、ｙを増やす方向に調整を行う。逆に、背面距離を短くするということは、ＤｐとＲｐの遅れが少なくなることである。そのため、この場合、制御部１０４は、ｙを減らす方向に調整を行う。このように、使用者（ユーザ）がユーザ入力部１０５、例えばリモートコントロール装置等を操作して背面距離を変えることで、テレビ受信機（ＴＶ）の様々な設置位置に応じて適切な調整が可能となる。 Increasing the back surface distance means that the delay of Rp becomes larger than Dp. Therefore, in this case, the control unit 104 adjusts in a direction to increase y. Conversely, reducing the back distance means that the delay between Dp and Rp is reduced. Therefore, in this case, the control unit 104 performs adjustment in a direction to reduce y. As described above, the user (user) operates the user input unit 105, for example, a remote control device to change the back surface distance, so that appropriate adjustment can be made according to various installation positions of the television receiver (TV). It becomes.

＜３．第３の実施の形態＞
［距離センサ］
図１４に示す音響再生装置１００Ａにおいては、使用者（ユーザ）がユーザ入力部１０５により、スピーカ１０３から壁までの距離の情報を入力可能としている。しかし、この距離の情報を、例えば、図１６に示すように、テレビ受信機（ＴＶ）の筐体の背面側に距離センサ１０７を設置し、この距離センサ１０７から得ることも考えられる。このように距離センサ１０７を設けることで、使用者（ユーザ）がユーザ入力部１０５から距離情報を入力する手間を回避できる。 <3. Third Embodiment>
[Distance sensor]
In the sound reproducing device 100 </ b> A shown in FIG. 14, a user (user) can input information on the distance from the speaker 103 to the wall by the user input unit 105. However, it is also conceivable that distance information 107 is obtained from the distance sensor 107 by installing a distance sensor 107 on the back side of the housing of the television receiver (TV) as shown in FIG. By providing the distance sensor 107 in this way, it is possible to avoid the trouble of the user (user) inputting distance information from the user input unit 105.

［音響再生装置の構成例］
図１７は、第３の実施の形態としての音響再生装置１００Ｂの構成例を示している。この図１７において、図１４と対応する部分には、同一符号を付し、その詳細説明は省略する。この音響再生装置１００Ｂは、デジタルシグナルプロセッサ１０１と、アンプ１０２と、スピーカ１０３と、制御部１０４Ｂと、ユーザ入力部１０５と、表示部１０６と、距離センサ１０７を有している。この距離センサ１０７は、例えば、赤外線距離センサ等である。 [Configuration example of sound reproduction device]
FIG. 17 shows a configuration example of a sound reproduction device 100B as the third embodiment. In FIG. 17, portions corresponding to those in FIG. 14 are denoted by the same reference numerals, and detailed description thereof is omitted. The sound reproducing device 100B includes a digital signal processor 101, an amplifier 102, a speaker 103, a control unit 104B, a user input unit 105, a display unit 106, and a distance sensor 107. The distance sensor 107 is, for example, an infrared distance sensor.

制御部１０４Ｂは、距離センサ１０７で得られるスピーカ１０３から壁までの距離の情報に基づいて、遅延部１１１における遅延量を制御する。これにより、遅延部１１１における遅延量は、スピーカ１０３から壁までの実際の距離に応じた値とされる。 The control unit 104B controls the delay amount in the delay unit 111 based on the distance information from the speaker 103 to the wall obtained by the distance sensor 107. Thereby, the delay amount in the delay unit 111 is set to a value according to the actual distance from the speaker 103 to the wall.

また、制御部１０４Ｂは、ユーザ入力部１０５から使用者（ユーザ）が入力する反射量の情報に基づいて、ゲイン調整部１１２におけるゲインを制御する。この制御部１０４Ｂには、ユーザ入力部１０５の他に、表示部１０６が接続されている。ユーザ入力部１０５は、例えば、テレビ受信機（ＴＶ）の筐体に配置された操作ボタン、操作摘み、さらには、リモートコントロール装置等である。表示部１０６は、液晶表示素子などで構成されるが、テレビ受信機（ＴＶ）の画像表示部を兼用することもできる。 The control unit 104 </ b> B controls the gain in the gain adjustment unit 112 based on the reflection amount information input by the user (user) from the user input unit 105. In addition to the user input unit 105, a display unit 106 is connected to the control unit 104B. The user input unit 105 is, for example, an operation button, an operation knob, a remote control device, or the like disposed on a housing of a television receiver (TV). The display unit 106 includes a liquid crystal display element or the like, but can also serve as an image display unit of a television receiver (TV).

使用者（ユーザ）がユーザ入力部１０５から上述の反射量情報を入力する際には、表示部１０６に、図１８に示すように、反射量の情報をユーザが入力するためのＧＵＩ（Graphical User Interface）表示が行われる。使用者（ユーザ）は、このＧＵＩ表示を用いて、反射量の情報を入力する。このようにユーザ入力部１０５から反射量の情報が入力されることで、制御部１０４Ｂにより、ゲイン調整部１１２におけるゲインは、入力された反射量の情報に応じた値に調整される。 When the user (user) inputs the above-described reflection amount information from the user input unit 105, a GUI (Graphical User) for the user to input the reflection amount information to the display unit 106 as shown in FIG. Interface) display is performed. A user (user) inputs information on the amount of reflection using the GUI display. Thus, by inputting the reflection amount information from the user input unit 105, the gain in the gain adjustment unit 112 is adjusted to a value according to the input reflection amount information by the control unit 104B.

＜４．変形例＞
なお、上述実施の形態においては、スピーカ１０３（スピーカＳＰ）が、テレビ受信機（ＴＶ）の筐体の背面側に下向きで設置される場合を例にとって説明した。しかし、本技術は、スピーカが同様の状態に設置されるその他の電子機器、例えばフォトフレーム等にも同様に適用できる。 <4. Modification>
In the above-described embodiment, the case where the speaker 103 (speaker SP) is installed downward on the back side of the housing of the television receiver (TV) has been described as an example. However, the present technology can be similarly applied to other electronic devices in which the speaker is installed in the same state, such as a photo frame.

また、上述実施の形態においては、キャンセル用音声信号を生成する系を１系統だけ備えている（図１０、図１４、図１７参照）。遅延量およびゲインを異にする複数の反射音をキャンセルするために、キャンセル用音声信号を生成する系を複数系統備える構成も考えられる。 In the above-described embodiment, only one system for generating a canceling audio signal is provided (see FIGS. 10, 14, and 17). In order to cancel a plurality of reflected sounds having different delay amounts and gains, a configuration including a plurality of systems for generating a canceling audio signal is also conceivable.

また、本技術は、以下のような構成を取ることもできる。
（１）スピーカから出力されて受聴点に到達する反射音のうち、第１の反射音が到達した後に到達する所定の第２の反射音をキャンセルするためのキャンセル用音声信号を、入力音声信号に基づいて生成するキャンセル用音声信号生成部と、
上記入力音声信号に、上記キャンセル用音声信号生成部で生成されたキャンセル用音声信号を加算して出力音声信号を得る音声信号加算部と
を備える音声信号処理装置。
（２）上記キャンセル用音声信号生成部は、上記入力音声信号を、上記受聴点に到達する上記第１の反射音と上記第２の反射音との時間差に対応する時間だけ遅延させる遅延部と、
上記遅延部の出力信号のゲインを反射量に応じて調整して上記キャンセル用音声信号を得るゲイン調整部とを有する
前記（１）に記載の音声信号処理装置。
（３）上記遅延部における遅延量を調整するための情報として上記スピーカの設置位置から壁までの距離の情報をユーザが入力するためのユーザインタフェースと、
上記ユーザインタフェースで入力される上記距離の情報に基づいて、上記遅延部における遅延量を制御する制御部とをさらに備える
前記（２）に記載の音声信号処理装置。
（４）上記ユーザインタフェースは、
上記距離の情報をユーザが入力するためのＧＵＩ表示を行う表示部と、
上記表示部に表示されたＧＵＩ表示を用いて上記距離の情報をユーザが入力するためのユーザ入力部とを有する
前記（３）に記載の音声信号処理装置。
（５）上記スピーカの設定位置から壁までの距離の情報を得る距離センサと、
上記距離センサで得られる上記距離の情報に基づいて、上記遅延部における遅延量を制御する制御部とをさらに備える
前記（２）に記載の音声信号処理装置。
（６）上記ゲイン調整部におけるゲインを調整するための情報として反射量の情報をユーザが入力するためのユーザインタフェースと、
上記ユーザインタフェースで入力される上記反射量の情報に基づいて、上記ゲイン調整部におけるゲインを制御する制御部とをさらに備える
前記（１）から（５）のいずれかに記載の音声信号処理装置。
（７）上記ユーザインタフェースは、
上記反射量の情報をユーザが入力するためのＧＵＩ表示を行う表示部と、
上記表示部に表示されたＧＵＩ表示を用いて上記反射量の情報をユーザが入力するためのユーザ入力部とを有する
前記（６）に記載の音声信号処理装置。
（８）上記スピーカは、表示機器の筐体の背面側に下向きに取り付けられている
前記（１）から（７）のいずれかに記載の音声信号処理装置。
（９）スピーカから出力されて受聴点に到達する反射音のうち、第１の反射音が到達した後に到達する所定の第２の反射音をキャンセルするためのキャンセル用音声信号を、入力音声信号に基づいて生成するステップと、
上記生成されたキャンセル用音声信号を上記入力音声信号に加算して出力音声信号を得るステップと
を備える音声信号処理方法。
（１０）表示機器の筐体の背面側に下向きに取り付けられたスピーカと、
入力音声信号を処理して出力音声信号を得る音声信号処理部と、
上記音声信号処理部で得られた出力音声信号を増幅して上記スピーカに供給する音声信号増幅部とを備え、
上記音声信号処理部は、
上記スピーカから出力されて受聴点に到達する反射音のうち、第１の反射音が到達した後に到達する所定の第２の反射音をキャンセルするためのキャンセル用音声信号を、上記入力音声信号に基づいて生成するキャンセル用音声信号生成部と、
上記入力音声信号に、上記キャンセル用音声信号生成部で生成されたキャンセル用音声信号を加算して上記出力音声信号を得る音声信号加算部とを有する
音響再生装置。
（１１）上記表示機器は、テレビ受信機である
前記（１０）に記載の音響再生装置。 Moreover, this technique can also take the following structures.
(1) Among the reflected sounds that are output from the speaker and reach the listening point, a cancellation audio signal for canceling a predetermined second reflected sound that arrives after the first reflected sound arrives is designated as an input audio signal. An audio signal generation unit for cancellation generated based on
An audio signal processing device comprising: an audio signal adding unit that adds the cancellation audio signal generated by the cancellation audio signal generation unit to the input audio signal to obtain an output audio signal.
(2) The cancellation audio signal generation unit delays the input audio signal by a time corresponding to a time difference between the first reflected sound and the second reflected sound that reaches the listening point; ,
The audio signal processing device according to (1), further including: a gain adjusting unit that adjusts a gain of the output signal of the delay unit according to a reflection amount to obtain the canceling audio signal.
(3) a user interface for the user to input information on the distance from the speaker installation position to the wall as information for adjusting the delay amount in the delay unit;
The audio signal processing device according to (2), further including: a control unit that controls a delay amount in the delay unit based on the distance information input through the user interface.
(4) The user interface is
A display unit for displaying a GUI for the user to input the distance information;
The audio signal processing device according to (3), further including: a user input unit for a user to input the distance information using a GUI display displayed on the display unit.
(5) a distance sensor that obtains information on the distance from the set position of the speaker to the wall;
The audio signal processing device according to (2), further comprising: a control unit that controls a delay amount in the delay unit based on the distance information obtained by the distance sensor.
(6) a user interface for the user to input information on the reflection amount as information for adjusting the gain in the gain adjustment unit;
The audio signal processing device according to any one of (1) to (5), further including: a control unit that controls gain in the gain adjustment unit based on information on the reflection amount input through the user interface.
(7) The user interface is
A display unit that performs GUI display for the user to input information on the reflection amount;
The audio signal processing device according to (6), further including: a user input unit for a user to input the information on the reflection amount using a GUI display displayed on the display unit.
(8) The audio signal processing device according to any one of (1) to (7), wherein the speaker is attached downward on a back side of a housing of the display device.
(9) Among the reflected sounds that are output from the speaker and reach the listening point, a canceling audio signal for canceling a predetermined second reflected sound that arrives after the first reflected sound arrives is designated as an input audio signal. Generating based on:
An audio signal processing method comprising: adding the generated cancellation audio signal to the input audio signal to obtain an output audio signal.
(10) a speaker attached downward on the back side of the housing of the display device;
An audio signal processor that processes an input audio signal to obtain an output audio signal;
An audio signal amplifier that amplifies the output audio signal obtained by the audio signal processor and supplies the amplified audio signal to the speaker;
The audio signal processing unit is
Among the reflected sounds that are output from the speaker and reach the listening point, a cancellation audio signal for canceling a predetermined second reflected sound that arrives after the first reflected sound arrives is used as the input audio signal. An audio signal generation unit for cancellation generated based on
An audio reproduction device comprising: an audio signal adding unit that adds the cancellation audio signal generated by the cancellation audio signal generation unit to the input audio signal to obtain the output audio signal.
(11) The sound reproduction device according to (10), wherein the display device is a television receiver.

１００，１００Ａ，１００Ｂ・・・音響再生装置
１０１・・・デジタルシグナルプロセッサ
１０２・・・アンプ
１０３・・・スピーカ
１０４，１０４Ｂ・・・制御部
１０５・・・ユーザ入力部
１０６・・・表示部
１０７・・・距離センサ
１１１・・・遅延部
１１２・・・ゲイン調整部
１１３・・・加算部 DESCRIPTION OF SYMBOLS 100,100A, 100B ... Sound reproduction apparatus 101 ... Digital signal processor 102 ... Amplifier 103 ... Speaker 104, 104B ... Control part 105 ... User input part 106 ... Display part 107 ... Distance sensor 111 ... Delay unit 112 ... Gain adjustment unit 113 ... Addition unit

Claims

Based on the input audio signal, a canceling audio signal for canceling a predetermined second reflected sound that arrives after the first reflected sound among the reflected sounds that are output from the speaker and reaches the listening point An audio signal generation unit for cancellation to be generated;
An audio signal processing device comprising: an audio signal adding unit that adds the cancellation audio signal generated by the cancellation audio signal generation unit to the input audio signal to obtain an output audio signal.

The cancellation audio signal generation unit delays the input audio signal by a time corresponding to the time difference between the first reflected sound and the second reflected sound that reaches the listening point;
The audio signal processing device according to claim 1, further comprising: a gain adjusting unit that adjusts a gain of the output signal of the delay unit according to a reflection amount to obtain the canceling audio signal.

A user interface for the user to input information about the distance from the speaker installation position to the wall as information for adjusting the amount of delay in the delay unit;
The audio signal processing apparatus according to claim 2, further comprising: a control unit that controls a delay amount in the delay unit based on the distance information input through the user interface.

The above user interface is
A display unit for displaying a GUI for the user to input the distance information;
The audio signal processing apparatus according to claim 3, further comprising: a user input unit for a user to input the distance information using a GUI display displayed on the display unit.

A distance sensor for obtaining information about the distance from the set position of the speaker to the wall;
The audio signal processing device according to claim 2, further comprising: a control unit that controls a delay amount in the delay unit based on the distance information obtained by the distance sensor.

A user interface for the user to input information on the amount of reflection as information for adjusting the gain in the gain adjustment unit;
The audio signal processing apparatus according to claim 2, further comprising: a control unit that controls a gain in the gain adjustment unit based on information on the reflection amount input through the user interface.

The above user interface is
A display unit that performs GUI display for the user to input information on the reflection amount;
The audio signal processing device according to claim 6, further comprising: a user input unit for a user to input information on the reflection amount using a GUI display displayed on the display unit.

The audio signal processing device according to claim 1, wherein the speaker is attached downward on a back side of a housing of the display device.

Based on the input audio signal, a canceling audio signal for canceling a predetermined second reflected sound that arrives after the first reflected sound among the reflected sounds that are output from the speaker and reaches the listening point Generating step;
An audio signal processing method comprising: adding the generated cancellation audio signal to the input audio signal to obtain an output audio signal.

A speaker mounted downward on the back side of the housing of the display device;
An audio signal processor that processes an input audio signal to obtain an output audio signal;
An audio signal amplifier that amplifies the output audio signal obtained by the audio signal processor and supplies the amplified audio signal to the speaker;
The audio signal processing unit is
Among the reflected sounds that are output from the speaker and reach the listening point, a cancellation audio signal for canceling a predetermined second reflected sound that arrives after the first reflected sound arrives is used as the input audio signal. An audio signal generation unit for cancellation generated based on
An audio reproduction device comprising: an audio signal adding unit that adds the cancellation audio signal generated by the cancellation audio signal generation unit to the input audio signal to obtain the output audio signal.

The sound reproduction apparatus according to claim 10, wherein the display device is a television receiver.