JP2006279555A

JP2006279555A - Signal regeneration apparatus and method

Info

Publication number: JP2006279555A
Application number: JP2005095713A
Authority: JP
Inventors: Masami Miura; 雅美三浦; Susumu Yabe; 進矢部; Katsuaki Yamashita; 功誠山下; Toshiro Terauchi; 俊郎寺内; Yoichiro Sako; 曜一郎佐古
Original assignee: Sony Corp
Current assignee: Sony Corp
Priority date: 2005-03-29
Filing date: 2005-03-29
Publication date: 2006-10-12

Abstract

<P>PROBLEM TO BE SOLVED: To provide a signal regeneration apparatus and a method, capable of positioning a clear virtual sound image at an optimal position in a regeneration sound field. <P>SOLUTION: In a signal regeneration apparatus 1, a sound image information extraction processor 24 separates music sounds of each sound source included in an audio data. Also, a sound source signal calculation processor 26 calculates a control parameter for shifting a sound source located near a listening point to a position in which a listener has a sensitive distance perception, and calculates a control parameter for relocating the sound image positioned remotely to a farther position. Further, by resynthesizing a particular sound source signal in a virtual sound source position calculated in a virtual sound source calculation processor 25, the presence of a virtual sound field is made remarkable. <P>COPYRIGHT: (C)2007,JPO&INPIT

Description

本発明は、信号再生装置及び信号再生方法に関し、特にマルチチャンネル方式と音場合成技術により音像を仮想音源位置に定位させる信号再生装置及び信号再生方法に関する。 The present invention relates to a signal reproduction device and a signal reproduction method, and more particularly to a signal reproduction device and a signal reproduction method for localizing a sound image to a virtual sound source position by a multi-channel method and a sound field synthesis technique.

大容量記憶媒体、ネットワークを介したダウンロード又はストリームによるコンテンツ配信等をはじめとする情報伝達形態の発達と再生機器性能の進化により、家庭でも高画質、高音質で映像コンテンツ及び音楽コンテンツが楽しめるようになった。 With the development of information transmission forms such as large-capacity storage media, content distribution by download or stream over the network, and the evolution of playback device performance, video content and music content can be enjoyed at home with high image quality and high sound quality. became.

ビデオデータ、音楽データ等の再生装置において臨場感や音質は、ユーザにとって善し悪しが比較的判断しやすい。例えば、ユーザがオーケストラ曲を聴くとき、仮想音場のなかで個々の楽器の位置が鮮明に感じられ、本物のオーケストラが眼前で演奏しているかのようなイメージが想起されることが好ましい。ユーザは、より現実味のある音声再生を望む傾向にあり、音を再生したときに音声或いは音楽を録音したときの音場が如何に忠実に再現できているかがユーザに機器特性の印象を与えてしまうことにもなるため、重要になっている。 In a playback device for video data, music data, etc., the sense of reality and sound quality are relatively easy for the user to judge. For example, when a user listens to orchestra music, it is preferable that the position of each instrument is clearly felt in the virtual sound field, and an image as if a real orchestra is playing in front of the eyes is recalled. Users tend to desire more realistic sound reproduction, and how the sound field when sound or music is recorded can be reproduced faithfully when the sound is reproduced. It becomes important because it will end up.

例えば、２チャンネルステレオでは、再生音場の音像が虚音像として最適な場所に定位するようにＬ信号とＲ信号からなる２チャンネルステレオ信号の各信号チャンネルのバランスを調整し２つのスピーカから出力している。しかし、２チャンネルステレオの場合、虚音像は、音像が鮮明でなく、受聴位置が左右スピーカの中央からずれると音像の定位位置が変化してしまうという欠点があった。そこで、少なくとも左右に設置した２つのスピーカの中央位置、すなわち受聴者の正面位置にはより鮮明な音像が定位し且つ受聴位置が左右にずれても音像が中央に留まっているようにするために、左右の２チャンネルスピーカの中央にセンタスピーカを加えた３チャンネルステレオ方式、更に背面スピーカを加えた５．１チャンネルステレオ方式が存在する。 For example, in 2-channel stereo, the balance of each signal channel of 2-channel stereo signal consisting of L signal and R signal is adjusted and output from two speakers so that the sound image of the reproduction sound field is localized at the optimum place as a virtual sound image. ing. However, in the case of 2-channel stereo, the virtual image has a drawback that the sound image is not clear and the localization position of the sound image changes when the listening position deviates from the center of the left and right speakers. Therefore, in order to ensure that a clearer sound image is localized at the center position of at least two speakers installed on the left and right sides, that is, the front position of the listener, and the sound image remains in the center even if the listening position is shifted left and right. There are a three-channel stereo system in which a center speaker is added to the center of left and right two-channel speakers, and a 5.1-channel stereo system in which a rear speaker is further added.

一方、従来一般的な２チャンネルステレオ信号は、３チャンネル、５．１チャンネルのような多チャンネルステレオ装置によっても再生できなければならない。そのために、２チャンネルステレオ信号からセンターチャンネルの信号や背面のＬチャンネルと背面のＲチャンネルの信号を生成することによって、２チャンネルステレオ信号を多チャンネル信号に変換できるようにした技術も公開されている（特許文献１参照）。 On the other hand, a conventional two-channel stereo signal must be able to be reproduced by a multi-channel stereo device such as a three-channel or 5.1-channel device. For this purpose, a technique has been disclosed that can convert a 2-channel stereo signal into a multi-channel signal by generating a center channel signal or a rear L channel signal and a rear R channel signal from a two-channel stereo signal. (See Patent Document 1).

また、より現実味のある音場を再現するための技術としては、このほかにも、音場合成技術を用いて点音源や平面波を合成することにより５．１チャンネルステレオ信号を再生する技術（非特許文献１参照）、音場合成技術の仮想音源の位置制御技術を利用して音像位置を任意に動かす技術が開示されている（非特許文献２参照）。更に、混合された音響信号から特定の信号を抽出する一般調和信号解析技術があげられる（非特許文献３参照）。 In addition to this, as a technique for reproducing a more realistic sound field, a technique for reproducing a 5.1 channel stereo signal by synthesizing a point sound source or a plane wave using a sound case forming technique (non-synthesizing) Patent Document 1), and a technique for arbitrarily moving a sound image position by using a virtual sound source position control technique of sound case formation technology is disclosed (see Non-Patent Document 2). Furthermore, there is a general harmonic signal analysis technique for extracting a specific signal from a mixed acoustic signal (see Non-Patent Document 3).

特表２００３−５１６０６９号公報Special table 2003-516069 gazette Spatial Sound in the Age of Fast Convolution Technologies,ICA2004, P.I-515〜P.I-518）Spatial Sound in the Age of Fast Convolution Technologies, ICA2004, P.I-515 to P.I-518) MONITORING DISTANCE EFFECT WITH WAVE FIELD SYNTHESIS, Proc. of the 6th Int. Conference on Digital Audio Effects(DAFX-03), London, UK, September 8-11,2003）MONITORING DISTANCE EFFECT WITH WAVE FIELD SYNTHESIS, Proc. Of the 6th Int. Conference on Digital Audio Effects (DAFX-03), London, UK, September 8-11, 2003) 一般調和解析による音響信号の分離日本音響学会講演論文集Ｐ６０７−６０８平成８年９月Separation of Acoustic Signals by General Harmonic Analysis Proceedings of the Acoustical Society of Japan P607-608 September 1996

ところが、２チャンネルステレオ信号からセンターチャンネルの信号を合成する方法は、簡便ではあるが鮮明な音像を得られない。また、非特許文献３に説明されている一般調和信号解析技術により、従来の２チャンネルステレオ信号に含まれる音源の信号を抽出し、音源毎に独立した信号のチャンネルを作り出すには多くの信号処理が必要である。また生成されたチャンネルの信号処理の結果を検証し、処理条件を変えて再度信号処理をやり直すという繰り返し処理が必要になることもあるため、２チャンネルステレオ信号に対してリアルタイムで一般調和信号解析処理を実行し、検出結果を再合成して多チャンネルステレオ信号を生成しこれをリアルタイム再生することは困難であった。 However, although the method of synthesizing the center channel signal from the two-channel stereo signal is simple, a clear sound image cannot be obtained. In addition, the general harmonic signal analysis technique described in Non-Patent Document 3 extracts a sound source signal included in a conventional two-channel stereo signal, and produces many independent signal channels for each sound source. is required. In addition, it may be necessary to repeat the process of verifying the signal processing results of the generated channel and changing the processing conditions again, so that the general harmonic signal analysis processing is performed in real time on the 2-channel stereo signal. It was difficult to re-synthesize the detection results to generate a multi-channel stereo signal and reproduce it in real time.

そこで、本発明は、上述した従来の実情に鑑みて提案されたものであり、再現音場内の最適位置に鮮明な虚音像を定位することができる信号再生装置及びその方法を提供することを目的とする。 Therefore, the present invention has been proposed in view of the above-described conventional situation, and an object of the present invention is to provide a signal reproducing apparatus and method capable of localizing a clear virtual sound image at an optimal position in a reproduced sound field. And

上述した目的を達成するために、本発明に係る信号再生装置は、オーディオ信号が多重化された多チャンネルデータを再生する信号再生装置において、多チャンネルデータを取得する多チャンネルデータ取得手段と、多チャンネルデータから特定音像の音源情報と音像位置情報とを抽出する信号解析手段と、音源情報から音源信号を算出する音源信号算出手段と、抽出された特定音像の音像情報を変更し変更後の特定音像の音源信号を任意の仮想音源位置に配置する音場合成を行う音場合成手段とを備え、再現音場内の最適位置に鮮明な虚音像を定位する。 In order to achieve the above-described object, a signal reproduction apparatus according to the present invention includes a multi-channel data acquisition unit that acquires multi-channel data in a signal reproduction apparatus that reproduces multi-channel data in which audio signals are multiplexed, Signal analysis means for extracting sound source information and sound image position information of a specific sound image from channel data, sound source signal calculation means for calculating a sound source signal from sound source information, and specifying after changing the sound image information of the extracted specific sound image Sound field generating means for performing sound field generation in which a sound source signal of a sound image is arranged at an arbitrary virtual sound source position is provided, and a clear virtual sound image is localized at an optimum position in the reproduced sound field.

ここで、本発明に係る信号再生装置は、抽出された特定音像の音源情報と音像位置情報とにしたがって特定音像の仮想音源位置を算出する仮想音源位置算出手段を備え、この場合、音場合成手段は、音源信号算出手段において算出された音源信号を仮想音源算出手段で算出された仮想音源位置に配置する。 Here, the signal reproduction device according to the present invention includes virtual sound source position calculating means for calculating the virtual sound source position of the specific sound image according to the extracted sound source information and sound image position information of the specific sound image. The means places the sound source signal calculated by the sound source signal calculating means at the virtual sound source position calculated by the virtual sound source calculating means.

また、上述した目的を達成するために、本発明に係る信号再生方法は、オーディオ信号が多重化された多チャンネルデータを再生する信号再生方法において、多チャンネルデータを取得する多チャンネルデータ取得工程と、多チャンネルデータから特定音像の音源情報と音像位置情報を抽出する信号解析工程と、音源情報から音源信号を算出する音源信号算出工程と、抽出された特定音像の音像情報を変更し変更後の特定音像の音源信号を任意の仮想音源位置に配置する音場合成を行う音場合成工程とを有する。 In order to achieve the above-described object, a signal reproduction method according to the present invention includes a multichannel data acquisition step of acquiring multichannel data in a signal reproduction method of reproducing multichannel data in which audio signals are multiplexed. , A signal analysis step of extracting sound source information and sound image position information of a specific sound image from multi-channel data, a sound source signal calculation step of calculating a sound source signal from the sound source information, and changing the sound image information of the extracted specific sound image A sound case generation step for performing sound case formation in which a sound source signal of a specific sound image is arranged at an arbitrary virtual sound source position.

本発明に係る信号再生装置及び信号再生方法によれば、再現音場内の最適位置に鮮明な虚音像を定位させた音場合成が可能になる。また、オーディオ信号が多重化された多チャンネルデータから特定音像の音源情報及び音像位置情報を分離して用意することで、音場解析と音場合成処理に係る処理負荷が軽減できる。 According to the signal reproducing device and the signal reproducing method of the present invention, it is possible to generate a sound when a clear virtual sound image is localized at an optimum position in the reproduced sound field. Also, by separating and preparing sound source information and sound image position information of a specific sound image from multi-channel data in which audio signals are multiplexed, the processing load related to sound field analysis and sound case formation processing can be reduced.

本発明の具体例として示す信号再生装置は、マルチチャンネル方式で作成され、複数の音源情報を含む音楽データに対して、一般調和信号解析技術をはじめとする音源信号抽出と音場合成技術とを用いて、オーディオ信号から特定の音源信号を抽出し、この音源信号の音像を最適な仮想音源位置に定位させる信号処理を行う再生装置である。 A signal reproducing apparatus shown as a specific example of the present invention is a multi-channel method, and for music data including a plurality of sound source information, a sound source signal extraction including a general harmonic signal analysis technique and a sound case forming technique are performed. It is a playback device that performs signal processing to extract a specific sound source signal from an audio signal and localize the sound image of this sound source signal to an optimal virtual sound source position.

なお、本発明では、オーディオ信号が多重化されたデータとは、例えば、複数の楽器で演奏された音楽データ、“演奏”と“歌声”とが含まれる音楽データのように複数の音源が含まれるオーディオデータであって、リニアＰＣＭ（Linear Pulse Code Modulation）、ドルビーデジタル、ＤＴＳ（Digital Theater Systems）、ＤＶＤ−Ａｕｄｉｏフォーマット、ＤＶＤ−Ｖｉｄｅｏフォーマットに決められたオーディオデータを含むが、いわゆるＭＩＤＩ（Musical Instrument Digital Interface）規格に準拠するデータのように、予めチャンネル毎（音源毎）に作成された複数パートのデータを合成（多重化）してできるオーディオデータは、本発明及び本発明の具体例では含まれない。 In the present invention, the data in which the audio signal is multiplexed includes, for example, a plurality of sound sources such as music data played by a plurality of musical instruments and music data including “performance” and “singing voice”. Audio data including audio data determined in linear PCM (Linear Pulse Code Modulation), Dolby Digital, DTS (Digital Theater Systems), DVD-Audio format, and DVD-Video format. Audio data that can be synthesized (multiplexed) with multiple parts of data created in advance for each channel (for each sound source), such as data that conforms to the Digital Interface) standard, is included in the present invention and specific examples of the present invention. I can't.

以下、本発明の第１の具体例として示す信号再生装置１について、図面を参照して詳細に説明する。信号再生装置１は、記録媒体から入力したオーディオデータを含むコンテンツデータを再生する再生装置である。 Hereinafter, a signal reproducing apparatus 1 shown as a first specific example of the present invention will be described in detail with reference to the drawings. The signal reproduction device 1 is a reproduction device that reproduces content data including audio data input from a recording medium.

図１に示す信号再生装置１は、光ディスクを回転駆動し記録されたデータを読み出す光ディスク再生部１１と、読み出したデータを圧縮されたオーディオデータ、ビデオデータ、字幕データ、その他のデータ等に分離する信号分離回路１２とを備える。また、圧縮されたオーディオデータを復号するオーディオデコーダ１３と、復号されたオーディオデータを再生するとともに仮想音源の位置情報に応じて音場合成するオーディオ信号処理回路１４と、圧縮された字幕データを復号する字幕デコーダ１５と、復号された字幕を再生する字幕再生回路１６とを備えている。また、圧縮されたビデオデータを復号するビデオデコーダ１７と、復号されたビデオデータを再生するビデオ信号再生回路１８と、ビデオ信号に同期して字幕を合成する字幕合成回路１９と、字幕が合成されたビデオ信号を外部に出力するビデオ信号出力回路２０とを備える。 A signal reproducing apparatus 1 shown in FIG. 1 separates the read data into compressed audio data, video data, caption data, other data, and the like, and an optical disk reproducing unit 11 that reads the recorded data by rotating the optical disk. And a signal separation circuit 12. Also, an audio decoder 13 for decoding the compressed audio data, an audio signal processing circuit 14 for reproducing the decoded audio data and generating a sound according to the position information of the virtual sound source, and decoding the compressed subtitle data A subtitle decoder 15 and a subtitle reproduction circuit 16 for reproducing the decoded subtitle. In addition, the video decoder 17 that decodes the compressed video data, the video signal reproduction circuit 18 that reproduces the decoded video data, the caption synthesis circuit 19 that synthesizes the caption in synchronization with the video signal, and the caption are synthesized. And a video signal output circuit 20 for outputting the video signal to the outside.

なお、オーディオデコーダ１３からの出力とこれに対応する音源信号算出処理部２６及びオーディオ信号再生処理部２３への入力、並びに音が合成処理部２７からの出力とこれに対応する多チャンネルアンプ２１への入力、オーディオ信号出力回路２２への入力及び出力は、チャンネル数に併せた信号線が用意されている。 The output from the audio decoder 13 and the input to the sound source signal calculation processing unit 26 and the audio signal reproduction processing unit 23 corresponding thereto, and the output from the synthesis processing unit 27 and the corresponding multi-channel amplifier 21 are output. For the input and output to the audio signal output circuit 22, signal lines corresponding to the number of channels are prepared.

オーディオデータ及びビデオデータとしては、ＭＰＥＧ１、ＭＰＥＧ２、ＡＶＩ、ＷＭＶ、ＷＭＡ等の各フォーマットが適用可能であるが、以下に説明する例では、光ディスクとしてＤＶＤ（Digital Versatile Disc）を使用する。この場合、ビデオデータはＭＰＥＧ（Moving Picture Experts Group）によって標準化されたＭＰＥＧ２であり、オーディオデータは、リニアＰＣＭ、ドルビーデジタル、ＤＴＳ、ＳＤＤＳ等が適用される。 As audio data and video data, formats such as MPEG1, MPEG2, AVI, WMV, and WMA are applicable. In the example described below, a DVD (Digital Versatile Disc) is used as an optical disk. In this case, video data is MPEG2 standardized by the Moving Picture Experts Group (MPEG), and linear PCM, Dolby Digital, DTS, SDDS, etc. are applied as audio data.

オーディオ信号処理回路１４は、オーディオデコーダ１３で復号されたオーディオデータを再生するオーディオ信号再生処理部２３と、光ディスク再生部１１で再生された再生信号から音像情報を抽出する処理を行う音像情報抽出処理部２４と、仮想音源位置を算出する仮想音源位置算出処理部２５と、音像情報から音源信号を算出する音源信号算出処理部２６と、音源信号と仮想音源位置に基づいて音場を合成する音場合成処理部２７とを備える。 The audio signal processing circuit 14 includes an audio signal reproduction processing unit 23 that reproduces audio data decoded by the audio decoder 13, and a sound image information extraction process that performs processing of extracting sound image information from the reproduction signal reproduced by the optical disc reproduction unit 11. Unit 24, virtual sound source position calculation processing unit 25 for calculating a virtual sound source position, sound source signal calculation processing unit 26 for calculating a sound source signal from sound image information, and sound for synthesizing a sound field based on the sound source signal and the virtual sound source position The case completion processing unit 27 is provided.

オーディオ信号再生処理部２３は、オーディオデコーダ１３で復号されたオーディオデータを再生し多チャンネルアンプ２１に送る。 The audio signal reproduction processing unit 23 reproduces the audio data decoded by the audio decoder 13 and sends it to the multi-channel amplifier 21.

音像情報抽出処理部２４は、光ディスク再生部１１が読み出したデータから音像情報と音像位置情報とを抽出し、抽出した音像情報及び音像位置情報を仮想音源位置算出処理部２５と音源信号算出処理部２６に送る。音源情報としては、オーディオデータを構成する楽器、音声等があげられる。また、音像情報抽出処理部２４は、一般調和信号解析により数種類の音源からの音が混在した音声波形から主要音源の時間変動に相関がある周波数成分を抽出する。例えば、音像情報抽出処理部２４は、一般調和信号解析によって、目的の女声ボーカル信号を構成する周波数成分を分離することができる。また、例えば中央位置に定位させる音源信号の周波数成分を分離することができる。 The sound image information extraction processing unit 24 extracts sound image information and sound image position information from the data read by the optical disc reproduction unit 11, and the extracted sound image information and sound image position information are used as a virtual sound source position calculation processing unit 25 and a sound source signal calculation processing unit. 26. The sound source information includes musical instruments, voices, and the like that make up audio data. Further, the sound image information extraction processing unit 24 extracts frequency components correlated with temporal fluctuations of the main sound source from a speech waveform in which sounds from several types of sound sources are mixed by general harmonic signal analysis. For example, the sound image information extraction processing unit 24 can separate frequency components constituting the target female vocal signal by general harmonic signal analysis. For example, the frequency component of the sound source signal localized at the center position can be separated.

仮想音源位置算出処理部２５は、音像情報抽出処理部２４で抽出された音像位置情報にしたがって音場を合成すべき音源位置を算出する。ここで、音場合成すべき音源位置とは、後段の音場合成処理部２７の精度やスピーカ配置に応じた仮想音源の位置である。仮想音源位置算出処理部２５は、例えば中央位置に音像定位する音源信号であれば、聴取点からこの音仮想源信号までの仮想音源距離を、後段の音場合成処理部２７の精度やスピーカ配置に応じて計算して算出又は変更する。 The virtual sound source position calculation processing unit 25 calculates the sound source position where the sound field should be synthesized according to the sound image position information extracted by the sound image information extraction processing unit 24. Here, the sound source position to be generated in the sound case is the position of the virtual sound source in accordance with the accuracy of the subsequent sound case generation processing unit 27 and the speaker arrangement. For example, if the sound source signal is a sound source signal localized at the center position, the virtual sound source position calculation processing unit 25 calculates the virtual sound source distance from the listening point to the sound virtual source signal, the accuracy of the subsequent sound case generation processing unit 27, and the speaker arrangement. Calculate or change according to the calculation.

音源信号算出処理部２６は、音像情報抽出処理部２４で一般調和信号解析によって音源毎に分離されたオーディオデータに仮想音源位置算出処理部２５で算出した音源位置に所定の音源を音像定位させるための制御パラメータを与える。図２〜図５には、オーディオデータから分離された周波数波形を示す。 The sound source signal calculation processing unit 26 performs sound image localization of a predetermined sound source at the sound source position calculated by the virtual sound source position calculation processing unit 25 to the audio data separated for each sound source by the sound image information extraction processing unit 24 by the general harmonic signal analysis. Gives control parameters. 2 to 5 show frequency waveforms separated from audio data.

音源信号算出処理部２６は、２チャンネルオーディオデータ、多チャンネルオーディオデータ等のオーディオデータから音像情報抽出処理部２４及び仮想音源位置算出処理部２５における解析結果に基づいて特定位置に音像定位する音源信号、パート毎に異なる音源信号を分離している。例えば、音源信号算出処理部２６は、音像情報抽出処理部２４で一般調和信号解析によって分離された目的の女声ボーカル信号を構成する周波数成分から女声ボーカルの音源信号を算出する。また、例えば中央位置に定位させる音源信号の周波数成分を分離することができる。音源信号算出処理部２６は、算出した音源信号を音場合成処理部２７に送る。 The sound source signal calculation processing unit 26 is a sound source signal that localizes a sound image at a specific position based on the analysis results in the sound image information extraction processing unit 24 and the virtual sound source position calculation processing unit 25 from audio data such as 2-channel audio data and multi-channel audio data. Different sound source signals are separated for each part. For example, the sound source signal calculation processing unit 26 calculates a female vocal sound source signal from the frequency components constituting the target female vocal signal separated by the general harmonic signal analysis in the sound image information extraction processing unit 24. For example, the frequency component of the sound source signal localized at the center position can be separated. The sound source signal calculation processing unit 26 sends the calculated sound source signal to the sound case generation processing unit 27.

また、音源信号算出処理部２６は、分離した所定の周波数成分を広がり方向（水平方向）、又は奥行き方向に定位位置変更する制御パラメータを算出する。 The sound source signal calculation processing unit 26 calculates a control parameter for changing the localization position of the separated predetermined frequency component in the spreading direction (horizontal direction) or the depth direction.

音源音像を広がり方向（水平方向）に定位位置変更するための制御パラメータの変更例として、音源信号算出処理部２６は、例えば、観測点と仮想音源との方向に関して、後述する音場合成処理部２７の精度が１０°刻み程度の精度でしか音場合成できない場合には、音像方向８°刻みの音像位置変化を生じさせるパラメータを１０°に再変換する。また、スピーカ配置上の制約で横方向に４０°の範囲内にしか音像配置できない場合には、音像位置５０°という音像位置変化を生じさせるパラメータは４０°に再変換する。 As an example of changing the control parameter for changing the localization position of the sound source sound image in the spreading direction (horizontal direction), the sound source signal calculation processing unit 26, for example, a sound case generation processing unit described later with respect to the direction of the observation point and the virtual sound source If the accuracy of 27 can be generated only with an accuracy of about 10 °, the parameter that causes the change in the position of the sound image in units of 8 ° of the sound image direction is reconverted to 10 °. In addition, when the sound image can be arranged only within the range of 40 ° in the horizontal direction due to restrictions on the speaker arrangement, the parameter that causes the sound image position change of 50 ° is reconverted to 40 °.

また、観測点と仮想音源との距離である音像位置距離データについても同様である。一般的に、３メートル以上の遠方の音源に対しては人間の距離知覚の精度が低下することが知られている。そこで音源信号算出処理部２６は、音源音像を奥行き方向に定位位置変更するための制御パラメータの変更例として、例えば、音像距離５メートル程度の距離変化を生じさせるパラメータを音源位置３メートル程度に音像定位させるパラメータに変換し、音像距離８メートル程度の距離変化を生じさせるパラメータを音源距離１０メートルに再変換する。 The same applies to the sound image position distance data, which is the distance between the observation point and the virtual sound source. In general, it is known that the accuracy of human distance perception decreases for a sound source farther than 3 meters. Therefore, the sound source signal calculation processing unit 26, as an example of changing the control parameter for changing the localization position of the sound source sound image in the depth direction, sets a parameter that causes a distance change of about 5 meters as a sound image at a sound source position of about 3 meters. The parameter is converted into a parameter to be localized, and the parameter that causes a distance change of about 8 meters in the sound image distance is reconverted into the sound source distance of 10 meters.

また、音源信号算出処理部２６は、分離された音源に基づいて新たな音源信号を作成することもできる。例えば、音像情報抽出処理部２４によって分離された特定音源の音源信号を別の位置に定位させるパラメータを算出したり、特定音源と周波数が若干異なる音源信号を生じるための制御パラメータを算出したりする。これにより、ある音源に対していわゆるユニゾンする別の音源、或いはある音源とハーモニーを生じる別の音源の周波数成分を生成することができる。 The sound source signal calculation processing unit 26 can also create a new sound source signal based on the separated sound source. For example, a parameter for localizing the sound source signal of the specific sound source separated by the sound image information extraction processing unit 24 is calculated, or a control parameter for generating a sound source signal having a frequency slightly different from that of the specific sound source is calculated. . As a result, it is possible to generate a frequency component of another sound source that unisons a certain sound source, or another sound source that produces harmony with a certain sound source.

音場合成処理部２７は、音源信号算出処理部２６によって算出された音源信号を仮想音源位置算出処理部２５で算出された仮想音源位置に配置する音場合成を行う。このとき、音場合成処理部２７は、ピッチ変更、タイミング変更、エンベロープジェネレータのうち１又は組合せにより、抽出された音源信号を仮想音源位置に再配置する音場合成を行う。また、音場合成処理部２７は、中央位置に音像定位する音源信号が取り除かれたインテンシティステレオ信号を通常のインテンシティステレオ再生し、分離した音源信号を中央位置に再配置する音場合成を行う。 The sound case formation processing unit 27 performs sound case formation in which the sound source signal calculated by the sound source signal calculation processing unit 26 is arranged at the virtual sound source position calculated by the virtual sound source position calculation processing unit 25. At this time, the sound case formation processing unit 27 performs sound case formation by rearranging the extracted sound source signal at the virtual sound source position by one or a combination of pitch change, timing change, and envelope generator. In addition, the sound case generator 27 performs normal intensity stereo reproduction of the intensity stereo signal from which the sound source signal having the sound image localized at the center position is removed, and rearranges the separated sound source signal at the center position. Do.

オーディオ信号出力回路２２にオーディオ信号の出力手段の一例として平面アレイスピーカが使用される場合、音場合成処理部２７は、中央位置から同心円状に広がる音波を出力するためのアレイスピーカ駆動用のデジタルフィルタ係数を算出し、目的のボーカル信号に畳み込み演算を行う。 When a planar array speaker is used as an example of an audio signal output means for the audio signal output circuit 22, the sound case processing unit 27 is a digital for driving the array speaker for outputting a sound wave extending concentrically from the central position. The filter coefficient is calculated, and the convolution operation is performed on the target vocal signal.

図２〜図５は、音像情報抽出処理部２４によって分離された周波数成分を示している。図２〜図５において横軸は時間であり、縦軸は信号レベルを表している。図２に示す周波数成分ｆ１の信号レベルと図３に示す周波数成分ｆ２の信号レベルは、互いに時間変動に強い相関があるため、音場合成処理部２７は、ｆ１とｆ２を同じ音源として音像定位する。また、音場合成処理部２７は、同様に、図４に示す周波数成分ｆ３と図５に示す周波数成分ｆ４とを同じ音源として音像定位する。 2 to 5 show frequency components separated by the sound image information extraction processing unit 24. 2 to 5, the horizontal axis represents time, and the vertical axis represents the signal level. Since the signal level of the frequency component f1 shown in FIG. 2 and the signal level of the frequency component f2 shown in FIG. 3 have a strong correlation with time fluctuation, the sound case generator 27 uses the same sound source as f1 and f2 for sound image localization. To do. Similarly, the sound case component processing unit 27 performs sound image localization using the frequency component f3 shown in FIG. 4 and the frequency component f4 shown in FIG. 5 as the same sound source.

音源信号算出処理部２６は、元のオーディオ信号から一般調和信号解析によって分離された図２〜図５に示す周波数成分ｆ１〜ｆ４に対して、互いに時間変動に強い相関があるものを同じ音源とし、再生空間内における同一位置に定位させるための制御パラメータを与える。音場合成処理部２７は、与えられた制御パラメータを抽出した周波数成分に、例えばｆ１とｆ２に重畳して再合成する。そして、この抽出した音源の周波数成分に所定の音源に、同じ音源として音像定位させるためのパラメータが与えられ、所定の楽器の周波数成分として再合成する。 The sound source signal calculation processing unit 26 uses the frequency components f1 to f4 shown in FIG. 2 to FIG. 5 separated from the original audio signal by general harmonic signal analysis as the same sound source. A control parameter for localizing to the same position in the reproduction space is given. The sound case component processing unit 27 re-synthesizes the given control parameter by superimposing it on the extracted frequency component, for example, on f1 and f2. Then, a parameter for localizing the sound image as the same sound source is given to the frequency component of the extracted sound source and re-synthesized as the frequency component of the predetermined instrument.

続く多チャンネルアンプ２１は、アレイスピーカ用の多チャンネル信号を増幅するとともに、オーディオ信号のうち再合成されなかった音源信号を２チャンネル又は５．１チャンネルで再生するための増幅を行う。例えば、多チャンネルアンプ２１は、上述のようにボーカル信号が音場の再合成をするために分離された場合、ボーカル信号をアレイスピーカ用に多チャンネルで増幅し、ボーカル信号成分が分離された後のオーディオ信号をＬチャンネル信号、Ｒチャンネル信号の２チャンネルで増幅する。 The subsequent multi-channel amplifier 21 amplifies the multi-channel signal for the array speaker, and performs amplification for reproducing the sound source signal that has not been recombined among the audio signals in two channels or 5.1 channels. For example, when the vocal signal is separated to re-synthesize the sound field as described above, the multi-channel amplifier 21 amplifies the vocal signal with multiple channels for the array speaker, and after the vocal signal component is separated. Audio signal is amplified by two channels of an L channel signal and an R channel signal.

上述した構成を有する信号再生装置１が光ディスクから読み出したコンテンツデータを再生する動作について説明する。 An operation of reproducing the content data read from the optical disc by the signal reproducing apparatus 1 having the above-described configuration will be described.

光ディスク再生部１１は、光ディスクを回転駆動しＤＶＤに記録されたデータを読み出す。光ディスク再生部１１によって読み出されたデータは、信号分離回路１２において、圧縮されたオーディオデータ、圧縮されたビデオデータ、字幕データ、その他のデータ等に分離される。圧縮されたオーディオデータは、オーディオデコーダ１３で復号された後、オーディオ信号処理回路１４のオーディオ信号再生処理部２３に送られる。オーディオ信号再生処理部２３は、復号されたオーディオデータを再生し多チャンネルアンプ２１に送る。 The optical disk reproducing unit 11 reads the data recorded on the DVD by rotating the optical disk. The data read by the optical disc reproducing unit 11 is separated into compressed audio data, compressed video data, caption data, other data, and the like by the signal separation circuit 12. The compressed audio data is decoded by the audio decoder 13 and then sent to the audio signal reproduction processing unit 23 of the audio signal processing circuit 14. The audio signal reproduction processing unit 23 reproduces the decoded audio data and sends it to the multi-channel amplifier 21.

光ディスク再生部１１によって読み出されたデータは、オーディオ信号処理回路１４の音像情報抽出処理部２４にも送られる。音像情報抽出処理部２４は、一般調和信号解析によって、再生信号から音源毎の音像情報と音像位置情報とを抽出し、抽出した音像情報及び音像位置情報を仮想音源位置算出処理部２５と音源信号算出処理部２６に送る。仮想音源位置算出処理部２５では、音像情報抽出処理部２４で抽出された音像位置情報にしたがって音場を合成すべき音源位置が算出される。また、音源信号算出処理部２６では、音像情報抽出処理部２４で一般調和信号解析によって音源毎に分離されたオーディオデータに仮想音源位置算出処理部２５で算出した音源位置に所定の音源を音像定位させるための制御パラメータが与えられる。 The data read by the optical disk reproducing unit 11 is also sent to the sound image information extraction processing unit 24 of the audio signal processing circuit 14. The sound image information extraction processing unit 24 extracts sound image information and sound image position information for each sound source from the reproduction signal by general harmonic signal analysis, and uses the extracted sound image information and sound image position information as the virtual sound source position calculation processing unit 25 and the sound source signal. The data is sent to the calculation processing unit 26. The virtual sound source position calculation processing unit 25 calculates the sound source position where the sound field should be synthesized according to the sound image position information extracted by the sound image information extraction processing unit 24. Further, the sound source signal calculation processing unit 26 converts a predetermined sound source into the sound source position calculated by the virtual sound source position calculation processing unit 25 into the audio data separated for each sound source by the sound image information extraction processing unit 24 by the general harmonic signal analysis. Control parameters are given to

音場合成処理部２７は、音源信号算出処理部２６によって算出された音源信号と制御パラメータに基づいて、仮想音源位置算出処理部２５で算出された仮想音源位置に配置する音場合成を行う。このとき、音場合成処理部２７は、ピッチ変更、タイミング変更、エンベロープジェネレータのうち１又は組合せにより、抽出された音源信号を仮想音源位置に再配置する音場合成を行う。音場合成処理部２７において音場が再合成されたオーディオ信号は、多チャンネルアンプ２１に送られる。多チャンネルアンプ２１では、特定の音声信号、例えばボーカル信号成分が分離された場合、分離されたボーカル信号を多チャンネルで増幅し、ボーカル信号成分が分離された後のオーディオ信号をＬチャンネル信号、Ｒチャンネル信号の２チャンネルで増幅する。 Based on the sound source signal calculated by the sound source signal calculation processing unit 26 and the control parameter, the sound case generation processing unit 27 performs sound case formation to be arranged at the virtual sound source position calculated by the virtual sound source position calculation processing unit 25. At this time, the sound case formation processing unit 27 performs sound case formation by rearranging the extracted sound source signal at the virtual sound source position by one or a combination of pitch change, timing change, and envelope generator. The audio signal in which the sound field is recombined in the sound case generating unit 27 is sent to the multi-channel amplifier 21. In the multi-channel amplifier 21, when a specific audio signal, for example, a vocal signal component is separated, the separated vocal signal is amplified by multi-channel, and the audio signal after the separation of the vocal signal component is converted into an L channel signal, R Amplifies with two channels of channel signal.

一方、ビデオデータと字幕データは、ビデオデコーダ１７、字幕デコーダ１５に送られ、続く字幕再生回路１６或いはビデオ信号再生回路１８で再生される。字幕信号とビデオ信号は、字幕合成回路１９においてビデオ信号に同期して映像に字幕が合成される。音場が再合成されたオーディオ信号はオーディオ信号出力回路２２から、字幕が合成されたビデオ信号はビデオ信号出力回路２０から互いに同期され外部のスピーカシステム、表示装置等に出力される。 On the other hand, the video data and the caption data are sent to the video decoder 17 and the caption decoder 15 and reproduced by the subsequent caption reproducing circuit 16 or video signal reproducing circuit 18. The subtitle signal and the video signal are combined with the video in synchronism with the video signal in the subtitle synthesis circuit 19. The audio signal in which the sound field is re-synthesized is synchronized with each other from the audio signal output circuit 22 and the video signal with the captions synthesized is synchronized with each other from the video signal output circuit 20 and output to an external speaker system, display device, or the like.

したがって、信号再生装置１は、音像情報抽出処理部２４においてオーディオデータに含まれる音源毎の楽音を分離算出し、音源信号算出処理部２６において聴取点に近い音源を聴取者の距離知覚が敏感な位置に変更し、遠くに定位される音像をより遠くに再配置するパラメータを算出し、仮想音源位置算出処理部２５において算出された仮想音源位置に特定の音源信号を再合成し、再生される仮想音場を再構築することにより、視聴者の臨場感を高めることができる。上述した信号再生装置１は、本発明の基本的な構成を実現したものである。 Therefore, in the signal reproduction apparatus 1, the sound image information extraction processing unit 24 separates and calculates the musical sound for each sound source included in the audio data, and the sound source signal calculation processing unit 26 detects the sound source close to the listening point in the listener's distance perception. The parameter is changed to the position, the parameter for rearranging the sound image localized far away is calculated, the specific sound source signal is re-synthesized with the virtual sound source position calculated by the virtual sound source position calculation processing unit 25, and reproduced. By reconstructing the virtual sound field, the viewer's sense of presence can be enhanced. The signal reproducing apparatus 1 described above realizes the basic configuration of the present invention.

以下では、本発明のほかの応用例について説明する。本発明の第２の具体例として信号再生装置２を図６に示す。図２に示す信号再生装置２は、音像毎の音源情報及び音像位置情報が多チャンネルのオーディオデータから独立して、例えば、楽曲、映画等といったコンテンツ毎にメタデータとして用意されていることが特徴である。そして、このメタデータは、コンテンツと対応づけるためにコンテンツを識別するための識別コードに対応して予め所定の領域に格納されている。ここで識別コードとは、所定のルールを用いてオーディオデータ等のコンテンツデータの一部から生成されたコンテンツ毎に固有な情報である。 In the following, other application examples of the present invention will be described. FIG. 6 shows a signal reproducing device 2 as a second specific example of the present invention. The signal reproduction apparatus 2 shown in FIG. 2 is characterized in that sound source information and sound image position information for each sound image are prepared as metadata for each content such as a song, a movie, etc., independently of multi-channel audio data. It is. This metadata is stored in advance in a predetermined area corresponding to an identification code for identifying the content to be associated with the content. Here, the identification code is information unique to each content generated from a part of content data such as audio data using a predetermined rule.

そのため、信号再生装置２は、オーディオデータから切り離されてコンテンツ毎に所定領域に格納された音源情報及び音像位置情報を読み出すために必要な識別コードを、この識別コードを光ディスクに格納したと同じ手順で生成する識別コード生成部２８と、ここで生成された識別コードに対応するメタデータを光ディスクから検索する検索処理部２９とを備えることを特徴としている。なお、図６に示す信号再生装置２において、図１に示した信号再生装置１と同様の機能を有する構成は、同一の番号を付けて詳細な説明を省略する。 For this reason, the signal reproduction apparatus 2 uses the same procedure as that for storing the identification code necessary for reading the sound source information and the sound image position information separated from the audio data and stored in a predetermined area for each content on the optical disc. And a search processing unit 29 for searching for metadata corresponding to the generated identification code from the optical disc. In the signal reproducing device 2 shown in FIG. 6, the same functions as those of the signal reproducing device 1 shown in FIG.

識別コード生成部２８は、オーディオデータから切り離されてメタデータとして所定領域に格納された音源情報及び音像位置情報を読み出すために必要な識別コードを、多チャンネルデータの一部から生成する。識別コードとしては、ＴＯＣに記録されているトラック数、各トラックの演奏時間等のデータ、またこれらを組み合わせたデータのほか、多チャンネルデータそのものに対して所定の符号化を施して得られるデータ等があげられる。 The identification code generation unit 28 generates an identification code necessary for reading sound source information and sound image position information separated from the audio data and stored as metadata in a predetermined area from a part of the multi-channel data. The identification code includes the number of tracks recorded in the TOC, data such as the performance time of each track, data obtained by combining these, data obtained by performing predetermined encoding on the multi-channel data itself, etc. Can be given.

検索処理部２９は、識別コード生成部２８で生成されたコンテンツを特定するための識別コードに応じて演奏対象となっている楽曲の識別コードを多重化されたデータから検索する。 The search processing unit 29 searches the multiplexed data for the identification code of the music to be played according to the identification code for specifying the content generated by the identification code generation unit 28.

上述した構成を有する信号再生装置２が光ディスクから読み出したコンテンツデータを再生する動作について説明する。 An operation of reproducing the content data read from the optical disc by the signal reproducing apparatus 2 having the above-described configuration will be described.

光ディスク再生部１１によって読み出されたデータは、信号分離回路１２において、圧縮されたオーディオデータ、圧縮されたビデオデータ、字幕データ、その他のデータ等に分離される。圧縮されたオーディオデータは、オーディオデコーダ１３で復号された後、オーディオ信号処理回路１４のオーディオ信号再生処理部２３に送られる。オーディオ信号再生処理部２３は、復号されたオーディオデータを再生し多チャンネルアンプ２１に送る。 The data read by the optical disc reproducing unit 11 is separated into compressed audio data, compressed video data, caption data, other data, and the like by the signal separation circuit 12. The compressed audio data is decoded by the audio decoder 13 and then sent to the audio signal reproduction processing unit 23 of the audio signal processing circuit 14. The audio signal reproduction processing unit 23 reproduces the decoded audio data and sends it to the multi-channel amplifier 21.

オーディオ信号処理回路１４の識別コード生成部２８は、光ディスク再生部１１で読み出されたデータから作成時と同様のルールで識別コードを生成する。そして、検索処理部２９は、生成された識別コードに応じて、演奏対象の楽曲の音源情報及び音像位置情報のデータを多重化されたデータから検索する。 The identification code generation unit 28 of the audio signal processing circuit 14 generates an identification code from the data read by the optical disc reproduction unit 11 according to the same rule as that at the time of creation. Then, the search processing unit 29 searches the multiplexed data for the sound source information and sound image position information data of the musical composition to be played in accordance with the generated identification code.

信号再生装置２では、音像情報抽出処理部２４は、検索処理部２９が検索して得たメタデータから音源毎の音像情報と音像位置情報とを抽出し、抽出した音像情報及び音像位置情報を仮想音源位置算出処理部２５と音源信号算出処理部２６に送る。仮想音源位置算出処理部２５では、音像情報抽出処理部２４で抽出された音像位置情報にしたがって音場を合成すべき音源位置が算出される。また、音源信号算出処理部２６では、音像情報抽出処理部２４で音源毎に分離されたオーディオデータに仮想音源位置算出処理部２５で算出した音源位置に所定の音源を音像定位させるための制御パラメータが与えられる。 In the signal reproduction device 2, the sound image information extraction processing unit 24 extracts sound image information and sound image position information for each sound source from the metadata obtained by the search processing unit 29, and extracts the extracted sound image information and sound image position information. This is sent to the virtual sound source position calculation processing unit 25 and the sound source signal calculation processing unit 26. The virtual sound source position calculation processing unit 25 calculates the sound source position where the sound field should be synthesized according to the sound image position information extracted by the sound image information extraction processing unit 24. In the sound source signal calculation processing unit 26, control parameters for localizing a predetermined sound source at the sound source position calculated by the virtual sound source position calculation processing unit 25 to the audio data separated for each sound source by the sound image information extraction processing unit 24. Is given.

音場合成処理部２７は、音源信号算出処理部２６によって算出された音源信号と制御パラメータに基づいて、仮想音源位置算出処理部２５で算出された仮想音源位置に配置する音場合成を行う。このとき、音場合成処理部２７は、ピッチ変更、タイミング変更、エンベロープジェネレータのうち１又は組合せにより、抽出された音源信号を仮想音源位置に再配置する音場合成を行う。音場合成処理部２７において音場が再合成されたオーディオ信号は、多チャンネルアンプ２１に送られ、分離された特定の音源信号を多チャンネルで増幅され、特定の音源信号成分が分離された後のオーディオ信号を２チャンネルで増幅される。字幕信号とビデオ信号は、字幕合成回路１９においてビデオ信号に同期して映像に字幕が合成され、音場が再合成されたオーディオ信号はオーディオ信号出力回路２２から、また字幕が合成されたビデオ信号はビデオ信号出力回路２０から、互いに同期されて外部のスピーカシステム、表示装置等に出力される。 Based on the sound source signal calculated by the sound source signal calculation processing unit 26 and the control parameter, the sound case generation processing unit 27 performs sound case formation to be arranged at the virtual sound source position calculated by the virtual sound source position calculation processing unit 25. At this time, the sound case formation processing unit 27 performs sound case formation by rearranging the extracted sound source signal at the virtual sound source position by one or a combination of pitch change, timing change, and envelope generator. The audio signal in which the sound field is re-synthesized in the sound case synthesis unit 27 is sent to the multi-channel amplifier 21, and the separated specific sound source signal is amplified in multi-channels and the specific sound source signal component is separated. Audio signal is amplified by two channels. The caption signal and the video signal are combined with the video in synchronism with the video signal in the caption synthesizing circuit 19, and the audio signal whose sound field is re-synthesized is sent from the audio signal output circuit 22 and the video signal with which the caption is synthesized. Are output from the video signal output circuit 20 to an external speaker system, a display device or the like in synchronization with each other.

上述したように信号再生装置２は、音像毎の音源情報及び音像位置情報を多チャンネルのオーディオデータから独立してメタデータとして用意し、更にメタデータを識別コードに対応して予め所定の領域に格納したことにより、一般調和信号解析等による音源抽出処理にかかる演算量を低減することができる。また、音像情報抽出処理部２４において音源毎の信号を分離し、音源信号算出処理部２６において聴取点に近い音源を聴取者の距離知覚が敏感な位置に変更し、遠くに定位される音像をより遠くに再配置するパラメータを算出し、仮想音源位置算出処理部２５において算出された仮想音源位置に特定の音源信号を再合成し、再生される仮想音場を再構築することにより、視聴者の臨場感を高めることができる。 As described above, the signal reproduction device 2 prepares sound source information and sound image position information for each sound image as metadata independent of multi-channel audio data, and further stores the metadata in a predetermined area corresponding to the identification code. By storing, it is possible to reduce the amount of calculation required for the sound source extraction processing by general harmonic signal analysis or the like. In addition, the sound image information extraction processing unit 24 separates the signal for each sound source, and the sound source signal calculation processing unit 26 changes the sound source close to the listening point to a position where the distance perception of the listener is sensitive so that the sound image localized in the distance can be obtained. By calculating parameters to be rearranged further, re-synthesize a specific sound source signal with the virtual sound source position calculated by the virtual sound source position calculation processing unit 25, and reconstruct the virtual sound field to be reproduced. Can increase the sense of reality.

続いて、本発明の第３の具体例として信号再生装置３を図７に示す。図７では、図１及び図６に示す信号再生装置と同様の機能を有する構成に関しては同一の番号を付けて詳細な説明を省略する。図７に示す信号再生装置３は、上述した多チャンネルのオーディオデータが光ディスクのようないわゆるパッケージングメディアとして提供される場合でなく、ネットワークを介して送られる場合である。そして更に、信号再生装置３では、音像毎の音源情報及び音像位置情報が多チャンネルのオーディオデータから独立して、メタデータとして用意されており、多チャンネルデータとメタデータが混合されてネットワークを介して提供されることが特徴である。 Next, a signal reproducing device 3 is shown in FIG. 7 as a third specific example of the present invention. 7, components having the same functions as those of the signal reproduction device shown in FIGS. 1 and 6 are assigned the same reference numerals and detailed description thereof is omitted. The signal reproduction apparatus 3 shown in FIG. 7 is not the case where the above-described multi-channel audio data is provided as a so-called packaging medium such as an optical disc, but is a case where the multi-channel audio data is sent via a network. Further, in the signal reproduction device 3, sound source information and sound image position information for each sound image are prepared as metadata independently of multi-channel audio data, and the multi-channel data and metadata are mixed and transmitted via a network. It is characteristic that it is provided.

そのため、信号再生装置３は、光ディスク再生部１１の代わりに、無線又は有線接続されるローカルエリアネットワーク、オリジナルネットワーク、いわゆるインターネット等のネットワークに接続するネットワークインターフェイス（以下、ネットワークＩ／Ｆという。）３１と、ネットワークを介して送られたオーディオデータ等のコンテンツデータを一時的に記憶する受信バッファ３２を備えている。ネットワークの通信プロトコルとしては、ＴＣＰ／ＩＰをはじめとする汎用プロトコルがあげられる。 Therefore, instead of the optical disc playback unit 11, the signal playback device 3 is a network interface (hereinafter referred to as a network I / F) 31 connected to a network such as a local area network, an original network, or a so-called Internet that is wirelessly or wiredly connected. And a reception buffer 32 for temporarily storing content data such as audio data sent via the network. Network communication protocols include general-purpose protocols such as TCP / IP.

また、信号再生装置３は、コンテンツ毎に作成された音源情報及び音像位置情報のメタデータを読み出すために必要な識別コードをこのコンテンツの識別コードを送信するときと同じ手順で生成する識別コード生成部３３と、ここで生成された識別コードに対応するメタデータをネットワークから受け取ったデータ中から検索する検索処理部３４とを備えている。 Further, the signal reproduction device 3 generates an identification code necessary for reading the metadata of the sound source information and the sound image position information created for each content in the same procedure as when the identification code of this content is transmitted. And a search processing unit 34 for searching for metadata corresponding to the identification code generated here from data received from the network.

なお、図７に示す信号再生装置３において、ネットワークを介して伝送されるオーディオデータは、リアルタイム再生を可能とするストリームデータであってもよいし、いわゆるダウンロードのような一括伝送データであってもよい。また、信号再生装置３は、識別コード入力部３５を備え、ユーザによって、識別コードが直接入力できてもよい。 In the signal reproduction device 3 shown in FIG. 7, the audio data transmitted via the network may be stream data that enables real-time reproduction, or may be batch transmission data such as so-called download. Good. Further, the signal reproduction device 3 may include an identification code input unit 35, and the identification code may be directly input by the user.

上述した構成を有する信号再生装置３がネットワークを介して受信したコンテンツデータを再生する動作について説明する。 An operation of reproducing the content data received via the network by the signal reproduction device 3 having the above-described configuration will be described.

ネットワークＩ／Ｆ３１で受け取ったデータは、受信バッファ３２に一時的に記憶され、信号分離回路１２において、圧縮されたオーディオデータ、圧縮されたビデオデータ、字幕データ、その他のデータ等に分離される。圧縮されたオーディオデータは、オーディオデコーダ１３で復号された後、オーディオ信号処理回路１４のオーディオ信号再生処理部２３に送られる。オーディオ信号再生処理部２３は、復号されたオーディオデータを再生し多チャンネルアンプ２１に送る。 Data received by the network I / F 31 is temporarily stored in the reception buffer 32, and is separated into compressed audio data, compressed video data, caption data, other data, and the like by the signal separation circuit 12. The compressed audio data is decoded by the audio decoder 13 and then sent to the audio signal reproduction processing unit 23 of the audio signal processing circuit 14. The audio signal reproduction processing unit 23 reproduces the decoded audio data and sends it to the multi-channel amplifier 21.

オーディオ信号処理回路１４の識別コード生成部３３は、ネットワークＩ／Ｆ３１で受け取ったデータから作成時と同様のルールで識別コードを生成する。そして、検索処理部３４は、生成された識別コードに応じて、演奏対象の楽曲のメタデータを多重化されたデータから検索する。 The identification code generation unit 33 of the audio signal processing circuit 14 generates an identification code from the data received by the network I / F 31 according to the same rule as that at the time of creation. Then, the search processing unit 34 searches for the metadata of the musical composition to be played from the multiplexed data according to the generated identification code.

信号再生装置３では、音像情報抽出処理部２４は、検索処理部２９が検索して得たメタデータから音源毎の音像情報と音像位置情報とを抽出し、抽出した音像情報及び音像位置情報を仮想音源位置算出処理部２５と音源信号算出処理部２６に送る。仮想音源位置算出処理部２５では、音像情報抽出処理部２４で抽出された音像位置情報にしたがって音場を合成すべき音源位置が算出される。また、音源信号算出処理部２６では、音像情報抽出処理部２４で音源毎に分離されたオーディオデータに仮想音源位置算出処理部２５で算出した音源位置に所定の音源を音像定位させるための制御パラメータが与えられる。 In the signal reproduction device 3, the sound image information extraction processing unit 24 extracts sound image information and sound image position information for each sound source from the metadata obtained by the search processing unit 29, and extracts the extracted sound image information and sound image position information. This is sent to the virtual sound source position calculation processing unit 25 and the sound source signal calculation processing unit 26. The virtual sound source position calculation processing unit 25 calculates the sound source position where the sound field should be synthesized according to the sound image position information extracted by the sound image information extraction processing unit 24. In the sound source signal calculation processing unit 26, control parameters for localizing a predetermined sound source at the sound source position calculated by the virtual sound source position calculation processing unit 25 to the audio data separated for each sound source by the sound image information extraction processing unit 24. Is given.

音場合成処理部２７は、音源信号算出処理部２６によって算出された音源信号と制御パラメータに基づいて抽出された音源信号を仮想音源位置に再配置する音場合成を行う。音場合成処理部２７において音場が再合成されたオーディオ信号は、多チャンネルアンプ２１に送られ、分離された特定の音源信号を多チャンネルで増幅され、特定の音源信号成分が分離された後のオーディオ信号を２チャンネルで増幅される。字幕信号とビデオ信号は、字幕合成回路１９においてビデオ信号に同期して映像に字幕が合成され、音場が再合成されたオーディオ信号はオーディオ信号出力回路２２から、また字幕が合成されたビデオ信号はビデオ信号出力回路２０から、互いに同期されて外部のスピーカシステム、表示装置等に出力される。 The sound case formation processing unit 27 performs sound case formation in which the sound source signal calculated by the sound source signal calculation processing unit 26 and the sound source signal extracted based on the control parameter are rearranged at the virtual sound source position. The audio signal in which the sound field is re-synthesized in the sound case synthesis unit 27 is sent to the multi-channel amplifier 21, and the separated specific sound source signal is amplified in multi-channels and the specific sound source signal component is separated. Audio signal is amplified by two channels. The caption signal and the video signal are combined with the video in synchronism with the video signal in the caption synthesizing circuit 19, and the audio signal whose sound field is re-synthesized is sent from the audio signal output circuit 22 and the video signal with which the caption is synthesized. Are output from the video signal output circuit 20 to an external speaker system, a display device or the like in synchronization with each other.

上述したように信号再生装置３は、音像毎の音源情報及び音像位置情報を多チャンネルのオーディオデータから独立してメタデータとして用意し、更にメタデータを識別コードに対応して予め所定の領域に格納したことにより、一般調和信号解析等による音源抽出処理にかかる演算量を低減することができる。したがって、信号再生装置３は、音源毎の信号を分離し、聴取点に近い音源を聴取者の距離知覚が敏感な位置に変更し、遠くに定位される音像をより遠くに再配置するパラメータを算出し、仮想音源位置に特定の音源信号を再合成し、再生される仮想音場を再構築するという信号処理がネットワークを介して送信されるオーディオデータ等のコンテンツデータに対してもリアルタイムに可能になる。 As described above, the signal reproduction device 3 prepares sound source information and sound image position information for each sound image as metadata independent of multi-channel audio data, and further stores the metadata in a predetermined area in advance corresponding to the identification code. By storing, it is possible to reduce the amount of calculation required for the sound source extraction processing by general harmonic signal analysis or the like. Therefore, the signal reproduction device 3 separates the signal for each sound source, changes the sound source close to the listening point to a position where the distance perception of the listener is sensitive, and relocates the sound image localized far away. Signal processing that calculates, re-synthesizes a specific sound source signal at the virtual sound source position, and reconstructs the virtual sound field to be played back is also possible in real time for content data such as audio data transmitted over the network become.

次に、本発明の第４の具体例として信号再生装置４を図８に示す。図８に示す信号再生装置４は、多チャンネルのオーディオデータから独立して音像毎の音源情報及び音像位置情報がメタデータとして用意されているのは同一であるが、オーディオデータは光ディスクから、また音源情報及び音像位置情報はネットワークを介して信号再生装置４に送られることを特徴としている。 Next, a signal reproducing apparatus 4 is shown in FIG. 8 as a fourth specific example of the present invention. In the signal reproduction device 4 shown in FIG. 8, the sound source information and the sound image position information for each sound image are prepared as metadata independently from the multi-channel audio data, but the audio data is obtained from the optical disk or from the optical disk. The sound source information and the sound image position information are transmitted to the signal reproduction device 4 through a network.

そのため、信号再生装置４は、コンテンツ毎に作成された音源情報及び音像位置情報のメタデータを読み出すために必要な識別コードをこのコンテンツの識別コードを送信するときと同じ手順で生成する識別コード生成部３６と、無線又は有線接続されるローカルエリアネットワーク又はオリジナルネットワーク、いわゆるインターネット等のネットワークに接続するネットワークインターフェイス（以下、ネットワークＩ／Ｆという。）３６と、識別コード生成部３６で生成された識別コードに対応するメタデータをネットワークから受け取ったデータ中から検索する検索処理部３８とを備えている。信号再生装置４は、図８には図示していないがネットワークを介して送られたオーディオデータ等のコンテンツデータを一時的に記憶する受信バッファを備えている。ネットワークの通信プロトコルとしては、ＴＣＰ／ＩＰをはじめとする汎用プロトコルがあげられる。なお、図８に示すに示す信号再生装置４において、図１、図６及び図７に示す信号再生装置と同様の機能を有する構成に関しては同一の番号を付けて詳細な説明を省略する。 Therefore, the signal reproduction device 4 generates an identification code necessary for reading the metadata of the sound source information and the sound image position information created for each content in the same procedure as when the identification code of this content is transmitted. A network interface (hereinafter referred to as a network I / F) 36 connected to a network 36 such as a local area network or an original network that is wirelessly or wiredly connected, so-called Internet, and the identification generated by the identification code generator 36 And a search processing unit 38 that searches the data corresponding to the code from the data received from the network. Although not shown in FIG. 8, the signal reproduction device 4 includes a reception buffer that temporarily stores content data such as audio data transmitted via a network. Network communication protocols include general-purpose protocols such as TCP / IP. In the signal reproducing device 4 shown in FIG. 8, the same reference numerals are given to the components having the same functions as those of the signal reproducing device shown in FIGS. 1, 6, and 7, and detailed description thereof is omitted.

上述した構成を有する信号再生装置４が光ディスクから読み出したコンテンツデータを再生する動作について説明する。 An operation of reproducing the content data read from the optical disc by the signal reproducing device 4 having the above-described configuration will be described.

光ディスク再生部１１で読み出されたデータは、信号分離回路１２において、圧縮されたオーディオデータ、圧縮されたビデオデータ、字幕データ、その他のデータ等に分離される。圧縮されたオーディオデータは、オーディオデコーダ１３で復号された後、オーディオ信号処理回路１４のオーディオ信号再生処理部２３に送られる。オーディオ信号再生処理部２３は、復号されたオーディオデータを再生し多チャンネルアンプ２１に送る。 The data read by the optical disk reproducing unit 11 is separated into compressed audio data, compressed video data, caption data, other data, and the like by the signal separation circuit 12. The compressed audio data is decoded by the audio decoder 13 and then sent to the audio signal reproduction processing unit 23 of the audio signal processing circuit 14. The audio signal reproduction processing unit 23 reproduces the decoded audio data and sends it to the multi-channel amplifier 21.

オーディオ信号処理回路１４の識別コード生成部３６は、ネットワークＩ／Ｆ３７で受け取った識別コードを検索処理部３８に送る。検索処理部３８は、生成された識別コードに応じて、演奏対象の楽曲のメタデータを多重化されたデータから検索する。 The identification code generation unit 36 of the audio signal processing circuit 14 sends the identification code received by the network I / F 37 to the search processing unit 38. The search processing unit 38 searches the metadata of the musical composition to be played from the multiplexed data according to the generated identification code.

光ディスク再生部１１から読み出されたデータは、信号分離回路１２において、圧縮されたオーディオデータ、圧縮されたビデオデータ、字幕データ、その他のデータ等に分離される。圧縮されたオーディオデータは、オーディオデコーダ１３で復号された後、オーディオ信号処理回路１４のオーディオ信号再生処理部２３に送られる。オーディオ信号再生処理部２３は、復号されたオーディオデータを再生し多チャンネルアンプ２１に送る。 The data read from the optical disk reproduction unit 11 is separated into compressed audio data, compressed video data, caption data, other data, and the like by the signal separation circuit 12. The compressed audio data is decoded by the audio decoder 13 and then sent to the audio signal reproduction processing unit 23 of the audio signal processing circuit 14. The audio signal reproduction processing unit 23 reproduces the decoded audio data and sends it to the multi-channel amplifier 21.

信号再生装置４では、音像情報抽出処理部２４は、検索処理部２９が検索して得たメタデータから音源毎の音像情報と音像位置情報とを抽出し、抽出した音像情報及び音像位置情報を仮想音源位置算出処理部２５と音源信号算出処理部２６に送る。仮想音源位置算出処理部２５では、音像情報抽出処理部２４で抽出された音像位置情報にしたがって音場を合成すべき音源位置が算出される。また、音源信号算出処理部２６では、音像情報抽出処理部２４で音源毎に分離されたオーディオデータに仮想音源位置算出処理部２５で算出した音源位置に所定の音源を音像定位させるための制御パラメータが与えられる。 In the signal reproduction device 4, the sound image information extraction processing unit 24 extracts sound image information and sound image position information for each sound source from the metadata obtained by the search processing unit 29, and extracts the extracted sound image information and sound image position information. This is sent to the virtual sound source position calculation processing unit 25 and the sound source signal calculation processing unit 26. The virtual sound source position calculation processing unit 25 calculates the sound source position where the sound field should be synthesized according to the sound image position information extracted by the sound image information extraction processing unit 24. In the sound source signal calculation processing unit 26, control parameters for localizing a predetermined sound source at the sound source position calculated by the virtual sound source position calculation processing unit 25 to the audio data separated for each sound source by the sound image information extraction processing unit 24. Is given.

上述したように信号再生装置４は、音像毎の音源情報及び音像位置情報を多チャンネルのオーディオデータから独立してメタデータとして用意して、これをネットワークから受け取る。そして、オーディオデータは光ディスクで提供されることにより、一般調和信号解析等による音源抽出処理にかかる演算量を低減することができる。したがって、信号再生装置４は、音源毎の信号を分離し、聴取点に近い音源を聴取者の距離知覚が敏感な位置に変更し、遠くに定位される音像をより遠くに再配置するパラメータを算出し、仮想音源位置に特定の音源信号を再合成し、再生される仮想音場を再構築するという信号処理がネットワークを介して送信されるオーディオデータ等のコンテンツデータに対してもリアルタイムに可能になる。 As described above, the signal reproduction device 4 prepares sound source information and sound image position information for each sound image as metadata independent of multi-channel audio data, and receives this from the network. Then, the audio data is provided on the optical disc, so that the amount of calculation required for the sound source extraction processing by general harmonic signal analysis or the like can be reduced. Therefore, the signal reproduction device 4 separates the signal for each sound source, changes the sound source close to the listening point to a position where the distance perception of the listener is sensitive, and relocates the sound image localized far away. Signal processing that calculates, re-synthesizes a specific sound source signal at the virtual sound source position, and reconstructs the virtual sound field to be played back is also possible in real time for content data such as audio data transmitted over the network become.

また例えば、信号再生装置４では、オーディオデータ等のコンテンツデータを提供する提供業者が過去に発売されたＣＤ、ＤＶＤ等のコンテンツについての音源情報及び音像位置情報が記述されたメタデータをＷｅｂページ上で提供するなどして旧来資産であるコンテンツに対しても仮想音場を再構築する信号処理を実行することにより視聴者の臨場感を高めることができる。 Also, for example, in the signal reproduction device 4, metadata describing sound source information and sound image position information about content such as CD and DVD released in the past by a provider who provides content data such as audio data is displayed on the Web page. It is possible to enhance the viewer's sense of presence by executing signal processing for reconstructing a virtual sound field even for content that is a legacy asset, for example, by providing it on the Internet.

図１、図６、図７、図８に示した信号再生装置の多チャンネルアンプ２１及びオーディオ信号出力回路２２の出力先として適用可能なスピーカシステムの具体例について説明する。 A specific example of a speaker system that can be applied as an output destination of the multi-channel amplifier 21 and the audio signal output circuit 22 of the signal reproduction device shown in FIGS. 1, 6, 7, and 8 will be described.

図９に示すスピーカシステム５０は、音波面再生用多チャンネル音響増幅回路５１と、５．１チャンネル用多チャンネル音響増幅回路５２と、音波面再生用スピーカ５３１、５３２、・・・、５３ｎと、聴取者の正面スピーカ５４１、右前方スピーカ５４２、左前方スピーカ５４３、右後方スピーカ５４４、左後方スピーカ５４５、低音出力用サブウーファースピーカ５４６の６つのスピーカからなる５．１チャンネル用スピーカシステムとを備えている。 The speaker system 50 shown in FIG. 9 includes a sound channel reproduction multi-channel sound amplification circuit 51, a 5.1 channel multi-channel sound amplification circuit 52, sound wave surface reproduction speakers 531, 532,. A 5.1-channel speaker system including six speakers: a front speaker 541 of the listener, a right front speaker 542, a left front speaker 543, a right rear speaker 544, a left rear speaker 545, and a low-frequency output subwoofer speaker 546. ing.

スピーカシステム５０は、図１、図６、図７、図８に示した信号再生装置における多チャンネルアンプ２１及びオーディオ信号出力回路２２を含んで構成されており、多チャンネルアンプ２１は、音波面用多チャンネル音響増幅回路５１及び５．１チャンネル用音響増幅回路５２に相当する。したがって、音波面用多チャンネル音響増幅回路５１、５．１チャンネル用音響増幅回路５２には、音源信号算出処理部２６によって算出された音源信号と制御パラメータに基づいて仮想音源位置算出処理部２５で算出された仮想音源位置に再配置するための音場合成が行われたオーディオ信号が音場合成処理部２７から送られるようになっている。 The speaker system 50 includes the multi-channel amplifier 21 and the audio signal output circuit 22 in the signal reproduction apparatus shown in FIGS. 1, 6, 7, and 8. The multi-channel amplifier 21 is used for the sound wave surface. This corresponds to the multi-channel acoustic amplifier circuit 51 and the 5.1-channel acoustic amplifier circuit 52. Accordingly, the sound source multi-channel acoustic amplification circuit 51 and the 5.1-channel acoustic amplification circuit 52 have the virtual sound source position calculation processing unit 25 based on the sound source signal calculated by the sound source signal calculation processing unit 26 and the control parameters. An audio signal subjected to sound case formation for rearrangement at the calculated virtual sound source position is sent from the sound case formation processing unit 27.

オーディオデータは、音波面用多チャンネル音響増幅回路５１、５．１チャンネル用音響増幅回路５２によってチャンネル毎に増幅され、音声出力デバイスとしてのスピーカから出力される。５．１チャンネルサラウンド方式の場合には、聴取者の正面、右前方、左前方、右後方、左後方、低音出力用サブウーファースピーカの６つのスピーカから出力される。また、音波面再生用スピーカ５３１〜５３ｎからは、それぞれ異なる波面をもつ音が出力され、任意の位置に音像定位する波面合成が行われる。 The audio data is amplified for each channel by the sound wave front multi-channel sound amplification circuit 51 and the 5.1-channel sound amplification circuit 52 and output from a speaker as an audio output device. In the case of the 5.1 channel surround system, the sound is output from six speakers: the front, right front, left front, right rear, left rear, and bass subwoofer speakers for the listener. Sound wave reproduction speakers 531 to 53n output sounds having different wavefronts, and perform wavefront synthesis that localizes a sound image at an arbitrary position.

この平面アレイスピーカ５０は、例えば、投写型プロジェクタ６０と組み合わせて使用することにより、ユーザはより臨場感のある音場を得ることができる。投写型プロジェクタ６０から出射された映像光は、音波面再生用スピーカ５３前面聴者側に置かれたスクリーン６１に投写される。スクリーン６１の背後に配置されたスピーカシステムから音場合成処理部２７で合成された合成音響信号が各スピーカから出力されると、上述した信号再生装置によって再合成された音場が構築される For example, when the planar array speaker 50 is used in combination with the projection projector 60, the user can obtain a more realistic sound field. The image light emitted from the projection type projector 60 is projected onto a screen 61 placed on the front listener side of the sound wave surface reproduction speaker 53. When the synthesized sound signal synthesized by the sound case processor 27 is output from each speaker from the speaker system arranged behind the screen 61, a sound field re-synthesized by the signal reproduction device described above is constructed.

本発明の第１の具体例として示す信号再生装置を説明する構成図である。It is a block diagram explaining the signal reproducing | regenerating apparatus shown as a 1st example of this invention. 上記信号再生装置の音像情報抽出回路によって分離される周波数成分ｆ１の時間に対する強度を示す波形図である。It is a wave form diagram which shows the intensity | strength with respect to time of the frequency component f1 isolate | separated by the sound image information extraction circuit of the said signal reproduction apparatus. 上記信号再生装置の音像情報抽出回路によって分離される周波数成分ｆ２の時間に対する強度を示す波形図である。It is a wave form diagram which shows the intensity | strength with respect to the time of the frequency component f2 isolate | separated by the sound image information extraction circuit of the said signal reproduction apparatus. 上記信号再生装置の音像情報抽出回路によって分離される周波数成分ｆ３の時間に対する強度を示す波形図である。It is a wave form diagram which shows the intensity | strength with respect to the time of the frequency component f3 isolate | separated by the sound image information extraction circuit of the said signal reproduction apparatus. 上記信号再生装置の音像情報抽出回路によって分離される周波数成分ｆ４の時間に対する強度を示す波形図である。It is a wave form diagram which shows the intensity | strength with respect to time of the frequency component f4 isolate | separated by the sound image information extraction circuit of the said signal reproduction apparatus. 本発明の第２の具体例として示す信号再生装置を説明する構成図である。It is a block diagram explaining the signal reproducing | regenerating apparatus shown as the 2nd example of this invention. 本発明の第３の具体例として示す信号再生装置を説明する構成図である。It is a block diagram explaining the signal reproducing | regenerating apparatus shown as the 3rd example of this invention. 本発明の第４の具体例として示す信号再生装置を説明する構成図である。It is a block diagram explaining the signal reproducing | regenerating apparatus shown as the 4th example of this invention. 本発明の具体例として示す信号再生装置で合成された音響信号を再生することのできるスピーカシステムの一例を説明する構成図である。It is a block diagram explaining an example of the speaker system which can reproduce | regenerate the acoustic signal synthesize | combined with the signal reproduction apparatus shown as a specific example of this invention.

Explanation of symbols

１，２，３，４信号再生装置、１１光ディスク再生部、１２信号分離回路、１３オーディオデコーダ、１４オーディオ信号処理回路、１５字幕デコーダ、１６字幕再生回路、１７ビデオデコーダ、１８ビデオ信号再生回路、１９字幕合成回路、２０ビデオ信号出力回路、２１多チャンネルアンプ、２２オーディオ信号出力回路、２３オーディオ信号再生処理部、２４音像情報抽出処理部、２５仮想音源位置算出処理部、２６音源信号算出処理部、２７音場合成処理部、２８，３３，３６識別コード生成部、２９，３４，３８検索処理部、３１，３７ネットワークＩ／Ｆ、３２受信バッファ、３５識別コード入力部 1, 2, 3, 4 signal playback device, 11 optical disc playback unit, 12 signal separation circuit, 13 audio decoder, 14 audio signal processing circuit, 15 subtitle decoder, 16 subtitle playback circuit, 17 video decoder, 18 video signal playback circuit, 19 caption synthesis circuit, 20 video signal output circuit, 21 multi-channel amplifier, 22 audio signal output circuit, 23 audio signal reproduction processing unit, 24 sound image information extraction processing unit, 25 virtual sound source position calculation processing unit, 26 sound source signal calculation processing unit 27 sound case generator, 28, 33, 36 identification code generator, 29, 34, 38 search processor, 31, 37 network I / F, 32 reception buffer, 35 identification code input unit

Claims

In a signal reproducing apparatus for reproducing multi-channel data in which audio signals are multiplexed,
Multi-channel data acquisition means for acquiring the multi-channel data;
Signal analysis means for extracting sound source information and sound image position information of a specific sound image from the multi-channel data;
A sound source signal calculating means for calculating a sound source signal from the sound source information;
A signal reproduction apparatus comprising: sound case forming means for changing sound source information of the extracted specific sound image and arranging sound source signals of the changed specific sound image at an arbitrary virtual sound source position.

Virtual sound source position calculating means for calculating a virtual sound source position of the specific sound image according to the extracted sound source information of the specific sound image and the sound image position information is provided, and the sound case generating means is calculated by the sound source signal calculating means. 2. The signal reproduction apparatus according to claim 1, wherein the sound source signal is arranged at the virtual sound source position.

3. The sound source signal calculating unit calculates a new sound source signal based on the sound source information, and the sound case generating unit arranges the new sound source signal at an arbitrary virtual sound source position. Signal reproduction device.

2. The signal reproduction apparatus according to claim 1, wherein the sound source signal calculating means calculates a sound source signal obtained by rearranging the sound source signal of the specific sound image in the depth direction.

The multi-channel data is an intensity stereo signal, and the sound source signal calculation means separates only a sound source signal that is localized at a central position from the intensity stereo signal based on an analysis result in the signal analysis means, The sound case generating means reproduces the intensity stereo signal from which the sound source signal localized at the central position is removed, and reproduces the intensity stereo signal in a normal intensity stereo, and rearranges the separated sound source signal at the central position. 2. The signal reproducing apparatus according to 1.

2. The signal reproduction apparatus according to claim 1, wherein sound source information and sound image position information for each sound image are prepared independently from the multi-channel data.

In the recording medium, the sound source information and sound image position information are stored in a predetermined area in association with data identification information unique to each content generated from a part of the multi-channel data, and data identification for specifying the multi-channel data Data identification information generating means for generating information from a part of the multi-channel data, and search means for searching the sound source information and sound image position information corresponding to the generated data identification information from the area. The signal reproducing apparatus according to claim 6.

7. The signal reproducing apparatus according to claim 6, wherein the multi-channel data and the sound source information and sound image position information for each sound image prepared independently from the multi-channel data are provided via a network.

7. The signal reproducing apparatus according to claim 6, wherein the multi-channel data is provided by being stored in a recording medium, and the sound source information and sound image position information for each sound image are provided via a network.

In a signal reproduction method for reproducing multi-channel data in which audio signals are multiplexed,
A multi-channel data acquisition step for acquiring the multi-channel data;
A signal analysis step of extracting sound source information and sound image position information of a specific sound image from the multi-channel data;
A sound source signal calculating step of calculating a sound source signal from the sound source information;
A signal reproduction method comprising: a sound case generating step for performing sound case formation in which the sound image information of the extracted specific sound image is changed and the sound source signal of the changed specific sound image is arranged at an arbitrary virtual sound source position.

A virtual sound source position calculating step of calculating a virtual sound source position of the specific sound image according to the extracted sound image information of the specific sound image and the sound image position information;
11. The signal reproduction method according to claim 10, wherein, in the sound case generating step, the sound source signal calculated in the sound source signal calculating step is arranged at the virtual sound source position.