CN103155030B - 用于处理多声道音频信号的方法及设备 - Google Patents
用于处理多声道音频信号的方法及设备 Download PDFInfo
- Publication number
- CN103155030B CN103155030B CN201180034344.9A CN201180034344A CN103155030B CN 103155030 B CN103155030 B CN 103155030B CN 201180034344 A CN201180034344 A CN 201180034344A CN 103155030 B CN103155030 B CN 103155030B
- Authority
- CN
- China
- Prior art keywords
- audio channel
- audio
- channel signals
- parameters
- time
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 230000005236 sound signal Effects 0.000 title claims abstract description 122
- 238000012545 processing Methods 0.000 title claims abstract description 73
- 238000000034 method Methods 0.000 title claims abstract description 57
- 230000002123 temporal effect Effects 0.000 claims description 36
- 238000005314 correlation function Methods 0.000 claims description 27
- 230000006870 function Effects 0.000 claims description 25
- 238000005259 measurement Methods 0.000 claims description 3
- 230000002596 correlated effect Effects 0.000 claims 1
- 230000000875 corresponding effect Effects 0.000 claims 1
- 239000000872 buffer Substances 0.000 description 32
- 238000004422 calculation algorithm Methods 0.000 description 22
- 238000007726 management method Methods 0.000 description 19
- 238000004364 calculation method Methods 0.000 description 16
- 238000011524 similarity measure Methods 0.000 description 13
- 238000010586 diagram Methods 0.000 description 11
- 238000000605 extraction Methods 0.000 description 11
- 230000007246 mechanism Effects 0.000 description 8
- 230000003044 adaptive effect Effects 0.000 description 7
- 238000004519 manufacturing process Methods 0.000 description 7
- 230000009977 dual effect Effects 0.000 description 6
- 239000000284 extract Substances 0.000 description 6
- 230000008447 perception Effects 0.000 description 6
- 230000006399 behavior Effects 0.000 description 4
- 230000003139 buffering effect Effects 0.000 description 4
- 230000008859 change Effects 0.000 description 4
- 230000001934 delay Effects 0.000 description 4
- 230000004048 modification Effects 0.000 description 4
- 238000012986 modification Methods 0.000 description 4
- 230000003595 spectral effect Effects 0.000 description 4
- 238000001228 spectrum Methods 0.000 description 4
- 230000007704 transition Effects 0.000 description 4
- 230000008901 benefit Effects 0.000 description 3
- 230000015556 catabolic process Effects 0.000 description 3
- 238000006731 degradation reaction Methods 0.000 description 3
- 239000012634 fragment Substances 0.000 description 3
- 239000002131 composite material Substances 0.000 description 2
- 230000006835 compression Effects 0.000 description 2
- 238000007906 compression Methods 0.000 description 2
- 238000005516 engineering process Methods 0.000 description 2
- 230000001788 irregular Effects 0.000 description 2
- 239000000463 material Substances 0.000 description 2
- 238000005457 optimization Methods 0.000 description 2
- 230000008569 process Effects 0.000 description 2
- 230000005540 biological transmission Effects 0.000 description 1
- 238000006243 chemical reaction Methods 0.000 description 1
- 238000004590 computer program Methods 0.000 description 1
- 230000001186 cumulative effect Effects 0.000 description 1
- 230000007423 decrease Effects 0.000 description 1
- 230000001419 dependent effect Effects 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 239000000203 mixture Substances 0.000 description 1
- 230000001360 synchronised effect Effects 0.000 description 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/008—Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/04—Time compression or expansion
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/16—Vocoder architecture
- G10L19/167—Audio streaming, i.e. formatting and decoding of an encoded audio signal representation into a data stream for transmission or storage purposes
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/04—Time compression or expansion
- G10L21/055—Time compression or expansion for synchronising with other signals, e.g. video signals
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Computational Linguistics (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Mathematical Physics (AREA)
- Quality & Reliability (AREA)
- Stereophonic System (AREA)
Abstract
Description
Claims (14)
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
PCT/CN2011/077198 WO2012167479A1 (en) | 2011-07-15 | 2011-07-15 | Method and apparatus for processing a multi-channel audio signal |
Publications (2)
Publication Number | Publication Date |
---|---|
CN103155030A CN103155030A (zh) | 2013-06-12 |
CN103155030B true CN103155030B (zh) | 2015-07-08 |
Family
ID=47295369
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201180034344.9A Active CN103155030B (zh) | 2011-07-15 | 2011-07-15 | 用于处理多声道音频信号的方法及设备 |
Country Status (5)
Country | Link |
---|---|
US (1) | US9406302B2 (zh) |
EP (1) | EP2710592B1 (zh) |
JP (1) | JP5734517B2 (zh) |
CN (1) | CN103155030B (zh) |
WO (1) | WO2012167479A1 (zh) |
Families Citing this family (17)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
TWI470974B (zh) * | 2013-01-10 | 2015-01-21 | Univ Nat Taiwan | 多媒體資料傳輸速率調節方法及網路電話語音資料傳輸速率調節方法 |
WO2014170530A1 (en) * | 2013-04-15 | 2014-10-23 | Nokia Corporation | Multiple channel audio signal encoder mode determiner |
US9712266B2 (en) * | 2013-05-21 | 2017-07-18 | Apple Inc. | Synchronization of multi-channel audio communicated over bluetooth low energy |
WO2014202647A1 (en) | 2013-06-21 | 2014-12-24 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Jitter buffer control, audio decoder, method and computer program |
SG11201510501YA (en) | 2013-06-21 | 2016-01-28 | Fraunhofer Ges Forschung | Time scaler, audio decoder, method and a computer program using a quality control |
CN104282309A (zh) | 2013-07-05 | 2015-01-14 | 杜比实验室特许公司 | 丢包掩蔽装置和方法以及音频处理系统 |
US9942119B2 (en) | 2013-09-19 | 2018-04-10 | Binauric SE | Adaptive jitter buffer |
PL3503097T3 (pl) * | 2016-01-22 | 2024-03-11 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Urządzenie oraz sposób do enkodowania lub dekodowania sygnału wielokanałowego z wykorzystaniem ponownego próbkowania w dziedzinie widmowej |
EP3246923A1 (en) | 2016-05-20 | 2017-11-22 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Apparatus and method for processing a multichannel audio signal |
US10706859B2 (en) * | 2017-06-02 | 2020-07-07 | Apple Inc. | Transport of audio between devices using a sparse stream |
CN108600936B (zh) * | 2018-04-19 | 2020-01-03 | 北京微播视界科技有限公司 | 多声道音频处理方法、装置、计算机可读存储介质和终端 |
EP3719799A1 (en) * | 2019-04-04 | 2020-10-07 | FRAUNHOFER-GESELLSCHAFT zur Förderung der angewandten Forschung e.V. | A multi-channel audio encoder, decoder, methods and computer program for switching between a parametric multi-channel operation and an individual channel operation |
CN110501674A (zh) * | 2019-08-20 | 2019-11-26 | 长安大学 | 一种基于半监督学习的声信号非视距识别方法 |
CN110808054B (zh) * | 2019-11-04 | 2022-05-06 | 思必驰科技股份有限公司 | 多路音频的压缩与解压缩方法及系统 |
CN111415675B (zh) * | 2020-02-14 | 2023-09-12 | 北京声智科技有限公司 | 音频信号处理方法、装置、设备及存储介质 |
JP2023515886A (ja) | 2020-03-02 | 2023-04-14 | マジック リープ, インコーポレイテッド | 没入型のオーディオプラットフォーム |
EP4475122A1 (en) * | 2023-06-06 | 2024-12-11 | Nokia Technologies Oy | Adapting spatial audio parameters for jitter buffer management |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1926824A (zh) * | 2004-05-26 | 2007-03-07 | 日本电信电话株式会社 | 声音分组再现方法、声音分组再现装置、声音分组再现程序、记录介质 |
CN101379556A (zh) * | 2006-02-07 | 2009-03-04 | 诺基亚公司 | 控制音频信号的时间缩放 |
CN102084418A (zh) * | 2008-07-01 | 2011-06-01 | 诺基亚公司 | 用于调整多通道音频信号的空间线索信息的设备和方法 |
Family Cites Families (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20050137729A1 (en) * | 2003-12-18 | 2005-06-23 | Atsuhiro Sakurai | Time-scale modification stereo audio signals |
ES2335221T3 (es) * | 2004-01-28 | 2010-03-23 | Koninklijke Philips Electronics N.V. | Procedimiento y aparato para ajustar la escala de tiempo en una señal. |
JP4550652B2 (ja) * | 2005-04-14 | 2010-09-22 | 株式会社東芝 | 音響信号処理装置、音響信号処理プログラム及び音響信号処理方法 |
US7957960B2 (en) * | 2005-10-20 | 2011-06-07 | Broadcom Corporation | Audio time scale modification using decimation-based synchronized overlap-add algorithm |
US7647229B2 (en) * | 2006-10-18 | 2010-01-12 | Nokia Corporation | Time scaling of multi-channel audio signals |
JP4940888B2 (ja) | 2006-10-23 | 2012-05-30 | ソニー株式会社 | オーディオ信号伸張圧縮装置及び方法 |
JP2010017216A (ja) | 2008-07-08 | 2010-01-28 | Ge Medical Systems Global Technology Co Llc | 音声データ処理装置,音声データ処理方法、および、イメージング装置 |
CN102157152B (zh) * | 2010-02-12 | 2014-04-30 | 华为技术有限公司 | 立体声编码的方法、装置 |
-
2011
- 2011-07-15 WO PCT/CN2011/077198 patent/WO2012167479A1/en active Application Filing
- 2011-07-15 EP EP11867249.2A patent/EP2710592B1/en not_active Not-in-force
- 2011-07-15 JP JP2014519373A patent/JP5734517B2/ja not_active Expired - Fee Related
- 2011-07-15 CN CN201180034344.9A patent/CN103155030B/zh active Active
-
2013
- 2013-12-31 US US14/144,874 patent/US9406302B2/en not_active Expired - Fee Related
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1926824A (zh) * | 2004-05-26 | 2007-03-07 | 日本电信电话株式会社 | 声音分组再现方法、声音分组再现装置、声音分组再现程序、记录介质 |
CN101379556A (zh) * | 2006-02-07 | 2009-03-04 | 诺基亚公司 | 控制音频信号的时间缩放 |
CN102084418A (zh) * | 2008-07-01 | 2011-06-01 | 诺基亚公司 | 用于调整多通道音频信号的空间线索信息的设备和方法 |
Also Published As
Publication number | Publication date |
---|---|
EP2710592B1 (en) | 2017-11-22 |
JP2014518407A (ja) | 2014-07-28 |
CN103155030A (zh) | 2013-06-12 |
JP5734517B2 (ja) | 2015-06-17 |
WO2012167479A1 (en) | 2012-12-13 |
EP2710592A1 (en) | 2014-03-26 |
US20140140516A1 (en) | 2014-05-22 |
EP2710592A4 (en) | 2014-04-16 |
US9406302B2 (en) | 2016-08-02 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN103155030B (zh) | 用于处理多声道音频信号的方法及设备 | |
US7394833B2 (en) | Method and apparatus for reducing synchronization delay in packet switched voice terminals using speech decoder modification | |
AU2006228821B2 (en) | Device and method for producing a data flow and for producing a multi-channel representation | |
RU2491658C2 (ru) | Синтезатор аудиосигнала и кодирующее устройство аудиосигнала | |
US11170791B2 (en) | Systems and methods for implementing efficient cross-fading between compressed audio streams | |
US8504378B2 (en) | Stereo acoustic signal encoding apparatus, stereo acoustic signal decoding apparatus, and methods for the same | |
KR101680953B1 (ko) | 인지 오디오 코덱들에서의 고조파 신호들에 대한 위상 코히어런스 제어 | |
US7734473B2 (en) | Method and apparatus for time scaling of a signal | |
KR20100086000A (ko) | 오디오 신호 처리 방법 및 장치 | |
JP2016539357A (ja) | 音声デコーダ、符号化音声出力データを生成するための装置、及びデコーダの初期化を可能にする方法 | |
US8996389B2 (en) | Artifact reduction in time compression | |
KR101411197B1 (ko) | 패킷 스트림 내의 지터 보상 방법 | |
US20140214412A1 (en) | Apparatus and method for processing voice signal | |
US11961538B2 (en) | Systems and methods for implementing efficient cross-fading between compressed audio streams | |
TW202445562A (zh) | 使用時長調整的音頻處理器、音頻處理系統、音頻解碼器、用於提供處理後音頻訊號表示的方法以及電腦程式 | |
CN119012079A (zh) | 一种基于dsp运算的多声道音频的播放控制方法 | |
Liu | Time scale modification of digital audio signals and its applications | |
KR20190013756A (ko) | 다중채널 오디오 신호를 처리하는 장치 및 방법 | |
WO2009047675A2 (en) | Encoding and decoding of an audio signal |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C14 | Grant of patent or utility model | ||
GR01 | Patent grant | ||
TR01 | Transfer of patent right | ||
TR01 | Transfer of patent right |
Effective date of registration: 20210507 Address after: Unit 3401, unit a, building 6, Shenye Zhongcheng, No. 8089, Hongli West Road, Donghai community, Xiangmihu street, Futian District, Shenzhen, Guangdong 518040 Patentee after: Honor Device Co.,Ltd. Address before: 518129 Bantian HUAWEI headquarters office building, Longgang District, Guangdong, Shenzhen Patentee before: HUAWEI TECHNOLOGIES Co.,Ltd. |
|
CP03 | Change of name, title or address | ||
CP03 | Change of name, title or address |
Address after: Unit 3401, unit a, building 6, Shenye Zhongcheng, No. 8089, Hongli West Road, Donghai community, Xiangmihu street, Futian District, Shenzhen, Guangdong 518040 Patentee after: Honor Terminal Co.,Ltd. Country or region after: China Address before: 3401, unit a, building 6, Shenye Zhongcheng, No. 8089, Hongli West Road, Donghai community, Xiangmihu street, Futian District, Shenzhen, Guangdong Patentee before: Honor Device Co.,Ltd. Country or region before: China |