KR100614496B1 - 가변 비트율의 광대역 음성 및 오디오 부호화 장치 및방법 - Google Patents
가변 비트율의 광대역 음성 및 오디오 부호화 장치 및방법 Download PDFInfo
- Publication number
- KR100614496B1 KR100614496B1 KR1020030080225A KR20030080225A KR100614496B1 KR 100614496 B1 KR100614496 B1 KR 100614496B1 KR 1020030080225 A KR1020030080225 A KR 1020030080225A KR 20030080225 A KR20030080225 A KR 20030080225A KR 100614496 B1 KR100614496 B1 KR 100614496B1
- Authority
- KR
- South Korea
- Prior art keywords
- encoding
- signal
- bit rate
- audio
- speech
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Fee Related
Links
- 238000000034 method Methods 0.000 title claims abstract description 40
- 230000005236 sound signal Effects 0.000 claims abstract description 37
- 230000015556 catabolic process Effects 0.000 abstract description 3
- 238000006731 degradation reaction Methods 0.000 abstract description 3
- 239000013598 vector Substances 0.000 description 12
- 238000013139 quantization Methods 0.000 description 7
- 230000003044 adaptive effect Effects 0.000 description 6
- 238000010586 diagram Methods 0.000 description 6
- 230000005540 biological transmission Effects 0.000 description 4
- 238000010295 mobile communication Methods 0.000 description 4
- 230000003595 spectral effect Effects 0.000 description 4
- 238000004364 calculation method Methods 0.000 description 3
- 238000005516 engineering process Methods 0.000 description 3
- 230000005284 excitation Effects 0.000 description 3
- 239000000284 extract Substances 0.000 description 3
- 238000004891 communication Methods 0.000 description 2
- 230000006835 compression Effects 0.000 description 2
- 238000007906 compression Methods 0.000 description 2
- 238000007796 conventional method Methods 0.000 description 2
- 238000013461 design Methods 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 101000622137 Homo sapiens P-selectin Proteins 0.000 description 1
- 102100023472 P-selectin Human genes 0.000 description 1
- 101000873420 Simian virus 40 SV40 early leader protein Proteins 0.000 description 1
- 238000004458 analytical method Methods 0.000 description 1
- 230000003247 decreasing effect Effects 0.000 description 1
- 230000006866 deterioration Effects 0.000 description 1
- 238000001914 filtration Methods 0.000 description 1
- 230000006870 function Effects 0.000 description 1
- 230000007774 longterm Effects 0.000 description 1
- 238000012545 processing Methods 0.000 description 1
- 238000012360 testing method Methods 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/16—Vocoder architecture
- G10L19/18—Vocoders using multiple modes
- G10L19/24—Variable rate codecs, e.g. for generating different qualities using a scalable representation such as hierarchical encoding or layered encoding
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Quality & Reliability (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
Abstract
Description
표준 | 압축 방식 | 속도 | MOS | 응용 |
G.711 | PCM | 64 Kbps | 4.1 | 전화국간 디지털 전송 |
G.721 | ADPCM | 32 Kbps | 3.85 | 가정 또는 기업의 CODEC |
G.722 | SB-ADPCM | 64 Kbps | (오디오 신호) | 멀티미디어 음성회의. AM 방송 품질 |
G.728 | LD-CELP | 16 Kbps | 3.61 | 디지털 이동통신, ISDN, FR망 음성용 |
G.729 | CS-ACELP | 8 Kbps | 3.92 | H.323, H.320 영상회의 단말 이동통신, FR망 음성용 |
G.723.1 | MP-MLQ | 6.3 Kbps | 3.9 | 이동통신, H.324 등 영상회의 단말 VOIP 포럼 추천 |
ACELP | 5.3 Kbps | 3.65 |
Claims (8)
- 가변 비트율(variable bit rate)의 광대역 음성 및 오디오 부호화(wideband speech and audio coding) 장치에 있어서,코덱으로 입력되는 신호를 음성이나 오디오 신호로 각각 분류하는 음성 및 오디오 분류 수단;상기 분류된 입력 신호가 음성 신호인 경우, 협대역 부호화를 수행하는 협대역 부호화 수단;상기 분류된 입력 신호가 오디오 신호인 경우, 저대역 신호 및 고대역 신호의 부호화를 위해 저대역과 고대역에 부호화 비트를 할당하는 비트율 조정 수단;상기 저대역에 할당되는 부호화 비트를 일부 줄이고, 줄인 만큼 상기 고대역에 부호화 비트를 추가 할당하여 부호화를 수행하는 광대역 부호화 수단을 포함하는 광대역 음성 및 오디오 부호화 장치.
- 제1항에 있어서,상기 비트율 조정 수단은 낮은 비트율의 입력 오디오 신호에 대해 상기 저대역과 고대역의 비트율을 조정하는 것을 특징으로 하는 광대역 음성 및 오디오 부호화 장치.
- 삭제
- 가변 비트율의 광대역 음성 및 오디오 부호화 방법에 있어서,ⅰ) 코덱으로 입력되는 신호를 판별하여 음성이나 오디오 신호로 각각 분류하는 단계;ⅱ) 상기 분류된 입력 신호가 음성 신호인 경우, 저대역에만 비트를 할당하고 부호화를 수행하는 단계;iii) 상기 분류된 입력 신호가 오디오 신호인 경우, 저대역 신호 및 고대역 신호의 부호화를 위해 저대역과 고대역에 부호화 비트를 할당하는 단계;iv) 상기 저대역에 할당되는 부호화 비트를 일부 줄이고, 줄인 만큼 상기 고대역에 부호화 비트를 추가 할당하여 부호화를 수행하는 단계를 포함하는 광대역 음성 및 오디오 부호화 방법.
- 제4항에 있어서,상기 ⅱ) 단계의 부호화는 음성-기반(speech-oriented) 협대역 부호화인 것을 특징으로 하는 광대역 음성 및 오디오 부호화 방법.
- 제4항에 있어서,상기 ⅳ) 단계의 부호화는 오디오-기반(audio-oriented) 광대역 부호화인 것을 특징으로 하는 광대역 음성 및 오디오 부호화 방법.
- 삭제
- 가변 비트율의 광대역 음성 및 오디오 부호화를 수행하는 프로그램이 저장된 기록매체에 있어서,ⅰ) 코덱으로 입력되는 신호를 판별하여 음성이나 오디오 신호로 각각 분류하는 기능;ⅱ) 상기 분류된 입력 신호가 음성 신호인 경우, 저대역에만 비트를 할당하고 부호화를 수행하는 기능;iii) 상기 분류된 입력 신호가 오디오 신호인 경우, 저대역 신호 및 고대역 신호의 부호화를 위해 저대역과 고대역에 부호화 비트를 할당하는 기능;iv) 상기 저대역에 할당되는 부호화 비트를 일부 줄이고, 줄인 만큼 상기 고대역에 부호화 비트를 추가 할당하여 부호화를 수행하는 기능을 구현하는 프로그램이 저장된 기록매체.
Priority Applications (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
KR1020030080225A KR100614496B1 (ko) | 2003-11-13 | 2003-11-13 | 가변 비트율의 광대역 음성 및 오디오 부호화 장치 및방법 |
US10/967,045 US7634402B2 (en) | 2003-11-13 | 2004-10-14 | Apparatus for coding of variable bitrate wideband speech and audio signals, and a method thereof |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
KR1020030080225A KR100614496B1 (ko) | 2003-11-13 | 2003-11-13 | 가변 비트율의 광대역 음성 및 오디오 부호화 장치 및방법 |
Publications (2)
Publication Number | Publication Date |
---|---|
KR20050046204A KR20050046204A (ko) | 2005-05-18 |
KR100614496B1 true KR100614496B1 (ko) | 2006-08-22 |
Family
ID=34567721
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
KR1020030080225A Expired - Fee Related KR100614496B1 (ko) | 2003-11-13 | 2003-11-13 | 가변 비트율의 광대역 음성 및 오디오 부호화 장치 및방법 |
Country Status (2)
Country | Link |
---|---|
US (1) | US7634402B2 (ko) |
KR (1) | KR100614496B1 (ko) |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US8903720B2 (en) | 2008-07-14 | 2014-12-02 | Electronics And Telecommunications Research Institute | Apparatus for encoding and decoding of integrated speech and audio |
US8959015B2 (en) | 2008-07-14 | 2015-02-17 | Electronics And Telecommunications Research Institute | Apparatus for encoding and decoding of integrated speech and audio |
KR101717256B1 (ko) * | 2016-08-30 | 2017-03-27 | (주)아이엠피 | 보이스와 오디오의 적응적 네트워크 밸런싱 기반의 광역 전관방송을 위한 음향 송출 장치 |
Families Citing this family (16)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7619995B1 (en) * | 2003-07-18 | 2009-11-17 | Nortel Networks Limited | Transcoders and mixers for voice-over-IP conferencing |
KR100754389B1 (ko) * | 2005-09-29 | 2007-08-31 | 삼성전자주식회사 | 음성 및 오디오 신호 부호화 장치 및 방법 |
WO2007073260A1 (en) * | 2005-12-22 | 2007-06-28 | Infineon Technologies Ag | Method and arrangement for narrowband compatible wideband communication in a dect system |
US20080300866A1 (en) * | 2006-05-31 | 2008-12-04 | Motorola, Inc. | Method and system for creation and use of a wideband vocoder database for bandwidth extension of voice |
KR100883656B1 (ko) * | 2006-12-28 | 2009-02-18 | 삼성전자주식회사 | 오디오 신호의 분류 방법 및 장치와 이를 이용한 오디오신호의 부호화/복호화 방법 및 장치 |
US20090099851A1 (en) * | 2007-10-11 | 2009-04-16 | Broadcom Corporation | Adaptive bit pool allocation in sub-band coding |
US8566107B2 (en) * | 2007-10-15 | 2013-10-22 | Lg Electronics Inc. | Multi-mode method and an apparatus for processing a signal |
US20090259469A1 (en) * | 2008-04-14 | 2009-10-15 | Motorola, Inc. | Method and apparatus for speech recognition |
US8447617B2 (en) * | 2009-12-21 | 2013-05-21 | Mindspeed Technologies, Inc. | Method and system for speech bandwidth extension |
US20110280398A1 (en) * | 2010-05-17 | 2011-11-17 | Anatoly Fradis | Secured content distribution system |
EP2590164B1 (en) * | 2010-07-01 | 2016-12-21 | LG Electronics Inc. | Audio signal processing |
US8964966B2 (en) * | 2010-09-15 | 2015-02-24 | Avaya Inc. | Multi-microphone system to support bandpass filtering for analog-to-digital conversions at different data rates |
EP2660811B1 (en) | 2011-02-16 | 2017-03-29 | Nippon Telegraph And Telephone Corporation | Encoding method, decoding method, encoder, decoder, program and recording medium |
CN103035248B (zh) | 2011-10-08 | 2015-01-21 | 华为技术有限公司 | 音频信号编码方法和装置 |
US9742780B2 (en) * | 2015-02-06 | 2017-08-22 | Microsoft Technology Licensing, Llc | Audio based discovery and connection to a service controller |
US9660999B2 (en) | 2015-02-06 | 2017-05-23 | Microsoft Technology Licensing, Llc | Discovery and connection to a service controller |
Family Cites Families (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5109417A (en) * | 1989-01-27 | 1992-04-28 | Dolby Laboratories Licensing Corporation | Low bit rate transform coder, decoder, and encoder/decoder for high-quality audio |
US5752225A (en) * | 1989-01-27 | 1998-05-12 | Dolby Laboratories Licensing Corporation | Method and apparatus for split-band encoding and split-band decoding of audio information using adaptive bit allocation to adjacent subbands |
US5778335A (en) * | 1996-02-26 | 1998-07-07 | The Regents Of The University Of California | Method and apparatus for efficient multiband celp wideband speech and music coding and decoding |
JP2002016925A (ja) | 2000-04-27 | 2002-01-18 | Canon Inc | 符号化装置及び符号化方法 |
JP3467469B2 (ja) | 2000-10-31 | 2003-11-17 | Necエレクトロニクス株式会社 | 音声復号装置および音声復号プログラムを記録した記録媒体 |
CA2430923C (en) | 2001-11-14 | 2012-01-03 | Matsushita Electric Industrial Co., Ltd. | Encoding device, decoding device, and system thereof |
US7333475B2 (en) * | 2002-09-27 | 2008-02-19 | Broadcom Corporation | Switchboard for multiple data rate communication system |
-
2003
- 2003-11-13 KR KR1020030080225A patent/KR100614496B1/ko not_active Expired - Fee Related
-
2004
- 2004-10-14 US US10/967,045 patent/US7634402B2/en not_active Expired - Fee Related
Cited By (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US8903720B2 (en) | 2008-07-14 | 2014-12-02 | Electronics And Telecommunications Research Institute | Apparatus for encoding and decoding of integrated speech and audio |
US8959015B2 (en) | 2008-07-14 | 2015-02-17 | Electronics And Telecommunications Research Institute | Apparatus for encoding and decoding of integrated speech and audio |
US9818411B2 (en) | 2008-07-14 | 2017-11-14 | Electronics And Telecommunications Research Institute | Apparatus for encoding and decoding of integrated speech and audio |
US10403293B2 (en) | 2008-07-14 | 2019-09-03 | Electronics And Telecommunications Research Institute | Apparatus for encoding and decoding of integrated speech and audio |
US10714103B2 (en) | 2008-07-14 | 2020-07-14 | Electronics And Telecommunications Research Institute | Apparatus for encoding and decoding of integrated speech and audio |
US11705137B2 (en) | 2008-07-14 | 2023-07-18 | Electronics And Telecommunications Research Institute | Apparatus for encoding and decoding of integrated speech and audio |
US12205599B2 (en) | 2008-07-14 | 2025-01-21 | Electronics And Telecommunications Research Institute | Apparatus for encoding and decoding of integrated speech and audio |
KR101717256B1 (ko) * | 2016-08-30 | 2017-03-27 | (주)아이엠피 | 보이스와 오디오의 적응적 네트워크 밸런싱 기반의 광역 전관방송을 위한 음향 송출 장치 |
Also Published As
Publication number | Publication date |
---|---|
US20050108009A1 (en) | 2005-05-19 |
KR20050046204A (ko) | 2005-05-18 |
US7634402B2 (en) | 2009-12-15 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
KR100614496B1 (ko) | 가변 비트율의 광대역 음성 및 오디오 부호화 장치 및방법 | |
US5778335A (en) | Method and apparatus for efficient multiband celp wideband speech and music coding and decoding | |
JP4444749B2 (ja) | 減少レート、可変レートの音声分析合成を実行する方法及び装置 | |
Gersho | Advances in speech and audio compression | |
CN1703737B (zh) | 在自适应多速率宽带(amr-wb)和多模式可变比特率宽带(vmr-wb)编解码器之间互操作的方法 | |
KR100732659B1 (ko) | 가변 비트 레이트 광대역 스피치 음성 코딩시의 이득양자화를 위한 방법 및 장치 | |
JP5543405B2 (ja) | フレームエラーに対する感度を低減する符号化体系パターンを使用する予測音声コーダ | |
AU2003281378B2 (en) | Method and device for efficient in-band dim-and-burst signaling and half-rate max operation in variable bit-rate wideband speech coding for CDMA wireless systems | |
KR20010093208A (ko) | 주기적 음성 코딩 | |
KR20010093210A (ko) | 가변 속도 음성 코딩 | |
JP2004517348A (ja) | 非音声のスピーチの高性能の低ビット速度コード化方法および装置 | |
JP4874464B2 (ja) | 遷移音声フレームのマルチパルス補間的符号化 | |
US6434519B1 (en) | Method and apparatus for identifying frequency bands to compute linear phase shifts between frame prototypes in a speech coder | |
JP3396480B2 (ja) | 多重モード音声コーダのためのエラー保護 | |
Vaseghi | Finite state CELP for variable rate speech coding | |
EP1397655A1 (en) | Method and device for coding speech in analysis-by-synthesis speech coders | |
Iao | Mixed wideband speech and music coding using a speech/music discriminator | |
Jbira et al. | Multi-layer scalable LPC audio format | |
Woodard et al. | A Range of Low and High Delay CELP Speech Codecs between 8 and 4 kbits/s | |
Paksoy | Variable rate speech coding with phonetic classification | |
JPH07239699A (ja) | 音声符号化方法およびこの方法を用いた音声符号化装置 | |
Farrugia | Combined speech and audio coding with bit rate and bandwidth scalability | |
De Iacovo et al. | A Two-Band CELP Audio Coder at 16 kbit/s and Its Evaluation | |
HK1130558B (en) | Method and device for cdma wireless systems | |
JPS6019520B2 (ja) | 音声処理装置 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PA0109 | Patent application |
Patent event code: PA01091R01D Comment text: Patent Application Patent event date: 20031113 |
|
A201 | Request for examination | ||
PA0201 | Request for examination |
Patent event code: PA02012R01D Patent event date: 20040615 Comment text: Request for Examination of Application Patent event code: PA02011R01I Patent event date: 20031113 Comment text: Patent Application |
|
PG1501 | Laying open of application | ||
E902 | Notification of reason for refusal | ||
PE0902 | Notice of grounds for rejection |
Comment text: Notification of reason for refusal Patent event date: 20060228 Patent event code: PE09021S01D |
|
E701 | Decision to grant or registration of patent right | ||
PE0701 | Decision of registration |
Patent event code: PE07011S01D Comment text: Decision to Grant Registration Patent event date: 20060810 |
|
GRNT | Written decision to grant | ||
PR0701 | Registration of establishment |
Comment text: Registration of Establishment Patent event date: 20060814 Patent event code: PR07011E01D |
|
PR1002 | Payment of registration fee |
Payment date: 20060816 End annual number: 3 Start annual number: 1 |
|
PG1601 | Publication of registration | ||
PR1001 | Payment of annual fee |
Payment date: 20090727 Start annual number: 4 End annual number: 4 |
|
PR1001 | Payment of annual fee |
Payment date: 20100802 Start annual number: 5 End annual number: 5 |
|
PR1001 | Payment of annual fee |
Payment date: 20110729 Start annual number: 6 End annual number: 6 |
|
FPAY | Annual fee payment |
Payment date: 20120730 Year of fee payment: 7 |
|
PR1001 | Payment of annual fee |
Payment date: 20120730 Start annual number: 7 End annual number: 7 |
|
FPAY | Annual fee payment |
Payment date: 20130729 Year of fee payment: 8 |
|
PR1001 | Payment of annual fee |
Payment date: 20130729 Start annual number: 8 End annual number: 8 |
|
FPAY | Annual fee payment |
Payment date: 20140728 Year of fee payment: 9 |
|
PR1001 | Payment of annual fee |
Payment date: 20140728 Start annual number: 9 End annual number: 9 |
|
LAPS | Lapse due to unpaid annual fee | ||
PC1903 | Unpaid annual fee |
Termination category: Default of registration fee Termination date: 20160709 |