DE60012760D1 - Multimodaler sprachkodierer - Google Patents
Multimodaler sprachkodiererInfo
- Publication number
- DE60012760D1 DE60012760D1 DE60012760T DE60012760T DE60012760D1 DE 60012760 D1 DE60012760 D1 DE 60012760D1 DE 60012760 T DE60012760 T DE 60012760T DE 60012760 T DE60012760 T DE 60012760T DE 60012760 D1 DE60012760 D1 DE 60012760D1
- Authority
- DE
- Germany
- Prior art keywords
- rate
- speech
- compression system
- rate codec
- codec
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Lifetime
Links
- 230000006835 compression Effects 0.000 abstract 3
- 238000007906 compression Methods 0.000 abstract 3
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/16—Vocoder architecture
- G10L19/167—Audio streaming, i.e. formatting and decoding of an encoded audio signal representation into a data stream for transmission or storage purposes
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/16—Vocoder architecture
- G10L19/18—Vocoders using multiple modes
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
-
- H—ELECTRICITY
- H03—ELECTRONIC CIRCUITRY
- H03G—CONTROL OF AMPLIFICATION
- H03G3/00—Gain control in amplifiers or frequency changers
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/16—Vocoder architecture
- G10L19/18—Vocoders using multiple modes
- G10L19/24—Variable rate codecs, e.g. for generating different qualities using a scalable representation such as hierarchical encoding or layered encoding
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Signal Processing (AREA)
- Acoustics & Sound (AREA)
- Quality & Reliability (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
- Transmission Systems Not Characterized By The Medium Used For Transmission (AREA)
- Lubricants (AREA)
- Ink Jet (AREA)
- Graft Or Block Polymers (AREA)
- Reduction Or Emphasis Of Bandwidth Of Signals (AREA)
Applications Claiming Priority (5)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US574396 | 1995-12-18 | ||
US15532199P | 1999-09-22 | 1999-09-22 | |
US155321P | 1999-09-22 | ||
US09/574,396 US6782360B1 (en) | 1999-09-22 | 2000-05-19 | Gain quantization for a CELP speech coder |
PCT/US2000/025182 WO2001022402A1 (en) | 1999-09-22 | 2000-09-15 | Multimode speech encoder |
Publications (2)
Publication Number | Publication Date |
---|---|
DE60012760D1 true DE60012760D1 (de) | 2004-09-09 |
DE60012760T2 DE60012760T2 (de) | 2005-08-04 |
Family
ID=26852220
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
DE60012760T Expired - Lifetime DE60012760T2 (de) | 1999-09-22 | 2000-09-15 | Multimodaler sprachkodierer |
Country Status (8)
Country | Link |
---|---|
EP (1) | EP1214706B9 (de) |
JP (2) | JP4176349B2 (de) |
KR (1) | KR100488080B1 (de) |
CN (1) | CN1245706C (de) |
AT (1) | ATE272885T1 (de) |
AU (1) | AU7486200A (de) |
BR (1) | BRPI0014212B1 (de) |
DE (1) | DE60012760T2 (de) |
Families Citing this family (28)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
KR100463418B1 (ko) * | 2002-11-11 | 2004-12-23 | 한국전자통신연구원 | Celp 음성 부호화기에서 사용되는 가변적인 고정코드북 검색방법 및 장치 |
FR2867649A1 (fr) * | 2003-12-10 | 2005-09-16 | France Telecom | Procede de codage multiple optimise |
WO2006098274A1 (ja) * | 2005-03-14 | 2006-09-21 | Matsushita Electric Industrial Co., Ltd. | スケーラブル復号化装置およびスケーラブル復号化方法 |
US7177804B2 (en) * | 2005-05-31 | 2007-02-13 | Microsoft Corporation | Sub-band voice codec with multi-stage codebooks and redundant coding |
CN101371296B (zh) * | 2006-01-18 | 2012-08-29 | Lg电子株式会社 | 用于编码和解码信号的设备和方法 |
US8451915B2 (en) | 2007-03-21 | 2013-05-28 | Samsung Electronics Co., Ltd. | Efficient uplink feedback in a wireless communication system |
KR20100006492A (ko) * | 2008-07-09 | 2010-01-19 | 삼성전자주식회사 | 부호화 방식 결정 방법 및 장치 |
KR101604774B1 (ko) | 2008-07-10 | 2016-03-18 | 보이세지 코포레이션 | 멀티-레퍼런스 lpc 필터 양자화 및 역 양자화 장치 및 방법 |
KR101170466B1 (ko) | 2008-07-29 | 2012-08-03 | 한국전자통신연구원 | Mdct 영역에서의 후처리 방법, 및 장치 |
JP2010122617A (ja) | 2008-11-21 | 2010-06-03 | Yamaha Corp | ノイズゲート、及び収音装置 |
JP2010160496A (ja) * | 2010-02-15 | 2010-07-22 | Toshiba Corp | 信号処理装置および信号処理方法 |
US9047875B2 (en) * | 2010-07-19 | 2015-06-02 | Futurewei Technologies, Inc. | Spectrum flatness control for bandwidth extension |
US9626982B2 (en) | 2011-02-15 | 2017-04-18 | Voiceage Corporation | Device and method for quantizing the gains of the adaptive and fixed contributions of the excitation in a CELP codec |
EP3686888B1 (de) * | 2011-02-15 | 2025-04-02 | VoiceAge EVS LLC | Vorrichtung und verfahren zur quantisierung der verstärkung von adaptiven und festen beiträgen der anregung in einem celp-koder-dekoder |
US9026434B2 (en) * | 2011-04-11 | 2015-05-05 | Samsung Electronic Co., Ltd. | Frame erasure concealment for a multi rate speech and audio codec |
US9336789B2 (en) * | 2013-02-21 | 2016-05-10 | Qualcomm Incorporated | Systems and methods for determining an interpolation factor set for synthesizing a speech signal |
CN104517612B (zh) * | 2013-09-30 | 2018-10-12 | 上海爱聊信息科技有限公司 | 基于amr-nb语音信号的可变码率编码器和解码器及其编码和解码方法 |
JP5981408B2 (ja) * | 2013-10-29 | 2016-08-31 | 株式会社Nttドコモ | 音声信号処理装置、音声信号処理方法、及び音声信号処理プログラム |
KR102392003B1 (ko) | 2014-03-28 | 2022-04-28 | 삼성전자주식회사 | 선형예측계수 양자화방법 및 장치와 역양자화 방법 및 장치 |
CN112927703A (zh) | 2014-05-07 | 2021-06-08 | 三星电子株式会社 | 对线性预测系数量化的方法和装置及解量化的方法和装置 |
SG11201609926YA (en) * | 2014-07-28 | 2016-12-29 | Ericsson Telefon Ab L M | Pyramid vector quantizer shape search |
US10109284B2 (en) * | 2016-02-12 | 2018-10-23 | Qualcomm Incorporated | Inter-channel encoding and decoding of multiple high-band audio signals |
US10373630B2 (en) * | 2017-03-31 | 2019-08-06 | Intel Corporation | Systems and methods for energy efficient and low power distributed automatic speech recognition on wearable devices |
EP3692521B1 (de) * | 2017-10-06 | 2022-06-01 | Sony Europe B.V. | Audiodatei einhüllende mittels effektiver leistung in sequenzen von unter-fenstern . |
CN108122552B (zh) * | 2017-12-15 | 2021-10-15 | 上海智臻智能网络科技股份有限公司 | 语音情绪识别方法和装置 |
WO2021029642A1 (en) * | 2019-08-13 | 2021-02-18 | Samsung Electronics Co., Ltd. | System and method for recognizing user's speech |
CN113593521B (zh) * | 2021-07-29 | 2022-09-20 | 北京三快在线科技有限公司 | 语音合成方法、装置、设备及可读存储介质 |
CN118430508B (zh) * | 2024-05-29 | 2024-09-17 | 中国矿业大学 | 基于神经音频编解码器的语音合成方法 |
Family Cites Families (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP3353852B2 (ja) * | 1994-02-15 | 2002-12-03 | 日本電信電話株式会社 | 音声の符号化方法 |
US5701390A (en) * | 1995-02-22 | 1997-12-23 | Digital Voice Systems, Inc. | Synthesis of MBE-based coded speech using regenerated phase information |
-
2000
- 2000-09-12 AU AU74862/00A patent/AU7486200A/en not_active Abandoned
- 2000-09-15 JP JP2001525686A patent/JP4176349B2/ja not_active Expired - Fee Related
- 2000-09-15 KR KR10-2002-7003768A patent/KR100488080B1/ko not_active Expired - Lifetime
- 2000-09-15 DE DE60012760T patent/DE60012760T2/de not_active Expired - Lifetime
- 2000-09-15 BR BRPI0014212A patent/BRPI0014212B1/pt not_active IP Right Cessation
- 2000-09-15 AT AT00963447T patent/ATE272885T1/de not_active IP Right Cessation
- 2000-09-15 EP EP00963447A patent/EP1214706B9/de not_active Expired - Lifetime
- 2000-09-15 CN CNB008159408A patent/CN1245706C/zh not_active Expired - Fee Related
-
2005
- 2005-07-11 JP JP2005202337A patent/JP2005338872A/ja active Pending
Also Published As
Publication number | Publication date |
---|---|
AU7486200A (en) | 2001-04-24 |
KR100488080B1 (ko) | 2005-05-06 |
JP4176349B2 (ja) | 2008-11-05 |
BR0014212A (pt) | 2003-06-10 |
JP2005338872A (ja) | 2005-12-08 |
EP1214706A1 (de) | 2002-06-19 |
EP1214706B1 (de) | 2004-08-04 |
CN1451155A (zh) | 2003-10-22 |
JP2003513296A (ja) | 2003-04-08 |
DE60012760T2 (de) | 2005-08-04 |
ATE272885T1 (de) | 2004-08-15 |
EP1214706B9 (de) | 2005-01-05 |
BRPI0014212B1 (pt) | 2016-07-26 |
CN1245706C (zh) | 2006-03-15 |
KR20020033819A (ko) | 2002-05-07 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
DE60012760D1 (de) | Multimodaler sprachkodierer | |
AU2001287969A1 (en) | Codebook structure and search for speech coding | |
US7596486B2 (en) | Encoding an audio signal using different audio coder modes | |
AU2003278014A8 (en) | Methods for interoperation between adaptive multi-rate wideband (amr-wb) and multi-mode variable bit-rate wideband (wmr-wb) speech codecs | |
CA2096991A1 (en) | Celp-based speech compressor | |
CA2306098A1 (en) | Multimode speech coding apparatus and decoding apparatus | |
BR0304540A (pt) | Métodos para codificar um sinal de áudio, e para decodificar um sinal de áudio codificado, codificador para codificar um sinal de áudio, aparelho para fornecer um sinal de áudio, sinal de áudio codificado, meio de armazenagem, e, decodificador para decodificar um sinal de áudio codificado | |
DK1222659T3 (da) | LPC-harmonisk talekoder med superramme-struktur | |
HK1048187A1 (en) | Variable bit-rate celp coding of speech with phonetic classification. | |
CN101141644B (zh) | 编码集成系统和方法与解码集成系统和方法 | |
DE60027140D1 (de) | Sprachsynthetisierer auf der basis von sprachkodierung mit veränderlicher bit-rate | |
Choudhary et al. | Study and performance of amr codecs for gsm | |
WO2002023533A3 (en) | System for improved use of pitch enhancement with subcodebooks | |
Wang et al. | Transcoding Scheme between AMR-WB and VMR-WB | |
PL1756806T3 (pl) | Sposób kwantyzacji kodera mowy o bardzo małej przepływności | |
BRPI0520115A2 (pt) | métodos para codificar e para decodificar sinais de áudio e codificador e decodificador para sinais de áudio | |
Srinonchat et al. | New Bit Rate CELP coder for Speaker Dependent Coding System | |
Ozawa et al. | M-LCELP speech coding at bit-rates below 4kbps | |
Xu et al. | A novel transcoding algorithm between 3GPP AMR-NB (7.95 kbit/s) and ITU-t g. 729a (8kbit/s) | |
Shikui et al. | Speech transcoding from AMR to G. 729 in excitation domain |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
8364 | No opposition during term of opposition |