[go: up one dir, main page]

DE60012760D1 - Multimodaler sprachkodierer - Google Patents

Multimodaler sprachkodierer

Info

Publication number
DE60012760D1
DE60012760D1 DE60012760T DE60012760T DE60012760D1 DE 60012760 D1 DE60012760 D1 DE 60012760D1 DE 60012760 T DE60012760 T DE 60012760T DE 60012760 T DE60012760 T DE 60012760T DE 60012760 D1 DE60012760 D1 DE 60012760D1
Authority
DE
Germany
Prior art keywords
rate
speech
compression system
rate codec
codec
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Lifetime
Application number
DE60012760T
Other languages
English (en)
Other versions
DE60012760T2 (de
Inventor
Yang Gao
Adil Benyassine
Jes Thyssen
Eyal Sholomot
Huan-Yu Su
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Conexant Systems LLC
Original Assignee
Conexant Systems LLC
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from US09/574,396 external-priority patent/US6782360B1/en
Application filed by Conexant Systems LLC filed Critical Conexant Systems LLC
Publication of DE60012760D1 publication Critical patent/DE60012760D1/de
Application granted granted Critical
Publication of DE60012760T2 publication Critical patent/DE60012760T2/de
Anticipated expiration legal-status Critical
Expired - Lifetime legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/16Vocoder architecture
    • G10L19/167Audio streaming, i.e. formatting and decoding of an encoded audio signal representation into a data stream for transmission or storage purposes
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/16Vocoder architecture
    • G10L19/18Vocoders using multiple modes
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • HELECTRICITY
    • H03ELECTRONIC CIRCUITRY
    • H03GCONTROL OF AMPLIFICATION
    • H03G3/00Gain control in amplifiers or frequency changers
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/16Vocoder architecture
    • G10L19/18Vocoders using multiple modes
    • G10L19/24Variable rate codecs, e.g. for generating different qualities using a scalable representation such as hierarchical encoding or layered encoding

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Signal Processing (AREA)
  • Acoustics & Sound (AREA)
  • Quality & Reliability (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
  • Transmission Systems Not Characterized By The Medium Used For Transmission (AREA)
  • Lubricants (AREA)
  • Ink Jet (AREA)
  • Graft Or Block Polymers (AREA)
  • Reduction Or Emphasis Of Bandwidth Of Signals (AREA)
DE60012760T 1999-09-22 2000-09-15 Multimodaler sprachkodierer Expired - Lifetime DE60012760T2 (de)

Applications Claiming Priority (5)

Application Number Priority Date Filing Date Title
US574396 1995-12-18
US15532199P 1999-09-22 1999-09-22
US155321P 1999-09-22
US09/574,396 US6782360B1 (en) 1999-09-22 2000-05-19 Gain quantization for a CELP speech coder
PCT/US2000/025182 WO2001022402A1 (en) 1999-09-22 2000-09-15 Multimode speech encoder

Publications (2)

Publication Number Publication Date
DE60012760D1 true DE60012760D1 (de) 2004-09-09
DE60012760T2 DE60012760T2 (de) 2005-08-04

Family

ID=26852220

Family Applications (1)

Application Number Title Priority Date Filing Date
DE60012760T Expired - Lifetime DE60012760T2 (de) 1999-09-22 2000-09-15 Multimodaler sprachkodierer

Country Status (8)

Country Link
EP (1) EP1214706B9 (de)
JP (2) JP4176349B2 (de)
KR (1) KR100488080B1 (de)
CN (1) CN1245706C (de)
AT (1) ATE272885T1 (de)
AU (1) AU7486200A (de)
BR (1) BRPI0014212B1 (de)
DE (1) DE60012760T2 (de)

Families Citing this family (28)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR100463418B1 (ko) * 2002-11-11 2004-12-23 한국전자통신연구원 Celp 음성 부호화기에서 사용되는 가변적인 고정코드북 검색방법 및 장치
FR2867649A1 (fr) * 2003-12-10 2005-09-16 France Telecom Procede de codage multiple optimise
WO2006098274A1 (ja) * 2005-03-14 2006-09-21 Matsushita Electric Industrial Co., Ltd. スケーラブル復号化装置およびスケーラブル復号化方法
US7177804B2 (en) * 2005-05-31 2007-02-13 Microsoft Corporation Sub-band voice codec with multi-stage codebooks and redundant coding
CN101371296B (zh) * 2006-01-18 2012-08-29 Lg电子株式会社 用于编码和解码信号的设备和方法
US8451915B2 (en) 2007-03-21 2013-05-28 Samsung Electronics Co., Ltd. Efficient uplink feedback in a wireless communication system
KR20100006492A (ko) * 2008-07-09 2010-01-19 삼성전자주식회사 부호화 방식 결정 방법 및 장치
KR101604774B1 (ko) 2008-07-10 2016-03-18 보이세지 코포레이션 멀티-레퍼런스 lpc 필터 양자화 및 역 양자화 장치 및 방법
KR101170466B1 (ko) 2008-07-29 2012-08-03 한국전자통신연구원 Mdct 영역에서의 후처리 방법, 및 장치
JP2010122617A (ja) 2008-11-21 2010-06-03 Yamaha Corp ノイズゲート、及び収音装置
JP2010160496A (ja) * 2010-02-15 2010-07-22 Toshiba Corp 信号処理装置および信号処理方法
US9047875B2 (en) * 2010-07-19 2015-06-02 Futurewei Technologies, Inc. Spectrum flatness control for bandwidth extension
US9626982B2 (en) 2011-02-15 2017-04-18 Voiceage Corporation Device and method for quantizing the gains of the adaptive and fixed contributions of the excitation in a CELP codec
EP3686888B1 (de) * 2011-02-15 2025-04-02 VoiceAge EVS LLC Vorrichtung und verfahren zur quantisierung der verstärkung von adaptiven und festen beiträgen der anregung in einem celp-koder-dekoder
US9026434B2 (en) * 2011-04-11 2015-05-05 Samsung Electronic Co., Ltd. Frame erasure concealment for a multi rate speech and audio codec
US9336789B2 (en) * 2013-02-21 2016-05-10 Qualcomm Incorporated Systems and methods for determining an interpolation factor set for synthesizing a speech signal
CN104517612B (zh) * 2013-09-30 2018-10-12 上海爱聊信息科技有限公司 基于amr-nb语音信号的可变码率编码器和解码器及其编码和解码方法
JP5981408B2 (ja) * 2013-10-29 2016-08-31 株式会社Nttドコモ 音声信号処理装置、音声信号処理方法、及び音声信号処理プログラム
KR102392003B1 (ko) 2014-03-28 2022-04-28 삼성전자주식회사 선형예측계수 양자화방법 및 장치와 역양자화 방법 및 장치
CN112927703A (zh) 2014-05-07 2021-06-08 三星电子株式会社 对线性预测系数量化的方法和装置及解量化的方法和装置
SG11201609926YA (en) * 2014-07-28 2016-12-29 Ericsson Telefon Ab L M Pyramid vector quantizer shape search
US10109284B2 (en) * 2016-02-12 2018-10-23 Qualcomm Incorporated Inter-channel encoding and decoding of multiple high-band audio signals
US10373630B2 (en) * 2017-03-31 2019-08-06 Intel Corporation Systems and methods for energy efficient and low power distributed automatic speech recognition on wearable devices
EP3692521B1 (de) * 2017-10-06 2022-06-01 Sony Europe B.V. Audiodatei einhüllende mittels effektiver leistung in sequenzen von unter-fenstern .
CN108122552B (zh) * 2017-12-15 2021-10-15 上海智臻智能网络科技股份有限公司 语音情绪识别方法和装置
WO2021029642A1 (en) * 2019-08-13 2021-02-18 Samsung Electronics Co., Ltd. System and method for recognizing user's speech
CN113593521B (zh) * 2021-07-29 2022-09-20 北京三快在线科技有限公司 语音合成方法、装置、设备及可读存储介质
CN118430508B (zh) * 2024-05-29 2024-09-17 中国矿业大学 基于神经音频编解码器的语音合成方法

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP3353852B2 (ja) * 1994-02-15 2002-12-03 日本電信電話株式会社 音声の符号化方法
US5701390A (en) * 1995-02-22 1997-12-23 Digital Voice Systems, Inc. Synthesis of MBE-based coded speech using regenerated phase information

Also Published As

Publication number Publication date
AU7486200A (en) 2001-04-24
KR100488080B1 (ko) 2005-05-06
JP4176349B2 (ja) 2008-11-05
BR0014212A (pt) 2003-06-10
JP2005338872A (ja) 2005-12-08
EP1214706A1 (de) 2002-06-19
EP1214706B1 (de) 2004-08-04
CN1451155A (zh) 2003-10-22
JP2003513296A (ja) 2003-04-08
DE60012760T2 (de) 2005-08-04
ATE272885T1 (de) 2004-08-15
EP1214706B9 (de) 2005-01-05
BRPI0014212B1 (pt) 2016-07-26
CN1245706C (zh) 2006-03-15
KR20020033819A (ko) 2002-05-07

Similar Documents

Publication Publication Date Title
DE60012760D1 (de) Multimodaler sprachkodierer
AU2001287969A1 (en) Codebook structure and search for speech coding
US7596486B2 (en) Encoding an audio signal using different audio coder modes
AU2003278014A8 (en) Methods for interoperation between adaptive multi-rate wideband (amr-wb) and multi-mode variable bit-rate wideband (wmr-wb) speech codecs
CA2096991A1 (en) Celp-based speech compressor
CA2306098A1 (en) Multimode speech coding apparatus and decoding apparatus
BR0304540A (pt) Métodos para codificar um sinal de áudio, e para decodificar um sinal de áudio codificado, codificador para codificar um sinal de áudio, aparelho para fornecer um sinal de áudio, sinal de áudio codificado, meio de armazenagem, e, decodificador para decodificar um sinal de áudio codificado
DK1222659T3 (da) LPC-harmonisk talekoder med superramme-struktur
HK1048187A1 (en) Variable bit-rate celp coding of speech with phonetic classification.
CN101141644B (zh) 编码集成系统和方法与解码集成系统和方法
DE60027140D1 (de) Sprachsynthetisierer auf der basis von sprachkodierung mit veränderlicher bit-rate
Choudhary et al. Study and performance of amr codecs for gsm
WO2002023533A3 (en) System for improved use of pitch enhancement with subcodebooks
Wang et al. Transcoding Scheme between AMR-WB and VMR-WB
PL1756806T3 (pl) Sposób kwantyzacji kodera mowy o bardzo małej przepływności
BRPI0520115A2 (pt) métodos para codificar e para decodificar sinais de áudio e codificador e decodificador para sinais de áudio
Srinonchat et al. New Bit Rate CELP coder for Speaker Dependent Coding System
Ozawa et al. M-LCELP speech coding at bit-rates below 4kbps
Xu et al. A novel transcoding algorithm between 3GPP AMR-NB (7.95 kbit/s) and ITU-t g. 729a (8kbit/s)
Shikui et al. Speech transcoding from AMR to G. 729 in excitation domain

Legal Events

Date Code Title Description
8364 No opposition during term of opposition