DE60125542D1 - System und verfahren zur spracherkennung mit einer vielzahl von spracherkennungsvorrichtungen - Google Patents

System und verfahren zur spracherkennung mit einer vielzahl von spracherkennungsvorrichtungen

Info

Publication number: DE60125542D1
Authority: DE; Germany
Prior art keywords: engine; results; voice recognition; variety; engines
Prior art date: 2000-07-18
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.): Expired - Lifetime

Application number

DE60125542T

Other languages

English (en)

Other versions

DE60125542T2 (de

Inventor

Harinath Garudadri

Puig Oses

Ning Bi

Yingyong Qi

Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)

Qualcomm Inc

Original Assignee

Qualcomm Inc

Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)

2000-07-18

Filing date

2001-07-17

Publication date

2007-02-08

2001-07-17 Application filed by Qualcomm Inc filed Critical Qualcomm Inc

2007-02-08 Application granted granted Critical

2007-02-08 Publication of DE60125542D1 publication Critical patent/DE60125542D1/de

2007-10-11 Publication of DE60125542T2 publication Critical patent/DE60125542T2/de

2021-07-18 Anticipated expiration legal-status Critical

Status Expired - Lifetime legal-status Critical Current

Classifications

- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/28—Constructional details of speech recognition systems
- G10L15/32—Multiple recognisers used in sequence or in parallel; Score combination systems therefor, e.g. voting systems

Landscapes

Engineering & Computer Science (AREA)
Acoustics & Sound (AREA)
Health & Medical Sciences (AREA)
Audiology, Speech & Language Pathology (AREA)
Human Computer Interaction (AREA)
Physics & Mathematics (AREA)
Computational Linguistics (AREA)
Multimedia (AREA)
Telephonic Communication Services (AREA)
Selective Calling Equipment (AREA)
Circuit For Audible Band Transducer (AREA)
Machine Translation (AREA)
Fittings On The Vehicle Exterior For Carrying Loads, And Devices For Holding Or Mounting Articles (AREA)

DE60125542T 2000-07-18 2001-07-17 System und verfahren zur spracherkennung mit einer vielzahl von spracherkennungsvorrichtungen Expired - Lifetime DE60125542T2 (de)

Applications Claiming Priority (3)

Application Number	Priority Date	Filing Date	Title
US618177		2000-07-18
US09/618,177 US6671669B1 (en)	2000-07-18	2000-07-18	combined engine system and method for voice recognition
PCT/US2001/022761 WO2002007148A1 (en)	2000-07-18	2001-07-17	System and method for voice recognition with a plurality of voice recognition engines

Publications (2)

Publication Number	Publication Date
DE60125542D1 true DE60125542D1 (de)	2007-02-08
DE60125542T2 DE60125542T2 (de)	2007-10-11

Family

ID=24476623

Family Applications (1)

Application Number	Title	Priority Date	Filing Date
DE60125542T Expired - Lifetime DE60125542T2 (de)	2000-07-18	2001-07-17	System und verfahren zur spracherkennung mit einer vielzahl von spracherkennungsvorrichtungen

Country Status (10)

Country	Link
US (1)	US6671669B1 (de)
EP (1)	EP1301922B1 (de)
CN (1)	CN1188831C (de)
AT (1)	ATE349751T1 (de)
AU (1)	AU2001275991A1 (de)
DE (1)	DE60125542T2 (de)
ES (1)	ES2278763T3 (de)
HK (1)	HK1057816A1 (de)
TW (1)	TWI253056B (de)
WO (1)	WO2002007148A1 (de)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number	Priority date	Publication date	Assignee	Title
AU2018351031B2 (en) *	2017-10-20	2021-03-11	Please Hold (Uk) Limited	Audio signal

Families Citing this family (58)

* Cited by examiner, † Cited by third party
Publication number	Priority date	Publication date	Assignee	Title
US7003463B1 (en)	1998-10-02	2006-02-21	International Business Machines Corporation	System and method for providing network coordinated conversational services
US6754629B1 (en) *	2000-09-08	2004-06-22	Qualcomm Incorporated	System and method for automatic voice recognition using mapping
US20030004720A1 (en) *	2001-01-30	2003-01-02	Harinath Garudadri	System and method for computing and transmitting parameters in a distributed voice recognition system
US20020143540A1 (en) *	2001-03-28	2002-10-03	Narendranath Malayath	Voice recognition system using implicit speaker adaptation
US7941313B2 (en)	2001-05-17	2011-05-10	Qualcomm Incorporated	System and method for transmitting speech activity information ahead of speech features in a distributed voice recognition system
US7203643B2 (en) *	2001-06-14	2007-04-10	Qualcomm Incorporated	Method and apparatus for transmitting speech activity in distributed voice recognition systems
US7366673B2 (en) *	2001-06-15	2008-04-29	International Business Machines Corporation	Selective enablement of speech recognition grammars
TW541517B (en) *	2001-12-25	2003-07-11	Univ Nat Cheng Kung	Speech recognition system
US6996526B2 (en) *	2002-01-02	2006-02-07	International Business Machines Corporation	Method and apparatus for transcribing speech when a plurality of speakers are participating
US7203652B1 (en) *	2002-02-21	2007-04-10	Nuance Communications	Method and system for improving robustness in a speech system
JP4304952B2 (ja) *	2002-10-07	2009-07-29	三菱電機株式会社	車載制御装置、並びにその操作説明方法をコンピュータに実行させるプログラム
US20040138885A1 (en) *	2003-01-09	2004-07-15	Xiaofan Lin	Commercial automatic speech recognition engine combinations
EP1603116A1 (de) *	2003-02-19	2005-12-07	Matsushita Electric Industrial Co., Ltd.	Spracherkennungsanordnung und -verfahren
US7523097B1 (en) *	2004-01-13	2009-04-21	Juniper Networks, Inc.	Restoration of archived configurations for a network device
KR100693284B1 (ko) *	2005-04-14	2007-03-13	학교법인 포항공과대학교	음성 인식 장치
CN1963918A (zh) *	2005-11-11	2007-05-16	株式会社东芝	说话人模板的压缩、合并装置和方法，以及说话人认证
US7970613B2 (en)	2005-11-12	2011-06-28	Sony Computer Entertainment Inc.	Method and system for Gaussian probability data bit reduction and computation
US7778831B2 (en)	2006-02-21	2010-08-17	Sony Computer Entertainment Inc.	Voice recognition with dynamic filter bank adjustment based on speaker categorization determined from runtime pitch
US8010358B2 (en)	2006-02-21	2011-08-30	Sony Computer Entertainment Inc.	Voice recognition with parallel gender and age normalization
US8532984B2 (en)	2006-07-31	2013-09-10	Qualcomm Incorporated	Systems, methods, and apparatus for wideband encoding and decoding of active frames
GB0616070D0 (en) *	2006-08-12	2006-09-20	Ibm	Speech Recognition Feedback
US8239190B2 (en)	2006-08-22	2012-08-07	Qualcomm Incorporated	Time-warping frames of wideband vocoder
US7813922B2 (en) *	2007-01-30	2010-10-12	Nokia Corporation	Audio quantization
US8494847B2 (en) *	2007-02-28	2013-07-23	Nec Corporation	Weighting factor learning system and audio recognition system
US7904410B1 (en) *	2007-04-18	2011-03-08	The Mathworks, Inc.	Constrained dynamic time warping
US8352265B1 (en)	2007-12-24	2013-01-08	Edward Lin	Hardware implemented backend search engine for a high-rate speech recognition system
US8639510B1 (en)	2007-12-24	2014-01-28	Kai Yu	Acoustic scoring unit implemented on a single FPGA or ASIC
US8463610B1 (en)	2008-01-18	2013-06-11	Patrick J. Bourke	Hardware-implemented scalable modular engine for low-power speech recognition
WO2010019831A1 (en) *	2008-08-14	2010-02-18	21Ct, Inc.	Hidden markov model for speech processing with training method
US8442833B2 (en)	2009-02-17	2013-05-14	Sony Computer Entertainment Inc.	Speech processing with source location estimation using signals from two or more microphones
US8788256B2 (en)	2009-02-17	2014-07-22	Sony Computer Entertainment Inc.	Multiple language voice recognition
US8442829B2 (en)	2009-02-17	2013-05-14	Sony Computer Entertainment Inc.	Automatic computation streaming partition for voice recognition on multiple processors with limited memory
US8417526B2 (en) *	2009-03-13	2013-04-09	Adacel, Inc.	Speech recognition learning system and method
US9026444B2 (en)	2009-09-16	2015-05-05	At&T Intellectual Property I, L.P.	System and method for personalization of acoustic models for automatic speech recognition
US8812321B2 (en) *	2010-09-30	2014-08-19	At&T Intellectual Property I, L.P.	System and method for combining speech recognition outputs from a plurality of domain-specific speech recognizers via machine learning
US20120168331A1 (en) *	2010-12-30	2012-07-05	Safecode Drug Technologies Corp.	Voice template protector for administering medicine
US10032455B2 (en)	2011-01-07	2018-07-24	Nuance Communications, Inc.	Configurable speech recognition system using a pronunciation alignment between multiple recognizers
US9153235B2 (en)	2012-04-09	2015-10-06	Sony Computer Entertainment Inc.	Text dependent speaker recognition with long-term feature based on functional data analysis
CN104769668B (zh)	2012-10-04	2018-10-30	纽昂斯通讯公司	改进的用于asr的混合控制器
US9240184B1 (en) *	2012-11-15	2016-01-19	Google Inc.	Frame-level combination of deep neural network and gaussian mixture models
CN105027198B (zh) *	2013-02-25	2018-11-20	三菱电机株式会社	语音识别系统以及语音识别装置
CN104143330A (zh) *	2013-05-07	2014-11-12	佳能株式会社	语音识别方法和语音识别系统
US20140337030A1 (en) *	2013-05-07	2014-11-13	Qualcomm Incorporated	Adaptive audio frame processing for keyword detection
US9225879B2 (en) *	2013-12-27	2015-12-29	TCL Research America Inc.	Method and apparatus for video sequential alignment
PH12017500352B1 (en) *	2014-08-28	2022-07-06	Nokia Technologies Oy	Audio parameter quantization
CN104616653B (zh) *	2015-01-23	2018-02-23	北京云知声信息技术有限公司	唤醒词匹配方法、装置以及语音唤醒方法、装置
US10134425B1 (en) *	2015-06-29	2018-11-20	Amazon Technologies, Inc.	Direction-based speech endpointing
US10536464B2 (en) *	2016-06-22	2020-01-14	Intel Corporation	Secure and smart login engine
US10971157B2 (en)	2017-01-11	2021-04-06	Nuance Communications, Inc.	Methods and apparatus for hybrid speech recognition processing
US10607601B2 (en) *	2017-05-11	2020-03-31	International Business Machines Corporation	Speech recognition by selecting and refining hot words
CN109285548A (zh) *	2017-07-19	2019-01-29	阿里巴巴集团控股有限公司	信息处理方法、系统、电子设备、和计算机存储介质
GB2566759B8 (en)	2017-10-20	2021-12-08	Please Hold Uk Ltd	Encoding identifiers to produce audio identifiers from a plurality of audio bitstreams
US12131228B2 (en)	2019-04-02	2024-10-29	International Business Machines Corporation	Method for accessing data records of a master data management system
CN111128154B (zh) *	2019-12-03	2022-06-03	杭州蓦然认知科技有限公司	一种聚合形成交互引擎簇的方法及装置
CN111694331B (zh) *	2020-05-11	2021-11-02	杭州睿疆科技有限公司	生产工艺参数调整的系统、方法和计算机设备
US11664033B2 (en) *	2020-06-15	2023-05-30	Samsung Electronics Co., Ltd.	Electronic apparatus and controlling method thereof
US11996087B2 (en)	2021-04-30	2024-05-28	Comcast Cable Communications, Llc	Method and apparatus for intelligent voice recognition
CN115376513B (zh) *	2022-10-19	2023-05-12	广州小鹏汽车科技有限公司	语音交互方法、服务器及计算机可读存储介质

Family Cites Families (15)

* Cited by examiner, † Cited by third party
Publication number	Priority date	Publication date	Assignee	Title
US4587670A (en) *	1982-10-15	1986-05-06	At&T Bell Laboratories	Hidden Markov model speech recognition arrangement
US4783804A (en) *	1985-03-21	1988-11-08	American Telephone And Telegraph Company, At&T Bell Laboratories	Hidden Markov model speech recognition arrangement
US4852180A (en) *	1987-04-03	1989-07-25	American Telephone And Telegraph Company, At&T Bell Laboratories	Speech recognition by acoustic/phonetic system and technique
US5167004A (en) *	1991-02-28	1992-11-24	Texas Instruments Incorporated	Temporal decorrelation method for robust speaker verification
AU671952B2 (en) *	1991-06-11	1996-09-19	Qualcomm Incorporated	Variable rate vocoder
US5222190A (en) *	1991-06-11	1993-06-22	Texas Instruments Incorporated	Apparatus and method for identifying a speech pattern
US5450522A (en) *	1991-08-19	1995-09-12	U S West Advanced Technologies, Inc.	Auditory model for parametrization of speech
CA2126380C (en) *	1993-07-22	1998-07-07	Wu Chou	Minimum error rate training of combined string models
US5839103A (en) *	1995-06-07	1998-11-17	Rutgers, The State University Of New Jersey	Speaker verification system using decision fusion logic
US5754978A (en) *	1995-10-27	1998-05-19	Speech Systems Of Colorado, Inc.	Speech recognition system
US5819220A (en) *	1996-09-30	1998-10-06	Hewlett-Packard Company	Web triggered word set boosting for speech interfaces to the world wide web
US6003002A (en) *	1997-01-02	1999-12-14	Texas Instruments Incorporated	Method and system of adapting speech recognition models to speaker environment
US5893059A (en) *	1997-04-17	1999-04-06	Nynex Science And Technology, Inc.	Speech recoginition methods and apparatus
US6014624A (en) *	1997-04-18	2000-01-11	Nynex Science And Technology, Inc.	Method and apparatus for transitioning from one voice recognition system to another
US6526380B1 (en)	1999-03-26	2003-02-25	Koninklijke Philips Electronics N.V.	Speech recognition system having parallel large vocabulary recognition engines

2000
- 2000-07-18 US US09/618,177 patent/US6671669B1/en not_active Expired - Lifetime
2001
- 2001-07-17 AU AU2001275991A patent/AU2001275991A1/en not_active Abandoned
- 2001-07-17 EP EP01953554A patent/EP1301922B1/de not_active Expired - Lifetime
- 2001-07-17 ES ES01953554T patent/ES2278763T3/es not_active Expired - Lifetime
- 2001-07-17 DE DE60125542T patent/DE60125542T2/de not_active Expired - Lifetime
- 2001-07-17 WO PCT/US2001/022761 patent/WO2002007148A1/en active IP Right Grant
- 2001-07-17 AT AT01953554T patent/ATE349751T1/de not_active IP Right Cessation
- 2001-07-17 CN CNB018145922A patent/CN1188831C/zh not_active Expired - Fee Related
- 2001-07-18 TW TW090117578A patent/TWI253056B/zh not_active IP Right Cessation
2004
- 2004-01-30 HK HK04100626A patent/HK1057816A1/xx not_active IP Right Cessation

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number	Priority date	Publication date	Assignee	Title
AU2018351031B2 (en) *	2017-10-20	2021-03-11	Please Hold (Uk) Limited	Audio signal

Also Published As

Publication number	Publication date
DE60125542T2 (de)	2007-10-11
TWI253056B (en)	2006-04-11
ES2278763T3 (es)	2007-08-16
US6671669B1 (en)	2003-12-30
CN1188831C (zh)	2005-02-09
AU2001275991A1 (en)	2002-01-30
CN1454380A (zh)	2003-11-05
HK1057816A1 (en)	2004-04-16
ATE349751T1 (de)	2007-01-15
EP1301922A1 (de)	2003-04-16
EP1301922B1 (de)	2006-12-27
WO2002007148A1 (en)	2002-01-24

Publication	Publication Date	Title
DE60125542D1 (de)	2007-02-08	System und verfahren zur spracherkennung mit einer vielzahl von spracherkennungsvorrichtungen
ATE344959T1 (de)	2006-11-15	Kombination von digitaler zeitverschiebung und hmm in sprecherabhängiger- und sprecherunabhängiger weise für die spracherkennung
DE69811921D1 (de)	2003-04-10	Vorrichtung und verfahren zur unterscheidung von ähnlich klingenden wörtern in der spracherkennung
HK1062738A1 (en)	2004-11-19	Apparation and method for performing voice recognition using acoustic feature vector modification
WO2004100638A3 (en)	2006-05-04	Source-dependent text-to-speech system
DE602004015973D1 (de)	2008-10-02	Spracherkennungssystem und verfahren auf phonetischer basis
DE69822179D1 (de)	2004-04-08	Verfahren zum lernen von mustern für die sprach- oder die sprechererkennung
ATE297588T1 (de)	2005-06-15	Anpassung des phonetischen kontextes zur verbesserung der spracherkennung
DE60033132D1 (de)	2007-03-15	Detektion von emotionen in sprachsignalen mittels analyse einer vielzahl von sprachsignalparametern
ATE335195T1 (de)	2006-08-15	Hintergrundlernen von sprecherstimmen
ATE482447T1 (de)	2010-10-15	Registrierung für spracherkennungssystem
EP1022722A3 (de)	2000-08-16	Sprecheradaptation auf der Basis von Stimm-Eigenvektoren
ATE312398T1 (de)	2005-12-15	Sprecheranpassung für die spracherkennung
ATE531033T1 (de)	2011-11-15	System und verfahren zur verteilung einer spracherkennungsgrammatik
ATE336059T1 (de)	2006-09-15	Verfahren und system zur erzeugung von behaglichkeitsrauschen bei der sprachkommunikation
ATE253763T1 (de)	2003-11-15	Verfahren zur spracherkennung
MX9505299A (es)	1997-01-31	Sistemas, metodos y articulos de fabricacion para realizar la hipotesizacion de n-cadenas optimas de alta resolucion.
DE69630999T2 (de)	2004-10-21	Verfahren zur verringerung von datenbankanforderungen für ein spracherkennungssystem
DE60228716D1 (de)	2008-10-16	Verfahren zum bereitstellen von kontoinformation und system zum aufschreiben von diktiertem text
ATE342566T1 (de)	2006-11-15	Verfahren zur spracheingabe eines zielortes mit hilfe eines definierten eingabedialogs in ein zielführungssystem
DE60004331D1 (de)	2003-09-11	Sprecher-erkennung
ATE366912T1 (de)	2007-08-15	Verfahren und vorrichtung zur sprachausgabe, datenträger mit sprachdaten
ES2179624T3 (es)	2003-01-16	Procedimiento y dispositivo para aumentar la probabilidad de reconocimiento de los sistemas de reconocimiento de voz.
ATE297047T1 (de)	2005-06-15	Sprachgeführtes gerätesteuerungsverfahren mit einer optimierung für einen benutzer
ATE348384T1 (de)	2007-01-15	Verfahren zur spracherkennung und spracherkennungssystem

Legal Events

Date	Code	Title	Description
2008-01-17	8364	No opposition during term of opposition

DE60125542D1 - System und verfahren zur spracherkennung mit einer vielzahl von spracherkennungsvorrichtungen - Google Patents

Info

Links

Classifications

Landscapes

Applications Claiming Priority (3)

Publications (2)

Family

ID=24476623

Family Applications (1)

Country Status (10)

Cited By (1)

Families Citing this family (58)

Family Cites Families (15)

Cited By (1)

Also Published As

Similar Documents

Legal Events