DE60125542D1 - System und verfahren zur spracherkennung mit einer vielzahl von spracherkennungsvorrichtungen - Google Patents
System und verfahren zur spracherkennung mit einer vielzahl von spracherkennungsvorrichtungenInfo
- Publication number
- DE60125542D1 DE60125542D1 DE60125542T DE60125542T DE60125542D1 DE 60125542 D1 DE60125542 D1 DE 60125542D1 DE 60125542 T DE60125542 T DE 60125542T DE 60125542 T DE60125542 T DE 60125542T DE 60125542 D1 DE60125542 D1 DE 60125542D1
- Authority
- DE
- Germany
- Prior art keywords
- engine
- results
- voice recognition
- variety
- engines
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Lifetime
Links
- 230000001419 dependent effect Effects 0.000 abstract 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/28—Constructional details of speech recognition systems
- G10L15/32—Multiple recognisers used in sequence or in parallel; Score combination systems therefor, e.g. voting systems
Landscapes
- Engineering & Computer Science (AREA)
- Acoustics & Sound (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Computational Linguistics (AREA)
- Multimedia (AREA)
- Telephonic Communication Services (AREA)
- Selective Calling Equipment (AREA)
- Circuit For Audible Band Transducer (AREA)
- Machine Translation (AREA)
- Fittings On The Vehicle Exterior For Carrying Loads, And Devices For Holding Or Mounting Articles (AREA)
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US618177 | 2000-07-18 | ||
US09/618,177 US6671669B1 (en) | 2000-07-18 | 2000-07-18 | combined engine system and method for voice recognition |
PCT/US2001/022761 WO2002007148A1 (en) | 2000-07-18 | 2001-07-17 | System and method for voice recognition with a plurality of voice recognition engines |
Publications (2)
Publication Number | Publication Date |
---|---|
DE60125542D1 true DE60125542D1 (de) | 2007-02-08 |
DE60125542T2 DE60125542T2 (de) | 2007-10-11 |
Family
ID=24476623
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
DE60125542T Expired - Lifetime DE60125542T2 (de) | 2000-07-18 | 2001-07-17 | System und verfahren zur spracherkennung mit einer vielzahl von spracherkennungsvorrichtungen |
Country Status (10)
Country | Link |
---|---|
US (1) | US6671669B1 (de) |
EP (1) | EP1301922B1 (de) |
CN (1) | CN1188831C (de) |
AT (1) | ATE349751T1 (de) |
AU (1) | AU2001275991A1 (de) |
DE (1) | DE60125542T2 (de) |
ES (1) | ES2278763T3 (de) |
HK (1) | HK1057816A1 (de) |
TW (1) | TWI253056B (de) |
WO (1) | WO2002007148A1 (de) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
AU2018351031B2 (en) * | 2017-10-20 | 2021-03-11 | Please Hold (Uk) Limited | Audio signal |
Families Citing this family (58)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7003463B1 (en) | 1998-10-02 | 2006-02-21 | International Business Machines Corporation | System and method for providing network coordinated conversational services |
US6754629B1 (en) * | 2000-09-08 | 2004-06-22 | Qualcomm Incorporated | System and method for automatic voice recognition using mapping |
US20030004720A1 (en) * | 2001-01-30 | 2003-01-02 | Harinath Garudadri | System and method for computing and transmitting parameters in a distributed voice recognition system |
US20020143540A1 (en) * | 2001-03-28 | 2002-10-03 | Narendranath Malayath | Voice recognition system using implicit speaker adaptation |
US7941313B2 (en) | 2001-05-17 | 2011-05-10 | Qualcomm Incorporated | System and method for transmitting speech activity information ahead of speech features in a distributed voice recognition system |
US7203643B2 (en) * | 2001-06-14 | 2007-04-10 | Qualcomm Incorporated | Method and apparatus for transmitting speech activity in distributed voice recognition systems |
US7366673B2 (en) * | 2001-06-15 | 2008-04-29 | International Business Machines Corporation | Selective enablement of speech recognition grammars |
TW541517B (en) * | 2001-12-25 | 2003-07-11 | Univ Nat Cheng Kung | Speech recognition system |
US6996526B2 (en) * | 2002-01-02 | 2006-02-07 | International Business Machines Corporation | Method and apparatus for transcribing speech when a plurality of speakers are participating |
US7203652B1 (en) * | 2002-02-21 | 2007-04-10 | Nuance Communications | Method and system for improving robustness in a speech system |
JP4304952B2 (ja) * | 2002-10-07 | 2009-07-29 | 三菱電機株式会社 | 車載制御装置、並びにその操作説明方法をコンピュータに実行させるプログラム |
US20040138885A1 (en) * | 2003-01-09 | 2004-07-15 | Xiaofan Lin | Commercial automatic speech recognition engine combinations |
EP1603116A1 (de) * | 2003-02-19 | 2005-12-07 | Matsushita Electric Industrial Co., Ltd. | Spracherkennungsanordnung und -verfahren |
US7523097B1 (en) * | 2004-01-13 | 2009-04-21 | Juniper Networks, Inc. | Restoration of archived configurations for a network device |
KR100693284B1 (ko) * | 2005-04-14 | 2007-03-13 | 학교법인 포항공과대학교 | 음성 인식 장치 |
CN1963918A (zh) * | 2005-11-11 | 2007-05-16 | 株式会社东芝 | 说话人模板的压缩、合并装置和方法,以及说话人认证 |
US7970613B2 (en) | 2005-11-12 | 2011-06-28 | Sony Computer Entertainment Inc. | Method and system for Gaussian probability data bit reduction and computation |
US7778831B2 (en) | 2006-02-21 | 2010-08-17 | Sony Computer Entertainment Inc. | Voice recognition with dynamic filter bank adjustment based on speaker categorization determined from runtime pitch |
US8010358B2 (en) | 2006-02-21 | 2011-08-30 | Sony Computer Entertainment Inc. | Voice recognition with parallel gender and age normalization |
US8532984B2 (en) | 2006-07-31 | 2013-09-10 | Qualcomm Incorporated | Systems, methods, and apparatus for wideband encoding and decoding of active frames |
GB0616070D0 (en) * | 2006-08-12 | 2006-09-20 | Ibm | Speech Recognition Feedback |
US8239190B2 (en) | 2006-08-22 | 2012-08-07 | Qualcomm Incorporated | Time-warping frames of wideband vocoder |
US7813922B2 (en) * | 2007-01-30 | 2010-10-12 | Nokia Corporation | Audio quantization |
US8494847B2 (en) * | 2007-02-28 | 2013-07-23 | Nec Corporation | Weighting factor learning system and audio recognition system |
US7904410B1 (en) * | 2007-04-18 | 2011-03-08 | The Mathworks, Inc. | Constrained dynamic time warping |
US8352265B1 (en) | 2007-12-24 | 2013-01-08 | Edward Lin | Hardware implemented backend search engine for a high-rate speech recognition system |
US8639510B1 (en) | 2007-12-24 | 2014-01-28 | Kai Yu | Acoustic scoring unit implemented on a single FPGA or ASIC |
US8463610B1 (en) | 2008-01-18 | 2013-06-11 | Patrick J. Bourke | Hardware-implemented scalable modular engine for low-power speech recognition |
WO2010019831A1 (en) * | 2008-08-14 | 2010-02-18 | 21Ct, Inc. | Hidden markov model for speech processing with training method |
US8442833B2 (en) | 2009-02-17 | 2013-05-14 | Sony Computer Entertainment Inc. | Speech processing with source location estimation using signals from two or more microphones |
US8788256B2 (en) | 2009-02-17 | 2014-07-22 | Sony Computer Entertainment Inc. | Multiple language voice recognition |
US8442829B2 (en) | 2009-02-17 | 2013-05-14 | Sony Computer Entertainment Inc. | Automatic computation streaming partition for voice recognition on multiple processors with limited memory |
US8417526B2 (en) * | 2009-03-13 | 2013-04-09 | Adacel, Inc. | Speech recognition learning system and method |
US9026444B2 (en) | 2009-09-16 | 2015-05-05 | At&T Intellectual Property I, L.P. | System and method for personalization of acoustic models for automatic speech recognition |
US8812321B2 (en) * | 2010-09-30 | 2014-08-19 | At&T Intellectual Property I, L.P. | System and method for combining speech recognition outputs from a plurality of domain-specific speech recognizers via machine learning |
US20120168331A1 (en) * | 2010-12-30 | 2012-07-05 | Safecode Drug Technologies Corp. | Voice template protector for administering medicine |
US10032455B2 (en) | 2011-01-07 | 2018-07-24 | Nuance Communications, Inc. | Configurable speech recognition system using a pronunciation alignment between multiple recognizers |
US9153235B2 (en) | 2012-04-09 | 2015-10-06 | Sony Computer Entertainment Inc. | Text dependent speaker recognition with long-term feature based on functional data analysis |
CN104769668B (zh) | 2012-10-04 | 2018-10-30 | 纽昂斯通讯公司 | 改进的用于asr的混合控制器 |
US9240184B1 (en) * | 2012-11-15 | 2016-01-19 | Google Inc. | Frame-level combination of deep neural network and gaussian mixture models |
CN105027198B (zh) * | 2013-02-25 | 2018-11-20 | 三菱电机株式会社 | 语音识别系统以及语音识别装置 |
CN104143330A (zh) * | 2013-05-07 | 2014-11-12 | 佳能株式会社 | 语音识别方法和语音识别系统 |
US20140337030A1 (en) * | 2013-05-07 | 2014-11-13 | Qualcomm Incorporated | Adaptive audio frame processing for keyword detection |
US9225879B2 (en) * | 2013-12-27 | 2015-12-29 | TCL Research America Inc. | Method and apparatus for video sequential alignment |
PH12017500352B1 (en) * | 2014-08-28 | 2022-07-06 | Nokia Technologies Oy | Audio parameter quantization |
CN104616653B (zh) * | 2015-01-23 | 2018-02-23 | 北京云知声信息技术有限公司 | 唤醒词匹配方法、装置以及语音唤醒方法、装置 |
US10134425B1 (en) * | 2015-06-29 | 2018-11-20 | Amazon Technologies, Inc. | Direction-based speech endpointing |
US10536464B2 (en) * | 2016-06-22 | 2020-01-14 | Intel Corporation | Secure and smart login engine |
US10971157B2 (en) | 2017-01-11 | 2021-04-06 | Nuance Communications, Inc. | Methods and apparatus for hybrid speech recognition processing |
US10607601B2 (en) * | 2017-05-11 | 2020-03-31 | International Business Machines Corporation | Speech recognition by selecting and refining hot words |
CN109285548A (zh) * | 2017-07-19 | 2019-01-29 | 阿里巴巴集团控股有限公司 | 信息处理方法、系统、电子设备、和计算机存储介质 |
GB2566759B8 (en) | 2017-10-20 | 2021-12-08 | Please Hold Uk Ltd | Encoding identifiers to produce audio identifiers from a plurality of audio bitstreams |
US12131228B2 (en) | 2019-04-02 | 2024-10-29 | International Business Machines Corporation | Method for accessing data records of a master data management system |
CN111128154B (zh) * | 2019-12-03 | 2022-06-03 | 杭州蓦然认知科技有限公司 | 一种聚合形成交互引擎簇的方法及装置 |
CN111694331B (zh) * | 2020-05-11 | 2021-11-02 | 杭州睿疆科技有限公司 | 生产工艺参数调整的系统、方法和计算机设备 |
US11664033B2 (en) * | 2020-06-15 | 2023-05-30 | Samsung Electronics Co., Ltd. | Electronic apparatus and controlling method thereof |
US11996087B2 (en) | 2021-04-30 | 2024-05-28 | Comcast Cable Communications, Llc | Method and apparatus for intelligent voice recognition |
CN115376513B (zh) * | 2022-10-19 | 2023-05-12 | 广州小鹏汽车科技有限公司 | 语音交互方法、服务器及计算机可读存储介质 |
Family Cites Families (15)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4587670A (en) * | 1982-10-15 | 1986-05-06 | At&T Bell Laboratories | Hidden Markov model speech recognition arrangement |
US4783804A (en) * | 1985-03-21 | 1988-11-08 | American Telephone And Telegraph Company, At&T Bell Laboratories | Hidden Markov model speech recognition arrangement |
US4852180A (en) * | 1987-04-03 | 1989-07-25 | American Telephone And Telegraph Company, At&T Bell Laboratories | Speech recognition by acoustic/phonetic system and technique |
US5167004A (en) * | 1991-02-28 | 1992-11-24 | Texas Instruments Incorporated | Temporal decorrelation method for robust speaker verification |
AU671952B2 (en) * | 1991-06-11 | 1996-09-19 | Qualcomm Incorporated | Variable rate vocoder |
US5222190A (en) * | 1991-06-11 | 1993-06-22 | Texas Instruments Incorporated | Apparatus and method for identifying a speech pattern |
US5450522A (en) * | 1991-08-19 | 1995-09-12 | U S West Advanced Technologies, Inc. | Auditory model for parametrization of speech |
CA2126380C (en) * | 1993-07-22 | 1998-07-07 | Wu Chou | Minimum error rate training of combined string models |
US5839103A (en) * | 1995-06-07 | 1998-11-17 | Rutgers, The State University Of New Jersey | Speaker verification system using decision fusion logic |
US5754978A (en) * | 1995-10-27 | 1998-05-19 | Speech Systems Of Colorado, Inc. | Speech recognition system |
US5819220A (en) * | 1996-09-30 | 1998-10-06 | Hewlett-Packard Company | Web triggered word set boosting for speech interfaces to the world wide web |
US6003002A (en) * | 1997-01-02 | 1999-12-14 | Texas Instruments Incorporated | Method and system of adapting speech recognition models to speaker environment |
US5893059A (en) * | 1997-04-17 | 1999-04-06 | Nynex Science And Technology, Inc. | Speech recoginition methods and apparatus |
US6014624A (en) * | 1997-04-18 | 2000-01-11 | Nynex Science And Technology, Inc. | Method and apparatus for transitioning from one voice recognition system to another |
US6526380B1 (en) | 1999-03-26 | 2003-02-25 | Koninklijke Philips Electronics N.V. | Speech recognition system having parallel large vocabulary recognition engines |
-
2000
- 2000-07-18 US US09/618,177 patent/US6671669B1/en not_active Expired - Lifetime
-
2001
- 2001-07-17 AU AU2001275991A patent/AU2001275991A1/en not_active Abandoned
- 2001-07-17 EP EP01953554A patent/EP1301922B1/de not_active Expired - Lifetime
- 2001-07-17 ES ES01953554T patent/ES2278763T3/es not_active Expired - Lifetime
- 2001-07-17 DE DE60125542T patent/DE60125542T2/de not_active Expired - Lifetime
- 2001-07-17 WO PCT/US2001/022761 patent/WO2002007148A1/en active IP Right Grant
- 2001-07-17 AT AT01953554T patent/ATE349751T1/de not_active IP Right Cessation
- 2001-07-17 CN CNB018145922A patent/CN1188831C/zh not_active Expired - Fee Related
- 2001-07-18 TW TW090117578A patent/TWI253056B/zh not_active IP Right Cessation
-
2004
- 2004-01-30 HK HK04100626A patent/HK1057816A1/xx not_active IP Right Cessation
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
AU2018351031B2 (en) * | 2017-10-20 | 2021-03-11 | Please Hold (Uk) Limited | Audio signal |
Also Published As
Publication number | Publication date |
---|---|
DE60125542T2 (de) | 2007-10-11 |
TWI253056B (en) | 2006-04-11 |
ES2278763T3 (es) | 2007-08-16 |
US6671669B1 (en) | 2003-12-30 |
CN1188831C (zh) | 2005-02-09 |
AU2001275991A1 (en) | 2002-01-30 |
CN1454380A (zh) | 2003-11-05 |
HK1057816A1 (en) | 2004-04-16 |
ATE349751T1 (de) | 2007-01-15 |
EP1301922A1 (de) | 2003-04-16 |
EP1301922B1 (de) | 2006-12-27 |
WO2002007148A1 (en) | 2002-01-24 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
DE60125542D1 (de) | System und verfahren zur spracherkennung mit einer vielzahl von spracherkennungsvorrichtungen | |
ATE344959T1 (de) | Kombination von digitaler zeitverschiebung und hmm in sprecherabhängiger- und sprecherunabhängiger weise für die spracherkennung | |
DE69811921D1 (de) | Vorrichtung und verfahren zur unterscheidung von ähnlich klingenden wörtern in der spracherkennung | |
HK1062738A1 (en) | Apparation and method for performing voice recognition using acoustic feature vector modification | |
WO2004100638A3 (en) | Source-dependent text-to-speech system | |
DE602004015973D1 (de) | Spracherkennungssystem und verfahren auf phonetischer basis | |
DE69822179D1 (de) | Verfahren zum lernen von mustern für die sprach- oder die sprechererkennung | |
ATE297588T1 (de) | Anpassung des phonetischen kontextes zur verbesserung der spracherkennung | |
DE60033132D1 (de) | Detektion von emotionen in sprachsignalen mittels analyse einer vielzahl von sprachsignalparametern | |
ATE335195T1 (de) | Hintergrundlernen von sprecherstimmen | |
ATE482447T1 (de) | Registrierung für spracherkennungssystem | |
EP1022722A3 (de) | Sprecheradaptation auf der Basis von Stimm-Eigenvektoren | |
ATE312398T1 (de) | Sprecheranpassung für die spracherkennung | |
ATE531033T1 (de) | System und verfahren zur verteilung einer spracherkennungsgrammatik | |
ATE336059T1 (de) | Verfahren und system zur erzeugung von behaglichkeitsrauschen bei der sprachkommunikation | |
ATE253763T1 (de) | Verfahren zur spracherkennung | |
MX9505299A (es) | Sistemas, metodos y articulos de fabricacion para realizar la hipotesizacion de n-cadenas optimas de alta resolucion. | |
DE69630999T2 (de) | Verfahren zur verringerung von datenbankanforderungen für ein spracherkennungssystem | |
DE60228716D1 (de) | Verfahren zum bereitstellen von kontoinformation und system zum aufschreiben von diktiertem text | |
ATE342566T1 (de) | Verfahren zur spracheingabe eines zielortes mit hilfe eines definierten eingabedialogs in ein zielführungssystem | |
DE60004331D1 (de) | Sprecher-erkennung | |
ATE366912T1 (de) | Verfahren und vorrichtung zur sprachausgabe, datenträger mit sprachdaten | |
ES2179624T3 (es) | Procedimiento y dispositivo para aumentar la probabilidad de reconocimiento de los sistemas de reconocimiento de voz. | |
ATE297047T1 (de) | Sprachgeführtes gerätesteuerungsverfahren mit einer optimierung für einen benutzer | |
ATE348384T1 (de) | Verfahren zur spracherkennung und spracherkennungssystem |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
8364 | No opposition during term of opposition |