DE69827586D1 - Technik zur Adaptation von Hidden Markov Modellen für die Spracherkennung - Google Patents
Technik zur Adaptation von Hidden Markov Modellen für die SpracherkennungInfo
- Publication number
- DE69827586D1 DE69827586D1 DE69827586T DE69827586T DE69827586D1 DE 69827586 D1 DE69827586 D1 DE 69827586D1 DE 69827586 T DE69827586 T DE 69827586T DE 69827586 T DE69827586 T DE 69827586T DE 69827586 D1 DE69827586 D1 DE 69827586D1
- Authority
- DE
- Germany
- Prior art keywords
- adaptation
- technology
- speech recognition
- hidden markov
- markov models
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Fee Related
Links
- 230000006978 adaptation Effects 0.000 title 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
- G10L15/065—Adaptation
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/14—Speech classification or search using statistical models, e.g. Hidden Markov Models [HMMs]
- G10L15/142—Hidden Markov Models [HMMs]
- G10L15/144—Training of HMMs
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Artificial Intelligence (AREA)
- Probability & Statistics with Applications (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
- Image Analysis (AREA)
Applications Claiming Priority (4)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US6782297P | 1997-12-05 | 1997-12-05 | |
US67822P | 1997-12-05 | ||
US149782 | 1998-09-08 | ||
US09/149,782 US6151574A (en) | 1997-12-05 | 1998-09-08 | Technique for adaptation of hidden markov models for speech recognition |
Publications (2)
Publication Number | Publication Date |
---|---|
DE69827586D1 true DE69827586D1 (de) | 2004-12-23 |
DE69827586T2 DE69827586T2 (de) | 2005-12-01 |
Family
ID=26748302
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
DE69827586T Expired - Fee Related DE69827586T2 (de) | 1997-12-05 | 1998-11-24 | Technik zur Adaptation von Hidden Markov Modellen für die Spracherkennung |
Country Status (4)
Country | Link |
---|---|
US (1) | US6151574A (de) |
EP (1) | EP0921519B1 (de) |
JP (1) | JP3742236B2 (de) |
DE (1) | DE69827586T2 (de) |
Families Citing this family (61)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
ATE258332T1 (de) * | 1998-11-25 | 2004-02-15 | Entropic Ltd | Netzwerk- und sprachmodelle zur verwendung in einem spracherkennungssystem |
US6678658B1 (en) * | 1999-07-09 | 2004-01-13 | The Regents Of The University Of California | Speech processing using conditional observable maximum likelihood continuity mapping |
KR100307623B1 (ko) * | 1999-10-21 | 2001-11-02 | 윤종용 | 엠.에이.피 화자 적응 조건에서 파라미터의 분별적 추정 방법 및 장치 및 이를 각각 포함한 음성 인식 방법 및 장치 |
US6539351B1 (en) * | 2000-02-04 | 2003-03-25 | International Business Machines Corporation | High dimensional acoustic modeling via mixtures of compound gaussians with linear transforms |
US6470314B1 (en) * | 2000-04-06 | 2002-10-22 | International Business Machines Corporation | Method and apparatus for rapid adapt via cumulative distribution function matching for continuous speech |
US6751590B1 (en) * | 2000-06-13 | 2004-06-15 | International Business Machines Corporation | Method and apparatus for performing pattern-specific maximum likelihood transformations for speaker recognition |
US7216077B1 (en) * | 2000-09-26 | 2007-05-08 | International Business Machines Corporation | Lattice-based unsupervised maximum likelihood linear regression for speaker adaptation |
US20030182290A1 (en) * | 2000-10-20 | 2003-09-25 | Parker Denise S. | Integrated life planning method and systems and products for implementation |
US6845357B2 (en) * | 2001-07-24 | 2005-01-18 | Honeywell International Inc. | Pattern recognition using an observable operator model |
US6788243B2 (en) | 2001-09-06 | 2004-09-07 | Minister Of National Defence Of Her Majestry's Canadian Government The Secretary Of State For Defence | Hidden Markov modeling for radar electronic warfare |
US7203635B2 (en) * | 2002-06-27 | 2007-04-10 | Microsoft Corporation | Layered models for context awareness |
US20050021337A1 (en) * | 2003-07-23 | 2005-01-27 | Tae-Hee Kwon | HMM modification method |
US7580570B2 (en) * | 2003-12-09 | 2009-08-25 | Microsoft Corporation | Accuracy model for recognition signal processing engines |
US7467086B2 (en) * | 2004-12-16 | 2008-12-16 | Sony Corporation | Methodology for generating enhanced demiphone acoustic models for speech recognition |
US7949533B2 (en) * | 2005-02-04 | 2011-05-24 | Vococollect, Inc. | Methods and systems for assessing and improving the performance of a speech recognition system |
US7827032B2 (en) | 2005-02-04 | 2010-11-02 | Vocollect, Inc. | Methods and systems for adapting a model for a speech recognition system |
US7865362B2 (en) | 2005-02-04 | 2011-01-04 | Vocollect, Inc. | Method and system for considering information about an expected response when performing speech recognition |
US8200495B2 (en) | 2005-02-04 | 2012-06-12 | Vocollect, Inc. | Methods and systems for considering information about an expected response when performing speech recognition |
US7895039B2 (en) * | 2005-02-04 | 2011-02-22 | Vocollect, Inc. | Methods and systems for optimizing model adaptation for a speech recognition system |
US20070088552A1 (en) * | 2005-10-17 | 2007-04-19 | Nokia Corporation | Method and a device for speech recognition |
US7970613B2 (en) | 2005-11-12 | 2011-06-28 | Sony Computer Entertainment Inc. | Method and system for Gaussian probability data bit reduction and computation |
US8010358B2 (en) * | 2006-02-21 | 2011-08-30 | Sony Computer Entertainment Inc. | Voice recognition with parallel gender and age normalization |
US7778831B2 (en) * | 2006-02-21 | 2010-08-17 | Sony Computer Entertainment Inc. | Voice recognition with dynamic filter bank adjustment based on speaker categorization determined from runtime pitch |
CN101390156B (zh) * | 2006-02-27 | 2011-12-07 | 日本电气株式会社 | 标准模式适应装置、标准模式适应方法 |
US20080059190A1 (en) * | 2006-08-22 | 2008-03-06 | Microsoft Corporation | Speech unit selection using HMM acoustic models |
US8234116B2 (en) * | 2006-08-22 | 2012-07-31 | Microsoft Corporation | Calculating cost measures between HMM acoustic models |
JP4427530B2 (ja) * | 2006-09-21 | 2010-03-10 | 株式会社東芝 | 音声認識装置、プログラムおよび音声認識方法 |
US20080243503A1 (en) * | 2007-03-30 | 2008-10-02 | Microsoft Corporation | Minimum divergence based discriminative training for pattern recognition |
US8996376B2 (en) | 2008-04-05 | 2015-03-31 | Apple Inc. | Intelligent text-to-speech conversion |
US8335381B2 (en) | 2008-09-18 | 2012-12-18 | Xerox Corporation | Handwritten word spotter using synthesized typed queries |
US8442833B2 (en) | 2009-02-17 | 2013-05-14 | Sony Computer Entertainment Inc. | Speech processing with source location estimation using signals from two or more microphones |
US8788256B2 (en) | 2009-02-17 | 2014-07-22 | Sony Computer Entertainment Inc. | Multiple language voice recognition |
US8442829B2 (en) | 2009-02-17 | 2013-05-14 | Sony Computer Entertainment Inc. | Automatic computation streaming partition for voice recognition on multiple processors with limited memory |
US10241752B2 (en) | 2011-09-30 | 2019-03-26 | Apple Inc. | Interface for a virtual digital assistant |
TWI396184B (zh) * | 2009-09-17 | 2013-05-11 | Tze Fen Li | 一種語音辨認所有語言及用語音輸入單字的方法 |
US8682667B2 (en) | 2010-02-25 | 2014-03-25 | Apple Inc. | User profiling for selecting user specific voice input processing information |
KR20120045582A (ko) * | 2010-10-29 | 2012-05-09 | 한국전자통신연구원 | 음향 모델 생성 장치 및 방법 |
US20120116764A1 (en) * | 2010-11-09 | 2012-05-10 | Tze Fen Li | Speech recognition method on sentences in all languages |
US8478711B2 (en) | 2011-02-18 | 2013-07-02 | Larus Technologies Corporation | System and method for data fusion with adaptive learning |
US8914290B2 (en) | 2011-05-20 | 2014-12-16 | Vocollect, Inc. | Systems and methods for dynamically improving user intelligibility of synthesized speech in a work environment |
US9153235B2 (en) | 2012-04-09 | 2015-10-06 | Sony Computer Entertainment Inc. | Text dependent speaker recognition with long-term feature based on functional data analysis |
US9721563B2 (en) | 2012-06-08 | 2017-08-01 | Apple Inc. | Name recognition system |
US9978395B2 (en) | 2013-03-15 | 2018-05-22 | Vocollect, Inc. | Method and system for mitigating delay in receiving audio stream during production of sound from audio stream |
WO2014197334A2 (en) | 2013-06-07 | 2014-12-11 | Apple Inc. | System and method for user-specified pronunciation of words for speech synthesis and recognition |
US10140981B1 (en) * | 2014-06-10 | 2018-11-27 | Amazon Technologies, Inc. | Dynamic arc weights in speech recognition models |
US9338493B2 (en) | 2014-06-30 | 2016-05-10 | Apple Inc. | Intelligent automated assistant for TV user interactions |
US9668121B2 (en) | 2014-09-30 | 2017-05-30 | Apple Inc. | Social reminders |
US10567477B2 (en) | 2015-03-08 | 2020-02-18 | Apple Inc. | Virtual assistant continuity |
US9578173B2 (en) | 2015-06-05 | 2017-02-21 | Apple Inc. | Virtual assistant aided communication with 3rd party service in a communication session |
US10255907B2 (en) * | 2015-06-07 | 2019-04-09 | Apple Inc. | Automatic accent detection using acoustic models |
US10152968B1 (en) * | 2015-06-26 | 2018-12-11 | Iconics, Inc. | Systems and methods for speech-based monitoring and/or control of automation devices |
US10714121B2 (en) | 2016-07-27 | 2020-07-14 | Vocollect, Inc. | Distinguishing user speech from background speech in speech-dense environments |
US10043516B2 (en) | 2016-09-23 | 2018-08-07 | Apple Inc. | Intelligent automated assistant |
US10593346B2 (en) | 2016-12-22 | 2020-03-17 | Apple Inc. | Rank-reduced token representation for automatic speech recognition |
DK201770439A1 (en) | 2017-05-11 | 2018-12-13 | Apple Inc. | Offline personal assistant |
DK179496B1 (en) | 2017-05-12 | 2019-01-15 | Apple Inc. | USER-SPECIFIC Acoustic Models |
DK179745B1 (en) | 2017-05-12 | 2019-05-01 | Apple Inc. | SYNCHRONIZATION AND TASK DELEGATION OF A DIGITAL ASSISTANT |
DK201770431A1 (en) | 2017-05-15 | 2018-12-20 | Apple Inc. | Optimizing dialogue policy decisions for digital assistants using implicit feedback |
DK201770432A1 (en) | 2017-05-15 | 2018-12-21 | Apple Inc. | Hierarchical belief states for digital assistants |
DK179560B1 (en) | 2017-05-16 | 2019-02-18 | Apple Inc. | FAR-FIELD EXTENSION FOR DIGITAL ASSISTANT SERVICES |
CN108647788B (zh) * | 2018-05-14 | 2021-03-19 | 暨南大学 | 一种联想式知识库的自动改进方法 |
Family Cites Families (13)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5027406A (en) * | 1988-12-06 | 1991-06-25 | Dragon Systems, Inc. | Method for interactive speech recognition and training |
JP2522154B2 (ja) * | 1993-06-03 | 1996-08-07 | 日本電気株式会社 | 音声認識システム |
US5794197A (en) * | 1994-01-21 | 1998-08-11 | Micrsoft Corporation | Senone tree representation and evaluation |
EP0788649B1 (de) * | 1995-08-28 | 2001-06-13 | Koninklijke Philips Electronics N.V. | Verfahren und system zur mustererkennung mittels baumstrukturierten wahrscheinlichkeitsdichten |
EP0788648B1 (de) * | 1995-08-28 | 2000-08-16 | Koninklijke Philips Electronics N.V. | Verfahren und system zur mustererkennung mittels dynamischer erzeugung einer untermenge von referenzvektoren |
JP3092491B2 (ja) * | 1995-08-30 | 2000-09-25 | 日本電気株式会社 | 記述長最小基準を用いたパターン適応化方式 |
US5657424A (en) * | 1995-10-31 | 1997-08-12 | Dictaphone Corporation | Isolated word recognition using decision tree classifiers and time-indexed feature vectors |
US5787394A (en) * | 1995-12-13 | 1998-07-28 | International Business Machines Corporation | State-dependent speaker clustering for speaker adaptation |
GB9602691D0 (en) * | 1996-02-09 | 1996-04-10 | Canon Kk | Word model generation |
US5960395A (en) * | 1996-02-09 | 1999-09-28 | Canon Kabushiki Kaisha | Pattern matching method, apparatus and computer readable memory medium for speech recognition using dynamic programming |
US5737487A (en) * | 1996-02-13 | 1998-04-07 | Apple Computer, Inc. | Speaker adaptation based on lateral tying for large-vocabulary continuous speech recognition |
US5797123A (en) * | 1996-10-01 | 1998-08-18 | Lucent Technologies Inc. | Method of key-phase detection and verification for flexible speech understanding |
US5983180A (en) * | 1997-10-23 | 1999-11-09 | Softsound Limited | Recognition of sequential data using finite state sequence models organized in a tree structure |
-
1998
- 1998-09-08 US US09/149,782 patent/US6151574A/en not_active Expired - Fee Related
- 1998-11-24 DE DE69827586T patent/DE69827586T2/de not_active Expired - Fee Related
- 1998-11-24 EP EP98309595A patent/EP0921519B1/de not_active Expired - Lifetime
- 1998-12-04 JP JP34499898A patent/JP3742236B2/ja not_active Expired - Fee Related
Also Published As
Publication number | Publication date |
---|---|
EP0921519A3 (de) | 2000-04-12 |
EP0921519A2 (de) | 1999-06-09 |
EP0921519B1 (de) | 2004-11-17 |
DE69827586T2 (de) | 2005-12-01 |
JP3742236B2 (ja) | 2006-02-01 |
JPH11242495A (ja) | 1999-09-07 |
US6151574A (en) | 2000-11-21 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
DE69827586D1 (de) | Technik zur Adaptation von Hidden Markov Modellen für die Spracherkennung | |
DE69827988D1 (de) | Sprachmodelle für die Spracherkennung | |
DE69917112D1 (de) | Erweiterung des Wortschatzes eines Client-Server-Spracherkennungssystems | |
DE69831114D1 (de) | Integration mehrfacher Modelle für die Spracherkennung in verschiedenen Umgebungen | |
DE69829235D1 (de) | Registrierung für die Spracherkennung | |
NO974097D0 (no) | Talegjenkjenning | |
DE69635325D1 (de) | Verbesserungen zur Spracherkennung | |
DE60229095D1 (de) | Ausprachen in mehreren Sprachen zur Spracherkennung | |
DK0789901T3 (da) | Talegenkendelse | |
DE69632517D1 (de) | Erkennung kontinuierlicher Sprache | |
DE69618503D1 (de) | Spracherkennung für Tonsprachen | |
IL146985A0 (en) | Automatic dynamic speech recognition vocabulary based on external sources of information | |
DE60115738D1 (de) | Sprachmodelle für die Spracherkennung | |
GB2333877B (en) | Method of evaluating an utterance in a speech recognition system | |
DE69421354D1 (de) | Datenkompression für die Spracherkennung | |
FI954573A0 (fi) | Asiayhteydessä olevan puheen tunnistus | |
DE69634784D1 (de) | Unterscheidende Verifizierung von Äusserungen für die Erkennung zusammenhängender Ziffern | |
DE60020773D1 (de) | Graphische Benutzeroberfläche und Verfahren zur Änderung von Aussprachen in Sprachsynthese und -Erkennungssystemen | |
DE59809609D1 (de) | Verfahren zur Spracherkennung mit Sprachmodellanpassung | |
DE59801560D1 (de) | Verfahren zur Spracherkennung mit Sprachmodellanpassung | |
DE69819951D1 (de) | Spracherkenner mit Rauschadaptierung | |
AU2001279172A1 (en) | Computer-implemented speech recognition system training | |
AU4845800A (en) | Automated language assessment using speech recognition modeling | |
DE59607861D1 (de) | Spracherkennungssystem | |
DE59912819D1 (de) | Spracherkennungsverfahren mit Konfidenzmassbewertung |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
8364 | No opposition during term of opposition | ||
8339 | Ceased/non-payment of the annual fee |