DE69827586D1 - Technik zur Adaptation von Hidden Markov Modellen für die Spracherkennung - Google Patents

Technik zur Adaptation von Hidden Markov Modellen für die Spracherkennung

Info

Publication number: DE69827586D1
Authority: DE; Germany
Prior art keywords: adaptation; technology; speech recognition; hidden markov; markov models
Prior art date: 1997-12-05
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.): Expired - Fee Related

Application number

DE69827586T

Other languages

English (en)

Other versions

DE69827586T2 (de

Inventor

Lee Chin-Hui

Shinoda Koichi

Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)

Nokia of America Corp

Original Assignee

Lucent Technologies Inc

Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)

1997-12-05

Filing date

1998-11-24

Publication date

2004-12-23

1998-11-24 Application filed by Lucent Technologies Inc filed Critical Lucent Technologies Inc

2004-12-23 Publication of DE69827586D1 publication Critical patent/DE69827586D1/de

2005-12-01 Application granted granted Critical

2005-12-01 Publication of DE69827586T2 publication Critical patent/DE69827586T2/de

2018-11-25 Anticipated expiration legal-status Critical

Status Expired - Fee Related legal-status Critical Current

Classifications

- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
- G10L15/065—Adaptation
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/14—Speech classification or search using statistical models, e.g. Hidden Markov Models [HMMs]
- G10L15/142—Hidden Markov Models [HMMs]
- G10L15/144—Training of HMMs

Landscapes

Engineering & Computer Science (AREA)
Physics & Mathematics (AREA)
Computational Linguistics (AREA)
Health & Medical Sciences (AREA)
Audiology, Speech & Language Pathology (AREA)
Human Computer Interaction (AREA)
Acoustics & Sound (AREA)
Multimedia (AREA)
Artificial Intelligence (AREA)
Probability & Statistics with Applications (AREA)
Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Image Analysis (AREA)

DE69827586T 1997-12-05 1998-11-24 Technik zur Adaptation von Hidden Markov Modellen für die Spracherkennung Expired - Fee Related DE69827586T2 (de)

Applications Claiming Priority (4)

Application Number	Priority Date	Filing Date	Title
US6782297P	1997-12-05	1997-12-05
US67822P		1997-12-05
US149782		1998-09-08
US09/149,782 US6151574A (en)	1997-12-05	1998-09-08	Technique for adaptation of hidden markov models for speech recognition

Publications (2)

Publication Number	Publication Date
DE69827586D1 true DE69827586D1 (de)	2004-12-23
DE69827586T2 DE69827586T2 (de)	2005-12-01

Family

ID=26748302

Family Applications (1)

Application Number	Title	Priority Date	Filing Date
DE69827586T Expired - Fee Related DE69827586T2 (de)	1997-12-05	1998-11-24	Technik zur Adaptation von Hidden Markov Modellen für die Spracherkennung

Country Status (4)

Country	Link
US (1)	US6151574A (de)
EP (1)	EP0921519B1 (de)
JP (1)	JP3742236B2 (de)
DE (1)	DE69827586T2 (de)

Families Citing this family (61)

* Cited by examiner, † Cited by third party
Publication number	Priority date	Publication date	Assignee	Title
ATE258332T1 (de) *	1998-11-25	2004-02-15	Entropic Ltd	Netzwerk- und sprachmodelle zur verwendung in einem spracherkennungssystem
US6678658B1 (en) *	1999-07-09	2004-01-13	The Regents Of The University Of California	Speech processing using conditional observable maximum likelihood continuity mapping
KR100307623B1 (ko) *	1999-10-21	2001-11-02	윤종용	엠.에이.피 화자 적응 조건에서 파라미터의 분별적 추정 방법 및 장치 및 이를 각각 포함한 음성 인식 방법 및 장치
US6539351B1 (en) *	2000-02-04	2003-03-25	International Business Machines Corporation	High dimensional acoustic modeling via mixtures of compound gaussians with linear transforms
US6470314B1 (en) *	2000-04-06	2002-10-22	International Business Machines Corporation	Method and apparatus for rapid adapt via cumulative distribution function matching for continuous speech
US6751590B1 (en) *	2000-06-13	2004-06-15	International Business Machines Corporation	Method and apparatus for performing pattern-specific maximum likelihood transformations for speaker recognition
US7216077B1 (en) *	2000-09-26	2007-05-08	International Business Machines Corporation	Lattice-based unsupervised maximum likelihood linear regression for speaker adaptation
US20030182290A1 (en) *	2000-10-20	2003-09-25	Parker Denise S.	Integrated life planning method and systems and products for implementation
US6845357B2 (en) *	2001-07-24	2005-01-18	Honeywell International Inc.	Pattern recognition using an observable operator model
US6788243B2 (en)	2001-09-06	2004-09-07	Minister Of National Defence Of Her Majestry's Canadian Government The Secretary Of State For Defence	Hidden Markov modeling for radar electronic warfare
US7203635B2 (en) *	2002-06-27	2007-04-10	Microsoft Corporation	Layered models for context awareness
US20050021337A1 (en) *	2003-07-23	2005-01-27	Tae-Hee Kwon	HMM modification method
US7580570B2 (en) *	2003-12-09	2009-08-25	Microsoft Corporation	Accuracy model for recognition signal processing engines
US7467086B2 (en) *	2004-12-16	2008-12-16	Sony Corporation	Methodology for generating enhanced demiphone acoustic models for speech recognition
US7949533B2 (en) *	2005-02-04	2011-05-24	Vococollect, Inc.	Methods and systems for assessing and improving the performance of a speech recognition system
US7827032B2 (en)	2005-02-04	2010-11-02	Vocollect, Inc.	Methods and systems for adapting a model for a speech recognition system
US7865362B2 (en)	2005-02-04	2011-01-04	Vocollect, Inc.	Method and system for considering information about an expected response when performing speech recognition
US8200495B2 (en)	2005-02-04	2012-06-12	Vocollect, Inc.	Methods and systems for considering information about an expected response when performing speech recognition
US7895039B2 (en) *	2005-02-04	2011-02-22	Vocollect, Inc.	Methods and systems for optimizing model adaptation for a speech recognition system
US20070088552A1 (en) *	2005-10-17	2007-04-19	Nokia Corporation	Method and a device for speech recognition
US7970613B2 (en)	2005-11-12	2011-06-28	Sony Computer Entertainment Inc.	Method and system for Gaussian probability data bit reduction and computation
US8010358B2 (en) *	2006-02-21	2011-08-30	Sony Computer Entertainment Inc.	Voice recognition with parallel gender and age normalization
US7778831B2 (en) *	2006-02-21	2010-08-17	Sony Computer Entertainment Inc.	Voice recognition with dynamic filter bank adjustment based on speaker categorization determined from runtime pitch
CN101390156B (zh) *	2006-02-27	2011-12-07	日本电气株式会社	标准模式适应装置、标准模式适应方法
US20080059190A1 (en) *	2006-08-22	2008-03-06	Microsoft Corporation	Speech unit selection using HMM acoustic models
US8234116B2 (en) *	2006-08-22	2012-07-31	Microsoft Corporation	Calculating cost measures between HMM acoustic models
JP4427530B2 (ja) *	2006-09-21	2010-03-10	株式会社東芝	音声認識装置、プログラムおよび音声認識方法
US20080243503A1 (en) *	2007-03-30	2008-10-02	Microsoft Corporation	Minimum divergence based discriminative training for pattern recognition
US8996376B2 (en)	2008-04-05	2015-03-31	Apple Inc.	Intelligent text-to-speech conversion
US8335381B2 (en)	2008-09-18	2012-12-18	Xerox Corporation	Handwritten word spotter using synthesized typed queries
US8442833B2 (en)	2009-02-17	2013-05-14	Sony Computer Entertainment Inc.	Speech processing with source location estimation using signals from two or more microphones
US8788256B2 (en)	2009-02-17	2014-07-22	Sony Computer Entertainment Inc.	Multiple language voice recognition
US8442829B2 (en)	2009-02-17	2013-05-14	Sony Computer Entertainment Inc.	Automatic computation streaming partition for voice recognition on multiple processors with limited memory
US10241752B2 (en)	2011-09-30	2019-03-26	Apple Inc.	Interface for a virtual digital assistant
TWI396184B (zh) *	2009-09-17	2013-05-11	Tze Fen Li	一種語音辨認所有語言及用語音輸入單字的方法
US8682667B2 (en)	2010-02-25	2014-03-25	Apple Inc.	User profiling for selecting user specific voice input processing information
KR20120045582A (ko) *	2010-10-29	2012-05-09	한국전자통신연구원	음향 모델 생성 장치 및 방법
US20120116764A1 (en) *	2010-11-09	2012-05-10	Tze Fen Li	Speech recognition method on sentences in all languages
US8478711B2 (en)	2011-02-18	2013-07-02	Larus Technologies Corporation	System and method for data fusion with adaptive learning
US8914290B2 (en)	2011-05-20	2014-12-16	Vocollect, Inc.	Systems and methods for dynamically improving user intelligibility of synthesized speech in a work environment
US9153235B2 (en)	2012-04-09	2015-10-06	Sony Computer Entertainment Inc.	Text dependent speaker recognition with long-term feature based on functional data analysis
US9721563B2 (en)	2012-06-08	2017-08-01	Apple Inc.	Name recognition system
US9978395B2 (en)	2013-03-15	2018-05-22	Vocollect, Inc.	Method and system for mitigating delay in receiving audio stream during production of sound from audio stream
WO2014197334A2 (en)	2013-06-07	2014-12-11	Apple Inc.	System and method for user-specified pronunciation of words for speech synthesis and recognition
US10140981B1 (en) *	2014-06-10	2018-11-27	Amazon Technologies, Inc.	Dynamic arc weights in speech recognition models
US9338493B2 (en)	2014-06-30	2016-05-10	Apple Inc.	Intelligent automated assistant for TV user interactions
US9668121B2 (en)	2014-09-30	2017-05-30	Apple Inc.	Social reminders
US10567477B2 (en)	2015-03-08	2020-02-18	Apple Inc.	Virtual assistant continuity
US9578173B2 (en)	2015-06-05	2017-02-21	Apple Inc.	Virtual assistant aided communication with 3rd party service in a communication session
US10255907B2 (en) *	2015-06-07	2019-04-09	Apple Inc.	Automatic accent detection using acoustic models
US10152968B1 (en) *	2015-06-26	2018-12-11	Iconics, Inc.	Systems and methods for speech-based monitoring and/or control of automation devices
US10714121B2 (en)	2016-07-27	2020-07-14	Vocollect, Inc.	Distinguishing user speech from background speech in speech-dense environments
US10043516B2 (en)	2016-09-23	2018-08-07	Apple Inc.	Intelligent automated assistant
US10593346B2 (en)	2016-12-22	2020-03-17	Apple Inc.	Rank-reduced token representation for automatic speech recognition
DK201770439A1 (en)	2017-05-11	2018-12-13	Apple Inc.	Offline personal assistant
DK179496B1 (en)	2017-05-12	2019-01-15	Apple Inc.	USER-SPECIFIC Acoustic Models
DK179745B1 (en)	2017-05-12	2019-05-01	Apple Inc.	SYNCHRONIZATION AND TASK DELEGATION OF A DIGITAL ASSISTANT
DK201770431A1 (en)	2017-05-15	2018-12-20	Apple Inc.	Optimizing dialogue policy decisions for digital assistants using implicit feedback
DK201770432A1 (en)	2017-05-15	2018-12-21	Apple Inc.	Hierarchical belief states for digital assistants
DK179560B1 (en)	2017-05-16	2019-02-18	Apple Inc.	FAR-FIELD EXTENSION FOR DIGITAL ASSISTANT SERVICES
CN108647788B (zh) *	2018-05-14	2021-03-19	暨南大学	一种联想式知识库的自动改进方法

Family Cites Families (13)

* Cited by examiner, † Cited by third party
Publication number	Priority date	Publication date	Assignee	Title
US5027406A (en) *	1988-12-06	1991-06-25	Dragon Systems, Inc.	Method for interactive speech recognition and training
JP2522154B2 (ja) *	1993-06-03	1996-08-07	日本電気株式会社	音声認識システム
US5794197A (en) *	1994-01-21	1998-08-11	Micrsoft Corporation	Senone tree representation and evaluation
EP0788649B1 (de) *	1995-08-28	2001-06-13	Koninklijke Philips Electronics N.V.	Verfahren und system zur mustererkennung mittels baumstrukturierten wahrscheinlichkeitsdichten
EP0788648B1 (de) *	1995-08-28	2000-08-16	Koninklijke Philips Electronics N.V.	Verfahren und system zur mustererkennung mittels dynamischer erzeugung einer untermenge von referenzvektoren
JP3092491B2 (ja) *	1995-08-30	2000-09-25	日本電気株式会社	記述長最小基準を用いたパターン適応化方式
US5657424A (en) *	1995-10-31	1997-08-12	Dictaphone Corporation	Isolated word recognition using decision tree classifiers and time-indexed feature vectors
US5787394A (en) *	1995-12-13	1998-07-28	International Business Machines Corporation	State-dependent speaker clustering for speaker adaptation
GB9602691D0 (en) *	1996-02-09	1996-04-10	Canon Kk	Word model generation
US5960395A (en) *	1996-02-09	1999-09-28	Canon Kabushiki Kaisha	Pattern matching method, apparatus and computer readable memory medium for speech recognition using dynamic programming
US5737487A (en) *	1996-02-13	1998-04-07	Apple Computer, Inc.	Speaker adaptation based on lateral tying for large-vocabulary continuous speech recognition
US5797123A (en) *	1996-10-01	1998-08-18	Lucent Technologies Inc.	Method of key-phase detection and verification for flexible speech understanding
US5983180A (en) *	1997-10-23	1999-11-09	Softsound Limited	Recognition of sequential data using finite state sequence models organized in a tree structure

1998
- 1998-09-08 US US09/149,782 patent/US6151574A/en not_active Expired - Fee Related
- 1998-11-24 DE DE69827586T patent/DE69827586T2/de not_active Expired - Fee Related
- 1998-11-24 EP EP98309595A patent/EP0921519B1/de not_active Expired - Lifetime
- 1998-12-04 JP JP34499898A patent/JP3742236B2/ja not_active Expired - Fee Related

Also Published As

Publication number	Publication date
EP0921519A3 (de)	2000-04-12
EP0921519A2 (de)	1999-06-09
EP0921519B1 (de)	2004-11-17
DE69827586T2 (de)	2005-12-01
JP3742236B2 (ja)	2006-02-01
JPH11242495A (ja)	1999-09-07
US6151574A (en)	2000-11-21

Publication	Publication Date	Title
DE69827586D1 (de)	2004-12-23	Technik zur Adaptation von Hidden Markov Modellen für die Spracherkennung
DE69827988D1 (de)	2005-01-13	Sprachmodelle für die Spracherkennung
DE69917112D1 (de)	2004-06-17	Erweiterung des Wortschatzes eines Client-Server-Spracherkennungssystems
DE69831114D1 (de)	2005-09-15	Integration mehrfacher Modelle für die Spracherkennung in verschiedenen Umgebungen
DE69829235D1 (de)	2005-04-14	Registrierung für die Spracherkennung
NO974097D0 (no)	1997-09-05	Talegjenkjenning
DE69635325D1 (de)	2005-12-01	Verbesserungen zur Spracherkennung
DE60229095D1 (de)	2008-11-13	Ausprachen in mehreren Sprachen zur Spracherkennung
DK0789901T3 (da)	2000-06-19	Talegenkendelse
DE69632517D1 (de)	2004-06-24	Erkennung kontinuierlicher Sprache
DE69618503D1 (de)	2002-02-21	Spracherkennung für Tonsprachen
IL146985A0 (en)	2002-08-14	Automatic dynamic speech recognition vocabulary based on external sources of information
DE60115738D1 (de)	2006-01-19	Sprachmodelle für die Spracherkennung
GB2333877B (en)	2001-08-08	Method of evaluating an utterance in a speech recognition system
DE69421354D1 (de)	1999-12-02	Datenkompression für die Spracherkennung
FI954573A0 (fi)	1995-09-27	Asiayhteydessä olevan puheen tunnistus
DE69634784D1 (de)	2005-06-30	Unterscheidende Verifizierung von Äusserungen für die Erkennung zusammenhängender Ziffern
DE60020773D1 (de)	2005-07-21	Graphische Benutzeroberfläche und Verfahren zur Änderung von Aussprachen in Sprachsynthese und -Erkennungssystemen
DE59809609D1 (de)	2003-10-23	Verfahren zur Spracherkennung mit Sprachmodellanpassung
DE59801560D1 (de)	2001-10-31	Verfahren zur Spracherkennung mit Sprachmodellanpassung
DE69819951D1 (de)	2004-01-08	Spracherkenner mit Rauschadaptierung
AU2001279172A1 (en)	2002-03-22	Computer-implemented speech recognition system training
AU4845800A (en)	2000-12-05	Automated language assessment using speech recognition modeling
DE59607861D1 (de)	2001-11-15	Spracherkennungssystem
DE59912819D1 (de)	2005-12-29	Spracherkennungsverfahren mit Konfidenzmassbewertung

Legal Events

Date	Code	Title	Description
2006-01-05	8364	No opposition during term of opposition
2008-09-11	8339	Ceased/non-payment of the annual fee