DE60033106D1 - Korrektur der Betriebsartfehler, Steuerung oder Diktieren, in die Spracherkennung - Google Patents

Korrektur der Betriebsartfehler, Steuerung oder Diktieren, in die Spracherkennung

Info

Publication number: DE60033106D1
Authority: DE; Germany
Prior art keywords: dictation; correction; control; operating mode; speech recognition
Prior art date: 1999-10-19
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.): Expired - Lifetime

Application number

DE60033106T

Other languages

English (en)

Other versions

DE60033106T2 (de

Inventor

Jeffrey C Reynar

Erik Rucker

Paul Kyong Hwan Kim

David Allen Caulton

Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)

Microsoft Corp

Original Assignee

Microsoft Corp

Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)

1999-10-19

Filing date

2000-10-13

Publication date

2007-03-15

2000-10-13 Application filed by Microsoft Corp filed Critical Microsoft Corp

2007-03-15 Application granted granted Critical

2007-03-15 Publication of DE60033106D1 publication Critical patent/DE60033106D1/de

2007-06-14 Publication of DE60033106T2 publication Critical patent/DE60033106T2/de

2020-10-14 Anticipated expiration legal-status Critical

Status Expired - Lifetime legal-status Critical Current

Classifications

- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/16—Sound input; Sound output
- G06F3/167—Audio in a user interface, e.g. using voice commands for navigating, audio feedback
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
- G10L2015/223—Execution procedure of a spoken command

Landscapes

Engineering & Computer Science (AREA)
Computational Linguistics (AREA)
Health & Medical Sciences (AREA)
Audiology, Speech & Language Pathology (AREA)
Human Computer Interaction (AREA)
Physics & Mathematics (AREA)
Acoustics & Sound (AREA)
Multimedia (AREA)
Document Processing Apparatus (AREA)
Machine Translation (AREA)

DE60033106T 1999-10-19 2000-10-13 Korrektur der Betriebsartfehler, Steuerung oder Diktieren, in die Spracherkennung Expired - Lifetime DE60033106T2 (de)

Applications Claiming Priority (2)

Application Number	Priority Date	Filing Date	Title
US420863		1999-10-19
US09/420,863 US6581033B1 (en)	1999-10-19	1999-10-19	System and method for correction of speech recognition mode errors

Publications (2)

Publication Number	Publication Date
DE60033106D1 true DE60033106D1 (de)	2007-03-15
DE60033106T2 DE60033106T2 (de)	2007-06-14

Family

ID=23668144

Family Applications (1)

Application Number	Title	Priority Date	Filing Date
DE60033106T Expired - Lifetime DE60033106T2 (de)	1999-10-19	2000-10-13	Korrektur der Betriebsartfehler, Steuerung oder Diktieren, in die Spracherkennung

Country Status (5)

Country	Link
US (1)	US6581033B1 (de)
EP (1)	EP1094445B1 (de)
JP (1)	JP2001184086A (de)
CN (1)	CN1229772C (de)
DE (1)	DE60033106T2 (de)

Families Citing this family (68)

* Cited by examiner, † Cited by third party
Publication number	Priority date	Publication date	Assignee	Title
US6514201B1 (en) *	1999-01-29	2003-02-04	Acuson Corporation	Voice-enhanced diagnostic medical ultrasound system and review station
JP3476007B2 (ja) *	1999-09-10	2003-12-10	インターナショナル・ビジネス・マシーンズ・コーポレーション	認識単語登録方法、音声認識方法、音声認識装置、認識単語登録のためのソフトウエア・プロダクトを格納した記憶媒体、音声認識のためのソフトウエア・プロダクトを格納した記憶媒体
US7109970B1 (en)	2000-07-01	2006-09-19	Miller Stephen S	Apparatus for remotely controlling computers and other electronic appliances/devices using a combination of voice commands and finger movements
US7035805B1 (en) *	2000-07-14	2006-04-25	Miller Stephen S	Switching the modes of operation for voice-recognition applications
US7451085B2 (en)	2000-10-13	2008-11-11	At&T Intellectual Property Ii, L.P.	System and method for providing a compensated speech recognition model for speech recognition
DE10120513C1 (de)	2001-04-26	2003-01-09	Siemens Ag	Verfahren zur Bestimmung einer Folge von Lautbausteinen zum Synthetisieren eines Sprachsignals einer tonalen Sprache
US7809574B2 (en) *	2001-09-05	2010-10-05	Voice Signal Technologies Inc.	Word recognition using choice lists
US7313526B2 (en)	2001-09-05	2007-12-25	Voice Signal Technologies, Inc.	Speech recognition using selectable recognition modes
US7444286B2 (en)	2001-09-05	2008-10-28	Roth Daniel L	Speech recognition using re-utterance recognition
US7526431B2 (en) *	2001-09-05	2009-04-28	Voice Signal Technologies, Inc.	Speech recognition using ambiguous or phone key spelling and/or filtering
US7467089B2 (en) *	2001-09-05	2008-12-16	Roth Daniel L	Combined speech and handwriting recognition
US7505911B2 (en) *	2001-09-05	2009-03-17	Roth Daniel L	Combined speech recognition and sound recording
WO2004023455A2 (en) *	2002-09-06	2004-03-18	Voice Signal Technologies, Inc.	Methods, systems, and programming for performing speech recognition
US7386454B2 (en) *	2002-07-31	2008-06-10	International Business Machines Corporation	Natural error handling in speech recognition
JP2005536795A (ja)	2002-08-20	2005-12-02	コーニンクレッカ　フィリップス　エレクトロニクス　エヌ　ヴィ	ジョブをルーティングする方法
US7634720B2 (en) *	2003-10-24	2009-12-15	Microsoft Corporation	System and method for providing context to an input method
US7580837B2 (en)	2004-08-12	2009-08-25	At&T Intellectual Property I, L.P.	System and method for targeted tuning module of a speech recognition system
US8725505B2 (en)	2004-10-22	2014-05-13	Microsoft Corporation	Verb error recovery in speech recognition
US7242751B2 (en)	2004-12-06	2007-07-10	Sbc Knowledge Ventures, L.P.	System and method for speech recognition-enabled automatic call routing
US7751551B2 (en)	2005-01-10	2010-07-06	At&T Intellectual Property I, L.P.	System and method for speech-enabled call routing
US7627096B2 (en) *	2005-01-14	2009-12-01	At&T Intellectual Property I, L.P.	System and method for independently recognizing and selecting actions and objects in a speech recognition system
JP4734155B2 (ja) *	2006-03-24	2011-07-27	株式会社東芝	音声認識装置、音声認識方法および音声認識プログラム
US20070265831A1 (en) *	2006-05-09	2007-11-15	Itai Dinur	System-Level Correction Service
CA2662564C (en)	2006-11-22	2011-06-28	Multimodal Technologies, Inc.	Recognition of speech in editable audio streams
JP4867654B2 (ja) *	2006-12-28	2012-02-01	日産自動車株式会社	音声認識装置、および音声認識方法
US8909528B2 (en) *	2007-05-09	2014-12-09	Nuance Communications, Inc.	Method and system for prompt construction for selection from a list of acoustically confusable items in spoken dialog systems
US8010465B2 (en) *	2008-02-26	2011-08-30	Microsoft Corporation	Predicting candidates using input scopes
US20090228273A1 (en) *	2008-03-05	2009-09-10	Microsoft Corporation	Handwriting-based user interface for correction of speech recognition errors
US20100138221A1 (en) *	2008-12-02	2010-06-03	Boys Donald R	Dedicated hardware/software voice-to-text system
US11416214B2 (en)	2009-12-23	2022-08-16	Google Llc	Multi-modal input on an electronic device
EP4318463A3 (de) *	2009-12-23	2024-02-28	Google LLC	Multimodale eingabe in eine elektronische vorrichtung
US8494852B2 (en)	2010-01-05	2013-07-23	Google Inc.	Word-level correction of speech input
US8352245B1 (en)	2010-12-30	2013-01-08	Google Inc.	Adjusting language models
US8296142B2 (en)	2011-01-21	2012-10-23	Google Inc.	Speech recognition using dock context
CN102956231B (zh) *	2011-08-23	2014-12-31	上海交通大学	基于半自动校正的语音关键信息记录装置及方法
CN103207769B (zh) *	2012-01-16	2016-10-05	联想(北京)有限公司	语音修正的方法及用户设备
US9361883B2 (en) *	2012-05-01	2016-06-07	Microsoft Technology Licensing, Llc	Dictation with incremental recognition of speech
KR20130135410A (ko) *	2012-05-31	2013-12-11	삼성전자주식회사	음성 인식 기능을 제공하는 방법 및 그 전자 장치
KR20140014510A (ko) *	2012-07-24	2014-02-06	삼성전자주식회사	음성 인식에 의하여 형성된 문자의 편집 방법 및 그 단말
US9111546B2 (en) *	2013-03-06	2015-08-18	Nuance Communications, Inc.	Speech recognition and interpretation system
CN104345880B (zh) *	2013-08-08	2017-12-26	联想(北京)有限公司	一种信息处理的方法及电子设备
US9842592B2 (en)	2014-02-12	2017-12-12	Google Inc.	Language models using non-linguistic context
US9412365B2 (en)	2014-03-24	2016-08-09	Google Inc.	Enhanced maximum entropy models
US9953646B2 (en)	2014-09-02	2018-04-24	Belleau Technologies	Method and system for dynamic speech recognition and tracking of prewritten script
US9922098B2 (en)	2014-11-06	2018-03-20	Microsoft Technology Licensing, Llc	Context-based search and relevancy generation
US10235130B2 (en) *	2014-11-06	2019-03-19	Microsoft Technology Licensing, Llc	Intent driven command processing
US10572810B2 (en)	2015-01-07	2020-02-25	Microsoft Technology Licensing, Llc	Managing user interaction for input understanding determinations
US10134394B2 (en)	2015-03-20	2018-11-20	Google Llc	Speech recognition using log-linear model
CN104822093B (zh)	2015-04-13	2017-12-19	腾讯科技（北京）有限公司	弹幕发布方法和装置
EP3089159B1 (de)	2015-04-28	2019-08-28	Google LLC	Korrekturspracherkennung mittels selektivem re-speak
US10249297B2 (en) *	2015-07-13	2019-04-02	Microsoft Technology Licensing, Llc	Propagating conversational alternatives using delayed hypothesis binding
US10409550B2 (en) *	2016-03-04	2019-09-10	Ricoh Company, Ltd.	Voice control of interactive whiteboard appliances
JP6675078B2 (ja) *	2016-03-15	2020-04-01	パナソニックＩｐマネジメント株式会社	誤認識訂正方法、誤認識訂正装置及び誤認識訂正プログラム
US9978367B2 (en)	2016-03-16	2018-05-22	Google Llc	Determining dialog states for language models
EP3477634B1 (de) *	2016-06-23	2020-09-16	Sony Corporation	Informationsverarbeitungsvorrichtung und informationsverarbeitungsverfahren
US10832664B2 (en)	2016-08-19	2020-11-10	Google Llc	Automated speech recognition using language models that selectively use domain-specific model components
US10446137B2 (en)	2016-09-07	2019-10-15	Microsoft Technology Licensing, Llc	Ambiguity resolving conversational understanding system
US10311860B2 (en)	2017-02-14	2019-06-04	Google Llc	Language model biasing system
WO2018208491A1 (en) *	2017-05-09	2018-11-15	Apple Inc.	User interface for correcting recognition errors
CN109410925A (zh) *	2018-08-30	2019-03-01	安徽声讯信息技术有限公司	一种基于多服务器解析传输的语音校验系统及方法
CN109766130A (zh) *	2018-12-15	2019-05-17	深圳壹账通智能科技有限公司	终端命令校正方法、装置、计算机设备及存储介质
CN109637541B (zh) *	2018-12-29	2021-08-17	联想(北京)有限公司	语音转换文字的方法和电子设备
CN111078098B (zh) *	2019-05-10	2021-11-05	广东小天才科技有限公司	一种听写控制方法及装置
US11605378B2 (en) *	2019-07-01	2023-03-14	Lg Electronics Inc.	Intelligent gateway device and system including the same
US11508361B2 (en) *	2020-06-01	2022-11-22	Amazon Technologies, Inc.	Sentiment aware voice user interface
US11947783B2 (en) *	2021-01-25	2024-04-02	Google Llc	Undoing application operation(s) via user interaction(s) with an automated assistant
CN113591441A (zh) *	2021-07-30	2021-11-02	交互未来(北京)科技有限公司	语音编辑方法及装置、存储介质及电子设备
US20240086637A1 (en) *	2022-09-08	2024-03-14	Tencent America LLC	Efficient hybrid text normalization

Family Cites Families (14)

* Cited by examiner, † Cited by third party
Publication number	Priority date	Publication date	Assignee	Title
JPS61175849A (ja) *	1985-01-31	1986-08-07	Canon Inc	文字処理装置
US5231670A (en)	1987-06-01	1993-07-27	Kurzweil Applied Intelligence, Inc.	Voice controlled system and method for generating text from a voice controlled input
US6073097A (en)	1992-11-13	2000-06-06	Dragon Systems, Inc.	Speech recognition system which selects one of a plurality of vocabulary models
JP3586777B2 (ja) *	1994-08-17	2004-11-10	富士通株式会社	音声入力装置
JP2690027B2 (ja) *	1994-10-05	1997-12-10	株式会社エイ・ティ・アール音声翻訳通信研究所	パターン認識方法及び装置
US5799279A (en)	1995-11-13	1998-08-25	Dragon Systems, Inc.	Continuous speech recognition of text and commands
US5794189A (en) *	1995-11-13	1998-08-11	Dragon Systems, Inc.	Continuous speech recognition
DE19635754A1 (de)	1996-09-03	1998-03-05	Siemens Ag	Sprachverarbeitungssystem und Verfahren zur Sprachverarbeitung
GB2303955B (en) *	1996-09-24	1997-05-14	Allvoice Computing Plc	Data processing method and apparatus
US5857099A (en) *	1996-09-27	1999-01-05	Allvoice Computing Plc	Speech-to-text dictation system with audio message capability
US5909667A (en) *	1997-03-05	1999-06-01	International Business Machines Corporation	Method and apparatus for fast voice selection of error words in dictated text
CA2321299A1 (en)	1998-03-09	1999-09-16	Lernout & Hauspie Speech Products N.V.	Apparatus and method for simultaneous multimode dictation
JP2000076241A (ja) *	1998-09-03	2000-03-14	Canon Inc	音声認識装置及び音声入力方法
US6314397B1 (en) *	1999-04-13	2001-11-06	International Business Machines Corp.	Method and apparatus for propagating corrections in speech recognition software

1999
- 1999-10-19 US US09/420,863 patent/US6581033B1/en not_active Expired - Lifetime
2000
- 2000-10-13 EP EP00309029A patent/EP1094445B1/de not_active Expired - Lifetime
- 2000-10-13 DE DE60033106T patent/DE60033106T2/de not_active Expired - Lifetime
- 2000-10-19 JP JP2000319866A patent/JP2001184086A/ja active Pending
- 2000-10-19 CN CNB001301950A patent/CN1229772C/zh not_active Expired - Fee Related

Also Published As

Publication number	Publication date
CN1293427A (zh)	2001-05-02
JP2001184086A (ja)	2001-07-06
US6581033B1 (en)	2003-06-17
DE60033106T2 (de)	2007-06-14
EP1094445B1 (de)	2006-02-15
EP1094445A3 (de)	2001-09-12
CN1229772C (zh)	2005-11-30
EP1094445A2 (de)	2001-04-25

Publication	Publication Date	Title
DE60033106D1 (de)	2007-03-15	Korrektur der Betriebsartfehler, Steuerung oder Diktieren, in die Spracherkennung
AU2003293646A1 (en)	2004-07-14	Sensor based speech recognizer selection, adaptation and combination
AU2003295682A1 (en)	2004-06-15	Multilingual speech recognition
EP1221694A4 (de)	2005-06-22	Sprachkodierer/dekodierer
FI19992351L (fi)	2001-04-30	Puheentunnistus
GB0210874D0 (en)	2002-06-19	Speech system barge-in control
DE60227991D1 (de)	2008-09-11	Gteil sowie hilfsmechanismus für den drehvorgang
HK1048187A1 (en)	2003-03-21	Variable bit-rate celp coding of speech with phonetic classification.
DE60229095D1 (de)	2008-11-13	Ausprachen in mehreren Sprachen zur Spracherkennung
DE60334102D1 (de)	2010-10-21	Ausgangsregler
AU2001291307A1 (en)	2002-04-29	Structured speech recognition
DE60109105D1 (de)	2005-04-07	Hierarchisierte Wörterbücher für die Spracherkennung
DE60126882D1 (de)	2007-04-12	Hierarchisierte Wörterbücher für die Spracherkennung
AU2003219758A1 (en)	2003-09-04	Collapsible metal truss
EP1251489A3 (de)	2004-03-31	Training von Parametern eines Spracherkennungssystems zur Erkennung von Aussprachevarianten
DE60044154D1 (de)	2010-05-20	Sprachdekodierung
DE60032068D1 (de)	2007-01-11	Sprachdekodierung
GB0204474D0 (en)	2002-04-10	Speech recognition system
ATA6432000A (de)	2001-05-15	Fenster, insbesondere dachfenster
DE60209706D1 (de)	2006-05-04	Spracherkennungsverfahren
DE10196506T1 (de)	2003-07-31	Dünnfilmstrukturkörper und Herstellungsverfahren dafür, sowie Beschleunigungssensor und Herstellungsverfahren dafür
DE60028310D1 (de)	2006-07-06	Sprachdekodierung
DE50211897D1 (de)	2008-04-24	3H-NAPHTHOÄ2,1-bÜ-PYRAN-DERIVATE SOWIE DEREN VERWENDUNG
ITPI20020063A1 (it)	2004-05-05	Bottale per le lavorazioni delle pelli a struttura perfezionata
AU2003251553A8 (en)	2003-12-31	Extensible structured controlled vocabularies

Legal Events

Date	Code	Title
2007-04-19	8332	No legal effect for de
2007-04-26	8370	Indication related to discontinuation of the patent is to be deleted
2007-10-25	8364	No opposition during term of opposition