[go: up one dir, main page]

DE60033106D1 - Korrektur der Betriebsartfehler, Steuerung oder Diktieren, in die Spracherkennung - Google Patents

Korrektur der Betriebsartfehler, Steuerung oder Diktieren, in die Spracherkennung

Info

Publication number
DE60033106D1
DE60033106D1 DE60033106T DE60033106T DE60033106D1 DE 60033106 D1 DE60033106 D1 DE 60033106D1 DE 60033106 T DE60033106 T DE 60033106T DE 60033106 T DE60033106 T DE 60033106T DE 60033106 D1 DE60033106 D1 DE 60033106D1
Authority
DE
Germany
Prior art keywords
dictation
correction
control
operating mode
speech recognition
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Lifetime
Application number
DE60033106T
Other languages
English (en)
Other versions
DE60033106T2 (de
Inventor
Jeffrey C Reynar
Erik Rucker
Paul Kyong Hwan Kim
David Allen Caulton
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Microsoft Corp
Original Assignee
Microsoft Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Microsoft Corp filed Critical Microsoft Corp
Application granted granted Critical
Publication of DE60033106D1 publication Critical patent/DE60033106D1/de
Publication of DE60033106T2 publication Critical patent/DE60033106T2/de
Anticipated expiration legal-status Critical
Expired - Lifetime legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/16Sound input; Sound output
    • G06F3/167Audio in a user interface, e.g. using voice commands for navigating, audio feedback
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • G10L2015/223Execution procedure of a spoken command

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Document Processing Apparatus (AREA)
  • Machine Translation (AREA)
DE60033106T 1999-10-19 2000-10-13 Korrektur der Betriebsartfehler, Steuerung oder Diktieren, in die Spracherkennung Expired - Lifetime DE60033106T2 (de)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US420863 1999-10-19
US09/420,863 US6581033B1 (en) 1999-10-19 1999-10-19 System and method for correction of speech recognition mode errors

Publications (2)

Publication Number Publication Date
DE60033106D1 true DE60033106D1 (de) 2007-03-15
DE60033106T2 DE60033106T2 (de) 2007-06-14

Family

ID=23668144

Family Applications (1)

Application Number Title Priority Date Filing Date
DE60033106T Expired - Lifetime DE60033106T2 (de) 1999-10-19 2000-10-13 Korrektur der Betriebsartfehler, Steuerung oder Diktieren, in die Spracherkennung

Country Status (5)

Country Link
US (1) US6581033B1 (de)
EP (1) EP1094445B1 (de)
JP (1) JP2001184086A (de)
CN (1) CN1229772C (de)
DE (1) DE60033106T2 (de)

Families Citing this family (68)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6514201B1 (en) * 1999-01-29 2003-02-04 Acuson Corporation Voice-enhanced diagnostic medical ultrasound system and review station
JP3476007B2 (ja) * 1999-09-10 2003-12-10 インターナショナル・ビジネス・マシーンズ・コーポレーション 認識単語登録方法、音声認識方法、音声認識装置、認識単語登録のためのソフトウエア・プロダクトを格納した記憶媒体、音声認識のためのソフトウエア・プロダクトを格納した記憶媒体
US7109970B1 (en) 2000-07-01 2006-09-19 Miller Stephen S Apparatus for remotely controlling computers and other electronic appliances/devices using a combination of voice commands and finger movements
US7035805B1 (en) * 2000-07-14 2006-04-25 Miller Stephen S Switching the modes of operation for voice-recognition applications
US7451085B2 (en) 2000-10-13 2008-11-11 At&T Intellectual Property Ii, L.P. System and method for providing a compensated speech recognition model for speech recognition
DE10120513C1 (de) 2001-04-26 2003-01-09 Siemens Ag Verfahren zur Bestimmung einer Folge von Lautbausteinen zum Synthetisieren eines Sprachsignals einer tonalen Sprache
US7809574B2 (en) * 2001-09-05 2010-10-05 Voice Signal Technologies Inc. Word recognition using choice lists
US7313526B2 (en) 2001-09-05 2007-12-25 Voice Signal Technologies, Inc. Speech recognition using selectable recognition modes
US7444286B2 (en) 2001-09-05 2008-10-28 Roth Daniel L Speech recognition using re-utterance recognition
US7526431B2 (en) * 2001-09-05 2009-04-28 Voice Signal Technologies, Inc. Speech recognition using ambiguous or phone key spelling and/or filtering
US7467089B2 (en) * 2001-09-05 2008-12-16 Roth Daniel L Combined speech and handwriting recognition
US7505911B2 (en) * 2001-09-05 2009-03-17 Roth Daniel L Combined speech recognition and sound recording
WO2004023455A2 (en) * 2002-09-06 2004-03-18 Voice Signal Technologies, Inc. Methods, systems, and programming for performing speech recognition
US7386454B2 (en) * 2002-07-31 2008-06-10 International Business Machines Corporation Natural error handling in speech recognition
JP2005536795A (ja) 2002-08-20 2005-12-02 コーニンクレッカ フィリップス エレクトロニクス エヌ ヴィ ジョブをルーティングする方法
US7634720B2 (en) * 2003-10-24 2009-12-15 Microsoft Corporation System and method for providing context to an input method
US7580837B2 (en) 2004-08-12 2009-08-25 At&T Intellectual Property I, L.P. System and method for targeted tuning module of a speech recognition system
US8725505B2 (en) 2004-10-22 2014-05-13 Microsoft Corporation Verb error recovery in speech recognition
US7242751B2 (en) 2004-12-06 2007-07-10 Sbc Knowledge Ventures, L.P. System and method for speech recognition-enabled automatic call routing
US7751551B2 (en) 2005-01-10 2010-07-06 At&T Intellectual Property I, L.P. System and method for speech-enabled call routing
US7627096B2 (en) * 2005-01-14 2009-12-01 At&T Intellectual Property I, L.P. System and method for independently recognizing and selecting actions and objects in a speech recognition system
JP4734155B2 (ja) * 2006-03-24 2011-07-27 株式会社東芝 音声認識装置、音声認識方法および音声認識プログラム
US20070265831A1 (en) * 2006-05-09 2007-11-15 Itai Dinur System-Level Correction Service
CA2662564C (en) 2006-11-22 2011-06-28 Multimodal Technologies, Inc. Recognition of speech in editable audio streams
JP4867654B2 (ja) * 2006-12-28 2012-02-01 日産自動車株式会社 音声認識装置、および音声認識方法
US8909528B2 (en) * 2007-05-09 2014-12-09 Nuance Communications, Inc. Method and system for prompt construction for selection from a list of acoustically confusable items in spoken dialog systems
US8010465B2 (en) * 2008-02-26 2011-08-30 Microsoft Corporation Predicting candidates using input scopes
US20090228273A1 (en) * 2008-03-05 2009-09-10 Microsoft Corporation Handwriting-based user interface for correction of speech recognition errors
US20100138221A1 (en) * 2008-12-02 2010-06-03 Boys Donald R Dedicated hardware/software voice-to-text system
US11416214B2 (en) 2009-12-23 2022-08-16 Google Llc Multi-modal input on an electronic device
EP4318463A3 (de) * 2009-12-23 2024-02-28 Google LLC Multimodale eingabe in eine elektronische vorrichtung
US8494852B2 (en) 2010-01-05 2013-07-23 Google Inc. Word-level correction of speech input
US8352245B1 (en) 2010-12-30 2013-01-08 Google Inc. Adjusting language models
US8296142B2 (en) 2011-01-21 2012-10-23 Google Inc. Speech recognition using dock context
CN102956231B (zh) * 2011-08-23 2014-12-31 上海交通大学 基于半自动校正的语音关键信息记录装置及方法
CN103207769B (zh) * 2012-01-16 2016-10-05 联想(北京)有限公司 语音修正的方法及用户设备
US9361883B2 (en) * 2012-05-01 2016-06-07 Microsoft Technology Licensing, Llc Dictation with incremental recognition of speech
KR20130135410A (ko) * 2012-05-31 2013-12-11 삼성전자주식회사 음성 인식 기능을 제공하는 방법 및 그 전자 장치
KR20140014510A (ko) * 2012-07-24 2014-02-06 삼성전자주식회사 음성 인식에 의하여 형성된 문자의 편집 방법 및 그 단말
US9111546B2 (en) * 2013-03-06 2015-08-18 Nuance Communications, Inc. Speech recognition and interpretation system
CN104345880B (zh) * 2013-08-08 2017-12-26 联想(北京)有限公司 一种信息处理的方法及电子设备
US9842592B2 (en) 2014-02-12 2017-12-12 Google Inc. Language models using non-linguistic context
US9412365B2 (en) 2014-03-24 2016-08-09 Google Inc. Enhanced maximum entropy models
US9953646B2 (en) 2014-09-02 2018-04-24 Belleau Technologies Method and system for dynamic speech recognition and tracking of prewritten script
US9922098B2 (en) 2014-11-06 2018-03-20 Microsoft Technology Licensing, Llc Context-based search and relevancy generation
US10235130B2 (en) * 2014-11-06 2019-03-19 Microsoft Technology Licensing, Llc Intent driven command processing
US10572810B2 (en) 2015-01-07 2020-02-25 Microsoft Technology Licensing, Llc Managing user interaction for input understanding determinations
US10134394B2 (en) 2015-03-20 2018-11-20 Google Llc Speech recognition using log-linear model
CN104822093B (zh) 2015-04-13 2017-12-19 腾讯科技(北京)有限公司 弹幕发布方法和装置
EP3089159B1 (de) 2015-04-28 2019-08-28 Google LLC Korrekturspracherkennung mittels selektivem re-speak
US10249297B2 (en) * 2015-07-13 2019-04-02 Microsoft Technology Licensing, Llc Propagating conversational alternatives using delayed hypothesis binding
US10409550B2 (en) * 2016-03-04 2019-09-10 Ricoh Company, Ltd. Voice control of interactive whiteboard appliances
JP6675078B2 (ja) * 2016-03-15 2020-04-01 パナソニックIpマネジメント株式会社 誤認識訂正方法、誤認識訂正装置及び誤認識訂正プログラム
US9978367B2 (en) 2016-03-16 2018-05-22 Google Llc Determining dialog states for language models
EP3477634B1 (de) * 2016-06-23 2020-09-16 Sony Corporation Informationsverarbeitungsvorrichtung und informationsverarbeitungsverfahren
US10832664B2 (en) 2016-08-19 2020-11-10 Google Llc Automated speech recognition using language models that selectively use domain-specific model components
US10446137B2 (en) 2016-09-07 2019-10-15 Microsoft Technology Licensing, Llc Ambiguity resolving conversational understanding system
US10311860B2 (en) 2017-02-14 2019-06-04 Google Llc Language model biasing system
WO2018208491A1 (en) * 2017-05-09 2018-11-15 Apple Inc. User interface for correcting recognition errors
CN109410925A (zh) * 2018-08-30 2019-03-01 安徽声讯信息技术有限公司 一种基于多服务器解析传输的语音校验系统及方法
CN109766130A (zh) * 2018-12-15 2019-05-17 深圳壹账通智能科技有限公司 终端命令校正方法、装置、计算机设备及存储介质
CN109637541B (zh) * 2018-12-29 2021-08-17 联想(北京)有限公司 语音转换文字的方法和电子设备
CN111078098B (zh) * 2019-05-10 2021-11-05 广东小天才科技有限公司 一种听写控制方法及装置
US11605378B2 (en) * 2019-07-01 2023-03-14 Lg Electronics Inc. Intelligent gateway device and system including the same
US11508361B2 (en) * 2020-06-01 2022-11-22 Amazon Technologies, Inc. Sentiment aware voice user interface
US11947783B2 (en) * 2021-01-25 2024-04-02 Google Llc Undoing application operation(s) via user interaction(s) with an automated assistant
CN113591441A (zh) * 2021-07-30 2021-11-02 交互未来(北京)科技有限公司 语音编辑方法及装置、存储介质及电子设备
US20240086637A1 (en) * 2022-09-08 2024-03-14 Tencent America LLC Efficient hybrid text normalization

Family Cites Families (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS61175849A (ja) * 1985-01-31 1986-08-07 Canon Inc 文字処理装置
US5231670A (en) 1987-06-01 1993-07-27 Kurzweil Applied Intelligence, Inc. Voice controlled system and method for generating text from a voice controlled input
US6073097A (en) 1992-11-13 2000-06-06 Dragon Systems, Inc. Speech recognition system which selects one of a plurality of vocabulary models
JP3586777B2 (ja) * 1994-08-17 2004-11-10 富士通株式会社 音声入力装置
JP2690027B2 (ja) * 1994-10-05 1997-12-10 株式会社エイ・ティ・アール音声翻訳通信研究所 パターン認識方法及び装置
US5799279A (en) 1995-11-13 1998-08-25 Dragon Systems, Inc. Continuous speech recognition of text and commands
US5794189A (en) * 1995-11-13 1998-08-11 Dragon Systems, Inc. Continuous speech recognition
DE19635754A1 (de) 1996-09-03 1998-03-05 Siemens Ag Sprachverarbeitungssystem und Verfahren zur Sprachverarbeitung
GB2303955B (en) * 1996-09-24 1997-05-14 Allvoice Computing Plc Data processing method and apparatus
US5857099A (en) * 1996-09-27 1999-01-05 Allvoice Computing Plc Speech-to-text dictation system with audio message capability
US5909667A (en) * 1997-03-05 1999-06-01 International Business Machines Corporation Method and apparatus for fast voice selection of error words in dictated text
CA2321299A1 (en) 1998-03-09 1999-09-16 Lernout & Hauspie Speech Products N.V. Apparatus and method for simultaneous multimode dictation
JP2000076241A (ja) * 1998-09-03 2000-03-14 Canon Inc 音声認識装置及び音声入力方法
US6314397B1 (en) * 1999-04-13 2001-11-06 International Business Machines Corp. Method and apparatus for propagating corrections in speech recognition software

Also Published As

Publication number Publication date
CN1293427A (zh) 2001-05-02
JP2001184086A (ja) 2001-07-06
US6581033B1 (en) 2003-06-17
DE60033106T2 (de) 2007-06-14
EP1094445B1 (de) 2006-02-15
EP1094445A3 (de) 2001-09-12
CN1229772C (zh) 2005-11-30
EP1094445A2 (de) 2001-04-25

Similar Documents

Publication Publication Date Title
DE60033106D1 (de) Korrektur der Betriebsartfehler, Steuerung oder Diktieren, in die Spracherkennung
AU2003293646A1 (en) Sensor based speech recognizer selection, adaptation and combination
AU2003295682A1 (en) Multilingual speech recognition
EP1221694A4 (de) Sprachkodierer/dekodierer
FI19992351L (fi) Puheentunnistus
GB0210874D0 (en) Speech system barge-in control
DE60227991D1 (de) Gteil sowie hilfsmechanismus für den drehvorgang
HK1048187A1 (en) Variable bit-rate celp coding of speech with phonetic classification.
DE60229095D1 (de) Ausprachen in mehreren Sprachen zur Spracherkennung
DE60334102D1 (de) Ausgangsregler
AU2001291307A1 (en) Structured speech recognition
DE60109105D1 (de) Hierarchisierte Wörterbücher für die Spracherkennung
DE60126882D1 (de) Hierarchisierte Wörterbücher für die Spracherkennung
AU2003219758A1 (en) Collapsible metal truss
EP1251489A3 (de) Training von Parametern eines Spracherkennungssystems zur Erkennung von Aussprachevarianten
DE60044154D1 (de) Sprachdekodierung
DE60032068D1 (de) Sprachdekodierung
GB0204474D0 (en) Speech recognition system
ATA6432000A (de) Fenster, insbesondere dachfenster
DE60209706D1 (de) Spracherkennungsverfahren
DE10196506T1 (de) Dünnfilmstrukturkörper und Herstellungsverfahren dafür, sowie Beschleunigungssensor und Herstellungsverfahren dafür
DE60028310D1 (de) Sprachdekodierung
DE50211897D1 (de) 3H-NAPHTHOÄ2,1-bÜ-PYRAN-DERIVATE SOWIE DEREN VERWENDUNG
ITPI20020063A1 (it) Bottale per le lavorazioni delle pelli a struttura perfezionata
AU2003251553A8 (en) Extensible structured controlled vocabularies

Legal Events

Date Code Title Description
8332 No legal effect for de
8370 Indication related to discontinuation of the patent is to be deleted
8364 No opposition during term of opposition