ATE532171T1 - Verfahren und system zur erzeugung von vokabeleintraegen aus akustischen daten - Google Patents
Verfahren und system zur erzeugung von vokabeleintraegen aus akustischen datenInfo
- Publication number
- ATE532171T1 ATE532171T1 AT09769712T AT09769712T ATE532171T1 AT E532171 T1 ATE532171 T1 AT E532171T1 AT 09769712 T AT09769712 T AT 09769712T AT 09769712 T AT09769712 T AT 09769712T AT E532171 T1 ATE532171 T1 AT E532171T1
- Authority
- AT
- Austria
- Prior art keywords
- vocabulary
- acoustic data
- entries
- vocabulary entries
- vocabulary entry
- Prior art date
Links
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/16—Speech classification or search using artificial neural networks
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
- G10L15/183—Speech classification or search using natural language modelling using context dependencies, e.g. language models
- G10L15/187—Phonemic context, e.g. pronunciation rules, phonotactical constraints or phoneme n-grams
Landscapes
- Engineering & Computer Science (AREA)
- Artificial Intelligence (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Machine Translation (AREA)
- Document Processing Apparatus (AREA)
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
EP08159166 | 2008-06-27 | ||
PCT/IB2009/052572 WO2009156903A2 (en) | 2008-06-27 | 2009-06-17 | Method and device for generating vocabulary entry from acoustic data |
Publications (1)
Publication Number | Publication Date |
---|---|
ATE532171T1 true ATE532171T1 (de) | 2011-11-15 |
Family
ID=41279403
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
AT09769712T ATE532171T1 (de) | 2008-06-27 | 2009-06-17 | Verfahren und system zur erzeugung von vokabeleintraegen aus akustischen daten |
Country Status (5)
Country | Link |
---|---|
US (1) | US8751230B2 (de) |
EP (1) | EP2308042B1 (de) |
CN (1) | CN102077275B (de) |
AT (1) | ATE532171T1 (de) |
WO (1) | WO2009156903A2 (de) |
Families Citing this family (20)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US9659559B2 (en) * | 2009-06-25 | 2017-05-23 | Adacel Systems, Inc. | Phonetic distance measurement system and related methods |
US20110004473A1 (en) * | 2009-07-06 | 2011-01-06 | Nice Systems Ltd. | Apparatus and method for enhanced speech recognition |
US20120179694A1 (en) * | 2009-09-28 | 2012-07-12 | International Business Machines Corporation | Method and system for enhancing a search request |
CN103578467B (zh) * | 2013-10-18 | 2017-01-18 | 威盛电子股份有限公司 | 声学模型的建立方法、语音辨识方法及其电子装置 |
CN103578465B (zh) * | 2013-10-18 | 2016-08-17 | 威盛电子股份有限公司 | 语音辨识方法及电子装置 |
US20160062979A1 (en) * | 2014-08-27 | 2016-03-03 | Google Inc. | Word classification based on phonetic features |
WO2016039751A1 (en) * | 2014-09-11 | 2016-03-17 | Nuance Communications, Inc. | Method for scoring in an automatic speech recognition system |
US10002543B2 (en) * | 2014-11-04 | 2018-06-19 | Knotbird LLC | System and methods for transforming language into interactive elements |
GB2533370A (en) | 2014-12-18 | 2016-06-22 | Ibm | Orthographic error correction using phonetic transcription |
US20170068868A1 (en) * | 2015-09-09 | 2017-03-09 | Google Inc. | Enhancing handwriting recognition using pre-filter classification |
US10387543B2 (en) * | 2015-10-15 | 2019-08-20 | Vkidz, Inc. | Phoneme-to-grapheme mapping systems and methods |
US10102189B2 (en) * | 2015-12-21 | 2018-10-16 | Verisign, Inc. | Construction of a phonetic representation of a generated string of characters |
US9910836B2 (en) * | 2015-12-21 | 2018-03-06 | Verisign, Inc. | Construction of phonetic representation of a string of characters |
US9947311B2 (en) | 2015-12-21 | 2018-04-17 | Verisign, Inc. | Systems and methods for automatic phonetization of domain names |
US10102203B2 (en) * | 2015-12-21 | 2018-10-16 | Verisign, Inc. | Method for writing a foreign language in a pseudo language phonetically resembling native language of the speaker |
US10229672B1 (en) | 2015-12-31 | 2019-03-12 | Google Llc | Training acoustic models using connectionist temporal classification |
US10902043B2 (en) * | 2016-01-03 | 2021-01-26 | Gracenote, Inc. | Responding to remote media classification queries using classifier models and context parameters |
CN110889987A (zh) * | 2019-12-16 | 2020-03-17 | 安徽必果科技有限公司 | 一种用于英语口语矫正的智能点评方法 |
CN112487797B (zh) * | 2020-11-26 | 2024-04-05 | 北京有竹居网络技术有限公司 | 数据生成方法、装置、可读介质及电子设备 |
US20240127801A1 (en) * | 2022-10-13 | 2024-04-18 | International Business Machines Corporation | Domain adaptive speech recognition using artificial intelligence |
Family Cites Families (12)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6101468A (en) * | 1992-11-13 | 2000-08-08 | Dragon Systems, Inc. | Apparatuses and methods for training and operating speech recognition systems |
DE69514382T2 (de) * | 1994-11-01 | 2001-08-23 | British Telecommunications P.L.C., London | Spracherkennung |
US6044343A (en) * | 1997-06-27 | 2000-03-28 | Advanced Micro Devices, Inc. | Adaptive speech recognition with selective input data to a speech classifier |
US6021384A (en) * | 1997-10-29 | 2000-02-01 | At&T Corp. | Automatic generation of superwords |
DE60026637T2 (de) * | 1999-06-30 | 2006-10-05 | International Business Machines Corp. | Verfahren zur Erweiterung des Wortschatzes eines Spracherkennungssystems |
EP1330817B1 (de) * | 2000-11-03 | 2005-07-20 | VoiceCom solutions GmbH | Robuste spracherkennung mit datenbankorganisation |
US7103533B2 (en) * | 2001-02-21 | 2006-09-05 | International Business Machines Corporation | Method for preserving contextual accuracy in an extendible speech recognition language model |
US7181398B2 (en) * | 2002-03-27 | 2007-02-20 | Hewlett-Packard Development Company, L.P. | Vocabulary independent speech recognition system and method using subword units |
US7389228B2 (en) * | 2002-12-16 | 2008-06-17 | International Business Machines Corporation | Speaker adaptation of vocabulary for speech recognition |
US7698136B1 (en) * | 2003-01-28 | 2010-04-13 | Voxify, Inc. | Methods and apparatus for flexible speech recognition |
US7676364B2 (en) | 2004-03-25 | 2010-03-09 | Ashwin Rao | System and method for speech-to-text conversion using constrained dictation in a speak-and-spell mode |
US20070124147A1 (en) * | 2005-11-30 | 2007-05-31 | International Business Machines Corporation | Methods and apparatus for use in speech recognition systems for identifying unknown words and for adding previously unknown words to vocabularies and grammars of speech recognition systems |
-
2009
- 2009-06-17 EP EP09769712A patent/EP2308042B1/de active Active
- 2009-06-17 WO PCT/IB2009/052572 patent/WO2009156903A2/en active Application Filing
- 2009-06-17 US US12/997,898 patent/US8751230B2/en active Active
- 2009-06-17 CN CN2009801245465A patent/CN102077275B/zh not_active Expired - Fee Related
- 2009-06-17 AT AT09769712T patent/ATE532171T1/de active
Also Published As
Publication number | Publication date |
---|---|
US8751230B2 (en) | 2014-06-10 |
WO2009156903A3 (en) | 2010-02-18 |
US20110093259A1 (en) | 2011-04-21 |
CN102077275B (zh) | 2012-08-29 |
CN102077275A (zh) | 2011-05-25 |
EP2308042A2 (de) | 2011-04-13 |
WO2009156903A2 (en) | 2009-12-30 |
EP2308042B1 (de) | 2011-11-02 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
ATE532171T1 (de) | Verfahren und system zur erzeugung von vokabeleintraegen aus akustischen daten | |
ATE404967T1 (de) | Text-zu-sprache-system und verfahren, computerprogramm dafür | |
WO2008087934A1 (ja) | 拡張認識辞書学習装置と音声認識システム | |
DE602008003781D1 (de) | System und verfahren für hybride sprachsynthese | |
EP4300824A3 (de) | Vorrichtung und verfahren zur erzeugung von zeitbereichsaudiomustern | |
ATE514162T1 (de) | Dynamische erzeugung von kontexten zur spracherkennung | |
DE60111329D1 (de) | Anpassung des phonetischen Kontextes zur Verbesserung der Spracherkennung | |
ATE499645T1 (de) | System und verfahren zur erzeugung eines rückmeldungssignals als antwort auf ein eingangssignal an eine elektronische vorrichtung | |
ATE417346T1 (de) | Spracherkennungs- und korrektursystem, korrekturvorrichtung und verfahren zur erstellung eines lexikons von alternativen | |
WO2009025356A1 (ja) | 音声認識装置および音声認識方法 | |
WO2011133766A3 (en) | Methods and systems for training dictation-based speech-to-text systems using recorded samples | |
ATE431610T1 (de) | Ton-abschirmungssystem und verfahren zur erzeugung von abschirmendem ton | |
PH12014500482A1 (en) | Systems and methods for language learning | |
ATE522857T1 (de) | Verfahren, vorrichtung, server, system und computerprogrammprodukt zur verwendung mit prädiktiver texteingabe | |
EP3091535A3 (de) | Multimodale eingabe in eine elektronische vorrichtung | |
WO2012064408A3 (en) | Method for tone/intonation recognition using auditory attention cues | |
WO2014182453A3 (en) | Method and apparatus for training a voice recognition model database | |
ATE523878T1 (de) | Wiedergewinnung von in ein audiosignal eingebetteten verborgenen daten und vorrichtung zur daten-verbergung in der komprimierten domäne | |
DE602006021741D1 (de) | Multisensorische sprachverstärkung unter verwendung eines sprachstatusmodells | |
EP1908053A4 (de) | Sprachanalysesystem | |
WO2012057562A3 (ko) | 감성적 음성합성 장치 및 그 방법 | |
SG10201803756YA (en) | A membrane filtration module | |
DE602005024497D1 (de) | Verstekte bedingte Zufallfeldermodelle für phonetische Klassifizierung und Spracherkennung | |
DE602005009091D1 (de) | Erzeugen einer Spracherkennungsgrammatik für alphanumerische Ausdrücke | |
GB2545096A (en) | Biometric-music interaction methods and systems |