[go: up one dir, main page]

ATE532171T1 - Verfahren und system zur erzeugung von vokabeleintraegen aus akustischen daten - Google Patents

Verfahren und system zur erzeugung von vokabeleintraegen aus akustischen daten

Info

Publication number
ATE532171T1
ATE532171T1 AT09769712T AT09769712T ATE532171T1 AT E532171 T1 ATE532171 T1 AT E532171T1 AT 09769712 T AT09769712 T AT 09769712T AT 09769712 T AT09769712 T AT 09769712T AT E532171 T1 ATE532171 T1 AT E532171T1
Authority
AT
Austria
Prior art keywords
vocabulary
acoustic data
entries
vocabulary entries
vocabulary entry
Prior art date
Application number
AT09769712T
Other languages
English (en)
Inventor
Zsolt Saffer
Original Assignee
Koninkl Philips Electronics Nv
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Koninkl Philips Electronics Nv filed Critical Koninkl Philips Electronics Nv
Application granted granted Critical
Publication of ATE532171T1 publication Critical patent/ATE532171T1/de

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/06Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/16Speech classification or search using artificial neural networks
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/18Speech classification or search using natural language modelling
    • G10L15/183Speech classification or search using natural language modelling using context dependencies, e.g. language models
    • G10L15/187Phonemic context, e.g. pronunciation rules, phonotactical constraints or phoneme n-grams

Landscapes

  • Engineering & Computer Science (AREA)
  • Artificial Intelligence (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Machine Translation (AREA)
  • Document Processing Apparatus (AREA)
AT09769712T 2008-06-27 2009-06-17 Verfahren und system zur erzeugung von vokabeleintraegen aus akustischen daten ATE532171T1 (de)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
EP08159166 2008-06-27
PCT/IB2009/052572 WO2009156903A2 (en) 2008-06-27 2009-06-17 Method and device for generating vocabulary entry from acoustic data

Publications (1)

Publication Number Publication Date
ATE532171T1 true ATE532171T1 (de) 2011-11-15

Family

ID=41279403

Family Applications (1)

Application Number Title Priority Date Filing Date
AT09769712T ATE532171T1 (de) 2008-06-27 2009-06-17 Verfahren und system zur erzeugung von vokabeleintraegen aus akustischen daten

Country Status (5)

Country Link
US (1) US8751230B2 (de)
EP (1) EP2308042B1 (de)
CN (1) CN102077275B (de)
AT (1) ATE532171T1 (de)
WO (1) WO2009156903A2 (de)

Families Citing this family (20)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9659559B2 (en) * 2009-06-25 2017-05-23 Adacel Systems, Inc. Phonetic distance measurement system and related methods
US20110004473A1 (en) * 2009-07-06 2011-01-06 Nice Systems Ltd. Apparatus and method for enhanced speech recognition
US20120179694A1 (en) * 2009-09-28 2012-07-12 International Business Machines Corporation Method and system for enhancing a search request
CN103578467B (zh) * 2013-10-18 2017-01-18 威盛电子股份有限公司 声学模型的建立方法、语音辨识方法及其电子装置
CN103578465B (zh) * 2013-10-18 2016-08-17 威盛电子股份有限公司 语音辨识方法及电子装置
US20160062979A1 (en) * 2014-08-27 2016-03-03 Google Inc. Word classification based on phonetic features
WO2016039751A1 (en) * 2014-09-11 2016-03-17 Nuance Communications, Inc. Method for scoring in an automatic speech recognition system
US10002543B2 (en) * 2014-11-04 2018-06-19 Knotbird LLC System and methods for transforming language into interactive elements
GB2533370A (en) 2014-12-18 2016-06-22 Ibm Orthographic error correction using phonetic transcription
US20170068868A1 (en) * 2015-09-09 2017-03-09 Google Inc. Enhancing handwriting recognition using pre-filter classification
US10387543B2 (en) * 2015-10-15 2019-08-20 Vkidz, Inc. Phoneme-to-grapheme mapping systems and methods
US10102189B2 (en) * 2015-12-21 2018-10-16 Verisign, Inc. Construction of a phonetic representation of a generated string of characters
US9910836B2 (en) * 2015-12-21 2018-03-06 Verisign, Inc. Construction of phonetic representation of a string of characters
US9947311B2 (en) 2015-12-21 2018-04-17 Verisign, Inc. Systems and methods for automatic phonetization of domain names
US10102203B2 (en) * 2015-12-21 2018-10-16 Verisign, Inc. Method for writing a foreign language in a pseudo language phonetically resembling native language of the speaker
US10229672B1 (en) 2015-12-31 2019-03-12 Google Llc Training acoustic models using connectionist temporal classification
US10902043B2 (en) * 2016-01-03 2021-01-26 Gracenote, Inc. Responding to remote media classification queries using classifier models and context parameters
CN110889987A (zh) * 2019-12-16 2020-03-17 安徽必果科技有限公司 一种用于英语口语矫正的智能点评方法
CN112487797B (zh) * 2020-11-26 2024-04-05 北京有竹居网络技术有限公司 数据生成方法、装置、可读介质及电子设备
US20240127801A1 (en) * 2022-10-13 2024-04-18 International Business Machines Corporation Domain adaptive speech recognition using artificial intelligence

Family Cites Families (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6101468A (en) * 1992-11-13 2000-08-08 Dragon Systems, Inc. Apparatuses and methods for training and operating speech recognition systems
DE69514382T2 (de) * 1994-11-01 2001-08-23 British Telecommunications P.L.C., London Spracherkennung
US6044343A (en) * 1997-06-27 2000-03-28 Advanced Micro Devices, Inc. Adaptive speech recognition with selective input data to a speech classifier
US6021384A (en) * 1997-10-29 2000-02-01 At&T Corp. Automatic generation of superwords
DE60026637T2 (de) * 1999-06-30 2006-10-05 International Business Machines Corp. Verfahren zur Erweiterung des Wortschatzes eines Spracherkennungssystems
EP1330817B1 (de) * 2000-11-03 2005-07-20 VoiceCom solutions GmbH Robuste spracherkennung mit datenbankorganisation
US7103533B2 (en) * 2001-02-21 2006-09-05 International Business Machines Corporation Method for preserving contextual accuracy in an extendible speech recognition language model
US7181398B2 (en) * 2002-03-27 2007-02-20 Hewlett-Packard Development Company, L.P. Vocabulary independent speech recognition system and method using subword units
US7389228B2 (en) * 2002-12-16 2008-06-17 International Business Machines Corporation Speaker adaptation of vocabulary for speech recognition
US7698136B1 (en) * 2003-01-28 2010-04-13 Voxify, Inc. Methods and apparatus for flexible speech recognition
US7676364B2 (en) 2004-03-25 2010-03-09 Ashwin Rao System and method for speech-to-text conversion using constrained dictation in a speak-and-spell mode
US20070124147A1 (en) * 2005-11-30 2007-05-31 International Business Machines Corporation Methods and apparatus for use in speech recognition systems for identifying unknown words and for adding previously unknown words to vocabularies and grammars of speech recognition systems

Also Published As

Publication number Publication date
US8751230B2 (en) 2014-06-10
WO2009156903A3 (en) 2010-02-18
US20110093259A1 (en) 2011-04-21
CN102077275B (zh) 2012-08-29
CN102077275A (zh) 2011-05-25
EP2308042A2 (de) 2011-04-13
WO2009156903A2 (en) 2009-12-30
EP2308042B1 (de) 2011-11-02

Similar Documents

Publication Publication Date Title
ATE532171T1 (de) Verfahren und system zur erzeugung von vokabeleintraegen aus akustischen daten
ATE404967T1 (de) Text-zu-sprache-system und verfahren, computerprogramm dafür
WO2008087934A1 (ja) 拡張認識辞書学習装置と音声認識システム
DE602008003781D1 (de) System und verfahren für hybride sprachsynthese
EP4300824A3 (de) Vorrichtung und verfahren zur erzeugung von zeitbereichsaudiomustern
ATE514162T1 (de) Dynamische erzeugung von kontexten zur spracherkennung
DE60111329D1 (de) Anpassung des phonetischen Kontextes zur Verbesserung der Spracherkennung
ATE499645T1 (de) System und verfahren zur erzeugung eines rückmeldungssignals als antwort auf ein eingangssignal an eine elektronische vorrichtung
ATE417346T1 (de) Spracherkennungs- und korrektursystem, korrekturvorrichtung und verfahren zur erstellung eines lexikons von alternativen
WO2009025356A1 (ja) 音声認識装置および音声認識方法
WO2011133766A3 (en) Methods and systems for training dictation-based speech-to-text systems using recorded samples
ATE431610T1 (de) Ton-abschirmungssystem und verfahren zur erzeugung von abschirmendem ton
PH12014500482A1 (en) Systems and methods for language learning
ATE522857T1 (de) Verfahren, vorrichtung, server, system und computerprogrammprodukt zur verwendung mit prädiktiver texteingabe
EP3091535A3 (de) Multimodale eingabe in eine elektronische vorrichtung
WO2012064408A3 (en) Method for tone/intonation recognition using auditory attention cues
WO2014182453A3 (en) Method and apparatus for training a voice recognition model database
ATE523878T1 (de) Wiedergewinnung von in ein audiosignal eingebetteten verborgenen daten und vorrichtung zur daten-verbergung in der komprimierten domäne
DE602006021741D1 (de) Multisensorische sprachverstärkung unter verwendung eines sprachstatusmodells
EP1908053A4 (de) Sprachanalysesystem
WO2012057562A3 (ko) 감성적 음성합성 장치 및 그 방법
SG10201803756YA (en) A membrane filtration module
DE602005024497D1 (de) Verstekte bedingte Zufallfeldermodelle für phonetische Klassifizierung und Spracherkennung
DE602005009091D1 (de) Erzeugen einer Spracherkennungsgrammatik für alphanumerische Ausdrücke
GB2545096A (en) Biometric-music interaction methods and systems