Seppi et al., 2010 - Google Patents

Data pruning for template-based automatic speech recognition.

Seppi et al., 2010

Document ID: 10039016653301662384
Author: Seppi D; Van Compernolle D
Publication year: 2010
Publication venue: INTERSPEECH

External Links

Cited by

Snippet

In this paper we describe and analyze a data pruning method in combination with template- based automatic speech recognition. We demonstrate the positive effects of polishing the template database by minimizing the word error rate scores. Data pruning allowed to …

Continue reading at www.isca-archive.org (PDF) (other versions)

230000000694 effects 0 abstract description 4

Classifications

- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
- G10L15/183—Speech classification or search using natural language modelling using context dependencies, e.g. language models
- G10L15/187—Phonemic context, e.g. pronunciation rules, phonotactical constraints or phoneme n-grams
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
- G10L15/065—Adaptation
- G10L15/07—Adaptation to the speaker
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/14—Speech classification or search using statistical models, e.g. hidden Markov models [HMMs]
- G10L15/142—Hidden Markov Models [HMMs]
- G10L15/144—Training of HMMs
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L2015/085—Methods for reducing search complexity, pruning
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
- G10L15/063—Training
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/28—Constructional details of speech recognition systems
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6217—Design or setup of recognition systems and techniques; Extraction of features in feature space; Clustering techniques; Blind source separation
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6267—Classification techniques
- G06K9/6268—Classification techniques relating to the classification paradigm, e.g. parametric or non-parametric approaches
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N99/00—Subject matter not provided for in other groups of this subclass

Similar Documents

Publication	Publication Date	Title
Jiang et al.	2006	Large margin hidden Markov models for speech recognition
US8301450B2 (en)	2012-10-30	Apparatus, method, and medium for dialogue speech recognition using topic domain detection
Imseng et al.	2014	Using out-of-language data to improve an under-resourced speech recognizer
Wang et al.	2013	Using parallel tokenizers with DTW matrix combination for low-resource spoken term detection
WO2003090203A2 (en)	2003-10-30	Pattern matching for large vocabulary speech recognition with packed distribution and localized trellis access
Cui et al.	2012	Multi-view and multi-objective semi-supervised learning for hmm-based automatic speech recognition
Abdou et al.	2004	Beam search pruning in speech recognition using a posterior probability-based confidence measure
Bolanos	2012	The bavieca open-source speech recognition toolkit
Gales et al.	2009	Support vector machines for noise robust ASR.
US7565290B2 (en)	2009-07-21	Speech recognition method and apparatus
CN100431003C (en)	2008-11-05	A Speech Decoding Method Based on Confusion Network
Seppi et al.	2010	Data pruning for template-based automatic speech recognition.
JP5079760B2 (en)	2012-11-21	Acoustic model parameter learning device, acoustic model parameter learning method, acoustic model parameter learning program
Demuynck et al.	1996	Reduced semi-continuous models for large vocabulary continuous speech recognition in Dutch
Li et al.	2006	Solving large margin estimation of HMMS via semidefinite programming.
KR101971696B1 (en)	2019-04-25	Apparatus and method for creating optimum acoustic model
Liu et al.	2007	Automatic model complexity control using marginalized discriminative growth functions
Vaněk et al.	2009	Discriminative training of gender-dependent acoustic models
US20250259621A1 (en)	2025-08-14	Techniques for utterance grouping and for improved training of machine learning models using grouped utterance data
Pylkkönen	2009	Investigations on discriminative training in large scale acoustic model estimation.
JP4705535B2 (en)	2011-06-22	Acoustic model creation device, speech recognition device, and acoustic model creation program
Pironkov et al.	2016	I-vector estimation as auxiliary task for multi-task learning based acoustic modeling for automatic speech recognition
De Wachter et al.	2007	Evaluating acoustic distance measures for template based recognition
Gales et al.	2008	Discriminative classifiers with generative kernels for noise robust speech recognition
Chen et al.	2010	Integrating MLP features and discriminative training in data sampling based ensemble acoustic modeling.