Seppi et al., 2010 - Google Patents
Data pruning for template-based automatic speech recognition.Seppi et al., 2010
View PDF- Document ID
- 10039016653301662384
- Author
- Seppi D
- Van Compernolle D
- Publication year
- Publication venue
- INTERSPEECH
External Links
Snippet
In this paper we describe and analyze a data pruning method in combination with template- based automatic speech recognition. We demonstrate the positive effects of polishing the template database by minimizing the word error rate scores. Data pruning allowed to …
- 230000000694 effects 0 abstract description 4
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
- G10L15/183—Speech classification or search using natural language modelling using context dependencies, e.g. language models
- G10L15/187—Phonemic context, e.g. pronunciation rules, phonotactical constraints or phoneme n-grams
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
- G10L15/065—Adaptation
- G10L15/07—Adaptation to the speaker
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/14—Speech classification or search using statistical models, e.g. hidden Markov models [HMMs]
- G10L15/142—Hidden Markov Models [HMMs]
- G10L15/144—Training of HMMs
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L2015/085—Methods for reducing search complexity, pruning
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
- G10L15/063—Training
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/28—Constructional details of speech recognition systems
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6217—Design or setup of recognition systems and techniques; Extraction of features in feature space; Clustering techniques; Blind source separation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6267—Classification techniques
- G06K9/6268—Classification techniques relating to the classification paradigm, e.g. parametric or non-parametric approaches
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N99/00—Subject matter not provided for in other groups of this subclass
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| Jiang et al. | Large margin hidden Markov models for speech recognition | |
| US8301450B2 (en) | Apparatus, method, and medium for dialogue speech recognition using topic domain detection | |
| Imseng et al. | Using out-of-language data to improve an under-resourced speech recognizer | |
| Wang et al. | Using parallel tokenizers with DTW matrix combination for low-resource spoken term detection | |
| WO2003090203A2 (en) | Pattern matching for large vocabulary speech recognition with packed distribution and localized trellis access | |
| Cui et al. | Multi-view and multi-objective semi-supervised learning for hmm-based automatic speech recognition | |
| Abdou et al. | Beam search pruning in speech recognition using a posterior probability-based confidence measure | |
| Bolanos | The bavieca open-source speech recognition toolkit | |
| Gales et al. | Support vector machines for noise robust ASR. | |
| US7565290B2 (en) | Speech recognition method and apparatus | |
| CN100431003C (en) | A Speech Decoding Method Based on Confusion Network | |
| Seppi et al. | Data pruning for template-based automatic speech recognition. | |
| JP5079760B2 (en) | Acoustic model parameter learning device, acoustic model parameter learning method, acoustic model parameter learning program | |
| Demuynck et al. | Reduced semi-continuous models for large vocabulary continuous speech recognition in Dutch | |
| Li et al. | Solving large margin estimation of HMMS via semidefinite programming. | |
| KR101971696B1 (en) | Apparatus and method for creating optimum acoustic model | |
| Liu et al. | Automatic model complexity control using marginalized discriminative growth functions | |
| Vaněk et al. | Discriminative training of gender-dependent acoustic models | |
| US20250259621A1 (en) | Techniques for utterance grouping and for improved training of machine learning models using grouped utterance data | |
| Pylkkönen | Investigations on discriminative training in large scale acoustic model estimation. | |
| JP4705535B2 (en) | Acoustic model creation device, speech recognition device, and acoustic model creation program | |
| Pironkov et al. | I-vector estimation as auxiliary task for multi-task learning based acoustic modeling for automatic speech recognition | |
| De Wachter et al. | Evaluating acoustic distance measures for template based recognition | |
| Gales et al. | Discriminative classifiers with generative kernels for noise robust speech recognition | |
| Chen et al. | Integrating MLP features and discriminative training in data sampling based ensemble acoustic modeling. |