[go: up one dir, main page]

Seppi et al., 2010 - Google Patents

Data pruning for template-based automatic speech recognition.

Seppi et al., 2010

View PDF
Document ID
10039016653301662384
Author
Seppi D
Van Compernolle D
Publication year
Publication venue
INTERSPEECH

External Links

Snippet

In this paper we describe and analyze a data pruning method in combination with template- based automatic speech recognition. We demonstrate the positive effects of polishing the template database by minimizing the word error rate scores. Data pruning allowed to …
Continue reading at www.isca-archive.org (PDF) (other versions)

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/18Speech classification or search using natural language modelling
    • G10L15/183Speech classification or search using natural language modelling using context dependencies, e.g. language models
    • G10L15/187Phonemic context, e.g. pronunciation rules, phonotactical constraints or phoneme n-grams
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/06Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
    • G10L15/065Adaptation
    • G10L15/07Adaptation to the speaker
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/14Speech classification or search using statistical models, e.g. hidden Markov models [HMMs]
    • G10L15/142Hidden Markov Models [HMMs]
    • G10L15/144Training of HMMs
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L2015/085Methods for reducing search complexity, pruning
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/06Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
    • G10L15/063Training
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/28Constructional details of speech recognition systems
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06KRECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
    • G06K9/00Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
    • G06K9/62Methods or arrangements for recognition using electronic means
    • G06K9/6217Design or setup of recognition systems and techniques; Extraction of features in feature space; Clustering techniques; Blind source separation
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06KRECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
    • G06K9/00Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
    • G06K9/62Methods or arrangements for recognition using electronic means
    • G06K9/6267Classification techniques
    • G06K9/6268Classification techniques relating to the classification paradigm, e.g. parametric or non-parametric approaches
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/30Information retrieval; Database structures therefor; File system structures therefor
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06NCOMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N99/00Subject matter not provided for in other groups of this subclass

Similar Documents

Publication Publication Date Title
Jiang et al. Large margin hidden Markov models for speech recognition
US8301450B2 (en) Apparatus, method, and medium for dialogue speech recognition using topic domain detection
Imseng et al. Using out-of-language data to improve an under-resourced speech recognizer
Wang et al. Using parallel tokenizers with DTW matrix combination for low-resource spoken term detection
WO2003090203A2 (en) Pattern matching for large vocabulary speech recognition with packed distribution and localized trellis access
Cui et al. Multi-view and multi-objective semi-supervised learning for hmm-based automatic speech recognition
Abdou et al. Beam search pruning in speech recognition using a posterior probability-based confidence measure
Bolanos The bavieca open-source speech recognition toolkit
Gales et al. Support vector machines for noise robust ASR.
US7565290B2 (en) Speech recognition method and apparatus
CN100431003C (en) A Speech Decoding Method Based on Confusion Network
Seppi et al. Data pruning for template-based automatic speech recognition.
JP5079760B2 (en) Acoustic model parameter learning device, acoustic model parameter learning method, acoustic model parameter learning program
Demuynck et al. Reduced semi-continuous models for large vocabulary continuous speech recognition in Dutch
Li et al. Solving large margin estimation of HMMS via semidefinite programming.
KR101971696B1 (en) Apparatus and method for creating optimum acoustic model
Liu et al. Automatic model complexity control using marginalized discriminative growth functions
Vaněk et al. Discriminative training of gender-dependent acoustic models
US20250259621A1 (en) Techniques for utterance grouping and for improved training of machine learning models using grouped utterance data
Pylkkönen Investigations on discriminative training in large scale acoustic model estimation.
JP4705535B2 (en) Acoustic model creation device, speech recognition device, and acoustic model creation program
Pironkov et al. I-vector estimation as auxiliary task for multi-task learning based acoustic modeling for automatic speech recognition
De Wachter et al. Evaluating acoustic distance measures for template based recognition
Gales et al. Discriminative classifiers with generative kernels for noise robust speech recognition
Chen et al. Integrating MLP features and discriminative training in data sampling based ensemble acoustic modeling.