[go: up one dir, main page]

Lieberman et al., 2005 - Google Patents

How to wreck a nice beach you sing calm incense

Lieberman et al., 2005

View PDF
Document ID
7452544371064042928
Author
Lieberman H
Faaborg A
Daher W
Espinosa J
Publication year
Publication venue
Proceedings of the 10th international conference on Intelligent user interfaces

External Links

Snippet

A principal problem in speech recognition is distinguishing between words and phrases that sound similar but have different meanings. Speech recognition programs produce a list of weighted candidate hypotheses for a given audio segment, and choose the" best" …
Continue reading at www.academia.edu (PDF) (other versions)

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/18Speech classification or search using natural language modelling
    • G10L15/183Speech classification or search using natural language modelling using context dependencies, e.g. language models
    • G10L15/19Grammatical context, e.g. disambiguation of the recognition hypotheses based on word sequence rules
    • G10L15/197Probabilistic grammars, e.g. word n-grams
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/20Handling natural language data
    • G06F17/27Automatic analysis, e.g. parsing
    • G06F17/2705Parsing
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/30Information retrieval; Database structures therefor; File system structures therefor
    • G06F17/3061Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F17/30634Querying
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/20Handling natural language data
    • G06F17/27Automatic analysis, e.g. parsing
    • G06F17/2765Recognition
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/30Information retrieval; Database structures therefor; File system structures therefor
    • G06F17/30861Retrieval from the Internet, e.g. browsers
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L2015/088Word spotting
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/06Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
    • G10L15/065Adaptation
    • G10L15/07Adaptation to the speaker
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/28Constructional details of speech recognition systems
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L17/00Speaker identification or verification
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L13/00Speech synthesis; Text to speech systems

Similar Documents

Publication Publication Date Title
Lieberman et al. How to wreck a nice beach you sing calm incense
US9911413B1 (en) Neural latent variable model for spoken language understanding
Tur et al. Spoken language understanding: Systems for extracting semantic information from speech
Lee et al. Spoken content retrieval—beyond cascading speech recognition with text retrieval
Sethy et al. Building topic specific language models from webdata using competitive models.
Gupta et al. The AT&T spoken language understanding system
He et al. A data-driven spoken language understanding system
CN109637537B (en) Method for automatically acquiring annotated data to optimize user-defined awakening model
Shriberg et al. Direct modeling of prosody: An overview of applications in automatic speech processing
Kupiec Probabilistic models of short and long distance word dependencies in running text
Kumar et al. A knowledge graph based speech interface for question answering systems
CN106570180A (en) Artificial intelligence based voice searching method and device
WO2002054385A1 (en) Computer-implemented dynamic language model generation method and system
Carvalho et al. A critical survey on the use of fuzzy sets in speech and natural language processing
Furui Recent progress in corpus-based spontaneous speech recognition
KR20170090127A (en) Apparatus for comprehending speech
Hong et al. Automatically extracting word relationships as templates for pun generation
Desot et al. End-to-End Spoken Language Understanding: Performance analyses of a voice command task in a low resource setting
Deekshitha et al. Multilingual spoken term detection: a review
KR102418260B1 (en) Method for analyzing customer consultation record
Nguyen et al. Are word boundaries useful for unsupervised language learning?
US20100204982A1 (en) System and Method for Generating Data for Complex Statistical Modeling for use in Dialog Systems
Jeong et al. Multi-domain spoken language understanding with transfer learning
Masumura et al. Training a Language Model Using Webdata for Large Vocabulary Japanese Spontaneous Speech Recognition.
Trancoso et al. The Impact of Language Technologies in the Legal Domain