Lieberman et al., 2005 - Google Patents

How to wreck a nice beach you sing calm incense

Lieberman et al., 2005

Document ID: 7452544371064042928
Author: Lieberman H; Faaborg A; Daher W; Espinosa J
Publication year: 2005
Publication venue: Proceedings of the 10th international conference on Intelligent user interfaces

External Links

Cited by

Snippet

A principal problem in speech recognition is distinguishing between words and phrases that sound similar but have different meanings. Speech recognition programs produce a list of weighted candidate hypotheses for a given audio segment, and choose the" best" …

Continue reading at www.academia.edu (PDF) (other versions)

238000000034 method 0 abstract description 6

Classifications

- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
- G10L15/183—Speech classification or search using natural language modelling using context dependencies, e.g. language models
- G10L15/19—Grammatical context, e.g. disambiguation of the recognition hypotheses based on word sequence rules
- G10L15/197—Probabilistic grammars, e.g. word n-grams
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/27—Automatic analysis, e.g. parsing
- G06F17/2705—Parsing
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/3061—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F17/30634—Querying
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/27—Automatic analysis, e.g. parsing
- G06F17/2765—Recognition
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30861—Retrieval from the Internet, e.g. browsers
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L2015/088—Word spotting
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
- G10L15/065—Adaptation
- G10L15/07—Adaptation to the speaker
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/28—Constructional details of speech recognition systems
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems

Similar Documents

Publication	Publication Date	Title
Lieberman et al.	2005	How to wreck a nice beach you sing calm incense
US9911413B1 (en)	2018-03-06	Neural latent variable model for spoken language understanding
Tur et al.	2011	Spoken language understanding: Systems for extracting semantic information from speech
Lee et al.	2015	Spoken content retrieval—beyond cascading speech recognition with text retrieval
Sethy et al.	2005	Building topic specific language models from webdata using competitive models.
Gupta et al.	2005	The AT&T spoken language understanding system
He et al.	2003	A data-driven spoken language understanding system
CN109637537B (en)	2020-06-30	Method for automatically acquiring annotated data to optimize user-defined awakening model
Shriberg et al.	2004	Direct modeling of prosody: An overview of applications in automatic speech processing
Kupiec	1989	Probabilistic models of short and long distance word dependencies in running text
Kumar et al.	2017	A knowledge graph based speech interface for question answering systems
CN106570180A (en)	2017-04-19	Artificial intelligence based voice searching method and device
WO2002054385A1 (en)	2002-07-11	Computer-implemented dynamic language model generation method and system
Carvalho et al.	2012	A critical survey on the use of fuzzy sets in speech and natural language processing
Furui	2005	Recent progress in corpus-based spontaneous speech recognition
KR20170090127A (en)	2017-08-07	Apparatus for comprehending speech
Hong et al.	2009	Automatically extracting word relationships as templates for pun generation
Desot et al.	2022	End-to-End Spoken Language Understanding: Performance analyses of a voice command task in a low resource setting
Deekshitha et al.	2020	Multilingual spoken term detection: a review
KR102418260B1 (en)	2022-07-06	Method for analyzing customer consultation record
Nguyen et al.	2022	Are word boundaries useful for unsupervised language learning?
US20100204982A1 (en)	2010-08-12	System and Method for Generating Data for Complex Statistical Modeling for use in Dialog Systems
Jeong et al.	2009	Multi-domain spoken language understanding with transfer learning
Masumura et al.	2011	Training a Language Model Using Webdata for Large Vocabulary Japanese Spontaneous Speech Recognition.
Trancoso et al.	2023	The Impact of Language Technologies in the Legal Domain