Lieberman et al., 2005 - Google Patents
How to wreck a nice beach you sing calm incenseLieberman et al., 2005
View PDF- Document ID
- 7452544371064042928
- Author
- Lieberman H
- Faaborg A
- Daher W
- Espinosa J
- Publication year
- Publication venue
- Proceedings of the 10th international conference on Intelligent user interfaces
External Links
Snippet
A principal problem in speech recognition is distinguishing between words and phrases that sound similar but have different meanings. Speech recognition programs produce a list of weighted candidate hypotheses for a given audio segment, and choose the" best" …
- 238000000034 method 0 abstract description 6
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
- G10L15/183—Speech classification or search using natural language modelling using context dependencies, e.g. language models
- G10L15/19—Grammatical context, e.g. disambiguation of the recognition hypotheses based on word sequence rules
- G10L15/197—Probabilistic grammars, e.g. word n-grams
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/27—Automatic analysis, e.g. parsing
- G06F17/2705—Parsing
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/3061—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F17/30634—Querying
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/27—Automatic analysis, e.g. parsing
- G06F17/2765—Recognition
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30861—Retrieval from the Internet, e.g. browsers
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L2015/088—Word spotting
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
- G10L15/065—Adaptation
- G10L15/07—Adaptation to the speaker
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/28—Constructional details of speech recognition systems
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Lieberman et al. | How to wreck a nice beach you sing calm incense | |
US9911413B1 (en) | Neural latent variable model for spoken language understanding | |
Tur et al. | Spoken language understanding: Systems for extracting semantic information from speech | |
Lee et al. | Spoken content retrieval—beyond cascading speech recognition with text retrieval | |
Sethy et al. | Building topic specific language models from webdata using competitive models. | |
Gupta et al. | The AT&T spoken language understanding system | |
He et al. | A data-driven spoken language understanding system | |
CN109637537B (en) | Method for automatically acquiring annotated data to optimize user-defined awakening model | |
Shriberg et al. | Direct modeling of prosody: An overview of applications in automatic speech processing | |
Kupiec | Probabilistic models of short and long distance word dependencies in running text | |
Kumar et al. | A knowledge graph based speech interface for question answering systems | |
CN106570180A (en) | Artificial intelligence based voice searching method and device | |
WO2002054385A1 (en) | Computer-implemented dynamic language model generation method and system | |
Carvalho et al. | A critical survey on the use of fuzzy sets in speech and natural language processing | |
Furui | Recent progress in corpus-based spontaneous speech recognition | |
KR20170090127A (en) | Apparatus for comprehending speech | |
Hong et al. | Automatically extracting word relationships as templates for pun generation | |
Desot et al. | End-to-End Spoken Language Understanding: Performance analyses of a voice command task in a low resource setting | |
Deekshitha et al. | Multilingual spoken term detection: a review | |
KR102418260B1 (en) | Method for analyzing customer consultation record | |
Nguyen et al. | Are word boundaries useful for unsupervised language learning? | |
US20100204982A1 (en) | System and Method for Generating Data for Complex Statistical Modeling for use in Dialog Systems | |
Jeong et al. | Multi-domain spoken language understanding with transfer learning | |
Masumura et al. | Training a Language Model Using Webdata for Large Vocabulary Japanese Spontaneous Speech Recognition. | |
Trancoso et al. | The Impact of Language Technologies in the Legal Domain |