Mendels et al., 2015 - Google Patents

Improving speech recognition and keyword search for low resource languages using web data

Mendels et al., 2015

Document ID: 11540504972847053855
Author: Mendels G; Cooper E; Soto V; Hirschberg J; Gales M; Knill K; Ragni A; Wang H
Publication year: 2015
Publication venue: INTERSPEECH 2015: 16th Annual Conference of the International Speech Communication Association

External Links

Cited by

Snippet

We describe the use of text data scraped from the web to augment language models for Automatic Speech Recognition and Keyword Search for Low Resource Languages. We scrape text from multiple genres including blogs, online news, translated TED talks, and …

Continue reading at eprints.whiterose.ac.uk (PDF) (other versions)

230000001603 reducing 0 abstract description 11

Classifications

- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/3061—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F17/30634—Querying
- G06F17/30657—Query processing
- G06F17/30675—Query execution
- G06F17/30684—Query execution using natural language analysis
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/3061—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F17/30613—Indexing
- G06F17/30619—Indexing indexing structures
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/27—Automatic analysis, e.g. parsing
- G06F17/2765—Recognition
- G06F17/2775—Phrasal analysis, e.g. finite state techniques, chunking
- G06F17/278—Named entity recognition
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/27—Automatic analysis, e.g. parsing
- G06F17/2705—Parsing
- G06F17/2715—Statistical methods
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30781—Information retrieval; Database structures therefor; File system structures therefor of video data
- G06F17/30784—Information retrieval; Database structures therefor; File system structures therefor of video data using features automatically derived from the video content, e.g. descriptors, fingerprints, signatures, genre
- G06F17/30796—Information retrieval; Database structures therefor; File system structures therefor of video data using features automatically derived from the video content, e.g. descriptors, fingerprints, signatures, genre using original textual content or text extracted from visual content or transcript of audio data
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/27—Automatic analysis, e.g. parsing
- G06F17/2795—Thesaurus; Synonyms
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30286—Information retrieval; Database structures therefor; File system structures therefor in structured data stores
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
- G10L15/183—Speech classification or search using natural language modelling using context dependencies, e.g. language models
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/3074—Audio data retrieval
- G06F17/30743—Audio data retrieval using features automatically derived from the audio content, e.g. descriptors, fingerprints, signatures, MEP-cepstral coefficients, musical score, tempo
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L2015/088—Word spotting
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
- G10L15/065—Adaptation
- G10L15/07—Adaptation to the speaker

Similar Documents

Publication	Publication Date	Title
Mendels et al.	2015	Improving speech recognition and keyword search for low resource languages using web data
Chelba et al.	2008	Retrieval and browsing of spoken content
Mamou et al.	2013	System combination and score normalization for spoken term detection
US8356032B2 (en)	2013-01-15	Method, medium, and system retrieving a media file based on extracted partial keyword
Chen et al.	2002	Unknown word extraction for Chinese documents
Heck et al.	2013	Leveraging knowledge graphs for web-scale unsupervised semantic parsing
JP5241840B2 (en)	2013-07-17	Computer-implemented method and information retrieval system for indexing and retrieving documents in a database
US8126897B2 (en)	2012-02-28	Unified inverted index for video passage retrieval
CA2454506A1 (en)	2003-02-06	Speech input search system
JP2004005600A (en)	2004-01-08	Method and system for indexing and retrieving document stored in database
JP2004133880A (en)	2004-04-30	Method for constructing dynamic vocabulary for speech recognizer used in database for indexed document
Parlak et al.	2011	Performance analysis and improvement of Turkish broadcast news retrieval
Ng et al.	2000	Experiments in spoken document retrieval using phoneme n-grams
Pan et al.	2009	Performance analysis for lattice-based speech indexing approaches using words and subword units
Audhkhasi et al.	2007	Keyword search using modified minimum edit distance measure
Roy et al.	2021	An unsupervised normalization algorithm for noisy text: a case study for information retrieval and stance detection
Lin et al.	2009	Improved speech summarization with multiple-hypothesis representations and kullback-leibler divergence measures.
Huang et al.	2011	Speech Indexing Using Semantic Context Inference.
Mamou et al.	2008	Combination of multiple speech transcription methods for vocabulary independent search
Besacier et al.	2014	Word confidence estimation for speech translation
JP2011128903A (en)	2011-06-30	Sequence signal retrieval device and sequence signal retrieval method
Hsieh et al.	2006	Improved spoken document retrieval with dynamic key term lexicon and probabilistic latent semantic analysis (PLSA)
Turunen et al.	2008	Speech retrieval from unsegmented Finnish audio using statistical morpheme-like units for segmentation, recognition, and retrieval
Can et al.	2009	Web derived pronunciations for spoken term detection
Chien et al.	2000	A spoken‐access approach for chinese text and speech information retrieval