Ngan et al., 1997 - Google Patents

Issues in generating pronunciation dictionaries for voice interfaces to spatial databases

Ngan et al., 1997

Document ID: 5253611815196503142
Author: Ngan J; Picone J
Publication year: 1997
Publication venue: Proceedings IEEE SOUTHEASTCON'97.'Engineering the New Century'

External Links

Cited by

Snippet

ln speech recognition research, increasing emphasis has been placed on generating pronunciation dictionaries for spontaneous human-computer interactions. We present a review of our strategies for developing lexicons for three distinct voice interfaces:(1) …

Continue reading at isip.piconepress.com (PDF) (other versions)

230000015572 biosynthetic process 0 abstract description 6

Classifications

- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
- G10L15/183—Speech classification or search using natural language modelling using context dependencies, e.g. language models
- G10L15/187—Phonemic context, e.g. pronunciation rules, phonotactical constraints or phoneme n-grams
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/28—Processing or translating of natural language
- G06F17/2872—Rule based translation
- G06F17/2881—Natural language generation
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
- G10L15/1822—Parsing for meaning understanding
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/28—Processing or translating of natural language
- G06F17/2809—Data driven translation
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
- G10L15/065—Adaptation
- G10L15/07—Adaptation to the speaker
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/28—Processing or translating of natural language
- G06F17/289—Use of machine translation, e.g. multi-lingual retrieval, server side translation for client devices, real-time translation
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/27—Automatic analysis, e.g. parsing
- G06F17/2765—Recognition
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/27—Automatic analysis, e.g. parsing
- G06F17/2705—Parsing
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
- G10L13/08—Text analysis or generation of parameters for speech synthesis out of text, e.g. grapheme to phoneme translation, prosody generation or stress or intonation determination
- G10L13/10—Prosody rules derived from text; Stress or intonation
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
- G10L13/02—Methods for producing synthetic speech; Speech synthesisers
- G10L13/033—Voice editing, e.g. manipulating the voice of the synthesiser
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
- G10L13/06—Elementary speech units used in speech synthesisers; Concatenation rules

Similar Documents

Publication	Publication Date	Title
EP0262938B1 (en)	1993-12-15	Language translation system
Bulyko et al.	2002	A bootstrapping approach to automating prosodic annotation for limited-domain synthesis
US8069045B2 (en)	2011-11-29	Hierarchical approach for the statistical vowelization of Arabic text
Dutoit	1997	High-quality text-to-speech synthesis: An overview
US5781884A (en)	1998-07-14	Grapheme-to-phoneme conversion of digit strings using weighted finite state transducers to apply grammar to powers of a number basis
US6363342B2 (en)	2002-03-26	System for developing word-pronunciation pairs
US5384701A (en)	1995-01-24	Language translation system
Watts	2013	Unsupervised learning for text-to-speech synthesis
WO2000038083A1 (en)	2000-06-29	Method and apparatus for performing full bi-directional translation between a source language and a linked alternative language
Macchi	1998	Issues in text-to-speech synthesis
Dutoit	1999	A short introduction to text-to-speech synthesis
Kishore et al.	2003	Experiments with unit selection speech databases for Indian languages
Kim et al.	2002	Morpheme-based grapheme to phoneme conversion using phonetic patterns and morphophonemic connectivity information
Allen	2003	Reading machines for the blind: The technical problems and the methods adopted for their solution
Ngan et al.	1997	Issues in generating pronunciation dictionaries for voice interfaces to spatial databases
Chen et al.	1996	A Mandarin Text-to-Speech System
Xydas et al.	2004	Modeling prosodic structures in linguistically enriched environments
Möbius	1999	The Bell Labs German text-to-speech system
Maddieson	1991	Investigating linguistic universals
Veaux et al.	2008	IrcamCorpusTools: an extensible platform for speech corpora exploitation
Williams	1987	Word stress assignment in a text-to-speech synthesis system for British English
Draxler	1995	Introduction to the Verbmobil-PhonDat database of spoken German
Wang	1998	Statistical analysis of mandarin acoustic units and automatic extraction of phonetically rich sentences based upon a very large chinese text corpus
JPH03245192A (en)	1991-10-31	Method for determining pronunciation of foreign language word
Laws	1998	A bilingual speech interface for New Zealand English to Māori