Spiegel et al., 1997 - Google Patents
Applying speech synthesis to user interfacesSpiegel et al., 1997
- Document ID
- 9951908836425554005
- Author
- Spiegel M
- Streeter L
- Publication year
- Publication venue
- Handbook of human-computer interaction
External Links
Snippet
Publisher Summary This chapter describes speech interfaces in terms of the enhancing properties of the acoustic medium such as the ability to perform tasks requiring divided attention; speech interfaces are also described in terms of shortcomings of speech such as …
- 230000002194 synthesizing 0 title abstract description 131
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
- G10L15/1822—Parsing for meaning understanding
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L2015/088—Word spotting
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/28—Processing or translating of natural language
- G06F17/289—Use of machine translation, e.g. multi-lingual retrieval, server side translation for client devices, real-time translation
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
- G10L13/02—Methods for producing synthetic speech; Speech synthesisers
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/26—Speech to text systems
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
- G10L13/06—Elementary speech units used in speech synthesisers; Concatenation rules
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
- G10L13/08—Text analysis or generation of parameters for speech synthesis out of text, e.g. grapheme to phoneme translation, prosody generation or stress or intonation determination
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/06—Transformation of speech into a non-audible representation, e.g. speech visualisation or speech processing for tactile aids
- G10L21/10—Transformation of speech into a non-audible representation, e.g. speech visualisation or speech processing for tactile aids transforming into visible information
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
- G10L15/065—Adaptation
- G10L15/07—Adaptation to the speaker
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/28—Constructional details of speech recognition systems
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/27—Automatic analysis, e.g. parsing
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/16—Sound input; Sound output
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| Zue et al. | Conversational interfaces: Advances and challenges | |
| Schultz et al. | Multilingual speech processing | |
| Syrdal et al. | Automatic ToBI prediction and alignment to speed manual labeling of prosody | |
| Scarborough | Coarticulation and the structure of the lexicon | |
| Gibbon et al. | Spoken language system and corpus design | |
| Alghamdi et al. | Saudi accented Arabic voice bank | |
| Delgado et al. | Spoken, multilingual and multimodal dialogue systems: development and assessment | |
| Fellbaum et al. | Principles of electronic speech processing with applications for people with disabilities | |
| Badino et al. | Language independent phoneme mapping for foreign TTS. | |
| Kiesling et al. | The variation in conversation (ViC) project: Creation of the Buckeye Corpus of Conversational Speech | |
| Ronzhin et al. | Russian voice interface | |
| Vonessen et al. | Comparing perception of L1 and L2 English by human listeners and machines: Effect of interlocutor adaptations | |
| Campbell | Evaluation of speech synthesis: from reading machines to talking machines | |
| Tomokiyo | Recognizing non-native speech: characterizing and adapting to non-native usage in LVCSR | |
| Bell et al. | Child and adult speaker adaptation during error resolution in a publicly available spoken dialogue system. | |
| Spiegel et al. | Applying speech synthesis to user interfaces | |
| Trouvain et al. | Speech synthesis: text-to-speech conversion and artificial voices | |
| Mihajlik et al. | Is spoken Hungarian low-resource?: A quantitative survey of Hungarian speech data sets | |
| Levow | Adaptations in spoken corrections: Implications for models of conversational speech | |
| Zhao | Speech-recognition technology in health care and special-needs assistance [Life Sciences] | |
| Sefara et al. | The development of local synthetic voices for an automatic pronunciation assistant | |
| Shahid et al. | Subjective testing of urdu text-to-speech (tts) system | |
| Marasek et al. | Multi-level annotation in SpeeCon Polish speech database | |
| Amazouz | Linguistic and phonetic investigations of French-Algerian Arabic code-switching: Large corpus studies using automatic speech processing | |
| Syrdal et al. | Text-to-speech systems |