Tomko et al., 2005 - Google Patents
Towards efficient human machine speech communication: The speech graffiti projectTomko et al., 2005
View PDF- Document ID
- 7398295645899791651
- Author
- Tomko S
- Harris T
- Toth A
- Sanders J
- Rudnicky A
- Rosenfeld R
- Publication year
- Publication venue
- ACM Transactions on Speech and Language Processing (TSLP)
External Links
Snippet
This research investigates the design and performance of the Speech Graffiti interface for spoken interaction with simple machines. Speech Graffiti is a standardized interface designed to address issues inherent in the current state-of-the-art in spoken dialog systems …
- 238000004891 communication 0 title description 25
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
- G10L15/1822—Parsing for meaning understanding
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
- G10L15/183—Speech classification or search using natural language modelling using context dependencies, e.g. language models
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
- G10L15/065—Adaptation
- G10L15/07—Adaptation to the speaker
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/26—Speech to text systems
- G10L15/265—Speech recognisers specially adapted for particular applications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/27—Automatic analysis, e.g. parsing
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/28—Constructional details of speech recognition systems
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN110730953B (en) | Method and system for customizing interactive dialogue application based on content provided by creator | |
KR101042119B1 (en) | Voice understanding system, and computer readable recording media | |
Cohen et al. | Voice user interface design | |
Yankelovich | How do users know what to say? | |
US8645122B1 (en) | Method of handling frequently asked questions in a natural language dialog service | |
Klemmer et al. | Suede: a wizard of oz prototyping tool for speech user interfaces | |
JP4768969B2 (en) | Understanding synchronization semantic objects for advanced interactive interfaces | |
US7869998B1 (en) | Voice-enabled dialog system | |
McTear et al. | Voice application development for Android | |
Tomko et al. | Towards efficient human machine speech communication: The speech graffiti project | |
Ramakrishnan et al. | Mixed-initiative interaction= mixed computation | |
Abbott et al. | Voice enabling Web applications: VoiceXML and beyond | |
Skidmore | Incremental disfluency detection for spoken learner English | |
McTear | Rule-based dialogue systems: Architecture, methods, and tools | |
McGraw | Crowd-supervised training of spoken language systems | |
Di Fabbrizio et al. | AT&t help desk. | |
Biermann et al. | A voice-and touch-driven natural language editor and its performance | |
US12243517B1 (en) | Utterance endpointing in task-oriented conversational systems | |
Karat et al. | Speech user interface evolution | |
Karat et al. | Speech and language interfaces, applications, and technologies | |
Tijerina | Talk Code-y To Me: An analysis of speech to text systems for consideration of use in writing software | |
Chuu | LIESHOU: A Mandarin conversational task agent for the Galaxy-II architecture | |
Rupitz et al. | Development of an Amazon Alexa App for a University Online Search | |
Miyazaki | Discussion board system with modality variation: From multi-modality to user freedom | |
Damper et al. | Experiences of usability evaluation of the IMAGINE speech-based interaction system |