Kamm, 1995 - Google Patents
User interfaces for voice applications.Kamm, 1995
View PDF- Document ID
- 4255455341810048549
- Author
- Kamm C
- Publication year
- Publication venue
- Proceedings of the National Academy of Sciences
External Links
Snippet
This paper discusses some of the aspects of task requirements, user expectations, and technological capabilities that influence the design of a voice interface and then identifies several components of user interfaces that are particularly critical in successful voice …
- 230000003993 interaction 0 description 47
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
- G10L15/183—Speech classification or search using natural language modelling using context dependencies, e.g. language models
- G10L15/19—Grammatical context, e.g. disambiguation of the recognition hypotheses based on word sequence rules
- G10L15/197—Probabilistic grammars, e.g. word n-grams
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
- G10L15/1822—Parsing for meaning understanding
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
- G10L15/065—Adaptation
- G10L15/07—Adaptation to the speaker
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L2015/088—Word spotting
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/28—Constructional details of speech recognition systems
- G10L15/30—Distributed recognition, e.g. in client-server systems, for mobile phones or network applications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
- G10L15/063—Training
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/26—Speech to text systems
- G10L15/265—Speech recognisers specially adapted for particular applications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Kamm | User interfaces for voice applications. | |
US7139706B2 (en) | System and method of developing automatic speech recognition vocabulary for voice activated services | |
US6173266B1 (en) | System and method for developing interactive speech applications | |
US8457973B2 (en) | Menu hierarchy skipping dialog for directed dialog speech recognition | |
Hone et al. | Designing habitable dialogues for speech-based interaction with computers | |
Marx et al. | Putting people first: Specifying proper names in speech interfaces | |
Kamm et al. | Design issues for interfaces using voice input | |
Lai et al. | Conversational speech interfaces and technologies | |
Lamel | Spoken language dialog system development and evaluation at LIMSI | |
Schnelle-Walka | A pattern language for error management in voice user interfaces | |
Hayes et al. | An anatomy of graceful interaction in spoken and written man-machine communication | |
Lehtinen et al. | IDAS: Interactive directory assistance service | |
López-Cózar et al. | Evaluation of a Dialogue System Based on a Generic Model that Combines Robust Speech Understanding and Mixed-initiative Control. | |
Williams | Dialogue Management in a mixed-initiative, cooperative, spoken language system | |
Kloosterman | Design and implementation of a user-oriented speech recognition interface: the synergy of technology and human factors | |
Wilpon | Voice-processing technologies--their application in telecommunications. | |
Wattenbarger et al. | Serving Customers With Automatic Speech Recognition—Human‐Factors Issues | |
Sharman | Speech interfaces for computer systems: Problems and potential | |
Thymé-Gobbel et al. | Choosing Strategies to Recover from Miscommunication | |
Stewart et al. | Transition relevance place: a proposal for adaptive user interface in natural language dialog management systems | |
Alvarez-Cercadillo et al. | The natural language processing module for a voice assisted operator at Telefonica I+ D | |
Larson | Voice user interface design for novice and experienced users | |
Schmandt | Putting People First: Specifying Proper Names in Speech Interfaces | |
Epstein et al. | Data mining to support human-machine dialogue for autonomous agents | |
Treumuth | A Framework for Asynchronous Dialogue Systems |