Crook et al., 2010 - Google Patents
Handling user interruptions in an embodied conversational agentCrook et al., 2010
View PDF- Document ID
- 16547788767991618677
- Author
- Crook N
- Smith C
- Cavazza M
- Pulman S
- Moore R
- Boye J
- Publication year
- Publication venue
- Proceedings of the AAMAS International Workshop on Interacting with ECAs as Virtual Characters
External Links
Snippet
We present a mechanism for handling “barge-in” interruptions from a user who is engaged in a'social'conversation with an Embodied Conversational Agent (ECA). The ECA is designed to recognise and be empathetic to the emotional state of the user. Occasionally …
- 230000002996 emotional 0 abstract description 16
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
- G10L15/1822—Parsing for meaning understanding
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L2015/088—Word spotting
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
- G10L2015/226—Taking into account non-speech caracteristics
- G10L2015/228—Taking into account non-speech caracteristics of application context
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/28—Constructional details of speech recognition systems
- G10L15/30—Distributed recognition, e.g. in client-server systems, for mobile phones or network applications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/26—Speech to text systems
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
- G10L25/78—Detection of presence or absence of voice signals
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification
- G10L17/26—Recognition of special voice characteristics, e.g. for use in lie detectors; Recognition of animal voices
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
- G10L25/48—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 specially adapted for particular use
- G10L25/51—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 specially adapted for particular use for comparison or discrimination
- G10L25/66—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 specially adapted for particular use for comparison or discrimination for extracting parameters related to health condition
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/06—Transformation of speech into a non-audible representation, e.g. speech visualisation or speech processing for tactile aids
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US10269374B2 (en) | Rating speech effectiveness based on speaking mode | |
Cassell et al. | Turn taking versus discourse structure | |
US7580908B1 (en) | System and method providing utility-based decision making about clarification dialog given communicative uncertainty | |
EP2849177B1 (en) | System and method of text zoning | |
Raux et al. | Optimizing the turn-taking behavior of task-oriented spoken dialog systems | |
McTear et al. | Conversational interfaces: Past and present | |
WO2020227557A1 (en) | Method, system and apparatus for understanding and generating human conversational cues | |
Alonso-Martín et al. | Integration of a voice recognition system in a social robot | |
Wilks et al. | Some background on dialogue management and conversational speech for dialogue systems | |
JP2017016566A (en) | Information processing device, information processing method and program | |
Torres et al. | Modeling gaze behavior as a function of discourse structure | |
WO2018163646A1 (en) | Dialogue method, dialogue system, dialogue device, and program | |
van Turnhout et al. | Identifying the intended addressee in mixed human-human and human-computer interaction from non-verbal features | |
KR20230007502A (en) | Hotword-free preemption of automated assistant response presentations | |
JP6712754B2 (en) | Discourse function estimating device and computer program therefor | |
WO2017200079A1 (en) | Dialog method, dialog system, dialog device, and program | |
Raux | Flexible turn-taking for spoken dialog systems | |
Crook et al. | Handling user interruptions in an embodied conversational agent | |
Paetzel-Prüsmann et al. | Improving a Robot's Turn-Taking Behavior in Dynamic Multiparty Interactions | |
KR20100081534A (en) | Multilingual dialogue system and method thereof | |
Chowdhury et al. | A deep learning approach to modeling competitiveness in spoken conversations | |
Jacoby et al. | Human latency conversational turns for spoken avatar systems | |
Chowdhury et al. | The role of speakers and context in classifying competition in overlapping speech | |
KR20210051523A (en) | Dialogue system by automatic domain classfication | |
Huang et al. | Making virtual conversational agent aware of the addressee of users' utterances in multi-user conversation using nonverbal information |