Goetze et al., 2012 - Google Patents
Multimodal human-machine interaction for service robots in home-care environmentsGoetze et al., 2012
View PDF- Document ID
- 16802571530479362824
- Author
- Goetze S
- Fischer S
- Moritz N
- Appell J
- Wallhoff F
- Publication year
- Publication venue
- Proceedings of the 1st Workshop on Speech and Multimodal Interaction in Assistive Environments
External Links
Snippet
This contribution focuses on multimodal interaction techniques for a mobile communication and assistance system on a robot platform. The system comprises of acoustic, visual and haptic input modalities. Feedback is given to the user by a graphical user interface and a …
- 230000003993 interaction 0 title abstract description 14
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
- G10L15/1822—Parsing for meaning understanding
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L2015/088—Word spotting
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/26—Speech to text systems
- G10L15/265—Speech recognisers specially adapted for particular applications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/01—Input arrangements or combined input and output arrangements for interaction between user and computer
- G06F3/011—Arrangements for interaction with the human body, e.g. for user immersion in virtual reality
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/28—Constructional details of speech recognition systems
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/24—Speech recognition using non-acoustical features
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
- G10L15/065—Adaptation
- G10L15/07—Adaptation to the speaker
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification
- G10L17/26—Recognition of special voice characteristics, e.g. for use in lie detectors; Recognition of animal voices
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/16—Sound input; Sound output
- G06F3/167—Audio in a user interface, e.g. using voice commands for navigating, audio feedback
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
- G10L25/48—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 specially adapted for particular use
- G10L25/51—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 specially adapted for particular use for comparison or discrimination
- G10L25/66—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 specially adapted for particular use for comparison or discrimination for extracting parameters related to health condition
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| US11017779B2 (en) | System and method for speech understanding via integrated audio and visual based speech recognition | |
| KR102599607B1 (en) | Dynamic and/or context-specific hot words to invoke automated assistant | |
| US11200902B2 (en) | System and method for disambiguating a source of sound based on detected lip movement | |
| KR101336641B1 (en) | Emotional Sympathy Robot Service System and Method of the Same | |
| US20190371318A1 (en) | System and method for adaptive detection of spoken language via multiple speech models | |
| Tsiourti et al. | A virtual assistive companion for older adults: design implications for a real-world application | |
| JP2019164345A (en) | System for processing sound data, user terminal and method for controlling the system | |
| JP6392374B2 (en) | Head mounted display system and method for operating head mounted display device | |
| JP6291303B2 (en) | Communication support robot system | |
| KR20210137118A (en) | Systems and methods for context-rich attentional memory networks with global and local encoding for dialogue break detection | |
| Dhanjal et al. | Tools and techniques of assistive technology for hearing impaired people | |
| Karpov et al. | A universal assistive technology with multimodal input and multimedia output interfaces | |
| KR20200143764A (en) | Emotional Sympathy Service System and Method of the Same | |
| JPWO2018105373A1 (en) | Information processing apparatus, information processing method, and information processing system | |
| Goetze et al. | Multimodal human-machine interaction for service robots in home-care environments | |
| KR20210100831A (en) | System and method for providing sign language translation service based on artificial intelligence | |
| US20250149052A1 (en) | Method for providing an artificial intelligence system with reduction of background noise | |
| JP2021051693A (en) | Utterance system, utterance recommendation device, utterance recommendation program, and utterance recommendation method | |
| Stavropoulou et al. | Voice user interfaces for service robots: Design principles and methodology | |
| JPWO2017200077A1 (en) | Dialogue method, dialogue system, dialogue apparatus, and program | |
| US11935449B2 (en) | Information processing apparatus and information processing method | |
| Sansen et al. | vAssist: building the personal assistant for dependent people: Helping dependent people to cope with technology through speech interaction | |
| Oviatt et al. | Multimodal interfaces for cell phones and mobile technology | |
| US20250322837A1 (en) | Background noise filtering system | |
| Popescu et al. | A platform that aims to help people to learn how to interact with robotic platforms |