Moritz et al., 2011 - Google Patents
Ambient voice control for a personal activity and household assistantMoritz et al., 2011
View PDF- Document ID
- 15939910307125502074
- Author
- Moritz N
- Goetze S
- Appell J
- Publication year
- Publication venue
- Ambient Assisted Living: 4. AAL-Kongress 2011, Berlin, Germany, January 25–26, 2011
External Links
Snippet
Technologies for ambient assisted living (AAL) are used to increase the quality of life of older or impaired persons. This contribution discusses the utilization of automatic speech recognition (ASR) as a natural interface for control of assistive technologies in everyday life …
- 230000000694 effects 0 title description 13
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
- G10L21/0216—Noise filtering characterised by the method used for estimating noise
- G10L2021/02161—Number of inputs available containing the signal or the noise to be suppressed
- G10L2021/02166—Microphone arrays; Beamforming
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
- G10L15/1822—Parsing for meaning understanding
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/06—Transformation of speech into a non-audible representation, e.g. speech visualisation or speech processing for tactile aids
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/26—Speech to text systems
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/28—Constructional details of speech recognition systems
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
- G10L15/065—Adaptation
- G10L15/07—Adaptation to the speaker
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signal, using source filter models or psychoacoustic analysis
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
- G10L25/48—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 specially adapted for particular use
- G10L25/51—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 specially adapted for particular use for comparison or discrimination
- G10L25/66—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 specially adapted for particular use for comparison or discrimination for extracting parameters related to health condition
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Zmolikova et al. | Neural target speech extraction: An overview | |
KR102694487B1 (en) | Systems and methods supporting selective listening | |
US10957337B2 (en) | Multi-microphone speech separation | |
Wang et al. | Sequential multi-frame neural beamforming for speech separation and enhancement | |
Wölfel et al. | Distant speech recognition | |
Nakadai et al. | Design and Implementation of Robot Audition System'HARK'—Open Source Software for Listening to Three Simultaneous Speakers | |
US20230164509A1 (en) | System and method for headphone equalization and room adjustment for binaural playback in augmented reality | |
Valin et al. | Robust recognition of simultaneous speech by a mobile robot | |
Ravanelli et al. | The DIRHA-English corpus and related tasks for distant-speech recognition in domestic environments | |
Potamitis et al. | An integrated system for smart-home control of appliances based on remote speech interaction. | |
Richard et al. | Audio signal processing in the 21st century: The important outcomes of the past 25 years | |
Ravanelli et al. | On the selection of the impulse responses for distant-speech recognition based on contaminated speech training. | |
Matassoni et al. | The DIRHA-GRID corpus: baseline and tools for multi-room distant speech recognition using distributed microphones. | |
Moritz et al. | Ambient voice control for a personal activity and household assistant | |
Abel et al. | Novel two-stage audiovisual speech filtering in noisy environments | |
Gogate et al. | Av speech enhancement challenge using a real noisy corpus | |
Nakadai et al. | A robot referee for rock-paper-scissors sound games | |
JPWO2022023417A5 (en) | ||
Okuno et al. | Robot audition: Missing feature theory approach and active audition | |
Giannakopoulos et al. | A practical, real-time speech-driven home automation front-end | |
Nishimura et al. | Speech recognition for a humanoid with motor noise utilizing missing feature theory | |
Takiguchi et al. | Human-robot interface using system request utterance detection based on acoustic features | |
Xu et al. | Personalized dereverberation of speech | |
JP7690138B2 (en) | A microphone array-invariant, streaming, multi-channel, neural enhancement front-end for automatic speech recognition | |
Martínez-Colón et al. | Evaluation of a multi-speaker system for socially assistive HRI in real scenarios |