Moritz et al., 2011 - Google Patents

Ambient voice control for a personal activity and household assistant

Moritz et al., 2011

Document ID: 15939910307125502074
Author: Moritz N; Goetze S; Appell J
Publication year: 2011
Publication venue: Ambient Assisted Living: 4. AAL-Kongress 2011, Berlin, Germany, January 25–26, 2011

External Links

Cited by

Snippet

Technologies for ambient assisted living (AAL) are used to increase the quality of life of older or impaired persons. This contribution discusses the utilization of automatic speech recognition (ASR) as a natural interface for control of assistive technologies in everyday life …

Continue reading at www.researchgate.net (PDF) (other versions)

230000000694 effects 0 title description 13

Classifications

- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
- G10L21/0216—Noise filtering characterised by the method used for estimating noise
- G10L2021/02161—Number of inputs available containing the signal or the noise to be suppressed
- G10L2021/02166—Microphone arrays; Beamforming
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
- G10L15/1822—Parsing for meaning understanding
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/06—Transformation of speech into a non-audible representation, e.g. speech visualisation or speech processing for tactile aids
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/26—Speech to text systems
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/28—Constructional details of speech recognition systems
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
- G10L15/065—Adaptation
- G10L15/07—Adaptation to the speaker
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signal, using source filter models or psychoacoustic analysis
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
- G10L25/48—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 specially adapted for particular use
- G10L25/51—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 specially adapted for particular use for comparison or discrimination
- G10L25/66—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 specially adapted for particular use for comparison or discrimination for extracting parameters related to health condition

Similar Documents

Publication	Publication Date	Title
Zmolikova et al.	2023	Neural target speech extraction: An overview
KR102694487B1 (en)	2024-08-13	Systems and methods supporting selective listening
US10957337B2 (en)	2021-03-23	Multi-microphone speech separation
Wang et al.	2021	Sequential multi-frame neural beamforming for speech separation and enhancement
Wölfel et al.	2009	Distant speech recognition
Nakadai et al.	2010	Design and Implementation of Robot Audition System'HARK'—Open Source Software for Listening to Three Simultaneous Speakers
US20230164509A1 (en)	2023-05-25	System and method for headphone equalization and room adjustment for binaural playback in augmented reality
Valin et al.	2007	Robust recognition of simultaneous speech by a mobile robot
Ravanelli et al.	2015	The DIRHA-English corpus and related tasks for distant-speech recognition in domestic environments
Potamitis et al.	2003	An integrated system for smart-home control of appliances based on remote speech interaction.
Richard et al.	2023	Audio signal processing in the 21st century: The important outcomes of the past 25 years
Ravanelli et al.	2014	On the selection of the impulse responses for distant-speech recognition based on contaminated speech training.
Matassoni et al.	2014	The DIRHA-GRID corpus: baseline and tools for multi-room distant speech recognition using distributed microphones.
Moritz et al.	2011	Ambient voice control for a personal activity and household assistant
Abel et al.	2014	Novel two-stage audiovisual speech filtering in noisy environments
Gogate et al.	2019	Av speech enhancement challenge using a real noisy corpus
Nakadai et al.	2008	A robot referee for rock-paper-scissors sound games
JPWO2022023417A5 (en)	2024-12-16
Okuno et al.	2011	Robot audition: Missing feature theory approach and active audition
Giannakopoulos et al.	2005	A practical, real-time speech-driven home automation front-end
Nishimura et al.	2006	Speech recognition for a humanoid with motor noise utilizing missing feature theory
Takiguchi et al.	2008	Human-robot interface using system request utterance detection based on acoustic features
Xu et al.	2023	Personalized dereverberation of speech
JP7690138B2 (en)	2025-06-09	A microphone array-invariant, streaming, multi-channel, neural enhancement front-end for automatic speech recognition
Martínez-Colón et al.	2020	Evaluation of a multi-speaker system for socially assistive HRI in real scenarios