Haeb-Umbach et al., 2005 - Google Patents

Speech processing in the networked home environment-a view on the amigo project.

Haeb-Umbach et al., 2005

Document ID: 7362859658348806634
Author: Haeb-Umbach R; Kladis B; Schmalenstroeer J
Publication year: 2005
Publication venue: INTERSPEECH

External Links

Cited by

Snippet

Full interoperability of networked devices in the home has been kind of an elusive concept for quite some years. Amigo, an Integrated Project within the EU 6-th framework program, tries to make home networking a reality by addressing two key issues: First, it brings together …

Continue reading at citeseerx.ist.psu.edu (PDF) (other versions)

238000011161 development 0 abstract description 3

Classifications

- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L2015/088—Word spotting
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/28—Constructional details of speech recognition systems
- G10L15/30—Distributed recognition, e.g. in client-server systems, for mobile phones or network applications
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/26—Speech to text systems
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/06—Transformation of speech into a non-audible representation, e.g. speech visualisation or speech processing for tactile aids
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/16—Sound input; Sound output
- G06F3/167—Audio in a user interface, e.g. using voice commands for navigating, audio feedback

Similar Documents

Publication	Publication Date	Title
JP6752870B2 (en)	2020-09-09	Methods and systems for controlling artificial intelligence devices using multiple wake words
US12170087B1 (en)	2024-12-17	Altering audio to improve automatic speech recognition
CN107112014B (en)	2021-01-05	Application focus in speech-based systems
KR102543693B1 (en)	2023-06-16	Electronic device and operating method thereof
CN108351872B (en)	2021-09-28	Method and system for responding to user speech
US9672812B1 (en)	2017-06-06	Qualifying trigger expressions in speech-based systems
US9087520B1 (en)	2015-07-21	Altering audio based on non-speech commands
KR102209092B1 (en)	2021-01-28	Method and system for controlling artificial intelligence device using plurality wake up word
JP2018190413A (en)	2018-11-29	Method and system for processing user command to adjust and provide operation of device and content provision range by grasping presentation method of user speech
KR102550030B1 (en)	2023-07-03	Adjustment of audio devices
CN105793923A (en)	2016-07-20	Local and remote speech processing
KR20050055776A (en)	2005-06-13	Controlling an apparatus based on speech
JP6920398B2 (en)	2021-08-18	Continuous conversation function in artificial intelligence equipment
CN109144458B (en)	2021-04-27	Electronic device for performing operation corresponding to voice input
US12020707B2 (en)	2024-06-25	Response orchestrator for natural language interface
CN113808611A (en)	2021-12-17	Audio playback method, device, computer-readable storage medium, and electronic device
JP2016206646A (en)	2016-12-08	Voice reproduction method, voice interactive device, and voice interactive program
US9805721B1 (en)	2017-10-31	Signaling voice-controlled devices
US20220261218A1 (en)	2022-08-18	Electronic device including speaker and microphone and method for operating the same
US12114075B1 (en)	2024-10-08	Object selection in computer vision
Haeb-Umbach et al.	2005	Speech processing in the networked home environment-a view on the amigo project.
US12190877B1 (en)	2025-01-07	Device arbitration for speech processing
Panek et al.	2015	Challenges in adopting speech control for assistive robots
US20240395257A1 (en)	2024-11-28	Concurrency rules for network microphone devices having multiple voice assistant services
US12198690B1 (en)	2025-01-14	Voice-based content attribution for speech processing applications