[go: up one dir, main page]

Haeb-Umbach et al., 2005 - Google Patents

Speech processing in the networked home environment-a view on the amigo project.

Haeb-Umbach et al., 2005

View PDF
Document ID
7362859658348806634
Author
Haeb-Umbach R
Kladis B
Schmalenstroeer J
Publication year
Publication venue
INTERSPEECH

External Links

Snippet

Full interoperability of networked devices in the home has been kind of an elusive concept for quite some years. Amigo, an Integrated Project within the EU 6-th framework program, tries to make home networking a reality by addressing two key issues: First, it brings together …
Continue reading at citeseerx.ist.psu.edu (PDF) (other versions)

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L2015/088Word spotting
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/18Speech classification or search using natural language modelling
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/28Constructional details of speech recognition systems
    • G10L15/30Distributed recognition, e.g. in client-server systems, for mobile phones or network applications
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/26Speech to text systems
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/06Transformation of speech into a non-audible representation, e.g. speech visualisation or speech processing for tactile aids
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L17/00Speaker identification or verification
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/16Sound input; Sound output
    • G06F3/167Audio in a user interface, e.g. using voice commands for navigating, audio feedback

Similar Documents

Publication Publication Date Title
JP6752870B2 (en) Methods and systems for controlling artificial intelligence devices using multiple wake words
US12170087B1 (en) Altering audio to improve automatic speech recognition
CN107112014B (en) Application focus in speech-based systems
KR102543693B1 (en) Electronic device and operating method thereof
CN108351872B (en) Method and system for responding to user speech
US9672812B1 (en) Qualifying trigger expressions in speech-based systems
US9087520B1 (en) Altering audio based on non-speech commands
KR102209092B1 (en) Method and system for controlling artificial intelligence device using plurality wake up word
JP2018190413A (en) Method and system for processing user command to adjust and provide operation of device and content provision range by grasping presentation method of user speech
KR102550030B1 (en) Adjustment of audio devices
CN105793923A (en) Local and remote speech processing
KR20050055776A (en) Controlling an apparatus based on speech
JP6920398B2 (en) Continuous conversation function in artificial intelligence equipment
CN109144458B (en) Electronic device for performing operation corresponding to voice input
US12020707B2 (en) Response orchestrator for natural language interface
CN113808611A (en) Audio playback method, device, computer-readable storage medium, and electronic device
JP2016206646A (en) Voice reproduction method, voice interactive device, and voice interactive program
US9805721B1 (en) Signaling voice-controlled devices
US20220261218A1 (en) Electronic device including speaker and microphone and method for operating the same
US12114075B1 (en) Object selection in computer vision
Haeb-Umbach et al. Speech processing in the networked home environment-a view on the amigo project.
US12190877B1 (en) Device arbitration for speech processing
Panek et al. Challenges in adopting speech control for assistive robots
US20240395257A1 (en) Concurrency rules for network microphone devices having multiple voice assistant services
US12198690B1 (en) Voice-based content attribution for speech processing applications