Haeb-Umbach et al., 2005 - Google Patents
Speech processing in the networked home environment-a view on the amigo project.Haeb-Umbach et al., 2005
View PDF- Document ID
- 7362859658348806634
- Author
- Haeb-Umbach R
- Kladis B
- Schmalenstroeer J
- Publication year
- Publication venue
- INTERSPEECH
External Links
Snippet
Full interoperability of networked devices in the home has been kind of an elusive concept for quite some years. Amigo, an Integrated Project within the EU 6-th framework program, tries to make home networking a reality by addressing two key issues: First, it brings together …
- 238000011161 development 0 abstract description 3
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L2015/088—Word spotting
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/28—Constructional details of speech recognition systems
- G10L15/30—Distributed recognition, e.g. in client-server systems, for mobile phones or network applications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/26—Speech to text systems
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/06—Transformation of speech into a non-audible representation, e.g. speech visualisation or speech processing for tactile aids
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/16—Sound input; Sound output
- G06F3/167—Audio in a user interface, e.g. using voice commands for navigating, audio feedback
Similar Documents
Publication | Publication Date | Title |
---|---|---|
JP6752870B2 (en) | Methods and systems for controlling artificial intelligence devices using multiple wake words | |
US12170087B1 (en) | Altering audio to improve automatic speech recognition | |
CN107112014B (en) | Application focus in speech-based systems | |
KR102543693B1 (en) | Electronic device and operating method thereof | |
CN108351872B (en) | Method and system for responding to user speech | |
US9672812B1 (en) | Qualifying trigger expressions in speech-based systems | |
US9087520B1 (en) | Altering audio based on non-speech commands | |
KR102209092B1 (en) | Method and system for controlling artificial intelligence device using plurality wake up word | |
JP2018190413A (en) | Method and system for processing user command to adjust and provide operation of device and content provision range by grasping presentation method of user speech | |
KR102550030B1 (en) | Adjustment of audio devices | |
CN105793923A (en) | Local and remote speech processing | |
KR20050055776A (en) | Controlling an apparatus based on speech | |
JP6920398B2 (en) | Continuous conversation function in artificial intelligence equipment | |
CN109144458B (en) | Electronic device for performing operation corresponding to voice input | |
US12020707B2 (en) | Response orchestrator for natural language interface | |
CN113808611A (en) | Audio playback method, device, computer-readable storage medium, and electronic device | |
JP2016206646A (en) | Voice reproduction method, voice interactive device, and voice interactive program | |
US9805721B1 (en) | Signaling voice-controlled devices | |
US20220261218A1 (en) | Electronic device including speaker and microphone and method for operating the same | |
US12114075B1 (en) | Object selection in computer vision | |
Haeb-Umbach et al. | Speech processing in the networked home environment-a view on the amigo project. | |
US12190877B1 (en) | Device arbitration for speech processing | |
Panek et al. | Challenges in adopting speech control for assistive robots | |
US20240395257A1 (en) | Concurrency rules for network microphone devices having multiple voice assistant services | |
US12198690B1 (en) | Voice-based content attribution for speech processing applications |