Trabelsi, 2004 - Google Patents

A Generic Multimodal Architecture for Integrating Voice and Ink XML Formats

Trabelsi, 2004

Document ID: 7609093377739192922
Author: Trabelsi Z
Publication year: 2004
Publication venue: International Arab Journal of Information Technology

External Links

Cited by

Snippet

The acceptance of a standard VoiceXML format has facilitated the development of voice applications, and we anticipate a similar facilitation of pen application development upon the acceptance of a standard InkXML format. In this paper we present a multimodal interface …

Continue reading at www.ccis2k.org (PDF) (other versions)

238000011161 development 0 abstract description 21

Classifications

- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
- G10L15/1822—Parsing for meaning understanding
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L2015/088—Word spotting
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
- G10L15/065—Adaptation
- G10L15/07—Adaptation to the speaker
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/26—Speech to text systems
- G10L15/265—Speech recognisers specially adapted for particular applications
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/28—Constructional details of speech recognition systems
- G10L15/30—Distributed recognition, e.g. in client-server systems, for mobile phones or network applications
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
- G10L2015/226—Taking into account non-speech caracteristics
- G10L2015/228—Taking into account non-speech caracteristics of application context
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/01—Input arrangements or combined input and output arrangements for interaction between user and computer
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/27—Automatic analysis, e.g. parsing
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/16—Sound input; Sound output
- G06F3/167—Audio in a user interface, e.g. using voice commands for navigating, audio feedback
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/06—Transformation of speech into a non-audible representation, e.g. speech visualisation or speech processing for tactile aids
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification

Similar Documents

Publication	Publication Date	Title
Oviatt et al.	2000	Designing the user interface for multimodal speech and pen-based gesture applications: State-of-the-art systems and future research directions
US6415256B1 (en)	2002-07-02	Integrated handwriting and speed recognition systems
CA2397703C (en)	2009-04-28	Systems and methods for abstracting portions of information that is represented with finite-state devices
US6917920B1 (en)	2005-07-12	Speech translation device and computer readable medium
RU2352979C2 (en)	2009-04-20	Synchronous comprehension of semantic objects for highly active interface
US8898202B2 (en)	2014-11-25	Systems and methods for generating markup-language based expressions from multi-modal and unimodal inputs
Roy et al.	2005	Towards situated speech understanding: Visual context priming of language models
US20100241431A1 (en)	2010-09-23	System and Method for Multi-Modal Input Synchronization and Disambiguation
US9093072B2 (en)	2015-07-28	Speech and gesture recognition enhancement
JP2004355630A (en)	2004-12-16	Semantic object synchronous understanding implemented with speech application language tag
WO2004036939A1 (en)	2004-04-29	Portable digital mobile communication apparatus, method for controlling speech and system
Delgado et al.	2007	Spoken, multilingual and multimodal dialogue systems: development and assessment
Fellbaum et al.	2008	Principles of electronic speech processing with applications for people with disabilities
Pieraccini	2021	AI assistants
Trabelsi et al.	2002	A voice and ink XML multimodal architecture for mobile e-commerce systems
Furui et al.	2001	Ubiquitous speech processing
Gilbert et al.	2005	Intelligent virtual agents for contact center automation
Trabelsi et al.	2002	Multimodal integration of voice and ink for pervasive computing
Trabelsi	2004	A Generic Multimodal Architecture for Integrating Voice and Ink XML Formats
Trabelsi et al.	2002	Multimodal integration of Voice and Ink XML formats
Schuller et al.	2006	Speech communication and multimodal interfaces
Breen et al.	2014	Voice in the user interface
JP2007212658A (en)	2007-08-23	Character input device
Li et al.	2021	Design and Implementation of a Voice Interactive Tool to Facilitate Web Collaboration
Deng et al.	2004	Speech and language processing for multimodal human-computer interaction