Di Fabbrizio et al., 1999 - Google Patents

Extending a standard-based ip and computer telephony platform to support multi-modal services

Di Fabbrizio et al., 1999

Document ID: 13699564803214153036
Author: Di Fabbrizio G; Kamm C; Ruscitti P; Narayanan S; Buntschuh B; Abella A; Hubbell J; Wright J
Publication year: 1999
Publication venue: Workshop on Interactive Dialogue in Multi-modal Systems

External Links

Cited by

Snippet

Despite recent advances in Computer Telephony (CT) and IP Telephony (IPT) standards at defining flexible architectures to support new technologies, the current CT paradigm does not adequately support the requirements of advanced spoken di alogue systems. This paper …

Continue reading at www.isca-archive.org (PDF) (other versions)

238000013461 design 0 abstract description 6

Classifications

- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
- G10L15/183—Speech classification or search using natural language modelling using context dependencies, e.g. language models
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/28—Constructional details of speech recognition systems
- G10L15/30—Distributed recognition, e.g. in client-server systems, for mobile phones or network applications
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
- G10L15/1822—Parsing for meaning understanding
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04M—TELEPHONIC COMMUNICATION
- H04M3/00—Automatic or semi-automatic exchanges
- H04M3/42—Systems providing special services or facilities to subscribers
- H04M3/487—Arrangements for providing information services, e.g. recorded voice services, time announcement
- H04M3/493—Interactive information services, e.g. directory enquiries ; Arrangements therefor, e.g. interactive voice response [IVR] systems or voice portals
- H04M3/4938—Interactive information services, e.g. directory enquiries ; Arrangements therefor, e.g. interactive voice response [IVR] systems or voice portals comprising a voice browser which renders and interprets, e.g. VoiceXML
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L2015/088—Word spotting
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/26—Speech to text systems
- G10L15/265—Speech recognisers specially adapted for particular applications
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04M—TELEPHONIC COMMUNICATION
- H04M3/00—Automatic or semi-automatic exchanges
- H04M3/42—Systems providing special services or facilities to subscribers
- H04M3/487—Arrangements for providing information services, e.g. recorded voice services, time announcement
- H04M3/493—Interactive information services, e.g. directory enquiries ; Arrangements therefor, e.g. interactive voice response [IVR] systems or voice portals
- H04M3/4936—Speech interaction details
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
- G10L15/065—Adaptation
- G10L15/07—Adaptation to the speaker
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04M—TELEPHONIC COMMUNICATION
- H04M2201/00—Electronic components, circuits, software, systems or apparatus used in telephone systems
- H04M2201/40—Electronic components, circuits, software, systems or apparatus used in telephone systems using speech recognition
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04M—TELEPHONIC COMMUNICATION
- H04M3/00—Automatic or semi-automatic exchanges
- H04M3/42—Systems providing special services or facilities to subscribers
- H04M3/42204—Arrangements at the exchange for service or number selection by voice
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/16—Sound input; Sound output

Similar Documents

Publication	Publication Date	Title
US7739117B2 (en)	2010-06-15	Method and system for voice-enabled autofill
US8706500B2 (en)	2014-04-22	Establishing a multimodal personality for a multimodal application
RU2349969C2 (en)	2009-03-20	Synchronous understanding of semantic objects realised by means of tags of speech application
RU2352979C2 (en)	2009-04-20	Synchronous comprehension of semantic objects for highly active interface
US8374874B2 (en)	2013-02-12	Establishing a multimodal personality for a multimodal application in dependence upon attributes of user interaction
US9292183B2 (en)	2016-03-22	Establishing a preferred mode of interaction between a user and a multimodal application
US8069047B2 (en)	2011-11-29	Dynamically defining a VoiceXML grammar in an X+V page of a multimodal application
US8781840B2 (en)	2014-07-15	Retrieval and presentation of network service results for mobile device using a multimodal browser
US8909532B2 (en)	2014-12-09	Supporting multi-lingual user interaction with a multimodal application
KR100561228B1 (en)	2006-03-15	Method for converting Voice XM document to XM LPlus Voice document and multi-modal service system using the same
US8086463B2 (en)	2011-12-27	Dynamically generating a vocal help prompt in a multimodal application
US8843376B2 (en)	2014-09-23	Speech-enabled web content searching using a multimodal browser
US20020169604A1 (en)	2002-11-14	System, method and computer program product for genre-based grammars and acoustic models in a speech recognition framework
US20080208586A1 (en)	2008-08-28	Enabling Natural Language Understanding In An X+V Page Of A Multimodal Application
US20080228495A1 (en)	2008-09-18	Enabling Dynamic VoiceXML In An X+ V Page Of A Multimodal Application
US20080208594A1 (en)	2008-08-28	Effecting Functions On A Multimodal Telephony Device
EP1215656B1 (en)	2005-06-15	Idiom handling in voice service systems
US20090271199A1 (en)	2009-10-29	Records Disambiguation In A Multimodal Application Operating On A Multimodal Device
US20030055651A1 (en)	2003-03-20	System, method and computer program product for extended element types to enhance operational characteristics in a voice portal
Di Fabbrizio et al.	1999	Extending a standard-based ip and computer telephony platform to support multi-modal services
Rouillard	2007	Web services and speech-based applications around VoiceXML.
US6662157B1 (en)	2003-12-09	Speech recognition system for database access through the use of data domain overloading of grammars
Di Fabbrizio et al.	2000	Unifying conversational multimedia interfaces for accessing network services across communication devices
Narayanan et al.	2000	Effects of dialog initiative and multi-modal presentation strategies on large directory information access.
Rouillard	2006	Web services and speech-based applications