US20050229185A1 - Method and system for navigating applications - Google Patents
Method and system for navigating applications Download PDFInfo
- Publication number
- US20050229185A1 US20050229185A1 US10/783,832 US78383204A US2005229185A1 US 20050229185 A1 US20050229185 A1 US 20050229185A1 US 78383204 A US78383204 A US 78383204A US 2005229185 A1 US2005229185 A1 US 2005229185A1
- Authority
- US
- United States
- Prior art keywords
- grammar
- application
- set forth
- applications
- entry point
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Abandoned
Links
- 238000000034 method Methods 0.000 title claims abstract description 39
- 239000002131 composite material Substances 0.000 claims abstract description 34
- 238000004891 communication Methods 0.000 claims abstract description 27
- 238000012545 processing Methods 0.000 claims abstract description 15
- 239000000470 constituent Substances 0.000 claims description 24
- 230000004044 response Effects 0.000 claims description 12
- 238000004519 manufacturing process Methods 0.000 claims description 6
- 238000013507 mapping Methods 0.000 claims 1
- 230000001413 cellular effect Effects 0.000 description 8
- 230000006870 function Effects 0.000 description 6
- 230000005236 sound signal Effects 0.000 description 4
- 230000008901 benefit Effects 0.000 description 3
- 230000003993 interaction Effects 0.000 description 3
- 230000005540 biological transmission Effects 0.000 description 2
- 238000013461 design Methods 0.000 description 2
- 238000011161 development Methods 0.000 description 2
- 230000000977 initiatory effect Effects 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- RYGMFSIKBFXOCR-UHFFFAOYSA-N Copper Chemical compound [Cu] RYGMFSIKBFXOCR-UHFFFAOYSA-N 0.000 description 1
- 238000013528 artificial neural network Methods 0.000 description 1
- 239000000835 fiber Substances 0.000 description 1
- 230000007246 mechanism Effects 0.000 description 1
- 238000003909 pattern recognition Methods 0.000 description 1
- 238000013515 script Methods 0.000 description 1
- 239000013589 supplement Substances 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
- G10L15/183—Speech classification or search using natural language modelling using context dependencies, e.g. language models
- G10L15/19—Grammatical context, e.g. disambiguation of the recognition hypotheses based on word sequence rules
- G10L15/193—Formal grammars, e.g. finite state automata, context free grammars or word networks
-
- B—PERFORMING OPERATIONS; TRANSPORTING
- B66—HOISTING; LIFTING; HAULING
- B66B—ELEVATORS; ESCALATORS OR MOVING WALKWAYS
- B66B5/00—Applications of checking, fault-correcting, or safety devices in elevators
- B66B5/0006—Monitoring devices or performance analysers
- B66B5/0018—Devices monitoring the operating condition of the elevator system
- B66B5/0031—Devices monitoring the operating condition of the elevator system for safety reasons
-
- B—PERFORMING OPERATIONS; TRANSPORTING
- B66—HOISTING; LIFTING; HAULING
- B66B—ELEVATORS; ESCALATORS OR MOVING WALKWAYS
- B66B5/00—Applications of checking, fault-correcting, or safety devices in elevators
- B66B5/0006—Monitoring devices or performance analysers
- B66B5/0012—Devices monitoring the users of the elevator system
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04M—TELEPHONIC COMMUNICATION
- H04M3/00—Automatic or semi-automatic exchanges
- H04M3/42—Systems providing special services or facilities to subscribers
- H04M3/487—Arrangements for providing information services, e.g. recorded voice services or time announcements
- H04M3/493—Interactive information services, e.g. directory enquiries ; Arrangements therefor, e.g. interactive voice response [IVR] systems or voice portals
- H04M3/4938—Interactive information services, e.g. directory enquiries ; Arrangements therefor, e.g. interactive voice response [IVR] systems or voice portals comprising a voice browser which renders and interprets, e.g. VoiceXML
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
- G10L2015/226—Procedures used during a speech recognition process, e.g. man-machine dialogue using non-speech characteristics
- G10L2015/228—Procedures used during a speech recognition process, e.g. man-machine dialogue using non-speech characteristics of application context
Definitions
- the present invention relates generally to accessing applications over communication systems and, more particularly, to navigating between applications.
- cellular service providers have expanded the services they provide from basic telephony to include access to a wide range of other applications. For example, voice mail and conference calling applications may be provided to subscribers.
- content-driven applications such as scheduling, news, sports, weather, and financial applications, may also be made available to subscribers. Such applications typically allow the subscriber to access personal or third party content, such as appointments, news stories, sports scores, weather forecasts, and stock prices.
- some or all of the available applications may be location-based or may utilize location information in their operation to increase the ease of use.
- techniques for accessing these applications such as a list of telephone or access numbers or an audio or video application menu, may be provided by the service provider to facilitate navigation among the available applications.
- a signal processor may be configured to receive a token selected based upon a composite grammar.
- the token corresponds to an entry point for one of a plurality of applications.
- the signal processor may also be configured to access the respective application at the entry point.
- the communications system may include a telephony server configured to receive a modulated signal correlative to an audio command and to analyze the modulated signal to identify a constituent of a composite grammar.
- the telephony server may also be configured to select a token corresponding to the constituent.
- the communications system may also include a browser module configured to acquire the token and to access an entry point for one of a plurality of applications based upon the token.
- a method for accessing an application may include the act of processing a signal to identify an audio code as a constituent of a composite grammar.
- the method may include the act of accessing an entry point of one of the plurality of applications based upon the constituent of the composite grammar.
- a tangible computer-readable medium may include programming instructions stored on the computer-readable medium for processing a signal to identify an audio code as a constituent of a composite grammar.
- the medium may also include programming instructions stored on the computer-readable medium for accessing an entry point of one of the plurality of applications based upon the constituent of the composite grammar.
- a method for manufacturing a tangible computer medium may include the act of storing, on a computer-readable medium, programming instructions for identifying an audio code as a constituent of a composite grammar.
- the method may also include the act of storing, on the computer readable medium, programming instructions for accessing an entry point of one of the plurality of applications based upon the constituent of the composite grammar.
- a method for manufacturing a telephony system may include the act of providing at least one signal processing device programmed to identify an audio code as a constituent of a composite grammar.
- the signal processing device may be programmed to access an entry point of one of the plurality of applications based upon the constituent of the composite grammar.
- FIG. 1 discloses an exemplary embodiment of a communications system configured to access two or more applications in accordance with the present invention
- FIG. 2 discloses an exemplary grammar overview in accordance with the present invention.
- FIG. 3 illustrates a flow chart depicting navigation between two exemplary applications in accordance with the present invention.
- the various applications offered by a service provider may be constructed and/or supported by multiple third party vendors.
- the third party vendors may actively compete with one another or may simply be unaware or indifferent to other applications.
- applications provided by the third party vendors are typically insular, with each application containing a stand-alone user interface.
- a subscriber wishing to navigate between applications must typically complete the transaction with a first application and exit the first application. The subscriber may then initiate a second application, complete the transaction with the second application, and exit the second application before initiating a third application, and so forth.
- subscribers typically must navigate through the various layers or scripts of an application to reach the function or content they seek. For example, to access a weather forecast for a city, a subscriber may have to exit from a first application, initiate the weather application, respond to a prompt to name a city, respond to a prompt to state the desired information, i.e., tomorrow's forecast, and then exit the weather application before returning to the first application.
- the subscriber therefore, may be subjected to the tedium of exiting and initiating applications more frequently than desired and of repeatedly navigating through the initial layers of each application to reach the desired function or content. Therefore, it may be desirable to provide subscribers not only with a variety of independent applications, but also with an interface that would allow subscribers to more easily navigate between applications and/or to more easily access specific data from an application.
- the techniques disclosed herein provide a unified interface for use with multiple independent applications. Specifically, the techniques disclosed provide for the use of a common listing or vocabulary of audio codes or signals (such as spoken words or phrases or DTMF tones) that generally equate to acceptable responses for a supported bundle of applications, allowing the various applications to be navigated collectively instead of individually.
- a unified interface may allow a subscriber to move freely between applications or to access content from an application directly, without having to traverse a maze of preliminary menus and options.
- the present techniques provide for the automatic generation of an application to provide a service (herein referred to as the Main Menu) that provides descriptions of and access to a suite of bundled application.
- each application has an associated grammar that includes, at least, the various grammars associated with the documents comprising the application.
- a grammar may be thought of as a list of allowable responses or inputs for the respective construct associated with the grammar, i.e., a document or application.
- a document of an application may accept as inputs those responses defined in its associated document grammar. Therefore, in the context of VoiceXML or a similar programming language, the present technique might be implemented by establishing a root or system level grammar, which may include the allowable inputs for each application, including the Main Menu, accessible from the respective root or system level.
- root grammar described provides a global vocabulary or navigational grammar for the supported applications
- a composite grammar may be employed which, depending on the circumstances, may include only the root grammar (such as when no applications are currently accessed, i.e., at the Main Menu) or may include the root grammar as well as the application grammar for a currently accessed application.
- the composite grammar may be implemented as a single grammar, i.e. a single grammar including both the root grammar and the application grammar of the currently accessed application.
- the composite grammar may be implemented as two separate grammars, i.e., the root grammar and current application grammar, which may be simultaneously accessed and/or functionally treated as a single grammar. Where discrepancies or duplications exist between the root grammar and the current application grammar, priority rules, such as giving precedence to the current application in the case of common or duplicative commands, may be employed.
- the composite grammar provides continuity with the application grammar while maintaining access to the root grammar, thereby providing easy access to other applications and to the Main Menu.
- a service provider may offer different packages or bundles of applications to subscribers. Different root grammars and Main Menus applications, as described above, may therefore be associated with each bundle of applications.
- the generation and updating of the various root grammars and Main Menu applications may be performed on a processor-based system, such as a unified interface server configured to query the respective application servers.
- the unified interface server may construct suitable root grammars and/or Main Menu applications by querying the appropriate application servers.
- the present techniques generally encompasses the use of a vocabulary that includes the accepted audio codes for a plurality of applications and the use of such a vocabulary to navigate between and within the various applications. In this way, the present techniques provide for navigating easily and quickly between otherwise independent applications.
- the depicted communications system 10 includes components supporting telephonic communication between parties as well as access to applications that may be provided or supported by various parties.
- the communication system 10 includes wireless support for wireless devices, such as one or more cellular telephones 12 , PDA devices, and other devices capable of audio communication.
- Wireless devices such as the cellular telephone 12 , may convert audio signals, such as speech and/or DTMF tones, to an initial modulated signal, which may be transmitted as an electromagnetic signal over an air interface.
- the signal is received by a base transceiver station, such as a cell tower 16 and associated antenna 18 .
- the cell tower 16 relays the modulated signal, which may comprise the initial modulated signal or an amplified or otherwise processed version of the initial modulated signal to a mobile switching center (MSC) 20 .
- MSC mobile switching center
- the mobile switching center 20 is the switch that serves the wireless system. It performs the function of switching calls to the appropriate destination and maintaining the connection. Indeed, a primary function of the mobile switching center 20 is to provide a voice path connection between a mobile telephone and another telephone, such as another mobile telephone or a land-line telephone.
- a typical mobile switching center 20 includes a number of devices that control switching functions, call processing, channel assignments, data interfaces, tracking, paging, call hand-off, billing, and user data bases.
- the mobile switching center 20 may transmit a modulated signal, which may comprise the relayed modulated signal or an amplified or otherwise processed version of the relayed modulated signal, to a telephony server 22 maintained by the service provider.
- the modulated signal may be transmitted from the mobile switching center 20 to the telephony server 22 via a physical line, such as a fiber optic cable or copper wire, or via a wireless transmission.
- the modulated signal may be transmitted over a T1 line using a T1 telephony standard.
- Modulated audio signals may also be sent to the telephony server 22 from a Public Switched Telephone Network (PSTN) 24 connected to a land-line phone 26 or other telephonic device.
- PSTN Public Switched Telephone Network
- the modulated audio signal may originate from a computer 28 connected to a network, such as the Internet, and employing a suitable communication protocol, such as Voice over IP (VOIP).
- VOIP Voice over IP
- the telephony server 22 receives the modulated audio signal, different operations may be performed based upon whether the received signal represents an attempt to place a phone call or a request to access an available application.
- the modulated signal may include audio codes, such as a word, a phrase, and/or DTMF tones, which may be recognized as an attempt to access an application or menu of applications.
- Recognition of the audio code may be accomplished by pattern recognition routines employing various statistical modeling techniques, such as Hidden Markov Models (HMM) or neural nets.
- HMM Hidden Markov Models
- Recognition that the received modulated signal represents an attempt to access an application may result in one or more suitable tokens being sent to a browser module 30 , and ultimately to an application server, such as one of application servers 32 , 34 , 36 .
- the respective application may transmit a data file to the browser module 30 for subsequent transmission to the originating device, such as cellular phone 12 , land-line phone 26 , or computer 28 .
- the format of the data file may correspond to the requested data.
- a voice mail application may respond to a token or token combination by transmitting an audio file corresponding to a requested voice mail to the subscriber.
- applications such as text messaging or e-mail may respond by transmitting data files corresponding to one or more text messages.
- Other applications such as web access or photograph album applications, may respond by transmitting data files corresponding to multi-media or video files.
- a suitable data file may be returned to the subscriber based on the application, the data requested, and the nature of the originating device. The subscriber may then request additional information from the application if desired.
- the token or tokens sent to the browser module 30 may be determined based upon a general vocabulary, such as a composite grammar in VoiceXML implementations, which relates recognized patterns to a respective token or tokens.
- the recognized pattern may correspond to one or more DTMF tones, one or more spoken words (such as “delete” or “temperature”), or a spoken phrase (such as “will it freeze tonight?”).
- more than one recognized pattern may correspond to the same token.
- the recognized patterns for the words “delete,” “erase,” and “remove” may all invoke the same token, and thereby the same response from the accessed application.
- the browser module 30 may receive the token or tokens and may direct and regulate interaction with the respective applications on the application servers 32 , 34 , 36 based on the token or tokens received. For example, the browser module 30 may receive one or more tokens associated with a data inquiry of an application, such as tokens corresponding to a request for the price of stock. If another application is presently active, the browser module 30 may, unseen to the user, properly exit the active application, such as by sending a suitable token, and provide the tokens to the finance application. The particular document or routine of interest in the finance application may be directly accessed by the tokens to elicit a data file containing the desired data. Alternatively, the browser module 30 may navigate preliminary documents or menus, unseen to the subscriber, to reach the document or routine associated from which the desired data may be elicited.
- tokens associated with a data inquiry of an application such as tokens corresponding to a request for the price of stock.
- the browser module 30 may, unseen to the user, properly exit the active application
- the browser module 30 may continue to direct subsequent tokens intended for the finance application, such as for additional stock quotes, to the finance application. Once the browser module 30 receives a token corresponding to a different application, the browser module may properly exit the finance application and initiate an interaction with the requested application for the requested data. Even though some audio codes may be common to more than one application (such as the spoken words “delete” or “next”) additional tokens (such as “delete” and “e-mail” or “next,” “voice,” and “mail”) in the processing string may be used by the browser module 30 to determine what application is being addressed.
- the browser module 30 may take into account whether a token (such as “next”) makes sense in the context of the currently accessed application, such as a voice mail application, in determining whether to continue communicating with an application. For example, in the context of a VoiceXML application, if the token for “next” is part of the application grammar for the current application, the browser module 30 may address that token to the current application. In this manner, communication with an application may be maintained until a new application is unequivocally addressed, such as by a token which is not in the application grammar of the current application.
- a token such as “next”
- the token or tokens to be transmitted to the browser module may be determined by reference to a common vocabulary, typically disposed on the telephony server 22 , which equates recognized patterns with appropriate tokens.
- a common vocabulary may be a composite grammar for use with applications written in VoiceXML.
- the common vocabulary may be generated, in part or in whole, by a unified interface server 37 , or other processor-based system, which may communicate with the browser module 30 to coordinate the generation and update of root grammars and/or Main Menu applications throughout the system 10 .
- the composite grammar provides a reference for equating a recognized pattern corresponding to a spoken word, spoken phrase, or DTMF tone to a semantic interpretation, i.e., a token, which may be employed in the present technique as described herein.
- a semantic interpretation i.e., a token
- the semantic interpretation may be a simple value (such as a string), a flat set of attribute-value pairs (such as a day, month, and year), or a nested object.
- the composite grammar provides a mechanism for translating a recognized word, phrase or tone into an input expected by at least one supported application.
- the application, or a document of the application may provide a desired output, such as the next voice mail message, a stock quote, a sports score, and so forth.
- the root grammar 40 may be associated with a plurality of applications and an automatically generated Main Menu application, each having a respective application grammar 48 , 50 , 52 , 53 .
- Each application grammar in turn encompasses the respective document grammars for the documents comprising that application.
- the first application grammar 48 may include at least the grammars for the documents of the first application, i.e., document ( 1 a ) grammar 54 and document ( 1 b ) grammar 56 , as depicted in FIG. 2 .
- the second application grammar 50 may include at least the document ( 2 a ) grammar 58
- the third application grammar 52 may include at least the document ( 3 a ) grammar 60 , the document ( 3 b ) grammar 62 , and the document ( 3 c ) grammar 64
- the Main Menu application grammar 53 may include document grammars related to help, such as document ( 4 a ) grammar 66 , and to tutorials, such as document ( 4 b ) grammar 68 .
- the browser module 30 may generate the root grammar 40 .
- the browser module 30 may query the respective applications to elicit the respective application grammars 48 , 50 , 52 , from which the root grammar 40 may be generated.
- the browser module 30 may, in turn, publish the root grammar 40 to a platform upon which it may be queried, such as the unified interface server 37 .
- the service provider may examine the application grammars 48 , 50 , 52 to determine the components of the root grammars and submit them to the browser module 30 for publication into the root grammar 40 .
- the application providers themselves may submit the grammars elements that they deem to be applicable to be published in the root grammar 40 .
- the root grammar 40 of the present technique may include the respective application grammars 48 , 50 , 52 . Therefore, words, phrases, or tones recognized as being a constituents of the root grammar 40 may be used to select a token or token string corresponding to a respective application and document referenced by the root grammar 40 . The token or token string may, in turn, be used to access the appropriate level or document of the application directly, without having to navigate through intervening layers or documents of the application. In addition, as noted above, the root grammar may access the Main Menu application created by the grouping of the applications into a bundle.
- VoiceXML is one language that may be used to implement speech centric applications
- other standardized languages and/or proprietary languages may also be employed.
- a speech centric application recognizes words, phrases, or tones having corresponding tokens, i.e., possesses a grammar
- the present technique is applicable.
- the present technique may be useful for navigating between multiple applications where each application possesses multiple entry points, i.e., levels, documents, or sub-routines, that may be directly accessed by the proper token or tokens.
- a number of applications 70 such as the depicted voice mail application 72 and weather application 74 may be available to a subscriber.
- a root grammar 40 may be employed, allowing the subscriber to freely navigate between applications 70 . For example, the subscriber may initially wish to get a list of service that are available in the bundle by saying “what can I do” thereby eliciting the help tutorial from the Main Menu.
- the subscriber may wish to bypass the Main Menu as well as application preliminary menus.
- the user may verbally state “Miami, weather for next Friday” to elicit responsive data from the play weather document 76 of the weather application 74 . Because the statement by the subscriber contains words that are constituents of the document grammar of the play weather document 74 , otherwise intervening steps, such as the elicitation of a city at decision block 78 , may be bypassed.
- the user may wish to check the second queued message in the voice mail application 72 .
- the user may verbally state “next message” to access the next message document 82 , thereby eliciting the second message in his voice mail queue.
- the statements of the subscriber provide sufficient information, i.e., generate the necessary tokens, to directly access application entry points, such as documents in the case of VoiceXML applications, which might not ordinarily be accessed in this manner.
- application entry points such as documents in the case of VoiceXML applications, which might not ordinarily be accessed in this manner.
- entry points with respect to applications generally refers to documents, sub-routines, levels or other programming constructs that may be accessed with suitable inputs, such as tokens or other semantic interpretations, to elicit a desired response, such as a data file.
- a composite grammar 40 (consisting of the root grammar as well as the application grammar of a currently accessed application, if applicable) such as at the telephony server 22 , allows a browser module 30 to navigate between applications and between documents of applications without forcing the user to consciously exit applications or linearly navigate intervening application levels, i.e., documents.
- either or both of the telephony server 22 and the browser module 30 may be based on a signal processing unit capable of implementing some or all of the techniques described herein, such as through software, hardware, of any suitable combination thereof.
- the telephony server 22 and/or the browser module 30 may be a general purpose device, such as a general purpose computer or server, with the appropriate software programming to carry out these techniques.
- the telephony server 22 and/or the browser module 30 may use special purpose processors, hardware, and/or software to carry out these techniques.
- Example of such special purpose processors and hardware include digital signal processors, RISC processors, or application specific integrated circuits, which may be specifically adapted to perform the present techniques.
- the functions of the browser module 30 and the telephony server 22 may be combined on a single processor-based system if so desired.
- the telephony server 22 and browser module 30 may be deployed on separate general purpose computers within a telephony complex, such as an ANYPATH® telephony complex.
Landscapes
- Engineering & Computer Science (AREA)
- Human Computer Interaction (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Artificial Intelligence (AREA)
- Signal Processing (AREA)
- Telephonic Communication Services (AREA)
- Mobile Radio Communication Systems (AREA)
- Machine Translation (AREA)
Abstract
A root grammar is provided to facilitate navigation between speech centric applications in a communications network. The root grammar may be constructed from the application grammars of the supported applications. During recognition processing, the root grammar may be combined with a current application's active grammar to produce a composite grammar. Audio communications recognized as part of the composite grammar may be used to service the current application, to navigate between applications, and/or to access different parts or documents of applications directly without navigating through intervening levels. In particular, from a user's perspective, another application or document referenced by the root grammar may be accessed directly, as opposed to employing a lengthier navigation process.
Description
- 1. Field of the Invention
- The present invention relates generally to accessing applications over communication systems and, more particularly, to navigating between applications.
- 2. Description of the Related Art
- This section is intended to introduce the reader to various aspects of art that may be related to various aspects of the present invention, which are described and/or claimed below. This discussion is believed to be helpful in providing the reader with background information to facilitate a better understanding of the various aspects of the present invention. Accordingly, it should be understood that these statements are to be read in this light, and not as admissions of prior art.
- Over the past several decades, communications systems, including wire line and wireless communications systems, have steadily evolved. One example of this evolution is the continuing adoption and growing prevalence of cellular telephone systems. As one might expect, the limited number of potential cellular service subscribers has lead to competition between the various cellular service providers to protect and grow their subscriber bases.
- In response to this competition for subscribers, cellular service providers have expanded the services they provide from basic telephony to include access to a wide range of other applications. For example, voice mail and conference calling applications may be provided to subscribers. Similarly, content-driven applications, such as scheduling, news, sports, weather, and financial applications, may also be made available to subscribers. Such applications typically allow the subscriber to access personal or third party content, such as appointments, news stories, sports scores, weather forecasts, and stock prices. In addition, some or all of the available applications may be location-based or may utilize location information in their operation to increase the ease of use. As may be appreciated, techniques for accessing these applications, such as a list of telephone or access numbers or an audio or video application menu, may be provided by the service provider to facilitate navigation among the available applications.
- Certain aspects commensurate in scope with the originally claimed invention are set forth below. It should be understood that these aspects are presented merely to provide the reader with a brief summary of certain forms the invention might take and that these aspects are not intended to limit the scope of the invention. Indeed, the invention may encompass a variety of aspects that may not be set forth below.
- In accordance with one aspect of the present invention, there is provided a signal processor. The signal processor may be configured to receive a token selected based upon a composite grammar. The token corresponds to an entry point for one of a plurality of applications. The signal processor may also be configured to access the respective application at the entry point.
- In accordance with another aspect of the present invention, there is provided a communications system. The communications system may include a telephony server configured to receive a modulated signal correlative to an audio command and to analyze the modulated signal to identify a constituent of a composite grammar. The telephony server may also be configured to select a token corresponding to the constituent. The communications system may also include a browser module configured to acquire the token and to access an entry point for one of a plurality of applications based upon the token.
- In accordance with still another aspect of the present invention, there is provided a method for accessing an application. The method may include the act of processing a signal to identify an audio code as a constituent of a composite grammar. In addition, the method may include the act of accessing an entry point of one of the plurality of applications based upon the constituent of the composite grammar.
- In accordance with a further aspect of the present invention, there is provided a tangible computer-readable medium. The medium may include programming instructions stored on the computer-readable medium for processing a signal to identify an audio code as a constituent of a composite grammar. The medium may also include programming instructions stored on the computer-readable medium for accessing an entry point of one of the plurality of applications based upon the constituent of the composite grammar.
- In accordance with another aspect of the present invention, there is provided a method for manufacturing a tangible computer medium. The method may include the act of storing, on a computer-readable medium, programming instructions for identifying an audio code as a constituent of a composite grammar. The method may also include the act of storing, on the computer readable medium, programming instructions for accessing an entry point of one of the plurality of applications based upon the constituent of the composite grammar.
- In accordance with an additional aspect of the present invention, there is provided a method for manufacturing a telephony system. The method may include the act of providing at least one signal processing device programmed to identify an audio code as a constituent of a composite grammar. In addition, the signal processing device may be programmed to access an entry point of one of the plurality of applications based upon the constituent of the composite grammar.
- Advantages of the invention may become apparent upon reading the following detailed description and upon reference to the drawings in which:
-
FIG. 1 discloses an exemplary embodiment of a communications system configured to access two or more applications in accordance with the present invention; -
FIG. 2 discloses an exemplary grammar overview in accordance with the present invention; and -
FIG. 3 illustrates a flow chart depicting navigation between two exemplary applications in accordance with the present invention. - One or more specific embodiments of the present invention will be described below. In an effort to provide a concise description of these embodiments, not all features of an actual implementation are described in the specification. It should be appreciated that in the development of any such actual implementation, as in any engineering or design project, numerous implementation-specific decisions must be made to achieve the developers' specific goals, such as compliance with system-related and business-related constraints, which may vary from one implementation to another. Moreover, it should be appreciated that such a development effort might be complex and time consuming, but would nevertheless be a routine undertaking of design, fabrication, and manufacture for those of ordinary skill having the benefit of this disclosure.
- As competition for customers in the cellular telephone industry has increased, service providers have responded by providing additional services, such as access to a variety of applications, including voice mail, news, and weather applications. Such applications may be accessible from a single point, such as a dial tone or mailbox. To improve the accessibility and convenience of the various applications further, various speech centric interfaces may be employed, allowing the subscriber to interact with an application using voice commands or responses. The advantages of speech centric interfaces may be mitigated, however, by the lack of a common user interface between the applications.
- In particular, the various applications offered by a service provider may be constructed and/or supported by multiple third party vendors. The third party vendors may actively compete with one another or may simply be unaware or indifferent to other applications. Regardless, applications provided by the third party vendors are typically insular, with each application containing a stand-alone user interface.
- As a result, a subscriber wishing to navigate between applications must typically complete the transaction with a first application and exit the first application. The subscriber may then initiate a second application, complete the transaction with the second application, and exit the second application before initiating a third application, and so forth.
- Furthermore, subscribers typically must navigate through the various layers or scripts of an application to reach the function or content they seek. For example, to access a weather forecast for a city, a subscriber may have to exit from a first application, initiate the weather application, respond to a prompt to name a city, respond to a prompt to state the desired information, i.e., tomorrow's forecast, and then exit the weather application before returning to the first application. The subscriber, therefore, may be subjected to the tedium of exiting and initiating applications more frequently than desired and of repeatedly navigating through the initial layers of each application to reach the desired function or content. Therefore, it may be desirable to provide subscribers not only with a variety of independent applications, but also with an interface that would allow subscribers to more easily navigate between applications and/or to more easily access specific data from an application.
- The techniques disclosed herein provide a unified interface for use with multiple independent applications. Specifically, the techniques disclosed provide for the use of a common listing or vocabulary of audio codes or signals (such as spoken words or phrases or DTMF tones) that generally equate to acceptable responses for a supported bundle of applications, allowing the various applications to be navigated collectively instead of individually. In particular, a unified interface may allow a subscriber to move freely between applications or to access content from an application directly, without having to traverse a maze of preliminary menus and options. In addition, the present techniques provide for the automatic generation of an application to provide a service (herein referred to as the Main Menu) that provides descriptions of and access to a suite of bundled application.
- In the context of applications written in VoiceXML, each application has an associated grammar that includes, at least, the various grammars associated with the documents comprising the application. In the context of programming languages such as VoiceXML, a grammar may be thought of as a list of allowable responses or inputs for the respective construct associated with the grammar, i.e., a document or application. In other words, a document of an application may accept as inputs those responses defined in its associated document grammar. Therefore, in the context of VoiceXML or a similar programming language, the present technique might be implemented by establishing a root or system level grammar, which may include the allowable inputs for each application, including the Main Menu, accessible from the respective root or system level.
- While the root grammar described provides a global vocabulary or navigational grammar for the supported applications, in practice it may be desirable to supplement such a root grammar based on current circumstances, such as when the subscriber is accessing an application. For example a composite grammar may be employed which, depending on the circumstances, may include only the root grammar (such as when no applications are currently accessed, i.e., at the Main Menu) or may include the root grammar as well as the application grammar for a currently accessed application. The composite grammar may be implemented as a single grammar, i.e. a single grammar including both the root grammar and the application grammar of the currently accessed application. Alternatively, the composite grammar may be implemented as two separate grammars, i.e., the root grammar and current application grammar, which may be simultaneously accessed and/or functionally treated as a single grammar. Where discrepancies or duplications exist between the root grammar and the current application grammar, priority rules, such as giving precedence to the current application in the case of common or duplicative commands, may be employed. The composite grammar provides continuity with the application grammar while maintaining access to the root grammar, thereby providing easy access to other applications and to the Main Menu.
- Furthermore, as will be appreciated by those of ordinary skill in the art, a service provider may offer different packages or bundles of applications to subscribers. Different root grammars and Main Menus applications, as described above, may therefore be associated with each bundle of applications. The generation and updating of the various root grammars and Main Menu applications may be performed on a processor-based system, such as a unified interface server configured to query the respective application servers. In this way, the unified interface server may construct suitable root grammars and/or Main Menu applications by querying the appropriate application servers.
- While the use of a root grammar in a Voice XML context is one possible implementation of the present technique, one of ordinary skill in the art will appreciate that other implementations are possible. Indeed, the present techniques generally encompasses the use of a vocabulary that includes the accepted audio codes for a plurality of applications and the use of such a vocabulary to navigate between and within the various applications. In this way, the present techniques provide for navigating easily and quickly between otherwise independent applications.
- Turning now to the drawings, and referring initially to
FIG. 1 , an example of a communications system, indicated generally byreference numeral 10, is provided. The depictedcommunications system 10 includes components supporting telephonic communication between parties as well as access to applications that may be provided or supported by various parties. For example, thecommunication system 10 includes wireless support for wireless devices, such as one or morecellular telephones 12, PDA devices, and other devices capable of audio communication. Wireless devices, such as thecellular telephone 12, may convert audio signals, such as speech and/or DTMF tones, to an initial modulated signal, which may be transmitted as an electromagnetic signal over an air interface. The signal is received by a base transceiver station, such as acell tower 16 and associatedantenna 18. Thecell tower 16 relays the modulated signal, which may comprise the initial modulated signal or an amplified or otherwise processed version of the initial modulated signal to a mobile switching center (MSC) 20. - The
mobile switching center 20 is the switch that serves the wireless system. It performs the function of switching calls to the appropriate destination and maintaining the connection. Indeed, a primary function of themobile switching center 20 is to provide a voice path connection between a mobile telephone and another telephone, such as another mobile telephone or a land-line telephone. A typicalmobile switching center 20 includes a number of devices that control switching functions, call processing, channel assignments, data interfaces, tracking, paging, call hand-off, billing, and user data bases. - As part of its operation, the
mobile switching center 20 may transmit a modulated signal, which may comprise the relayed modulated signal or an amplified or otherwise processed version of the relayed modulated signal, to atelephony server 22 maintained by the service provider. The modulated signal may be transmitted from themobile switching center 20 to thetelephony server 22 via a physical line, such as a fiber optic cable or copper wire, or via a wireless transmission. For example, the modulated signal may be transmitted over a T1 line using a T1 telephony standard. - Modulated audio signals may also be sent to the
telephony server 22 from a Public Switched Telephone Network (PSTN) 24 connected to a land-line phone 26 or other telephonic device. Similarly, the modulated audio signal may originate from acomputer 28 connected to a network, such as the Internet, and employing a suitable communication protocol, such as Voice over IP (VOIP). - Once the
telephony server 22 receives the modulated audio signal, different operations may be performed based upon whether the received signal represents an attempt to place a phone call or a request to access an available application. For example, the modulated signal may include audio codes, such as a word, a phrase, and/or DTMF tones, which may be recognized as an attempt to access an application or menu of applications. Recognition of the audio code may be accomplished by pattern recognition routines employing various statistical modeling techniques, such as Hidden Markov Models (HMM) or neural nets. - Recognition that the received modulated signal represents an attempt to access an application may result in one or more suitable tokens being sent to a
browser module 30, and ultimately to an application server, such as one ofapplication servers browser module 30 for subsequent transmission to the originating device, such ascellular phone 12, land-line phone 26, orcomputer 28. The format of the data file may correspond to the requested data. For example, a voice mail application may respond to a token or token combination by transmitting an audio file corresponding to a requested voice mail to the subscriber. Similarly, applications such as text messaging or e-mail may respond by transmitting data files corresponding to one or more text messages. Other applications, such as web access or photograph album applications, may respond by transmitting data files corresponding to multi-media or video files. In other words, a suitable data file may be returned to the subscriber based on the application, the data requested, and the nature of the originating device. The subscriber may then request additional information from the application if desired. - The token or tokens sent to the
browser module 30 may be determined based upon a general vocabulary, such as a composite grammar in VoiceXML implementations, which relates recognized patterns to a respective token or tokens. The recognized pattern may correspond to one or more DTMF tones, one or more spoken words (such as “delete” or “temperature”), or a spoken phrase (such as “will it freeze tonight?”). Furthermore, more than one recognized pattern may correspond to the same token. For example, the recognized patterns for the words “delete,” “erase,” and “remove” may all invoke the same token, and thereby the same response from the accessed application. - In general, the
browser module 30 may receive the token or tokens and may direct and regulate interaction with the respective applications on theapplication servers browser module 30 may receive one or more tokens associated with a data inquiry of an application, such as tokens corresponding to a request for the price of stock. If another application is presently active, thebrowser module 30 may, unseen to the user, properly exit the active application, such as by sending a suitable token, and provide the tokens to the finance application. The particular document or routine of interest in the finance application may be directly accessed by the tokens to elicit a data file containing the desired data. Alternatively, thebrowser module 30 may navigate preliminary documents or menus, unseen to the subscriber, to reach the document or routine associated from which the desired data may be elicited. - The
browser module 30 may continue to direct subsequent tokens intended for the finance application, such as for additional stock quotes, to the finance application. Once thebrowser module 30 receives a token corresponding to a different application, the browser module may properly exit the finance application and initiate an interaction with the requested application for the requested data. Even though some audio codes may be common to more than one application (such as the spoken words “delete” or “next”) additional tokens (such as “delete” and “e-mail” or “next,” “voice,” and “mail”) in the processing string may be used by thebrowser module 30 to determine what application is being addressed. Furthermore, thebrowser module 30 may take into account whether a token (such as “next”) makes sense in the context of the currently accessed application, such as a voice mail application, in determining whether to continue communicating with an application. For example, in the context of a VoiceXML application, if the token for “next” is part of the application grammar for the current application, thebrowser module 30 may address that token to the current application. In this manner, communication with an application may be maintained until a new application is unequivocally addressed, such as by a token which is not in the application grammar of the current application. - As noted above, the token or tokens to be transmitted to the browser module may be determined by reference to a common vocabulary, typically disposed on the
telephony server 22, which equates recognized patterns with appropriate tokens. As discussed above, one example of such a common vocabulary may be a composite grammar for use with applications written in VoiceXML. The common vocabulary may be generated, in part or in whole, by aunified interface server 37, or other processor-based system, which may communicate with thebrowser module 30 to coordinate the generation and update of root grammars and/or Main Menu applications throughout thesystem 10. - The composite grammar provides a reference for equating a recognized pattern corresponding to a spoken word, spoken phrase, or DTMF tone to a semantic interpretation, i.e., a token, which may be employed in the present technique as described herein. As one of ordinary skill in the art will appreciate, the semantic interpretation may be a simple value (such as a string), a flat set of attribute-value pairs (such as a day, month, and year), or a nested object. In this manner, the composite grammar provides a mechanism for translating a recognized word, phrase or tone into an input expected by at least one supported application. In response to the token the application, or a document of the application, may provide a desired output, such as the next voice mail message, a stock quote, a sports score, and so forth.
- By means of example, and referring now to
FIG. 2 , a graphic representation of aroot grammar 40 and its relation to associated application and document grammars is provided. As depicted inFIG. 2 , theroot grammar 40 may be associated with a plurality of applications and an automatically generated Main Menu application, each having arespective application grammar first application grammar 48 may include at least the grammars for the documents of the first application, i.e., document (1 a)grammar 54 and document (1 b)grammar 56, as depicted inFIG. 2 . Similarly, thesecond application grammar 50 may include at least the document (2 a) grammar 58, and thethird application grammar 52 may include at least the document (3 a)grammar 60, the document (3 b)grammar 62, and the document (3 c)grammar 64. In addition, the MainMenu application grammar 53 may include document grammars related to help, such as document (4 a)grammar 66, and to tutorials, such as document (4 b)grammar 68. - In practice, the
browser module 30 may generate theroot grammar 40. For example, thebrowser module 30 may query the respective applications to elicit therespective application grammars root grammar 40 may be generated. Thebrowser module 30 may, in turn, publish theroot grammar 40 to a platform upon which it may be queried, such as theunified interface server 37. Alternatively, the service provider may examine theapplication grammars browser module 30 for publication into theroot grammar 40. In addition, the application providers themselves may submit the grammars elements that they deem to be applicable to be published in theroot grammar 40. - As described above, the
root grammar 40 of the present technique may include therespective application grammars root grammar 40 may be used to select a token or token string corresponding to a respective application and document referenced by theroot grammar 40. The token or token string may, in turn, be used to access the appropriate level or document of the application directly, without having to navigate through intervening layers or documents of the application. In addition, as noted above, the root grammar may access the Main Menu application created by the grouping of the applications into a bundle. - Though VoiceXML is one language that may be used to implement speech centric applications, other standardized languages and/or proprietary languages may also be employed. To the extent that a speech centric application recognizes words, phrases, or tones having corresponding tokens, i.e., possesses a grammar, the present technique is applicable. In particular, the present technique may be useful for navigating between multiple applications where each application possesses multiple entry points, i.e., levels, documents, or sub-routines, that may be directly accessed by the proper token or tokens.
- Referring now to
FIG. 3 , an example of an interaction between a user and two applications using the present technique is provided. As depicted inFIG. 3 , a number ofapplications 70, such as the depictedvoice mail application 72 andweather application 74 may be available to a subscriber. In VoiceXML implementations, aroot grammar 40 may be employed, allowing the subscriber to freely navigate betweenapplications 70. For example, the subscriber may initially wish to get a list of service that are available in the bundle by saying “what can I do” thereby eliciting the help tutorial from the Main Menu. - Alternatively, the subscriber may wish to bypass the Main Menu as well as application preliminary menus. For example, the user may verbally state “Miami, weather for next Friday” to elicit responsive data from the
play weather document 76 of theweather application 74. Because the statement by the subscriber contains words that are constituents of the document grammar of theplay weather document 74, otherwise intervening steps, such as the elicitation of a city atdecision block 78, may be bypassed. - Subsequent to checking the weather, the user may wish to check the second queued message in the
voice mail application 72. The user may verbally state “next message” to access thenext message document 82, thereby eliciting the second message in his voice mail queue. - As set forth in these examples, the statements of the subscriber provide sufficient information, i.e., generate the necessary tokens, to directly access application entry points, such as documents in the case of VoiceXML applications, which might not ordinarily be accessed in this manner. As used herein, the term entry points with respect to applications generally refers to documents, sub-routines, levels or other programming constructs that may be accessed with suitable inputs, such as tokens or other semantic interpretations, to elicit a desired response, such as a data file. In addition, as set forth in the preceding example, the implementation of a composite grammar 40 (consisting of the root grammar as well as the application grammar of a currently accessed application, if applicable) such as at the
telephony server 22, allows abrowser module 30 to navigate between applications and between documents of applications without forcing the user to consciously exit applications or linearly navigate intervening application levels, i.e., documents. - With regard to implementation of the present techniques, either or both of the
telephony server 22 and thebrowser module 30 may be based on a signal processing unit capable of implementing some or all of the techniques described herein, such as through software, hardware, of any suitable combination thereof. For example, thetelephony server 22 and/or thebrowser module 30 may be a general purpose device, such as a general purpose computer or server, with the appropriate software programming to carry out these techniques. Alternately, thetelephony server 22 and/or thebrowser module 30 may use special purpose processors, hardware, and/or software to carry out these techniques. Example of such special purpose processors and hardware include digital signal processors, RISC processors, or application specific integrated circuits, which may be specifically adapted to perform the present techniques. Furthermore, the functions of thebrowser module 30 and thetelephony server 22 may be combined on a single processor-based system if so desired. In one implementation, thetelephony server 22 andbrowser module 30 may be deployed on separate general purpose computers within a telephony complex, such as an ANYPATH® telephony complex. - While the invention may be susceptible to various modifications and alternative forms, specific embodiments have been shown by way of example in the drawings and have been described in detail herein. However, it should be understood that the invention is not intended to be limited to the particular forms disclosed. Rather, the invention is to cover all modifications, equivalents, and alternatives falling within the spirit and scope of the invention as defined by the following appended claims.
Claims (27)
1. A signal processor configured to receive a token selected based upon a composite grammar, wherein the token corresponds to an entry point for one of a plurality of applications, and configured to access the respective application at the entry point.
2. The signal processor, as set forth in claim 1 , wherein the signal processor is configured to exit a previous application without receiving an exit instruction from a subscriber.
3. The signal processor, as set forth in claim 1 , wherein the signal processor is configured to receive a responsive data file from a level of the respective application corresponding to the entry point and configured to transmit the data file to a telephony server.
4. The signal processor, as set forth in claim 1 , comprising:
a telephony server configured to receive a modulated signal correlative to an audio command, to analyze the modulated signal to identify a constituent of a root grammar, to select the token corresponding to the constituent, and to transmit the token to the signal processor.
5. A communications system, comprising:
a telephony server configured to receive a modulated signal correlative to an audio command, to analyze the modulated signal to identify a constituent of a composite grammar, and to select a token corresponding to the constituent; and
a browser module configured to acquire the token and to access an entry point for one of a plurality of applications based upon the token.
6. The communications system, as set forth in claim 5 , comprising:
a plurality of application servers, wherein each application server is configured to execute at least one of the plurality of applications, wherein each application comprises at least one entry point which may be accessed by a corresponding token.
7. The communications system, as set forth in claim 5 , wherein the browser module is configured to receive a responsive data file from a level of the respective application corresponding to the entry point and configured to transmit the data file to the telephony server.
8. The communications system, as set forth in claim 7 , wherein the responsive data file comprises at least one of an audio file, a text file, a video file, and a multimedia file.
9. The communications system, as set forth in claim 5 , comprising:
a mobile switching center configured to transmit the modulated signal to the telephony server.
10. The communications system, as set forth in claim 9 , comprising:
at least one cell tower configured to generate an initial modulated signal in response to electromagnetic waves received via at least one antenna and to transmit the initial modulated signal to the mobile switching center.
11. The communications system, as set forth in claim 5 , comprising:
a public switched telephone network configured to transmit the modulated signal to the telephony server.
12. The communications system, as set forth in claim 5 , wherein the composite grammar comprises a VoiceXML grammar.
13. The communications system, as set forth in claim 5 , wherein the root grammar comprises at least two of a voice mail application grammar, a help application grammar, a conference call application grammar, a news application grammar, a weather application grammar, a financial application grammar, a scheduling application grammar, a mapping application grammar, and a database application grammar.
14. The communications system, as set forth in claim 5 , comprising a unified interface server configured to generate at least one root grammar included within the composite grammar.
15. The communications system, as set forth in claim 14 , wherein the unified interface server is further configured to generate one or more main menu applications associated with the plurality of applications.
16. A method for accessing an application, the method comprising the acts of:
processing a signal to identify an audio code as a constituent of a composite grammar; and
accessing an entry point of one of the plurality of applications based upon the constituent of the composite grammar.
17. The method, as set forth in claim 16 , comprising the acts of:
sending a data file to a user, wherein the data file is generated in response to accessing the entry point.
18. The method, as set forth in claim 16 , wherein accessing the entry point comprises transmitting an indicator to the respective application that the audio code was identified in the processed signal.
19. A tangible computer-readable medium, comprising:
programming instructions stored on the computer-readable medium for processing a signal to identify an audio code as a constituent of a composite grammar; and
programming instructions stored on the computer-readable medium for accessing an entry point of one of the plurality of applications based upon the constituent of the composite grammar.
20. The tangible computer-readable medium, as set forth in claim 19 , comprising:
programming instructions stored on the computer-readable medium for receiving a data file from the entry point in response to accessing the entry point.
21. The tangible computer-readable medium, as set forth in claim 20 , comprising:
programming instructions stored on the computer-readable medium for sending the data file to a telephony server.
22. The tangible computer-readable medium, as set forth in claim 19 , wherein the programming instructions for accessing the entry point transmit a token to the respective application that the audio code was identified.
23. The tangible computer-readable medium, as set forth in claim 19 , wherein the composite grammar comprises a VoiceXML grammar.
24. A method for manufacturing a tangible computer medium, the method comprising the acts of:
storing programming instructions for identifying an audio code as a constituent of a composite grammar on a computer-readable medium; and
storing programming instructions for accessing an entry point of one of the plurality of applications based upon the constituent of the composite grammar on the computer-readable medium.
25. A method for manufacturing telephony system, the method comprising the act of:
providing at least one signal processing device programmed to identifying an audio code as a constituent of a composite grammar and programmed to access an entry point of one of the plurality of applications based upon the constituent of the composite grammar.
26. The method, as set forth in claim 25 , wherein providing the at least one signal processing device comprises obtaining at least one signal processing device.
27. The method, as set forth in claim 25 , wherein providing the at least one signal processing device comprises building at least one signal processing device.
Priority Applications (5)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US10/783,832 US20050229185A1 (en) | 2004-02-20 | 2004-02-20 | Method and system for navigating applications |
EP05250717A EP1566954A3 (en) | 2004-02-20 | 2005-02-08 | Method and system for navigating applications |
KR1020050011680A KR20060041889A (en) | 2004-02-20 | 2005-02-11 | Method and system for navigating applications |
JP2005041474A JP2005237009A (en) | 2004-02-20 | 2005-02-18 | Method and system for moving among applications |
CN2005100093844A CN1658635A (en) | 2004-02-20 | 2005-02-18 | Method and system for navigating applications |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US10/783,832 US20050229185A1 (en) | 2004-02-20 | 2004-02-20 | Method and system for navigating applications |
Publications (1)
Publication Number | Publication Date |
---|---|
US20050229185A1 true US20050229185A1 (en) | 2005-10-13 |
Family
ID=34711886
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US10/783,832 Abandoned US20050229185A1 (en) | 2004-02-20 | 2004-02-20 | Method and system for navigating applications |
Country Status (5)
Country | Link |
---|---|
US (1) | US20050229185A1 (en) |
EP (1) | EP1566954A3 (en) |
JP (1) | JP2005237009A (en) |
KR (1) | KR20060041889A (en) |
CN (1) | CN1658635A (en) |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20070160184A1 (en) * | 2003-10-06 | 2007-07-12 | Utbk, Inc. | Systems and methods to connect people in a marketplace environment |
CN107817995A (en) * | 2016-09-12 | 2018-03-20 | 华为技术有限公司 | A kind of silent method, apparatus and terminal device for starting application in backstage |
US20190147850A1 (en) * | 2016-10-14 | 2019-05-16 | Soundhound, Inc. | Integration of third party virtual assistants |
Families Citing this family (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US8503423B2 (en) | 2007-02-02 | 2013-08-06 | Interdigital Technology Corporation | Method and apparatus for versatile MAC multiplexing in evolved HSPA |
US9933914B2 (en) | 2009-07-06 | 2018-04-03 | Nokia Technologies Oy | Method and apparatus of associating application state information with content and actions |
EP2355537B1 (en) * | 2010-01-25 | 2014-11-19 | BlackBerry Limited | Error correction for DTMF corruption on uplink |
US8233951B2 (en) | 2010-01-25 | 2012-07-31 | Research In Motion Limited | Error correction for DTMF corruption on uplink |
JPWO2014024751A1 (en) * | 2012-08-10 | 2016-07-25 | エイディシーテクノロジー株式会社 | Voice response device |
US9582498B2 (en) * | 2014-09-12 | 2017-02-28 | Microsoft Technology Licensing, Llc | Actions on digital document elements from voice |
JP6904788B2 (en) * | 2017-05-25 | 2021-07-21 | キヤノン株式会社 | Image processing equipment, image processing methods, and programs |
Citations (12)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6125347A (en) * | 1993-09-29 | 2000-09-26 | L&H Applications Usa, Inc. | System for controlling multiple user application programs by spoken input |
US20020002453A1 (en) * | 2000-06-30 | 2002-01-03 | Mihal Lazaridis | System and method for implementing a natural language user interface |
US6374226B1 (en) * | 1999-08-06 | 2002-04-16 | Sun Microsystems, Inc. | System and method for interfacing speech recognition grammars to individual components of a computer program |
US20020133354A1 (en) * | 2001-01-12 | 2002-09-19 | International Business Machines Corporation | System and method for determining utterance context in a multi-context speech application |
US20020184023A1 (en) * | 2001-05-30 | 2002-12-05 | Senis Busayapongchai | Multi-context conversational environment system and method |
US20030182131A1 (en) * | 2002-03-25 | 2003-09-25 | Arnold James F. | Method and apparatus for providing speech-driven routing between spoken language applications |
US20040153322A1 (en) * | 2003-01-31 | 2004-08-05 | Comverse, Inc. | Menu-based, speech actuated system with speak-ahead capability |
US20040260562A1 (en) * | 2003-01-30 | 2004-12-23 | Toshihiro Kujirai | Speech interaction type arrangements |
US7158936B2 (en) * | 2001-11-01 | 2007-01-02 | Comverse, Inc. | Method and system for providing a voice application bookmark |
US7203645B2 (en) * | 2001-04-27 | 2007-04-10 | Intel Corporation | Speech recognition system loading different recognition engines for different applications |
US7401024B2 (en) * | 2003-12-02 | 2008-07-15 | International Business Machines Corporation | Automatic and usability-optimized aggregation of voice portlets into a speech portal menu |
US7418382B1 (en) * | 1998-10-02 | 2008-08-26 | International Business Machines Corporation | Structure skeletons for efficient voice navigation through generic hierarchical objects |
-
2004
- 2004-02-20 US US10/783,832 patent/US20050229185A1/en not_active Abandoned
-
2005
- 2005-02-08 EP EP05250717A patent/EP1566954A3/en not_active Ceased
- 2005-02-11 KR KR1020050011680A patent/KR20060041889A/en not_active Withdrawn
- 2005-02-18 CN CN2005100093844A patent/CN1658635A/en active Pending
- 2005-02-18 JP JP2005041474A patent/JP2005237009A/en not_active Abandoned
Patent Citations (13)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6125347A (en) * | 1993-09-29 | 2000-09-26 | L&H Applications Usa, Inc. | System for controlling multiple user application programs by spoken input |
US7418382B1 (en) * | 1998-10-02 | 2008-08-26 | International Business Machines Corporation | Structure skeletons for efficient voice navigation through generic hierarchical objects |
US6374226B1 (en) * | 1999-08-06 | 2002-04-16 | Sun Microsystems, Inc. | System and method for interfacing speech recognition grammars to individual components of a computer program |
US20020002453A1 (en) * | 2000-06-30 | 2002-01-03 | Mihal Lazaridis | System and method for implementing a natural language user interface |
US20020133354A1 (en) * | 2001-01-12 | 2002-09-19 | International Business Machines Corporation | System and method for determining utterance context in a multi-context speech application |
US7203645B2 (en) * | 2001-04-27 | 2007-04-10 | Intel Corporation | Speech recognition system loading different recognition engines for different applications |
US6944594B2 (en) * | 2001-05-30 | 2005-09-13 | Bellsouth Intellectual Property Corporation | Multi-context conversational environment system and method |
US20020184023A1 (en) * | 2001-05-30 | 2002-12-05 | Senis Busayapongchai | Multi-context conversational environment system and method |
US7158936B2 (en) * | 2001-11-01 | 2007-01-02 | Comverse, Inc. | Method and system for providing a voice application bookmark |
US20030182131A1 (en) * | 2002-03-25 | 2003-09-25 | Arnold James F. | Method and apparatus for providing speech-driven routing between spoken language applications |
US20040260562A1 (en) * | 2003-01-30 | 2004-12-23 | Toshihiro Kujirai | Speech interaction type arrangements |
US20040153322A1 (en) * | 2003-01-31 | 2004-08-05 | Comverse, Inc. | Menu-based, speech actuated system with speak-ahead capability |
US7401024B2 (en) * | 2003-12-02 | 2008-07-15 | International Business Machines Corporation | Automatic and usability-optimized aggregation of voice portlets into a speech portal menu |
Cited By (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20070160184A1 (en) * | 2003-10-06 | 2007-07-12 | Utbk, Inc. | Systems and methods to connect people in a marketplace environment |
US8180676B2 (en) * | 2003-10-06 | 2012-05-15 | Utbk, Inc. | Systems and methods to connect people in a marketplace environment |
US20130262240A1 (en) * | 2003-10-06 | 2013-10-03 | Ingenio Llc | System and methods to connect people in a marketplace environment |
US9639863B2 (en) * | 2003-10-06 | 2017-05-02 | Yellowpages.Com Llc | System and methods to connect people in a marketplace environment |
US10102550B2 (en) | 2003-10-06 | 2018-10-16 | Yellowpages.Com Llc | Systems and methods to connect people in a marketplace environment |
CN107817995A (en) * | 2016-09-12 | 2018-03-20 | 华为技术有限公司 | A kind of silent method, apparatus and terminal device for starting application in backstage |
US10901779B2 (en) | 2016-09-12 | 2021-01-26 | Huawei Technologies Co., Ltd. | Method and apparatus for silently starting application in background and terminal device |
US20190147850A1 (en) * | 2016-10-14 | 2019-05-16 | Soundhound, Inc. | Integration of third party virtual assistants |
US10783872B2 (en) * | 2016-10-14 | 2020-09-22 | Soundhound, Inc. | Integration of third party virtual assistants |
Also Published As
Publication number | Publication date |
---|---|
CN1658635A (en) | 2005-08-24 |
EP1566954A3 (en) | 2005-08-31 |
KR20060041889A (en) | 2006-05-12 |
JP2005237009A (en) | 2005-09-02 |
EP1566954A2 (en) | 2005-08-24 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US7242752B2 (en) | Behavioral adaptation engine for discerning behavioral characteristics of callers interacting with an VXML-compliant voice application | |
US9202247B2 (en) | System and method utilizing voice search to locate a product in stores from a phone | |
US7286985B2 (en) | Method and apparatus for preprocessing text-to-speech files in a voice XML application distribution system using industry specific, social and regional expression rules | |
US9350862B2 (en) | System and method for processing speech | |
US7609829B2 (en) | Multi-platform capable inference engine and universal grammar language adapter for intelligent voice application execution | |
US9088652B2 (en) | System and method for speech-enabled call routing | |
US7050976B1 (en) | Method and system for use of navigation history in a voice command platform | |
EP1579428B1 (en) | Method and apparatus for selective distributed speech recognition | |
US7167830B2 (en) | Multimodal information services | |
US6985865B1 (en) | Method and system for enhanced response to voice commands in a voice command platform | |
US7590542B2 (en) | Method of generating test scripts using a voice-capable markup language | |
US8886540B2 (en) | Using speech recognition results based on an unstructured language model in a mobile communication facility application | |
US7260530B2 (en) | Enhanced go-back feature system and method for use in a voice portal | |
US20050043953A1 (en) | Dynamic creation of a conversational system from dialogue objects | |
US7436939B1 (en) | Method and system for consolidated message notification in a voice command platform | |
US7672436B1 (en) | Voice rendering of E-mail with tags for improved user experience | |
US20110106527A1 (en) | Method and Apparatus for Adapting a Voice Extensible Markup Language-enabled Voice System for Natural Speech Recognition and System Response | |
US20040153322A1 (en) | Menu-based, speech actuated system with speak-ahead capability | |
US7450698B2 (en) | System and method of utilizing a hybrid semantic model for speech recognition | |
US7627096B2 (en) | System and method for independently recognizing and selecting actions and objects in a speech recognition system | |
US20090030691A1 (en) | Using an unstructured language model associated with an application of a mobile communication facility | |
US20050229185A1 (en) | Method and system for navigating applications | |
US20030055651A1 (en) | System, method and computer program product for extended element types to enhance operational characteristics in a voice portal | |
US20050114139A1 (en) | Method of operating a speech dialog system | |
US7451086B2 (en) | Method and apparatus for voice recognition |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
AS | Assignment |
Owner name: LUCENT TECHNOLOGIES, INC., NEW JERSEY Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:STOOPS, DANIEL STEWART;WEBB, JEFFREY J.;REEL/FRAME:015012/0955 Effective date: 20040220 |
|
STCB | Information on status: application discontinuation |
Free format text: ABANDONED -- FAILURE TO RESPOND TO AN OFFICE ACTION |