[go: up one dir, main page]

US20210295421A1 - Voice-based order processing - Google Patents

Voice-based order processing Download PDF

Info

Publication number
US20210295421A1
US20210295421A1 US17/341,691 US202117341691A US2021295421A1 US 20210295421 A1 US20210295421 A1 US 20210295421A1 US 202117341691 A US202117341691 A US 202117341691A US 2021295421 A1 US2021295421 A1 US 2021295421A1
Authority
US
United States
Prior art keywords
user
order
speech
terminal
voice
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
US17/341,691
Inventor
Wayne Lawrence Buckholdt
Andrew Robert Hamilton
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
NCR Voyix Corp
Original Assignee
NCR Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by NCR Corp filed Critical NCR Corp
Priority to US17/341,691 priority Critical patent/US20210295421A1/en
Publication of US20210295421A1 publication Critical patent/US20210295421A1/en
Assigned to NCR CORPORATION reassignment NCR CORPORATION ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: BUCKHOLDT, WAYNE LAWRENCE, Hamilton, Andrew Robert
Assigned to BANK OF AMERICA, N.A., AS ADMINISTRATIVE AGENT reassignment BANK OF AMERICA, N.A., AS ADMINISTRATIVE AGENT SECURITY INTEREST Assignors: NCR VOYIX CORPORATION
Assigned to NCR VOYIX CORPORATION reassignment NCR VOYIX CORPORATION CHANGE OF NAME Assignors: NCR CORPORATION
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q30/00Commerce
    • G06Q30/06Buying, selling or leasing transactions
    • G06Q30/0601Electronic shopping [e-shopping]
    • G06Q30/0641Electronic shopping [e-shopping] utilising user interfaces specially adapted for shopping
    • G06K9/00624
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/26Speech to text systems
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/18Speech classification or search using natural language modelling
    • G10L15/1822Parsing for meaning understanding
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L2015/088Word spotting
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • G10L2015/223Execution procedure of a spoken command
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • G10L2015/225Feedback of the input speech

Definitions

  • a drive thru ordering service is expected by consumers. Consumers have become accustomed to eating in their vehicles or picking up food to take home or to their offices.
  • a drive thru usually requires at least one and in some cases two employees that are not available to prepare food orders and that are dedicated to taking drive thru orders, which these employees then enter into a terminal for order fulfillment processing by other employees. This adds to the number of employees needed to staff the restaurant and accordingly adds to the retailer's overall employee-associated expenses. With wage rates spiking, many fast food restaurants are finding it particularly difficult to stay in business and/or make a reasonable profit on their investment.
  • a method for voice-based processing is presented. More particularly, a voice dialogue is established with a user for an order at a drive thru terminal. Speech provided by user is translated into text commands associated with an order interface using a lexicon associated with a type of restaurant. The text commands are processed through the order interface. Speech feedback and guidance are provided at the drive thru terminal during the order. Finally, the order is completed at the drive thru terminal for subsequent order fulfillment processing.
  • FIG. 1 is a diagram of a system for voice-based order processing, according to an example embodiment.
  • FIG. 2 is a diagram of a method for voice-based order processing, according to an example embodiment.
  • FIG. 3 is a diagram of another method for voice-based order processing, according to an example embodiment.
  • FIG. 4 is a diagram of another system for voice-based order processing, according to an example embodiment.
  • FIG. 1 is a diagram of a system 100 for voice-based order processing.
  • the various components are illustrated and the arrangement of the components is presented for purposes of illustration only. It is to be noted that other arrangements with more or less components are possible without departing from the coded scan-based item processing teachings presented herein and below.
  • the techniques, methods, and system presented herein and below for voice-based order processing can be implemented in whole or in part in one, all, or some combination of the components shown with the system 100 .
  • the techniques and methods are programmed as executable instructions in memory and/or non-transitory computer-readable storage media and processed on one or more processors associated with the various components.
  • the system 100 includes a server/Point-Of-Sale (POS) terminal 110 , a drive thru terminal/digital sign 120 , optionally a voice-enabled network device 130 , and a vehicle 140 that optionally includes a mobile device 150 associated with a user in the vehicle 140 .
  • POS Point-Of-Sale
  • the server/POS terminal 100 includes at least one processor, memory, and non-transitory computer-readable storage medium.
  • the non-transitory computer-readable storage medium includes executable instructions for a transaction manager 111 , a voice interface 113 , and a drive thru/display manager 114 .
  • the non-transitory computer-readable storage medium also includes order images for items on a menu.
  • the executable instructions are executed by the processor from the non-transitory computer-readable storage medium to perform the processing discussed below for the transaction manager 111 , the voice interface 113 , and the drive thru terminal/display 114 .
  • the drive thru terminal/display 120 includes at least one processor, memory, and a non-transitory computer-readable storage medium having executable instructions representing a drive thru voice interface 128 and a server/POS terminal agent 129 .
  • the drive thru terminal/display 120 also includes a display 121 , a microphone 122 , a speaker 120 , a camera/sensor 124 , optionally a card reader 120 , optionally a currency dispenser/accepter (D/A) 126 , and optionally a wireless transceiver 127 .
  • the processor executes the executable instructions from the non-transitory computer-readable storage medium to perform the processing discussed below with respect to the drive thru voice interface 128 and the server/POS terminal agent 129 .
  • the system 100 includes a network-enabled network device 130 that includes a processor, memory, and non-transitory computer-readable storage medium that includes a POS terminal service 131 .
  • the system 100 includes a mobile device 150 carried or operated by a user in the vehicle 140 .
  • the mobile device 150 includes executable instructions, memory, and a non-transitory computer-readable storage medium having executable instructions representing a mobile application 151 .
  • the processor executes the executable instructions to perform the processing discussed below with respect to the mobile application 151 .
  • a user in the vehicle while in front of the drive thru terminal/display 120 uses spoken natural language words and phrases to place an order from a menu associated with a restaurant/fast food establishment.
  • the camera/sensor 124 detects a vehicle in front of the drive thru terminal/display 120 and initiates the drive thru voice interface 128 .
  • the camera/sensor 124 detects a vehicle in front of the drive thru terminal/display 120 and processes the server/POS terminal agent 129 through a wired or wireless connection 115 to the server/POS terminal 110 .
  • the voice interface 113 is then initiated on the server/POS terminal 110 providing audio feedback to the speaker 123 and receive speech input from the user through the microphone 122 .
  • the camera/sensor 124 detects a vehicle in front of the drive thru terminal/display 120 and processes the drive thru voice interface 128 to redirect speech input from the user received over the microphone 122 to the voice-enabled network device 135 over a wired or wireless connection 135 .
  • the voice-enabled network device 135 initiates the POS terminal service 131 for a dialogue with the user during the ordering and audio/speech-based feedback produced by the POS terminal service 131 is routed over connection 135 to be played over the speaker 123 .
  • the drive thru voice interface 128 , the voice interface 113 , and/or the POS terminal service 131 is initiated based on a plurality of predefined wake-up words or phrases, such as: “Hello,” “I would like,” “can I have,” “can I get,” and others that are typically used when starting an order.
  • the wake-up words or phrases may be presented on the display 121 for the user to see when at the drive thru terminal/display 120 .
  • An automated voice response from the drive thru voice interface 128 or the voice interface 113 may include: “Please tell me what you would like to order,” “How can I serve you today,” “Ready to take/receive your order,” etc.
  • the drive thru voice interface 128 , the voice interface 113 , and/or the POS terminal service 131 can be initiated for an order based on automatic discovery of the vehicle 140 at the drive thru terminal/display 120 (through image processing) of images captured by the camera 124 ; can be initiated for an order based on automatic detection of the vehicle 140 through a sensor 124 (such as a light-based or motion-based sensor 124 ) that identifies the vehicle 140 at the drive thru terminal/display 120 ; and/or can be initiated for an order based on listening over the microphone 122 for predefined wake-up words or phrases.
  • a sensor 124 such as a light-based or motion-based sensor 124
  • the voice-interface 128 , the voice interface 113 and/or the POS terminal service 131 are automatically initiated for taking an interactive speech-based order with the user based on a machine-learning algorithm that is trained on images to determine when the vehicle 140 is present at the drive thru terminal/display 120 .
  • the voice interface 113 Once the drive thru voice interface 128 , the voice interface 113 , and/or the POS terminal service 131 are initiated (either through automated vehicle 140 discovery or predefined wake up words or phrases).
  • An interactive speech dialogue is initiated and the transaction manager 111 provides a transaction identifier for an order transaction.
  • An initial greeting may be played over the speaker 123 to the customer and/or reproduced as text presented on the display.
  • the transaction identifier starts a session or an interactive ordering dialogue for the user to communicate through speech an order and user-specific order details.
  • the voice interface 113 , the drive thru voice interface 128 , and/or the POS terminal service 131 provides a real-time speech-based interface for the user to verbally communicate an order.
  • a dialogue is created during which speech from the user received over the microphone is converted to text (speech to text).
  • the text is recognized and processed as commands and selections received from the user by the transaction manager 111 .
  • Output text that confirms order selections and order details for an order produced by the transaction manager 111 is processed by the voice interface 113 , the drive thru voice interface 128 , and/or the POS terminal service 131 to convey speech feedback for the order or speech instructions/guidance, which is played over the speaker 123 (text to speech).
  • the feedback and instructions may also be concurrently presented on the display 121 by the server/POS terminal agent 129 .
  • the lexicon of words and phrases are recognized and processed for speech-to-text, transaction processing of an order, and text-to-speech feedback are predefined and constrained to the menu and menu options available at the restaurant. This substantially improves accuracy.
  • the predefined lexicon can include order-specific words and phrases as well, such words may include commands for: “want,” “like,” “make,” “add,” “order,” “buy,” “purchase,” “cancel,” “delete,” “remove,” “modify,” “change,” etc.
  • the commands are identified as processing actions that are to be processed by the transaction manager 111 for a given order/transaction/session/dialogue.
  • the supported nouns associated with commands vary based on the restaurant (fast food) type and the food menu of available food items.
  • the nouns identity menu items associated with a given command may include nouns for: hamburger, cheeseburger, lettuce, tomato, onion, pickle, ketchup, mustard, mayo, fries, onion rings, soda (such as Coca-Cola®, Pepsi®, Sprite®, root beer, etc.), water, ice tea, coffee, etc.
  • the adjectives can also be customized for the specific restaurant and its menu and menu options, such adjectives may include: small, medium, regular, large, extra-large, double, triple, Super-Size®, etc.
  • the adjectives define characteristics of a given noun or set of nouns (such as a prepackaged meal).
  • Prepositions may include: with, and without for including or not including something with a noun (menu item).
  • the exclamations used for confirming an order is correct through speech feedback may include: yes, no, or variants such as, yea, nah, yep, nope, wrong, right, correct, incorrect.
  • the supported lexicon is specific to a given restaurant type and provided through an added skill to an existing voice-enabled network service as a specialized skill represented by the POS terminal service 131 .
  • the existing voice-enabled network service can include by way of example, Siri®, Amazon Echo/Alexa®, Google Home®, etc.
  • the voice-enabled network device 130 is a modified version of one of these consumer-voice services that includes the specialized POS terminal service 131 as an enhanced skill for the existing voice-enabled network service.
  • the POS terminal service 131 is coded for the specialized lexicon/dictionary and commands as discussed above and operates through the voice-enabled network device 130 .
  • Words communicated through speech are received through the microphone 122 and translated to text by the voice interface 113 , the drive thru voice interface 128 , and/or the POS terminal service 131 . Sentences then become text words that are recognized as parts of speech for the commands, the adjectives, the noun menu items, and prepositions, and exclamations.
  • the components of the sentences are formatted as commands provided to an Application Programming Interface (API) of the transaction manager 111 , the transaction manager 121 is processed as a component of the restaurant's POS terminal ordering system/interfaces.
  • API Application Programming Interface
  • Output produced by the transaction manager 111 is translated to speech and played over the speaker 123 and/or presented on the display 121 during the order by the voice interface 113 , the drive thru voice interface 128 , and/or the POS terminal service 131 .
  • Any visual presented feedback or guidance during the transaction/dialogue is processed by the server POS terminal agent 129 to render such visual information on the display 121 .
  • the speech-based feedback and guidance during the transaction/dialogue is processed by the voice interface 113 the drive thru voice interface 128 , and/or the POS terminal service 131 and played as audio over the speaker 123 .
  • the order/transaction/dialogue is a real-time dialogue and session for a transaction and the transaction is for placing an order with the restaurant.
  • the user provides natural language voice instructions and receives natural language speech feedback, speech confirmation, and speech-based help instructions or guidance as needed during the session for the transaction.
  • the server/POS terminal agent 129 utilizes a collection of menu item images 113 that map to menu items for the menu of the restaurant. So, when a user orders through speech an item and image of that item is retrieved by the server/POS terminal agent 129 and presented on the display 121 . The user can visually see what the user had communicated through speech as the items are ordered through speech by the user. This may be particularly helpful when a user sees that an image for an item ordered is not something that the user actually wanted (in other words the user may have said something different from what the user actually wanted). This allows for speech-based revisions and changes during the order. In addition, text associated with the order may be presented with the corresponding images.
  • the voice interface 113 , the drive thru voice interface 128 , and/or the POS terminal service 131 may detect the language being spoken by the user and a dialect of a specific language.
  • the voice interface 113 , the drive thru voice interface 128 , and/or the POS terminal service 131 , and the server/POS terminal agent 129 then configure its speech and text provided as spoken and visual feedback and guidance for the detected language and dialect. So, any supported spoken language can be automatically recognized and used throughout the order/session/dialogue/transaction.
  • the voice interface 113 , the drive thru voice interface 128 , and/or the POS terminal service 131 may also upon completion of an order (with an exclamation being verified from the user for the order) provide speech instructions for the user to pay for the order.
  • the order may be paid for at the drive thru terminal/display 120 or at a different station.
  • the drive thru terminal/display 120 includes a card reader 125 and/or a currency dispenser/accepter 126 .
  • Other forms of payment may be used as well, such as Apple Pay®, NCR's JetPay®, Venmo®, digital wallets, etc. Control is passed to the transaction manager 121 and these peripherals for confirming payment was received from the user either through a credit/debit card or through currency.
  • the server/POS terminal agent 129 may initiate a voice call to a remote human ordering agent. This call can be initiated through cellular or through Voice over Internet Protocol (VoIP) from the drive thru terminal/display 120 to the remote order taker or a bank of order takers to select.
  • VoIP Voice over Internet Protocol
  • the server/POS terminal agent 129 may then be remote controlled through an API by the order taker during the order, and selections or details for the order provided by the server/POS terminal agent 129 to the transaction manager 111 for processing with the local restaurant's ordering system/interface.
  • the call request is routed over connection 115 to the server/POS terminal 110 and the server/POS terminal 110 makes the call and provides the connection between the order taker and the user through the drive thru terminal/display 120 utilizing the display 121 and the microphone 122 .
  • the server/POS terminal agent 129 interacts with the wireless transceiver 127 to establish a connection to the mobile device 150 and communicate with the mobile application 151 .
  • the user may conduct the voice-based order dialogue with the drive thru terminal/display 120 using the microphone of the mobile device 150 while remaining in the vehicle 140 with the windows of the vehicle 140 remaining up.
  • the voice-based feedback and guidance for the order/dialogue/transaction can be played over a speaker of the mobile device 150 , over speaker 123 of the drive thru terminal/display 120 or over both the mobile device's speaker and the drive thru terminal/display's speaker. This may be particularly advantageous when there is inclement weather at the location associated with the drive thru terminal/display 120 .
  • spoken user voice during the dialogue is provided over a wireless connection 145 (such as Bluetooth®) and feedback from the voice interface 113 , the drive thru voice interface 128 , and/or the POS terminal service 131 routed over the wireless connection 145 and played over a speaker that is integrated into the mobile device.
  • the mobile application 151 may be used to display the visual feedback and guidance provided over the wireless connection 145 on a display of the mobile device 150 .
  • the mobile application 151 may provide selectable and changeable options for the order that the user can make through an interface of the mobile application 151 . So, the input microphone can be microphone 122 or a mobile device microphone, while the output speaker can be speaker 123 or a mobile device speaker.
  • a text-based menu interface may be interacted with by the user through a mobile application user interface on the mobile device 150 .
  • connection 135 also provides the speech-to-text translations made by the POS terminal service 131 to the drive thru voice interface 128 .
  • the drive thru voice interface 128 then uses the text transactions to translate into transaction manager 111 commands and provides to the server/POS terminal agent 129 .
  • the terminal agent 129 interacts with the transaction manager 111 through an API to have the commands for the order and the order details recorded for the transaction in the ordering system/interface.
  • Output received from the transaction manager 111 is received by the terminal agent 129 and provided to the drive thru voice interface 128 , the voice feedback is forwarded to the POS terminal service 131 and played over speaker 123 to the user.
  • the terminal agent 129 also presents the feedback in visual format on the display 121 , the visual format may include the images for items of the menu that were ordered (such as an image of a cheese burger, etc.) and text formatted feedback.
  • the mobile application 151 makes a wireless connection 146 to the POS terminal service 131 and provides voice-based ordering through the microphone of the mobile device 150 . Speech and voiced-based feedback is communicated back to the speakers of the mobile device 150 and/or also the speaker 123 .
  • the POS terminal service 131 provides the voice input to the terminal agent 129 and plays the feedback for the order over the mobile device's speakers using connection 146 .
  • the POS terminal service 131 interacts with the drive thru voice interface 128 over a different connection 135 to permit the terminal agent 129 and the transaction manager 111 to process the order for the transaction in the manners discussed above.
  • the mobile application 151 reports its current geographical location and the POS terminal service 131 looks up an identifier for the terminal/display 120 based on the reported geographical location of the mobile device 150 . This permits the POS terminal service 131 to make connection 135 with the appropriate terminal/display 120 that the vehicle 140 is located at.
  • the voice-enabled network device 130 resides on server 110 or is connected to POS terminal 110 rather than connected to the drive thru terminal/display 120 .
  • the voice interface 113 manages voice to text and text to voice translations for interaction with the terminal agent 129 and/or transaction manager 111 .
  • the drive thru terminal/display 120 is a thin client, which includes just the display 121 , the microphone 122 , the speaker 123 , optionally the camera/sensor 124 , optionally the card reader 125 , optionally the currency dispenser/accepter 126 , optionally the wireless transceiver 127 , and the terminal agent 129 .
  • the terminal agent 129 provides speech received over 115 to the server/POS terminal 110 and all processing for the transaction/order/dialogue is processed on the server/POS terminal 110 with the drive through display manager 114 displaying the menu item images 112 on display 121 and the voice interface 113 playing the speech feedback over speaker 123 .
  • the system 100 provides a variety of configurations for integrated a separate intelligent voice service with an existing ordering system/interface for purposes of allowing natural language based ordering at the drive thru terminal/display 120 .
  • the system 100 includes a variety of embodiments and additional order processing.
  • the system 100 is customized for a specific restaurant type and specific menu of such a restaurant type, which allows for more accurate speech-to-text and text-to-speech processing. This is done with a restricted lexicon of words and phrases to identify user order commands, menu items (nouns), adjectives affecting the menu items, prepositions affecting menu items, and exclamations that confirm or do not confirm a given user order.
  • the server/POS terminal 110 is a server 110 .
  • the server 110 is a collection of servers that logically cooperate in a cloud processing environment.
  • the server 110 is connected over a Local Area Network (LAN) via 115 to the terminal/display 120 .
  • the server 110 is connected over a Wide Area Network (WAN) through 115 to the terminal/display 120 .
  • LAN Local Area Network
  • WAN Wide Area Network
  • the server/POS terminal 110 is a POS terminal 110 .
  • the POS terminal 110 is a transaction terminal located within the restaurant.
  • the POS terminal 110 can be a Self-Service Terminal (SST) 110 located within the restaurant and remote from the terminal/display 120 , and the SST 100 also permits other customers or users that come within the restaurant to concurrently place self-orders on touchscreen interface of the SST 100 .
  • the POS terminal 110 is a cashier assisted transaction terminal within the restaurant that concurrently takes and processes orders for customers/users that come inside to eat at the restaurant.
  • the POS terminal 110 is connected to the terminal/display 120 over wireless connection 115 .
  • the POS terminal is connected to the terminal/display over a wired connection 115 .
  • a separate SST includes the card reader 125 and the currency dispenser/accepter 126 .
  • the user engages in a completed voice-based dialogue order at the terminal/display 120 and pulls the vehicle ahead to the SST where the user is instructed to pay for the order to complete the transaction.
  • the user engages in a completed voice-based dialogue order at the terminal/display 120 and pulls up to a window where the user pays for the order to complete the transaction.
  • the user engages in a completed voice dialogue and pays for the order to complete the transaction at the terminal/display 120 utilizing card reader 125 and/or the currency dispenser/accepter 126 .
  • the user then pulls ahead and picks up the order at a window from a human agent or the user pulls ahead to a designated window or food storage bin that automatically unlocks when the order is completed.
  • Any automated window/bin may not include a human agent; albeit a human agent may fulfill the order and pace it in the located bin or inside the locked window.
  • the order is fulfilled by automated robots and placed in a locked bin or inside the locked window.
  • FIG. 2 is a diagram of a method 200 for voice-based order processing, according to an example embodiment.
  • the software module(s) that implements the method 200 is referred to as a “voice-order service.”
  • the voice-order service is implemented as executable instructions programmed and residing within memory and/or a non-transitory computer-readable (processor-readable) storage medium and executed by one or more processors of a device.
  • the processor(s) of the device that executes the transaction manager are specifically configured and programmed to process the voice-order service.
  • the voice-order service has access to one or more networks during its processing.
  • the networks can be wired, wireless, or a combination of wired and wireless.
  • the device that executes the voice-order service is the drive thru terminal/display 120 .
  • the device that executes the voice-order service the server 110
  • the device that executes the voice-order service is the POS terminal 110 .
  • the device that executes the voice-order service is the voice-enabled network device 130 .
  • a plurality of devices execute portions of the voice-order service, the devices include all of some combination of: the server/POS terminal 110 , the drive thru terminal/display 120 , the voice-enabled network device 130 , and/or the mobile device 150 .
  • the voice-order service is all or some combination of: the transaction manager 111 , voice interface 113 , drive thru terminal/display manager 114 , drive thru voice interface 128 , server/POS terminal agent 129 , POS terminal service 131 , and/or mobile application 151 .
  • the voice-order service initiates, at a drive thru terminal, an interactive voice dialogue/session with a user for an order.
  • the user is situated within a vehicle in front of the drive thru terminal.
  • the drive thru terminal is the terminal/display 120 .
  • the voice-order service automatically detects a vehicle with includes the user in front of the drive thru terminal.
  • the voice-order service processes images taken in a front of the drive thru terminal with a machine-learning algorithm to identify a presence of the vehicle.
  • the voice order service identifies the vehicle through a motion or light sensor that detects the vehicle in front of the drive thru terminal.
  • the voice-order service identifies a wake-up word or phrase received over a microphone of the drive thru terminal that is specific to ordering.
  • the voice-order service translates speech provided by the user into text commands associated with an order interface using a lexicon associated with a type of restaurant.
  • the voice-order service requests clarification to a portion of the speech through a speech-clarification request played over the speaker to the user.
  • the voice-order service summarizes the speech once translated and playing a summary of the speech over the speaker to the user.
  • the voice-order service processes text commands through the order interface.
  • the voice-order service receives order pricing and order details from the order interface as text information.
  • the voice-order service provides, at the drive thru terminal, speech feedback and guidance to the user during the order over the speaker.
  • the voice-order service translates the text information into the speech feedback that is played over the speaker.
  • the voice-order service simultaneously presents the text information on a display of the drive thru terminal as the speech feedback is played over the speaker to the user.
  • the voice-order service presents images associated with the order items of the order details on the display with the text information. Each image is a customized picture or illustration of a given order item.
  • the voice-order service completes, at the drive thru terminal, the order for subsequent order fulfillment processing.
  • the voice-order service processes the speech and the speech feedback through a voice-enabled network device interfaced to the drive thru terminal using an enabled skill on the voice-enabled network device for the type of restaurant.
  • the voice-enabled network device is an enhanced version of Siri®, Echo®, or Google Home® having the novel and enabled skill.
  • FIG. 3 is a diagram of another method 300 for voice-based order processing, according to an example embodiment.
  • the software module(s) that implement the method 300 is referred to herein as a “drive thru voice-enabled service.”
  • the drive thru voice-enabled service is implemented as executable instructions and programmed within memory and/or a non-transitory computer-readable (processor-readable) storage medium that executes on one or more processors of a device.
  • the processors of the device are specifically configured to execute the drive thru voice-enabled service.
  • the drive thru voice-enabled service has access one or more networks; the networks are wireless.
  • the drive thru voice-enabled service is all or some combination of: transaction manager 111 , voice interface 113 , drive thru terminal/display manager 114 , drive thru voice interface 128 , server/POS terminal agent 129 , POS terminal service 131 , and/or the mobile application 151 .
  • the device that executes the drive thru voice-enabled service is server 110 .
  • the device that executes the drive thru voice-enabled service is the POS terminal 110 .
  • the device that executes the drive thru voice-enabled service is the drive thru terminal/display 120 .
  • the device that executes the drive thru voice-enabled service is a combination of all of the devices 110 , 120 , 130 and/or 150 .
  • the drive thru voice-enabled service automatically initiates a voice dialogue for a transaction associated with an order of a user in a vehicle at a drive thru terminal based on detection of an event.
  • the drive thru voice-enabled service detects the event as an indication of a vehicle that includes the user being in front of the drive thru terminal using a camera or a motion/light sensor.
  • the drive thru voice-enabled service detects the event as a predefined wake-up word that initiates the order when detected over the microphone.
  • the drive thru voice-enabled service translates speech received over a microphone from a user into text commands, text order items, and text order customizations.
  • the drive thru voice-enabled service formats the text commands, text order items, and text order customizations into an order interface actions.
  • the drive thru voice-enabled service provides the order interface actions to a transaction manager using an Application Programming Interface (API).
  • API Application Programming Interface
  • the drive thru voice-enabled service obtains a transaction number for the transaction and the order from the transaction manager.
  • the drive thru voice-enabled service receives order pricing and order details over the API from the transaction manager.
  • the drive thru voice-enabled service converts the order pricing and the order details into speech feedback.
  • the drive thru voice-enabled service plays the speech feedback over a speaker to the user.
  • the drive thru voice-enabled service obtains the order pricing and the order details on a display of the drive thru terminal.
  • the drive thru voice-enabled service iterates back to 320 until speech confirmation is received from the user indicating the voice dialogue has concluded.
  • the drive thru voice-enabled service provides instructions to the user for making payment to conclude the transaction.
  • the drive thru voice-enabled service receives a credit card or currency by the drive thru terminal from the user to complete the transaction. It is to be noted that other non-card based forms of payment may also be processed, such as but not limited to: Apple Pay®, NCR JetPay®, Venmo®, digital wallets, virtual currency from a virtual currency wallet, etc.
  • FIG. 4 is a diagram of a system 400 for voice-based order processing, according to an example embodiment. Some components of the system 400 are programmed and reside within memory and/or a non-transitory computer-readable medium and execute on one or more processors of devices of the system 400 .
  • the system 400 also has access and can communicate over one or more networks; and the networks are wireless, wired, or a combination of wired and wireless.
  • the system 400 is configured and programed to perform the processing discussed above with the FIGS. 1-3 .
  • the system 400 includes a display 401 , a microphone 402 , a speaker 403 , a processor 404 , and non-transitory computer-readable storage medium 405 having executable instructions representing a voice-based ordering service 406 .
  • the processor is included within one of: a server 110 , a POS terminal 110 , a SST 110 , terminal/display 120 , or voice-enabled network device 130 .
  • the voice-based ordering service 406 is all or some combination of one or more of: the transaction manager 111 , the voice interface 113 , the drive-thru terminal display manager 114 , the drive thru voice interface 128 , the terminal agent 129 , the POS terminal service 131 , the mobile application 151 , the method 200 , and/or the method 300 .
  • the processor 404 loads the executable instructions for the voice-based ordering service 406 from the non-transitory computer-readable storage medium 405 and executes the voice-based ordering service 406 this causes the processor 404 to: engage a user while in a vehicle at a drive thru terminal 120 in an interactive voice dialogue for taking an order at a restaurant; translate speech received over the microphone 402 during the interactive voice dialogue into text actions; process the text actions with an order interface; receive order pricing and order details from the order interface responsive to processing the text actions; communicate through speech feedback the order pricing and order details over the speaker 403 ; display the order pricing and order details on the display 401 ; confirm through speech interaction that the order is complete; and request a payment method from the user to complete the order for order fulfillment processing.
  • the voice-based ordering system 406 is further configured when executed by the processor to cause the processor 404 to: display unique images representing order items on the display 401 during the interactive voice dialogue.
  • modules may be illustrated as separate modules, but may be implemented as homogenous code, as individual components, some, but not all of these modules may be combined, or the functions may be implemented in software structured in any other convenient manner.
  • the software modules are illustrated as executing on one piece of hardware, the software may be distributed over multiple processors of a single device, or in any other convenient manner.

Landscapes

  • Engineering & Computer Science (AREA)
  • Business, Economics & Management (AREA)
  • Physics & Mathematics (AREA)
  • Finance (AREA)
  • Accounting & Taxation (AREA)
  • Human Computer Interaction (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Health & Medical Sciences (AREA)
  • Computational Linguistics (AREA)
  • Economics (AREA)
  • Development Economics (AREA)
  • Marketing (AREA)
  • Strategic Management (AREA)
  • General Business, Economics & Management (AREA)
  • General Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • User Interface Of Digital Computer (AREA)
  • Cash Registers Or Receiving Machines (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

A vehicle is detected at a drive thru terminal and an automated voice-based dialogue is established with a user in the vehicle and the terminal. An order is placed for a transaction using natural-language speech. The automated voice-based dialogue identifies commands, nouns associated with menu items, adjectives associated with modifying the menu items, prepositions associated with modifying the menu items, and exclamations associated with confirming the order and associated menu items. The automated speech is identified and processed based on a specific lexicon associated with a specific type of restaurant. The speech is translated into actions processed through an ordering system/interface as text-based instructions. Feedback for the order and guidance during the order are provided to the user through: speech played over a speaker associated with the terminal, text displayed on a display, and/or images associated with the ordered items.

Description

    BACKGROUND
  • Network-based shopping has steadily increased in recent years within the retail industry. Brick and mortar stores have suffered substantially as a result of this trend because of rent, costs, inventory of product, and staffing associated with maintaining the physical stores and loss of sales to online competition. One area where consumers are less likely to engage in shopping over the Internet is fast food, restaurants, and grocery stores. This is for a variety of reasons, such as consumers want these products at the time that they shop for those products and want their food while they are traveling, such that online ordering makes little sense.
  • Employee expense has become an issue for many retailers. This is particularly true in the fast food and restaurant industries. Many state and city governments have mandated minimum wage rates that are particular difficult for the industry to absorb. Customers are unwilling to pay significantly higher prices that would be necessary, in many cases, to cover the new wage rates. As a result, some retailers have been forced to reduce employee hours and/or close their businesses.
  • In the fast food industry, a drive thru ordering service is expected by consumers. Consumers have become accustomed to eating in their vehicles or picking up food to take home or to their offices. However, a drive thru usually requires at least one and in some cases two employees that are not available to prepare food orders and that are dedicated to taking drive thru orders, which these employees then enter into a terminal for order fulfillment processing by other employees. This adds to the number of employees needed to staff the restaurant and accordingly adds to the retailer's overall employee-associated expenses. With wage rates spiking, many fast food restaurants are finding it particularly difficult to stay in business and/or make a reasonable profit on their investment.
  • SUMMARY
  • In various embodiments, methods and a system for voice-based order processing are presented.
  • According to an embodiment, a method for voice-based processing is presented. More particularly, a voice dialogue is established with a user for an order at a drive thru terminal. Speech provided by user is translated into text commands associated with an order interface using a lexicon associated with a type of restaurant. The text commands are processed through the order interface. Speech feedback and guidance are provided at the drive thru terminal during the order. Finally, the order is completed at the drive thru terminal for subsequent order fulfillment processing.
  • BRIEF DESCRIPTION OF THE DRAWINGS
  • FIG. 1 is a diagram of a system for voice-based order processing, according to an example embodiment.
  • FIG. 2 is a diagram of a method for voice-based order processing, according to an example embodiment.
  • FIG. 3 is a diagram of another method for voice-based order processing, according to an example embodiment.
  • FIG. 4 is a diagram of another system for voice-based order processing, according to an example embodiment.
  • DETAILED DESCRIPTION
  • FIG. 1 is a diagram of a system 100 for voice-based order processing. The various components are illustrated and the arrangement of the components is presented for purposes of illustration only. It is to be noted that other arrangements with more or less components are possible without departing from the coded scan-based item processing teachings presented herein and below.
  • The techniques, methods, and system presented herein and below for voice-based order processing can be implemented in whole or in part in one, all, or some combination of the components shown with the system 100. The techniques and methods are programmed as executable instructions in memory and/or non-transitory computer-readable storage media and processed on one or more processors associated with the various components.
  • As used herein the terms “customer,” “consumer,” and “user” may be used synonymously and interchangeably.
  • The system 100 includes a server/Point-Of-Sale (POS) terminal 110, a drive thru terminal/digital sign 120, optionally a voice-enabled network device 130, and a vehicle 140 that optionally includes a mobile device 150 associated with a user in the vehicle 140.
  • The server/POS terminal 100 includes at least one processor, memory, and non-transitory computer-readable storage medium. The non-transitory computer-readable storage medium includes executable instructions for a transaction manager 111, a voice interface 113, and a drive thru/display manager 114. The non-transitory computer-readable storage medium also includes order images for items on a menu. The executable instructions are executed by the processor from the non-transitory computer-readable storage medium to perform the processing discussed below for the transaction manager 111, the voice interface 113, and the drive thru terminal/display 114.
  • The drive thru terminal/display 120 includes at least one processor, memory, and a non-transitory computer-readable storage medium having executable instructions representing a drive thru voice interface 128 and a server/POS terminal agent 129. The drive thru terminal/display 120 also includes a display 121, a microphone 122, a speaker 120, a camera/sensor 124, optionally a card reader 120, optionally a currency dispenser/accepter (D/A) 126, and optionally a wireless transceiver 127. The processor executes the executable instructions from the non-transitory computer-readable storage medium to perform the processing discussed below with respect to the drive thru voice interface 128 and the server/POS terminal agent 129.
  • In an embodiment, the system 100 includes a network-enabled network device 130 that includes a processor, memory, and non-transitory computer-readable storage medium that includes a POS terminal service 131.
  • In an embodiment, the system 100 includes a mobile device 150 carried or operated by a user in the vehicle 140. The mobile device 150 includes executable instructions, memory, and a non-transitory computer-readable storage medium having executable instructions representing a mobile application 151. The processor executes the executable instructions to perform the processing discussed below with respect to the mobile application 151.
  • During operation of the system 100, a user in the vehicle while in front of the drive thru terminal/display 120 uses spoken natural language words and phrases to place an order from a menu associated with a restaurant/fast food establishment.
  • In an embodiment, the camera/sensor 124 detects a vehicle in front of the drive thru terminal/display 120 and initiates the drive thru voice interface 128.
  • In an embodiment, the camera/sensor 124 detects a vehicle in front of the drive thru terminal/display 120 and processes the server/POS terminal agent 129 through a wired or wireless connection 115 to the server/POS terminal 110. The voice interface 113 is then initiated on the server/POS terminal 110 providing audio feedback to the speaker 123 and receive speech input from the user through the microphone 122.
  • In an embodiment, the camera/sensor 124 detects a vehicle in front of the drive thru terminal/display 120 and processes the drive thru voice interface 128 to redirect speech input from the user received over the microphone 122 to the voice-enabled network device 135 over a wired or wireless connection 135. The voice-enabled network device 135 initiates the POS terminal service 131 for a dialogue with the user during the ordering and audio/speech-based feedback produced by the POS terminal service 131 is routed over connection 135 to be played over the speaker 123.
  • In an embodiment, the drive thru voice interface 128, the voice interface 113, and/or the POS terminal service 131 is initiated based on a plurality of predefined wake-up words or phrases, such as: “Hello,” “I would like,” “can I have,” “can I get,” and others that are typically used when starting an order. In an embodiment, the wake-up words or phrases may be presented on the display 121 for the user to see when at the drive thru terminal/display 120. An automated voice response from the drive thru voice interface 128 or the voice interface 113 may include: “Please tell me what you would like to order,” “How can I serve you today,” “Ready to take/receive your order,” etc.
  • So, the drive thru voice interface 128, the voice interface 113, and/or the POS terminal service 131: can be initiated for an order based on automatic discovery of the vehicle 140 at the drive thru terminal/display 120 (through image processing) of images captured by the camera 124; can be initiated for an order based on automatic detection of the vehicle 140 through a sensor 124 (such as a light-based or motion-based sensor 124) that identifies the vehicle 140 at the drive thru terminal/display 120; and/or can be initiated for an order based on listening over the microphone 122 for predefined wake-up words or phrases.
  • In an embodiment, the voice-interface 128, the voice interface 113 and/or the POS terminal service 131 are automatically initiated for taking an interactive speech-based order with the user based on a machine-learning algorithm that is trained on images to determine when the vehicle 140 is present at the drive thru terminal/display 120.
  • Once the drive thru voice interface 128, the voice interface 113, and/or the POS terminal service 131 are initiated (either through automated vehicle 140 discovery or predefined wake up words or phrases). An interactive speech dialogue is initiated and the transaction manager 111 provides a transaction identifier for an order transaction. An initial greeting may be played over the speaker 123 to the customer and/or reproduced as text presented on the display. The transaction identifier starts a session or an interactive ordering dialogue for the user to communicate through speech an order and user-specific order details.
  • The voice interface 113, the drive thru voice interface 128, and/or the POS terminal service 131 provides a real-time speech-based interface for the user to verbally communicate an order. A dialogue is created during which speech from the user received over the microphone is converted to text (speech to text). The text is recognized and processed as commands and selections received from the user by the transaction manager 111. Output text that confirms order selections and order details for an order produced by the transaction manager 111 is processed by the voice interface 113, the drive thru voice interface 128, and/or the POS terminal service 131 to convey speech feedback for the order or speech instructions/guidance, which is played over the speaker 123 (text to speech). The feedback and instructions may also be concurrently presented on the display 121 by the server/POS terminal agent 129.
  • The lexicon of words and phrases are recognized and processed for speech-to-text, transaction processing of an order, and text-to-speech feedback are predefined and constrained to the menu and menu options available at the restaurant. This substantially improves accuracy. The predefined lexicon can include order-specific words and phrases as well, such words may include commands for: “want,” “like,” “make,” “add,” “order,” “buy,” “purchase,” “cancel,” “delete,” “remove,” “modify,” “change,” etc. The commands are identified as processing actions that are to be processed by the transaction manager 111 for a given order/transaction/session/dialogue. The supported nouns associated with commands vary based on the restaurant (fast food) type and the food menu of available food items. The nouns identity menu items associated with a given command. For example, a hamburger fast food restaurant may include nouns for: hamburger, cheeseburger, lettuce, tomato, onion, pickle, ketchup, mustard, mayo, fries, onion rings, soda (such as Coca-Cola®, Pepsi®, Sprite®, root beer, etc.), water, ice tea, coffee, etc. The adjectives can also be customized for the specific restaurant and its menu and menu options, such adjectives may include: small, medium, regular, large, extra-large, double, triple, Super-Size®, etc. The adjectives define characteristics of a given noun or set of nouns (such as a prepackaged meal). Prepositions may include: with, and without for including or not including something with a noun (menu item). The exclamations used for confirming an order is correct through speech feedback may include: yes, no, or variants such as, yea, nah, yep, nope, wrong, right, correct, incorrect.
  • In an embodiment, the supported lexicon is specific to a given restaurant type and provided through an added skill to an existing voice-enabled network service as a specialized skill represented by the POS terminal service 131. The existing voice-enabled network service can include by way of example, Siri®, Amazon Echo/Alexa®, Google Home®, etc. In this embodiment, the voice-enabled network device 130 is a modified version of one of these consumer-voice services that includes the specialized POS terminal service 131 as an enhanced skill for the existing voice-enabled network service. The POS terminal service 131 is coded for the specialized lexicon/dictionary and commands as discussed above and operates through the voice-enabled network device 130.
  • Words communicated through speech are received through the microphone 122 and translated to text by the voice interface 113, the drive thru voice interface 128, and/or the POS terminal service 131. Sentences then become text words that are recognized as parts of speech for the commands, the adjectives, the noun menu items, and prepositions, and exclamations. The components of the sentences are formatted as commands provided to an Application Programming Interface (API) of the transaction manager 111, the transaction manager 121 is processed as a component of the restaurant's POS terminal ordering system/interfaces. Output produced by the transaction manager 111 is translated to speech and played over the speaker 123 and/or presented on the display 121 during the order by the voice interface 113, the drive thru voice interface 128, and/or the POS terminal service 131. Any visual presented feedback or guidance during the transaction/dialogue is processed by the server POS terminal agent 129 to render such visual information on the display 121. The speech-based feedback and guidance during the transaction/dialogue is processed by the voice interface 113 the drive thru voice interface 128, and/or the POS terminal service 131 and played as audio over the speaker 123.
  • The order/transaction/dialogue is a real-time dialogue and session for a transaction and the transaction is for placing an order with the restaurant. The user provides natural language voice instructions and receives natural language speech feedback, speech confirmation, and speech-based help instructions or guidance as needed during the session for the transaction.
  • In an embodiment, the server/POS terminal agent 129 utilizes a collection of menu item images 113 that map to menu items for the menu of the restaurant. So, when a user orders through speech an item and image of that item is retrieved by the server/POS terminal agent 129 and presented on the display 121. The user can visually see what the user had communicated through speech as the items are ordered through speech by the user. This may be particularly helpful when a user sees that an image for an item ordered is not something that the user actually wanted (in other words the user may have said something different from what the user actually wanted). This allows for speech-based revisions and changes during the order. In addition, text associated with the order may be presented with the corresponding images.
  • In an embodiment, when the user initially starts an order with a wake up word, the voice interface 113, the drive thru voice interface 128, and/or the POS terminal service 131 may detect the language being spoken by the user and a dialect of a specific language. The voice interface 113, the drive thru voice interface 128, and/or the POS terminal service 131, and the server/POS terminal agent 129 then configure its speech and text provided as spoken and visual feedback and guidance for the detected language and dialect. So, any supported spoken language can be automatically recognized and used throughout the order/session/dialogue/transaction.
  • In an embodiment, if the voice interface 113, the drive thru voice interface 128, and/or the POS terminal service 131 may also upon completion of an order (with an exclamation being verified from the user for the order) provide speech instructions for the user to pay for the order. The order may be paid for at the drive thru terminal/display 120 or at a different station. In an embodiment, where the order is paid for at the drive thru terminal/display 120, the drive thru terminal/display 120 includes a card reader 125 and/or a currency dispenser/accepter 126. Other forms of payment may be used as well, such as Apple Pay®, NCR's JetPay®, Venmo®, digital wallets, etc. Control is passed to the transaction manager 121 and these peripherals for confirming payment was received from the user either through a credit/debit card or through currency.
  • In an embodiment, when the voice interface 113, the drive thru voice interface 128, and/or the POS terminal 131 is unable to understand or translated the spoken words of the user during the order. The server/POS terminal agent 129 may initiate a voice call to a remote human ordering agent. This call can be initiated through cellular or through Voice over Internet Protocol (VoIP) from the drive thru terminal/display 120 to the remote order taker or a bank of order takers to select. The server/POS terminal agent 129 may then be remote controlled through an API by the order taker during the order, and selections or details for the order provided by the server/POS terminal agent 129 to the transaction manager 111 for processing with the local restaurant's ordering system/interface. In an embodiment, the call request is routed over connection 115 to the server/POS terminal 110 and the server/POS terminal 110 makes the call and provides the connection between the order taker and the user through the drive thru terminal/display 120 utilizing the display 121 and the microphone 122.
  • In an embodiment, the server/POS terminal agent 129 interacts with the wireless transceiver 127 to establish a connection to the mobile device 150 and communicate with the mobile application 151. In this embodiment, the user may conduct the voice-based order dialogue with the drive thru terminal/display 120 using the microphone of the mobile device 150 while remaining in the vehicle 140 with the windows of the vehicle 140 remaining up. Here, the voice-based feedback and guidance for the order/dialogue/transaction can be played over a speaker of the mobile device 150, over speaker 123 of the drive thru terminal/display 120 or over both the mobile device's speaker and the drive thru terminal/display's speaker. This may be particularly advantageous when there is inclement weather at the location associated with the drive thru terminal/display 120. Here, spoken user voice during the dialogue is provided over a wireless connection 145 (such as Bluetooth®) and feedback from the voice interface 113, the drive thru voice interface 128, and/or the POS terminal service 131 routed over the wireless connection 145 and played over a speaker that is integrated into the mobile device. Additionally, in this embodiment, the mobile application 151 may be used to display the visual feedback and guidance provided over the wireless connection 145 on a display of the mobile device 150. In some cases, the mobile application 151 may provide selectable and changeable options for the order that the user can make through an interface of the mobile application 151. So, the input microphone can be microphone 122 or a mobile device microphone, while the output speaker can be speaker 123 or a mobile device speaker. Furthermore, a text-based menu interface may be interacted with by the user through a mobile application user interface on the mobile device 150.
  • In an embodiment, the microphone 122 input source feed is routed over connection 135 to the voice-enabled network device 130 and the speaker output source feed of the voice-enabled network device 130 is routed over 135 to the speaker 123. Connection 135 also provides the speech-to-text translations made by the POS terminal service 131 to the drive thru voice interface 128. The drive thru voice interface 128 then uses the text transactions to translate into transaction manager 111 commands and provides to the server/POS terminal agent 129. The terminal agent 129 interacts with the transaction manager 111 through an API to have the commands for the order and the order details recorded for the transaction in the ordering system/interface. Output received from the transaction manager 111 is received by the terminal agent 129 and provided to the drive thru voice interface 128, the voice feedback is forwarded to the POS terminal service 131 and played over speaker 123 to the user. The terminal agent 129 also presents the feedback in visual format on the display 121, the visual format may include the images for items of the menu that were ordered (such as an image of a cheese burger, etc.) and text formatted feedback.
  • In an embodiment, the mobile application 151 makes a wireless connection 146 to the POS terminal service 131 and provides voice-based ordering through the microphone of the mobile device 150. Speech and voiced-based feedback is communicated back to the speakers of the mobile device 150 and/or also the speaker 123. Here, the POS terminal service 131 provides the voice input to the terminal agent 129 and plays the feedback for the order over the mobile device's speakers using connection 146. Separately, the POS terminal service 131 interacts with the drive thru voice interface 128 over a different connection 135 to permit the terminal agent 129 and the transaction manager 111 to process the order for the transaction in the manners discussed above. In this embodiment, the mobile application 151 reports its current geographical location and the POS terminal service 131 looks up an identifier for the terminal/display 120 based on the reported geographical location of the mobile device 150. This permits the POS terminal service 131 to make connection 135 with the appropriate terminal/display 120 that the vehicle 140 is located at.
  • In an embodiment, the voice-enabled network device 130 resides on server 110 or is connected to POS terminal 110 rather than connected to the drive thru terminal/display 120. In this embodiment the voice interface 113 manages voice to text and text to voice translations for interaction with the terminal agent 129 and/or transaction manager 111.
  • In an embodiment, the drive thru terminal/display 120 is a thin client, which includes just the display 121, the microphone 122, the speaker 123, optionally the camera/sensor 124, optionally the card reader 125, optionally the currency dispenser/accepter 126, optionally the wireless transceiver 127, and the terminal agent 129. In this embodiment, the terminal agent 129 provides speech received over 115 to the server/POS terminal 110 and all processing for the transaction/order/dialogue is processed on the server/POS terminal 110 with the drive through display manager 114 displaying the menu item images 112 on display 121 and the voice interface 113 playing the speech feedback over speaker 123.
  • The system 100 provides a variety of configurations for integrated a separate intelligent voice service with an existing ordering system/interface for purposes of allowing natural language based ordering at the drive thru terminal/display 120. The system 100 includes a variety of embodiments and additional order processing. The system 100 is customized for a specific restaurant type and specific menu of such a restaurant type, which allows for more accurate speech-to-text and text-to-speech processing. This is done with a restricted lexicon of words and phrases to identify user order commands, menu items (nouns), adjectives affecting the menu items, prepositions affecting menu items, and exclamations that confirm or do not confirm a given user order.
  • In an embodiment, the server/POS terminal 110 is a server 110. In an embodiment, the server 110 is a collection of servers that logically cooperate in a cloud processing environment. In an embodiment, the server 110 is connected over a Local Area Network (LAN) via 115 to the terminal/display 120. In an embodiment, the server 110 is connected over a Wide Area Network (WAN) through 115 to the terminal/display 120.
  • In an embodiment the server/POS terminal 110 is a POS terminal 110. The POS terminal 110 is a transaction terminal located within the restaurant. In an embodiment, the POS terminal 110 can be a Self-Service Terminal (SST) 110 located within the restaurant and remote from the terminal/display 120, and the SST 100 also permits other customers or users that come within the restaurant to concurrently place self-orders on touchscreen interface of the SST 100. In an embodiment, the POS terminal 110 is a cashier assisted transaction terminal within the restaurant that concurrently takes and processes orders for customers/users that come inside to eat at the restaurant. In an embodiment, the POS terminal 110 is connected to the terminal/display 120 over wireless connection 115. In an embodiment, the POS terminal is connected to the terminal/display over a wired connection 115.
  • In an embodiment, a separate SST includes the card reader 125 and the currency dispenser/accepter 126. The user engages in a completed voice-based dialogue order at the terminal/display 120 and pulls the vehicle ahead to the SST where the user is instructed to pay for the order to complete the transaction.
  • In an embodiment, the user engages in a completed voice-based dialogue order at the terminal/display 120 and pulls up to a window where the user pays for the order to complete the transaction.
  • In an embodiment, the user engages in a completed voice dialogue and pays for the order to complete the transaction at the terminal/display 120 utilizing card reader 125 and/or the currency dispenser/accepter 126. The user then pulls ahead and picks up the order at a window from a human agent or the user pulls ahead to a designated window or food storage bin that automatically unlocks when the order is completed. Any automated window/bin may not include a human agent; albeit a human agent may fulfill the order and pace it in the located bin or inside the locked window. In an embodiment, the order is fulfilled by automated robots and placed in a locked bin or inside the locked window.
  • The embodiments discussed above and other embodiments will not be discussed with reference to the FIGS. 2-4 below.
  • FIG. 2 is a diagram of a method 200 for voice-based order processing, according to an example embodiment. The software module(s) that implements the method 200 is referred to as a “voice-order service.” The voice-order service is implemented as executable instructions programmed and residing within memory and/or a non-transitory computer-readable (processor-readable) storage medium and executed by one or more processors of a device. The processor(s) of the device that executes the transaction manager are specifically configured and programmed to process the voice-order service. The voice-order service has access to one or more networks during its processing. The networks can be wired, wireless, or a combination of wired and wireless.
  • In an embodiment, the device that executes the voice-order service is the drive thru terminal/display 120.
  • In an embodiment, the device that executes the voice-order service the server 110
  • In an embodiment, the device that executes the voice-order service is the POS terminal 110.
  • In an embodiment, the device that executes the voice-order service is the voice-enabled network device 130.
  • In an embodiment, a plurality of devices execute portions of the voice-order service, the devices include all of some combination of: the server/POS terminal 110, the drive thru terminal/display 120, the voice-enabled network device 130, and/or the mobile device 150.
  • In an embodiment, the voice-order service is all or some combination of: the transaction manager 111, voice interface 113, drive thru terminal/display manager 114, drive thru voice interface 128, server/POS terminal agent 129, POS terminal service 131, and/or mobile application 151.
  • At 210, the voice-order service initiates, at a drive thru terminal, an interactive voice dialogue/session with a user for an order. The user is situated within a vehicle in front of the drive thru terminal. In an embodiment, the drive thru terminal is the terminal/display 120.
  • In an embodiment, at 211, the voice-order service automatically detects a vehicle with includes the user in front of the drive thru terminal.
  • In an embodiment of 211 at 212, the voice-order service processes images taken in a front of the drive thru terminal with a machine-learning algorithm to identify a presence of the vehicle.
  • In an embodiment of 212 at 213, the voice order service identifies the vehicle through a motion or light sensor that detects the vehicle in front of the drive thru terminal.
  • In an embodiment, at 214, the voice-order service identifies a wake-up word or phrase received over a microphone of the drive thru terminal that is specific to ordering.
  • At 220, the voice-order service translates speech provided by the user into text commands associated with an order interface using a lexicon associated with a type of restaurant.
  • In an embodiment, at 221, the voice-order service requests clarification to a portion of the speech through a speech-clarification request played over the speaker to the user.
  • In an embodiment, at 222, the voice-order service summarizes the speech once translated and playing a summary of the speech over the speaker to the user.
  • At 230, the voice-order service processes text commands through the order interface.
  • In an embodiment, at 231, the voice-order service receives order pricing and order details from the order interface as text information.
  • At 240, the voice-order service provides, at the drive thru terminal, speech feedback and guidance to the user during the order over the speaker.
  • In an embodiment of 231 and 240, at 241, the voice-order service translates the text information into the speech feedback that is played over the speaker.
  • In an embodiment of 241 at 242, the voice-order service simultaneously presents the text information on a display of the drive thru terminal as the speech feedback is played over the speaker to the user.
  • In an embodiment of 242 at 243, the voice-order service presents images associated with the order items of the order details on the display with the text information. Each image is a customized picture or illustration of a given order item.
  • At 250, the voice-order service completes, at the drive thru terminal, the order for subsequent order fulfillment processing.
  • In an embodiment, at 260, the voice-order service processes the speech and the speech feedback through a voice-enabled network device interfaced to the drive thru terminal using an enabled skill on the voice-enabled network device for the type of restaurant. In an embodiment, the voice-enabled network device is an enhanced version of Siri®, Echo®, or Google Home® having the novel and enabled skill.
  • FIG. 3 is a diagram of another method 300 for voice-based order processing, according to an example embodiment. The software module(s) that implement the method 300 is referred to herein as a “drive thru voice-enabled service.” The drive thru voice-enabled service is implemented as executable instructions and programmed within memory and/or a non-transitory computer-readable (processor-readable) storage medium that executes on one or more processors of a device. The processors of the device are specifically configured to execute the drive thru voice-enabled service. The drive thru voice-enabled service has access one or more networks; the networks are wireless.
  • In an embodiment, the drive thru voice-enabled service is all or some combination of: transaction manager 111, voice interface 113, drive thru terminal/display manager 114, drive thru voice interface 128, server/POS terminal agent 129, POS terminal service 131, and/or the mobile application 151.
  • In an embodiment, the device that executes the drive thru voice-enabled service is server 110.
  • In an embodiment, the device that executes the drive thru voice-enabled service is the POS terminal 110.
  • In an embodiment, the device that executes the drive thru voice-enabled service is the drive thru terminal/display 120.
  • In an embodiment, the device that executes the drive thru voice-enabled service is a combination of all of the devices 110, 120, 130 and/or 150.
  • At 310, the drive thru voice-enabled service automatically initiates a voice dialogue for a transaction associated with an order of a user in a vehicle at a drive thru terminal based on detection of an event.
  • In an embodiment, at 311, the drive thru voice-enabled service detects the event as an indication of a vehicle that includes the user being in front of the drive thru terminal using a camera or a motion/light sensor.
  • In an embodiment, at 312, the drive thru voice-enabled service detects the event as a predefined wake-up word that initiates the order when detected over the microphone.
  • At 320, the drive thru voice-enabled service translates speech received over a microphone from a user into text commands, text order items, and text order customizations.
  • At 330, the drive thru voice-enabled service formats the text commands, text order items, and text order customizations into an order interface actions.
  • At 340, the drive thru voice-enabled service provides the order interface actions to a transaction manager using an Application Programming Interface (API).
  • In an embodiment, at 341, the drive thru voice-enabled service obtains a transaction number for the transaction and the order from the transaction manager.
  • At 350, the drive thru voice-enabled service receives order pricing and order details over the API from the transaction manager.
  • At 360, the drive thru voice-enabled service converts the order pricing and the order details into speech feedback.
  • At 370, the drive thru voice-enabled service plays the speech feedback over a speaker to the user.
  • In an embodiment, at 371, the drive thru voice-enabled service obtains the order pricing and the order details on a display of the drive thru terminal.
  • At 380, the drive thru voice-enabled service iterates back to 320 until speech confirmation is received from the user indicating the voice dialogue has concluded.
  • At 390, the drive thru voice-enabled service provides instructions to the user for making payment to conclude the transaction.
  • In an embodiment, at 391, the drive thru voice-enabled service receives a credit card or currency by the drive thru terminal from the user to complete the transaction. It is to be noted that other non-card based forms of payment may also be processed, such as but not limited to: Apple Pay®, NCR JetPay®, Venmo®, digital wallets, virtual currency from a virtual currency wallet, etc.
  • FIG. 4 is a diagram of a system 400 for voice-based order processing, according to an example embodiment. Some components of the system 400 are programmed and reside within memory and/or a non-transitory computer-readable medium and execute on one or more processors of devices of the system 400. The system 400 also has access and can communicate over one or more networks; and the networks are wireless, wired, or a combination of wired and wireless.
  • The system 400 is configured and programed to perform the processing discussed above with the FIGS. 1-3.
  • The system 400 includes a display 401, a microphone 402, a speaker 403, a processor 404, and non-transitory computer-readable storage medium 405 having executable instructions representing a voice-based ordering service 406.
  • In an embodiment, the processor is included within one of: a server 110, a POS terminal 110, a SST 110, terminal/display 120, or voice-enabled network device 130.
  • In an embodiment, the voice-based ordering service 406 is all or some combination of one or more of: the transaction manager 111, the voice interface 113, the drive-thru terminal display manager 114, the drive thru voice interface 128, the terminal agent 129, the POS terminal service 131, the mobile application 151, the method 200, and/or the method 300.
  • When the processor 404 loads the executable instructions for the voice-based ordering service 406 from the non-transitory computer-readable storage medium 405 and executes the voice-based ordering service 406 this causes the processor 404 to: engage a user while in a vehicle at a drive thru terminal 120 in an interactive voice dialogue for taking an order at a restaurant; translate speech received over the microphone 402 during the interactive voice dialogue into text actions; process the text actions with an order interface; receive order pricing and order details from the order interface responsive to processing the text actions; communicate through speech feedback the order pricing and order details over the speaker 403; display the order pricing and order details on the display 401; confirm through speech interaction that the order is complete; and request a payment method from the user to complete the order for order fulfillment processing.
  • In an embodiment the voice-based ordering system 406 is further configured when executed by the processor to cause the processor 404 to: display unique images representing order items on the display 401 during the interactive voice dialogue.
  • It should be appreciated that where software is described in a particular form (such as a component or module) this is merely to aid understanding and is not intended to limit how software that implements those functions may be architected or structured. For example, modules may be illustrated as separate modules, but may be implemented as homogenous code, as individual components, some, but not all of these modules may be combined, or the functions may be implemented in software structured in any other convenient manner.
  • Furthermore, although the software modules are illustrated as executing on one piece of hardware, the software may be distributed over multiple processors of a single device, or in any other convenient manner.
  • The above description is illustrative, and not restrictive. Many other embodiments will be apparent to those of skill in the art upon reviewing the above description. The scope of embodiments should therefore be determined with reference to the appended claims, along with the full scope of equivalents to which such claims are entitled.
  • In the foregoing description of the embodiments, various features are grouped together in a single embodiment for the purpose of streamlining the disclosure. This method of disclosure is not to be interpreted as reflecting that the claimed embodiments have more features than are expressly recited in each claim. Rather, as the following claims reflect, inventive subject matter lies in less than all features of a single disclosed embodiment. Thus the following claims are hereby incorporated into the Description of the Embodiments, with each claim standing on its own as a separate exemplary embodiment.

Claims (21)

1. (canceled)
2. A method, comprising:
detecting a user in a vehicle at a drive thru terminal by image processing, by sensor-raised events, or by user-provided speech to a microphone;
greeting the user by automated speech played over a speaker and reproduced as text presented on a display of the drive thru terminal;
engaging the user in a natural language dialogue to start a session for a transaction with the user using a lexicon for automated speech processing that is specific to a menu and menu options associated with the drive thru terminal;
receiving an order from the user during the session based on the dialogue;
providing the order to order fulfillment processing for fulfillment; and
providing instructions to the user during the dialogue using the automated speech processing for the user to provide a payment to complete the transaction, the session, and the dialogue.
3. The method of claim 2, wherein detecting further includes identifying a wake-up word or a wake-up phrase in the user-provided speech to the microphone.
4. The method of claim 2, wherein detecting further includes detecting a vehicle adjacent to or in front of the drive thru terminal in an image captured by a camera associated with the drive thru terminal.
5. The method of claim 2, wherein engaging further includes translating the user-provided speech during the dialogue into feedback text and presenting the feedback text on the display to confirm with the user the user-provided speech provided as responses during the dialogue.
6. The method of claim 5, wherein translating further includes translating automated speech for the automated speech processing generated during the dialogue into automated text and presenting the automated text on the display to ensure the user hears the automated speech and can also read the automated speech during the dialogue.
7. The method of claim 2, wherein receiving further include obtaining menu item images as spoken by the user during the dialogue and presenting the menu item images on the display as visual feedback to the user of a given menu item ordered with the order by the user.
8. The method of claim 2, wherein receiving further includes initiating a remote voice call to an agent when the user-provided speech is unable to be translated for completing the order during the dialogue.
9. The method of claim 8, wherein initiating further includes receiving the order details for the order from the agent.
10. The method of claim 2 further comprising, receiving the user-provided speech during the dialogue from a mobile device operated by the user.
11. The method of claim 10 further comprising, providing the automated speech generated during the dialogue to a speaker of the mobile.
12. The method of claim 2 further comprising, processing the method on a server remotely located from the drive thru terminal or processing the method on a Point-Of-Sale (POS) terminal located at a same establishment that the drive thru terminal is located.
13. A method, comprising:
automatically greeting a user detected in a vehicle adjacent to a drive thru terminal with a spoken automatically generated greeting;
obtaining a transaction identifier for a transaction from a Point-Of-Sale (POS) terminal;
configuring automated speech processing based on a lexicon associated with a menu and menu items available from the drive thru terminal;
initiating a session with the user using the transaction identifier;
engaging the user during the session in a natural language dialogue to receive order details from the user for an order the user-provided speech and automated generated speech using the automated speech processing;
instructing the user during the dialogue on where and how a payment for the order can be supplied to complete the order; and
submitting the order details with the transaction identifier to the POS terminal for order fulfillment and completion of the transaction.
14. The method of claim 13, wherein engaging further includes obtaining menu item images as spoken by the user during the dialogue from the POS terminal and displaying the menu item images as visual feedback to the user on a display of the drive thru terminal.
15. The method of claim 13, wherein engaging further includes translating the user-provided speech to user-identified text, translating the automated generated speech to feedback text, and presenting the user-identified text and the feedback text on a display of the drive thru terminal during the session as visual feedback to the user.
16. The method of claim 13, wherein instructing further includes using the automated generated speech to instruct the user to pull a vehicle of the user ahead to a next location where the payment for the order will be collected from the user.
17. The method of claim 13, wherein instructing further includes activating a card reader at the drive thru terminal to receive a payment card for the payment.
18. The method of claim 13, wherein instructing further includes instructing the user to pull ahead to a next location to receive items associated with the order after processing the payment.
19. The method of claim 13, wherein instructing further includes instructing the user to pull ahead to a next location to receive the items associated with the order from a locked bin that automatically unlocks when the items are placed in the bin after the order is fulfilled for collection by the user and after processing the payment.
20. A system, comprising:
a drive thru terminal comprising a display, a microphone, and a camera;
a Point-Of-Sale (POS) terminal interfaced to an order fulfillment system;
a server comprising a processor and a non-transitory computer-readable storage medium;
the non-transitory computer-readable storage medium comprises executable instruction;
the executable instructions when executed by the processor from the non-transitory computer-readable storage medium cause the processor to perform operations comprising:
automatically initiating a voice-based dialogue with a user when a vehicle of the user is detected adjacent to the drive thru terminal based on images provided by the camera or when the user speaks a wake-up word or phrase into the microphone;
generating automated speech responses to user-provided speech during the voice-based dialogue;
obtaining order details for an order being placed by the user based on the user-provided speech during the voice-based dialogue;
providing the order details to the POS terminal for order fulfillment; and
instructing the user on how and where to provide a payment to pay for the order and complete the order.
21. The system of claim 20, wherein the server is a cloud processing environment and the drive thru terminal is a thin client device.
US17/341,691 2019-03-28 2021-06-08 Voice-based order processing Pending US20210295421A1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
US17/341,691 US20210295421A1 (en) 2019-03-28 2021-06-08 Voice-based order processing

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US16/368,772 US11132740B2 (en) 2019-03-28 2019-03-28 Voice-based order processing
US17/341,691 US20210295421A1 (en) 2019-03-28 2021-06-08 Voice-based order processing

Related Parent Applications (1)

Application Number Title Priority Date Filing Date
US16/368,772 Continuation US11132740B2 (en) 2019-03-28 2019-03-28 Voice-based order processing

Publications (1)

Publication Number Publication Date
US20210295421A1 true US20210295421A1 (en) 2021-09-23

Family

ID=72605894

Family Applications (2)

Application Number Title Priority Date Filing Date
US16/368,772 Active US11132740B2 (en) 2019-03-28 2019-03-28 Voice-based order processing
US17/341,691 Pending US20210295421A1 (en) 2019-03-28 2021-06-08 Voice-based order processing

Family Applications Before (1)

Application Number Title Priority Date Filing Date
US16/368,772 Active US11132740B2 (en) 2019-03-28 2019-03-28 Voice-based order processing

Country Status (1)

Country Link
US (2) US11132740B2 (en)

Families Citing this family (15)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US11741529B2 (en) 2019-02-26 2023-08-29 Xenial, Inc. System for eatery ordering with mobile interface and point-of-sale terminal
WO2022026037A1 (en) * 2020-07-31 2022-02-03 Xenial, Inc. System and method for drive through order processing
US11676592B2 (en) * 2020-11-25 2023-06-13 Ncr Corporation Voice-based menu personalization
US11694039B1 (en) 2021-01-22 2023-07-04 Walgreen Co. Intelligent automated order-based customer dialogue system
US11862157B2 (en) 2021-02-24 2024-01-02 Conversenow Ai Automated ordering system
US11354760B1 (en) 2021-02-24 2022-06-07 Conversenowai Order post to enable parallelized order taking using artificial intelligence engine(s)
US11810550B2 (en) 2021-02-24 2023-11-07 Conversenowai Determining order preferences and item suggestions
US11348160B1 (en) 2021-02-24 2022-05-31 Conversenowai Determining order preferences and item suggestions
US11514894B2 (en) 2021-02-24 2022-11-29 Conversenowai Adaptively modifying dialog output by an artificial intelligence engine during a conversation with a customer based on changing the customer's negative emotional state to a positive one
US11355122B1 (en) 2021-02-24 2022-06-07 Conversenowai Using machine learning to correct the output of an automatic speech recognition system
US12456455B2 (en) * 2021-10-13 2025-10-28 Ncr Voyix Corporation Natural-language based order processing
EP4681154A1 (en) 2023-03-15 2026-01-21 Xenial, Inc. Drive through system including vision system and transaction system integration
US12381673B1 (en) * 2023-07-31 2025-08-05 Xenial, Inc. Drive through audio communication system with multi-lane support
WO2025122185A1 (en) 2023-12-05 2025-06-12 Xenial, Inc. Drive through system with traffic management
US20250245761A1 (en) * 2024-01-30 2025-07-31 International Business Machines Corporation Artificial Intelligence for Vehicular Drive-Through Based Exchanges

Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20030018531A1 (en) * 2000-09-08 2003-01-23 Mahaffy Kevin E. Point-of-sale commercial transaction processing system using artificial intelligence assisted by human intervention
US20040158494A1 (en) * 2003-02-05 2004-08-12 Suthar Yogin P. Restaurant automation system
US20070022018A1 (en) * 2003-04-23 2007-01-25 Sbc Knowledge Venture, L.P. Wireless electronic drive-thru system and method
US20180253805A1 (en) * 2016-10-31 2018-09-06 Kevin Kelly Drive-thru / point-of-sale automated transaction technologies and apparatus
US20180276602A1 (en) * 2017-03-09 2018-09-27 Smartmart, Inc. Automated Retail Facility
US20190108566A1 (en) * 2017-10-05 2019-04-11 Novo Labs, Inc. Contextual Restaurant Ordering System
US20190171711A1 (en) * 2017-03-29 2019-06-06 Valyant Al, Inc. Artificially intelligent order processing system
US20190259232A1 (en) * 2016-08-05 2019-08-22 Gopal Nandakumar Locker Adaption System and Related Method for Consumer In-Door, Out-Door and Curbside Goods Delivery and Pickup Services and for Merchant Store Pickup Services
US20230368274A1 (en) * 2019-02-26 2023-11-16 Xenial, Inc. System for eatery ordering with mobile interface and point-of-sale terminal

Family Cites Families (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2001155259A (en) * 1999-11-30 2001-06-08 Matsushita Electric Ind Co Ltd Drive-through system
US20040199425A1 (en) * 2003-03-14 2004-10-07 Van Luchene Andrew S. Method and apparatus for motion-controlled communication of offers
US20060218039A1 (en) * 2005-02-25 2006-09-28 Johnson Neldon P Enhanced fast food restaurant and method of operation
US20070007331A1 (en) * 2005-07-06 2007-01-11 Verety Llc Order processing apparatus and method
US20080222004A1 (en) * 2007-03-06 2008-09-11 Verety, Llc Order Entry Graphical User Interface
US20080218313A1 (en) * 2007-03-09 2008-09-11 D Hont Loek Rfid-based system and method for drive-through ordering
US20150310365A1 (en) * 2014-04-25 2015-10-29 Xerox Corporation System and method for video-based detection of goods received event in a vehicular drive-thru
CA3052661A1 (en) * 2018-08-21 2020-02-21 Bite Inc. Retail ordering system with facial recognition

Patent Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20030018531A1 (en) * 2000-09-08 2003-01-23 Mahaffy Kevin E. Point-of-sale commercial transaction processing system using artificial intelligence assisted by human intervention
US20040158494A1 (en) * 2003-02-05 2004-08-12 Suthar Yogin P. Restaurant automation system
US20070022018A1 (en) * 2003-04-23 2007-01-25 Sbc Knowledge Venture, L.P. Wireless electronic drive-thru system and method
US20190259232A1 (en) * 2016-08-05 2019-08-22 Gopal Nandakumar Locker Adaption System and Related Method for Consumer In-Door, Out-Door and Curbside Goods Delivery and Pickup Services and for Merchant Store Pickup Services
US20180253805A1 (en) * 2016-10-31 2018-09-06 Kevin Kelly Drive-thru / point-of-sale automated transaction technologies and apparatus
US20180276602A1 (en) * 2017-03-09 2018-09-27 Smartmart, Inc. Automated Retail Facility
US20190171711A1 (en) * 2017-03-29 2019-06-06 Valyant Al, Inc. Artificially intelligent order processing system
US20190108566A1 (en) * 2017-10-05 2019-04-11 Novo Labs, Inc. Contextual Restaurant Ordering System
US20230368274A1 (en) * 2019-02-26 2023-11-16 Xenial, Inc. System for eatery ordering with mobile interface and point-of-sale terminal

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
Masood, Adnan, et al, "Cognitive Computing Recipes," 2019, "8-8. Fast Food Drive-thru Automation Problem Solution" (Year: 2019) *
Masood, Adnan, et al, "Cognitive Computing Recipes," 2019, "8-8. Fast Food Drive-thru Automation ProblemnSolution" (Year: 2019) *

Also Published As

Publication number Publication date
US11132740B2 (en) 2021-09-28
US20200311804A1 (en) 2020-10-01

Similar Documents

Publication Publication Date Title
US11132740B2 (en) Voice-based order processing
US12346945B2 (en) System for virtual agents to help customers and businesses
US11755836B1 (en) Artificially intelligent order processing system
US12451133B2 (en) Voice-based menu personalization
US8060412B2 (en) System and method for obtaining merchandise information
US20030018531A1 (en) Point-of-sale commercial transaction processing system using artificial intelligence assisted by human intervention
US20150113006A1 (en) Systems, devices and methods for list display and management
CN107230142A (en) Method and device, method of commerce and system based on speech production order
US12456455B2 (en) Natural-language based order processing
US20070143127A1 (en) Virtual host
US20180089654A1 (en) Point-Of-Sale (POS) Bot
US11628574B2 (en) System and method for robotic, compact, mobile beverage station
US20230214592A1 (en) Methods and apparatus to generate textual data using machine learning processes
WO2017223298A2 (en) Speech-based storage tracking
US11238859B2 (en) Voice-based transaction processing with location-based activation
US20250095039A1 (en) Systems and methods for autonomous conversational ordering and booking using ai technologies
US20190378505A1 (en) Interactive voice-activated bot with visual cue
US20240404430A1 (en) Method of transmission of sign language for customer use with a business
US9576316B2 (en) Dynamic menus
US11315071B1 (en) Speech-based storage tracking
Sadi et al. Development of a Voice Controlled Web based E-Commerce
US20230123813A1 (en) System and method of voice-based ordering and delivery thereof
JP2025099405A (en) Shopping support system, sales data processing apparatus, and shopping support program
CN121391425A (en) A method and device for online shopping

Legal Events

Date Code Title Description
STPP Information on status: patent application and granting procedure in general

Free format text: DOCKETED NEW CASE - READY FOR EXAMINATION

AS Assignment

Owner name: NCR CORPORATION, GEORGIA

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:BUCKHOLDT, WAYNE LAWRENCE;HAMILTON, ANDREW ROBERT;REEL/FRAME:060855/0965

Effective date: 20190328

STPP Information on status: patent application and granting procedure in general

Free format text: NON FINAL ACTION MAILED

STPP Information on status: patent application and granting procedure in general

Free format text: ADVISORY ACTION MAILED

STPP Information on status: patent application and granting procedure in general

Free format text: DOCKETED NEW CASE - READY FOR EXAMINATION

STPP Information on status: patent application and granting procedure in general

Free format text: NON FINAL ACTION MAILED

AS Assignment

Owner name: BANK OF AMERICA, N.A., AS ADMINISTRATIVE AGENT, NORTH CAROLINA

Free format text: SECURITY INTEREST;ASSIGNOR:NCR VOYIX CORPORATION;REEL/FRAME:065346/0168

Effective date: 20231016

AS Assignment

Owner name: NCR VOYIX CORPORATION, GEORGIA

Free format text: CHANGE OF NAME;ASSIGNOR:NCR CORPORATION;REEL/FRAME:065532/0893

Effective date: 20231013

STPP Information on status: patent application and granting procedure in general

Free format text: RESPONSE TO NON-FINAL OFFICE ACTION ENTERED AND FORWARDED TO EXAMINER

STPP Information on status: patent application and granting procedure in general

Free format text: FINAL REJECTION MAILED

STPP Information on status: patent application and granting procedure in general

Free format text: RESPONSE AFTER FINAL ACTION FORWARDED TO EXAMINER

STPP Information on status: patent application and granting procedure in general

Free format text: ADVISORY ACTION MAILED

STPP Information on status: patent application and granting procedure in general

Free format text: DOCKETED NEW CASE - READY FOR EXAMINATION

STPP Information on status: patent application and granting procedure in general

Free format text: NON FINAL ACTION MAILED

STPP Information on status: patent application and granting procedure in general

Free format text: RESPONSE TO NON-FINAL OFFICE ACTION ENTERED AND FORWARDED TO EXAMINER

STPP Information on status: patent application and granting procedure in general

Free format text: FINAL REJECTION MAILED

STPP Information on status: patent application and granting procedure in general

Free format text: DOCKETED NEW CASE - READY FOR EXAMINATION

STPP Information on status: patent application and granting procedure in general

Free format text: NON FINAL ACTION COUNTED, NOT YET MAILED

STPP Information on status: patent application and granting procedure in general

Free format text: NON FINAL ACTION MAILED

STPP Information on status: patent application and granting procedure in general

Free format text: NON FINAL ACTION MAILED