[go: up one dir, main page]

CN100353417C - Method and apparatus for providing text message - Google Patents

Method and apparatus for providing text message Download PDF

Info

Publication number
CN100353417C
CN100353417C CNB031249639A CN03124963A CN100353417C CN 100353417 C CN100353417 C CN 100353417C CN B031249639 A CNB031249639 A CN B031249639A CN 03124963 A CN03124963 A CN 03124963A CN 100353417 C CN100353417 C CN 100353417C
Authority
CN
China
Prior art keywords
message
template
utterance
templates
user
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
CNB031249639A
Other languages
Chinese (zh)
Other versions
CN1601548A (en
Inventor
张亚昕
何昕
任晓林
孙放
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Motorola Mobility LLC
Google Technology Holdings LLC
Original Assignee
Motorola Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Motorola Inc filed Critical Motorola Inc
Priority to CNB031249639A priority Critical patent/CN100353417C/en
Priority to RU2006113581/09A priority patent/RU2320082C2/en
Priority to KR1020067005735A priority patent/KR100759728B1/en
Priority to PCT/US2004/030553 priority patent/WO2005031995A1/en
Priority to EP04784421A priority patent/EP1665561A4/en
Publication of CN1601548A publication Critical patent/CN1601548A/en
Application granted granted Critical
Publication of CN100353417C publication Critical patent/CN100353417C/en
Anticipated expiration legal-status Critical
Expired - Fee Related legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/26Speech to text systems
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/10Speech classification or search using distance or distortion measures between unknown speech and reference templates
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M1/00Substation equipment, e.g. for use by subscribers
    • H04M1/72Mobile telephones; Cordless telephones, i.e. devices for establishing wireless links to base stations without route selection
    • H04M1/724User interfaces specially adapted for cordless or mobile telephones
    • H04M1/72403User interfaces specially adapted for cordless or mobile telephones with means for local support of applications that increase the functionality
    • H04M1/7243User interfaces specially adapted for cordless or mobile telephones with means for local support of applications that increase the functionality with interactive means for internal management of messages
    • H04M1/72436User interfaces specially adapted for cordless or mobile telephones with means for local support of applications that increase the functionality with interactive means for internal management of messages for text messaging, e.g. short messaging services [SMS] or e-mails
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M1/00Substation equipment, e.g. for use by subscribers
    • H04M1/26Devices for calling a subscriber
    • H04M1/27Devices whereby a plurality of signals may be stored simultaneously
    • H04M1/271Devices whereby a plurality of signals may be stored simultaneously controlled by voice recognition
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M2250/00Details of telephonic subscriber devices
    • H04M2250/74Details of telephonic subscriber devices with voice recognition means

Landscapes

  • Engineering & Computer Science (AREA)
  • Human Computer Interaction (AREA)
  • Acoustics & Sound (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Health & Medical Sciences (AREA)
  • Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • Multimedia (AREA)
  • Business, Economics & Management (AREA)
  • General Business, Economics & Management (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Signal Processing (AREA)
  • Telephone Function (AREA)
  • Telephonic Communication Services (AREA)

Abstract

A method and apparatus for providing a text message includes an input of an electronic device (100) receiving an utterance (step 210). Speech recognition is then performed on the utterance (step 230) guided by a message template set by the user, where the message template is stored in a memory (155) associated with the electronic device (100). Speech recognition is defined by matching one of the utterances to a template to create a matching template. A text message is then provided through the matching template (step 235).

Description

用于提供文本消息的方法和装置Method and apparatus for providing text messages

技术领域technical field

本发明主要涉及一种利用话音来提供文本消息的方法和装置。本发明对于(但不排它的)利用话音输入来提供文本消息的方式尤其有用处,其中话音输入是在带有有限内存和计算能力的便携式电子设备上进行处理的。The present invention generally relates to a method and apparatus for providing text messages using voice. The present invention is particularly useful, but not exclusively, for providing text messages using voice input that is processed on portable electronic devices with limited memory and computing power.

背景技术Background technique

通常采用短消息业务(SMS)格式的短文本消息在无线通信领域内应用非常广泛。每个月有几十亿的短文本消息被发送,通常从一部移动电话发向另一部移动电话。这种文本消息的流行有很多原因。消息一般只需花费一分钟移动电话呼叫费用的一小部分且无需发送或接收占线音。因此,发送用户可以在方便的时候才建立和发送消息,而接收用户则可以在方便的时候才接收并读取消息。Short text messages, usually in Short Message Service (SMS) format, are widely used in the field of wireless communications. Billions of short text messages are sent every month, often from one mobile phone to another. There are many reasons for the popularity of this text message. Messages typically cost a fraction of the cost of a minute mobile phone call and require no busy tone to be sent or received. Therefore, the sending user can create and send the message when it is convenient, and the receiving user can receive and read the message when it is convenient.

通常通过在移动电话的键盘内键入字符的方式来建立文本消息。但是,使用这种小型的非标准键盘来编写消息可能很不方便,且通常比采用原尺寸的标准键盘的方式需要更多的时间。但是,在移动电话上连接原尺寸的标准键盘当然是不切实际的。因此,需要采用更有效的方法来编写短文本消息。Text messages are typically created by typing characters into the keypad of the mobile phone. However, composing a message using such a small, non-standard keyboard can be inconvenient and often requires more time than with a full-sized standard keyboard. However, it is of course impractical to attach a full-sized standard keyboard to a mobile phone. Therefore, there is a need for a more efficient method of composing short text messages.

进而,虽然现存多种公知的语音识别系统,但大多数都不适用于诸如移动电话的便携式电子设备。这是因为现有技术的语音识别系统通常需要比便携式电子设备中所能提供的更大的处理功率和内存。Furthermore, although there are various known speech recognition systems, most are not suitable for portable electronic devices such as mobile phones. This is because prior art speech recognition systems typically require more processing power and memory than can be provided in portable electronic devices.

现有技术的封闭式词汇语音识别系统和方法采用预先设定的、固定的词汇表。在使用中,固定词汇表可能很大但并不完全,因此,例如,它可能不包括人的姓氏或许多地名。相反,开放式词汇语音识别系统和方法使用可变的词汇表,用户可以向其中添加新的单词和词组或者删除单词和词组。但是,现有的开放式词汇语音识别系统和方法需要相对较高的计算开销,这对于诸如数字个人助理(PDA)、无线电话和其它便携式设备的便携式电子设备来说,可能承受不了。Closed-vocabulary speech recognition systems and methods in the prior art use a pre-set, fixed vocabulary. In use, a fixed vocabulary may be large but not complete, so that, for example, it may not include a person's last name or many place names. In contrast, open vocabulary speech recognition systems and methods use a variable vocabulary to which a user can add new words and phrases or delete words and phrases. However, existing open-vocabulary speech recognition systems and methods require relatively high computational overhead, which may be prohibitive for portable electronic devices such as digital personal assistants (PDAs), wireless phones, and other portable devices.

在本说明书以及权利要求中,术语“包含”或者类似术语希望表示非排它性的包含,以使包含一组组件的方法或装置并不是仅仅只包括这些组件,而是可以很好地包括其它未列出的组件。In this specification and claims, the term "comprises" or similar terms is intended to indicate a non-exclusive inclusion, so that a method or apparatus comprising a group of components does not only include these components, but may well include other components. Components not listed.

发明内容Contents of the invention

根据本发明的一个方面,提供了一种用于提供文本消息的方法。该方法包括在电子设备的输入端接收话语的步骤。然后,在由用户设定的消息模板所引导的话语上进行语音识别,该消息模板存储在与电子设备相关联的存储器中,其中通过对该话语和模板之一进行匹配的方式定义语音识别,以建立一个匹配模板。然后,通过该匹配模板提供文本消息。According to one aspect of the present invention, a method for providing a text message is provided. The method includes the step of receiving an utterance at an input of an electronic device. performing speech recognition on the utterance guided by a message template set by the user, the message template being stored in a memory associated with the electronic device, wherein the speech recognition is defined by matching the utterance to one of the templates, to create a matching template. Then, provide a text message through this matching template.

至少一个消息模板可以包含固定的语言成分。At least one message template may contain fixed language components.

至少一个消息模板可以包含可变的语言成分。At least one message template may contain variable language components.

至少一个消息模板可以同时包含固定和可变的语言成分。At least one message template can contain both fixed and variable language components.

文本消息可以是SMS消息。Text messages may be SMS messages.

上述方法也可以包括通过接收由电子设备的键盘键入的字符来编辑由用户设定的消息模板的步骤。The above method may also include the step of editing the message template set by the user by receiving characters typed by the keyboard of the electronic device.

文本消息的一部分可以是话语的转录(transcription)。A portion of a text message may be a transcription of an utterance.

文本消息可以全部是话语的转录。Text messages may all be transcriptions of utterances.

根据本发明的另一方面,提供了一种用于提供文本消息的电子设备。该电子设备包括一个用于接收话语的麦克风;用于存储消息模板的非易失性存储器;用于执行由该消息模板引导的话语的语音识别的处理器,其中处理器对话语和模板之一进行匹配以建立一个匹配模板,并通过该匹配模板提供文本消息。According to another aspect of the present invention, an electronic device for providing text messages is provided. The electronic device includes a microphone for receiving an utterance; a non-volatile memory for storing a message template; a processor for performing speech recognition of an utterance guided by the message template, wherein the processor interprets one of the utterance and the template Matches are made to build a match template through which the text message is served.

对于该电子设备,消息模板还可以包括固定和/或可变的语言成分。For the electronic device, the message template may also include fixed and/or variable language components.

对于该电子设备,文本消息可以是SMS消息。For the electronic device, the text message may be an SMS message.

该电子设备可以包括一个键盘用于编辑消息模板。The electronic device may include a keypad for editing message templates.

该电子设备可以对话语和多个模板进行匹配以计算每个模板的似然值。The electronic device can match the utterance to the plurality of templates to calculate a likelihood for each template.

附图说明Description of drawings

为了使本发明易于理解并付诸实现,现在将参考附图,为所示出的优选实施例提供参考。在附图中:In order that the present invention may be readily understood and carried into practice, reference will now be made to the preferred embodiments shown, with reference to the accompanying drawings. In the attached picture:

图1是根据本发明的无线电话的示意性框图。Figure 1 is a schematic block diagram of a radiotelephone according to the present invention.

图2是示出根据本发明的、用来提供、编辑和发送文本消息的方法的流程图。FIG. 2 is a flowchart illustrating a method for providing, editing and sending a text message according to the present invention.

图3是示出根据本发明的、用来提供备选消息模板列表的方法的流程图。FIG. 3 is a flowchart illustrating a method for providing a list of alternative message templates according to the present invention.

图4是示出根据本发明的、使用户可以编辑现有消息模板和在静态可编程存储器中保存新模板的方法的流程图。4 is a flowchart illustrating a method of enabling a user to edit existing message templates and save new templates in static programmable memory in accordance with the present invention.

具体实施方式Detailed ways

参考图1,示出了无线电话100,它包括连接到处理器110并与其通信的射频通信单元105。以显示屏115形式呈现的输入/输出(I/O)接口、键盘120、麦克风190以及扬声器195也被连接到处理器110并与其通信。Referring to FIG. 1, a radiotelephone 100 is shown which includes a radio frequency communication unit 105 connected to and in communication with a processor 110. An input/output (I/O) interface in the form of a display screen 115, a keyboard 120, a microphone 190, and a speaker 195 are also connected to and in communication with the processor 110.

处理器110包括编码器/解码器125以及与之相关联的存储数据的只读存储器(ROM)130,用于对可能由无线电话100发送或接收的话音或其它信号进行编码和解码。处理器110还包括微处理器135、随机存取存储器(RAM)150、静态可编程存储器155和可移动SIM模块160,其中微处理器135通过公共数据和地址总线140连接至编码器/解码器125及其相关联的字符只读存储器(ROM)145。静态可编程存储器155和SIM模块160都可以存储(在其它消息中)选中的呼入文本消息、电话本数据库以及呼出文本消息的模板(将在下文中详细描述)。微处理器135具有用于连接至键盘120、显示屏115和提示模块165的端口,其中提示模块165通常包含扬声器、振动马达和相应驱动器。字符只读存储器145存储用于对可由通信单元105接收且在键盘120上输入的文本消息进行解码或编码的代码。Processor 110 includes encoder/decoder 125 and associated read-only memory (ROM) 130 storing data for encoding and decoding voice or other signals that may be transmitted or received by radiotelephone 100 . Processor 110 also includes microprocessor 135, random access memory (RAM) 150, static programmable memory 155, and removable SIM module 160, wherein microprocessor 135 is connected to encoder/decoder via common data and address bus 140 125 and its associated character read only memory (ROM) 145. Both static programmable memory 155 and SIM module 160 can store (among other messages) selected templates for incoming text messages, phonebook databases, and outgoing text messages (described in more detail below). Microprocessor 135 has ports for connection to keyboard 120, display screen 115, and prompt module 165, wherein prompt module 165 typically contains a speaker, vibration motor and corresponding drivers. Character ROM 145 stores codes for decoding or encoding text messages that may be received by communication unit 105 and entered on keyboard 120 .

射频通信单元105是一个组合式接收器和发送器,其具有共用天线170。通信单元105具有经射频放大器180连接至天线170的收发器175。收发器175还连接至组合式调制器/解调器185,调制器/解调器1 85把通信单元2连接至处理器110。The RF communication unit 105 is a combined receiver and transmitter with a common antenna 170 . The communication unit 105 has a transceiver 175 connected to an antenna 170 via a radio frequency amplifier 180 . The transceiver 175 is also connected to a combined modulator/demodulator 185, which connects the communication unit 2 to the processor 110.

参考图2,该流程图示出了包括方法200的本发明的一个实施例,方法200通过采用无线电话100来提供、编辑和发送文本消息。在开始步骤205中调用方法200。在步骤210中,在诸如麦克风190的电话100的输入设备上接收话语。然后,处理器110在步骤215中对该话语波形进行采样和数字化,接着在步骤220中对其进行分段,然后在步骤225中对其进行处理以提供表示该波形的特征向量。应当注意,步骤215、步骤220和步骤225在本领域内是公知的,因此无需对此进行详细说明。Referring to FIG. 2 , a flowchart illustrates an embodiment of the present invention including a method 200 for providing, editing and sending text messages using the wireless telephone 100 . Method 200 is called in start step 205 . In step 210 utterance is received on an input device of phone 100 such as microphone 190 . Processor 110 then samples and digitizes the utterance waveform in step 215, then segments it in step 220, and then processes it in step 225 to provide feature vectors representing the waveform. It should be noted that step 215, step 220, and step 225 are well known in the art, and thus a detailed description thereof is not required.

接下来,在步骤230中,对在步骤225中产生的特征向量进行语音识别。由存储在设备100的静态可编程存储器155中的用户设定的消息模板引导该语音识别。消息模板在下文中将有详细说明。然后,方法200在步骤235中为用户提供文本消息。该消息可通过使用诸如设备100的显示屏115或扬声器195的I/O接口之一来向用户提供。Next, in step 230 , speech recognition is performed on the feature vector generated in step 225 . The speech recognition is guided by user-set message templates stored in the static programmable memory 155 of the device 100 . Message templates are described in detail below. The method 200 then provides a text message to the user in step 235 . The message may be provided to the user by using one of the I/O interfaces, such as display 115 or speaker 195 of device 100 .

向用户提供消息之后,接着用户在步骤240中能够决定是否编辑该消息。如果用户决定不对该消息进行编辑,则该消息在步骤245中以诸如SMS的消息格式发送出去。但是,如果用户在步骤240决定编辑该消息,则消息在步骤245中送出之前,先在步骤250中进行编辑。在本发明的各种实施例中,用户可以以几种不同的方式编辑消息,包括通过扬声器195的语音式的编辑或通过键盘120的键入式的编辑。然后,在步骤255结束方法200。After providing the message to the user, the user can then decide in step 240 whether to edit the message. If the user decides not to edit the message, the message is sent in step 245 in a message format such as SMS. However, if the user decides to edit the message at step 240, the message is edited at step 250 before it is sent at step 245. In various embodiments of the invention, the user may edit the message in several different ways, including voice-based editing through the speaker 195 or typing-in editing through the keyboard 120 . Method 200 then ends at step 255 .

在本发明的另外一种实施例中,在上述的语音识别步骤230之后,提供文本消息的步骤235可以包括向电话100的用户提供备选消息模板列表,用户可从中挑选出最适合于所希望的文本消息的模板。In another embodiment of the present invention, after the speech recognition step 230 described above, the step 235 of providing a text message may include providing the user of the phone 100 with a list of alternative message templates from which the user can select the most suitable template for the desired message. template for your text message.

图3是示出用来向用户提供这种备选模板列表的方法300的流程图。当用户向键盘120或麦克风190输入命令时,在开始步骤305中调用方法300。首先,方法300包括处理器110在步骤310中从可用消息模板列表中选出一个消息模板。然后,在步骤315中,把这个选中的模板和在方法200的步骤225中提供的特征向量进行比较。接着,处理器110在步骤320中计算似然值,估计选中的模板的各个方面和输入话语的特征向量之间的匹配度。FIG. 3 is a flowchart illustrating a method 300 for providing such a list of alternative templates to a user. Method 300 is invoked in start step 305 when a user enters a command into keyboard 120 or microphone 190 . First, the method 300 includes the processor 110 selecting a message template from a list of available message templates in step 310 . Then, in step 315 , this selected template is compared to the feature vector provided in step 225 of method 200 . Next, the processor 110 calculates a likelihood value in step 320 , estimating the matching degree between various aspects of the selected template and the feature vector of the input utterance.

然后,处理器110在步骤325中判断这个似然值是否高于一个已设的门限。这个门限可以由处理器110自动算出,或者由电话100的用户预先设定。如果选中的模板的似然值低于已设的门限,模板在步骤330中被拒绝。但是,如果选中模板的似然值高于已设的门限,那么在步骤335中,则认为该模板与输入话语之间有合理匹配度,并将其添加到备选模板列表中。无论选中的模板是否被拒绝或是添加到备选模板列表中,方法300都继续进行到步骤340,在该步骤中,处理器110判断所有可用的模板是否都已经评估过。如果并非所有可用的模板都被评估过,则方法300在步骤345挑选下一个消息模板并返回到步骤315,在步骤315中,将下一个模板与输入话语的特征向量进行比较。如果在步骤340中已评估过所有模板,则方法300继续进行到步骤350并向用户提供所有备选模板的列表。备选模板可以通过使用诸如设备100的显示屏115或扬声器195的I/O接口之一来向用户提供。然后,在步骤355结束方法300。Then, the processor 110 determines in step 325 whether the likelihood value is higher than a preset threshold. This threshold can be calculated automatically by the processor 110 or preset by the user of the phone 100 . If the likelihood value of the selected template is lower than the preset threshold, the template is rejected in step 330 . However, if the likelihood value of the selected template is higher than the preset threshold, then in step 335, it is considered that the template has a reasonable matching degree with the input utterance, and it is added to the list of candidate templates. Regardless of whether the selected template is rejected or added to the candidate template list, method 300 proceeds to step 340 where processor 110 determines whether all available templates have been evaluated. If not all available templates have been evaluated, method 300 picks the next message template at step 345 and returns to step 315 where the next template is compared to the feature vector of the input utterance. If all templates have been evaluated in step 340, method 300 proceeds to step 350 and provides the user with a list of all candidate templates. Alternate templates may be presented to the user by using one of the I/O interfaces such as display 115 or speaker 195 of device 100 . Method 300 then ends at step 355 .

根据本发明的一个实施例,电话100的用户不限定于只使用由设备100的制造商提供的模板。相反,设备100的用户可以编辑存储在静态可编程存储器155中的现有模板,以建立自己个性化的消息模板。According to one embodiment of the invention, the user of the phone 100 is not limited to using only templates provided by the manufacturer of the device 100 . Instead, a user of device 100 may edit existing templates stored in static programmable memory 155 to create his own personalized message templates.

参考图4,示出了使用户可以编辑现有模板并在静态可编程存储器155中保存新模板的方法400。当用户向键盘120或麦克风190输入命令时,在开始步骤405中调用方法400。在步骤410中,通过诸如显示屏115或扬声器195的I/O接口向设备100的用户提供现有模板的列表。接着,用户在步骤415中通过使用诸如麦克风190或键盘120的I/O接口来选择所需的消息模板。接下来,在步骤420中,用户再次通过使用诸如麦克风190或键盘120的I/O接口来编辑模板。最后,在步骤425中,用户在静态可编程存储器155中保存已编辑好的模板。然后,在步骤430结束方法400。Referring to FIG. 4 , a method 400 of enabling a user to edit existing templates and save new templates in static programmable memory 155 is shown. Method 400 is invoked in start step 405 when a user enters a command into keyboard 120 or microphone 190 . In step 410 , a user of device 100 is provided with a list of existing templates through an I/O interface such as display screen 115 or speaker 195 . Next, the user selects a desired message template by using an I/O interface such as microphone 190 or keyboard 120 in step 415 . Next, in step 420 , the user edits the template again by using an I/O interface such as the microphone 190 or the keyboard 120 . Finally, in step 425 , the user saves the edited template in the static programmable memory 155 . Then, the method 400 ends at step 430 .

编辑消息模板的其它方法也属于本发明的范围内,其中包括通过使用诸如USB线缆的通信信道把电话100连接至主机,随后把已编辑好的模板下载或补充(flash)至静态可编程存储器155。Other methods of editing message templates are also within the scope of the present invention, including by connecting the phone 100 to a host using a communication channel such as a USB cable, and subsequently downloading or flashing the edited template to static programmable memory 155.

本发明的方法可进一步包括包含固定和可变语言成分的消息模板。固定语言成分在用户选中一个模板并发送消息时不会发生改变。而可变语言成分可以在不同的消息中(from message to message)由用户修改。固定和可变语言成分的使用大大提高了电话100的有限处理功率和内存的使用效率(leverage)。The method of the present invention may further comprise message templates comprising fixed and variable language components. Fixed language components do not change when a user selects a template and sends a message. Variable language components can be modified by the user from message to message. The use of fixed and variable speech components greatly improves the leverage of the phone 100's limited processing power and memory.

例如,与见面请求相关的短文本消息的特定模板可能如下所示:“ Meet me at $PLACE  at $TIME.”。在这个句子中,固定语言成分由下划线标出,而可变语言成分则大写且以“$”开头。那么,模板的不同用户就可以编辑诸如$PLACE等变量以适应其特定的环境。例如,大学生可能把可变的$PLACE设定为:$PLACE=sp|library(图书馆)|dormitory(宿舍)|cafeteria(自助餐厅)等等而律师可能把可变的$PLACE设定为:$PLACE=sp|office(办公室)|courthouse(法院)|home(家)等等For example, a specific template for a short text message related to a meet and greet request might look like this: " Meet me at $PLACE at $TIME.". In this sentence, fixed language components are underlined, while variable language components are capitalized and start with "$". Different users of the template can then edit variables such as $PLACE to suit their particular environment. For example, a college student might set the variable $PLACE to: $PLACE = sp|library(library)|dormitory(dormitory)|cafeteria(cafeteria), etc. and a lawyer might set the variable $PLACE to: $PLACE=sp|office (office)|courthouse (court)|home (home) etc.

在上面的句子中,“sp”指停顿或无话音,而“|”指逻辑算子“OR”。In the above sentence, "sp" refers to pause or silence, and "|" refers to the logical operator "OR".

在本发明中可能用到的消息模板的另一示例是:“ Happy$FESTIVAL.”。这个句子中的可变语言成分$FESTIVAL可能会被用户编辑成:Another example of a message template that may be used in the present invention is: " Happy $FESTIVAL.". The variable linguistic component $FESTIVAL in this sentence might be edited by the user to:

$FESTIVAL=sp|birthday(生日)|new year(新年)|thanksgiving(感恩节)等等$FESTIVAL=sp|birthday(birthday)|new year(New Year)|thanksgiving(Thanksgiving) etc.

通过使用开放式的词汇语音识别,电话100能够识别出由用户输入的已编辑的可变语言成分。由于可变语言成分由变量的离散集组成,语音识别处理的开销和内存需求被缩减到最小。因此上述方法特别适用于只具有有限的处理和内存资源的设备,诸如移动电话。Using open-lexical speech recognition, the phone 100 is able to recognize edited variable language components entered by the user. Since the variable linguistic components consist of discrete sets of variables, the overhead and memory requirements of speech recognition processing are minimized. The method described above is therefore particularly suitable for devices with only limited processing and memory resources, such as mobile phones.

对包括固定和可变语言成分的模板的使用在几个方面提高了语音识别系统的效率。其一,由于在静态可编程存储器155中存储的模板的数量相对于与自然语言句子结构相关联的句子排列的几乎无限的数量来说是不多的,所以通常可以快速而有效地识别出特定模板中的固定语言成分。其二,由于消息模板中的可变语言成分的句中位置能够自动识别出可能答复语的离散集,所以也可以高效地识别出可变语言成分。例如,以上述的“ Happy$FESTIVAL”消息模板为例,固定语言成分“ Happy”可作为一个信号,使处理器110知道:麦克风190接收到的随后的话音输入将是可变语言成分“$FESTIVAL”。The use of templates comprising fixed and variable linguistic components increases the efficiency of speech recognition systems in several ways. First, since the number of templates stored in static programmable memory 155 is modest relative to the virtually infinite number of sentence permutations associated with natural language sentence structures, specific templates can often be identified quickly and efficiently. Fixed language components in templates. Second, since the sentence positions of the variable language components in the message template can automatically identify the discrete set of possible replies, the variable language components can also be efficiently identified. For example, taking the above-mentioned " Happy $FESTIVAL" message template as an example, the fixed language component " Happy " can be used as a signal to make the processor 110 know that the subsequent voice input received by the microphone 190 will be the variable language component "$FESTIVAL ".

虽然本发明的上述实施例是通过无线电话100来说明的,但是本发明的方法和装置也可以包括诸如数字个人助理(PDA)之类的可提供文本消息的其它电子设备。Although the above-described embodiment of the present invention is illustrated in terms of a wireless telephone 100, the methods and apparatus of the present invention may also include other electronic devices that can provide text messaging, such as a digital personal assistant (PDA).

因此,本发明简化了从便携式电子设备上提供和发送文本消息所需的步骤。文本消息可通过话音输入而不是从小型键盘键入字符的方式来提供。此外,本发明可以包括开放式词汇语音识别,以避免现有技术的封闭式词汇语音识别对内存的高度需求。开放式词汇语音识别采用独立于扬声器的子词(sub-word)声学模型,设计成涵盖一种语言的所有声音现象(occurrence)或音素(phoneme)。因此,用户不限定于只使用预先设定的词汇,也能够编辑如上所述的可变语言成分,以包括诸如人名和地名等词典里查不到的单词。因此本发明提供的文本消息可以高度地个性化。Thus, the present invention simplifies the steps required to provide and send text messages from a portable electronic device. Text messages may be provided by voice input rather than typing characters from a keypad. Furthermore, the present invention may include open-vocabulary speech recognition to avoid the high memory requirements of the prior art closed-vocabulary speech recognition. Open-vocabulary speech recognition employs speaker-independent sub-word acoustic models designed to cover all occurrences or phonemes of a language. Thus, the user is not limited to using only the pre-set vocabulary, but can also edit the variable language components as described above to include words that cannot be found in dictionaries, such as names of people and places. Thus the text messages provided by the present invention can be highly personalized.

上述的详细说明只提供了优选的示范性实施例,且不视作对本发明的范围、适用性或配置的限制。更合适的说,该优选的示范性实施例的详细说明给本领域的普通技术人员提供了实现本发明的优选的示范性实施例的指导性说明。应当理解,在不背离所附权利要求中所要求的本发明的精神和范围的前提下,可以对组件的功能和排列做出多种的改变。The foregoing detailed description provides preferred exemplary embodiments only, and is not to be considered as limiting the scope, applicability or configuration of the invention. Rather, the detailed description of the preferred exemplary embodiment provides those skilled in the art with an enabling description for implementing the preferred exemplary embodiment of the invention. It being understood that various changes may be made in the function and arrangement of components without departing from the spirit and scope of the invention as claimed in the appended claims.

Claims (9)

1.一种用来提供文本消息的方法,所述方法包括如下步骤:1. A method for providing text messages, said method comprising the steps of: 在电子设备的输入端接收话语;receiving speech at an input to an electronic device; 在由用户设定的消息模板所引导的所述话语上执行语音识别,所述消息模板存储在与所述电子设备相关联的存储器中,其中通过对所述话语和所述消息模板之一进行匹配的方式定义语音识别,以建立一个匹配模板;以及Speech recognition is performed on said utterance guided by a user-configured message template stored in a memory associated with said electronic device, wherein said utterance is performed on one of said utterance and said message template Matching means defining speech recognition to build a matching template; and 通过所述匹配模板提供文本消息,Provide a text message via said matching template, 其中,所述消息模板中至少有一个同时包含固定和可变的语言成分。Wherein, at least one of the message templates contains both fixed and variable language components. 2.如权利要求1所述的方法,其中所述文本消息是SMS消息。2. The method of claim 1, wherein the text message is an SMS message. 3.如权利要求1所述的方法,其进一步包括通过接收从所述电子设备的键盘键入的字符来编辑所述由用户设定的消息模板的步骤。3. The method of claim 1, further comprising the step of editing the message template set by the user by receiving characters typed from a keyboard of the electronic device. 4.如权利要求1所述的方法,其中所述文本消息的一部分是所述话语的转录。4. The method of claim 1, wherein the portion of the text message is a transcription of the utterance. 5.如权利要求1所述的方法,其中所述文本消息全部是所述话语的转录。5. The method of claim 1, wherein the text messages are all transcriptions of the utterances. 6.一种用于提供文本消息的电子设备,所述设备包括:6. An electronic device for providing text messages, said device comprising: 麦克风,用于接收话语;a microphone for receiving speech; 非易失性存储器,用于存储消息模板;以及non-volatile memory for storing message templates; and 处理器,用于进行由所述消息模板引导的所述话语的语音识别,所述处理器对所述话语和所述消息模板之一进行匹配以建立一个匹配模板,并通过所述匹配模板提供文本消息,a processor, configured to perform speech recognition of the utterance guided by the message template, the processor matches the utterance with one of the message templates to establish a matching template, and provides text message, 其中所述消息模板中至少有一个同时包含固定和可变的语言成分。Wherein at least one of the message templates contains both fixed and variable language components. 7.如权利要求6所述的设备,其中所述文本消息是SMS消息。7. The device of claim 6, wherein the text message is an SMS message. 8.如权利要求6所述的设备,其进一步包括一个用于编辑所述消息模板的键盘。8. The device of claim 6, further comprising a keyboard for editing said message template. 9.如权利要求6所述的设备,其中所述处理器对所述话语和多个所述消息模板进行匹配并计算每个所述消息模板的似然值。9. The device of claim 6, wherein said processor matches said utterance to a plurality of said message templates and calculates a likelihood value for each of said message templates.
CNB031249639A 2003-09-23 2003-09-23 Method and apparatus for providing text message Expired - Fee Related CN100353417C (en)

Priority Applications (5)

Application Number Priority Date Filing Date Title
CNB031249639A CN100353417C (en) 2003-09-23 2003-09-23 Method and apparatus for providing text message
RU2006113581/09A RU2320082C2 (en) 2003-09-23 2004-09-17 Method and device for providing a text message
KR1020067005735A KR100759728B1 (en) 2003-09-23 2004-09-17 Method and apparatus for providing a text message
PCT/US2004/030553 WO2005031995A1 (en) 2003-09-23 2004-09-17 Method and apparatus for providing a text message
EP04784421A EP1665561A4 (en) 2003-09-23 2004-09-17 Method and apparatus for providing a text message

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CNB031249639A CN100353417C (en) 2003-09-23 2003-09-23 Method and apparatus for providing text message

Publications (2)

Publication Number Publication Date
CN1601548A CN1601548A (en) 2005-03-30
CN100353417C true CN100353417C (en) 2007-12-05

Family

ID=34383973

Family Applications (1)

Application Number Title Priority Date Filing Date
CNB031249639A Expired - Fee Related CN100353417C (en) 2003-09-23 2003-09-23 Method and apparatus for providing text message

Country Status (5)

Country Link
EP (1) EP1665561A4 (en)
KR (1) KR100759728B1 (en)
CN (1) CN100353417C (en)
RU (1) RU2320082C2 (en)
WO (1) WO2005031995A1 (en)

Families Citing this family (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
DE60236627D1 (en) * 2002-03-27 2010-07-15 Nokia Corp Determination of a Distortion Measure in Speech Recognition
KR100805252B1 (en) 2005-06-27 2008-02-21 서울통신기술 주식회사 Call processing method of IP terminal and apparatus therefor
DE102007061156A1 (en) * 2007-12-17 2009-08-06 Vodafone Holding Gmbh Message transmission in telecommunication networks
KR101597286B1 (en) 2009-05-07 2016-02-25 삼성전자주식회사 Apparatus for generating avatar image message and method thereof
CN102263851A (en) * 2010-05-31 2011-11-30 北京迅捷英翔网络科技有限公司 Message conversion method
CN103366741B (en) * 2012-03-31 2019-05-17 上海果壳电子有限公司 Voice input error correction method and system
WO2014209157A1 (en) * 2013-06-27 2014-12-31 Obschestvo S Ogranichennoy Otvetstvennostiyu "Speaktoit" Generating dialog recommendations for chat information systems
US9185211B2 (en) 2013-11-08 2015-11-10 Sorenson Communications, Inc. Apparatuses and methods for operating a communication system in one of a tone mode and a text mode
US9473627B2 (en) 2013-11-08 2016-10-18 Sorenson Communications, Inc. Video endpoints and related methods for transmitting stored text to other video endpoints
KR101894928B1 (en) 2017-02-14 2018-09-05 (주)스톤아이 Bonus calculating apparatus using number of visit and method thereof
US11924149B2 (en) * 2020-10-15 2024-03-05 Google Llc Composition of complex content via user interaction with an automated assistant

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6526292B1 (en) * 1999-03-26 2003-02-25 Ericsson Inc. System and method for creating a digit string for use by a portable phone
CN1419686A (en) * 2000-10-30 2003-05-21 皇家菲利浦电子有限公司 User interface/entertainment equipment of imitating human interaction and loading relative external database using relative data
US20030139922A1 (en) * 2001-12-12 2003-07-24 Gerhard Hoffmann Speech recognition system and method for operating same

Family Cites Families (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4525793A (en) * 1982-01-07 1985-06-25 General Electric Company Voice-responsive mobile status unit
EP1172994B1 (en) * 1994-10-25 2008-07-30 Cisco Technology, Inc. Voice-operated services
JP3533051B2 (en) * 1996-08-21 2004-05-31 パイオニア株式会社 Telephone with automatic voice response function
US6173316B1 (en) * 1998-04-08 2001-01-09 Geoworks Corporation Wireless communication device with markup language based man-machine interface
RU13455U1 (en) * 1999-09-30 2000-04-10 Бурин Андрей Михайлович DEVICE FOR SENDING TEXT OF ELECTRONIC MAIL FROM ELECTRONIC MAILBOX TO A CELL PHONE OF A SUBSCRIBER AND FOR TRANSFER OF TEXT MESSAGES FROM A CELL PHONE OF A SUBSCRIBER TO AN ELECTRONIC ELECTRONIC
DE19959903A1 (en) * 1999-12-07 2001-06-13 Bruno Jentner Module for supporting text messaging communications in mobile radio networks uses text-to-speech converter for speech output, speech-to-text converter for speech input and detection
US6625474B1 (en) * 2000-04-11 2003-09-23 Motorola, Inc. Method and apparatus for audio signal based answer call message generation
KR20020028501A (en) * 2000-10-10 2002-04-17 김철권 Method for conversion between sound data and text data in network and apparatus thereof
WO2002077975A1 (en) * 2001-03-27 2002-10-03 Koninklijke Philips Electronics N.V. Method to select and send text messages with a mobile
US6895257B2 (en) * 2002-02-18 2005-05-17 Matsushita Electric Industrial Co., Ltd. Personalized agent for portable devices and cellular phone
US7072684B2 (en) * 2002-09-27 2006-07-04 International Business Machines Corporation Method, apparatus and computer program product for transcribing a telephone communication
US20040176139A1 (en) * 2003-02-19 2004-09-09 Motorola, Inc. Method and wireless communication device using voice recognition for entering text characters

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6526292B1 (en) * 1999-03-26 2003-02-25 Ericsson Inc. System and method for creating a digit string for use by a portable phone
CN1419686A (en) * 2000-10-30 2003-05-21 皇家菲利浦电子有限公司 User interface/entertainment equipment of imitating human interaction and loading relative external database using relative data
US20030139922A1 (en) * 2001-12-12 2003-07-24 Gerhard Hoffmann Speech recognition system and method for operating same

Also Published As

Publication number Publication date
RU2320082C2 (en) 2008-03-20
CN1601548A (en) 2005-03-30
KR100759728B1 (en) 2007-09-20
EP1665561A1 (en) 2006-06-07
WO2005031995A1 (en) 2005-04-07
EP1665561A4 (en) 2011-03-23
RU2006113581A (en) 2007-10-27
KR20060054469A (en) 2006-05-22

Similar Documents

Publication Publication Date Title
US6393403B1 (en) Mobile communication devices having speech recognition functionality
US6694295B2 (en) Method and a device for recognizing speech
EP1171870B1 (en) Spoken user interface for speech-enabled devices
US8577681B2 (en) Pronunciation discovery for spoken words
EP1600018B1 (en) Multimedia and text messaging with speech-to-text assistance
US7689417B2 (en) Method, system and apparatus for improved voice recognition
US6393304B1 (en) Method for supporting numeric voice dialing
US20030120493A1 (en) Method and system for updating and customizing recognition vocabulary
US6526292B1 (en) System and method for creating a digit string for use by a portable phone
CN111325039B (en) Language translation method, system, program and handheld terminal based on real-time call
JP2008529101A (en) Method and apparatus for automatically expanding the speech vocabulary of a mobile communication device
CN100353417C (en) Method and apparatus for providing text message
US8374862B2 (en) Method, software and device for uniquely identifying a desired contact in a contacts database based on a single utterance
EP1632934B1 (en) Baseband modem and method for speech recognition and mobile communication terminal using the same
AU760377B2 (en) A method and a system for voice dialling
US20050154587A1 (en) Voice enabled phone book interface for speaker dependent name recognition and phone number categorization
US20050131685A1 (en) Installing language modules in a mobile communication device
JP2005520194A (en) Generating text messages
CN111274828B (en) Language translation method, system, computer program and handheld terminal based on message leaving
JP4070963B2 (en) Mobile communication equipment
JP2003333203A (en) Speech synthesis system, server device, information processing method, recording medium, and program
GB2406471A (en) Mobile phone with speech-to-text conversion system
KR100817284B1 (en) Apparatus and method for providing sound effect of mobile communication terminal
KR20060063420A (en) Voice recognition method in portable terminal and portable terminal equipped with the same
EP1895748A1 (en) Method, software and device for uniquely identifying a desired contact in a contacts database based on a single utterance

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant
ASS Succession or assignment of patent right

Owner name: MOTOROLA MOBILE CO., LTD.

Free format text: FORMER OWNER: MOTOROLA INC.

Effective date: 20110110

C41 Transfer of patent application or patent right or utility model
TR01 Transfer of patent right

Effective date of registration: 20110110

Address after: Illinois State

Patentee after: MOTOROLA MOBILITY, Inc.

Address before: Illinois, USA

Patentee before: Motorola, Inc.

C41 Transfer of patent application or patent right or utility model
C56 Change in the name or address of the patentee
CP01 Change in the name or title of a patent holder

Address after: Illinois State

Patentee after: MOTOROLA MOBILITY LLC

Address before: Illinois State

Patentee before: MOTOROLA MOBILITY, Inc.

TR01 Transfer of patent right

Effective date of registration: 20160523

Address after: California, USA

Patentee after: Google Technology Holdings LLC

Address before: Illinois State

Patentee before: MOTOROLA MOBILITY LLC

CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20071205

Termination date: 20150923

EXPY Termination of patent right or utility model