[go: up one dir, main page]

CN107705788A - The method of calibration and intelligent terminal of a kind of phonetic order - Google Patents

The method of calibration and intelligent terminal of a kind of phonetic order Download PDF

Info

Publication number
CN107705788A
CN107705788A CN201710909063.2A CN201710909063A CN107705788A CN 107705788 A CN107705788 A CN 107705788A CN 201710909063 A CN201710909063 A CN 201710909063A CN 107705788 A CN107705788 A CN 107705788A
Authority
CN
China
Prior art keywords
voice
intelligent terminal
instruction
verification information
user
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201710909063.2A
Other languages
Chinese (zh)
Inventor
王文斌
李承敏
包振毅
周金虎
禹杰
李亚军
林虎
安丽朋
黄俊岚
叶巧莉
余登武
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shanghai Wind Communication Technologies Co Ltd
Original Assignee
Shanghai Wind Communication Technologies Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shanghai Wind Communication Technologies Co Ltd filed Critical Shanghai Wind Communication Technologies Co Ltd
Priority to CN201710909063.2A priority Critical patent/CN107705788A/en
Publication of CN107705788A publication Critical patent/CN107705788A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/18Speech classification or search using natural language modelling
    • G10L15/1815Semantic context, e.g. disambiguation of the recognition hypotheses based on word meaning
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/18Speech classification or search using natural language modelling
    • G10L15/1822Parsing for meaning understanding
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/26Speech to text systems
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/48Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
    • G10L25/51Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • G10L2015/223Execution procedure of a spoken command

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Artificial Intelligence (AREA)
  • Signal Processing (AREA)
  • Mobile Radio Communication Systems (AREA)

Abstract

The present embodiments relate to artificial intelligence field, the method for calibration and intelligent terminal of a kind of phonetic order are disclosed.In the present invention, the method for calibration of phonetic order, applied to the first intelligent terminal, including:After the phonetic order for the operation for being used for the second intelligent terminal of instruction that user sends is received, voice response is made to phonetic order, voice response is carried only for the check information of equipment identification, check information is used for for the second intelligent terminal according to check information, detect and phonetic order is identified whether correctly, second intelligent terminal is effectively detected to identify whether correctly phonetic order, so as to correctly perform the operation indicated by phonetic order.

Description

Voice instruction checking method and intelligent terminal
Technical Field
The embodiment of the invention relates to the field of artificial intelligence, in particular to a voice instruction verification method and an intelligent terminal.
Background
Artificial Intelligence (AI) is a new technical science for studying and developing theories, methods, techniques and application systems for simulating, extending and expanding human Intelligence. It attempts to understand the essence of intelligence and produces a new intelligent machine that can react in a manner similar to human intelligence, a field of research including robotics, language recognition, image recognition, natural language processing, and expert systems. Since the birth of artificial intelligence, theories and technologies become mature day by day, and application fields are expanded continuously, so that science and technology products brought by the artificial intelligence in the future can be assumed to be 'containers' of human intelligence. Under this big background of artificial intelligence, people are to the continuous promotion of the demand of smart machine, and smart machine develops gradually towards the direction of volume miniaturization, function intensification, and various wearable smart machine begin to get into people's daily life, if: smart watches, smart glasses, and the like, people's daily life is becoming more and more away from smart devices.
However, the inventors found that at least the following problems exist in the prior art: with the increase of the number of smart devices and the increasing role played by smart devices in life, errors occur in recognizing voice commands issued by users by smart devices in many cases, so that operations indicated by the voice commands are executed incorrectly.
Disclosure of Invention
The embodiment of the invention aims to provide a voice instruction checking method and an intelligent terminal, so that the intelligent terminal can effectively detect whether the recognition of a voice instruction is correct or not, and accordingly, the operation indicated by the voice instruction is correctly executed.
In order to solve the above technical problem, an embodiment of the present invention provides a method for verifying a voice instruction, which is applied to a first intelligent terminal, and includes:
and after a voice instruction which is sent by a user and used for indicating the operation of the second intelligent terminal is received, making a voice response to the voice instruction, wherein the voice response carries verification information only used for equipment identification, and the verification information is used for the second intelligent terminal to detect whether the identification of the voice instruction is correct or not according to the verification information.
The embodiment of the invention also provides a voice instruction checking method, which is applied to a second intelligent terminal and comprises the following steps:
after receiving a voice instruction which is sent by a user and used for indicating the operation of the second intelligent terminal, recognizing the voice instruction;
after receiving a voice response made by the first intelligent terminal to the voice command, detecting whether the voice command is correctly identified according to verification information carried in the voice response and only used for equipment identification.
The embodiment of the invention also provides an intelligent terminal, which comprises: at least one processor; and the number of the first and second groups,
a memory communicatively coupled to the at least one processor; wherein,
the memory stores instructions executable by the at least one processor to enable the at least one processor to perform the method for verifying voice instructions.
Compared with the prior art, the voice response method and the voice response device have the advantages that after the voice command which is sent by the user and used for indicating the operation of the second intelligent terminal is received, the voice response is made to the voice command, the voice response carries the verification information only used for equipment identification, namely the voice response made by the first intelligent terminal carries the verification information only used for equipment identification, and the verification information is used for the second intelligent terminal to detect whether the voice command is correctly identified or not according to the verification information. Since the verification information is equivalently carried in the voice reply in the form of the password, only the equipment can identify, and the second intelligent terminal can effectively detect whether the identification of the voice command is correct or not according to the received verification information, so that the operation indicated by the voice command is correctly executed.
In addition, before making a voice response to the voice command, the method further comprises the following steps: performing semantic recognition on the voice instruction; and generating a coding instruction for controlling the second intelligent terminal to execute the operation according to the operation of the second intelligent terminal indicated by the identified user, and taking the coding instruction as the verification information. The coding instruction for controlling the second intelligent terminal to execute the operation is used as the verification information, so that the verification information is more pertinent, namely the verification information is another expression mode of the operation indicated by the voice instruction, only equipment can recognize the expression mode, the auditory perception of natural people cannot be influenced, and the voice instruction identification method is beneficial to effectively detecting whether the second intelligent terminal identifies the voice instruction correctly or not and simultaneously improving the user experience.
In addition, verification information is carried in the voice response by: converting the verification information into ultrasonic waves or infrasonic waves according to a preset coding rule; the converted ultrasonic or infrasonic waves are superimposed into the voice response. Through the mode, the verification information is converted into the sound wave information to be carried in the voice response, so that whether the recognition of the voice instruction is correct or not is detected based on the better detection of the verification information in the voice response by the second intelligent terminal, the voice response heard by the user cannot be influenced on the premise that the second intelligent terminal can acquire the verification information from the voice response, and the user experience can be improved.
In addition, when verification error information fed back by the second intelligent terminal is received, prompt information is sent to a user; the verification error information is used for indicating that the second intelligent terminal incorrectly identifies the voice instruction, and the prompt information is used for requiring the user to send an instruction again. The prompt message used for indicating that the voice instruction is incorrectly identified by the second intelligent terminal is sent to the user, so that the user can conveniently send the instruction again in time, and the instruction operation required by the user can be better executed.
Drawings
One or more embodiments are illustrated by way of example in the accompanying drawings, which correspond to the figures in which like reference numerals refer to similar elements and which are not to scale unless otherwise specified.
FIG. 1 is a flow chart illustrating a method for verifying a voice command according to a first embodiment of the present invention;
FIG. 2 is a flow chart illustrating a method for verifying a voice command according to a third embodiment of the present invention;
fig. 3 is a schematic structural diagram of an intelligent terminal according to a fourth embodiment of the present invention.
Detailed Description
In order to make the objects, technical solutions and advantages of the embodiments of the present invention more apparent, embodiments of the present invention will be described in detail below with reference to the accompanying drawings. However, it will be appreciated by those of ordinary skill in the art that numerous technical details are set forth in order to provide a better understanding of the present application in various embodiments of the present invention. However, the technical solution claimed in the present application can be implemented without these technical details and various changes and modifications based on the following embodiments.
The first embodiment of the invention relates to a voice instruction checking method which is applied to a first intelligent terminal. The core of the embodiment is that after a voice instruction which is sent by a user and used for indicating the operation of the second intelligent terminal is received, a voice response is made to the voice instruction, the voice response carries verification information only used for equipment identification, and the verification information is used for the second intelligent terminal to detect whether the voice instruction is correctly identified according to the verification information, so that whether the intelligent terminal correctly identifies the voice instruction can be effectively detected, and the operation indicated by the voice instruction is correctly executed. The following describes implementation details of the voice instruction verification method according to the present embodiment in detail, and the following description is only provided for facilitating understanding of the implementation details and is not necessary for implementing the present solution.
In this embodiment, a method for verifying a voice instruction is applied to a first intelligent terminal, and a specific flow is shown in fig. 1, where the method includes:
step 101: and receiving a voice instruction sent by the user and used for indicating the operation of the second intelligent terminal.
Specifically, the first intelligent terminal receives a voice instruction issued by a user to instruct the operation of the second intelligent terminal. First intelligent terminal can be smart mobile phone, panel computer, intelligent wearing equipment etc. and second intelligent terminal can be smart door and window, intelligent desk lamp, intelligent equipment such as refrigerator. In this embodiment, the first smart terminal is an example of a smart phone, and the second smart terminal is an example of an intelligent desk lamp, but the invention is not limited thereto in practical application. For example, a mobile phone of a user receives a voice instruction sent by the user to instruct the intelligent desk lamp to turn on, that is, the mobile phone receives a voice instruction sent by the user to turn on the lamp.
Step 102: and performing semantic recognition on the voice command.
Specifically, the first intelligent terminal performs semantic recognition on the voice command, that is, the first intelligent terminal needs to understand the specific meaning of the received command, and can perform semantic recognition on the voice command based on the rule and the corpus stored in the first intelligent terminal. Such as: the received instruction can be segmented into a plurality of related semantic meta-components by using a semantic segmentation rule, and then semantic internal representation corresponding to each semantic meta-component is found out according to a semantic conversion rule (such as keyword matching). The first intelligent terminal establishes the logical relation among the semantic components by testing the relation among the semantic components to form full-text semantic recognition, and the processing process is mainly realized by looking up a semantic dictionary. In addition, in the process of carrying out semantic recognition on the received instruction by the first intelligent terminal, the received instruction and the recognized semantics can be stored, and a one-to-one correspondence relationship is established between the instruction and the recognized semantics, so that when the first intelligent terminal receives the instruction again, the first intelligent terminal can be preferentially matched with the stored instruction, and if the instruction is completely matched with the stored instruction, the semantics corresponding to the received instruction can be directly obtained, and the efficiency of semantic recognition can be improved.
Furthermore, semantic recognition is carried out on the received voice command, and conversion on different planes can be carried out on different commands received by the first intelligent terminal by using the latest result of artificial intelligence, namely, by using the multi-path dynamic selection and the automatic recombination technology of a knowledge base. Therefore, the grammar, the semantics and the common sense planes can be connected into an organic whole, the advantages of the traditional system can be inherited, and the self-growing function of the system can be realized. Of course, in addition to the above manner, semantic recognition may be performed on the received instruction based on the statistical result, the artificial neural network, and the like, and any method for performing semantic recognition on the received instruction is within the protection scope of the present embodiment, and is not described in detail herein.
Step 103: a voice response is made to the voice command, the voice response carrying verification information for device identification only.
Specifically, the first intelligent terminal makes a voice response to the voice command, and the voice response carries verification information only used for equipment identification. Before making a voice response to the voice command, the first intelligent terminal can generate a coding command for controlling the second intelligent terminal to execute the operation according to the operation of the second intelligent terminal indicated by the identified user; and taking the coding instruction as check information. The person skilled in the art can implement by computer program, according to the operation of the second intelligent terminal indicated by the identified user, to generate coded instructions for controlling the second intelligent terminal to execute the operation, and therefore, the detailed description is omitted here. The coding instruction for controlling the second intelligent terminal to execute the operation is used as the verification information, so that the verification information is more pertinent, namely the verification information is another expression mode of the operation indicated by the voice instruction, and whether the voice instruction is correctly identified by the second intelligent terminal is effectively detected.
In practical application, the first intelligent terminal can also calculate a check code of the coding instruction according to a preset check algorithm, the check code is used as check information, and the preset check algorithm can be parity check, hamming check and cyclic redundancy check.
In the embodiment of the present invention, the verification information is carried in the voice response, and can be implemented in the following manner: converting the verification information into ultrasonic waves or infrasonic waves according to a preset coding rule; the converted ultrasonic or infrasonic waves are superimposed into the voice response.
For example, the coded instructions generated by the first intelligent terminal for controlling the second intelligent terminal to perform the operations indicated by the voice instructions may be represented as 1010 in binary code. For example, the coded instruction generated by the smart phone of the user for controlling the smart desk lamp to perform the "turn on" operation is 1010, and 1010 is used as the verification information. The verification information is converted into infrasonic waves according to a preset coding rule, and the preset coding rule can be preset by a person skilled in the art according to actual needs. For example, the preset encoding rule is as follows: a "1" corresponds to a frequency of 19Hz, and a "0" corresponds to a frequency of 17 Hz. The verification information, i.e., the coding instructions 1010, can be converted into infrasonic waves having frequencies in the order of 19HZ, 17HZ, 19HZ, 17HZ according to a preset coding rule. The converted infrasonic wave is superposed into a voice response, the voice response is a piece of voice information, for example, the voice response made by the smart phone of the user to the voice command of turning on the light is 'good', and the converted infrasonic wave is superposed into the voice response of 'good'. The user can ensure that the sent voice command is successfully identified by the mobile phone according to the received good voice response, and because the verification information superposed in the voice response is infrasonic waves, the frequency of the infrasonic waves is less than 20HZ and is out of the frequency range which can be identified by human ears, the user can not identify the verification information, and only the equipment can identify the verification information carried in the voice response. The method 1 realizes that the verification information only used for equipment identification is carried in the voice reply.
It should be noted that, in the embodiment, the verification information is converted into the infrasonic wave as an example, but in practical application, the verification information may also be converted into the ultrasonic wave, which is similar to the method of converting the verification information into the infrasonic wave, and thus the description thereof is omitted. In this embodiment, the coded command is represented by binary, and in practical application, the coded command may be represented by a common counting method such as decimal, hexadecimal, and the like.
By the method, the verification information is converted into the sound wave information to be carried in the voice response, so that the second intelligent terminal can better detect whether the voice instruction is correctly identified based on the verification information in the voice response.
It should be noted that, when receiving the verification error information fed back by the second intelligent terminal, the first intelligent terminal sends a prompt message to the user; the verification error information is used for indicating that the second intelligent terminal incorrectly identifies the voice instruction, and the prompt information is used for requiring the user to send an instruction again. For example, when receiving the verification error information fed back by the smart desk lamp, the smart phone of the user sends a prompt message to the user, that is, the smart desk lamp may incorrectly identify the voice command sent by the user, which results in a verification error, and then the verification error information is fed back to the mobile phone of the user.
Compared with the prior art, the embodiment of the invention makes a voice response to the voice instruction after receiving the voice instruction which is sent by the user and used for indicating the operation of the second intelligent terminal, wherein the voice response carries the verification information only used for equipment identification, that is, the voice response made by the first intelligent terminal carries the verification information only used for equipment identification, and the verification information is equivalently carried in the voice response in a form of a secret code. Converting the verification information into ultrasonic waves or infrasonic waves according to a preset coding rule; the converted ultrasonic or infrasonic waves are superimposed into the voice response. The verification information carried in the voice response can be identified only by the equipment, and the second intelligent terminal can effectively detect whether the voice command is correctly identified according to the identified verification information, so that the operation indicated by the voice command is correctly executed. The prompt message used for indicating that the voice instruction is incorrectly identified by the second intelligent terminal is sent to the user, so that the user can conveniently send the instruction again in time, and the instruction operation required by the user can be better executed.
The second embodiment of the invention relates to a voice instruction checking method which is applied to a first intelligent terminal. The second embodiment is substantially the same as the first embodiment, and mainly differs therefrom in that: in the first embodiment, the verification information is converted into ultrasonic waves or infrasonic waves according to a preset encoding rule; and the converted ultrasonic waves or infrasonic waves are superposed in the voice response, so that the verification information only used for equipment identification is carried in the voice response. In the present embodiment, however, the verification information is converted into sound waves according to a preset coding rule, and the sound waves are divided into L sub-sound waves; inserting sub-sound waves in sequence every first preset time length in a voice response; the duration of the sub-sound wave is less than 1/N of the first preset duration, and N is greater than a preset threshold, so that verification information only recognized by equipment is carried in a voice response.
In the second embodiment of the present invention, the verification information only for device identification is carried in the voice reply, and it can be understood that the coded instruction generated by the first intelligent terminal for controlling the second intelligent terminal to execute the operation indicated by the voice instruction may be represented as 1010 by a binary code. For example, the coded instruction generated by the smart phone of the user for controlling the smart desk lamp to perform the "turn on" operation is 1010, and 1010 is used as the verification information. The verification information is converted into sound waves according to a preset coding rule, wherein the sound waves can be ultrasonic waves, infrasonic waves and sound waves within the human ear recognition range, and the preset coding rule can be preset by a person skilled in the art according to actual needs. For example, the preset encoding rule is as follows: a "1" corresponds to a frequency of 500Hz, and a "0" corresponds to a frequency of 700 Hz. The verification information, i.e., the coded instructions 1010, may be converted into sound waves having a frequency order of 500HZ, 700HZ, according to a preset coding rule. The sound wave is divided into L sub-sound waves, for example, the sound wave may be divided into 5 sub-sound waves, and the time duration of each sub-sound wave is 0.1 millisecond. Inserting sub-sound waves in sequence every first preset time length in a voice response; and the duration of the sub-sound wave is less than 1/N of the first preset duration, and N is greater than a preset threshold. If the duration of the sub-sound wave is much less than the first preset duration, then the sub-sound wave is inserted into the voice response, and the voice response recognized by the person is indistinguishable from the voice response that had not been previously inserted, and only the device can recognize the sub-sound wave inserted in the voice response. Thus, N may be set to 1000, and if the duration of the sub-sound wave is less than 1/1000 for the first predetermined duration, it may be considered that the duration of the sub-sound wave is much less than the first predetermined duration, and only the device may recognize the sub-sound wave inserted in the voice response.
The voice response is a piece of voice information, for example, the user's smartphone responds "good" to the voice command "turn on" for a period of 2 seconds. The first preset time period may be set by a person skilled in the art according to actual requirements, for example, 5 sub-sound waves are sequentially inserted into a voice response with a time duration of 2 seconds, and the first preset time period may be set to 0.4 second, that is, 1 sub-sound wave is inserted every 0.4 second, so that 5 sub-sound waves are sequentially inserted into a voice response with a time duration of 2 seconds. Since the duration of each sub-sound wave is 0.1 millisecond, 0.1 millisecond is far shorter than the first preset duration of 0.4 second, the sub-sound waves are inserted for a very short period of time, natural people can not distinguish the sub-sound waves, and the intelligent device can easily identify the details. By means of the method 2, the verification information only recognized by the equipment is carried in the voice response.
It should be noted that, in the embodiment, the verification information is converted into the sound wave within the human ear recognition range as an example, but in practical application, the verification information may also be converted into the ultrasonic wave or the infrasonic wave, which is similar to the method of converting the verification information into the sound wave within the human ear recognition range, and is not described herein again. In this embodiment, the coded command is represented by binary, and in practical application, the coded command may be represented by a common counting method such as decimal, hexadecimal, and the like.
By the method, the verification information is converted into the sound wave information to be carried in the voice response, so that the second intelligent terminal can better detect whether the voice instruction is correctly identified based on the verification information in the voice response.
Compared with the prior art, the embodiment of the invention makes a voice response to the voice instruction after receiving the voice instruction which is sent by the user and used for indicating the operation of the second intelligent terminal, wherein the voice response carries the verification information only used for equipment identification, that is, the voice response made by the first intelligent terminal carries the verification information only used for equipment identification, and the verification information is equivalently carried in the voice response in a form of a secret code. Converting the verification information into sound waves according to a preset coding rule, and dividing the sound waves into L sub-sound waves; and inserting the sub-sound waves in the voice response in sequence every first preset time, and superposing the verification information to the voice response in such a way, so that the verification information can only be identified by the equipment, and then the second intelligent terminal effectively detects whether the identification of the voice instruction is correct according to the identified verification information, thereby correctly executing the operation indicated by the voice instruction. The prompt message used for indicating that the voice instruction is incorrectly identified by the second intelligent terminal is sent to the user, so that the user can conveniently send the instruction again in time, and the instruction operation required by the user can be better executed.
The third embodiment of the invention relates to a voice instruction checking method which is applied to a second intelligent terminal. The third embodiment is substantially the same as the first embodiment, and mainly differs therefrom in that: in a first embodiment, the method for verifying the voice command is applied to a first intelligent terminal, the first intelligent terminal makes a voice response to the voice command after receiving the voice command which is sent by a user and used for indicating the operation of a second intelligent terminal, the voice response carries verification information only used for equipment identification, and the verification information is used for the second intelligent terminal to detect whether the voice command is correctly identified according to the verification information. In this embodiment, the method for verifying the voice command is applied to the second intelligent terminal, and the second intelligent terminal recognizes the voice command after receiving the voice command sent by the user to instruct the second intelligent terminal to operate; after receiving a voice response made by the first intelligent terminal to the voice command, detecting whether the voice command is correctly identified according to verification information carried in the voice response and only used for equipment identification.
In this embodiment, a method for verifying a voice instruction is applied to a second intelligent terminal, and a specific flow is shown in fig. 2, where the method includes:
step 201: and receiving a voice instruction sent by the user and used for indicating the operation of the second intelligent terminal.
Specifically, the second intelligent terminal receives a voice instruction issued by a user for instructing an operation of the second intelligent terminal. The second intelligent terminal can be intelligent devices such as intelligent doors and windows, intelligent desk lamps and intelligent refrigerators. In this embodiment, the second intelligent terminal is an intelligent desk lamp, but the second intelligent terminal is not limited to this in practical application. For example, a mobile phone of a user receives a voice instruction sent by the user to instruct the intelligent desk lamp to turn on the lamp, that is, the intelligent desk lamp receives the voice instruction of turning on the lamp sent by the user.
Step 202: and performing semantic recognition on the voice command.
Specifically, the second intelligent terminal performs semantic recognition on the voice command. Since the semantic recognition performed on the voice command by the first intelligent terminal in step 202 is substantially the same as that performed on the voice command by the first intelligent terminal in step 102 in the first embodiment, in order to avoid repetition, the step of performing semantic recognition on the voice command is not described herein again.
Step 203: and receiving a voice response made by the first intelligent terminal to the voice command.
For example, the second intelligent terminal receives a voice response made by the first intelligent terminal to the voice command. In combination with the first embodiment, the first intelligent terminal takes a smart phone as an example, and the second intelligent terminal takes a smart desk lamp as an example, that is, the smart desk lamp receives a voice response made by the mobile phone to the voice command of turning on the lamp, and the voice response carries verification information only used for device identification.
Step 204: it is checked whether the recognition of the speech instruction is correct, if so, step 205 is performed, and if not, step 206 is performed.
Specifically, the second intelligent terminal detects whether the voice command is correctly recognized according to the verification information which is carried in the voice response and only used for equipment recognition. If the verification information is the coding instruction in step 103, the second intelligent terminal may generate a coding instruction for controlling the second intelligent terminal to execute the operation according to the operation of the second intelligent terminal indicated by the identified user after recognizing the voice instruction sent by the user. The second intelligent terminal compares the coding instruction generated by the second intelligent terminal with the coding instruction corresponding to the verification information carried in the voice reply of the identified first intelligent terminal, if the two coding instructions are the same, the identification of the voice instruction by the second intelligent terminal is considered to be correct, and the step 205 can be entered for executing the voice instruction. If the two coded commands are not the same, the recognition of the voice command by the second intelligent terminal is deemed to be incorrect, and the process proceeds to step 206.
It should be noted that, if the check code calculated according to the preset check algorithm is used in the check information of the first intelligent terminal, the second intelligent terminal also needs to calculate the check code by using the same check algorithm when detecting whether the identification of the voice instruction is correct.
Step 205: and executing the voice instruction.
For example, when the intelligent desk lamp detects that the voice command of turning on the lamp is correctly identified, the voice command is executed, that is, the operation of turning on the lamp is executed.
Step 206: and feeding back the checking error information to the first intelligent terminal.
For example, if the intelligent desk lamp incorrectly identifies the voice command of turning on the lamp, the intelligent desk lamp feeds back verification error information to the intelligent mobile phone of the user to prompt the mobile phone that the voice command is incorrectly identified by the intelligent desk lamp. The mobile phone of the user can prompt the user to send the voice command again, so that the command of the user can be executed in time.
Compared with the prior art, the embodiment of the invention identifies the voice command after receiving the voice command which is sent by the user and used for indicating the operation of the second intelligent terminal; after receiving a voice response made by the first intelligent terminal to the voice command, detecting whether the voice command is correctly identified according to verification information carried in the voice response and only used for equipment identification. Since the verification information is equivalently carried in the voice reply in the form of the password, only the equipment can identify, and the second intelligent terminal can effectively detect whether the identification of the voice command is correct or not according to the received verification information, so that the operation indicated by the voice command is correctly executed. When the voice instruction is detected to be incorrectly identified, the verification error information is fed back to the first intelligent terminal, so that the user can be reminded of re-sending the instruction in time, and the instruction operation required by the user can be better executed.
The steps of the above methods are divided for clarity, and the implementation may be combined into one step or split some steps, and the steps are divided into multiple steps, so long as the same logical relationship is included, which are all within the protection scope of the present patent; it is within the scope of the patent to add insignificant modifications to the algorithms or processes or to introduce insignificant design changes to the core design without changing the algorithms or processes.
A fourth embodiment of the present invention relates to an intelligent terminal, a schematic structural diagram of the intelligent terminal is shown in fig. 3, and the intelligent terminal includes: at least one processor 301; and a memory 302 communicatively coupled to the at least one processor 301; the memory 302 stores instructions executable by the at least one processor 301, and the instructions are executed by the at least one processor 301, so that the at least one processor 301 can execute the verification method of the voice instruction in the above method embodiment.
Where the memory 302 and the processor 301 are coupled in a bus, the bus may comprise any number of interconnected buses and bridges, the buses coupling one or more of the various circuits of the processor 301 and the memory 302. The bus may also connect various other circuits such as peripherals, voltage regulators, power management circuits, and the like, which are well known in the art, and therefore, will not be described any further herein. A bus interface provides an interface between the bus and the transceiver. The transceiver may be one element or a plurality of elements, such as a plurality of receivers and transmitters, providing a means for communicating with various other apparatus over a transmission medium. The data processed by the processor 301 is transmitted over a wireless medium through an antenna, which further receives the data and transmits the data to the processor 301.
The processor 301 is responsible for managing the bus and general processing and may also provide various functions including timing, peripheral interfaces, voltage regulation, power management, and other control functions. And memory 302 may be used to store data used by processor 301 in performing operations.
It will be understood by those of ordinary skill in the art that the foregoing embodiments are specific examples for carrying out the invention, and that various changes in form and details may be made therein without departing from the spirit and scope of the invention in practice.

Claims (10)

1. A voice instruction checking method is applied to a first intelligent terminal and is characterized by comprising the following steps:
after a voice instruction which is sent by a user and used for indicating the operation of a second intelligent terminal is received, a voice response is made to the voice instruction, the voice response carries verification information only used for equipment identification, and the verification information is used for the second intelligent terminal to detect whether the voice instruction is correctly identified or not according to the verification information.
2. The method for verifying the voice command of claim 1, further comprising, before said making the voice response to the voice command:
performing semantic recognition on the voice instruction;
generating a coding instruction for controlling the second intelligent terminal to execute the operation according to the identified operation of the second intelligent terminal indicated by the user;
and taking the coding instruction as the check information.
3. The method for verifying the voice command of claim 1, further comprising, before said making the voice response to the voice command:
performing semantic recognition on the voice instruction;
generating a coding instruction for controlling the second intelligent terminal to execute the operation according to the identified operation of the second intelligent terminal indicated by the user;
calculating a check code of the coding instruction according to a preset check algorithm;
and taking the check code as the check information.
4. The method for verifying the voice command according to claim 3, wherein the verification algorithm specifically comprises any one of the following:
parity check, hamming check, cyclic redundancy check.
5. The method of claim 1, wherein the verification information is carried in the voice reply by:
converting the verification information into ultrasonic waves or infrasonic waves according to a preset coding rule;
and superposing the converted ultrasonic waves or infrasonic waves into the voice response.
6. The method of claim 1, wherein the verification information is carried in the voice reply by:
converting the verification information into sound waves according to a preset coding rule, and dividing the sound waves into L sub-sound waves;
inserting the sub-sound waves in the voice response in sequence every other first preset time length; and the duration of the sub-sound wave is less than 1/N of the first preset duration, and N is greater than a preset threshold.
7. The method for verifying the voice command according to claim 1, further comprising:
sending prompt information to a user when the verification error information fed back by the second intelligent terminal is received; the verification error information is used for indicating that the second intelligent terminal incorrectly identifies the voice instruction, and the prompt information is used for requiring the user to send an instruction again.
8. A voice instruction checking method is applied to a second intelligent terminal and is characterized by comprising the following steps:
after a voice instruction which is sent by a user and used for indicating the operation of the second intelligent terminal is received, recognizing the voice instruction;
after receiving a voice response made by the first intelligent terminal to the voice command, detecting whether the voice command is correctly recognized or not according to the verification information which is carried in the voice response and is only used for equipment recognition.
9. The method for verifying the voice command according to claim 8, wherein after detecting whether the recognition of the voice command is correct, the method further comprises:
executing the voice instruction if the voice instruction is correctly identified;
and if the voice instruction is incorrectly recognized, feeding back verification error information to the first intelligent terminal, wherein the verification error information is used for indicating that the voice instruction is incorrectly recognized by the second intelligent terminal.
10. An intelligent terminal, comprising:
at least one processor; and the number of the first and second groups,
a memory communicatively coupled to the at least one processor; wherein,
the memory stores instructions executable by the at least one processor to enable the at least one processor to perform a method of verifying a voice instruction according to any one of claims 1 to 9.
CN201710909063.2A 2017-09-29 2017-09-29 The method of calibration and intelligent terminal of a kind of phonetic order Pending CN107705788A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201710909063.2A CN107705788A (en) 2017-09-29 2017-09-29 The method of calibration and intelligent terminal of a kind of phonetic order

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201710909063.2A CN107705788A (en) 2017-09-29 2017-09-29 The method of calibration and intelligent terminal of a kind of phonetic order

Publications (1)

Publication Number Publication Date
CN107705788A true CN107705788A (en) 2018-02-16

Family

ID=61175798

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201710909063.2A Pending CN107705788A (en) 2017-09-29 2017-09-29 The method of calibration and intelligent terminal of a kind of phonetic order

Country Status (1)

Country Link
CN (1) CN107705788A (en)

Cited By (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109448734A (en) * 2018-09-20 2019-03-08 李庆湧 Unlocking terminal equipment and application starting method and device based on vocal print
CN109760055A (en) * 2019-01-30 2019-05-17 盈科视控(北京)科技有限公司 A kind of method and system that the figure for robot communicates
CN109877829A (en) * 2019-01-30 2019-06-14 盈科视控(北京)科技有限公司 A kind of figure communication dispatch method and system for robot
CN111312230A (en) * 2019-11-27 2020-06-19 南京创维信息技术研究院有限公司 Voice interaction monitoring method and device for voice dialogue platform
CN112116803A (en) * 2020-09-04 2020-12-22 北京小米移动软件有限公司 Information processing method and device, electronic device and storage medium
CN112349279A (en) * 2020-06-16 2021-02-09 深圳Tcl新技术有限公司 Remote semantic recognition method, device, equipment and computer readable storage medium
CN112634899A (en) * 2021-01-31 2021-04-09 成都市玄上科技有限公司 Method for interactive control by using sound signal
CN113611305A (en) * 2021-09-06 2021-11-05 云知声(上海)智能科技有限公司 Voice control method, system, equipment and medium in autonomous learning home scene
CN114121002A (en) * 2021-11-15 2022-03-01 歌尔微电子股份有限公司 Electronic device, interactive module and control method and control device thereof
CN114743542A (en) * 2022-04-29 2022-07-12 青岛海尔科技有限公司 Voice processing method and device, storage medium and electronic device

Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102820972A (en) * 2012-08-17 2012-12-12 刘欣 Authentication method based on sound wave
CN103152106A (en) * 2013-03-13 2013-06-12 荆效民 Ultrasonic information pushing method and system based on audio
CN103941686A (en) * 2014-04-14 2014-07-23 美的集团股份有限公司 Voice control method and system
CN104200806A (en) * 2014-09-05 2014-12-10 马金保 Intelligent household linkage control method and system
CN106572241A (en) * 2016-10-21 2017-04-19 北京巡声巡影科技服务有限公司 Method and device for displaying information
EP3160151A1 (en) * 2014-06-17 2017-04-26 LG Electronics Inc. Video display device and operation method therefor
CN106788779A (en) * 2016-11-09 2017-05-31 上海斐讯数据通信技术有限公司 Beacon frame information method, wireless aps and terminal in wireless aps
CN106782523A (en) * 2016-11-16 2017-05-31 上海卓易云汇智能技术有限公司 The control method and device of phonetic function

Patent Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102820972A (en) * 2012-08-17 2012-12-12 刘欣 Authentication method based on sound wave
CN103152106A (en) * 2013-03-13 2013-06-12 荆效民 Ultrasonic information pushing method and system based on audio
CN103941686A (en) * 2014-04-14 2014-07-23 美的集团股份有限公司 Voice control method and system
EP3160151A1 (en) * 2014-06-17 2017-04-26 LG Electronics Inc. Video display device and operation method therefor
CN104200806A (en) * 2014-09-05 2014-12-10 马金保 Intelligent household linkage control method and system
CN106572241A (en) * 2016-10-21 2017-04-19 北京巡声巡影科技服务有限公司 Method and device for displaying information
CN106788779A (en) * 2016-11-09 2017-05-31 上海斐讯数据通信技术有限公司 Beacon frame information method, wireless aps and terminal in wireless aps
CN106782523A (en) * 2016-11-16 2017-05-31 上海卓易云汇智能技术有限公司 The control method and device of phonetic function

Cited By (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109448734A (en) * 2018-09-20 2019-03-08 李庆湧 Unlocking terminal equipment and application starting method and device based on vocal print
CN109760055A (en) * 2019-01-30 2019-05-17 盈科视控(北京)科技有限公司 A kind of method and system that the figure for robot communicates
CN109877829A (en) * 2019-01-30 2019-06-14 盈科视控(北京)科技有限公司 A kind of figure communication dispatch method and system for robot
CN111312230A (en) * 2019-11-27 2020-06-19 南京创维信息技术研究院有限公司 Voice interaction monitoring method and device for voice dialogue platform
CN111312230B (en) * 2019-11-27 2023-10-20 南京创维信息技术研究院有限公司 Voice interaction monitoring method and device for voice conversation platform
CN112349279A (en) * 2020-06-16 2021-02-09 深圳Tcl新技术有限公司 Remote semantic recognition method, device, equipment and computer readable storage medium
CN112349279B (en) * 2020-06-16 2024-08-20 深圳Tcl新技术有限公司 Remote semantic recognition method, device, equipment and computer readable storage medium
CN112116803A (en) * 2020-09-04 2020-12-22 北京小米移动软件有限公司 Information processing method and device, electronic device and storage medium
CN112634899A (en) * 2021-01-31 2021-04-09 成都市玄上科技有限公司 Method for interactive control by using sound signal
CN113611305A (en) * 2021-09-06 2021-11-05 云知声(上海)智能科技有限公司 Voice control method, system, equipment and medium in autonomous learning home scene
CN114121002A (en) * 2021-11-15 2022-03-01 歌尔微电子股份有限公司 Electronic device, interactive module and control method and control device thereof
CN114743542A (en) * 2022-04-29 2022-07-12 青岛海尔科技有限公司 Voice processing method and device, storage medium and electronic device

Similar Documents

Publication Publication Date Title
CN107705788A (en) The method of calibration and intelligent terminal of a kind of phonetic order
JP6828001B2 (en) Voice wakeup method and equipment
US20190385594A1 (en) Method and apparatus for selecting voice-enabled device and intelligent computing device for controlling the same
CN112100349A (en) Multi-turn dialogue method and device, electronic equipment and storage medium
CN111737987B (en) Intention recognition method, device, equipment and storage medium
CN109256125B (en) Off-line voice recognition method and device and storage medium
WO2014096506A1 (en) Method, apparatus, and computer program product for personalizing speech recognition
CN110942763B (en) Speech recognition method and device
CN113674746B (en) Man-machine interaction method, device, equipment and storage medium
CN103578468A (en) Method for adjusting confidence coefficient threshold of voice recognition and electronic device
WO2020233363A1 (en) Speech recognition method and device, electronic apparatus, and storage medium
CN115587598A (en) Multi-turn dialogue rewriting method, equipment and medium
CN112687298A (en) Voice wake-up optimization method, device, system, storage medium and electronic equipment
CN101266791A (en) Method for cooperatively recognizing voice command and related system thereof
US11475069B2 (en) Corpus processing method, apparatus and storage medium
US20230316952A1 (en) System and method for bidirectional automatic sign language translation and production
US20230140997A1 (en) Method and Apparatus for Selecting Sample Corpus Used to Optimize Translation Model
CN107767866A (en) Information transferring method and smart machine between smart machine
US20220122593A1 (en) User-friendly virtual voice assistant
CN113992468A (en) Smart home voice control method
CN106254186A (en) A kind of interactive voice control system for identifying
US11646021B2 (en) Apparatus for voice-age adjusting an input voice signal according to a desired age
CN113516977B (en) Keyword recognition method and system
WO2024255262A1 (en) Voice wake-up method and device supporting wake-up word customization
CN106297783A (en) A kind of interactive voice identification intelligent terminal

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
WD01 Invention patent application deemed withdrawn after publication
WD01 Invention patent application deemed withdrawn after publication

Application publication date: 20180216