WO2020082710A1

WO2020082710A1 - Voice interaction control method, apparatus and system for bluetooth speaker

Info

Publication number: WO2020082710A1
Application number: PCT/CN2019/084835
Authority: WO
Inventors: 林敏洁; 吴海全; 姜德军; 张恩勤; 曹磊; 师瑞文
Original assignee: 深圳市冠旭电子股份有限公司
Priority date: 2018-10-22
Filing date: 2019-04-28
Publication date: 2020-04-30
Also published as: CN111081238A; CN111081238B

Abstract

A voice interaction control method for a Bluetooth speaker (11), comprising: establishing voice channel connection with a terminal device (12), and collecting voice information (S201); sending the voice information to the terminal device (12) by means of a voice channel so that the terminal device (12) uploads the voice information to a cloud (13), and maintaining the voice channel connection when the cloud (13) returns a voice analysis result to the terminal device (12) (S202); receiving the voice analysis result sent by the terminal device (12) by means of the voice channel, and playing back the voice analysis result (S203). The method improves a response speed of voice interaction for the Bluetooth speaker, and reduces the delay of the establishment of the voice channel after the voice information is collected completely. Also provided are a voice interaction control apparatus and system for the Bluetooth speaker.

Description

Voice interactive control method, device and system of bluetooth speaker

Technical field

The invention belongs to the technical field of voice interactive control, and particularly relates to a voice interactive control method, device and system of a Bluetooth speaker.

Background technique

When the Bluetooth speaker is ready to play audio data, it needs to establish a synchronous directional SCO connection or an advanced audio transmission A2DP connection with the terminal device. The terminal device transmits the audio data to the Bluetooth speaker through this link method. After decoding, amplification and other steps, the voice is played Out; for example, when the mobile phone is talking on the phone, the voice in the phone is transmitted through the synchronous SCO connection, and when the mobile phone is playing music, the music is played through the advanced audio transmission A2DP connection.

technical problem

At present, when the Bluetooth speaker interacts with the user, the terminal device establishes a connection with the Bluetooth speaker, receives voice data, and disconnects after the voice data is received. After the terminal device returns the voice analysis result in the cloud, it establishes a connection with the Bluetooth speaker again. The connection is established successfully, and the voice analysis result is transmitted to the Bluetooth speaker for voice playback; it takes a certain time for the terminal device to establish a connection with the Bluetooth speaker. After waiting for the voice analysis in the cloud, it must continue to wait for the terminal device to establish a connection with the Bluetooth speaker Time, resulting in delays in Bluetooth speaker playback during voice interaction, slow response speed.

Technical solution

In view of this, the embodiments of the present invention provide a voice interaction control method, device, and system to solve the problems of slow response speed and delay in voice playback when interacting with Bluetooth speakers in the prior art.

The first aspect of the embodiments of the present invention provides a voice interaction control method, which is applied to a Bluetooth speaker and includes:

Establish a voice channel connection with terminal equipment and collect voice information;

Sending the voice information to the terminal device through the voice channel, so that the terminal device uploads the voice information to the cloud, and maintains the voice channel connection when the cloud returns the voice analysis result to the terminal device;

Receiving the voice analysis result sent by the terminal device through the voice channel, and playing the voice analysis result.

The second aspect of the embodiments of the present invention provides another voice interaction control method, including:

Establish a voice channel connection with a Bluetooth speaker, and receive voice information sent by the Bluetooth speaker through the voice channel;

Upload the voice information to the cloud, so that the cloud can parse the voice information, and maintain the voice channel connection with the Bluetooth speaker when obtaining the voice analysis result;

Sending the voice analysis result to a Bluetooth speaker through the voice channel, so that the Bluetooth speaker plays the voice analysis result.

A third aspect of the embodiments of the present invention provides a voice interaction control device, including:

Voice collection module, used to collect voice information;

The first channel establishment module is used to establish a voice channel connection with the terminal device and maintain the voice channel connection when the voice analysis result is returned to the terminal device;

A first voice sending module, configured to send the voice information to the terminal device through the voice channel, so that the terminal device uploads the voice information to the cloud for analysis, and returns a voice analysis result;

The voice playback module is configured to receive the voice analysis result sent by the terminal device through the voice channel and play the voice analysis result.

A fourth aspect of the embodiments of the present invention provides a terminal device, including:

The second channel establishment module is used to establish a voice channel connection with a Bluetooth speaker;

A data receiving module, configured to receive voice information sent by a Bluetooth speaker through the voice channel;

The second data sending module is used to upload the voice information to the cloud, so that the cloud can parse the voice information to obtain the voice analysis result;

The third data sending module is configured to send the voice analysis result to a Bluetooth speaker through the voice channel, so that the Bluetooth speaker plays the voice analysis result.

A fifth aspect of the embodiments of the present invention provides a voice interactive control system, including:

The Bluetooth speaker is used to establish a voice channel connection with the terminal device, collect voice information, send the voice information to the terminal device through the voice channel, receive the voice analysis result sent by the terminal device through the voice channel, and play the Voice analysis results;

The terminal device is used to establish a voice channel connection with the Bluetooth speaker and receive the voice information sent by the Bluetooth speaker through the voice channel; maintain the connection with the voice channel of the Bluetooth speaker and upload the voice information to the cloud to make the cloud Analyze the voice information to obtain a voice analysis result; send the voice analysis result to a Bluetooth speaker through the voice channel, so that the Bluetooth speaker plays the voice analysis result;

The cloud is used to receive the voice information sent by the terminal device, analyze the voice information, and return the voice analysis result to the terminal device.

A fourth aspect of the embodiments of the present invention provides a computer-readable storage medium that stores a computer program, and when the computer program is executed by a processor, implements the steps of the voice interaction control method.

Beneficial effect

Compared with the prior art, the beneficial effects of the embodiments of the present invention are: through the embodiment of the present invention, when the voice interaction is performed through the Bluetooth speaker, the connection between the Bluetooth speaker and the terminal device's voice channel is established, and the terminal device is uploaded to the cloud During the voice analysis and when the voice analysis result is returned to the terminal device, the voice channel connection is maintained. The Bluetooth speaker directly receives the voice analysis result sent by the terminal device through the voice channel and plays the voice analysis result to improve the response speed of the Bluetooth speaker voice interaction and reduce The delay after collecting voice information.

BRIEF DESCRIPTION

In order to more clearly explain the technical solutions in the embodiments of the present invention, the following will briefly introduce the drawings required in the embodiments or the description of the prior art. Obviously, the drawings in the following description are only for the invention. In some embodiments, for those of ordinary skill in the art, without paying creative labor, other drawings may be obtained based on these drawings.

1 is a schematic diagram of a system application scenario of voice interactive control provided by an embodiment of the present invention;

2 is a schematic diagram of an implementation process of a voice interaction control method provided by an embodiment of the present invention;

3 is a schematic diagram of an implementation process of another voice interaction control method provided by an embodiment of the present invention;

4 is a schematic diagram of an interaction process of a voice interaction control method provided by an embodiment of the present invention;

5 is a schematic diagram of a voice interaction control device provided by an embodiment of the present invention;

6 is a schematic diagram of a terminal device provided by an embodiment of the present invention.

Embodiments of the invention

In the following description, for the purpose of illustration rather than limitation, specific details such as specific system structures and technologies are proposed to thoroughly understand the embodiments of the present invention. However, those skilled in the art should understand that the present invention can also be implemented in other embodiments without these specific details. In other cases, detailed descriptions of well-known systems, devices, circuits, and methods are omitted so as not to obscure the description of the present invention with unnecessary details.

It should be understood that when used in this specification and the appended claims, the term "comprising" indicates the presence of described features, integers, steps, operations, elements, and / or components, but does not exclude one or more other features , Wholes, steps, operations, elements, components and / or their existence or addition.

It should also be understood that the terminology used in the description of the invention herein is for the purpose of describing particular embodiments only and is not intended to limit the invention. As used in this specification and the appended claims, unless the context clearly indicates otherwise, the singular forms "a", "an", and "the" are intended to include the plural form.

It should also be further understood that the term "and / or" used in the description of the present invention and the appended claims refers to any and all possible combinations of one or more of the associated listed items and includes these combinations .

In order to explain the technical solution described in the present invention, the following will be described through specific embodiments.

FIG. 1 shows a schematic diagram of an application scenario of a system for voice interaction control provided by an embodiment of the present invention. For ease of explanation, only parts related to this embodiment are shown.

Referring to FIG. 1, the system collects voice information from the Bluetooth speaker 11 and sends the voice information to the terminal device 12 through the established voice channel. The terminal device 12 uploads the voice information to the cloud 13 and the cloud 13 performs voice analysis; Upload voice information to the cloud 13 and the cloud 13 during the voice analysis process, keep the voice channel connected, the terminal device 12 receives the voice analysis result returned by the cloud 13, and sends it to the Bluetooth speaker 11 through the voice channel to be played by the Bluetooth speaker 11 Voice analysis results.

The voice interaction control method in the system scenario shown in FIG. 1 is described in detail below:

FIG. 2 shows a schematic flowchart of an implementation process of a voice interaction control method provided by an embodiment of the present invention. In this embodiment, the execution subject of the process is the Bluetooth speaker 11 shown in FIG. 1, and the execution subject of the method implementation process may also be other Bluetooth devices that implement voice information network interaction, such as Bluetooth headsets, car Bluetooth devices, etc. The details are as follows:

Step S201: Establish a voice channel connection with the terminal device and collect voice information.

In the embodiment of the present invention, the terminal device may be any device that can realize Bluetooth connection, such as a mobile phone, a notebook, a palmtop computer, and a desktop computer; the Bluetooth speaker collects voice information through a microphone, and the Bluetooth speaker may have a built-in microphone array to perform Long-distance pickup; the Bluetooth speakers include but are not limited to: ordinary single-tube Bluetooth speakers, outdoor single-tube Bluetooth speakers, home-type dual-tube Bluetooth speakers, outdoor sports Bluetooth speakers or large multi-tube home Bluetooth speakers, all of which can be collected Voice information; when a Bluetooth speaker performs voice information transmission, a voice channel connection needs to be established with a terminal device, and the voice channel may be a synchronous directional SCO connection or an advanced audio transmission model A2DP connection.

Step S202: Send the voice information to the terminal device through the voice channel, so that the terminal device uploads the voice information to the cloud, and maintains the voice when the cloud returns the voice analysis result to the terminal device Channel connection.

In the embodiment of the present invention, the Bluetooth speaker sends the collected voice information to a terminal device through a voice channel. The terminal device may be any device that can achieve Bluetooth connection, such as a mobile phone, a notebook, a palmtop computer, and a desktop computer. ; The terminal device uploads the voice data to the cloud through the Internet or Internet of Things for voice analysis, including voice recognition and feature extraction, generates the corresponding voice analysis result, and maintains the Bluetooth speaker and the terminal device when the voice analysis result is returned to the terminal device Voice channel connection.

Further, after establishing a voice channel connection with the terminal device and collecting voice information, the method further includes:

A1. Send the voice information to the terminal device through the voice channel, and disconnect the voice channel connection after the terminal device uploads the voice information to the cloud;

A2. Before the cloud returns the voice analysis result to the terminal device, establish a voice channel connection with the terminal device again.

In the embodiment of the present invention, when the Bluetooth speaker ends collecting voice information and sends the voice information to the terminal device through the voice channel, and the terminal device uploads the voice information to the cloud, the Bluetooth speaker will disconnect the voice channel connection with the terminal device; Before waiting for the cloud to parse the voice information and return the voice analysis result, the Bluetooth speaker and the terminal device establish a voice channel connection again; the time required to wait for the cloud to return is in seconds, which can be 1 second, 2 seconds, etc., to establish a voice channel connection The time required is in the order of 100 milliseconds, and can be 0.3 seconds or 0.4 seconds, etc., so that the establishment of a voice channel connection can be completed before waiting for or returning the voice analysis result from the cloud.

Wherein, the voice channel connection may be a synchronous SCO connection.

The voice information is sent to the terminal device through the voice channel, and the voice channel connection established with the terminal device is maintained until the terminal device uploads the voice information to the cloud and the cloud returns the voice analysis result.

In the embodiment of the present invention, after the voice channel connection between the Bluetooth speaker and the terminal device is established, the established voice channel connection is no longer disconnected. When the terminal device uploads the voice information to the cloud, the cloud analyzes the voice information and returns from the cloud Keep the voice channel connected until the result of voice analysis.

Step S203: Receive the voice analysis result sent by the terminal device through the voice channel, and play the voice analysis result.

In the embodiment of the present invention, the voice channel may be an established SCO-oriented connection channel, or an advanced audio transmission model A2DP connection channel; the terminal device may be a mobile phone, a computer, and other networked devices that can support Bluetooth connection. ; The Bluetooth speaker can receive the voice analysis result sent by the terminal device through the voice channel, and it is played from the speaker of the Bluetooth speaker after decoding, amplification, etc. For example: the collected voice information is: "How is the weather today", and returned by the speaker : "Today the weather is fine, the temperature is 21 degrees, and the northerly wind is 2-3."

FIG. 3 shows a schematic diagram of an implementation process of another voice interaction control method provided by an embodiment of the present invention; in this embodiment, the execution subject of the process is the terminal device 12 shown in FIG. It can also be other networked devices that support Bluetooth connection, such as mobile phones, computers, tablets, etc., as detailed below:

Step S301: Establish a voice channel connection with a Bluetooth speaker, and receive voice information sent by the Bluetooth speaker through the voice channel.

In the embodiment of the present invention, when receiving a piece of data through a Bluetooth speaker or playing a piece of audio data through a Bluetooth speaker, a voice channel connection needs to be established with the Bluetooth speaker speaker, and the voice channel may be a synchronous directional SCO connection or an advanced Audio transmission model A2DP connection; the Bluetooth speaker can be a built-in microphone array for remote pickup; the Bluetooth speakers include but are not limited to: ordinary single-tube Bluetooth speaker, outdoor single-tube Bluetooth speaker, home-type dual-barrel Bluetooth Speakers, outdoor sports Bluetooth speakers or large multi-barrel home Bluetooth speakers.

In addition, the Bluetooth speaker may also be other devices that collect voice information, and may be any Bluetooth device that supports voice channel connection and implements voice information network interaction, such as a headset and a car Bluetooth.

Step S302: Upload the voice information to the cloud, so that the cloud parses the voice information, and maintains the voice channel connection with the Bluetooth speaker when obtaining the voice analysis result.

In the embodiment of the present invention, the terminal device may be any device that can achieve Bluetooth connection, such as a mobile phone, a notebook, a palmtop computer, and a desktop computer; the terminal device uploads voice information to the cloud or server, and the cloud or The server analyzes the voice, including voice recognition and feature extraction, and generates the corresponding voice analysis result; when the cloud returns the voice analysis result, the voice channel remains connected, and it is not necessary to connect the terminal device and the Bluetooth speaker again. To avoid the delay caused by establishing the connection again, the voice result is directly sent to the Bluetooth speaker through the voice channel.

Further, after establishing a voice channel connection with a Bluetooth speaker and receiving voice information sent by the Bluetooth speaker through the voice channel, the method further includes:

B1. Upload the voice information to the cloud, and disconnect the voice channel connection with the Bluetooth speaker after uploading to the cloud;

B2. Before analyzing the voice information in the cloud and returning the voice analysis result, establish a voice channel connection with the Bluetooth speaker again and receive the voice analysis result sent by the cloud.

In the embodiment of the present invention, when the terminal device receives the voice information and uploads the voice information to the cloud, the voice channel connection with the Bluetooth speaker will be disconnected; before waiting for the cloud to parse the voice information and return the voice analysis result, the terminal device Establish a voice channel connection with the Bluetooth speaker again; the time required to wait for the cloud to return is in the order of seconds, which can be 1 second, 2 seconds, etc. The time required to establish the voice channel connection is in the order of 100 milliseconds, which can be 0.3 seconds or 0.4 Seconds, etc., so that the establishment of the voice channel connection can be completed before waiting for the cloud to return the voice analysis result or during the process of returning the voice analysis result.

Wherein, the voice channel connection may be a synchronous SCO connection or an advanced audio transmission model A2DP connection.

Receive the voice information sent by the Bluetooth speaker through the voice channel, and maintain the voice channel connection established with the Bluetooth speaker before uploading the voice information to the cloud and returning the voice analysis result from the cloud.

In the embodiment of the present invention, after the terminal device and the Bluetooth speaker establish a voice channel connection, the established voice channel connection is no longer disconnected, and the terminal device uploads the voice information to the cloud, the cloud analyzes the voice information, and the cloud returns Keep the voice channel connected until the result of voice analysis.

In addition, receiving the voice information sent by the Bluetooth speaker can establish a synchronous directional SCO channel, and send the voice analysis results to the Bluetooth speaker, or through the synchronous directional SCO channel, because the two times are the voice transmitted through the established synchronous SCO connection Data, there is no need to perform conversion and establishment of other types of voice channels, and the SCO connection has been kept synchronously oriented in the middle, which has no other impact on the terminal device or the Bluetooth speaker.

Step S303: Send the voice analysis result to a Bluetooth speaker through the voice channel, so that the Bluetooth speaker plays the voice analysis result.

Through the embodiment of the present invention, when the voice interaction of the Bluetooth speaker is established, the connection between the Bluetooth speaker and the terminal device's voice channel is established, and when the terminal device is uploaded to the cloud for voice analysis, and the voice analysis result is returned to the terminal device, the voice is maintained Channel connection, the Bluetooth speaker directly receives the voice analysis result sent by the terminal device through the voice channel, and plays the voice analysis result, improves the response speed of the Bluetooth speaker voice interaction, and reduces the delay after collecting the voice information; through the embodiment of the present invention After collecting the voice information, the time to reply the voice analysis result is about 20% faster than the traditional solution, which significantly improves the user's perception; in addition, it further fully optimizes the synchronous directional SCO connection to avoid unnecessary establishment The process of orientating the SCO connection synchronously reduces the time required for voice interaction and improves the user experience.

FIG. 4 shows a schematic diagram of an interaction process of a voice interaction control method provided by an embodiment of the present invention. For ease of explanation, only parts related to the embodiment of the present invention are shown; execution subjects participating in the interaction process include a Bluetooth speaker and a terminal device In the cloud, the implementation principle of the interaction process is consistent with the implementation principle of each execution subject side described in FIGS. 2 and 3. Therefore, the interaction process is only briefly described, and not repeated:

1. Establish a voice channel with a Bluetooth speaker;

2. Voice information is collected by Bluetooth speakers;

3. Send the voice information to the terminal device through the voice channel;

4. Upload voice information to the cloud;

5. The cloud performs voice analysis on the voice information to obtain the voice analysis result;

6. Return the voice analysis result to the terminal device;

7. The terminal device sends the voice analysis result to the Bluetooth speaker through the voice channel;

8. The voice analysis result is played by the Bluetooth speaker.

It should be noted that, those skilled in the art, within the technical scope disclosed by the present invention, other sorting schemes that can be easily thought of should also fall within the protection scope of the present invention, which will not be repeated here.

It should be understood that the size of the sequence numbers of the steps in the above embodiments does not mean the order of execution, and the execution order of each process should be determined by its function and inherent logic, and should not constitute any limitation on the implementation process of the embodiments of the present invention.

FIG. 5 shows a schematic diagram of a voice interaction control device provided by an embodiment of the present invention. For ease of description, only parts related to the embodiment of the present invention are shown.

The voice interactive control device includes:

Voice collection module 51, used to collect voice information;

The first channel establishment module 52 is used to establish a voice channel connection with the terminal device and maintain the voice channel connection when the voice analysis result is returned to the terminal device;

The first voice sending module 53 is configured to send the voice information to the terminal device through the voice channel, so that the terminal device uploads the voice information to the cloud for analysis, and returns a voice analysis result;

The voice playback module 54 is configured to receive the voice analysis result sent by the terminal device through the voice channel and play the voice analysis result.

Further, an embodiment of the present invention also provides a terminal device, including:

The second channel establishment module 61 is used to establish a voice channel connection with a Bluetooth speaker;

The data receiving module 62 is used to receive the voice information sent by the Bluetooth speaker through the voice channel;

The second data sending module 63 is used to upload the voice information to the cloud, so that the cloud can parse the voice information to obtain a voice analysis result;

The third data sending module 64 is configured to send the voice analysis result to a Bluetooth speaker through the voice channel, so that the Bluetooth speaker plays the voice analysis result.

Further, an embodiment of the present invention also provides a voice interaction system, including:

Those skilled in the art can clearly understand that, for convenience and conciseness of description, only the above-mentioned division of each functional module is used as an example for illustration. In practical applications, the above-mentioned functions can be allocated by different functional units and modules as needed That is, the internal structure of the mobile terminal is divided into different functional units or modules to complete all or part of the functions described above. The functional modules in the embodiments may be integrated into one processing unit, or each unit may exist alone physically, or two or more units may be integrated into one unit. The above integrated units may be implemented in the form of hardware , Can also be implemented in the form of software functional units. In addition, the specific names of the functional modules are only for the purpose of distinguishing each other, and are not used to limit the protection scope of the present application. For the specific working process of the module in the above mobile terminal, reference may be made to the corresponding process in the foregoing method embodiments, which will not be repeated here.

Those skilled in the art can clearly understand that, for convenience and conciseness of description, only the above-mentioned division of each functional unit and module is used as an example for illustration. In practical applications, the above-mentioned functions can be allocated by different functional units, Module completion means that the internal structure of the device is divided into different functional units or modules to complete all or part of the functions described above. The functional units and modules in the embodiment may be integrated into one processing unit, or each unit may exist alone physically, or two or more units may be integrated into one unit. The above integrated unit may use hardware It can also be implemented in the form of software functional units. In addition, the specific names of each functional unit and module are only for the purpose of distinguishing each other, and are not intended to limit the protection scope of the present invention. For the specific working processes of the units and modules in the above system, reference may be made to the corresponding processes in the foregoing method embodiments, which will not be repeated here.

In the above embodiments, the description of each embodiment has its own emphasis. For a part that is not detailed or recorded in an embodiment, you can refer to the related descriptions of other embodiments.

Those of ordinary skill in the art may realize that the units and algorithm steps of the examples described in conjunction with the embodiments disclosed herein can be implemented by electronic hardware, or a combination of computer software and electronic hardware. Whether these functions are executed in hardware or software depends on the specific application of the technical solution and design constraints. Professional technicians can use different methods to implement the described functions for each specific application, but such implementation should not be considered beyond the scope of the present invention.

In the embodiments provided by the present invention, it should be understood that the disclosed device / terminal device and method may be implemented in other ways. For example, the device / terminal device embodiments described above are only schematic. For example, the division of the module or unit is only a logical function division. In actual implementation, there may be other division modes, such as multiple units Or components can be combined or integrated into another system, or some features can be ignored or not implemented. In addition, the displayed or discussed mutual coupling or direct coupling or communication connection may be indirect coupling or communication connection through some interfaces, devices or units, and may be in electrical, mechanical or other forms.

The units described as separate components may or may not be physically separated, and the components displayed as units may or may not be physical units, that is, they may be located in one place, or may be distributed on multiple network units. Some or all of the units may be selected according to actual needs to achieve the purpose of the solution of this embodiment.

In addition, each functional unit in each embodiment of the present invention may be integrated into one processing unit, or each unit may exist alone physically, or two or more units may be integrated into one unit. The above integrated unit can be implemented in the form of hardware or software function unit.

If the integrated module / unit is implemented in the form of a software functional unit and sold or used as an independent product, it may be stored in a computer-readable storage medium. Based on this understanding, the present invention can implement all or part of the processes in the methods of the above embodiments, and can also be completed by a computer program instructing relevant hardware. The computer program can be stored in a computer-readable storage medium. When the program is executed by the processor, the steps of the foregoing method embodiments may be implemented. Wherein, the computer program includes computer program code, and the computer program code may be in a source code form, an object code form, an executable file, or some intermediate form, etc. The computer-readable medium may include: any entity or device capable of carrying the computer program code, a recording medium, a USB flash drive, a mobile hard disk, a magnetic disk, an optical disc, a computer memory, a read-only memory (ROM, Read-Only Memory), random access memory (RAM, Random Access Memory), electrical carrier signals, telecommunication signals, and software distribution media. It should be noted that the content contained in the computer-readable medium can be appropriately increased or decreased according to the requirements of legislation and patent practice in jurisdictions. For example, in some jurisdictions, according to legislation and patent practice, computer-readable media Excluded are electrical carrier signals and telecommunications signals.

The above-mentioned embodiments are only used to illustrate the technical solutions of the present invention, not to limit them; although the present invention has been described in detail with reference to the foregoing embodiments, those of ordinary skill in the art should understand that they can still implement the foregoing The technical solutions described in the examples are modified, or some of the technical features are equivalently replaced; and these modifications or replacements do not deviate from the spirit and scope of the technical solutions of the embodiments of the present invention, and should be included in Within the protection scope of the present invention.

Claims

A voice interactive control method, applied to Bluetooth speakers, characterized by including:

Establish a voice channel connection with terminal equipment and collect voice information;

Sending the voice information to the terminal device through the voice channel, so that the terminal device uploads the voice information to the cloud, and maintains the voice channel connection when the cloud returns the voice analysis result to the terminal device;

Receiving the voice analysis result sent by the terminal device through the voice channel, and playing the voice analysis result.
The voice interactive control method according to claim 1, wherein after establishing a voice channel connection with the terminal device and collecting voice information, the method further comprises:

Sending the voice information to a terminal device through the voice channel, and disconnecting the voice channel connection after the terminal device uploads the voice information to the cloud;

Before the cloud returns the voice analysis result to the terminal device, establish a voice channel connection with the terminal device again.
The voice interactive control method according to claim 1, wherein after establishing a voice channel connection with the terminal device and collecting voice information, the method further comprises:

Sending the voice information to a terminal device through the voice channel, and maintaining the voice channel connection established with the terminal device before the terminal device uploads the voice information to the cloud and returns the voice analysis result from the cloud .
A voice interactive control method, characterized in that it includes:

Establish a voice channel connection with a Bluetooth speaker, and receive voice information sent by the Bluetooth speaker through the voice channel;

Upload the voice information to the cloud, so that the cloud can parse the voice information, and maintain the voice channel connection with the Bluetooth speaker when obtaining the voice analysis result;

Sending the voice analysis result to a Bluetooth speaker through the voice channel, so that the Bluetooth speaker plays the voice analysis result.
The voice interactive control method according to claim 4, wherein after establishing a voice channel connection with the Bluetooth speaker and receiving voice information sent by the Bluetooth speaker through the voice channel, the method further comprises:

Upload the voice information to the cloud, and disconnect the voice channel connection with the Bluetooth speaker after uploading to the cloud;

Before the cloud parses the voice information and returns the voice analysis result, establish a voice channel connection with the Bluetooth speaker again and receive the voice analysis result sent by the cloud.
The voice interactive control method according to claim 4, wherein after establishing a voice channel connection with the Bluetooth speaker and receiving voice information sent by the Bluetooth speaker through the voice channel, the method further comprises:

Receive the voice information sent by the Bluetooth speaker through the voice channel, and maintain the voice channel connection established with the Bluetooth speaker before uploading the voice information to the cloud and returning the voice analysis result from the cloud.
A voice interactive control device, characterized in that it includes:

Voice collection module, used to collect voice information;

The first channel establishment module is used to establish a voice channel connection with the terminal device and maintain the voice channel connection when the voice analysis result is returned to the terminal device;

A first voice sending module, configured to send the voice information to the terminal device through the voice channel, so that the terminal device uploads the voice information to the cloud for analysis, and returns a voice analysis result;

The voice playback module is configured to receive the voice analysis result sent by the terminal device through the voice channel and play the voice analysis result.
A terminal device is characterized by comprising:

The second channel establishment module is used to establish a voice channel connection with a Bluetooth speaker;

A data receiving module, configured to receive voice information sent by a Bluetooth speaker through the voice channel;

The second data sending module is used to upload the voice information to the cloud, so that the cloud can parse the voice information to obtain the voice analysis result;

The third data sending module is configured to send the voice analysis result to a Bluetooth speaker through the voice channel, so that the Bluetooth speaker plays the voice analysis result.
A voice interactive control system, characterized by including:

The Bluetooth speaker is used to establish a voice channel connection with the terminal device, collect voice information, send the voice information to the terminal device through the voice channel, receive the voice analysis result sent by the terminal device through the voice channel, and play the Voice analysis results;

The terminal device is used to establish a voice channel connection with the Bluetooth speaker and receive the voice information sent by the Bluetooth speaker through the voice channel; maintain the connection with the voice channel of the Bluetooth speaker and upload the voice information to the cloud to make the cloud Analyze the voice information to obtain a voice analysis result; send the voice analysis result to a Bluetooth speaker through the voice channel, so that the Bluetooth speaker plays the voice analysis result;

The cloud is used to receive the voice information sent by the terminal device, analyze the voice information, and return the voice analysis result to the terminal device.
A computer-readable storage medium storing a computer program, characterized in that, when the computer program is executed by a processor, the steps of the method according to any one of claims 1 to 6 are implemented.