WO2020082710A1 - Voice interaction control method, apparatus and system for bluetooth speaker - Google Patents
Voice interaction control method, apparatus and system for bluetooth speaker Download PDFInfo
- Publication number
- WO2020082710A1 WO2020082710A1 PCT/CN2019/084835 CN2019084835W WO2020082710A1 WO 2020082710 A1 WO2020082710 A1 WO 2020082710A1 CN 2019084835 W CN2019084835 W CN 2019084835W WO 2020082710 A1 WO2020082710 A1 WO 2020082710A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- voice
- terminal device
- analysis result
- bluetooth speaker
- cloud
- Prior art date
Links
- 238000000034 method Methods 0.000 title claims abstract description 62
- 230000003993 interaction Effects 0.000 title abstract description 33
- 238000004458 analytical method Methods 0.000 claims abstract description 109
- 230000002452 interceptive effect Effects 0.000 claims description 13
- 238000004590 computer program Methods 0.000 claims description 10
- 230000004044 response Effects 0.000 abstract description 5
- 230000008569 process Effects 0.000 description 22
- 238000010586 diagram Methods 0.000 description 10
- 230000001360 synchronised effect Effects 0.000 description 10
- 230000006870 function Effects 0.000 description 9
- 230000005540 biological transmission Effects 0.000 description 8
- 230000003321 amplification Effects 0.000 description 3
- 230000008878 coupling Effects 0.000 description 3
- 238000010168 coupling process Methods 0.000 description 3
- 238000005859 coupling reaction Methods 0.000 description 3
- 238000003199 nucleic acid amplification method Methods 0.000 description 3
- 238000012545 processing Methods 0.000 description 3
- 230000009286 beneficial effect Effects 0.000 description 2
- 238000004891 communication Methods 0.000 description 2
- 238000000605 extraction Methods 0.000 description 2
- 238000006243 chemical reaction Methods 0.000 description 1
- 230000003247 decreasing effect Effects 0.000 description 1
- 230000001934 delay Effects 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- 230000008447 perception Effects 0.000 description 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/28—Constructional details of speech recognition systems
- G10L15/30—Distributed recognition, e.g. in client-server systems, for mobile phones or network applications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L67/00—Network arrangements or protocols for supporting network services or applications
- H04L67/01—Protocols
- H04L67/06—Protocols specially adapted for file transfer, e.g. file transfer protocol [FTP]
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L67/00—Network arrangements or protocols for supporting network services or applications
- H04L67/01—Protocols
- H04L67/10—Protocols in which an application is distributed across nodes in the network
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04R—LOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
- H04R1/00—Details of transducers, loudspeakers or microphones
- H04R1/20—Arrangements for obtaining desired frequency or directional characteristics
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04W—WIRELESS COMMUNICATION NETWORKS
- H04W4/00—Services specially adapted for wireless communication networks; Facilities therefor
- H04W4/80—Services using short range communication, e.g. near-field communication [NFC], radio-frequency identification [RFID] or low energy communication
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04W—WIRELESS COMMUNICATION NETWORKS
- H04W76/00—Connection management
- H04W76/10—Connection setup
- H04W76/14—Direct-mode setup
Definitions
- the invention belongs to the technical field of voice interactive control, and particularly relates to a voice interactive control method, device and system of a Bluetooth speaker.
- the Bluetooth speaker When the Bluetooth speaker is ready to play audio data, it needs to establish a synchronous directional SCO connection or an advanced audio transmission A2DP connection with the terminal device.
- the terminal device transmits the audio data to the Bluetooth speaker through this link method.
- the voice After decoding, amplification and other steps, the voice is played Out; for example, when the mobile phone is talking on the phone, the voice in the phone is transmitted through the synchronous SCO connection, and when the mobile phone is playing music, the music is played through the advanced audio transmission A2DP connection.
- the terminal device when the Bluetooth speaker interacts with the user, the terminal device establishes a connection with the Bluetooth speaker, receives voice data, and disconnects after the voice data is received. After the terminal device returns the voice analysis result in the cloud, it establishes a connection with the Bluetooth speaker again. The connection is established successfully, and the voice analysis result is transmitted to the Bluetooth speaker for voice playback; it takes a certain time for the terminal device to establish a connection with the Bluetooth speaker. After waiting for the voice analysis in the cloud, it must continue to wait for the terminal device to establish a connection with the Bluetooth speaker Time, resulting in delays in Bluetooth speaker playback during voice interaction, slow response speed.
- the embodiments of the present invention provide a voice interaction control method, device, and system to solve the problems of slow response speed and delay in voice playback when interacting with Bluetooth speakers in the prior art.
- the first aspect of the embodiments of the present invention provides a voice interaction control method, which is applied to a Bluetooth speaker and includes:
- the second aspect of the embodiments of the present invention provides another voice interaction control method, including:
- a third aspect of the embodiments of the present invention provides a voice interaction control device, including:
- Voice collection module used to collect voice information
- the first channel establishment module is used to establish a voice channel connection with the terminal device and maintain the voice channel connection when the voice analysis result is returned to the terminal device;
- a first voice sending module configured to send the voice information to the terminal device through the voice channel, so that the terminal device uploads the voice information to the cloud for analysis, and returns a voice analysis result;
- the voice playback module is configured to receive the voice analysis result sent by the terminal device through the voice channel and play the voice analysis result.
- a fourth aspect of the embodiments of the present invention provides a terminal device, including:
- the second channel establishment module is used to establish a voice channel connection with a Bluetooth speaker
- a data receiving module configured to receive voice information sent by a Bluetooth speaker through the voice channel
- the second data sending module is used to upload the voice information to the cloud, so that the cloud can parse the voice information to obtain the voice analysis result;
- the third data sending module is configured to send the voice analysis result to a Bluetooth speaker through the voice channel, so that the Bluetooth speaker plays the voice analysis result.
- a fifth aspect of the embodiments of the present invention provides a voice interactive control system, including:
- the Bluetooth speaker is used to establish a voice channel connection with the terminal device, collect voice information, send the voice information to the terminal device through the voice channel, receive the voice analysis result sent by the terminal device through the voice channel, and play the Voice analysis results;
- the terminal device is used to establish a voice channel connection with the Bluetooth speaker and receive the voice information sent by the Bluetooth speaker through the voice channel; maintain the connection with the voice channel of the Bluetooth speaker and upload the voice information to the cloud to make the cloud Analyze the voice information to obtain a voice analysis result; send the voice analysis result to a Bluetooth speaker through the voice channel, so that the Bluetooth speaker plays the voice analysis result;
- the cloud is used to receive the voice information sent by the terminal device, analyze the voice information, and return the voice analysis result to the terminal device.
- a fourth aspect of the embodiments of the present invention provides a computer-readable storage medium that stores a computer program, and when the computer program is executed by a processor, implements the steps of the voice interaction control method.
- the beneficial effects of the embodiments of the present invention are: through the embodiment of the present invention, when the voice interaction is performed through the Bluetooth speaker, the connection between the Bluetooth speaker and the terminal device's voice channel is established, and the terminal device is uploaded to the cloud During the voice analysis and when the voice analysis result is returned to the terminal device, the voice channel connection is maintained.
- the Bluetooth speaker directly receives the voice analysis result sent by the terminal device through the voice channel and plays the voice analysis result to improve the response speed of the Bluetooth speaker voice interaction and reduce The delay after collecting voice information.
- FIG. 1 is a schematic diagram of a system application scenario of voice interactive control provided by an embodiment of the present invention
- FIG. 2 is a schematic diagram of an implementation process of a voice interaction control method provided by an embodiment of the present invention
- FIG. 3 is a schematic diagram of an implementation process of another voice interaction control method provided by an embodiment of the present invention.
- FIG. 4 is a schematic diagram of an interaction process of a voice interaction control method provided by an embodiment of the present invention.
- FIG. 5 is a schematic diagram of a voice interaction control device provided by an embodiment of the present invention.
- FIG. 6 is a schematic diagram of a terminal device provided by an embodiment of the present invention.
- FIG. 1 shows a schematic diagram of an application scenario of a system for voice interaction control provided by an embodiment of the present invention. For ease of explanation, only parts related to this embodiment are shown.
- the system collects voice information from the Bluetooth speaker 11 and sends the voice information to the terminal device 12 through the established voice channel.
- the terminal device 12 uploads the voice information to the cloud 13 and the cloud 13 performs voice analysis; Upload voice information to the cloud 13 and the cloud 13 during the voice analysis process, keep the voice channel connected, the terminal device 12 receives the voice analysis result returned by the cloud 13, and sends it to the Bluetooth speaker 11 through the voice channel to be played by the Bluetooth speaker 11 Voice analysis results.
- FIG. 2 shows a schematic flowchart of an implementation process of a voice interaction control method provided by an embodiment of the present invention.
- the execution subject of the process is the Bluetooth speaker 11 shown in FIG. 1, and the execution subject of the method implementation process may also be other Bluetooth devices that implement voice information network interaction, such as Bluetooth headsets, car Bluetooth devices, etc.
- the details are as follows:
- Step S201 Establish a voice channel connection with the terminal device and collect voice information.
- the terminal device may be any device that can realize Bluetooth connection, such as a mobile phone, a notebook, a palmtop computer, and a desktop computer;
- the Bluetooth speaker collects voice information through a microphone, and the Bluetooth speaker may have a built-in microphone array to perform Long-distance pickup;
- the Bluetooth speakers include but are not limited to: ordinary single-tube Bluetooth speakers, outdoor single-tube Bluetooth speakers, home-type dual-tube Bluetooth speakers, outdoor sports Bluetooth speakers or large multi-tube home Bluetooth speakers, all of which can be collected Voice information; when a Bluetooth speaker performs voice information transmission, a voice channel connection needs to be established with a terminal device, and the voice channel may be a synchronous directional SCO connection or an advanced audio transmission model A2DP connection.
- Step S202 Send the voice information to the terminal device through the voice channel, so that the terminal device uploads the voice information to the cloud, and maintains the voice when the cloud returns the voice analysis result to the terminal device Channel connection.
- the Bluetooth speaker sends the collected voice information to a terminal device through a voice channel.
- the terminal device may be any device that can achieve Bluetooth connection, such as a mobile phone, a notebook, a palmtop computer, and a desktop computer. ;
- the terminal device uploads the voice data to the cloud through the Internet or Internet of Things for voice analysis, including voice recognition and feature extraction, generates the corresponding voice analysis result, and maintains the Bluetooth speaker and the terminal device when the voice analysis result is returned to the terminal device Voice channel connection.
- the method further includes:
- A1 Send the voice information to the terminal device through the voice channel, and disconnect the voice channel connection after the terminal device uploads the voice information to the cloud;
- the Bluetooth speaker when the Bluetooth speaker ends collecting voice information and sends the voice information to the terminal device through the voice channel, and the terminal device uploads the voice information to the cloud, the Bluetooth speaker will disconnect the voice channel connection with the terminal device; Before waiting for the cloud to parse the voice information and return the voice analysis result, the Bluetooth speaker and the terminal device establish a voice channel connection again; the time required to wait for the cloud to return is in seconds, which can be 1 second, 2 seconds, etc., to establish a voice channel connection The time required is in the order of 100 milliseconds, and can be 0.3 seconds or 0.4 seconds, etc., so that the establishment of a voice channel connection can be completed before waiting for or returning the voice analysis result from the cloud.
- the voice channel connection may be a synchronous SCO connection.
- the method further includes:
- the voice information is sent to the terminal device through the voice channel, and the voice channel connection established with the terminal device is maintained until the terminal device uploads the voice information to the cloud and the cloud returns the voice analysis result.
- the established voice channel connection is no longer disconnected.
- the cloud analyzes the voice information and returns from the cloud Keep the voice channel connected until the result of voice analysis.
- Step S203 Receive the voice analysis result sent by the terminal device through the voice channel, and play the voice analysis result.
- the voice channel may be an established SCO-oriented connection channel, or an advanced audio transmission model A2DP connection channel;
- the terminal device may be a mobile phone, a computer, and other networked devices that can support Bluetooth connection.
- the Bluetooth speaker can receive the voice analysis result sent by the terminal device through the voice channel, and it is played from the speaker of the Bluetooth speaker after decoding, amplification, etc.
- the collected voice information is: "How is the weather today", and returned by the speaker : "Today the weather is fine, the temperature is 21 degrees, and the northerly wind is 2-3.”
- FIG. 3 shows a schematic diagram of an implementation process of another voice interaction control method provided by an embodiment of the present invention
- the execution subject of the process is the terminal device 12 shown in FIG.
- It can also be other networked devices that support Bluetooth connection, such as mobile phones, computers, tablets, etc., as detailed below:
- Step S301 Establish a voice channel connection with a Bluetooth speaker, and receive voice information sent by the Bluetooth speaker through the voice channel.
- a voice channel connection when receiving a piece of data through a Bluetooth speaker or playing a piece of audio data through a Bluetooth speaker, a voice channel connection needs to be established with the Bluetooth speaker speaker, and the voice channel may be a synchronous directional SCO connection or an advanced Audio transmission model A2DP connection;
- the Bluetooth speaker can be a built-in microphone array for remote pickup;
- the Bluetooth speakers include but are not limited to: ordinary single-tube Bluetooth speaker, outdoor single-tube Bluetooth speaker, home-type dual-barrel Bluetooth Speakers, outdoor sports Bluetooth speakers or large multi-barrel home Bluetooth speakers.
- the Bluetooth speaker may also be other devices that collect voice information, and may be any Bluetooth device that supports voice channel connection and implements voice information network interaction, such as a headset and a car Bluetooth.
- Step S302 Upload the voice information to the cloud, so that the cloud parses the voice information, and maintains the voice channel connection with the Bluetooth speaker when obtaining the voice analysis result.
- the terminal device may be any device that can achieve Bluetooth connection, such as a mobile phone, a notebook, a palmtop computer, and a desktop computer; the terminal device uploads voice information to the cloud or server, and the cloud or The server analyzes the voice, including voice recognition and feature extraction, and generates the corresponding voice analysis result; when the cloud returns the voice analysis result, the voice channel remains connected, and it is not necessary to connect the terminal device and the Bluetooth speaker again. To avoid the delay caused by establishing the connection again, the voice result is directly sent to the Bluetooth speaker through the voice channel.
- the method further includes:
- the terminal device when the terminal device receives the voice information and uploads the voice information to the cloud, the voice channel connection with the Bluetooth speaker will be disconnected; before waiting for the cloud to parse the voice information and return the voice analysis result, the terminal device Establish a voice channel connection with the Bluetooth speaker again; the time required to wait for the cloud to return is in the order of seconds, which can be 1 second, 2 seconds, etc.
- the time required to establish the voice channel connection is in the order of 100 milliseconds, which can be 0.3 seconds or 0.4 Seconds, etc., so that the establishment of the voice channel connection can be completed before waiting for the cloud to return the voice analysis result or during the process of returning the voice analysis result.
- the voice channel connection may be a synchronous SCO connection or an advanced audio transmission model A2DP connection.
- the method further includes:
- the established voice channel connection is no longer disconnected, and the terminal device uploads the voice information to the cloud, the cloud analyzes the voice information, and the cloud returns Keep the voice channel connected until the result of voice analysis.
- receiving the voice information sent by the Bluetooth speaker can establish a synchronous directional SCO channel, and send the voice analysis results to the Bluetooth speaker, or through the synchronous directional SCO channel, because the two times are the voice transmitted through the established synchronous SCO connection Data, there is no need to perform conversion and establishment of other types of voice channels, and the SCO connection has been kept synchronously oriented in the middle, which has no other impact on the terminal device or the Bluetooth speaker.
- Step S303 Send the voice analysis result to a Bluetooth speaker through the voice channel, so that the Bluetooth speaker plays the voice analysis result.
- the voice channel may be an established SCO-oriented connection channel, or an advanced audio transmission model A2DP connection channel;
- the terminal device may be a mobile phone, a computer, and other networked devices that can support Bluetooth connection.
- the Bluetooth speaker can receive the voice analysis result sent by the terminal device through the voice channel, and it is played from the speaker of the Bluetooth speaker after decoding, amplification, etc.
- the collected voice information is: "How is the weather today", and returned by the speaker : "Today the weather is fine, the temperature is 21 degrees, and the northerly wind is 2-3.”
- the Bluetooth speaker directly receives the voice analysis result sent by the terminal device through the voice channel, and plays the voice analysis result, improves the response speed of the Bluetooth speaker voice interaction, and reduces the delay after collecting the voice information;
- the time to reply the voice analysis result is about 20% faster than the traditional solution, which significantly improves the user's perception; in addition, it further fully optimizes the synchronous directional SCO connection to avoid unnecessary establishment
- the process of orientating the SCO connection synchronously reduces the time required for voice interaction and improves the user experience.
- FIG. 4 shows a schematic diagram of an interaction process of a voice interaction control method provided by an embodiment of the present invention.
- execution subjects participating in the interaction process include a Bluetooth speaker and a terminal device
- the implementation principle of the interaction process is consistent with the implementation principle of each execution subject side described in FIGS. 2 and 3. Therefore, the interaction process is only briefly described, and not repeated:
- the cloud performs voice analysis on the voice information to obtain the voice analysis result
- the terminal device sends the voice analysis result to the Bluetooth speaker through the voice channel
- the voice analysis result is played by the Bluetooth speaker.
- FIG. 5 shows a schematic diagram of a voice interaction control device provided by an embodiment of the present invention. For ease of description, only parts related to the embodiment of the present invention are shown.
- the voice interactive control device includes:
- Voice collection module 51 used to collect voice information
- the first channel establishment module 52 is used to establish a voice channel connection with the terminal device and maintain the voice channel connection when the voice analysis result is returned to the terminal device;
- the first voice sending module 53 is configured to send the voice information to the terminal device through the voice channel, so that the terminal device uploads the voice information to the cloud for analysis, and returns a voice analysis result;
- the voice playback module 54 is configured to receive the voice analysis result sent by the terminal device through the voice channel and play the voice analysis result.
- an embodiment of the present invention also provides a terminal device, including:
- the second channel establishment module 61 is used to establish a voice channel connection with a Bluetooth speaker
- the data receiving module 62 is used to receive the voice information sent by the Bluetooth speaker through the voice channel;
- the second data sending module 63 is used to upload the voice information to the cloud, so that the cloud can parse the voice information to obtain a voice analysis result;
- the third data sending module 64 is configured to send the voice analysis result to a Bluetooth speaker through the voice channel, so that the Bluetooth speaker plays the voice analysis result.
- an embodiment of the present invention also provides a voice interaction system, including:
- the Bluetooth speaker is used to establish a voice channel connection with the terminal device, collect voice information, send the voice information to the terminal device through the voice channel, receive the voice analysis result sent by the terminal device through the voice channel, and play the Voice analysis results;
- the terminal device is used to establish a voice channel connection with the Bluetooth speaker and receive the voice information sent by the Bluetooth speaker through the voice channel; maintain the connection with the voice channel of the Bluetooth speaker and upload the voice information to the cloud to make the cloud Analyze the voice information to obtain a voice analysis result; send the voice analysis result to a Bluetooth speaker through the voice channel, so that the Bluetooth speaker plays the voice analysis result;
- the cloud is used to receive the voice information sent by the terminal device, analyze the voice information, and return the voice analysis result to the terminal device.
- each functional module is used as an example for illustration.
- the above-mentioned functions can be allocated by different functional units and modules as needed That is, the internal structure of the mobile terminal is divided into different functional units or modules to complete all or part of the functions described above.
- the functional modules in the embodiments may be integrated into one processing unit, or each unit may exist alone physically, or two or more units may be integrated into one unit.
- the above integrated units may be implemented in the form of hardware , Can also be implemented in the form of software functional units.
- the specific names of the functional modules are only for the purpose of distinguishing each other, and are not used to limit the protection scope of the present application.
- For the specific working process of the module in the above mobile terminal reference may be made to the corresponding process in the foregoing method embodiments, which will not be repeated here.
- each functional unit and module is used as an example for illustration.
- the above-mentioned functions can be allocated by different functional units
- Module completion means that the internal structure of the device is divided into different functional units or modules to complete all or part of the functions described above.
- the functional units and modules in the embodiment may be integrated into one processing unit, or each unit may exist alone physically, or two or more units may be integrated into one unit.
- the above integrated unit may use hardware It can also be implemented in the form of software functional units.
- the specific names of each functional unit and module are only for the purpose of distinguishing each other, and are not intended to limit the protection scope of the present invention.
- the disclosed device / terminal device and method may be implemented in other ways.
- the device / terminal device embodiments described above are only schematic.
- the division of the module or unit is only a logical function division.
- the displayed or discussed mutual coupling or direct coupling or communication connection may be indirect coupling or communication connection through some interfaces, devices or units, and may be in electrical, mechanical or other forms.
- the units described as separate components may or may not be physically separated, and the components displayed as units may or may not be physical units, that is, they may be located in one place, or may be distributed on multiple network units. Some or all of the units may be selected according to actual needs to achieve the purpose of the solution of this embodiment.
- each functional unit in each embodiment of the present invention may be integrated into one processing unit, or each unit may exist alone physically, or two or more units may be integrated into one unit.
- the above integrated unit can be implemented in the form of hardware or software function unit.
- the integrated module / unit is implemented in the form of a software functional unit and sold or used as an independent product, it may be stored in a computer-readable storage medium.
- the present invention can implement all or part of the processes in the methods of the above embodiments, and can also be completed by a computer program instructing relevant hardware.
- the computer program can be stored in a computer-readable storage medium. When the program is executed by the processor, the steps of the foregoing method embodiments may be implemented.
- the computer program includes computer program code, and the computer program code may be in a source code form, an object code form, an executable file, or some intermediate form, etc.
- the computer-readable medium may include: any entity or device capable of carrying the computer program code, a recording medium, a USB flash drive, a mobile hard disk, a magnetic disk, an optical disc, a computer memory, a read-only memory (ROM, Read-Only Memory), random access memory (RAM, Random Access Memory), electrical carrier signals, telecommunication signals, and software distribution media.
- ROM Read-Only Memory
- RAM Random Access Memory
Landscapes
- Engineering & Computer Science (AREA)
- Signal Processing (AREA)
- Computer Networks & Wireless Communication (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Multimedia (AREA)
- Telephone Function (AREA)
Abstract
A voice interaction control method for a Bluetooth speaker (11), comprising: establishing voice channel connection with a terminal device (12), and collecting voice information (S201); sending the voice information to the terminal device (12) by means of a voice channel so that the terminal device (12) uploads the voice information to a cloud (13), and maintaining the voice channel connection when the cloud (13) returns a voice analysis result to the terminal device (12) (S202); receiving the voice analysis result sent by the terminal device (12) by means of the voice channel, and playing back the voice analysis result (S203). The method improves a response speed of voice interaction for the Bluetooth speaker, and reduces the delay of the establishment of the voice channel after the voice information is collected completely. Also provided are a voice interaction control apparatus and system for the Bluetooth speaker.
Description
本发明属于语音交互控制技术领域,尤其涉及一种蓝牙音箱语音交互控制方法、装置及系统。The invention belongs to the technical field of voice interactive control, and particularly relates to a voice interactive control method, device and system of a Bluetooth speaker.
当蓝牙音箱准备播放音频数据时,需要与终端设备建立同步定向SCO连接或者高级音频传输A2DP连接,终端设备通过该链接方式把音频数据传输到蓝牙音箱端,经过解码,放大等步骤,将语音播放出来;比如,手机在电话通话时,电话里的声音是通过同步定向SCO连接进行传输的,手机在播放音乐时,是通过高级音频传输A2DP连接进行播放音乐的。When the Bluetooth speaker is ready to play audio data, it needs to establish a synchronous directional SCO connection or an advanced audio transmission A2DP connection with the terminal device. The terminal device transmits the audio data to the Bluetooth speaker through this link method. After decoding, amplification and other steps, the voice is played Out; for example, when the mobile phone is talking on the phone, the voice in the phone is transmitted through the synchronous SCO connection, and when the mobile phone is playing music, the music is played through the advanced audio transmission A2DP connection.
目前在蓝牙音箱与用户进行语音交互时,终端设备与蓝牙音箱建立连接,接收语音数据,语音数据接收结束后断开连接,在终端设备返回云端的语音分析结果后再一次与蓝牙音箱建立连接,连接建立成功,将语音分析结果传输至蓝牙音箱,进行语音播放;终端设备与蓝牙音箱建立连接需要一定的时间,在等待语音在云端分析结束后,还要继续等待终端设备与蓝牙音箱建立连接的时间,导致在语音交互时蓝牙音箱播放的延时,响应速度慢。At present, when the Bluetooth speaker interacts with the user, the terminal device establishes a connection with the Bluetooth speaker, receives voice data, and disconnects after the voice data is received. After the terminal device returns the voice analysis result in the cloud, it establishes a connection with the Bluetooth speaker again. The connection is established successfully, and the voice analysis result is transmitted to the Bluetooth speaker for voice playback; it takes a certain time for the terminal device to establish a connection with the Bluetooth speaker. After waiting for the voice analysis in the cloud, it must continue to wait for the terminal device to establish a connection with the Bluetooth speaker Time, resulting in delays in Bluetooth speaker playback during voice interaction, slow response speed.
有鉴于此,本发明实施例提供了一种语音交互控制方法、装置及系统,以解决现有技术中与蓝牙音箱语音交互时,响应速度慢,存在语音播放延时的问题。In view of this, the embodiments of the present invention provide a voice interaction control method, device, and system to solve the problems of slow response speed and delay in voice playback when interacting with Bluetooth speakers in the prior art.
本发明实施例的第一方面提供了一种语音交互控制方法,应用于蓝牙音箱,包括:The first aspect of the embodiments of the present invention provides a voice interaction control method, which is applied to a Bluetooth speaker and includes:
与终端设备建立语音通道连接,并采集语音信息;Establish a voice channel connection with terminal equipment and collect voice information;
通过所述语音通道将所述语音信息发送至终端设备,以使得终端设备将所述语音信息上传至云端,并在所述云端返回语音解析结果至终端设备之时,保持所述语音通道连接;Sending the voice information to the terminal device through the voice channel, so that the terminal device uploads the voice information to the cloud, and maintains the voice channel connection when the cloud returns the voice analysis result to the terminal device;
通过所述语音通道接收终端设备发送的语音解析结果,并播放所述语音解析结果。Receiving the voice analysis result sent by the terminal device through the voice channel, and playing the voice analysis result.
本发明实施例的第二方面提供了另一种语音交互控制方法,包括:The second aspect of the embodiments of the present invention provides another voice interaction control method, including:
与蓝牙音箱建立语音通道连接,并通过所述语音通道接收蓝牙音箱发送的语音信息;Establish a voice channel connection with a Bluetooth speaker, and receive voice information sent by the Bluetooth speaker through the voice channel;
将所述语音信息上传至云端,以使得云端对所述语音信息进行解析,并在获取语音解析结果时,保持与所述蓝牙音箱的语音通道连接;Upload the voice information to the cloud, so that the cloud can parse the voice information, and maintain the voice channel connection with the Bluetooth speaker when obtaining the voice analysis result;
通过所述语音通道将所述语音解析结果发送至蓝牙音箱,以使得蓝牙音箱播放所述语音解析结果。Sending the voice analysis result to a Bluetooth speaker through the voice channel, so that the Bluetooth speaker plays the voice analysis result.
本发明实施例的第三方面提供了一种语音交互控制装置,包括:A third aspect of the embodiments of the present invention provides a voice interaction control device, including:
语音采集模块,用于采集语音信息;Voice collection module, used to collect voice information;
第一通道建立模块,用于与终端设备建立语音通道连接,并在所述语音解析结果返回至终端设备之时,保持所述语音通道连接;The first channel establishment module is used to establish a voice channel connection with the terminal device and maintain the voice channel connection when the voice analysis result is returned to the terminal device;
第一语音发送模块,用于通过所述语音通道将所述语音信息发送至终端设备,以使得终端设备将所述语音信息上传至云端解析,返回语音解析结果;A first voice sending module, configured to send the voice information to the terminal device through the voice channel, so that the terminal device uploads the voice information to the cloud for analysis, and returns a voice analysis result;
语音播放模块,用于通过所述语音通道接收终端设备发送的语音解析结果,并播放所述语音解析结果。The voice playback module is configured to receive the voice analysis result sent by the terminal device through the voice channel and play the voice analysis result.
本发明实施例的第四方面提供了一种终端设备,包括:A fourth aspect of the embodiments of the present invention provides a terminal device, including:
第二通道建立模块,用于与蓝牙音箱建立语音通道连接;The second channel establishment module is used to establish a voice channel connection with a Bluetooth speaker;
数据接收模块,用于通过所述语音通道接收蓝牙音箱发送的语音信息;A data receiving module, configured to receive voice information sent by a Bluetooth speaker through the voice channel;
第二数据发送模块,用于将所述语音信息上传至云端,以使得云端对所述语音信息进行解析,获取语音解析结果;The second data sending module is used to upload the voice information to the cloud, so that the cloud can parse the voice information to obtain the voice analysis result;
第三数据发送模块,用于通过所述语音通道将所述语音解析结果发送至蓝牙音箱,以使得蓝牙音箱播放所述语音解析结果。The third data sending module is configured to send the voice analysis result to a Bluetooth speaker through the voice channel, so that the Bluetooth speaker plays the voice analysis result.
本发明实施例的第五方面提供可一种语音交互控制系统,包括:A fifth aspect of the embodiments of the present invention provides a voice interactive control system, including:
蓝牙音箱,用于与终端设备建立语音通道连接,采集语音信息,通过所述语音通道将所述语音信息发送至终端设备,通过所述语音通道接收终端设备发送的语音解析结果,并播放所述语音解析结果;The Bluetooth speaker is used to establish a voice channel connection with the terminal device, collect voice information, send the voice information to the terminal device through the voice channel, receive the voice analysis result sent by the terminal device through the voice channel, and play the Voice analysis results;
终端设备,用于与蓝牙音箱建立语音通道连接,并通过所述语音通道接收蓝牙音箱发送的语音信息;保持与蓝牙音箱的语音通道的连接,并将所述语音信息上传至云端,以使得云端对所述语音信息进行解析,获取语音解析结果;通过所述语音通道将所述语音解析结果发送至蓝牙音箱,以使得蓝牙音箱播放所述语音解析结果;The terminal device is used to establish a voice channel connection with the Bluetooth speaker and receive the voice information sent by the Bluetooth speaker through the voice channel; maintain the connection with the voice channel of the Bluetooth speaker and upload the voice information to the cloud to make the cloud Analyze the voice information to obtain a voice analysis result; send the voice analysis result to a Bluetooth speaker through the voice channel, so that the Bluetooth speaker plays the voice analysis result;
云端,用于接收终端设备发送的语音信息,对所述语音信息进行解析,并返回语音解析结果至终端设备。The cloud is used to receive the voice information sent by the terminal device, analyze the voice information, and return the voice analysis result to the terminal device.
本发明实施例的第四方面提供了一种计算机可读存储介质,所述计算机可读存储介质存储有计算机程序,所述计算机程序被处理器执行时实现上述语音交互控制方法的步骤。A fourth aspect of the embodiments of the present invention provides a computer-readable storage medium that stores a computer program, and when the computer program is executed by a processor, implements the steps of the voice interaction control method.
本发明实施例与现有技术相比存在的有益效果是:通过本发明实施例在通过蓝牙音箱语音交互时,建立持蓝牙音箱与终端设备的语音通道的连接,并在终端设备上传至云端进行语音解析时,以及语音解析结果返回至终端设备时,保持语音通道连接,蓝牙音箱直接通过语音通道接收终端设备发送的语音解析结果,并播放语音解析结果,提高蓝牙音箱语音交互的响应速度,减少了采集完语音信息后的延时。Compared with the prior art, the beneficial effects of the embodiments of the present invention are: through the embodiment of the present invention, when the voice interaction is performed through the Bluetooth speaker, the connection between the Bluetooth speaker and the terminal device's voice channel is established, and the terminal device is uploaded to the cloud During the voice analysis and when the voice analysis result is returned to the terminal device, the voice channel connection is maintained. The Bluetooth speaker directly receives the voice analysis result sent by the terminal device through the voice channel and plays the voice analysis result to improve the response speed of the Bluetooth speaker voice interaction and reduce The delay after collecting voice information.
为了更清楚地说明本发明实施例中的技术方案,下面将对实施例或现有技术描述中所需要使用的附图作简单地介绍,显而易见地,下面描述中的附图仅仅是本发明的一些实施例,对于本领域普通技术人员来讲,在不付出创造性劳动性的前提下,还可以根据这些附图获得其他的附图。In order to more clearly explain the technical solutions in the embodiments of the present invention, the following will briefly introduce the drawings required in the embodiments or the description of the prior art. Obviously, the drawings in the following description are only for the invention. In some embodiments, for those of ordinary skill in the art, without paying creative labor, other drawings may be obtained based on these drawings.
图1是本发明实施例提供的语音交互控制的系统应用场景示意图;1 is a schematic diagram of a system application scenario of voice interactive control provided by an embodiment of the present invention;
图2是本发明实施例提供的语音交互控制方法的实现流程示意图;2 is a schematic diagram of an implementation process of a voice interaction control method provided by an embodiment of the present invention;
图3是本发明实施例提供的另一语音交互控制方法的实现流程示意图;3 is a schematic diagram of an implementation process of another voice interaction control method provided by an embodiment of the present invention;
图4是本发明实施例提供的语音交互控制方法的交互流程示意图;4 is a schematic diagram of an interaction process of a voice interaction control method provided by an embodiment of the present invention;
图5是本发明实施例提供的语音交互控制装置的示意图;5 is a schematic diagram of a voice interaction control device provided by an embodiment of the present invention;
图6是本发明实施例提供的终端设备的示意图。6 is a schematic diagram of a terminal device provided by an embodiment of the present invention.
以下描述中,为了说明而不是为了限定,提出了诸如特定系统结构、技术之类的具体细节,以便透彻理解本发明实施例。然而,本领域的技术人员应当清楚,在没有这些具体细节的其它实施例中也可以实现本发明。在其它情况中,省略对众所周知的系统、装置、电路以及方法的详细说明,以免不必要的细节妨碍本发明的描述。In the following description, for the purpose of illustration rather than limitation, specific details such as specific system structures and technologies are proposed to thoroughly understand the embodiments of the present invention. However, those skilled in the art should understand that the present invention can also be implemented in other embodiments without these specific details. In other cases, detailed descriptions of well-known systems, devices, circuits, and methods are omitted so as not to obscure the description of the present invention with unnecessary details.
应当理解,当在本说明书和所附权利要求书中使用时,术语“包括”指示所描述特征、整体、步骤、操作、元素和/或组件的存在,但并不排除一个或多个其它特征、整体、步骤、操作、元素、组件和/或其集合的存在或添加。It should be understood that when used in this specification and the appended claims, the term "comprising" indicates the presence of described features, integers, steps, operations, elements, and / or components, but does not exclude one or more other features , Wholes, steps, operations, elements, components and / or their existence or addition.
还应当理解,在此本发明说明书中所使用的术语仅仅是出于描述特定实施例的目的而并不意在限制本发明。如在本发明说明书和所附权利要求书中所使用的那样,除非上下文清楚地指明其它情况,否则单数形式的“一”、“一个”及“该”意在包括复数形式。It should also be understood that the terminology used in the description of the invention herein is for the purpose of describing particular embodiments only and is not intended to limit the invention. As used in this specification and the appended claims, unless the context clearly indicates otherwise, the singular forms "a", "an", and "the" are intended to include the plural form.
还应当进一步理解,在本发明说明书和所附权利要求书中使用的术语“和/或”是指相关联列出的项中的一个或多个的任何组合以及所有可能组合,并且包括这些组合。It should also be further understood that the term "and / or" used in the description of the present invention and the appended claims refers to any and all possible combinations of one or more of the associated listed items and includes these combinations .
为了说明本发明所述的技术方案,下面通过具体实施例来进行说明。In order to explain the technical solution described in the present invention, the following will be described through specific embodiments.
图1示出了本发明实施例提供的语音交互控制的系统的应用场景的示意图,为了便于说明,仅示出了与本实施例相关的部分。FIG. 1 shows a schematic diagram of an application scenario of a system for voice interaction control provided by an embodiment of the present invention. For ease of explanation, only parts related to this embodiment are shown.
参照图1,该系统由蓝牙音箱11采集语音信息,将语音信息通过建立的语音通道发送至终端设备12,终端设备12将语音信息上传至云端13,由云端13进行语音解析;在终端设备12上传语音信息至云端13以及云端13进行语音解析过程中,保持语音通道连接不断开,由终端设备12接收云端13返回的语音解析结果,并通过语音通道发送至蓝牙音箱11,由蓝牙音箱11播放语音解析结果。Referring to FIG. 1, the system collects voice information from the Bluetooth speaker 11 and sends the voice information to the terminal device 12 through the established voice channel. The terminal device 12 uploads the voice information to the cloud 13 and the cloud 13 performs voice analysis; Upload voice information to the cloud 13 and the cloud 13 during the voice analysis process, keep the voice channel connected, the terminal device 12 receives the voice analysis result returned by the cloud 13, and sends it to the Bluetooth speaker 11 through the voice channel to be played by the Bluetooth speaker 11 Voice analysis results.
下面对图1所示的系统场景下的语音交互控制方法进行详细阐述:The voice interaction control method in the system scenario shown in FIG. 1 is described in detail below:
图2示出了本发明实施例提供的语音交互控制方法的实现流程示意图。在本实施例中,该流程的执行主体为图1所示的蓝牙音箱11,该方法实现流程的执行主体也可以是其它实现语音信息网络交互的蓝牙设备,例如蓝牙耳机、车载蓝牙设备等,详述如下:FIG. 2 shows a schematic flowchart of an implementation process of a voice interaction control method provided by an embodiment of the present invention. In this embodiment, the execution subject of the process is the Bluetooth speaker 11 shown in FIG. 1, and the execution subject of the method implementation process may also be other Bluetooth devices that implement voice information network interaction, such as Bluetooth headsets, car Bluetooth devices, etc. The details are as follows:
步骤S201,与终端设备建立语音通道连接,并采集语音信息。Step S201: Establish a voice channel connection with the terminal device and collect voice information.
在本发明实施例中,所述终端设备可以是手机、笔记本、掌上电脑及桌上型计算机等可以实现蓝牙连接的任一设备;蓝牙音箱通过麦克风采集语音信息,蓝牙音箱可以内置麦克风阵列,进行远距离拾音;所述的蓝牙音箱包括但不仅限于:普通单筒蓝牙音箱、户外单筒蓝牙音箱、家居型双筒蓝牙音箱、户外运动型蓝牙音箱或大型多筒家居蓝牙音箱,均可采集语音信息;当蓝牙音箱进行语音信息传输时,需要与终端设备建立语音通道连接,所述的语音通道可以是同步定向SCO连接,也可以是高级音频传输模型A2DP连接。In the embodiment of the present invention, the terminal device may be any device that can realize Bluetooth connection, such as a mobile phone, a notebook, a palmtop computer, and a desktop computer; the Bluetooth speaker collects voice information through a microphone, and the Bluetooth speaker may have a built-in microphone array to perform Long-distance pickup; the Bluetooth speakers include but are not limited to: ordinary single-tube Bluetooth speakers, outdoor single-tube Bluetooth speakers, home-type dual-tube Bluetooth speakers, outdoor sports Bluetooth speakers or large multi-tube home Bluetooth speakers, all of which can be collected Voice information; when a Bluetooth speaker performs voice information transmission, a voice channel connection needs to be established with a terminal device, and the voice channel may be a synchronous directional SCO connection or an advanced audio transmission model A2DP connection.
步骤S202,通过所述语音通道将所述语音信息发送至终端设备,以使得终端设备将所述语音信息上传至云端,并在所述云端返回语音解析结果至终端设备之时,保持所述语音通道连接。Step S202: Send the voice information to the terminal device through the voice channel, so that the terminal device uploads the voice information to the cloud, and maintains the voice when the cloud returns the voice analysis result to the terminal device Channel connection.
在本发明实施例中,蓝牙音箱将采集到的语音信息通过语音通道发送至终端设备,所述的终端设备可以是手机、笔记本、掌上电脑及桌上型计算机等可以实现蓝牙连接的任一设备;由终端设备通过互联网或物联网将语音数据上传至云端进行语音解析,包括语音识别以及特征提取,生成对应的语音解析结果,并在返回语音解析结果至终端设备时,保持蓝牙音箱与终端设备的语音通道的连接。In the embodiment of the present invention, the Bluetooth speaker sends the collected voice information to a terminal device through a voice channel. The terminal device may be any device that can achieve Bluetooth connection, such as a mobile phone, a notebook, a palmtop computer, and a desktop computer. ; The terminal device uploads the voice data to the cloud through the Internet or Internet of Things for voice analysis, including voice recognition and feature extraction, generates the corresponding voice analysis result, and maintains the Bluetooth speaker and the terminal device when the voice analysis result is returned to the terminal device Voice channel connection.
进一步的,在与终端设备建立语音通道连接,并采集语音信息之后,还包括:Further, after establishing a voice channel connection with the terminal device and collecting voice information, the method further includes:
A1、通过所述语音通道将所述语音信息发送至终端设备,并在终端设备将所述语音信息上传至云端结束后断开所述语音通道连接;A1. Send the voice information to the terminal device through the voice channel, and disconnect the voice channel connection after the terminal device uploads the voice information to the cloud;
A2、在云端返回语音解析结果至终端设备之前,再次与终端设备建立语音通道连接。A2. Before the cloud returns the voice analysis result to the terminal device, establish a voice channel connection with the terminal device again.
在本发明实施例中,当蓝牙音箱采集语音信息结束,并将语音信息通过语音通道发送至终端设备,终端设备将语音信息上传至云端后,蓝牙音箱将断开与终端设备的语音通道连接;在等待云端解析语音信息并返回语音解析结果之前,蓝牙音箱与终端设备再次建立语音通道连接;在等待云端返回所需要的时间是秒级的,可以是1秒,2秒等,建立语音通道连接所需要的时间是100毫秒级的,可以是0.3秒或0.4秒等,从而在等待云端返回语音解析结果之前或返回语音解析结果的过程中,可以完成语音通道连接的建立。In the embodiment of the present invention, when the Bluetooth speaker ends collecting voice information and sends the voice information to the terminal device through the voice channel, and the terminal device uploads the voice information to the cloud, the Bluetooth speaker will disconnect the voice channel connection with the terminal device; Before waiting for the cloud to parse the voice information and return the voice analysis result, the Bluetooth speaker and the terminal device establish a voice channel connection again; the time required to wait for the cloud to return is in seconds, which can be 1 second, 2 seconds, etc., to establish a voice channel connection The time required is in the order of 100 milliseconds, and can be 0.3 seconds or 0.4 seconds, etc., so that the establishment of a voice channel connection can be completed before waiting for or returning the voice analysis result from the cloud.
其中,所述的语音通道连接可以是同步定向SCO连接。Wherein, the voice channel connection may be a synchronous SCO connection.
进一步的,在与终端设备建立语音通道连接,并采集语音信息之后,还包括:Further, after establishing a voice channel connection with the terminal device and collecting voice information, the method further includes:
通过所述语音通道将所述语音信息发送至终端设备,并在所述终端设备将所述语音信息上传至云端以及云端返回语音解析结果之前,保持与终端设备建立的所述语音通道连接。The voice information is sent to the terminal device through the voice channel, and the voice channel connection established with the terminal device is maintained until the terminal device uploads the voice information to the cloud and the cloud returns the voice analysis result.
在本发明实施例中,蓝牙音箱与终端设备建立语音通道连接后,所建立的语音通道连接不再断开,在终端设备将语音信息上传至云端,云端对语音信息进行解析过程,以及云端返回语音解析结果之前,一直保持语音通道连接。In the embodiment of the present invention, after the voice channel connection between the Bluetooth speaker and the terminal device is established, the established voice channel connection is no longer disconnected. When the terminal device uploads the voice information to the cloud, the cloud analyzes the voice information and returns from the cloud Keep the voice channel connected until the result of voice analysis.
步骤S203,通过所述语音通道接收终端设备发送的语音解析结果,并播放所述语音解析结果。Step S203: Receive the voice analysis result sent by the terminal device through the voice channel, and play the voice analysis result.
在本发明实施例中,所述语音通道可以是建立的同步面向SCO连接通道,也可以是高级音频传输模型A2DP连接通道;所述的终端设备可以是手机、电脑等可以支持蓝牙连接的联网设备;蓝牙音箱可以通过语音通道接收终端设备发送的语音解析结果,经过解码,放大等步骤从蓝牙音箱端的扬声器播放出来,例如:采集到的语音信息为:“今天天气怎么样”,由音箱端返回:“今天天气晴,气温21度,偏北风2到3级”。In the embodiment of the present invention, the voice channel may be an established SCO-oriented connection channel, or an advanced audio transmission model A2DP connection channel; the terminal device may be a mobile phone, a computer, and other networked devices that can support Bluetooth connection. ; The Bluetooth speaker can receive the voice analysis result sent by the terminal device through the voice channel, and it is played from the speaker of the Bluetooth speaker after decoding, amplification, etc. For example: the collected voice information is: "How is the weather today", and returned by the speaker : "Today the weather is fine, the temperature is 21 degrees, and the northerly wind is 2-3."
图3示出了本发明实施例提供的另一语音交互控制方法的实现流程示意图;在本实施例中,该流程的执行主体为图1所示的终端设备12,该方法实现流程的执行主体也可以是其它支持蓝牙连接的联网设备,例如手机、电脑、平板等,详述如下:FIG. 3 shows a schematic diagram of an implementation process of another voice interaction control method provided by an embodiment of the present invention; in this embodiment, the execution subject of the process is the terminal device 12 shown in FIG. It can also be other networked devices that support Bluetooth connection, such as mobile phones, computers, tablets, etc., as detailed below:
步骤S301,与蓝牙音箱建立语音通道连接,并通过所述语音通道接收蓝牙音箱发送的语音信息。Step S301: Establish a voice channel connection with a Bluetooth speaker, and receive voice information sent by the Bluetooth speaker through the voice channel.
在本发明实施例中,当需要通过蓝牙音箱接收一段数据或通过蓝牙音箱播放一段音频数据时,都需要与蓝牙音箱音箱建立语音通道的连接,所述的语音通道可以是同步定向SCO连接或高级音频传输模型A2DP连接;所述的蓝牙音箱可以是内置麦克风阵列,进行远距离拾音;所述的蓝牙音箱包括但不仅限于:普通单筒蓝牙音箱、户外单筒蓝牙音箱、家居型双筒蓝牙音箱、户外运动型蓝牙音箱或大型多筒家居蓝牙音箱。In the embodiment of the present invention, when receiving a piece of data through a Bluetooth speaker or playing a piece of audio data through a Bluetooth speaker, a voice channel connection needs to be established with the Bluetooth speaker speaker, and the voice channel may be a synchronous directional SCO connection or an advanced Audio transmission model A2DP connection; the Bluetooth speaker can be a built-in microphone array for remote pickup; the Bluetooth speakers include but are not limited to: ordinary single-tube Bluetooth speaker, outdoor single-tube Bluetooth speaker, home-type dual-barrel Bluetooth Speakers, outdoor sports Bluetooth speakers or large multi-barrel home Bluetooth speakers.
另外,所述的蓝牙音箱也可以是其它采集语音信息的设备,可以是任何支持语音通道连接并实现语音信息网络交互的蓝牙设备,比如耳机、车载蓝牙等。In addition, the Bluetooth speaker may also be other devices that collect voice information, and may be any Bluetooth device that supports voice channel connection and implements voice information network interaction, such as a headset and a car Bluetooth.
步骤S302,将所述语音信息上传至云端,以使得云端对所述语音信息进行解析,并在获取语音解析结果时,保持与所述蓝牙音箱的语音通道连接。Step S302: Upload the voice information to the cloud, so that the cloud parses the voice information, and maintains the voice channel connection with the Bluetooth speaker when obtaining the voice analysis result.
在本发明实施例中,所述的终端设备可以是手机、笔记本、掌上电脑及桌上型计算机等可以实现蓝牙连接的任一设备;由终端设备将语音信息上传至云端或服务器,由云端或服务器对用于语音进行解析,包括语音识别以及特征提取,生成对应的语音解析结果;在云端返回语音解析结果时,所述的语音通道保持连接,可以不用再次进行终端设备与蓝牙音箱的连接,避免再次建立连接产生的延时,直接通过所述的语音通道将语音结果发送至蓝牙音箱端。In the embodiment of the present invention, the terminal device may be any device that can achieve Bluetooth connection, such as a mobile phone, a notebook, a palmtop computer, and a desktop computer; the terminal device uploads voice information to the cloud or server, and the cloud or The server analyzes the voice, including voice recognition and feature extraction, and generates the corresponding voice analysis result; when the cloud returns the voice analysis result, the voice channel remains connected, and it is not necessary to connect the terminal device and the Bluetooth speaker again. To avoid the delay caused by establishing the connection again, the voice result is directly sent to the Bluetooth speaker through the voice channel.
进一步的,在与蓝牙音箱建立语音通道连接,并通过所述语音通道接收蓝牙音箱发送的语音信息之后,还包括:Further, after establishing a voice channel connection with a Bluetooth speaker and receiving voice information sent by the Bluetooth speaker through the voice channel, the method further includes:
B1、将所述语音信息上传至云端,并在上传至云端结束后断开与蓝牙音箱的所述语音通道连接;B1. Upload the voice information to the cloud, and disconnect the voice channel connection with the Bluetooth speaker after uploading to the cloud;
B2、在云端对所述语音信息进行解析且返回语音解析结果之前,再次与蓝牙音箱建立语音通道连接,并接收云端发送的语音解析结果。B2. Before analyzing the voice information in the cloud and returning the voice analysis result, establish a voice channel connection with the Bluetooth speaker again and receive the voice analysis result sent by the cloud.
在本发明实施例中,当终端设备接收到语音信息,并将语音信息上传至云端后,将断开与蓝牙音箱的语音通道连接;在等待云端解析语音信息并返回语音解析结果之前,终端设备与蓝牙音箱再次建立语音通道连接;等待云端返回所需要的时间是秒级的,可以是1秒,2秒等,建立语音通道连接所需要的时间是100毫秒级的,可以是0.3秒或0.4秒等,从而在等待云端返回语音解析结果之前或返回语音解析结果的过程中,可以完成语音通道连接的建立。In the embodiment of the present invention, when the terminal device receives the voice information and uploads the voice information to the cloud, the voice channel connection with the Bluetooth speaker will be disconnected; before waiting for the cloud to parse the voice information and return the voice analysis result, the terminal device Establish a voice channel connection with the Bluetooth speaker again; the time required to wait for the cloud to return is in the order of seconds, which can be 1 second, 2 seconds, etc. The time required to establish the voice channel connection is in the order of 100 milliseconds, which can be 0.3 seconds or 0.4 Seconds, etc., so that the establishment of the voice channel connection can be completed before waiting for the cloud to return the voice analysis result or during the process of returning the voice analysis result.
其中,所述的语音通道连接可以是同步定向SCO连接或高级音频传输模型A2DP连接。Wherein, the voice channel connection may be a synchronous SCO connection or an advanced audio transmission model A2DP connection.
进一步的,在与蓝牙音箱建立语音通道连接,并通过所述语音通道接收蓝牙音箱发送的语音信息之后,还包括:Further, after establishing a voice channel connection with a Bluetooth speaker and receiving voice information sent by the Bluetooth speaker through the voice channel, the method further includes:
通过所述语音通道接收蓝牙音箱发送的语音信息,在将所述语音信息上传至云端以及云端返回语音解析结果之前,保持与所述蓝牙音箱建立的所述语音通道连接。Receive the voice information sent by the Bluetooth speaker through the voice channel, and maintain the voice channel connection established with the Bluetooth speaker before uploading the voice information to the cloud and returning the voice analysis result from the cloud.
在本发明实施例中,终端设备与蓝牙音箱建立语音通道连接后,所建立的语音通道连接不再断开,在终端设备将语音信息上传至云端,云端对语音信息进行解析过程,以及云端返回语音解析结果之前,一直保持语音通道连接。In the embodiment of the present invention, after the terminal device and the Bluetooth speaker establish a voice channel connection, the established voice channel connection is no longer disconnected, and the terminal device uploads the voice information to the cloud, the cloud analyzes the voice information, and the cloud returns Keep the voice channel connected until the result of voice analysis.
另外,接收蓝牙音箱发送的语音信息可以建立同步定向SCO通道,将语音解析结果发送至蓝牙音箱端,也可以通过同步定向SCO通道,因为两次都是通过建立的同步定向SCO连接来传输的语音数据,不需要进行其他类型的语音通道的转换与建立,中间一直保持同步定向SCO连接对于终端设备或蓝牙音箱端均没有其它影响。In addition, receiving the voice information sent by the Bluetooth speaker can establish a synchronous directional SCO channel, and send the voice analysis results to the Bluetooth speaker, or through the synchronous directional SCO channel, because the two times are the voice transmitted through the established synchronous SCO connection Data, there is no need to perform conversion and establishment of other types of voice channels, and the SCO connection has been kept synchronously oriented in the middle, which has no other impact on the terminal device or the Bluetooth speaker.
步骤S303,通过所述语音通道将所述语音解析结果发送至蓝牙音箱,以使得蓝牙音箱播放所述语音解析结果。Step S303: Send the voice analysis result to a Bluetooth speaker through the voice channel, so that the Bluetooth speaker plays the voice analysis result.
在本发明实施例中,所述语音通道可以是建立的同步面向SCO连接通道,也可以是高级音频传输模型A2DP连接通道;所述的终端设备可以是手机、电脑等可以支持蓝牙连接的联网设备;蓝牙音箱可以通过语音通道接收终端设备发送的语音解析结果,经过解码,放大等步骤从蓝牙音箱端的扬声器播放出来,例如:采集到的语音信息为:“今天天气怎么样”,由音箱端返回:“今天天气晴,气温21度,偏北风2到3级”。In the embodiment of the present invention, the voice channel may be an established SCO-oriented connection channel, or an advanced audio transmission model A2DP connection channel; the terminal device may be a mobile phone, a computer, and other networked devices that can support Bluetooth connection. ; The Bluetooth speaker can receive the voice analysis result sent by the terminal device through the voice channel, and it is played from the speaker of the Bluetooth speaker after decoding, amplification, etc. For example: the collected voice information is: "How is the weather today", and returned by the speaker : "Today the weather is fine, the temperature is 21 degrees, and the northerly wind is 2-3."
通过本发明实施例,通过蓝牙音箱语音交互时,建立持蓝牙音箱与终端设备的语音通道的连接,并在终端设备上传至云端进行语音解析时,以及语音解析结果返回至终端设备时,保持语音通道连接,蓝牙音箱直接通过语音通道接收终端设备发送的语音解析结果,并播放语音解析结果,提高蓝牙音箱语音交互的响应速度,减少了采集完语音信息后的延时;通过本发明的实施例,采集完语音信息后,回复语音解析结果的时间比传统方案要快了约20%左右,这对用户的感知具有明显的提升;另外进一步充分优化了同步定向SCO连接,避免了不必要的建立同步定向SCO连接的过程,从而减少了语音交互所需要的时间,提升了用户体验。Through the embodiment of the present invention, when the voice interaction of the Bluetooth speaker is established, the connection between the Bluetooth speaker and the terminal device's voice channel is established, and when the terminal device is uploaded to the cloud for voice analysis, and the voice analysis result is returned to the terminal device, the voice is maintained Channel connection, the Bluetooth speaker directly receives the voice analysis result sent by the terminal device through the voice channel, and plays the voice analysis result, improves the response speed of the Bluetooth speaker voice interaction, and reduces the delay after collecting the voice information; through the embodiment of the present invention After collecting the voice information, the time to reply the voice analysis result is about 20% faster than the traditional solution, which significantly improves the user's perception; in addition, it further fully optimizes the synchronous directional SCO connection to avoid unnecessary establishment The process of orientating the SCO connection synchronously reduces the time required for voice interaction and improves the user experience.
图4示出了本发明实施例提供的语音交互控制方法的交互流程示意图,为了便于说明,仅示出了与本发明实施例相关的部分;参与该交互流程的执行主体包括蓝牙音箱、终端设备、云端,该交互流程的实现原理与图2、图3所述的每个执行主体侧的实现原理相一致,因此仅简要地描述该交互流程,不赘述:FIG. 4 shows a schematic diagram of an interaction process of a voice interaction control method provided by an embodiment of the present invention. For ease of explanation, only parts related to the embodiment of the present invention are shown; execution subjects participating in the interaction process include a Bluetooth speaker and a terminal device In the cloud, the implementation principle of the interaction process is consistent with the implementation principle of each execution subject side described in FIGS. 2 and 3. Therefore, the interaction process is only briefly described, and not repeated:
1、与蓝牙音箱建立语音通道;1. Establish a voice channel with a Bluetooth speaker;
2、由蓝牙音箱采集语音信息;2. Voice information is collected by Bluetooth speakers;
3、将语音信息通过语音通道发送至终端设备;3. Send the voice information to the terminal device through the voice channel;
4、将语音信息上传至云端;4. Upload voice information to the cloud;
5、云端对语音信息进行语音解析获取语音解析结果;5. The cloud performs voice analysis on the voice information to obtain the voice analysis result;
6、将语音解析结果返回至终端设备;6. Return the voice analysis result to the terminal device;
7、终端设备将语音解析结果通过语音通道发送至蓝牙音箱;7. The terminal device sends the voice analysis result to the Bluetooth speaker through the voice channel;
8、由蓝牙音箱播放语音解析结果。8. The voice analysis result is played by the Bluetooth speaker.
需要说明的是,本领域技术人员在本发明揭露的技术范围内,可容易想到的其他排序方案也应在本发明的保护范围之内,在此不一一赘述。It should be noted that, those skilled in the art, within the technical scope disclosed by the present invention, other sorting schemes that can be easily thought of should also fall within the protection scope of the present invention, which will not be repeated here.
应理解,上述实施例中各步骤的序号的大小并不意味着执行顺序的先后,各过程的执行顺序应以其功能和内在逻辑确定,而不应对本发明实施例的实施过程构成任何限定。It should be understood that the size of the sequence numbers of the steps in the above embodiments does not mean the order of execution, and the execution order of each process should be determined by its function and inherent logic, and should not constitute any limitation on the implementation process of the embodiments of the present invention.
图5示出了本发明实施例提供的语音交互控制装置的示意图,为了便于说明,仅示出了与本发明实施例相关的部分。FIG. 5 shows a schematic diagram of a voice interaction control device provided by an embodiment of the present invention. For ease of description, only parts related to the embodiment of the present invention are shown.
所述语音交互控制装置,包括:The voice interactive control device includes:
语音采集模块51,用于采集语音信息;Voice collection module 51, used to collect voice information;
第一通道建立模块52,用于与终端设备建立语音通道连接,并在所述语音解析结果返回至终端设备之时,保持所述语音通道连接;The first channel establishment module 52 is used to establish a voice channel connection with the terminal device and maintain the voice channel connection when the voice analysis result is returned to the terminal device;
第一语音发送模块53,用于通过所述语音通道将所述语音信息发送至终端设备,以使得终端设备将所述语音信息上传至云端解析,返回语音解析结果;The first voice sending module 53 is configured to send the voice information to the terminal device through the voice channel, so that the terminal device uploads the voice information to the cloud for analysis, and returns a voice analysis result;
语音播放模块54,用于通过所述语音通道接收终端设备发送的语音解析结果,并播放所述语音解析结果。The voice playback module 54 is configured to receive the voice analysis result sent by the terminal device through the voice channel and play the voice analysis result.
进一步的,本发明实施例还提供了一种终端设备,包括:Further, an embodiment of the present invention also provides a terminal device, including:
第二通道建立模块61,用于与蓝牙音箱建立语音通道连接;The second channel establishment module 61 is used to establish a voice channel connection with a Bluetooth speaker;
数据接收模块62,用于通过所述语音通道接收蓝牙音箱发送的语音信息;The data receiving module 62 is used to receive the voice information sent by the Bluetooth speaker through the voice channel;
第二数据发送模块63,用于将所述语音信息上传至云端,以使得云端对所述语音信息进行解析,获取语音解析结果;The second data sending module 63 is used to upload the voice information to the cloud, so that the cloud can parse the voice information to obtain a voice analysis result;
第三数据发送模块64,用于通过所述语音通道将所述语音解析结果发送至蓝牙音箱,以使得蓝牙音箱播放所述语音解析结果。The third data sending module 64 is configured to send the voice analysis result to a Bluetooth speaker through the voice channel, so that the Bluetooth speaker plays the voice analysis result.
进一步的,本发明实施例还提供了一种语音交互系统,包括:Further, an embodiment of the present invention also provides a voice interaction system, including:
蓝牙音箱,用于与终端设备建立语音通道连接,采集语音信息,通过所述语音通道将所述语音信息发送至终端设备,通过所述语音通道接收终端设备发送的语音解析结果,并播放所述语音解析结果;The Bluetooth speaker is used to establish a voice channel connection with the terminal device, collect voice information, send the voice information to the terminal device through the voice channel, receive the voice analysis result sent by the terminal device through the voice channel, and play the Voice analysis results;
终端设备,用于与蓝牙音箱建立语音通道连接,并通过所述语音通道接收蓝牙音箱发送的语音信息;保持与蓝牙音箱的语音通道的连接,并将所述语音信息上传至云端,以使得云端对所述语音信息进行解析,获取语音解析结果;通过所述语音通道将所述语音解析结果发送至蓝牙音箱,以使得蓝牙音箱播放所述语音解析结果;The terminal device is used to establish a voice channel connection with the Bluetooth speaker and receive the voice information sent by the Bluetooth speaker through the voice channel; maintain the connection with the voice channel of the Bluetooth speaker and upload the voice information to the cloud to make the cloud Analyze the voice information to obtain a voice analysis result; send the voice analysis result to a Bluetooth speaker through the voice channel, so that the Bluetooth speaker plays the voice analysis result;
云端,用于接收终端设备发送的语音信息,对所述语音信息进行解析,并返回语音解析结果至终端设备。The cloud is used to receive the voice information sent by the terminal device, analyze the voice information, and return the voice analysis result to the terminal device.
所属领域的技术人员可以清楚地了解到,为了描述的方便和简洁,仅以上述各功能模块的划分进行举例说明,实际应用中,可以根据需要而将上述功能分配由不同的功能单元、模块完成,即将所述移动终端的内部结构划分成不同的功能单元或模块,以完成以上描述的全部或者部分功能。实施例中的各功能模块可以集成在一个处理单元中,也可以是各个单元单独物理存在,也可以两个或两个以上单元集成在一个单元中,上述集成的单元既可以采用硬件的形式实现,也可以采用软件功能单元的形式实现。另外,各功能模块的具体名称也只是为了便于相互区分,并不用于限制本申请的保护范围。上述移动终端中模块的具体工作过程,可以参考前述方法实施例中的对应过程,在此不再赘述。Those skilled in the art can clearly understand that, for convenience and conciseness of description, only the above-mentioned division of each functional module is used as an example for illustration. In practical applications, the above-mentioned functions can be allocated by different functional units and modules as needed That is, the internal structure of the mobile terminal is divided into different functional units or modules to complete all or part of the functions described above. The functional modules in the embodiments may be integrated into one processing unit, or each unit may exist alone physically, or two or more units may be integrated into one unit. The above integrated units may be implemented in the form of hardware , Can also be implemented in the form of software functional units. In addition, the specific names of the functional modules are only for the purpose of distinguishing each other, and are not used to limit the protection scope of the present application. For the specific working process of the module in the above mobile terminal, reference may be made to the corresponding process in the foregoing method embodiments, which will not be repeated here.
所属领域的技术人员可以清楚地了解到,为了描述的方便和简洁,仅以上述各功能单元、模块的划分进行举例说明,实际应用中,可以根据需要而将上述功能分配由不同的功能单元、模块完成,即将所述装置的内部结构划分成不同的功能单元或模块,以完成以上描述的全部或者部分功能。实施例中的各功能单元、模块可以集成在一个处理单元中,也可以是各个单元单独物理存在,也可以两个或两个以上单元集成在一个单元中,上述集成的单元既可以采用硬件的形式实现,也可以采用软件功能单元的形式实现。另外,各功能单元、模块的具体名称也只是为了便于相互区分,并不用于限制本发明的保护范围。上述系统中单元、模块的具体工作过程,可以参考前述方法实施例中的对应过程,在此不再赘述。Those skilled in the art can clearly understand that, for convenience and conciseness of description, only the above-mentioned division of each functional unit and module is used as an example for illustration. In practical applications, the above-mentioned functions can be allocated by different functional units, Module completion means that the internal structure of the device is divided into different functional units or modules to complete all or part of the functions described above. The functional units and modules in the embodiment may be integrated into one processing unit, or each unit may exist alone physically, or two or more units may be integrated into one unit. The above integrated unit may use hardware It can also be implemented in the form of software functional units. In addition, the specific names of each functional unit and module are only for the purpose of distinguishing each other, and are not intended to limit the protection scope of the present invention. For the specific working processes of the units and modules in the above system, reference may be made to the corresponding processes in the foregoing method embodiments, which will not be repeated here.
在上述实施例中,对各个实施例的描述都各有侧重,某个实施例中没有详述或记载的部分,可以参见其它实施例的相关描述。In the above embodiments, the description of each embodiment has its own emphasis. For a part that is not detailed or recorded in an embodiment, you can refer to the related descriptions of other embodiments.
本领域普通技术人员可以意识到,结合本文中所公开的实施例描述的各示例的单元及算法步骤,能够以电子硬件、或者计算机软件和电子硬件的结合来实现。这些功能究竟以硬件还是软件方式来执行,取决于技术方案的特定应用和设计约束条件。专业技术人员可以对每个特定的应用来使用不同方法来实现所描述的功能,但是这种实现不应认为超出本发明的范围。Those of ordinary skill in the art may realize that the units and algorithm steps of the examples described in conjunction with the embodiments disclosed herein can be implemented by electronic hardware, or a combination of computer software and electronic hardware. Whether these functions are executed in hardware or software depends on the specific application of the technical solution and design constraints. Professional technicians can use different methods to implement the described functions for each specific application, but such implementation should not be considered beyond the scope of the present invention.
在本发明所提供的实施例中,应该理解到,所揭露的装置/终端设备和方法,可以通过其它的方式实现。例如,以上所描述的装置/终端设备实施例仅仅是示意性的,例如,所述模块或单元的划分,仅仅为一种逻辑功能划分,实际实现时可以有另外的划分方式,例如多个单元或组件可以结合或者可以集成到另一个系统,或一些特征可以忽略,或不执行。另一点,所显示或讨论的相互之间的耦合或直接耦合或通讯连接可以是通过一些接口,装置或单元的间接耦合或通讯连接,可以是电性,机械或其它的形式。In the embodiments provided by the present invention, it should be understood that the disclosed device / terminal device and method may be implemented in other ways. For example, the device / terminal device embodiments described above are only schematic. For example, the division of the module or unit is only a logical function division. In actual implementation, there may be other division modes, such as multiple units Or components can be combined or integrated into another system, or some features can be ignored or not implemented. In addition, the displayed or discussed mutual coupling or direct coupling or communication connection may be indirect coupling or communication connection through some interfaces, devices or units, and may be in electrical, mechanical or other forms.
所述作为分离部件说明的单元可以是或者也可以不是物理上分开的,作为单元显示的部件可以是或者也可以不是物理单元,即可以位于一个地方,或者也可以分布到多个网络单元上。可以根据实际的需要选择其中的部分或者全部单元来实现本实施例方案的目的。The units described as separate components may or may not be physically separated, and the components displayed as units may or may not be physical units, that is, they may be located in one place, or may be distributed on multiple network units. Some or all of the units may be selected according to actual needs to achieve the purpose of the solution of this embodiment.
另外,在本发明各个实施例中的各功能单元可以集成在一个处理单元中,也可以是各个单元单独物理存在,也可以两个或两个以上单元集成在一个单元中。上述集成的单元既可以采用硬件的形式实现,也可以采用软件功能单元的形式实现。In addition, each functional unit in each embodiment of the present invention may be integrated into one processing unit, or each unit may exist alone physically, or two or more units may be integrated into one unit. The above integrated unit can be implemented in the form of hardware or software function unit.
所述集成的模块/单元如果以软件功能单元的形式实现并作为独立的产品销售或使用时,可以存储在一个计算机可读取存储介质中。基于这样的理解,本发明实现上述实施例方法中的全部或部分流程,也可以通过计算机程序来指令相关的硬件来完成,所述的计算机程序可存储于一计算机可读存储介质中,该计算机程序在被处理器执行时,可实现上述各个方法实施例的步骤。其中,所述计算机程序包括计算机程序代码,所述计算机程序代码可以为源代码形式、对象代码形式、可执行文件或某些中间形式等。所述计算机可读介质可以包括:能够携带所述计算机程序代码的任何实体或装置、记录介质、U盘、移动硬盘、磁碟、光盘、计算机存储器、只读存储器(ROM,Read-Only
Memory)、随机存取存储器(RAM,Random
Access Memory)、电载波信号、电信信号以及软件分发介质等。需要说明的是,所述计算机可读介质包含的内容可以根据司法管辖区内立法和专利实践的要求进行适当的增减,例如在某些司法管辖区,根据立法和专利实践,计算机可读介质不包括是电载波信号和电信信号。If the integrated module / unit is implemented in the form of a software functional unit and sold or used as an independent product, it may be stored in a computer-readable storage medium. Based on this understanding, the present invention can implement all or part of the processes in the methods of the above embodiments, and can also be completed by a computer program instructing relevant hardware. The computer program can be stored in a computer-readable storage medium. When the program is executed by the processor, the steps of the foregoing method embodiments may be implemented. Wherein, the computer program includes computer program code, and the computer program code may be in a source code form, an object code form, an executable file, or some intermediate form, etc. The computer-readable medium may include: any entity or device capable of carrying the computer program code, a recording medium, a USB flash drive, a mobile hard disk, a magnetic disk, an optical disc, a computer memory, a read-only memory (ROM, Read-Only
Memory), random access memory (RAM, Random
Access Memory), electrical carrier signals, telecommunication signals, and software distribution media. It should be noted that the content contained in the computer-readable medium can be appropriately increased or decreased according to the requirements of legislation and patent practice in jurisdictions. For example, in some jurisdictions, according to legislation and patent practice, computer-readable media Excluded are electrical carrier signals and telecommunications signals.
以上所述实施例仅用以说明本发明的技术方案,而非对其限制;尽管参照前述实施例对本发明进行了详细的说明,本领域的普通技术人员应当理解:其依然可以对前述各实施例所记载的技术方案进行修改,或者对其中部分技术特征进行等同替换;而这些修改或者替换,并不使相应技术方案的本质脱离本发明各实施例技术方案的精神和范围,均应包含在本发明的保护范围之内。The above-mentioned embodiments are only used to illustrate the technical solutions of the present invention, not to limit them; although the present invention has been described in detail with reference to the foregoing embodiments, those of ordinary skill in the art should understand that they can still implement the foregoing The technical solutions described in the examples are modified, or some of the technical features are equivalently replaced; and these modifications or replacements do not deviate from the spirit and scope of the technical solutions of the embodiments of the present invention, and should be included in Within the protection scope of the present invention.
Claims (10)
- 一种语音交互控制方法,应用于蓝牙音箱,其特征在于,包括:A voice interactive control method, applied to Bluetooth speakers, characterized by including:与终端设备建立语音通道连接,并采集语音信息;Establish a voice channel connection with terminal equipment and collect voice information;通过所述语音通道将所述语音信息发送至终端设备,以使得终端设备将所述语音信息上传至云端,并在所述云端返回语音解析结果至终端设备之时,保持所述语音通道连接;Sending the voice information to the terminal device through the voice channel, so that the terminal device uploads the voice information to the cloud, and maintains the voice channel connection when the cloud returns the voice analysis result to the terminal device;通过所述语音通道接收终端设备发送的语音解析结果,并播放所述语音解析结果。Receiving the voice analysis result sent by the terminal device through the voice channel, and playing the voice analysis result.
- 如权利要求1所述的语音交互控制方法,其特征在于,在与终端设备建立语音通道连接,并采集语音信息之后,还包括:The voice interactive control method according to claim 1, wherein after establishing a voice channel connection with the terminal device and collecting voice information, the method further comprises:通过所述语音通道将所述语音信息发送至终端设备,并在所述终端设备将所述语音信息上传至云端结束后断开所述语音通道连接;Sending the voice information to a terminal device through the voice channel, and disconnecting the voice channel connection after the terminal device uploads the voice information to the cloud;在云端返回语音解析结果至终端设备之前,再次与终端设备建立语音通道连接。Before the cloud returns the voice analysis result to the terminal device, establish a voice channel connection with the terminal device again.
- 如权利要求1所述的语音交互控制方法,其特征在于,在与终端设备建立语音通道连接,并采集语音信息之后,还包括:The voice interactive control method according to claim 1, wherein after establishing a voice channel connection with the terminal device and collecting voice information, the method further comprises:通过所述语音通道将所述语音信息发送至终端设备,并在所述终端设备将所述语音信息上传至云端以及云端返回语音解析结果之前,保持与所述终端设备建立的所述语音通道连接。Sending the voice information to a terminal device through the voice channel, and maintaining the voice channel connection established with the terminal device before the terminal device uploads the voice information to the cloud and returns the voice analysis result from the cloud .
- 一种语音交互控制方法,其特征在于,包括:A voice interactive control method, characterized in that it includes:与蓝牙音箱建立语音通道连接,并通过所述语音通道接收蓝牙音箱发送的语音信息;Establish a voice channel connection with a Bluetooth speaker, and receive voice information sent by the Bluetooth speaker through the voice channel;将所述语音信息上传至云端,以使得云端对所述语音信息进行解析,并在获取语音解析结果时,保持与所述蓝牙音箱的语音通道连接;Upload the voice information to the cloud, so that the cloud can parse the voice information, and maintain the voice channel connection with the Bluetooth speaker when obtaining the voice analysis result;通过所述语音通道将所述语音解析结果发送至蓝牙音箱,以使得蓝牙音箱播放所述语音解析结果。Sending the voice analysis result to a Bluetooth speaker through the voice channel, so that the Bluetooth speaker plays the voice analysis result.
- 如权利要求4所述的语音交互控制方法,其特征在于,在与蓝牙音箱建立语音通道连接,并通过所述语音通道接收蓝牙音箱发送的语音信息之后,还包括:The voice interactive control method according to claim 4, wherein after establishing a voice channel connection with the Bluetooth speaker and receiving voice information sent by the Bluetooth speaker through the voice channel, the method further comprises:将所述语音信息上传至云端,并在上传至云端结束后断开与蓝牙音箱的所述语音通道连接;Upload the voice information to the cloud, and disconnect the voice channel connection with the Bluetooth speaker after uploading to the cloud;在云端对所述语音信息进行解析且返回语音解析结果之前,再次与蓝牙音箱建立语音通道连接,并接收云端发送的语音解析结果。Before the cloud parses the voice information and returns the voice analysis result, establish a voice channel connection with the Bluetooth speaker again and receive the voice analysis result sent by the cloud.
- 如权利要求4所述的语音交互控制方法,其特征在于,在与蓝牙音箱建立语音通道连接,并通过所述语音通道接收蓝牙音箱发送的语音信息之后,还包括:The voice interactive control method according to claim 4, wherein after establishing a voice channel connection with the Bluetooth speaker and receiving voice information sent by the Bluetooth speaker through the voice channel, the method further comprises:通过所述语音通道接收蓝牙音箱发送的语音信息,在将所述语音信息上传至云端以及云端返回语音解析结果之前,保持与所述蓝牙音箱建立的所述语音通道连接。Receive the voice information sent by the Bluetooth speaker through the voice channel, and maintain the voice channel connection established with the Bluetooth speaker before uploading the voice information to the cloud and returning the voice analysis result from the cloud.
- 一种语音交互控制装置,其特征在于,包括:A voice interactive control device, characterized in that it includes:语音采集模块,用于采集语音信息;Voice collection module, used to collect voice information;第一通道建立模块,用于与终端设备建立语音通道连接,并在所述语音解析结果返回至终端设备之时,保持所述语音通道连接;The first channel establishment module is used to establish a voice channel connection with the terminal device and maintain the voice channel connection when the voice analysis result is returned to the terminal device;第一语音发送模块,用于通过所述语音通道将所述语音信息发送至终端设备,以使得终端设备将所述语音信息上传至云端解析,返回语音解析结果;A first voice sending module, configured to send the voice information to the terminal device through the voice channel, so that the terminal device uploads the voice information to the cloud for analysis, and returns a voice analysis result;语音播放模块,用于通过所述语音通道接收终端设备发送的语音解析结果,并播放所述语音解析结果。The voice playback module is configured to receive the voice analysis result sent by the terminal device through the voice channel and play the voice analysis result.
- 一种终端设备,其特征在于,包括:A terminal device is characterized by comprising:第二通道建立模块,用于与蓝牙音箱建立语音通道连接;The second channel establishment module is used to establish a voice channel connection with a Bluetooth speaker;数据接收模块,用于通过所述语音通道接收蓝牙音箱发送的语音信息;A data receiving module, configured to receive voice information sent by a Bluetooth speaker through the voice channel;第二数据发送模块,用于将所述语音信息上传至云端,以使得云端对所述语音信息进行解析,获取语音解析结果;The second data sending module is used to upload the voice information to the cloud, so that the cloud can parse the voice information to obtain the voice analysis result;第三数据发送模块,用于通过所述语音通道将所述语音解析结果发送至蓝牙音箱,以使得蓝牙音箱播放所述语音解析结果。The third data sending module is configured to send the voice analysis result to a Bluetooth speaker through the voice channel, so that the Bluetooth speaker plays the voice analysis result.
- 一种语音交互控制系统,其特征在于,包括:A voice interactive control system, characterized by including:蓝牙音箱,用于与终端设备建立语音通道连接,采集语音信息,通过所述语音通道将所述语音信息发送至终端设备,通过所述语音通道接收终端设备发送的语音解析结果,并播放所述语音解析结果;The Bluetooth speaker is used to establish a voice channel connection with the terminal device, collect voice information, send the voice information to the terminal device through the voice channel, receive the voice analysis result sent by the terminal device through the voice channel, and play the Voice analysis results;终端设备,用于与蓝牙音箱建立语音通道连接,并通过所述语音通道接收蓝牙音箱发送的语音信息;保持与蓝牙音箱的语音通道的连接,并将所述语音信息上传至云端,以使得云端对所述语音信息进行解析,获取语音解析结果;通过所述语音通道将所述语音解析结果发送至蓝牙音箱,以使得蓝牙音箱播放所述语音解析结果;The terminal device is used to establish a voice channel connection with the Bluetooth speaker and receive the voice information sent by the Bluetooth speaker through the voice channel; maintain the connection with the voice channel of the Bluetooth speaker and upload the voice information to the cloud to make the cloud Analyze the voice information to obtain a voice analysis result; send the voice analysis result to a Bluetooth speaker through the voice channel, so that the Bluetooth speaker plays the voice analysis result;云端,用于接收终端设备发送的语音信息,对所述语音信息进行解析,并返回语音解析结果至终端设备。The cloud is used to receive the voice information sent by the terminal device, analyze the voice information, and return the voice analysis result to the terminal device.
- 一种计算机可读存储介质,所述计算机可读存储介质存储有计算机程序,其特征在于,所述计算机程序被处理器执行时实现如权利要求1至6任一项所述方法的步骤。A computer-readable storage medium storing a computer program, characterized in that, when the computer program is executed by a processor, the steps of the method according to any one of claims 1 to 6 are implemented.
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201811231063.2A CN111081238B (en) | 2018-10-22 | 2018-10-22 | Bluetooth sound box voice interaction control method, device and system |
CN201811231063.2 | 2018-10-22 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2020082710A1 true WO2020082710A1 (en) | 2020-04-30 |
Family
ID=70308349
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/CN2019/084835 WO2020082710A1 (en) | 2018-10-22 | 2019-04-28 | Voice interaction control method, apparatus and system for bluetooth speaker |
Country Status (2)
Country | Link |
---|---|
CN (1) | CN111081238B (en) |
WO (1) | WO2020082710A1 (en) |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN113709558A (en) * | 2021-10-09 | 2021-11-26 | 立讯电子科技(昆山)有限公司 | Multimedia processing method and multimedia interaction system |
CN113852937A (en) * | 2020-06-28 | 2021-12-28 | 阿里巴巴集团控股有限公司 | Method and device for establishing Bluetooth connection and vehicle-mounted equipment |
CN114253148A (en) * | 2021-12-09 | 2022-03-29 | 英华达(上海)科技有限公司 | Intelligent device control method, gateway device and intelligent device control system |
Citations (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20120253823A1 (en) * | 2004-09-10 | 2012-10-04 | Thomas Barton Schalk | Hybrid Dialog Speech Recognition for In-Vehicle Automated Interaction and In-Vehicle Interfaces Requiring Minimal Driver Processing |
CN103095813A (en) * | 2012-12-31 | 2013-05-08 | 威盛电子股份有限公司 | Voice interaction system, mobile terminal device and voice communication method |
CN104202488A (en) * | 2014-09-29 | 2014-12-10 | 广东欧珀移动通信有限公司 | Method for conducting voice interaction through Bluetooth headset and mobile terminal, and system |
US20150331666A1 (en) * | 2014-05-15 | 2015-11-19 | Tyco Safety Products Canada Ltd. | System and Method for Processing Control Commands in a Voice Interactive System |
CN106878850A (en) * | 2017-03-13 | 2017-06-20 | 歌尔股份有限公司 | Method, system and the wireless headset of interactive voice are realized using wireless headset |
CN107277272A (en) * | 2017-07-25 | 2017-10-20 | 深圳市芯中芯科技有限公司 | A kind of bluetooth equipment voice interactive method and system based on software APP |
CN108428452A (en) * | 2018-03-14 | 2018-08-21 | 百度在线网络技术(北京)有限公司 | Terminal support and far field voice interactive system |
CN108492823A (en) * | 2018-03-07 | 2018-09-04 | 广东思派康电子科技有限公司 | Voice song-requesting interaction system and voice song-requesting interaction method |
Family Cites Families (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP2787790B1 (en) * | 2012-11-16 | 2017-07-26 | Huawei Device Co., Ltd. | Method, mobile terminal and system for establishing bluetooth connection |
CN105161111B (en) * | 2015-08-25 | 2017-09-26 | 百度在线网络技术(北京)有限公司 | Audio recognition method and device based on bluetooth connection |
-
2018
- 2018-10-22 CN CN201811231063.2A patent/CN111081238B/en active Active
-
2019
- 2019-04-28 WO PCT/CN2019/084835 patent/WO2020082710A1/en active Application Filing
Patent Citations (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20120253823A1 (en) * | 2004-09-10 | 2012-10-04 | Thomas Barton Schalk | Hybrid Dialog Speech Recognition for In-Vehicle Automated Interaction and In-Vehicle Interfaces Requiring Minimal Driver Processing |
CN103095813A (en) * | 2012-12-31 | 2013-05-08 | 威盛电子股份有限公司 | Voice interaction system, mobile terminal device and voice communication method |
US20150331666A1 (en) * | 2014-05-15 | 2015-11-19 | Tyco Safety Products Canada Ltd. | System and Method for Processing Control Commands in a Voice Interactive System |
CN104202488A (en) * | 2014-09-29 | 2014-12-10 | 广东欧珀移动通信有限公司 | Method for conducting voice interaction through Bluetooth headset and mobile terminal, and system |
CN106878850A (en) * | 2017-03-13 | 2017-06-20 | 歌尔股份有限公司 | Method, system and the wireless headset of interactive voice are realized using wireless headset |
CN107277272A (en) * | 2017-07-25 | 2017-10-20 | 深圳市芯中芯科技有限公司 | A kind of bluetooth equipment voice interactive method and system based on software APP |
CN108492823A (en) * | 2018-03-07 | 2018-09-04 | 广东思派康电子科技有限公司 | Voice song-requesting interaction system and voice song-requesting interaction method |
CN108428452A (en) * | 2018-03-14 | 2018-08-21 | 百度在线网络技术(北京)有限公司 | Terminal support and far field voice interactive system |
Non-Patent Citations (1)
Title |
---|
陈飞 (CHEN, FEI): "低功耗智能蓝牙云交互耳机的设计与实现 (Design and Implementation of a Low-Power Cloud Interactive Headphone Based on Bluetooth)", 电脑知识与技术 (COMPUTER KNOWLEDGE AND TECHNOLOGY), vol. 14, no. 28, 8 October 2018 (2018-10-08), ISSN: 1009-3044 * |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN113852937A (en) * | 2020-06-28 | 2021-12-28 | 阿里巴巴集团控股有限公司 | Method and device for establishing Bluetooth connection and vehicle-mounted equipment |
CN113709558A (en) * | 2021-10-09 | 2021-11-26 | 立讯电子科技(昆山)有限公司 | Multimedia processing method and multimedia interaction system |
CN114253148A (en) * | 2021-12-09 | 2022-03-29 | 英华达(上海)科技有限公司 | Intelligent device control method, gateway device and intelligent device control system |
Also Published As
Publication number | Publication date |
---|---|
CN111081238A (en) | 2020-04-28 |
CN111081238B (en) | 2022-09-23 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
WO2020063146A1 (en) | Data transmission method and system, and bluetooth headphone | |
WO2014166243A1 (en) | Multi-terminal multi-channel independent play method and apparatus | |
WO2020082710A1 (en) | Voice interaction control method, apparatus and system for bluetooth speaker | |
CN111276135A (en) | Network speech recognition method, network service interaction method and smart headset | |
CN108124114A (en) | A kind of audio/video conference sound collection method and device | |
CN115022777A (en) | Audio playing parameter updating method, audio data adjusting method, device and medium | |
CN114258003A (en) | Audio playing control method, system, device and storage medium | |
CN103686540A (en) | Active wireless network sound equipment and use method thereof | |
CN104168534A (en) | Holographic audio device and control method | |
WO2017185339A1 (en) | Wireless connection method, apparatus and system | |
CN105744022A (en) | Mobile terminal as well as voice playing method and system | |
US10291759B2 (en) | Data return type voice input/output device for smart phone | |
CN103873711A (en) | VOIP phone-based virtual USB (universal serial bus) sound card creation method | |
CN113992965A (en) | A low-latency transmission method and system | |
CN106911978B (en) | Voice signal imported smart machine earphone again | |
CN110971744B (en) | Method and device for controlling voice playing of Bluetooth sound box | |
CN115802087B (en) | Sound and picture synchronous processing method and related equipment thereof | |
WO2020062862A1 (en) | Voice interactive control method and device for speaker | |
CN106445456A (en) | TTS audio data transmission method and device for navigation function | |
CN204377136U (en) | A kind of holographic audio frequency gathers playback reproducer | |
CN114885261A (en) | Earphone assembly, wireless audio playing system and communication method thereof | |
CN115499761A (en) | Audio processing equipment and audio processing system | |
CN116036591A (en) | Sound effect optimization method, device, equipment and storage medium | |
CN110121057A (en) | A kind of intelligent camera method and system based on block chain | |
CN112333594A (en) | Audio switching method for Bluetooth headset and Bluetooth sound box |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 19876504 Country of ref document: EP Kind code of ref document: A1 |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 19876504 Country of ref document: EP Kind code of ref document: A1 |