WO2020062861A1

WO2020062861A1 - Voice playback control method and device for bluetooth speaker

Info

Publication number: WO2020062861A1
Application number: PCT/CN2019/084833
Authority: WO
Inventors: 祁学文; 吴海全; 迟欣; 张恩勤; 曹磊; 师瑞文
Original assignee: 深圳市冠旭电子股份有限公司
Priority date: 2018-09-28
Filing date: 2019-04-28
Publication date: 2020-04-02
Also published as: CN110971744A; CN110971744B

Abstract

The present application relates to the technical field of Bluetooth speaker control and provides a playback control method and device for a Bluetooth speaker, the method comprising: collecting voice data, and transmitting the voice data to a mobile terminal, uploading the voice data to a server by means of the mobile terminal for carrying out voice recognition; receiving a message transmitted by the mobile terminal indicating that the uploading of the voice data has been completed, and establishing a first voice channel with the mobile terminal before the server sends the result of the voice recognition as a feedback to the mobile terminal; receiving the result of the voice recognition transmitted by the mobile terminal by means of the first voice channel and playing the result of the voice recognition, wherein the result of the voice recognition is the result sent by the server to the mobile terminal as a feedback. The present invention can establish a voice playback channel with a mobile terminal in the process of voice recognition, and can play the voice directly without requiring to establish a connection after receiving the result of voice recognition as a feedback, thus reducing the delay during the voice interaction with a Bluetooth speaker and increasing the response speed.

Description

Method and device for controlling voice playback of Bluetooth speaker

Technical field

The invention belongs to the technical field of Bluetooth speaker control, and particularly relates to a method and a device for controlling voice playback of a Bluetooth speaker.

Background technique

Currently, wireless speakers are becoming more and more popular. Bluetooth speakers with voice wake-up function, which can support both recording and playback, are widely used. The mobile phone establishes a connection with the Bluetooth speaker, transmits the voice data recorded by the Bluetooth speaker to the mobile phone, and interacts with the server through the mobile phone application App. The server performs voice recognition and returns the result to the mobile phone, and then transmits the mobile phone app to the Bluetooth speaker for playback . During the playback of Bluetooth speakers, A2DP (Advanced Audio Distribution Profile (Bluetooth Audio Transmission Protocol) connection, and when the mobile phone receives the server's feedback results, the A2DP connection needs to be established only when the feedback results are played to the Bluetooth speakers through A2DP; therefore, there is a problem with After the server feedback result is obtained, the Bluetooth A2DP connection is delayed when voice is played by the Bluetooth speaker, and the response speed of the speaker is slow during the voice interaction process, which reduces the user experience effect.

technical problem

In view of this, embodiments of the present invention provide a method and a device for controlling voice playback of a Bluetooth speaker, so as to solve the problems of connection delay and slow response of the speaker during the voice interaction in the prior art.

Technical solutions

A first aspect of the embodiments of the present invention provides a method for controlling voice playback of a Bluetooth speaker, including:

Collect voice data and send the voice data to a mobile terminal, and the voice data is uploaded to the server via the mobile terminal for voice recognition;

Receiving a message of completion of uploading voice data sent by a mobile terminal, and establishing a first voice path with the mobile terminal before the server feeds back a voice recognition result to the mobile terminal;

Receiving a voice recognition result sent by a mobile terminal via the first voice path, and playing the voice recognition result; wherein the voice recognition result is a result fed back by the server to the mobile terminal.

A second aspect of the embodiments of the present invention provides a method for controlling voice playback of a Bluetooth speaker, including:

Receiving voice data sent by a Bluetooth speaker, and uploading the voice data to a server for voice recognition;

Send the message that the voice data upload is completed to the Bluetooth speaker, and establish a first voice path with the Bluetooth speaker before receiving the voice recognition result fed back by the server;

Receiving a voice recognition result fed back by the server, and sending the voice recognition result to a Bluetooth speaker via the first voice path for voice playback.

A third aspect of the embodiments of the present invention provides a method for controlling voice playback of a Bluetooth speaker, including:

The Bluetooth speaker sends voice data to the mobile terminal;

The mobile terminal uploads the voice data to a server;

The Bluetooth speaker receives a message that the upload of the voice data is completed by the mobile terminal;

The Bluetooth speaker establishes a first voice path with the mobile terminal, while the server performs voice recognition;

The mobile terminal receives the speech recognition result;

The mobile terminal sends the voice recognition result to a Bluetooth speaker through a first voice path, and the Bluetooth speaker performs voice playback.

A fourth aspect of the embodiments of the present invention provides a Bluetooth speaker voice playback control device, including:

A first voice data processing module, configured to collect voice data and send the voice data to a mobile terminal, where the voice data is uploaded to a server via the mobile terminal for voice recognition;

A first connection establishing module, configured to receive a voice data upload end message sent by a mobile terminal, and establish a first voice path with the mobile terminal before the server feeds back a speech recognition result to the mobile terminal;

The voice playback module is configured to receive a voice recognition result sent by the mobile terminal via the first voice path, and play the voice recognition result; wherein the voice recognition result is a result fed back from the server to the mobile terminal.

A fifth aspect of the embodiments of the present invention provides a mobile terminal, including:

A second voice data processing module, configured to receive voice data sent by the Bluetooth speaker end, and upload the voice data to a server for voice recognition;

A second connection establishing module, configured to send a message that voice data uploading is completed to the Bluetooth speaker, and establish a first voice path with the Bluetooth speaker before receiving the voice recognition result fed back by the server;

The voice recognition result processing module is configured to receive a voice recognition result fed back by the server, and send the voice recognition result to a Bluetooth speaker via the first voice path for voice playback.

A sixth aspect of the embodiments of the present invention provides a Bluetooth speaker voice playback control system, including a Bluetooth speaker, a mobile terminal, and a server.

The Bluetooth speaker is used to collect voice data and send the voice data to the mobile terminal through the second voice path;

The mobile terminal is configured to receive the voice data and upload the voice data to the server, and feedback a message that the voice data upload is completed to the Bluetooth speaker;

A server for receiving and recognizing the voice data and feeding back a voice recognition result corresponding to the voice data;

The Bluetooth speaker and the mobile terminal are respectively used to establish a first voice path before the server feeds back the voice recognition result;

The mobile terminal is further configured to receive a voice recognition result fed back by the server, and send the voice recognition result to a Bluetooth speaker via the first voice path.

The Bluetooth speaker is also used to receive the voice recognition results sent by the mobile terminal and perform voice playback.

A seventh aspect of the embodiments of the present invention provides a computer-readable storage medium, where the computer-readable storage medium stores a computer program, and when the computer program is executed by a processor, implements the steps of the foregoing method.

Beneficial effect

Compared with the prior art, the embodiment of the present invention has the beneficial effect that the embodiment of the present invention can establish a voice playback path between the Bluetooth speaker and the mobile terminal before the server feedbacks the voice recognition result, and upon receiving the feedback from the server to the mobile terminal, When the speech recognition result is used, it is not necessary to connect the channels and directly play the voice, which reduces the delay of the voice interaction of the Bluetooth speaker and improves the response speed of the voice interaction.

BRIEF DESCRIPTION OF THE DRAWINGS

In order to more clearly illustrate the technical solutions in the embodiments of the present invention, the drawings used in the embodiments or the description of the prior art will be briefly introduced below. Obviously, the drawings in the following description are only the present invention. For some embodiments, for those of ordinary skill in the art, other drawings can be obtained according to these drawings without paying creative labor.

FIG. 1 is a schematic diagram of a system scenario applicable to a method for controlling voice playback of a Bluetooth speaker according to Embodiment 1 of the present invention; FIG.

FIG. 2 is a schematic flowchart of implementing a method for controlling voice playback of a Bluetooth speaker according to a second embodiment of the present invention; FIG.

3 is a schematic flowchart of a method for controlling a Bluetooth speaker voice playback method provided by a mobile terminal according to a third embodiment of the present invention;

4 is a schematic diagram of an interaction flow of a method for controlling voice playback of a Bluetooth speaker according to a fourth embodiment of the present invention;

FIG. 5 is an exemplary diagram of a Bluetooth speaker voice playback control device provided by Embodiment 5 of the present invention.

Embodiments of the invention

In the following description, for the purpose of illustration rather than limitation, specific details such as a specific system structure and technology are provided in order to thoroughly understand the embodiments of the present invention. However, it should be clear to a person skilled in the art that the present invention can also be implemented in other embodiments without these specific details. In other cases, detailed descriptions of well-known systems, devices, circuits, and methods are omitted so as not to obscure the description of the present invention with unnecessary details.

It should be understood that when used in this specification and the appended claims, the term "comprising" indicates the presence of described features, integers, steps, operations, elements and / or components, but does not exclude one or more other features , The whole, steps, operations, elements, components, and / or their presence or addition.

It should also be understood that the terminology used in the description of the invention herein is for the purpose of describing particular embodiments only and is not intended to limit the invention. As used in the description of the invention and the appended claims, the singular forms "a", "an" and "the" are intended to include the plural forms unless the context clearly indicates otherwise.

It should be further understood that the term "and / or" used in the present description and the appended claims refers to any combination of one or more of the listed items and all possible combinations, and includes these combinations .

In order to explain the technical solution of the present invention, the following description is made through specific embodiments.

Example one

FIG. 1 is a schematic diagram of a system scenario applicable to a method for controlling voice playback of a Bluetooth speaker according to an embodiment of the present invention. For convenience of explanation, only parts related to this embodiment are shown.

Referring to FIG. 1, the system collects voice data from the Bluetooth speaker 11 and transmits it to the mobile terminal 12. The mobile terminal 12 uploads the voice data to the server 13 and performs voice recognition by the server 13. The server 13 feeds back the voice recognition results to the mobile terminal. Before 12, the Bluetooth speaker 11 and the mobile terminal 12 established a voice playback channel. When the Bluetooth speaker 11 received the voice recognition result fed back from the server 13 to the mobile terminal 12, it did not need to connect the channel and directly played the voice. The delay of voice interaction of the Bluetooth speaker is reduced, and the response speed of the voice interaction is improved.

The method for controlling voice playback of the Bluetooth speaker in the system scenario shown in FIG. 1 is described in detail below.

Example two

FIG. 2 is a schematic flowchart of a method for controlling a voice playback of a Bluetooth speaker according to an embodiment of the present invention. In this embodiment, the execution subject of this process is the Bluetooth speaker 11 shown in FIG. 1, which is detailed as follows:

Step S201: Collect voice data and send the voice data to a mobile terminal, and the voice data is uploaded to the server via the mobile terminal for voice recognition.

In the embodiment of the present invention, the Bluetooth speaker has a built-in microphone array and can also be used for long-distance pickup. The Bluetooth speakers include, but are not limited to: ordinary monocular Bluetooth speakers, outdoor monophonic Bluetooth speakers, home-style dual-tube Bluetooth speakers, Outdoor sports Bluetooth speakers or large multi-cylinder home Bluetooth speakers can collect voice data and transmit the voice data to the mobile terminal through the established Bluetooth protocol.

The voice data is transmitted by a mobile terminal to a server or the cloud for voice recognition through the network, and the voice recognition is to transform unstructured voice data information into a structured index through voice recognition to implement audio or recording Data information mining and retrieval; including signal processing and feature extraction of speech information, decoding of acoustic models and language models, and finally generating speech recognition results.

Further, the step of collecting voice data and transmitting the voice data to a mobile terminal, where the voice data is used for uploading to the server for voice recognition via the mobile terminal, includes:

A1. Generate a wake-up event and send the wake-up event to the mobile terminal.

In this embodiment, the wake-up event may be a voice wake-up event; the Bluetooth speaker has a built-in microphone array, which can collect voice data in real time, and the voice data can be used as a match with the wake-up keyword or as a voice recognition Voice data source. When the microphone array of the Bluetooth speaker is always in a low-power operation state, only the data is collected and the wake-up word is matched, the voice data can be continuously recorded; when the recorded voice data is matched by the wake-up algorithm to the wake-up keyword, the Bluetooth speaker triggers an interrupt. And the mobile terminal is notified of the voice wake-up event through the protocol stack.

A2: After sending the wake-up event is completed, establish a second voice path with the mobile terminal.

In this embodiment, the second voice path may be a voice data path or a synchronous SCO connection path; after the speaker terminal sends a wake-up event to the mobile terminal, a synchronous SCO connection is established with the mobile terminal. Because it is synchronized with the mobile terminal for the SCO connection, the microphone array on the Bluetooth speaker side is turned on to receive voice data.

A3. The voice data is sent to the mobile terminal via the second voice path.

In this embodiment, after the second voice path connection is established, the Bluetooth speaker receives the input voice data, and sends the voice data to the mobile terminal through the second voice path, thereby transmitting the voice data during the voice interaction process.

Step S202: Receive a voice data upload end message sent by the mobile terminal, and establish a first voice path with the mobile terminal before the server feeds back the voice recognition result to the mobile terminal.

In the embodiment of the present invention, the first voice path is a voice playback path, which may be a Bluetooth audio transmission protocol connection established between the Bluetooth speaker and the mobile terminal; before the server returns the voice recognition result or the voice data information is uploaded to the server, After receiving the message that the upload of the voice data transmitted by the mobile terminal is completed, the Bluetooth speaker end establishes a voice playback channel with the mobile terminal. Because voice recognition takes time, voice feedback arrives at the mobile terminal through the network. After the voice data is uploaded to the server, the voice recognition channel is established before waiting for voice feedback; the voice playback channel connection will be established and voice recognition will be established. The processes are performed simultaneously in different child threads.

Further, before receiving the message that the upload of the voice data transmitted by the mobile terminal is completed, and before the server feeds back the voice recognition result to the mobile terminal, establishing a first voice path with the mobile terminal includes:

When the server starts speech recognition, a Bluetooth audio transmission protocol connection is established with the mobile terminal.

In this embodiment, the Bluetooth audio transmission protocol established between the Bluetooth speaker and the mobile terminal may be a Bluetooth audio transmission protocol A2DP connection or a synchronous SCO-oriented connection; the synchronous SCO-oriented connection is bidirectional and can collect voice data, Voice data can also be played; the Bluetooth audio transmission protocol A2DP connection can support mono or stereo high-quality audio data transmission, and has a higher sampling rate.

Step S203: Receive a voice recognition result sent by the mobile terminal via the first voice path, and play the voice recognition result; wherein the voice recognition result is a result fed back from the server to the mobile terminal.

In the embodiment of the present invention, the first voice path is a voice playback path. Since the voice playback path between the Bluetooth speaker and the mobile terminal has been established, the Bluetooth speaker directly sends an air packet after receiving the voice recognition result fed back by the server. It receives the voice recognition result sent by the mobile terminal and plays the voice after receiving the air packet data to realize the rapid response of the voice interaction process and reduce the delay of the voice interaction of the Bluetooth speaker.

According to the embodiment of the present invention, when the Bluetooth speaker voice interaction is performed, after the voice data is entered and uploaded to the server, the establishment of the voice playback path with the mobile terminal is started, so that the establishment of the voice playback path and the voice recognition and voice feedback Synchronized execution in different sub-threads. After the voice feedback is over, since the voice playback channel has been established, the voice playback is directly performed, which improves the response speed and reduces the interaction delay.

Example three

FIG. 3 is a schematic flowchart of a method for controlling voice playback of a Bluetooth speaker according to an embodiment of the present invention. In this embodiment, the execution subject of this process is the mobile terminal 12 shown in FIG. 1. The mobile terminal may be a mobile phone, a computer, or a tablet with a Bluetooth connection function, which is not specifically limited herein, and is described in detail below. :

Step S301: Receive voice data sent by a Bluetooth speaker, and upload the voice data to a server for voice recognition.

In the embodiment of the present invention, the mobile terminal performs voice pickup through the Bluetooth speaker end, and after receiving the input voice data, establishes a connection with an independent server or the cloud, and uploads the received voice data to the independent server or the cloud, and the independent server Or cloud for voice recognition of voice data.

Further, the step of receiving voice data transmitted by the Bluetooth speaker end and uploading the voice data to a server for voice recognition includes:

B1. Receive the wake-up event sent by the Bluetooth speaker.

In this embodiment, the wake-up event may be a voice wake-up event; when the voice data entered on the Bluetooth speaker end is matched with the wake-up keyword by the wake-up algorithm, the Bluetooth speaker triggers an interrupt, and the mobile terminal receives the Bluetooth speaker through the protocol line After the voice wake-up event is received, the mobile terminal responds to the wake-up event and performs a voice pickup process from the Bluetooth speaker end.

B2. Establish a second voice path with the Bluetooth speaker according to the wake-up event.

In this embodiment, the second voice path may be a voice data path for transmitting voice data; the voice data path may also be a synchronously-oriented SCO connection path; after the mobile terminal receives a voice wake-up event, then Immediately establish a connection with the voice data transmission path of the Bluetooth speaker, and specifically establish a synchronous SCO-oriented connection. The synchronous SCO-oriented connection is bidirectional, which is mainly used for synchronous voice transmission and uses reserved time slots to transmit data packets. Can transmit voice or data.

B3. Receive voice data of the Bluetooth speaker through the second voice path; wherein the first voice path is established after the second voice path is established.

In this embodiment, the second voice channel may be a voice data channel, and specifically may be a synchronous SCO-oriented connection channel; since the mobile terminal and the Bluetooth speaker end maintain a synchronous-oriented connection, it is preferentially opened when the voice data channel is established. The microphone array on the Bluetooth speaker side, the mobile terminal picks up the voice from the Bluetooth speaker side through the voice data channel, and obtains the voice data through the voice data channel.

In step S302, a message that voice data uploading is completed is sent to the Bluetooth speaker, and a first voice path is established with the Bluetooth speaker before receiving the voice recognition result fed back by the server.

In the embodiment of the present invention, the first voice path may be a voice playback path; it may be a Bluetooth audio transmission protocol established by a Bluetooth speaker and a mobile terminal; specifically, it may be a Bluetooth audio transmission protocol A2DP connection, or it may be a synchronous SCO connection. . After the mobile terminal uploads the voice data to the cloud or a stand-alone server, it sends a message that the upload is complete to the Bluetooth speaker, and establishes a voice playback channel with the Bluetooth speaker before receiving the server's feedback of the speech recognition result, or after sending the message that the upload is complete.

It should be noted that, while establishing a voice playback channel with a Bluetooth speaker, the cloud or an independent server performs voice recognition on the voice data and feedbacks the voice recognition results to the mobile terminal, that is, the establishment of the voice playback channel and the voice recognition and voice feedback. Simultaneously execute in different threads. When the mobile terminal receives the speech recognition result, the speech playback path has been established.

Step S303: Receive a voice recognition result fed back by the server, and send the voice recognition result to a Bluetooth speaker for voice playback via the first voice path.

In the embodiment of the present invention, the first voice path may be a voice playback path; since a voice playback path has been established with a Bluetooth speaker, after receiving the voice recognition result fed back by the server, the voice is sent directly in the form of an air packet. The recognition result is transmitted to the Bluetooth speaker, and the voice data of the air packet is played to realize the rapid response of the voice interaction process and reduce the delay of the voice interaction of the Bluetooth speaker.

According to the embodiment of the present invention, the mobile terminal performs voice pickup through the Bluetooth speaker, and uploads the voice data to the server for voice recognition, and completes the establishment of the voice playback channel with the Bluetooth speaker before the upload is completed and the voice recognition is received. After the voice recognition result, the voice recognition result is directly transmitted to the Bluetooth speaker through the established voice playback channel for voice playback, which reduces the delay of the Bluetooth connection and improves the response rate of voice interaction.

Embodiment 4

FIG. 4 shows a schematic diagram of an interaction process of a method for controlling voice playback of a Bluetooth speaker according to an embodiment of the present invention. The execution subject participating in the interaction process includes a Bluetooth speaker and a mobile terminal. The implementation principle of the interaction process is as described in FIGS. 2 to 3 The implementation principle of each execution subject side is the same, so this interaction process is only briefly described, without repeating:

1. The Bluetooth speaker sends voice data to the mobile terminal;

2. The mobile terminal uploads the voice data to the server;

3. The Bluetooth speaker receives a message that the upload of the voice data sent by the mobile terminal is completed;

4. The Bluetooth speaker establishes the first voice path with the mobile terminal, and the server performs voice recognition at the same time;

5. The mobile terminal receives the voice recognition result;

6. The mobile terminal sends the voice recognition result to the Bluetooth speaker via the first voice path, and the Bluetooth speaker performs voice playback.

Further, the method for controlling voice playback of a Bluetooth speaker further includes:

The Bluetooth speaker sends a wake event to the mobile terminal;

According to the wake-up event, the Bluetooth speaker establishes a second voice path with the mobile terminal;

The Bluetooth speaker sends voice data to the mobile terminal via a second voice path; wherein the first voice path is established after the second voice path is established.

Further, the Bluetooth speaker establishes a first voice path with the mobile terminal, and the server performs voice recognition, including:

When the server performs voice recognition, the Bluetooth speaker establishes a Bluetooth audio transmission protocol connection with the mobile terminal.

It should be noted that other sorting schemes that can be easily conceived by those skilled in the art within the technical scope disclosed in the present invention should also fall within the protection scope of the present invention, and are not described in detail here.

It should be understood that the size of the sequence numbers of the steps in the above embodiments does not mean the order of execution. The execution order of each process should be determined by its function and internal logic, and should not constitute any limitation on the implementation process of the embodiment of the present invention.

Example 5

FIG. 5 shows an example diagram of a Bluetooth speaker voice playback control device according to an embodiment of the present invention. For ease of description, only parts related to the embodiment of the present invention are shown.

The Bluetooth speaker voice playback control device includes:

A first voice data processing module 51, configured to collect voice data and send the voice data to a mobile terminal, where the voice data is used for uploading to the server for voice recognition via the mobile terminal;

A first connection establishing module 52, configured to receive a voice data upload end message sent by the mobile terminal, and establish a voice playback channel with the mobile terminal before the server feeds back the voice recognition result to the mobile terminal;

The voice playback module 53 is configured to receive a voice recognition result sent by a mobile terminal via the first voice path, and play the voice recognition result; wherein the voice recognition result is a result fed back from the server to the mobile terminal.

Further, the Bluetooth speaker voice playback control device further includes:

A wake-up module for generating a wake-up event and sending the wake-up event to a mobile terminal;

A second voice path establishment module is configured to establish a second voice path with the mobile terminal after the sending of the wake-up event is completed.

Further, an embodiment of the present invention further provides a mobile terminal, including:

A second connection establishing module, configured to send a message that voice data uploading is completed to the Bluetooth speaker end, and establish a first voice path with the Bluetooth speaker before receiving the voice recognition result fed back by the server;

The voice recognition result processing module is configured to receive the voice recognition result fed back by the server, and send the voice recognition result to the Bluetooth speaker end via the first voice path for voice playback.

Further, an embodiment of the present invention further provides a Bluetooth speaker voice playback control system, including a Bluetooth speaker, a mobile terminal, and a server;

The mobile terminal is further configured to receive a voice recognition result fed back by the server, and send the voice recognition result to a Bluetooth speaker via the first voice path;

An embodiment of the present invention further provides a computer-readable storage medium, where the computer-readable storage medium stores a computer program, and when the computer program is executed by a processor, implements steps of a Bluetooth speaker voice playback control method.

Those skilled in the art can clearly understand that, for the convenience and brevity of the description, only the above-mentioned division of functional units and modules is used as an example. In practical applications, the above functions can be allocated by different functional units according to needs. Module completion, that is, dividing the internal structure of the device into different functional units or modules to complete all or part of the functions described above. Each functional unit and module in the embodiment may be integrated into one processing unit, or each unit may exist separately physically, or two or more units may be integrated into one unit. The integrated unit may be hardware. It can be implemented in the form of software functional units. In addition, the specific names of the functional units and modules are only for the convenience of distinguishing from each other, and are not used to limit the protection scope of the present invention. For specific working processes of the units and modules in the foregoing system, reference may be made to corresponding processes in the foregoing method embodiments, and details are not described herein again.

In the above embodiments, the description of each embodiment has its own emphasis. For a part that is not detailed or recorded in an embodiment, reference may be made to related descriptions of other embodiments.

Those of ordinary skill in the art may realize that the units and algorithm steps of each example described in connection with the embodiments disclosed herein can be implemented by electronic hardware, or a combination of computer software and electronic hardware. Whether these functions are performed in hardware or software depends on the specific application and design constraints of the technical solution. A person skilled in the art can use different methods to implement the described functions for each specific application, but such implementation should not be considered to be beyond the scope of the present invention.

In the embodiments provided by the present invention, it should be understood that the disclosed apparatus / terminal device and method may be implemented in other ways. For example, the device / terminal device embodiments described above are only schematic. For example, the division of the modules or units is only a logical function division. In actual implementation, there may be another division manner, such as multiple units. Or components can be combined or integrated into another system, or some features can be ignored or not implemented. In addition, the displayed or discussed mutual coupling or direct coupling or communication connection may be indirect coupling or communication connection through some interfaces, devices or units, which may be electrical, mechanical or other forms.

The units described as separate components may or may not be physically separated, and the components displayed as units may or may not be physical units, may be located in one place, or may be distributed on multiple network units. Some or all of the units may be selected according to actual needs to achieve the objective of the solution of this embodiment.

In addition, each functional unit in each embodiment of the present invention may be integrated into one processing unit, or each unit may exist separately physically, or two or more units may be integrated into one unit. The above integrated unit may be implemented in the form of hardware or in the form of software functional unit.

When the integrated module / unit is implemented in the form of a software functional unit and sold or used as an independent product, it can be stored in a computer-readable storage medium. Based on such an understanding, the present invention implements all or part of the processes in the methods of the above embodiments, and may also be completed by a computer program instructing related hardware. The computer program may be stored in a computer-readable storage medium. The computer When the program is executed by a processor, the steps of the foregoing method embodiments can be implemented. The computer program includes computer program code, and the computer program code may be in a source code form, an object code form, an executable file, or some intermediate form. The computer-readable medium may include: any entity or device capable of carrying the computer program code, a recording medium, a U disk, a mobile hard disk, a magnetic disk, an optical disk, a computer memory, a read-only memory (ROM, Read-Only Memory), random access memory (RAM, Random Access Memory), electric carrier signals, telecommunication signals, and software distribution media. It should be noted that the content contained in the computer-readable medium can be appropriately increased or decreased according to the requirements of legislation and patent practice in the jurisdictions. For example, in some jurisdictions, the computer-readable medium Excludes electric carrier signals and telecommunication signals.

The above-mentioned embodiments are only used to illustrate the technical solutions of the present invention, but not limited thereto. Although the present invention has been described in detail with reference to the foregoing embodiments, those skilled in the art should understand that they can still implement the foregoing implementations. The technical solutions described in the examples are modified, or some of the technical features are equivalently replaced; and these modifications or replacements do not deviate the essence of the corresponding technical solutions from the spirit and scope of the technical solutions of the embodiments of the present invention, and should be included in Within the scope of the present invention.

Claims

A method for controlling voice playback of a Bluetooth speaker, comprising:

Collect voice data and send the voice data to a mobile terminal, and the voice data is uploaded to the server via the mobile terminal for voice recognition;

Receiving a message of completion of uploading voice data sent by a mobile terminal, and establishing a first voice path with the mobile terminal before the server feeds back a voice recognition result to the mobile terminal;

Receiving a voice recognition result sent by a mobile terminal via the first voice path, and playing the voice recognition result; wherein the voice recognition result is a result fed back by the server to the mobile terminal.
The method for controlling voice playback of a Bluetooth speaker according to claim 1, wherein the voice data is collected and transmitted to the mobile terminal, and the voice data is used for uploading to the server for voice recognition through the mobile terminal. ,include:

Generate a wake-up event and send the wake-up event to the mobile terminal;

Establishing a second voice path with the mobile terminal after the wake-up event is sent;

The voice data is sent to a mobile terminal via the second voice path.
The method for controlling voice playback of a Bluetooth speaker according to claim 1, wherein before the server feeds back the voice recognition result to the mobile terminal, establishing a first voice path with the mobile terminal comprises:

When the server performs voice recognition, a Bluetooth audio transmission protocol connection is established with the mobile terminal.
A method for controlling voice playback of a Bluetooth speaker, comprising:

Receiving voice data sent by a Bluetooth speaker, and uploading the voice data to a server for voice recognition;

Send the message that the voice data upload is completed to the Bluetooth speaker, and establish a first voice path with the Bluetooth speaker before receiving the voice recognition result fed back by the server;

Receiving a voice recognition result fed back by the server, and sending the voice recognition result to a Bluetooth speaker via the first voice path for voice playback.
The method for controlling voice playback of a Bluetooth speaker according to claim 4, before receiving voice data sent by the Bluetooth speaker and uploading the voice data to a server for voice recognition, comprising:

Receive the wake-up event sent by the Bluetooth speaker;

Establishing a second voice path with the Bluetooth speaker according to the wake-up event;

Receive voice data of a Bluetooth speaker through the second voice path; wherein the first voice path is established after the second voice path is established.
A method for controlling voice playback of a Bluetooth speaker, comprising:

The Bluetooth speaker sends voice data to the mobile terminal;

The mobile terminal uploads the voice data to a server;

The Bluetooth speaker receives a message that the upload of the voice data is completed by the mobile terminal;

The Bluetooth speaker establishes a first voice path with the mobile terminal, while the server performs voice recognition;

The mobile terminal receives the speech recognition result;

The mobile terminal sends the voice recognition result to a Bluetooth speaker through a first voice path, and the Bluetooth speaker performs voice playback.
The method for controlling voice playback of a Bluetooth speaker according to claim 6, further comprising:

The Bluetooth speaker sends a wake event to the mobile terminal;

According to the wake-up event, the Bluetooth speaker establishes a second voice path with the mobile terminal;

The Bluetooth speaker sends voice data to the mobile terminal via a second voice path; wherein the first voice path is established after the second voice path is established.
The method for controlling voice playback of a Bluetooth speaker according to claim 6, wherein the Bluetooth speaker establishes a first voice path with the mobile terminal and the server performs voice recognition, comprising:

When the server performs voice recognition, the Bluetooth speaker establishes a Bluetooth audio transmission protocol connection with the mobile terminal.
A Bluetooth speaker voice playback control device is characterized in that it includes:

A first voice data processing module, configured to collect voice data and send the voice data to a mobile terminal, where the voice data is uploaded to a server via the mobile terminal for voice recognition;

A first connection establishing module, configured to receive a voice data upload end message sent by a mobile terminal, and establish a first voice path with the mobile terminal before the server feeds back a speech recognition result to the mobile terminal;

The voice playback module receives a voice recognition result sent by a mobile terminal via the first voice path, and plays the voice recognition result; wherein the voice recognition result is a result fed back from the server to the mobile terminal.
The Bluetooth speaker voice playback control device according to claim 9, further comprising:

A wake-up module for generating a wake-up event and sending the wake-up event to a mobile terminal;

A second voice path establishment module is configured to establish a second voice path with the mobile terminal after the sending of the wake-up event is completed.
A mobile terminal, comprising:

A second voice data processing module, configured to receive voice data sent by the Bluetooth speaker end, and upload the voice data to a server for voice recognition;

A second connection establishing module, configured to send a message that voice data uploading is completed to the Bluetooth speaker, and establish a first voice path with the Bluetooth speaker before receiving the voice recognition result fed back by the server;

The voice recognition result processing module is configured to receive a voice recognition result fed back by the server, and send the voice recognition result to a Bluetooth speaker via the first voice path for voice playback.
A Bluetooth speaker voice playback control system, comprising a Bluetooth speaker, a mobile terminal, and a server.

The Bluetooth speaker is used to collect voice data and send the voice data to the mobile terminal through the second voice path;

The mobile terminal is configured to receive the voice data and upload the voice data to the server, and feedback a message that the voice data upload is completed to the Bluetooth speaker;

A server for receiving and recognizing the voice data and feeding back a voice recognition result corresponding to the voice data;

The Bluetooth speaker and the mobile terminal are respectively used to establish a first voice path before the server feeds back the voice recognition result;

The mobile terminal is further configured to receive a voice recognition result fed back by the server, and send the voice recognition result to a Bluetooth speaker via the first voice path;

The Bluetooth speaker is also used to receive the voice recognition results sent by the mobile terminal and perform voice playback.
A computer-readable storage medium storing a computer program, wherein when the computer program is executed by a processor, the steps of the method according to any one of claims 1 to 8 are implemented.