WO2019178739A1

WO2019178739A1 - Speaker, intelligent terminal, and speaker and intelligent terminal-based interactive control method

Info

Publication number: WO2019178739A1
Application number: PCT/CN2018/079603
Authority: WO
Inventors: 夏新元
Original assignee: 深圳市柔宇科技有限公司
Priority date: 2018-03-20
Filing date: 2018-03-20
Publication date: 2019-09-26
Also published as: CN111819867A

Abstract

Disclosed in the present technical solution is a speaker and intelligent terminal-based interactive control method, comprising: a speaker establishes a connection to an intelligent terminal; a microphone of the speaker collects sound information; a processor of the speaker converts the sound information into a digital signal and simultaneously generates a trigger signal; an output unit of the speaker transmits the digital signal and the trigger signal to the intelligent terminal; and the processor of the intelligent terminal is triggered after receiving the trigger signal, and then processes the digital signal and performs an operation corresponding to the digital signal. Also disclosed in the present technical solution are the speaker and the intelligent terminal.

Description

Speaker, intelligent terminal and interactive control method based on speaker and intelligent terminal

Technical field

The invention relates to the field of intelligent terminals, in particular to a speaker, an intelligent terminal and an interactive control method based on a speaker and an intelligent terminal.

Background technique

With the development of technology, the types and functions of electronic products are more and more, people's interaction requirements between various electronic products are getting higher and higher; traditional electronic products have single functions and cannot be controlled interactively, which can no longer satisfy people's Claim.

Summary of the invention

The embodiment of the invention discloses a speaker, an intelligent terminal and an interactive control method based on the speaker and the intelligent terminal, so that the speaker and the intelligent terminal can perform interactive control, realize more functions, and greatly improve the convenience of life.

An interactive control method based on a speaker and an intelligent terminal, comprising: establishing a connection between a speaker and a smart terminal; collecting sound information by a microphone of the speaker; and converting the sound information into a digital signal by the processor of the speaker simultaneously Generating a trigger signal; the output unit of the speaker transmits the digital signal and the trigger signal to the smart terminal; and the processor of the smart terminal is triggered after receiving the trigger signal, and then The digital signal is processed and an operation corresponding to the digital signal is performed.

A speaker comprising: a microphone for collecting sound information; a speaker processor for converting the sound information into a digital signal and simultaneously generating a trigger signal; and a speaker output unit for using the digital signal and The trigger signal is transmitted to an intelligent terminal, and the trigger signal is used to trigger the smart terminal.

An intelligent terminal, comprising: a terminal processor, configured to be triggered after receiving a trigger signal sent by a speaker, and configured to send a digital signal to the speaker for processing and perform an operation corresponding to the digital signal; and output And means for transmitting an audio to the speaker in response to an operation performed by the terminal processor.

The technical solution connects the intelligent terminal to a speaker with a microphone and a microphone, so that the speaker and the intelligent terminal can interact and realize more functions, thereby greatly improving the convenience of life.

DRAWINGS

In order to more clearly illustrate the technical solutions in the embodiments of the present invention, the drawings to be used in the embodiments will be briefly described below. It is obvious that the drawings in the following description are only some embodiments of the present invention. Those skilled in the art can also obtain other drawings based on these drawings without paying any creative work.

FIG. 1 is a flowchart of an interaction control method based on a speaker and an intelligent terminal according to a first embodiment of the present technical solution.

2 is a schematic block diagram of an interactive control device based on a speaker and an intelligent terminal according to a second embodiment of the present technical solution.

3 is a schematic structural view of a sound box in the interactive control device of FIG. 2.

FIG. 4 is a schematic structural diagram of an intelligent terminal in the interaction control apparatus in FIG. 2.

FIG. 5 is a schematic structural diagram of an interactive control device obtained by mounting the smart terminal of FIG. 4 on the speaker of FIG. 3. FIG.

FIG. 6 is still another schematic structural diagram of an interaction control apparatus according to an embodiment of the present disclosure.

FIG. 7 is a flowchart of a speaker-based control method according to a third embodiment of the present technical solution.

FIG. 8 is a flowchart of a method for controlling an intelligent terminal according to a fourth embodiment of the present technology.

detailed description

The technical solutions in the embodiments of the present invention will be clearly and completely described in conjunction with the accompanying drawings in the embodiments of the present invention. It is obvious that the described embodiments are only a part of the embodiments of the present invention, but not all embodiments. . All other embodiments obtained by those skilled in the art based on the embodiments of the present invention without creative efforts are within the scope of the present invention.

Please refer to FIG. 1 , which is a flowchart of an interaction control method based on a speaker and an intelligent terminal according to a first embodiment of the present technical solution. It should be noted that the interaction control method of the embodiment of the present technical solution is not limited to the steps and the sequence in the flowchart shown in FIG. 1 . The steps in the illustrated flow diagrams can be added, removed, or changed in order, depending on the requirements. The smart terminal according to the embodiment of the present invention may be a smart device such as a tablet computer, a mobile phone, a remote controller, an electronic reader, a personal computer (PC), a notebook computer, an in-vehicle device, a network television, a wearable device, or the like.

As shown in FIG. 1, the interactive control method 100 includes the following steps:

S101, a speaker establishes a connection with a smart terminal.

The speaker can be wiredly connected to at least one intelligent terminal through a metal touch pad, an audio jack or a USB interface, or can be a short-range wireless communication NFC connection mode, a Bluetooth communication connection mode, a wireless network communication connection mode or an infrared communication connection mode. A wireless connection is established with at least one smart terminal. Of course, other connection manners may also be adopted, which is not limited to this implementation.

S102. The microphone of the speaker collects sound information.

In this embodiment, the sound information is monitored and collected in real time through the microphone array of the speaker. The sound information includes voice information of the user.

S103. The processor of the speaker converts the sound information into a digital signal and simultaneously generates a trigger signal.

In an optional embodiment, after the speaker receives the sound information through the microphone array, the processor of the speaker may intercept the digital signal according to a preset rule, thereby acquiring the complete voice sent by the user. The preset rule may be whether the interrupt duration of the detected voice signal reaches a preset threshold. For example, when it is detected that the interruption time of the voice signal reaches 0.75 s, the processor of the speaker determines that the user stops talking and intercepts the voice signal before the current time. Each segment of the voice signal corresponds to a trigger signal.

S104. The output unit of the speaker transmits the digital signal and the trigger signal to the smart terminal.

The trigger signal is used to trigger the smart terminal.

It should be noted that, in this embodiment, the sound box does not recognize the collected sound but collects it all, and the screening and recognition of the sound are completed by the smart terminal. In other embodiments, the processor of the speaker can also perform preliminary screening or identification of the collected sound before transmitting to the smart terminal.

S105. The processor of the smart terminal is triggered after receiving the trigger signal, and then processes the digital signal and performs an operation corresponding to the digital signal.

That is to say, even if the smart terminal in the embodiment of the present technical solution is in the standby state, after the sound is collected by the speaker, the smart terminal can be triggered by the trigger signal, and the smart terminal that is locked need not be manually turned on.

The processing of the digital signal by the processor of the smart terminal and performing an operation includes: performing noise reduction processing and recognition on the digital signal, and obtaining a recognition result; and then performing a correspondence according to the recognition result. The operation of identifying the result.

Since the ambient noise is present in the radio environment, and the ambient noise affects the accuracy of subsequent speech recognition and semantic analysis, the processor of the intelligent terminal needs to perform noise reduction processing on the received digital signal before identifying. .

The process of recognition includes speech recognition and semantic recognition. In this embodiment, after the noise reduction process, the processor of the smart terminal may perform voice recognition on the digital signal through a recognition model (including an acoustic model, a language model, a pronunciation dictionary, and the like) stored in a memory of the smart terminal. Then, the speech recognition result is semantically analyzed to obtain the recognition result. The smart terminal can continuously optimize the recognition model through artificial intelligence or machine deep learning when accessing the network, and gradually improve the accuracy of the voice recognition.

In other embodiments, the output unit of the smart terminal may also send the digital signal after the noise reduction processing to the server, and the server performs voice recognition on the digital signal by using a recognition model (including an acoustic model, a language model, a pronunciation dictionary, etc.). And performing semantic analysis on the speech recognition result to obtain the recognition result, and feeding the recognition result to the smart terminal.

In some optional embodiments, when the recognition result indicates that the specified audio and video content is played, the processor of the smart terminal downloads or calls up the corresponding audio and video data and controls to play the audio and video data; when the recognition result indicates the play pair When the navigation from the start point to the specified end point is specified, the processor of the smart terminal enables the navigation software and obtains a navigation result and controls to play the navigation result; when the recognition result indicates the broadcast designation information, the processor of the smart terminal controls to play Specifying information; when the semantic analysis result indicates that the smart terminal controls the connected smart home device, the processor of the smart terminal controls to turn on and operate the smart home device; when the semantic analysis result indicates that the local function (such as the broadcast listening function) is enabled, The processor control of the smart terminal enables the corresponding local function. The embodiments of the present invention are not limited in the above manner.

In some optional embodiments, when the operation performed by the processor of the smart terminal includes playing a graphic, a video, or the like, the interactive control method further includes the steps of:

S106. The display unit of the smart terminal displays graphics and video in response to an operation performed by a processor of the smart terminal.

In some optional embodiments, the display unit of the smart terminal displays video data in the audio and video data, or displays the graphic in the navigation result, or displays the graphic in the local application.

In some optional embodiments, when the operation performed by the processor of the smart terminal includes playing audio or the like, the interaction control method further includes the steps of:

S107. The output unit of the smart terminal sends audio to the speaker, and the speaker of the speaker plays the audio.

In some optional embodiments, the loudspeaker of the speaker plays audio data in the audio and video data, or plays the audio in the navigation result, or plays the audio in the local application, or broadcasts the specified information. Wait.

In some optional embodiments, the audio is received by an input unit of the speaker, after which the processor of the speaker controls the loudspeaker of the speaker to play the audio.

In some optional embodiments, when the audio is played through a loudspeaker of the speaker, the interactive control method further includes the steps of:

S108. The smart terminal controls the speaker to play the audio through a control interface.

In some optional embodiments, the control interface of the smart terminal may be displayed on a display screen of the smart terminal; the control interface of the smart terminal may also be a touch control interface, and the user may operate the control interface by touch .

In some optional embodiments, controlling the playing of the speaker comprises: adjusting a volume of the speaker, fast forward playback or fast reverse playback, pause playback, and other gesture recall functions.

Please refer to FIG. 2 , which is a schematic diagram of a module of an interactive control device based on a speaker and an intelligent terminal according to a second embodiment of the present technology. The interactive control device 10 includes a speaker 11 and a smart terminal 12.

The speaker 11 is connected to at least one of the smart terminals 12 to enable communication; wherein the speaker 11 can be wired to at least one of the smart terminals 12 through a metal touch pad, an audio jack or a USB interface. The wireless communication NFC connection mode, the Bluetooth communication connection mode, the wireless network communication connection mode, or the infrared communication connection mode may be used to establish a wireless connection with at least one intelligent terminal. Of course, other connection modes may also be adopted, which is not limited to this implementation.

The speaker 11 includes a microphone 111, a speaker processor 112, a speaker output unit 113, a speaker input unit 114, and a loudspeaker 115.

The microphone 111 is used to collect sound information. In this embodiment, the microphone 111 includes a microphone array; the microphone 111 is used to monitor and collect sound information in real time. The sound information includes voice information of the user.

The speaker processor 1124 is configured to convert the sound information into a digital signal and simultaneously generate a trigger signal.

In an optional embodiment, after the speaker receives the sound information through the microphone array, the speaker processor 124 is further configured to intercept the digital signal according to a preset rule, thereby acquiring the complete voice sent by the user. The preset rule may be whether the interrupt duration of the detected voice signal reaches a preset threshold. For example, when it is detected that the interruption time of the voice signal reaches 0.75 s, the processor of the speaker determines that the user stops talking and intercepts the voice signal before the current time. Each segment of the voice signal corresponds to a trigger signal.

In an alternative embodiment, the speaker 11 may also include one or more programs, wherein the one or more programs are stored in a memory and configured to be executed by the speaker processor 124, The program includes instructions for performing the steps of converting sound information collected by the microphone 111 into a digital signal and simultaneously generating a trigger signal. In an optional embodiment, the program further includes instructions for performing the step of: intercepting the digital signal according to a preset rule to obtain a complete voice sent by the user.

The speaker output unit 113 is configured to transmit the digital signal and the trigger signal to the smart terminal 12 . The trigger signal is used to trigger the smart terminal.

The speaker input unit 114 is configured to receive audio sent by the smart terminal.

The loudspeaker 115 is configured to play audio sent by the smart terminal.

In an alternative embodiment, the speaker processor 124 is further configured to control the loudspeaker 115 to play audio.

The smart terminal 12 includes a terminal processor 121, a display unit 122, an output unit 123, and a control interface 124.

The terminal processor 121 is configured to be triggered after receiving the trigger signal sent by the speaker 11, and configured to process the received digital signal sent by the speaker 11 and execute a corresponding number Signal operation.

The processing of the digital signal and performing an operation includes: performing noise reduction processing and recognition on the digital signal, and obtaining a recognition result; and then performing an operation corresponding to the recognition result according to the recognition result.

Because there is environmental noise in the radio environment, and the ambient noise will affect the accuracy of subsequent speech recognition and semantic analysis, the received digital signal must be denoised first and then identified.

The recognition process includes speech recognition and semantic recognition. In this embodiment, after the noise reduction process, the terminal processor 121 may be configured to perform voice recognition on the digital signal by using a recognition model (including an acoustic model, a language model, a pronunciation dictionary, and the like) stored in a memory of the smart terminal. Then, it is used to perform semantic analysis on the speech recognition result to obtain the recognition result. The smart terminal 12 can be used to continuously optimize the recognition model through artificial intelligence or machine deep learning when accessing the network, and gradually improve the accuracy of the voice recognition.

In some optional embodiments, when the recognition result indicates that the specified audio and video content is played, the terminal processor 121 is configured to download or call up the corresponding audio and video data and control to play the audio and video data; when the recognition result indicates the playback When navigating the specified starting point to the specified end point, the terminal processor 121 is configured to enable the navigation software and obtain a navigation result and control to play the navigation result; when the recognition result indicates the broadcast designation information, the terminal processor 121 uses Controlling the playing of the specified information; when the semantic analysis result indicates that the intelligent terminal controls the connected smart home device, the terminal processor 121 is configured to control to turn on and operate the smart home device; when the semantic analysis result indicates that the local function is enabled (eg, broadcast listening) The terminal processor 121 is configured to control the activation of the corresponding local function. It can be understood that the embodiments of the present invention are not limited in the above manner.

In an optional embodiment, the smart terminal 12 may further include one or more programs, wherein the one or more programs are stored in a memory and configured to be executed by the terminal processor 121 The program includes instructions for performing the steps of: being triggered by a trigger signal; processing a digital signal and performing an operation corresponding to the digital signal. In an optional embodiment, processing a digital signal and performing an operation corresponding to the digital signal includes: performing noise reduction processing and recognition on the digital signal, and obtaining a recognition result; and then according to the recognition result An operation corresponding to the recognition result is performed.

The display unit 122 of the smart terminal 12 is configured to display graphics and video in response to operations performed by the terminal processor 121. The display unit 122 may include a display screen of the smart terminal 12. In some optional embodiments, the display unit 122 of the smart terminal is configured to display video data in audio and video data, or display graphics in the navigation result, or display graphics in a local application, and the like.

The output unit 123 of the smart terminal 12 is configured to send an audio to the speaker input unit 114 of the speaker 11 in response to an operation performed by the terminal processor 121.

In other embodiments, the output unit 123 of the smart terminal 12 can also be used to send the digital signal after the noise reduction process to a server. The server may perform speech recognition on the digital signal by identifying a model (including an acoustic model, a language model, a pronunciation dictionary, etc.), and then performing semantic analysis on the speech recognition result to obtain the recognition result, and feedback the recognition result. To the smart terminal 12.

The control interface 124 of the smart terminal 12 is used to control the speaker 11 to play the audio.

In some optional embodiments, the control interface 124 of the smart terminal 12 may be displayed on a display screen of the smart terminal; the control interface 124 of the smart terminal 12 may also be a touch control interface, and the user may operate through a touch The control interface 124 of the smart terminal 12 can also be located on a touch screen.

In some optional embodiments, controlling the playing of the speaker comprises: adjusting the volume of the speaker, fast forward playback or fast reverse playback, pause playback, and other gesture recall functions.

Please refer to FIG. 3 together. FIG. 3 is a schematic structural diagram of the speaker 11 in the interactive control device 10.

The speaker 11 is formed with a receiving structure 115 for receiving at least one smart terminal 12 .

It can be understood that the speaker 11 is provided with a microphone 111, a speaker processor (not shown), a speaker output unit (not shown), a loudspeaker 114, and the like.

In an optional embodiment, as shown in FIG. 3, the receiving structure 115 is formed with at least one set of metal touch pads 116. When the at least one smart terminal 12 is received in the receiving structure 115, the The speaker 11 can be electrically connected to the at least one smart terminal 12 through the metal touch pad 116 to perform a wired connection.

In an optional embodiment, the metal touch pad 116 is a magnetic contact pad, and can be sucked with the at least one smart terminal 12 to facilitate a fixed connection between the at least one smart terminal 12 and the speaker 11 .

In other embodiments, the metal touch pad 116 may also be disposed at other positions of the speaker 11, and is not limited to the embodiment.

In other embodiments, the metal touch pad 116 can also be an audio jack or a USB interface or the like.

In other embodiments, the metal touch pad 116 can also be replaced by a wireless communication NFC module built in the speaker 11 , a Bluetooth communication module, a wireless network communication module, or an infrared communication module.

In an optional embodiment, the receiving structure 115 may be a stepped surface formed on the speaker 11, or may be a groove or the like formed on the speaker 11.

In an alternative embodiment, as shown in FIG. 3, the speaker 11 includes a main body portion 117 and an extending portion 118 extending axially from the main body portion 117. The main body portion 117 and the extending portion 118 are substantially Cylindrical, and the diameter of the extension portion 118 is smaller than the diameter of the main body portion 117, so that the top surface of the main body portion 117 is exposed to the extension portion 118 to form an annular top surface 1171, the ring The top surface 1171 is substantially perpendicularly connected to the outer side surface 1181 of the extending portion 118 to form the receiving structure 115. The smart terminal 12 can be sleeved on the extending portion 118 to be received in the receiving portion. Structure 115.

Further, as shown in FIG. 3, the outer side surface 1181 of the extending portion 118 is formed with at least one set of metal touch pads 116. When the at least one smart terminal 12 is sleeved on the extending portion 118, the sound box is 11 may be wired to the at least one smart terminal 12 through the metal touch pad 116.

In an alternative embodiment, as shown in FIG. 3, a set of metal touch pads 116 is formed on the receiving structure 115.

In an alternative embodiment, as shown in FIG. 3, the microphone 111 and the loudspeaker 114 are disposed on the main body portion 117.

In other embodiments, the shape and structure of the speaker 11 may be other shapes and structures, and are not limited to the embodiment; for example, the main body portion and the extending portion 118 may also have a polygonal column shape. Rectangular, hemispherical, etc.

Referring to FIG. 4, FIG. 4 is a schematic structural diagram of the smart terminal 12 in the interaction control apparatus 10.

In this embodiment, the smart terminal 12 is a flexible smart terminal, such as a flexible mobile phone or a telephone watch, and can be bent and sleeved on the extending portion 118 to be received in the speaker 11 . Structure 115.

In an optional embodiment, a contact pad (not shown) corresponding to the metal touch pad 116 may be formed on the at least one smart terminal 12.

It can be understood that, as shown in FIG. 4, the display unit 122 of the smart terminal 12 includes a display screen 1221.

Please refer to FIG. 5 . FIG. 5 is a schematic structural diagram of the interaction control device 10 obtained by the smart terminal 12 being received in the receiving structure 115 of the speaker 11 .

As shown in FIG. 5, the number of the at least one smart terminal 12 is one; correspondingly, the receiving structure 115 is formed with a set of metal touch pads 116 (refer to FIG. 3).

The smart terminal 12 is sleeved on the extending portion 118, so that the smart terminal 12 is at least partially received in the receiving structure 115, and the speaker 11 passes through the outer side surface 1181 of the extending portion 118. The metal touch pad 116 is in contact with the smart terminal 12 to make a wired connection.

Please refer to FIG. 6 . FIG. 6 is still another schematic structural diagram of the interaction control device 10 obtained by the smart terminal 12 being received in the receiving structure 115 of the speaker 11 .

As shown in FIG. 6 , the number of the at least one smart terminal 12 is three; it can be understood that three sets of metal touch pads 116 are formed on the receiving structure 115 .

The three smart terminals 12 are sleeved on the extending portion 118, so that the upper smart terminals 12 are at least partially received in the receiving structure 115, and the speaker 11 passes through the outer side of the extending portion 118. The three sets of the metal touch pads 116 on the 1181 are respectively in contact with the upper smart terminal 12 for wired connection, so that the interaction of the three smart terminals 12 with the speaker 11 can be realized.

The third embodiment of the technical solution also provides a speaker-based control method. Please refer to FIG. 7 , which is a flowchart of a speaker-based control method according to a third embodiment of the present technology. It should be noted that the speaker-based control method of the embodiment of the present technical solution is not limited to the steps and the sequence in the flowchart shown in FIG. 7. The steps in the illustrated flow diagrams can be added, removed, or changed in order, depending on the requirements. The smart terminal according to the embodiment of the present invention may be a smart device such as a tablet computer, a mobile phone, a remote controller, an electronic reader, a personal computer (PC), a notebook computer, an in-vehicle device, a network television, a wearable device, or the like.

As shown in FIG. 7, the speaker-based control method 700 includes the following steps:

S701, a speaker establishes a connection with a smart terminal.

The speaker can be wired to at least one smart terminal through a metal touch pad, an audio jack or a USB interface, or can be a short-range wireless communication NFC connection method, a Bluetooth communication connection method, a wireless network communication connection method or an infrared communication connection method and at least A smart terminal establishes a wireless connection, and of course, other connection methods may also be used, which is not limited to this implementation.

S702, the microphone of the speaker collects sound information.

S703. The processor of the speaker converts the sound information into a digital signal and simultaneously generates a trigger signal.

S704. The output unit of the speaker transmits the digital signal and the trigger signal to the smart terminal.

The trigger signal is used to trigger the smart terminal.

S705. The speaker receives audio sent by the smart terminal, and the speaker of the speaker plays the audio.

In some alternative embodiments, when the audio is played through the loudspeaker of the speaker, it is also played under the control of a control interface of the smart terminal.

The control of a control interface of the smart terminal includes: adjusting the volume of the speaker, fast forward play or fast reverse play, pause play, and other gesture callout functions.

Please refer to FIG. 8 , which is a flowchart of a method for controlling an intelligent terminal according to a fourth embodiment of the present technology. It should be noted that the smart terminal-based control method according to the embodiment of the present technical solution is not limited to the steps and the sequence in the flowchart shown in FIG. 8. The steps in the illustrated flow diagrams can be added, removed, or changed in order, depending on the requirements. The smart terminal according to the embodiment of the present invention may be a smart device such as a tablet computer, a mobile phone, a remote controller, an electronic reader, a personal computer (PC), a notebook computer, an in-vehicle device, a network television, a wearable device, or the like.

As shown in FIG. 8, the intelligent terminal-based control method 800 includes the following steps:

S801. The smart terminal establishes a connection with a speaker.

S802. The processor of the smart terminal is triggered after receiving a trigger signal sent by the speaker, and then processes a digital signal sent by the speaker and performs an operation corresponding to the digital signal.

That is to say, even if the smart terminal in the embodiment of the present technical solution is in the standby state, after the sound is collected by the speaker, the smart terminal can be triggered by the trigger signal, and the smart terminal does not need to be manually turned on.

The recognition process includes speech recognition and semantic recognition. In this embodiment, after the noise reduction process, the processor of the smart terminal may perform voice recognition on the digital signal through a recognition model (including an acoustic model, a language model, a pronunciation dictionary, and the like) stored in a memory of the smart terminal. Then, the speech recognition result is semantically analyzed to obtain the recognition result. The smart terminal can continuously optimize the recognition model through artificial intelligence or machine deep learning when accessing the network, and gradually improve the accuracy of the voice recognition.

In other embodiments, the output unit of the smart terminal may also send the digital signal after the noise reduction processing to the server, and the server performs voice on the digital signal by identifying the model (including an acoustic model, a language model, a pronunciation dictionary, etc.). Identifying, then performing semantic analysis on the speech recognition result, obtaining the recognition result, and feeding back the recognition result to the smart terminal.

In some optional embodiments, when the recognition result indicates that the specified audio and video content is played, the processor of the smart terminal downloads or calls up the corresponding audio and video data and controls to play the audio and video data; when the recognition result indicates the play pair When the navigation from the start point to the specified end point is specified, the processor of the smart terminal enables the navigation software and obtains a navigation result and controls to play the navigation result; when the recognition result indicates the broadcast designation information, the processor of the smart terminal controls to play Specifying information; when the semantic analysis result indicates that the smart terminal controls the connected smart home device, the processor of the smart terminal controls to turn on and operate the smart home device; when the semantic analysis result indicates that the local function (such as the broadcast listening function) is enabled, The processor control of the smart terminal enables the corresponding local function. It can be understood that the embodiments of the present invention are not limited in the above manner.

S803. The display unit of the smart terminal displays graphics and video in response to an operation performed by a processor of the smart terminal.

S804. The output unit of the smart terminal sends audio to the speaker.

In some optional embodiments, the control method further includes the steps of:

S805. The smart terminal controls the speaker to play the audio through a control interface.

A person skilled in the art can understand that all or part of the steps of the foregoing embodiments can be completed by a program to instruct related hardware, and the program can be stored in a computer readable memory, and the memory can include: a flash drive , read-only memory (English: Read-Only Memory, referred to as: ROM), random accessor (English: Random Access Memory, referred to as: RAM), disk or CD.

The technical solution connects the smart terminal to a speaker with a microphone and a microphone, so that the speaker and the smart terminal can interact, can trigger the smart terminal through the speaker, and can control the speaker to play through the smart terminal; and, the mobile phone becomes The smart center of the home can realize real-time dialogue and operation; compared with the traditional smart speaker, because the smart terminal has a voice processing module, the speaker of the technical solution can be disposed without the independent voice processing module, but by the connected intelligent terminal. Voice processing; and the display screen and touch screen of the smart terminal can be used as the display screen and touch screen of the speaker, which increases the operation mode of the smart terminal.

The above is a preferred embodiment of the present invention, and it should be noted that those skilled in the art can also make several improvements and retouchings without departing from the principles of the present invention. It is the scope of protection of the present invention.

Claims

An interactive control method based on a speaker and an intelligent terminal, comprising:

A speaker establishes a connection with a smart terminal;

The microphone of the speaker collects sound information;

The processor of the speaker converts the sound information into a digital signal and simultaneously generates a trigger signal;

An output unit of the speaker transmits the digital signal and the trigger signal to the smart terminal; and

The processor of the intelligent terminal is triggered after receiving the trigger signal, and then processes the digital signal and performs an operation corresponding to the digital signal.
The interactive control method based on a speaker and an intelligent terminal according to claim 1, wherein after the speaker receives the sound information through the microphone, the processor of the speaker intercepts the digital signal according to a preset rule. The output unit of the speaker transmits the intercepted digital signal to the smart terminal.
The interaction control method based on a speaker and an intelligent terminal according to claim 1, wherein the processor of the intelligent terminal processes the digital signal and performs an operation comprising: performing noise reduction processing on the digital signal And identifying, and obtaining a recognition result; performing an operation corresponding to the recognition result according to the recognition result.
The method for controlling an interaction based on a speaker and an intelligent terminal according to claim 3, wherein the process of identifying comprises voice recognition and semantic recognition, and the processor of the smart terminal is stored in a memory of the smart terminal. The recognition model performs speech recognition on the digital signal, and then performs semantic analysis on the speech recognition result to obtain the recognition result.
The interactive control method based on a speaker and an intelligent terminal according to claim 3, wherein the process of identifying comprises speech recognition and semantic recognition, and the output unit of the intelligent terminal sends the digital signal after the noise reduction process to The server performs voice recognition on the digital signal by the server through the recognition model, performs semantic analysis on the voice recognition result, obtains the recognition result, and feeds the recognition result to the smart terminal.
The interactive control method based on a speaker and an intelligent terminal according to claim 1, further comprising the step of: displaying, by the display unit of the intelligent terminal, graphics and video in response to an operation performed by the terminal processor.
The interactive control method based on a speaker and an intelligent terminal according to claim 1, further comprising the step of: the output unit of the smart terminal transmitting an audio to the speaker in response to an operation performed by the terminal processor, The loudspeaker of the speaker plays the audio.
The interactive control method based on a speaker and an intelligent terminal according to claim 7, further comprising the step of: the intelligent terminal controlling the speaker to play the audio through a control interface.
A speaker that includes:

a microphone for collecting sound information;

a speaker processor for converting the sound information into a digital signal and simultaneously generating a trigger signal;

And a speaker output unit, configured to transmit the digital signal and the trigger signal to an intelligent terminal, where the trigger signal is used to trigger the smart terminal.
The speaker according to claim 9, wherein the speaker processor is further configured to intercept the digital signal according to a preset rule to obtain a complete voice sent by the user.
A speaker according to claim 9, further comprising a loudspeaker for playing audio transmitted by said smart terminal.
The speaker according to claim 9, wherein the speaker is formed with a receiving structure, and the receiving structure is configured to receive at least one of the smart terminals.
The speaker according to claim 12, wherein the receiving structure is formed with at least one set of metal touch pads, and the at least one set of metal touch pads is configured to receive at least one of the smart terminals in the receiving structure The speaker is electrically connected to at least one of the smart terminals.
The speaker according to claim 13, wherein the speaker comprises a main body portion and an extending portion extending axially from the main body portion, the main body portion and the extending portion are both cylindrical and the extending The diameter of the portion is smaller than the diameter of the body portion, such that the top surface of the body portion is exposed to the extension portion to form an annular top surface that is perpendicular to the outer side surface of the extension portion Connecting to form the receiving structure together; the at least one set of metal touch pads is a magnetic contact pad, and the at least one set of metal touch pads is formed on an outer side of the extension.
An intelligent terminal comprising:

a terminal processor, configured to be triggered after receiving a trigger signal from a speaker, and configured to send a digital signal to the speaker for processing and perform an operation corresponding to the digital signal; and

And an output unit, configured to send an audio to the speaker in response to an operation performed by the terminal processor.
The intelligent terminal according to claim 15, wherein the processing of the digital signal for the speaker and the operation of the digital signal comprises: performing noise reduction processing and recognition on the digital signal, and obtaining a Identifying a result; performing an operation corresponding to the recognition result according to the recognition result.
The intelligent terminal according to claim 16, wherein the process of identifying comprises voice recognition and semantic recognition, and the terminal processor is configured to perform voice on a digital signal through a recognition model stored in a memory of the smart terminal. Identifying, performing semantic analysis on the speech recognition result to obtain the recognition result.
The intelligent terminal according to claim 15, further comprising a display unit configured to display the graphic and the video in response to the operation performed by the terminal processor.
The intelligent terminal of claim 15, further comprising a control interface for controlling the speaker to play the audio.
The intelligent terminal according to claim 15, wherein the smart terminal is a flexible smart terminal.