CN114553623A - Voice interaction control method and voice interaction system of electrical equipment - Google Patents

Voice interaction control method and voice interaction system of electrical equipment Download PDF

Info

Publication number
CN114553623A
CN114553623A CN202210118112.1A CN202210118112A CN114553623A CN 114553623 A CN114553623 A CN 114553623A CN 202210118112 A CN202210118112 A CN 202210118112A CN 114553623 A CN114553623 A CN 114553623A
Authority
CN
China
Prior art keywords
voice
communication device
controller
electrical equipment
central controller
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202210118112.1A
Other languages
Chinese (zh)
Inventor
李保水
郑文成
刘健军
王子
梁博
陶梦春
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Gree Electric Appliances Inc of Zhuhai
Original Assignee
Gree Electric Appliances Inc of Zhuhai
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Gree Electric Appliances Inc of Zhuhai filed Critical Gree Electric Appliances Inc of Zhuhai
Priority to CN202210118112.1A priority Critical patent/CN114553623A/en
Publication of CN114553623A publication Critical patent/CN114553623A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L12/00Data switching networks
    • H04L12/28Data switching networks characterised by path configuration, e.g. LAN [Local Area Networks] or WAN [Wide Area Networks]
    • H04L12/2803Home automation networks
    • H04L12/2816Controlling appliance services of a home automation network by calling their functionalities
    • H04L12/282Controlling appliance services of a home automation network by calling their functionalities based on user interaction within the home
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/28Constructional details of speech recognition systems
    • G10L15/30Distributed recognition, e.g. in client-server systems, for mobile phones or network applications
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L12/00Data switching networks
    • H04L12/28Data switching networks characterised by path configuration, e.g. LAN [Local Area Networks] or WAN [Wide Area Networks]
    • H04L12/2803Home automation networks
    • H04L12/283Processing of data at an internetworking point of a home automation network
    • H04L12/2834Switching of information between an external network and a home network
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L12/00Data switching networks
    • H04L12/28Data switching networks characterised by path configuration, e.g. LAN [Local Area Networks] or WAN [Wide Area Networks]
    • H04L12/2803Home automation networks
    • H04L12/283Processing of data at an internetworking point of a home automation network
    • H04L12/2836Protocol conversion between an external network and a home network
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • G10L2015/223Execution procedure of a spoken command
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L12/00Data switching networks
    • H04L12/28Data switching networks characterised by path configuration, e.g. LAN [Local Area Networks] or WAN [Wide Area Networks]
    • H04L12/2803Home automation networks
    • H04L2012/284Home automation networks characterised by the type of medium used
    • H04L2012/2841Wireless
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02PCLIMATE CHANGE MITIGATION TECHNOLOGIES IN THE PRODUCTION OR PROCESSING OF GOODS
    • Y02P90/00Enabling technologies with a potential contribution to greenhouse gas [GHG] emissions mitigation
    • Y02P90/02Total factory control, e.g. smart factories, flexible manufacturing systems [FMS] or integrated manufacturing systems [IMS]

Abstract

The application relates to a voice interaction control method and a voice interaction system of electrical equipment, which comprise a controller and a communication module connected with the controller, wherein the communication module comprises a first communication device, a second communication device and a dispatching central controller, the first communication device is connected with the dispatching central controller, the second communication device is connected with the dispatching central controller, and the dispatching central controller is connected with the controller. The communication module with the dispatching central control is added in the electrical equipment to control the first communication device to communicate with the voice terminal equipment to carry out near-field acquisition of voice data, and then the voice data is transmitted to the second communication device to carry out two-way communication with the voice server, so that the analysis of the voice data and the issuing of a recognition result are completed, and the near-field voice control of the electrical equipment is realized. The voice acquisition and recognition module is not required to be added in the electrical equipment, the cost is reduced, the controller function of the electrical equipment is not required to be added, the universal rate of the application of the voice interaction control function is improved, and a good control effect is realized by a near-field voice acquisition mode.

Description

Voice interaction control method and voice interaction system of electrical equipment
Technical Field
The present application relates to the field of voice interaction control technologies, and in particular, to a voice interaction control method and a voice interaction system for an electrical device.
Background
With the rapid development of the internet of things and the artificial intelligence technology, more and more innovative functions are integrated into family life, for example, a user terminal and intelligent household electrical equipment adopt human-computer interaction to realize control and the like. The intelligent household electrical equipment is controlled through voice interaction, so that convenience of the intelligent household electrical equipment gradually becomes an important interaction mode, and the intelligent household electrical equipment is applied more in actual household electrical equipment products.
At present, voice interaction control schemes adopted by intelligent household electrical equipment comprise a far-field voice scheme and a near-field voice scheme. In a far-field voice scheme, a microphone assembly for voice acquisition and a front-end algorithm module need to be matched in electrical equipment, so that the cost is high, the voice interaction control effect is poor when the environmental noise is high and the distance from the equipment is long, and the application is limited. The existing near-field voice scheme also has the problem of higher cost caused by the fact that a voice module needs to be matched in the intelligent household equipment.
Disclosure of Invention
Therefore, it is necessary to provide a voice interaction control method and a voice interaction system for electrical equipment, aiming at the problems of high cost and poor control effect of the voice interaction control scheme of the existing intelligent home equipment.
An electric appliance device comprises a controller and a communication module connected with the controller, wherein the communication module comprises a first communication device, a second communication device and a dispatching central controller, the first communication device is connected with the dispatching central controller, the second communication device is connected with the dispatching central controller, and the dispatching central controller is connected with the controller;
the first communication device is used for receiving a request voice packet transmitted by voice terminal equipment and sending the request voice packet to the dispatching central controller;
the dispatching central controller is used for carrying out protocol conversion on the received request voice packet and then sending the request voice packet to the second communication device;
the second communication device is used for sending the request voice packet after protocol conversion to a voice server and receiving a recognition result returned by the voice server;
the dispatching central controller is also used for sending the identification result to the controller;
and the controller is used for generating a control instruction according to the identification result and correspondingly controlling the electrical equipment to execute corresponding operation.
In one embodiment, the dispatch central controller is further configured to pair with the voice terminal device through the first communication device according to preset pairing information.
In one embodiment, the first communication device is a Wifi-P2P communication device.
In one embodiment, the central scheduling controller is further configured to send a preset authentication message to the voice server through the second communication device for access authentication.
In one embodiment, the preset authentication information at least includes one of authentication information given by the voice server, a MAC address of the electrical device, and an UTC time for access authentication.
In one embodiment, the second communication device is a WebSocket communication device.
In one embodiment, the communication module further includes a third communication device, and the controller is connected to the dispatching center through the third communication device.
In one embodiment, the electrical equipment further comprises a voice broadcasting device, and the voice broadcasting device is connected with the dispatching central controller through the third communication device.
In one embodiment, a voice interaction control method is provided, which includes:
receiving a request voice packet transmitted by voice terminal equipment through a first communication device;
carrying out protocol conversion on the received request voice packet, sending the request voice packet to a voice server through a second communication device, and receiving an identification result returned by the voice server according to the request voice packet through the second communication device;
and sending the identification result to a controller so that the controller generates a control instruction according to the identification result and controls and executes corresponding operation according to the control instruction.
In one embodiment, a voice interaction system is provided, which includes a voice terminal device, a voice server, and the above electrical device, where the voice terminal device is connected to the electrical device, and the electrical device is connected to the voice server.
According to the voice interaction control method and the voice interaction system of the electrical equipment, the communication module with the dispatching central control is added in the electrical equipment, the first communication device is controlled to communicate with the voice terminal equipment to carry out near-field acquisition of voice data, then the voice data is transmitted to the second communication device to carry out two-way communication with the voice server, analysis and recognition results of the voice data are sent, and near-field voice control of the electrical equipment is achieved. The voice acquisition and recognition module is not required to be added in the electrical equipment, the cost is reduced, the controller function of the electrical equipment is not required to be added, the universal rate of the application of the voice interaction control function is improved, and a good control effect is realized by a near-field voice acquisition mode.
Drawings
FIG. 1 is a system block diagram of an electrical device in one embodiment;
FIG. 2 is a flow chart of a voice interaction control method according to an embodiment;
FIG. 3 is a system block diagram of a voice interaction system in one embodiment;
FIG. 4 is a flow chart of data transmission of the voice interactive system according to an embodiment.
Detailed Description
In order to make the objects, technical solutions and advantages of the present application more apparent, the present application is described in further detail below with reference to the accompanying drawings and embodiments. It should be understood that the specific embodiments described herein are merely illustrative of the present application and are not intended to limit the present application.
Unless defined otherwise, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this application belongs. The terminology used herein in the description of the present application is for the purpose of describing particular embodiments only and is not intended to be limiting of the application.
It will be understood that, as used herein, the terms "first," "second," and the like may be used herein to describe various elements, but these elements are not limited by these terms. These terms are only used to distinguish one element from another. For example, a first resistance may be referred to as a second resistance, and similarly, a second resistance may be referred to as a first resistance, without departing from the scope of the present application. The first resistance and the second resistance are both resistances, but they are not the same resistance.
It is to be understood that "connection" in the following embodiments is to be understood as "electrical connection", "communication connection", and the like if the connected circuits, modules, units, and the like have communication of electrical signals or data with each other.
As used herein, the singular forms "a", "an" and "the" may include the plural forms as well, unless the context clearly indicates otherwise. It will be further understood that the terms "comprises/comprising," "includes" or "including," etc., specify the presence of stated features, integers, steps, operations, components, parts, or combinations thereof, but do not preclude the presence or addition of one or more other features, integers, steps, operations, components, parts, or combinations thereof.
With the rapid development of the internet of things and the artificial intelligence technology, more and more innovative functions are integrated into family life, for example, intelligent home equipment is controlled through voice interaction, so that convenience gradually becomes an important interaction mode, and the intelligent home equipment is applied more in actual home products. At present, voice interaction control schemes adopted by intelligent household electrical equipment comprise a far-field voice scheme and a near-field voice scheme. In a far-field voice scheme, a microphone assembly for voice acquisition and a front-end algorithm module need to be matched in electrical equipment, so that the cost is high, the voice interaction control effect is poor when the environmental noise is high and the distance from the equipment is long, and the application is limited. The existing near-field voice scheme also has the problem of higher cost caused by the fact that a voice module needs to be matched in the intelligent household equipment.
Therefore, the present application provides a voice interaction control method and a voice interaction system for an electrical apparatus, which can be applied to an electrical apparatus such as an air conditioner, a washing machine, a dust collector, an electric cooker, and a television set, which performs a unique function and operation according to a user's operation. The near-field voice data is collected through the voice terminal device, the Wifi-P2P communication technology is adopted to transmit the near-field voice data to the electric appliance device, the electric appliance device communicates with the voice server, voice recognition related processing tasks are transmitted to the voice server to be completed, and the electric appliance device does not need to be matched with modules such as voice collection and front-end algorithms, so that the cost is reduced.
In one embodiment, as shown in fig. 1, an electrical appliance 100 is provided, which includes a controller 110 and a communication module 120 connected to the controller 110, wherein the communication module 120 includes a first communication device 121, a second communication device 122 and a dispatching central controller 123, the first communication device 121 is connected to the dispatching central controller 123, the second communication device 122 is connected to the dispatching central controller 123, and the dispatching central controller 123 is connected to the controller 110; the first communication device 121 is configured to receive a request voice packet transmitted by a voice terminal device and send the request voice packet to the dispatching central controller 123; the dispatching central controller 123 is configured to perform protocol conversion on the received request voice packet and send the request voice packet to the second communication device 122; the second communication device 122 is configured to send the request voice packet after performing the protocol conversion to the voice server, and receive a recognition result returned by the voice server; the scheduling central controller 123 is further configured to send the identification result to the controller 110; the controller 110 is configured to generate a control command according to the recognition result to control the electrical device to perform a corresponding operation.
The voice terminal device is a voice data acquisition terminal for realizing near-field voice control of the electrical equipment, and can be a remote controller corresponding to the electrical equipment, and can also be a mobile terminal such as a mobile phone, a tablet personal computer and a wearable device. The voice terminal equipment is provided with a near-field voice acquisition device such as a microphone, and then preprocessing such as filtering and analog-to-digital conversion is carried out on the acquired user voice signals to obtain audio signals. And then, encoding and sub-packaging the preprocessed data by adopting a Wifi module, transmitting the streaming voice data packet to electrical equipment in a closed loop mode through Wifi-P2P communication, and assisting a retransmission mechanism to ensure stable and reliable voice data transmission.
The voice server is a voice data processing end for realizing near-field voice control of the electrical equipment, and is provided with corresponding algorithms such as voice recognition, natural language processing and the like, so that the received voice data can be converted into text data, a voice command is obtained by judging according to the text data, and a corresponding signal generated by the judged voice command is returned to the electrical equipment end, so that the near-field voice control of the electrical equipment is realized. It can be understood that the voice server may be implemented by a single server, or may be a group of servers communicating with each other, and respectively implement the functions of voice recognition, text conversion, natural language processing, voice instruction generation, and the like.
The controller 110 of the electrical apparatus 100 is configured to implement control of the basic function module of the electrical apparatus 100 body, and meanwhile, the controller 110 further receives a recognition result returned by the communication module 120 according to the near-field voice control, and implements control of the basic function module of the electrical apparatus 100 body according to the recognition result. The specific hardware of the Controller 110 is not unique, and may be a Programmable Logic Controller (PLC), a Micro Control Unit (MCU), a Field Programmable Gate Array (FPGA), a Complex Programmable Logic Device (CPLD), or the like, and is implemented according to the hardware selected in the actual electrical Device 100 without this limitation. It can be understood that the voice data processing and the communication connection in the near-field voice control process of the electrical apparatus in the present application can be completed through the communication module 120 alone, without increasing the control work of the controller 110 in the electrical apparatus 100, and have a higher apparatus general application rate.
Specifically, the communication module 120 receives a request voice packet of the voice terminal device, performs protocol conversion, then sends the request voice packet to the voice server, receives an identification result returned by the voice server, and finally sends the identification result to the controller 110 to implement near-field voice control. The first communication device 121 receives a request voice packet transmitted by a voice terminal device and sends the request voice packet to the dispatching central controller 123, the dispatching central controller 123 performs protocol conversion on the received request voice packet and sends the request voice packet to the second communication device 122, the second communication device 122 sends the request voice packet after the protocol conversion to a voice server and receives a recognition result returned by the voice server, the dispatching central controller 123 sends the recognition result to the controller 110, and the controller 110 generates a control instruction according to the recognition result to correspondingly control an electrical device to execute corresponding operation to realize near-field voice control.
Further, in order to improve the transmission efficiency of the voice data, the voice terminal device encapsulates the compressed and encoded voice data into a plurality of request voice packets, and transmits the request voice packets to the scheduling central controller 123 through the first communication device 121 by means of packet streaming. Meanwhile, in order to guarantee the validity of the voice data, closed-loop transmission is adopted for the request voice packet containing valid information. It can be understood that after the voice request packet is sent by the voice terminal device, the next transmission operation must be performed until the response message returned by the central control unit 123 is scheduled. For example, when the request voice packet is transmitted in sequence according to the order of the head request voice packet, the middle request voice packet, and the tail request voice packet, the head request voice packet is transmitted to the dispatching central controller 123 through the first communication device 121, and then the middle request voice packet is transmitted to the dispatching central controller 123 through the first communication device 121 only after the head response information returned by the dispatching central controller 123 according to the head request voice packet is received; similarly, the end request voice packet is transmitted to the dispatching central controller 123 via the first communication device 121 only after receiving the intermediate response information returned by the dispatching central controller 123 according to the intermediate request voice packet. Synchronously, when the response information returned by the dispatching central controller 123 according to the request voice packet is not received, the voice terminal device further performs a corresponding retransmission mechanism to ensure stable and reliable transmission of the data to the dispatching central controller 123 via the first communication device 121.
In one embodiment, the first communication device 121 is a Wifi-P2P communication device. Specifically, the Wifi-P2P (peer-to-peer) communication mode is a peer-to-peer communication technology, and even in an environment without a conventional Wifi network or Wifi Access Point, a tcp/ip link can be directly established between two devices without the participation of an AP (Wireless Access Point). Also called Wi-Fi Direct, compared with wireless communication modules such as Bluetooth and ZigBee, the wireless communication module has the advantages that the transmission speed is faster and can reach 1M/s, and the transmission bandwidth is larger. The Wifi-P2P communication device is used as the first communication device 121 to realize communication with the voice terminal device, so that the phenomena of low transmission efficiency, packet loss or data stream error and the like can be effectively avoided.
Further, the dispatch center controller 123, after receiving the voice request packet, sends the voice request packet to the voice server via the second communication device 122 for voice recognition. When the electrical apparatus 100 is connected to the voice server, after the request voice packet is transmitted, the identification result returned by the voice server according to the request voice packet needs to be received, so that bidirectional communication needs to be performed, and the first communication device 121 cannot be used to perform communication with the voice server. Thus, in one embodiment, the second communication device 122 is a WebSocket communication device. The WebSocket is a protocol for performing full-duplex communication on a single TCP connection, so that data exchange between a client and a server is simpler, and the server is allowed to actively push data to the client. In the WebSocket API, the browser and the server only need to complete one handshake, and persistent connection can be directly established between the browser and the server, and bidirectional data transmission is carried out. Namely, the electrical equipment terminal can request the information of the voice server terminal, and the voice server terminal can also actively push the information to the electrical equipment terminal. On one hand, the voice data of the voice terminal equipment can be uploaded to the voice server through the WebSocket, and the voice server identifies and issues an identification result to the electric equipment terminal. On the other hand, when the control function of the voice data is to realize the timed reminding task, the voice server can actively push the message to the electrical equipment for processing at the timed arrival time, so that the timely arrival of the message is ensured, the situation that whether the message wastes resources or not is avoided when the electrical equipment continuously polls the voice server, and the electrical equipment is reasonably scheduled and processed under the limited resources is ensured.
It can be understood that, since the first communication device 121 and the second communication device 122 use different communication methods, the dispatch central controller 123 further performs protocol conversion before sending the request voice packet received through the first communication device 121 to the voice server through the second communication device 122. The protocol conversion process can be determined according to the actual communication mode. The protocol conversion may be realized by adding a protocol converter between the dispatching central controller 123 and the first communication device 121 and the second communication device 122, or may be realized by a computer program stored in the dispatching central controller 123. In addition, in order to request high-speed transmission of a voice packet, the voice packet is compression-encoded, and in addition to protocol conversion, audio encoding and decoding are required. For example, in this embodiment, the voice data is compressed and encoded by using an AMR (Adaptive Multi-Rate) coding format to obtain the compressed and encoded voice data, and then, an audio coding and decoding process for the AMR coding format needs to be correspondingly added.
Further, after the voice server performs voice recognition on the request voice packet, the corresponding recognition result is returned to the dispatching central controller 123 through the second communication device 122. The central dispatching controller 123 sends the recognition result to the controller 110, and the controller 110 generates a control instruction according to the recognition result to correspondingly control the electrical equipment to execute corresponding operations to realize near-field voice control. The content of the request voice packet is not limited, and may be a voice signal input by a user to control the operation state of the electric appliance, a voice signal input by the user to obtain related information, or a voice signal input by the user to set an event of the electric appliance. For example, "turn up the temperature by 10 degrees", "turn on", "broadcast the weather of today" or "turn on at 10 o' clock", etc. by voice input. Then, after the voice server performs voice recognition on the requested voice packet and returns a recognition result, and the dispatching central controller 123 sends the recognition result to the controller 110, the controller 110 implements corresponding operations on each functional module of the corresponding control electrical equipment.
In addition, when two or more voice terminal devices are connected to the communication module 120 of the electrical apparatus 100 and simultaneously send a request voice packet to the scheduling central controller 123 through the first communication device 121, the scheduling central controller 123 detects the feedback transmission of multiple data streams. In this case, the communication module 120 will execute with the rule of responding to the last requested voice packet. It is to be understood that the request voice packets each include request time information, and the last request voice packet is a starting request voice packet whose request time information is closest to the current time. Then, when the dispatch central controller 123 receives a request voice packet from another voice terminal device while processing the request voice packet from one voice terminal device, it will respond to the last transmitted request voice packet.
According to the electrical equipment, the communication module with the dispatching center control is additionally arranged in the electrical equipment, the first communication device is controlled to communicate with the voice terminal equipment to carry out near-field acquisition of voice data, and then the voice data is transmitted to the second communication device to carry out two-way communication with the voice server, so that analysis and recognition results of the voice data are issued, and near-field voice control of the electrical equipment is realized. The voice acquisition and recognition module is not required to be added in the electrical equipment, the cost is reduced, the controller function of the electrical equipment is not required to be added, the universal rate of the application of the voice interaction control function is improved, and a good control effect is realized by a near-field voice acquisition mode.
In one embodiment, the dispatch central controller is further configured to pair with the voice terminal device through the first communication device according to preset pairing information. Specifically, the voice terminal device and the dispatching central controller communicate with each other through Wifi-P2P, and one-to-one connection is needed. The central controller for dispatching sends out a broadcast signal carrying preset pairing information through the first communication device. The voice terminal equipment can send out a request signal corresponding to preset pairing information according to the broadcast signal to realize pairing with the dispatching central controller. It is understood that the preset pairing information is a key, and the preset pairing information is not unique in form, and may be a number, a letter, or a symbol, as long as pairing between the two can be achieved.
In one embodiment, the dispatch central controller is further configured to send a preset authentication message to the voice server via the second communication device for access authentication.
Specifically, before connection with the voice server is performed, access authentication needs to be performed according to authentication information issued by the voice server. In an embodiment, the preset authentication information at least includes one of authentication information provided by the voice server, a Media Access Control Address (MAC) Address of the electrical device, and Universal Time Coordinated (UTC) Time for Access authentication. The verification information given by the voice server is appid and appkey issued when the central control unit is dispatched to install the application software corresponding to the voice server. The MAC address of the electric appliance may be a MAC address of the scheduling central controller, and may further include an MID address of the scheduling central controller. The preset verification information is signature information generated according to the parameters, and when the central controller carries the signature information to access the voice server, the voice server performs content verification and timeliness judgment on the signature information. It can be understood that the content verification is whether the appid, appkey, MAC address, and the like in the signature information are accurate. And the timeliness is judged by comparing the UTC time in the signature information with the UTC time of the voice server, and when the time difference between the UTC time and the UTC time is greater than a preset time difference threshold value, the request of the dispatching central controller for accessing the voice server is not responded. The value of the preset time difference threshold is not fixed, and can be determined according to the actual response time in communication with the voice server, for example, set to 5 minutes in this embodiment. In the embodiment, the timeliness judgment is carried out on the signature information, so that repeated requests of request information intercepted by other people can be prevented, and the reliability of communication is ensured.
In one embodiment, the dispatch central controller is further configured to encrypt the received request voice packet and send the encrypted request voice packet to the second communication device. Specifically, the Encryption mode for the request voice packet is not exclusive, and may be aes (advanced Encryption standard), MD5, or other modes recognized by those skilled in the art. During the transmission process of the voice data, the encrypted voice data is transmitted, so that the reliability of the voice data transmission during the voice interaction control process is guaranteed to the maximum extent.
In one embodiment, as shown in fig. 1, the communication module 120 further includes a third communication device 124, and the controller 110 is connected to the dispatching center 123 through the third communication device 124.
Specifically, after the voice server performs voice recognition on the request voice packet, the corresponding recognition result is returned to the dispatching central controller 123 through the second communication device 122. The central dispatching controller 123 sends the recognition result to the controller 110 through the third communication device 124, and the controller 110 generates a control instruction according to the recognition result to correspondingly control the electrical equipment to execute corresponding operations, so as to implement near-field voice control. The third communication device 124 is a serial module, which can be connected to and communicate with the controller 110 inside the electrical equipment.
In one embodiment, as shown in fig. 1, the electrical apparatus 100 further includes a voice broadcasting device 130, and the voice broadcasting device 130 is connected to the dispatch center controller 123 through a third communication device 124.
Specifically, controller 123 obtains the recognition result and sends for controller 110 through third communication device 124 in the dispatch, and when controller 110 accomplished corresponding control execution corresponding operation, also feeds back this recognition result to voice broadcast device 130 through the serial module, and voice broadcast device 130 realizes the report of recognition result. Wherein, the process of realizing reporting can be that voice broadcast device 130 is in the storage have the voice broadcast database of predetermineeing, predetermine the voice broadcast database including the corresponding relation between the voice broadcast information of discernment result and operation commonly used, can follow through the discernment result and broadcast the voice broadcast information that finds corresponding in the voice broadcast database of predetermineeing and report. In addition, the process of realizing the broadcast may also be that the voice server has generated corresponding voice broadcast information, and synchronously issues the voice broadcast information to the dispatching central controller 123 in the recognition result, and the dispatching central controller 123 feeds the voice broadcast information in the recognition result back to the voice broadcast device 130 through the serial port module, and the voice broadcast device 130 realizes the broadcast of the recognition result. In this embodiment, the voice broadcasting device 130 is implemented by a speaker unit.
In one embodiment, as shown in fig. 2, there is provided a voice interaction control method, including:
step 202: and receiving a request voice packet transmitted by the voice terminal equipment through the first communication device.
Specifically, the voice terminal device is a voice data acquisition terminal for realizing near-field voice control of the electrical equipment, and the acquired user voice signal is subjected to preprocessing such as filtering and analog-to-digital conversion to obtain an audio signal. And then, a Wifi module is adopted to encode and sub-package compress the preprocessed data, and finally, voice data after compression and encoding is packaged into a plurality of request voice packets through Wifi-P2P communication and is sent to a dispatching central controller through a first communication device in a sub-package streaming transmission mode. Meanwhile, in order to guarantee the validity of the voice data, closed-loop transmission is adopted for the request voice packet containing valid information. And synchronously, when the response information returned by the dispatching central controller according to the request voice packet is not received, the voice terminal equipment also carries out a corresponding retransmission mechanism so as to ensure that the data is stably and reliably transmitted to the dispatching central controller through the first communication device. The first communication device adopts a Wifi-P2P communication device with high transmission speed and large bandwidth, so that the phenomena of low transmission efficiency, packet loss or data stream errors and the like can be effectively avoided.
Step 204: and carrying out protocol conversion on the received request voice packet, sending the request voice packet to the voice server through the second communication device, and receiving an identification result returned by the voice server according to the request voice packet through the second communication device.
Specifically, after receiving the request voice packet, the scheduling central controller sends the request voice packet to the voice server through the second communication device for voice recognition. When the electrical equipment is in butt joint with the voice server, after the request voice packet is transmitted, the identification result returned by the voice server according to the request voice packet needs to be received, bidirectional communication needs to be carried out, and the first communication device cannot be continuously adopted for communication with the voice server. Therefore, the second communication device is a WebSocket communication device.
Step 206: and sending the identification result to the controller so that the controller generates a control instruction according to the identification result and controls and executes corresponding operation according to the control instruction.
Specifically, the content of the request voice packet is not limited, and may be a voice signal input by the user to control the operation state of the electric appliance, a voice signal input by the user to obtain related information, or a voice signal input by the user to set an event of the electric appliance. For example, "turn up the temperature by 10 degrees", "turn on", "broadcast the weather of today" or "turn on at 10 o' clock", etc. by voice input. Then, after the voice server performs voice recognition on the request voice packet and returns a recognition result, and the central dispatching controller sends the recognition result to the controller, the controller correspondingly controls each functional module of the electrical equipment to realize corresponding operation.
It can be understood that the embodiment of the voice interaction control method is implemented based on the embodiment of the one or more electrical devices, and therefore, for specific limitations of the voice interaction control method, reference may be made to the above limitations on the electrical devices, and details are not described herein again.
In one embodiment, as shown in fig. 3, a voice interaction system is provided, which includes a voice terminal device, a voice server, and the above-mentioned electrical device, where the voice terminal device is connected to the electrical device, and the electrical device is connected to the voice server.
Specifically, the voice terminal device is a voice data acquisition terminal for realizing near-field voice control of the electrical equipment, and may be a remote controller corresponding to the electrical equipment, or may be a mobile terminal such as a mobile phone, a tablet computer, and a wearable device, and the voice data acquisition is realized through a virtual application program carried on the mobile terminal. The voice terminal equipment is provided with a near-field voice acquisition device such as a microphone, and then preprocessing such as filtering and analog-to-digital conversion is carried out on the acquired user voice signals to obtain audio signals. And then, encoding and sub-packaging the preprocessed data by adopting a Wifi module, transmitting the streaming voice data packet to electrical equipment in a closed loop mode through Wifi-P2P communication, and assisting a retransmission mechanism to ensure stable and reliable voice data transmission.
The voice server is a voice data processing end for realizing near-field voice control of the electrical equipment, and is provided with corresponding algorithms such as voice recognition, natural language processing and the like, so that the received voice data can be converted into text data, a voice command is obtained by judging according to the text data, and a corresponding signal generated by the judged voice command is returned to the electrical equipment end, so that the near-field voice control of the electrical equipment is realized. It can be understood that the voice server may be implemented by a single server, or may be a group of servers communicating with each other, and respectively implement the functions of voice recognition, text conversion, natural language processing, voice instruction generation, and the like.
Specifically, the electrical equipment receives a request voice packet of the voice terminal equipment through a communication module of the electrical equipment, performs protocol conversion, then sends the request voice packet to the voice server, receives a recognition result returned by the voice server, and finally sends the recognition result to a main control unit of the electrical equipment to realize near-field voice control. As shown in fig. 3, in this embodiment, the communication module of the electrical device is a wifi module unit, and specifically includes functional modules such as P2P communication, WebSocket communication, audio encoding and decoding, serial port management, and scheduling central control.
Fig. 4 shows a flow chart of data transmission of a voice interaction system, a remote controller or an applet (voice terminal device) collects voice and audio information, compresses audio data and transmits the data to an electrical device end through P2P communication, a WiFi module (communication module) of the electrical device end performs central control scheduling processing, transmits the voice data to a voice server through Websocket communication, performs related processing such as voice recognition on the voice server end, and feeds back an instruction to the device WiFi module end for processing, the WiFi module end feeds back processing information to a main control unit (controller) through a serial port to execute corresponding operations, and feeds back a speaker unit (voice broadcasting device) to realize device control broadcasting, thereby realizing near-field voice control of devices.
The technical features of the embodiments described above may be arbitrarily combined, and for the sake of brevity, all possible combinations of the technical features in the embodiments described above are not described, but should be considered as being within the scope of the present specification as long as there is no contradiction between the combinations of the technical features.
The above-mentioned embodiments only express several embodiments of the present application, and the description thereof is more specific and detailed, but not construed as limiting the scope of the invention. It should be noted that, for a person skilled in the art, several variations and modifications can be made without departing from the concept of the present application, which falls within the scope of protection of the present application. Therefore, the protection scope of the present patent shall be subject to the appended claims.

Claims (10)

1. An electric appliance device is characterized by comprising a controller and a communication module connected with the controller, wherein the communication module comprises a first communication device, a second communication device and a dispatching central controller, the first communication device is connected with the dispatching central controller, the second communication device is connected with the dispatching central controller, and the dispatching central controller is connected with the controller;
the first communication device is used for receiving a request voice packet transmitted by voice terminal equipment and sending the request voice packet to the dispatching central controller;
the dispatching central controller is used for carrying out protocol conversion on the received request voice packet and then sending the request voice packet to the second communication device;
the second communication device is used for sending the request voice packet after protocol conversion to a voice server and receiving a recognition result returned by the voice server;
the dispatching central controller is also used for sending the identification result to the controller;
and the controller is used for generating a control instruction according to the identification result and correspondingly controlling the electric equipment to execute corresponding operation.
2. The electrical equipment according to claim 1, wherein the dispatch central controller is further configured to pair with the voice terminal device through the first communication device according to preset pairing information.
3. The electrical equipment as recited in claim 2, wherein the first communication device is a Wifi-P2P communication device.
4. The electrical equipment of claim 1, wherein the dispatch central controller is further configured to send a preset authentication message to the voice server via the second communication device for access authentication.
5. The electric appliance according to claim 4, wherein the preset authentication information includes at least one of authentication information given by the voice server, a MAC address of the electric appliance, and UTC time of access authentication.
6. The electrical equipment of claim 4, wherein the second communication device is a WebSocket communication device.
7. The electrical equipment of claim 1, wherein the communication module further comprises a third communication device, and the controller is connected to the dispatch center controller through the third communication device.
8. The electrical equipment of claim 7, further comprising a voice broadcast device, wherein the voice broadcast device is connected to the dispatch center controller through the third communication device.
9. A voice interaction control method is characterized by comprising the following steps:
receiving a request voice packet transmitted by voice terminal equipment through a first communication device;
carrying out protocol conversion on the received request voice packet, sending the request voice packet to a voice server through a second communication device, and receiving an identification result returned by the voice server according to the request voice packet through the second communication device;
and sending the identification result to a controller so that the controller generates a control instruction according to the identification result and controls and executes corresponding operation according to the control instruction.
10. A voice interaction system, comprising a voice terminal device, a voice server and the electrical device of any one of claims 1 to 8, wherein the voice terminal device is connected to the electrical device, and the electrical device is connected to the voice server.
CN202210118112.1A 2022-02-08 2022-02-08 Voice interaction control method and voice interaction system of electrical equipment Pending CN114553623A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202210118112.1A CN114553623A (en) 2022-02-08 2022-02-08 Voice interaction control method and voice interaction system of electrical equipment

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202210118112.1A CN114553623A (en) 2022-02-08 2022-02-08 Voice interaction control method and voice interaction system of electrical equipment

Publications (1)

Publication Number Publication Date
CN114553623A true CN114553623A (en) 2022-05-27

Family

ID=81672882

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202210118112.1A Pending CN114553623A (en) 2022-02-08 2022-02-08 Voice interaction control method and voice interaction system of electrical equipment

Country Status (1)

Country Link
CN (1) CN114553623A (en)

Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN202663484U (en) * 2012-05-22 2013-01-09 青岛海尔电子有限公司 Voice-controlled television system and television
CN107086037A (en) * 2017-03-17 2017-08-22 上海庆科信息技术有限公司 A kind of voice interactive method of embedded device, device and embedded device
CN107481722A (en) * 2017-08-21 2017-12-15 无锡小天鹅股份有限公司 Sound control method, device for clothing processing and server
EP3392878A1 (en) * 2017-04-21 2018-10-24 LG Electronics Inc. Voice recognition apparatus and voice recognition method
CN109147784A (en) * 2018-09-10 2019-01-04 百度在线网络技术(北京)有限公司 Voice interactive method, equipment and storage medium
CN109285546A (en) * 2018-11-14 2019-01-29 珠海格力电器股份有限公司 Control system, method, storage medium and the electronic device of terminal
CN209266015U (en) * 2018-11-14 2019-08-16 珠海格力电器股份有限公司 The control system of terminal
CN113225838A (en) * 2020-01-21 2021-08-06 青岛海信移动通信技术股份有限公司 Microphone control method, audio receiving equipment and audio collecting equipment

Patent Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN202663484U (en) * 2012-05-22 2013-01-09 青岛海尔电子有限公司 Voice-controlled television system and television
CN107086037A (en) * 2017-03-17 2017-08-22 上海庆科信息技术有限公司 A kind of voice interactive method of embedded device, device and embedded device
EP3392878A1 (en) * 2017-04-21 2018-10-24 LG Electronics Inc. Voice recognition apparatus and voice recognition method
CN107481722A (en) * 2017-08-21 2017-12-15 无锡小天鹅股份有限公司 Sound control method, device for clothing processing and server
CN109147784A (en) * 2018-09-10 2019-01-04 百度在线网络技术(北京)有限公司 Voice interactive method, equipment and storage medium
CN109285546A (en) * 2018-11-14 2019-01-29 珠海格力电器股份有限公司 Control system, method, storage medium and the electronic device of terminal
CN209266015U (en) * 2018-11-14 2019-08-16 珠海格力电器股份有限公司 The control system of terminal
CN113225838A (en) * 2020-01-21 2021-08-06 青岛海信移动通信技术股份有限公司 Microphone control method, audio receiving equipment and audio collecting equipment

Similar Documents

Publication Publication Date Title
CN107800621B (en) Novel intelligent home gateway supporting multiple access protocols
AU2021266359A1 (en) Edge computing-based internet of things proxy apparatus and data decision-making method
CN110266794A (en) Method for subscribing and system are acted on behalf of based on MQTT server
CN207558417U (en) Intelligent home control system
CN102130802A (en) Internet of things home appliance management system and management method
CN102739864B (en) Realize method and the wireless telecommunications system of TV wireless earphone function
CN101449525A (en) Method for controlling energy consumption of sensor network nodes and device thereof
CN104301012A (en) Interaction method, terminal and system realized by NFC technique
CN104735274A (en) Learning type universal infrared remote control method based on cloud platform and smart phone
CN109347861B (en) Communication method of real-time dynamic monitoring system
CN103475713A (en) Method and system for remotely monitoring smart home based on fusion communication technology
CN106878933A (en) A kind of bluetooth equipment network-building method and system
CN106603579B (en) The tele-control system and method and its wireless terminal of a kind of wireless terminal
CN102547243A (en) Audio and video remote monitoring method and system based on 3G (the 3rd generation telecommunication) network
CN113938858B (en) Communication method of V2X protocol stack
CN112511388A (en) Method for home Internet of things and Internet of things terminal
CN108430091B (en) Efficient wifi network distribution method and device for smart home
CN109639703A (en) A kind of radio station remote control method and system based on B/S framework
CN102868769A (en) Real-time bi-directional data transmission system of Internet of things and Internet
CN113286002A (en) Power data transmission method based on multi-priority queue
CN106713411B (en) Communication device and method based on Ethernet virtual serial technology and LTE wireless network
CN102611759A (en) Internet of things (IOT) household appliance management system and IOT household appliance management method
CN213279693U (en) Internet of things terminal
KR20020096172A (en) Message transmission method and system capable of transmitting differential data
CN106686011B (en) Multi-view-field interaction method and device based on IMS communication

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20220527

RJ01 Rejection of invention patent application after publication