WO2020083038A1 - 多媒体设备的控制指令的采集、学习、控制、下发方法和装置以及存储介质 - Google Patents

多媒体设备的控制指令的采集、学习、控制、下发方法和装置以及存储介质 Download PDF

Info

Publication number
WO2020083038A1
WO2020083038A1 PCT/CN2019/110569 CN2019110569W WO2020083038A1 WO 2020083038 A1 WO2020083038 A1 WO 2020083038A1 CN 2019110569 W CN2019110569 W CN 2019110569W WO 2020083038 A1 WO2020083038 A1 WO 2020083038A1
Authority
WO
WIPO (PCT)
Prior art keywords
message
instruction
content
multimedia device
address information
Prior art date
Application number
PCT/CN2019/110569
Other languages
English (en)
French (fr)
Inventor
陶伟成
Original Assignee
阿里巴巴集团控股有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 阿里巴巴集团控股有限公司 filed Critical 阿里巴巴集团控股有限公司
Publication of WO2020083038A1 publication Critical patent/WO2020083038A1/zh

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/41Structure of client; Structure of client peripherals
    • H04N21/422Input-only peripherals, i.e. input devices connected to specially adapted client devices, e.g. global positioning system [GPS]
    • H04N21/42204User interfaces specially adapted for controlling a client device through a remote control device; Remote control devices therefor
    • H04N21/42206User interfaces specially adapted for controlling a client device through a remote control device; Remote control devices therefor characterized by hardware details
    • H04N21/42221Transmission circuitry, e.g. infrared [IR] or radio frequency [RF]
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/01Input arrangements or combined input and output arrangements for interaction between user and computer
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/01Input arrangements or combined input and output arrangements for interaction between user and computer
    • G06F3/011Arrangements for interaction with the human body, e.g. for user immersion in virtual reality
    • GPHYSICS
    • G08SIGNALLING
    • G08CTRANSMISSION SYSTEMS FOR MEASURED VALUES, CONTROL OR SIMILAR SIGNALS
    • G08C17/00Arrangements for transmitting signals characterised by the use of a wireless electrical link
    • G08C17/02Arrangements for transmitting signals characterised by the use of a wireless electrical link using a radio link
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/41Structure of client; Structure of client peripherals
    • H04N21/422Input-only peripherals, i.e. input devices connected to specially adapted client devices, e.g. global positioning system [GPS]
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/60Network structure or processes for video distribution between server and client or between remote clients; Control signalling between clients, server and network components; Transmission of management data between server and client, e.g. sending from server to client commands for recording incoming content stream; Communication details between server and client 
    • H04N21/63Control signaling related to video distribution between client, server and network components; Network processes for video distribution between server and clients or between remote clients, e.g. transmitting basic layer and enhancement layers over different transmission paths, setting up a peer-to-peer communication via Internet between remote STB's; Communication protocols; Addressing
    • H04N21/633Control signals issued by server directed to the network components or client
    • H04N21/6332Control signals issued by server directed to the network components or client directed to client
    • GPHYSICS
    • G08SIGNALLING
    • G08CTRANSMISSION SYSTEMS FOR MEASURED VALUES, CONTROL OR SIMILAR SIGNALS
    • G08C2201/00Transmission systems of control signals via wireless link
    • G08C2201/30User interface
    • G08C2201/31Voice input
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • G10L2015/223Execution procedure of a spoken command

Definitions

  • the present application relates to the field of multimedia device control, in particular to a method and device for collecting control commands of multimedia devices, a method and device for learning control commands of multimedia devices, a method and device for controlling multimedia devices, and a method and device for issuing control commands for multimedia devices and Device, non-transitory computer-readable storage medium.
  • Tmall Magic Box is a network set-top box. Through the “Tmall Magic Box", users can realize the functions of watching movies and TV, playing various types of games, making online shopping, and paying utility bills on the TV.
  • This multimedia device moves human-computer interaction from the traditional computer and mobile terminal to the home environment, and realizes the future digital home interconnection concept.
  • “Tmall Genie” is a new type of artificial intelligence device, which can "understand” voice commands, and can realize functions such as smart home control, voice shopping, mobile phone recharge, takeaway, audio music playback, etc., bringing man-machine New interactive experience.
  • users can also use “Tmall Genie” to control multimedia devices (for example, "Tmall Magic Box”).
  • an intelligent control device for example, "Tmall Genie”
  • the control device can control the multimedia device.
  • both the multimedia device and the control device need to establish a connection with a network device (for example, a router), and then the control device can establish a connection with the multimedia device.
  • the user needs to manually operate no less than twice (that is, the multimedia device establishes a connection with the network device and the control device establishes a connection with the network device) to complete the pairing. Moreover, if the multimedia device is not turned on, the pairing cannot be completed, so that it cannot be controlled by the control device.
  • the control device In addition, if there are many devices connected to the router, it often causes the control device to fail to discover the existence of the multimedia device, resulting in the inability to establish a connection. In addition, because the device discovery protocol usually uses protocols such as UPNP / UDP, some routers will prohibit the forwarding of such data packets, resulting in failure to establish a connection.
  • the present application provides a method and apparatus for collecting control instructions of multimedia equipment, a method and apparatus for learning control instructions of multimedia equipment, a method and apparatus for controlling multimedia equipment, a method and apparatus for issuing control instructions for multimedia equipment, and a non-transient computer Readable storage media.
  • a method for collecting control instructions of a multimedia device including:
  • a method for learning control instructions of a multimedia device including:
  • mapping relationship table represents the mapping relationship between the wireless control instruction for the multimedia device and a preset user voice instruction.
  • a multimedia device control method including:
  • a method for issuing a control instruction of a multimedia device including:
  • mapping relationship table represents the preset message content and the preset user for the wireless control instruction of the multimedia device.
  • an apparatus for collecting control instructions of a multimedia device including:
  • a monitoring unit configured to monitor wireless control commands directed to the multimedia device
  • An extracting unit configured to extract the content of the message in the wireless control instruction
  • the sending unit sends the message content to the server.
  • a server for learning control instructions of a multimedia device including:
  • a receiving unit configured to receive the message content of the wireless control instruction for the multimedia device
  • the establishing unit establishes an instruction mapping relationship table for the multimedia device according to the content of the message, wherein the mapping relationship table represents the mapping relationship between the wireless control instruction for the multimedia device and a preset user voice instruction.
  • an apparatus for controlling multimedia equipment including:
  • a receiving unit configured to receive a user's voice instruction and receive message content matching the voice instruction from the server;
  • the sending unit is configured to send the voice instruction to the server, and send the message content to the multimedia device as a control instruction.
  • a server for issuing control instructions of a multimedia device including:
  • the receiving unit receives voice commands from the control device
  • the searching unit searches for the content of the message matching the voice instruction in a preset instruction mapping relationship table, where the mapping relationship table represents the preset message content and pre-set of the wireless control instruction for the multimedia device The mapping relationship of the user's voice commands;
  • the sending unit sends the found message content to the control device.
  • an apparatus including:
  • Memory used to store one or more programs
  • the processor When the one or more programs are executed by the processor, the processor is caused to implement any one of the methods described above.
  • a non-transitory computer-readable storage medium on which a computer program is stored, which when executed by a processor causes the processor to implement any of the above a way.
  • FIG. 1 shows a flowchart of a method for collecting control instructions of a multimedia device according to an embodiment of the present application
  • FIG. 2 shows an example of a message of the wireless control instruction according to this embodiment
  • FIG. 3 shows a flowchart of monitoring wireless control commands for a multimedia device according to an embodiment of the present application
  • FIG. 4 shows a flowchart of a method for learning a control instruction of a multimedia device according to an embodiment of the present application
  • FIG. 5 shows a flowchart of establishing an instruction mapping relationship table for multimedia devices according to message content according to an embodiment of the present application
  • FIG. 6 shows a flowchart of a control method of a multimedia device according to an embodiment of the present application
  • FIG. 7 shows a flowchart of a method for issuing a control instruction of a multimedia device according to an embodiment of the present application
  • FIG. 8 shows a flowchart of searching for the content of a message matching a voice command in a preset command mapping table according to an embodiment of the present application
  • FIG. 10 shows a schematic diagram of an apparatus for collecting control instructions of a multimedia device according to an embodiment of the present application
  • FIG. 11 shows a schematic diagram of a server for learning control instructions of a multimedia device according to an embodiment of the present application
  • FIG. 12 shows a schematic diagram of a server for learning control instructions of a multimedia device according to another embodiment of the present application.
  • FIG. 13 shows a schematic diagram of a building unit according to an embodiment of the present application.
  • FIG. 14 shows a schematic diagram of an apparatus for controlling a multimedia device according to an embodiment of the present application.
  • FIG. 15 shows a schematic diagram of a server for issuing control instructions of a multimedia device according to an embodiment of the present application.
  • the control commands issued by the wireless remote control device of the multimedia device can be used to learn the control commands and establish a mapping relationship between the control commands and the user's voice commands.
  • the control command corresponding to the voice command can be queried according to the mapping relationship, and the control device sends the control command to the multimedia device, thereby realizing control of the multimedia device.
  • a user issues a voice command it is like using a wireless remote control device to control a multimedia device.
  • FIG. 1 shows a flowchart of a method for collecting control instructions of a multimedia device according to an embodiment of the present application.
  • the method 100 may include steps S110 to S130.
  • step S110 the wireless control command for the multimedia device is monitored.
  • the learning process of the control commands of the multimedia device for example, "Tmall Magic Box”
  • a wireless remote control device for example, a Bluetooth remote control
  • the user can manually press a key (for example, a power-on key) on the wireless remote control device to issue a wireless control command to control the multimedia device.
  • the wireless control command may be received by a control device (for example, "Tmall Genie").
  • step S120 the content of the message in the wireless control instruction is extracted.
  • the wireless control command is usually a message with a certain form, and the content of the message can be extracted by the control device.
  • FIG. 2 shows an example of a message of the wireless control instruction according to this embodiment.
  • the message may include a header and a payload.
  • PDU Type PDU Type
  • Length Length
  • body part some core data of the message is recorded, for example, the controller address (Controler address), target address (Dest address), command sequence (Index), etc.
  • the controller address of the message (that is, the source address of the wireless control command) is AdvA
  • the target address (that is, the address of the device that the wireless control command intends to control) and the command sequence have been omitted Go, indicated by *.
  • step S120 all the message content in the wireless control instruction may be extracted, or only a part of the important content may be extracted for subsequent learning operations.
  • step S130 the extracted message content is sent to the server.
  • the message extraction of the wireless control instruction of the multimedia device can be completed, and then the control instruction can be learned by the server.
  • the above-mentioned process of collecting control instructions can be completed when the user daily operates the wireless remote control device of the multimedia device, without extra time and effort; on the other hand, the user can also deliberately operate the multimedia in order to collect control instructions
  • the wireless remote control device of the device can complete the collection of control instructions without turning on the multimedia device.
  • FIG. 3 shows a flowchart of monitoring wireless control commands for a multimedia device according to an embodiment of the present application.
  • the above step S110 may include sub-steps S111 and S112.
  • the Bluetooth low energy monitoring mode is turned on.
  • the wireless control instruction is monitored in the Bluetooth low energy monitoring mode.
  • Bluetooth Low Energy (BLE) mode is based on Bluetooth low energy technology and is a low-cost, short-range, interoperable and robust wireless technology.
  • Monitoring the wireless control commands in the Bluetooth low energy mode can reduce the energy consumption of the control command collection process, especially in the first aspect above, when the user collects the control commands during daily operation of the wireless remote control device of the multimedia device, The entire acquisition process may take a long time, and monitoring in Bluetooth low energy mode will greatly save energy consumption.
  • the source address information, target address information, and / or operation intent data of the wireless control instructions may be extracted from the monitored wireless control instructions.
  • the source address information indicates which device the wireless control command comes from
  • the target address information indicates which device the wireless control command intends to control
  • the operation intention data indicates what kind of operation needs to be performed on the controlled multimedia device.
  • the source address information in the monitored wireless control instruction may also be replaced with preset local address information. Therefore, the content of the message sent to the server will display the local address information instead of the address of the wireless remote control device that actually issued the wireless control command. In this way, the server can identify which control device the message content came from, and establish a dedicated mapping table for it.
  • the operation intention data in the wireless control instruction may include voice data.
  • the wireless remote control device not only supports the user's key control commands, but also supports the user's voice control commands.
  • the user can not only control the multimedia device by pressing the button of the wireless remote control device, but also control the multimedia device by issuing a voice command to the wireless remote control device.
  • the operation intention data in the monitored wireless control command may be voice data.
  • FIG. 4 shows a flowchart of a method for learning a control instruction of a multimedia device according to an embodiment of the present application.
  • the method 200 may include steps S210 and S220.
  • step S210 the message content of the wireless control instruction for the multimedia device is received.
  • the control device monitors the wireless control instruction, it will extract and issue the message content of the wireless control instruction.
  • step S210 the message content is received.
  • an instruction mapping table for the multimedia device is established according to the content of the message.
  • the instruction mapping relationship table represents the mapping relationship between wireless control instructions for multimedia devices and preset user voice instructions.
  • the content of the received message may include operation intent data, which represents the operation intent of the wireless control instruction to the multimedia device, for example, shutting down, increasing the volume, and so on.
  • a user voice instruction pool is preset, which records various user voice instructions and their corresponding operation intentions. Therefore, the operation intent can be used as a link, and the mapping relationship of "message-operation intent-user voice instruction" can be established according to the received message content and the preset user voice instruction, thereby achieving Control instruction learning.
  • the content of the wireless control instruction issued by the control device may be the entire message of the instruction, or may be part of the content. Therefore, if the message content received in step S210 is the entire message of the wireless control instruction, the method 200 may further include extracting source address information, target address information, and / or operation intent data in the message content. Among them, the source address information indicates which device the wireless control command comes from, the target address information indicates which device the wireless control command intends to control, and the operation intention data indicates what kind of operation needs to be performed on the controlled multimedia device.
  • the method 200 may further include: receiving control device address information, and replacing source address information in the message content with control device address information.
  • the wireless control commands monitored by the control device are usually issued by the wireless remote control device of the multimedia device. Therefore, the source address information in the message content of the wireless control instruction records the address of the wireless remote control device.
  • the purpose of learning the wireless control commands is to control the device to be able to use the results of the learning to control the multimedia device. Therefore, in the method 200, the address information of the control device needs to be known, and the source address in the message is replaced with the address of the control device. In this way, in the subsequent control process of the control device, the corresponding mapping relationship table of the control device can be found.
  • FIG. 5 shows a flowchart of establishing an instruction mapping relationship table for multimedia devices according to message content according to an embodiment of the present application.
  • the above step S220 may include sub-steps S221 and S222.
  • sub-step S221 the operation intention of the wireless control instruction is determined according to the content of the message. Since the content of the message contains operation intent data, the operation intent of the wireless control instruction can be determined according to the content of the message.
  • sub-step S222 according to the determined operation intention, a user voice instruction matching the content of the message is searched for in the preset user voice instruction.
  • the user's voice instructions matching the message content can be found according to the operation intention determined by the message content ( Have the same or corresponding operation intent), thereby establishing an instruction mapping relationship table.
  • the method 100 may further include: adding a preset tag to the content of the message that does not match the user voice instruction in the preset user voice instruction.
  • a preset tag to the content of the message that does not match the user voice instruction in the preset user voice instruction.
  • not every content of the received message can find a user voice instruction matching it. For example, there may be no user voice instruction that is the same as or corresponding to the operation intention of the content of the message.
  • preset tags are added to the content of such messages for users or developers to view, and subsequent improvement measures can be given accordingly.
  • FIG. 6 shows a flowchart of a multimedia device control method according to an embodiment of the present application.
  • the method 300 may include steps S310 to S330.
  • step S310 the user's voice instruction is received, and the voice instruction is sent to the server.
  • a voice command can be issued, for example, the user says "turn on”.
  • the voice command can be received and sent to the server side.
  • step S320 the message content matching the voice instruction is received from the server. Since the server has set a command mapping relationship table that has been learned, which records the mapping relationship among the message, the operation intention, and the user's voice command, the content of the message received from the server may be the server according to the voice command. The content of the message matching the voice instruction found in the instruction mapping relationship table.
  • the message content may include source address information, target address information, and operation intent data. The source address information indicates which device the message content comes from, the destination address information indicates which device the message content intends to control, and the operation intention data indicates what kind of operation needs to be performed on the controlled multimedia device.
  • step S330 the content of the message is sent to the multimedia device as a control instruction.
  • the instruction information (for example, instruction sequence) in the content of the message is instruction information of the wireless remote control device of the multimedia device, when the multimedia device receives the message content, it will perform corresponding operations according to the instruction information in the message. Therefore, the user can control the multimedia device by issuing voice commands, and the control effect is the same as that of the wireless remote control device using the multimedia device.
  • the content of the message may be sent in the Bluetooth low energy mode.
  • the Bluetooth Low Energy (BLE) mode is based on Bluetooth low energy technology and is a low-cost, short-range, and interoperable robust wireless technology. Sending the message content in Bluetooth low energy mode can reduce the energy consumption of the message sending process.
  • the method 300 may further include: replacing source address information in the message content with preset address information.
  • the preset address information may be the address of the issuing end of the wireless control command monitored during the collection and learning of the control command.
  • the source address information in the message content has been modified to the address of the control device (ie, the device that collects wireless control instructions, for example, Tmall Genie).
  • the control device ie, the device that collects wireless control instructions, for example, Tmall Genie.
  • the source address information in the message content can be changed back to the address of the wireless remote control device that originally issued the wireless control command, so that the message content is the same as that sent by the wireless remote control device.
  • step S410 a voice instruction is received from the control device.
  • the voice command may be the voice command issued by the user in step S310, and the control device sends the voice command to the server after collecting the voice command.
  • step S420 the content of the message matching the voice instruction is searched in the preset instruction mapping relationship table.
  • the mapping relationship table represents the mapping relationship between the preset message content of the wireless control instruction for the multimedia device and the preset user voice instruction. According to the above method 200, through the control instruction learning process, an instruction mapping relationship table that records the mapping relationship between the message, the operation intention, and the user's voice instruction can be obtained. Therefore, after receiving the voice instruction, the content of the message matching the voice instruction can be found in the mapping relationship table.
  • step S430 the found message content is sent to the control device. Therefore, when the user intends to control the multimedia device by issuing a voice command, the control device can forward the voice command to the server side, and the server side sends the message content corresponding to the voice command to the control device , The control device uses the message to control the multimedia device.
  • the content of the message searched in step S420 may include source address information, target address information, and operation intent data.
  • the source address information indicates which device the message content comes from
  • the destination address information indicates which device the message content intends to control
  • the operation intention data indicates what kind of operation needs to be performed on the controlled multimedia device.
  • the method 400 may further include: replacing source address information in the found message content with preset address information.
  • the preset address information may be the address of the issuing end of the wireless control command monitored during the collection and learning of the control command.
  • the source address information in the message content has been modified to the address of the control device (that is, the device that collects wireless control instructions, such as Tmall Genie).
  • the source address information in the content of the message can be changed back to the address of the wireless remote control device that originally issued the wireless control command, so that the message content and the wireless remote control device the same.
  • FIG. 8 shows a flowchart of searching for the content of a message matching a voice instruction in a preset instruction mapping relationship table according to an embodiment of the present application.
  • the above step S420 may include sub-steps S421 and S422.
  • sub-step S421 the operation intention of the voice instruction is determined.
  • a voice command pool is preset, in which operation intentions corresponding to different voice commands are recorded. Therefore, the preset voice command pool can be used to determine the operation intention of the received voice command.
  • sub-step S422 the content of the message that matches the operation intention is searched in the preset instruction mapping relationship table.
  • the mapping relationship between the message and the operation intention is recorded in the instruction mapping relationship table. Therefore, the corresponding message can be searched in the mapping relationship table according to the operation intention determined in sub-step S421.
  • FIG. 9 shows a schematic diagram of actual application scenarios of the embodiments of the present application.
  • Tmall Genie 501 ie, an example of a control device
  • the Bluetooth remote control 502 ie, an example of a wireless remote control device
  • the Bluetooth remote control 502 is A dedicated remote control for Cat Magic Box 503 (that is, an example of a multimedia device).
  • the user can press the "power on” button to control the Tmall magic box 503 to turn on.
  • the Bluetooth remote control 502 will issue a control command (in the form of a message) corresponding to the boot operation, and the Tmall Genie 501 will detect the control command. Subsequently, the Tmall Genie 501 may replace the source address recorded in the control instruction message with the address of the Tmall Genie 501 itself, and then upload the message (or part of the content in the message) to the cloud 504 (ie, the server) .
  • the cloud 504 may determine the operation intent of the message as "power on" according to the instruction sequence field (for example, the Index field shown in FIG. 2) in the message.
  • a cloud instruction pool is preset in the cloud 504, and the voice instruction pool records the correspondence between the user's voice instruction and the operation intention.
  • the cloud 504 can establish a mapping relationship between the received message and the user's voice instruction (ie, instruction mapping relationship table) by operating intent as a link, thereby completing the learning process.
  • the user 505 may issue a voice command to the Tmall Genie 501 to control the Tmall Magic Box 503, for example, "turn on”.
  • the Tmall Genie 501 forwards it to the cloud 504.
  • the cloud 504 searches for a message matching the voice instruction in the instruction mapping relationship table established in the above learning stage, and delivers the found message to the Tmall Genie 501.
  • the command sequence field in the message records the command sequence of the boot operation.
  • the Tmall Genie 501 can first modify the source address information in the message to the address of the Bluetooth remote control 502. Subsequently, the Tmall Genie 501 may send the message to the Tmall Magic Box 503. Since the instruction sequence field of the message records the instruction sequence of the power-on operation, the Tmall Magic Box 503 can be turned on under the control of the message, so that the voice of the user 505 controls the Tmall Magic Box 503.
  • FIG. 10 shows a schematic diagram of an apparatus for collecting control instructions of a multimedia device according to an embodiment of the present application.
  • the device 600 may include a monitoring unit 610, an extraction unit 620, and a sending unit 630.
  • the monitoring unit 610 is used to monitor wireless control commands for the multimedia device.
  • the extracting unit 620 is used to extract the content of the message in the wireless control instruction.
  • the sending unit 630 sends the message content to the server.
  • the monitoring unit 610 is further configured to enable a Bluetooth low energy monitoring mode and monitor the wireless control command in the Bluetooth low energy monitoring mode.
  • the extraction unit 620 is further configured to extract source address information, target address information, and / or operation intention data of the wireless control instruction from the wireless control instruction.
  • the extraction unit 620 is further configured to replace the source address information in the wireless control instruction with preset local address information.
  • the operation intention data includes voice data.
  • FIG. 11 shows a schematic diagram of a server for learning control instructions of a multimedia device according to an embodiment of the present application.
  • the server 700 may include a receiving unit 710 and a establishing unit 720.
  • the receiving unit 710 is configured to receive the message content of the wireless control instruction for the multimedia device.
  • the establishment unit 720 creates an instruction mapping relationship table for the multimedia device according to the message content, where the mapping relationship table represents the mapping relationship between the wireless control instruction for the multimedia device and a preset user voice instruction.
  • FIG. 12 shows a schematic diagram of a server for learning control instructions of a multimedia device according to another embodiment of the present application.
  • the server 700 in addition to the receiving unit 710 and the establishing unit 720, the server 700 further includes an extracting unit 730.
  • the extracting unit 730 is used to extract source address information, target address information and / or operation intention data in the content of the message.
  • the receiving unit 710 is also used to receive control device address information.
  • the extraction unit 730 is also used to replace the source address information in the message content with the control device address information.
  • the establishment unit 720 includes a determination subunit 721 and a search subunit 722.
  • the determination subunit 721 determines the operation intention of the wireless control instruction according to the content of the message.
  • the searching subunit 722 searches for the user voice instruction matching the message content in the preset user voice instruction according to the operation intention.
  • the server 700 further includes an adding unit (not shown in the figure).
  • the adding unit adds a preset label to the content of the message that does not match the user voice instruction in the preset user voice instruction.
  • the device 800 may include a receiving unit 810 and a sending unit 820.
  • the receiving unit 810 is configured to receive a user's voice instruction and receive message content matching the voice instruction from the server.
  • the sending unit 820 is configured to send the voice instruction to the server, and send the message content to the multimedia device as a control instruction.
  • the sending unit 820 is configured to send the message content in Bluetooth low energy mode.
  • the message content includes source address information, target address information, and operation intent data.
  • the device 800 further includes a replacement unit (not shown in the figure).
  • the replacement unit is used to replace the source address information in the message content with preset address information.
  • the server 900 may include a receiving unit 910, a searching unit 920, and a sending unit 930.
  • the receiving unit 910 receives voice instructions from the control device.
  • the searching unit 920 searches for a message content matching the voice instruction in a preset instruction mapping relationship table, where the mapping relationship table represents the preset message content and the pre-message for the wireless control instruction of the multimedia device Set the mapping relationship of user voice commands.
  • the sending unit 930 sends the found message content to the control device.
  • the message content includes source address information, target address information, and operation intent data.
  • the server 900 further includes a replacement unit (not shown in the figure).
  • the replacement unit replaces the source address information in the found message content with preset address information.
  • the searching unit 920 is configured to determine the operation intention of the voice instruction, and search for a message content matching the operation intention in a preset instruction mapping relationship table.
  • the technical solution of the present application may be implemented as a system, a method, or a computer program product. Therefore, the present application can be expressed as a completely hardware embodiment, a completely software embodiment (including firmware, resident software, microcode, etc.) or an embodiment combining software and hardware, which can be generally called " "Circuit", "module” or "system”.
  • the present application may be expressed in the form of a computer program product embedded in any tangible expression medium, the tangible expression medium having computer usable program code embedded in the medium.
  • These computer program instructions may also be stored in a computer-readable medium that can instruct a computer or other programmable data processing device to implement functions in a specific manner, so that the generation of instructions stored in the computer-readable medium includes implementing flowcharts and / or An instruction device for the function / action specified in one or more blocks in the block diagram.
  • Computer program instructions can also be loaded onto a computer or other programmable data processing device to cause a series of operating steps to be performed on the computer or other programmable device to produce a computer-implemented process, thereby enabling the computer or other programmable device
  • the instructions executed above provide a process for implementing the functions / actions indicated in the block or blocks in the flowchart and / or block diagram.
  • each block in the flowchart or block diagram may represent a module, section, or part of code that includes one or more executable instructions for implementing a specific logical function.
  • the functions noted in the block may occur out of the order noted in the figures. For example, depending on the functionality involved, two blocks shown in succession may actually be executed at about the same time, or these blocks are sometimes executed in the reverse order.

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Physics & Mathematics (AREA)
  • Human Computer Interaction (AREA)
  • General Engineering & Computer Science (AREA)
  • Signal Processing (AREA)
  • General Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Health & Medical Sciences (AREA)
  • Computational Linguistics (AREA)
  • Acoustics & Sound (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Selective Calling Equipment (AREA)
  • Telephonic Communication Services (AREA)

Abstract

本申请提供了多媒体设备的控制指令的采集方法和装置、多媒体设备的控制指令的学习方法和装置、多媒体设备的控制方法和装置、下发多媒体设备的控制指令的方法和装置、非瞬时性计算机可读存储介质。该多媒体设备的控制指令的采集方法包括:监测针对所述多媒体设备的无线控制指令;提取所述无线控制指令中的报文内容;将所述报文内容发送至服务器。

Description

多媒体设备的控制指令的采集、学习、控制、下发方法和装置以及存储介质
本申请要求2018年10月22日递交的申请号为2018112310577、发明名称为“多媒体设备的控制指令的采集、学习、控制、下发方法和装置以及存储介质”的中国专利申请的优先权,其全部内容通过引用结合在本申请中。
技术领域
本申请涉及多媒体设备控制领域,尤其涉及多媒体设备的控制指令的采集方法和装置、多媒体设备的控制指令的学习方法和装置、多媒体设备的控制方法和装置、下发多媒体设备的控制指令的方法和装置、非瞬时性计算机可读存储介质。
背景技术
目前,多媒体设备已被人们越来越多地用于日常的工作、学习和生活中,随着互联网、移动互联、人工智能等技术的发展,其应用领域变得越来越广泛。
例如,“天猫魔盒”是一种网络机顶盒。通过“天猫魔盒”,用户可在电视上实现观看电影电视、玩各种类型的游戏、进行网络购物、支付水电煤气费用等功能。这种多媒体设备将人机交互由传统的计算机、移动终端搬到家居环境中,实现了未来数字家庭互联概念。
对于类似于“天猫魔盒”的多媒体设备而言,通常配备有专门的无线遥控设备以对其进行操作控制。然而,目前已出现了另一种新型的智能控制设备,可以用于控制多媒体设备。
例如,“天猫精灵”是一种新型的人工智能设备,其能够“听懂”语音指令,可实现智能家居控制、语音购物、手机充值、叫外卖、音频音乐播放等功能,带来人机交互新体验。此外,用户还可利用“天猫精灵”来控制多媒体设备(例如,“天猫魔盒”)。但是,在使用前,需要将智能控制设备(例如,“天猫精灵”)与多媒体设备进行配对,在配对完成后,控制设备才能够对多媒体设备进行控制。在配对操作中,首先需要将多媒体设备和控制设备均与网络设备(例如,路由器)建立连接,然后,控制设备才能够与多媒体设备建立起连接。也就是说,用户需要手动操作不少于两次(即,多媒体设备与网络设备建立连接以及控制设备与网络设备建立连接),才能够完成配对。而且,如 果多媒体设备没有开机,则也无法完成配对,从而不能实现利用控制设备对其的控制。
此外,如果路由器上连接的设备较多,经常会导致控制设备无法发现多媒体设备的存在,导致无法建立连接。另外,由于设备发现协议通常采用UPNP/UDP等协议,有些路由器会禁止这种数据包的转发,从而导致建立连接失败。
发明内容
本申请提供了多媒体设备的控制指令的采集方法和装置、多媒体设备的控制指令的学习方法和装置、多媒体设备的控制方法和装置、下发多媒体设备的控制指令的方法和装置、非瞬时性计算机可读存储介质。
根据本申请的第一方面,提供了一种多媒体设备的控制指令的采集方法,包括:
监测针对所述多媒体设备的无线控制指令;
提取所述无线控制指令中的报文内容;
将所述报文内容发送至服务器。
根据本申请的第二方面,提供了一种多媒体设备的控制指令的学习方法,包括:
接收针对所述多媒体设备的无线控制指令的报文内容;
根据所述报文内容建立针对所述多媒体设备的指令映射关系表,其中所述映射关系表表征了针对所述多媒体设备的无线控制指令与预设的用户语音指令的映射关系。
根据本申请的第三方面,提供了一种多媒体设备的控制方法,包括:
接收用户的语音指令,并将所述语音指令发送至服务器;
从所述服务器接收与所述语音指令相匹配的报文内容;
将所述报文内容作为控制指令发送至所述多媒体设备。
根据本申请的第四方面,提供了一种下发多媒体设备的控制指令的方法,包括:
从控制设备接收语音指令;
在预设的指令映射关系表中查找与所述语音指令相匹配的报文内容,其中所述映射关系表表征了针对所述多媒体设备的无线控制指令的预设报文内容与预设的用户语音指令的映射关系;
将查找到的报文内容发送至所述控制设备。
根据本申请的第五方面,提供了一种用于采集多媒体设备的控制指令的装置,包括:
监测单元,用于监测针对所述多媒体设备的无线控制指令;
提取单元,用于提取所述无线控制指令中的报文内容;
发送单元,将所述报文内容发送至服务器。
根据本申请的第六方面,提供了一种用于学习多媒体设备的控制指令的服务器,包括:
接收单元,用于接收针对所述多媒体设备的无线控制指令的报文内容;
建立单元,根据所述报文内容建立针对所述多媒体设备的指令映射关系表,其中所述映射关系表表征了针对所述多媒体设备的无线控制指令与预设的用户语音指令的映射关系。
根据本申请的第七方面,提供了一种用于控制多媒体设备的装置,包括:
接收单元,用于接收用户的语音指令,并从服务器接收与所述语音指令相匹配的报文内容;
发送单元,用于将所述语音指令发送至所述服务器,并将所述报文内容作为控制指令发送至所述多媒体设备。
根据本申请的第八方面,提供了一种用于下发多媒体设备的控制指令的服务器,包括:
接收单元,从控制设备接收语音指令;
查找单元,在预设的指令映射关系表中查找与所述语音指令相匹配的报文内容,其中所述映射关系表表征了针对所述多媒体设备的无线控制指令的预设报文内容与预设的用户语音指令的映射关系;
发送单元,将查找到的报文内容发送至所述控制设备。
根据本申请的第九方面,提供了一种装置,包括:
处理器;
存储器,用于存储一个或多个程序;
当所述一个或多个程序被所述处理器执行时,使得所述处理器实现如上所述的任一种方法。
根据本申请的第十方面,提供了一种非瞬时性计算机可读存储介质,其上存储有计算机程序,所述计算机程序在被处理器执行时,使得所述处理器实现如上所述的任一种方法。
附图说明
通过阅读下文优选实施方式的详细描述,各种其他的优点和益处对于本领域普通技 术人员将变得清楚明了。附图仅用于示出优选实施方式的目的,而并不认为是对本申请的限制。而且在整个附图中,用相同的参考符号表示相同的部件。在附图中:
图1示出了根据本申请一个实施方式的多媒体设备的控制指令的采集方法的流程图;
图2示出了根据该实施方式的无线控制指令的报文的一个实例;
图3示出了根据本申请一个实施方式监测针对多媒体设备的无线控制指令的流程图;
图4示出了根据本申请一个实施方式的多媒体设备的控制指令的学习方法的流程图;
图5示出了根据本申请一个实施方式根据报文内容建立针对多媒体设备的指令映射关系表的流程图;
图6示出了根据本申请一个实施方式的多媒体设备的控制方法的流程图;
图7示出了根据本申请一个实施方式下发多媒体设备的控制指令的方法的流程图;
图8示出了根据本申请一个实施方式在预设的指令映射关系表中查找与语音指令相匹配的报文内容的流程图;
图9示出了本申请各实施方式的实际应用场景示意图;
图10示出了根据本申请一个实施方式用于采集多媒体设备的控制指令的装置的示意图;
图11示出了根据本申请一个实施方式用于学习多媒体设备的控制指令的服务器的示意图;
图12示出了根据本申请另一个实施方式用于学习多媒体设备的控制指令的服务器的示意图;
图13示出了根据本申请一个实施方式的建立单元的示意图;
图14示出了根据本申请一个实施方式用于控制多媒体设备的装置的示意图;
图15示出了根据本申请一个实施方式用于下发多媒体设备的控制指令的服务器的示意图。
具体实施方式
以下参照附图对本申请的实施方式进行详细描述。应注意,以下描述仅仅是示例性的,而并不旨在限制本申请。此外,在以下描述中,将采用相同的附图标号表示不同附 图中的相同或相似的部件。在以下描述的不同实施方式中的不同特征,可彼此结合,以形成本申请范围内的其他实施方式。
为了解决现有技术中所出现的问题,根据本申请,可利用多媒体设备的无线遥控设备所发出的控制指令,对控制指令进行学习,建立起控制指令与用户语音指令之间的映射关系。在实际控制时,当用户发出语音指令给控制设备时,可根据映射关系查询到与该语音指令相对应的控制指令,并由控制设备向多媒体设备发出该控制指令,从而实现对多媒体设备的控制,这样当用户发出语音指令时,就如同采用无线遥控设备在控制多媒体设备那样。以下将对本申请的多个实施方式进行详细解释说明。
图1示出了根据本申请一个实施方式的多媒体设备的控制指令的采集方法的流程图。如图1所示,该方法100可包括步骤S110至S130。在步骤S110中,监测针对多媒体设备的无线控制指令。在对多媒体设备(例如,“天猫魔盒”)的控制指令的学习过程中,首先需要采集到对多媒体设备的无线控制指令。通常,可利用多媒体设备的无线遥控设备(例如,蓝牙遥控器)发出无线控制指令。例如,用户可手动按下无线遥控设备上的按键(例如,开机键)发出无线控制指令,以对多媒体设备进行控制。此时,可利用控制设备(例如,“天猫精灵”)接收到该无线控制指令。
在步骤S120中,提取无线控制指令中的报文内容。无线控制指令通常为具有一定形式的报文,可利用控制设备提取其中的报文内容。图2示出了根据该实施方式的无线控制指令的报文的一个实例。如图2所示,该报文可包括报头部分(header)和主体部分(payload)。在报头部分中,记录了该报文的一些基本信息,例如,PDU类型(PDU Type)、长度(Length)等。在主体部分,记录了该报文的一些核心数据,例如,控制器地址(Controler address)、目标地址(Dest address)、指令序列(Index)等。在图2所示的示例中,该报文的控制器地址(即,无线控制指令的源地址)为AdvA,目标地址(即,无线控制指令意图控制的设备的地址)和指令序列均已略去,由*号表示。那么,在步骤S120中,可以提取无线控制指令中的全部报文内容,也可以仅提取其中一部分重要内容,以用于后续的学习操作。
随后,在步骤S130中,将提取到的报文内容发送至服务器。由此,可完成对多媒体设备的无线控制指令的报文提取,并随后可由服务器端对控制指令进行学习。一方面,上述对控制指令的采集过程可在用户日常操作多媒体设备的无线遥控设备时即可完成,无需花费额外的时间和精力;另一方面,用户也可以为了采集控制指令而有意地操作多媒体设备的无线遥控设备,这时在无需开启多媒体设备时,也可以完成对控制指令 的采集。
图3示出了根据本申请一个实施方式监测针对多媒体设备的无线控制指令的流程图。如图3所示,上述步骤S110可包括子步骤S111和S112。在子步骤S111中,开启蓝牙低功耗监测模式。随后,在子步骤S112中,在蓝牙低功耗监测模式下监测无线控制指令。蓝牙低功耗(BLE)模式基于蓝牙低功耗技术,是一种低成本、短距离、可互操作的鲁棒性无线技术。在蓝牙低功耗模式下对无线控制指令进行监测,可降低控制指令采集过程的能耗,尤其是在上述第一方面中,当在用户日常操作多媒体设备的无线遥控设备时采集控制指令时,整个采集过程可能时间较长,而在蓝牙低功耗模式下进行监测将大大节约能耗。
根据本申请一个实施方式,在上述步骤S120中,在提取无线控制指令中的报文内容时,可仅提取报文中一部分重要内容,以用于后续的学习操作。例如,可从监测到的无线控制指令中提取无线控制指令的源地址信息、目标地址信息和/或操作意图数据。其中,源地址信息表明该无线控制指令来自于哪台设备,目标地址信息表明该无线控制指令意图控制哪台设备,操作意图数据表明需要对受控的多媒体设备进行怎样的操作。
此外,在上述步骤S120中,还可将监测到的无线控制指令中的源地址信息替换为预设的本地地址信息。由此,发送给服务器端的报文内容将显示本地地址信息,而不是实际发出无线控制指令的无线遥控设备的地址。这样,在服务器端可以识别该报文内容是来自于哪个控制设备,并为其建立专用的映射关系表。
根据本申请的一个实施方式,上述无线控制指令中的操作意图数据可包括语音数据。在某些情况下,无线遥控设备不仅支持用户的按键控制指令,还可支持用户的语音控制指令。也就是说,用户不仅可以通过按下无线遥控设备的按键来控制多媒体设备,还可通过对无线遥控设备发出语音指令来控制多媒体设备。在这种情况下,监测到的无线控制指令中的操作意图数据就可以是语音数据。
图4示出了根据本申请一个实施方式的多媒体设备的控制指令的学习方法的流程图。如图4所示,该方法200可包括步骤S210和S220。在步骤S210中,接收针对多媒体设备的无线控制指令的报文内容。如上所述,当控制设备监测到无线控制指令时,会提取并发出无线控制指令的报文内容,在步骤S210中,即对该报文内容进行接收。
随后,在步骤S220中,根据报文内容建立针对多媒体设备的指令映射关系表。该指令映射关系表表征了针对多媒体设备的无线控制指令与预设的用户语音指令的映射关 系。在接收到的报文内容中,可包含有操作意图数据,表征了该无线控制指令对多媒体设备的操作意图,例如,关机、提高音量等。在本实施方式中,预设有用户语音指令池,其记录了多种用户语音指令以及它们所对应的操作意图。由此,可以操作意图为纽带,根据接收到的报文内容和预设的用户语音指令建立起:“报文——操作意图——用户语音指令”的映射关系,从而实现了对多媒体设备的控制指令的学习。
如上所述,控制设备发出的无线控制指令的报文内容可以是该指令的整个报文,也可以是其中一部分内容。因此,如果在步骤S210中接收到的报文内容是无线控制指令的整个报文时,该方法200还可包括提取报文内容中的源地址信息、目标地址信息和/或操作意图数据。其中,源地址信息表明该无线控制指令来自于哪台设备,目标地址信息表明该无线控制指令意图控制哪台设备,操作意图数据表明需要对受控的多媒体设备进行怎样的操作。
此外,该方法200还可包括:接收控制设备地址信息,并且将报文内容中的源地址信息替换为控制设备地址信息。如上所述,控制设备所监测到的无线控制指令通常是由多媒体设备的无线遥控设备发出的。因此,无线控制指令的报文内容中的源地址信息所记录的是该无线遥控设备的地址。根据本申请,对无线控制指令进行学习的目的是为了控制设备后续能够利用学习的结果来控制多媒体设备。因此,在方法200中需要知晓控制设备的地址信息,并且将报文中的源地址替换为控制设备的地址。这样,在控制设备后续的控制过程中,才能够查找到该控制设备相应的映射关系表。
图5示出了根据本申请一个实施方式根据报文内容建立针对多媒体设备的指令映射关系表的流程图。如图5所示,上述步骤S220可包括子步骤S221和S222。在子步骤S221中,根据报文内容确定无线控制指令的操作意图。由于报文内容中包含了操作意图数据,因此可根据报文内容确定出该无线控制指令的操作意图。随后,在子步骤S222中,根据确定出的操作意图,在预设的用户语音指令中查找与报文内容相匹配的用户语音指令。由于在预设的用户语音指令池中,记录了多种用户语音指令以及它们所对应的操作意图,因此可以根据报文内容确定出的操作意图查找与该报文内容相匹配的用户语音指令(具有相同或相应的操作意图),从而建立起指令映射关系表。
根据一个实施方式,该方法100还可包括:向在预设的用户语音指令中没有匹配的用户语音指令的报文内容添加预设的标签。在上述子步骤S222中,并不是接收到的每个报文内容都能够查找到与之相匹配的用户语音指令。例如,可能没有与该报文内容的操作意图相同或相对应的用户语音指令。由此,向这样的报文内容添加预设的标签,以 用于用户或开发人员查看,并可据此给出后续的改进措施。
图6示出了根据本申请一个实施方式的多媒体设备的控制方法的流程图。如图6所示,该方法300可包括步骤S310至S330。在步骤S310中,接收用户的语音指令,并将该语音指令发送至服务器。当用户希望控制多媒体设备时,可发出语音指令,例如,用户说出“开机”。在多媒体设备的控制设备端,可接收到该语音指令并将其发送至服务器端。
在步骤S320中,从服务器接收与语音指令相匹配的报文内容。由于服务器中设置有已经过学习的指令映射关系表,其记录了报文、操作意图、用户语音指令三者之间的映射关系,因此从服务器接收的报文内容可以是服务器根据语音指令,在指令映射关系表中查找到的与该语音指令相匹配的报文内容。根据一个实施例,该报文内容可包括源地址信息、目标地址信息以及操作意图数据。源地址信息表明该报文内容来自于哪台设备,目标地址信息表明该报文内容意图控制哪台设备,操作意图数据表明需要对受控的多媒体设备进行怎样的操作。
随后,在步骤S330中,将报文内容作为控制指令发送至多媒体设备。由于该报文内容中的指令信息(例如,指令序列)是多媒体设备的无线遥控设备的指令信息,因此多媒体设备在接收到该报文内容时,会根据其中的指令信息进行相应的操作。由此,用户通过发出语音指令,就可以对多媒体设备进行控制,控制的效果就如同采用多媒体设备的无线遥控设备相同。
根据本申请一个实施方式,在步骤S330中,可在蓝牙低功耗模式下发送该报文内容。如上所述,蓝牙低功耗(BLE)模式基于蓝牙低功耗技术,是一种低成本、短距离、可互操作的鲁棒性无线技术。在蓝牙低功耗模式下发送报文内容,可降低报文发送过程的能耗。
根据本申请一个实施方式,该方法300还可包括:将报文内容中的源地址信息替换为预设的地址信息。在此,预设的地址信息可以是在控制指令的采集和学习过程中,监测到的无线控制指令的发出端的地址。由于在学习过程中,有可能已将报文内容中的源地址信息修改为控制设备(即,采集无线控制指令的设备,例如,天猫精灵)的地址,因此在此步骤中,在将报文内容发送至多媒体设备之前,可将报文内容中的源地址信息再改回为最初发出该无线控制指令的无线遥控设备的地址,从而使得该报文内容与无线遥控设备发出的相同。
图7示出了根据本申请一个实施方式下发多媒体设备的控制指令的方法的流程图。如图7所示,该方法400可包括步骤S410至S430。在步骤S410中,从控制设备接收语音指令。该语音指令即可为上述步骤S310中用户发出的语音指令,控制设备在采集到该语音指令后,发送至服务器端。
在步骤S420中,在预设的指令映射关系表中查找与语音指令相匹配的报文内容。该映射关系表表征了针对多媒体设备的无线控制指令的预设报文内容与预设的用户语音指令的映射关系。根据上述方法200,通过控制指令的学习过程,可得到记录了报文、操作意图、用户语音指令三者之间的映射关系的指令映射关系表。因此,在收到语音指令后,可在该映射关系表中查找到与该语音指令相匹配的报文内容。
随后,在步骤S430中,将查找到的报文内容发送至控制设备。由此,当用户通过发出语音指令,意图对多媒体设备进行控制时,可由控制设备将该语音指令转发至服务器端,服务器端在查找到与该语音指令相应的报文内容后,发送至控制设备,由控制设备利用该报文控制多媒体设备。
根据本申请一个实施方式,在步骤S420中查找的报文内容可包括源地址信息、目标地址信息以及操作意图数据。源地址信息表明该报文内容来自于哪台设备,目标地址信息表明该报文内容意图控制哪台设备,操作意图数据表明需要对受控的多媒体设备进行怎样的操作。
根据本申请一个实施方式,该方法400还可包括:将查找到的报文内容中的源地址信息替换为预设的地址信息。在此,预设的地址信息可以是在控制指令的采集和学习过程中,监测到的无线控制指令的发出端的地址。由于在学习过程中,有可能已将报文内容中的源地址信息修改为控制设备(即,采集无线控制指令的设备,例如,天猫精灵)的地址,因此在此步骤中,在将查找到的报文内容发送至控制设备之前,可将报文内容中的源地址信息再改回为最初发出该无线控制指令的无线遥控设备的地址,从而使得该报文内容与无线遥控设备发出的相同。
图8示出了根据本申请一个实施方式在预设的指令映射关系表中查找与语音指令相匹配的报文内容的流程图。如图8所示,上述步骤S420可包括子步骤S421和S422。在子步骤S421中,确定语音指令的操作意图。在服务器端,会预设有语音指令池,其中记录了不同语音指令所对应的操作意图。因此,可利用该预设的语音指令池确定出接收到的语音指令的操作意图。
随后,在子步骤S422中,在预设的指令映射关系表中查找与该操作意图相匹配的报文内容。如上所述,指令映射关系表中记录了报文和操作意图之间的映射关系。因此,可根据在子步骤S421中确定出的操作意图,在映射关系表中查找相应的报文。
以上详细描述了根据本申请多个实施方式的多媒体设备的控制指令的采集、学习、控制、下发方法。下面将结合实际应用场景对本申请各实施方式进行示例性描述。
图9示出了本申请各实施方式的实际应用场景示意图。如图9所示,在学习阶段,天猫精灵501(即,控制设备的一例)可主动监听蓝牙遥控器502(即,无线遥控设备的一例)发出的控制指令,其中蓝牙遥控器502是天猫魔盒503(即,多媒体设备的一例)的专用遥控器。当用户在日常操作蓝牙遥控器502时,例如可按下“开机”键,以用于控制天猫魔盒503开机。此时,蓝牙遥控器502会发出与开机操作相对应的控制指令(具有报文形式),天猫精灵501会监测到该控制指令。随后,天猫精灵501可将控制指令报文中记录的源地址替换为天猫精灵501自身的地址后,将报文(或报文中的一部分内容)上传至云端504(即,服务器端)。云端504在收到报文后,可根据报文中的指令序列字段(例如,图2中所示的Index字段)确定出该报文的操作意图为“开机”。在云端504预设有语音指令池,该语音指令池记录了用户的语音指令与操作意图的对应关系。云端504可以操作意图为纽带,建立起所接收到的报文与用户的语音指令的映射关系(即,指令映射关系表),从而完成学习过程。
在控制阶段,用户505可向天猫精灵501发出意于控制天猫魔盒503的语音指令,例如,“开机”。天猫精灵501在收到该语音指令后,转发给云端504。云端504在接收到语音指令后,在上述学习阶段建立的指令映射关系表中查找与该语音指令相匹配的报文,并将查找到的报文下发给天猫精灵501。在该报文中的指令序列字段,记载了开机操作的指令序列。天猫精灵501在收到该报文后,首先可将报文中的源地址信息修改为蓝牙遥控器502的地址。随后,天猫精灵501可将该报文发送至天猫魔盒503。由于该报文的指令序列字段记载了开机操作的指令序列,因此天猫魔盒503可在该报文的控制下开机,从而实现用户505的语音对天猫魔盒503的控制。
图10示出了根据本申请一个实施方式用于采集多媒体设备的控制指令的装置的示意图。如图10所示,装置600可包括监测单元610、提取单元620、发送单元630。监测单元610用于监测针对所述多媒体设备的无线控制指令。提取单元620用于提取所述 无线控制指令中的报文内容。发送单元630将所述报文内容发送至服务器。
根据一个实施方式,监测单元610进一步用于开启蓝牙低功耗监测模式,并在所述蓝牙低功耗监测模式下监测所述无线控制指令。
根据一个实施方式,提取单元620进一步用于从所述无线控制指令中提取所述无线控制指令的源地址信息、目标地址信息和/或操作意图数据。
根据一个实施方式,提取单元620进一步用于将所述无线控制指令中的源地址信息替换为预设的本地地址信息。
根据一个实施方式,操作意图数据包括语音数据。
图11示出了根据本申请一个实施方式用于学习多媒体设备的控制指令的服务器的示意图。如图11所示,服务器700可包括接收单元710和建立单元720。接收单元710用于接收针对所述多媒体设备的无线控制指令的报文内容。建立单元720根据所述报文内容建立针对所述多媒体设备的指令映射关系表,其中所述映射关系表表征了针对所述多媒体设备的无线控制指令与预设的用户语音指令的映射关系。
图12示出了根据本申请另一实施方式用于学习多媒体设备的控制指令的服务器的示意图。如图12所示,除了接收单元710和建立单元720,服务器700还包括提取单元730。提取单元730用于提取所述报文内容中的源地址信息、目标地址信息和/或操作意图数据。
根据一个实施方式,接收单元710还用于接收控制设备地址信息。提取单元730还用于将所述报文内容中的源地址信息替换为所述控制设备地址信息。
图13示出了根据本申请一个实施方式的建立单元的示意图。如图13所示,建立单元720包括确定子单元721和查找子单元722。确定子单元721根据所述报文内容确定所述无线控制指令的操作意图。查找子单元722根据所述操作意图在所述预设的用户语音指令中查找与所述报文内容相匹配的用户语音指令。
根据一个实施方式,服务器700还包括添加单元(图中未示出)。添加单元向在所述预设的用户语音指令中没有匹配的用户语音指令的报文内容添加预设的标签。
图14示出了根据本申请一个实施方式用于控制多媒体设备的装置的示意图。如图14所示,装置800可包括接收单元810和发送单元820。接收单元810用于接收用户的语音指令,并从服务器接收与所述语音指令相匹配的报文内容。发送单元820用于将所 述语音指令发送至所述服务器,并将所述报文内容作为控制指令发送至所述多媒体设备。
根据一个实施方式,发送单元820用于在蓝牙低功耗模式下发送所述报文内容。
根据一个实施方式,所述报文内容包括源地址信息、目标地址信息以及操作意图数据。
根据一个实施方式,装置800还包括替换单元(图中未示出)。替换单元用于将所述报文内容中的源地址信息替换为预设的地址信息。
图15示出了根据本申请一个实施方式用于下发多媒体设备的控制指令的服务器的示意图。如图15所示,服务器900可包括接收单元910、查找单元920和发送单元930。接收单元910从控制设备接收语音指令。查找单元920在预设的指令映射关系表中查找与所述语音指令相匹配的报文内容,其中所述映射关系表表征了针对所述多媒体设备的无线控制指令的预设报文内容与预设的用户语音指令的映射关系。发送单元930将查找到的报文内容发送至所述控制设备。
根据一个实施方式,所述报文内容包括源地址信息、目标地址信息以及操作意图数据。
根据一个实施方式,服务器900还包括替换单元(图中未示出)。替换单元将查找到的报文内容中的源地址信息替换为预设的地址信息。
根据一个实施方式,查找单元920用于确定所述语音指令的操作意图,并在预设的指令映射关系表中查找与所述操作意图相匹配的报文内容。
本领域技术人员可以理解,本申请的技术方案可实施为系统、方法或计算机程序产品。因此,本申请可表现为完全硬件的实施例、完全软件的实施例(包括固件、常驻软件、微码等)或将软件和硬件相结合的实施例的形式,它们一般可被称为“电路”、“模块”或“系统”。此外,本申请可表现为计算机程序产品的形式,所述计算机程序产品嵌入到任何有形的表达介质中,所述有形的表达介质具有嵌入到所述介质中的计算机可用程序代码。
参照根据本申请实施例的方法、装置(系统)和计算机程序产品的流程图和/或框图来描述本申请。可以理解的是,可由计算机程序指令执行流程图和/或框图中的每个框、以及流程图和/或框图中的多个框的组合。这些计算机程序指令可提供给通用目的 计算机、专用目的计算机或其它可编程数据处理装置的处理器,以使通过计算机或其它可编程数据处理装置的处理器执行的指令创建用于实现流程图和/或框图的一个框或多个框中指明的功能/动作的装置。
这些计算机程序指令还可存储于能够指导计算机或其它可编程数据处理装置以特定的方式实现功能的计算机可读介质中,以使存储于计算机可读介质中的指令产生包括实现流程图和/或框图中的一个框或多个框中指明的功能/动作的指令装置。
计算机程序指令还可加载到计算机或其它可编程数据处理装置上,以引起在计算机上或其它可编程装置上执行一连串的操作步骤,以产生计算机实现的过程,从而使在计算机或其它可编程装置上执行的指令提供用于实现流程图和/或框图中的一个框或多个框中指明的功能/动作的过程。
附图中的流程图和框图示出根据本申请的多个实施例的系统、方法和计算机程序产品的可能实现的体系结构、功能和操作。在这点上,流程图或框图中的每个框可表示一个模块、区段或代码的一部分,其包括一个或多个用于实现特定逻辑功能的可执行指令。还应注意,在一些可替代性实施中,框中标注的功能可以不按照附图中标注的顺序发生。例如,根据所涉及的功能性,连续示出的两个框实际上可大致同时地执行,或者这些框有时以相反的顺序执行。还可注意到,可由执行特定功能或动作的专用目的的基于硬件的系统、或专用目的硬件与计算机指令的组合来实现框图和/或流程图示图中的每个框、以及框图和/或流程图示图中的多个框的组合。
虽然以上的叙述包括很多特定布置和参数,但需要注意的是,这些特定布置和参数仅仅用于说明本申请的一个实施方式。这不应该作为对本申请范围的限制。本领域技术人员可以理解,在不脱离本申请范围和精神的情况下,可对其进行各种修改、增加和替换。因此,本申请的范围应该基于所述权利要求来解释。

Claims (24)

  1. 一种多媒体设备的控制指令的采集方法,包括:
    监测针对所述多媒体设备的无线控制指令;
    提取所述无线控制指令中的报文内容;
    将所述报文内容发送至服务器。
  2. 如权利要求1所述的方法,其中监测针对所述多媒体设备的无线控制指令包括:
    开启蓝牙低功耗监测模式;
    在所述蓝牙低功耗监测模式下监测所述无线控制指令。
  3. 如权利要求1所述的方法,其中提取所述无线控制指令中的报文内容包括:
    从所述无线控制指令中提取所述无线控制指令的源地址信息;
    从所述无线控制指令中提取所述无线控制指令的目标地址信息;和/或
    从所述无线控制指令中提取所述无线控制指令的操作意图数据。
  4. 如权利要求3所述的方法,其中提取所述无线控制指令中的报文内容还包括:
    将所述无线控制指令中的源地址信息替换为预设的本地地址信息。
  5. 如权利要求3所述的方法,其中所述操作意图数据包括语音数据。
  6. 一种多媒体设备的控制指令的学习方法,包括:
    接收针对所述多媒体设备的无线控制指令的报文内容;
    根据所述报文内容建立针对所述多媒体设备的指令映射关系表,其中所述映射关系表表征了针对所述多媒体设备的无线控制指令与预设的用户语音指令的映射关系。
  7. 如权利要求6所述的方法,还包括:
    提取所述报文内容中的源地址信息;
    提取所述报文内容中的目标地址信息;和/或
    提取所述报文内容中的操作意图数据。
  8. 如权利要求7所述的方法,还包括:
    接收控制设备地址信息;
    将所述报文内容中的源地址信息替换为所述控制设备地址信息。
  9. 如权利要求6所述的方法,其中根据所述报文内容建立针对所述多媒体设备的指令映射关系表包括:
    根据所述报文内容确定所述无线控制指令的操作意图;
    根据所述操作意图在所述预设的用户语音指令中查找与所述报文内容相匹配的用户语音指令。
  10. 如权利要求9所述的方法,还包括:
    向在所述预设的用户语音指令中没有匹配的用户语音指令的报文内容添加预设的标签。
  11. 一种多媒体设备的控制方法,包括:
    接收用户的语音指令,并将所述语音指令发送至服务器;
    从所述服务器接收与所述语音指令相匹配的报文内容;
    将所述报文内容作为控制指令发送至所述多媒体设备。
  12. 如权利要求11所述的方法,其中将所述报文内容作为控制指令发送至所述多媒体设备包括:
    在蓝牙低功耗模式下发送所述报文内容。
  13. 如权利要求11所述的方法,其中所述报文内容包括源地址信息、目标地址信息以及操作意图数据。
  14. 如权利要求13所述的方法,还包括:
    将所述报文内容中的源地址信息替换为预设的地址信息。
  15. 一种下发多媒体设备的控制指令的方法,包括:
    从控制设备接收语音指令;
    在预设的指令映射关系表中查找与所述语音指令相匹配的报文内容,其中所述映射关系表表征了针对所述多媒体设备的无线控制指令的预设报文内容与预设的用户语音指令的映射关系;
    将查找到的报文内容发送至所述控制设备。
  16. 如权利要求15所述的方法,其中所述报文内容包括源地址信息、目标地址信息以及操作意图数据。
  17. 如权利要求16所述的方法,还包括:
    将查找到的报文内容中的源地址信息替换为预设的地址信息。
  18. 如权利要求15所述的方法,其中在预设的指令映射关系表中查找与所述语音指令相匹配的报文内容包括:
    确定所述语音指令的操作意图;
    在预设的指令映射关系表中查找与所述操作意图相匹配的报文内容。
  19. 一种用于采集多媒体设备的控制指令的装置,包括:
    监测单元,用于监测针对所述多媒体设备的无线控制指令;
    提取单元,用于提取所述无线控制指令中的报文内容;
    发送单元,将所述报文内容发送至服务器。
  20. 一种用于学习多媒体设备的控制指令的服务器,包括:
    接收单元,用于接收针对所述多媒体设备的无线控制指令的报文内容;
    建立单元,根据所述报文内容建立针对所述多媒体设备的指令映射关系表,其中所述映射关系表表征了针对所述多媒体设备的无线控制指令与预设的用户语音指令的映射关系。
  21. 一种用于控制多媒体设备的装置,包括:
    接收单元,用于接收用户的语音指令,并从服务器接收与所述语音指令相匹配的报文内容;
    发送单元,用于将所述语音指令发送至所述服务器,并将所述报文内容作为控制指 令发送至所述多媒体设备。
  22. 一种用于下发多媒体设备的控制指令的服务器,包括:
    接收单元,从控制设备接收语音指令;
    查找单元,在预设的指令映射关系表中查找与所述语音指令相匹配的报文内容,其中所述映射关系表表征了针对所述多媒体设备的无线控制指令的预设报文内容与预设的用户语音指令的映射关系;
    发送单元,将查找到的报文内容发送至所述控制设备。
  23. 一种装置,包括:
    处理器;
    存储器,用于存储一个或多个程序;
    当所述一个或多个程序被所述处理器执行时,使得所述处理器实现如权利要求1-18中任一项所述的方法。
  24. 一种非瞬时性计算机可读存储介质,其上存储有计算机程序,所述计算机程序在被处理器执行时,使得所述处理器实现如权利要求1-18中任一项所述的方法。
PCT/CN2019/110569 2018-10-22 2019-10-11 多媒体设备的控制指令的采集、学习、控制、下发方法和装置以及存储介质 WO2020083038A1 (zh)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201811231057.7 2018-10-22
CN201811231057.7A CN111083543B (zh) 2018-10-22 2018-10-22 多媒体设备的控制指令的采集、学习、控制、下发方法和装置以及存储介质

Publications (1)

Publication Number Publication Date
WO2020083038A1 true WO2020083038A1 (zh) 2020-04-30

Family

ID=70309822

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2019/110569 WO2020083038A1 (zh) 2018-10-22 2019-10-11 多媒体设备的控制指令的采集、学习、控制、下发方法和装置以及存储介质

Country Status (3)

Country Link
CN (1) CN111083543B (zh)
TW (1) TW202016736A (zh)
WO (1) WO2020083038A1 (zh)

Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN201177871Y (zh) * 2007-11-26 2009-01-07 厉天福 一种手持语音遥控装置
CN201927172U (zh) * 2010-12-21 2011-08-10 上海盛淘智能科技有限公司 学习型语音控制红外遥控器
CN103197571A (zh) * 2013-03-15 2013-07-10 张春鹏 一种控制方法及装置、系统
CN203385140U (zh) * 2012-10-19 2014-01-08 北京时代凌宇科技有限公司 空调控制系统
CN103899963A (zh) * 2014-04-30 2014-07-02 习小猛 多功能一体化智能电灯
CN107393290A (zh) * 2016-05-17 2017-11-24 上海后界信息科技发展有限公司 一种基于云计算语音识别控制红外设备的系统及方法
CN108154668A (zh) * 2017-12-28 2018-06-12 江苏惠通集团有限责任公司 遥控装置、遥控方法及遥控系统
CN207882679U (zh) * 2017-08-18 2018-09-18 安徽鑫美芝光电科技有限公司 一种智能手表

Patent Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN201177871Y (zh) * 2007-11-26 2009-01-07 厉天福 一种手持语音遥控装置
CN201927172U (zh) * 2010-12-21 2011-08-10 上海盛淘智能科技有限公司 学习型语音控制红外遥控器
CN203385140U (zh) * 2012-10-19 2014-01-08 北京时代凌宇科技有限公司 空调控制系统
CN103197571A (zh) * 2013-03-15 2013-07-10 张春鹏 一种控制方法及装置、系统
CN103899963A (zh) * 2014-04-30 2014-07-02 习小猛 多功能一体化智能电灯
CN107393290A (zh) * 2016-05-17 2017-11-24 上海后界信息科技发展有限公司 一种基于云计算语音识别控制红外设备的系统及方法
CN207882679U (zh) * 2017-08-18 2018-09-18 安徽鑫美芝光电科技有限公司 一种智能手表
CN108154668A (zh) * 2017-12-28 2018-06-12 江苏惠通集团有限责任公司 遥控装置、遥控方法及遥控系统

Also Published As

Publication number Publication date
TW202016736A (zh) 2020-05-01
CN111083543A (zh) 2020-04-28
CN111083543B (zh) 2021-09-28

Similar Documents

Publication Publication Date Title
US10123066B2 (en) Media playback method, apparatus, and system
CN105308673B (zh) 用于管理hdmi源的输出的方法、系统和介质
US8782150B2 (en) Method and apparatus for enabling device communication and control using XMPP
US10419822B2 (en) Method, device, and system for switching at a mobile terminal of a smart television and acquiring information at a television terminal
WO2016197866A1 (zh) 网络唤醒方法、远程服务器和网络交换设备
US8954641B2 (en) Method and apparatus for establishing communication
EP3139376B1 (en) Voice recognition method and device
TWI512489B (zh) Multi-screen interactive method, center equipment, terminal equipment and systems
US10944829B2 (en) Methods, systems, and devices for multiplexing service information from sensor data
US20200014556A1 (en) Methods, systems, and devices for managing client devices using a virtual anchor manager
US10938595B2 (en) Device control system, device control method, and non-transitory computer readable storage medium
WO2016165584A1 (zh) 一种终端之间的通信方法和装置
WO2021129262A1 (zh) 用于主动发起对话的服务端处理方法及服务器、能够主动发起对话的语音交互系统
US10591999B2 (en) Hand gesture recognition method, device, system, and computer storage medium
CN104635501A (zh) 智能家居控制方法和系统
WO2017096851A1 (zh) 一种推送视频文件的方法、系统和服务器
CN113741762A (zh) 一种多媒体播放方法、装置、电子设备和存储介质
CN104683865A (zh) 一种arc通道设置方法及设备
US20200213653A1 (en) Automatic input selection
CN114067798A (zh) 一种服务器、智能设备及智能语音控制方法
WO2017059768A1 (zh) 物联网中媒体数据分享的方法、系统、终端和存储介质
CN115473876A (zh) 实时流媒体数据的传输方法、装置、系统和存储介质
CN109787900B (zh) 传输方法、装置、设备和机器可读介质
WO2016062191A1 (zh) 信息发布方法、信息接收方法、装置及信息共享系统
WO2020083038A1 (zh) 多媒体设备的控制指令的采集、学习、控制、下发方法和装置以及存储介质

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 19875994

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 19875994

Country of ref document: EP

Kind code of ref document: A1