CN112735406A - Device control method and apparatus, storage medium, and electronic apparatus - Google Patents

Device control method and apparatus, storage medium, and electronic apparatus Download PDF

Info

Publication number
CN112735406A
CN112735406A CN202011548326.XA CN202011548326A CN112735406A CN 112735406 A CN112735406 A CN 112735406A CN 202011548326 A CN202011548326 A CN 202011548326A CN 112735406 A CN112735406 A CN 112735406A
Authority
CN
China
Prior art keywords
target
voice
function
equipment
semantics
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202011548326.XA
Other languages
Chinese (zh)
Other versions
CN112735406B (en
Inventor
关晶
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Qingdao Haier Technology Co Ltd
Haier Smart Home Co Ltd
Original Assignee
Qingdao Haier Technology Co Ltd
Haier Smart Home Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Qingdao Haier Technology Co Ltd, Haier Smart Home Co Ltd filed Critical Qingdao Haier Technology Co Ltd
Priority to CN202011548326.XA priority Critical patent/CN112735406B/en
Publication of CN112735406A publication Critical patent/CN112735406A/en
Application granted granted Critical
Publication of CN112735406B publication Critical patent/CN112735406B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/30Semantic analysis
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/18Speech classification or search using natural language modelling
    • G10L15/1822Parsing for meaning understanding
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • G10L2015/223Execution procedure of a spoken command
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02PCLIMATE CHANGE MITIGATION TECHNOLOGIES IN THE PRODUCTION OR PROCESSING OF GOODS
    • Y02P90/00Enabling technologies with a potential contribution to greenhouse gas [GHG] emissions mitigation
    • Y02P90/02Total factory control, e.g. smart factories, flexible manufacturing systems [FMS] or integrated manufacturing systems [IMS]

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Multimedia (AREA)
  • Artificial Intelligence (AREA)
  • Acoustics & Sound (AREA)
  • Human Computer Interaction (AREA)
  • Theoretical Computer Science (AREA)
  • General Health & Medical Sciences (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Selective Calling Equipment (AREA)
  • Telephonic Communication Services (AREA)

Abstract

The invention discloses a method and a device for controlling equipment, a storage medium and an electronic device, wherein the method comprises the following steps: acquiring multiple devices and device function sets of the multiple devices, wherein the device function sets are used for indicating all functions under the same device and preset semantic identifiers of each function; acquiring target semantics of voice input by a target object to voice equipment, determining target equipment corresponding to the target semantics, and determining target functions corresponding to the target semantics in an equipment function set under the type of the target equipment; the target equipment is controlled to execute the target function, namely, the equipment functions corresponding to different semantics of different equipment are confirmed by establishing the equipment function set, so that the problems that in the related technology, the semantics and the functions of each equipment need to be periodically tested around the equipment function and the like are solved, the voice control debugging of the equipment is greatly facilitated, the research and development period of the voice control of the equipment is shortened, and the error rate of the voice control is reduced.

Description

Device control method and apparatus, storage medium, and electronic apparatus
Technical Field
The present invention relates to the field of communications, and in particular, to a method and an apparatus for controlling a device, a storage medium, and an electronic apparatus.
Background
In recent years, with the development of intelligent household appliances, the number of intelligent devices is increased, and the voice control of the devices is more and more common. And performing voice control test on the functions of the equipment of the model one by one, and verifying whether the equipment is controllable. Where establishing semantics and device instruction relationships is labor intensive and error prone. Occupying a large amount of research and development testing time. In addition, each time a product is accessed, a corresponding relation is established around the functions and semantics of the product, the configuration development and test time period is long, and errors are easy to occur in manual configuration.
Aiming at the problems that in the related art, each device needs to perform semantic and function periodic tests around the functions of the device, and the like, an effective solution is not provided yet.
Disclosure of Invention
The embodiment of the invention provides a device control method and device, a storage medium and an electronic device, which are used for at least solving the problems that in the related art, each device needs to perform semantic and function periodic test around the function of the device and the like.
According to an aspect of an embodiment of the present invention, there is provided a control method of a device, including: acquiring multiple devices and device function sets of the multiple devices, wherein the device function sets are used for indicating all functions under the same device and preset semantic identifiers of each function; acquiring target semantics of voice input by a target object to voice equipment, determining target equipment corresponding to the target semantics, and determining target functions corresponding to the target semantics in an equipment function set under the type of the target equipment; controlling the target device to execute the target function.
In one exemplary embodiment, obtaining target semantics of a target object to speech input by a speech device comprises: collecting voice which is sent by a target object and used for controlling the target device through the voice device; carrying out voice recognition conversion on the voice to obtain a voice text; and extracting target semantics in the voice text according to a preset corpus generalization recognition condition.
In an exemplary embodiment, determining a target function corresponding to the target semantic in the device function set under the target device type includes: acquiring a device function set of the target device; searching a preset semantic identifier corresponding to the target semantic from the device function set of the target device; and determining the function corresponding to the preset semantic identifier as the target function.
In one exemplary embodiment, after controlling the target device to execute the target function, the method further includes: acquiring an operation result of the target equipment; and sending the operation result to a voice device, and performing voice feedback on the target object through the voice device.
In an exemplary embodiment, before obtaining the plurality of devices and the device functionality of the plurality of devices, the method further comprises: determining a standard format of the device and a function set of the device; and storing the obtained multiple devices and the device function sets of the multiple devices according to the standard format.
In an exemplary embodiment, extracting target semantics in the speech text according to a preset corpus generalization recognition condition includes: acquiring a voice rule of a target object in voice equipment, wherein the voice rule is used for indicating the characteristics of voice of the target object; determining a target recognition condition from a preset corpus generalization recognition condition according to the voice rule; and extracting target semantics in the voice text according to the target recognition conditions.
According to another aspect of the embodiments of the present invention, there is also provided a control apparatus of a device, including: the device comprises an acquisition module, a storage module and a processing module, wherein the acquisition module is used for acquiring multiple devices and device function sets of the multiple devices, and the device function sets are used for indicating all functions under the same device and preset semantic identifiers of each function; the determining module is used for acquiring target semantics of voice input by a voice device through a target object, determining target devices corresponding to the target semantics, and determining target functions corresponding to the target semantics in a device function set under the type of the target devices; and the control module is used for controlling the target equipment to execute the target function.
In an exemplary embodiment, the determining module is further configured to collect, by the speech device, a speech that is emitted by a target object and is used for controlling the target device; carrying out voice recognition conversion on the voice to obtain a voice text; and extracting target semantics in the voice text according to a preset corpus generalization recognition condition.
In an exemplary embodiment, the determining module is further configured to obtain a device function set of the target device; searching a preset semantic identifier corresponding to the target semantic from the device function set of the target device; and determining the function corresponding to the preset semantic identifier as the target function.
In an exemplary embodiment, the apparatus further includes: the feedback module is used for acquiring the operation result of the target equipment; and sending the operation result to a voice device, and performing voice feedback on the target object through the voice device.
In an exemplary embodiment, the apparatus further includes: the storage module is used for determining the standard format of the equipment and the function set of the equipment; and storing the obtained multiple devices and the device function sets of the multiple devices according to the standard format.
In an exemplary embodiment, the determining module is further configured to obtain a voice rule of a target object in a voice device, where the voice rule is used to indicate a feature of a spoken voice of the target object; determining a target recognition condition from a preset corpus generalization recognition condition according to the voice rule; and extracting target semantics in the voice text according to the target recognition conditions.
According to still another aspect of the embodiments of the present invention, there is also provided a computer-readable storage medium having a computer program stored therein, wherein the computer program is configured to execute the control method of the above apparatus when running.
According to another aspect of the embodiments of the present invention, there is also provided an electronic apparatus, including a memory, a processor, and a computer program stored in the memory and executable on the processor, wherein the processor executes the control method of the above apparatus through the computer program.
In the embodiment of the invention, multiple devices and device function sets of the multiple devices are obtained, wherein the device function sets are used for indicating all functions under the same device and preset semantic identifiers of each function; acquiring target semantics of voice input by a target object to voice equipment, determining target equipment corresponding to the target semantics, and determining target functions corresponding to the target semantics in an equipment function set under the type of the target equipment; the target equipment is controlled to execute the target function, namely, the equipment functions corresponding to different semantics of different equipment are confirmed by establishing the equipment function set, so that the problems that in the related technology, the semantics and the functions of each equipment need to be periodically tested around the equipment function and the like are solved, the voice control debugging of the equipment is greatly facilitated, the manual configuration is avoided, the research and development period of the voice control of the equipment is shortened, the error rate of the voice control is reduced, and the experience feeling of a user is improved.
Drawings
The accompanying drawings, which are included to provide a further understanding of the invention and are incorporated in and constitute a part of this application, illustrate embodiment(s) of the invention and together with the description serve to explain the invention without limiting the invention. In the drawings:
fig. 1 is a block diagram of a hardware configuration of a voice platform of a control method of an apparatus according to an embodiment of the present invention;
fig. 2 is a flowchart of a control method of an apparatus according to an embodiment of the present invention;
FIG. 3 is a diagram of a device establishing a primary function to semantic association according to an alternate embodiment of the present invention;
FIG. 4 is a schematic diagram of a device selecting a function from a standard function library in accordance with an alternative embodiment of the present invention;
FIG. 5 is a schematic diagram of a development flow of a device in a development phase according to an alternative embodiment of the invention;
FIG. 6 is a schematic diagram of a device functionality set according to an alternative embodiment of the present invention;
FIG. 7 is a flow diagram of a device voice control according to an alternative embodiment of the present invention;
fig. 8 is a block diagram of a control device of an apparatus according to an embodiment of the present invention.
Detailed Description
Hereinafter, embodiments of the present invention will be described in detail with reference to the accompanying drawings in conjunction with the embodiments.
It should be noted that the terms "first," "second," and the like in the description and claims of the present invention and in the drawings described above are used for distinguishing between similar elements and not necessarily for describing a particular sequential or chronological order.
The method provided by the embodiment of the application can be executed in a computer terminal, a voice platform or a similar computing device. Taking the example of operating on a voice platform, fig. 1 is a block diagram of a hardware structure of a voice platform of a device control method according to an embodiment of the present invention. As shown in fig. 1, the voice platform may include one or more (only one shown in fig. 1) processors 102 (the processors 102 may include, but are not limited to, a processing device such as a microprocessor MCU or a programmable logic device FPGA) and a memory 104 for storing data, which may also include a transmission device 106 for communication functions and an input-output device 108 in an exemplary embodiment. It will be understood by those skilled in the art that the structure shown in fig. 1 is only an illustration, and is not intended to limit the structure of the voice platform. For example, the voice platform may also include more or fewer components than shown in FIG. 1, or have a different configuration with equivalent functionality to that shown in FIG. 1 or with more functionality than that shown in FIG. 1.
The memory 104 may be used for storing computer programs, for example, software programs and modules of application software, such as computer programs corresponding to the control method of the device in the embodiment of the present invention, and the processor 102 executes various functional applications and data processing by running the computer programs stored in the memory 104, so as to implement the method described above. The memory 104 may include high speed random access memory, and may also include non-volatile memory, such as one or more magnetic storage devices, flash memory, or other non-volatile solid-state memory. In some examples, the memory 104 may further include memory located remotely from the processor 102, which may be connected to the voice platform over a network. Examples of such networks include, but are not limited to, the internet, intranets, local area networks, mobile communication networks, and combinations thereof.
The transmission device 106 is used for receiving or transmitting data via a network. Specific examples of the network described above may include a wireless network provided by a communication provider of a voice platform. In one example, the transmission device 106 includes a Network adapter (NIC), which can be connected to other Network devices through a base station so as to communicate with the internet. In one example, the transmission device 106 may be a Radio Frequency (RF) module, which is used for communicating with the internet in a wireless manner.
In the present embodiment, a control method of a device is provided, and fig. 2 is a flowchart of the control method of the device according to the embodiment of the present invention, where the flowchart includes the following steps:
step S202, acquiring multiple devices and device function sets of the multiple devices, wherein the device function sets are used for indicating all functions of the same device and preset semantic identifiers of each function;
step S204, acquiring target semantics of voice input by a voice device from a target object, determining a target device corresponding to the target semantics, and determining a target function corresponding to the target semantics in a device function set under the type of the target device;
step S206, controlling the target device to execute the target function.
Through the steps, multiple devices and device function sets of the multiple devices are obtained, wherein the device function sets are used for indicating all functions of the same device and preset semantic identifiers of each function; acquiring target semantics of voice input by a target object to voice equipment, determining target equipment corresponding to the target semantics, and determining target functions corresponding to the target semantics in an equipment function set under the type of the target equipment; the target equipment is controlled to execute the target function, namely, the equipment functions corresponding to different semantics of different equipment are confirmed by establishing the equipment function set, so that the problems that in the related technology, the semantics and the functions of each equipment need to be periodically tested around the equipment function and the like are solved, the voice control debugging of the equipment is greatly facilitated, the manual configuration is avoided, the research and development period of the voice control of the equipment is shortened, the error rate of the voice control is reduced, and the experience feeling of a user is improved.
In one exemplary embodiment, obtaining target semantics of a target object to speech input by a speech device comprises: collecting voice which is sent by a target object and used for controlling the target device through the voice device; carrying out voice recognition conversion on the voice to obtain a voice text; and extracting target semantics in the voice text according to a preset corpus generalization recognition condition.
It can be understood that, in order to facilitate the recognition of the semantics in the voice sent by the target object by the voice device, after the voice of the control device is acquired, the voice is correspondingly converted into a voice text, and then the target semantics in the voice text is extracted according to the preset corpus generalization recognition condition.
In an exemplary embodiment, determining a target function corresponding to the target semantic in the device function set under the target device type includes: acquiring a device function set of the target device; searching a preset semantic identifier corresponding to the target semantic from the device function set of the target device; and determining the function corresponding to the preset semantic identifier as the target function.
Namely, a device function set is built by using a standard function modeling and taking the device type as a dimension, each function corresponds to a semantic identifier ID, a preset semantic identifier corresponding to a target semantic is searched for in the built device function set according to the target semantic in the voice, and the corresponding target function to be executed is determined through the semantic identifier.
In one exemplary embodiment, after controlling the target device to execute the target function, the method further includes: acquiring an operation result of the target equipment; and sending the operation result to a voice device, and performing voice feedback on the target object through the voice device.
Because the target object is controlled by the voice, in order to enable the target object to quickly determine the control result of the voice, after the target device executes the target function corresponding to the voice, the operation result is fed back to the voice device, so that the voice device can generate corresponding feedback voice according to the operation result and perform voice broadcast to the target object.
In an exemplary embodiment, before obtaining the plurality of devices and the device functionality of the plurality of devices, the method further comprises: determining a standard format of the device and a function set of the device; and storing the obtained multiple devices and the device function sets of the multiple devices according to the standard format.
That is to say, in order to ensure the accuracy of the correspondence between the devices and the device function sets, the multiple devices and the device function sets of the multiple devices are uniformly stored after being in one-to-one correspondence according to the standard format, and it should be noted that the standard format is an optimal storage format predetermined according to the type of the function set, and may also be a standard format corresponding to an optimal search mode determined according to the personalized semantics of the target object.
In an exemplary embodiment, extracting target semantics in the speech text according to a preset corpus generalization recognition condition includes: acquiring a voice rule of a target object in voice equipment, wherein the voice rule is used for indicating the characteristics of voice of the target object; determining a target recognition condition from a preset corpus generalization recognition condition according to the voice rule; and extracting target semantics in the voice text according to the target recognition conditions.
In order to better understand the technical solutions of the embodiments and the alternative embodiments of the present invention, the following explains the flow of the control method of the above-described device with reference to examples, but is not limited to the technical solutions of the embodiments of the present invention.
As an alternative embodiment, the device accesses the voice assistant, and supports the voice control flow as follows:
step one, according to the type of the device and the corresponding relationship between the function configuration function and the semantic meaning of the device, each device needs to establish a primary association relationship between the function and the semantic meaning, and the function identifiers corresponding to the same function are different, as shown in fig. 3.
Step two, testing the voice control equipment in an acceptance environment;
and step three, the voice control function of the equipment is on line after the verification is reliable.
In an alternative embodiment of the present invention, a development-free voice control system for an intelligent device is provided, where the development-free voice control system for an intelligent device includes: basic data development, equipment development and a voice control system, wherein the basic data development is the basis of function definition and development on a standard open platform; the voice control system comprises an NLP (Natural Language Processing, NLP for short), an AI control system, a domain model, a device control service and a gateway 5 large system, wherein the voice control system carries out semantic and function identification through function configuration data in a consumer device development system, the information conversion of the function identification, a communication protocol and the like finally realizes a complete voice control closed loop which passes through the voice control device and returns a control result, further, the device function standard and the semantic standard are unified through establishing the association relationship between the functions of the device standard model and the semantics, the AI platform carries out one-time semantic development on the semantic identification generated by the functions of the standard model, all devices defined by the standard function model can be supported, independent voice control debugging is not needed, manual configuration is avoided, the research and development period is shortened, and the error rate is reduced.
Optionally, the development-free voice control system for intelligent devices in the optional embodiment of the present invention is based on a standard modeling system, and the identifiers of the unified device function models, that is, the identifiers of the same function under the same type of devices are the same, for example: the system identification mark of the temperature change chamber display temperature function of all refrigerators is 'vtRoomTemperature', and the corresponding semantic mark is '11 e166e3cd8111eabe5d801844e4b1 cf'. And in the equipment development stage, the standard model is used for defining the equipment functions and carrying out bottom plate development. The artificial intelligence department develops the voice and generalizes the linguistic data.
Fig. 4 is a schematic diagram of an apparatus according to an alternative embodiment of the present invention selecting a function from a standard function library, so as to ensure that the same function identifier is the same.
Optionally, as shown in fig. 5, in the development stage, the device functions are selected from the standard model library according to the function design, and the backplane is developed according to the protocol, so that the voice control function does not need to be separately developed, and the device can be directly tested. Corresponding semantics are developed according to the standard model function, the semantics are merged into a standard modeling system, and all intelligent equipment using the standard model definition function can be directly accessed into a voice assistant without development and test, so that the intelligent sound box control is supported. The development workload of the equipment for accessing the voice is avoided.
Alternatively, as shown in fig. 6, the basic data development system is a standard functional modeling system, i.e., a standard semantic development system. The standard functional modeling takes the type of equipment as a dimension to build an equipment function set, each function corresponds to a semantic ID, and an NLP system generalizes linguistic data according to functional semantics.
Optionally, the device development defines device functions according to standard models of all devices on the platform, and the development bottom plate can ensure that the devices with the same type correspond to the same function and the system recognizable identifiers with the same function are the same, so that a foundation is laid for standardized voice control.
For example, a user sends an instruction for controlling equipment through an intelligent sound box or a voice assistant, an NLP system converts voice into Chinese characters through an ASR (Automatic Speech Recognition) technology and analyzes the intention of the user, i.e., semantics, the NLP system determines the equipment to be controlled by the user and the instruction and sends the information to an equipment control service of the AI through multiple conversations, the AI equipment control service sends an equipment ID, i.e., control information, to a domain model service of a cloud platform, the domain model provides a mapping relation between the equipment instruction and the semantics through an equipment development platform, a function model document of equipment model dimensions converts the semantics into a function identifier which can be recognized by the equipment, and the instruction is issued to the equipment through the equipment control service. The device receives the instruction and then uploads the execution result to the cloud platform gateway, the gateway feeds the execution result back to the domain model, the domain model then sends the device execution result and error code information to the AI device control service, the AI device control service analyzes the execution result, produces a response file, sends the file to the NLP, converts the characters into voice and feeds the voice back to the user through the sound box or the voice assistant to form a control closed loop, and fig. 7 is a schematic flow diagram of device voice control according to an optional embodiment of the invention.
Through the embodiment, the device side is free from development and testing through semantic development and one-time semantic development, and voice control can be supported. Greatly shortens the development period of the equipment and saves the labor and time cost.
Through the above description of the embodiments, those skilled in the art can clearly understand that the method according to the above embodiments can be implemented by software plus a necessary general hardware platform, and certainly can also be implemented by hardware, but the former is a better implementation mode in many cases. Based on such understanding, the technical solutions of the present invention may be embodied in the form of a software product, which is stored in a storage medium (e.g., ROM/RAM, magnetic disk, optical disk) and includes instructions for enabling a terminal device (e.g., a mobile phone, a computer, a server, or a network device) to execute the method according to the embodiments of the present invention.
In this embodiment, a control device of an apparatus is further provided, where the control device is used to implement the foregoing embodiments and preferred embodiments, and details are not repeated for what has been described. As used below, the term "module" may be a combination of software and/or hardware that implements a predetermined function. Although the means described in the embodiments below are preferably implemented in software, an implementation in hardware, or a combination of software and hardware is also possible and contemplated.
Fig. 8 is a block diagram of a control apparatus of a device according to an embodiment of the present invention, as shown in fig. 8, the apparatus including:
an obtaining module 82, configured to obtain multiple devices and device function sets of the multiple devices, where the device function set is used to indicate all functions of the same device and preset semantic identifiers of each function;
a determining module 84, configured to obtain a target semantic of a voice input by a target object to a voice device, determine a target device corresponding to the target semantic, and determine a target function corresponding to the target semantic in a device function set of the target device type;
a control module 86, configured to control the target device to execute the target function.
According to the technical scheme, multiple devices and device function sets of the multiple devices are obtained, wherein the device function sets are used for indicating all functions of the same device and preset semantic identifiers of each function; acquiring target semantics of voice input by a target object to voice equipment, determining target equipment corresponding to the target semantics, and determining target functions corresponding to the target semantics in an equipment function set under the type of the target equipment; the target equipment is controlled to execute the target function, namely, the equipment functions corresponding to different semantics of different equipment are confirmed by establishing the equipment function set, so that the problems that in the related technology, the semantics and the functions of each equipment need to be periodically tested around the equipment function and the like are solved, the voice control debugging of the equipment is greatly facilitated, the manual configuration is avoided, the research and development period of the voice control of the equipment is shortened, the error rate of the voice control is reduced, and the experience feeling of a user is improved.
In an exemplary embodiment, the determining module is further configured to collect, by the speech device, a speech that is emitted by a target object and is used for controlling the target device; carrying out voice recognition conversion on the voice to obtain a voice text; and extracting target semantics in the voice text according to a preset corpus generalization recognition condition.
It can be understood that, in order to facilitate the recognition of the semantics in the voice sent by the target object by the voice device, after the voice of the control device is acquired, the voice is correspondingly converted into a voice text, and then the target semantics in the voice text is extracted according to the preset corpus generalization recognition condition.
In an exemplary embodiment, the determining module is further configured to obtain a device function set of the target device; searching a preset semantic identifier corresponding to the target semantic from the device function set of the target device; and determining the function corresponding to the preset semantic identifier as the target function.
Namely, a device function set is built by using a standard function modeling and taking the device type as a dimension, each function corresponds to a semantic identifier ID, a preset semantic identifier corresponding to a target semantic is searched for in the built device function set according to the target semantic in the voice, and the corresponding target function to be executed is determined through the semantic identifier.
In an exemplary embodiment, the apparatus further includes: the feedback module is used for acquiring the operation result of the target equipment; and sending the operation result to a voice device, and performing voice feedback on the target object through the voice device. Because the target object is controlled by the voice, in order to enable the target object to quickly determine the control result of the voice, after the target device executes the target function corresponding to the voice, the operation result is fed back to the voice device, so that the voice device can generate corresponding feedback voice according to the operation result and perform voice broadcast to the target object.
In an exemplary embodiment, the apparatus further includes: the storage module is used for determining the standard format of the equipment and the function set of the equipment; and storing the obtained multiple devices and the device function sets of the multiple devices according to the standard format. That is to say, in order to ensure the accuracy of the correspondence between the devices and the device function sets, the multiple devices and the device function sets of the multiple devices are uniformly stored after being in one-to-one correspondence according to the standard format, and it should be noted that the standard format is an optimal storage format predetermined according to the type of the function set, and may also be a standard format corresponding to an optimal search mode determined according to the personalized semantics of the target object.
In an exemplary embodiment, the determining module is further configured to obtain a voice rule of a target object in a voice device, where the voice rule is used to indicate a feature of a spoken voice of the target object; determining a target recognition condition from a preset corpus generalization recognition condition according to the voice rule; and extracting target semantics in the voice text according to the target recognition conditions.
It should be noted that, the above modules may be implemented by software or hardware, and for the latter, the following may be implemented, but not limited to: the modules are all positioned in the same processor; alternatively, the modules are respectively located in different processors in any combination.
Embodiments of the present invention also provide a storage medium having a computer program stored therein, wherein the computer program is arranged to perform the steps of any of the above method embodiments when executed.
Alternatively, in the present embodiment, the storage medium may be configured to store a computer program for executing the steps of:
s1, acquiring multiple devices and device function sets of the multiple devices, wherein the device function sets are used for indicating all functions of the same device and preset semantic identifiers of each function;
s2, acquiring target semantics of voice input by a voice device through a target object, determining a target device corresponding to the target semantics, and determining a target function corresponding to the target semantics in a device function set under the type of the target device;
and S3, controlling the target device to execute the target function.
Embodiments of the present invention also provide a computer-readable storage medium having a computer program stored thereon, wherein the computer program is arranged to perform the steps of any of the above-mentioned method embodiments when executed.
In an exemplary embodiment, the computer-readable storage medium may include, but is not limited to: various media capable of storing computer programs, such as a usb disk, a Read-Only Memory (ROM), a Random Access Memory (RAM), a removable hard disk, a magnetic disk, or an optical disk.
Embodiments of the present invention also provide an electronic device comprising a memory having a computer program stored therein and a processor arranged to run the computer program to perform the steps of any of the above method embodiments.
In an exemplary embodiment, the electronic apparatus may further include a transmission device and an input/output device, wherein the transmission device is connected to the processor, and the input/output device is connected to the processor.
In an exemplary embodiment, the processor may be configured to execute the following steps by a computer program:
s1, acquiring multiple devices and device function sets of the multiple devices, wherein the device function sets are used for indicating all functions of the same device and preset semantic identifiers of each function;
s2, acquiring target semantics of voice input by a voice device through a target object, determining a target device corresponding to the target semantics, and determining a target function corresponding to the target semantics in a device function set under the type of the target device;
and S3, controlling the target device to execute the target function.
It will be apparent to those skilled in the art that the various modules or steps of the invention described above may be implemented using a general purpose computing device, they may be centralized on a single computing device or distributed across a network of computing devices, and they may be implemented using program code executable by the computing devices, such that they may be stored in a memory device and executed by the computing device, and in some cases, the steps shown or described may be performed in an order different than that described herein, or they may be separately fabricated into various integrated circuit modules, or multiple ones of them may be fabricated into a single integrated circuit module. Thus, the present invention is not limited to any specific combination of hardware and software.
The above description is only a preferred embodiment of the present invention and is not intended to limit the present invention, and various modifications and changes may be made by those skilled in the art. Any modification, equivalent replacement, or improvement made within the principle of the present invention should be included in the protection scope of the present invention.

Claims (10)

1. A method of controlling a device, comprising:
acquiring multiple devices and device function sets of the multiple devices, wherein the device function sets are used for indicating all functions under the same device and preset semantic identifiers of each function;
acquiring target semantics of voice input by a target object to voice equipment, determining target equipment corresponding to the target semantics, and determining target functions corresponding to the target semantics in an equipment function set under the type of the target equipment;
controlling the target device to execute the target function.
2. The method of claim 1, wherein obtaining target semantics of a target object for speech input by a speech device comprises:
collecting voice which is sent by a target object and used for controlling the target device through the voice device;
carrying out voice recognition conversion on the voice to obtain a voice text;
and extracting target semantics in the voice text according to a preset corpus generalization recognition condition.
3. The method of claim 1, wherein determining the target function corresponding to the target semantic in the device function set of the target device type comprises:
acquiring a device function set of the target device;
searching a preset semantic identifier corresponding to the target semantic from the device function set of the target device;
and determining the function corresponding to the preset semantic identifier as the target function.
4. The method of claim 1, wherein after controlling the target device to perform the target function, the method further comprises:
acquiring an operation result of the target equipment;
and sending the operation result to a voice device, and performing voice feedback on the target object through the voice device.
5. The method of claim 1, wherein prior to obtaining the plurality of devices and the device functionality of the plurality of devices, the method further comprises:
determining a standard format of the device and a function set of the device;
and storing the obtained multiple devices and the device function sets of the multiple devices according to the standard format.
6. The method according to claim 2, wherein extracting the target semantics from the speech text according to a preset corpus generalization recognition condition comprises:
acquiring a voice rule of a target object in voice equipment, wherein the voice rule is used for indicating the characteristics of voice of the target object;
determining a target recognition condition from a preset corpus generalization recognition condition according to the voice rule;
and extracting target semantics in the voice text according to the target recognition conditions.
7. A control apparatus of a device, characterized by comprising:
the device comprises an acquisition module, a storage module and a processing module, wherein the acquisition module is used for acquiring multiple devices and device function sets of the multiple devices, and the device function sets are used for indicating all functions under the same device and preset semantic identifiers of each function;
the determining module is used for acquiring target semantics of voice input by a voice device through a target object, determining target devices corresponding to the target semantics, and determining target functions corresponding to the target semantics in a device function set under the type of the target devices;
and the control module is used for controlling the target equipment to execute the target function.
8. The apparatus according to claim 7, wherein the determining module is further configured to collect, by the speech device, speech uttered by a target object for controlling the target device; carrying out voice recognition conversion on the voice to obtain a voice text; and extracting target semantics in the voice text according to a preset corpus generalization recognition condition.
9. A computer-readable storage medium, comprising a stored program, wherein the program is operable to perform the method of any one of claims 1 to 6.
10. An electronic device comprising a memory and a processor, characterized in that the memory has stored therein a computer program, the processor being arranged to execute the method of any of claims 1 to 6 by means of the computer program.
CN202011548326.XA 2020-12-23 2020-12-23 Device control method and apparatus, storage medium, and electronic apparatus Active CN112735406B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202011548326.XA CN112735406B (en) 2020-12-23 2020-12-23 Device control method and apparatus, storage medium, and electronic apparatus

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202011548326.XA CN112735406B (en) 2020-12-23 2020-12-23 Device control method and apparatus, storage medium, and electronic apparatus

Publications (2)

Publication Number Publication Date
CN112735406A true CN112735406A (en) 2021-04-30
CN112735406B CN112735406B (en) 2022-10-28

Family

ID=75605286

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202011548326.XA Active CN112735406B (en) 2020-12-23 2020-12-23 Device control method and apparatus, storage medium, and electronic apparatus

Country Status (1)

Country Link
CN (1) CN112735406B (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113434167A (en) * 2021-06-15 2021-09-24 青岛海尔科技有限公司 Function upgrading method and device, storage medium and electronic device
CN115146650A (en) * 2022-06-27 2022-10-04 西安羚控电子科技有限公司 Test process creating method and system based on semantic recognition

Citations (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105446146A (en) * 2015-11-19 2016-03-30 深圳创想未来机器人有限公司 Intelligent terminal control method based on semantic analysis, system and intelligent terminal
US20180121783A1 (en) * 2016-10-31 2018-05-03 Baidu Online Network Technology (Beijing) Co., Ltd. Semantic analysis method and apparatus based on artificial intelligence
CN108735214A (en) * 2018-05-30 2018-11-02 出门问问信息科技有限公司 The sound control method and device of equipment
CN108965081A (en) * 2018-09-06 2018-12-07 珠海格力电器股份有限公司 The method and device of voice control device
CN109347708A (en) * 2018-10-15 2019-02-15 珠海格力电器股份有限公司 A kind of audio recognition method, device, household appliance, Cloud Server and medium
CN110875040A (en) * 2018-08-29 2020-03-10 九阳股份有限公司 Household appliance control method and system based on product skills
CN111324626A (en) * 2020-01-21 2020-06-23 苏州思必驰信息科技有限公司 Search method and device based on voice recognition, computer equipment and storage medium
CN211791560U (en) * 2020-05-09 2020-10-27 宁波奥克斯电气股份有限公司 Distributed voice control system
CN111883112A (en) * 2020-07-27 2020-11-03 中国平安人寿保险股份有限公司 Semantic recognition method and device based on multi-mode identification and computer equipment
CN111884887A (en) * 2020-07-01 2020-11-03 海尔优家智能科技(北京)有限公司 Voice interaction method and device, storage medium and electronic device

Patent Citations (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105446146A (en) * 2015-11-19 2016-03-30 深圳创想未来机器人有限公司 Intelligent terminal control method based on semantic analysis, system and intelligent terminal
US20180121783A1 (en) * 2016-10-31 2018-05-03 Baidu Online Network Technology (Beijing) Co., Ltd. Semantic analysis method and apparatus based on artificial intelligence
CN108735214A (en) * 2018-05-30 2018-11-02 出门问问信息科技有限公司 The sound control method and device of equipment
CN110875040A (en) * 2018-08-29 2020-03-10 九阳股份有限公司 Household appliance control method and system based on product skills
CN108965081A (en) * 2018-09-06 2018-12-07 珠海格力电器股份有限公司 The method and device of voice control device
CN109347708A (en) * 2018-10-15 2019-02-15 珠海格力电器股份有限公司 A kind of audio recognition method, device, household appliance, Cloud Server and medium
CN111324626A (en) * 2020-01-21 2020-06-23 苏州思必驰信息科技有限公司 Search method and device based on voice recognition, computer equipment and storage medium
CN211791560U (en) * 2020-05-09 2020-10-27 宁波奥克斯电气股份有限公司 Distributed voice control system
CN111884887A (en) * 2020-07-01 2020-11-03 海尔优家智能科技(北京)有限公司 Voice interaction method and device, storage medium and electronic device
CN111883112A (en) * 2020-07-27 2020-11-03 中国平安人寿保险股份有限公司 Semantic recognition method and device based on multi-mode identification and computer equipment

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113434167A (en) * 2021-06-15 2021-09-24 青岛海尔科技有限公司 Function upgrading method and device, storage medium and electronic device
CN113434167B (en) * 2021-06-15 2023-06-16 青岛海尔科技有限公司 Function upgrading method and device, storage medium and electronic device
CN115146650A (en) * 2022-06-27 2022-10-04 西安羚控电子科技有限公司 Test process creating method and system based on semantic recognition

Also Published As

Publication number Publication date
CN112735406B (en) 2022-10-28

Similar Documents

Publication Publication Date Title
CN112735406B (en) Device control method and apparatus, storage medium, and electronic apparatus
CN107589952A (en) Intelligent upgrade method, device and system
CN104185868A (en) Voice authentication and speech recognition system and method
CN108733639A (en) A kind of configuration parameter regulation means, device, terminal device and storage medium
CN112084315A (en) Question-answer interaction method, device, storage medium and equipment
CN112116910A (en) Voice instruction recognition method and device, storage medium and electronic device
CN107911251A (en) A kind of method for configuring network equipment, device and medium
CN113921004A (en) Intelligent device control method and device, storage medium and electronic device
CN113407157A (en) Object model determination method and device, storage medium and electronic device
CN110364155A (en) Voice control error-reporting method, electric appliance and computer readable storage medium
US11985029B2 (en) Pico-base station configuration method and apparatus, storage medium and electronic apparatus
CN114238269A (en) Database parameter adjusting method and device, electronic equipment and storage medium
CN110531632A (en) Control method and system
CN112418735A (en) Power grid AI scheduling system and method based on graph calculation
CN115620710A (en) Speech recognition method, speech recognition device, storage medium and electronic device
CN113595811B (en) Equipment performance testing method and device, storage medium and electronic device
CN114550719A (en) Method and device for recognizing voice control instruction and storage medium
CN110647314B (en) Skill generation method and device and electronic equipment
CN113835939A (en) Method and device for processing fault information, storage medium and electronic device
CN112698948A (en) Method and device for acquiring product resources, storage medium and electronic device
CN115599881A (en) Workflow creation method, system, electronic device, and computer-readable storage medium
CN108737538B (en) Background data processing method and device
CN113723800B (en) Risk identification model training method and device, and risk identification method and device
CN111309754B (en) Service data access method, device, storage medium and electronic equipment
CN114999494A (en) Voice interaction function testing method and device, storage medium and electronic device

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant