WO2020006993A1 - Intelligent household electrical appliance control method and intelligent household electrical appliance control device - Google Patents

Intelligent household electrical appliance control method and intelligent household electrical appliance control device Download PDF

Info

Publication number
WO2020006993A1
WO2020006993A1 PCT/CN2018/122256 CN2018122256W WO2020006993A1 WO 2020006993 A1 WO2020006993 A1 WO 2020006993A1 CN 2018122256 W CN2018122256 W CN 2018122256W WO 2020006993 A1 WO2020006993 A1 WO 2020006993A1
Authority
WO
WIPO (PCT)
Prior art keywords
comfort
evaluation result
control action
parameter information
smart home
Prior art date
Application number
PCT/CN2018/122256
Other languages
French (fr)
Chinese (zh)
Inventor
杨赛赛
陈翀
万会
宋德超
连圆圆
秦萍
冯德兵
Original Assignee
珠海格力电器股份有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 珠海格力电器股份有限公司 filed Critical 珠海格力电器股份有限公司
Publication of WO2020006993A1 publication Critical patent/WO2020006993A1/en

Links

Images

Classifications

    • GPHYSICS
    • G05CONTROLLING; REGULATING
    • G05BCONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
    • G05B15/00Systems controlled by a computer
    • G05B15/02Systems controlled by a computer electric
    • GPHYSICS
    • G05CONTROLLING; REGULATING
    • G05BCONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
    • G05B19/00Programme-control systems
    • G05B19/02Programme-control systems electric
    • G05B19/418Total factory control, i.e. centrally controlling a plurality of machines, e.g. direct or distributed numerical control [DNC], flexible manufacturing systems [FMS], integrated manufacturing systems [IMS] or computer integrated manufacturing [CIM]
    • GPHYSICS
    • G05CONTROLLING; REGULATING
    • G05BCONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
    • G05B2219/00Program-control systems
    • G05B2219/20Pc systems
    • G05B2219/26Pc applications
    • G05B2219/2642Domotique, domestic, home control, automation, smart house

Definitions

  • the present application relates to the field of smart home appliance control, and in particular, to a smart home appliance control method and a smart home appliance control device.
  • Smart appliances can improve the comfort of home life. Taking air conditioners as an example, it can provide users with a comfortable ambient temperature environment.
  • the control method of the air conditioner is that the user sets the operating temperature, and the air conditioner performs feedback adjustment according to the ambient temperature of the room in which the ambient temperature of the room is maintained at the set temperature of the air conditioner.
  • comfort control users usually set the air conditioner to achieve comfort control by relying on their somatosensory experience.
  • the air conditioner operation state set by the user each time may not be the most comfortable operation state, which results in a poor user experience.
  • the present application discloses a smart home appliance control method and a smart home appliance control device.
  • a smart home appliance control method includes:
  • a control action corresponding to the parameter information is obtained through a preset model, the preset model includes a reinforcement learning model, and the reinforcement learning model can be adjusted according to a comfort evaluation result;
  • the operation is controlled according to the control action.
  • the acquiring parameter information includes:
  • the acquiring environmental parameter information includes:
  • the preset model further includes a state transition model
  • the obtaining a control action corresponding to the parameter information based on the parameter information through a preset model includes:
  • a control action is generated by the reinforcement learning model, and the reinforcement learning model is used to represent a correspondence between the state parameter and the control action.
  • the state transition model includes one or more of a state comparison table, a neural network model, and a preset logic rule.
  • the reinforcement learning model can be adjusted according to the comfort evaluation results, including:
  • the probability that the reinforcement learning model outputs the control action can be adjusted according to the comfort evaluation result.
  • it further includes:
  • the obtaining the comfort evaluation result after running according to the control action includes:
  • the first comfort value is a comfort value corresponding to a state parameter before performing a corresponding operation according to the control action
  • the second comfort value is a comfort value corresponding to a state parameter after performing a corresponding operation according to the control action
  • the comfort evaluation result is obtained.
  • the comfort evaluation algorithm sets the same or different weights corresponding to each state parameter.
  • the obtaining the comfort evaluation result after running according to the control action includes:
  • a comfort evaluation result fed back by a user is obtained after a corresponding operation is performed according to the control action.
  • the comfort evaluation result includes a positive evaluation result or a negative evaluation result
  • the updating the reinforcement learning model according to the comfort evaluation result includes:
  • a smart home appliance control device includes:
  • a first obtaining module configured to obtain parameter information
  • a second acquisition module based on the parameter information, obtaining a control action corresponding to the parameter information through a preset model, the preset model includes a reinforcement learning model, and the reinforcement learning model can be adjusted according to a comfort evaluation result;
  • a control module configured to control operation according to the control action.
  • the first obtaining module is specifically configured to:
  • the preset model further includes a state transition model
  • the obtaining a control action corresponding to the parameter information based on the parameter information through a preset model includes:
  • a control action is generated by the reinforcement learning model, and the reinforcement learning model is used to represent a correspondence between the state parameter and the control action.
  • the smart home appliance control device further includes:
  • An evaluation module configured to obtain the comfort evaluation result after controlling operation according to the control action
  • An update module is configured to update the reinforcement learning model according to the comfort evaluation result.
  • the evaluation module is specifically configured to:
  • the first comfort value is a comfort value corresponding to a state parameter before performing a corresponding operation according to the control action
  • the second comfort value is a comfort value corresponding to a state parameter after performing a corresponding operation according to the control action
  • the comfort evaluation result is obtained.
  • the evaluation module is further specifically configured to:
  • a comfort evaluation result fed back by a user is obtained after a corresponding operation is performed according to the control action.
  • the update module is specifically configured to:
  • the comfort evaluation result includes a positive evaluation result or a negative evaluation result
  • This application is based on the obtained parameter information, and obtains the control action corresponding to the parameter information through a preset model.
  • the preset model includes a reinforcement learning model.
  • the reinforcement learning model can be adjusted according to the comfort evaluation result.
  • FIG. 1 is a schematic flowchart of a smart home appliance control method disclosed by an embodiment of the present application
  • FIG. 2 is a schematic structural diagram of a state transition model disclosed by an embodiment of the present application.
  • FIG. 3 is a schematic flowchart of a smart home appliance control method disclosed by another embodiment of the present application.
  • FIG. 4 is a schematic flowchart of a smart home appliance control method disclosed by another embodiment of the present application.
  • FIG. 5 is a schematic structural diagram of a smart home appliance control device disclosed by an embodiment of the present application.
  • FIG. 6 is a schematic structural diagram of a smart home appliance control device disclosed by another embodiment of the present application.
  • FIG. 1 is a schematic flowchart of a smart home appliance control method according to an embodiment of the present application. As shown in FIG. 1, the smart home appliance control method includes the following steps:
  • Step S101 Acquire parameter information.
  • the parameter information is related to the comfort of the smart home appliance control action, and the parameter information may be environmental parameter information.
  • the environmental parameter information may be environmental parameter information collected and / or configured by the smart home appliance itself.
  • Smart appliances can obtain environmental parameter information through themselves.
  • smart home appliances use their own configured sensors to collect environmental parameter information, such as indoor temperature, humidity, and particle information; for example, room information configured in smart home appliances, such as room size, orientation, and lighting.
  • the environmental parameter information may also be collected and / or configured for external devices of the smart home appliance.
  • Smart appliances can receive environmental parameter information from external devices, such as smart appliances receiving local weather information, such as local temperature, humidity, rain, and snow, sent by cloud servers through the network.
  • the smart home appliance may also be associated with other smart home appliances and sensors, and receive environmental parameter information collected by other smart home appliances and sensors.
  • it is associated with other smart home appliances and receives temperature and humidity information collected by other smart home appliances.
  • it is associated with door and window sensors.
  • the door and window sensors obtain the door and window switch status information.
  • the smart home appliances receive the switch status information sent by the door and window sensors. It is the smart home appliance that receives the information sent by the control center of the smart home system, such as room information configured in the control center of the smart home system.
  • the above-mentioned environmental parameter information may be obtained by the smart home appliance from itself or by other appliances.
  • the environmental parameter information may include at least one of the following: local weather information, such as temperature, humidity, rain, snow, etc .; room information of the room where the smart home appliance is located , Such as space size, orientation, lighting conditions, etc .; information about other devices in the room where the smart appliance is located, such as door and window status information, such as doors or windows open or closed.
  • local weather information such as temperature, humidity, rain, snow, etc .
  • room information of the room where the smart home appliance is located such as space size, orientation, lighting conditions, etc .
  • information about other devices in the room where the smart appliance is located such as door and window status information, such as doors or windows open or closed.
  • parameter information may also be parameter information of the smart home appliance, such as the running time information of the smart home appliance.
  • the diversified parameter information is comprehensively used to control smart home appliances.
  • Smart home appliances are controlled by responding to the diversified parameter information, instead of being operated by users themselves, which can improve the user experience.
  • the smart home appliance includes, but is not limited to, a smart air conditioner.
  • a smart air conditioner by responding to diverse parameter information for control, instead of the user's own operation, the user experience can be improved, and energy saving purposes can also be achieved in some cases, such as the local hot weather suddenly cools down and becomes cold, but the user is unknown At this time, the intelligent air conditioner controls the operation according to the local weather at this time, which can achieve energy saving.
  • Step S102 Based on the parameter information, a control action corresponding to the parameter information is obtained through a preset model.
  • the preset model includes a reinforcement learning model, and the reinforcement learning model can be adjusted according to a comfort evaluation result.
  • control action can be generated by the reinforcement learning model, and the reinforcement learning model can be adjusted according to the comfort evaluation result, so that the control action generated by the adjustment meets the user's comfort experience.
  • the preset model may further include a state transition model. Accordingly, obtaining the control action corresponding to the parameter information through the preset model based on the parameter information includes:
  • a control action is generated by the reinforcement learning model, which is used to characterize the correspondence between the state parameter and the control action.
  • FIG. 2 is a schematic structural diagram of a state transition model provided by an embodiment of the present application.
  • the input of the state transition model 20 is parameter information and the output is state parameters.
  • the parameter information includes door and window closing conditions 201 and weather environment conditions. 202 (including temperature, humidity, rain, snow, etc.), room information 203 (including space size, orientation, lighting, etc.), the state parameters are preset and fixed, as shown in FIG. 2, taking three state parameters as examples.
  • the specific status parameters can be set according to the actual situation.
  • the status parameters can include temperature, humidity, and lighting, and can be three types of statuses determined by comprehensively determining a variety of parameter information.
  • the state transition model 20 may be an artificially set logic rule, a state comparison table, a neural network structure, or a mixture of the three.
  • the output is a simplified mapping of the input information, and the specific output parameter type depends on the actual control target.
  • the establishment of this process requires a large amount of actual case data extraction or training. For example, according to the actual case data extraction, it can be determined that the corresponding state parameter B when the parameter information is A. Therefore, when the parameter information is obtained as A, the corresponding state parameter can be obtained as B according to the state transition model.
  • the complex parameter information is converted into state parameters with a mapping relationship.
  • the state parameters can form a summary of the complex parameter information, simplifying data processing, and avoiding Reinforcement learning models face the processing pressure when faced with complex and numerous parameter information.
  • the input of the reinforcement learning model is the state parameter
  • the output is the control action.
  • the state parameter is, for example, temperature drop
  • the control action is, for example, increasing the temperature.
  • the output probability of the output parameters of the reinforcement learning model can be adjusted according to the comfort evaluation result, so each output control action of the reinforcement model can be the control action with the best comfort evaluation result.
  • the reinforcement learning model can be adjusted according to the comfort evaluation result, including: the probability that the reinforcement learning model outputs the control action can be adjusted according to the comfort evaluation result.
  • the output probability of the control action can be adjusted according to the comfort evaluation result, so that the generated control action can be most suitable for the user's comfort experience.
  • Step S103 Control operation according to the control action.
  • the reinforcement learning model can be obtained by continuously updating the comfort evaluation result of the control action, because the control action of the reinforcement learning model can be adjusted according to the comfort evaluation result, corresponding to a specific state parameter, According to the reinforcement learning model, the control action with the best evaluation result corresponding to the state parameter is output to control the operation of the smart home appliance, so that the control action performed by the smart home appliance can achieve comfort control, thereby improving the user experience.
  • FIG. 3 is a schematic flowchart of a smart home appliance control method according to another embodiment of the present application. As shown in FIG. 3, the smart home appliance control method further includes the following steps:
  • Step S104 Obtain the comfort evaluation result after the control operation is performed according to the control action.
  • the comfort evaluation result reflects the degree of comfort experience given to the user after the control action is performed. It can be understood that after the control action has a new comfort evaluation result, the occurrence of the control action in the future operation of the smart home appliance will be Adjust based on the new comfort evaluation results.
  • the comfort evaluation result may be a feedback result of the user, and / or the comfort evaluation result may also be calculated according to a preset algorithm.
  • the acquiring the comfort evaluation result after controlling the operation of the smart home appliance according to the control action includes:
  • a comfort evaluation result fed back by a user is obtained after a corresponding operation is performed according to the control action.
  • the user can intuitively evaluate the comfort experience of the environment as the result of the comfort evaluation of the air conditioner control action.
  • the acquiring the comfort evaluation result after running according to the control action includes:
  • the first comfort value is a comfort value corresponding to a state parameter before performing a corresponding operation according to the control action
  • the second comfort value is a comfort value corresponding to a state parameter after performing a corresponding operation according to the control action
  • the comfort evaluation result is obtained.
  • the air conditioner Before the air conditioner performs a new control action, its state parameter corresponds to the first comfort value. After the air conditioner performs a new control action, its new state parameter corresponds to the second comfort value. The second comfort value is compared numerically. If the second comfort value is greater than the first comfort value, the comfort evaluation result is a positive evaluation result, and the comfort evaluation of the control action is better; if the second comfort value is less than the first comfort value, A comfort value is a negative evaluation result, and the comfort evaluation of the control action is poor.
  • the comfort value corresponding to the state parameter is obtained through a preset comfort evaluation algorithm.
  • the comfort evaluation algorithm may be a comparison table of the state parameter and the comfort value state, or may be a formula or the like.
  • the same or different weights can be set for each state parameter in the comfort evaluation algorithm, and the state parameters are quantified by the weight ratio to obtain the corresponding comfort value.
  • Step S105 Update the reinforcement learning model according to the comfort evaluation result.
  • the comfort evaluation result includes a positive evaluation result or a negative evaluation result
  • the updating the reinforcement learning model according to the comfort evaluation result includes:
  • the comfort evaluation result is a positive evaluation result
  • the control action is performed after the air conditioner.
  • the probability of occurrence is increased; if the comfort evaluation result is a negative evaluation result, it indicates that the indoor environment after the air conditioner performs the control action is less comfortable for the user.
  • the control action In the subsequent operation of the air conditioner, the probability of occurrence is reduced. It can be understood that in the actual operation of the air conditioner, the above process is repeated many times. If a control action is repeatedly evaluated multiple times, it means that the indoor environment after the execution of the control action gives the user a very good comfort experience. The probability of performing this control action in the future is also very high, and then the control action of the air conditioner can be adjusted in the direction of optimal comfort, thereby achieving comfort control to improve the operation of the air conditioner and improving the user experience.
  • FIG. 4 is a schematic flowchart of a smart home appliance control method according to another embodiment of the present application. As shown in FIG. 4, the smart home appliance control method includes the following steps:
  • Obtaining parameter information includes: obtaining environmental parameter information, and / or obtaining own parameter information of the smart home appliance.
  • a control action corresponding to the parameter information is obtained through a preset model.
  • the preset model includes a state transition model and a reinforcement learning model, wherein the reinforcement learning model can be based on a comfort evaluation result. Make adjustments
  • a control action is generated by the reinforcement learning model, and the reinforcement learning model is used to represent a correspondence between the state parameter and the control action.
  • the first comfort value is a comfort value corresponding to a state parameter before performing a corresponding operation according to the control action
  • the second comfort value is a comfort value corresponding to a state parameter after performing a corresponding operation according to the control action
  • the comfort evaluation result is obtained.
  • the comfort evaluation result includes a positive evaluation result or a negative evaluation result
  • updating the reinforcement learning model according to the comfort evaluation result includes: if the comfort evaluation result is a positive evaluation result, increasing Increase the output probability of the control action; or reduce the output probability of the control action if the comfort evaluation result is a negative evaluation result.
  • step S24 acquiring the comfort evaluation result after the control operation according to the control action may further include:
  • a comfort evaluation result fed back by a user is obtained after a corresponding operation is performed according to the control action.
  • the evaluation by the user feedback is used as the evaluation of the control action.
  • the above embodiments of the smart home appliance control method are not limited to being applied to the smart air conditioner embodiment, and can also be applied to other smart home appliances, such as smart air purifiers.
  • FIG. 5 is a schematic structural diagram of a smart home appliance control device according to an embodiment of the present application. As shown in FIG. 5, the smart home appliance control device 5 includes:
  • a first acquiring module 51 configured to acquire parameter information
  • the second obtaining module 52 obtains a control action corresponding to the parameter information through a preset model based on the parameter information.
  • the preset model includes a reinforcement learning model, and the reinforcement learning model can be adjusted according to a comfort evaluation result. ;
  • the control module 53 is configured to control operation according to the control action.
  • the smart home appliance control device 5 obtains parameter information through the first obtaining module 51, and the second obtaining module 52 obtains control actions of the smart home appliance based on the parameter information.
  • the reinforcement learning model can be based on The comfort evaluation result is adjusted to realize that the control action generated according to the reinforcement learning model is the control action with the best comfort evaluation result. Therefore, the control module 53 controls the operation of the smart home appliance according to the control action to suit the user ’s comfort experience.
  • the first obtaining module 51 is specifically configured to:
  • the parameter information is related to the comfort of the smart home appliance control action, and the parameter information may be environmental parameter information.
  • the environmental parameter information may be environmental parameter information collected and / or configured by the smart home appliance itself.
  • Smart appliances can obtain environmental parameter information from themselves.
  • smart home appliances use their own configured sensors to collect environmental parameter information, such as indoor temperature, humidity, and particle information; for example, room information configured in smart home appliances, such as room size, orientation, and lighting.
  • the environmental parameter information may also be collected and / or configured for external devices of the smart home appliance.
  • Smart appliances can receive environmental parameter information from external devices, such as smart appliances receiving local weather information, such as local temperature, humidity, rain, and snow, sent by cloud servers through the network.
  • the smart home appliance may also be associated with other smart home appliances and sensors, and receive environmental parameter information collected by other smart home appliances and sensors.
  • it is associated with other smart home appliances and receives temperature and humidity information collected by other smart home appliances.
  • it is associated with door and window sensors.
  • the door and window sensors obtain the door and window switch status information.
  • the smart home appliances receive the switch status information sent by the door and window sensors. It is the smart home appliance that receives the information sent by the control center of the smart home system, such as room information configured in the control center of the smart home system.
  • the above-mentioned environmental parameter information may be obtained by the smart home appliance from itself or by other appliances.
  • the environmental parameter information may include at least one of the following: local weather information, such as temperature, humidity, rain, snow, etc .; room information of the room where the smart home appliance is located , Such as space size, orientation, lighting conditions, etc .; information about other devices in the room where the smart appliance is located, such as door and window status information, such as doors or windows open or closed.
  • local weather information such as temperature, humidity, rain, snow, etc .
  • room information of the room where the smart home appliance is located such as space size, orientation, lighting conditions, etc .
  • information about other devices in the room where the smart appliance is located such as door and window status information, such as doors or windows open or closed.
  • parameter information may also be parameter information of the smart home appliance, such as the running time information of the smart home appliance.
  • the diversified parameter information is comprehensively used to control smart home appliances.
  • Smart home appliances are controlled by responding to the diversified parameter information, instead of being operated by users themselves, which can improve the user experience.
  • the smart home appliance includes, but is not limited to, a smart air conditioner.
  • a smart air conditioner by responding to diverse parameter information for control, instead of the user's own operation, the user experience can be improved, and energy saving purposes can also be achieved in some cases, such as the local hot weather suddenly cools down and becomes cold, but the user is unknown At this time, the intelligent air conditioner controls the operation according to the local weather at this time, which can achieve energy saving.
  • the preset model further includes a state transition model
  • the obtaining a control action corresponding to the parameter information based on the parameter information through a preset model includes:
  • a control action is generated by the reinforcement learning model, and the reinforcement learning model is used to represent a correspondence between the state parameter and the control action.
  • FIG. 2 is a schematic structural diagram of a state transition model provided by an embodiment of the present application.
  • the input of the state transition model 20 is parameter information and the output is state parameters.
  • the parameter information includes door and window closing conditions 201 and weather environment conditions. 202 (including temperature, humidity, rain, snow, etc.), room information 203 (including space size, orientation, lighting, etc.), the state parameters are preset and fixed, as shown in FIG. 2, taking three state parameters as examples.
  • the specific status parameters can be set according to the actual situation.
  • the status parameters can include temperature, humidity, and lighting, and can be three types of statuses determined by comprehensively determining a variety of parameter information.
  • the state transition model 20 may be an artificially set logic rule, a state comparison table, a neural network structure, or a mixture of the three.
  • the output is a simplified mapping of the input information, and the specific output parameter type depends on the actual control target.
  • the establishment of this process requires a large amount of actual case data extraction or training. For example, according to the actual case data extraction, it can be determined that the corresponding state parameter B when the parameter information is A. Therefore, when the parameter information is obtained as A, the corresponding state parameter can be obtained as B according to the state transition model.
  • the complex parameter information is converted into state parameters with a mapping relationship.
  • the state parameters can form a summary of the complex parameter information, simplify the data processing, and avoid the reinforcement learning model. Processing pressure in the face of complex and numerous parameter information.
  • the input of the reinforcement learning model is the state parameter
  • the output is the control action.
  • the state parameter is, for example, temperature drop
  • the control action is, for example, increasing the temperature.
  • the output probability of the output parameters of the reinforcement learning model can be adjusted according to the comfort evaluation result, so each output control action of the reinforcement model can be the control action with the best comfort evaluation result.
  • FIG. 6 is a schematic structural diagram of a smart home appliance control device according to another embodiment of the present application. As shown in FIG. 6, the smart home appliance control device 5 further includes:
  • An evaluation module 54 configured to obtain the comfort evaluation result after the control operation is performed according to the control action
  • An update module 55 is configured to update the reinforcement learning model according to the comfort evaluation result.
  • the evaluation result of the control action is obtained through the evaluation module 54.
  • the appearance of the control action in the subsequent operation of the smart home appliance can be performed based on the new comfort evaluation result. Adjustment.
  • the update module 55 After the reinforcement learning model is updated according to the comfort evaluation result, the control action generated by the reinforcement learning model will be adjusted accordingly, so as to achieve the control action with the best comfort evaluation result.
  • the evaluation module 54 is specifically configured to:
  • the first comfort value is a comfort value corresponding to a state parameter before performing a corresponding operation according to the control action
  • the second comfort value is a comfort value corresponding to a state parameter after performing a corresponding operation according to the control action
  • the comfort evaluation result is obtained.
  • the air conditioner Before the air conditioner performs a new control action, its state parameter corresponds to the first comfort value. After the air conditioner performs a new control action, its new state parameter corresponds to the second comfort value. The second comfort value is compared numerically. If the second comfort value is greater than the first comfort value, the comfort evaluation result is a positive evaluation result, and the comfort evaluation of the control action is better; if the second comfort value is less than the first comfort value, A comfort value is a negative evaluation result, and the comfort evaluation of the control action is poor.
  • the comfort value corresponding to the state parameter is obtained through a preset comfort evaluation algorithm.
  • the comfort evaluation algorithm may be a comparison table between the state parameter and the comfort value state, or a formula or the like.
  • the same or different weights can be set for each state parameter in the comfort evaluation algorithm, and the state parameters are quantified by the weight ratio to obtain the corresponding comfort value.
  • the evaluation module 54 is further specifically configured to:
  • a comfort evaluation result fed back by a user is obtained after a corresponding operation is performed according to the control action.
  • the user can intuitively evaluate the comfort experience of the environment as the result of the comfort evaluation of the air conditioner control action.
  • the update module 55 is specifically configured to:
  • the comfort evaluation result includes a positive evaluation result or a negative evaluation result
  • the comfort evaluation result is a positive evaluation result
  • the control action is performed after the air conditioner.
  • the probability of occurrence is increased; if the comfort evaluation result is a negative evaluation result, it indicates that the indoor environment after the air conditioner performs the control action is less comfortable for the user.
  • the control action In the subsequent operation of the air conditioner, the probability of occurrence is reduced. It can be understood that in the actual operation of the air conditioner, the above process is repeated many times. If a control action is repeatedly evaluated multiple times, it means that the indoor environment after the execution of the control action is really very comfortable for the user. The probability of performing this control action in the future is also very high, and then the control action of the air conditioner can be adjusted in the direction of optimal comfort, thereby improving the comfort control of the air conditioner operation and improving the user experience.
  • the above-mentioned application embodiments of the smart home appliance control device 5 include, but are not limited to, the embodiments of smart air conditioners, and can also be applied to other smart home appliances, such as smart air purifiers.
  • Any process or method description in a flowchart or otherwise described herein can be understood as representing a module, fragment, or portion of code that includes one or more executable instructions for implementing a particular logical function or step of a process
  • the scope of the preferred embodiments of the present application includes additional implementations, in which the functions may be performed out of the order shown or discussed, including performing functions in a substantially simultaneous manner or in the reverse order according to the functions involved, which should It is understood by those skilled in the art to which the embodiments of the present application pertain.
  • each part of the application may be implemented by hardware, software, firmware, or a combination thereof.
  • multiple steps or methods may be implemented by software or firmware stored in a memory and executed by a suitable instruction execution system.
  • a suitable instruction execution system For example, if implemented in hardware, as in another embodiment, it may be implemented using any one or a combination of the following techniques known in the art: Discrete logic circuits, application specific integrated circuits with suitable combinational logic gate circuits, programmable gate arrays (PGA), field programmable gate arrays (FPGA), etc.
  • a person of ordinary skill in the art can understand that all or part of the steps carried by the methods in the foregoing embodiments may be implemented by a program instructing related hardware.
  • the program may be stored in a computer-readable storage medium.
  • the program is When executed, one or a combination of the steps of the method embodiment is included.
  • each functional unit in each embodiment of the present application may be integrated into one processing module, or each unit may exist separately physically, or two or more units may be integrated into one module.
  • the above integrated modules can be implemented in the form of hardware or software functional modules. If the integrated module is implemented in the form of a software functional module and sold or used as an independent product, it may also be stored in a computer-readable storage medium.
  • the aforementioned storage medium may be a read-only memory, a magnetic disk, or an optical disk.

Landscapes

  • Engineering & Computer Science (AREA)
  • General Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Automation & Control Theory (AREA)
  • Manufacturing & Machinery (AREA)
  • Quality & Reliability (AREA)
  • Air Conditioning Control Device (AREA)
  • Selective Calling Equipment (AREA)

Abstract

Disclosed are an intelligent household electrical appliance control method and an intelligent household electrical appliance control device (5), which belong to the field of intelligent household electrical appliance control. The intelligent household electrical appliance control method comprises: obtaining parameter information (S101); obtaining, on the basis of the parameter information, a control action corresponding to the parameter information by means of a preset model, the preset model including an intensive learning model, and the intensive learning model can be adjusted according to a comfort evaluation result (S102); and controlling operation according to the control action (S103). On the basis of the obtained parameter information, control actions with good evaluation results are output to control operation of the intelligent household electrical appliances; the control actions for implementing executions of the intelligent household electrical appliances can satisfy comfort demands of a user, so that the comfort control is achieved and the user experience is improved.

Description

智能家电控制方法及智能家电控制装置Intelligent household appliance control method and intelligent household appliance control device
相关申请Related applications
本申请要求2018年07月06日申请的,申请号为201810734605.1,名称为“一种智能家电控制方法及智能家电控制装置”的中国专利申请的优先权,在此将其全文引入作为参考。This application claims priority from a Chinese patent application filed on July 6, 2018, with application number 201810734605.1, entitled "A Smart Home Appliance Control Method and Smart Home Appliance Control Device", which is incorporated herein by reference in its entirety.
技术领域Technical field
本申请涉及智能家电控制领域,具体涉及一种智能家电控制方法及智能家电控制装置。The present application relates to the field of smart home appliance control, and in particular, to a smart home appliance control method and a smart home appliance control device.
背景技术Background technique
智能家电可以提升家居生活的舒适性,以空调为例,可以提供用户舒适的环境温度环境。Smart appliances can improve the comfort of home life. Taking air conditioners as an example, it can provide users with a comfortable ambient temperature environment.
目前,空调的控制方法是用户设定运行温度,空调根据所在房间的环境温度进行反馈调节,以使房间的环境温度保持在空调的设定温度。在舒适性控制方面,用户通常依靠自己的体感感受对空调进行设定,以实现舒适性控制。该情形下,存在的问题是,用户每次设定的空调运行状态可能并不是最舒适的运行状态,由此导致用户体验也不佳。At present, the control method of the air conditioner is that the user sets the operating temperature, and the air conditioner performs feedback adjustment according to the ambient temperature of the room in which the ambient temperature of the room is maintained at the set temperature of the air conditioner. In terms of comfort control, users usually set the air conditioner to achieve comfort control by relying on their somatosensory experience. In this case, there is a problem that the air conditioner operation state set by the user each time may not be the most comfortable operation state, which results in a poor user experience.
因而,在智能家电运行舒适性控制和用户体验方面,依然存在提升改善的需求。Therefore, in terms of the comfort control and user experience of smart home appliances, there is still a need for improvement.
发明内容Summary of the invention
为至少在一定程度上克服相关技术中存在的问题,本申请公开了一种智能家电控制方法及智能家电控制装置。In order to overcome the problems in the related technology at least to a certain extent, the present application discloses a smart home appliance control method and a smart home appliance control device.
为实现以上目的,本申请采用如下技术方案:In order to achieve the above purpose, this application uses the following technical solutions:
一种智能家电控制方法,包括:A smart home appliance control method includes:
获取参数信息;Get parameter information;
基于所述参数信息,通过预设模型获取与所述参数信息对应的控制动作,所述预设模型包括强化学习模型,所述强化学习模型能够依据舒适性评价结果进行调整;Based on the parameter information, a control action corresponding to the parameter information is obtained through a preset model, the preset model includes a reinforcement learning model, and the reinforcement learning model can be adjusted according to a comfort evaluation result;
根据所述控制动作控制运行。The operation is controlled according to the control action.
优选的,所述获取参数信息包括:Preferably, the acquiring parameter information includes:
获取环境参数信息,和/或,Obtain environmental parameter information, and / or,
获取智能家电自身参数信息。Get the parameter information of the smart home appliance.
优选的,所述获取环境参数信息包括:Preferably, the acquiring environmental parameter information includes:
获取智能家电自身采集和/或配置的环境参数信息;和/或Obtain the environmental parameter information collected and / or configured by the smart appliance itself; and / or
获取智能家电的外界设备采集和/或配置的环境参数信息。Obtain environmental parameter information collected and / or configured by external devices of smart appliances.
优选的,所述预设模型还包括状态转换模型;Preferably, the preset model further includes a state transition model;
所述基于所述参数信息,通过预设模型获取与所述参数信息对应的控制动作,包括:The obtaining a control action corresponding to the parameter information based on the parameter information through a preset model includes:
基于所述参数信息,通过所述状态转换模型,获取与所述参数信息对应的状态参数,所述状态转换模型用于表征参数信息与状态参数之间的对应关系;Obtaining state parameters corresponding to the parameter information through the state transition model based on the parameter information, and the state transition model is used to represent a correspondence between the parameter information and the state parameters;
基于所述状态参数,通过所述强化学习模型产生控制动作,所述强化学习模型用于表征状态参数与控制动作之间的对应关系。Based on the state parameter, a control action is generated by the reinforcement learning model, and the reinforcement learning model is used to represent a correspondence between the state parameter and the control action.
优选的,所述状态转换模型包括状态对照表、神经网络模型和预设逻辑规则中的一种或者多种。Preferably, the state transition model includes one or more of a state comparison table, a neural network model, and a preset logic rule.
优选的,所述强化学习模型能够依据舒适性评价结果进行调整,包括:Preferably, the reinforcement learning model can be adjusted according to the comfort evaluation results, including:
所述强化学习模型输出所述控制动作的概率能够依据舒适性评价结果进行调整。The probability that the reinforcement learning model outputs the control action can be adjusted according to the comfort evaluation result.
优选的,还包括:Preferably, it further includes:
获取根据所述控制动作控制运行后的所述舒适性评价结果;Acquiring the comfort evaluation result after controlling operation according to the control action;
根据所述舒适性评价结果更新所述强化学习模型。Updating the reinforcement learning model according to the comfort evaluation result.
优选的,所述获取根据所述控制动作运行后的所述舒适性评价结果,包括:Preferably, the obtaining the comfort evaluation result after running according to the control action includes:
获取根据所述控制动作执行相应操作前后的状态参数;Obtaining state parameters before and after performing a corresponding operation according to the control action;
根据预设的舒适性评价算法,计算第一舒适性数值和第二舒适性数值,其中,所述第一舒适性数值为根据所述控制动作执行相应操作前的状态参数对应的舒适性数值,所述第二舒适性数值为根据所述控制动作执行相应操作后的状态参数对应的舒适性数值;Calculating a first comfort value and a second comfort value according to a preset comfort evaluation algorithm, wherein the first comfort value is a comfort value corresponding to a state parameter before performing a corresponding operation according to the control action, The second comfort value is a comfort value corresponding to a state parameter after performing a corresponding operation according to the control action;
根据所述第一舒适性数值和所述第二舒适性数值,得到所述舒适性评价结果。According to the first comfort value and the second comfort value, the comfort evaluation result is obtained.
优选的,所述舒适性评价算法中对应各个状态参数设置相同或不同的权重。Preferably, the comfort evaluation algorithm sets the same or different weights corresponding to each state parameter.
优选的,所述获取根据所述控制动作运行后的所述舒适性评价结果,包括:Preferably, the obtaining the comfort evaluation result after running according to the control action includes:
获取根据所述控制动作执行相应的操作后,用户反馈的舒适性评价结果。A comfort evaluation result fed back by a user is obtained after a corresponding operation is performed according to the control action.
优选的,所述舒适性评价结果包括正向评价结果或者负向评价结果,所述根据所述舒适性评价结果更新所述强化学习模型,包括:Preferably, the comfort evaluation result includes a positive evaluation result or a negative evaluation result, and the updating the reinforcement learning model according to the comfort evaluation result includes:
如果所述舒适性评价结果为正向评价结果,则增大所述控制动作的输出概率;或者,If the comfort evaluation result is a positive evaluation result, increasing the output probability of the control action; or,
如果所述舒适性评价结果为负向评价结果,则减小所述控制动作的输出概率。If the comfort evaluation result is a negative evaluation result, the output probability of the control action is reduced.
一种智能家电控制装置,包括:A smart home appliance control device includes:
第一获取模块,用于获取参数信息;A first obtaining module, configured to obtain parameter information;
第二获取模块,基于所述参数信息,通过预设模型获取与所述参数信息对应的控制动作,所述预设模型包括强化学习模型,所述强化学习模型能够依据舒适性评价结果进行调整;A second acquisition module, based on the parameter information, obtaining a control action corresponding to the parameter information through a preset model, the preset model includes a reinforcement learning model, and the reinforcement learning model can be adjusted according to a comfort evaluation result;
控制模块,用于根据所述控制动作控制运行。A control module, configured to control operation according to the control action.
优选的,所述第一获取模块具体用于:Preferably, the first obtaining module is specifically configured to:
获取环境参数信息,和/或,Obtain environmental parameter information, and / or,
获取智能家电自身参数信息。Get the parameter information of the smart home appliance.
优选的,所述第二获取模块中,所述预设模型还包括状态转换模型;Preferably, in the second acquisition module, the preset model further includes a state transition model;
所述基于所述参数信息,通过预设模型获取与所述参数信息对应的控制动作,包括:The obtaining a control action corresponding to the parameter information based on the parameter information through a preset model includes:
基于所述参数信息,通过所述状态转换模型,获取与所述参数信息对应的状态参数,所述状态转换模型用于表征参数信息与状态参数之间的对应关系;Obtaining state parameters corresponding to the parameter information through the state transition model based on the parameter information, and the state transition model is used to represent a correspondence between the parameter information and the state parameters;
基于所述状态参数,通过所述强化学习模型产生控制动作,所述强化学习模型用于表征状态参数与控制动作之间的对应关系。Based on the state parameter, a control action is generated by the reinforcement learning model, and the reinforcement learning model is used to represent a correspondence between the state parameter and the control action.
优选的,所述智能家电控制装置还包括:Preferably, the smart home appliance control device further includes:
评价模块,用于获取根据所述控制动作控制运行后的所述舒适性评价结果;An evaluation module, configured to obtain the comfort evaluation result after controlling operation according to the control action;
更新模块,用于根据所述舒适性评价结果更新所述强化学习模型。An update module is configured to update the reinforcement learning model according to the comfort evaluation result.
优选的,所述评价模块具体用于:Preferably, the evaluation module is specifically configured to:
获取根据所述控制动作执行相应操作前后的状态参数;Obtaining state parameters before and after performing a corresponding operation according to the control action;
根据预设的舒适性评价算法,计算第一舒适性数值和第二舒适性数值,其中,所述第一舒适性数值为根据所述控制动作执行相应操作前的状态参数对应的舒适性数值,所述第二舒适性数值为根据所述控制动作执行相应操作后的状态参数对应的舒适性数值;Calculating a first comfort value and a second comfort value according to a preset comfort evaluation algorithm, wherein the first comfort value is a comfort value corresponding to a state parameter before performing a corresponding operation according to the control action, The second comfort value is a comfort value corresponding to a state parameter after performing a corresponding operation according to the control action;
根据所述第一舒适性数值和所述第二舒适性数值,得到所述舒适性评价结果。According to the first comfort value and the second comfort value, the comfort evaluation result is obtained.
优选的,所述评价模块还具体用于:Preferably, the evaluation module is further specifically configured to:
获取根据所述控制动作执行相应的操作后,用户反馈的舒适性评价结果。A comfort evaluation result fed back by a user is obtained after a corresponding operation is performed according to the control action.
优选的,所述更新模块具体用于:Preferably, the update module is specifically configured to:
所述舒适性评价结果包括正向评价结果或者负向评价结果;The comfort evaluation result includes a positive evaluation result or a negative evaluation result;
如果所述舒适性评价结果为正向评价结果,则增大所述控制动作的输出概率;或者,If the comfort evaluation result is a positive evaluation result, increasing the output probability of the control action; or,
如果所述舒适性评价结果为负向评价结果,则减小所述控制动作的输出概率。If the comfort evaluation result is a negative evaluation result, the output probability of the control action is reduced.
本申请采用以上技术方案,至少具备以下有益效果:This application adopts the above technical solutions and has at least the following beneficial effects:
本申请基于获取的参数信息,通过预设模型获取与参数信息对应的控制动作,预设模型包括强化学习模型,强化学习模型能够依据舒适性评价结果进行调整,通过输出评价结果好的控制动作,以控制智能家电运行,实现智能家电执行的控制动作能够满足用户的舒适性需求,以此实现智能家电的舒适性控制,提升用户体验。This application is based on the obtained parameter information, and obtains the control action corresponding to the parameter information through a preset model. The preset model includes a reinforcement learning model. The reinforcement learning model can be adjusted according to the comfort evaluation result. By controlling the operation of smart home appliances, the control actions performed by the smart home appliances can meet the comfort needs of users, thereby achieving the comfort control of smart home appliances and improving the user experience.
应当理解的是,以上的一般描述和后文的细节描述仅是示例性和解释性的,并不能限制本申请。It should be understood that the above general description and the following detailed description are merely exemplary and explanatory, and should not limit the present application.
附图说明BRIEF DESCRIPTION OF THE DRAWINGS
为了更清楚地说明本申请实施例或现有技术中的技术方案,下面将对实施例或现有技术描述中所需要使用的附图作简单地介绍,显而易见地,下面描述中的附图仅仅是本申请的实施例,对于本领域普通技术人员来讲,在不付出创造性劳动的前提下,还可以根据公开的附图获得其他的附图。In order to explain the technical solutions in the embodiments of the present application or the prior art more clearly, the drawings used in the description of the embodiments or the prior art will be briefly introduced below. Obviously, the drawings in the following description are merely It is an embodiment of the present application. For those of ordinary skill in the art, other drawings can be obtained according to the disclosed drawings without paying creative labor.
图1为本申请一个实施例公开的智能家电控制方法的流程示意图;1 is a schematic flowchart of a smart home appliance control method disclosed by an embodiment of the present application;
图2为本申请一个实施例公开的状态转换模型的结构示意图;2 is a schematic structural diagram of a state transition model disclosed by an embodiment of the present application;
图3为本申请另一个实施例公开的智能家电控制方法的流程示意图;3 is a schematic flowchart of a smart home appliance control method disclosed by another embodiment of the present application;
图4为本申请另一个实施例公开的智能家电控制方法的流程示意图;4 is a schematic flowchart of a smart home appliance control method disclosed by another embodiment of the present application;
图5为本申请一个实施例公开的智能家电控制装置的结构示意图;5 is a schematic structural diagram of a smart home appliance control device disclosed by an embodiment of the present application;
图6为本申请另一个实施例公开的智能家电控制装置的结构示意图。FIG. 6 is a schematic structural diagram of a smart home appliance control device disclosed by another embodiment of the present application.
具体实施方式detailed description
为使本申请的目的、技术方案和优点更加清楚,下面将对本申请的技术方案进行详细的描述。显然,所描述的实施例仅仅是本申请一部分实施例,而不是全部的实施例。基于本申请中的实施例,本领域普通技术人员在没有做出创造性劳动的前提下所得到的所有其它实施方式,都属于本申请所保护的范围。In order to make the purpose, technical solution, and advantages of the present application clearer, the technical solution of the present application will be described in detail below. Obviously, the described embodiments are only a part of the embodiments of the present application, but not all the embodiments. Based on the embodiments in the present application, all other implementations obtained by a person of ordinary skill in the art without making creative efforts fall within the protection scope of the present application.
图1为本申请一个实施例提供的智能家电控制方法的流程示意图,如图1所示,该智能家电控制方法包括如下步骤:FIG. 1 is a schematic flowchart of a smart home appliance control method according to an embodiment of the present application. As shown in FIG. 1, the smart home appliance control method includes the following steps:
步骤S101,获取参数信息。Step S101: Acquire parameter information.
可以理解的是,参数信息与智能家电控制动作的舒适性相关,参数信息可以是环境参数信息,在一个实施例中,所述环境参数信息可以为智能家电自身采集和/或配置的环境参数信息,智能家电可以通过自身获取到环境参数信息。比如智能家电利用自身配置的传感器采集的环境参数信息,如室内的温度、湿度、颗粒信息;比如配置在智能家电中的房间 信息,如房间的空间大小、朝向、采光等信息。或者,所述环境参数信息也可以为智能家电的外界设备采集和/或配置的。智能家电可以接收外界设备发送的环境参数信息,比如智能家电通过网络接收云服务器发送的当地天气信息,如当地的温度、湿度、雨雪等信息。或者,智能家电也可以与其他智能家电、传感器进行关联,接收其他智能家电和传感器采集的环境参数信息。比如与其他智能家电关联,接收其他智能家电采集的温度、湿度等信息;比如与门窗传感器进行关联,门窗传感器获得门窗的开关状态信息,然后智能家电接收门窗传感器发送的开关状态信息;比如也可以是智能家电接收智能家居系统的控制中心发送的信息,比如配置在智能家居系统的控制中心的房间信息。It can be understood that the parameter information is related to the comfort of the smart home appliance control action, and the parameter information may be environmental parameter information. In one embodiment, the environmental parameter information may be environmental parameter information collected and / or configured by the smart home appliance itself. , Smart appliances can obtain environmental parameter information through themselves. For example, smart home appliances use their own configured sensors to collect environmental parameter information, such as indoor temperature, humidity, and particle information; for example, room information configured in smart home appliances, such as room size, orientation, and lighting. Alternatively, the environmental parameter information may also be collected and / or configured for external devices of the smart home appliance. Smart appliances can receive environmental parameter information from external devices, such as smart appliances receiving local weather information, such as local temperature, humidity, rain, and snow, sent by cloud servers through the network. Alternatively, the smart home appliance may also be associated with other smart home appliances and sensors, and receive environmental parameter information collected by other smart home appliances and sensors. For example, it is associated with other smart home appliances and receives temperature and humidity information collected by other smart home appliances. For example, it is associated with door and window sensors. The door and window sensors obtain the door and window switch status information. Then the smart home appliances receive the switch status information sent by the door and window sensors. It is the smart home appliance that receives the information sent by the control center of the smart home system, such as room information configured in the control center of the smart home system.
上述的环境参数信息从获取角度来讲,可以是智能家电从自身获取的,也可以是智能家电从其他设备获取的。From the perspective of acquisition, the above-mentioned environmental parameter information may be obtained by the smart home appliance from itself or by other appliances.
从具体信息角度来讲,在一个具体的实施例中,环境参数信息可以包括以下项中的至少一项:当地天气信息,比如温度、湿度、雨雪等;所述智能家电所在房间的房间信息,比如空间大小、朝向、采光情况等;所述智能家电所在房间的其他设备的信息,比如门窗状态信息,比如门窗打开或关闭。From the perspective of specific information, in a specific embodiment, the environmental parameter information may include at least one of the following: local weather information, such as temperature, humidity, rain, snow, etc .; room information of the room where the smart home appliance is located , Such as space size, orientation, lighting conditions, etc .; information about other devices in the room where the smart appliance is located, such as door and window status information, such as doors or windows open or closed.
另外,参数信息也可以是智能家电自身参数信息,比如智能家电的运行时长信息等。In addition, the parameter information may also be parameter information of the smart home appliance, such as the running time information of the smart home appliance.
通过上述实施例,可以实现参数信息获得的多样化,多样化的参数信息综合用于控制智能家电,智能家电通过响应多样化的参数信息进行控制,代替用户亲自操作,可以提升用户体验。Through the above embodiments, diversification of parameter information can be achieved, and the diversified parameter information is comprehensively used to control smart home appliances. Smart home appliances are controlled by responding to the diversified parameter information, instead of being operated by users themselves, which can improve the user experience.
在一个具体的实施例中,所述智能家电包括但不限于智能空调。以智能空调为例,通过响应多样化的参数信息进行控制,代替用户亲自操作,可以提升用户体验,也可以在一些情况下实现节能目的,比如当地炎热天气突然降温变凉,而用户未知的情况下,此时智能空调根据当地天气控制运行,可以实现节能。In a specific embodiment, the smart home appliance includes, but is not limited to, a smart air conditioner. Taking smart air conditioners as an example, by responding to diverse parameter information for control, instead of the user's own operation, the user experience can be improved, and energy saving purposes can also be achieved in some cases, such as the local hot weather suddenly cools down and becomes cold, but the user is unknown At this time, the intelligent air conditioner controls the operation according to the local weather at this time, which can achieve energy saving.
步骤S102,基于所述参数信息,通过预设模型获取与所述参数信息对应的控制动作,所述预设模型包括强化学习模型,所述强化学习模型能够依据舒适性评价结果进行调整。Step S102: Based on the parameter information, a control action corresponding to the parameter information is obtained through a preset model. The preset model includes a reinforcement learning model, and the reinforcement learning model can be adjusted according to a comfort evaluation result.
上述方案中,可通过强化学习模型产生控制动作,强化学习模型能够依据舒适性评价结果进行调整,使调整产生的控制动作满足用户的舒适性体验。In the above solution, the control action can be generated by the reinforcement learning model, and the reinforcement learning model can be adjusted according to the comfort evaluation result, so that the control action generated by the adjustment meets the user's comfort experience.
一些实施例中,预设模型还可以包括状态转换模型,相应地,所述基于所述参数信息,通过预设模型获取与所述参数信息对应的控制动作,包括:In some embodiments, the preset model may further include a state transition model. Accordingly, obtaining the control action corresponding to the parameter information through the preset model based on the parameter information includes:
基于所述参数信息,通过所述状态转换模型,获取与所述参数信息对应的状态参数,所述状态转换模型用于表征参数信息与状态参数之间的对应关系;Obtaining state parameters corresponding to the parameter information through the state transition model based on the parameter information, and the state transition model is used to represent a correspondence between the parameter information and the state parameters;
基于所述状态参数,通过所述强化学习模型产生控制动作,所述强化学习模型用于表 征状态参数与控制动作之间的对应关系。Based on the state parameter, a control action is generated by the reinforcement learning model, which is used to characterize the correspondence between the state parameter and the control action.
图2为本申请一个实施例提供的状态转换模型的结构示意图,如图2所示,状态转换模型20的输入为参数信息,输出为状态参数,比如参数信息包括门窗关闭情况201、天气环境情况202(包括温度、湿度、雨雪等)、房间信息203(包括空间大小、朝向、采光等),状态参数是预设的固定几种,如图2所示,以三种状态参数为例。具体的状态参数可以根据实际情况设定,比如,以智能家电是空调为例,状态参数可以包括温度、湿度和采光,可以是多种参数信息综合确定得到的三种状态。FIG. 2 is a schematic structural diagram of a state transition model provided by an embodiment of the present application. As shown in FIG. 2, the input of the state transition model 20 is parameter information and the output is state parameters. For example, the parameter information includes door and window closing conditions 201 and weather environment conditions. 202 (including temperature, humidity, rain, snow, etc.), room information 203 (including space size, orientation, lighting, etc.), the state parameters are preset and fixed, as shown in FIG. 2, taking three state parameters as examples. The specific status parameters can be set according to the actual situation. For example, taking smart appliances as air conditioners as an example, the status parameters can include temperature, humidity, and lighting, and can be three types of statuses determined by comprehensively determining a variety of parameter information.
状态转换模型20可以是人为设定的逻辑规则、状态对照表、神经网络结构或者三者的混合。输出为对输入信息的简化映射,具体输出参数类型根据实际控制目标而定。一般该过程的确立需要大量的实际案例数据提炼或训练得到。比如,根据实际案例数据提炼可以确定出参数信息为A时对应状态参数B。因此,当获取到参数信息为A时,根据状态转换模型可以获取对应的状态参数为B。The state transition model 20 may be an artificially set logic rule, a state comparison table, a neural network structure, or a mixture of the three. The output is a simplified mapping of the input information, and the specific output parameter type depends on the actual control target. Generally, the establishment of this process requires a large amount of actual case data extraction or training. For example, according to the actual case data extraction, it can be determined that the corresponding state parameter B when the parameter information is A. Therefore, when the parameter information is obtained as A, the corresponding state parameter can be obtained as B according to the state transition model.
通过状态转换,能较好地适用于参数信息复杂众多的情况,将复杂的参数信息转换成具有映射关系的状态参数,状态参数可形成对复杂参数信息的总结,实现数据处理简单化,可以避免强化学习模型面对复杂众多参数信息时的处理压力。Through state conversion, it can be better applied to the situation where the parameter information is complicated. The complex parameter information is converted into state parameters with a mapping relationship. The state parameters can form a summary of the complex parameter information, simplifying data processing, and avoiding Reinforcement learning models face the processing pressure when faced with complex and numerous parameter information.
强化学习模型的输入是状态参数,输出是控制动作。以空调为例,状态参数比如是温度下降,控制动作比如是调高温度。强化学习模型的输出参数的输出概率能够依据舒适性评价结果进行调整,因而强化模型每次输出控制动作可以是舒适性评价结果最好的控制动作。The input of the reinforcement learning model is the state parameter, and the output is the control action. Take the air conditioner as an example, the state parameter is, for example, temperature drop, and the control action is, for example, increasing the temperature. The output probability of the output parameters of the reinforcement learning model can be adjusted according to the comfort evaluation result, so each output control action of the reinforcement model can be the control action with the best comfort evaluation result.
在一个实施例中,所述强化学习模型能够依据舒适性评价结果进行调整,包括:所述强化学习模型输出所述控制动作的概率能够依据舒适性评价结果进行调整。In one embodiment, the reinforcement learning model can be adjusted according to the comfort evaluation result, including: the probability that the reinforcement learning model outputs the control action can be adjusted according to the comfort evaluation result.
可以理解的是,可以根据舒适性评价结果调整控制动作的输出概率,使产生的控制动作可最大概率地适于用户的舒适性体验。It can be understood that the output probability of the control action can be adjusted according to the comfort evaluation result, so that the generated control action can be most suitable for the user's comfort experience.
步骤S103,根据所述控制动作控制运行。Step S103: Control operation according to the control action.
可以理解的是,强化学习模型可以是依据对控制动作的舒适性评价结果进行不断循环更新得到的,因强化学习模型的控制动作能够依据舒适性评价结果进行调整,对应某个具体的状态参数,依据强化学习模型,将与状态参数对应的评价结果最好的控制动作输出,进行控制智能家电运行,使得智能家电执行的控制动作能够实现舒适性控制,进而提升用户体验。It can be understood that the reinforcement learning model can be obtained by continuously updating the comfort evaluation result of the control action, because the control action of the reinforcement learning model can be adjusted according to the comfort evaluation result, corresponding to a specific state parameter, According to the reinforcement learning model, the control action with the best evaluation result corresponding to the state parameter is output to control the operation of the smart home appliance, so that the control action performed by the smart home appliance can achieve comfort control, thereby improving the user experience.
图3为本申请另一个实施例提供的智能家电控制方法的流程示意图,如图3所示,该智能家电控制方法还包括如下步骤:FIG. 3 is a schematic flowchart of a smart home appliance control method according to another embodiment of the present application. As shown in FIG. 3, the smart home appliance control method further includes the following steps:
步骤S104,获取根据所述控制动作控制运行后的所述舒适性评价结果。Step S104: Obtain the comfort evaluation result after the control operation is performed according to the control action.
舒适性评价结果,其反映控制动作执行后给予用户的舒适性体验程度,可以理解的是,控制动作有了新的舒适性评价结果后,该控制动作在智能家电以后运行中的出现情况,会根据该新的舒适性评价结果进行调整。The comfort evaluation result reflects the degree of comfort experience given to the user after the control action is performed. It can be understood that after the control action has a new comfort evaluation result, the occurrence of the control action in the future operation of the smart home appliance will be Adjust based on the new comfort evaluation results.
舒适性评价结果可以为用户的反馈结果,和/或,舒适性评价结果也可以根据预设的算法计算得到。The comfort evaluation result may be a feedback result of the user, and / or the comfort evaluation result may also be calculated according to a preset algorithm.
比如,在一个实施例中,所述获取根据所述控制动作控制智能家电运行后的所述舒适性评价结果,包括:For example, in one embodiment, the acquiring the comfort evaluation result after controlling the operation of the smart home appliance according to the control action includes:
获取根据所述控制动作执行相应的操作后,用户反馈的舒适性评价结果。A comfort evaluation result fed back by a user is obtained after a corresponding operation is performed according to the control action.
以空调为例,在空调执行控制动作后,用户可以对环境进行直观地舒适性体验评价,以此作为空调控制动作的舒适性评价结果。Taking the air conditioner as an example, after the air conditioner performs a control action, the user can intuitively evaluate the comfort experience of the environment as the result of the comfort evaluation of the air conditioner control action.
又如,在另一个实施例中,所述获取根据所述控制动作运行后的所述舒适性评价结果,包括:For another example, in another embodiment, the acquiring the comfort evaluation result after running according to the control action includes:
获取根据所述控制动作执行相应操作前后的状态参数;Obtaining state parameters before and after performing a corresponding operation according to the control action;
根据预设的舒适性评价算法,计算第一舒适性数值和第二舒适性数值,其中,所述第一舒适性数值为根据所述控制动作执行相应操作前的状态参数对应的舒适性数值,所述第二舒适性数值为根据所述控制动作执行相应操作后的状态参数对应的舒适性数值;Calculating a first comfort value and a second comfort value according to a preset comfort evaluation algorithm, wherein the first comfort value is a comfort value corresponding to a state parameter before performing a corresponding operation according to the control action, The second comfort value is a comfort value corresponding to a state parameter after performing a corresponding operation according to the control action;
根据所述第一舒适性数值和所述第二舒适性数值,得到所述舒适性评价结果。According to the first comfort value and the second comfort value, the comfort evaluation result is obtained.
以空调为例,空调执行新的控制动作前,其状态参数对应第一舒适性数值,空调执行新的控制动作后,其新的状态参数对应第二舒适性数值,将第一舒适性数值和第二舒适性数值进行数值比较,若第二舒适性数值大于第一舒适性数值,则舒适性评价结果是正向评价结果,该控制动作的舒适性评价较好;若第二舒适性数值小于第一舒适性数值,则舒适性评价结果是负向评价结果,该控制动作的舒适性评价较差。Take the air conditioner as an example. Before the air conditioner performs a new control action, its state parameter corresponds to the first comfort value. After the air conditioner performs a new control action, its new state parameter corresponds to the second comfort value. The second comfort value is compared numerically. If the second comfort value is greater than the first comfort value, the comfort evaluation result is a positive evaluation result, and the comfort evaluation of the control action is better; if the second comfort value is less than the first comfort value, A comfort value is a negative evaluation result, and the comfort evaluation of the control action is poor.
上述方案中,通过预设的舒适性评价算法得到状态参数对应的舒适性数值,舒适性评价算法可以为状态参数与舒适性数值状态的对照表,也可以为公式等。舒适性评价算法中对应各个状态参数可设置相同或不同的权重,通过权重比例对状态参数进行量化以得到对应的舒适性数值。In the above solution, the comfort value corresponding to the state parameter is obtained through a preset comfort evaluation algorithm. The comfort evaluation algorithm may be a comparison table of the state parameter and the comfort value state, or may be a formula or the like. The same or different weights can be set for each state parameter in the comfort evaluation algorithm, and the state parameters are quantified by the weight ratio to obtain the corresponding comfort value.
步骤S105,根据所述舒适性评价结果更新所述强化学习模型。Step S105: Update the reinforcement learning model according to the comfort evaluation result.
可以理解的是,根据舒适性评价结果更新强化学习模型后,强化学习模型产生控制动作的也会相应地调整,可以实现强化学习模型输出评价结果最好的控制动作。It can be understood that after the reinforcement learning model is updated according to the comfort evaluation result, the control action generated by the reinforcement learning model will be adjusted accordingly, and the control action with the best evaluation result output by the reinforcement learning model can be realized.
在一个实施例中,所述舒适性评价结果包括正向评价结果或者负向评价结果,所述根 据所述舒适性评价结果更新所述强化学习模型,包括:In one embodiment, the comfort evaluation result includes a positive evaluation result or a negative evaluation result, and the updating the reinforcement learning model according to the comfort evaluation result includes:
如果所述舒适性评价结果为正向评价结果,则增大所述控制动作的输出概率;或者,If the comfort evaluation result is a positive evaluation result, increasing the output probability of the control action; or,
如果所述舒适性评价结果为负向评价结果,则减小所述控制动作的输出概率。If the comfort evaluation result is a negative evaluation result, the output probability of the control action is reduced.
以空调为例,若所述舒适性评价结果为正向评价结果,则表明空调执行该控制动作后的室内环境对用户来说舒适性增加,根据该正向评价,该控制动作在空调以后的运行中,出现的概率得到增加;若所述舒适性评价结果为负向评价结果,则表明空调执行该控制动作后的室内环境对用户来说舒适性降低,根据该负向评价,该控制动作在空调以后的运行中,出现的概率降低。可以理解的是,在空调的实际运行中,以上过程经过多次重复,如果一个控制动作得到多次正向评价,说明该控制动作执行后的室内环境给用户的舒适性体验确实非常好,空调以后执行该控制动作的概率也非常高,随之而然地可实现空调的控制动作会朝着舒适性最优的方向调整,从而实现改善空调运行的舒适性控制,也提升了用户体验。Taking the air conditioner as an example, if the comfort evaluation result is a positive evaluation result, it indicates that the indoor environment after the air conditioner performs the control action increases comfort for the user. According to the positive evaluation, the control action is performed after the air conditioner. During operation, the probability of occurrence is increased; if the comfort evaluation result is a negative evaluation result, it indicates that the indoor environment after the air conditioner performs the control action is less comfortable for the user. According to the negative evaluation, the control action In the subsequent operation of the air conditioner, the probability of occurrence is reduced. It can be understood that in the actual operation of the air conditioner, the above process is repeated many times. If a control action is repeatedly evaluated multiple times, it means that the indoor environment after the execution of the control action gives the user a very good comfort experience. The probability of performing this control action in the future is also very high, and then the control action of the air conditioner can be adjusted in the direction of optimal comfort, thereby achieving comfort control to improve the operation of the air conditioner and improving the user experience.
图4为本申请另一个实施例提供的智能家电控制方法的流程示意图,如图4所示,该智能家电控制方法包括如下步骤:FIG. 4 is a schematic flowchart of a smart home appliance control method according to another embodiment of the present application. As shown in FIG. 4, the smart home appliance control method includes the following steps:
S21、获取参数信息,包括:获取环境参数信息,和/或,获取智能家电自身参数信息。S21. Obtaining parameter information includes: obtaining environmental parameter information, and / or obtaining own parameter information of the smart home appliance.
S22、基于所述参数信息,通过预设模型获取与所述参数信息对应的控制动作,所述预设模型包括状态转换模型和强化学习模型,其中,所述强化学习模型能够依据舒适性评价结果进行调整;S22. Based on the parameter information, a control action corresponding to the parameter information is obtained through a preset model. The preset model includes a state transition model and a reinforcement learning model, wherein the reinforcement learning model can be based on a comfort evaluation result. Make adjustments
通过所述状态转换模型,获取与所述参数信息对应的状态参数,所述状态转换模型用于表征参数信息与状态参数之间的对应关系;Obtaining a state parameter corresponding to the parameter information through the state transition model, where the state transition model is used to represent a correspondence between the parameter information and the state parameter;
基于所述状态参数,通过所述强化学习模型产生控制动作,所述强化学习模型用于表征状态参数与控制动作之间的对应关系。Based on the state parameter, a control action is generated by the reinforcement learning model, and the reinforcement learning model is used to represent a correspondence between the state parameter and the control action.
S23、根据所述控制动作控制运行。S23. Control operation according to the control action.
S24、获取根据所述控制动作控制运行后的所述舒适性评价结果,包括:S24. Obtaining the comfort evaluation result after the control operation according to the control action includes:
获取根据所述控制动作执行相应操作前后的状态参数;Obtaining state parameters before and after performing a corresponding operation according to the control action;
根据预设的舒适性评价算法,计算第一舒适性数值和第二舒适性数值,其中,所述第一舒适性数值为根据所述控制动作执行相应操作前的状态参数对应的舒适性数值,所述第二舒适性数值为根据所述控制动作执行相应操作后的状态参数对应的舒适性数值;Calculating a first comfort value and a second comfort value according to a preset comfort evaluation algorithm, wherein the first comfort value is a comfort value corresponding to a state parameter before performing a corresponding operation according to the control action, The second comfort value is a comfort value corresponding to a state parameter after performing a corresponding operation according to the control action;
根据所述第一舒适性数值和所述第二舒适性数值,得到所述舒适性评价结果。According to the first comfort value and the second comfort value, the comfort evaluation result is obtained.
S25、所述舒适性评价结果包括正向评价结果或者负向评价结果,根据所述舒适性评价结果更新所述强化学习模型,包括:如果所述舒适性评价结果为正向评价结果,则增大 所述控制动作的输出概率;或者,如果所述舒适性评价结果为负向评价结果,则减小所述控制动作的输出概率。S25. The comfort evaluation result includes a positive evaluation result or a negative evaluation result, and updating the reinforcement learning model according to the comfort evaluation result includes: if the comfort evaluation result is a positive evaluation result, increasing Increase the output probability of the control action; or reduce the output probability of the control action if the comfort evaluation result is a negative evaluation result.
可以理解是,步骤S24,获取根据所述控制动作控制运行后的所述舒适性评价结果,还可以包括:It can be understood that, in step S24, acquiring the comfort evaluation result after the control operation according to the control action may further include:
获取根据所述控制动作执行相应的操作后,用户反馈的舒适性评价结果。将所述用户反馈的评价作为对所述控制动作的评价。A comfort evaluation result fed back by a user is obtained after a corresponding operation is performed according to the control action. The evaluation by the user feedback is used as the evaluation of the control action.
可以理解的是,智能家电的实际运行中,以上步骤过程经过多次重复,可实现智能家电的控制动作会朝着用户体验最优的方向调整,从而改善智能家电运行的舒适性控制,使智能家电控制更加精准,进而提升用户体验。It can be understood that in the actual operation of smart home appliances, the above steps are repeated many times, and the control actions of the smart home appliances can be adjusted in the direction of the optimal user experience, thereby improving the comfort control of smart home appliance operation and making smart Appliance control is more precise, which improves user experience.
需要指出的是,上述智能家电控制方法的实施例的包括并不限于应用于智能空调实施例,还可以应用于其他智能家电,如智能空气净化器等。It should be pointed out that the above embodiments of the smart home appliance control method are not limited to being applied to the smart air conditioner embodiment, and can also be applied to other smart home appliances, such as smart air purifiers.
图5为本申请一个实施例提供的智能家电控制装置的结构示意图,如图5所示,该智能家电控制装置5包括:FIG. 5 is a schematic structural diagram of a smart home appliance control device according to an embodiment of the present application. As shown in FIG. 5, the smart home appliance control device 5 includes:
第一获取模块51,用于获取参数信息;A first acquiring module 51, configured to acquire parameter information;
第二获取模块52,基于所述参数信息,通过预设模型获取与所述参数信息对应的控制动作,所述预设模型包括强化学习模型,所述强化学习模型能够依据舒适性评价结果进行调整;The second obtaining module 52 obtains a control action corresponding to the parameter information through a preset model based on the parameter information. The preset model includes a reinforcement learning model, and the reinforcement learning model can be adjusted according to a comfort evaluation result. ;
控制模块53,用于根据所述控制动作控制运行。The control module 53 is configured to control operation according to the control action.
可以理解的是,上述智能家电控制装置5,通过第一获取模块51获取参数信息,第二获取模块52基于参数信息获取智能家电的控制动作,第二获取模块52中,因强化学习模型能够依据舒适性评价结果进行调整,可实现根据强化学习模型产生的控制动作是舒适性评价结果最好的控制动作,因而控制模块53根据控制动作控制智能家电的运行适于用户舒适性体验。It can be understood that the smart home appliance control device 5 obtains parameter information through the first obtaining module 51, and the second obtaining module 52 obtains control actions of the smart home appliance based on the parameter information. In the second obtaining module 52, the reinforcement learning model can be based on The comfort evaluation result is adjusted to realize that the control action generated according to the reinforcement learning model is the control action with the best comfort evaluation result. Therefore, the control module 53 controls the operation of the smart home appliance according to the control action to suit the user ’s comfort experience.
在一个实施例中,所述第一获取模块51具体用于:In one embodiment, the first obtaining module 51 is specifically configured to:
获取环境参数信息,和/或,Obtain environmental parameter information, and / or,
获取智能家电自身参数信息。Get the parameter information of the smart home appliance.
可以理解的是,参数信息与智能家电控制动作的舒适性相关,参数信息可以是环境参数信息,在一个实施例中,所述环境参数信息可以为智能家电自身采集和/或配置的环境参数信息,智能家电可以从自身获取到环境参数信息。比如智能家电利用自身配置的传感器采集的环境参数信息,如室内的温度、湿度、颗粒信息;比如配置在智能家电中的房间信息,如房间的空间大小、朝向、采光等信息。或者,所述环境参数信息也可以为智能家电 的外界设备采集和/或配置的。智能家电可以接收外界设备发送的环境参数信息,比如智能家电通过网络接收云服务器发送的当地天气信息,如当地的温度、湿度、雨雪等信息。或者,智能家电也可以与其他智能家电、传感器进行关联,接收其他智能家电和传感器采集的环境参数信息。比如与其他智能家电关联,接收其他智能家电采集的温度、湿度等信息;比如与门窗传感器进行关联,门窗传感器获得门窗的开关状态信息,然后智能家电接收门窗传感器发送的开关状态信息;比如也可以是智能家电接收智能家居系统的控制中心发送的信息,比如配置在智能家居系统的控制中心的房间信息。It can be understood that the parameter information is related to the comfort of the smart home appliance control action, and the parameter information may be environmental parameter information. In one embodiment, the environmental parameter information may be environmental parameter information collected and / or configured by the smart home appliance itself. , Smart appliances can obtain environmental parameter information from themselves. For example, smart home appliances use their own configured sensors to collect environmental parameter information, such as indoor temperature, humidity, and particle information; for example, room information configured in smart home appliances, such as room size, orientation, and lighting. Alternatively, the environmental parameter information may also be collected and / or configured for external devices of the smart home appliance. Smart appliances can receive environmental parameter information from external devices, such as smart appliances receiving local weather information, such as local temperature, humidity, rain, and snow, sent by cloud servers through the network. Alternatively, the smart home appliance may also be associated with other smart home appliances and sensors, and receive environmental parameter information collected by other smart home appliances and sensors. For example, it is associated with other smart home appliances and receives temperature and humidity information collected by other smart home appliances. For example, it is associated with door and window sensors. The door and window sensors obtain the door and window switch status information. Then the smart home appliances receive the switch status information sent by the door and window sensors. It is the smart home appliance that receives the information sent by the control center of the smart home system, such as room information configured in the control center of the smart home system.
上述的环境参数信息从获取角度来讲,可以是智能家电从自身获取的,也可以是智能家电从其他设备获取的。From the perspective of acquisition, the above-mentioned environmental parameter information may be obtained by the smart home appliance from itself or by other appliances.
从具体信息角度来讲,在一个具体的实施例中,环境参数信息可以包括以下项中的至少一项:当地天气信息,比如温度、湿度、雨雪等;所述智能家电所在房间的房间信息,比如空间大小、朝向、采光情况等;所述智能家电所在房间的其他设备的信息,比如门窗状态信息,比如门窗打开或关闭。From the perspective of specific information, in a specific embodiment, the environmental parameter information may include at least one of the following: local weather information, such as temperature, humidity, rain, snow, etc .; room information of the room where the smart home appliance is located , Such as space size, orientation, lighting conditions, etc .; information about other devices in the room where the smart appliance is located, such as door and window status information, such as doors or windows open or closed.
另外,参数信息也可以是智能家电自身参数信息,比如智能家电的运行时长信息等。In addition, the parameter information may also be parameter information of the smart home appliance, such as the running time information of the smart home appliance.
通过上述实施例,可以实现参数信息获得的多样化,多样化的参数信息综合用于控制智能家电,智能家电通过响应多样化的参数信息进行控制,代替用户亲自操作,可以提升用户体验。Through the above embodiments, diversification of parameter information can be achieved, and the diversified parameter information is comprehensively used to control smart home appliances. Smart home appliances are controlled by responding to the diversified parameter information, instead of being operated by users themselves, which can improve the user experience.
在一个具体的实施例中,所述智能家电包括但不限于智能空调。以智能空调为例,通过响应多样化的参数信息进行控制,代替用户亲自操作,可以提升用户体验,也可以在一些情况下实现节能目的,比如当地炎热天气突然降温变凉,而用户未知的情况下,此时智能空调根据当地天气控制运行,可以实现节能。In a specific embodiment, the smart home appliance includes, but is not limited to, a smart air conditioner. Taking smart air conditioners as an example, by responding to diverse parameter information for control, instead of the user's own operation, the user experience can be improved, and energy saving purposes can also be achieved in some cases, such as the local hot weather suddenly cools down and becomes cold, but the user is unknown At this time, the intelligent air conditioner controls the operation according to the local weather at this time, which can achieve energy saving.
在一个实施例中,所述第二获取模块52中,In one embodiment, in the second obtaining module 52,
优选的,所述第二获取模块52中,所述预设模型还包括状态转换模型;Preferably, in the second acquisition module 52, the preset model further includes a state transition model;
所述基于所述参数信息,通过预设模型获取与所述参数信息对应的控制动作,包括:The obtaining a control action corresponding to the parameter information based on the parameter information through a preset model includes:
基于所述参数信息,通过所述状态转换模型,获取与所述参数信息对应的状态参数,所述状态转换模型用于表征参数信息与状态参数之间的对应关系;Obtaining state parameters corresponding to the parameter information through the state transition model based on the parameter information, and the state transition model is used to represent a correspondence between the parameter information and the state parameters;
基于所述状态参数,通过所述强化学习模型产生控制动作,所述强化学习模型用于表征状态参数与控制动作之间的对应关系。Based on the state parameter, a control action is generated by the reinforcement learning model, and the reinforcement learning model is used to represent a correspondence between the state parameter and the control action.
图2为本申请一个实施例提供的状态转换模型的结构示意图,如图2所示,状态转换模型20的输入为参数信息,输出为状态参数,比如参数信息包括门窗关闭情况201、天气环境情况202(包括温度、湿度、雨雪等)、房间信息203(包括空间大小、朝向、采光等), 状态参数是预设的固定几种,如图2所示,以三种状态参数为例。具体的状态参数可以根据实际情况设定,比如,以智能家电是空调为例,状态参数可以包括温度、湿度和采光,可以是多种参数信息综合确定得到的三种状态。FIG. 2 is a schematic structural diagram of a state transition model provided by an embodiment of the present application. As shown in FIG. 2, the input of the state transition model 20 is parameter information and the output is state parameters. For example, the parameter information includes door and window closing conditions 201 and weather environment conditions. 202 (including temperature, humidity, rain, snow, etc.), room information 203 (including space size, orientation, lighting, etc.), the state parameters are preset and fixed, as shown in FIG. 2, taking three state parameters as examples. The specific status parameters can be set according to the actual situation. For example, taking smart appliances as air conditioners as an example, the status parameters can include temperature, humidity, and lighting, and can be three types of statuses determined by comprehensively determining a variety of parameter information.
状态转换模型20可以是人为设定的逻辑规则、状态对照表、神经网络结构或者三者的混合。输出为对输入信息的简化映射,具体输出参数类型根据实际控制目标而定。一般该过程的确立需要大量的实际案例数据提炼或训练得到。比如,根据实际案例数据提炼可以确定出参数信息为A时对应状态参数B。因此,当获取到参数信息为A时,根据状态转换模型可以获取对应的状态参数为B。The state transition model 20 may be an artificially set logic rule, a state comparison table, a neural network structure, or a mixture of the three. The output is a simplified mapping of the input information, and the specific output parameter type depends on the actual control target. Generally, the establishment of this process requires a large amount of actual case data extraction or training. For example, according to the actual case data extraction, it can be determined that the corresponding state parameter B when the parameter information is A. Therefore, when the parameter information is obtained as A, the corresponding state parameter can be obtained as B according to the state transition model.
通过状态转换,可以适用于参数信息复杂众多的情况,将复杂的参数信息转换成具有映射关系的状态参数,状态参数可形成对复杂参数信息的总结,实现数据处理简单化,可以避免强化学习模型面对复杂众多参数信息时的处理压力。Through state conversion, it can be applied to the situation where the parameter information is complicated. The complex parameter information is converted into state parameters with a mapping relationship. The state parameters can form a summary of the complex parameter information, simplify the data processing, and avoid the reinforcement learning model. Processing pressure in the face of complex and numerous parameter information.
强化学习模型的输入是状态参数,输出是控制动作。以空调为例,状态参数比如是温度下降,控制动作比如是调高温度。强化学习模型的输出参数的输出概率能够依据舒适性评价结果进行调整,因而强化模型每次输出控制动作可以是舒适性评价结果最好的控制动作。The input of the reinforcement learning model is the state parameter, and the output is the control action. Take the air conditioner as an example, the state parameter is, for example, temperature drop, and the control action is, for example, increasing the temperature. The output probability of the output parameters of the reinforcement learning model can be adjusted according to the comfort evaluation result, so each output control action of the reinforcement model can be the control action with the best comfort evaluation result.
图6为本申请另一个实施例提供的智能家电控制装置的结构示意图,如图6所示,该智能家电控制装置5还包括:FIG. 6 is a schematic structural diagram of a smart home appliance control device according to another embodiment of the present application. As shown in FIG. 6, the smart home appliance control device 5 further includes:
评价模块54,用于获取根据所述控制动作控制运行后的所述舒适性评价结果;An evaluation module 54 configured to obtain the comfort evaluation result after the control operation is performed according to the control action;
更新模块55,用于根据所述舒适性评价结果更新所述强化学习模型。An update module 55 is configured to update the reinforcement learning model according to the comfort evaluation result.
可以理解的是,通过评价模块54获取控制动作的评价结果,控制动作有新的舒适性评价结果后,对于该控制动作在智能家电以后运行中的出现情况,可以根据新的舒适性评价结果进行调整。通过更新模块55,根据舒适性评价结果更新强化学习模型后,强化学习模型产生控制动作的也会相应地调整,实现产生舒适性评价结果最好的控制动作。通过上述各模块,在智能家电的实际运行中,以上过程经过多次重复,可实现智能家电的控制动作会朝着用户舒适性体验最优的方向调整,从而改善智能家电运行的舒适性控制,使智能家电控制更加精准,进而提升用户体验。It can be understood that the evaluation result of the control action is obtained through the evaluation module 54. After the control action has a new comfort evaluation result, the appearance of the control action in the subsequent operation of the smart home appliance can be performed based on the new comfort evaluation result. Adjustment. Through the update module 55, after the reinforcement learning model is updated according to the comfort evaluation result, the control action generated by the reinforcement learning model will be adjusted accordingly, so as to achieve the control action with the best comfort evaluation result. Through the above modules, in the actual operation of the smart home appliance, the above process is repeated many times, and the control action of the smart home appliance can be adjusted in the direction of optimal user comfort experience, thereby improving the comfort control of the operation of the smart home appliance. Make smart home appliances more precise and improve user experience.
在一个实施例中,所述评价模块54具体用于:In one embodiment, the evaluation module 54 is specifically configured to:
获取根据所述控制动作执行相应操作前后的状态参数;Obtaining state parameters before and after performing a corresponding operation according to the control action;
根据预设的舒适性评价算法,计算第一舒适性数值和第二舒适性数值,其中,所述第一舒适性数值为根据所述控制动作执行相应操作前的状态参数对应的舒适性数值,所述第二舒适性数值为根据所述控制动作执行相应操作后的状态参数对应的舒适性数值;Calculating a first comfort value and a second comfort value according to a preset comfort evaluation algorithm, wherein the first comfort value is a comfort value corresponding to a state parameter before performing a corresponding operation according to the control action, The second comfort value is a comfort value corresponding to a state parameter after performing a corresponding operation according to the control action;
根据所述第一舒适性数值和所述第二舒适性数值,得到所述舒适性评价结果。According to the first comfort value and the second comfort value, the comfort evaluation result is obtained.
以空调为例,空调执行新的控制动作前,其状态参数对应第一舒适性数值,空调执行新的控制动作后,其新的状态参数对应第二舒适性数值,将第一舒适性数值和第二舒适性数值进行数值比较,若第二舒适性数值大于第一舒适性数值,则舒适性评价结果是正向评价结果,该控制动作的舒适性评价较好;若第二舒适性数值小于第一舒适性数值,则舒适性评价结果是负向评价结果,该控制动作的舒适性评价较差。Take the air conditioner as an example. Before the air conditioner performs a new control action, its state parameter corresponds to the first comfort value. After the air conditioner performs a new control action, its new state parameter corresponds to the second comfort value. The second comfort value is compared numerically. If the second comfort value is greater than the first comfort value, the comfort evaluation result is a positive evaluation result, and the comfort evaluation of the control action is better; if the second comfort value is less than the first comfort value, A comfort value is a negative evaluation result, and the comfort evaluation of the control action is poor.
上述方案中,通过预设的舒适性评价算法得到状态参数对应的舒适性数值,舒适性评价算法可以为状态参数与舒适性数值状态的对照表,也可以为公式等方式。舒适性评价算法中对应各个状态参数可设置相同或不同的权重,通过权重比例对状态参数进行量化以得到对应的舒适性数值。In the above solution, the comfort value corresponding to the state parameter is obtained through a preset comfort evaluation algorithm. The comfort evaluation algorithm may be a comparison table between the state parameter and the comfort value state, or a formula or the like. The same or different weights can be set for each state parameter in the comfort evaluation algorithm, and the state parameters are quantified by the weight ratio to obtain the corresponding comfort value.
在一个实施例中,所述评价模块54还具体用于:In one embodiment, the evaluation module 54 is further specifically configured to:
获取根据所述控制动作执行相应的操作后,用户反馈的舒适性评价结果。A comfort evaluation result fed back by a user is obtained after a corresponding operation is performed according to the control action.
以空调为例,在空调执行控制动作后,用户可以对环境进行直观地舒适性体验评价,以此作为空调控制动作的舒适性评价结果。Taking the air conditioner as an example, after the air conditioner performs a control action, the user can intuitively evaluate the comfort experience of the environment as the result of the comfort evaluation of the air conditioner control action.
在一个实施例中,所述更新模块55具体用于:In one embodiment, the update module 55 is specifically configured to:
所述舒适性评价结果包括正向评价结果或者负向评价结果;The comfort evaluation result includes a positive evaluation result or a negative evaluation result;
如果所述舒适性评价结果为正向评价结果,则增大所述控制动作的输出概率;或者,If the comfort evaluation result is a positive evaluation result, increasing the output probability of the control action; or,
如果所述舒适性评价结果为负向评价结果,则减小所述控制动作的输出概率。If the comfort evaluation result is a negative evaluation result, the output probability of the control action is reduced.
以空调为例,若所述舒适性评价结果为正向评价结果,则表明空调执行该控制动作后的室内环境对用户来说舒适性增加,根据该正向评价,该控制动作在空调以后的运行中,出现的概率得到增加;若所述舒适性评价结果为负向评价结果,则表明空调执行该控制动作后的室内环境对用户来说舒适性降低,根据该负向评价,该控制动作在空调以后的运行中,出现的概率降低。可以理解的是,在空调的实际运行中,以上过程经过多次重复,如果一个控制动作得到多次正向评价,说明该控制动作执行后的室内环境给用户的舒适性体验确实非常好,空调以后执行该控制动作的概率也非常高,随之而然地可实现空调的控制动作会朝着舒适性最优的方向调整,从而改善空调运行的舒适性控制,也提升了用户体验。Taking the air conditioner as an example, if the comfort evaluation result is a positive evaluation result, it indicates that the indoor environment after the air conditioner performs the control action increases comfort for the user. According to the positive evaluation, the control action is performed after the air conditioner. During operation, the probability of occurrence is increased; if the comfort evaluation result is a negative evaluation result, it indicates that the indoor environment after the air conditioner performs the control action is less comfortable for the user. According to the negative evaluation, the control action In the subsequent operation of the air conditioner, the probability of occurrence is reduced. It can be understood that in the actual operation of the air conditioner, the above process is repeated many times. If a control action is repeatedly evaluated multiple times, it means that the indoor environment after the execution of the control action is really very comfortable for the user. The probability of performing this control action in the future is also very high, and then the control action of the air conditioner can be adjusted in the direction of optimal comfort, thereby improving the comfort control of the air conditioner operation and improving the user experience.
需要指出的是,上述智能家电控制装置5的应用实施例的包括但并不限于应用于智能空调实施例,还可以应用于其他智能家电,如智能空气净化器等。It should be noted that the above-mentioned application embodiments of the smart home appliance control device 5 include, but are not limited to, the embodiments of smart air conditioners, and can also be applied to other smart home appliances, such as smart air purifiers.
可以理解的是,上述各实施例中相同或相似部分可以相互参考,在一些实施例中未详细说明的内容可以参见其他实施例中相同或相似的内容。It can be understood that the same or similar parts in the above embodiments can be referred to each other. For the content that is not described in detail in some embodiments, refer to the same or similar content in other embodiments.
需要说明的是,在本申请的描述中,术语“第一”、“第二”等仅用于描述目的,而不能 理解为指示或暗示相对重要性。此外,在本申请的描述中,除非另有说明,“多个”的含义是指至少两个。It should be noted that, in the description of the present application, the terms "first", "second" and the like are only used for descriptive purposes, and cannot be understood to indicate or imply relative importance. In addition, in the description of this application, unless otherwise stated, the meaning of "a plurality" means at least two.
流程图中或在此以其他方式描述的任何过程或方法描述可以被理解为,表示包括一个或更多个用于实现特定逻辑功能或过程的步骤的可执行指令的代码的模块、片段或部分,并且本申请的优选实施方式的范围包括另外的实现,其中可以不按所示出或讨论的顺序,包括根据所涉及的功能按基本同时的方式或按相反的顺序,来执行功能,这应被本申请的实施例所属技术领域的技术人员所理解。Any process or method description in a flowchart or otherwise described herein can be understood as representing a module, fragment, or portion of code that includes one or more executable instructions for implementing a particular logical function or step of a process And, the scope of the preferred embodiments of the present application includes additional implementations, in which the functions may be performed out of the order shown or discussed, including performing functions in a substantially simultaneous manner or in the reverse order according to the functions involved, which should It is understood by those skilled in the art to which the embodiments of the present application pertain.
应当理解,本申请的各部分可以用硬件、软件、固件或它们的组合来实现。在上述实施方式中,多个步骤或方法可以用存储在存储器中且由合适的指令执行系统执行的软件或固件来实现。例如,如果用硬件来实现,和在另一实施方式中一样,可用本领域公知的下列技术中的任一项或他们的组合来实现:具有用于对数据信号实现逻辑功能的逻辑门电路的离散逻辑电路,具有合适的组合逻辑门电路的专用集成电路,可编程门阵列(PGA),现场可编程门阵列(FPGA)等。It should be understood that each part of the application may be implemented by hardware, software, firmware, or a combination thereof. In the above embodiments, multiple steps or methods may be implemented by software or firmware stored in a memory and executed by a suitable instruction execution system. For example, if implemented in hardware, as in another embodiment, it may be implemented using any one or a combination of the following techniques known in the art: Discrete logic circuits, application specific integrated circuits with suitable combinational logic gate circuits, programmable gate arrays (PGA), field programmable gate arrays (FPGA), etc.
本技术领域的普通技术人员可以理解实现上述实施例方法携带的全部或部分步骤是可以通过程序来指令相关的硬件完成,所述的程序可以存储于一种计算机可读存储介质中,该程序在执行时,包括方法实施例的步骤之一或其组合。A person of ordinary skill in the art can understand that all or part of the steps carried by the methods in the foregoing embodiments may be implemented by a program instructing related hardware. The program may be stored in a computer-readable storage medium. The program is When executed, one or a combination of the steps of the method embodiment is included.
此外,在本申请各个实施例中的各功能单元可以集成在一个处理模块中,也可以是各个单元单独物理存在,也可以两个或两个以上单元集成在一个模块中。上述集成的模块既可以采用硬件的形式实现,也可以采用软件功能模块的形式实现。所述集成的模块如果以软件功能模块的形式实现并作为独立的产品销售或使用时,也可以存储在一个计算机可读取存储介质中。In addition, each functional unit in each embodiment of the present application may be integrated into one processing module, or each unit may exist separately physically, or two or more units may be integrated into one module. The above integrated modules can be implemented in the form of hardware or software functional modules. If the integrated module is implemented in the form of a software functional module and sold or used as an independent product, it may also be stored in a computer-readable storage medium.
上述提到的存储介质可以是只读存储器,磁盘或光盘等。The aforementioned storage medium may be a read-only memory, a magnetic disk, or an optical disk.
在本说明书的描述中,参考术语“一个实施例”、“一些实施例”、“示例”、“具体示例”、或“一些示例”等的描述意指结合该实施例或示例描述的具体特征、结构、材料或者特点包含于本申请的至少一个实施例或示例中。在本说明书中,对上述术语的示意性表述不一定指的是相同的实施例或示例。而且,描述的具体特征、结构、材料或者特点可以在任何的一个或多个实施例或示例中以合适的方式结合。In the description of this specification, the description with reference to the terms “one embodiment”, “some embodiments”, “examples”, “specific examples”, or “some examples” and the like means specific features described in conjunction with the embodiments or examples , Structure, materials, or features are included in at least one embodiment or example of the present application. In this specification, the schematic expressions of the above terms do not necessarily refer to the same embodiment or example. Furthermore, the particular features, structures, materials, or characteristics described may be combined in any suitable manner in any one or more embodiments or examples.
尽管上面已经示出和描述了本申请的实施例,可以理解的是,上述实施例是示例性的,不能理解为对本申请的限制,本领域的普通技术人员在本申请的范围内可以对上述实施例进行变化、修改、替换和变型。Although the embodiments of the present application have been shown and described above, it can be understood that the above embodiments are exemplary and should not be construed as limitations on the present application. Those skilled in the art can interpret the above within the scope of the present application. Embodiments are subject to change, modification, substitution, and modification.

Claims (18)

  1. 一种智能家电控制方法,其特征在于,包括:A method for controlling a smart home appliance, comprising:
    获取参数信息;Get parameter information;
    基于所述参数信息,通过预设模型获取与所述参数信息对应的控制动作,所述预设模型包括强化学习模型,所述强化学习模型能够依据舒适性评价结果进行调整;Based on the parameter information, a control action corresponding to the parameter information is obtained through a preset model, the preset model includes a reinforcement learning model, and the reinforcement learning model can be adjusted according to a comfort evaluation result;
    根据所述控制动作控制运行。The operation is controlled according to the control action.
  2. 根据权利要求1所述的智能家电控制方法,其特征在于,所述获取参数信息包括:The smart home appliance control method according to claim 1, wherein the acquiring parameter information comprises:
    获取环境参数信息,和/或,Obtain environmental parameter information, and / or,
    获取智能家电自身参数信息。Get the parameter information of the smart home appliance.
  3. 根据权利要求2所述的智能家电控制方法,其特征在于,所述获取环境参数信息包括:The method for controlling a smart home appliance according to claim 2, wherein the acquiring environmental parameter information comprises:
    获取智能家电自身采集和/或配置的环境参数信息;和/或Obtain the environmental parameter information collected and / or configured by the smart appliance itself; and / or
    获取智能家电的外界设备采集和/或配置的环境参数信息。Obtain environmental parameter information collected and / or configured by external devices of smart appliances.
  4. 权利要求1至3任一项所述的智能家电控制方法,其特征在于,所述预设模型还包括状态转换模型;The smart home appliance control method according to any one of claims 1 to 3, wherein the preset model further includes a state transition model;
    所述基于所述参数信息,通过预设模型获取与所述参数信息对应的控制动作,包括:The obtaining a control action corresponding to the parameter information based on the parameter information through a preset model includes:
    基于所述参数信息,通过所述状态转换模型,获取与所述参数信息对应的状态参数,所述状态转换模型用于表征参数信息与状态参数之间的对应关系;Obtaining state parameters corresponding to the parameter information through the state transition model based on the parameter information, and the state transition model is used to represent a correspondence between the parameter information and the state parameters;
    基于所述状态参数,通过所述强化学习模型产生控制动作,所述强化学习模型用于表征状态参数与控制动作之间的对应关系。Based on the state parameter, a control action is generated by the reinforcement learning model, and the reinforcement learning model is used to represent a correspondence between the state parameter and the control action.
  5. 根据权利要求4所述的智能家电控制方法,其特征在于,The smart home appliance control method according to claim 4, wherein:
    所述状态转换模型包括状态对照表、神经网络模型和预设逻辑规则中的一种或者多种。The state transition model includes one or more of a state comparison table, a neural network model, and a preset logic rule.
  6. 根据权利要求4所述的智能家电控制方法,其特征在于,所述强化学习模型能够依据舒适性评价结果进行调整,包括:The smart home appliance control method according to claim 4, wherein the reinforcement learning model can be adjusted according to a comfort evaluation result, comprising:
    所述强化学习模型输出所述控制动作的概率能够依据舒适性评价结果进行调整。The probability that the reinforcement learning model outputs the control action can be adjusted according to the comfort evaluation result.
  7. 权利要求4所述的智能家电控制方法,其特征在于,还包括:The smart home appliance control method according to claim 4, further comprising:
    获取根据所述控制动作控制运行后的所述舒适性评价结果;Acquiring the comfort evaluation result after controlling operation according to the control action;
    根据所述舒适性评价结果更新所述强化学习模型。Updating the reinforcement learning model according to the comfort evaluation result.
  8. 权利要求7所述的智能家电控制方法,其特征在于,所述获取根据所述控制动作运 行后的所述舒适性评价结果,包括:The method for controlling a smart home appliance according to claim 7, wherein the obtaining the comfort evaluation result after running according to the control action comprises:
    获取根据所述控制动作执行相应操作前后的状态参数;Obtaining state parameters before and after performing a corresponding operation according to the control action;
    根据预设的舒适性评价算法,计算第一舒适性数值和第二舒适性数值,其中,所述第一舒适性数值为根据所述控制动作执行相应操作前的状态参数对应的舒适性数值,所述第二舒适性数值为根据所述控制动作执行相应操作后的状态参数对应的舒适性数值;Calculating a first comfort value and a second comfort value according to a preset comfort evaluation algorithm, wherein the first comfort value is a comfort value corresponding to a state parameter before performing a corresponding operation according to the control action, The second comfort value is a comfort value corresponding to a state parameter after performing a corresponding operation according to the control action;
    根据所述第一舒适性数值和所述第二舒适性数值,得到所述舒适性评价结果。According to the first comfort value and the second comfort value, the comfort evaluation result is obtained.
  9. 根据权利要求8所述的智能家电控制方法,其特征在于,所述舒适性评价算法中对应各个状态参数设置相同或不同的权重。The smart home appliance control method according to claim 8, wherein the comfort evaluation algorithm sets the same or different weights corresponding to each state parameter.
  10. 根据权利要求7或8所述的智能家电控制方法,其特征在于,所述获取根据所述控制动作运行后的所述舒适性评价结果,包括:The method for controlling a smart home appliance according to claim 7 or 8, wherein the acquiring the comfort evaluation result after running according to the control action comprises:
    获取根据所述控制动作执行相应的操作后,用户反馈的舒适性评价结果。A comfort evaluation result fed back by a user is obtained after a corresponding operation is performed according to the control action.
  11. 根据权利要求7或8所述的智能家电控制方法,其特征在于,所述舒适性评价结果包括正向评价结果或者负向评价结果,所述根据所述舒适性评价结果更新所述强化学习模型,包括:The smart home appliance control method according to claim 7 or 8, wherein the comfort evaluation result comprises a positive evaluation result or a negative evaluation result, and the reinforcement learning model is updated according to the comfort evaluation result ,include:
    如果所述舒适性评价结果为正向评价结果,则增大所述控制动作的输出概率;或者,If the comfort evaluation result is a positive evaluation result, increasing the output probability of the control action; or,
    如果所述舒适性评价结果为负向评价结果,则减小所述控制动作的输出概率。If the comfort evaluation result is a negative evaluation result, the output probability of the control action is reduced.
  12. 一种智能家电控制装置,其特征在于,包括:A smart home appliance control device is characterized in that it includes:
    第一获取模块,用于获取参数信息;A first obtaining module, configured to obtain parameter information;
    第二获取模块,基于所述参数信息,通过预设模型获取与所述参数信息对应的控制动作,所述预设模型包括强化学习模型,所述强化学习模型能够依据舒适性评价结果进行调整;A second acquisition module, based on the parameter information, obtaining a control action corresponding to the parameter information through a preset model, the preset model includes a reinforcement learning model, and the reinforcement learning model can be adjusted according to a comfort evaluation result;
    控制模块,用于根据所述控制动作控制运行。A control module, configured to control operation according to the control action.
  13. 根据权利要求12所述的智能家电控制装置,其特征在于,所述第一获取模块具体用于:The smart home appliance control device according to claim 12, wherein the first acquisition module is specifically configured to:
    获取环境参数信息,和/或,Obtain environmental parameter information, and / or,
    获取智能家电自身参数信息。Get the parameter information of the smart home appliance.
  14. 根据权利要求12所述的智能家电控制装置,其特征在于,所述第二获取模块中,所述预设模型还包括状态转换模型;The smart home appliance control device according to claim 12, wherein in the second acquisition module, the preset model further comprises a state transition model;
    所述基于所述参数信息,通过预设模型获取与所述参数信息对应的控制动作,包括:The obtaining a control action corresponding to the parameter information based on the parameter information through a preset model includes:
    基于所述参数信息,通过所述状态转换模型,获取与所述参数信息对应的状态参数,所述状态转换模型用于表征参数信息与状态参数之间的对应关系;Obtaining state parameters corresponding to the parameter information through the state transition model based on the parameter information, and the state transition model is used to represent a correspondence between the parameter information and the state parameters;
    基于所述状态参数,通过所述强化学习模型产生控制动作,所述强化学习模型用于表征状态参数与控制动作之间的对应关系。Based on the state parameter, a control action is generated by the reinforcement learning model, and the reinforcement learning model is used to represent a correspondence between the state parameter and the control action.
  15. 根据权利要求12至14任一项所述的智能家电控制装置,其特征在于,所述智能家电控制装置还包括:The smart home appliance control device according to any one of claims 12 to 14, wherein the smart home appliance control device further comprises:
    评价模块,用于获取根据所述控制动作控制运行后的所述舒适性评价结果;An evaluation module, configured to obtain the comfort evaluation result after controlling operation according to the control action;
    更新模块,用于根据所述舒适性评价结果更新所述强化学习模型。An update module is configured to update the reinforcement learning model according to the comfort evaluation result.
  16. 根据权利要求15所述的智能家电控制装置,其特征在于,所述评价模块具体用于:The smart home appliance control device according to claim 15, wherein the evaluation module is specifically configured to:
    获取根据所述控制动作执行相应操作前后的状态参数;Obtaining state parameters before and after performing a corresponding operation according to the control action;
    根据预设的舒适性评价算法,计算第一舒适性数值和第二舒适性数值,其中,所述第一舒适性数值为根据所述控制动作执行相应操作前的状态参数对应的舒适性数值,所述第二舒适性数值为根据所述控制动作执行相应操作后的状态参数对应的舒适性数值;Calculating a first comfort value and a second comfort value according to a preset comfort evaluation algorithm, wherein the first comfort value is a comfort value corresponding to a state parameter before performing a corresponding operation according to the control action, The second comfort value is a comfort value corresponding to a state parameter after performing a corresponding operation according to the control action;
    根据所述第一舒适性数值和所述第二舒适性数值,得到所述舒适性评价结果。According to the first comfort value and the second comfort value, the comfort evaluation result is obtained.
  17. 根据权利要求15所述的智能家电控制装置,其特征在于,所述评价模块还具体用于:The smart home appliance control device according to claim 15, wherein the evaluation module is further configured to:
    获取根据所述控制动作执行相应的操作后,用户反馈的舒适性评价结果。A comfort evaluation result fed back by a user is obtained after a corresponding operation is performed according to the control action.
  18. 根据权利要求15所述的智能家电控制装置,其特征在于,The control device for a smart home appliance according to claim 15, wherein:
    所述更新模块具体用于:The update module is specifically configured to:
    所述舒适性评价结果包括正向评价结果或者负向评价结果;The comfort evaluation result includes a positive evaluation result or a negative evaluation result;
    如果所述舒适性评价结果为正向评价结果,则增大所述控制动作的输出概率;或者,If the comfort evaluation result is a positive evaluation result, increasing the output probability of the control action; or,
    如果所述舒适性评价结果为负向评价结果,则减小所述控制动作的输出概率。If the comfort evaluation result is a negative evaluation result, the output probability of the control action is reduced.
PCT/CN2018/122256 2018-07-06 2018-12-20 Intelligent household electrical appliance control method and intelligent household electrical appliance control device WO2020006993A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201810734605.1A CN110687802A (en) 2018-07-06 2018-07-06 Intelligent household electrical appliance control method and intelligent household electrical appliance control device
CN201810734605.1 2018-07-06

Publications (1)

Publication Number Publication Date
WO2020006993A1 true WO2020006993A1 (en) 2020-01-09

Family

ID=69060037

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2018/122256 WO2020006993A1 (en) 2018-07-06 2018-12-20 Intelligent household electrical appliance control method and intelligent household electrical appliance control device

Country Status (2)

Country Link
CN (1) CN110687802A (en)
WO (1) WO2020006993A1 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113495488A (en) * 2020-04-07 2021-10-12 佛山市顺德区美的洗涤电器制造有限公司 Control method of household appliance, household appliance and computer readable storage medium

Families Citing this family (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113375300A (en) * 2020-03-09 2021-09-10 青岛海尔空调器有限总公司 Intelligent control method and intelligent control equipment of air conditioner
CN111338227B (en) * 2020-05-18 2020-12-01 南京三满互联网络科技有限公司 Electronic appliance control method and control device based on reinforcement learning and storage medium
CN111913400B (en) * 2020-07-28 2024-04-30 深圳Tcl新技术有限公司 Information fusion method, device and computer readable storage medium
CN114675552A (en) * 2022-03-10 2022-06-28 深圳亿思腾达集成股份有限公司 Intelligent home management method and system based on deep learning algorithm

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104597761A (en) * 2014-12-31 2015-05-06 珠海格力电器股份有限公司 Control method and device of intelligent household appliances
CN105068515A (en) * 2015-07-16 2015-11-18 华南理工大学 Intelligent household equipment voice control method based on self-learning algorithm
JP2016023915A (en) * 2014-07-24 2016-02-08 パナソニックIpマネジメント株式会社 Environment control device, program
CN105371425A (en) * 2015-10-12 2016-03-02 美的集团股份有限公司 Air conditioner
CN106369739A (en) * 2016-08-23 2017-02-01 海信(山东)空调有限公司 Air conditioner control method, air conditioner controller and air conditioner system
CN106549833A (en) * 2015-09-21 2017-03-29 阿里巴巴集团控股有限公司 A kind of control method and device of intelligent home device
CN107797459A (en) * 2017-09-15 2018-03-13 珠海格力电器股份有限公司 Control method, device, storage medium and the processor of terminal device

Family Cites Families (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102213966A (en) * 2010-04-09 2011-10-12 宁波中科集成电路设计中心有限公司 Wireless intelligent measurement and control system of greenhouse
US10354345B2 (en) * 2012-01-23 2019-07-16 Whisker Labs, Inc. Optimizing and controlling the energy consumption of a building
CN103186677A (en) * 2013-04-15 2013-07-03 北京百纳威尔科技有限公司 Information display method and information display device
CN106598058A (en) * 2016-12-20 2017-04-26 华北理工大学 Intrinsically motivated extreme learning machine autonomous development system and operating method thereof
CN106842925B (en) * 2017-01-20 2019-10-11 清华大学 A kind of locomotive smart steering method and system based on deeply study

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2016023915A (en) * 2014-07-24 2016-02-08 パナソニックIpマネジメント株式会社 Environment control device, program
CN104597761A (en) * 2014-12-31 2015-05-06 珠海格力电器股份有限公司 Control method and device of intelligent household appliances
CN105068515A (en) * 2015-07-16 2015-11-18 华南理工大学 Intelligent household equipment voice control method based on self-learning algorithm
CN106549833A (en) * 2015-09-21 2017-03-29 阿里巴巴集团控股有限公司 A kind of control method and device of intelligent home device
CN105371425A (en) * 2015-10-12 2016-03-02 美的集团股份有限公司 Air conditioner
CN106369739A (en) * 2016-08-23 2017-02-01 海信(山东)空调有限公司 Air conditioner control method, air conditioner controller and air conditioner system
CN107797459A (en) * 2017-09-15 2018-03-13 珠海格力电器股份有限公司 Control method, device, storage medium and the processor of terminal device

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113495488A (en) * 2020-04-07 2021-10-12 佛山市顺德区美的洗涤电器制造有限公司 Control method of household appliance, household appliance and computer readable storage medium
WO2021204117A1 (en) * 2020-04-07 2021-10-14 佛山市顺德区美的洗涤电器制造有限公司 Household appliance control method, household appliance, and computer-readable storage medium

Also Published As

Publication number Publication date
CN110687802A (en) 2020-01-14

Similar Documents

Publication Publication Date Title
WO2020006993A1 (en) Intelligent household electrical appliance control method and intelligent household electrical appliance control device
WO2021000790A1 (en) Smart home control method and smart home control apparatus
WO2020006982A1 (en) Method for controlling cooperation between home appliances
CN105700389B (en) Intelligent home natural language control method
KR102393418B1 (en) Data learning server and method for generating and using thereof
CN105068515B (en) A kind of intelligent home device sound control method based on self-learning algorithm
CN110578994B (en) Operation method and device
WO2016180163A1 (en) Household control and adjustment method and system
CN109269036B (en) Cloud control method of multi-split air conditioner and multi-split air conditioner system
CN105652677B (en) A kind of intelligent home furnishing control method based on user behavior analysis, device and system
WO2022267671A1 (en) Air conditioner operation mode pushing method and apparatus, and air conditioner
CN104991537B (en) Control method of intelligent equipment
CN110966733A (en) Intelligent control method and system for indoor environment and storage medium
CN105652826A (en) Intelligent household control method, controller, mobile terminal and system thereof
CN111338227B (en) Electronic appliance control method and control device based on reinforcement learning and storage medium
CN107966960A (en) The control method and device of a kind of intelligent appliance
CN109595756A (en) The control method and air-conditioning of air-conditioning
WO2022242265A1 (en) Method and apparatus for controlling display temperature of air conditioner, and air conditioner
Qin et al. Energy-efficient heating control for nearly zero energy residential buildings with deep reinforcement learning
CN111158255A (en) Method, system and related medium for configuring intelligent device
JP2020517856A (en) Windlessness control method, device and computer-readable storage medium
CN110018641A (en) A kind of long-range jump point control method and system
CN112327648B (en) Control method, apparatus and computer-readable storage medium for home appliances
WO2024001196A1 (en) Household appliance control method and apparatus, storage medium, and electronic apparatus
JP7305041B2 (en) Information processing equipment and air conditioning system

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 18925437

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 18925437

Country of ref document: EP

Kind code of ref document: A1