CN113706029A - Self-adaptive management method, device and system for household energy - Google Patents

Self-adaptive management method, device and system for household energy Download PDF

Info

Publication number
CN113706029A
CN113706029A CN202111013941.5A CN202111013941A CN113706029A CN 113706029 A CN113706029 A CN 113706029A CN 202111013941 A CN202111013941 A CN 202111013941A CN 113706029 A CN113706029 A CN 113706029A
Authority
CN
China
Prior art keywords
decision
model
meta
environment
management
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202111013941.5A
Other languages
Chinese (zh)
Inventor
招景明
潘峰
杨雨瑶
马键
吴敏
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Guangdong Power Grid Co Ltd
Measurement Center of Guangdong Power Grid Co Ltd
Metrology Center of Guangdong Power Grid Co Ltd
Original Assignee
Guangdong Power Grid Co Ltd
Measurement Center of Guangdong Power Grid Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Guangdong Power Grid Co Ltd, Measurement Center of Guangdong Power Grid Co Ltd filed Critical Guangdong Power Grid Co Ltd
Priority to CN202111013941.5A priority Critical patent/CN113706029A/en
Publication of CN113706029A publication Critical patent/CN113706029A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/06Resources, workflows, human or project management; Enterprise or organisation planning; Enterprise or organisation modelling
    • G06Q10/063Operations research, analysis or management
    • G06Q10/0637Strategic management or analysis, e.g. setting a goal or target of an organisation; Planning actions based on goals; Analysis or evaluation of effectiveness of goals
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/21Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
    • G06F18/214Generating training patterns; Bootstrap methods, e.g. bagging or boosting
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F30/00Computer-aided design [CAD]
    • G06F30/20Design optimisation, verification or simulation
    • G06F30/27Design optimisation, verification or simulation using machine learning, e.g. artificial intelligence, neural networks, support vector machines [SVM] or training a model
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/04Forecasting or optimisation specially adapted for administrative or management purposes, e.g. linear programming or "cutting stock problem"

Landscapes

  • Engineering & Computer Science (AREA)
  • Business, Economics & Management (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Human Resources & Organizations (AREA)
  • General Physics & Mathematics (AREA)
  • Strategic Management (AREA)
  • Economics (AREA)
  • Evolutionary Computation (AREA)
  • Entrepreneurship & Innovation (AREA)
  • Artificial Intelligence (AREA)
  • General Engineering & Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Tourism & Hospitality (AREA)
  • General Business, Economics & Management (AREA)
  • Software Systems (AREA)
  • Development Economics (AREA)
  • Educational Administration (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Game Theory and Decision Science (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Marketing (AREA)
  • Operations Research (AREA)
  • Quality & Reliability (AREA)
  • Health & Medical Sciences (AREA)
  • Molecular Biology (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Biomedical Technology (AREA)
  • Biophysics (AREA)
  • Computational Linguistics (AREA)
  • General Health & Medical Sciences (AREA)
  • Evolutionary Biology (AREA)
  • Computing Systems (AREA)
  • Mathematical Physics (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Medical Informatics (AREA)
  • Computer Hardware Design (AREA)
  • Geometry (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

The invention discloses a self-adaptive management method, a device and a system for household energy. The adaptive management device comprises a data acquisition unit, an adaptive improvement unit and a decision management unit. The self-adaptive management system comprises a self-adaptive management module, an environment monitoring module and a cloud server. The self-adaptive management method, the device and the system improve the self-adaptability of the family energy management to different environments by training a first decision meta-model in advance, carrying out self-adaptive improvement on the first decision meta-model according to a reinforcement learning algorithm and a current environment data set to obtain a second decision meta-model and carrying out energy decision management on the current family environment according to the second decision meta-model.

Description

Self-adaptive management method, device and system for household energy
Technical Field
The invention relates to the field of self-adaptive management of household energy, in particular to a self-adaptive management method, a device and a system of household energy.
Background
The household energy management system is an intelligent decision-making system which performs optimization management on various energy consumption devices in a household, thereby realizing energy conservation and cost reduction and improving the comfort level of a user. The household energy management system works in a household environment, wherein the environment comprises household equipment managed by the household energy management system, the indoor and outdoor space environment of the household and user behavior influence, and different household environments can be generated by different equipment parameters and different probability distributions of random quantity. The existing household energy management optimization decision-making method generally has the problem of environmental dependence. That is, both modeling the environment and predicting the random quantities are environment-specific, even a data-driven reinforcement learning decision model can only make good decisions under a certain probability distribution of the environmental parameters when it is trained. Under the background of continuous development of smart homes, the development space of a home energy management system is larger and larger, the feasibility and the effectiveness of floor implementation are improved continuously, and the home energy management system is expected to become a core component of a new-generation smart home system.
In the prior art, the most common method is an integer programming method based on mathematical modeling, namely, a mathematical model of the working characteristics of the household electrical appliance is established firstly, the switching state of the mathematical model is expressed by integer variables, future environmental parameters are predicted firstly, and then the integer programming method such as Branch-and-Cut is adopted to solve by taking the minimum deviation of the integral household energy cost and the comfort degree of a user as an optimization target. On the basis, some researches consider the environment randomness, and the randomness is processed by adopting a random planning method such as a scene method and opportunity constraint. In addition, there are some studies on a reinforcement learning method based on data driving.
However, the prior art still has the following disadvantages: the self-adaptability of the household energy management system is poor, and the built-in optimization decision strategy of a manufacturer can hardly make good decisions in different environments in specific users, so that the operation effect of the household energy management system is seriously influenced; in addition, when the living environment or the behavior habit of the user changes, the home energy management system is difficult to perform decision self-adaptation, so that the decision capability is poor.
Therefore, there is a need for a method, apparatus and system for adaptive management of home energy, which can solve the above-mentioned problems in the prior art.
Disclosure of Invention
In view of the above technical problems, an object of the present invention is to provide a method, an apparatus and a system for adaptive management of home energy, so as to improve the adaptivity of home energy management.
The invention provides a self-adaptive management method of household energy, which comprises the following steps: acquiring a current environment data set;
according to the current environment data set and a preset reinforcement learning algorithm, performing self-adaptive improvement on a preset first decision-making meta-model so as to obtain a second decision-making meta-model; the first decision meta-model is a model trained in advance in a cloud server; and performing energy decision management on the current home environment according to the second decision meta-model.
In one embodiment, before the acquiring the current environment data set, the method further includes: and performing energy decision management on the current home environment through the first decision meta-model.
In one embodiment, after performing energy decision management on the current home environment according to the second decision meta-model, the method further includes: judging whether the current home environment changes; if not, continuing to perform energy decision management on the current home environment according to the second decision meta-model; if so, performing self-adaptive improvement on a preset first decision-making meta-model according to the changed environment data set and a preset reinforcement learning algorithm so as to obtain a third decision-making meta-model; and performing energy decision management on the current family environment according to the third decision meta-model.
In one embodiment, the first decision meta-model is a model trained in advance in a cloud server, and specifically includes: establishing an environment model training set according to a plurality of preset parameter probability distributions; and training a preset reinforcement learning decision model according to the environmental model training group by a meta-learning method, thereby obtaining a first decision meta-model.
In an embodiment, the adaptively improving the first decision meta-model according to the current environment data set and a preset reinforcement learning algorithm to obtain a second decision meta-model specifically includes: training the first decision-making meta-model according to the current environment data set by a random gradient descent method, and acquiring a convergence error of the trained first decision-making meta-model; judging whether the convergence error is smaller than a preset error threshold value or not; if the first decision meta-model is smaller than the second decision meta-model, finishing the training, and outputting the trained first decision meta-model as a second decision meta-model; and if not, continuing to train the first decision meta-model according to the current environment data set by a random gradient descent method.
The invention also provides a self-adaptive management device of the family energy, which comprises a data acquisition unit, an adaptive improvement unit and a decision management unit, wherein the data acquisition unit is used for acquiring the current environment data set; the adaptive improvement unit is used for carrying out adaptive improvement on a preset first decision-making meta-model according to the current environment data set and a preset reinforcement learning algorithm so as to obtain a second decision-making meta-model; the first decision meta-model is a model trained in advance in a cloud server; and the decision management unit is used for performing energy decision management on the current family environment according to the second decision meta-model.
In one embodiment, the adaptation-improving unit is further configured to: judging whether the current family environment change degree is higher than a preset environment change threshold value or not; if not, continuing to perform energy decision management on the current home environment according to the second decision meta-model; and if so, carrying out self-adaptive improvement on the preset first decision-making meta-model according to the changed environment data set and a preset reinforcement learning algorithm so as to obtain a third decision-making meta-model.
The invention also provides a self-adaptive management system for the household energy, which comprises a self-adaptive management module, an environment monitoring module and a cloud server, wherein the environment monitoring module and the cloud server are respectively connected to the self-adaptive management module in a communication manner; wherein the adaptive management module is used for executing the adaptive management method of the household energy; the environment monitoring module is used for acquiring and storing a current environment data set and judging whether the current family environment change degree is higher than a preset environment change threshold value or not according to the current environment data set and a preset environment monitoring method; the cloud server is used for establishing an environment model training set according to a plurality of preset parameter probability distributions; and training a preset reinforcement learning decision model according to the environmental model training group by a meta-learning method, thereby obtaining a first decision meta-model.
In one embodiment, the adaptive management system further comprises an analysis optimization module for: recording the management behavior of the self-adaptive management module; analyzing the energy consumption behavior habit of the current family and sending an analysis report to the current family according to the management behavior, the current environment data set and a preset energy consumption optimization strategy library; the analysis report includes a behavioral optimization suggestion.
Compared with the prior art, the embodiment of the invention has the following beneficial effects:
the invention provides a self-adaptive management method, a device and a system for household energy, which are characterized in that a first decision-making meta-model is trained in advance, the first decision-making meta-model is subjected to self-adaptive improvement according to a reinforcement learning algorithm and a current environment data set to obtain a second decision-making meta-model, and energy decision management is performed on the current household environment according to the second decision-making meta-model.
Further, according to the adaptive management method, device and system for the household energy, provided by the invention, when the change degree of the current household environment is judged to be higher than the preset environment change threshold value, the changed environment data set is collected again, the preset first decision-making meta-model is subjected to adaptive improvement again according to the changed environment data set and the reinforcement learning method, so that a third decision-making meta-model is obtained, and then the energy decision-making management is performed on the current household environment according to the third decision-making meta-model, so that the accuracy of the adaptive management of the household energy is improved.
Drawings
The invention will be further described with reference to the accompanying drawings, in which:
fig. 1 shows a flow chart of an embodiment of a method for adaptive management of home energy according to the invention;
fig. 2 shows a flow chart of another embodiment of a method for adaptive management of home energy according to the present invention;
fig. 3 shows a block diagram of an embodiment of an apparatus for adaptive management of home energy according to the present invention;
fig. 4 shows a block diagram of an embodiment of an adaptive management system for home energy according to the invention.
Detailed Description
The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.
Detailed description of the preferred embodiment
The embodiment of the invention firstly describes a self-adaptive management method of household energy. Fig. 1 shows a flow chart of an embodiment of a method for adaptive management of home energy according to the present invention. As shown in fig. 1, the method comprises the steps of:
and S1, acquiring the current environment data set.
In order to adapt to the currently deployed home environment, the current environment data set needs to be acquired so as to perform adaptive improvement on the preset first decision meta-model subsequently.
And S2, carrying out self-adaptive improvement on a preset first decision-making meta-model according to the current environment data set and a preset reinforcement learning algorithm so as to obtain a second decision-making meta-model.
In one embodiment, a preset first decision meta-model is adaptively improved according to the current environment data set and a preset reinforcement learning algorithm, so as to obtain a second decision meta-model, specifically:
a: using a first decision meta-model
Figure BDA0003239241000000051
Performing K Episodes in a real environment and calculating a loss function
Figure BDA0003239241000000052
B: using gradient descent pair parameters
Figure BDA0003239241000000061
Performing one-time updating to obtain
Figure BDA0003239241000000062
C: judgment of
Figure BDA0003239241000000063
And if the value is less than epsilon, continuing the next step, otherwise, turning to the step A.
D: the trained first decision meta-model
Figure BDA0003239241000000064
The output is a second decision meta-model.
Each Episode is defined as an operation period or characteristic operation duration of specific controlled equipment in the household energy system; ε is the convergence tolerance error constant; α is a learning rate constant.
And S3, performing energy decision management on the current family environment according to the second decision meta-model.
Through step S2, the first decision meta-model originally set in factory is already adaptively trained to sufficiently adapt to the current home environment, so that energy decision management can be performed on the current home environment through the second decision meta-model (the adaptively trained first decision meta-model).
The invention provides a self-adaptive management method of household energy, which improves the self-adaptability of household energy management to different environments by training a first decision-making meta-model in advance, carrying out self-adaptive improvement on the first decision-making meta-model according to a reinforcement learning algorithm and a current environment data group to obtain a second decision-making meta-model and carrying out energy decision-making management on the current household environment according to the second decision-making meta-model.
Detailed description of the invention
Furthermore, the embodiment of the invention also describes a method for adaptively managing the household energy. Fig. 2 shows a flow chart of another embodiment of a method for adaptive management of home energy according to the present invention.
As shown in fig. 2, the method comprises the steps of:
a1: and performing energy decision management on the current home environment through the first decision meta-model.
The first decision meta-model is a model trained in advance at a cloud server. Specifically, in one embodiment, the first decision meta-model training process is as follows: establishing an environment model training set according to a plurality of preset parameter probability distributions; and training a preset reinforcement learning decision model according to the environmental model training group by a meta-learning method, thereby obtaining a first decision meta-model.
Further, in an embodiment, the first decision meta-model training process specifically includes the following steps:
and a101, establishing an environment model training set according to a plurality of preset parameter probability distributions.
a102, establishing a preset reinforcement learning decision model fθAnd initializes the reinforcement learning parameter θ.
a103, sampling N environments from the environment model training set, and recording as e1,e2,…,eNThe loss function corresponding to each environment is L1,L2,…,LN
and a104, making i equal to 0.
a105, judging whether i is smaller than N, if so, changing i to i +1, and continuing the next step; if not, the step a109 is executed.
a106 adopting a strategy fθIn the environment eiPerforms K Episodes and calculates the loss function Li(fθ)。
a107, updating theta once by adopting gradient descent to obtain
Figure BDA0003239241000000071
a108, adopting the updated strategy fθiIn the environment eiPerforms K Episodes and calculates the loss function Li(fθi)。
a109, updating the element parameter theta,
Figure BDA0003239241000000072
a110 judging
Figure BDA0003239241000000073
And if not, skipping to execute the step a 104.
Wherein, the environment model training set comprises a plurality of environments, and each environment comprises a controlled electrical appliance model and a working environmentA model and a user behavior model; α is a learning rate constant, θiThe parameters of the decision model obtained by training in the environment i are obtained; ε is the convergence tolerance error constant, where β is the learning rate constant of the meta-learning; each Episode is defined as the operating duration of one operating cycle or characteristic of a particular controlled device.
A2: and acquiring a current environment data set.
In order to adapt to the currently deployed home environment, the current environment data set needs to be acquired so as to perform adaptive improvement on the preset first decision meta-model subsequently.
A3: and performing self-adaptive improvement on a preset first decision-making meta-model according to the current environment data set and a preset reinforcement learning algorithm so as to obtain a second decision-making meta-model.
In one embodiment, a preset first decision meta-model is adaptively improved according to the current environment data set and a preset reinforcement learning algorithm, so as to obtain a second decision meta-model, specifically:
a: using a first decision meta-model
Figure BDA0003239241000000081
Performing K Episodes in a real environment and calculating a loss function
Figure BDA0003239241000000082
B: using gradient descent pair parameters
Figure BDA0003239241000000083
Performing one-time updating to obtain
Figure BDA0003239241000000084
C: judgment of
Figure BDA0003239241000000085
And if the value is less than epsilon, continuing the next step, otherwise, turning to the step A.
D: the trained first decision meta-model
Figure BDA0003239241000000086
The output is a second decision meta-model.
Each Episode is defined as an operation period or characteristic operation duration of specific controlled equipment in the household energy system; ε is the convergence tolerance error constant; α is a learning rate constant.
A4: and performing energy decision management on the current home environment according to the second decision meta-model.
Through step S2, the first decision meta-model originally set in factory is already adaptively trained to sufficiently adapt to the current home environment, so that energy decision management can be performed on the current home environment through the second decision meta-model (the adaptively trained first decision meta-model).
A5: and judging whether the current family environment change degree is higher than a preset environment change threshold value.
When the environment changes greatly, the second decision meta-model may no longer be suitable for a new environment, so that, while performing energy decision management on the current home environment according to the second decision meta-model, it should be monitored in real time whether the current home environment changes, and corresponding measures should be taken according to the monitoring result.
A61: and if not, continuing to perform energy decision management on the current family environment according to the second decision meta-model.
A62: if so, performing self-adaptive improvement on a preset first decision-making meta-model according to the changed environment data set and a preset reinforcement learning algorithm so as to obtain a third decision-making meta-model; and performing energy decision management on the current family environment according to the third decision meta-model.
When the change degree of the current home environment is higher than a preset environment change threshold value, theoretically, the second decision-making meta-model is not suitable for the changed home environment any more, so that a changed environment data set needs to be collected again, the preset first decision-making meta-model is subjected to adaptive improvement again according to the changed environment data set and a preset reinforcement learning algorithm, a third decision-making meta-model is obtained, and then energy decision management is performed on the current home environment according to the third decision-making meta-model.
By adding the judgment and correction steps (A5-A62), the embodiment of the invention can directly re-collect the changed environment data set and re-perform model adaptive training when the current family environment has great change, thereby improving the speed and efficiency of the re-adaptation on the basis of rapid deployment and adaptation.
The embodiment of the invention provides a self-adaptive management method of household energy, which is characterized in that a first decision-making meta-model is trained in advance through a reinforcement learning algorithm, the first decision-making meta-model is subjected to self-adaptive improvement according to the reinforcement learning algorithm and a current environment data group to obtain a second decision-making meta-model, and energy decision-making management is carried out on a current household environment according to the second decision-making meta-model, so that the self-adaptive management method improves the self-adaptability of household energy management to different environments; further, the adaptive management method for the family energy provided by the invention also acquires the changed environment data set again when the change degree of the current family environment is judged to be higher than the preset environment change threshold value, performs adaptive improvement on the preset first decision-making meta-model again according to the changed environment data set and the reinforcement learning method, so as to obtain a third decision-making meta-model, and performs energy decision-making management on the current family environment according to the third decision-making meta-model, so as to improve the accuracy of the adaptive management of the family energy.
Detailed description of the preferred embodiment
Besides the method, the embodiment of the invention also describes a device for adaptively managing the household energy. Fig. 3 shows a block diagram of an embodiment of an apparatus for adaptive management of home energy according to the present invention.
As shown in fig. 3, the adaptive management apparatus includes a data acquisition unit 11, an adaptation improvement unit 12, and a decision management unit 13.
The data acquisition unit 11 is used for acquiring the current environment data set.
The adaptive improvement unit 12 is configured to perform adaptive improvement on a preset first decision meta-model according to the current environment data set and a preset reinforcement learning algorithm, so as to obtain a second decision meta-model. The first decision meta-model is a model trained in advance at a cloud server.
The decision management unit 13 is configured to perform energy decision management on the current home environment according to the second decision meta-model. In one embodiment, the decision management unit 13 is further configured to perform energy decision management on the current home environment through the first decision meta-model before acquiring the current environment data set. In an embodiment, the decision management unit 13 is further configured to perform energy decision management on the current home environment according to the third decision meta-model.
In one embodiment, the adaptation-improving unit 12 is further configured to: judging whether the current family environment change degree is higher than a preset environment change threshold value or not; if not, continuing to perform energy decision management on the current home environment according to the second decision meta-model; and if so, carrying out self-adaptive improvement on the preset first decision-making meta-model according to the changed environment data set and a preset reinforcement learning algorithm so as to obtain a third decision-making meta-model.
When self-adaptive management of family energy is needed, firstly, a decision management unit 13 performs energy decision management on the current family environment through a first decision meta-model trained in advance in a cloud server, so as to generate a current environment data set, and a data acquisition unit 11 acquires the current environment data set; then, the adaptive improvement unit 12 performs adaptive improvement on a preset first decision meta-model according to the current environment data set and a preset reinforcement learning algorithm, so as to obtain a second decision meta-model; finally, the decision management unit 13 performs energy decision management on the current home environment according to the second decision meta-model; when the decision management unit performs energy decision management on the current home environment according to the second decision meta-model, the adaptive improvement unit 12 judges whether the change degree of the current home environment is higher than a preset environment change threshold value in real time; if not, continuing to perform energy decision management on the current home environment according to the second decision meta-model; if so, performing adaptive improvement on a preset first decision meta-model according to the changed environment data set and a preset reinforcement learning algorithm so as to obtain a third decision meta-model, and enabling a decision management unit 13 to perform energy decision management on the current home environment according to the third decision meta-model.
The invention provides a self-adaptive management device for household energy, which is characterized in that a first decision-making meta-model is trained in advance, the first decision-making meta-model is subjected to self-adaptive improvement according to a reinforcement learning algorithm and a current environment data group to obtain a second decision-making meta-model, and energy decision-making management is performed on a current household environment according to the second decision-making meta-model, so that the self-adaptive management device improves the self-adaptability of household energy management to different environments; furthermore, the adaptive management device for the family energy provided by the invention also acquires the changed environment data set again when the change degree of the current family environment is judged to be higher than the preset environment change threshold value, performs adaptive improvement on the preset first decision-making meta-model again according to the changed environment data set and the reinforcement learning method, so as to obtain a third decision-making meta-model, and performs energy decision-making management on the current family environment according to the third decision-making meta-model, so as to improve the accuracy of the adaptive management of the family energy.
Detailed description of the invention
In addition to the above method and apparatus, the present invention also describes an adaptive management system for home energy. Fig. 4 shows a block diagram of an embodiment of an adaptive management system for home energy according to the invention.
As shown in the figure, the adaptive management system comprises an adaptive management module 1, an environment monitoring module 2 and a cloud server 3, wherein the environment monitoring module 2 and the cloud server 3 are respectively connected to the adaptive management module 1 in a communication manner.
The adaptive management module 1 is configured to execute the adaptive management method for home energy as described above, so that the adaptive management system can be quickly adaptively deployed in different home environments and is adaptively deployed and managed again when the home environment is changed halfway, thereby implementing adaptive management of home energy.
The environment monitoring module 2 is used for collecting and storing a current environment data set, and judging whether the current family environment change degree is higher than a preset environment change threshold value according to the current environment data set and a preset environment monitoring method.
The cloud server 3 is used for establishing an environment model training set according to a plurality of preset parameter probability distributions; and training a preset reinforcement learning decision model according to the environmental model training group by a meta-learning method, thereby obtaining a first decision meta-model.
In one embodiment, the adaptive management system further comprises an analysis optimization module for: recording the management behavior of the self-adaptive management module; and analyzing the energy consumption behavior habit of the current family and sending an analysis report to the current family according to the management behavior, the current environment data set and a preset energy consumption optimization strategy library. In one embodiment, the analysis report includes a behavior optimization suggestion.
The invention provides a self-adaptive management system for household energy, which is characterized in that a first decision-making meta-model is trained in advance, the first decision-making meta-model is subjected to self-adaptive improvement according to a reinforcement learning algorithm and a current environment data group to obtain a second decision-making meta-model, and energy decision-making management is carried out on the current household environment according to the second decision-making meta-model, so that the self-adaptive management system improves the self-adaptability of household energy management to different environments; furthermore, the adaptive management system for the family energy provided by the invention also acquires the changed environment data set again when the change degree of the current family environment is judged to be higher than the preset environment change threshold value, performs adaptive improvement on the preset first decision-making meta-model again according to the changed environment data set and the reinforcement learning method, so as to obtain a third decision-making meta-model, and performs energy decision-making management on the current family environment according to the third decision-making meta-model, so as to improve the accuracy of the adaptive management of the family energy.
The above-mentioned embodiments are provided to further explain the objects, technical solutions and advantages of the present invention in detail, and it should be understood that the above-mentioned embodiments are only examples of the present invention and are not intended to limit the scope of the present invention. It should be understood that any modifications, equivalents, improvements and the like, which come within the spirit and principle of the invention, may occur to those skilled in the art and are intended to be included within the scope of the invention.

Claims (9)

1. A method for adaptive management of home energy, comprising:
acquiring a current environment data set;
according to the current environment data set and a preset reinforcement learning algorithm, performing self-adaptive improvement on a preset first decision-making meta-model so as to obtain a second decision-making meta-model; the first decision meta-model is a model trained in advance in a cloud server;
and performing energy decision management on the current home environment according to the second decision meta-model.
2. The adaptive management method of home energy according to claim 1, further comprising, before said obtaining the current set of environmental data:
and performing energy decision management on the current home environment through the first decision meta-model.
3. The adaptive home energy management method of claim 2, further comprising, after performing energy decision management on the current home environment according to the second decision meta-model:
judging whether the current family environment change degree is higher than a preset environment change threshold value or not;
if not, continuing to perform energy decision management on the current home environment according to the second decision meta-model;
if so, performing self-adaptive improvement on a preset first decision-making meta-model according to the changed environment data set and a preset reinforcement learning algorithm so as to obtain a third decision-making meta-model; and performing energy decision management on the current family environment according to the third decision meta-model.
4. The adaptive management method for home energy according to claim 3, wherein the first decision meta-model is a model trained in advance at a cloud server, and specifically comprises:
establishing an environment model training set according to a plurality of preset parameter probability distributions;
and training a preset reinforcement learning decision model according to the environmental model training group by a meta-learning method, thereby obtaining a first decision meta-model.
5. The adaptive management method for home energy according to any one of claims 1 to 4, wherein the adaptive improvement is performed on the first decision meta-model according to the current environment data set and a preset reinforcement learning algorithm, so as to obtain a second decision meta-model, specifically:
training the first decision-making meta-model according to the current environment data set by a random gradient descent method, and acquiring a convergence error of the trained first decision-making meta-model;
judging whether the convergence error is smaller than a preset error threshold value or not;
if the first decision meta-model is smaller than the second decision meta-model, finishing the training, and outputting the trained first decision meta-model as a second decision meta-model;
and if not, continuing to train the first decision meta-model according to the current environment data set by a random gradient descent method.
6. An adaptive management device for home energy, comprising a data acquisition unit, an adaptation improvement unit, and a decision management unit, wherein,
the data acquisition unit is used for acquiring a current environment data set;
the adaptive improvement unit is used for carrying out adaptive improvement on a preset first decision-making meta-model according to the current environment data set and a preset reinforcement learning algorithm so as to obtain a second decision-making meta-model; the first decision meta-model is a model trained in advance in a cloud server;
and the decision management unit is used for performing energy decision management on the current family environment according to the second decision meta-model.
7. The adaptive management device of home energy according to claim 6, characterized in that the adaptation improvement unit is further configured to:
judging whether the current family environment change degree is higher than a preset environment change threshold value or not;
if not, continuing to perform energy decision management on the current home environment according to the second decision meta-model;
and if so, carrying out self-adaptive improvement on the preset first decision-making meta-model according to the changed environment data set and a preset reinforcement learning algorithm so as to obtain a third decision-making meta-model.
8. The self-adaptive management system for the household energy is characterized by comprising a self-adaptive management module, an environment monitoring module and a cloud server, wherein the environment monitoring module and the cloud server are respectively connected to the self-adaptive management module in a communication mode; wherein the content of the first and second substances,
the adaptive management module is configured to perform an adaptive management method of home energy according to any one of claims 1-5;
the environment monitoring module is used for acquiring and storing a current environment data set and judging whether the current family environment change degree is higher than a preset environment change threshold value or not according to the current environment data set and a preset environment monitoring method;
the cloud server is used for establishing an environment model training set according to a plurality of preset parameter probability distributions; and training a preset reinforcement learning decision model according to the environmental model training group by a meta-learning method, thereby obtaining a first decision meta-model.
9. The adaptive management system of home energy as claimed in claim 8, further comprising an analysis optimization module for:
recording the management behavior of the self-adaptive management module;
analyzing the energy consumption behavior habit of the current family and sending an analysis report to the current family according to the management behavior, the current environment data set and a preset energy consumption optimization strategy library; the analysis report includes a behavioral optimization suggestion.
CN202111013941.5A 2021-08-31 2021-08-31 Self-adaptive management method, device and system for household energy Pending CN113706029A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202111013941.5A CN113706029A (en) 2021-08-31 2021-08-31 Self-adaptive management method, device and system for household energy

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202111013941.5A CN113706029A (en) 2021-08-31 2021-08-31 Self-adaptive management method, device and system for household energy

Publications (1)

Publication Number Publication Date
CN113706029A true CN113706029A (en) 2021-11-26

Family

ID=78658127

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202111013941.5A Pending CN113706029A (en) 2021-08-31 2021-08-31 Self-adaptive management method, device and system for household energy

Country Status (1)

Country Link
CN (1) CN113706029A (en)

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106322656A (en) * 2016-08-23 2017-01-11 海信(山东)空调有限公司 Air conditioner control method, server and air conditioner system
CN112161322A (en) * 2020-09-24 2021-01-01 深圳市合信达控制系统有限公司 Heating equipment and control method thereof
CN112198853A (en) * 2020-10-10 2021-01-08 珠海格力电器股份有限公司 Control method and device of intelligent household equipment
CN112611080A (en) * 2020-12-10 2021-04-06 浙江大学 Intelligent air conditioner control system and method based on federal learning
CN113044064A (en) * 2021-04-01 2021-06-29 南京大学 Vehicle self-adaptive automatic driving decision method and system based on meta reinforcement learning

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106322656A (en) * 2016-08-23 2017-01-11 海信(山东)空调有限公司 Air conditioner control method, server and air conditioner system
CN112161322A (en) * 2020-09-24 2021-01-01 深圳市合信达控制系统有限公司 Heating equipment and control method thereof
CN112198853A (en) * 2020-10-10 2021-01-08 珠海格力电器股份有限公司 Control method and device of intelligent household equipment
CN112611080A (en) * 2020-12-10 2021-04-06 浙江大学 Intelligent air conditioner control system and method based on federal learning
CN113044064A (en) * 2021-04-01 2021-06-29 南京大学 Vehicle self-adaptive automatic driving decision method and system based on meta reinforcement learning

Similar Documents

Publication Publication Date Title
CN110647900B (en) Intelligent safety situation prediction method, device and system based on deep neural network
CN107690176B (en) Network selection method based on Q learning algorithm
CN112272074B (en) Information transmission rate control method and system based on neural network
CN113485144A (en) Intelligent household control method and system based on Internet of things
CN112329997A (en) Power demand load prediction method and system, electronic device, and storage medium
US20230216737A1 (en) Network performance assessment
CN113706029A (en) Self-adaptive management method, device and system for household energy
CN112312299A (en) Service unloading method, device and system
CN104778495A (en) Bayesian network optimization method based on particle swarm algorithm
CN110705756A (en) Electric power energy consumption optimization control method based on input convex neural network
CN110262275A (en) A kind of smart home system and its control method
Singh et al. Grey Wolf Optimization Based CNN-LSTM Network for the Prediction of Energy Consumption in Smart Home Environment
Verma et al. Making smart homes smarter: optimizing energy consumption with human in the loop
CN107942693A (en) Control method, apparatus, storage medium and the computer equipment of home appliance
CN110987062A (en) Intelligent metering box early warning method and system based on data fusion
CN112926777A (en) Day-ahead load prediction method and system considering commercial building energy consumption behaviors
CN111047465A (en) Power grid friendly load response method based on electric power big data
KR102542488B1 (en) Method and Apparatus for Predicting Appliance Power Usage State Using Non Intrusive Load Monitoring Based on Long-Short Term Memory
Kang et al. Multi-objective evolutionary optimization of 3D differentiated sensor network deployment
CN114460855B (en) AIOT module based on intelligence house
KR101330526B1 (en) Method, Apparatus and Recording Medium for Migration of Prediction Systems
CN117371681A (en) Power resource scheduling method, device, equipment and storage medium
CN117348431A (en) Device control method, device and computer readable medium
CN117277346A (en) Energy storage frequency modulation method, device and equipment based on multi-agent system
bin Dahari et al. Energy Efficiency of IoT-Edge Computing in Smart Grid Using Deep Reinforcement Learning

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination