CN117033071A

CN117033071A - Disaster recovery data high-value quick and orderly recovery method based on file heat and multi-objective optimization

Info

Publication number: CN117033071A
Application number: CN202310855044.1A
Authority: CN
Inventors: 孙中伟; 刘希; 丁添; 刘朋矩; 周振宇
Original assignee: North China Electric Power University
Current assignee: North China Electric Power University
Priority date: 2023-07-12
Filing date: 2023-07-12
Publication date: 2023-11-10

Abstract

The invention relates to a high-value quick and orderly recovery method of disaster recovery data based on file heat and multi-objective optimization. Compared with the prior art, the method and the system can realize the rapid and orderly recovery of a large number of disaster recovery data of the power metering system, improve the throughput of the system in the process of recovering the disaster recovery data, reduce the response time of user access, improve the data service capability of the system and ensure the maximization of the service value of the power metering disaster recovery data.

Description

Disaster recovery data high-value quick and orderly recovery method based on file heat and multi-objective optimization

Technical Field

The invention belongs to the technical field of electric power metering data, and particularly relates to a high-value quick and orderly recovery method of disaster recovery data based on file heat and multi-objective optimization

The background technology is as follows:

under the big data background, along with the large-scale access of novel power equipment of huge amount to the distribution network, electric energy metering device in the distribution network can produce a large amount of multisource data information, and various electric power metering information is complicated, and a large amount of valuable data resources can't exert its effect for the usability of data drops by a wide margin, is difficult to provide effective support for power system's development.

The disaster recovery technology is an important way for improving the availability of data, and by adding copies of the same data and dispersedly placing the copies in the cluster nodes, the number of users served at the same time can be increased, the speed of data access is increased by parallel reading, and the availability of the electric power metering data is obviously improved.

However, the existing disaster recovery technology is difficult to formulate an effective strategy in the data recovery process, and the disaster recovery data between departments is always lack of effective integration due to the isolated micro-grid and different acquisition standards. Therefore, how to recover the disaster recovery data of power metering with high value and fast order is a problem to be solved.

The traditional data disaster recovery scheme generally adopts a nearby principle, the priority of the data is not divided, the scheme can complete timely recovery of disaster recovery data when the server load rate is low, but under the condition of high load rate, serious loss is easily caused by timely recovery of critical data, the high-value rapid and orderly recovery of disaster recovery data can be realized by adopting a multi-objective optimization algorithm, and the rapid and efficient convergence of performance is achieved by formulating and solving an optimization objective, so that compared with the traditional method, the method has stronger adaptability to scenes.

The invention comprises the following steps:

therefore, the invention aims to provide a high-value quick and orderly recovery method for disaster recovery data based on file heat and multi-objective optimization so as to realize efficient recovery of power metering disaster recovery data.

In order to achieve the above purpose, the present invention provides the following technical solutions:

according to the method, aiming at a power metering data disaster recovery scene, data recovery priority is formulated by calculating file heat, a disaster recovery scheme is designed by adopting a multi-objective optimization algorithm, recovery cost is considered while the problem of quick recovery of disaster recovery data is solved, and the method is suitable for the service requirements of a complex environment of an actual disaster recovery system. The scheme specifically comprises the following steps:

s1: acquiring state information of power metering data, and establishing a disaster recovery data backup metadata attribute group;

s2: calculating the heat value of the disaster recovery data file, and designing a recovery priority ordering method;

s3: designing a disaster recovery data recovery scheme based on a multi-objective optimization algorithm;

s4: the specific scheme flow is briefly described;

in step S1, a metadata attribute set of the disaster recovery data file copy is constructed by counting information such as the disaster recovery data access frequency, the data copy size, the access time, etc.

Further, in the step S2, the heat value of the data file is calculated according to the meta attribute group, and the priority of disaster recovery data is ordered based on the descending method of the heat value.

Further, in the step S3, a disaster recovery data recovery method based on a multi-objective optimization algorithm is designed, the disaster recovery data recovery strategy is dynamically prepared according to the average load rate of the server cluster, and when the load rate is smaller than a preset threshold, recovery tasks of the data copies are sequentially completed according to the file heat value; when the load rate is larger than a preset threshold, a data recovery scheme is formulated by taking minimized load rate, service transmission delay and service data loss as optimization targets.

Further, in the step S4, the specific scheme flow includes four parts, namely node status determination, data priority ordering, disaster recovery policy formulation, and data recovery. In the process of disaster recovery strategy formulation, a Chebyshev weight aggregation method is adopted to decompose a multi-objective optimization problem into a plurality of objective sub-problems, the solution process is fast in convergence, the calculation complexity is low, and high-value, rapid and orderly recovery of disaster recovery data can be ensured.

Compared with the prior art, the invention has the following advantages:

1) According to the invention, the metadata attribute group of the disaster recovery data file copy is constructed, the power metering disaster recovery data is analyzed, the file heat value is calculated according to the indexes such as the file size, the read-write rate and the like, and the disaster recovery data recovery priority is formulated according to the obtained result, so that the data with high value in the power system can be recovered preferentially. And the characteristics of the heat value of the data copy are further considered, the data is subjected to grading treatment, the management efficiency of disaster recovery data is effectively improved, and a support is provided for high-value rapid and orderly recovery of the disaster recovery data.

2) The invention provides a disaster recovery data high-value rapid and orderly recovery method based on multi-objective optimization, which takes minimized load rate, service transmission delay and service data loss as optimization objectives and can formulate a disaster recovery data recovery scheme comprehensively considering multiple factors. Further considering the average load rate of the server cluster, and formulating different disaster recovery strategies under the condition of different load rates, the throughput of the system in the process of recovering disaster recovery data can be improved, the response time of user access is reduced, the data service capacity of the system is improved, and the maximization of the service value of the power metering disaster recovery data is realized.

Description of the drawings:

FIG. 1 is a flow chart of a method for high-value and rapid and orderly recovery of disaster recovery data based on multi-objective optimization according to an embodiment of the present invention;

Detailed Description

The invention will now be described in detail with reference to the drawings and specific examples. The embodiment is implemented on the premise of the technical scheme of the invention, and detailed implementation modes and specific operation processes are given. It should be noted that, without conflict, the embodiments of the present invention and features of the embodiments may be combined with each other.

Fig. 1 is a flow chart of a method for high-value and rapid and orderly recovery of disaster recovery data based on multi-objective optimization according to the present embodiment, which specifically includes the following steps:

1) Node state determination

The invention adopts the heartbeat communication mode to monitor the state of the server node, and judges that the server fails and needs disaster recovery data when the heartbeat of the main server node cannot be monitored for a long time.

2) Disaster recovery data recovery priority determining method

The invention provides a disaster recovery data recovery priority determining method based on file heat. The method further carries out priority formulation by constructing metadata attribute groups of disaster recovery data file copies, and ensures orderly recovery of disaster recovery data, and is specifically introduced as follows:

metadata group construction: metadata property group for constructing disaster tolerant data file copy expressed as

G＝{S,U ⁿ ,T _l ,T _n ,H ^* ,F}

Wherein S represents a size set of files, U ⁿ For a set of number of users accessed over a period of time, T _l For the last accessed time set of the file, T _n H is the current time of the system ^* For the set of the heat value of the file in the previous stage, the initial heat value of the file is 0,F, which is the set of the read-write rate of the disaster recovery data file, and the higher the read-write rate is, the more times of accessing the file is indicated, and the larger the corresponding heat value of the file is.

And (5) calculating file heat: and according to the metadata attribute group of the disaster recovery data files, counting the information such as the access frequency, the data volume, the number of access users and the like of each disaster recovery data file in the current system time period, so as to calculate the heat value of the file and evaluate the recovery priority of the file. In the invention, the calculation formula of the file heat is as follows

Wherein, C is the file heat value evaluation index.

Grading disaster recovery: considering that the difference between the heat value of the data copies stored in different servers is possibly smaller, the data recovery needs to be carried out simultaneously under the condition of higher system load rate, so that the data recovery efficiency is further improved, the safety of the system is ensured, the disaster recovery data to be recovered can be subjected to grading treatment at the moment, and the data priority is divided according to grading results. By normalizing the heat valueThe data can be divided into key data to be recovered,Important data to be recovered and general data to be recovered. Wherein critical data to be restored->Important data to be recoveredGeneral data to be restored->

3) Disaster recovery strategy formulation

And calculating the average load rate of the current server cluster. If the average load rate is lower, the concurrent access amount of the system is low, the disaster recovery pressure is low, and the recovery task of the data copy can be completed in sequence according to the file heat value; when the average load rate is larger than a preset threshold omega, a data recovery scheme is formulated according to the disaster recovery data grading result. The calculation formula of the average load rate is as follows

Wherein M is the number of servers in the cluster, S _i Is the size of a single data copy, S _s The threshold ω=0.6 is preset for the maximum storage amount of the server.

4) Specific scheme flow

The disaster recovery data high-value rapid and orderly recovery scheme flow based on the multi-objective optimization method comprises an initialization stage, a genetic evolution stage and a termination stage. The specific steps are as follows:

an initialization stage: initializing the population size as K, setting an optimization target as minimizing the Load rate Load, the transmission Delay and the service data Loss of the server, and restricting the storage margin of the server as A. The overall optimization function is expressed as

Genetic evolution stage: the invention firstly solves the problem of multiple optimization targetsDecomposing into K single-objective optimization sub-questions, and distributing initial weight vector to each sub-questionThe vector dimension corresponds to the number of optimization objective functions, and then each sub-problem is subjected to cross mutation in the neighbor individuals to generate new server individuals.

Because the servers are in discrete distribution in the three-dimensional space, after a new generation individual is obtained, the Euclidean distance between the position of the new generation individual and the position of each real server is calculated, and the real server closest to the new generation individual is used as the new individual to continue iteration.

Aggregating all new individuals by using Chebyshev weight aggregation method, updating parent population according to aggregation result, and the aggregation formula is that

Wherein x is population individual, z ^* Is a set of reference solutions.

Termination phase: and when the maximum iteration times are reached, the loop is exited, a disaster recovery data recovery server node list is output, and a recovery task of the disaster recovery data is executed.

Although specific implementations of the invention and the accompanying drawings are disclosed for illustrative purposes only, and are presented to aid in understanding the invention and its implementation, it will be appreciated by those skilled in the art that: various alternatives, variations and modifications are possible without departing from the spirit and scope of the invention and the appended claims. Therefore, the present invention should not be limited to the preferred embodiments and the disclosure of the drawings, but the scope of the invention is defined by the appended claims.

Claims

1. The high-value quick and orderly recovery method for disaster recovery data based on file heat and multi-objective optimization is characterized in that the method aims at a disaster recovery scene of electric power metering data, the data recovery priority is formulated by calculating the file heat, a disaster recovery data recovery scheme is designed by adopting a multi-objective optimization algorithm, recovery cost is considered while the problem of quick recovery of disaster recovery data is solved, and the method is suitable for service requirements of a complex environment of an actual disaster recovery system.

2. The method for high-value quick and orderly recovery of disaster recovery data based on file heat and multi-objective optimization according to claim 1, wherein the scheme specifically comprises the following steps:

s4: the specific scheme flow is briefly described.

3. The method for sorting the recovery priorities according to claim 2, wherein the metadata attribute group of the disaster recovery data file copy is constructed by counting information such as the disaster recovery data access frequency, the data copy size, the access time, and the like, the heat value of the data file is calculated according to the attribute group, and the method for sorting the recovery priorities of the disaster recovery data is realized based on the descending order of the heat value.

4. The disaster recovery data recovery method based on the multi-objective optimization algorithm according to claim 2, wherein the disaster recovery data recovery method based on the multi-objective optimization algorithm is designed, a disaster recovery data recovery strategy is dynamically prepared according to the average load rate of the server cluster, and when the load rate is smaller than a threshold value, recovery tasks of data copies are sequentially completed according to the file heat value; when the load rate is greater than the threshold, a data recovery scheme is formulated with the minimum load rate, service transmission delay and service data loss as optimization targets.

5. The disaster recovery data recovery method based on multi-objective optimization algorithm according to claim 2, wherein the specific scheme flow comprises an initialization phase, a genetic evolution phase and a termination phase. In the genetic evolution stage, a Chebyshev weight aggregation method is adopted to decompose the multi-objective optimization problem into a plurality of objective quantum problems, the solving process is fast in convergence, the calculation complexity is low, and high-value, rapid and orderly recovery of disaster recovery data can be ensured.