CN113722292B - Disaster response processing method, device, equipment and storage medium of distributed data system - Google Patents

Disaster response processing method, device, equipment and storage medium of distributed data system Download PDF

Info

Publication number
CN113722292B
CN113722292B CN202111007167.7A CN202111007167A CN113722292B CN 113722292 B CN113722292 B CN 113722292B CN 202111007167 A CN202111007167 A CN 202111007167A CN 113722292 B CN113722292 B CN 113722292B
Authority
CN
China
Prior art keywords
distributed data
data system
resource
utilization rate
preset
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202111007167.7A
Other languages
Chinese (zh)
Other versions
CN113722292A (en
Inventor
梁天恺
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shenzhen Ping An Smart Healthcare Technology Co ltd
Original Assignee
Shenzhen Ping An Smart Healthcare Technology Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shenzhen Ping An Smart Healthcare Technology Co ltd filed Critical Shenzhen Ping An Smart Healthcare Technology Co ltd
Priority to CN202111007167.7A priority Critical patent/CN113722292B/en
Publication of CN113722292A publication Critical patent/CN113722292A/en
Application granted granted Critical
Publication of CN113722292B publication Critical patent/CN113722292B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/21Design, administration or maintenance of databases
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2458Special types of queries, e.g. statistical queries, fuzzy queries or distributed queries
    • G06F16/2465Query processing support for facilitating data mining operations in structured databases
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/27Replication, distribution or synchronisation of data between databases or within a distributed database system; Distributed database system architectures therefor

Abstract

The application is applicable to the technical field of artificial intelligence and system disaster response processing, and provides a disaster response processing method, device, equipment and storage medium of a distributed data system, wherein the method comprises the following steps: acquiring the current resource utilization rate of the distributed data system, and judging whether the current resource utilization rate of the distributed data system reaches a disaster response starting threshold; if the resource utilization rate reaches the disaster response starting threshold, solving the consumption of the computing resource which needs to be added in the disaster response of the system by adopting a preset disaster response model; and carrying out resource scheduling processing on the distributed data system according to the consumption of the computing resources which are required to be added, so that the resource utilization rate of the distributed data system is reduced to a reference value representing the safe operation of the system. Based on the method, when the distributed data system has a sudden data disaster event, the annealing algorithm can be utilized to perform data disaster 'pouring-out', so that the resource utilization rate of clusters in the distributed data system is reduced to a safe value, and the normal operation of the system is ensured.

Description

Disaster response processing method, device, equipment and storage medium of distributed data system
Technical Field
The present disclosure relates to the field of artificial intelligence and disaster response processing technologies for systems, and in particular, to a disaster response processing method, apparatus, device, and storage medium for a distributed data system.
Background
In the data age of big data with the rapid growth of data, big data mining and analysis technology is widely applied in various industry fields. However, with the increasing data volume, it is difficult for the conventional data management system to ensure the security of data when there is a sudden data disaster event, and the capability of coping with the data disaster and the sudden data loss is lacking.
Disclosure of Invention
In view of this, the embodiments of the present application provide a disaster response processing method, apparatus, electronic device, and storage medium for a distributed data system, which can effectively improve the capability of the distributed data system to cope with data disaster and sudden data loss while implementing distributed storage of data by fusing Hadoop cluster technology and annealing algorithm, thereby solving the problems of reading efficiency and data security existing in conventional medical large data storage.
A first aspect of an embodiment of the present application provides a disaster response processing method for a distributed data system, including:
acquiring the current resource utilization rate of a distributed data system, and judging whether the current resource utilization rate of the distributed data system reaches a disaster response starting threshold;
if the resource utilization rate reaches a disaster response starting threshold, solving the consumption of computing resources required to be added in the disaster response of the system by adopting a preset disaster response model;
and carrying out resource scheduling processing on the distributed data system according to the consumption of the computing resource to be added so as to reduce the resource utilization rate of the distributed data system to a reference value representing the safe operation of the system.
With reference to the first aspect, in a first possible implementation manner of the first aspect, the step of solving, by using a preset disaster response model, a calculation resource usage amount that needs to be added in response to a system disaster, if the resource usage rate reaches a disaster response start threshold includes:
inputting a preset initialized computing resource additional consumption into the preset disaster response model for iterative processing to obtain an iterative computing resource additional consumption;
according to the iterative calculation resource adding amount, simulating and calculating the resource utilization rate of the distributed data system after adding the calculation resource;
judging whether the resource utilization rate of the distributed data system after adding the computing resource is lower than a preset resource utilization rate threshold value, if yes, configuring the iterative computing resource adding amount as the computing resource amount required to be added by the system disaster, otherwise, carrying out iterative processing again until the resource utilization rate of the distributed data system after adding the computing resource is lower than the preset resource utilization rate threshold value.
With reference to the first possible implementation manner of the first aspect, in a second possible implementation manner of the first aspect, the step of inputting a preset initialized additional usage of computing resources into the preset disaster response model to perform iterative processing, and obtaining the iterative additional usage of computing resources includes:
the preset initialized calculation resource adding amount is used as an initial solution of a target loss function in the preset disaster response model to calculate the target loss function, and a first function value is obtained;
carrying out disturbance processing on the initial solution according to preset disturbance conditions to obtain a plurality of disturbance solutions corresponding to the initial solution;
randomly selecting one disturbance solution from the disturbance solutions as a new solution of a target loss function in the preset disaster response model to calculate the target loss function, and obtaining a second function value;
and comparing the first function value with the second function value, if the second function value is smaller than or equal to the first function value, receiving the new solution, and performing next round of iterative processing on the target loss function by taking the new solution as an initial solution until the iterative times reach a preset time threshold value, so as to obtain the calculated resource additional consumption after iteration.
With reference to the second possible implementation manner of the first aspect, in a third possible implementation manner of the first aspect, in the step of performing a perturbation process on the initial solution according to a preset perturbation condition to obtain a plurality of perturbation solutions corresponding to the initial solution, the preset perturbation condition is configured to:
ω 2 -ω’>0 and ω' 2 =2-ω
Where ω is denoted as the initial solution and ω' is denoted as the disturbance solution.
With reference to the second possible implementation manner of the first aspect, in a fourth possible implementation manner of the first aspect, after the step of comparing the first function value and the second function value, if the second function value is greater than the first function value, the method further includes:
calculating a probability value for accepting the new solution according to a Metropolis criterion according to the first function value and the second function value;
and comparing the calculated probability value with a preset probability threshold, if the probability value is larger than the preset probability threshold, accepting the new solution, and carrying out next round of iterative processing on the target loss function by taking the new solution as an initial solution, otherwise, not accepting the new solution.
With reference to the first possible implementation manner of the first aspect, in a fifth possible implementation manner of the first aspect, if the resource usage rate of the distributed data system after adding the computing resource is not lower than a preset resource usage rate threshold, performing iterative processing again until the resource usage rate of the distributed data system after adding the computing resource is lower than the preset resource usage rate threshold, the method includes:
and reducing the resource utilization rate of the distributed data system according to a preset annealing rate and resetting the iteration times so as to carry out iteration processing again based on the annealed resource utilization rate of the distributed data system.
With reference to the first aspect or any one of possible implementation manners of the first aspect, in a sixth possible implementation manner of the first aspect, the performing, by using the computing resource to be added, resource scheduling processing on the distributed data system, so that a resource usage rate of the distributed data system falls to a reference value indicating safe operation of the system, where a manner of performing resource scheduling processing includes internal scheduling processing and external access processing, where: the internal scheduling process is resource scheduling between slave nodes in the distributed data system, and the external access process is to access computing resources from outside of the distributed data system.
A second aspect of an embodiment of the present application provides a disaster response processing device of a distributed data system, where the disaster response processing device of the distributed data system includes:
the judging module is used for acquiring the current resource utilization rate of the distributed data system and judging whether the current resource utilization rate of the distributed data system reaches a disaster response starting threshold value or not;
the calculation module is used for solving the consumption of the calculation resources which are needed to be added in the system disaster response by adopting a preset disaster response model if the resource utilization rate reaches a disaster response starting threshold;
and the scheduling module is used for carrying out resource scheduling processing on the distributed data system according to the consumption of the computing resources to be added so as to reduce the resource utilization rate of the distributed data system to a reference value representing the safe operation of the system.
A third aspect of the embodiments of the present application provides an electronic device, including a memory, a processor, and a computer program stored in the memory and executable on the electronic device, where the processor implements the steps of the disaster response processing method of the distributed data system provided in the first aspect when the processor executes the computer program.
A fourth aspect of the embodiments of the present application provides a computer-readable storage medium storing a computer program, which when executed by a processor, implements the steps of the disaster response processing method of the distributed data system provided in the first aspect.
The disaster response processing method, device, electronic equipment and storage medium of the distributed data system have the following beneficial effects:
the method comprises the steps of obtaining the current resource utilization rate of the distributed data system, and judging whether the current resource utilization rate of the distributed data system reaches a disaster response starting threshold value or not; if the resource utilization rate reaches the disaster response starting threshold, solving the consumption of the computing resource which needs to be added in the disaster response of the system by adopting a preset disaster response model; and carrying out resource scheduling processing on the distributed data system according to the consumption of the computing resources which are required to be added, so that the resource utilization rate of the distributed data system is reduced to a reference value representing the safe operation of the system. Based on the method, an annealing algorithm is fused in a distributed data system constructed by adopting a Hadoop cluster technology through a preset disaster response model, so that the capacity of the distributed data system for coping with difficult data disaster and sudden data loss is effectively improved while the distributed data system is subjected to distributed storage, and therefore, when the distributed data system has sudden data disaster events, the annealing algorithm is utilized to perform 'pouring and extinguishing' of the data disaster, so that the resource utilization rate of clusters in the distributed data system is reduced to a safe value, and the normal operation of the distributed data system is ensured.
Drawings
In order to more clearly illustrate the technical solutions of the embodiments of the present application, the drawings that are required for the embodiments or the description of the prior art will be briefly described below, it being obvious that the drawings in the following description are only some embodiments of the present application, and that other drawings may be obtained according to these drawings without inventive effort for a person skilled in the art.
Fig. 1 is a flowchart of an implementation of a disaster response method for a distributed data system according to an embodiment of the present application;
FIG. 2 is a flow chart of a process when a distributed data system stores data in a disaster response processing method of the distributed data system according to an embodiment of the present application;
FIG. 3 is a schematic flow chart of a method for solving the computing resource consumption needed to be added when a system disaster is solved in the disaster response processing method of the distributed data system according to the embodiment of the present application;
fig. 4 is a schematic flow chart of a method for obtaining an additional usage amount of iterative computing resources through iteration in the disaster response processing method of the distributed data system according to the embodiment of the present application;
FIG. 5 is a flowchart illustrating another method for obtaining an additional usage amount of iterative computing resources by iteration in the disaster response processing method of the distributed data system according to the embodiment of the present application;
FIG. 6 is a basic block diagram of a disaster response device of a distributed data system according to an embodiment of the present application;
fig. 7 is a basic structural block diagram of an electronic device according to an embodiment of the present application.
Detailed Description
In order to make the objects, technical solutions and advantages of the present application more apparent, the present application will be further described in detail with reference to the accompanying drawings and examples. It should be understood that the specific embodiments described herein are for purposes of illustration only and are not intended to limit the present application.
Referring to fig. 1, fig. 1 is a flowchart illustrating an implementation of a disaster response method for a distributed data system according to an embodiment of the present application. The details are as follows:
s11: and acquiring the current resource utilization rate of the distributed data system, and judging whether the current resource utilization rate of the distributed data system reaches a disaster response starting threshold.
In this embodiment, the distributed data system is a system for data storage and management built by using Hadoop (distributed system infrastructure). At least one master node and a plurality of slave nodes are built in the distributed data system, wherein the master node is used for executing work for coping with data disasters, and the slave nodes are used for storing data of different categories. For example, referring to fig. 2 together, fig. 2 is a flowchart illustrating a process when a distributed data system stores data in a disaster response processing method of the distributed data system according to an embodiment of the present application. As shown in fig. 2, the distributed data system provides an online access portal and an offline access portal, wherein the online portal is directly connected with the database, supports reading data directly from the database, the offline portal supports reading a local csv file, reads data from the local csv file, and gathers the data read by the two access portals after reading the data from the two different access portals. Then, according to the read data, data slicing processing is carried out according to the set data types, and the data is divided into a plurality of data blocks. Further, the plurality of data blocks are stored in different slave nodes according to data types.
In this embodiment, a higher resource usage rate of a node indicates that the node has a higher probability of being down, which results in a data disaster in which the data of the node cannot be used. In this embodiment, a disaster response starting threshold for triggering the system to execute disaster response processing may be set in the distributed data system in advance, where the disaster response starting threshold is a resource usage upper limit reference value that characterizes that the system can normally operate. And then monitoring the resource utilization rate of each slave node at the master node of the distributed data system, and summarizing and integrating the resource utilization rate of each slave node to obtain the current resource utilization rate of the distributed data system. And comparing the obtained current resource utilization rate of the distributed data system with a disaster response starting threshold preset in the system, judging whether the obtained current resource utilization rate of the distributed data system is larger than or equal to the disaster response starting threshold preset in the system, and if so, judging whether the current resource utilization rate of the distributed data system reaches the disaster response starting threshold.
S12: and if the resource utilization rate reaches the disaster response starting threshold, solving the consumption of the computing resource which needs to be added in the disaster response of the system by adopting a preset disaster response model.
In this embodiment, a preset disaster response model is constructed by using a simulated annealing algorithm, and in the disaster response model, a target loss function is designed, and the target loss function adopts a cross entropy loss function, which is specifically as follows:
L=-[ylog y^+(1-y)log(1-y^)]
wherein y is represented as the true output of the disaster response model, and y is represented as the predicted output of the disaster response model.
In this embodiment, the input of the disaster response model is the initially set computing resource addition amount, and the output of the disaster response model is the resource utilization rate of the system after adding the computing resource. In the disaster response model, based on the input computing resource additional consumption, iterating the target loss function, and solving the computing resource consumption required to be added in the disaster response of the system by minimizing the target loss function. In this embodiment, when the resource usage rate output by the disaster response model reaches the reference value of the safe operation of the system, it means that the objective loss function is minimized.
S13: and carrying out resource scheduling processing on the distributed data system according to the consumption of the computing resource to be added so as to reduce the resource utilization rate of the distributed data system to a reference value representing the safe operation of the system.
In this embodiment, the resource scheduling processing includes two modes, i.e., internal scheduling processing and external access processing. Wherein: the internal scheduling process is the resource scheduling between the slave nodes in the distributed data system, and the external access process is the external access computing resource of the distributed data system. That is, the resource scheduling process may be performed from an existing computing resource within the distributed data system, or may be performed by adding a new computing resource from the outside. In this embodiment, when the internal scheduling processing manner is used, the computing resources in the slave nodes with sufficient computing resources are scheduled to the slave nodes with insufficient computing resources by specifically calculating the computing resource usage condition of each slave node in the distributed data system, so as to implement internal scheduling. It can be understood that when the internal scheduling process is performed, it is necessary to ensure that the resource utilization rate of the slave node itself with sufficient computing resources can be stabilized below a reference value representing the safe operation of the system, and then schedule the remaining computing resources to other slave nodes.
As can be seen from the above, the disaster response processing method for a distributed data system provided by the present embodiment obtains the current resource usage rate of the distributed data system, and determines whether the current resource usage rate of the distributed data system reaches the disaster response starting threshold; if the resource utilization rate reaches the disaster response starting threshold, solving the consumption of the computing resource which needs to be added in the disaster response of the system by adopting a preset disaster response model; and carrying out resource scheduling processing on the distributed data system according to the consumption of the computing resources which are required to be added, so that the resource utilization rate of the distributed data system is reduced to a reference value representing the safe operation of the system. By means of a preset disaster response model, an annealing algorithm is fused in a distributed data system constructed by adopting a Hadoop cluster technology, the capacity of the distributed data system for coping with difficult data disaster and sudden data loss is effectively improved while the distributed data system is subjected to distributed storage, so that when the distributed data system has sudden data disaster events, the annealing algorithm is utilized to perform 'pouring and extinguishing' of the data disaster, the resource utilization rate of clusters in the distributed data system is reduced to a safe value, and normal operation of the distributed data system is guaranteed.
In some embodiments of the present application, referring to fig. 3, fig. 3 is a flow chart of a method for solving a computing resource consumption needed for system disaster recovery in the disaster recovery processing method of the distributed data system provided in the embodiments of the present application. The details are as follows:
s31: inputting a preset initialized computing resource additional consumption into the preset disaster response model for iterative processing to obtain an iterative computing resource additional consumption;
s32: according to the iterative calculation resource adding amount, simulating and calculating the resource utilization rate of the distributed data system after adding the calculation resource;
s33: judging whether the resource utilization rate of the distributed data system after adding the computing resource is lower than a preset resource utilization rate threshold value, if yes, configuring the iterative computing resource adding amount as the computing resource amount required to be added by the system disaster, otherwise, carrying out iterative processing again until the resource utilization rate of the distributed data system after adding the computing resource is lower than the preset resource utilization rate threshold value.
In this embodiment, the preset initialized additional usage of computing resources is obtained through random setting. In this embodiment, an initialized additional computing resource amount may be input at random, then a disturbance solution corresponding to the initialized additional computing resource amount may be generated by randomly perturbing the initialized additional computing resource amount, the objective loss function may be calculated by using the initialized additional computing resource amount and the disturbance solution corresponding to the initialized additional computing resource amount, so as to obtain two function values respectively corresponding to the two function values, and further by comparing the two function values, if the function value obtained when the objective loss function is calculated by using the disturbance solution corresponding to the initialized additional computing resource amount is smaller than or equal to the function value obtained when the objective loss function is calculated by using the initialized additional computing resource amount, the disturbance solution may be accepted as the additional computing resource amount after the iteration. In this embodiment, an iteration number threshold may be set in the disaster response model, so as to obtain the iterative computing resource additional usage through multiple iterations. It can be understood that when multiple iterations are performed, after the disturbance solution in the objective loss function iteration process is accepted, when the next iteration is performed, the disturbance solution is used as an initial solution to perform random disturbance again to generate a disturbance solution of the next iteration. After the iterative computing resource adding consumption is obtained, simulating and adding the iterative computing resource adding consumption into the distributed data system according to the iterative computing resource adding consumption, so as to simulate and calculate the resource utilization rate of the distributed data system after adding the computing resource. And further, judging whether the resource utilization rate of the distributed data system after adding the computing resource is lower than a preset resource utilization rate threshold value, so as to determine whether the iterative computing resource adding amount can be used as the computing resource amount which needs to be added for system disaster response. In this embodiment, the preset resource usage threshold is characterized as a reference value for safe operation of the system. If the resource utilization rate of the distributed data system after the additional computing resource is lower than the preset resource utilization rate threshold after the additional computing resource is simulated according to the iterative computing resource additional consumption, the iterative computing resource additional consumption is indicated to meet the disaster response requirement of the distributed data system, and at the moment, the iterative computing resource additional consumption can be configured as the computing resource consumption required to be added for the distributed data system disaster response.
In some embodiments of the present application, referring to fig. 4, fig. 4 is a flowchart of a method for obtaining an additional usage amount of iterative computing resources by iteration in the disaster response processing method of the distributed data system provided in the embodiments of the present application, which is described in detail below:
s41: calculating the target loss function by taking the preset initialized calculation resource adding amount as an initial solution of the target loss function in the preset disaster response model to obtain a first function value;
s42: carrying out disturbance processing on the initial solution according to preset disturbance conditions to obtain a plurality of disturbance solutions corresponding to the initial solution;
s43: randomly selecting one disturbance solution from the disturbance solutions as a new solution of a target loss function in the preset disaster response model to calculate the target loss function, and obtaining a second function value;
s44: and comparing the first function value with the second function value, if the second function value is smaller than or equal to the first function value, accepting the new solution, and carrying out iterative processing on the target loss function by taking the new solution as an initial solution until the iterative times reach a preset time threshold value, so as to obtain the iterative calculation resource adding amount.
In this embodiment, the initialized additional usage of computing resources is obtained by random setting. In this embodiment, the initialized additional usage of computing resources is set to be a usage of one computing resource unit, for example, 100m. After the initialized additional usage of the computing resource is obtained, in the disaster response model, the initialized additional usage of the computing resource can be used as an initial solution to calculate a target loss function in the disaster response model, so as to obtain a first function value. For the initialized additional computing resource consumption, a disturbance solution corresponding to the initialized additional computing resource consumption may be obtained by performing a disturbance process according to a preset disturbance condition, where it is understood that the disturbance solution is also characterized as the additional computing resource consumption. For example, assuming that the initialized computing resource additional usage is denoted as ω, the perturbation solution is denoted as ω', the perturbation condition may be set as: omega 2 -ω’>0 and ω' 2 =2- ω, a finite number of perturbation solutions can be generated based on the perturbation condition. In this embodiment, a disturbance solution may be randomly selected from the limited number of disturbance solutions as a new solution of the target loss function, and the target loss function may be calculated to obtain a second function value. After the first function value and the second function value are obtained, the first function value and the second function value are compared, if the second function value is smaller than or equal to the first function value, the disturbance solution is received, and the disturbance solution is used as an initial solution to perform next round of iterative processing on the target loss function. In this embodiment, an iteration number threshold is preset in the disaster response model, so that the number of iterations is multiple, until the number of iterations reaches the preset number threshold, to obtain the additional consumption of the iterative computing resource.
In some embodiments of the present application, referring to fig. 5, fig. 5 is a flowchart of another method for obtaining an additional usage of iterative computing resources by iteration in the disaster response processing method of the distributed data system according to the embodiment of the present application, which is described in detail below:
s51: calculating a probability value for accepting the new solution according to a Metropolis criterion according to the first function value and the second function value;
s52: and comparing the calculated probability value with a preset probability threshold, if the probability value is larger than the preset probability threshold, accepting the new solution, and carrying out next round of iterative processing on the target loss function by taking the new solution as an initial solution, otherwise, not accepting the new solution.
In this embodiment, if the second function value is greater than the first function value, it may be determined whether to accept the perturbation solution according to the Metropolis criterion. Specifically, the metapolis criterion is to calculate the probability of accepting the perturbation solution, i.e., p=exp (- Δt/T), where Δt=second function value-first function value. If the calculated probability is larger than a preset threshold, the disturbance solution is accepted, otherwise, the disturbance solution is not accepted, wherein the preset threshold is a random number in a [0,1 ] interval. In this embodiment, if it is determined according to the metapolis criterion that the disturbance solution selected at this time is not accepted, one disturbance solution is randomly selected again from other disturbance solutions generated based on the disturbance conditions before so as to perform iterative processing on the objective function again. In this embodiment, an iteration number threshold is set for the iteration process of the objective loss function, and after each iteration of the objective loss function and receiving the disturbance solution, it is determined whether the iteration number at that time reaches the iteration number threshold, if so, the disturbance solution received by the iteration is obtained as the calculation resource additional usage after the iteration.
In some embodiments of the present application, if the post-iteration computing resource adding amount still fails to meet the disaster response requirement of the distributed data system, the iterative process may be returned to be performed again until the resource usage rate of the distributed data system after adding the computing resource is lower than the preset resource usage rate threshold. Exemplary, based on a simulated annealing algorithm, reducing the resource utilization rate of the distributed data system according to a preset annealing rate, and resetting the iteration times to zero the iteration times, so that the iteration process is carried out again based on the annealed resource utilization rate of the distributed data system. In this embodiment, the annealing rate λ is set to be a positive number smaller than 1, specifically, a value is between 0.8 and 0.99, and the resource usage rate of the distributed data system is slowly reduced by the annealing rate, where a specific calculation formula is as follows: t (n+1) =λt (n), n=1, 2,3. Where T (n) represents the resource usage of the distributed data system before annealing and T (n+1) represents the resource usage of the distributed data system after annealing. When judging that the resource utilization rate of the distributed data system after adding the computing resource is not lower than a preset resource utilization rate threshold, cooling the distributed data system according to the preset annealing rate, and then returning to iterate again until the resource utilization rate of the distributed data system after adding the computing resource is lower than the preset resource utilization rate threshold, so as to obtain the iterative computing resource adding amount.
It should be understood that, the sequence number of each step in the foregoing embodiment does not mean the execution sequence, and the execution sequence of each process should be determined by the function and the internal logic of each process, and should not limit the implementation process of the embodiment of the present application in any way.
In some embodiments of the present application, referring to fig. 6, fig. 6 is a basic block diagram of a disaster response processing device of a distributed data system according to an embodiment of the present application. The apparatus in this embodiment includes units for performing the steps in the method embodiments described above. Refer to the related description in the above method embodiment. For convenience of explanation, only the portions related to the present embodiment are shown. As shown in fig. 6, the disaster response processing device of the distributed data system includes: a judging module 61, a calculating module 62 and a scheduling module 63. Wherein: the judging module 61 is configured to obtain a current resource usage rate of the distributed data system, and judge whether the current resource usage rate of the distributed data system reaches a disaster response starting threshold. The calculation module 62 is configured to solve, if the resource usage rate reaches the disaster response starting threshold, a preset disaster response model to obtain an amount of calculation resources that need to be added for system disaster response. The scheduling module 63 is configured to perform resource scheduling processing on the distributed data system according to the usage of the computing resource to be added, so that the resource usage rate of the distributed data system is reduced to a reference value representing safe operation of the system.
It should be understood that the disaster response processing device of the distributed data system corresponds to the disaster response processing method of the distributed data system one by one, and will not be described herein.
In some embodiments of the present application, referring to fig. 7, fig. 7 is a basic block diagram of an electronic device according to an embodiment of the present application. As shown in fig. 7, the electronic device 7 of this embodiment includes: a processor 71, a memory 72 and a computer program 73 stored in said memory 72 and executable on said processor 71, for example a program of a disaster response handling method of a distributed data system. Processor 71, when executing computer program 73, implements the steps described above in various embodiments of the disaster recovery processing method for each of the distributed data systems. Alternatively, the processor 71 may implement the functions of each module in the embodiments corresponding to the disaster response processing device of the distributed data system when executing the computer program 73. Please refer to the related description in the embodiments, which is not repeated here.
By way of example, the computer program 73 may be divided into one or more modules (units) that are stored in the memory 72 and executed by the processor 71 to complete the present application. The one or more modules may be a series of computer program instruction segments capable of performing the specified functions for describing the execution of the computer program 73 in the electronic device 7. For example, the computer program 73 may be divided into a judgment module, a calculation module, and a scheduling module, each module having a specific function as described above.
The electronic device may include, but is not limited to, a processor 71, a memory 72. It will be appreciated by those skilled in the art that fig. 7 is merely an example of the electronic device 7 and is not meant to be limiting as the electronic device 7 may include more or fewer components than shown, or may combine certain components, or different components, e.g., the electronic device may further include an input-output device, a network access device, a bus, etc.
The processor 71 may be a central processing unit (Central Processing Unit, CPU), but may also be other general purpose processors, digital signal processors (Digital Signal Processor, DSP), application specific integrated circuits (Application Specific Integrated Circuit, ASIC), off-the-shelf programmable gate arrays (Field-Programmable Gate Array, FPGA) or other programmable logic devices, discrete gate or transistor logic devices, discrete hardware components, or the like. A general purpose processor may be a microprocessor or the processor may be any conventional processor or the like.
The memory 72 may be an internal storage unit of the electronic device 7, such as a hard disk or a memory of the electronic device 7. The memory 72 may be an external storage device of the electronic device 7, such as a plug-in hard disk, a Smart Media Card (SMC), a Secure Digital (SD) Card, a Flash memory Card (Flash Card) or the like, which are provided on the electronic device 7. Further, the memory 72 may also include both an internal storage unit and an external storage device of the electronic device 7. The memory 72 is used to store the computer program as well as other programs and data required by the electronic device. The memory 72 may also be used to temporarily store data that has been output or is to be output.
It should be noted that, because the content of information interaction and execution process between the above devices/units is based on the same concept as the method embodiment of the present application, specific functions and technical effects thereof may be referred to in the method embodiment section, and will not be described herein again.
Embodiments of the present application also provide a computer readable storage medium storing a computer program which, when executed by a processor, implements steps that may implement the various method embodiments described above. In this embodiment, the computer-readable storage medium may be nonvolatile or may be volatile.
Embodiments of the present application provide a computer program product which, when run on a mobile terminal, causes the mobile terminal to perform steps that may be performed in the various method embodiments described above.
It is appreciated that embodiments of the present application may acquire and process relevant data based on artificial intelligence techniques. Among these, artificial intelligence (Artificial Intelligence, AI) is the theory, method, technique and application system that uses a digital computer or a digital computer-controlled machine to simulate, extend and extend human intelligence, sense the environment, acquire knowledge and use knowledge to obtain optimal results. Artificial intelligence infrastructure technologies generally include technologies such as sensors, dedicated artificial intelligence chips, cloud computing, distributed storage, big data processing technologies, operation/interaction systems, mechatronics, and the like. The artificial intelligence software technology mainly comprises a computer vision technology, a robot technology, a biological recognition technology, a voice processing technology, a natural language processing technology, machine learning/deep learning and other directions.
It is to be understood that the embodiments of the present application are operational with numerous general purpose or special purpose computer system environments or configurations. For example: personal computers, server computers, hand-held or portable devices, tablet devices, multiprocessor systems, microprocessor-based systems, set top boxes, programmable consumer electronics, network PCs, minicomputers, mainframe computers, distributed computing environments that include any of the above systems or devices, and the like. Embodiments of the application may be described in the general context of computer-executable instructions, such as program modules, being executed by a computer. Generally, program modules include routines, programs, objects, components, data structures, etc. that perform particular tasks or implement particular abstract data types. The application may also be practiced in distributed computing environments where tasks are performed by remote processing devices that are linked through a communications network. In a distributed computing environment, program modules may be located in both local and remote computer storage media including memory storage devices.
It will be apparent to those skilled in the art that, for convenience and brevity of description, only the above-described division of the functional units and modules is illustrated, and in practical application, the above-described functional distribution may be performed by different functional units and modules according to needs, i.e. the internal structure of the apparatus is divided into different functional units or modules to perform all or part of the above-described functions. The functional units and modules in the embodiment may be integrated in one processing unit, or each unit may exist alone physically, or two or more units may be integrated in one unit, where the integrated units may be implemented in a form of hardware or a form of a software functional unit. In addition, specific names of the functional units and modules are only for convenience of distinguishing from each other, and are not used for limiting the protection scope of the present application. The specific working process of the units and modules in the above system may refer to the corresponding process in the foregoing method embodiment, which is not described herein again.
The integrated modules/units, if implemented in the form of software functional units and sold or used as stand-alone products, may be stored in a computer readable storage medium. Based on such understanding, the present application may implement all or part of the flow of the method of the above embodiment, or may be implemented by a computer program to instruct related hardware, where the computer program may be stored in a computer readable storage medium, and when the computer program is executed by a processor, the computer program may implement the steps of each method embodiment described above. Wherein the computer program comprises computer program code which may be in source code form, object code form, executable file or some intermediate form etc. The computer readable medium may include: any entity or device capable of carrying the computer program code, a recording medium, a U disk, a removable hard disk, a magnetic disk, an optical disk, a computer Memory, a Read-Only Memory (ROM), a random access Memory (RAM, random Access Memory), an electrical carrier signal, a telecommunications signal, a software distribution medium, and so forth. It should be noted that the computer readable medium may include content that is subject to appropriate increases and decreases as required by jurisdictions in which such content is subject to legislation and patent practice, such as in certain jurisdictions in which such content is not included as electrical carrier signals and telecommunication signals.
In the foregoing embodiments, the descriptions of the embodiments are emphasized, and in part, not described or illustrated in any particular embodiment, reference is made to the related descriptions of other embodiments.
The above embodiments are only for illustrating the technical solution of the present application, and are not limiting; although the present application has been described in detail with reference to the foregoing embodiments, it should be understood by those of ordinary skill in the art that: the technical scheme described in the foregoing embodiments can be modified or some technical features thereof can be replaced by equivalents; such modifications and substitutions do not depart from the spirit and scope of the technical solutions of the embodiments of the present application, and are intended to be included in the scope of the present application.

Claims (6)

1. The disaster response processing method for the distributed data system is characterized by comprising the following steps:
acquiring the current resource utilization rate of a distributed data system, and judging whether the current resource utilization rate of the distributed data system reaches a disaster response starting threshold;
if the resource utilization rate reaches the disaster response starting threshold, solving the consumption of the computing resource which needs to be added in the disaster response of the system by adopting a preset disaster response model, wherein the method comprises the following steps: the preset initialized calculation resource adding amount is used as an initial solution of a target loss function in the preset disaster response model to calculate the target loss function, and a first function value is obtained; carrying out disturbance processing on the initial solution according to preset disturbance conditions to obtain a plurality of disturbance solutions corresponding to the initial solution; randomly selecting one disturbance solution from the disturbance solutions as a new solution of a target loss function in the preset disaster response model to calculate the target loss function, and obtaining a second function value; comparing the first function value with the second function value, if the second function value is smaller than or equal to the first function value, receiving the new solution and carrying out next round of iteration processing on the target loss function by taking the new solution as an initial solution until the iteration times reach a preset time threshold value, and obtaining the calculated resource additional consumption after iteration; according to the iterative calculation resource adding amount, simulating and calculating the resource utilization rate of the distributed data system after adding the calculation resource; judging whether the resource utilization rate of the distributed data system after adding the computing resource is lower than a preset resource utilization rate threshold value, if yes, configuring the iterative computing resource adding amount as the computing resource amount which needs to be added by the system according to disaster, otherwise, reducing the resource utilization rate of the distributed data system according to a preset annealing rate and resetting the iteration times so as to carry out iterative processing again based on the annealed resource utilization rate of the distributed data system until the resource utilization rate of the distributed data system after adding the computing resource is lower than the preset resource utilization rate threshold value;
performing resource scheduling processing on the distributed data system according to the consumption of the computing resource to be added, so that the resource utilization rate of the distributed data system is reduced to a reference value representing the safe operation of the system, wherein the mode of performing the resource scheduling processing comprises internal scheduling processing and external access processing, and the method comprises the following steps: the internal scheduling process is resource scheduling between slave nodes in the distributed data system, and the external access process is to access computing resources from outside of the distributed data system.
2. The disaster response processing method of a distributed data system according to claim 1, wherein in the step of performing a disturbance process on the initial solution according to a preset disturbance condition to obtain a plurality of disturbance solutions corresponding to the initial solution, the preset disturbance condition is configured to:
and->
Where ω is denoted as the initial solution and ω' is denoted as the disturbance solution.
3. The disaster recovery processing method of claim 1, wherein after the step of comparing the first function value and the second function value, if the second function value is greater than the first function value, further comprising:
calculating a probability value for accepting the new solution according to a Metropolis criterion according to the first function value and the second function value;
and comparing the calculated probability value with a preset probability threshold, if the probability value is larger than the preset probability threshold, accepting the new solution, and carrying out next round of iterative processing on the target loss function by taking the new solution as an initial solution, otherwise, not accepting the new solution.
4. A disaster response processing device for a distributed data system, comprising:
the judging module is used for acquiring the current resource utilization rate of the distributed data system and judging whether the current resource utilization rate of the distributed data system reaches a disaster response starting threshold value or not;
the calculation module is used for solving the consumption of the calculation resources which need to be added in response to the disaster of the system by adopting a preset disaster response model if the utilization rate of the resources reaches a disaster response starting threshold, and comprises the following steps: the preset initialized calculation resource adding amount is used as an initial solution of a target loss function in the preset disaster response model to calculate the target loss function, and a first function value is obtained; carrying out disturbance processing on the initial solution according to preset disturbance conditions to obtain a plurality of disturbance solutions corresponding to the initial solution; randomly selecting one disturbance solution from the disturbance solutions as a new solution of a target loss function in the preset disaster response model to calculate the target loss function, and obtaining a second function value; comparing the first function value with the second function value, if the second function value is smaller than or equal to the first function value, receiving the new solution and carrying out next round of iteration processing on the target loss function by taking the new solution as an initial solution until the iteration times reach a preset time threshold value, and obtaining the calculated resource additional consumption after iteration; according to the iterative calculation resource adding amount, simulating and calculating the resource utilization rate of the distributed data system after adding the calculation resource; judging whether the resource utilization rate of the distributed data system after adding the computing resource is lower than a preset resource utilization rate threshold value, if yes, configuring the iterative computing resource adding amount as the computing resource amount which needs to be added by the system according to disaster, otherwise, reducing the resource utilization rate of the distributed data system according to a preset annealing rate and resetting the iteration times so as to carry out iterative processing again based on the annealed resource utilization rate of the distributed data system until the resource utilization rate of the distributed data system after adding the computing resource is lower than the preset resource utilization rate threshold value;
the scheduling module is used for performing resource scheduling processing on the distributed data system according to the consumption of the computing resources to be added, so that the resource utilization rate of the distributed data system is reduced to a reference value representing the safe operation of the system, and the mode of performing the resource scheduling processing comprises internal scheduling processing and external access processing, wherein: the internal scheduling process is resource scheduling between slave nodes in the distributed data system, and the external access process is to access computing resources from outside of the distributed data system.
5. An electronic device comprising a memory, a processor and a computer program stored in the memory and executable on the processor, characterized in that the processor implements the steps of the method according to any one of claims 1 to 3 when the computer program is executed.
6. A computer readable storage medium storing a computer program, characterized in that the computer program when executed by a processor implements the steps of the method according to any one of claims 1 to 3.
CN202111007167.7A 2021-08-30 2021-08-30 Disaster response processing method, device, equipment and storage medium of distributed data system Active CN113722292B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202111007167.7A CN113722292B (en) 2021-08-30 2021-08-30 Disaster response processing method, device, equipment and storage medium of distributed data system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202111007167.7A CN113722292B (en) 2021-08-30 2021-08-30 Disaster response processing method, device, equipment and storage medium of distributed data system

Publications (2)

Publication Number Publication Date
CN113722292A CN113722292A (en) 2021-11-30
CN113722292B true CN113722292B (en) 2024-02-09

Family

ID=78679384

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202111007167.7A Active CN113722292B (en) 2021-08-30 2021-08-30 Disaster response processing method, device, equipment and storage medium of distributed data system

Country Status (1)

Country Link
CN (1) CN113722292B (en)

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2005263447A (en) * 2004-03-19 2005-09-29 Yaskawa Electric Corp Method for scheduling optimum car allocation, and program storage medium thereof
CN110515704A (en) * 2019-08-30 2019-11-29 广东浪潮大数据研究有限公司 Resource regulating method and device based on Kubernetes system
CN111176753A (en) * 2019-12-20 2020-05-19 贝壳技术有限公司 Cloud resource optimal configuration method and device, electronic equipment and readable storage medium
CN111586762A (en) * 2020-04-29 2020-08-25 重庆邮电大学 Task unloading and resource allocation joint optimization method based on edge cooperation
WO2020169182A1 (en) * 2019-02-19 2020-08-27 Nokia Solutions And Networks Oy Method and apparatus for allocating tasks
CN111611274A (en) * 2020-05-28 2020-09-01 华中科技大学 Database query optimization method and system
CN112965813A (en) * 2021-02-10 2021-06-15 山东英信计算机技术有限公司 AI platform resource regulation and control method, system and medium

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2005263447A (en) * 2004-03-19 2005-09-29 Yaskawa Electric Corp Method for scheduling optimum car allocation, and program storage medium thereof
WO2020169182A1 (en) * 2019-02-19 2020-08-27 Nokia Solutions And Networks Oy Method and apparatus for allocating tasks
CN110515704A (en) * 2019-08-30 2019-11-29 广东浪潮大数据研究有限公司 Resource regulating method and device based on Kubernetes system
CN111176753A (en) * 2019-12-20 2020-05-19 贝壳技术有限公司 Cloud resource optimal configuration method and device, electronic equipment and readable storage medium
CN111586762A (en) * 2020-04-29 2020-08-25 重庆邮电大学 Task unloading and resource allocation joint optimization method based on edge cooperation
CN111611274A (en) * 2020-05-28 2020-09-01 华中科技大学 Database query optimization method and system
CN112965813A (en) * 2021-02-10 2021-06-15 山东英信计算机技术有限公司 AI platform resource regulation and control method, system and medium

Also Published As

Publication number Publication date
CN113722292A (en) 2021-11-30

Similar Documents

Publication Publication Date Title
EP3447642A1 (en) System and method for predicting application performance for large data size on big data cluster
CN111159897B (en) Target optimization method and device based on system modeling application
CN113537337A (en) Training method, abnormality detection method, apparatus, device, and storage medium
CN112783508B (en) File compiling method, device, equipment and storage medium
CN113495825A (en) Line alarm processing method and device, electronic equipment and readable storage medium
CN113722292B (en) Disaster response processing method, device, equipment and storage medium of distributed data system
CN111339072A (en) User behavior based change value analysis method and device, electronic device and medium
CN115495151A (en) Rule engine migration method, device, equipment, storage medium and program product
EP3855316A1 (en) Optimizing breakeven points for enhancing system performance
CN114676272A (en) Information processing method, device and equipment of multimedia resource and storage medium
CN112764957A (en) Application fault delimiting method and device
CN112231292A (en) File processing method and device, storage medium and computer equipment
US20120233224A1 (en) Data processing
CN113343064B (en) Data processing method, apparatus, device, storage medium, and computer program product
CN105229668A (en) Make the search that line pattern is represented that uses gesture
CN109753405B (en) Application resource consumption detection method and device, storage medium and electronic equipment
CN113641670B (en) Data storage and data retrieval method and device, electronic equipment and storage medium
CN117135151B (en) Fault detection method of GPU cluster, electronic equipment and storage medium
CN111815442B (en) Link prediction method and device and electronic equipment
US20240004765A1 (en) Data processing method and apparatus for distributed storage system, device, and storage medium
CN109240827B (en) Method and device for determining resource occupation condition of application, storage medium and equipment
CN115858324A (en) IT equipment fault processing method, device, equipment and medium based on AI
CN116187975A (en) Method and device for detecting running state of equipment, computer equipment and storage medium
CN116018604A (en) User constrained process mining
CN112131320A (en) Abnormal data detection method and device and storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
TA01 Transfer of patent application right
TA01 Transfer of patent application right

Effective date of registration: 20220928

Address after: Room 2601 (Unit 07), Qianhai Free Trade Building, No. 3048, Xinghai Avenue, Nanshan Street, Qianhai Shenzhen-Hong Kong Cooperation Zone, Shenzhen, Guangdong 518000

Applicant after: Shenzhen Ping An Smart Healthcare Technology Co.,Ltd.

Address before: 1-34 / F, Qianhai free trade building, 3048 Xinghai Avenue, Mawan, Qianhai Shenzhen Hong Kong cooperation zone, Shenzhen, Guangdong 518000

Applicant before: Ping An International Smart City Technology Co.,Ltd.

GR01 Patent grant
GR01 Patent grant