CN114996102A - System availability evaluation method, device, equipment and storage medium - Google Patents

System availability evaluation method, device, equipment and storage medium Download PDF

Info

Publication number
CN114996102A
CN114996102A CN202210783526.6A CN202210783526A CN114996102A CN 114996102 A CN114996102 A CN 114996102A CN 202210783526 A CN202210783526 A CN 202210783526A CN 114996102 A CN114996102 A CN 114996102A
Authority
CN
China
Prior art keywords
availability
information
evaluated
network
server
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202210783526.6A
Other languages
Chinese (zh)
Inventor
冯月玲
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Ping An Property and Casualty Insurance Company of China Ltd
Original Assignee
Ping An Property and Casualty Insurance Company of China Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Ping An Property and Casualty Insurance Company of China Ltd filed Critical Ping An Property and Casualty Insurance Company of China Ltd
Priority to CN202210783526.6A priority Critical patent/CN114996102A/en
Publication of CN114996102A publication Critical patent/CN114996102A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/30Monitoring
    • G06F11/3003Monitoring arrangements specially adapted to the computing system or computing system component being monitored
    • G06F11/3006Monitoring arrangements specially adapted to the computing system or computing system component being monitored where the computing system is distributed, e.g. networked systems, clusters, multiprocessor systems
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/30Monitoring
    • G06F11/3065Monitoring arrangements determined by the means or processing involved in reporting the monitored data
    • G06F11/3072Monitoring arrangements determined by the means or processing involved in reporting the monitored data where the reporting involves data filtering, e.g. pattern matching, time or event triggered, adaptive or policy-based reporting
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L43/00Arrangements for monitoring or testing data switching networks
    • H04L43/08Monitoring or testing based on specific metrics, e.g. QoS, energy consumption or environmental parameters
    • H04L43/0823Errors, e.g. transmission errors

Abstract

The invention relates to monitoring and provides a method, a device, equipment and a storage medium for evaluating system availability. The method comprises the steps of detecting a network service line and an operating server from a system log of a system to be evaluated, generating network availability according to network operating information of the network service line, generating server availability according to server information of the operating server, generating storage availability according to storage information of the system to be evaluated, generating database availability according to database information of the system to be evaluated, and accurately evaluating the system availability based on the network availability, the server availability, the storage availability, the database availability, middleware availability, application availability and fault analysis degree. Furthermore, the invention also relates to a blockchain technique, the system availability of which can be stored in a blockchain.

Description

System availability evaluation method, device, equipment and storage medium
Technical Field
The present invention relates to the field of monitoring technologies, and in particular, to a method, an apparatus, a device, and a storage medium for evaluating system availability.
Background
Currently, in evaluating a system availability scheme, the availability of the system is usually calculated directly in an abnormal period of time of the failure. However, because the cause of the failure is too single, such as the failure problem caused by the single code version problem or the failure caused by the network problem, the factors influencing the availability of the system are many, and the availability of the system cannot be accurately evaluated.
Disclosure of Invention
In view of the foregoing, there is a need to provide a method, an apparatus, a device and a storage medium for evaluating system availability, which can solve the technical problem that the availability of the system cannot be accurately evaluated.
In one aspect, the present invention provides a system availability evaluation method, where the system availability evaluation method includes:
identifying a system to be evaluated according to the received system availability evaluation request;
detecting a network service line of the system to be evaluated from a system log of the system to be evaluated;
generating the network availability of the system to be evaluated according to the network operation information of the network service line;
detecting an operating server of the system to be evaluated from the system log;
generating the server availability of the system to be evaluated according to the server information of the operating server;
generating storage availability according to the storage information of the system to be evaluated, and generating database availability according to the database information of the system to be evaluated;
evaluating system availability of the system under evaluation based on the network availability, the server availability, the storage availability, the database availability, middleware availability generated by middleware information of the system under evaluation, application availability generated by application information of the system under evaluation, and a fault analysis degree generated by fault information of the system under evaluation.
According to a preferred embodiment of the present invention, the detecting the network service line of the system under evaluation from the system log of the system under evaluation comprises:
extracting response information received by the system to be evaluated from the system log;
extracting a node address from the response message;
determining a node corresponding to a node address successfully matched with a network address in a preset local area network as a service node;
and determining a line connected with the system to be evaluated and the service node as the network service line.
According to the preferred embodiment of the present invention, the network operation information includes the network maintenance times of the network service line, the line redundancy information of a dedicated line in the network service line, the life cycle of the service node, the emergency plan drilling information of the network service line, and the network monitoring coverage rate, and the generating the network availability of the system to be evaluated according to the network operation information of the network service line includes:
generating the line redundancy information based on the line request of each network service line and a preset line request threshold value;
counting the service duration of the service node according to the life cycle;
generating a drilling coefficient according to the emergency plan drilling information and preset drilling target information;
calculating the network monitoring coverage rate based on the total line area of the network service line and the monitoring area corresponding to the monitoring equipment of the network service line;
and calculating the network availability according to the network maintenance times, the line redundancy information, the service duration, the drilling coefficient and the network monitoring coverage rate.
According to a preferred embodiment of the present invention, the generating the line redundancy information based on the line request of each network service line and a preset line request threshold includes:
identifying a preset time period with the maximum total request amount as a peak time period according to the historical request amount of the preset local area network at a plurality of historical time points;
collecting a request of each network service line on the peak hour as the line request;
identifying the private line from among the plurality of network service lines based on an outgoing address and an incoming address in the line request, and determining a line request corresponding to the private line as a target request;
and generating the line redundancy information based on the request quantity of the target requests and the preset line request threshold value.
According to a preferred embodiment of the present invention, the server information includes a virtualization environment type of the operating server, power redundancy information, high available deployment information, and a server monitoring coverage, and the generating the server availability of the system to be evaluated according to the server information of the operating server includes:
identifying the power supply redundancy information according to the power supply configuration of the running server;
identifying the high available deployment information based on the cluster in which the running server is located;
calculating the server monitoring coverage rate based on the monitoring number of the running servers and the number of the running servers;
quantifying the server availability based on the virtualization environment type, the power redundancy information, the high available deployment information, and the server monitoring coverage.
According to a preferred embodiment of the present invention, the storage information includes a system available time length, a storage monitoring coverage rate, and a storage drilling result of the system to be evaluated, and the generating of the storage availability according to the storage information of the system to be evaluated includes:
acquiring system occupation capacity of the system to be evaluated at a plurality of preset moments;
plotting a system capacity trend based on the system occupancy capacity;
predicting the moment when the system occupied capacity is a capacity threshold value based on the trend slope of the system capacity trend as a prediction moment;
calculating the difference value between the prediction time and the preset time with the maximum value as the available time of the system;
calculating the storage monitoring coverage rate based on the monitoring number of the storage paths in the system to be evaluated and the path number of the storage paths;
identifying a storage drilling result based on the storage abnormal drilling information of the system to be evaluated;
and quantifying the storage availability based on the system availability duration, the storage monitoring coverage rate and the storage drilling result.
According to the preferred embodiment of the present invention, the middleware information includes upgrade migration information, middleware monitoring coverage and middleware emergency drilling information of the middleware in the system to be evaluated, the application information comprises a fusing degradation mechanism of an application program in the system to be evaluated, fault repair information of the application program, application drilling information and application monitoring coverage rate of the application program, the fault information comprises the actual availability ratio, the fault times, the fault reasons and the corresponding processing schemes of the system to be evaluated, the evaluating system availability of the system under evaluation based on the network availability, the server availability, the storage availability, the database availability, middleware availability generated by middleware information of the system under evaluation, application availability generated by application information of the system under evaluation, and failure analysis degree generated by failure information of the system under evaluation comprises:
quantifying the middleware availability based on the upgrade migration information, the middleware monitoring coverage and the middleware emergency drilling information;
quantifying the application availability based on the fusing degradation mechanism, the fault remediation information, the application drilling information, and the application monitoring coverage;
calculating the fault analysis degree based on the actual availability ratio, the fault times, the fault reasons and the corresponding processing scheme;
and carrying out weighted sum operation on the network availability, the server availability, the storage availability, the database availability, the middleware availability, the application availability and the fault analysis degree to obtain the system availability.
In another aspect, the present invention further provides a system availability evaluating apparatus, including:
the identification unit is used for identifying the system to be evaluated according to the received system availability evaluation request;
the detection unit is used for detecting the network service line of the system to be evaluated from the system log of the system to be evaluated;
the generating unit is used for generating the network availability of the system to be evaluated according to the network operation information of the network service line;
the detection unit is further used for detecting the running server of the system to be evaluated from the system log;
the generating unit is further used for generating the server availability of the system to be evaluated according to the server information of the operating server;
the generating unit is further used for generating storage availability according to the storage information of the system to be evaluated and generating database availability according to the database information of the system to be evaluated;
an evaluation unit, configured to evaluate system availability of the system to be evaluated based on the network availability, the server availability, the storage availability, the database availability, middleware availability generated by middleware information of the system to be evaluated, application availability generated by application information of the system to be evaluated, and a failure analysis degree generated by failure information of the system to be evaluated.
In another aspect, the present invention further provides an electronic device, including:
a memory storing computer readable instructions; and
a processor executing computer readable instructions stored in the memory to implement the system availability assessment method.
In another aspect, the present invention further provides a computer-readable storage medium, in which computer-readable instructions are stored, and the computer-readable instructions are executed by a processor in an electronic device to implement the system availability evaluation method.
According to the technical scheme, the availability of the system to be evaluated is evaluated by combining multiple dimensions such as the network availability, the server availability, the storage availability, the database availability, the middleware availability, the application availability and the fault analysis degree, the availability of the system to be evaluated can be more comprehensively reflected by combining the network operation information of the network service line serving the system to be evaluated, the information of the server operating the system to be evaluated and the characteristic information of the system to be evaluated, and therefore the evaluation accuracy of the system availability is improved. In addition, the network service line and the operation server can be accurately detected through the system log, so that the error identification of the network service line and the operation server is avoided, and the evaluation accuracy of the system availability can be further improved.
Drawings
FIG. 1 is a flow chart of a preferred embodiment of a system availability assessment method of the present invention.
FIG. 2 is a functional block diagram of a preferred embodiment of the system availability evaluating apparatus of the present invention.
Fig. 3 is a schematic structural diagram of an electronic device implementing a method for evaluating system availability according to a preferred embodiment of the present invention.
Detailed Description
In order to make the objects, technical solutions and advantages of the present invention more apparent, the present invention will be described in detail with reference to the accompanying drawings and specific embodiments.
Fig. 1 is a flow chart of a method for evaluating the availability of a system according to a preferred embodiment of the present invention. The order of the steps in the flow chart may be changed and some steps may be omitted according to different needs.
The system availability evaluation method can acquire and process related data based on artificial intelligence technology. Among them, Artificial Intelligence (AI) is a theory, method, technique and application system that simulates, extends and expands human Intelligence using a digital computer or a machine controlled by a digital computer, senses the environment, acquires knowledge and uses the knowledge to obtain the best result.
The artificial intelligence infrastructure generally includes technologies such as sensors, dedicated artificial intelligence chips, cloud computing, distributed storage, big data processing technologies, operation/interaction systems, mechatronics, and the like. The artificial intelligence software technology mainly comprises a computer vision technology, a robot technology, a biological recognition technology, a voice processing technology, a natural language processing technology, machine learning/deep learning and the like.
The system availability evaluation method is applied to one or more electronic devices, which are devices capable of automatically performing numerical calculation and/or information processing according to computer readable instructions set or stored in advance, and the hardware thereof includes but is not limited to a microprocessor, an Application Specific Integrated Circuit (ASIC), a Programmable Gate Array (FPGA), a Digital Signal Processor (DSP), an embedded device, and the like.
The electronic device may be any electronic product capable of performing human-computer interaction with a user, for example, a Personal computer, a tablet computer, a smart phone, a Personal Digital Assistant (PDA), a game console, an Internet Protocol Television (IPTV), a smart wearable device, and the like.
The electronic device may include a network device and/or a user device. Wherein the network device includes, but is not limited to, a single network electronic device, an electronic device group consisting of a plurality of network electronic devices, or a Cloud Computing (Cloud Computing) based Cloud consisting of a large number of hosts or network electronic devices.
The network in which the electronic device is located includes, but is not limited to: the internet, a wide area Network, a metropolitan area Network, a local area Network, a Virtual Private Network (VPN), and the like.
And 101, identifying a system to be evaluated according to the received system availability evaluation request.
In at least one embodiment of the invention, the system availability assessment request may be triggered and generated by an operation and maintenance user maintaining the system. The system availability evaluation request carries a system identifier.
The system to be evaluated is a system for the operation and maintenance user to evaluate the system as required. The system to be evaluated may be any business system.
In at least one embodiment of the present invention, the electronic device, according to the received system availability evaluation request, identifying a system to be evaluated includes:
analyzing a system identifier from a request message of the system availability evaluation request;
and determining the system corresponding to the system identification as the system to be evaluated.
The system identification can be quickly acquired by directly analyzing the request message, so that the determination efficiency of the system to be evaluated is improved, and meanwhile, the determination accuracy of the system to be evaluated can be improved due to the uniqueness of the system identification.
102, detecting a network service line of the system to be evaluated from a system log of the system to be evaluated.
In at least one embodiment of the present invention, the system log records operation information on the system under evaluation.
The network service line refers to a link for the system to be evaluated to perform network communication, and the network service line may include a dedicated line of the system to be evaluated, or may include a general line. Wherein the dedicated line is a communication line dedicated to the system under evaluation.
In at least one embodiment of the present invention, the electronic device detecting the network service line of the system under evaluation from the system log of the system under evaluation includes:
extracting response information received by the system to be evaluated from the system log;
extracting a node address from the response message;
determining a node corresponding to a node address successfully matched with a network address in a preset local area network as a service node;
and determining a line connected with the system to be evaluated and the service node as the network service line.
The preset local area network comprises local area networks corresponding to a plurality of operators.
The service node refers to a device node corresponding to the node address in the preset local area network.
The service node can be accurately identified through the matching relation between the network address in the preset local area network and the node address, so that the accuracy of the network service line is improved.
103, generating the network availability of the system to be evaluated according to the network operation information of the network service line.
In at least one embodiment of the present invention, the network operation information includes a network maintenance frequency of the network service line, line redundancy information of a dedicated line in the network service line, a life cycle of the service node, emergency plan drilling information of the network service line, and a network monitoring coverage.
The network maintenance times refer to the version number of the line upgrade version of the network service line. The line redundancy information is determined according to the magnitude relation between the request quantity of the special line in the peak period and a preset line request threshold value. The life cycle refers to the whole process from production to downtime of the service node. The emergency plan drilling information refers to a drilling situation of an emergency plan specified for a problem occurring in the network service line. The network monitoring coverage rate refers to an area ratio of a monitoring area corresponding to the monitoring device of the network service line to a total line area of the network service line.
In at least one embodiment of the present invention, the generating, by the electronic device, the network availability of the system to be evaluated according to the network operation information of the network service line includes:
generating the line redundancy information based on the line request of each network service line and a preset line request threshold value;
counting the service duration of the service node according to the life cycle;
generating a drilling coefficient according to the emergency plan drilling information and preset configured drilling target information;
calculating the network monitoring coverage rate based on the total line area of the network service line and the monitoring area corresponding to the monitoring equipment of the network service line;
and calculating the network availability according to the network maintenance times, the line redundant information, the service duration, the drilling coefficient and the network monitoring coverage rate.
The preset configured drilling target information refers to a target situation expected to be reached after a corresponding emergency plan is executed for a problem occurring in the network service line.
By combining the network maintenance times, the line redundancy information, the service duration, the drilling coefficient and the network monitoring coverage rate, the network availability can be accurately quantified.
Specifically, the generating, by the electronic device, the line redundancy information based on the line request of each network service line and a preset line request threshold includes:
identifying a preset time period with the maximum total request amount as a peak time period according to the historical request amount of the preset local area network at a plurality of historical time points;
collecting a request of each network service line on the peak hour as the line request;
identifying the private line from among the plurality of network service lines based on an outgoing address and an incoming address in the line request, and determining a line request corresponding to the private line as a target request;
and generating the line redundancy information based on the request quantity of the target requests and the preset line request threshold value.
The plurality of historical time points and the preset time period can be set according to actual requirements. For example, the duration of the preset period is set to ten minutes divided into 0-24 hours.
The line request refers to a request that has completed the allocation of the execution thread.
The dedicated line refers to a communication line in which the sending address and the sending address correspond to the system to be evaluated and a service node in the preset local area network.
The peak time period with the time length meeting the requirement can be identified by setting the preset time period, the special line can be accurately identified by the sending address and the sending address, and further, the line redundancy information can be accurately determined by the relation between the target request of the special line in the peak time period and the preset line request threshold value.
Specifically, the generating, by the electronic device, the line redundancy information based on the request number of the target request and the preset line request threshold includes:
comparing the request quantity with the preset line request threshold value;
if the request quantity is larger than the preset line request threshold value, determining the line redundancy information as the existence of line redundancy; or alternatively
And if the request quantity is less than or equal to the preset line request threshold value, obtaining the line configuration information of the special line, and identifying the line redundancy information from the line configuration information.
104, detecting the running server of the system to be evaluated from the system log.
In at least one embodiment of the invention, the running server provides computing or application services for the system to be evaluated in a network.
In at least one embodiment of the present invention, the electronic device extracts a server address from the system log, and determines a server corresponding to the server address as the running server.
And 105, generating the server availability of the system to be evaluated according to the server information of the running server.
In at least one embodiment of the invention, the server information comprises a virtualization environment type of the running server, power redundancy information, high-available deployment information and server monitoring coverage.
The virtualization environment types comprise a container, a virtual machine, a physical machine and the like. The identification mode of the virtualization environment type is as follows: detecting whether a loaded kernel file exists in the running server, if the loaded kernel file does not exist in the running server, determining that the type of the virtualized environment is a container, or if the loaded kernel file exists in the running server, running a system-detect-virt command in the running server to obtain a running result, if the running result is a first preset result, determining that the type of the virtualized environment is a physical machine, or if the running result is a second preset result, determining that the type of the virtualized environment is a virtual machine. Wherein the first predetermined result is usually set to none.
In at least one embodiment of the present invention, the generating, by the electronic device, the server availability of the system to be evaluated according to the server information of the operating server includes:
identifying the power supply redundancy information according to the power supply configuration of the running server;
identifying the high available deployment information based on the cluster in which the running server is located;
calculating the server monitoring coverage rate based on the monitoring number of the running servers and the number of the running servers;
quantifying the server availability based on the virtualization environment type, the power redundancy information, the high availability deployment information, and the server monitoring coverage.
The power supply configuration may include a 1+1 power supply configuration, where the power supply configuration indicates that one power supply module in the operating server can work normally, and if two power supply modules are configured, one of the power supply modules is a redundant power supply.
The power supply redundancy information is determined according to the power supply configuration and the number of power supply modules in the operating server, for example, if the power supply configuration is 1+1 and the number of modules of the power supply modules is 2, the power supply redundancy information is determined to be that redundant power supplies exist.
The high available deployment information is determined based on the number of servers included in the cluster.
The server monitoring coverage rate refers to the ratio of the monitoring number to the server number.
The server availability is obtained according to the virtualized environment type, the power redundancy information, the high-availability deployment information and the weighting and information quantification of the server monitoring coverage rate.
By combining the virtualization environment type, the power redundancy information, the high-availability deployment information and the server monitoring coverage rate, the server availability can be analyzed from multiple dimensions, and the accuracy of the server availability is improved.
And 106, generating storage availability according to the storage information of the system to be evaluated, and generating database availability according to the database information of the system to be evaluated.
In at least one embodiment of the present invention, the storage information includes a system available time length, a storage monitoring coverage rate and a storage drilling result of the system to be evaluated.
The database information comprises database monitoring coverage rate of a database in the system to be evaluated, database optimization rule quantity, database disaster recovery standard reaching information and database drilling information. The generation mode of the database monitoring coverage rate is similar to the generation mode of the server monitoring coverage rate, the number of the database optimization rules can be obtained by directly counting the number of the optimization rules of the database, the database disaster recovery compliance information refers to a difference value between actual disaster recovery information and target disaster recovery information of the database, the generation mode of the database drilling information is similar to the generation mode of the drilling coefficient, and details are not repeated in the invention.
In at least one embodiment of the present invention, the generating, by the electronic device, the storage availability according to the storage information of the system to be evaluated includes:
acquiring system occupation capacity of the system to be evaluated at a plurality of preset moments;
plotting a system capacity trend based on the system occupancy capacity;
predicting the moment when the system occupied capacity is a capacity threshold value based on the trend slope of the system capacity trend as a prediction moment;
calculating the difference value between the predicted time and the preset time with the maximum value as the available time of the system;
calculating the storage monitoring coverage rate based on the monitoring number of the storage paths in the system to be evaluated and the path number of the storage paths;
identifying a storage drilling result based on the storage abnormal drilling information of the system to be evaluated;
and quantifying the storage availability based on the system availability duration, the storage monitoring coverage rate and the storage drilling result.
Wherein, the capacity threshold refers to the maximum available capacity of the system to be evaluated.
The storage monitoring coverage rate refers to a ratio of the monitored number of the storage paths to the number of the paths.
If the stored abnormal drilling information is the same as the stored expected drilling information, the stored drilling result is normal and is generally represented by 1. If the stored abnormal drilling information is different from the stored expected drilling information, the stored drilling result is abnormal and is generally represented by 0.
And the storage availability is obtained according to the system available duration, the storage monitoring coverage rate and the weighting and information quantification of the storage drilling result.
By combining the system available duration, the storage monitoring coverage rate and the storage drilling result, the storage availability can be accurately quantified.
In at least one embodiment of the present invention, the generating, by the electronic device, database availability according to the database information of the system to be evaluated includes:
and weighting and calculating the database monitoring coverage rate, the database optimization rule quantity, the database disaster recovery standard-reaching information and the database drilling information to obtain the database availability.
And 107, evaluating the system availability of the system to be evaluated based on the network availability, the server availability, the storage availability, the database availability, the middleware availability generated by the middleware information of the system to be evaluated, the application availability generated by the application information of the system to be evaluated and the fault analysis degree generated by the fault information of the system to be evaluated.
It is emphasized that the system availability may also be stored in a node of a blockchain in order to further ensure privacy and security of the system availability.
In at least one embodiment of the present invention, the middleware information includes upgrade migration information, middleware monitoring coverage, and middleware emergency drilling information of a middleware in the system to be evaluated, the application information includes a fusing degradation mechanism of an application in the system to be evaluated, fault repair information of the application, application drilling information of the application, and application monitoring coverage, and the fault information includes an actual availability, a number of faults, a fault reason, and a corresponding processing scheme of the system to be evaluated.
In at least one embodiment of the present invention, the electronic device evaluating system availability of the system to be evaluated based on the network availability, the server availability, the storage availability, the database availability, the middleware availability generated by the middleware information of the system to be evaluated, the application availability generated by the application information of the system to be evaluated, and the fault analysis degree generated by the fault information of the system to be evaluated includes:
quantifying the middleware availability based on the upgrade migration information, the middleware monitoring coverage and the middleware emergency drilling information;
quantifying the application availability based on the fusing degradation mechanism, the fault repair information, the application drilling information, and the application monitoring coverage;
calculating the fault analysis degree based on the actual availability ratio, the fault times, the fault reasons and the corresponding processing scheme;
and carrying out weighted sum operation on the network availability, the server availability, the storage availability, the database availability, the middleware availability, the application availability and the fault analysis degree to obtain the system availability.
By combining the dimensions to evaluate the performance of the system to be evaluated, the accuracy of the system availability can be improved.
According to the technical scheme, the availability of the system to be evaluated is evaluated by combining multiple dimensions such as the network availability, the server availability, the storage availability, the database availability, the middleware availability, the application availability and the fault analysis degree, and the availability of the system to be evaluated can be more comprehensively reflected by combining the network operation information of a network service line serving the system to be evaluated, the information of a server operating the system to be evaluated and the characteristic information of the system to be evaluated, so that the evaluation accuracy of the system availability is improved. In addition, the network service line and the running server can be accurately detected through the system log, and the error identification of the network service line and the running server is avoided, so that the evaluation accuracy of the system availability can be further improved.
Fig. 2 is a functional block diagram of a preferred embodiment of the system availability evaluating apparatus according to the present invention. The system availability evaluation apparatus 11 includes a recognition unit 110, a detection unit 111, a generation unit 112, and an evaluation unit 113. A module/unit as referred to herein is a series of computer readable instruction segments capable of being retrieved by the processor 13 and performing a fixed function, and stored in the memory 12. In the present embodiment, the functions of the modules/units will be described in detail in the following embodiments.
The identification unit 110 identifies a system to be evaluated based on the received system availability evaluation request.
In at least one embodiment of the invention, the system availability assessment request may be generated by an operation and maintenance user who maintains the system. The system availability evaluation request carries a system identifier.
The system to be evaluated is a system for the operation and maintenance user to evaluate the system as required. The system to be evaluated may be any business system.
In at least one embodiment of the present invention, the identifying unit 110, according to the received system availability evaluation request, identifies the system to be evaluated, including:
analyzing a system identifier from a request message of the system availability evaluation request;
and determining the system corresponding to the system identification as the system to be evaluated.
The system identification can be quickly acquired by directly analyzing the request message, so that the determination efficiency of the system to be evaluated is improved, and meanwhile, the determination accuracy of the system to be evaluated can be improved due to the uniqueness of the system identification.
The detection unit 111 detects the network service line of the system to be evaluated from the system log of the system to be evaluated.
In at least one embodiment of the present invention, the system log records operation information on the system under evaluation.
The network service line refers to a link for performing network communication by the system to be evaluated, and the network service line may include a dedicated line of the system to be evaluated, or may include a general line. Wherein the dedicated line is a communication line dedicated to the system under evaluation.
In at least one embodiment of the present invention, the detecting unit 111 detects the network service line of the system to be evaluated from the system log of the system to be evaluated includes:
extracting response information received by the system to be evaluated from the system log;
extracting a node address from the response message;
determining a node corresponding to a node address successfully matched with a network address in a preset local area network as a service node;
and determining a line connected with the system to be evaluated and the service node as the network service line.
The preset local area network comprises local area networks corresponding to a plurality of operators.
The service node refers to a device node corresponding to the node address in the preset local area network.
The service node can be accurately identified through the matching relation between the network address in the preset local area network and the node address, so that the accuracy of the network service line is improved.
The generating unit 112 generates the network availability of the system to be evaluated according to the network operation information of the network service line.
In at least one embodiment of the present invention, the network operation information includes a network maintenance frequency of the network service line, line redundancy information of a dedicated line in the network service line, a life cycle of the service node, emergency plan drilling information of the network service line, and a network monitoring coverage.
The network maintenance times refer to the version number of the line upgrade version of the network service line. The line redundancy information is determined according to the magnitude relation between the request quantity of the special line in the peak period and a preset line request threshold value. The life cycle refers to the whole process from production to downtime of the service node. The emergency plan drilling information refers to a drilling situation of an emergency plan specified for a problem occurring in the network service line. The network monitoring coverage rate refers to an area ratio of a monitoring area corresponding to the monitoring device of the network service line to a total line area of the network service line.
In at least one embodiment of the present invention, the generating unit 112 generates the network availability of the system to be evaluated according to the network operation information of the network service line, including:
generating the line redundancy information based on the line request of each network service line and a preset line request threshold value;
counting the service duration of the service node according to the life cycle;
generating a drilling coefficient according to the emergency plan drilling information and preset configured drilling target information;
calculating the network monitoring coverage rate based on the total line area of the network service line and the monitoring area corresponding to the monitoring equipment of the network service line;
and calculating the network availability according to the network maintenance times, the line redundant information, the service duration, the drilling coefficient and the network monitoring coverage rate.
The preset configured drilling target information refers to a target situation expected to be reached after a corresponding emergency plan is executed for a problem occurring in the network service line.
By combining the network maintenance times, the line redundancy information, the service duration, the drilling coefficient and the network monitoring coverage rate, the network availability can be accurately quantified.
Specifically, the generating unit 112 generates the line redundancy information based on the line request of each network service line and a preset line request threshold, including:
identifying a preset time period with the maximum request total amount as a peak time period according to the historical request amount of the preset local area network at a plurality of historical time points;
collecting a request of each network service line on the peak hour as the line request;
identifying the private line from among the plurality of network service lines based on an outgoing address and an incoming address in the line request, and determining a line request corresponding to the private line as a target request;
and generating the line redundancy information based on the request quantity of the target requests and the preset line request threshold value.
The plurality of historical time points and the preset time period can be set according to actual requirements. For example, the duration of the preset period is set to ten minutes divided by 0 to 24 hours.
The line request refers to a request that has completed the allocation of the execution thread.
The dedicated line refers to a communication line in which the sending address and the sending address correspond to the system to be evaluated and a service node in the preset local area network.
The peak time period with the time length meeting the requirement can be identified by setting the preset time period, the special line can be accurately identified by the sending address and the sending address, and further, the line redundancy information can be accurately determined by the relation between the target request of the special line in the peak time period and the preset line request threshold value.
Specifically, the generating unit 112 generates the line redundancy information based on the request number of the target request and the preset line request threshold includes:
comparing the request quantity with the preset line request threshold value;
if the request quantity is larger than the preset line request threshold value, determining the line redundancy information as the existence of line redundancy; or
And if the request quantity is less than or equal to the preset line request threshold value, obtaining the line configuration information of the special line, and identifying the line redundancy information from the line configuration information.
The detection unit 111 detects the running server of the system to be evaluated from the system log.
In at least one embodiment of the invention, the running server provides computing or application services for the system to be evaluated in a network.
In at least one embodiment of the present invention, the detection unit 111 extracts a server address from the system log, and determines a server corresponding to the server address as the running server.
The generating unit 112 generates the server availability of the system to be evaluated according to the server information of the running server.
In at least one embodiment of the invention, the server information comprises a virtualization environment type of the running server, power redundancy information, high available deployment information and server monitoring coverage.
The virtualization environment types comprise a container, a virtual machine, a physical machine and the like. The identification mode of the virtualization environment type is as follows: detecting whether a loaded kernel file exists in the running server, if the loaded kernel file does not exist in the running server, determining that the type of the virtualized environment is a container, or if the loaded kernel file exists in the running server, running a system-detect-virt command in the running server to obtain a running result, if the running result is a first preset result, determining that the type of the virtualized environment is a physical machine, or if the running result is a second preset result, determining that the type of the virtualized environment is a virtual machine. Wherein the first predetermined result is usually set to none.
In at least one embodiment of the present invention, the generating unit 112 generates the server availability of the system to be evaluated according to the server information of the running server, including:
identifying the power supply redundancy information according to the power supply configuration of the running server;
identifying the high available deployment information based on the cluster in which the running server is located;
calculating the server monitoring coverage rate based on the monitoring number of the running servers and the number of the running servers;
quantifying the server availability based on the virtualization environment type, the power redundancy information, the high availability deployment information, and the server monitoring coverage.
The power supply configuration may include a 1+1 power supply configuration, where the power supply configuration indicates that one power supply module in the operating server can normally operate, and if there are two power supply modules in the configuration, one of the power supply modules is a redundant power supply.
The power supply redundancy information is determined according to the power supply configuration and the number of power supply modules in the operating server, for example, if the power supply configuration is 1+1 and the number of modules of the power supply modules is 2, the power supply redundancy information is determined to be that redundant power supplies exist.
The high available deployment information is determined based on the number of servers included in the cluster.
The server monitoring coverage rate refers to the ratio of the monitoring number to the server number.
The server availability is obtained according to the virtualized environment type, the power redundancy information, the high-availability deployment information and the weighting and information quantification of the server monitoring coverage rate.
By combining the virtualization environment type, the power redundancy information, the high-availability deployment information and the server monitoring coverage rate, the server availability can be analyzed from multiple dimensions, and the accuracy of the server availability is improved.
The generating unit 112 generates storage availability according to the storage information of the system to be evaluated, and generates database availability according to the database information of the system to be evaluated.
In at least one embodiment of the present invention, the storage information includes a system available time of the system to be evaluated, a storage monitoring coverage rate, and a storage drill result.
The database information comprises database monitoring coverage rate of a database in the system to be evaluated, database optimization rule quantity, database disaster recovery standard reaching information and database drilling information. The generation mode of the database monitoring coverage rate is similar to the generation mode of the server monitoring coverage rate, the number of the database optimization rules can be obtained by directly counting the number of the optimization rules of the database, the database disaster recovery compliance information refers to a difference value between actual disaster recovery information and target disaster recovery information of the database, the generation mode of the database drilling information is similar to the generation mode of the drilling coefficient, and details are not repeated in the invention.
In at least one embodiment of the present invention, the generating unit 112 generates storage availability according to the storage information of the system to be evaluated, including:
acquiring system occupation capacity of the system to be evaluated at a plurality of preset moments;
plotting a system capacity trend based on the system occupancy capacity;
predicting the moment when the system occupied capacity is a capacity threshold value based on the trend slope of the system capacity trend as a prediction moment;
calculating the difference value between the prediction time and the preset time with the maximum value as the available time of the system;
calculating the storage monitoring coverage rate based on the monitoring number of the storage paths in the system to be evaluated and the path number of the storage paths;
identifying a storage drilling result based on the storage abnormal drilling information of the system to be evaluated;
and quantifying the storage availability based on the system availability duration, the storage monitoring coverage rate and the storage drilling result.
Wherein, the capacity threshold refers to the maximum available capacity of the system to be evaluated.
The storage monitoring coverage rate refers to a ratio of the monitored number of the storage paths to the number of the paths.
If the stored abnormal drilling information is the same as the stored expected drilling information, the stored drilling result is normal and is generally represented by 1. If the stored abnormal drilling information is different from the stored expected drilling information, the stored drilling result is abnormal and is generally represented by 0.
And the storage availability is obtained according to the system available duration, the storage monitoring coverage rate and the weighting and information quantification of the storage drilling result.
By combining the system available time, the storage monitoring coverage rate and the storage drilling result, the storage availability can be accurately quantified.
In at least one embodiment of the present invention, the generating unit 112 generates database availability according to the database information of the system to be evaluated, including:
and weighting and calculating the database monitoring coverage rate, the database optimization rule quantity, the database disaster recovery standard-reaching information and the database drilling information to obtain the database availability.
The evaluation unit 113 evaluates the system availability of the system to be evaluated based on the network availability, the server availability, the storage availability, the database availability, the middleware availability generated by the middleware information of the system to be evaluated, the application availability generated by the application information of the system to be evaluated, and the failure analysis degree generated by the failure information of the system to be evaluated.
It is emphasized that the system availability may also be stored in a node of a blockchain in order to further ensure privacy and security of the system availability.
In at least one embodiment of the present invention, the middleware information includes upgrade migration information, middleware monitoring coverage, and middleware emergency drilling information of a middleware in the system to be evaluated, the application information includes a fusing degradation mechanism of an application in the system to be evaluated, fault repair information of the application, application drilling information of the application, and application monitoring coverage, and the fault information includes an actual availability, a number of faults, a fault reason, and a corresponding processing scheme of the system to be evaluated.
In at least one embodiment of the present invention, the evaluating unit 113 evaluating the system availability of the system to be evaluated based on the network availability, the server availability, the storage availability, the database availability, the middleware availability generated by the middleware information of the system to be evaluated, the application availability generated by the application information of the system to be evaluated, and the fault analysis degree generated by the fault information of the system to be evaluated includes:
quantifying the middleware availability based on the upgrade migration information, the middleware monitoring coverage and the middleware emergency drilling information;
quantifying the application availability based on the fusing degradation mechanism, the fault remediation information, the application drilling information, and the application monitoring coverage;
calculating the fault analysis degree based on the actual availability ratio, the fault times, the fault reasons and the corresponding processing scheme;
and carrying out weighted sum operation on the network availability, the server availability, the storage availability, the database availability, the middleware availability, the application availability and the fault analysis degree to obtain the system availability.
By combining the dimensions to evaluate the performance of the system to be evaluated, the accuracy of the system availability can be improved.
According to the technical scheme, the availability of the system to be evaluated is evaluated by combining multiple dimensions such as the network availability, the server availability, the storage availability, the database availability, the middleware availability, the application availability and the fault analysis degree, the availability of the system to be evaluated can be more comprehensively reflected by combining the network operation information of the network service line serving the system to be evaluated, the information of the server operating the system to be evaluated and the characteristic information of the system to be evaluated, and therefore the evaluation accuracy of the system availability is improved. In addition, the network service line and the running server can be accurately detected through the system log, and the error identification of the network service line and the running server is avoided, so that the evaluation accuracy of the system availability can be further improved.
Fig. 3 is a schematic structural diagram of an electronic device implementing a method for evaluating system availability according to a preferred embodiment of the present invention.
In one embodiment of the present invention, the electronic device 1 includes, but is not limited to, a memory 12, a processor 13, and computer readable instructions, such as a system availability assessment program, stored in the memory 12 and executable on the processor 13.
It will be appreciated by those skilled in the art that the schematic diagram is merely an example of the electronic device 1, and does not constitute a limitation of the electronic device 1, and may include more or less components than those shown, or combine some components, or different components, for example, the electronic device 1 may further include an input-output device, a network access device, a bus, etc.
The Processor 13 may be a Central Processing Unit (CPU), other general purpose Processor, a Digital Signal Processor (DSP), an Application Specific Integrated Circuit (ASIC), a Field Programmable Gate Array (FPGA) or other Programmable logic device, discrete Gate or transistor logic device, discrete hardware component, etc. The processor 13 is an operation core and a control center of the electronic device 1, and is connected to each part of the whole electronic device 1 by various interfaces and lines, and executes an operating system of the electronic device 1 and various installed application programs, program codes, and the like.
Illustratively, the computer readable instructions may be partitioned into one or more modules/units that are stored in the memory 12 and executed by the processor 13 to implement the present invention. The one or more modules/units may be a series of computer readable instruction segments capable of performing specific functions, which are used for describing the execution process of the computer readable instructions in the electronic device 1. For example, the computer readable instructions may be divided into an identification unit 110, a detection unit 111, a generation unit 112 and an evaluation unit 113.
The memory 12 may be used for storing the computer readable instructions and/or modules, and the processor 13 implements various functions of the electronic device 1 by executing or executing the computer readable instructions and/or modules stored in the memory 12 and invoking data stored in the memory 12. The memory 12 may mainly include a program storage area and a data storage area, wherein the program storage area may store an operating system, an application program required by at least one function (such as a sound playing function, an image playing function, etc.), and the like; the storage data area may store data created according to use of the electronic device, and the like. The memory 12 may include non-volatile and volatile memories, such as: a hard disk, a memory, a plug-in hard disk, a Smart Media Card (SMC), a Secure Digital (SD) Card, a Flash memory Card (Flash Card), at least one magnetic disk storage device, a Flash memory device, or other storage device.
The memory 12 may be an external memory and/or an internal memory of the electronic device 1. Further, the memory 12 may be a memory having a physical form, such as a memory stick, a TF Card (Trans-flash Card), or the like.
The integrated modules/units of the electronic device 1 may be stored in a computer-readable storage medium if they are implemented in the form of software functional units and sold or used as separate products. Based on such understanding, all or part of the flow of the method of the embodiments described above can be realized by the present invention, and the computer readable instructions can also be used for instructing the related hardware, and the computer readable instructions can be stored in a computer readable storage medium, and when the computer readable instructions are executed by a processor, the steps of the method embodiments described above can be realized.
Wherein the computer readable instructions comprise computer readable instruction code which may be in source code form, object code form, an executable file or some intermediate form, and the like. The computer-readable medium may include: any entity or device capable of carrying said computer readable instruction code, recording medium, U-disk, removable hard disk, magnetic disk, optical disk, computer Memory, Read-Only Memory (ROM), Random Access Memory (RAM).
The block chain is a novel application mode of computer technologies such as distributed data storage, point-to-point transmission, a consensus mechanism, an encryption algorithm and the like. A block chain (Blockchain), which is essentially a decentralized database, is a series of data blocks associated by using a cryptographic method, and each data block contains information of a batch of network transactions, so as to verify the validity (anti-counterfeiting) of the information and generate a next block. The blockchain may include a blockchain underlying platform, a platform product service layer, an application service layer, and the like.
With reference to fig. 1, the memory 12 of the electronic device 1 stores computer-readable instructions for implementing a system availability assessment method, and the processor 13 executes the computer-readable instructions to implement:
identifying a system to be evaluated according to the received system availability evaluation request;
detecting a network service line of the system to be evaluated from a system log of the system to be evaluated;
generating the network availability of the system to be evaluated according to the network operation information of the network service line;
detecting an operating server of the system to be evaluated from the system log;
generating the server availability of the system to be evaluated according to the server information of the operating server;
generating storage availability according to the storage information of the system to be evaluated, and generating database availability according to the database information of the system to be evaluated;
evaluating the system availability of the system to be evaluated based on the network availability, the server availability, the storage availability, the database availability, the middleware availability generated by the middleware information of the system to be evaluated, the application availability generated by the application information of the system to be evaluated, and the fault analysis degree generated by the fault information of the system to be evaluated.
Specifically, the processor 13 may refer to the description of the relevant steps in the embodiment corresponding to fig. 1 for a specific implementation method of the computer readable instructions, which is not described herein again.
In the embodiments provided in the present invention, it should be understood that the disclosed system, apparatus and method may be implemented in other ways. For example, the above-described apparatus embodiments are merely illustrative, and for example, the division of the modules is only one logical functional division, and other divisions may be realized in practice.
The computer readable storage medium has computer readable instructions stored thereon, wherein the computer readable instructions when executed by the processor 13 are configured to implement the steps of:
identifying a system to be evaluated according to the received system availability evaluation request;
detecting a network service line of the system to be evaluated from a system log of the system to be evaluated;
generating the network availability of the system to be evaluated according to the network operation information of the network service line;
detecting an operating server of the system to be evaluated from the system log;
generating the server availability of the system to be evaluated according to the server information of the operating server;
generating storage availability according to the storage information of the system to be evaluated, and generating database availability according to the database information of the system to be evaluated;
evaluating system availability of the system under evaluation based on the network availability, the server availability, the storage availability, the database availability, middleware availability generated by middleware information of the system under evaluation, application availability generated by application information of the system under evaluation, and a fault analysis degree generated by fault information of the system under evaluation.
The modules described as separate parts may or may not be physically separate, and parts displayed as modules may or may not be physical units, may be located in one place, or may be distributed on a plurality of network units. Some or all of the modules may be selected according to actual needs to achieve the purpose of the solution of the present embodiment.
In addition, functional modules in the embodiments of the present invention may be integrated into one processing unit, or each unit may exist alone physically, or two or more units are integrated into one unit. The integrated unit can be realized in a form of hardware, or in a form of hardware plus a software functional module.
The present embodiments are therefore to be considered in all respects as illustrative and not restrictive, the scope of the invention being indicated by the appended claims rather than by the foregoing description, and all changes which come within the meaning and range of equivalency of the claims are therefore intended to be embraced therein. Any reference signs in the claims shall not be construed as limiting the claim concerned.
Furthermore, it is obvious that the word "comprising" does not exclude other elements or steps, and the singular does not exclude the plural. The plurality of units or devices may also be implemented by one unit or device through software or hardware. The terms first, second, etc. are used to denote names, but not any particular order.
Finally, it should be noted that the above embodiments are only intended to illustrate the technical solutions of the present invention and not to limit the same, and although the present invention is described in detail with reference to the preferred embodiments, it should be understood by those skilled in the art that modifications or equivalent substitutions can be made to the technical solutions of the present invention without departing from the spirit and scope of the technical solutions of the present invention.

Claims (10)

1. A system availability evaluation method, the system availability evaluation method comprising:
identifying a system to be evaluated according to the received system availability evaluation request;
detecting a network service line of the system to be evaluated from a system log of the system to be evaluated;
generating the network availability of the system to be evaluated according to the network operation information of the network service line;
detecting an operating server of the system to be evaluated from the system log;
generating the server availability of the system to be evaluated according to the server information of the operating server;
generating storage availability according to the storage information of the system to be evaluated, and generating database availability according to the database information of the system to be evaluated;
evaluating the system availability of the system to be evaluated based on the network availability, the server availability, the storage availability, the database availability, the middleware availability generated by the middleware information of the system to be evaluated, the application availability generated by the application information of the system to be evaluated, and the fault analysis degree generated by the fault information of the system to be evaluated.
2. The system availability evaluation method of claim 1, wherein the detecting the network service line of the system under evaluation from the system log of the system under evaluation comprises:
extracting response information received by the system to be evaluated from the system log;
extracting a node address from the response message;
determining a node corresponding to a node address successfully matched with a network address in a preset local area network as a service node;
and determining a line connected with the system to be evaluated and the service node as the network service line.
3. The method as claimed in claim 2, wherein the network operation information includes network maintenance times of the network service lines, line redundancy information of dedicated lines in the network service lines, life cycles of the service nodes, emergency plan exercise information of the network service lines, and network monitoring coverage, and the generating network availability of the system to be evaluated according to the network operation information of the network service lines includes:
generating the line redundancy information based on the line request of each network service line and a preset line request threshold value;
counting the service duration of the service node according to the life cycle;
generating a drilling coefficient according to the emergency plan drilling information and preset drilling target information;
calculating the network monitoring coverage rate based on the total line area of the network service line and the monitoring area corresponding to the monitoring equipment of the network service line;
and calculating the network availability according to the network maintenance times, the line redundancy information, the service duration, the drilling coefficient and the network monitoring coverage rate.
4. The system availability evaluation method of claim 3, wherein the generating the line redundancy information based on the line request and a preset line request threshold for each network service line comprises:
identifying a preset time period with the maximum request total amount as a peak time period according to the historical request amount of the preset local area network at a plurality of historical time points;
collecting a request of each network service line on the peak hour as the line request;
identifying the private line from among the plurality of network service lines based on an outgoing address and an incoming address in the line request, and determining a line request corresponding to the private line as a target request;
and generating the line redundancy information based on the request quantity of the target requests and the preset line request threshold value.
5. The system availability evaluation method according to claim 1, wherein the server information includes virtualization environment type of the running server, power redundancy information, high-availability deployment information, and server monitoring coverage, and the generating the server availability of the system to be evaluated according to the server information of the running server comprises:
identifying the power supply redundancy information according to the power supply configuration of the running server;
identifying the high available deployment information based on the cluster in which the running server is located;
calculating the server monitoring coverage rate based on the monitoring number of the running servers and the number of the running servers;
quantifying the server availability based on the virtualization environment type, the power redundancy information, the high availability deployment information, and the server monitoring coverage.
6. The method as claimed in claim 1, wherein the storage information includes system availability duration, storage monitoring coverage and storage drilling result of the system under evaluation, and the generating storage availability according to the storage information of the system under evaluation comprises:
acquiring system occupation capacity of the system to be evaluated at a plurality of preset moments;
plotting a system capacity trend based on the system occupancy capacity;
predicting the moment when the system occupied capacity is a capacity threshold value based on the trend slope of the system capacity trend as a prediction moment;
calculating the difference value between the predicted time and the preset time with the maximum value as the available time of the system;
calculating the storage monitoring coverage rate based on the monitoring number of the storage paths in the system to be evaluated and the path number of the storage paths;
identifying a storage drilling result based on the storage abnormal drilling information of the system to be evaluated;
and quantifying the storage availability based on the system availability duration, the storage monitoring coverage rate and the storage drilling result.
7. The system availability evaluation method according to claim 1, wherein the middleware information includes upgrade migration information, middleware monitoring coverage and middleware emergency drilling information of middleware in the system to be evaluated, the application information includes a fusing degradation mechanism of an application in the system to be evaluated, fault repair information of the application, application drilling information and application monitoring coverage of the application, the fault information includes an actual availability, a number of times of fault, a fault reason of the application and a corresponding processing scheme of the system to be evaluated, and the fault information is evaluated based on the network availability, the server availability, the storage availability, the database availability, the middleware availability generated from the middleware information of the system to be evaluated, the application generated from the application information of the system to be evaluated and the fault analysis generated from the fault information of the system to be evaluated The system availability of the system to be evaluated comprises the following steps:
quantifying the middleware availability based on the upgrade migration information, the middleware monitoring coverage and the middleware emergency drilling information;
quantifying the application availability based on the fusing degradation mechanism, the fault remediation information, the application drilling information, and the application monitoring coverage;
calculating the fault analysis degree based on the actual availability ratio, the fault times, the fault reasons and the corresponding processing scheme;
and carrying out weighted sum operation on the network availability, the server availability, the storage availability, the database availability, the middleware availability, the application availability and the fault analysis degree to obtain the system availability.
8. A system availability evaluation apparatus, the system availability evaluation apparatus comprising:
the identification unit is used for identifying the system to be evaluated according to the received system availability evaluation request;
the detection unit is used for detecting the network service line of the system to be evaluated from the system log of the system to be evaluated;
the generating unit is used for generating the network availability of the system to be evaluated according to the network operation information of the network service line;
the detection unit is further used for detecting the running server of the system to be evaluated from the system log;
the generating unit is further used for generating the server availability of the system to be evaluated according to the server information of the operating server;
the generating unit is further used for generating storage availability according to the storage information of the system to be evaluated and generating database availability according to the database information of the system to be evaluated;
an evaluation unit, configured to evaluate system availability of the system to be evaluated based on the network availability, the server availability, the storage availability, the database availability, middleware availability generated by middleware information of the system to be evaluated, application availability generated by application information of the system to be evaluated, and a failure analysis degree generated by failure information of the system to be evaluated.
9. An electronic device, characterized in that the electronic device comprises:
a memory storing computer readable instructions; and
a processor executing computer readable instructions stored in the memory to implement the system availability assessment method of any one of claims 1 to 7.
10. A computer-readable storage medium characterized by: the computer-readable storage medium stores therein computer-readable instructions which are executed by a processor in an electronic device to implement the system availability evaluation method of any one of claims 1 to 7.
CN202210783526.6A 2022-06-28 2022-06-28 System availability evaluation method, device, equipment and storage medium Pending CN114996102A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202210783526.6A CN114996102A (en) 2022-06-28 2022-06-28 System availability evaluation method, device, equipment and storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202210783526.6A CN114996102A (en) 2022-06-28 2022-06-28 System availability evaluation method, device, equipment and storage medium

Publications (1)

Publication Number Publication Date
CN114996102A true CN114996102A (en) 2022-09-02

Family

ID=83019162

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202210783526.6A Pending CN114996102A (en) 2022-06-28 2022-06-28 System availability evaluation method, device, equipment and storage medium

Country Status (1)

Country Link
CN (1) CN114996102A (en)

Similar Documents

Publication Publication Date Title
US9104572B1 (en) Automated root cause analysis
CN110807064B (en) Data recovery device in RAC distributed database cluster system
CN104583968A (en) Management system and management program
CN110275992B (en) Emergency processing method, device, server and computer readable storage medium
WO2022142013A1 (en) Artificial intelligence-based ab testing method and apparatus, computer device and medium
CN112015663A (en) Test data recording method, device, equipment and medium
CN112948275A (en) Test data generation method, device, equipment and storage medium
CN110291505A (en) Reduce the recovery time of application
CN110737924A (en) method and equipment for data protection
CN112559291A (en) Resource monitoring method and device, electronic equipment and storage medium
CN114595127A (en) Log exception handling method, device, equipment and storage medium
CN115037790B (en) Abnormal registration identification method, device, equipment and storage medium
CN102986151A (en) Monitoring system and data transfer device and method thereof
CN111277626A (en) Server upgrading method and device, electronic equipment and medium
CN114996102A (en) System availability evaluation method, device, equipment and storage medium
US20220035691A1 (en) Techniques for utilizing disruptions to enterprise systems
CN114925033A (en) Information uplink method, device, system and storage medium
CN115562934A (en) Service flow switching method based on artificial intelligence and related equipment
CN112817742B (en) Data migration method, device, equipment and storage medium
US20220197762A1 (en) Method and system for health rank based virtual machine restoration using a conformal framework
CN114490262A (en) Database monitoring method, device, equipment and storage medium
CN114610386A (en) Interaction method, device, equipment and storage medium of H5 and application program
CN114385498A (en) Performance test method, system, computer equipment and readable storage medium
CN112965791A (en) Timing task detection method, device, equipment and storage medium
CN113973068A (en) Chaos test method and device, chaos test platform and storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination