CN113806171A - Server health assessment method, system, equipment and medium - Google Patents
Server health assessment method, system, equipment and medium Download PDFInfo
- Publication number
- CN113806171A CN113806171A CN202111065323.5A CN202111065323A CN113806171A CN 113806171 A CN113806171 A CN 113806171A CN 202111065323 A CN202111065323 A CN 202111065323A CN 113806171 A CN113806171 A CN 113806171A
- Authority
- CN
- China
- Prior art keywords
- server
- data
- evaluation
- acquiring
- evaluation parameters
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Withdrawn
Links
- 230000036541 health Effects 0.000 title claims abstract description 110
- 238000000034 method Methods 0.000 title claims abstract description 23
- 238000011156 evaluation Methods 0.000 claims abstract description 56
- 238000013210 evaluation model Methods 0.000 claims abstract description 47
- 238000012549 training Methods 0.000 claims description 34
- 238000012360 testing method Methods 0.000 claims description 30
- 238000004140 cleaning Methods 0.000 claims description 14
- 238000004590 computer program Methods 0.000 claims description 8
- 238000012544 monitoring process Methods 0.000 abstract description 18
- 238000004364 calculation method Methods 0.000 abstract description 6
- 238000002360 preparation method Methods 0.000 abstract description 4
- 230000002159 abnormal effect Effects 0.000 description 20
- 238000003066 decision tree Methods 0.000 description 20
- 230000003862 health status Effects 0.000 description 20
- 230000006870 function Effects 0.000 description 15
- 238000012423 maintenance Methods 0.000 description 11
- 238000004458 analytical method Methods 0.000 description 8
- 238000013480 data collection Methods 0.000 description 8
- 238000004422 calculation algorithm Methods 0.000 description 7
- 238000001514 detection method Methods 0.000 description 4
- 238000010586 diagram Methods 0.000 description 4
- 238000011835 investigation Methods 0.000 description 4
- 230000002688 persistence Effects 0.000 description 4
- 230000002085 persistent effect Effects 0.000 description 4
- 238000001914 filtration Methods 0.000 description 2
- 239000003550 marker Substances 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 230000008569 process Effects 0.000 description 2
- 230000009286 beneficial effect Effects 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 230000014509 gene expression Effects 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
- 230000009466 transformation Effects 0.000 description 1
- 238000000844 transformation Methods 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F11/00—Error detection; Error correction; Monitoring
- G06F11/30—Monitoring
- G06F11/3003—Monitoring arrangements specially adapted to the computing system or computing system component being monitored
- G06F11/3006—Monitoring arrangements specially adapted to the computing system or computing system component being monitored where the computing system is distributed, e.g. networked systems, clusters, multiprocessor systems
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F11/00—Error detection; Error correction; Monitoring
- G06F11/30—Monitoring
- G06F11/3058—Monitoring arrangements for monitoring environmental properties or parameters of the computing system or of the computing system component, e.g. monitoring of power, currents, temperature, humidity, position, vibrations
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F11/00—Error detection; Error correction; Monitoring
- G06F11/30—Monitoring
- G06F11/32—Monitoring with visual or acoustical indication of the functioning of the machine
- G06F11/324—Display of status information
- G06F11/327—Alarm or error message display
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F11/00—Error detection; Error correction; Monitoring
- G06F11/30—Monitoring
- G06F11/34—Recording or statistical evaluation of computer activity, e.g. of down time, of input/output operation ; Recording or statistical evaluation of user activity, e.g. usability assessment
- G06F11/3409—Recording or statistical evaluation of computer activity, e.g. of down time, of input/output operation ; Recording or statistical evaluation of user activity, e.g. usability assessment for performance assessment
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/24—Classification techniques
- G06F18/243—Classification techniques relating to the number of classes
- G06F18/24323—Tree-organised classifiers
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Quality & Reliability (AREA)
- Computing Systems (AREA)
- Data Mining & Analysis (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Evolutionary Computation (AREA)
- Evolutionary Biology (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Bioinformatics & Computational Biology (AREA)
- Artificial Intelligence (AREA)
- Life Sciences & Earth Sciences (AREA)
- Mathematical Physics (AREA)
- Computer Hardware Design (AREA)
- Debugging And Monitoring (AREA)
Abstract
The invention discloses a server health assessment method, which comprises the following steps: acquiring a plurality of configured evaluation parameters; acquiring corresponding historical data according to the plurality of evaluation parameters; generating a corresponding evaluation model according to the corresponding historical data; acquiring current real-time data corresponding to the evaluation parameters of the server to be evaluated; and inputting the real-time data into the evaluation model to evaluate the health degree of the server to be evaluated. The invention also discloses a system, a computer device and a readable storage medium. According to the technical scheme provided by the invention, the health state calculation method suitable for a specific user scene is defined through the emphasis configuration of the server health state index items, the health state prediction is carried out by combining the performance monitoring characteristic index item data of the corresponding server, the preparation of the server health state in the specific scene can be effectively improved, and meanwhile, the server equipment with potential fault risk is screened and early warned.
Description
Technical Field
The invention relates to the field of servers, in particular to a server health assessment method, a system, equipment and a storage medium.
Background
With the development of information technology, the equipment scale of a data center is larger and larger, the operation and maintenance difficulty of the equipment is also larger and larger, when the server generates an alarm, the server needs to be checked and maintained aiming at the alarm, but for part of clients, under a specific scene, some alarms do not affect the use of the clients, and some slight alarms are concerned by the clients; in a common monitoring system, the alarms are only divided into different levels, so that whether the alarms are false alarms or not cannot be effectively distinguished, and meanwhile, the related performance monitoring indexes of the server are only used as display type data and are not effectively utilized. In an actual data center scene, there is no accurate definition of the health state of the server, so that whether the server equipment has hidden danger or not cannot be accurately judged.
Disclosure of Invention
In view of the above, in order to overcome at least one aspect of the above problems, an embodiment of the present invention provides a server health assessment method, including:
acquiring a plurality of configured evaluation parameters;
acquiring corresponding historical data according to the plurality of evaluation parameters;
generating a corresponding evaluation model according to the corresponding historical data;
acquiring current real-time data corresponding to the evaluation parameters of the server to be evaluated;
and inputting the real-time data into the evaluation model to evaluate the health degree of the server to be evaluated.
In some embodiments, generating a corresponding assessment model from the historical data further comprises:
dividing the historical data into a training set and a test set;
and training the evaluation model by using the training set and testing the evaluation model by using the testing set.
In some embodiments, further comprising:
acquiring all evaluation parameters;
acquiring corresponding data according to all the evaluation parameters;
and cleaning the corresponding data and then storing the cleaned data as historical data.
In some embodiments, further comprising:
and responding to the situation that the health degree of the server to be evaluated is smaller than a threshold value, performing differential display and early warning through a preset way.
Based on the same inventive concept, according to another aspect of the present invention, an embodiment of the present invention further provides a server health assessment system, including:
the configuration module is configured to acquire a plurality of configured evaluation parameters;
the first acquisition module is configured to acquire corresponding historical data according to a plurality of evaluation parameters;
a generation module configured to generate a corresponding evaluation model from the corresponding historical data;
the acquisition module is configured to acquire the current real-time data corresponding to the evaluation parameters of the server to be evaluated;
and the evaluation module is configured to input the real-time data into the evaluation model so as to evaluate the health degree of the server to be evaluated.
In some embodiments, the generation module is further configured to:
dividing the historical data into a training set and a test set;
and training the evaluation model by using the training set and testing the evaluation model by using the testing set.
In some embodiments, the system further comprises a second obtaining module configured to obtain all the evaluation parameters;
acquiring corresponding data according to all the evaluation parameters;
and cleaning the corresponding data and then storing the cleaned data as historical data.
In some embodiments, further comprising a notification module configured to:
and responding to the situation that the health degree of the server to be evaluated is smaller than a threshold value, performing differential display and early warning through a preset way.
Based on the same inventive concept, according to another aspect of the present invention, an embodiment of the present invention further provides a computer apparatus, including:
at least one processor; and
a memory storing a computer program operable on the processor, wherein the processor executes the program to perform the steps of:
acquiring a plurality of configured evaluation parameters;
acquiring corresponding historical data according to the plurality of evaluation parameters;
generating a corresponding evaluation model according to the corresponding historical data;
acquiring current real-time data corresponding to the evaluation parameters of the server to be evaluated;
and inputting the real-time data into the evaluation model to evaluate the health degree of the server to be evaluated.
In some embodiments, generating a corresponding evaluation model from the historical data further comprises:
dividing the historical data into a training set and a test set;
and training the evaluation model by using the training set and testing the evaluation model by using the testing set.
In some embodiments, further comprising:
acquiring all evaluation parameters;
acquiring corresponding data according to all the evaluation parameters;
and cleaning the corresponding data and then storing the cleaned data as historical data.
In some embodiments, further comprising:
and responding to the situation that the health degree of the server to be evaluated is smaller than a threshold value, performing differential display and early warning through a preset way.
Based on the same inventive concept, according to another aspect of the present invention, an embodiment of the present invention further provides a computer-readable storage medium storing a computer program which, when executed by a processor, performs the steps of:
acquiring a plurality of configured evaluation parameters;
acquiring corresponding historical data according to the plurality of evaluation parameters;
generating a corresponding evaluation model according to the corresponding historical data;
acquiring current real-time data corresponding to the evaluation parameters of the server to be evaluated;
and inputting the real-time data into the evaluation model to evaluate the health degree of the server to be evaluated.
In some embodiments, generating a corresponding evaluation model from the historical data further comprises:
dividing the historical data into a training set and a test set;
and training the evaluation model by using the training set and testing the evaluation model by using the testing set.
In some embodiments, further comprising:
acquiring all evaluation parameters;
acquiring corresponding data according to all the evaluation parameters;
and cleaning the corresponding data and then storing the cleaned data as historical data.
In some embodiments, further comprising:
and responding to the situation that the health degree of the server to be evaluated is smaller than a threshold value, performing differential display and early warning through a preset way.
The invention has one of the following beneficial technical effects: according to the scheme provided by the invention, the health state calculation method suitable for a specific user scene is defined through the emphasis configuration of the server health state index item, and the health state prediction is carried out by combining the performance monitoring characteristic index item data of the corresponding server, so that the preparation of the server health state in the specific scene can be effectively improved, and meanwhile, the server equipment with potential fault risk is screened and early warned.
Drawings
In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the drawings used in the description of the embodiments or the prior art will be briefly described below, and it is obvious that the drawings in the following description are only some embodiments of the present invention, and it is obvious for those skilled in the art that other embodiments can be obtained by using the drawings without creative efforts.
FIG. 1 is a schematic flow chart illustrating a server health assessment method according to an embodiment of the present invention;
FIG. 2 is a schematic diagram of a server health assessment apparatus according to an embodiment of the present invention;
FIG. 3 is a schematic structural diagram of a server health assessment system according to an embodiment of the present invention;
FIG. 4 is a schematic structural diagram of a computer device provided in an embodiment of the present invention;
fig. 5 is a schematic structural diagram of a computer-readable storage medium according to an embodiment of the present invention.
Detailed Description
In order to make the objects, technical solutions and advantages of the present invention more apparent, the following embodiments of the present invention are described in further detail with reference to the accompanying drawings.
It should be noted that all expressions using "first" and "second" in the embodiments of the present invention are used for distinguishing two entities with the same name but different names or different parameters, and it should be noted that "first" and "second" are merely for convenience of description and should not be construed as limitations of the embodiments of the present invention, and they are not described in any more detail in the following embodiments.
According to an aspect of the present invention, an embodiment of the present invention provides a server health assessment method, as shown in fig. 1, which may include the steps of:
s1, acquiring a plurality of configured evaluation parameters;
s2, acquiring corresponding historical data according to the plurality of evaluation parameters;
s3, generating a corresponding evaluation model according to the corresponding historical data;
s4, acquiring the current real-time data corresponding to the evaluation parameters of the server to be evaluated;
and S5, inputting the real-time data into the evaluation model to evaluate the health degree of the server to be evaluated.
In some embodiments, generating a corresponding evaluation model from the historical data further comprises:
dividing the historical data into a training set and a test set;
and training the evaluation model by using the training set and testing the evaluation model by using the testing set.
In some embodiments, further comprising:
acquiring all evaluation parameters;
acquiring corresponding data according to all the evaluation parameters;
and cleaning the corresponding data and then storing the cleaned data as historical data.
In some embodiments, further comprising:
and responding to the situation that the health degree of the server to be evaluated is smaller than a threshold value, performing differential display and early warning through a preset way.
The technical scheme provided by the invention can be used for automatically configuring the health state characteristic index item of the server according to the requirements of a user on a specific scene by acquiring the data information of the current equipment performance monitoring and other relevant health state characteristics of the server, combining the current fault information, combining the historical performance monitoring information and the fault information, training and constructing a decision tree prediction model according to the historical sample data of the configured characteristic index item, predicting the health state of the current server equipment through the established decision tree prediction model, marking the equipment with abnormal predicted health state, identifying the equipment with abnormal health state, reminding operation and maintenance personnel of which equipment has fault risk, and carrying out early detection, investigation and maintenance on the equipment with the fault risk, thereby reducing the equipment fault rate.
It should be particularly noted that, the steps in the embodiments of the server health assessment method described above may be mutually intersected, replaced, added, or deleted, and therefore, these reasonable permutation and combination transformations should also belong to the scope of the present invention, and should not limit the scope of the present invention to the embodiments.
In some embodiments, as shown in fig. 2, the server health assessment method provided by the present invention may be implemented by a data collection module, a health status configuration module, a decision tree model generation module, a health status analysis module, a marker early warning module, and a feature storage module.
In some embodiments, the feature data collection module includes a data collection and data cleansing function, the data collection is used to collect feature quantities related to the health status of the server, the server health status feature quantities are feature quantities corresponding to node types that can be used as a server health status prediction model based on a decision tree algorithm, and include, but are not limited to, the following performance, monitoring, and alarm data: CPU temperature, CPU utilization rate, memory utilization rate, fan rotation speed, power supply real-time power, hard disk IOPS, network card transceiving rate, voltage, current, Trap alarm and the like. The data cleaning is used for cleaning a large amount of characteristic data and filtering out some abnormal data. The acquisition module is used for acquiring health state characteristic data such as server performance monitoring and the like, and the characteristic storage module is used for storing the server performance data.
Therefore, the characteristic data acquisition module provides a data acquisition function to acquire the characteristic quantity related to the health state of the server, and the characteristic quantity of the health state of the server is the characteristic quantity corresponding to the node type of the server health state prediction model based on the decision tree algorithm.
In some embodiments, the feature storage module may be configured to store feature quantities of server health phases and may provide an efficient feature data query service. The characteristic storage module is a device for information reserve persistence, can be understood as a program with a local cache and a database capable of persisting, or a service with the function, the cache layer can provide efficient query for characteristic data query, and the persistence layer persists the characteristic data and the prediction result.
In some embodiments, the health status configuration module classifies and counts the currently collected data index items and provides the user with a focus on custom configuration index items to adjust the health status calculation. The method includes the steps that currently acquired index items are classified and counted through a feature storage module, the fault rate of each index item when the index items are abnormal is counted, reference is provided for configuration, meanwhile, a health state index item configuration function is provided, and support is provided for a decision tree prediction generation module through matching of feature index items and index item weights influencing the health state of a current specific scene.
The health state configuration module is a health state index management module and can perform classified statistics on health state characteristic value data stored by the characteristic storage module, calculate the fault rate when each index item is abnormal and provide reference for configuration; and meanwhile, an index item configuration function influencing the health state is provided, so that the emphasis adjustment of the index item is performed according to certain specific scenes.
In some embodiments, the decision tree model generation module, in combination with the configured health status indicator, establishes a server health status prediction model based on a decision tree algorithm using historically collected corresponding health status feature indicator data.
In some embodiments, the health state analysis module may call the health state prediction model to obtain a server health state prediction result according to the acquired data as input data of the prediction model, deliver the health state prediction result to the storage module for persistent storage, and input the prediction result to the mark early warning module for subsequent operations.
In some embodiments, the mark early warning module includes an abnormal display function and an early warning notification function, receives the prediction analysis result data of the health state analysis module, is used for performing differential display on the server in the abnormal health state to distinguish server devices in different health states, and can notify and early warn operation and maintenance staff for abnormal information by configuring a notification template.
According to the scheme provided by the embodiment of the invention, the health state calculation method suitable for a specific user scene is defined through the weighted configuration of the server health state index items, the health state prediction is carried out by combining the performance monitoring characteristic index item data of the corresponding server, the preparation of the server health state in the specific scene can be effectively improved, and meanwhile, the server equipment with potential fault risk is screened and early warned.
Based on the same inventive concept, according to another aspect of the present invention, an embodiment of the present invention further provides a server health assessment system 400, as shown in fig. 3, including:
a configuration module 401 configured to obtain a plurality of configured evaluation parameters;
a first obtaining module 402 configured to obtain corresponding historical data according to a plurality of evaluation parameters;
a generating module 403 configured to generate a corresponding evaluation model according to the corresponding historical data;
the acquisition module 404 is configured to acquire the current real-time data corresponding to the evaluation parameters of the server to be evaluated;
an evaluation module 405 configured to input the real-time data into the evaluation model to evaluate the health of the server to be evaluated.
In some embodiments, the generation module 403 is further configured to:
dividing the historical data into a training set and a test set;
and training the evaluation model by using the training set and testing the evaluation model by using the testing set.
In some embodiments, the system further comprises a second acquisition module configured to
Acquiring all evaluation parameters;
acquiring corresponding data according to all the evaluation parameters;
and cleaning the corresponding data and then storing the cleaned data as historical data.
In some embodiments, further comprising a notification module configured to:
and responding to the situation that the health degree of the server to be evaluated is smaller than a threshold value, performing differential display and early warning through a preset way.
The technical scheme provided by the invention can be used for automatically configuring the health state characteristic index item of the server according to the requirements of a user on a specific scene by acquiring the data information of the current equipment performance monitoring and other relevant health state characteristics of the server, combining the current fault information, combining the historical performance monitoring information and the fault information, training and constructing a decision tree prediction model according to the historical sample data of the configured characteristic index item, predicting the health state of the current server equipment through the established decision tree prediction model, marking the equipment with abnormal predicted health state, identifying the equipment with abnormal health state, reminding operation and maintenance personnel of which equipment has fault risk, and carrying out early detection, investigation and maintenance on the equipment with the fault risk, thereby reducing the equipment fault rate.
Based on the same inventive concept, according to another aspect of the present invention, as shown in fig. 4, an embodiment of the present invention further provides a computer apparatus 501, including:
at least one processor 520; and
a memory 510, the memory 510 storing a computer program 511 executable on the processor, the processor 520 executing the program to perform the steps of:
s1, acquiring a plurality of configured evaluation parameters;
s2, acquiring corresponding historical data according to the plurality of evaluation parameters;
s3, generating a corresponding evaluation model according to the corresponding historical data;
s4, acquiring the current real-time data corresponding to the evaluation parameters of the server to be evaluated;
and S5, inputting the real-time data into the evaluation model to evaluate the health degree of the server to be evaluated.
In some embodiments, generating a corresponding evaluation model from the historical data further comprises:
dividing the historical data into a training set and a test set;
and training the evaluation model by using the training set and testing the evaluation model by using the testing set.
In some embodiments, further comprising:
acquiring all evaluation parameters;
acquiring corresponding data according to all the evaluation parameters;
and cleaning the corresponding data and then storing the cleaned data as historical data.
In some embodiments, further comprising:
and responding to the situation that the health degree of the server to be evaluated is smaller than a threshold value, performing differential display and early warning through a preset way.
The technical scheme provided by the invention can be used for automatically configuring the health state characteristic index item of the server according to the requirements of a user on a specific scene by acquiring the data information of the current equipment performance monitoring and other relevant health state characteristics of the server, combining the current fault information, combining the historical performance monitoring information and the fault information, training and constructing a decision tree prediction model according to the historical sample data of the configured characteristic index item, predicting the health state of the current server equipment through the established decision tree prediction model, marking the equipment with abnormal predicted health state, identifying the equipment with abnormal health state, reminding operation and maintenance personnel of which equipment has fault risk, and carrying out early detection, investigation and maintenance on the equipment with the fault risk, thereby reducing the equipment fault rate.
In some embodiments, as shown in fig. 2, the server health assessment method provided by the present invention may be implemented by a data collection module, a health status configuration module, a decision tree model generation module, a health status analysis module, a marker early warning module, and a feature storage module.
In some embodiments, the feature data collection module includes a data collection and data cleansing function, the data collection is used to collect feature quantities related to the health status of the server, the server health status feature quantities are feature quantities corresponding to node types that can be used as a server health status prediction model based on a decision tree algorithm, and include, but are not limited to, the following performance, monitoring, and alarm data: CPU temperature, CPU utilization rate, memory utilization rate, fan rotation speed, power supply real-time power, hard disk IOPS, network card transceiving rate, voltage, current, Trap alarm and the like. The data cleaning is used for cleaning a large amount of characteristic data and filtering out some abnormal data. The acquisition module is used for acquiring health state characteristic data such as server performance monitoring and the like, and the characteristic storage module is used for storing the server performance data.
Therefore, the characteristic data acquisition module provides a data acquisition function to acquire the characteristic quantity related to the health state of the server, and the characteristic quantity of the health state of the server is the characteristic quantity corresponding to the node type of the server health state prediction model based on the decision tree algorithm.
In some embodiments, the feature storage module may be configured to store feature quantities of server health phases and may provide an efficient feature data query service. The characteristic storage module is a device for information reserve persistence, can be understood as a program with a local cache and a database capable of persisting, or a service with the function, the cache layer can provide efficient query for characteristic data query, and the persistence layer persists the characteristic data and the prediction result.
In some embodiments, the health status configuration module classifies and counts the currently collected data index items and provides the user with a focus on custom configuration index items to adjust the health status calculation. The method includes the steps that currently acquired index items are classified and counted through a feature storage module, the fault rate of each index item when the index items are abnormal is counted, reference is provided for configuration, meanwhile, a health state index item configuration function is provided, and support is provided for a decision tree prediction generation module through matching of feature index items and index item weights influencing the health state of a current specific scene.
The health state configuration module is a health state index management module and can perform classified statistics on health state characteristic value data stored by the characteristic storage module, calculate the fault rate when each index item is abnormal and provide reference for configuration; and meanwhile, an index item configuration function influencing the health state is provided, so that the emphasis adjustment of the index item is performed according to certain specific scenes.
In some embodiments, the decision tree model generation module, in combination with the configured health status indicator, establishes a server health status prediction model based on a decision tree algorithm using historically collected corresponding health status feature indicator data.
In some embodiments, the health state analysis module may call the health state prediction model to obtain a server health state prediction result according to the acquired data as input data of the prediction model, deliver the health state prediction result to the storage module for persistent storage, and input the prediction result to the mark early warning module for subsequent operations.
In some embodiments, the mark early warning module includes an abnormal display function and an early warning notification function, receives the prediction analysis result data of the health state analysis module, is used for performing differential display on the server in the abnormal health state to distinguish server devices in different health states, and can notify and early warn operation and maintenance staff for abnormal information by configuring a notification template.
According to the scheme provided by the embodiment of the invention, the health state calculation method suitable for a specific user scene is defined through the weighted configuration of the server health state index items, the health state prediction is carried out by combining the performance monitoring characteristic index item data of the corresponding server, the preparation of the server health state in the specific scene can be effectively improved, and meanwhile, the server equipment with potential fault risk is screened and early warned.
Based on the same inventive concept, according to another aspect of the present invention, as shown in fig. 5, an embodiment of the present invention further provides a computer-readable storage medium 601, where the computer-readable storage medium 601 stores computer program instructions 610, and the computer program instructions 610, when executed by a processor, perform the following steps:
s1, acquiring a plurality of configured evaluation parameters;
s2, acquiring corresponding historical data according to the plurality of evaluation parameters;
s3, generating a corresponding evaluation model according to the corresponding historical data;
s4, acquiring the current real-time data corresponding to the evaluation parameters of the server to be evaluated;
and S5, inputting the real-time data into the evaluation model to evaluate the health degree of the server to be evaluated.
In some embodiments, generating a corresponding evaluation model from the historical data further comprises:
dividing the historical data into a training set and a test set;
and training the evaluation model by using the training set and testing the evaluation model by using the testing set.
In some embodiments, further comprising:
acquiring all evaluation parameters;
acquiring corresponding data according to all the evaluation parameters;
and cleaning the corresponding data and then storing the cleaned data as historical data.
In some embodiments, further comprising:
and responding to the situation that the health degree of the server to be evaluated is smaller than a threshold value, performing differential display and early warning through a preset way.
The technical scheme provided by the invention can be used for automatically configuring the health state characteristic index item of the server according to the requirements of a user on a specific scene by acquiring the data information of the current equipment performance monitoring and other relevant health state characteristics of the server, combining the current fault information, combining the historical performance monitoring information and the fault information, training and constructing a decision tree prediction model according to the historical sample data of the configured characteristic index item, predicting the health state of the current server equipment through the established decision tree prediction model, marking the equipment with abnormal predicted health state, identifying the equipment with abnormal health state, reminding operation and maintenance personnel of which equipment has fault risk, and carrying out early detection, investigation and maintenance on the equipment with the fault risk, thereby reducing the equipment fault rate.
Finally, it should be noted that, as will be understood by those skilled in the art, all or part of the processes of the methods of the above embodiments may be implemented by a computer program, which may be stored in a computer-readable storage medium, and when executed, may include the processes of the embodiments of the methods described above.
Further, it should be appreciated that the computer-readable storage media (e.g., memory) herein can be either volatile memory or nonvolatile memory, or can include both volatile and nonvolatile memory.
Those of skill would further appreciate that the various illustrative logical blocks, modules, circuits, and algorithm steps described in connection with the disclosure herein may be implemented as electronic hardware, computer software, or combinations of both. To clearly illustrate this interchangeability of hardware and software, various illustrative components, blocks, modules, circuits, and steps have been described above generally in terms of their functionality. Whether such functionality is implemented as software or hardware depends upon the particular application and design constraints imposed on the overall system. Skilled artisans may implement the described functionality in varying ways for each particular application, but such implementation decisions should not be interpreted as causing a departure from the scope of the disclosed embodiments of the present invention.
The foregoing is an exemplary embodiment of the present disclosure, but it should be noted that various changes and modifications could be made herein without departing from the scope of the present disclosure as defined by the appended claims. The functions, steps and/or actions of the method claims in accordance with the disclosed embodiments described herein need not be performed in any particular order. Furthermore, although elements of the disclosed embodiments of the invention may be described or claimed in the singular, the plural is contemplated unless limitation to the singular is explicitly stated.
It should be understood that, as used herein, the singular forms "a", "an" and "the" are intended to include the plural forms as well, unless the context clearly supports the exception. It should also be understood that "and/or" as used herein is meant to include any and all possible combinations of one or more of the associated listed items.
The numbers of the embodiments disclosed in the embodiments of the present invention are merely for description, and do not represent the merits of the embodiments.
It will be understood by those skilled in the art that all or part of the steps for implementing the above embodiments may be implemented by hardware, or may be implemented by a program instructing relevant hardware, and the program may be stored in a computer-readable storage medium, and the above-mentioned storage medium may be a read-only memory, a magnetic disk or an optical disk, etc.
Those of ordinary skill in the art will understand that: the discussion of any embodiment above is meant to be exemplary only, and is not intended to intimate that the scope of the disclosure, including the claims, of embodiments of the invention is limited to these examples; within the idea of an embodiment of the invention, also technical features in the above embodiment or in different embodiments may be combined and there are many other variations of the different aspects of the embodiments of the invention as described above, which are not provided in detail for the sake of brevity. Therefore, any omissions, modifications, substitutions, improvements, and the like that may be made without departing from the spirit and principles of the embodiments of the present invention are intended to be included within the scope of the embodiments of the present invention.
Claims (10)
1. A server health assessment method is characterized by comprising the following steps:
acquiring a plurality of configured evaluation parameters;
acquiring corresponding historical data according to the plurality of evaluation parameters;
generating a corresponding evaluation model according to the corresponding historical data;
acquiring current real-time data corresponding to the evaluation parameters of the server to be evaluated;
and inputting the real-time data into the evaluation model to evaluate the health degree of the server to be evaluated.
2. The method of claim 1, wherein generating a corresponding assessment model from the historical data further comprises:
dividing the historical data into a training set and a test set;
and training the evaluation model by using the training set and testing the evaluation model by using the testing set.
3. The method of claim 1, further comprising:
acquiring all evaluation parameters;
acquiring corresponding data according to all the evaluation parameters;
and cleaning the corresponding data and then storing the cleaned data as historical data.
4. The method of claim 1, further comprising:
and responding to the situation that the health degree of the server to be evaluated is smaller than a threshold value, performing differential display and early warning through a preset way.
5. A server health assessment system, comprising:
the configuration module is configured to acquire a plurality of configured evaluation parameters;
the first acquisition module is configured to acquire corresponding historical data according to a plurality of evaluation parameters;
a generation module configured to generate a corresponding evaluation model from the corresponding historical data;
the acquisition module is configured to acquire the current real-time data corresponding to the evaluation parameters of the server to be evaluated;
and the evaluation module is configured to input the real-time data into the evaluation model so as to evaluate the health degree of the server to be evaluated.
6. The system of claim 5, wherein the generation module is further configured to:
dividing the historical data into a training set and a test set;
and training the evaluation model by using the training set and testing the evaluation model by using the testing set.
7. The system of claim 5, further comprising a second acquisition module configured to
Acquiring all evaluation parameters;
acquiring corresponding data according to all the evaluation parameters;
and cleaning the corresponding data and then storing the cleaned data as historical data.
8. The system of claim 5, further comprising a notification module configured to:
and responding to the situation that the health degree of the server to be evaluated is smaller than a threshold value, performing differential display and early warning through a preset way.
9. A computer device, comprising:
at least one processor; and
memory storing a computer program operable on the processor, characterized in that the processor executes the program to perform the steps of the method according to any of claims 1-4.
10. A computer-readable storage medium, in which a computer program is stored which, when being executed by a processor, is adapted to carry out the steps of the method according to any one of claims 1-4.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202111065323.5A CN113806171A (en) | 2021-09-12 | 2021-09-12 | Server health assessment method, system, equipment and medium |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202111065323.5A CN113806171A (en) | 2021-09-12 | 2021-09-12 | Server health assessment method, system, equipment and medium |
Publications (1)
Publication Number | Publication Date |
---|---|
CN113806171A true CN113806171A (en) | 2021-12-17 |
Family
ID=78895085
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202111065323.5A Withdrawn CN113806171A (en) | 2021-09-12 | 2021-09-12 | Server health assessment method, system, equipment and medium |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN113806171A (en) |
Cited By (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN114493116A (en) * | 2021-12-25 | 2022-05-13 | 南京移腾电力技术有限公司 | Power distribution network low-voltage circuit breaker state evaluation method based on cart algorithm |
CN115190039A (en) * | 2022-07-31 | 2022-10-14 | 苏州浪潮智能科技有限公司 | Equipment health evaluation method, system, equipment and storage medium |
CN116070963A (en) * | 2023-03-06 | 2023-05-05 | 华安证券股份有限公司 | Online customer service system health degree detection method based on big data |
WO2023221587A1 (en) * | 2022-05-16 | 2023-11-23 | 深圳市道通合创数字能源有限公司 | Method for determining state of health of power battery of electric vehicle, and server |
CN118190443A (en) * | 2024-02-28 | 2024-06-14 | 武汉万曦智能科技有限公司 | Comprehensive field vehicle detection system and detection method |
-
2021
- 2021-09-12 CN CN202111065323.5A patent/CN113806171A/en not_active Withdrawn
Cited By (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN114493116A (en) * | 2021-12-25 | 2022-05-13 | 南京移腾电力技术有限公司 | Power distribution network low-voltage circuit breaker state evaluation method based on cart algorithm |
WO2023221587A1 (en) * | 2022-05-16 | 2023-11-23 | 深圳市道通合创数字能源有限公司 | Method for determining state of health of power battery of electric vehicle, and server |
CN115190039A (en) * | 2022-07-31 | 2022-10-14 | 苏州浪潮智能科技有限公司 | Equipment health evaluation method, system, equipment and storage medium |
CN115190039B (en) * | 2022-07-31 | 2023-08-08 | 苏州浪潮智能科技有限公司 | Equipment health evaluation method, system, equipment and storage medium |
CN116070963A (en) * | 2023-03-06 | 2023-05-05 | 华安证券股份有限公司 | Online customer service system health degree detection method based on big data |
CN118190443A (en) * | 2024-02-28 | 2024-06-14 | 武汉万曦智能科技有限公司 | Comprehensive field vehicle detection system and detection method |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN113806171A (en) | Server health assessment method, system, equipment and medium | |
CN108206747B (en) | Alarm generation method and system | |
CN108683530B (en) | Data analysis method and device for multi-dimensional data and storage medium | |
KR20180108446A (en) | System and method for management of ict infra | |
CN113190423B (en) | Method, device and system for monitoring service data | |
CN104011719B (en) | The method and system that message is tracked and checked | |
CN106815125A (en) | A kind of log audit method and platform | |
EP2415209B1 (en) | Network analysis system | |
CN104539471B (en) | Bandwidth measures method, apparatus and computer equipment | |
CN113010374B (en) | Quantum device monitoring method and system based on monitoring platform | |
JP2015095060A (en) | Log analysis device and method | |
CN114238033B (en) | Board card running state early warning method, device, equipment and readable storage medium | |
CN111078513A (en) | Log processing method, device, equipment, storage medium and log alarm system | |
CN110460608B (en) | Situation awareness method and system including correlation analysis | |
CN115454778A (en) | Intelligent monitoring system for abnormal time sequence indexes in large-scale cloud network environment | |
CN114610561A (en) | System monitoring method, device, electronic equipment and computer readable storage medium | |
CN114328107A (en) | Monitoring method and system for optomagnetic fusion storage server cluster and electronic equipment | |
CN108039971A (en) | A kind of alarm method and device | |
CN110991241B (en) | Abnormality recognition method, apparatus, and computer-readable medium | |
CN117251751A (en) | Machine room monitoring method and device, electronic equipment and storage medium | |
CN113992496B (en) | Abnormal alarm method and device based on quartile algorithm and computing equipment | |
CN113626284A (en) | Health management method, system, equipment and medium of management platform | |
CN114443407A (en) | Detection method and system of server, electronic equipment and storage medium | |
CN113407764A (en) | Audio and video equipment state graphical display equipment and method based on physical position | |
CN113835961A (en) | Alarm information monitoring method, device, server and storage medium |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
WW01 | Invention patent application withdrawn after publication |
Application publication date: 20211217 |
|
WW01 | Invention patent application withdrawn after publication |