CN117539727A - Computer running state monitoring method and system - Google Patents

Computer running state monitoring method and system Download PDF

Info

Publication number
CN117539727A
CN117539727A CN202410032132.6A CN202410032132A CN117539727A CN 117539727 A CN117539727 A CN 117539727A CN 202410032132 A CN202410032132 A CN 202410032132A CN 117539727 A CN117539727 A CN 117539727A
Authority
CN
China
Prior art keywords
value
computer
network
detection
program
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202410032132.6A
Other languages
Chinese (zh)
Inventor
丘永健
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shenzhen Nettime Cloud Computing Co ltd
Original Assignee
Shenzhen Nettime Cloud Computing Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shenzhen Nettime Cloud Computing Co ltd filed Critical Shenzhen Nettime Cloud Computing Co ltd
Priority to CN202410032132.6A priority Critical patent/CN117539727A/en
Publication of CN117539727A publication Critical patent/CN117539727A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/30Monitoring
    • G06F11/3055Monitoring arrangements for monitoring the status of the computing system or of the computing system component, e.g. monitoring if the computing system is on, off, available, not available
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/22Detection or location of defective computer hardware by testing during standby operation or during idle time, e.g. start-up testing
    • G06F11/2205Detection or location of defective computer hardware by testing during standby operation or during idle time, e.g. start-up testing using arrangements specific to the hardware being tested
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/22Detection or location of defective computer hardware by testing during standby operation or during idle time, e.g. start-up testing
    • G06F11/26Functional testing
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/30Monitoring
    • G06F11/3058Monitoring arrangements for monitoring environmental properties or parameters of the computing system or of the computing system component, e.g. monitoring of power, currents, temperature, humidity, position, vibrations

Abstract

The invention belongs to the technical field of computer operation supervision, in particular to a computer operation state monitoring method and a computer operation state monitoring system, wherein the computer operation state monitoring system comprises a monitoring platform, a hardware state detection module, a software state detection module, a network detection evaluation module and a computer management end; according to the invention, the hardware state detection module is used for carrying out hardware state evaluation analysis on the computer, the software state detection module is used for detecting and analyzing an application program operated in the computer, the network detection evaluation module is used for carrying out evaluation analysis on the quality of a network connected with the computer, and when a hardware state abnormal signal, a software state abnormal signal or a network quality unqualified signal is generated, the computer management end is enabled to send out early warning, so that the hardware state, the software state and the network state of the computer can be reasonably analyzed and accurately evaluated, the full-scale monitoring of the computer and the accurate judgment of the operation condition of the computer are realized, and the safe, stable and efficient operation of the computer is effectively ensured.

Description

Computer running state monitoring method and system
Technical Field
The invention relates to the technical field of computer operation supervision, in particular to a computer operation state monitoring method and a monitoring system.
Background
The computer is commonly called a computer, is an electronic computing machine for high-speed computing, can perform numerical computation, can perform logic computation, has a memory function, is modern intelligent electronic equipment capable of automatically and high-speed processing mass data according to program operation, and consists of a hardware system and a software system, and can be divided into a super computer, an industrial control computer, a network computer, a personal computer, an embedded computer and the like according to different scales and performances;
at present, when the running state of a computer is monitored, the hardware state, the software state and the network state of the computer cannot be reasonably analyzed and accurately evaluated, the running state of the computer is difficult to be monitored in all aspects and accurately judged, and a manager cannot timely make targeted improvement measures, so that the safe, stable and efficient running of the computer is not guaranteed;
in view of the above technical drawbacks, a solution is now proposed.
Disclosure of Invention
The invention aims to provide a computer running state monitoring method and a computer running state monitoring system, which solve the problems that the prior art cannot reasonably analyze and accurately evaluate the hardware state, the software state and the network state of a computer, is difficult to realize the full-scale monitoring of the computer and accurately judge the running state of the computer, and is not beneficial to ensuring the safe, stable and efficient running of the computer.
In order to achieve the above purpose, the present invention provides the following technical solutions:
a computer running state monitoring system comprises a monitoring platform, a hardware state detection module, a software state detection module, a network detection evaluation module and a computer management end; the hardware state detection module carries out hardware state evaluation analysis on the computer, generates a hardware state abnormal signal or a hardware state normal signal through analysis, and sends the hardware state abnormal signal to the computer management end through the monitoring platform; the software state detection module detects and analyzes an application program running in the computer, generates a software state abnormal signal or a software state normal signal through analysis, and sends the software state abnormal signal to the computer management end through the monitoring platform;
the network detection evaluation module acquires a network connected in the running process of the computer and marks the network as a target network, the running quality of the target network is evaluated and analyzed, a network quality qualified signal or a network quality unqualified signal of the target network is generated through analysis, and the network quality unqualified signal of the target network is sent to the computer management end through the monitoring platform; and the computer management end sends out corresponding early warning when receiving a hardware state abnormal signal, a software state abnormal signal or a network quality unqualified signal.
Further, the specific analysis process of the hardware state evaluation analysis includes:
acquiring the CPU utilization rate, the memory utilization rate and the disk space occupancy rate of the computer, respectively comparing the CPU utilization rate, the memory utilization rate and the disk space occupancy rate with corresponding preset thresholds, and generating a hardware state abnormal signal if the CPU utilization rate, the memory utilization rate or the disk space occupancy rate exceeds the corresponding preset thresholds;
if the CPU utilization rate, the memory utilization rate or the disk space occupancy rate do not exceed the corresponding preset threshold value, receiving a hardware auxiliary measurement judgment symbol FY-1 or FY-2 of the computer from the server, and if the FY-1 is received, generating a hardware state abnormal signal; if FY-2 is received, a hardware state normal signal is generated.
Further, the monitoring platform is in communication connection with the hardware auxiliary measuring module, the hardware auxiliary measuring module collects real-time temperatures of all key components in the computer, the real-time temperatures are compared with preset temperature thresholds of the corresponding key components in a numerical mode, and if the real-time temperatures exceed the corresponding preset temperature thresholds, the corresponding key components are marked as temperature risk components; if the temperature risk component exists in the computer, a hardware auxiliary measurement judgment symbol FY-1 is allocated to the computer;
if no temperature risk component exists in the computer, collecting real-time voltages of a plurality of detection time points of a power supply in the computer in unit time, carrying out average value calculation on all the real-time voltages, marking deviation values of calculation results compared with preset proper voltage standard values as voltage deviation table values, and carrying out variance calculation on all the real-time voltages to obtain voltage fluctuation values; respectively comparing the voltage deviation table value and the voltage fluctuation value with a preset voltage deviation table threshold value and a preset voltage fluctuation threshold value, and if the voltage deviation table value or the voltage fluctuation value exceeds the corresponding preset threshold value, distributing a hardware auxiliary measurement judgment symbol FY-1 to the computer;
if the voltage deviation table value and the voltage fluctuation value do not exceed the corresponding preset thresholds, collecting the total power consumption rate of the computer, carrying out numerical calculation on the total power consumption rate, the voltage deviation table value and the voltage fluctuation value to obtain a power detection value, carrying out numerical comparison on the power detection value and the preset power detection threshold, and if the power detection value exceeds the preset power detection threshold, distributing a hardware auxiliary detection judgment symbol FY-1 to the computer; and if the power detection value does not exceed the preset power detection threshold value, analyzing the running condition of the fan in the computer.
Further, a specific analysis process for analyzing the operation condition of the fan in the computer is as follows:
collecting the rotation speed of the fan, performing difference calculation on the rotation speed compared with a preset standard speed value, and taking an absolute value to obtain a fan rotation deflection value; collecting a noise decibel value and a vibration frequency amplitude value generated by the fan, and carrying out numerical calculation on the fan deflection value, the noise decibel value and the vibration frequency amplitude value to obtain a fan real detection value; comparing the fan actual detection value with a preset fan actual detection threshold value, and marking the corresponding fan actual detection value as a fan abnormal detection value if the fan actual detection value exceeds the preset fan actual detection threshold value;
marking the ratio of the number of the different detection values of the fans to the number of the actual detection values of the fans in unit time as a fan detection table value, and carrying out average value calculation on all the actual detection values of the fans in unit time to obtain a fan detection value; respectively comparing the fan detection table value and the fan detection analysis value with a preset fan detection table threshold value and a preset fan detection analysis threshold value, and if the fan detection table value or the fan detection analysis value exceeds the corresponding preset threshold value, distributing a hardware auxiliary detection judgment symbol FY-1 to the computer; if the fan detection table value and the fan detection value do not exceed the corresponding preset threshold values, a hardware auxiliary detection judgment symbol FY-2 is distributed to the computer.
Further, the specific operation process of the software state detection module comprises the following steps:
acquiring an application program in an operation state in unit time, and marking the corresponding application program as i, wherein i is a natural number which is greater than or equal to 1; judging whether a risk program or a damaged program exists or not through program running detection analysis, and generating a software state abnormal signal if the risk program or the damaged program exists; if the risk program and the damaged program do not exist, a software state normal signal is generated.
Further, the specific analysis process of the program operation detection analysis is as follows:
collecting data of CPU or memory resources occupied by the application program i, and judging that the application program i is in a suspicious running state if the data of the CPU or memory resources occupied by the application program i exceeds a corresponding data threshold value; respectively marking the total duration and the single maximum duration of the application program i in the running suspicious state in unit time as a program suspicious total time value and a program suspicious time amplitude, respectively comparing the program suspicious total time value and the program suspicious time amplitude of the application program i with corresponding preset program suspicious total time threshold and preset program suspicious time amplitude threshold in numerical values, and marking the application program i as a risk program if the program suspicious total time value or the program suspicious time amplitude exceeds the corresponding preset threshold;
if the program suspicious total time value and the program suspicious time amplitude value do not exceed the corresponding preset threshold values, identifying whether the application program i is crashed in unit time, acquiring the crash recovery time length of the application program i if the application program i is crashed, summing all the crash recovery time lengths of the application program i in unit time to obtain a program crash value, and marking the times of the application program i crashing in unit time as the program crash frequency value; and respectively comparing the program burst value and the program burst frequency value of the application program i with corresponding preset program burst time threshold values and preset program burst frequency threshold values, and marking the application program i as a damaged program if the program burst value or the program burst frequency value exceeds the corresponding preset threshold values.
Further, the specific operation process of the network detection and evaluation module comprises the following steps:
collecting the times of disconnection of a computer and a target network in unit time, marking the times as network disconnection frequency analysis values, marking the duration time of each network disconnection as network disconnection time condition values, and summing all the network disconnection time condition values in unit time to obtain the network disconnection time analysis values; respectively comparing the network disconnection frequency analysis value and the network disconnection time analysis value with a preset network disconnection frequency analysis threshold value and a preset network disconnection time analysis threshold value, and generating a network quality disqualification signal if the network disconnection frequency analysis value or the network disconnection time analysis value exceeds the corresponding preset threshold value;
if the network disconnection frequency analysis value and the network disconnection time analysis value do not exceed the corresponding preset threshold values, collecting network speeds of a plurality of detection periods in unit time, and carrying out average value calculation on the network speeds of all the detection periods to obtain a network speed measurement value; the network speed of the corresponding detection time period is compared with a preset network speed threshold value in a numerical mode, and if the network speed does not exceed the preset network speed threshold value, the corresponding detection time period is marked as a network low-speed time period;
marking the ratio of the number of the network low-speed time periods to the number of the detection time periods in unit time as a network low-speed detection value, carrying out numerical calculation on the network low-speed detection value and the network speed analysis value to obtain a network speed table value, carrying out numerical comparison on the network speed table value and a preset network speed table threshold value, and generating a network quality unqualified signal if the network speed table value exceeds the preset network speed table threshold value; and if the network speed table value does not exceed the preset network speed table threshold value, generating a network quality qualified signal.
Furthermore, the invention also provides a computer running state monitoring method, which comprises the following steps:
step one, carrying out hardware state evaluation analysis on a computer, and generating a hardware state abnormal signal or a hardware state normal signal through analysis;
detecting and analyzing an application program running in a computer, and generating a software state abnormal signal or a software state normal signal through analysis;
step three, acquiring a network connected in the running process of a computer, marking the network as a target network, evaluating and analyzing the running quality of the target network, and generating a network quality qualified signal or a network quality unqualified signal of the target network through analysis;
and step four, when generating a hardware state abnormal signal, a software state abnormal signal or a network quality unqualified signal, the computer management end sends out corresponding early warning.
Compared with the prior art, the invention has the beneficial effects that:
1. in the invention, the hardware auxiliary detection module is used for carrying out auxiliary detection on the hardware condition of the computer, so that not only the expansion analysis on the hardware condition of the computer is realized, but also information support is provided for the analysis process of the hardware state detection module, the hardware state evaluation analysis is carried out on the computer through the hardware state detection module, and the hardware state abnormal signal or the hardware state normal signal is generated through analysis, thereby realizing the effective monitoring on the hardware condition of the computer and reasonably judging the risk degree of the hardware state abnormal signal or the hardware state normal signal;
2. according to the invention, the software state detection module is used for detecting and analyzing the application programs running in the computer, so that the software programs running in the computer are effectively monitored and the abnormal conditions of the programs are reasonably judged, the network detection and evaluation module is used for acquiring the network connected with the computer in the running process and marking the network as a target network, and the running quality of the target network is evaluated and analyzed, so that the accurate evaluation of the quality of the network connected with the computer is realized, the hardware state, the software state and the network state of the computer can be reasonably analyzed and accurately evaluated, the overall monitoring of the computer and the accurate judgment of the running conditions of the computer are realized, the management personnel can not timely make targeted improvement measures, and the safe, stable and efficient running of the computer is ensured.
Drawings
For the convenience of those skilled in the art, the present invention will be further described with reference to the accompanying drawings;
FIG. 1 is a system block diagram of a first embodiment of the present invention;
FIG. 2 is a system block diagram of a second embodiment of the present invention;
fig. 3 is a flow chart of a method according to a third embodiment of the present invention.
Detailed Description
The following description of the embodiments of the present invention will be made clearly and completely with reference to the accompanying drawings, in which it is apparent that the embodiments described are only some embodiments of the present invention, but not all embodiments. All other embodiments, which can be made by those skilled in the art based on the embodiments of the invention without making any inventive effort, are intended to be within the scope of the invention.
Embodiment one: as shown in fig. 1, the computer running state monitoring system provided by the invention comprises a monitoring platform, a hardware state detection module, a software state detection module, a network detection evaluation module and a computer management end, wherein the monitoring platform is in communication connection with the hardware state detection module, the software state detection module, the network detection evaluation module and the computer management end;
the hardware state detection module carries out hardware state evaluation analysis on the computer, generates a hardware state abnormal signal or a hardware state normal signal through analysis, and sends the hardware state abnormal signal to the computer management end through the monitoring platform, so that the effective monitoring of the hardware state of the computer is realized, the risk degree of the computer is reasonably judged, and information support is provided for judging the running state of the computer; and, the specific analysis procedure of the hardware state evaluation analysis is as follows:
collecting CPU utilization rate, memory utilization rate and disk space occupation rate of a computer, wherein the memory utilization rate refers to a data value of the degree of currently used memory of the computer, and the disk space occupation rate refers to a data value representing the percentage of the used storage space on a hard disk drive of the computer to the total storage space; the CPU utilization rate refers to the measurement of the number of commands and the utilization rate of the actual running of a central processing unit of a computer in a period of time, and when the CPU utilization rate is too high, the computer is easy to run slowly, get stuck or have other performance problems;
respectively carrying out numerical comparison on the CPU utilization rate, the memory utilization rate and the disk space occupancy rate with a preset CPU utilization rate threshold value, a preset memory utilization rate threshold value and a preset disk space occupancy rate threshold value, and if the CPU utilization rate, the memory utilization rate or the disk space occupancy rate exceeds the corresponding preset threshold value, indicating that the hardware state of the computer is poor, generating a hardware state abnormal signal; if the CPU utilization rate, the memory utilization rate or the disk space occupancy rate do not exceed the corresponding preset threshold value, receiving a hardware auxiliary measurement judgment symbol FY-1 or FY-2 of the computer from the server, and if the FY-1 is received, indicating that the hardware state of the computer is poor, generating a hardware state abnormal signal; if FY-2 is received, which indicates that the hardware state of the computer is good, a hardware state normal signal is generated.
The software state detection module detects and analyzes an application program running in the computer, generates a software state abnormal signal or a software state normal signal through analysis, and sends the software state abnormal signal to the computer management end through the monitoring platform, so that the software program running in the computer is effectively monitored, the abnormal state of the program is reasonably judged, and information support is provided for judging the running state of the computer; the specific operation process of the software state detection module is as follows:
acquiring an application program in an operation state in unit time, and marking the corresponding application program as i, wherein i is a natural number which is greater than or equal to 1; judging whether a risk program or a damaged program exists or not through program running detection analysis, and generating a software state abnormal signal if the risk program or the damaged program exists; if the risk program and the damaged program do not exist, a software state normal signal is generated.
Further, the specific analysis process of the program operation detection analysis is as follows: collecting the data of the CPU or memory resources occupied by the application program i, and judging that the application program i is in a suspicious running state if the data of the CPU or memory resources occupied by the application program i exceeds a corresponding data threshold value (namely, the application program i occupies a large amount of CPU or memory resources);
respectively marking the total duration and the single maximum duration of the application program i in the suspicious state of running in unit time as a program suspicious total time value and a program suspicious time amplitude value, respectively comparing the program suspicious total time value and the program suspicious time amplitude value of the application program i with corresponding preset program suspicious total time threshold values and preset program suspicious time amplitude threshold values, and marking the application program i as a risk program if the program suspicious total time value or the program suspicious time amplitude value exceeds the corresponding preset threshold values, which indicates that the risk brought by the application program i is larger and the adverse effect on the current running performance of the computer is larger;
if the program suspicious total time value and the program suspicious time amplitude value do not exceed the corresponding preset threshold values, identifying whether the application program i crashes in unit time, if so, acquiring the crash recovery time (namely, the interval time between the starting crash time and the recovery time), summing all the crash recovery time of the application program i in unit time to obtain a program crash value, and marking the number of times of crashing the application program i in unit time as the program crash frequency value;
the larger the values of the program burst value and the program burst value are, the worse the running performance of the application program i in unit time is, and the less smooth the running is; and respectively comparing the program burst value and the program burst frequency value of the application program i with corresponding preset program burst time threshold values and preset program burst frequency threshold values, and marking the application program i as a damaged program if the program burst value or the program burst frequency value exceeds the corresponding preset threshold values, which indicates that the running smoothness of the application program i in unit time is poor.
The network detection evaluation module acquires a network connected in the running process of the computer and marks the network as a target network, evaluates and analyzes the running quality of the target network, generates a network quality qualified signal or a network quality unqualified signal of the target network through analysis, and sends the network quality unqualified signal of the target network to a computer management end through a monitoring platform, thereby realizing accurate evaluation of the quality of the network connected with the computer and providing information support for judging the running state of the computer; the specific operation process of the network detection and evaluation module is as follows:
collecting the times of disconnection of a computer and a target network in unit time, marking the times as network disconnection frequency analysis values, marking the duration time of each network disconnection as network disconnection time condition values, and summing all the network disconnection time condition values in unit time to obtain the network disconnection time analysis values; respectively comparing the network disconnection frequency analysis value and the network disconnection time analysis value with a preset network disconnection frequency analysis threshold value and a preset network disconnection time analysis threshold value, and if the network disconnection frequency analysis value or the network disconnection time analysis value exceeds the corresponding preset threshold value, indicating that the worse the network connection condition of the computer is in unit time, the more unfavorable the smooth and smooth use of the computer is ensured, generating a network quality disqualification signal;
if the network disconnection frequency analysis value and the network disconnection time analysis value do not exceed the corresponding preset threshold values, collecting network speeds of a plurality of detection periods in unit time, and carrying out average value calculation on the network speeds of all the detection periods to obtain a network speed measurement value; the network speed of the corresponding detection period is compared with a preset network speed threshold value in a numerical mode, if the network speed does not exceed the preset network speed threshold value, the network speed of the target network of the corresponding detection period is indicated to be lower, and the corresponding detection period is marked as a network low-speed period; marking the ratio of the number of network low-speed time periods to the number of detection time periods in unit time as a network low-speed detection value; it should be noted that, the larger the value of the network speed detection value and the smaller the value of the network low speed detection value, the better the network speed performance condition of the target network in unit time is indicated;
calculating the network low-speed detection value WF and the network speed analysis value WK by a formula WB=a1 xWF/(a2 xWK+1) to obtain a network speed table value WB, wherein a1 and a2 are preset proportionality coefficients, and a1 is more than a2 and more than 0; comparing the network speed table value WB with a preset network speed table threshold value, and if the network speed table value WB exceeds the preset network speed table threshold value, indicating that the network speed of the target network in unit time is lower, generating a network quality disqualification signal; if the network speed table value WB does not exceed the preset network speed table threshold value, the network speed of the target network in unit time is higher, and a network quality qualified signal is generated.
The computer management end sends out corresponding early warning when receiving hardware state abnormal signals, software state abnormal signals or network quality unqualified signals, when receiving the early warning, corresponding supervision personnel timely conduct reason investigation and tracing, comprehensively inspect the computer according to the needs, pertinently make corresponding improvement measures, realize comprehensive monitoring and accurate feedback early warning of the computer, reduce the management difficulty of supervision personnel on the computer, thereby ensuring safe, stable and efficient operation of the computer and having high intelligent degree.
Embodiment two: as shown in fig. 2, the difference between the present embodiment and embodiment 1 is that the monitoring platform is in communication connection with the hardware auxiliary measurement module, and the hardware auxiliary measurement module collects real-time temperatures of all critical components in the computer, where the critical components refer to important components in the computer, such as a CPU, a graphics card, a hard disk, and the like; comparing the real-time temperature with a preset temperature threshold value of the corresponding key component, and marking the corresponding key component as a temperature risk component if the real-time temperature exceeds the corresponding preset temperature threshold value; if the temperature risk component exists in the computer, a hardware auxiliary measurement judgment symbol FY-1 is allocated to the computer;
if no temperature risk component exists in the computer, collecting real-time voltages of a plurality of detection time points of a power supply in the computer in unit time, carrying out average value calculation on all the real-time voltages, marking a deviation value of a calculation result compared with a preset proper voltage standard value as a voltage deviation table value, and carrying out variance calculation on all the real-time voltages to obtain a voltage fluctuation value; the larger the values of the voltage deviation table value and the voltage fluctuation value are, the worse the voltage performance of a power supply in the computer is, and the larger the damage to the computer is;
respectively comparing the voltage deviation table value and the voltage fluctuation value with a preset voltage deviation table threshold value and a preset voltage fluctuation threshold value, and if the voltage deviation table value or the voltage fluctuation value exceeds the corresponding preset threshold value, indicating that the voltage of a power supply in the computer is poor in performance, distributing a hardware auxiliary measurement judgment symbol FY-1 to the computer;
if the voltage deviation meter value and the voltage fluctuation value do not exceed the corresponding preset threshold values, collecting the total power consumption rate of the computer, wherein the total power consumption rate is a data value representing the rate of the electric energy consumed by the computer in unit time; carrying out numerical calculation on the total power consumption rate TK, the voltage deviation table value TY and the voltage fluctuation value TF through a formula TX=ew1+ew2+ew3+TF to obtain a power detection value TX, wherein, the values of the ew1, the ew2 and the ew3 are all larger than zero, and the preset proportionality coefficients are the values of the ew1, the ew2 and the ew 3; and, the larger the value of the power detection value TX, the worse the running power performance condition of the computer is indicated; comparing the power detection value TX with a preset power detection threshold value, and if the power detection value TX exceeds the preset power detection threshold value, indicating that the running power performance condition of the computer is poor, distributing a hardware auxiliary detection judgment symbol FY-1 to the computer;
if the power detection value TX does not exceed the preset power detection threshold value, analyzing the running condition of a fan in the computer; the method comprises the following steps: collecting the rotation speed of the fan, performing difference calculation on the rotation speed compared with a preset standard speed value, and taking an absolute value to obtain a fan rotation deflection value; the larger the value of the fan deflection value is, the more the rotating speed of the fan at the corresponding moment is not in accordance with the preset requirement; the noise decibel value and the vibration frequency amplitude value generated by the fan are collected, wherein the vibration frequency amplitude value is a data value representing the sum of the amplitude and the frequency of vibration generated in the rotation process of the fan;
carrying out numerical calculation on the fan deflection value RY, the noise decibel value RK and the vibration frequency amplitude value RP through a formula RX= (b1+b2+RK+b3) RP)/3 to obtain a fan real detection value RX; wherein b1, b2 and b3 are preset proportionality coefficients, and the values of b1, b2 and b3 are all larger than zero; and the larger the value of the fan real detection value RX is, the worse the running condition of the fan at the corresponding moment is, and the larger the probability of abnormality is; comparing the fan real detection value RX with a preset fan real detection threshold value, and marking the corresponding fan real detection value as a fan abnormal detection value if the fan real detection value RX exceeds the preset fan real detection threshold value;
marking the ratio of the number of the different detection values of the fans to the number of the actual detection values of the fans in unit time as a fan detection table value, and carrying out average value calculation on all the actual detection values of the fans in unit time to obtain a fan detection value; respectively comparing the fan detection table value and the fan detection analysis value with a preset fan detection table threshold value and a preset fan detection analysis threshold value, and if the fan detection table value or the fan detection analysis value exceeds the corresponding preset threshold value, indicating that the running condition of the fan in unit time is poor, distributing a hardware auxiliary detection judgment symbol FY-1 to a computer;
if the fan detection table value and the fan detection value do not exceed the corresponding preset threshold values, indicating that the running condition of the fan in unit time is good, assigning a hardware auxiliary detection judgment symbol FY-2 to the computer; and the hardware auxiliary measurement judgment symbol FY-1 or FY-2 is sent to the hardware state detection module through the monitoring platform, so that not only can the hardware condition of the computer be expanded and analyzed, but also information support can be provided for the analysis process of the hardware state detection module, so that the analysis process is more comprehensive, and the analysis result is more accurate.
Embodiment III: as shown in fig. 3, the difference between the present embodiment and embodiments 1 and 2 is that the method for monitoring the running state of a computer according to the present invention includes the following steps:
step one, carrying out hardware state evaluation analysis on a computer, and generating a hardware state abnormal signal or a hardware state normal signal through analysis;
detecting and analyzing an application program running in a computer, and generating a software state abnormal signal or a software state normal signal through analysis;
step three, acquiring a network connected in the running process of a computer, marking the network as a target network, evaluating and analyzing the running quality of the target network, and generating a network quality qualified signal or a network quality unqualified signal of the target network through analysis;
and step four, when generating a hardware state abnormal signal, a software state abnormal signal or a network quality unqualified signal, the computer management end sends out corresponding early warning.
The working principle of the invention is as follows: when the method is used, the hardware state detection module is used for carrying out hardware state evaluation analysis on the computer, so that the effective monitoring of the hardware state of the computer is realized, the risk degree of the computer is reasonably judged, the software state detection module is used for carrying out detection analysis on an application program operated in the computer, the effective monitoring of the software program operated in the computer is realized, the abnormal state of the program is reasonably judged, the network connected in the operation process of the computer is obtained by the network detection evaluation module and is marked as a target network, the operation quality of the target network is evaluated and analyzed, the accurate evaluation of the quality of the network connected to the computer is realized, the computer management end sends out corresponding early warning when the hardware state abnormal signal, the software state abnormal signal or the network quality unqualified signal is generated, the hardware state, the software state and the network state of the computer can be reasonably analyzed and accurately evaluated, the overall monitoring of the computer is realized, the operation state of the computer is accurately judged, the manager cannot timely make targeted improvement measures, and the safe, stable and efficient operation of the computer is ensured.
The above formulas are all formulas with dimensions removed and numerical values calculated, the formulas are formulas with a large amount of data collected for software simulation to obtain the latest real situation, and preset parameters in the formulas are set by those skilled in the art according to the actual situation. The preferred embodiments of the invention disclosed above are intended only to assist in the explanation of the invention. The preferred embodiments are not intended to be exhaustive or to limit the invention to the precise form disclosed. Obviously, many modifications and variations are possible in light of the above teaching. The embodiments were chosen and described in order to best explain the principles of the invention and the practical application, to thereby enable others skilled in the art to best understand and utilize the invention. The invention is limited only by the claims and the full scope and equivalents thereof.

Claims (10)

1. The computer running state monitoring system is characterized by comprising a monitoring platform, a hardware state detection module, a software state detection module, a network detection evaluation module and a computer management end; the hardware state detection module carries out hardware state evaluation analysis on the computer, generates a hardware state abnormal signal or a hardware state normal signal through analysis, and sends the hardware state abnormal signal to the computer management end through the monitoring platform; the software state detection module detects and analyzes an application program running in the computer, generates a software state abnormal signal or a software state normal signal through analysis, and sends the software state abnormal signal to the computer management end through the monitoring platform;
the network detection evaluation module acquires a network connected in the running process of the computer and marks the network as a target network, the running quality of the target network is evaluated and analyzed, a network quality qualified signal or a network quality unqualified signal of the target network is generated through analysis, and the network quality unqualified signal of the target network is sent to the computer management end through the monitoring platform; the computer management end sends out corresponding early warning when receiving a hardware state abnormal signal, a software state abnormal signal or a network quality unqualified signal;
the specific analysis process of the hardware state evaluation analysis comprises the following steps:
acquiring the CPU utilization rate, the memory utilization rate and the disk space occupancy rate of the computer, respectively comparing the CPU utilization rate, the memory utilization rate and the disk space occupancy rate with corresponding preset thresholds, and generating a hardware state abnormal signal if the CPU utilization rate, the memory utilization rate or the disk space occupancy rate exceeds the corresponding preset thresholds; if the CPU utilization rate, the memory utilization rate or the disk space occupancy rate do not exceed the corresponding preset threshold value, receiving a hardware auxiliary measurement judgment symbol FY-1 or FY-2 of the computer from the server, and if the FY-1 is received, generating a hardware state abnormal signal; if FY-2 is received, a hardware state normal signal is generated.
2. The system for monitoring the running state of a computer according to claim 1, wherein the monitoring platform is in communication connection with the hardware auxiliary measuring module, the hardware auxiliary measuring module collects real-time temperatures of all key components in the computer, the real-time temperatures are compared with preset temperature thresholds of the corresponding key components in a numerical mode, and if the real-time temperatures exceed the corresponding preset temperature thresholds, the corresponding key components are marked as temperature risk components; if the temperature risk component exists in the computer, a hardware auxiliary measurement judgment symbol FY-1 is allocated to the temperature risk component.
3. The system for monitoring the running state of a computer according to claim 2, wherein if no temperature risk component exists in the computer, collecting real-time voltages of a plurality of detection time points of a power supply in the computer in unit time, performing average calculation on all the real-time voltages, marking a deviation value of a calculation result compared with a preset proper voltage standard value as a voltage deviation table value, and performing variance calculation on all the real-time voltages to obtain a voltage fluctuation value; and respectively comparing the voltage deviation table value and the voltage fluctuation value with a preset voltage deviation table threshold value and a preset voltage fluctuation threshold value, and if the voltage deviation table value or the voltage fluctuation value exceeds the corresponding preset threshold value, distributing a hardware auxiliary measurement judgment symbol FY-1 to the computer.
4. The system for monitoring the running state of a computer according to claim 3, wherein if the voltage deviation table value and the voltage fluctuation value do not exceed the corresponding preset thresholds, the total power consumption rate of the computer is collected, the total power consumption rate, the voltage deviation table value and the voltage fluctuation value are calculated to obtain a power detection value, the power detection value is compared with the preset power detection threshold, and if the power detection value exceeds the preset power detection threshold, a hardware auxiliary detection judgment symbol FY-1 is allocated to the computer; and if the power detection value does not exceed the preset power detection threshold value, analyzing the running condition of the fan in the computer.
5. The system of claim 4, wherein the specific analysis process for analyzing the operation condition of the fan in the computer is as follows:
collecting the rotation speed of the fan, performing difference calculation on the rotation speed compared with a preset standard speed value, and taking an absolute value to obtain a fan rotation deflection value; collecting a noise decibel value and a vibration frequency amplitude value generated by the fan, and carrying out numerical calculation on the fan deflection value, the noise decibel value and the vibration frequency amplitude value to obtain a fan real detection value; comparing the fan actual detection value with a preset fan actual detection threshold value, and marking the corresponding fan actual detection value as a fan abnormal detection value if the fan actual detection value exceeds the preset fan actual detection threshold value;
marking the ratio of the number of the different detection values of the fans to the number of the actual detection values of the fans in unit time as a fan detection table value, and carrying out average value calculation on all the actual detection values of the fans in unit time to obtain a fan detection value; respectively comparing the fan detection table value and the fan detection analysis value with a preset fan detection table threshold value and a preset fan detection analysis threshold value, and if the fan detection table value or the fan detection analysis value exceeds the corresponding preset threshold value, distributing a hardware auxiliary detection judgment symbol FY-1 to the computer; if the fan detection table value and the fan detection value do not exceed the corresponding preset threshold values, a hardware auxiliary detection judgment symbol FY-2 is distributed to the computer.
6. The system of claim 1, wherein the software state detection module comprises:
acquiring an application program in an operation state in unit time, and marking the corresponding application program as i, wherein i is a natural number which is greater than or equal to 1; judging whether a risk program or a damaged program exists or not through program running detection analysis, and generating a software state abnormal signal if the risk program or the damaged program exists; if the risk program and the damaged program do not exist, a software state normal signal is generated.
7. The computer running state monitoring system according to claim 6, wherein the specific analysis process of the program running detection analysis is as follows:
collecting data of CPU or memory resources occupied by the application program i, and judging that the application program i is in a suspicious running state if the data of the CPU or memory resources occupied by the application program i exceeds a corresponding data threshold value; respectively marking the total duration and the single maximum duration of the application program i in the running suspicious state in unit time as a program suspicious total time value and a program suspicious time amplitude, respectively comparing the program suspicious total time value and the program suspicious time amplitude of the application program i with corresponding preset program suspicious total time threshold and preset program suspicious time amplitude threshold in numerical values, and marking the application program i as a risk program if the program suspicious total time value or the program suspicious time amplitude exceeds the corresponding preset threshold;
if the program suspicious total time value and the program suspicious time amplitude value do not exceed the corresponding preset threshold values, identifying whether the application program i is crashed in unit time, acquiring the crash recovery time length of the application program i if the application program i is crashed, summing all the crash recovery time lengths of the application program i in unit time to obtain a program crash value, and marking the times of the application program i crashing in unit time as the program crash frequency value; and respectively comparing the program burst value and the program burst frequency value of the application program i with corresponding preset program burst time threshold values and preset program burst frequency threshold values, and marking the application program i as a damaged program if the program burst value or the program burst frequency value exceeds the corresponding preset threshold values.
8. The system of claim 1, wherein the network detection and assessment module comprises:
collecting the times of disconnection of a computer and a target network in unit time, marking the times as network disconnection frequency analysis values, marking the duration time of each network disconnection as network disconnection time condition values, and summing all the network disconnection time condition values in unit time to obtain the network disconnection time analysis values; and if the network disconnection frequency analysis value or the network disconnection time analysis value exceeds the corresponding preset threshold value, generating a network quality disqualification signal.
9. The system for monitoring the running state of a computer according to claim 8, wherein if the network disconnection frequency analysis value and the network disconnection time analysis value do not exceed the corresponding preset threshold values, network speeds of a plurality of detection periods in unit time are acquired, and the average value of the network speeds of all the detection periods is calculated to obtain a network speed measurement value; the network speed of the corresponding detection time period is compared with a preset network speed threshold value in a numerical mode, and if the network speed does not exceed the preset network speed threshold value, the corresponding detection time period is marked as a network low-speed time period;
marking the ratio of the number of the network low-speed time periods to the number of the detection time periods in unit time as a network low-speed detection value, carrying out numerical calculation on the network low-speed detection value and the network speed analysis value to obtain a network speed table value, and generating a network quality unqualified signal if the network speed table value exceeds a preset network speed table threshold value; and if the network speed table value does not exceed the preset network speed table threshold value, generating a network quality qualified signal.
10. A computer operating state monitoring method, characterized in that the computer operating state monitoring method employs the computer operating state monitoring system according to any one of claims 1 to 9.
CN202410032132.6A 2024-01-10 2024-01-10 Computer running state monitoring method and system Pending CN117539727A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202410032132.6A CN117539727A (en) 2024-01-10 2024-01-10 Computer running state monitoring method and system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202410032132.6A CN117539727A (en) 2024-01-10 2024-01-10 Computer running state monitoring method and system

Publications (1)

Publication Number Publication Date
CN117539727A true CN117539727A (en) 2024-02-09

Family

ID=89792323

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202410032132.6A Pending CN117539727A (en) 2024-01-10 2024-01-10 Computer running state monitoring method and system

Country Status (1)

Country Link
CN (1) CN117539727A (en)

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6959265B1 (en) * 2003-10-07 2005-10-25 Serden Technologies, Inc. User-centric measurement of quality of service in a computer network
CN103117879A (en) * 2013-01-30 2013-05-22 昆明理工大学 Network monitoring system for computer hardware processing parameters
US20170235622A1 (en) * 2016-02-14 2017-08-17 Dell Products, Lp System and method to assess information handling system health and resource utilization
CN112564964A (en) * 2020-12-04 2021-03-26 中国石油大学(华东) Fault link detection and recovery method based on software defined network
CN114116380A (en) * 2021-11-12 2022-03-01 成都大学 Computer software fault monitoring system
CN116450463A (en) * 2023-04-27 2023-07-18 宁波云弧科技有限公司 Processing method for monitoring server hardware
CN117319047A (en) * 2023-10-09 2023-12-29 北京易财花科技有限公司 Network path analysis method and system based on network security anomaly detection

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6959265B1 (en) * 2003-10-07 2005-10-25 Serden Technologies, Inc. User-centric measurement of quality of service in a computer network
CN103117879A (en) * 2013-01-30 2013-05-22 昆明理工大学 Network monitoring system for computer hardware processing parameters
US20170235622A1 (en) * 2016-02-14 2017-08-17 Dell Products, Lp System and method to assess information handling system health and resource utilization
CN112564964A (en) * 2020-12-04 2021-03-26 中国石油大学(华东) Fault link detection and recovery method based on software defined network
CN114116380A (en) * 2021-11-12 2022-03-01 成都大学 Computer software fault monitoring system
CN116450463A (en) * 2023-04-27 2023-07-18 宁波云弧科技有限公司 Processing method for monitoring server hardware
CN117319047A (en) * 2023-10-09 2023-12-29 北京易财花科技有限公司 Network path analysis method and system based on network security anomaly detection

Similar Documents

Publication Publication Date Title
CN111459700B (en) Equipment fault diagnosis method, diagnosis device, diagnosis equipment and storage medium
CN117040138B (en) Power distribution cabinet operation dynamic safety evaluation system
CN113239627B (en) Distributed intelligent monitoring method and device
CN116351313B (en) Intelligent control system of food mixer based on computer control
CN116381563B (en) Underground power cable monitoring method and device
CN112994972B (en) Distributed probe monitoring platform
CN110469461B (en) Fracture estimation method and device for fan toothed belt and readable storage medium
CN111144009B (en) Running state evaluation method, device, equipment and storage medium of fan
CN107103425B (en) Intelligent energy evaluation system for power generation equipment running state computer
CN114061770B (en) Distributed prefabricated optical fiber bus temperature measurement system
CN112267979B (en) Early warning method and system for judging failure of yaw bearing
CN110687851A (en) Terminal operation monitoring system and method
CN112529059B (en) Unit electromagnetic vibration diagnosis method, system, computer equipment and storage medium
CN116538092B (en) Compressor on-line monitoring and diagnosing method, device, equipment and storage medium
CN117539727A (en) Computer running state monitoring method and system
CN116578486A (en) Intelligent manufacturing industrial software debugging system based on cloud computing
CN116389304A (en) SG-TMS-based network operation state trend analysis system
CN113884837B (en) Cable partial discharge online monitoring and analyzing system and analyzing method
CN115792634A (en) Battery monomer voltage sampling fault identification method based on cloud online data
CN114689321A (en) Bearing fault diagnosis method and device for wind generating set and electronic equipment
CN209659326U (en) One kind being used for network communication operation management equipment
Wüstrich et al. AC/DCIM: Acoustic Channels for Data Center Infrastructure Monitoring
CN115711206B (en) Wind driven generator blade icing state monitoring system based on clustering weight
CN117667589A (en) Network hardware operation risk monitoring and early warning system
CN115438310A (en) Power determination method and device for new energy station

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination