CN116881096A - Computer fault alarm method and alarm system - Google Patents

Computer fault alarm method and alarm system Download PDF

Info

Publication number
CN116881096A
CN116881096A CN202311147327.7A CN202311147327A CN116881096A CN 116881096 A CN116881096 A CN 116881096A CN 202311147327 A CN202311147327 A CN 202311147327A CN 116881096 A CN116881096 A CN 116881096A
Authority
CN
China
Prior art keywords
state
coefficient
computer equipment
computer
equipment
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202311147327.7A
Other languages
Chinese (zh)
Inventor
杨丽
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Guizhou Zhengyi Technology Co ltd
Original Assignee
Guizhou Zhengyi Technology Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Guizhou Zhengyi Technology Co ltd filed Critical Guizhou Zhengyi Technology Co ltd
Priority to CN202311147327.7A priority Critical patent/CN116881096A/en
Publication of CN116881096A publication Critical patent/CN116881096A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/30Monitoring
    • G06F11/32Monitoring with visual or acoustical indication of the functioning of the machine
    • G06F11/324Display of status information
    • G06F11/327Alarm or error message display
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F1/00Details not covered by groups G06F3/00 - G06F13/00 and G06F21/00
    • G06F1/16Constructional details or arrangements
    • G06F1/20Cooling means
    • G06F1/206Cooling means comprising thermal management
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/30Monitoring
    • G06F11/3055Monitoring arrangements for monitoring the status of the computing system or of the computing system component, e.g. monitoring if the computing system is on, off, available, not available
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/30Monitoring
    • G06F11/3058Monitoring arrangements for monitoring environmental properties or parameters of the computing system or of the computing system component, e.g. monitoring of power, currents, temperature, humidity, position, vibrations
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/30Monitoring
    • G06F11/3089Monitoring arrangements determined by the means or processing involved in sensing the monitored data, e.g. interfaces, connectors, sensors, probes, agents
    • G06F11/3093Configuration details thereof, e.g. installation, enabling, spatial arrangement of the probes
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/903Querying

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Quality & Reliability (AREA)
  • Computing Systems (AREA)
  • Databases & Information Systems (AREA)
  • Human Computer Interaction (AREA)
  • Computational Linguistics (AREA)
  • Data Mining & Analysis (AREA)
  • Debugging And Monitoring (AREA)

Abstract

The application discloses a computer fault alarm method and an alarm system, which relate to the technical field of computer equipment, monitor the running state of the computer equipment and acquire a monitoring result, establish a first running state set of the computer equipment and generate a first running coefficient; acquiring the hardware operation and data communication state of the computer equipment, respectively generating an operation coefficient and a communication coefficient of the hardware, acquiring an equipment state coefficient, predicting the change trend of the equipment state coefficient, and sending out second early warning information if the increase ratio of the predicted value is higher than the expected value; the method comprises the steps of enabling computer equipment to continuously run for a delay time, performing self-checking on the computer after adjusting the environment temperature, obtaining current fault characteristics, and matching and outputting corresponding maintenance schemes after a maintenance scheme library is preset. The increasing proportion of the state coefficient predicted value exceeds the expected value, and the running state of the computer equipment is gradually worsened, so that running faults can be generated immediately, and timely prevention is needed.

Description

Computer fault alarm method and alarm system
Technical Field
The application relates to the technical field of computer equipment, in particular to a computer fault alarm method and an alarm system.
Background
Computer security is a series of measures and practices that ensure that computer systems, networks, data, and information related thereto are protected from unauthorized access, use, modification, vandalism, leakage, or interference. The goal of computer security is to protect the integrity, confidentiality, and availability of computer systems, as well as to prevent damage to the systems by malicious activity.
In the Chinese patent application number 202111058633.4, a computer hardware fault alarm system and method are disclosed, comprising a data acquisition, an alarm controller, a local area network and a maintenance terminal, wherein the data acquisition comprises a computer main board detection card, a storage module and computer hardware. The computer main board detection card is used for detecting computer hardware and storing fault detection information into the storage module for backup, meanwhile, the computer main board detection card transmits the fault detection information to the alarm controller for alarm, the alarm controller respectively transmits the fault detection information to the display terminal, the central host and the mobile terminal through GPRS and GSM, and at the moment, the central host can call out the fault information of the computer hardware and the fault resolution database and display the fault detection information on the display terminal.
The technical scheme recorded in the application can solve the problems of long processing period, time waste and work and life delay when computer hardware fails, but the application lacks evaluation on the running state of a computer when alarming the computer failure, has lower comprehensiveness when alarming, and simultaneously, the system is difficult to quickly give a targeted maintenance scheme after alarming, so that the maintenance efficiency of the computer equipment is low, and the normal operation of the computer equipment is not enough to be guaranteed.
Therefore, the application provides a computer fault alarm method and an alarm system.
Disclosure of Invention
Aiming at the defects of the prior art, the application provides a computer fault alarm method and an alarm system, which are characterized in that the operation state of computer equipment is monitored, a monitoring result is obtained, a first operation state set of the computer equipment is established, and a first operation coefficient is generated; acquiring the hardware operation and data communication state of the computer equipment, respectively generating an operation coefficient and a communication coefficient of the hardware, acquiring an equipment state coefficient, predicting the change trend of the equipment state coefficient, and sending out second early warning information if the increase ratio of the predicted value is higher than the expected value; the method comprises the steps of enabling computer equipment to continuously run for a delay time, performing self-checking on the computer after adjusting the environment temperature, obtaining current fault characteristics, and matching and outputting corresponding maintenance schemes after a maintenance scheme library is preset. The increasing proportion of the state coefficient predicted value exceeds the expected value, the running state of the computer equipment is gradually deteriorated, the running fault is possibly about to be generated, the running fault needs to be prevented in time, and the problems in the background technology are solved.
In order to achieve the above purpose, the application is realized by the following technical scheme:
a computer failure alarm method, comprising the following contents:
when the computer equipment is in a continuous running state, monitoring the running state of the computer equipment, acquiring a monitoring result, establishing a first running state set of the computer equipment, further generating a first running coefficient Dex, and if the first running coefficient Dex is higher than a first state threshold, sending out first early warning information;
after receiving the first early warning information, monitoring the hardware working state of the computer equipment, obtaining the hardware running state of the computer equipment, establishing a hardware state set of the computer equipment, further generating a hardware running coefficient Ysx, and sending out first warning information to the outside if the hardware running coefficient Ysx is higher than an running threshold value;
after the first early warning information is received, a monitoring period is set, the data communication state of the computer equipment is monitored, a data communication state set is established according to the monitoring result, a communication coefficient Txs is further generated, and if the acquired communication coefficient Txs is lower than a communication threshold value, second alarm information is sent;
if the first alarm information or the second alarm information is not received, acquiring a device state coefficient Byt by using a communication coefficient Txs and an operation coefficient Ysx, predicting the change trend of the device state coefficient Byt, acquiring a state coefficient predicted value Yt, and if the increase ratio of the state coefficient predicted value Yt is higher than the expected value, sending second early warning information;
after receiving the second early warning information, the computer equipment is enabled to continuously run for a delay time Ct, the computer is subjected to self-checking after the environmental temperature is adjusted, the current fault characteristics are obtained, and after a maintenance scheme library is preset, the corresponding maintenance scheme is matched and output.
Further, when the computer device is in a continuous running state, the running state of the computer device is monitored, and the specific contents are as follows: setting a plurality of detection points outside the computer equipment, monitoring the external temperature of the computer equipment at the detection points, acquiring the equipment temperature at each detection point, and acquiring the case temperature Xt after averaging;
presetting a first temperature threshold, inquiring and acquiring the current rotation speed of a cooling fan of computer equipment to generate a fan rotation speed Zs if the temperature Xt of the case exceeds the first temperature threshold; meanwhile, detecting noise emitted by the computer equipment at each detection point, and generating chassis noise Jz after averaging; and summarizing the case temperature Xt, the fan rotating speed Zs and the case noise Jz, and establishing a first running state set of the computer equipment.
Further, a first operation coefficient Dex of the computer device is generated from the first operation state set in the following specific manner: dimensionless processing is carried out on the case temperature Xt, the fan rotating speed Zs and the case noise Jz, and the method is according to the following formula:
wherein ,is->For changeable constant parameters->,/>And->The user can adjust according to the actual situation; />For correction coefficients greater than 0, the +.A.is determined by software simulation>Specific values of (2);
and if the acquired first operation coefficient Dex is higher than the first state threshold value, sending out first early warning information.
Further, after receiving the first early warning information, the hardware working state of the computer equipment is monitored, and the monitored content is as follows: inquiring and sequentially obtaining the utilization rate of a processor through a task manager of computer equipment to generate the occupancy rate Zy of the processor, and inquiring and obtaining the memory occupancy rate Ny and the hard disk occupancy rate Py of the processor if the occupancy rate Zy of the processor is preset with a threshold value; and summarizing the occupancy rate Zy of the processor, the occupancy rate Ny of the memory and the occupancy rate Py of the hard disk, and establishing a hardware state set of the computer equipment.
Further, generating an operating coefficient Ysx of the hardware of the computer device from the set of hardware states; the method comprises the following steps: after dimensionless processing is carried out on the processor occupancy rate Zy, the memory occupancy rate Ny and the hard disk occupancy rate Py, the following formula is adopted:
wherein ,is->For the changeable constant weight coefficient, +.>,/>The user can adjust according to actual conditions, and if the operation coefficient Ysx of the hardware is higher than the operation threshold value, the first alarm information is sent to the outside.
Further, no provision is made forWhen the first alarm information or the second alarm information is received, a plurality of communication coefficients Txs and operation coefficients Ysx are continuously obtained, and the communication coefficients are used as followsAnd the operational coefficients Ysx generate device state coefficients Byt for the computer device:
the significance of the parameters is:,/>and->,/>The specific value of the weight coefficient can be set by user adjustment or obtained by mathematical analysis software through simulation analysis; and if the acquired communication coefficient Txs is lower than the communication threshold value, sending out second alarm information.
Further, when the first alarm information or the second alarm information is not received, a plurality of communication coefficients Txs and operation coefficients Ysx are continuously obtained, and the communication coefficients are used as followsAnd the operational coefficients Ysx generate device state coefficients Byt for the computer device:
wherein ,,/>and->1, representing the running risk of the computer equipment by using an equipment state coefficient Byt, wherein the representing mode is as follows:
wherein ,for the weight, its specific value can be set by the user adjustment, +.>Is the expected average of the communication coefficients Txs,n is a positive integer greater than 1, which is the expected average of the running coefficients Ysx.
Further, a plurality of device state coefficients Byt are continuously obtained, the function fitting is carried out on the changes of the plurality of device state coefficients Byt, after the K-S test is carried out, a state function of the computer device is generated, the state function predicts the change trend of the device state coefficients Byt, and a first predicted value of the device state coefficients Byt is obtained
Predicting the change trend of the equipment state coefficient Byt by a smooth index prediction method to obtain a second predicted value of the equipment state coefficient BytThe method comprises the steps of carrying out a first treatment on the surface of the Obtaining a second predicted value +.>First predictive value->The two are weighted and averaged according to the proportion of three to seven, and the weighted and averaged is taken as the predicted value of the equipment state coefficient Byt, and is markedThe state coefficient predicted value Yt is recorded;
and acquiring an increase ratio Zb of the state coefficient predicted value Yt relative to the current equipment state coefficient Byt, presetting a proportion threshold value, and if the increase ratio Zb is larger than the proportion threshold value, sending out second early warning information.
Further, after the second early warning information is received, determining the area where the computer equipment is located as an operation area, setting heat dissipation equipment in the operation area, and dissipating heat of the computer equipment in the operation area to enable the continuous operation time of the heat dissipation equipment to at least exceed the hysteresis time Ct;
after the hysteresis time Ct is elapsed, performing self-checking on the computer, and judging whether the computer equipment has an operation fault currently or not; if the operation fault exists, determining the operation fault as a fault characteristic according to the characteristics of the operation fault; the maintenance scheme aiming at the operation faults is obtained through on-line linear retrieval or off-line collection, a maintenance scheme library is built after summarization, and according to the correspondence between the fault characteristics and the maintenance scheme, the corresponding maintenance scheme is matched for the computer equipment after the self-checking is finished, and the maintenance scheme is output.
A computer fault alarm method and alarm system includes:
the first early warning unit monitors the running state of the computer equipment and acquires a monitoring result when the computer equipment is in a continuous running state, a first running state set of the computer equipment is established, a first running coefficient Dex is generated, and if the first running coefficient Dex is higher than a first state threshold value, first early warning information is sent outwards;
the method comprises the steps that after a first alarm unit receives first early warning information, the hardware working state of computer equipment is monitored, the hardware operating state of the computer equipment is obtained, a hardware state set of the computer equipment is established, a hardware operating coefficient Ysx is generated, and if the hardware operating coefficient Ysx is higher than an operating threshold value, the first alarm information is sent to the outside;
the second alarm unit is used for setting a monitoring period after receiving the first early warning information, monitoring the data communication state of the computer equipment, establishing a data communication state set according to the monitoring result, further generating a communication coefficient Txs, and sending out second alarm information if the acquired communication coefficient Txs is lower than a communication threshold value;
the second early warning unit acquires the equipment state coefficient Byt by using the communication coefficient Txs and the operation coefficient Ysx if the first alarm information or the second alarm information is not received, predicts the change trend of the equipment state coefficient Byt, acquires the state coefficient predicted value Yt, and sends out the second early warning information if the increase ratio of the state coefficient predicted value Yt is higher than the expected value;
and after the maintenance unit receives the second early warning information, the computer equipment is enabled to continuously run for a delay time Ct, the computer is subjected to self-checking after the environmental temperature is regulated, the current fault characteristics are obtained, and after a maintenance scheme library is preset, the corresponding maintenance scheme is matched and output.
The application provides a computer fault alarm method and an alarm system, which have the following beneficial effects:
1. the running coefficient Ysx of the hardware of the computer equipment is generated through the hardware state set, the current hardware load of the computer equipment is evaluated, whether the hardware running load of the computer equipment has a running hidden trouble or not is judged, if the running hidden trouble exists, the running hidden trouble needs to be processed in time, and therefore the running state of the computer equipment is guaranteed.
2. The current running state of the computer equipment is monitored from the angle of data communication, the data communication state of the computer equipment is evaluated by a communication coefficient Txs, whether the current running state of the computer equipment can meet the current use requirement is judged, if the current running state of the computer equipment cannot meet the current use requirement, the sent second alarm information timely adjusts the current communication state and the network connection state of the computer equipment, and the running state of the computer equipment is guaranteed.
3. Continuously acquiring a plurality of communication coefficients Txs and operation coefficients Ysx, generating an equipment state coefficient Byt of the computer equipment, predicting the change trend of the equipment state coefficient Byt, generating a state coefficient predicted value Yt, predicting the current operation state of the computer equipment by the state coefficient predicted value Yt, if the increasing proportion of the state coefficient predicted value Yt exceeds the expected value, indicating that the operation state of the computer equipment is gradually worsened, possibly causing an operation fault, possibly causing a certain operation risk of the computer equipment, and needing to be timely prevented.
4. The method comprises the steps of adjusting the running environment of computer equipment, radiating the computer equipment from the outside, reducing the running load of the computer, enabling the computer equipment to continue running for a period of time, performing periodic self-checking on the computer equipment, and if the running fault exists in the current calculation, rapidly matching out a corresponding maintenance scheme, thereby providing the maintenance scheme for the computer equipment, solving the problem in a targeted manner, rapidly giving out the maintenance scheme when the computer equipment generates potential safety hazards, providing the fault solving efficiency, and guaranteeing the normal running of the computer equipment.
Drawings
FIG. 1 is a schematic flow chart of a computer failure alarm method of the present application;
FIG. 2 is a schematic diagram of a computer failure alarm system according to the present application.
Detailed Description
The following description of the embodiments of the present application will be made clearly and completely with reference to the accompanying drawings, in which it is apparent that the embodiments described are only some embodiments of the present application, but not all embodiments. All other embodiments, which can be made by those skilled in the art based on the embodiments of the application without making any inventive effort, are intended to be within the scope of the application.
Referring to fig. 1, the application provides a computer fault alarm method, which comprises the following steps:
step one, when the computer equipment is in a continuous running state, monitoring the running state of the computer equipment, acquiring a monitoring result, establishing a first running state set of the computer equipment, generating a first running coefficient Dex, and if the first running coefficient Dex is higher than a first state threshold value, sending out first early warning information;
the first step comprises the following steps:
step 101, monitoring the running state of the computer equipment when the computer equipment is in a continuous running state, wherein the specific contents are as follows: setting a plurality of detection points outside the computer equipment, monitoring the external temperature of the computer equipment at the detection points, acquiring the equipment temperature at each detection point, and acquiring the case temperature Xt after averaging;
102, presetting a first temperature threshold, and inquiring and acquiring the current rotation speed of a cooling fan of computer equipment to generate a fan rotation speed Zs if the temperature Xt of the case exceeds the first temperature threshold; meanwhile, detecting noise emitted by the computer equipment at each detection point, and generating chassis noise Jz after averaging;
summarizing the case temperature Xt, the fan rotating speed Zs and the case noise Jz, and establishing a first running state set of the computer equipment;
step 103, generating a first operation coefficient Dex of the computer device from the first operation state set, specifically as follows: dimensionless processing is carried out on the case temperature Xt, the fan rotating speed Zs and the case noise Jz, and the method is according to the following formula:
wherein ,is->For changeable constant parameters->,/>And->The user can adjust according to the actual situation; />For correction coefficients greater than 0, the +.A.is determined by software simulation>Specific values of (2);
a first state threshold value is preset, and if the acquired first operation coefficient Dex is higher than the first state threshold value, first early warning information is sent outwards.
When the computer equipment is in a continuous operation state, for example, a time length threshold of operation is set, if the continuous operation time length of the computer equipment exceeds the time length threshold, at this time, the current operation state of the computer equipment is monitored, a first operation state set of the computer equipment is established, and if the first operation coefficient Dex is set, so that whether the computer equipment is currently at operation risk is judged according to the evaluation of the current operation state of the computer equipment if the first operation coefficient Dex is set, and if the risk is possible, the processing can be timely performed, for example, the heat dissipation capacity of the computer equipment is increased, and the operation load of the computer equipment is reduced.
Step two, after receiving the first early warning information, monitoring the hardware working state of the computer equipment, obtaining the hardware running state of the computer equipment, establishing a hardware state set of the computer equipment, generating a hardware running coefficient Ysx, and if the hardware running coefficient Ysx is higher than an running threshold value, sending out the first warning information to the outside;
the second step comprises the following steps:
step 201, after receiving the first early warning information, monitoring the hardware working state of the computer device, where the monitored content is as follows: inquiring and sequentially obtaining the utilization rate of the processor through a task manager of the computer equipment to generate the processor occupancy rate Zy, and inquiring and obtaining the memory occupancy rate Ny and the hard disk occupancy rate Py if the processor occupancy rate Zy is a preset threshold value, for example, when the processor occupancy rate exceeds 40%; summarizing the occupancy rate Zy of the processor, the occupancy rate Ny of the memory and the occupancy rate Py of the hard disk, and establishing a hardware state set of the computer equipment;
step 202, generating an operation coefficient Ysx of the hardware of the computer equipment from the hardware state set; the method comprises the following steps: after dimensionless processing is carried out on the processor occupancy rate Zy, the memory occupancy rate Ny and the hard disk occupancy rate Py, the following formula is adopted:
wherein ,is->For the changeable constant weight coefficient, +.>,/>The user can adjust according to the actual situation;
presetting an operation threshold of hardware, and if the operation coefficient Ysx of the hardware is higher than the operation threshold, sending first alarm information to the outside to indicate that the computer equipment has operation hidden danger currently or is about to have operation risk, wherein the computer equipment needs to be maintained and overhauled in time at the moment; when the operation coefficient Ysx is not higher than the operation threshold, it can be determined that the computer device is currently in a normal operation state.
In use, the contents of steps 201 and 202 are combined:
after the first early warning information is acquired, an operation coefficient Ysx of the hardware of the computer equipment is generated through the hardware state set, the current hardware load of the computer equipment is evaluated, whether the hardware operation load of the computer equipment has operation hidden danger or not is judged, if the operation hidden danger exists, the operation hidden danger needs to be processed in time, and therefore the operation state of the computer equipment is guaranteed.
Step three, after receiving the first early warning information, setting a monitoring period, monitoring the data communication state of the computer equipment, establishing a data communication state set according to the monitoring result, further generating a communication coefficient Txs, and if the acquired communication coefficient Txs is lower than a communication threshold value, sending out second alarm information;
the third step comprises the following steps:
step 301, after receiving the first early warning information, the computer device is in an operation state and the network is in a normal connection state; setting a monitoring period, for example, taking 5 minutes as a monitoring period, monitoring the communication state of the computer equipment in each monitoring period, acquiring the data exchange amount Jv of each minute in the monitoring period, acquiring the data packet loss rate Dv and the data delay Yv when data are transmitted, and establishing a data communication state set of the computer equipment after summarizing;
step 302, generating a communication coefficient Txs from the data communication state set, which specifically includes the following steps: after dimensionless processing is carried out on the data exchange amount Jv, the data packet loss rate Dv and the data delay Yv, the following formula is adopted:
the significance of the parameters is:,/>and->,/>The specific value of the weight coefficient can be set by user adjustment or obtained by mathematical analysis software through simulation analysis;
if the obtained communication coefficient Txs is lower than the communication threshold, the communication state of the computer device is worse than expected, and the current communication state and the network connection state of the computer device need to be adjusted, so that second alarm information is sent.
In use, the contents of steps 301 and 302 are combined:
after the first alarm information is received, the current running state of the computer equipment is monitored from the data communication angle after the running state of the computer equipment is evaluated from the hardware load angle, the data communication state of the computer equipment is evaluated by the communication coefficient Txs, whether the current running state of the computer equipment can meet the current use requirement is judged, if the current running state of the computer equipment cannot meet the current use requirement, the sent second alarm information is convenient for timely adjusting the current communication state and the network connection state of the computer equipment, and the running state of the computer equipment is guaranteed.
Step four, if the first alarm information or the second alarm information is not received, acquiring a device state coefficient Byt by using a communication coefficient Txs and an operation coefficient Ysx, predicting the change trend of the device state coefficient Byt, acquiring a state coefficient predicted value Yt, and if the increase ratio of the state coefficient predicted value Yt is higher than the expected value, sending out second early warning information;
the fourth step comprises the following steps:
step 401, when the first alarm information or the second alarm information is not received, at least explaining that the computer equipment is in a normal use state in the current hardware state and communication state, and continuously acquiring a plurality of communication coefficients Txs and operation coefficients Ysx;
for example: communication coefficient、/>、/>Up to->、/>Run coefficient->、/>、/>Up to、/>The method comprises the steps of carrying out a first treatment on the surface of the The device state coefficients Byt for the computer device are generated from the communication coefficients Txs and the run coefficients Ysx as follows:
wherein ,,/>and->1, representing the running risk of the computer equipment by using an equipment state coefficient Byt, wherein the representing mode is as follows:
wherein ,for the weight, its specific value can be set by the user adjustment, +.>Is the expected average of the communication coefficients Txs,n is a positive integer greater than 1, which is the expected mean of the running coefficients Ysx;
step 402, continuously obtaining a plurality of device state coefficients Byt at fixed intervals along a time axis, performing function fitting on the changes of the plurality of device state coefficients Byt, generating a state function of the computer device after the K-S test, and performing function fitting on the changes of the device state coefficients Byt by the state functionTrend prediction is carried out, and a first predicted value of the equipment state coefficient Byt is obtained
Predicting the change trend of the equipment state coefficient Byt by a smooth index prediction method to obtain a second predicted value of the equipment state coefficient Byt
Step 403, obtaining a second predicted value of the device state coefficient BytFirst predictive value->The two are weighted and averaged according to the proportion of three to seven, and the weighted and averaged are used as the predicted value of the equipment state coefficient Byt and marked as the state coefficient predicted value Yt;
acquiring an increasing ratio Zb of a state coefficient predicted value Yt relative to a current equipment state coefficient Byt, presetting a proportion threshold value, and if the increasing ratio Zb is larger than the proportion threshold value, sending second early warning information, wherein if the second early warning information is sent, the running state of the computer equipment is gradually deteriorated;
in use, the contents of steps 401 and 403 are combined:
when the first alarm information or the second alarm information is not received, a plurality of communication coefficients Txs and operation coefficients Ysx are continuously acquired, an equipment state coefficient Byt of the computer equipment is generated, the change trend of the equipment state coefficient Byt is predicted, a state coefficient predicted value Yt is generated, the current operation state of the computer equipment is predicted by the state coefficient predicted value Yt, if the increasing proportion of the state coefficient predicted value Yt exceeds the expected value, the operation state of the computer equipment is gradually deteriorated, an operation fault is probably about to occur, a certain operation risk exists in the computer equipment, and timely prevention is needed.
Step five, after receiving the second early warning information, enabling the computer equipment to continuously run for a delay time Ct, performing self-checking on the computer after adjusting the environment temperature to obtain the current fault characteristics, and after presetting a maintenance scheme library, matching and outputting a corresponding maintenance scheme;
the fifth step comprises the following steps:
step 501, after receiving the second early warning information, setting a hysteresis time Ct, for example, 30 minutes, determining an area where the computer equipment is located, determining the area as an operation area, setting a heat dissipation device in the operation area, and dissipating heat of the computer equipment in the operation area, so that the continuous operation time of the heat dissipation device at least exceeds the hysteresis time Ct;
step 502, after the hysteresis time Ct, performing self-checking on the computer, and judging whether the computer equipment has an operation fault at present; if the operation fault exists, determining the operation fault as a fault characteristic according to the characteristics of the operation fault, for example, the occupancy rate of a processor is too high, the packet loss rate of data is too high, and the like;
step 503, obtaining maintenance schemes aiming at operation faults through on-line linear retrieval or off-line collection, building a maintenance scheme library after summarizing, matching corresponding maintenance schemes for computer equipment after self-checking is finished according to the correspondence between fault characteristics and the maintenance schemes, and outputting the maintenance schemes so as to be convenient for timely maintenance.
In use, the contents of steps 501 and 503 are combined:
the method comprises the steps of predicting the running state of the computer equipment, firstly adjusting the running environment of the computer equipment, radiating the computer equipment from the outside, reducing the running load of the computer, enabling the computer equipment to continue running for a period of time, carrying out periodic self-checking on the computer equipment, and if the running fault exists in the current calculation, rapidly matching out a corresponding maintenance scheme, so that the maintenance scheme can be provided for the computer equipment in time, the problem can be solved in a targeted manner, when the potential safety hazard is generated in the computer equipment, the maintenance scheme can be rapidly given out, the failure solving efficiency is provided, and the normal running of the computer equipment is guaranteed.
Referring to fig. 2, the present application provides a computer fault alarm system, comprising:
the first early warning unit monitors the running state of the computer equipment and acquires a monitoring result when the computer equipment is in a continuous running state, a first running state set of the computer equipment is established, a first running coefficient Dex is generated, and if the first running coefficient Dex is higher than a first state threshold value, first early warning information is sent outwards;
the method comprises the steps that after a first alarm unit receives first early warning information, the hardware working state of computer equipment is monitored, the hardware operating state of the computer equipment is obtained, a hardware state set of the computer equipment is established, a hardware operating coefficient Ysx is generated, and if the hardware operating coefficient Ysx is higher than an operating threshold value, the first alarm information is sent to the outside;
the second alarm unit is used for setting a monitoring period after receiving the first early warning information, monitoring the data communication state of the computer equipment, establishing a data communication state set according to the monitoring result, further generating a communication coefficient Txs, and sending out second alarm information if the acquired communication coefficient Txs is lower than a communication threshold value;
the second early warning unit acquires the equipment state coefficient Byt by using the communication coefficient Txs and the operation coefficient Ysx if the first alarm information or the second alarm information is not received, predicts the change trend of the equipment state coefficient Byt, acquires the state coefficient predicted value Yt, and sends out the second early warning information if the increase ratio of the state coefficient predicted value Yt is higher than the expected value;
and after the maintenance unit receives the second early warning information, the computer equipment is enabled to continuously run for a delay time Ct, the computer is subjected to self-checking after the environmental temperature is regulated, the current fault characteristics are obtained, and after a maintenance scheme library is preset, the corresponding maintenance scheme is matched and output.
In the application, the formulas are all dimensionality-removed numerical calculation, the formulas are formulas which are obtained by acquiring a large amount of data and performing software simulation and are closer to the actual situation, and preset parameters in the formulas can be adjusted and set by a person skilled in the art according to the actual application scene.
The above embodiments may be implemented in whole or in part by software, hardware, firmware, or any other combination. When implemented in software, the above-described embodiments may be implemented in whole or in part in the form of a computer program product. Those of ordinary skill in the art will appreciate that the various illustrative elements and algorithm steps described in connection with the embodiments disclosed herein may be implemented as electronic hardware, or combinations of computer software and electronic hardware. Whether such functionality is implemented as hardware or software depends upon the particular application and design constraints imposed on the solution.
The units described as separate units may or may not be physically separate, and units shown as units may or may not be physical units, may be located in one place, or may be distributed on a plurality of network units. Some or all of the units may be selected according to actual needs to achieve the purpose of the solution of this embodiment.
The foregoing is merely illustrative of the present application, and the present application is not limited thereto, and any person skilled in the art will readily recognize that variations or substitutions are within the scope of the present application.

Claims (7)

1. A computer fault alarm method is characterized in that: the method comprises the following steps:
when the computer equipment is in a continuous running state, monitoring the running state of the computer equipment, acquiring a monitoring result, establishing a first running state set of the computer equipment, further generating a first running coefficient Dex, and if the first running coefficient Dex is higher than a first state threshold, sending out first early warning information; when the computer equipment is in a continuous running state, the running state of the computer equipment is monitored, and the specific contents are as follows: setting a plurality of detection points outside the computer equipment, monitoring the external temperature of the computer equipment at the detection points, acquiring the equipment temperature at each detection point, and acquiring the case temperature Xt after averaging;
presetting a first temperature threshold, inquiring and acquiring the current rotation speed of a cooling fan of computer equipment to generate a fan rotation speed Zs if the temperature Xt of the case exceeds the first temperature threshold; detecting the noise emitted by the computer equipment at each detection point, and generating chassis noise Jz after averaging; summarizing the case temperature Xt, the fan rotating speed Zs and the case noise Jz, and establishing a first running state set of the computer equipment;
after receiving the first early warning information, monitoring the hardware working state of the computer equipment, obtaining the hardware running state of the computer equipment, establishing a hardware state set of the computer equipment, further generating a hardware running coefficient Ysx, and sending out first warning information to the outside if the hardware running coefficient Ysx is higher than an running threshold value; after receiving the first early warning information, monitoring the hardware working state of the computer equipment, wherein the monitored content is as follows: inquiring and sequentially obtaining the utilization rate of a processor through a task manager of computer equipment to generate the occupancy rate Zy of the processor, and inquiring and obtaining the memory occupancy rate Ny and the hard disk occupancy rate Py of the processor if the occupancy rate Zy of the processor is preset with a threshold value; summarizing the occupancy rate Zy of the processor, the occupancy rate Ny of the memory and the occupancy rate Py of the hard disk, and establishing a hardware state set of the computer equipment;
after the first early warning information is received, a monitoring period is set, the data communication state of the computer equipment is monitored, a data communication state set is established according to the monitoring result, a communication coefficient Txs is further generated, and if the acquired communication coefficient Txs is lower than a communication threshold value, second alarm information is sent; after receiving the first early warning information, monitoring the communication state of the computer equipment in each monitoring period, acquiring the data exchange amount Jv of each minute in the monitoring period, acquiring the data packet loss rate Dv and the data delay Yv when data are transmitted, and establishing a data communication state set of the computer equipment after summarizing;
if the first alarm information or the second alarm information is not received, acquiring a device state coefficient Byt by using a communication coefficient Txs and an operation coefficient Ysx, predicting the change trend of the device state coefficient Byt, acquiring a state coefficient predicted value Yt, and if the increase ratio of the state coefficient predicted value Yt is higher than the expected value, sending second early warning information;
after receiving the second early warning information, the computer equipment is enabled to continuously run for a delay time Ct, the computer is subjected to self-checking after the environmental temperature is adjusted, the current fault characteristics are obtained, and after a maintenance scheme library is preset, the corresponding maintenance scheme is matched and output.
2. A computer failure warning method according to claim 1, characterized in that:
generating a first operation coefficient Dex of the computer equipment from the first operation state set by the following specific modes: dimensionless processing is carried out on the case temperature Xt, the fan rotating speed Zs and the case noise Jz, and the method is according to the following formula:
wherein ,is->For changeable constant parameters->,/>And->The user can adjust according to the actual situation; />For correction coefficients greater than 0, the +.A.is determined by software simulation>Specific values of (2); and if the acquired first operation coefficient Dex is higher than the first state threshold value, sending out first early warning information.
3. A computer failure warning method according to claim 2, characterized in that:
generating an operating coefficient Ysx of the computer device hardware from the set of hardware states; the method comprises the following steps: after dimensionless processing is carried out on the processor occupancy rate Zy, the memory occupancy rate Ny and the hard disk occupancy rate Py, the following formula is adopted:
wherein ,is->For the changeable constant weight coefficient, +.>,/>And if the operation coefficient Ysx of the hardware is higher than the operation threshold value, sending out first alarm information to the outside.
4. A computer failure warning method according to claim 1, characterized in that:
the communication coefficients Txs are generated from the set of data communication states in the following manner: after dimensionless processing is carried out on the data exchange amount Jv, the data packet loss rate Dv and the data delay Yv, the following formula is adopted:
the significance of the parameters is:,/>and->,/>The specific value of the weight coefficient can be set by user adjustment or obtained by mathematical analysis software through simulation analysis;
and if the acquired communication coefficient Txs is lower than the communication threshold value, sending out second alarm information.
5. A computer failure warning method according to claim 1, characterized in that:
when the first alarm information or the second alarm information is not received, a plurality of communication coefficients Txs and operation coefficients Ysx are continuously obtained, and the communication coefficients are used as followsAnd the operational coefficients Ysx generate device state coefficients Byt for the computer device:
wherein ,,/>and->1, representing the running risk of the computer equipment by using an equipment state coefficient Byt, wherein the representing mode is as follows:
wherein ,for the weight, its specific value can be set by the user adjustment, +.>Is the expected mean of the communication coefficients Txs, +.>N is a positive integer greater than 1, which is the expected average of the running coefficients Ysx.
6. A computer failure warning method according to claim 5, characterized in that:
continuously acquiring a plurality of equipment state coefficients Byt, performing function fitting on the changes of the plurality of equipment state coefficients Byt, generating a state function of the computer equipment after K-S inspection, predicting the change trend of the equipment state coefficients Byt by the state function, and acquiring a first predicted value of the equipment state coefficients Byt
Predicting the change trend of the equipment state coefficient Byt by a smooth index prediction method to obtain a second predicted value of the equipment state coefficient BytThe method comprises the steps of carrying out a first treatment on the surface of the Obtaining a second predicted value +.>First predictive value->The two are weighted and averaged according to the proportion of three to seven, and the weighted and averaged are used as the predicted value of the equipment state coefficient Byt and marked as the state coefficient predicted value Yt;
and acquiring an increase ratio Zb of the state coefficient predicted value Yt relative to the current equipment state coefficient Byt, presetting a proportion threshold value, and if the increase ratio Zb is larger than the proportion threshold value, sending out second early warning information.
7. A computer failure warning method according to claim 6, characterized in that:
after the second early warning information is received, determining the area where the computer equipment is located as an operation area, setting heat dissipation equipment in the operation area, and dissipating heat of the computer equipment in the operation area to enable the continuous operation time of the heat dissipation equipment to at least exceed the hysteresis time Ct;
after the hysteresis time Ct is elapsed, performing self-checking on the computer, and judging whether the computer equipment has an operation fault currently or not; if the operation fault exists, determining the operation fault as a fault characteristic according to the characteristics of the operation fault; the maintenance scheme aiming at the operation faults is obtained through on-line linear retrieval or off-line collection, a maintenance scheme library is built after summarization, and according to the correspondence between the fault characteristics and the maintenance scheme, the corresponding maintenance scheme is matched for the computer equipment after the self-checking is finished, and the maintenance scheme is output.
CN202311147327.7A 2023-09-07 2023-09-07 Computer fault alarm method and alarm system Pending CN116881096A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202311147327.7A CN116881096A (en) 2023-09-07 2023-09-07 Computer fault alarm method and alarm system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202311147327.7A CN116881096A (en) 2023-09-07 2023-09-07 Computer fault alarm method and alarm system

Publications (1)

Publication Number Publication Date
CN116881096A true CN116881096A (en) 2023-10-13

Family

ID=88262631

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202311147327.7A Pending CN116881096A (en) 2023-09-07 2023-09-07 Computer fault alarm method and alarm system

Country Status (1)

Country Link
CN (1) CN116881096A (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN117439687A (en) * 2023-12-20 2024-01-23 江苏华鹏智能仪表科技股份有限公司 Dual-mode communication method, system and device based on HPLC and HRF
CN117992315A (en) * 2024-04-03 2024-05-07 福建时代星云科技有限公司 EMS platform data visualization method and terminal

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN117439687A (en) * 2023-12-20 2024-01-23 江苏华鹏智能仪表科技股份有限公司 Dual-mode communication method, system and device based on HPLC and HRF
CN117439687B (en) * 2023-12-20 2024-03-29 江苏华鹏智能仪表科技股份有限公司 Dual-mode communication method, system and device based on HPLC and HRF
CN117992315A (en) * 2024-04-03 2024-05-07 福建时代星云科技有限公司 EMS platform data visualization method and terminal

Similar Documents

Publication Publication Date Title
CN116881096A (en) Computer fault alarm method and alarm system
US20130150985A1 (en) Abnormality Detection for Isolating a Control System
CN114793019A (en) Secondary equipment visual supervision system based on big data analysis
CN116739568B (en) Intelligent network operation and maintenance service detection method, system and medium based on big data
CN103856471A (en) Cross-site scripting attack monitoring system and method
CN116502166B (en) Method, device, equipment and medium for predicting faults of target equipment
CN111556473A (en) Abnormal access behavior detection method and device
CN114387762A (en) Building data management method, device, equipment and storage medium
CN116894247B (en) Method and system for protecting computer system security
US6460005B1 (en) Apparatus and method for monitoring environmental conditions in a computing device
CN113760661B (en) Electricity utilization safety monitoring method and device based on edge server
CN113865649B (en) Lightning protection device degradation monitoring device, system and method
CN117220416B (en) Smart power grid electric power information safety transmission system
Cisar et al. EWMA algorithm in network practice
CN116661399A (en) Chemical workshop safety monitoring method and system
CN116992503A (en) Data security storage method and storage device
CN100446374C (en) A thermal protection method
WO2019036945A1 (en) Computer room static electricity monitoring system based on internet of things
CN116451282B (en) Sensor data tamper-proof system and method for monitoring carbon emission of website
JP3446891B2 (en) Monitoring system
CN114323116A (en) Power system monitoring method and device and computer equipment
KR102606063B1 (en) Building Automation System with failure prediction function
WO2024001666A1 (en) Network risk assessment method and related apparatus
CN211015459U (en) Computer network safety monitoring device
CN117742194A (en) Application monitoring method of system automation technology in power engineering

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination