CN113746695B - Fault monitoring adjustment method, device and equipment - Google Patents

Fault monitoring adjustment method, device and equipment Download PDF

Info

Publication number
CN113746695B
CN113746695B CN202110872532.4A CN202110872532A CN113746695B CN 113746695 B CN113746695 B CN 113746695B CN 202110872532 A CN202110872532 A CN 202110872532A CN 113746695 B CN113746695 B CN 113746695B
Authority
CN
China
Prior art keywords
white
monitoring
priority
period
hardware
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202110872532.4A
Other languages
Chinese (zh)
Other versions
CN113746695A (en
Inventor
张锡鑫
刘齐
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Suzhou Inspur Intelligent Technology Co Ltd
Original Assignee
Suzhou Inspur Intelligent Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Suzhou Inspur Intelligent Technology Co Ltd filed Critical Suzhou Inspur Intelligent Technology Co Ltd
Priority to CN202110872532.4A priority Critical patent/CN113746695B/en
Publication of CN113746695A publication Critical patent/CN113746695A/en
Application granted granted Critical
Publication of CN113746695B publication Critical patent/CN113746695B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L43/00Arrangements for monitoring or testing data switching networks
    • H04L43/08Monitoring or testing based on specific metrics, e.g. QoS, energy consumption or environmental parameters
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L49/00Packet switching elements
    • H04L49/40Constructional details, e.g. power supply, mechanical construction or backplane
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L49/00Packet switching elements
    • H04L49/55Prevention, detection or correction of errors
    • H04L49/555Error detection
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02PCLIMATE CHANGE MITIGATION TECHNOLOGIES IN THE PRODUCTION OR PROCESSING OF GOODS
    • Y02P90/00Enabling technologies with a potential contribution to greenhouse gas [GHG] emissions mitigation
    • Y02P90/02Total factory control, e.g. smart factories, flexible manufacturing systems [FMS] or integrated manufacturing systems [IMS]

Landscapes

  • Engineering & Computer Science (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Signal Processing (AREA)
  • Environmental & Geological Engineering (AREA)
  • Data Exchanges In Wide-Area Networks (AREA)

Abstract

The embodiment of the application discloses a fault monitoring adjustment method, device and equipment, which are used for determining a first period parameter and a first priority parameter matched with a fault monitoring result by acquiring the fault monitoring result of hardware in a white box switch, adjusting the monitoring period of the white box switch by using the first period parameter, and adjusting the monitoring priority of the hardware by using the first priority parameter. Therefore, the monitoring period and the monitoring priority of the hardware can be flexibly adjusted according to the fault monitoring result of the hardware, the fault monitoring efficiency is improved on the basis of more reliable fault monitoring, and the waste of system resources of the white box switch is reduced.

Description

Fault monitoring adjustment method, device and equipment
Technical Field
The present disclosure relates to the field of fault monitoring, and in particular, to a method, an apparatus, and a device for adjusting fault monitoring.
Background
A white-box switch is a switch that decouples the switch hardware and software. The user can only purchase the hardware of the white box switch and then match the corresponding software according to the use requirement. The white box switch has better openness and can also reduce networking cost.
During the use process of the white box switch, fault monitoring needs to be carried out on hardware. For example, hardware such as fans, power supplies, transceivers in the white-box switch are monitored for faults. At present, the fault monitoring efficiency of hardware is low. How to improve the efficiency of fault monitoring is a problem to be solved.
Disclosure of Invention
In view of this, the embodiments of the present application provide a method, an apparatus, and a device for adjusting fault monitoring, which can improve the hardware fault monitoring efficiency of a white-box switch.
Based on this, the technical scheme provided in the embodiment of the application is as follows:
in a first aspect, the present application provides a method for adjusting fault monitoring, where the method is applied to a white-box switch, and the method includes:
obtaining a fault monitoring result of hardware in the white box switch;
determining a first period parameter and a first priority parameter matched with the fault monitoring result;
according to the first period parameter, adjusting a monitoring period of the white-box switch, wherein the monitoring period is a period for monitoring whether hardware in the white-box switch has faults or not;
and adjusting the monitoring priority of the hardware according to the first priority parameter, wherein the monitoring priority is used for determining a monitoring sequence for monitoring whether the hardware has faults or not.
In one possible implementation, the method further includes:
acquiring monitoring parameters of other white-box switches, wherein the monitoring parameters comprise monitoring periods of the other white-box switches and monitoring priorities of first hardware of the other white-box switches;
determining a second period parameter according to the monitoring period in the monitoring parameters;
adjusting the monitoring period of the white box switch by using the second period parameter;
determining a second priority parameter of the first hardware of the white-box switch according to the monitoring priority of the first hardware of the other white-box switches in the monitoring parameters;
and adjusting the monitoring priority of the first hardware of the white-box switch by using the second priority parameter.
In a possible implementation manner, the monitoring parameters further include model information of the other white-box switches, and after the obtaining the monitoring parameters of the other white-box switches, the method further includes:
and discarding monitoring parameters of the white box switch with different types from the white box switch according to the model information of the other white box switches.
In a possible implementation manner, the monitoring parameters further include network addresses of the other white-box switches, and after the acquiring the monitoring parameters of the other white-box switches, the method further includes:
Determining a first position distance between the other white-box switches and the white-box switch according to the network addresses of the other white-box switches and the network address of the white-box switch;
determining a first periodic weight and a first priority weight according to the first position distance;
the adjusting the monitoring period of the white-box switch by using the second period parameter includes:
adjusting the monitoring period of the white box switch by using the second period parameter and the first period weight;
the adjusting the monitoring priority of the first hardware of the white-box switch by using the second priority parameter includes:
and adjusting the monitoring priority of the first hardware of the white-box switch by using the second priority parameter and the first priority weight.
In a possible implementation manner, the monitoring parameters further include a production serial number of the other white-box switch, and after the acquiring the monitoring parameters of the other white-box switch, the method further includes:
determining a production batch gap between the other white box switches and the white box switch according to the production serial numbers of the other white box switches and the production serial numbers of the white box switch;
Determining a second cycle weight and a second priority weight according to the production lot gap;
the adjusting the monitoring period of the white-box switch by using the second period parameter includes:
adjusting the monitoring period of the white box switch by using the second period parameter and the second period weight;
the adjusting the monitoring priority of the first hardware of the white-box switch by using the second priority parameter includes:
and adjusting the monitoring priority of the first hardware of the white-box switch by using the second priority parameter and the second priority weight.
In one possible implementation, the method further includes:
acquiring hardware fault information through a server, wherein the hardware fault information comprises hardware information of second hardware, and the second hardware is the hardware with the fault;
determining a third period parameter and a third priority parameter according to the hardware fault information;
adjusting the monitoring period of the white box switch by using the third period parameter;
and adjusting the monitoring priority of the second hardware of the white-box switch by using the third priority parameter.
In one possible implementation manner, the hardware exception information further includes a failure time when the second hardware fails, and the method further includes:
Determining a third period weight and a third priority weight according to the time of the fault;
the adjusting the fault monitoring period of the white-box switch by using the third period parameter includes:
adjusting the monitoring period of the white box switch by using the third period parameter and the third period weight;
the adjusting the monitoring priority of the second hardware of the white-box switch by using the third priority parameter includes:
and adjusting the monitoring priority of the second hardware of the white-box switch by using the third priority parameter and the third priority weight.
In one possible implementation manner, the hardware fault information further includes a network address of a target white-box switch, where the target white-box switch is a white-box switch that generates the hardware fault information, and the method further includes:
determining a second position distance between the target white-box switch and the own white-box switch according to the network address of the target white-box switch and the network address of the own white-box switch;
determining a fourth period weight and a fourth priority weight according to the second position distance;
the adjusting the monitoring period of the white-box switch by using the third period parameter includes:
Adjusting the monitoring period of the white box switch by using the third period parameter and the fourth period weight;
the adjusting the monitoring priority of the second hardware of the white-box switch by using the third priority parameter includes:
and adjusting the monitoring priority of the second hardware of the white-box switch by using the third priority parameter and the fourth priority weight.
In a second aspect, the present application provides an adjustment device for fault monitoring, applied to a white-box switch, the device including:
the first acquisition unit is used for acquiring a fault monitoring result of hardware in the white-box switch;
a first determining unit, configured to determine a first cycle parameter and a first priority parameter that are matched with the fault monitoring result;
the first adjusting unit is used for adjusting the monitoring period of the white box switch according to the first period parameter, wherein the monitoring period is a period for monitoring whether hardware in the white box switch has faults or not;
and the second adjusting unit is used for adjusting the monitoring priority of the hardware according to the first priority parameter, and the monitoring priority is used for determining a monitoring sequence for monitoring whether the hardware is in fault or not.
In one possible implementation, the apparatus further includes:
the second acquisition unit is used for acquiring monitoring parameters of other white-box switches, wherein the monitoring parameters comprise monitoring periods of the other white-box switches and monitoring priorities of first hardware of the other white-box switches;
the second determining unit is used for determining a second period parameter according to the monitoring period in the monitoring parameters;
the third adjusting unit is used for adjusting the monitoring period of the white box switch by utilizing the second period parameter;
a third determining unit, configured to determine a second priority parameter of the first hardware of the present white-box switch according to the monitoring priority of the first hardware of the other white-box switch in the monitoring parameters;
and the fourth adjusting unit is used for adjusting the monitoring priority of the first hardware of the white-box switch by utilizing the second priority parameter.
In a possible implementation manner, the monitoring parameter further includes model information of the other white-box switches, and the apparatus further includes:
and the deleting unit is used for discarding monitoring parameters of the white box switch with different types from the white box switch according to the type information of the other white box switches.
In a possible implementation manner, the monitoring parameter further includes a network address of the other white-box switch, and the apparatus further includes:
a fourth determining unit, configured to determine a first location distance between the other white-box switch and the own white-box switch according to the network address of the other white-box switch and the network address of the own white-box switch;
a fifth determining unit, configured to determine a first cycle weight and a first priority weight according to the first location distance;
the third adjusting unit is specifically configured to adjust a monitoring period of the present white box switch by using the second period parameter and the first period weight;
the fourth adjusting unit is specifically configured to adjust the monitoring priority of the first hardware of the present white-box switch by using the second priority parameter and the first priority weight.
In a possible implementation manner, the monitoring parameter further includes a production serial number of the other white-box switch, and the apparatus further includes:
a sixth determining unit, configured to determine a production lot gap between the other white-box switch and the own white-box switch according to the production serial numbers of the other white-box switches and the production serial numbers of the own white-box switch;
A seventh determining unit configured to determine a second cycle weight and a second priority weight according to the production lot gap;
the third adjusting unit is specifically configured to adjust a monitoring period of the present white box switch by using the second period parameter and the second period weight;
the fourth adjusting unit is specifically configured to adjust the monitoring priority of the first hardware of the present white-box switch by using the second priority parameter and the second priority weight.
In one possible implementation, the apparatus further includes:
acquiring hardware fault information through a server, wherein the hardware fault information comprises hardware information of second hardware, and the second hardware is the hardware with the fault;
an eighth determining unit, configured to determine a third cycle parameter and a third priority parameter according to the hardware failure information;
a fifth adjusting unit, configured to adjust a monitoring period of the present white box switch by using the third period parameter;
and a sixth adjusting unit, configured to adjust the monitoring priority of the second hardware of the present white-box switch by using the third priority parameter.
In one possible implementation manner, the hardware exception information further includes a failure time when the second hardware fails, and the apparatus further includes:
A ninth determining unit, configured to determine a third cycle weight and a third priority weight according to the time when the fault occurs;
the fifth adjusting unit is specifically configured to adjust the monitoring period of the white box switch by using the third period parameter and the third period weight;
the sixth adjusting unit is specifically configured to adjust the monitoring priority of the second hardware of the present white-box switch by using the third priority parameter and the third priority weight.
In one possible implementation manner, the hardware fault information further includes a network address of a target white-box switch, where the target white-box switch is a white-box switch that generates the hardware fault information, and the apparatus further includes:
a tenth determining unit, configured to determine a second location distance between the target white-box switch and the own white-box switch according to the network address of the target white-box switch and the network address of the own white-box switch;
an eleventh determining unit configured to determine a fourth period weight and a fourth priority weight according to the second location distance;
the fifth adjusting unit is specifically configured to adjust a monitoring period of the present white box switch by using the third period parameter and the fourth period weight;
The sixth adjusting unit is specifically configured to adjust the monitoring priority of the second hardware of the present white-box switch by using the third priority parameter and the fourth priority weight.
In a third aspect, the present application provides an adjustment device for fault monitoring, including: a processor, memory, system bus;
the processor and the memory are connected through the system bus;
the memory is configured to store one or more programs, the one or more programs comprising instructions, which when executed by the processor, cause the processor to perform the method of any of the embodiments described above.
In a fourth aspect, the present application provides a computer readable storage medium having instructions stored therein, which when run on a terminal device, cause the terminal device to perform the method according to any one of the embodiments described above.
From this, the embodiment of the application has the following beneficial effects:
according to the method, the device and the equipment for adjusting fault monitoring, the first periodic parameter and the first priority parameter matched with the fault monitoring result are determined by acquiring the fault monitoring result of the hardware in the white box switch, the monitoring period of the white box switch is adjusted by using the first periodic parameter, and the monitoring priority of the hardware is adjusted by using the first priority parameter. Therefore, the monitoring period and the monitoring priority of the hardware can be flexibly adjusted according to the fault monitoring result of the hardware, the fault monitoring efficiency is improved on the basis of more reliable fault monitoring, and the waste of system resources of the white box switch is reduced.
Drawings
Fig. 1 is a flowchart of a fault monitoring adjustment method provided in an embodiment of the present application;
fig. 2 is a schematic structural diagram of an adjusting device for fault monitoring according to an embodiment of the present application.
Detailed Description
In order to facilitate understanding and explanation of the technical solutions provided by the embodiments of the present application, the background art of the present application will be described first.
After researching the fault detection process of the traditional white box switch, it is found that in one case, the existing fault monitoring is to continuously inquire the state of the hardware through a certain uninterrupted program in the operating system, so as to judge whether the hardware can work normally. And if the hardware abnormality is detected by monitoring, corresponding measures are taken according to the abnormality degree. This approach can continue to occupy the system resources of the white-box switch, resulting in increased cost of the white-box switch. In another case, the monitoring of the hardware operating state is triggered by a fixed monitoring period. Too long a monitoring period can lead to reduced reliability, and too short a monitoring period can lead to lower efficiency of system fault monitoring.
Based on this, the embodiments of the present application provide a method, an apparatus, and a device for adjusting fault monitoring, by acquiring a fault monitoring result of hardware in a present white-box switch, when the fault monitoring result is that the hardware is abnormal, determining a first cycle parameter and a first priority parameter that are matched with the degree of abnormality, and adjusting a monitoring cycle of the present white-box switch by using the first cycle parameter, and adjusting a monitoring priority of the hardware by using the first priority parameter. Therefore, the monitoring period and the monitoring priority of the hardware can be adjusted according to the fault monitoring result of the hardware, the fault monitoring efficiency is improved on the basis of reliability, and the waste of system resources of the white box switch is reduced.
In order to facilitate understanding of the technical solution provided by the embodiments of the present application, an adjustment method for fault monitoring provided by the embodiments of the present application is described below with reference to the accompanying drawings. The fault monitoring adjustment method provided by the embodiment of the application is applied to the white box switch.
Referring to fig. 1, a flowchart of a fault monitoring adjustment method according to an embodiment of the present application is shown, and the method includes steps S101 to S104.
S101: and obtaining a fault monitoring result of hardware in the white-box switch.
The white box switch comprises a plurality of hardware, such as fans, power supplies, transceivers, sensors and the like, so as to monitor faults, and repair the faulty hardware in time. The embodiments of the present application do not limit the manner in which hardware is fault detected. In one possible implementation, the hardware may be determined whether it is malfunctioning by collecting output data of the hardware or querying an operating state of the hardware.
And obtaining a fault monitoring result of fault monitoring of hardware in the white-box switch. The fault monitoring result may include a hardware name for performing fault monitoring and a monitoring result corresponding to the hardware. Specifically, according to possible operation states of hardware, monitoring results can be classified into normal and fault-existing types. Further, the existence of faults may also be divided according to the severity of the faults that may occur to the hardware. For example, the presence of faults can also be categorized into warning levels, severity levels, and deadline levels. The warning level may refer to a fault with a lesser degree of fault, among other things. Such as faults that can be repaired automatically, faults that occur temporarily, and so forth. The severity level may be a fault with a slightly heavier degree of fault. For example, hardware fails to function properly and the like. The deadline level may be a fault with a greater degree of failure. Such as a failure that severely affects white-box switch operation. For alert level faults, only the log may be recorded. But the severity level and the deadline level can be recorded, and the alarm can be prompted to even report to the operation and maintenance personnel.
In one possible implementation, S101 and subsequent steps may be timed to ensure the validity of the fault monitoring adjustment.
S102: and determining a first period parameter and a first priority parameter matched with the fault monitoring result.
After the fault monitoring result is obtained, the matched first cycle parameter and first priority parameter may be determined according to the fault monitoring result. Wherein the first period parameter is a parameter for adjusting the monitoring period. The first priority parameter is a parameter for adjusting monitoring priority of specific hardware.
The first period parameter and the first priority parameter may be preset parameters. In one possible implementation manner, the first period parameter and the first priority parameter which are correspondingly matched may be set in advance for each type of fault detection result, and the matching relationship between the fault detection result and the first period parameter and the first priority parameter is stored in the storage unit of the white-box switch.
Specifically, the first period parameter may be set in such a manner that the higher the fault level is, the shorter the monitoring period adjusted based on the first period parameter is, and when no fault occurs, the more the monitoring period adjusted based on the first period parameter is or is longer.
Similarly, the first priority parameter may be set in such a manner that the higher the failure level, the higher the monitoring priority adjusted based on the first priority parameter, and the lower the monitoring priority adjusted based on the first priority parameter, without failure.
S103: and adjusting the monitoring period of the white box switch according to the first period parameter, wherein the monitoring period is a period for monitoring whether hardware in the white box switch has faults or not.
The monitoring period of the white-box switch is used for determining whether the hardware in the white-box switch is monitored at the moment of failure. The white-box switch triggers the monitoring of hardware faults according to the monitoring period.
The monitoring period of the present white-box switch may be a period in which the present white-box switch monitors. The current monitoring period of the white-box switch can be a monitoring period preset by a user or an operation and maintenance person and other management personnel, or can be an adjusted monitoring period.
The monitoring period of the white-box switch may also be the monitoring period of the white-box switch in a specific time. For example, the monitoring period of the white-box switch between 1 pm and 3 pm every day can be adjusted according to the fault monitoring result of the hardware during the period from 1 pm to 3 pm.
And adjusting the monitoring period of the white-box switch according to the determined first period parameter. The method for adjusting the monitoring period of the white-box switch can be related to the meaning of the first period parameter and the data form.
In one possible implementation, the first period parameter may be an adjustment step size. For example, the monitoring period of the white-box switch is 30 minutes, the fault monitoring result is that a fault exists, and the first period parameter matched with the fault monitoring result is-15 minutes. The monitoring period may be added to the first period parameter resulting in an adjusted monitoring period of 15 minutes. For another example, the monitoring period of the white-box switch is 30 minutes, the fault monitoring result is normal, and the first period parameter matched with the fault monitoring result is 15 minutes. The monitoring period may be added to the first period parameter resulting in an adjusted monitoring period of 45 minutes.
In another possible implementation, the first period parameter may also be a ratio. For example, the monitoring period of the white-box switch is 30 minutes, the fault monitoring result is that a fault exists, and the first period parameter matched with the fault monitoring result is 0.5. The monitoring period may be multiplied by the first period parameter resulting in an adjusted monitoring period of 15 minutes. For another example, the monitoring period of the white-box switch is 30 minutes, the fault monitoring result is normal, and the first period parameter matched with the fault monitoring result is 1.5. The monitoring period may be multiplied by the first period parameter resulting in an adjusted monitoring period of 45 minutes.
In addition, in order to prevent the monitoring period of the white-box switch from being excessively adjusted, an upper limit value and a lower limit value of the monitoring period may also be set in advance. When the adjusted monitoring period is higher than the upper limit value, the monitoring period may be determined as the upper limit value. When the adjusted monitoring period is lower than the lower limit value, the monitoring period may be determined as the lower limit value.
S104: and adjusting the monitoring priority of the hardware according to the first priority parameter, wherein the monitoring priority is used for determining a monitoring sequence for monitoring whether the hardware has faults or not.
Different hardware in the white-box switch affects the operation of the white-box switch to different degrees. When monitoring hardware, the more important hardware can be monitored first. To facilitate determining the order in which hardware is monitored, the priority of monitoring of the hardware that is required to be monitored may be predetermined. When monitoring is carried out, the sequence of monitoring hardware is determined according to the monitoring priority of the hardware.
And adjusting the monitoring priority of the hardware corresponding to the fault monitoring result according to the determined first priority parameter matched with the fault monitoring result. The method of adjusting the monitoring priority of the hardware may be related to the meaning of the first priority parameter and the data form.
In one possible implementation, the first priority parameter may be an adjustment step size. For example, the monitoring priority of the fan in the present white-box switch is 5 levels, wherein a smaller number of levels indicates a higher priority. The fault monitoring result is that a fault exists, and the first priority parameter matched with the fault monitoring result is-1. The monitoring priority of the fan may be added to the first priority parameter, and the resulting adjusted monitoring priority of the fan may be 4. For another example, the monitoring priority of the fan in the white box switch is 5. The fault monitoring result is normal, and the first priority parameter matched with the fault monitoring result is 1. The monitoring priority of the fan may be added to the first priority parameter, and the resulting adjusted monitoring priority of the fan may be 6.
In another possible implementation, the first priority parameter may also be a ratio. For example, the monitoring priority of the fan in the present white-box switch corresponds to a value of 5, where a smaller number of stages indicates a higher priority. The fault monitoring result is that a fault exists, and the first priority parameter matched with the fault monitoring result is 0.5. The monitoring priority of the fan may be multiplied by the first priority parameter to obtain an adjusted monitoring priority of 2.5. For another example, the monitoring priority of the fan in the present white-box switch is 5. The fault monitoring result is normal, and the first priority parameter matched with the fault monitoring result is 1.5. The monitoring priority of the fan may be multiplied by the first priority parameter resulting in an adjusted monitoring priority of 7.5.
In addition, similarly, in order to prevent the monitoring priority of the hardware in the white-box switch from being excessively adjusted, an upper limit value and a lower limit value of the monitoring priority may also be set in advance. When the adjusted monitoring priority is higher than the upper limit value, the monitoring priority may be determined as the upper limit value. When the adjusted monitoring priority is lower than the lower limit value, the monitoring priority may be determined as the lower limit value.
Based on the above-mentioned content related to S101-S104, it is known that by obtaining the fault monitoring result of the hardware in the present white-box switch, determining the first period parameter and the first priority parameter that are matched with the fault monitoring result, adjusting the monitoring period of the present white-box switch by using the first period parameter, and adjusting the monitoring priority of the hardware by using the first priority parameter. Therefore, the monitoring period and the monitoring priority of the hardware can be flexibly adjusted according to the fault monitoring result of the hardware, the fault monitoring efficiency is improved on the basis of more reliable fault monitoring, and the waste of system resources of the white box switch is reduced.
In addition, besides adjusting the fault monitoring process according to the fault monitoring result of the present white-box switch, the fault monitoring process of the present white-box switch may also be adjusted with reference to the monitoring parameters of other white-box switches.
In a possible implementation manner, the embodiment of the present application further provides a fault monitoring adjustment method, which includes the following five steps in addition to the above steps:
a1: and acquiring monitoring parameters of other white-box switches, wherein the monitoring parameters comprise monitoring periods of the other white-box switches and monitoring priorities of first hardware of the other white-box switches.
The embodiment of the application does not limit the manner of acquiring the monitoring parameters of other white-box switches. For example, the other white-box switch may send its own monitoring parameters to the own white-box switch via the management network. In one possible implementation, the monitoring parameters may be transmitted using network messages. The white box switch may add the monitoring parameters in the broadcast transmitted LLDP (Link Layer Discovery Protocol ).
The monitoring parameters may include monitoring periods of other white-box switches and monitoring priorities of first hardware of the other white-box switches. Wherein the first hardware may be one or more of the hardware monitored by the other white-box switches.
A2: and determining a second period parameter according to the monitoring period in the monitoring parameters.
A second cycle parameter is determined based on the monitoring cycles of the other white-box switches. The second period parameter is used for adjusting the monitoring period of the white-box switch. The second period parameter is a parameter that matches a monitoring period among the monitoring parameters. The second period parameter may be a preset parameter. In one possible implementation manner, the second period parameter corresponding to the matching may be set in advance for the monitoring periods of the other white-box switches, and the matching relationship between the monitoring periods of the other white-box switches and the second period parameter may be stored in the storage unit of the white-box switch.
It should be noted that, in one possible implementation, the influence degree of the hardware fault condition of the white-box switch itself on the monitoring period should be greater than the influence degree of the monitoring parameters of other white-box switches on the monitoring period of the white-box switch. The second period parameter may be smaller than the first period parameter. For example, when the second cycle parameter and the first cycle parameter are both ratios, the second cycle parameter is closer to 1 than the first cycle parameter.
A3: and adjusting the monitoring period of the white box switch by using the second period parameter.
And adjusting the monitoring period of the white-box switch by using the determined second period parameter. The method for adjusting the monitoring period of the present white-box switch by using the second period parameter is similar to the method for adjusting the monitoring period of the present white-box switch by using the first period parameter, and will not be described herein.
A4: and determining a second priority parameter of the first hardware of the white-box switch according to the monitoring priority of the first hardware of the other white-box switches in the monitoring parameters.
The second priority parameter is determined based on the monitored priorities of the first hardware of the other white-box switches. The second priority parameter is used to adjust the monitoring priority of the first hardware of the present white-box switch. The second priority parameter may be a preset parameter. In one possible implementation, the second priority parameter corresponding to the match may be set in advance for the monitoring priority of the first hardware of the other white-box switch, and the matching relationship may be stored in the storage unit of the white-box switch.
It should be noted that, in one possible implementation, the degree of influence of the hardware failure condition of the white-box switch itself on the monitoring priority of the first hardware should be greater than the degree of influence of the monitoring priority of the first hardware of the other white-box switches on the monitoring period of the white-box switch. The second priority parameter may be less than the first priority parameter in adjusting the monitoring priority of the first hardware of the present white-box switch. For example, when the second priority parameter and the first priority parameter are both ratios, the second priority parameter is closer to 1 than the first priority parameter.
A5: and adjusting the monitoring priority of the first hardware of the white-box switch by using the second priority parameter.
And adjusting the monitoring priority of the first hardware of the white-box switch by using the determined second priority parameter. The manner of adjusting the monitoring priority of the first hardware of the present white-box switch by using the second priority parameter is similar to the manner of adjusting the monitoring priority of the first hardware of the present white-box switch by using the first priority parameter, and is not described herein.
In the embodiment of the application, the fault monitoring of the white box switch can be adjusted according to the monitoring parameters of other white box switches. Therefore, when other white box switches fail, the white box switches can be automatically affected to adjust fault monitoring, and the efficiency of fault monitoring is further improved.
Further, in one possible implementation manner, the monitoring parameters further include model information of the other white-box switches, and after the obtaining the monitoring parameters of the other white-box switches, the method further includes:
and discarding monitoring parameters of the white box switch with different types from the white box switch according to the model information of the other white box switches.
The monitoring parameters of other white-box switches can also be provided with model information of the white-box switch for sending the monitoring parameters. The model information may include a model identification of the white-box switch.
According to the model information of the white box switch, whether the white box switch which sends the monitoring parameters and the white box switch belong to the same model can be determined. The monitoring parameters of the white box switches of the same model have reference significance. The monitoring parameters of the white-box switch with different models can be abandoned, and the monitoring data of the white-box switch with different models are not referred to.
In the embodiment of the application, the monitoring parameters of other white box switches with the same model have reference significance, and the fault monitoring of the white box switch is adjusted based on the monitoring parameters of the other white box switches with the same model, so that the fault monitoring efficiency of the white box switch can be further improved.
In another possible implementation manner, the monitoring parameters further include network addresses of the other white-box switches, and after the acquiring the monitoring parameters of the other white-box switches, the method further includes the following two steps:
b1: and determining a first position distance between the other white-box switches and the white-box switch according to the network addresses of the other white-box switches and the network address of the white-box switch.
The network address of the white-box switch may specifically refer to MAC (Media Access Control Address ) and/or IP (Internet Protocol Address, internet protocol address).
The network address of the white-box switch can reflect the physical location of the white-box switch at the data center. Based on the network addresses of the other white-box switches and the network address of the own white-box switch, a first location distance between the other white-box switches and the own white-box switch can be determined.
B2: and determining a first period weight and a first priority weight according to the first position distance.
The first period weight is used for adjusting the influence degree of the second period parameter on the monitoring period of the white-box switch. The first priority weight adjusts the influence degree of the second priority parameter on the monitoring priority of the first hardware of the white-box switch.
In general, a white box switch with a relatively short distance fails, and the probability of the failure of the white box switch is relatively high. The white box switch with a longer distance breaks down, so that the probability of affecting the white box switch to break down is smaller. Correspondingly, in one possible implementation, the first cycle weight and the first priority weight are higher when the first location distance is smaller. The first cycle weight and the first priority weight are lower when the first location distance is greater.
The embodiment of the application provides a specific implementation manner for adjusting the monitoring period of the white-box switch by using the second period parameter, which comprises the following steps:
and adjusting the monitoring period of the white-box switch by using the second period parameter and the first period weight.
And adjusting the monitoring period of the white-box switch by using the first period weight and the second period parameter. Taking the second period parameter as a ratio as an example, multiplying the monitoring period of the white box switch by the second period parameter, and multiplying the obtained product by the first period weight to obtain the adjusted monitoring period of the white box switch.
The embodiment of the application provides a specific implementation manner for adjusting the monitoring priority of the first hardware of the white-box switch by using the second priority parameter, which comprises the following steps:
and adjusting the monitoring priority of the first hardware of the white-box switch by using the second priority parameter and the first priority weight.
And adjusting the monitoring priority of the first hardware of the white-box switch by using the first priority weight and the second priority parameter. Taking the second priority parameter as a ratio as an example, multiplying the monitoring priority of the first hardware of the white-box switch by the second priority parameter, and multiplying the obtained product by the first priority weight to obtain the adjusted monitoring priority of the first hardware of the white-box switch.
In the embodiment of the application, the monitoring period of the white box switch and the monitoring priority of the first hardware are further adjusted in consideration of the first position distance between the white box switches, so that the adjusted fault monitoring meets the requirement of fault monitoring.
In another possible implementation manner, the monitoring parameters further include a production serial number of the other white-box switch, and after the obtaining the monitoring parameters of the other white-box switch, the method further includes the following two steps:
c1: and determining the production batch gap between the other white box switches and the white box switch according to the production serial numbers of the other white box switches and the production serial numbers of the white box switch.
In one possible implementation, the white-box switch has different production lots. The monitoring parameters also include a production serial number of the white-box switch that sent the monitoring parameters. The production serial number is used to identify the production lot of the white-box switch. The performance of the white box switch with the nearer production lot is closer.
Based on the production sequence numbers of the other white-box switches and the production sequence numbers of the own white-box switch, the production lot gap between the other white-box switches and the own white-box switch can be determined.
C2: and determining a second cycle weight and a second priority weight according to the production lot gap.
The second period weight is used for adjusting the influence degree of the second period parameter on the monitoring period of the white-box switch. The second priority weight adjusts the influence degree of the second priority parameter on the monitoring priority of the first hardware of the white-box switch.
In one possible implementation, the second cycle weight and the second priority weight are higher when the production lot is closer. The second cycle weight and the second priority weight are lower when the production lot is farther.
The embodiment of the application provides a specific implementation manner for adjusting the monitoring period of the white-box switch by using the second period parameter, which comprises the following steps:
and adjusting the monitoring period of the white box switch by using the second period parameter and the second period weight.
And adjusting the monitoring period of the white-box switch by using the second period weight and the second period parameter. Taking the second period parameter as a ratio as an example, multiplying the monitoring period of the white box switch by the second period parameter, and multiplying the obtained product by the second period weight to obtain the adjusted monitoring period of the white box switch.
The embodiment of the application provides a specific implementation manner for adjusting the monitoring priority of the first hardware of the white-box switch by using the second priority parameter, which comprises the following steps:
and adjusting the monitoring priority of the first hardware of the white-box switch by using the second priority parameter and the second priority weight.
And adjusting the monitoring priority of the first hardware of the white-box switch by using the second priority weight and the second priority parameter. Taking the second priority parameter as a ratio as an example, multiplying the monitoring priority of the first hardware of the white-box switch by the second priority parameter, and multiplying the obtained product by the second priority weight to obtain the adjusted monitoring priority of the first hardware of the white-box switch.
In the embodiment of the application, the monitoring period of the white box switch and the monitoring priority of the first hardware are further adjusted in consideration of the proximity degree of the production batch of the white box switch, so that the adjusted fault monitoring meets the requirement of fault monitoring.
In a possible implementation manner, the embodiment of the present application further provides a fault monitoring adjustment method, which includes the following four steps in addition to the above steps:
D1: and acquiring hardware fault information through a server, wherein the hardware fault information comprises hardware information of second hardware, and the second hardware is the hardware with the fault.
The white-box switch records the hardware fault in a log after monitoring the hardware fault, and sends the log to a server. In particular, the server may be a log server for receiving and storing logs of the white-box switch.
The embodiment of the application does not limit the mode of triggering the white-box switch to acquire hardware fault information. In one possible implementation, a request may be initiated by a white-box switch to a server, which sends hardware failure information to the white-box switch in response to the white-box switch's request. In another possible implementation, the server may send the hardware failure information to the white-box switch at regular intervals.
The white-box switch can acquire hardware fault information from a server. The hardware failure information includes hardware information of the second hardware. The second hardware is the hardware that fails in the white-box switch. The second hardware may be the same as the first hardware, and the second hardware may also be different from the first hardware.
D2: and determining a third period parameter and a third priority parameter according to the hardware fault information.
Based on the hardware failure information, the frequency of failure of other white-box switches, the failed second hardware, and the frequency of failure of the second hardware may be determined. Specifically, the third cycle parameter may be determined according to a frequency of occurrence of a fault, and the third priority parameter for the second hardware may be determined according to a frequency of occurrence of a fault of the second hardware.
It should be noted that, in one possible implementation, the extent of the influence of the hardware fault condition of the white-box switch itself on the monitoring period should be greater than the extent of the influence of the hardware fault information on the monitoring period of the white-box switch. The adjustment amplitude of the third period parameter to the monitoring period of the present white-box switch may be smaller than the adjustment amplitude of the first period parameter to the monitoring period of the present white-box switch. Further, the influence degree of the monitoring parameters of other white-box switches on the monitoring period should be greater than that of the hardware fault information on the monitoring period of the white-box switch. The adjustment amplitude of the third period parameter to the monitoring period of the present white-box switch may also be smaller than the adjustment amplitude of the second period parameter to the monitoring period of the present white-box switch.
Similarly, the third priority parameter may adjust the monitoring priority of the hardware of the present white-box switch by a smaller amount than the first priority parameter. And, the adjustment amplitude of the third priority parameter to the monitoring priority of the hardware of the present white-box switch may also be smaller than the adjustment amplitude of the second priority parameter to the monitoring priority of the hardware of the present white-box switch.
D3: and adjusting the monitoring period of the white box switch by using the third period parameter.
And adjusting the monitoring period of the white box switch by using the determined third period parameter. The method for adjusting the monitoring period of the present white-box switch by using the third period parameter is similar to the method for adjusting the monitoring period of the present white-box switch by using the first period parameter, and will not be described herein.
D4: and adjusting the monitoring priority of the second hardware of the white-box switch by using the third priority parameter.
And adjusting the monitoring priority of the first hardware of the white-box switch by utilizing the determined third priority parameter. The method for adjusting the monitoring priority of the first hardware of the present white-box switch by using the third priority parameter is similar to the method for adjusting the monitoring priority of the first hardware of the present white-box switch by using the first priority parameter, and will not be described herein.
In the embodiment of the application, the white-box switch acquires hardware fault information, and the fault monitoring process of the white-box switch is adjusted according to the hardware with faults, so that the adjusted fault monitoring meets the requirements of fault monitoring.
Further, in one possible implementation manner, the hardware exception information further includes a failure time when the second hardware fails, and the method further includes:
And determining a third period weight and a third priority weight according to the time of the fault.
The third period weight is used for adjusting the influence degree of the third period parameter on the monitoring period of the white-box switch. The third priority weight adjusts the influence degree of the third priority parameter on the monitoring priority of the second hardware of the white-box switch.
In one possible implementation, the third cycle weight and the third priority weight are higher when the time of failure is closer to the current time. The third cycle weight and the third priority weight are lower when the time when the failure occurs is farther from the current time.
The embodiment of the present application provides a specific implementation manner for adjusting the fault monitoring period of the present white-box switch by using the third period parameter, including:
and adjusting the monitoring period of the white box switch by using the third period parameter and the third period weight.
And adjusting the monitoring period of the white-box switch by using the third period weight and the third period parameter. Taking the third period parameter as a ratio as an example, multiplying the monitoring period of the white box switch by the third period parameter, and multiplying the obtained product by the third period weight to obtain the adjusted monitoring period of the white box switch.
The embodiment of the application provides a specific implementation manner for adjusting the monitoring priority of the second hardware of the white-box switch by using the third priority parameter, which comprises the following steps:
and adjusting the monitoring priority of the second hardware of the white-box switch by using the third priority parameter and the third priority weight.
And adjusting the monitoring priority of the second hardware of the white-box switch by using the third priority weight and the third priority parameter. Taking the third priority parameter as a ratio as an example, multiplying the monitoring priority of the second hardware of the white-box switch by the third priority parameter, and multiplying the obtained product by the third priority weight to obtain the adjusted monitoring priority of the second hardware of the white-box switch.
In the embodiment of the application, considering the time when the second hardware fails, the monitoring period of the white-box switch and the monitoring priority of the first hardware are further adjusted, so that the adjusted failure monitoring meets the requirement of failure monitoring.
In another possible implementation manner, the hardware fault information further includes a network address of a target white-box switch, where the target white-box switch is a white-box switch that generates the hardware fault information, and the method further includes the following two steps:
E1: and determining a second position distance between the target white-box switch and the white-box switch according to the network address of the target white-box switch and the network address of the white-box switch.
The network address of the white-box switch may refer to MAC and/or IP in particular.
The network address of the white-box switch can reflect the physical location of the white-box switch at the data center. Based on the network address of the target white-box switch and the network address of the own white-box switch, a second location distance between the target white-box switch and the own white-box switch can be determined.
E2: and determining a fourth period weight and a fourth priority weight according to the second position distance.
The fourth option is used for adjusting the influence degree of the third period parameter on the monitoring period of the white-box switch. The fourth priority weight adjusts the influence degree of the third priority parameter on the monitoring priority of the second hardware of the white-box switch.
In general, a white box switch with a relatively short distance fails, and the probability of the failure of the white box switch is relatively high. The white box switch with a longer distance breaks down, so that the probability of affecting the white box switch to break down is smaller. Correspondingly, in one possible implementation, the fourth cycle weight and the fourth priority weight are higher when the second location distance is smaller. The fourth cycle weight and the first priority weight are lower when the second location distance is larger.
The embodiment of the application provides a specific implementation manner for adjusting the monitoring period of the white-box switch by using the third period parameter, which comprises the following steps:
and adjusting the monitoring period of the white box switch by using the third period parameter and the fourth period weight.
And adjusting the monitoring period of the white-box switch by using the fourth period weight and the third period parameter. Taking the third period parameter as a ratio as an example, multiplying the monitoring period of the white box switch by the third year period parameter, and multiplying the obtained product by the fourth period weight to obtain the adjusted monitoring period of the white box switch.
The embodiment of the present application provides a specific manner of adjusting the monitoring priority of the second hardware of the present white-box switch by using the third priority parameter, including:
and adjusting the monitoring priority of the second hardware of the white-box switch by using the third priority parameter and the fourth priority weight.
And adjusting the monitoring priority of the second hardware of the white-box switch by using the fourth priority weight and the third priority parameter. Taking the third priority parameter as a ratio as an example, multiplying the monitoring priority of the second hardware of the white-box switch by the third priority parameter, and multiplying the obtained product by the fourth priority weight to obtain the adjusted monitoring priority of the second hardware of the white-box switch.
In the embodiment of the application, the monitoring period of the white box switch and the monitoring priority of the first hardware are further adjusted in consideration of the first position distance between the white box switches, so that the adjusted fault monitoring meets the requirement of fault monitoring.
Based on the fault monitoring adjustment method provided by the above method embodiment, the present application embodiment further provides a fault monitoring adjustment device, and the fault monitoring adjustment device will be described below with reference to the accompanying drawings.
Referring to fig. 2, the structure of an adjusting device for fault monitoring according to an embodiment of the present application is shown. As shown in fig. 2, the fault monitoring adjustment device includes:
a first obtaining unit 201, configured to obtain a fault monitoring result of hardware in the present white-box switch;
a first determining unit 202, configured to determine a first cycle parameter and a first priority parameter that match the fault monitoring result;
a first adjusting unit 203, configured to adjust a monitoring period of the present white-box switch according to the first period parameter, where the monitoring period is a period for monitoring whether hardware in the present white-box switch has a fault;
and a second adjusting unit 204, configured to adjust a monitoring priority of the hardware according to the first priority parameter, where the monitoring priority is used to determine a monitoring sequence for monitoring whether the hardware is faulty.
In one possible implementation, the apparatus further includes:
the second acquisition unit is used for acquiring monitoring parameters of other white-box switches, wherein the monitoring parameters comprise monitoring periods of the other white-box switches and monitoring priorities of first hardware of the other white-box switches;
the second determining unit is used for determining a second period parameter according to the monitoring period in the monitoring parameters;
the third adjusting unit is used for adjusting the monitoring period of the white box switch by utilizing the second period parameter;
a third determining unit, configured to determine a second priority parameter of the first hardware of the present white-box switch according to the monitoring priority of the first hardware of the other white-box switch in the monitoring parameters;
and the fourth adjusting unit is used for adjusting the monitoring priority of the first hardware of the white-box switch by utilizing the second priority parameter.
In a possible implementation manner, the monitoring parameter further includes model information of the other white-box switches, and the apparatus further includes:
and the deleting unit is used for discarding monitoring parameters of the white box switch with different types from the white box switch according to the type information of the other white box switches.
In a possible implementation manner, the monitoring parameter further includes a network address of the other white-box switch, and the apparatus further includes:
a fourth determining unit, configured to determine a first location distance between the other white-box switch and the own white-box switch according to the network address of the other white-box switch and the network address of the own white-box switch;
a fifth determining unit, configured to determine a first cycle weight and a first priority weight according to the first location distance;
the third adjusting unit is specifically configured to adjust a monitoring period of the present white box switch by using the second period parameter and the first period weight;
the fourth adjusting unit is specifically configured to adjust the monitoring priority of the first hardware of the present white-box switch by using the second priority parameter and the first priority weight.
In a possible implementation manner, the monitoring parameter further includes a production serial number of the other white-box switch, and the apparatus further includes:
a sixth determining unit, configured to determine a production lot gap between the other white-box switch and the own white-box switch according to the production serial numbers of the other white-box switches and the production serial numbers of the own white-box switch;
A seventh determining unit configured to determine a second cycle weight and a second priority weight according to the production lot gap;
the third adjusting unit is specifically configured to adjust a monitoring period of the present white box switch by using the second period parameter and the second period weight;
the fourth adjusting unit is specifically configured to adjust the monitoring priority of the first hardware of the present white-box switch by using the second priority parameter and the second priority weight.
In one possible implementation, the apparatus further includes:
acquiring hardware fault information through a server, wherein the hardware fault information comprises hardware information of second hardware, and the second hardware is the hardware with the fault;
an eighth determining unit, configured to determine a third cycle parameter and a third priority parameter according to the hardware failure information;
a fifth adjusting unit, configured to adjust a monitoring period of the present white box switch by using the third period parameter;
and a sixth adjusting unit, configured to adjust the monitoring priority of the second hardware of the present white-box switch by using the third priority parameter.
In one possible implementation manner, the hardware exception information further includes a failure time when the second hardware fails, and the apparatus further includes:
A ninth determining unit, configured to determine a third cycle weight and a third priority weight according to the time when the fault occurs;
the fifth adjusting unit is specifically configured to adjust the monitoring period of the white box switch by using the third period parameter and the third period weight;
the sixth adjusting unit is specifically configured to adjust the monitoring priority of the second hardware of the present white-box switch by using the third priority parameter and the third priority weight.
In one possible implementation manner, the hardware fault information further includes a network address of a target white-box switch, where the target white-box switch is a white-box switch that generates the hardware fault information, and the apparatus further includes:
a tenth determining unit, configured to determine a second location distance between the target white-box switch and the own white-box switch according to the network address of the target white-box switch and the network address of the own white-box switch;
an eleventh determining unit configured to determine a fourth period weight and a fourth priority weight according to the second location distance;
the fifth adjusting unit is specifically configured to adjust a monitoring period of the present white box switch by using the third period parameter and the fourth period weight;
The sixth adjusting unit is specifically configured to adjust the monitoring priority of the second hardware of the present white-box switch by using the third priority parameter and the fourth priority weight.
Based on the foregoing method embodiment for adjusting fault monitoring, the embodiment of the present application further provides an adjusting device for fault monitoring, including: a processor, memory, system bus;
the processor and the memory are connected through the system bus;
the memory is configured to store one or more programs, the one or more programs comprising instructions, which when executed by the processor, cause the processor to perform the method of any of the embodiments above.
Based on the foregoing method embodiment for adjusting fault monitoring, the present application provides a computer readable storage medium, where instructions are stored in the computer readable storage medium, and when the instructions are executed on a terminal device, the terminal device is caused to execute the method described in any one of the foregoing embodiments.
It should be noted that, in the present description, each embodiment is described in a progressive manner, and each embodiment is mainly described in a different manner from other embodiments, and identical and similar parts between the embodiments are all enough to refer to each other. For the system or device disclosed in the embodiments, since it corresponds to the method disclosed in the embodiments, the description is relatively simple, and the relevant points refer to the description of the method section.
It should be understood that in this application, "at least one" means one or more, and "a plurality" means two or more. "and/or" for describing the association relationship of the association object, the representation may have three relationships, for example, "a and/or B" may represent: only a, only B and both a and B are present, wherein a, B may be singular or plural. The character "/" generally indicates that the context-dependent object is an "or" relationship. "at least one of" or the like means any combination of these items, including any combination of single item(s) or plural items(s). For example, at least one (one) of a, b or c may represent: a, b, c, "a and b", "a and c", "b and c", or "a and b and c", wherein a, b, c may be single or plural.
It is further noted that relational terms such as first and second, and the like are used solely to distinguish one entity or action from another entity or action without necessarily requiring or implying any actual such relationship or order between such entities or actions. Moreover, the terms "comprises," "comprising," or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, method, article, or apparatus that comprises a list of elements does not include only those elements but may include other elements not expressly listed or inherent to such process, method, article, or apparatus. Without further limitation, an element defined by the phrase "comprising one … …" does not exclude the presence of other like elements in a process, method, article, or apparatus that comprises the element.
The steps of a method or algorithm described in connection with the embodiments disclosed herein may be embodied directly in hardware, in a software module executed by a processor, or in a combination of the two. The software modules may be disposed in Random Access Memory (RAM), memory, read Only Memory (ROM), electrically programmable ROM, electrically erasable programmable ROM, registers, hard disk, a removable disk, a CD-ROM, or any other form of storage medium known in the art.
The previous description of the disclosed embodiments is provided to enable any person skilled in the art to make or use the present application. Various modifications to these embodiments will be readily apparent to those skilled in the art, and the generic principles defined herein may be applied to other embodiments without departing from the spirit or scope of the application. Thus, the present application is not intended to be limited to the embodiments shown herein but is to be accorded the widest scope consistent with the principles and novel features disclosed herein.

Claims (10)

1. A method for adjusting fault monitoring, wherein the method is applied to a white-box switch, and the method comprises:
obtaining a fault monitoring result of hardware in the white box switch;
Determining a first period parameter and a first priority parameter matched with the fault monitoring result;
according to the first period parameter, adjusting a monitoring period of the white-box switch, wherein the monitoring period is a period for monitoring whether hardware in the white-box switch has faults or not;
according to the first priority parameter, adjusting the monitoring priority of the hardware, wherein the monitoring priority is used for determining a monitoring sequence for monitoring whether the hardware has faults or not;
the method comprises the steps of respectively adjusting a monitoring period and a monitoring priority of a white box switch based on a first period parameter and a first priority parameter which are correspondingly matched with a fault level in a fault detection result;
and, the method further comprises: acquiring monitoring parameters of other white-box switches, wherein the monitoring parameters comprise monitoring periods of the other white-box switches and monitoring priorities of first hardware of the other white-box switches; determining a second period parameter according to the monitoring period in the monitoring parameters; adjusting the monitoring period of the white box switch by using the second period parameter; determining a second priority parameter of the first hardware of the white-box switch according to the monitoring priority of the first hardware of the other white-box switches in the monitoring parameters; adjusting the monitoring priority of the first hardware of the white-box switch by using the second priority parameter;
And the method also determines corresponding cycle weight and priority weight according to the first position distance and the production batch difference between other white box switches and the white box switch, and adjusts the monitoring cycle and the monitoring priority of the white box switch based on the cycle weight and the priority weight respectively.
2. The method of claim 1, wherein the monitoring parameters further comprise model information of the other white-box switch, and wherein after the obtaining the monitoring parameters of the other white-box switch, the method further comprises:
and discarding monitoring parameters of the white box switch with different types from the white box switch according to the model information of the other white box switches.
3. The method of claim 1, wherein the monitoring parameters further comprise network addresses of the other white-box switches, and wherein after the obtaining the monitoring parameters of the other white-box switches, the method further comprises:
determining a first position distance between the other white-box switches and the white-box switch according to the network addresses of the other white-box switches and the network address of the white-box switch;
Determining a first periodic weight and a first priority weight according to the first position distance;
the adjusting the monitoring period of the white-box switch by using the second period parameter includes:
adjusting the monitoring period of the white box switch by using the second period parameter and the first period weight;
the adjusting the monitoring priority of the first hardware of the white-box switch by using the second priority parameter includes:
and adjusting the monitoring priority of the first hardware of the white-box switch by using the second priority parameter and the first priority weight.
4. The method of claim 1, wherein the monitoring parameters further comprise production serial numbers of the other white-box switches, and wherein after the obtaining the monitoring parameters of the other white-box switches, the method further comprises:
determining a production batch gap between the other white box switches and the white box switch according to the production serial numbers of the other white box switches and the production serial numbers of the white box switch;
determining a second cycle weight and a second priority weight according to the production lot gap;
the adjusting the monitoring period of the white-box switch by using the second period parameter includes:
Adjusting the monitoring period of the white box switch by using the second period parameter and the second period weight;
the adjusting the monitoring priority of the first hardware of the white-box switch by using the second priority parameter includes:
and adjusting the monitoring priority of the first hardware of the white-box switch by using the second priority parameter and the second priority weight.
5. The method according to any one of claims 1-4, further comprising:
acquiring hardware fault information through a server, wherein the hardware fault information comprises hardware information of second hardware, and the second hardware is the hardware with the fault;
determining a third period parameter and a third priority parameter according to the hardware fault information;
adjusting the monitoring period of the white box switch by using the third period parameter;
and adjusting the monitoring priority of the second hardware of the white-box switch by using the third priority parameter.
6. The method of claim 5, wherein the hardware exception information further includes a failure time at which the second hardware failed, the method further comprising:
Determining a third period weight and a third priority weight according to the time of the fault;
the adjusting the fault monitoring period of the white-box switch by using the third period parameter includes:
adjusting the monitoring period of the white box switch by using the third period parameter and the third period weight;
the adjusting the monitoring priority of the second hardware of the white-box switch by using the third priority parameter includes:
and adjusting the monitoring priority of the second hardware of the white-box switch by using the third priority parameter and the third priority weight.
7. The method of claim 5, wherein the hardware failure information further comprises a network address of a target white-box switch, the target white-box switch being a white-box switch that generated the hardware failure information, the method further comprising:
determining a second position distance between the target white-box switch and the own white-box switch according to the network address of the target white-box switch and the network address of the own white-box switch;
determining a fourth period weight and a fourth priority weight according to the second position distance;
The adjusting the monitoring period of the white-box switch by using the third period parameter includes:
adjusting the monitoring period of the white box switch by using the third period parameter and the fourth period weight;
the adjusting the monitoring priority of the second hardware of the white-box switch by using the third priority parameter includes:
and adjusting the monitoring priority of the second hardware of the white-box switch by using the third priority parameter and the fourth priority weight.
8. An adjustment device for fault monitoring, applied to a white-box switch, the device comprising:
the first acquisition unit is used for acquiring a fault monitoring result of hardware in the white-box switch;
a first determining unit, configured to determine a first cycle parameter and a first priority parameter that are matched with the fault monitoring result;
the first adjusting unit is used for adjusting the monitoring period of the white box switch according to the first period parameter, wherein the monitoring period is a period for monitoring whether hardware in the white box switch has faults or not;
the second adjusting unit is used for adjusting the monitoring priority of the hardware according to the first priority parameter, and the monitoring priority is used for determining a monitoring sequence for monitoring whether the hardware has faults or not;
The method comprises the steps of respectively adjusting a monitoring period and a monitoring priority of a white box switch based on a first period parameter and a first priority parameter which are correspondingly matched with a fault level in a fault detection result;
the device is also used for acquiring monitoring parameters of other white-box switches, wherein the monitoring parameters comprise monitoring periods of the other white-box switches and monitoring priorities of first hardware of the other white-box switches; determining a second period parameter according to the monitoring period in the monitoring parameters; adjusting the monitoring period of the white box switch by using the second period parameter; determining a second priority parameter of the first hardware of the white-box switch according to the monitoring priority of the first hardware of the other white-box switches in the monitoring parameters; adjusting the monitoring priority of the first hardware of the white-box switch by using the second priority parameter;
and the device also determines corresponding cycle weight and priority weight according to the first position distance and the production batch difference between other white box switches and the white box switch, and adjusts the monitoring cycle and the monitoring priority of the white box switch based on the cycle weight and the priority weight respectively.
9. An adjustment device for fault monitoring, comprising: a processor, memory, system bus;
the processor and the memory are connected through the system bus;
the memory is for storing one or more programs, the one or more programs comprising instructions, which when executed by the processor, cause the processor to perform the method of any of claims 1-7.
10. A computer readable storage medium, characterized in that the computer readable storage medium has stored therein instructions, which when run on a terminal device, cause the terminal device to perform the method of any of claims 1-7.
CN202110872532.4A 2021-07-30 2021-07-30 Fault monitoring adjustment method, device and equipment Active CN113746695B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202110872532.4A CN113746695B (en) 2021-07-30 2021-07-30 Fault monitoring adjustment method, device and equipment

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202110872532.4A CN113746695B (en) 2021-07-30 2021-07-30 Fault monitoring adjustment method, device and equipment

Publications (2)

Publication Number Publication Date
CN113746695A CN113746695A (en) 2021-12-03
CN113746695B true CN113746695B (en) 2023-07-18

Family

ID=78729571

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202110872532.4A Active CN113746695B (en) 2021-07-30 2021-07-30 Fault monitoring adjustment method, device and equipment

Country Status (1)

Country Link
CN (1) CN113746695B (en)

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110830289A (en) * 2019-10-21 2020-02-21 华中科技大学 Container abnormity monitoring method and monitoring system
CN112114952A (en) * 2020-09-24 2020-12-22 杭州鲁尔物联科技有限公司 Multi-level threshold triggering sensor and multi-level threshold triggering method

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110830289A (en) * 2019-10-21 2020-02-21 华中科技大学 Container abnormity monitoring method and monitoring system
CN112114952A (en) * 2020-09-24 2020-12-22 杭州鲁尔物联科技有限公司 Multi-level threshold triggering sensor and multi-level threshold triggering method

Also Published As

Publication number Publication date
CN113746695A (en) 2021-12-03

Similar Documents

Publication Publication Date Title
CN108491305B (en) Method and system for detecting server fault
CN110224858B (en) Log-based alarm method and related device
US11146457B2 (en) Train network node and CANopen-based train network node monitoring method
CN110968061B (en) Equipment fault early warning method and device, storage medium and computer equipment
CN112636979B (en) Cluster alarm method and related device
CN105450472A (en) Method and device for automatically acquiring states of physical components of servers
CN106789445B (en) Status polling method and system for network equipment in broadcast television network
CN111611254A (en) Equipment energy consumption abnormity monitoring method and device, terminal equipment and storage medium
CN106199261A (en) Electrical equipment Degree of Ageing based on the Internet continues detecting system and detection method
CN110647417B (en) Energy internet abnormal data processing method, device and system
CN109412902B (en) Intelligent monitoring method, storage device, terminal and system of power dispatching data network system
CN108959025A (en) A kind of server alarm method, device and server
CN106385343B (en) Method and device for monitoring client under distributed system and distributed system
CN113487182B (en) Device health state evaluation method, device, computer device and medium
CN113746695B (en) Fault monitoring adjustment method, device and equipment
CN110737565A (en) data monitoring method, device, electronic equipment and storage medium
US10584681B2 (en) Automative system management
CN114793132A (en) Optical module detection method and device, electronic equipment and storage medium
WO2021174684A1 (en) Cutover information processing method, system and apparatus
CN115378841B (en) Method and device for detecting state of equipment accessing cloud platform, storage medium and terminal
CN114301815B (en) Method and device for processing broadcast storm
CN114296979A (en) Method and device for detecting abnormal state of Internet of things equipment
CN113938844A (en) Network connection monitoring method, system, computer device and storage medium
CN113300918A (en) Fault detection method of intelligent lamp pole, terminal device and storage medium
CN115934453A (en) Troubleshooting method, troubleshooting device and storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant