CN113590434B - Cluster alarm method, system, equipment and medium - Google Patents

Cluster alarm method, system, equipment and medium Download PDF

Info

Publication number
CN113590434B
CN113590434B CN202110682331.8A CN202110682331A CN113590434B CN 113590434 B CN113590434 B CN 113590434B CN 202110682331 A CN202110682331 A CN 202110682331A CN 113590434 B CN113590434 B CN 113590434B
Authority
CN
China
Prior art keywords
cluster
node
reporting
availability
nodes
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202110682331.8A
Other languages
Chinese (zh)
Other versions
CN113590434A (en
Inventor
赵晓青
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Inspur Jinan data Technology Co ltd
Original Assignee
Inspur Jinan data Technology Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Inspur Jinan data Technology Co ltd filed Critical Inspur Jinan data Technology Co ltd
Priority to CN202110682331.8A priority Critical patent/CN113590434B/en
Publication of CN113590434A publication Critical patent/CN113590434A/en
Application granted granted Critical
Publication of CN113590434B publication Critical patent/CN113590434B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/30Monitoring
    • G06F11/32Monitoring with visual or acoustical indication of the functioning of the machine
    • G06F11/324Display of status information
    • G06F11/327Alarm or error message display
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/30Monitoring
    • G06F11/3089Monitoring arrangements determined by the means or processing involved in sensing the monitored data, e.g. interfaces, connectors, sensors, probes, agents
    • G06F11/3093Configuration details thereof, e.g. installation, enabling, spatial arrangement of the probes
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02DCLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
    • Y02D30/00Reducing energy consumption in communication networks
    • Y02D30/70Reducing energy consumption in communication networks in wireless communication networks

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Quality & Reliability (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Data Exchanges In Wide-Area Networks (AREA)

Abstract

The invention discloses a cluster alarm method, which comprises the following steps: setting a plurality of high-availability nodes outside the cluster; carrying out preset reporting configuration on each high-availability node; responding to the detection that the nodes in the cluster generate alarm information, and judging whether the nodes in the cluster can be connected with the reporting address; and responding to the fact that the internal node cannot establish connection with the reporting address, and establishing connection with one of the high-availability nodes so as to call the high-availability nodes to report the alarm information. The invention also discloses a system, computer equipment and a readable storage medium. The scheme provided by the invention is to pre-determine a plurality of nodes outside the cluster as high available nodes, when an alarm is generated inside the cluster, the judgment on whether the node can report the alarm is increased, if the current node cannot report, the node can report the alarm through the pre-determined high available nodes, so that the stability of reporting the alarm of the cluster is ensured.

Description

Cluster alarm method, system, equipment and medium
Technical Field
The present invention relates to the field of clusters, and in particular, to a cluster alarm method, system, device, and storage medium.
Background
With the rapid development of cloud computing and big data technology in the development of modern society, the accumulated production data in production and life also grow exponentially, and mass storage technology becomes an integral part in the development of the internet. In distributed storage systems, however, some critical information or faults often need to be alerted due to the need to monitor and manage the vast amount of data. However, in actual situations, the operation and maintenance personnel cannot be guaranteed to be always present, and in general, once an abnormality or an alarm occurs in a cluster, the operation and maintenance personnel are required to be timely notified in a manner of e-mail, short message, and the like. Therefore, the stability of alarm reporting is greatly required, and once a fault, a network reason and the like are met or a machine of a machine room cannot be connected with an external network, operation and maintenance personnel cannot acquire and process the corresponding alarm in time, so that a serious accident of a cluster is likely to occur, and immeasurable loss is caused.
Disclosure of Invention
In view of this, in order to overcome at least one aspect of the above-mentioned problems, an embodiment of the present invention provides a cluster alarm method, including the following steps:
setting a plurality of high-availability nodes outside the cluster;
carrying out preset reporting configuration on each high-availability node;
responding to the detection that the nodes in the cluster generate alarm information, and judging whether the nodes in the cluster can be connected with the reporting address;
and responding to the fact that the internal node cannot establish connection with the reporting address, and establishing connection with one of the high-availability nodes so as to call the high-availability nodes to report the alarm information.
In some embodiments, the setting of the plurality of high availability nodes outside the cluster further comprises:
a preset interface is arranged inside the cluster;
and filling in the information of each high available node through the preset interface.
In some embodiments, further comprising:
and testing whether the high-availability nodes can be connected to the reported address or not and testing whether the high-availability nodes can be connected to the internal nodes or not according to the information of each high-availability node filled in the preset interface.
In some embodiments, the preset reporting configuration is performed on each of the high available nodes, and the method further includes:
and installing a reporting module for providing reporting functions on each high-availability node.
In some embodiments, in response to detecting that a node within a cluster generates alarm information, determining whether the node within the cluster is capable of establishing a connection with a reporting address further includes:
judging whether a master node in the cluster can establish connection with the reporting address;
and responding to the fact that a master node in the cluster can establish connection with the reporting address, and reporting the alarm information to the reporting address by using the master node.
In some embodiments, further comprising:
responding to the fact that a master node in the cluster cannot establish connection with the reporting address, and judging whether a slave node in the cluster can establish connection with the reporting address;
and responding to the fact that the slave node in the cluster can establish connection with the reporting address, and reporting the alarm information to the reporting address by using the slave node.
In some embodiments, in response to the internal node failing to establish a connection with a reporting address, establishing a connection with one of the high-availability nodes to invoke the high-availability node to report the alarm information, further comprising:
responding to the fact that a slave node in the cluster cannot establish connection with the reporting address, and establishing connection with one of high-availability nodes by using the master node;
and calling a pre-installed reporting module in the high-availability node to report the alarm information to a preset address through the reporting module.
Based on the same inventive concept, according to another aspect of the present invention, an embodiment of the present invention further provides a cluster alarm system, including:
the setting module is configured to set a plurality of high-availability nodes outside the cluster;
the configuration module is configured to preset reporting configuration for each high-availability node;
the judging module is configured to respond to the detection that the nodes in the cluster generate alarm information and judge whether the nodes in the cluster can establish connection with the reporting address;
and the high availability module is configured to respond to the fact that the internal node cannot establish connection with the reporting address, establish connection with one of the high availability nodes, and call the high availability node to report the alarm information.
Based on the same inventive concept, according to another aspect of the present invention, an embodiment of the present invention further provides a computer apparatus, including:
at least one processor; and
a memory storing a computer program executable on the processor, wherein the processor performs the steps of any of the cluster alert methods described above when the program is executed.
Based on the same inventive concept, according to another aspect of the present invention, there is also provided a computer-readable storage medium storing a computer program which, when executed by a processor, performs the steps of any of the cluster alarm methods described above.
The invention has one of the following beneficial technical effects: the scheme provided by the invention is to pre-determine a plurality of nodes outside the cluster as high available nodes, when an alarm is generated inside the cluster, the judgment on whether the node can report the alarm is increased, if the current node cannot report, the node can report the alarm through the pre-determined high available nodes, so that the stability of reporting the alarm of the cluster is ensured.
Drawings
In order to more clearly illustrate the embodiments of the invention or the technical solutions in the prior art, the drawings that are necessary for the description of the embodiments or the prior art will be briefly described, it being obvious that the drawings in the following description are only some embodiments of the invention and that other embodiments may be obtained according to these drawings without inventive effort for a person skilled in the art.
Fig. 1 is a schematic flow chart of a cluster alarm method according to an embodiment of the present invention;
FIG. 2 is a schematic diagram of a cluster alarm system according to an embodiment of the present invention;
FIG. 3 is a schematic diagram of a computer device according to an embodiment of the present invention;
fig. 4 is a schematic structural diagram of a computer-readable storage medium according to an embodiment of the present invention.
Detailed Description
In order to make the objects, technical solutions and advantages of the present invention more apparent, the following embodiments of the present invention will be described in further detail with reference to the accompanying drawings.
It should be noted that, in the embodiments of the present invention, all the expressions "first" and "second" are used to distinguish two entities with the same name but different entities or different parameters, and it is noted that the "first" and "second" are only used for convenience of expression, and should not be construed as limiting the embodiments of the present invention, and the following embodiments are not described one by one.
According to an aspect of the present invention, an embodiment of the present invention proposes a cluster alarm method, as shown in fig. 1, which may include the steps of:
s1, arranging a plurality of high-availability nodes outside a cluster;
s2, carrying out preset reporting configuration on each high-availability node;
s3, responding to detection of alarm information generated by nodes in the cluster, and judging whether the nodes in the cluster can be connected with the reporting address;
and S4, responding to the fact that the internal node cannot establish connection with the reporting address, and establishing connection with one of the high-availability nodes so as to call the high-availability node to report the alarm information.
The scheme provided by the invention is to pre-determine a plurality of nodes outside the cluster as high available nodes, when an alarm is generated inside the cluster, the judgment on whether the node can report the alarm is increased, if the current node cannot report, the node can report the alarm through the pre-determined high available nodes, so that the stability of reporting the alarm of the cluster is ensured.
In some embodiments, the setting of the plurality of high availability nodes outside the cluster further comprises:
a preset interface is arranged inside the cluster;
and filling in the information of each high available node through the preset interface.
Specifically, a high availability node configuration module may be provided within the cluster. The module provides an entry for configuring the alarm reporting high availability node through which the user can fill in information of the external node to which the present cluster node can be connected and which can be connected to the external network. For example, the user can configure IP information of the high availability node through the portal, and then the nodes inside the cluster can realize connection with the set high availability node through the IP information.
In some embodiments, the preset reporting configuration is performed on each of the high available nodes, and the method further includes:
and installing a reporting module for providing reporting functions on each high-availability node.
Specifically, after the user fills out relevant information of the high available node at the entry provided by the high available node configuration module, a tool package can be sent to the corresponding high available node, and the tool package is equivalent to providing a public alarm reporting interface and is used as a medium for sending alarm mail, alarm short messages and the like outwards by the cluster.
In some embodiments, further comprising:
and testing whether the high-availability nodes can be connected to the reported address or not and testing whether the high-availability nodes can be connected to the internal nodes or not according to the information of each high-availability node filled in the preset interface.
Specifically, after receiving the information, such as IP information, of the high availability node filled by the user, it may first test whether the cluster node can access the high availability node and whether the high availability node can be connected to the report address, if the prompt test is successful, it may be added, and if the test fails, it needs to be refilled. After one success is filled, the information of other high-availability nodes can be continuously added, or the information of the high-availability nodes can be added at one time, then the test is carried out simultaneously, and the node information of the test failure is fed back.
In some embodiments, in response to detecting that a node within a cluster generates alarm information, determining whether the node within the cluster is capable of establishing a connection with a reporting address further includes:
judging whether a master node in the cluster can establish connection with the reporting address;
and responding to the fact that a master node in the cluster can establish connection with the reporting address, and reporting the alarm information to the reporting address by using the master node.
Specifically, when the cluster generates an alarm, the master node of the cluster can be used for calling an alarm reporting module in the cluster to report the alarm, before reporting, whether the current master node can be connected to a reporting address can be judged, and if the master node in the cluster can be connected with the reporting address, the master node is used for reporting the alarm information to the reporting address.
In some embodiments, further comprising:
responding to the fact that a master node in the cluster cannot establish connection with the reporting address, and judging whether a slave node in the cluster can establish connection with the reporting address;
and responding to the fact that the slave node in the cluster can establish connection with the reporting address, and reporting the alarm information to the reporting address by using the slave node.
Specifically, if the current master node cannot establish connection with the reporting address, the current master node can call other nodes in the cluster to report through an alarm reporting automatic repair logic. When other nodes in the cluster are called for reporting, whether the nodes capable of reporting normally exist or not can be checked and judged, namely whether the nodes capable of being connected to a reporting address exist or not, and if yes, the nodes are used for reporting the alarm.
In some embodiments, in response to the internal node failing to establish a connection with a reporting address, establishing a connection with one of the high-availability nodes to invoke the high-availability node to report the alarm information, further comprising:
responding to the fact that a slave node in the cluster cannot establish connection with the reporting address, and establishing connection with one of high-availability nodes by using the master node;
and calling a pre-installed reporting module in the high-availability node to report the alarm information to a preset address through the reporting module.
Specifically, if all nodes in the cluster cannot be connected with the reporting address to report the alarm information, the master node is required to establish connection with one of the high-availability nodes, the high-availability nodes are used for reporting the alarm information, and a pre-installed reporting module (i.e. a pre-sent tool kit) in the high-availability nodes is called to report the alarm information to a preset address through the reporting module.
The high availability mechanism for reporting the alarm can effectively avoid the situation of mail sending failure caused by the reasons of a machine room network, a local area network and the like. And moreover, a plurality of alarm reporting alternative mechanisms enable the cluster alarm system to be more powerful, and timely report alarms, so that operation and maintenance personnel can identify cluster health conditions in advance to repair problems, the overall stability of the storage system is enhanced, major accidents are avoided, and the maintenance cost is saved.
The proposal of the invention pre-configures some nodes outside the cluster as high availability nodes which can establish connection with the nodes inside the cluster and can also be connected to an external network. When the cluster reports the alarm, the judgment of whether the node can report the alarm is increased, if the current node cannot report the alarm, the alarm reporting automatic repair logic is called, the alarm reporting is firstly carried out in sequence through other nodes of the cluster, and if the reporting is successful, the subsequent node does not report any more. If the whole cluster cannot report the alarm, such as in an intranet, mail cannot be sent to an external network, etc., the alarm reporting tool kit is called through a high-availability node configured in advance to report the alarm, so that the stability of the alarm reporting of the storage cluster is ensured.
Based on the same inventive concept, according to another aspect of the present invention, there is further provided a cluster alarm system 400, as shown in fig. 2, including:
a setting module 401 configured to set a number of high availability nodes outside the cluster;
a configuration module 402, configured to perform preset reporting configuration on each of the high-availability nodes;
a judging module 403, configured to respond to detecting that the node inside the cluster generates alarm information, and judge whether the node inside can establish connection with the reporting address;
and the high availability module 404 is configured to respond that the internal node cannot establish connection with the reporting address, and establish connection with one of the high availability nodes so as to call the high availability node to report the alarm information.
In some embodiments, the setting of the plurality of high availability nodes outside the cluster further comprises:
a preset interface is arranged inside the cluster;
and filling in the information of each high available node through the preset interface.
In some embodiments, further comprising:
and testing whether the high-availability nodes can be connected to the reported address or not and testing whether the high-availability nodes can be connected to the internal nodes or not according to the information of each high-availability node filled in the preset interface.
In some embodiments, the preset reporting configuration is performed on each of the high available nodes, and the method further includes:
and installing a reporting module for providing reporting functions on each high-availability node.
In some embodiments, in response to detecting that a node within a cluster generates alarm information, determining whether the node within the cluster is capable of establishing a connection with a reporting address further includes:
judging whether a master node in the cluster can establish connection with the reporting address;
and responding to the fact that a master node in the cluster can establish connection with the reporting address, and reporting the alarm information to the reporting address by using the master node.
In some embodiments, further comprising:
responding to the fact that a master node in the cluster cannot establish connection with the reporting address, and judging whether a slave node in the cluster can establish connection with the reporting address;
and responding to the fact that the slave node in the cluster can establish connection with the reporting address, and reporting the alarm information to the reporting address by using the slave node.
In some embodiments, in response to the internal node failing to establish a connection with a reporting address, establishing a connection with one of the high-availability nodes to invoke the high-availability node to report the alarm information, further comprising:
responding to the fact that a slave node in the cluster cannot establish connection with the reporting address, and establishing connection with one of high-availability nodes by using the master node;
and calling a pre-installed reporting module in the high-availability node to report the alarm information to a preset address through the reporting module.
The proposal of the invention pre-configures some nodes outside the cluster as high availability nodes which can establish connection with the nodes inside the cluster and can also be connected to an external network. When the cluster reports the alarm, the judgment of whether the node can report the alarm is increased, if the current node cannot report the alarm, the alarm reporting automatic repair logic is called, the alarm reporting is firstly carried out in sequence through other nodes of the cluster, and if the reporting is successful, the subsequent node does not report any more. If the whole cluster cannot report the alarm, such as in an intranet, mail cannot be sent to an external network, etc., the alarm reporting tool kit is called through a high-availability node configured in advance to report the alarm, so that the stability of the alarm reporting of the storage cluster is ensured.
Based on the same inventive concept, according to another aspect of the present invention, as shown in fig. 3, an embodiment of the present invention further provides a computer apparatus 501, including:
at least one processor 520; and
the memory 510, the memory 510 stores a computer program 511 executable on a processor, and the processor 520 executes the program to perform the steps of:
s1, arranging a plurality of high-availability nodes outside a cluster;
s2, carrying out preset reporting configuration on each high-availability node;
s3, responding to detection of alarm information generated by nodes in the cluster, and judging whether the nodes in the cluster can be connected with the reporting address;
and S4, responding to the fact that the internal node cannot establish connection with the reporting address, and establishing connection with one of the high-availability nodes so as to call the high-availability node to report the alarm information.
In some embodiments, the setting of the plurality of high availability nodes outside the cluster further comprises:
a preset interface is arranged inside the cluster;
and filling in the information of each high available node through the preset interface.
In some embodiments, further comprising:
and testing whether the high-availability nodes can be connected to the reported address or not and testing whether the high-availability nodes can be connected to the internal nodes or not according to the information of each high-availability node filled in the preset interface.
In some embodiments, the preset reporting configuration is performed on each of the high available nodes, and the method further includes:
and installing a reporting module for providing reporting functions on each high-availability node.
In some embodiments, in response to detecting that a node within a cluster generates alarm information, determining whether the node within the cluster is capable of establishing a connection with a reporting address further includes:
judging whether a master node in the cluster can establish connection with the reporting address;
and responding to the fact that a master node in the cluster can establish connection with the reporting address, and reporting the alarm information to the reporting address by using the master node.
In some embodiments, further comprising:
responding to the fact that a master node in the cluster cannot establish connection with the reporting address, and judging whether a slave node in the cluster can establish connection with the reporting address;
and responding to the fact that the slave node in the cluster can establish connection with the reporting address, and reporting the alarm information to the reporting address by using the slave node.
In some embodiments, in response to the internal node failing to establish a connection with a reporting address, establishing a connection with one of the high-availability nodes to invoke the high-availability node to report the alarm information, further comprising:
responding to the fact that a slave node in the cluster cannot establish connection with the reporting address, and establishing connection with one of high-availability nodes by using the master node;
and calling a pre-installed reporting module in the high-availability node to report the alarm information to a preset address through the reporting module.
The proposal of the invention pre-configures some nodes outside the cluster as high availability nodes which can establish connection with the nodes inside the cluster and can also be connected to an external network. When the cluster reports the alarm, the judgment of whether the node can report the alarm is increased, if the current node cannot report the alarm, the alarm reporting automatic repair logic is called, the alarm reporting is firstly carried out in sequence through other nodes of the cluster, and if the reporting is successful, the subsequent node does not report any more. If the whole cluster cannot report the alarm, such as in an intranet, mail cannot be sent to an external network, etc., the alarm reporting tool kit is called through a high-availability node configured in advance to report the alarm, so that the stability of the alarm reporting of the storage cluster is ensured.
Based on the same inventive concept, according to another aspect of the present invention, as shown in fig. 4, an embodiment of the present invention further provides a computer-readable storage medium 601, the computer-readable storage medium 601 storing computer program instructions 610, the computer program instructions 610 being executed by a processor to:
s1, arranging a plurality of high-availability nodes outside a cluster;
s2, carrying out preset reporting configuration on each high-availability node;
s3, responding to detection of alarm information generated by nodes in the cluster, and judging whether the nodes in the cluster can be connected with the reporting address;
and S4, responding to the fact that the internal node cannot establish connection with the reporting address, and establishing connection with one of the high-availability nodes so as to call the high-availability node to report the alarm information.
In some embodiments, the setting of the plurality of high availability nodes outside the cluster further comprises:
a preset interface is arranged inside the cluster;
and filling in the information of each high available node through the preset interface.
In some embodiments, further comprising:
and testing whether the high-availability nodes can be connected to the reported address or not and testing whether the high-availability nodes can be connected to the internal nodes or not according to the information of each high-availability node filled in the preset interface.
In some embodiments, the preset reporting configuration is performed on each of the high available nodes, and the method further includes:
and installing a reporting module for providing reporting functions on each high-availability node.
In some embodiments, in response to detecting that a node within a cluster generates alarm information, determining whether the node within the cluster is capable of establishing a connection with a reporting address further includes:
judging whether a master node in the cluster can establish connection with the reporting address;
and responding to the fact that a master node in the cluster can establish connection with the reporting address, and reporting the alarm information to the reporting address by using the master node.
In some embodiments, further comprising:
responding to the fact that a master node in the cluster cannot establish connection with the reporting address, and judging whether a slave node in the cluster can establish connection with the reporting address;
and responding to the fact that the slave node in the cluster can establish connection with the reporting address, and reporting the alarm information to the reporting address by using the slave node.
In some embodiments, in response to the internal node failing to establish a connection with a reporting address, establishing a connection with one of the high-availability nodes to invoke the high-availability node to report the alarm information, further comprising:
responding to the fact that a slave node in the cluster cannot establish connection with the reporting address, and establishing connection with one of high-availability nodes by using the master node;
and calling a pre-installed reporting module in the high-availability node to report the alarm information to a preset address through the reporting module.
The proposal of the invention pre-configures some nodes outside the cluster as high availability nodes which can establish connection with the nodes inside the cluster and can also be connected to an external network. When the cluster reports the alarm, the judgment of whether the node can report the alarm is increased, if the current node cannot report the alarm, the alarm reporting automatic repair logic is called, the alarm reporting is firstly carried out in sequence through other nodes of the cluster, and if the reporting is successful, the subsequent node does not report any more. If the whole cluster cannot report the alarm, such as in an intranet, mail cannot be sent to an external network, etc., the alarm reporting tool kit is called through a high-availability node configured in advance to report the alarm, so that the stability of the alarm reporting of the storage cluster is ensured.
Finally, it should be noted that, as will be appreciated by those skilled in the art, all or part of the procedures in implementing the methods of the embodiments described above may be implemented by a computer program for instructing relevant hardware, and the program may be stored in a computer readable storage medium, and the program may include the procedures of the embodiments of the methods described above when executed.
Further, it should be appreciated that the computer-readable storage medium (e.g., memory) herein can be either volatile memory or nonvolatile memory, or can include both volatile and nonvolatile memory.
Those of skill would further appreciate that the various illustrative logical blocks, modules, circuits, and algorithm steps described in connection with the disclosure herein may be implemented as electronic hardware, computer software, or combinations of both. To clearly illustrate this interchangeability of hardware and software, various illustrative components, blocks, modules, circuits, and steps have been described above generally in terms of their functionality. Whether such functionality is implemented as software or hardware depends upon the particular application and design constraints imposed on the overall system. Skilled artisans may implement the described functionality in varying ways for each particular application, but such implementation decisions should not be interpreted as causing a departure from the scope of the present disclosure.
The foregoing is an exemplary embodiment of the present disclosure, but it should be noted that various changes and modifications could be made herein without departing from the scope of the disclosure as defined by the appended claims. The functions, steps and/or actions of the method claims in accordance with the disclosed embodiments described herein need not be performed in any particular order. Furthermore, although elements of the disclosed embodiments may be described or claimed in the singular, the plural is contemplated unless limitation to the singular is explicitly stated.
It should be understood that as used herein, the singular forms "a", "an", and "the" are intended to include the plural forms as well, unless the context clearly supports the exception. It should also be understood that "and/or" as used herein is meant to include any and all possible combinations of one or more of the associated listed items.
The foregoing embodiment of the present invention has been disclosed with reference to the number of embodiments for the purpose of description only, and does not represent the advantages or disadvantages of the embodiments.
It will be appreciated by those of ordinary skill in the art that all or part of the steps of implementing the above embodiments may be implemented by hardware, or may be implemented by a program to instruct related hardware, and the program may be stored in a computer readable storage medium, where the storage medium may be a read-only memory, a magnetic disk or an optical disk, etc.
Those of ordinary skill in the art will appreciate that: the above discussion of any embodiment is merely exemplary and is not intended to imply that the scope of the disclosure of embodiments of the invention, including the claims, is limited to such examples; combinations of features of the above embodiments or in different embodiments are also possible within the idea of an embodiment of the invention, and many other variations of the different aspects of the embodiments of the invention as described above exist, which are not provided in detail for the sake of brevity. Therefore, any omission, modification, equivalent replacement, improvement, etc. of the embodiments should be included in the protection scope of the embodiments of the present invention.

Claims (8)

1. The cluster alarm method is characterized by comprising the following steps of:
setting a plurality of high-availability nodes outside the cluster;
carrying out preset reporting configuration on each high-availability node;
responding to the detection that the nodes in the cluster generate alarm information, and judging whether the nodes in the cluster can be connected with the reporting address;
responding to the fact that the internal node cannot establish connection with a reporting address, and establishing connection with one of the high-availability nodes so as to call the high-availability node to report alarm information;
setting a plurality of high available nodes outside the cluster, further comprising:
a preset interface is arranged inside the cluster;
filling in the information of each high-availability node through the preset interface;
and testing whether the high-availability nodes can be connected to the reported address or not and testing whether the high-availability nodes can be connected to the internal nodes or not according to the information of each high-availability node filled in the preset interface.
2. The method of claim 1, wherein the preset reporting configuration is performed for each of the high availability nodes, further comprising:
and installing a reporting module for providing reporting functions on each high-availability node.
3. The method of claim 2, wherein in response to detecting that a node within a cluster generates alarm information, determining whether the node within the cluster is capable of establishing a connection with a reporting address further comprises:
judging whether a master node in the cluster can establish connection with the reporting address;
and responding to the fact that a master node in the cluster can establish connection with the reporting address, and reporting the alarm information to the reporting address by using the master node.
4. A method as recited in claim 3, further comprising:
responding to the fact that a master node in the cluster cannot establish connection with the reporting address, and judging whether a slave node in the cluster can establish connection with the reporting address;
and responding to the fact that the slave node in the cluster can establish connection with the reporting address, and reporting the alarm information to the reporting address by using the slave node.
5. The method of claim 4, wherein in response to the internal node failing to establish a connection with a reporting address, establishing a connection with one of the high availability nodes to invoke reporting of the alert information by the high availability node, further comprising:
responding to the fact that a slave node in the cluster cannot establish connection with the reporting address, and establishing connection with one of high-availability nodes by using the master node;
and calling a pre-installed reporting module in the high-availability node to report the alarm information to a preset address through the reporting module.
6. A cluster alarm system, comprising:
the setting module is configured to set a plurality of high-availability nodes outside the cluster;
the configuration module is configured to preset reporting configuration for each high-availability node;
the judging module is configured to respond to the detection that the nodes in the cluster generate alarm information and judge whether the nodes in the cluster can establish connection with the reporting address;
the high availability module is configured to respond to the fact that the internal node cannot establish connection with a reporting address, establish connection with one of the high availability nodes, and call the high availability node to report alarm information;
the setup module is further configured to:
a preset interface is arranged inside the cluster;
filling in the information of each high-availability node through the preset interface;
and testing whether the high-availability nodes can be connected to the reported address or not and testing whether the high-availability nodes can be connected to the internal nodes or not according to the information of each high-availability node filled in the preset interface.
7. A computer device, comprising:
at least one processor; and
a memory storing a computer program executable on the processor, wherein the processor performs the steps of the method of any one of claims 1-5 when the program is executed.
8. A computer readable storage medium storing a computer program, characterized in that the computer program when executed by a processor performs the steps of the method according to any one of claims 1-5.
CN202110682331.8A 2021-06-20 2021-06-20 Cluster alarm method, system, equipment and medium Active CN113590434B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202110682331.8A CN113590434B (en) 2021-06-20 2021-06-20 Cluster alarm method, system, equipment and medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202110682331.8A CN113590434B (en) 2021-06-20 2021-06-20 Cluster alarm method, system, equipment and medium

Publications (2)

Publication Number Publication Date
CN113590434A CN113590434A (en) 2021-11-02
CN113590434B true CN113590434B (en) 2023-12-22

Family

ID=78244202

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202110682331.8A Active CN113590434B (en) 2021-06-20 2021-06-20 Cluster alarm method, system, equipment and medium

Country Status (1)

Country Link
CN (1) CN113590434B (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN117155938B (en) * 2023-10-30 2024-01-12 北京腾达泰源科技有限公司 Cluster node fault reporting method, device, equipment and storage medium

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2015085963A1 (en) * 2013-12-13 2015-06-18 腾讯科技(深圳)有限公司 Distributed system-based monitoring method, device, and system
CN108038043A (en) * 2017-12-22 2018-05-15 郑州云海信息技术有限公司 A kind of distributed storage cluster alarm method, system and equipment
CN109039733A (en) * 2018-07-26 2018-12-18 郑州云海信息技术有限公司 A kind of alarm method, system and electronic equipment and storage medium
CN109714222A (en) * 2017-10-26 2019-05-03 创盛视联数码科技(北京)有限公司 The distributed computer monitoring system and its monitoring method of High Availabitity
CN110516454A (en) * 2019-08-13 2019-11-29 苏州浪潮智能科技有限公司 Exchange method, system, device and the computer readable storage medium of more equipment
CN110535945A (en) * 2019-08-30 2019-12-03 苏州浪潮智能科技有限公司 Test method, device, equipment and the storage medium of storage cluster alarm function
WO2021051582A1 (en) * 2019-09-17 2021-03-25 平安科技(深圳)有限公司 Host performance monitoring method and apparatus for server cluster, device, and storage medium

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2015085963A1 (en) * 2013-12-13 2015-06-18 腾讯科技(深圳)有限公司 Distributed system-based monitoring method, device, and system
CN109714222A (en) * 2017-10-26 2019-05-03 创盛视联数码科技(北京)有限公司 The distributed computer monitoring system and its monitoring method of High Availabitity
CN108038043A (en) * 2017-12-22 2018-05-15 郑州云海信息技术有限公司 A kind of distributed storage cluster alarm method, system and equipment
CN109039733A (en) * 2018-07-26 2018-12-18 郑州云海信息技术有限公司 A kind of alarm method, system and electronic equipment and storage medium
CN110516454A (en) * 2019-08-13 2019-11-29 苏州浪潮智能科技有限公司 Exchange method, system, device and the computer readable storage medium of more equipment
CN110535945A (en) * 2019-08-30 2019-12-03 苏州浪潮智能科技有限公司 Test method, device, equipment and the storage medium of storage cluster alarm function
WO2021051582A1 (en) * 2019-09-17 2021-03-25 平安科技(深圳)有限公司 Host performance monitoring method and apparatus for server cluster, device, and storage medium

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
Shaoguang Liu ; Jun Xie ; Zhicheng Zhao ; Yang Li ; Xin Yang.Extraction Method of Alarm Transaction Based on Morphology Similarity Clustering.IEEE.全文. *
基于Redis和RabbitMQ的GPON告警采集系统;姜秀芳;李鹏飞;胡晶;盛苗;;中国新通信(第17期);全文 *
基于SOA架构的异构系统集成平台的设计;马建辉;赖涛;;数字技术与应用(第02期);全文 *

Also Published As

Publication number Publication date
CN113590434A (en) 2021-11-02

Similar Documents

Publication Publication Date Title
CN110798375B (en) Monitoring method, system and terminal equipment for enhancing high availability of container cluster
US6038288A (en) System and method for maintenance arbitration at a switching node
CN108173911B (en) Micro-service fault detection processing method and device
CN103607297A (en) Fault processing method of computer cluster system
CN108710673A (en) Realize database high availability method, system, computer equipment and storage medium
CN113590434B (en) Cluster alarm method, system, equipment and medium
CN113726556B (en) Edge internet of things proxy node operation and maintenance method, system, storage medium and computing equipment
CN105071968A (en) Method and device for repairing hidden failures of service plane and control plane of communication device
CN114490565A (en) Database fault processing method and device
US8582444B2 (en) Method for detecting hardware faults by determining a ratio of released connections
CN111190761B (en) Log output method and device, storage medium and electronic equipment
CN115842860A (en) Monitoring method, device and system for data link
CN115712521A (en) Cluster node fault processing method, system and medium
CN111786806B (en) Network element exception handling method and network management system
CN113808725A (en) Equipment early warning system and method
CN107528730A (en) Multiple redundancy method, multiple redundancy server and system
CN112181780A (en) Detection and alarm method, device and equipment for containerized platform core component
CN115633197B (en) Service data distribution system, method and device, electronic equipment and medium
CN115174356B (en) Cluster alarm reporting method, device, equipment and medium
US20220128966A1 (en) Context-Sensitive Technical Audit Trail of A Technical System
CN110569056B (en) Rule service information updating method and device
CN113535506B (en) Monitoring method and device of service system, storage medium and computer equipment
CN109617761B (en) Method and device for switching main server and standby server
US20220353200A1 (en) Monitoring a Communication System That is Used for Control and/or Surveillance of an Industrial Process
CN108400894B (en) Server cluster network fault positioning method and system

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant