CN113590434B - Cluster alarm method, system, equipment and medium - Google Patents
Cluster alarm method, system, equipment and medium Download PDFInfo
- Publication number
- CN113590434B CN113590434B CN202110682331.8A CN202110682331A CN113590434B CN 113590434 B CN113590434 B CN 113590434B CN 202110682331 A CN202110682331 A CN 202110682331A CN 113590434 B CN113590434 B CN 113590434B
- Authority
- CN
- China
- Prior art keywords
- cluster
- node
- reporting
- availability
- nodes
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 238000000034 method Methods 0.000 title claims abstract description 27
- 238000001514 detection method Methods 0.000 claims abstract description 8
- 238000012360 testing method Methods 0.000 claims description 19
- 230000004044 response Effects 0.000 claims description 12
- 238000004590 computer program Methods 0.000 claims description 9
- 230000006870 function Effects 0.000 claims description 7
- 230000008439 repair process Effects 0.000 description 6
- 238000012423 maintenance Methods 0.000 description 5
- 238000011161 development Methods 0.000 description 3
- 238000010586 diagram Methods 0.000 description 3
- 230000008901 benefit Effects 0.000 description 2
- 230000014509 gene expression Effects 0.000 description 2
- 238000004519 manufacturing process Methods 0.000 description 2
- 230000007246 mechanism Effects 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 230000005856 abnormality Effects 0.000 description 1
- 230000009471 action Effects 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 230000036541 health Effects 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- 230000008569 process Effects 0.000 description 1
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F11/00—Error detection; Error correction; Monitoring
- G06F11/30—Monitoring
- G06F11/32—Monitoring with visual or acoustical indication of the functioning of the machine
- G06F11/324—Display of status information
- G06F11/327—Alarm or error message display
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F11/00—Error detection; Error correction; Monitoring
- G06F11/30—Monitoring
- G06F11/3089—Monitoring arrangements determined by the means or processing involved in sensing the monitored data, e.g. interfaces, connectors, sensors, probes, agents
- G06F11/3093—Configuration details thereof, e.g. installation, enabling, spatial arrangement of the probes
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y02—TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
- Y02D—CLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
- Y02D30/00—Reducing energy consumption in communication networks
- Y02D30/70—Reducing energy consumption in communication networks in wireless communication networks
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Quality & Reliability (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Data Exchanges In Wide-Area Networks (AREA)
Abstract
The invention discloses a cluster alarm method, which comprises the following steps: setting a plurality of high-availability nodes outside the cluster; carrying out preset reporting configuration on each high-availability node; responding to the detection that the nodes in the cluster generate alarm information, and judging whether the nodes in the cluster can be connected with the reporting address; and responding to the fact that the internal node cannot establish connection with the reporting address, and establishing connection with one of the high-availability nodes so as to call the high-availability nodes to report the alarm information. The invention also discloses a system, computer equipment and a readable storage medium. The scheme provided by the invention is to pre-determine a plurality of nodes outside the cluster as high available nodes, when an alarm is generated inside the cluster, the judgment on whether the node can report the alarm is increased, if the current node cannot report, the node can report the alarm through the pre-determined high available nodes, so that the stability of reporting the alarm of the cluster is ensured.
Description
Technical Field
The present invention relates to the field of clusters, and in particular, to a cluster alarm method, system, device, and storage medium.
Background
With the rapid development of cloud computing and big data technology in the development of modern society, the accumulated production data in production and life also grow exponentially, and mass storage technology becomes an integral part in the development of the internet. In distributed storage systems, however, some critical information or faults often need to be alerted due to the need to monitor and manage the vast amount of data. However, in actual situations, the operation and maintenance personnel cannot be guaranteed to be always present, and in general, once an abnormality or an alarm occurs in a cluster, the operation and maintenance personnel are required to be timely notified in a manner of e-mail, short message, and the like. Therefore, the stability of alarm reporting is greatly required, and once a fault, a network reason and the like are met or a machine of a machine room cannot be connected with an external network, operation and maintenance personnel cannot acquire and process the corresponding alarm in time, so that a serious accident of a cluster is likely to occur, and immeasurable loss is caused.
Disclosure of Invention
In view of this, in order to overcome at least one aspect of the above-mentioned problems, an embodiment of the present invention provides a cluster alarm method, including the following steps:
setting a plurality of high-availability nodes outside the cluster;
carrying out preset reporting configuration on each high-availability node;
responding to the detection that the nodes in the cluster generate alarm information, and judging whether the nodes in the cluster can be connected with the reporting address;
and responding to the fact that the internal node cannot establish connection with the reporting address, and establishing connection with one of the high-availability nodes so as to call the high-availability nodes to report the alarm information.
In some embodiments, the setting of the plurality of high availability nodes outside the cluster further comprises:
a preset interface is arranged inside the cluster;
and filling in the information of each high available node through the preset interface.
In some embodiments, further comprising:
and testing whether the high-availability nodes can be connected to the reported address or not and testing whether the high-availability nodes can be connected to the internal nodes or not according to the information of each high-availability node filled in the preset interface.
In some embodiments, the preset reporting configuration is performed on each of the high available nodes, and the method further includes:
and installing a reporting module for providing reporting functions on each high-availability node.
In some embodiments, in response to detecting that a node within a cluster generates alarm information, determining whether the node within the cluster is capable of establishing a connection with a reporting address further includes:
judging whether a master node in the cluster can establish connection with the reporting address;
and responding to the fact that a master node in the cluster can establish connection with the reporting address, and reporting the alarm information to the reporting address by using the master node.
In some embodiments, further comprising:
responding to the fact that a master node in the cluster cannot establish connection with the reporting address, and judging whether a slave node in the cluster can establish connection with the reporting address;
and responding to the fact that the slave node in the cluster can establish connection with the reporting address, and reporting the alarm information to the reporting address by using the slave node.
In some embodiments, in response to the internal node failing to establish a connection with a reporting address, establishing a connection with one of the high-availability nodes to invoke the high-availability node to report the alarm information, further comprising:
responding to the fact that a slave node in the cluster cannot establish connection with the reporting address, and establishing connection with one of high-availability nodes by using the master node;
and calling a pre-installed reporting module in the high-availability node to report the alarm information to a preset address through the reporting module.
Based on the same inventive concept, according to another aspect of the present invention, an embodiment of the present invention further provides a cluster alarm system, including:
the setting module is configured to set a plurality of high-availability nodes outside the cluster;
the configuration module is configured to preset reporting configuration for each high-availability node;
the judging module is configured to respond to the detection that the nodes in the cluster generate alarm information and judge whether the nodes in the cluster can establish connection with the reporting address;
and the high availability module is configured to respond to the fact that the internal node cannot establish connection with the reporting address, establish connection with one of the high availability nodes, and call the high availability node to report the alarm information.
Based on the same inventive concept, according to another aspect of the present invention, an embodiment of the present invention further provides a computer apparatus, including:
at least one processor; and
a memory storing a computer program executable on the processor, wherein the processor performs the steps of any of the cluster alert methods described above when the program is executed.
Based on the same inventive concept, according to another aspect of the present invention, there is also provided a computer-readable storage medium storing a computer program which, when executed by a processor, performs the steps of any of the cluster alarm methods described above.
The invention has one of the following beneficial technical effects: the scheme provided by the invention is to pre-determine a plurality of nodes outside the cluster as high available nodes, when an alarm is generated inside the cluster, the judgment on whether the node can report the alarm is increased, if the current node cannot report, the node can report the alarm through the pre-determined high available nodes, so that the stability of reporting the alarm of the cluster is ensured.
Drawings
In order to more clearly illustrate the embodiments of the invention or the technical solutions in the prior art, the drawings that are necessary for the description of the embodiments or the prior art will be briefly described, it being obvious that the drawings in the following description are only some embodiments of the invention and that other embodiments may be obtained according to these drawings without inventive effort for a person skilled in the art.
Fig. 1 is a schematic flow chart of a cluster alarm method according to an embodiment of the present invention;
FIG. 2 is a schematic diagram of a cluster alarm system according to an embodiment of the present invention;
FIG. 3 is a schematic diagram of a computer device according to an embodiment of the present invention;
fig. 4 is a schematic structural diagram of a computer-readable storage medium according to an embodiment of the present invention.
Detailed Description
In order to make the objects, technical solutions and advantages of the present invention more apparent, the following embodiments of the present invention will be described in further detail with reference to the accompanying drawings.
It should be noted that, in the embodiments of the present invention, all the expressions "first" and "second" are used to distinguish two entities with the same name but different entities or different parameters, and it is noted that the "first" and "second" are only used for convenience of expression, and should not be construed as limiting the embodiments of the present invention, and the following embodiments are not described one by one.
According to an aspect of the present invention, an embodiment of the present invention proposes a cluster alarm method, as shown in fig. 1, which may include the steps of:
s1, arranging a plurality of high-availability nodes outside a cluster;
s2, carrying out preset reporting configuration on each high-availability node;
s3, responding to detection of alarm information generated by nodes in the cluster, and judging whether the nodes in the cluster can be connected with the reporting address;
and S4, responding to the fact that the internal node cannot establish connection with the reporting address, and establishing connection with one of the high-availability nodes so as to call the high-availability node to report the alarm information.
The scheme provided by the invention is to pre-determine a plurality of nodes outside the cluster as high available nodes, when an alarm is generated inside the cluster, the judgment on whether the node can report the alarm is increased, if the current node cannot report, the node can report the alarm through the pre-determined high available nodes, so that the stability of reporting the alarm of the cluster is ensured.
In some embodiments, the setting of the plurality of high availability nodes outside the cluster further comprises:
a preset interface is arranged inside the cluster;
and filling in the information of each high available node through the preset interface.
Specifically, a high availability node configuration module may be provided within the cluster. The module provides an entry for configuring the alarm reporting high availability node through which the user can fill in information of the external node to which the present cluster node can be connected and which can be connected to the external network. For example, the user can configure IP information of the high availability node through the portal, and then the nodes inside the cluster can realize connection with the set high availability node through the IP information.
In some embodiments, the preset reporting configuration is performed on each of the high available nodes, and the method further includes:
and installing a reporting module for providing reporting functions on each high-availability node.
Specifically, after the user fills out relevant information of the high available node at the entry provided by the high available node configuration module, a tool package can be sent to the corresponding high available node, and the tool package is equivalent to providing a public alarm reporting interface and is used as a medium for sending alarm mail, alarm short messages and the like outwards by the cluster.
In some embodiments, further comprising:
and testing whether the high-availability nodes can be connected to the reported address or not and testing whether the high-availability nodes can be connected to the internal nodes or not according to the information of each high-availability node filled in the preset interface.
Specifically, after receiving the information, such as IP information, of the high availability node filled by the user, it may first test whether the cluster node can access the high availability node and whether the high availability node can be connected to the report address, if the prompt test is successful, it may be added, and if the test fails, it needs to be refilled. After one success is filled, the information of other high-availability nodes can be continuously added, or the information of the high-availability nodes can be added at one time, then the test is carried out simultaneously, and the node information of the test failure is fed back.
In some embodiments, in response to detecting that a node within a cluster generates alarm information, determining whether the node within the cluster is capable of establishing a connection with a reporting address further includes:
judging whether a master node in the cluster can establish connection with the reporting address;
and responding to the fact that a master node in the cluster can establish connection with the reporting address, and reporting the alarm information to the reporting address by using the master node.
Specifically, when the cluster generates an alarm, the master node of the cluster can be used for calling an alarm reporting module in the cluster to report the alarm, before reporting, whether the current master node can be connected to a reporting address can be judged, and if the master node in the cluster can be connected with the reporting address, the master node is used for reporting the alarm information to the reporting address.
In some embodiments, further comprising:
responding to the fact that a master node in the cluster cannot establish connection with the reporting address, and judging whether a slave node in the cluster can establish connection with the reporting address;
and responding to the fact that the slave node in the cluster can establish connection with the reporting address, and reporting the alarm information to the reporting address by using the slave node.
Specifically, if the current master node cannot establish connection with the reporting address, the current master node can call other nodes in the cluster to report through an alarm reporting automatic repair logic. When other nodes in the cluster are called for reporting, whether the nodes capable of reporting normally exist or not can be checked and judged, namely whether the nodes capable of being connected to a reporting address exist or not, and if yes, the nodes are used for reporting the alarm.
In some embodiments, in response to the internal node failing to establish a connection with a reporting address, establishing a connection with one of the high-availability nodes to invoke the high-availability node to report the alarm information, further comprising:
responding to the fact that a slave node in the cluster cannot establish connection with the reporting address, and establishing connection with one of high-availability nodes by using the master node;
and calling a pre-installed reporting module in the high-availability node to report the alarm information to a preset address through the reporting module.
Specifically, if all nodes in the cluster cannot be connected with the reporting address to report the alarm information, the master node is required to establish connection with one of the high-availability nodes, the high-availability nodes are used for reporting the alarm information, and a pre-installed reporting module (i.e. a pre-sent tool kit) in the high-availability nodes is called to report the alarm information to a preset address through the reporting module.
The high availability mechanism for reporting the alarm can effectively avoid the situation of mail sending failure caused by the reasons of a machine room network, a local area network and the like. And moreover, a plurality of alarm reporting alternative mechanisms enable the cluster alarm system to be more powerful, and timely report alarms, so that operation and maintenance personnel can identify cluster health conditions in advance to repair problems, the overall stability of the storage system is enhanced, major accidents are avoided, and the maintenance cost is saved.
The proposal of the invention pre-configures some nodes outside the cluster as high availability nodes which can establish connection with the nodes inside the cluster and can also be connected to an external network. When the cluster reports the alarm, the judgment of whether the node can report the alarm is increased, if the current node cannot report the alarm, the alarm reporting automatic repair logic is called, the alarm reporting is firstly carried out in sequence through other nodes of the cluster, and if the reporting is successful, the subsequent node does not report any more. If the whole cluster cannot report the alarm, such as in an intranet, mail cannot be sent to an external network, etc., the alarm reporting tool kit is called through a high-availability node configured in advance to report the alarm, so that the stability of the alarm reporting of the storage cluster is ensured.
Based on the same inventive concept, according to another aspect of the present invention, there is further provided a cluster alarm system 400, as shown in fig. 2, including:
a setting module 401 configured to set a number of high availability nodes outside the cluster;
a configuration module 402, configured to perform preset reporting configuration on each of the high-availability nodes;
a judging module 403, configured to respond to detecting that the node inside the cluster generates alarm information, and judge whether the node inside can establish connection with the reporting address;
and the high availability module 404 is configured to respond that the internal node cannot establish connection with the reporting address, and establish connection with one of the high availability nodes so as to call the high availability node to report the alarm information.
In some embodiments, the setting of the plurality of high availability nodes outside the cluster further comprises:
a preset interface is arranged inside the cluster;
and filling in the information of each high available node through the preset interface.
In some embodiments, further comprising:
and testing whether the high-availability nodes can be connected to the reported address or not and testing whether the high-availability nodes can be connected to the internal nodes or not according to the information of each high-availability node filled in the preset interface.
In some embodiments, the preset reporting configuration is performed on each of the high available nodes, and the method further includes:
and installing a reporting module for providing reporting functions on each high-availability node.
In some embodiments, in response to detecting that a node within a cluster generates alarm information, determining whether the node within the cluster is capable of establishing a connection with a reporting address further includes:
judging whether a master node in the cluster can establish connection with the reporting address;
and responding to the fact that a master node in the cluster can establish connection with the reporting address, and reporting the alarm information to the reporting address by using the master node.
In some embodiments, further comprising:
responding to the fact that a master node in the cluster cannot establish connection with the reporting address, and judging whether a slave node in the cluster can establish connection with the reporting address;
and responding to the fact that the slave node in the cluster can establish connection with the reporting address, and reporting the alarm information to the reporting address by using the slave node.
In some embodiments, in response to the internal node failing to establish a connection with a reporting address, establishing a connection with one of the high-availability nodes to invoke the high-availability node to report the alarm information, further comprising:
responding to the fact that a slave node in the cluster cannot establish connection with the reporting address, and establishing connection with one of high-availability nodes by using the master node;
and calling a pre-installed reporting module in the high-availability node to report the alarm information to a preset address through the reporting module.
The proposal of the invention pre-configures some nodes outside the cluster as high availability nodes which can establish connection with the nodes inside the cluster and can also be connected to an external network. When the cluster reports the alarm, the judgment of whether the node can report the alarm is increased, if the current node cannot report the alarm, the alarm reporting automatic repair logic is called, the alarm reporting is firstly carried out in sequence through other nodes of the cluster, and if the reporting is successful, the subsequent node does not report any more. If the whole cluster cannot report the alarm, such as in an intranet, mail cannot be sent to an external network, etc., the alarm reporting tool kit is called through a high-availability node configured in advance to report the alarm, so that the stability of the alarm reporting of the storage cluster is ensured.
Based on the same inventive concept, according to another aspect of the present invention, as shown in fig. 3, an embodiment of the present invention further provides a computer apparatus 501, including:
at least one processor 520; and
the memory 510, the memory 510 stores a computer program 511 executable on a processor, and the processor 520 executes the program to perform the steps of:
s1, arranging a plurality of high-availability nodes outside a cluster;
s2, carrying out preset reporting configuration on each high-availability node;
s3, responding to detection of alarm information generated by nodes in the cluster, and judging whether the nodes in the cluster can be connected with the reporting address;
and S4, responding to the fact that the internal node cannot establish connection with the reporting address, and establishing connection with one of the high-availability nodes so as to call the high-availability node to report the alarm information.
In some embodiments, the setting of the plurality of high availability nodes outside the cluster further comprises:
a preset interface is arranged inside the cluster;
and filling in the information of each high available node through the preset interface.
In some embodiments, further comprising:
and testing whether the high-availability nodes can be connected to the reported address or not and testing whether the high-availability nodes can be connected to the internal nodes or not according to the information of each high-availability node filled in the preset interface.
In some embodiments, the preset reporting configuration is performed on each of the high available nodes, and the method further includes:
and installing a reporting module for providing reporting functions on each high-availability node.
In some embodiments, in response to detecting that a node within a cluster generates alarm information, determining whether the node within the cluster is capable of establishing a connection with a reporting address further includes:
judging whether a master node in the cluster can establish connection with the reporting address;
and responding to the fact that a master node in the cluster can establish connection with the reporting address, and reporting the alarm information to the reporting address by using the master node.
In some embodiments, further comprising:
responding to the fact that a master node in the cluster cannot establish connection with the reporting address, and judging whether a slave node in the cluster can establish connection with the reporting address;
and responding to the fact that the slave node in the cluster can establish connection with the reporting address, and reporting the alarm information to the reporting address by using the slave node.
In some embodiments, in response to the internal node failing to establish a connection with a reporting address, establishing a connection with one of the high-availability nodes to invoke the high-availability node to report the alarm information, further comprising:
responding to the fact that a slave node in the cluster cannot establish connection with the reporting address, and establishing connection with one of high-availability nodes by using the master node;
and calling a pre-installed reporting module in the high-availability node to report the alarm information to a preset address through the reporting module.
The proposal of the invention pre-configures some nodes outside the cluster as high availability nodes which can establish connection with the nodes inside the cluster and can also be connected to an external network. When the cluster reports the alarm, the judgment of whether the node can report the alarm is increased, if the current node cannot report the alarm, the alarm reporting automatic repair logic is called, the alarm reporting is firstly carried out in sequence through other nodes of the cluster, and if the reporting is successful, the subsequent node does not report any more. If the whole cluster cannot report the alarm, such as in an intranet, mail cannot be sent to an external network, etc., the alarm reporting tool kit is called through a high-availability node configured in advance to report the alarm, so that the stability of the alarm reporting of the storage cluster is ensured.
Based on the same inventive concept, according to another aspect of the present invention, as shown in fig. 4, an embodiment of the present invention further provides a computer-readable storage medium 601, the computer-readable storage medium 601 storing computer program instructions 610, the computer program instructions 610 being executed by a processor to:
s1, arranging a plurality of high-availability nodes outside a cluster;
s2, carrying out preset reporting configuration on each high-availability node;
s3, responding to detection of alarm information generated by nodes in the cluster, and judging whether the nodes in the cluster can be connected with the reporting address;
and S4, responding to the fact that the internal node cannot establish connection with the reporting address, and establishing connection with one of the high-availability nodes so as to call the high-availability node to report the alarm information.
In some embodiments, the setting of the plurality of high availability nodes outside the cluster further comprises:
a preset interface is arranged inside the cluster;
and filling in the information of each high available node through the preset interface.
In some embodiments, further comprising:
and testing whether the high-availability nodes can be connected to the reported address or not and testing whether the high-availability nodes can be connected to the internal nodes or not according to the information of each high-availability node filled in the preset interface.
In some embodiments, the preset reporting configuration is performed on each of the high available nodes, and the method further includes:
and installing a reporting module for providing reporting functions on each high-availability node.
In some embodiments, in response to detecting that a node within a cluster generates alarm information, determining whether the node within the cluster is capable of establishing a connection with a reporting address further includes:
judging whether a master node in the cluster can establish connection with the reporting address;
and responding to the fact that a master node in the cluster can establish connection with the reporting address, and reporting the alarm information to the reporting address by using the master node.
In some embodiments, further comprising:
responding to the fact that a master node in the cluster cannot establish connection with the reporting address, and judging whether a slave node in the cluster can establish connection with the reporting address;
and responding to the fact that the slave node in the cluster can establish connection with the reporting address, and reporting the alarm information to the reporting address by using the slave node.
In some embodiments, in response to the internal node failing to establish a connection with a reporting address, establishing a connection with one of the high-availability nodes to invoke the high-availability node to report the alarm information, further comprising:
responding to the fact that a slave node in the cluster cannot establish connection with the reporting address, and establishing connection with one of high-availability nodes by using the master node;
and calling a pre-installed reporting module in the high-availability node to report the alarm information to a preset address through the reporting module.
The proposal of the invention pre-configures some nodes outside the cluster as high availability nodes which can establish connection with the nodes inside the cluster and can also be connected to an external network. When the cluster reports the alarm, the judgment of whether the node can report the alarm is increased, if the current node cannot report the alarm, the alarm reporting automatic repair logic is called, the alarm reporting is firstly carried out in sequence through other nodes of the cluster, and if the reporting is successful, the subsequent node does not report any more. If the whole cluster cannot report the alarm, such as in an intranet, mail cannot be sent to an external network, etc., the alarm reporting tool kit is called through a high-availability node configured in advance to report the alarm, so that the stability of the alarm reporting of the storage cluster is ensured.
Finally, it should be noted that, as will be appreciated by those skilled in the art, all or part of the procedures in implementing the methods of the embodiments described above may be implemented by a computer program for instructing relevant hardware, and the program may be stored in a computer readable storage medium, and the program may include the procedures of the embodiments of the methods described above when executed.
Further, it should be appreciated that the computer-readable storage medium (e.g., memory) herein can be either volatile memory or nonvolatile memory, or can include both volatile and nonvolatile memory.
Those of skill would further appreciate that the various illustrative logical blocks, modules, circuits, and algorithm steps described in connection with the disclosure herein may be implemented as electronic hardware, computer software, or combinations of both. To clearly illustrate this interchangeability of hardware and software, various illustrative components, blocks, modules, circuits, and steps have been described above generally in terms of their functionality. Whether such functionality is implemented as software or hardware depends upon the particular application and design constraints imposed on the overall system. Skilled artisans may implement the described functionality in varying ways for each particular application, but such implementation decisions should not be interpreted as causing a departure from the scope of the present disclosure.
The foregoing is an exemplary embodiment of the present disclosure, but it should be noted that various changes and modifications could be made herein without departing from the scope of the disclosure as defined by the appended claims. The functions, steps and/or actions of the method claims in accordance with the disclosed embodiments described herein need not be performed in any particular order. Furthermore, although elements of the disclosed embodiments may be described or claimed in the singular, the plural is contemplated unless limitation to the singular is explicitly stated.
It should be understood that as used herein, the singular forms "a", "an", and "the" are intended to include the plural forms as well, unless the context clearly supports the exception. It should also be understood that "and/or" as used herein is meant to include any and all possible combinations of one or more of the associated listed items.
The foregoing embodiment of the present invention has been disclosed with reference to the number of embodiments for the purpose of description only, and does not represent the advantages or disadvantages of the embodiments.
It will be appreciated by those of ordinary skill in the art that all or part of the steps of implementing the above embodiments may be implemented by hardware, or may be implemented by a program to instruct related hardware, and the program may be stored in a computer readable storage medium, where the storage medium may be a read-only memory, a magnetic disk or an optical disk, etc.
Those of ordinary skill in the art will appreciate that: the above discussion of any embodiment is merely exemplary and is not intended to imply that the scope of the disclosure of embodiments of the invention, including the claims, is limited to such examples; combinations of features of the above embodiments or in different embodiments are also possible within the idea of an embodiment of the invention, and many other variations of the different aspects of the embodiments of the invention as described above exist, which are not provided in detail for the sake of brevity. Therefore, any omission, modification, equivalent replacement, improvement, etc. of the embodiments should be included in the protection scope of the embodiments of the present invention.
Claims (8)
1. The cluster alarm method is characterized by comprising the following steps of:
setting a plurality of high-availability nodes outside the cluster;
carrying out preset reporting configuration on each high-availability node;
responding to the detection that the nodes in the cluster generate alarm information, and judging whether the nodes in the cluster can be connected with the reporting address;
responding to the fact that the internal node cannot establish connection with a reporting address, and establishing connection with one of the high-availability nodes so as to call the high-availability node to report alarm information;
setting a plurality of high available nodes outside the cluster, further comprising:
a preset interface is arranged inside the cluster;
filling in the information of each high-availability node through the preset interface;
and testing whether the high-availability nodes can be connected to the reported address or not and testing whether the high-availability nodes can be connected to the internal nodes or not according to the information of each high-availability node filled in the preset interface.
2. The method of claim 1, wherein the preset reporting configuration is performed for each of the high availability nodes, further comprising:
and installing a reporting module for providing reporting functions on each high-availability node.
3. The method of claim 2, wherein in response to detecting that a node within a cluster generates alarm information, determining whether the node within the cluster is capable of establishing a connection with a reporting address further comprises:
judging whether a master node in the cluster can establish connection with the reporting address;
and responding to the fact that a master node in the cluster can establish connection with the reporting address, and reporting the alarm information to the reporting address by using the master node.
4. A method as recited in claim 3, further comprising:
responding to the fact that a master node in the cluster cannot establish connection with the reporting address, and judging whether a slave node in the cluster can establish connection with the reporting address;
and responding to the fact that the slave node in the cluster can establish connection with the reporting address, and reporting the alarm information to the reporting address by using the slave node.
5. The method of claim 4, wherein in response to the internal node failing to establish a connection with a reporting address, establishing a connection with one of the high availability nodes to invoke reporting of the alert information by the high availability node, further comprising:
responding to the fact that a slave node in the cluster cannot establish connection with the reporting address, and establishing connection with one of high-availability nodes by using the master node;
and calling a pre-installed reporting module in the high-availability node to report the alarm information to a preset address through the reporting module.
6. A cluster alarm system, comprising:
the setting module is configured to set a plurality of high-availability nodes outside the cluster;
the configuration module is configured to preset reporting configuration for each high-availability node;
the judging module is configured to respond to the detection that the nodes in the cluster generate alarm information and judge whether the nodes in the cluster can establish connection with the reporting address;
the high availability module is configured to respond to the fact that the internal node cannot establish connection with a reporting address, establish connection with one of the high availability nodes, and call the high availability node to report alarm information;
the setup module is further configured to:
a preset interface is arranged inside the cluster;
filling in the information of each high-availability node through the preset interface;
and testing whether the high-availability nodes can be connected to the reported address or not and testing whether the high-availability nodes can be connected to the internal nodes or not according to the information of each high-availability node filled in the preset interface.
7. A computer device, comprising:
at least one processor; and
a memory storing a computer program executable on the processor, wherein the processor performs the steps of the method of any one of claims 1-5 when the program is executed.
8. A computer readable storage medium storing a computer program, characterized in that the computer program when executed by a processor performs the steps of the method according to any one of claims 1-5.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202110682331.8A CN113590434B (en) | 2021-06-20 | 2021-06-20 | Cluster alarm method, system, equipment and medium |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202110682331.8A CN113590434B (en) | 2021-06-20 | 2021-06-20 | Cluster alarm method, system, equipment and medium |
Publications (2)
Publication Number | Publication Date |
---|---|
CN113590434A CN113590434A (en) | 2021-11-02 |
CN113590434B true CN113590434B (en) | 2023-12-22 |
Family
ID=78244202
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202110682331.8A Active CN113590434B (en) | 2021-06-20 | 2021-06-20 | Cluster alarm method, system, equipment and medium |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN113590434B (en) |
Families Citing this family (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN117155938B (en) * | 2023-10-30 | 2024-01-12 | 北京腾达泰源科技有限公司 | Cluster node fault reporting method, device, equipment and storage medium |
Citations (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2015085963A1 (en) * | 2013-12-13 | 2015-06-18 | 腾讯科技(深圳)有限公司 | Distributed system-based monitoring method, device, and system |
CN108038043A (en) * | 2017-12-22 | 2018-05-15 | 郑州云海信息技术有限公司 | A kind of distributed storage cluster alarm method, system and equipment |
CN109039733A (en) * | 2018-07-26 | 2018-12-18 | 郑州云海信息技术有限公司 | A kind of alarm method, system and electronic equipment and storage medium |
CN109714222A (en) * | 2017-10-26 | 2019-05-03 | 创盛视联数码科技(北京)有限公司 | The distributed computer monitoring system and its monitoring method of High Availabitity |
CN110516454A (en) * | 2019-08-13 | 2019-11-29 | 苏州浪潮智能科技有限公司 | Exchange method, system, device and the computer readable storage medium of more equipment |
CN110535945A (en) * | 2019-08-30 | 2019-12-03 | 苏州浪潮智能科技有限公司 | Test method, device, equipment and the storage medium of storage cluster alarm function |
WO2021051582A1 (en) * | 2019-09-17 | 2021-03-25 | 平安科技(深圳)有限公司 | Host performance monitoring method and apparatus for server cluster, device, and storage medium |
-
2021
- 2021-06-20 CN CN202110682331.8A patent/CN113590434B/en active Active
Patent Citations (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2015085963A1 (en) * | 2013-12-13 | 2015-06-18 | 腾讯科技(深圳)有限公司 | Distributed system-based monitoring method, device, and system |
CN109714222A (en) * | 2017-10-26 | 2019-05-03 | 创盛视联数码科技(北京)有限公司 | The distributed computer monitoring system and its monitoring method of High Availabitity |
CN108038043A (en) * | 2017-12-22 | 2018-05-15 | 郑州云海信息技术有限公司 | A kind of distributed storage cluster alarm method, system and equipment |
CN109039733A (en) * | 2018-07-26 | 2018-12-18 | 郑州云海信息技术有限公司 | A kind of alarm method, system and electronic equipment and storage medium |
CN110516454A (en) * | 2019-08-13 | 2019-11-29 | 苏州浪潮智能科技有限公司 | Exchange method, system, device and the computer readable storage medium of more equipment |
CN110535945A (en) * | 2019-08-30 | 2019-12-03 | 苏州浪潮智能科技有限公司 | Test method, device, equipment and the storage medium of storage cluster alarm function |
WO2021051582A1 (en) * | 2019-09-17 | 2021-03-25 | 平安科技(深圳)有限公司 | Host performance monitoring method and apparatus for server cluster, device, and storage medium |
Non-Patent Citations (3)
Title |
---|
Shaoguang Liu ; Jun Xie ; Zhicheng Zhao ; Yang Li ; Xin Yang.Extraction Method of Alarm Transaction Based on Morphology Similarity Clustering.IEEE.全文. * |
基于Redis和RabbitMQ的GPON告警采集系统;姜秀芳;李鹏飞;胡晶;盛苗;;中国新通信(第17期);全文 * |
基于SOA架构的异构系统集成平台的设计;马建辉;赖涛;;数字技术与应用(第02期);全文 * |
Also Published As
Publication number | Publication date |
---|---|
CN113590434A (en) | 2021-11-02 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN110798375B (en) | Monitoring method, system and terminal equipment for enhancing high availability of container cluster | |
US6038288A (en) | System and method for maintenance arbitration at a switching node | |
CN108173911B (en) | Micro-service fault detection processing method and device | |
CN103607297A (en) | Fault processing method of computer cluster system | |
CN108710673A (en) | Realize database high availability method, system, computer equipment and storage medium | |
CN113590434B (en) | Cluster alarm method, system, equipment and medium | |
CN113726556B (en) | Edge internet of things proxy node operation and maintenance method, system, storage medium and computing equipment | |
CN105071968A (en) | Method and device for repairing hidden failures of service plane and control plane of communication device | |
CN114490565A (en) | Database fault processing method and device | |
US8582444B2 (en) | Method for detecting hardware faults by determining a ratio of released connections | |
CN111190761B (en) | Log output method and device, storage medium and electronic equipment | |
CN115842860A (en) | Monitoring method, device and system for data link | |
CN115712521A (en) | Cluster node fault processing method, system and medium | |
CN111786806B (en) | Network element exception handling method and network management system | |
CN113808725A (en) | Equipment early warning system and method | |
CN107528730A (en) | Multiple redundancy method, multiple redundancy server and system | |
CN112181780A (en) | Detection and alarm method, device and equipment for containerized platform core component | |
CN115633197B (en) | Service data distribution system, method and device, electronic equipment and medium | |
CN115174356B (en) | Cluster alarm reporting method, device, equipment and medium | |
US20220128966A1 (en) | Context-Sensitive Technical Audit Trail of A Technical System | |
CN110569056B (en) | Rule service information updating method and device | |
CN113535506B (en) | Monitoring method and device of service system, storage medium and computer equipment | |
CN109617761B (en) | Method and device for switching main server and standby server | |
US20220353200A1 (en) | Monitoring a Communication System That is Used for Control and/or Surveillance of an Industrial Process | |
CN108400894B (en) | Server cluster network fault positioning method and system |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |