CN113132160A - Method and system for detecting network sub-health state of client node - Google Patents

Method and system for detecting network sub-health state of client node Download PDF

Info

Publication number
CN113132160A
CN113132160A CN202110375922.0A CN202110375922A CN113132160A CN 113132160 A CN113132160 A CN 113132160A CN 202110375922 A CN202110375922 A CN 202110375922A CN 113132160 A CN113132160 A CN 113132160A
Authority
CN
China
Prior art keywords
client node
message
client
node
sub
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202110375922.0A
Other languages
Chinese (zh)
Other versions
CN113132160B (en
Inventor
张烨
贺计文
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shandong Yingxin Computer Technology Co Ltd
Original Assignee
Shandong Yingxin Computer Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shandong Yingxin Computer Technology Co Ltd filed Critical Shandong Yingxin Computer Technology Co Ltd
Priority to CN202110375922.0A priority Critical patent/CN113132160B/en
Publication of CN113132160A publication Critical patent/CN113132160A/en
Application granted granted Critical
Publication of CN113132160B publication Critical patent/CN113132160B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L43/00Arrangements for monitoring or testing data switching networks
    • H04L43/08Monitoring or testing based on specific metrics, e.g. QoS, energy consumption or environmental parameters
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/06Management of faults, events, alarms or notifications
    • H04L41/0631Management of faults, events, alarms or notifications using root cause analysis; using analysis of correlation between notifications, alarms or events based on decision criteria, e.g. hierarchy, tree or time analysis
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L43/00Arrangements for monitoring or testing data switching networks
    • H04L43/02Capturing of monitoring data
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L67/00Network arrangements or protocols for supporting network services or applications
    • H04L67/01Protocols
    • H04L67/10Protocols in which an application is distributed across nodes in the network
    • H04L67/1097Protocols in which an application is distributed across nodes in the network for distributed storage of data in networks, e.g. transport arrangements for network file system [NFS], storage area networks [SAN] or network attached storage [NAS]

Landscapes

  • Engineering & Computer Science (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Signal Processing (AREA)
  • Environmental & Geological Engineering (AREA)
  • Data Exchanges In Wide-Area Networks (AREA)

Abstract

The invention discloses a method and a system for detecting the network sub-health state of a client node, wherein the method for detecting the network sub-health state comprises the following steps: the client node sends a state detection message to any server node, and the state detection message is used for enabling the server node to feed back a message reply message; the client node judges whether the client node has a network sub-health state according to the message reply message; if the client node has a network sub-health state, the client node sends a message retest message to all the server nodes; all the server nodes judge whether the client nodes have the network sub-health state according to the message retest message; and if the server end nodes with the preset number and above in all the server end nodes judge that the client end node has the network sub-health state, reporting the sub-health warning information of the client end node. The technical scheme of the invention can solve the problems that the detection method of the network sub-health state in the prior art is inaccurate in detection and is easy to cause false alarm.

Description

Method and system for detecting network sub-health state of client node
Technical Field
The invention relates to the technical field of storage clusters, in particular to a method and a system for detecting network sub-health states of client nodes.
Background
In general, a state in which a network can normally operate and can be quickly recovered after the network is subjected to an external impact is referred to as a "healthy" state; the state in which the network is paralyzed and cannot operate normally is called an "unhealthy" state. However, many large and medium-sized enterprises have networks in a "sub-health" state, and the networks in the "sub-health" state can normally operate at ordinary times, but have extremely low risk resistance, are easily paralyzed when exposed to sudden network risks, and are difficult to recover for a long time.
The network of the storage cluster is in a sub-healthy state, which may cause the service capacity of the storage cluster to be rapidly reduced. When a single-bit packet loss rate or a delay time of more than 100ms exists in a client network interface (i.e., a client node) in a storage cluster, the performance of the storage service of the storage cluster is reduced by more than 70%, which seriously affects the overall performance of the storage cluster.
The detection mode of the network sub-health state is mostly based on the CS architecture, that is, a single server detects the network state of a client node, so as to determine whether the storage cluster is in the network sub-health state. The detection method has high false alarm probability, and once the server has a problem, the detection result of the client can be influenced, so that the client considers that the client has a problem and false alarms occur.
Disclosure of Invention
The invention provides a method and a system for detecting a network sub-health state of a client node, and aims to solve the problems that the existing method for detecting the network sub-health state in the prior art has high false alarm probability, and once a server has a problem, the detection result of the client is influenced, and false alarm is caused.
To achieve the above object, according to a first aspect of the present invention, there is provided a method for detecting a network sub-health state of a client node, including:
the client node sends a state detection message to any server node, and the state detection message is used for enabling the server node to feed back a message reply message;
the client node judges whether the client node has a network sub-health state according to the message reply message;
if the client node is judged to have the sub-health state of the network, the client node sends message retest information to all the server nodes;
all the server nodes judge whether the client nodes have the network sub-health state or not according to the message retest message;
and if the server end nodes with the preset number and above in all the server end nodes judge that the client end node has the network sub-health state, the server end nodes report the sub-health warning information of the client end node.
Preferably, the step of judging whether the client node itself has a network sub-health state according to the message reply message by the client node includes:
the client node calculates the packet loss rate of the client node according to the number of data packets fed back by the message reply message;
the client node calculates the time delay of the client node according to the time tag fed back by the message reply message;
the client node judges whether the packet loss rate is greater than or equal to a preset packet loss rate threshold value or not and judges whether the time delay is greater than or equal to a preset time delay threshold value or not;
and if the client node judges that the packet loss rate is greater than or equal to the preset packet loss rate threshold or the time delay is greater than or equal to the preset time delay threshold, the client node judges that the client node has a network sub-health state.
Preferably, the step of judging whether the client node has a network sub-health state according to the message retest message by all the server nodes includes:
each service end node respectively calculates the packet loss rate and the time delay of the client end node according to the time labels and the data packet quantity contained in the message retest message;
and each service end node judges whether the packet loss rate is greater than or equal to a preset packet loss rate threshold value or not and judges whether the time delay is greater than or equal to a preset time delay threshold value or not.
Preferably, after the step of the client node determining that the client node itself is in a network sub-health state, the method further comprises:
the client node sends message retest information to all other client nodes except the client node;
all other client nodes judge whether the client nodes have the network sub-health state according to the message retest message;
and if the preset number of nodes in all other client nodes and more than the preset number of nodes judge that the client nodes have the network sub-health state, sending sub-health warning information to the client nodes.
Preferably, after the step of reporting the client node, the method for detecting the sub-health status of the network further comprises:
each server node which judges that the client node has the sub-health state of the network sends the sub-health warning information to the client node;
and when the number of the sub-health warning messages received by the client node is greater than or equal to the preset warning number threshold value, the client node closes the client network port of the client node.
According to a second aspect of the present invention, the present invention also provides a detection system for network sub-health status of a client node, comprising at least one client node and a plurality of server nodes, wherein,
the client node is used for sending a state detection message to any service end node, and the state detection message is used for enabling the service end node to feed back a message reply message;
the client node is also used for judging whether the client node has a network sub-health state according to the message reply message;
the client node is also used for sending message retest information to all the server nodes if the client node judges that the client node has the sub-health state of the network;
all the service end nodes are used for judging whether the client end nodes have the sub-health state of the network according to the message retest message;
and the service end nodes are also used for reporting the sub-health warning information of the client nodes if the service end nodes with the preset number and above in all the service end nodes judge that the client nodes have the network sub-health state.
Preferably, in the system for detecting a sub-health state of a network, the client node is further specifically configured to calculate a packet loss rate of the client node according to the number of data packets fed back by the message reply message;
the client node is specifically used for calculating the time delay of the client node according to the time tag fed back by the message reply message;
the client node is specifically further configured to determine whether the packet loss rate is greater than or equal to a preset packet loss rate threshold, and determine whether the time delay is greater than or equal to a preset time delay threshold;
the client node is specifically configured to determine that the client node is in a network sub-health state if the client node determines that the packet loss rate is greater than or equal to the preset packet loss rate threshold or the time delay is greater than or equal to the preset time delay threshold.
Preferably, in the system for detecting a sub-health state of a network, each service end node is further specifically configured to calculate a packet loss rate and a time delay of a client end node according to a time tag and a data packet number included in a message retest message;
each service end node is specifically further configured to determine whether the packet loss rate is greater than or equal to a preset packet loss rate threshold, and determine whether the time delay is greater than or equal to a preset time delay threshold.
Preferably, in the system, the client node is specifically further configured to send a message retest message to all other client nodes except the client node;
all other client nodes are specifically used for judging whether the client nodes have the network sub-health state according to the message retest message;
the client node is specifically used for sending sub-health warning information to the client node if the preset number of nodes and above in all other client nodes judge that the client node has the network sub-health state.
Preferably, the system for detecting the sub-health status of the network further comprises: the server node is specifically used for sending the sub-health alarm information to the client node; and the client node is specifically used for closing the client network port corresponding to the client node when the number of the sub-health alarm information received by the client node is greater than or equal to a preset alarm number threshold value.
In summary, according to the detection scheme for the network sub-health state of the client node provided by the application, the client node firstly sends a state detection message to any server node, and then receives a message reply message fed back by the server node; then the client node judges whether the client node has the network sub-health state according to the message reply message, the client node sends a message retest message to all server nodes when judging that the client node has the network sub-health state, when the server nodes with the preset number and above in all the server nodes judge that the client has the network sub-health state, the client node really has the network sub-health state, and reports the sub-health warning information of the client node at the moment, thereby realizing that compared with the mode of detecting the network state of the client node by a single server terminal in the background technology, the technical scheme provided by the application can more accurately judge whether the client node has the network sub-health state because the client node self-detects the network state after determining that the client has the network sub-health state and then detects the network state of the client node by all the server nodes, therefore, the accuracy of detection is improved, and the false alarm rate of the sub-health state of the network is reduced.
Drawings
In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the drawings used in the description of the embodiments or the prior art will be briefly described below, it is obvious that the drawings in the following description are only some embodiments of the present invention, and for those skilled in the art, other drawings can be obtained according to the structures shown in the drawings without creative efforts.
Fig. 1 is a schematic diagram of an application scenario provided in an embodiment of the present invention;
fig. 2 is a schematic flowchart of a method for detecting a network sub-health status of a client node according to a first embodiment of the present invention;
FIG. 3 is a flowchart illustrating a first method for determining a sub-health status of a network according to the embodiment shown in FIG. 2;
FIG. 4 is a flowchart illustrating a second method for determining a sub-health status of a network according to the embodiment shown in FIG. 2;
fig. 5 is a flowchart illustrating a method for detecting a network sub-health status of a client node according to a second embodiment of the present invention;
fig. 6 is a flowchart illustrating a method for detecting a network sub-health status of a third client node according to an embodiment of the present invention;
fig. 7 is a schematic structural diagram of a system for detecting a network sub-health state of a first client node according to an embodiment of the present invention;
fig. 8 is a schematic structural diagram of a system for detecting a network sub-health state of a client node according to a second embodiment of the present invention.
The implementation, functional features and advantages of the objects of the present invention will be further explained with reference to the accompanying drawings.
Detailed Description
It should be understood that the specific embodiments described herein are merely illustrative of the invention and are not intended to limit the invention.
The main technical problems of the embodiment of the invention are as follows:
most of the existing detection schemes for the network sub-health state of the client node adopt a detection mode based on a CS (circuit switched) architecture, that is, a single server detects the network state of the client node, so as to determine whether the storage cluster is in the network sub-health state as a whole. The detection method has high false alarm probability, and once the server has a problem, the detection result of the client can be influenced, so that the client considers that the client has a problem and false alarms occur.
In order to solve the above problem, referring to fig. 1, fig. 1 is a schematic view of an application scenario provided by an embodiment of the present invention. As shown in fig. 1, in the application scenario, a client node 1 sends a state detection message to any server node 2, and waits for receiving a message reply message fed back by the server node; the client receives the message and judges whether the client node 1 has the network sub-health state according to the message reply message, when the message appears, the client node 1 sends a message retest message to all the service node nodes 2, then all the service node nodes 2 judge whether the client node 1 has the network sub-health state according to the message retest message, when the service node 2 judges that the client node 1 has the network sub-health state with a preset quantity and more than half, for example, the service node 2 reports the sub-health warning information of the client node 1. According to the technical scheme, the network state of the client node 1 is self-detected, and then when the network sub-health state of the client node 1 is determined, whether the network sub-health state exists in the client node 1 is judged through all the server nodes 2, so that the accuracy of detecting the network sub-health state at the client node 1 is improved, and the false alarm rate of the network sub-health state is reduced.
Referring to fig. 2 in particular, fig. 2 is a schematic flow chart of a method for detecting a network sub-health state of a client node according to the present invention. As shown in fig. 2, the method for detecting the network sub-health status of the client node includes:
s110: the client node sends a state detection message to any service end node, and the state detection message is used for enabling the service end node to feed back a message reply message. The state detection message comprises a certain number of data packets, if the network state is normal, the number of the data packets fed back in the message reply message fed back by the service end node is the same, and the client end node sets a time tag when sending the state detection message, and calculates the delay condition through the time tag.
S120: and the client node judges whether the client node has a network sub-health state according to the message reply message. If yes, go to step S130; if not, the client node stops sending the state detection message. The message reply message comprises a certain number of data packets, and when the number of the data packets is the same as the number of the data packets contained in the state detection message, the condition that the network state health of the client node does not occur is determined; and if the number of data packets contained in the message reply message is less than that of the state detection message, or the message reply message is not returned, determining that the client node is in the network sub-health state.
As a preferred embodiment, as shown in fig. 3, the step of the client node determining whether the client node itself has the network sub-health status according to the message reply message specifically includes:
s121: the client node calculates the packet loss rate of the client node according to the number of data packets fed back by the message reply message;
s122: the client node calculates the time delay of the client node according to the time tag fed back by the message reply message;
s123: the client node judges whether the packet loss rate is greater than or equal to a preset packet loss rate threshold value or not and judges whether the time delay is greater than or equal to a preset time delay threshold value or not;
s124: and if the client node judges that the packet loss rate is greater than or equal to the preset packet loss rate threshold or the time delay is greater than or equal to the preset time delay threshold, the client node judges that the client node has a network sub-health state.
According to the technical scheme provided by the embodiment of the application, the packet loss rate of the client node is calculated by the client node according to the number of the data packets fed back by the message reply message, and the time delay of the client node is calculated according to the time tag fed back by the message reply message, so that when the packet loss rate is greater than or equal to the preset packet loss rate threshold value or the experiment is greater than or equal to the preset time delay threshold value, the client node can judge that the client node is in a network sub-health state, and at the moment, the packet loss rate of the client node needs to be further judged by other nodes.
S130: and the client node sends a message retest message to all the server nodes. By sending a message retest message, the retest message contains information such as a data packet and a time tag, so that all the server-side nodes can be used for judging whether the client-side nodes have the network sub-health state.
S140: and all the server nodes judge whether the client nodes have the network sub-health state according to the message retest message.
As a preferred embodiment, as shown in fig. 4, the step of determining, by all the server nodes, whether the client node has a network sub-health state according to the message retest message specifically includes:
s141: and each service end node respectively calculates the packet loss rate and the time delay of the client end node according to the time labels and the data packet quantity contained in the message retest message.
S142: and each service end node judges whether the packet loss rate is greater than or equal to a preset packet loss rate threshold value or not and judges whether the time delay is greater than or equal to a preset time delay threshold value or not.
According to the technical scheme provided by the embodiment of the application, each service end node in all the service end nodes judges the time delay and the packet loss rate, and whether the client end node has the network sub-health state or not can be determined according to the conditions of the time delay and the packet loss rate, so that the accurate judgment of the network sub-health state of the client end node is realized.
S150: and if the server end nodes with the preset number and above in all the server end nodes judge that the client end node has the network sub-health state, the server end nodes report the sub-health warning information of the client end node.
In summary, according to the method for detecting the network sub-health state of the client node provided by the embodiment of the present application, the client node first sends a state detection message to any service node, and then receives a message reply message fed back by the service node; then the client node judges whether the client node has the network sub-health state according to the message reply message, the client node sends a message retest message to all server nodes when judging that the client node has the network sub-health state, when the server nodes with the preset number and above in all the server nodes judge that the client has the network sub-health state, the client node really has the network sub-health state, at the moment, the sub-health warning information of the client node is reported, thereby realizing that compared with the mode of detecting the network state of the client node by a single server terminal, the technical scheme provided by the application is characterized in that the network state is self-detected by the client node, the network state of the client node is detected by other all the server nodes after the client node determines that the network sub-health state occurs, whether the client node has the condition of network sub-health can be judged more accurately, so that the detection accuracy is improved, and the false alarm rate of the network sub-health state is reduced.
In addition, in order to further accurately early warn the network sub-health state of the client node, the method and the system can also judge the network sub-health state of the client node by using other client nodes. Specifically, as a preferred embodiment, as shown in fig. 5, after the step of the client node determining that the client node itself has the network sub-health status, the method further comprises:
s210: the client node sends message retest information to all other client nodes except the client node;
s220: all other client nodes judge whether the client nodes have the network sub-health state according to the message retest message;
s230: and if the preset number of nodes in all other client nodes and more than the preset number of nodes judge that the client nodes have the network sub-health state, sending sub-health warning information to the client nodes.
According to the technical scheme provided by the embodiment of the application, whether the client node has the network sub-health state is judged by other client nodes, and then the sub-health state warning information is sent to the client node when the client node is determined to have the network sub-health state, so that the network state of the client node can be further accurately judged, and whether the client node has the network sub-health state is determined.
As a preferred embodiment, as shown in fig. 6, after the step of reporting the client node, the method for detecting the sub-health status of the network further comprises:
s310: and each server node which judges that the client node has the sub-health state of the network sends the sub-health alarm information to the client node.
S320: and when the number of the sub-health warning messages received by the client node is greater than or equal to the preset warning number threshold value, the client node closes the client network port of the client node.
According to the technical scheme provided by the embodiment of the application, the sub-health state alarm information is sent to the client node through each service node for judging the existence of the network sub-health state of the client node, then the client node can count the number of the received sub-health alarm information, the data corresponds to the number of the service nodes for judging the existence of the network sub-health state, then the client node closes the network port of the client node, and the client node is prevented from transmitting information through the network port.
In addition, based on the same concept of the above method embodiment, an embodiment of the present invention further provides a system for detecting a network sub-health status of a client node, which is used to implement the method of the present invention.
Referring to fig. 7 and 8, fig. 7 and 8 are respectively schematic structural diagrams of a detection system for detecting a network sub-health state of a client node according to an embodiment of the present invention. As shown in fig. 7 and 8, the detection system of the sub-health state of the network comprises at least one client node 1 and a plurality of server nodes 2, wherein,
the client node 1 is configured to send a state detection message to any of the service node 2, where the state detection message is used to enable the service node 2 to feed back a message reply message.
The client node 1 is further configured to determine whether the client node 1 itself has a network sub-health state according to the message reply message.
The client node 1 is also used for sending a message retest message to all the server nodes 2 if the client node 1 judges that the client node 1 has the network sub-health state.
And all the service end nodes 2 are used for judging whether the client end node 1 has the network sub-health state according to the message retest message.
The service end node 2 is further configured to report sub-health alarm information of the client node 1 if the predetermined number of service end nodes 2 and more in all the service end nodes 2 determine that the client node 1 has the network sub-health state.
Referring to fig. 7 and 8, the detection system is divided into two layers of logic, namely a bottom layer logic and an upper layer logic. The bottom logic still adopts a CS framework, namely the Client node sends a state detection message to the Server node of the Server, the Server node immediately returns after receiving the message, and the Client node analyzes the packet loss rate and the time delay of the Client node according to the returned message.
Specifically, the client node 1 detects whether the network sub-health state exists by itself through the problem of the underlying logic, that is, there is a case that the packet loss rate or the time delay is greater than or equal to the respective threshold, and at this time, sends the proposal that the client node is in the network sub-health state to all the service end nodes 2. After receiving the proposal, the service end node 2 sends message retest messages to the problem client node 1 respectively. Other nodes give opinions about the proposal according to the retest result, approve or reject the proposal; when half or more of all the service end nodes 2 approve, the problem client node 1 is considered to have the network sub-health state, at the moment, sub-health warning information is reported to the operating system 3, and the network port corresponding to the problem client node 1 is closed; otherwise, the sub-health warning information is not reported to the operating system 3.
To sum up, the system for detecting the network sub-health state of the client node 1 provided by the embodiment of the present application firstly sends a state detection message to any of the server nodes 2 through the client node 1, and then receives a message reply message fed back by the server nodes 2; then the client node 1 judges whether the client node 1 has the network sub-health state according to the message reply message, the client node 1 sends a message retest message to all server nodes when judging that the client node 1 has the network sub-health state, when the server nodes 2 with the preset number and above in all the server nodes 2 judge that the client has the network sub-health state, the client node 1 really has the network sub-health state, at the moment, the sub-health warning information of the client node 1 is reported, thereby realizing that the technical scheme provided by the application is compared with the mode of detecting the network state of the client node 1 by a single server in the background technology, because the network state is detected by the client node 1, the network state of the client node 1 is detected by other all the server nodes 2 after the client node 1 determines that the network sub-health state occurs, whether the client node 1 has the condition of network sub-health can be judged more accurately, so that the detection accuracy is improved, and the false alarm rate of the network sub-health state is reduced.
As a preferred embodiment, as shown in fig. 8, in the detection system of the network sub-health state, the client node 1 is specifically further configured to calculate a packet loss rate of the client node 1 according to the number of data packets fed back by the message reply message;
the client node 1 is specifically further configured to calculate a time delay of the client node 1 according to the time tag fed back by the message reply message;
the client node 1 is specifically configured to determine whether the packet loss rate is greater than or equal to a preset packet loss rate threshold, and determine whether the time delay is greater than or equal to a preset time delay threshold;
the client node 1 is further specifically configured to determine that the client node 1 itself is in a network sub-health state if the client node 1 determines that the packet loss rate is greater than or equal to the preset packet loss rate threshold or the time delay is greater than or equal to the preset time delay threshold.
As a preferred embodiment, as shown in fig. 8, in the detection system of the network sub-health state, each service end node 2 is specifically further configured to calculate, according to a time tag and a number of data packets included in a message retest message, a packet loss rate and a time delay of the client end node 1 respectively;
each service end node 2 is specifically further configured to determine whether the packet loss rate is greater than or equal to a preset packet loss rate threshold, and determine whether the time delay is greater than or equal to a preset time delay threshold.
Preferably, in the system, the client node 1 is further configured to specifically send a message retest message to all other client nodes 1 except the client node 1;
all other client nodes 1 are specifically used for judging whether the client node 1 has a network sub-health state according to the message retest message;
the client node 1 is further specifically configured to send sub-health warning information to the client node 1 if the predetermined number of nodes or more in all other client nodes 1 determine that the client node 1 has the network sub-health state.
As a preferred embodiment, as shown in fig. 8, the system for detecting the sub-health status of the network further includes: the server node 2 is specifically used for sending the sub-health warning information to the client node 1; the client node 1 is further specifically configured to close the client network port corresponding to the client node 1 when the number of the sub-health warning messages received by the client node 1 is greater than or equal to the predetermined warning number threshold.
As will be appreciated by one skilled in the art, embodiments of the present invention may be provided as a method, system, or computer program product. Accordingly, the present invention may take the form of an entirely hardware embodiment, an entirely software embodiment or an embodiment combining software and hardware aspects. Furthermore, the present invention may take the form of a computer program product embodied on one or more computer-usable storage media (including, but not limited to, disk storage, CD-ROM, optical storage, and the like) having computer-usable program code embodied therein.
The present invention is described with reference to flowchart illustrations and/or block diagrams of methods, apparatus (systems), and computer program products according to embodiments of the invention. It will be understood that each flow and/or block of the flow diagrams and/or block diagrams, and combinations of flows and/or blocks in the flow diagrams and/or block diagrams, can be implemented by computer program instructions. These computer program instructions may be provided to a processor of a general purpose computer, special purpose computer, embedded processor, or other programmable data processing apparatus to produce a machine, such that the instructions, which execute via the processor of the computer or other programmable data processing apparatus, create means for implementing the functions specified in the flowchart flow or flows and/or block diagram block or blocks.
These computer program instructions may also be stored in a computer-readable memory that can direct a computer or other programmable data processing apparatus to function in a particular manner, such that the instructions stored in the computer-readable memory produce an article of manufacture including instruction means which implement the function specified in the flowchart flow or flows and/or block diagram block or blocks.
These computer program instructions may also be loaded onto a computer or other programmable data processing apparatus to cause a series of operational steps to be performed on the computer or other programmable apparatus to produce a computer implemented process such that the instructions which execute on the computer or other programmable apparatus provide steps for implementing the functions specified in the flowchart flow or flows and/or block diagram block or blocks.
It should be noted that in the claims, any reference signs placed between parentheses shall not be construed as limiting the claim. The word "comprising" does not exclude the presence of elements or steps not listed in a claim. The word "a" or "an" preceding an element does not exclude the presence of a plurality of such elements. The invention may be implemented by means of hardware comprising several distinct elements, and by means of a suitably programmed computer. In the unit claims enumerating several means, several of these means may be embodied by one and the same item of hardware. The usage of the words first, second and third, etcetera do not indicate any ordering. These words may be interpreted as names.
While preferred embodiments of the present invention have been described, additional variations and modifications in those embodiments may occur to those skilled in the art once they learn of the basic inventive concepts. Therefore, it is intended that the appended claims be interpreted as including preferred embodiments and all such alterations and modifications as fall within the scope of the invention.
It will be apparent to those skilled in the art that various changes and modifications may be made in the present invention without departing from the spirit and scope of the invention. Thus, if such modifications and variations of the present invention fall within the scope of the claims of the present invention and their equivalents, the present invention is also intended to include such modifications and variations.

Claims (10)

1. A method for detecting network sub-health status of a client node is characterized by comprising the following steps:
a client node sends a state detection message to any server node, wherein the state detection message is used for enabling the server node to feed back a message reply message;
the client node judges whether the client node has a network sub-health state according to the message reply message;
if the client node is judged to be in the sub-health state of the network, the client node sends message retest information to all the server nodes;
all the server nodes judge whether the client nodes have the sub-health state of the network according to the message retest message;
and if the server nodes with the preset number and above in all the server nodes judge that the client node has the network sub-health state, the server nodes report the sub-health warning information of the client node.
2. The method according to claim 1, wherein the step of the client node determining whether the client node itself has the network sub-health status according to the message reply message comprises:
the client node calculates the packet loss rate of the client node according to the number of data packets fed back by the message reply message;
the client node calculates the time delay of the client node according to the time label fed back by the message reply message;
the client node judges whether the packet loss rate is greater than or equal to a preset packet loss rate threshold value or not, and judges whether the time delay is greater than or equal to a preset time delay threshold value or not;
if the client node judges that the packet loss rate is larger than or equal to a preset packet loss rate threshold value, or the time delay is larger than or equal to a preset time delay threshold value, the client node judges that the client node has a network sub-health state.
3. The method according to claim 1, wherein the step of determining, by all the server nodes, whether the client node has the network sub-health status according to the message retest message comprises:
each service end node respectively calculates the packet loss rate and the time delay of the client end node according to the time labels and the data packet quantity contained in the message retest message;
and each service end node judges whether the packet loss rate is greater than or equal to a preset packet loss rate threshold value or not, and judges whether the time delay is greater than or equal to a preset time delay threshold value or not.
4. The method of claim 1, wherein after the step of the client node determining that the client node itself is in the network sub-health state, the method further comprises:
the client node sends message retest information to all other client nodes except the client node;
all other client nodes judge whether the client nodes have the network sub-health state according to the message retest message;
and if the preset number of nodes in all other client nodes and more than the preset number of nodes judge that the client nodes have the network sub-health state, sending sub-health warning information to the client nodes.
5. The method of claim 1, wherein after the step of reporting the client node, the method further comprises:
each server node which judges that the client node has the network sub-health state sends the sub-health alarm information to the client node;
and when the number of the sub-health warning messages received by the client node is greater than or equal to a preset warning number threshold value, the client node closes the client network port of the client node.
6. A detection system for network sub-health status of a client node, comprising at least one client node and a plurality of server nodes, wherein,
the client node is used for sending a state detection message to any service end node, and the state detection message is used for enabling the service end node to feed back a message reply message;
the client node is also used for judging whether the client node has a network sub-health state according to the message reply message;
the client node is also used for sending a message retest message to all the server nodes if the client node judges that the client node has the network sub-health state;
the all service end nodes are used for judging whether the client end node has the network sub-health state according to the message retest message;
and the service end nodes are also used for reporting sub-health warning information of the client node if the service end nodes with the preset number and above in all the service end nodes judge that the client node has the network sub-health state.
7. The system according to claim 6, wherein the client node is further configured to calculate a packet loss rate of the client node according to a number of data packets fed back by the packet reply message;
the client node is specifically further configured to calculate a time delay of the client node according to the time tag fed back by the message reply message;
the client node is specifically further configured to determine whether the packet loss rate is greater than or equal to a preset packet loss rate threshold, and determine whether the time delay is greater than or equal to a preset time delay threshold;
the client node is specifically further configured to determine that the client node itself is in a network sub-health state if the client node determines that the packet loss rate is greater than or equal to a preset packet loss rate threshold or the time delay is greater than or equal to a preset time delay threshold.
8. The system for detecting the sub-health status of a network according to claim 6,
each service end node is specifically further configured to calculate a packet loss rate and a time delay of the client end node according to the time tag and the number of data packets included in the packet retest message;
each service end node is specifically further configured to determine whether the packet loss rate is greater than or equal to a preset packet loss rate threshold, and determine whether the time delay is greater than or equal to a preset time delay threshold.
9. The system for detecting the sub-health status of a network according to claim 6,
the client node is specifically further configured to send a message retest message to all other client nodes except the client node;
all other client nodes are specifically used for judging whether the client node has the network sub-health state according to the message retest message;
the client node is specifically configured to send sub-health warning information to the client node if the predetermined number of nodes or more in all other client nodes determine that the client node has a network sub-health state.
10. The system for detecting the sub-health status of a network according to claim 6,
the server node is specifically further configured to send the sub-health alarm information to the client node;
the client node is specifically further configured to close a client network port corresponding to the client node when the number of the sub-health warning messages received by the client node is greater than or equal to a predetermined warning number threshold.
CN202110375922.0A 2021-04-08 2021-04-08 Method and system for detecting network sub-health state of client node Active CN113132160B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202110375922.0A CN113132160B (en) 2021-04-08 2021-04-08 Method and system for detecting network sub-health state of client node

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202110375922.0A CN113132160B (en) 2021-04-08 2021-04-08 Method and system for detecting network sub-health state of client node

Publications (2)

Publication Number Publication Date
CN113132160A true CN113132160A (en) 2021-07-16
CN113132160B CN113132160B (en) 2023-02-28

Family

ID=76775365

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202110375922.0A Active CN113132160B (en) 2021-04-08 2021-04-08 Method and system for detecting network sub-health state of client node

Country Status (1)

Country Link
CN (1) CN113132160B (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113890816A (en) * 2021-11-19 2022-01-04 深信服科技股份有限公司 Network health state analysis method and device, computer equipment and storage medium
CN115118635A (en) * 2022-06-29 2022-09-27 济南浪潮数据技术有限公司 Time delay detection method, device, equipment and storage medium
WO2024116136A1 (en) * 2022-12-01 2024-06-06 Jio Platforms Limited System and method for generating a customer health card

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105872118A (en) * 2015-11-25 2016-08-17 乐视云计算有限公司 Detection method and device of node servers in DNS (Domain Name System) network
CN110620703A (en) * 2019-09-29 2019-12-27 北京首汽智行科技有限公司 Http-based service node state determination method
CN110677480A (en) * 2019-09-29 2020-01-10 北京浪潮数据技术有限公司 Node health management method and device and computer readable storage medium
CN111142801A (en) * 2019-12-26 2020-05-12 星辰天合(北京)数据科技有限公司 Distributed storage system network sub-health detection method and device
CN111510338A (en) * 2020-03-09 2020-08-07 苏州浪潮智能科技有限公司 Distributed block storage network sub-health test method, device and storage medium
CN111585845A (en) * 2020-05-15 2020-08-25 苏州浪潮智能科技有限公司 Method, device and equipment for detecting network card node performance and readable medium
CN111884870A (en) * 2020-06-28 2020-11-03 苏州浪潮智能科技有限公司 Method, device, terminal and storage medium for predicting network sub-health state

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105872118A (en) * 2015-11-25 2016-08-17 乐视云计算有限公司 Detection method and device of node servers in DNS (Domain Name System) network
CN110620703A (en) * 2019-09-29 2019-12-27 北京首汽智行科技有限公司 Http-based service node state determination method
CN110677480A (en) * 2019-09-29 2020-01-10 北京浪潮数据技术有限公司 Node health management method and device and computer readable storage medium
CN111142801A (en) * 2019-12-26 2020-05-12 星辰天合(北京)数据科技有限公司 Distributed storage system network sub-health detection method and device
CN111510338A (en) * 2020-03-09 2020-08-07 苏州浪潮智能科技有限公司 Distributed block storage network sub-health test method, device and storage medium
CN111585845A (en) * 2020-05-15 2020-08-25 苏州浪潮智能科技有限公司 Method, device and equipment for detecting network card node performance and readable medium
CN111884870A (en) * 2020-06-28 2020-11-03 苏州浪潮智能科技有限公司 Method, device, terminal and storage medium for predicting network sub-health state

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113890816A (en) * 2021-11-19 2022-01-04 深信服科技股份有限公司 Network health state analysis method and device, computer equipment and storage medium
CN115118635A (en) * 2022-06-29 2022-09-27 济南浪潮数据技术有限公司 Time delay detection method, device, equipment and storage medium
WO2024116136A1 (en) * 2022-12-01 2024-06-06 Jio Platforms Limited System and method for generating a customer health card

Also Published As

Publication number Publication date
CN113132160B (en) 2023-02-28

Similar Documents

Publication Publication Date Title
CN113132160B (en) Method and system for detecting network sub-health state of client node
CN110213068B (en) Message middleware monitoring method and related equipment
US10862786B2 (en) Method and device for fingerprint based status detection in a distributed processing system
CN114064333A (en) Memory fault processing method and device
US10831579B2 (en) Error detecting device and error detecting method for detecting failure of hierarchical system, computer readable recording medium, and computer program product
CN110784355A (en) Fault identification method and device
CN106330588B (en) BFD detection method and device
CN111988191A (en) Fault detection method and device for distributed communication network
CN111142801B (en) Distributed storage system network sub-health detection method and device
CN111427694A (en) Task execution method, device, system and server
CN112039723A (en) Method and device for detecting state of micro-service network and electronic equipment
CN105933176A (en) Method and device for detecting states of host
CN103995901A (en) Method for determining data node failure
CN107786390B (en) Method and device for correcting networking nodes
US8582444B2 (en) Method for detecting hardware faults by determining a ratio of released connections
CN106161058B (en) Alarm classification method and device
CN115150253B (en) Fault root cause determining method and device and electronic equipment
CN112363893B (en) Method, equipment and device for detecting time sequence index abnormity
CN114697196A (en) Network path switching method in data center, data center network system and equipment
CN109062642B (en) Control message notification method and device
CN109117294B (en) Fault detection method and device suitable for security trading system
Li Anomaly detection in wireless sensor networks based on time factor
CN116016265B (en) Message all-link monitoring method, device, system, equipment and storage medium
CN109831342A (en) A kind of fault recovery method based on distributed system
CN114567471B (en) Electric power communication network safety detection system and method based on 5G

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant