CN105490837A - Network monitoring processing method and device - Google Patents

Network monitoring processing method and device Download PDF

Info

Publication number
CN105490837A
CN105490837A CN201510824761.3A CN201510824761A CN105490837A CN 105490837 A CN105490837 A CN 105490837A CN 201510824761 A CN201510824761 A CN 201510824761A CN 105490837 A CN105490837 A CN 105490837A
Authority
CN
China
Prior art keywords
service node
message
service
node
information
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201510824761.3A
Other languages
Chinese (zh)
Other versions
CN105490837B (en
Inventor
巩吉璋
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Guangzhou Netstar Information Technology Co., Ltd.
Original Assignee
All Kinds Of Fruits Garden Guangzhou Network Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by All Kinds Of Fruits Garden Guangzhou Network Technology Co Ltd filed Critical All Kinds Of Fruits Garden Guangzhou Network Technology Co Ltd
Priority to CN201510824761.3A priority Critical patent/CN105490837B/en
Publication of CN105490837A publication Critical patent/CN105490837A/en
Application granted granted Critical
Publication of CN105490837B publication Critical patent/CN105490837B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/08Configuration management of networks or network elements
    • H04L41/0803Configuration setting
    • H04L41/0823Configuration setting characterised by the purposes of a change of settings, e.g. optimising configuration for enhancing reliability
    • H04L41/0826Configuration setting characterised by the purposes of a change of settings, e.g. optimising configuration for enhancing reliability for reduction of network costs

Abstract

The embodiment of the invention discloses a network monitoring processing method and device. The method comprises: obtaining the message sending state information between an access node and a server node; determining the network state of the service node according to the message delay rate or message failure rate in the message sending state information; when the network state is a normal state, adding service node to a normal service node set; when the network state is an abnormal state, adding service node to a pause service node set, thus only communicating with all service nodes in the normal service node set by the access node. In application of the invention, manual maintenance cost of the service nodes is reduced.

Description

A kind of network monitoring processing method and device
Technical field
The present invention relates to Internet technical field, particularly relate to a kind of network monitoring processing method and device.
Background technology
At current IM (InstantMessaging, instant messaging) in system, first request message from client is the process by access node, and then being forwarded to the service node of rear end, result is returned to client via access node after being disposed to request message by service node again.In order to realize the reliability of system, the service node of multiple equity usually can be disposed at the machine room in different regions.Request message is normally transmitted to service node according to random or polling mode by access node.When certain service node network goes wrong, still have component requests message and be forwarded on this problem service node, thus can impact the request of client.In this case, just need manually to take measures, isolate or stop this problem service node, and when network of computer room residing for this service node is not very stable, frequently problem service node isolated with regard to needs keeper or stop, causing manual maintenance cost to increase.
Summary of the invention
The embodiment of the present invention provides a kind of network monitoring processing method and device, can reduce the manual maintenance cost to service node.
Embodiments provide a kind of network monitoring processing method, comprising:
The message obtained between access node and service node sends state information;
According to the message delay rate in described message transmission state information or message failure rate, determine the network state of described service node;
When described network state is normal condition, add described service node to normal service node set;
When described network state is abnormality, add described service node to node set out of service, so that described access node only communicates to all service nodes in described normal service node set.
Correspondingly, the embodiment of the present invention additionally provides a kind of network monitoring processing unit, comprising:
Acquisition module, sends state information for the message obtained between access node and service node;
Determination module, for according to the message delay rate in described message transmission state information or message failure rate, determines the network state of described service node;
Add module, for when described network state is normal condition, add described service node to normal service node set;
Described interpolation module, also for when described network state is abnormality, adds described service node to node set out of service, so that described access node only communicates to all service nodes in described normal service node set.
The embodiment of the present invention sends state information by the message obtained between access node and service node, and according to the message delay rate in described message transmission state information or message failure rate, determine the network state of described service node, when described network state is normal condition, described service node can be added to normal service node set; Or when described network state is abnormality, add described service node to node set out of service, so that described access node only communicates to all service nodes in described normal service node set.Only request message is transmitted to the service node in described normal service node set due to described access node, so the impact that the request of problem service node on client cause effectively can be reduced, and based on described normal service node set and described node set out of service to the division of service node, can automatically isolate problem service node or stop, thus the manual maintenance cost to service node can be reduced.
Accompanying drawing explanation
In order to be illustrated more clearly in the embodiment of the present invention or technical scheme of the prior art, be briefly described to the accompanying drawing used required in embodiment or description of the prior art below, apparently, accompanying drawing in the following describes is only some embodiments of the present invention, for those of ordinary skill in the art, under the prerequisite not paying creative work, other accompanying drawing can also be obtained according to these accompanying drawings.
Fig. 1 is the schematic diagram of a kind of network architecture that the embodiment of the present invention provides;
Fig. 2 is the schematic flow sheet of a kind of network monitoring processing method that the embodiment of the present invention provides;
Fig. 3 is the schematic flow sheet of the another kind of network monitoring processing method that the embodiment of the present invention provides;
Fig. 4 is the structural representation of a kind of network monitoring processing unit that the embodiment of the present invention provides;
Fig. 5 is the structural representation of a kind of acquisition module that the embodiment of the present invention provides;
Fig. 6 is a kind of structural representation calculating generation unit that the embodiment of the present invention provides;
Fig. 7 is the structural representation of a kind of determination module that the embodiment of the present invention provides;
Fig. 8 is the structural representation of the another kind of network monitoring processing unit that the embodiment of the present invention provides;
Fig. 9 is the structural representation of another network monitoring processing unit that the embodiment of the present invention provides.
Embodiment
Below in conjunction with the accompanying drawing in the embodiment of the present invention, be clearly and completely described the technical scheme in the embodiment of the present invention, obviously, described embodiment is only the present invention's part embodiment, instead of whole embodiments.Based on the embodiment in the present invention, those of ordinary skill in the art, not making the every other embodiment obtained under creative work prerequisite, belong to the scope of protection of the invention.
Referring to Fig. 1, is the schematic diagram of a kind of network architecture that the embodiment of the present invention provides.As shown in Figure 1, this network architecture can comprise network monitoring processing unit, access node and multiple service node.Described network monitoring processing unit can be applied in IM system, is specifically as follows a certain or multiple servers being deployed in IM system.Described network monitoring processing unit can carry out network connection respectively at described access node and described multiple service node, and described access node also can carry out network with described multiple service node and be connected.In the embodiment of the present invention, the message that described network monitoring processing unit can obtain between described access node and each service node sends state information, and according to the message delay rate in each message transmission state information or message failure rate, determine that each message sends the network state of the service node corresponding to state information respectively, described network state can also be that the service node of normal condition adds normal service node set to by described network monitoring processing unit, and be that the service node of abnormality adds node set out of service to by described network state, and control described access node and only all service nodes in described normal service node set are communicated, thus effectively can reduce the impact that the request of problem service node on client cause, but also can automatically isolate problem service node or stop, thus the manual maintenance cost that can reduce service node.Due to the same to the network status monitoring process of each service node, and to the set deterministic process of each service node too, so embodiment corresponding to following Fig. 2 to Fig. 9 is all described for one of them service node, and this service node can be any one service node in this network architecture.
Based on the network architecture shown in Fig. 1, a kind of network monitoring processing method that the embodiment of the invention discloses.Refer to Fig. 2, be the schematic flow sheet of a kind of network monitoring processing method that the embodiment of the present invention provides, described method can comprise:
S201, the message obtained between access node and service node sends state information;
Concrete, the message that network monitoring processing unit can obtain between access node and service node sends state information, and described message sends state information can comprise message delay rate and message failure rate.Wherein, message delay rate can be that in certain period, described access node is sent in all request messages of described service node, belongs to the ratio shared by request message of delay state; Described message failure rate can be that in the time, described access node is sent in all request messages of described service node, belongs to the ratio shared by request message of status of fail.Wherein, described access node is after sending a request message to described service node, and described service node can respond described request message and return acknowledge message.Therefore, when described access node exceedes the first default duration threshold value from certain request message of transmission to the duration receiving corresponding acknowledge message, this request message can be defined as the request message belonging to delay state; When described access node cannot receive acknowledge message corresponding to certain request message, this request message can be defined as the request message belonging to status of fail.Such as, described access node have sent 100 request messages to service node in 1 minute, wherein, the request message belonging to delay state is 20, and the request message belonging to status of fail is 10, so, can calculate message delay rate is 20%, and message failure rate is 10%.
S202, according to the message delay rate in described message transmission state information or message failure rate, determines the network state of described service node;
Concrete, described network monitoring processing unit can judge that described message sends message delay rate in state information or whether message failure rate is greater than default proportion threshold value, if be judged as YES, illustrate that too much request message occurs that time delay or too much request message occur responding unsuccessfully, then can determine that the network state of described service node is abnormality, otherwise, can determine that the network state of described service node is normal condition.Wherein, for judging the proportion threshold value whether described message delay rate is excessive, and for judging that the whether excessive proportion threshold value of described message failure rate can be identical, also can be different.
S203, when described network state is normal condition, adds described service node to normal service node set;
S204, when described network state is abnormality, adds described service node to node set out of service;
Concrete, when determining that described network state is normal condition, described network monitoring processing unit can add described service node to normal service node set; When determining that described network state is abnormality, described network monitoring processing unit can add described service node to node set out of service.Be in all service nodes in described node set out of service all by automatism isolation or stopping, make described access node when follow-up sending a request message, only request message can be sent to all service nodes in described normal service node set, to avoid access node described in subsequent process, request message be sent to problem service node.
The embodiment of the present invention sends state information by the message obtained between access node and service node, and according to the message delay rate in described message transmission state information or message failure rate, determine the network state of described service node, when described network state is normal condition, described service node can be added to normal service node set; Or when described network state is abnormality, add described service node to node set out of service, so that described access node only communicates to all service nodes in described normal service node set.Only request message is transmitted to the service node in described normal service node set due to described access node, so the impact that the request of problem service node on client cause effectively can be reduced, and based on described normal service node set and described node set out of service to the division of service node, can automatically isolate problem service node or stop, thus the manual maintenance cost to service node can be reduced.
Based on the network architecture shown in Fig. 1, the another kind of network monitoring processing method that the embodiment of the invention discloses.Refer to Fig. 3, be the schematic flow sheet of the another kind of network monitoring processing method that the embodiment of the present invention provides, described method can comprise:
S301, in preset time period, obtains at least one timestamp information between described access node and described service node;
Concrete, in preset time period, network monitoring processing unit can obtain at least one timestamp information between described access node and described service node, and each timestamp information includes transmitting time stamp and time of reception stamp; Described transmitting time stamp in described each timestamp information is timestamp when described access node sends a request message to described service node, and described time of reception stamp is timestamp when described access node receives the acknowledge message that described service node responds described request message.
S302, the described transmitting time calculated respectively in described each timestamp information stabs the time difference between described time of reception stamp, and according to time difference corresponding to described each timestamp information and the first default duration threshold value, the message generated between described access node and described service node sends state information;
Concrete, after described network monitoring processing unit gets at least one timestamp information described, the described transmitting time that can calculate respectively in described each timestamp information stabs the time difference between described time of reception stamp, and the timestamp information being greater than described first duration threshold value the described time difference is defined as delayed data, and be that the timestamp information of exceptional value is defined as failure information by the described time difference; Described time of reception stamp corresponding to the time difference of exceptional value is null value.Add up the quantity of described delayed data and the quantity of described failure information again, and calculate message delay rate according to the quantity of described delayed data and the total quantity of described timestamp information, and calculate message failure rate according to the quantity of described failure information and the total quantity of described timestamp information, send state information to generate the message comprising described message delay rate and described message failure rate.Such as, in 1 minute, if described access node have sent 100 request messages to described service node, the all corresponding timestamp information of every bar request message, wherein, the described time of reception stamp in 10 timestamp informations is had to be null value, illustrate that described access node does not receive the acknowledge message corresponding to request message associated with these 10 timestamp informations, failure information is defined as by these 10 timestamp informations, therefore, total quantity (100) the * 100%=10% of quantity (the 10)/timestamp information of message failure rate=failure information can be obtained, in addition, the time difference wherein having 20 timestamp informations corresponding is greater than described first duration threshold value (2 seconds), then these 20 timestamp informations can be defined as delayed data, therefore, total quantity (100) the * 100%=20% of quantity (the 20)/timestamp information of message delay rate=delayed data can be obtained, thus the message transmission state information comprising message delay rate (20%) and message failure rate (10%) can be generated.
S303, judges that described message sends message delay rate in state information or whether message failure rate is greater than default proportion threshold value;
Concrete, described network monitoring processing unit can judge that described message sends message delay rate in state information or whether message failure rate is greater than default proportion threshold value.Wherein, for judging the proportion threshold value whether described message delay rate is excessive, and for judging that the whether excessive proportion threshold value of described message failure rate can be identical, also can be different.
S304, determines that the network state of described service node is normal condition;
Concrete, if S303 judges no, then can determine that the network state of described service node is normal condition.
S305, adds described service node to normal service node set;
Concrete, when the network state that S304 determines described service node is normal condition, can described service node be added to normal service node set.
S306, determines that the network state of described service node is abnormality;
Concrete, if S303 is judged as YES, illustrates that too much request message occurs that time delay or too much request message occur responding unsuccessfully, then can determine that the network state of described service node is abnormality.
S307, adds described service node to node set out of service;
Concrete, when the network state that S306 determines described service node is abnormality, can described service node be added to node set out of service.Be in all service nodes in described node set out of service all by automatism isolation or stopping, make described access node when follow-up sending a request message, only request message can be sent to all service nodes in described normal service node set, to avoid access node described in subsequent process, request message be sent to problem service node.
S308, when detecting that the described service node duration be in described node set out of service reaches described second duration threshold value, is transferred to described normal service node set by described service node from described node set out of service;
Concrete, after adding described service node to described node set out of service, timing can be started, and when detecting that the described service node duration be in described node set out of service reaches described second duration threshold value, described service node is transferred to described normal service node set from described node set out of service, the service node being isolated or stopping can be recovered automatically, reduce further manual maintenance cost.Further, after described service node transfers to described normal service node set again, again the network state of described service node will be detected, if detect, the network state of described service node is still abnormality, then described service node will be transferred to described node set out of service again, now, described second duration threshold value will become original 2 times, and namely described service node just can be transferred in described normal service node set after needing to stop the time more of a specified duration in described node set out of service.Such as, if described second duration threshold value when described service node first time enters described node set out of service is t, described second duration threshold value when so described service node second time enters described node set out of service will become 2*t, by that analogy, described second duration threshold value when described service node enters described node set out of service for n-th time will become n*t.
Optionally, when the service node quantity that the service node quantity in described node set out of service is greater than in described normal service node set being detected, all service nodes in described node set out of service are all transferred in described normal service node set, and generate the alarm notification information be associated with described access node;
Concrete, when the service node quantity that the service node quantity in described node set out of service is greater than in described normal service node set being detected, illustrate that described access node has problem to the link of each service node, namely normally itself network goes wrong, and needs to process access node.
Optionally, described network monitoring processing unit is in preset time period, when obtaining at least one timestamp information between described access node and described service node, in order to avoid request message and acknowledge message are just across the error statistics that two preset time period cause, the timestamp information be in the time range of easily makeing mistakes can be deleted, the timestamp information be namely in the time range of easily makeing mistakes does not participate in statistics.Such as, the timestamp that current preparation is carried out adding up is set to t1, transmitting time stamp in each timestamp information got all is set to t2, then the timestamp information meeting t1-t2<30 second or t1-t2>2 minute can be deleted, namely only have the timestamp information of satisfied 30 seconds≤t1-t2≤2 minute just to participate in statistics, send state information to generate corresponding message.
The embodiment of the present invention sends state information by the message obtained between access node and service node, and according to the message delay rate in described message transmission state information or message failure rate, determine the network state of described service node, when described network state is normal condition, described service node can be added to normal service node set; Or when described network state is abnormality, add described service node to node set out of service, so that described access node only communicates to all service nodes in described normal service node set.Only request message is transmitted to the service node in described normal service node set due to described access node, so the impact that the request of problem service node on client cause effectively can be reduced, and based on described normal service node set and described node set out of service to the division of service node, can automatically isolate problem service node or stop, and when detecting that the described service node duration be in described node set out of service reaches described second duration threshold value, described service node can be transferred to described normal service node set from described node set out of service, the service node being isolated or stopping can being recovered automatically, thus the manual maintenance cost that can reduce service node.
Based on the network architecture shown in Fig. 1, a kind of network monitoring processing unit that the embodiment of the invention discloses.Refer to Fig. 4, be the structural representation of a kind of network monitoring processing unit that the embodiment of the present invention provides, described network monitoring processing unit 1 can comprise: acquisition module 10, determination module 20, interpolation module 30;
Described acquisition module 10, sends state information for the message obtained between access node and service node;
Concrete, the message that described acquisition module 10 can obtain between access node and service node sends state information, and described message sends state information can comprise message delay rate and message failure rate.Wherein, message delay rate can be that in certain period, described access node is sent in all request messages of described service node, belongs to the ratio shared by request message of delay state; Described message failure rate can be that in the time, described access node is sent in all request messages of described service node, belongs to the ratio shared by request message of status of fail.Wherein, described access node is after sending a request message to described service node, and described service node can respond described request message and return acknowledge message.Therefore, when described access node exceedes the first default duration threshold value from certain request message of transmission to the duration receiving corresponding acknowledge message, this request message can be defined as the request message belonging to delay state; When described access node cannot receive acknowledge message corresponding to certain request message, this request message can be defined as the request message belonging to status of fail.Such as, described access node have sent 100 request messages to service node in 1 minute, wherein, the request message belonging to delay state is 20, and the request message belonging to status of fail is 10, so, can calculate message delay rate is 20%, and message failure rate is 10%.
Described determination module 20, for according to the message delay rate in described message transmission state information or message failure rate, determines the network state of described service node;
Concrete, described determination module 20 can judge that described message sends message delay rate in state information or whether message failure rate is greater than default proportion threshold value, if be judged as YES, illustrate that too much request message occurs that time delay or too much request message occur responding unsuccessfully, then described determination module 20 can determine that the network state of described service node is abnormality, otherwise, can determine that the network state of described service node is normal condition.Wherein, for judging the proportion threshold value whether described message delay rate is excessive, and for judging that the whether excessive proportion threshold value of described message failure rate can be identical, also can be different.
Described interpolation module 30, for when described network state is normal condition, adds described service node to normal service node set;
Described interpolation module 30, also for when described network state is abnormality, adds described service node to node set out of service, so that described access node only communicates to all service nodes in described normal service node set.
Concrete, when determining that described network state is normal condition, described interpolation module 30 can add described service node to normal service node set; When determining that described network state is abnormality, described interpolation module 30 can add described service node to node set out of service.Be in all service nodes in described node set out of service all by automatism isolation or stopping, make described access node when follow-up sending a request message, only request message can be sent to all service nodes in described normal service node set, to avoid access node described in subsequent process, request message be sent to problem service node.
Further, then refer to Fig. 5, be the structural representation of a kind of acquisition module 10 that the embodiment of the present invention provides, described acquisition module 10 can comprise: timestamp acquiring unit 101, calculating generation unit 102;
Described timestamp acquiring unit 101, in preset time period, obtains at least one timestamp information between described access node and described service node, and each timestamp information includes transmitting time stamp and time of reception stamp; Described transmitting time stamp in described each timestamp information is timestamp when described access node sends a request message to described service node, and described time of reception stamp is timestamp when described access node receives the acknowledge message that described service node responds described request message;
Optionally, described timestamp acquiring unit 101 is in preset time period, when obtaining at least one timestamp information between described access node and described service node, in order to avoid request message and acknowledge message are just across the error statistics that two preset time period cause, the timestamp information be in the time range of easily makeing mistakes can be deleted, the timestamp information be namely in the time range of easily makeing mistakes does not participate in statistics.Such as, the timestamp that current preparation is carried out adding up is set to t1, transmitting time stamp in each timestamp information got all is set to t2, then the timestamp information meeting t1-t2<30 second or t1-t2>2 minute can be deleted, namely only have the timestamp information of satisfied 30 seconds≤t1-t2≤2 minute just to participate in statistics, send state information to generate corresponding message.
Described calculating generation unit 102, the time difference between described time of reception stamp is stabbed for the described transmitting time calculated respectively in described each timestamp information, and according to time difference corresponding to described each timestamp information and the first default duration threshold value, the message generated between described access node and described service node sends state information.
Further, refer to Fig. 6 again, be a kind of structural representation calculating generation unit 102 that the embodiment of the present invention provides, described calculating generation unit 102 can comprise: computation subunit 1021, information determination subelement 1022, statistics subelement 1023, generation subelement 1024;
Described computation subunit 1021, stabs the time difference between described time of reception stamp for the described transmitting time calculated respectively in described each timestamp information;
Described information determination subelement 1022, is defined as delayed data for the timestamp information being greater than described first duration threshold value the described time difference, and is that the timestamp information of exceptional value is defined as failure information by the described time difference; Described time of reception stamp corresponding to the time difference of exceptional value is null value;
Described statistics subelement 1023, for the quantity of the quantity and described failure information of adding up described delayed data, and calculate message delay rate according to the quantity of described delayed data and the total quantity of described timestamp information, and calculate message failure rate according to the quantity of described failure information and the total quantity of described timestamp information;
Described generation subelement 1024, sends state information for generating the message comprising described message delay rate and described message failure rate;
Such as, in 1 minute, if described access node have sent 100 request messages to described service node, the all corresponding timestamp information of every bar request message, wherein, the described time of reception stamp in 10 timestamp informations is had to be null value, illustrate that described access node does not receive the acknowledge message corresponding to request message associated with these 10 timestamp informations, namely these 10 timestamp informations can be defined as failure information by described information determination subelement 1022, therefore, described statistics subelement 1023 can obtain total quantity (100) the * 100%=10% of quantity (the 10)/timestamp information of message failure rate=failure information, in addition, the time difference wherein having 20 timestamp informations corresponding is greater than described first duration threshold value (2 seconds), then these 20 timestamp informations can be defined as delayed data by described information determination subelement 1022, therefore, described statistics subelement 1023 can obtain total quantity (100) the * 100%=20% of quantity (the 20)/timestamp information of message delay rate=delayed data, thus described generation subelement 1024 can generate the message transmission state information comprising message delay rate (20%) and message failure rate (10%).
Further, then refer to Fig. 7, be the structural representation of a kind of determination module 20 that the embodiment of the present invention provides, described determination module 20 can comprise: ratio judging unit 201, status determining unit 202;
Described ratio judging unit 201, for judging that described message sends message delay rate in state information or whether message failure rate is greater than default proportion threshold value;
Described status determining unit 202, if be judged as YES for described ratio judging unit 201, then determines that the network state of described service node is abnormality;
Described status determining unit 202, if be also judged as NO for described ratio judging unit 201, then determines that the network state of described service node is normal condition.
The embodiment of the present invention sends state information by the message obtained between access node and service node, and according to the message delay rate in described message transmission state information or message failure rate, determine the network state of described service node, when described network state is normal condition, described service node can be added to normal service node set; Or when described network state is abnormality, add described service node to node set out of service, so that described access node only communicates to all service nodes in described normal service node set.Only request message is transmitted to the service node in described normal service node set due to described access node, so the impact that the request of problem service node on client cause effectively can be reduced, and based on described normal service node set and described node set out of service to the division of service node, can automatically isolate problem service node or stop, thus the manual maintenance cost to service node can be reduced.
Refer to Fig. 8 again, it is the structural representation of the another kind of network monitoring processing unit that the embodiment of the present invention provides, described network monitoring processing unit 1 can comprise acquisition module 10, determination module 20, interpolation module 30 in the corresponding embodiment of above-mentioned Fig. 4, further, described network monitoring processing unit 1 can also comprise: the first shift module 40, second shift module 50;
Described first shift module 40, for when detecting that the described service node duration be in described node set out of service reaches described second duration threshold value, described service node is transferred to described normal service node set from described node set out of service;
Concrete, after adding described service node to described node set out of service, described first shift module 40 can start timing, and when detecting that the described service node duration be in described node set out of service reaches described second duration threshold value, described service node is transferred to described normal service node set from described node set out of service, the service node being isolated or stopping can be recovered automatically, reduce further manual maintenance cost.Further, after described service node transfers to described normal service node set again, described network monitoring processing unit 1 will detect the network state of described service node again, if detect, the network state of described service node is still abnormality, then described service node will be transferred to described node set out of service again, now, described second duration threshold value will become original 2 times, and namely described service node just can be transferred in described normal service node set after needing to stop the time more of a specified duration in described node set out of service.Such as, if described second duration threshold value when described service node first time enters described node set out of service is t, described second duration threshold value when so described service node second time enters described node set out of service will become 2*t, by that analogy, described second duration threshold value when described service node enters described node set out of service for n-th time will become n*t.
Described second shift module 50, for when the service node quantity that the service node quantity in described node set out of service is greater than in described normal service node set being detected, all service nodes in described node set out of service are all transferred in described normal service node set, and generate the alarm notification information be associated with described access node;
Concrete, when the service node quantity that the service node quantity in described node set out of service is greater than in described normal service node set being detected, illustrate that described access node has problem to the link of each service node, namely normally itself network goes wrong, and needs to process access node.
The embodiment of the present invention sends state information by the message obtained between access node and service node, and according to the message delay rate in described message transmission state information or message failure rate, determine the network state of described service node, when described network state is normal condition, described service node can be added to normal service node set; Or when described network state is abnormality, add described service node to node set out of service, so that described access node only communicates to all service nodes in described normal service node set.Only request message is transmitted to the service node in described normal service node set due to described access node, so the impact that the request of problem service node on client cause effectively can be reduced, and based on described normal service node set and described node set out of service to the division of service node, can automatically isolate problem service node or stop, and when detecting that the described service node duration be in described node set out of service reaches described second duration threshold value, described service node can be transferred to described normal service node set from described node set out of service, the service node being isolated or stopping can being recovered automatically, thus the manual maintenance cost that can reduce service node.
Based on the network architecture shown in Fig. 1, another the network monitoring processing unit that the embodiment of the invention discloses.Referring to Fig. 9, is the structural representation of another network monitoring processing unit that the embodiment of the present invention provides.Described network monitoring processing unit 1000 can comprise processor 1001, communication interface 1002 and memory 1003 (quantity of the processor 1001 in described network monitoring processing unit 1000 can be one or more, for a processor in Fig. 9).In some embodiments of the present invention, processor 1001, communication interface 1002 are connected by communication bus or other modes with memory 1003, and wherein, Fig. 9 is to connect by communication bus.
Wherein, described communication interface 1002, for communicating with service node with access node;
Described memory 1003 is for storage program;
Described processor 1001 for performing described program, to realize
The message obtained between access node and service node sends state information;
According to the message delay rate in described message transmission state information or message failure rate, determine the network state of described service node;
When described network state is normal condition, add described service node to normal service node set;
When described network state is abnormality, add described service node to node set out of service, so that described access node only communicates to all service nodes in described normal service node set.
In one embodiment, when the message of described processor 1001 between execution acquisition access node and service node sends state information, specifically for:
In preset time period, obtain at least one timestamp information between described access node and described service node, each timestamp information includes transmitting time stamp and time of reception stamp; Described transmitting time stamp in described each timestamp information is timestamp when described access node sends a request message to described service node, and described time of reception stamp is timestamp when described access node receives the acknowledge message that described service node responds described request message;
The described transmitting time calculated respectively in described each timestamp information stabs the time difference between described time of reception stamp, and according to time difference corresponding to described each timestamp information and the first default duration threshold value, the message generated between described access node and described service node sends state information.
In one embodiment, described processor 1001 stabs the time difference between described time of reception stamp in the described transmitting time that execution calculates in described each timestamp information respectively, and according to time difference corresponding to described each timestamp information and the first default duration threshold value, when the message generated between described access node and described service node sends state information, specifically for:
The described transmitting time calculated respectively in described each timestamp information stabs the time difference between described time of reception stamp;
The timestamp information being greater than described first duration threshold value the described time difference is defined as delayed data, and is that the timestamp information of exceptional value is defined as failure information by the described time difference; Described time of reception stamp corresponding to the time difference of exceptional value is null value;
Add up the quantity of described delayed data and the quantity of described failure information, and calculate message delay rate according to the quantity of described delayed data and the total quantity of described timestamp information, and calculate message failure rate according to the quantity of described failure information and the total quantity of described timestamp information;
Generate the message comprising described message delay rate and described message failure rate and send state information.
In one embodiment, described processor 1001 is performing according to the message delay rate in described message transmission state information or message failure rate, when determining the network state of described service node, specifically for:
Judge that described message sends message delay rate in state information or whether message failure rate is greater than default proportion threshold value;
If be judged as YES, then determine that the network state of described service node is abnormality;
If be judged as NO, then determine that the network state of described service node is normal condition.
In one embodiment, described processor 1001 execution when described network state is abnormality, after adding described service node to node set out of service, also for:
When detecting that the described service node duration be in described node set out of service reaches described second duration threshold value, described service node is transferred to described normal service node set from described node set out of service.
In one embodiment, described processor 1001 also for:
When the service node quantity that the service node quantity in described node set out of service is greater than in described normal service node set being detected, all service nodes in described node set out of service are all transferred in described normal service node set, and generate the alarm notification information be associated with described access node.
The embodiment of the present invention sends state information by the message obtained between access node and service node, and according to the message delay rate in described message transmission state information or message failure rate, determine the network state of described service node, when described network state is normal condition, described service node can be added to normal service node set; Or when described network state is abnormality, add described service node to node set out of service, so that described access node only communicates to all service nodes in described normal service node set.Only request message is transmitted to the service node in described normal service node set due to described access node, so the impact that the request of problem service node on client cause effectively can be reduced, and based on described normal service node set and described node set out of service to the division of service node, can automatically isolate problem service node or stop, and when detecting that the described service node duration be in described node set out of service reaches described second duration threshold value, described service node can be transferred to described normal service node set from described node set out of service, the service node being isolated or stopping can being recovered automatically, thus the manual maintenance cost that can reduce service node.
One of ordinary skill in the art will appreciate that all or part of flow process realized in above-described embodiment method, that the hardware that can carry out instruction relevant by computer program has come, described program can be stored in a computer read/write memory medium, this program, when performing, can comprise the flow process of the embodiment as above-mentioned each side method.Wherein, described storage medium can be magnetic disc, CD, read-only store-memory body (Read-OnlyMemory, ROM) or random store-memory body (RandomAccessMemory, RAM) etc.
Above disclosedly be only present pre-ferred embodiments, certainly can not limit the interest field of the present invention with this, therefore according to the equivalent variations that the claims in the present invention are done, still belong to the scope that the present invention is contained.

Claims (12)

1. a network monitoring processing method, is characterized in that, comprising:
The message obtained between access node and service node sends state information;
According to the message delay rate in described message transmission state information or message failure rate, determine the network state of described service node;
When described network state is normal condition, add described service node to normal service node set;
When described network state is abnormality, add described service node to node set out of service, so that described access node only communicates to all service nodes in described normal service node set.
2. the method for claim 1, is characterized in that, the message between described acquisition access node and service node sends state information, comprising:
In preset time period, obtain at least one timestamp information between described access node and described service node, each timestamp information includes transmitting time stamp and time of reception stamp; Described transmitting time stamp in described each timestamp information is timestamp when described access node sends a request message to described service node, and described time of reception stamp is timestamp when described access node receives the acknowledge message that described service node responds described request message;
The described transmitting time calculated respectively in described each timestamp information stabs the time difference between described time of reception stamp, and according to time difference corresponding to described each timestamp information and the first default duration threshold value, the message generated between described access node and described service node sends state information.
3. method as claimed in claim 2, it is characterized in that, the described described transmitting time calculated respectively in described each timestamp information stabs the time difference between described time of reception stamp, and according to time difference corresponding to described each timestamp information and the first default duration threshold value, the message generated between described access node and described service node sends state information, comprising:
The described transmitting time calculated respectively in described each timestamp information stabs the time difference between described time of reception stamp;
The timestamp information being greater than described first duration threshold value the described time difference is defined as delayed data, and is that the timestamp information of exceptional value is defined as failure information by the described time difference; Described time of reception stamp corresponding to the time difference of exceptional value is null value;
Add up the quantity of described delayed data and the quantity of described failure information, and calculate message delay rate according to the quantity of described delayed data and the total quantity of described timestamp information, and calculate message failure rate according to the quantity of described failure information and the total quantity of described timestamp information;
Generate the message comprising described message delay rate and described message failure rate and send state information.
4. the method for claim 1, is characterized in that, described according to the message delay rate in described message transmission state information or message failure rate, determines the network state of described service node, comprising:
Judge that described message sends message delay rate in state information or whether message failure rate is greater than default proportion threshold value;
If be judged as YES, then determine that the network state of described service node is abnormality;
If be judged as NO, then determine that the network state of described service node is normal condition.
5. the method for claim 1, is characterized in that, described when described network state is abnormality, after described service node being added to the step of node set out of service, also comprises:
When detecting that the described service node duration be in described node set out of service reaches described second duration threshold value, described service node is transferred to described normal service node set from described node set out of service.
6. the method for claim 1, is characterized in that, also comprises:
When the service node quantity that the service node quantity in described node set out of service is greater than in described normal service node set being detected, all service nodes in described node set out of service are all transferred in described normal service node set, and generate the alarm notification information be associated with described access node.
7. a network monitoring processing unit, is characterized in that, comprising:
Acquisition module, sends state information for the message obtained between access node and service node;
Determination module, for according to the message delay rate in described message transmission state information or message failure rate, determines the network state of described service node;
Add module, for when described network state is normal condition, add described service node to normal service node set;
Described interpolation module, also for when described network state is abnormality, adds described service node to node set out of service, so that described access node only communicates to all service nodes in described normal service node set.
8. device as claimed in claim 7, it is characterized in that, described acquisition module comprises:
Timestamp acquiring unit, in preset time period, obtains at least one timestamp information between described access node and described service node, and each timestamp information includes transmitting time stamp and time of reception stamp; Described transmitting time stamp in described each timestamp information is timestamp when described access node sends a request message to described service node, and described time of reception stamp is timestamp when described access node receives the acknowledge message that described service node responds described request message;
Calculate generation unit, the time difference between described time of reception stamp is stabbed for the described transmitting time calculated respectively in described each timestamp information, and according to time difference corresponding to described each timestamp information and the first default duration threshold value, the message generated between described access node and described service node sends state information.
9. device as claimed in claim 8, it is characterized in that, described calculating generation unit comprises:
Computation subunit, stabs the time difference between described time of reception stamp for the described transmitting time calculated respectively in described each timestamp information;
Information determination subelement, is defined as delayed data for the timestamp information being greater than described first duration threshold value the described time difference, and is that the timestamp information of exceptional value is defined as failure information by the described time difference; Described time of reception stamp corresponding to the time difference of exceptional value is null value;
Statistics subelement, for the quantity of the quantity and described failure information of adding up described delayed data, and calculate message delay rate according to the quantity of described delayed data and the total quantity of described timestamp information, and calculate message failure rate according to the quantity of described failure information and the total quantity of described timestamp information;
Generating subelement, sending state information for generating the message comprising described message delay rate and described message failure rate.
10. device as claimed in claim 7, it is characterized in that, described determination module comprises:
Ratio judging unit, for judging that described message sends message delay rate in state information or whether message failure rate is greater than default proportion threshold value;
Status determining unit, if be judged as YES for described ratio judging unit, then determines that the network state of described service node is abnormality;
Described status determining unit, if be also judged as NO for described ratio judging unit, then determines that the network state of described service node is normal condition.
11. devices as claimed in claim 7, is characterized in that, also comprise:
First shift module, for when detecting that the described service node duration be in described node set out of service reaches described second duration threshold value, is transferred to described normal service node set by described service node from described node set out of service.
12. devices as claimed in claim 7, is characterized in that, also comprise:
Second shift module, for when the service node quantity that the service node quantity in described node set out of service is greater than in described normal service node set being detected, all service nodes in described node set out of service are all transferred in described normal service node set, and generate the alarm notification information be associated with described access node.
CN201510824761.3A 2015-11-24 2015-11-24 A kind of network monitoring processing method and device Active CN105490837B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201510824761.3A CN105490837B (en) 2015-11-24 2015-11-24 A kind of network monitoring processing method and device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201510824761.3A CN105490837B (en) 2015-11-24 2015-11-24 A kind of network monitoring processing method and device

Publications (2)

Publication Number Publication Date
CN105490837A true CN105490837A (en) 2016-04-13
CN105490837B CN105490837B (en) 2019-07-02

Family

ID=55677581

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201510824761.3A Active CN105490837B (en) 2015-11-24 2015-11-24 A kind of network monitoring processing method and device

Country Status (1)

Country Link
CN (1) CN105490837B (en)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106230658A (en) * 2016-07-25 2016-12-14 杭州迪普科技有限公司 A kind of method and apparatus monitoring the network equipment
CN108566331A (en) * 2018-04-03 2018-09-21 北京五八信息技术有限公司 Method of calibration, device, server, terminal and the storage medium of message transmission
CN111064613A (en) * 2019-12-13 2020-04-24 新华三大数据技术有限公司 Network fault detection method and device
CN111629399A (en) * 2019-02-28 2020-09-04 华为技术有限公司 Message processing method and device and terminal

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6374300B2 (en) * 1999-07-15 2002-04-16 F5 Networks, Inc. Method and system for storing load balancing information with an HTTP cookie
CN105049509A (en) * 2015-07-23 2015-11-11 浪潮电子信息产业股份有限公司 Cluster scheduling method, load balancer and clustering system

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6374300B2 (en) * 1999-07-15 2002-04-16 F5 Networks, Inc. Method and system for storing load balancing information with an HTTP cookie
CN105049509A (en) * 2015-07-23 2015-11-11 浪潮电子信息产业股份有限公司 Cluster scheduling method, load balancer and clustering system

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
罗智慧: "IP网络服务质量监测研究与实现", 《中国优秀硕士论文全文数据库 信息科技辑》 *

Cited By (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106230658A (en) * 2016-07-25 2016-12-14 杭州迪普科技有限公司 A kind of method and apparatus monitoring the network equipment
CN106230658B (en) * 2016-07-25 2019-05-07 杭州迪普科技股份有限公司 A kind of method and apparatus monitoring the network equipment
CN108566331A (en) * 2018-04-03 2018-09-21 北京五八信息技术有限公司 Method of calibration, device, server, terminal and the storage medium of message transmission
CN108566331B (en) * 2018-04-03 2021-08-10 北京五八信息技术有限公司 Message transmission checking method, device, server, terminal and storage medium
CN111629399A (en) * 2019-02-28 2020-09-04 华为技术有限公司 Message processing method and device and terminal
CN111629399B (en) * 2019-02-28 2022-01-14 华为技术有限公司 Message processing method and device and terminal
CN111064613A (en) * 2019-12-13 2020-04-24 新华三大数据技术有限公司 Network fault detection method and device
CN111064613B (en) * 2019-12-13 2022-03-22 新华三大数据技术有限公司 Network fault detection method and device

Also Published As

Publication number Publication date
CN105490837B (en) 2019-07-02

Similar Documents

Publication Publication Date Title
CN101483544B (en) Fault location device, communication device, and fault location method
KR102167613B1 (en) Message push method and device
KR101575547B1 (en) The error variance detection method of can communication system and the can communication system
CN105490837A (en) Network monitoring processing method and device
CN108737574B (en) Node offline judgment method, device, equipment and readable storage medium
CN107360239A (en) A kind of client connection status detection method and system
CN108418710B (en) Distributed monitoring system, method and device
CN109495530B (en) Real-time traffic data transmission method, transmission device and transmission system
CN102970167A (en) Method for detecting faults of network nodes in cluster system, network node and system
CN104301140A (en) Service request responding method, device and system
CN110740064A (en) Distributed cluster node fault processing method, device, equipment and storage medium
CN110809262A (en) Internet of things equipment operation and maintenance management method based on COAP protocol
CN104954190A (en) Method and device for detecting faults of application server
KR20170092602A (en) Method and apparatus for detecting connectivity of label switched path
US11606282B2 (en) Method and device for detecting network reliability
CN110321261B (en) Monitoring system and monitoring method
CN110601972A (en) Message transmission method and device and intelligent elastic architecture system
JP4836053B2 (en) Failure determination method and program
JP4863984B2 (en) Monitoring processing program, method and apparatus
JPH05225161A (en) Network monitoring system
JPS609246A (en) Monitor system for data circuit network
JP7143609B2 (en) COMMUNICATION DEVICE, COMMUNICATION METHOD, AND PROGRAM
KR102016029B1 (en) Apparatus and method for distributing load of vehicle communication
JP3060076B2 (en) Line monitoring system
CN115276710A (en) Transponder transmission system, transponder positioning compensation method, transponder positioning compensation device, and transponder positioning compensation medium

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
TR01 Transfer of patent right

Effective date of registration: 20191106

Address after: 510000 X1301-E6803 (Cluster Address) (JM) No. 106 Fengze East Road, Nansha District, Guangzhou, Guangdong Province

Patentee after: Guangzhou Netstar Information Technology Co., Ltd.

Address before: 511442, Guangdong, Panyu District, Guangzhou Town, two Huambo road 79, Huambo business district, Wanda Commercial Plaza, North District, B-1 building, 25 floor

Patentee before: All kinds of fruits garden, Guangzhou network technology company limited

TR01 Transfer of patent right