CN109981377B - Distributed data center link monitoring method and system - Google Patents

Distributed data center link monitoring method and system Download PDF

Info

Publication number
CN109981377B
CN109981377B CN201910287217.8A CN201910287217A CN109981377B CN 109981377 B CN109981377 B CN 109981377B CN 201910287217 A CN201910287217 A CN 201910287217A CN 109981377 B CN109981377 B CN 109981377B
Authority
CN
China
Prior art keywords
link quality
monitoring server
machine room
data center
quality data
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201910287217.8A
Other languages
Chinese (zh)
Other versions
CN109981377A (en
Inventor
田兆楠
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Sohu New Media Information Technology Co Ltd
Original Assignee
Beijing Sohu New Media Information Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Sohu New Media Information Technology Co Ltd filed Critical Beijing Sohu New Media Information Technology Co Ltd
Priority to CN201910287217.8A priority Critical patent/CN109981377B/en
Publication of CN109981377A publication Critical patent/CN109981377A/en
Application granted granted Critical
Publication of CN109981377B publication Critical patent/CN109981377B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/04Network management architectures or arrangements
    • H04L41/042Network management architectures or arrangements comprising distributed management centres cooperatively managing the network
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/06Management of faults, events, alarms or notifications
    • H04L41/0631Management of faults, events, alarms or notifications using root cause analysis; using analysis of correlation between notifications, alarms or events based on decision criteria, e.g. hierarchy, tree or time analysis
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/06Management of faults, events, alarms or notifications
    • H04L41/0677Localisation of faults
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L43/00Arrangements for monitoring or testing data switching networks
    • H04L43/08Monitoring or testing based on specific metrics, e.g. QoS, energy consumption or environmental parameters

Landscapes

  • Engineering & Computer Science (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Signal Processing (AREA)
  • Environmental & Geological Engineering (AREA)
  • Data Exchanges In Wide-Area Networks (AREA)

Abstract

The invention discloses a distributed data center link monitoring method and a system, wherein the method comprises the following steps: deploying a monitoring server in each internet data center machine room, selecting an IP from each subnet of the machine room where the monitoring server is located, and configuring the IPs of all the subnets on a network card of the monitoring server; and acquiring link quality data among different subnetworks of each Internet data center machine room based on the monitoring server, storing the link quality data, and displaying the link quality data based on the link quality matrix. The invention can realize real-time network link monitoring, does not need additional development tools, does not need to analyze network equipment logs, has simple development and almost no need of maintenance, can directly feed back the quality of the network link from a network equipment layer, does not need to find faults from an application side, and ensures that the positioning is timely and accurate.

Description

Distributed data center link monitoring method and system
Technical Field
The invention relates to the technical field of internet data centers, in particular to a distributed data center link monitoring method and system.
Background
The large-scale data center has a huge network scale, and usually, a large number of network devices are provided, and the network devices need to be cascaded in multiple layers, and once a device fails, the failed device needs to be located in time. If the application side finds a fault, the problem link is passively positioned according to the fault phenomenon of the application side, and sufficient information cannot be obtained to position in time. Therefore, the network operation department needs to actively monitor the quality of the data center link in real time, discover faults in time, and isolate and replace faulty equipment.
The traditional network center link monitoring method usually needs to collect and analyze logs or information on network equipment, and has complex configuration and analysis and high maintenance cost. Or the application side reflects the network fault, and the fault cannot be sufficiently and timely positioned.
Therefore, how to effectively monitor the distributed data center link is an urgent problem to be solved.
Disclosure of Invention
In view of this, the present invention provides a distributed data center link monitoring method, which can implement real-time network link monitoring, does not need additional development tools, does not need to analyze network device logs, is simple to develop, hardly needs maintenance, can directly feed back the quality of a network link from a network device layer, does not need to find a fault from an application side, and enables positioning to be timely and accurate.
The invention provides a distributed data center link monitoring method, which comprises the following steps:
deploying a monitoring server in each internet data center machine room;
selecting an IP from each subnet of all subnets of a machine room where a monitoring server is located, and configuring the IPs of all subnets on a network card of the monitoring server;
acquiring link quality data among different subnets of each internet data center machine room based on the monitoring server;
storing the link quality data;
displaying the link quality data based on a link quality matrix.
Preferably, the obtaining, based on the monitoring server, link quality data between different subnets of each internet data center machine room includes:
reading all monitoring server IP sets of the Internet data center machine room recorded on each monitoring server;
and operating the Ping command on each monitoring server, sequentially Ping the IPs of all the Internet data center machine rooms monitoring servers by all the IPs on the monitoring servers, and obtaining link quality data among different subnets of each Internet data center machine room.
Preferably, the Ping command specifies parameters when executed as follows:
10 packets, each 500 bytes in size, are sent, one packet every 6 seconds.
Preferably, the storing the link quality data comprises:
storing the link quality data in real time;
summarizing the link quality data every hour;
aggregating the link quality data every day.
Preferably, the method further comprises:
and when the packet loss rate or the network delay of the link quality data is greater than an alarm threshold, generating alarm information.
A distributed data center link monitoring system, comprising:
the deployment module is used for deploying one monitoring server in each Internet data center machine room;
the configuration module is used for selecting an IP from each subnet of all subnets of a machine room where the monitoring server is located and configuring the IPs of all subnets on a network card of the monitoring server;
the processing module is used for acquiring link quality data among different subnets of each internet data center machine room based on the monitoring server;
a storage module for storing the link quality data;
and the display module is used for displaying the link quality data based on the link quality matrix.
Preferably, the processing module comprises:
the reading unit is used for reading all monitoring server IP sets of the Internet data center machine room recorded on each monitoring server;
and the command operation unit is used for operating the Ping command on each monitoring server, sequentially Ping the IPs of all the Internet data center machine rooms monitoring servers by all the IPs on the monitoring servers, and obtaining the link quality data among different subnets of each Internet data center machine room.
Preferably, the Ping command specifies parameters when executed as follows:
10 packets, each 500 bytes in size, are sent, one packet every 6 seconds.
Preferably, when the storage module executes the storage of the link quality data, the storage module is specifically configured to:
storing the link quality data in real time;
summarizing the link quality data every hour;
aggregating the link quality data every day.
Preferably, the system further comprises:
and the generating module is used for generating alarm information when the packet loss rate or the network delay of the link quality data is greater than an alarm threshold.
In summary, the present invention discloses a distributed data center link monitoring method, which includes: deploying a monitoring server in each internet data center machine room; for all subnets of a machine room where the monitoring server is located, selecting an IP from each subnet, and configuring the IPs of all subnets on a network card of the monitoring server; acquiring link quality data among different subnets of each internet data center machine room based on a monitoring server; and storing the link quality data, and displaying the link quality data based on the link quality matrix. According to the invention, information of network equipment does not need to be collected and analyzed, link quality conditions of all subnets can be monitored only by deploying a simple monitoring service in each Internet data center machine room, and monitoring data is displayed in a link matrix form, so that link states can be visually detected.
Drawings
In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the drawings used in the description of the embodiments or the prior art will be briefly described below, it is obvious that the drawings in the following description are only some embodiments of the present invention, and for those skilled in the art, other drawings can be obtained according to the drawings without creative efforts.
Fig. 1 is a flowchart of a distributed data center link monitoring method 1 according to an embodiment of the present disclosure;
fig. 2 is a flowchart of a distributed data center link monitoring method according to embodiment 2 of the present disclosure;
FIG. 3 is a schematic diagram of a link quality matrix disclosed in the present invention;
FIG. 4 is a schematic diagram of a link quality matrix displaying values according to the present disclosure;
fig. 5 is a schematic structural diagram of a distributed data center link monitoring system 1 according to an embodiment of the present disclosure;
fig. 6 is a schematic structural diagram of a distributed data center link monitoring system in embodiment 2 disclosed in the present invention.
Detailed Description
The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.
As shown in fig. 1, which is a flowchart of embodiment 1 of a distributed data center link monitoring method disclosed in the present invention, the method may include the following steps:
s101, deploying a monitoring server in each Internet data center machine room;
when the quality of a distributed data Center link needs to be monitored, a monitoring server is deployed in each IDC (Internet data Center) room. The IDC machine room is that the telecommunication department utilizes the existing Internet communication line and bandwidth resources to establish a standardized telecommunication professional machine room environment and provide all-round services in the aspects of server hosting, renting, related value adding and the like for enterprises and governments.
S102, selecting an IP from each subnet of all subnets of a machine room where a monitoring server is located, and configuring the IPs of all subnets on a network card of the monitoring server;
the IP address represents the host on the network by the network number and the host number, and the direct intercommunication can be realized only between computers under one network number, and the intercommunication can be realized only between computers with different network numbers through a gateway. However, such a division is not very flexible in some cases, for which reason IP networks also allow a division into smaller networks, called subnets (Subnet).
And selecting one IP from each subnet of the machine room where the monitoring server is located, and configuring the IPs on the network card of the monitoring server. Therefore, each monitoring server is configured with the IP of all different subnets of the IDC room. The network card configuration on the monitoring server is as follows:
eth10.subnet1:inet IP1netmask mask1broadcast broadcast1
eth10.subnet2:inet IP1netmask mask1broadcast broadcast2
eth10.subnet3:inet IP1netmask mask1broadcast broadcast3
eth10.subnet4:inet IP1netmask mask4broadcast broadcast4
s103, acquiring link quality data among different subnets of each Internet data center machine room based on a monitoring server;
and then, executing a monitoring service program on each monitoring server at regular time to acquire link quality data among different subnets of each IDC machine room.
S104, storing link quality data;
and after the link quality data among different subnets of each IDC machine room is obtained, the obtained link quality data is further stored, so that the link quality data can be conveniently displayed at the front end.
And S105, displaying the link quality data based on the link quality matrix.
Finally, as shown in fig. 3, the link quality matrix is used to display the link conditions between different subnets in a matrix form, and different colors can represent different link qualities.
Further, as shown in fig. 4, a specific value of the link condition may be displayed for each point in the link quality matrix. Wherein, the row and column adopt time delay and packet loss rate curve to display the link condition directly for one IP of all sub-networks of different IDC machine rooms.
In summary, in the above embodiments, a distributed data center link monitoring method includes: deploying a monitoring server in each internet data center machine room; for all subnets of a machine room where the monitoring server is located, selecting an IP from each subnet, and configuring the IPs of all subnets on a network card of the monitoring server; acquiring link quality data among different subnets of each internet data center machine room based on a monitoring server; and storing the link quality data, and displaying the link quality data based on the link quality matrix. According to the invention, information of network equipment does not need to be collected and analyzed, link quality conditions of all subnets can be monitored only by deploying a simple monitoring service in each Internet data center machine room, and monitoring data is displayed in a link matrix form, so that link states can be visually detected.
As shown in fig. 2, which is a flowchart of embodiment 2 of a distributed data center link monitoring method disclosed in the present invention, the method may include the following steps:
s201, deploying a monitoring server in each Internet data center machine room;
when the quality of a distributed data Center link needs to be monitored, a monitoring server is deployed in each IDC (Internet data Center) room. The IDC machine room is that the telecommunication department utilizes the existing Internet communication line and bandwidth resources to establish a standardized telecommunication professional machine room environment and provide all-round services in the aspects of server hosting, renting, related value adding and the like for enterprises and governments.
S202, selecting an IP from each subnet of all subnets of a machine room where a monitoring server is located, and configuring the IPs of all subnets on a network card of the monitoring server;
the IP address represents the host on the network by the network number and the host number, and the direct intercommunication can be realized only between computers under one network number, and the intercommunication can be realized only between computers with different network numbers through a gateway. However, such a division is not very flexible in some cases, for which reason IP networks also allow a division into smaller networks, called subnets (Subnet).
And selecting one IP from each subnet of the machine room where the monitoring server is located, and configuring the IPs on the network card of the monitoring server. Therefore, each monitoring server is configured with the IP of all different subnets of the IDC room. The network card configuration on the monitoring server is as follows:
eth10.subnet1:inet IP1netmask mask1broadcast broadcast1
eth10.subnet2:inet IP1netmask mask1broadcast broadcast2
eth10.subnet3:inet IP1netmask mask1broadcast broadcast3
eth10.subnet4:inet IP1netmask mask4broadcast broadcast4
s203, reading all monitoring server IP sets of the Internet data center machine room recorded on each monitoring server;
and then, executing a monitoring service program on each monitoring server at regular time to acquire link quality data among different subnets of each IDC machine room. Specifically, the IP sets of all the IDC rooms recorded on each monitoring server are read first.
S204, operating a Ping command on each monitoring server, sequentially Ping the IPs of all the monitoring servers of the Internet data center machine rooms by all the IPs on the monitoring servers, and obtaining link quality data among different subnets of each Internet data center machine room;
and then reading the IP of the monitoring node, namely reading all the IPs on the monitoring server, operating the Ping command on the monitoring server, and sequentially using the monitoring server IPs of all the IDC machine rooms of all the IPPINg on the monitoring server to obtain link quality data among different subnets of each Internet data center machine room.
Specifically, in order to ensure the real-time performance of link monitoring, the present invention is expected to limit the execution of Ping commands within one minute, so the specified parameters when executing Ping commands are set to be:
ping–I source_ip-c 10-i 6-l 500destination_ip
where-c 10 indicates sending 10 packets, -l 500 indicates the size of each packet is 500 bytes, -i 6 indicates sending one packet every 6 seconds.
In order to monitor the link condition in real time, the monitoring program is executed for two minutes, the execution time of each program is controlled within two minutes, wherein one minute is used for executing the Ping command, and one minute is used for storing the data.
S205, storing link quality data;
and after the link quality data among different subnets of each IDC machine room is obtained, the obtained link quality data is further stored, so that the link quality data can be conveniently displayed at the front end.
Specifically, to display monitoring data of different time granularities, the data may be stored in three granularities, which are real-time data, hourly summarized data, and daily summarized data, respectively. Wherein the real-time data is updated every two minutes; summarizing data every two minutes every hour, and reserving all data of the last hour; one hour of data was summarized each day, and the last two weeks of data were retained. Wherein the hourly summary and the daily summary data are obtained from the real-time data and stored.
S206, displaying link quality data based on the link quality matrix;
finally, as shown in fig. 3, the link quality matrix is used to display the link conditions between different subnets in a matrix form, and different colors can represent different link qualities.
Further, as shown in fig. 4, a specific value of the link condition may be displayed for each point in the link quality matrix. Wherein, the row and column adopt time delay and packet loss rate curve to display the link condition directly for one IP of all sub-networks of different IDC machine rooms.
And S207, when the packet loss rate of the link quality data or the network delay is greater than an alarm threshold, generating alarm information.
According to practical experience, when the packet loss rate is greater than 10% or the network delay is greater than 5ms, the network link is considered to be in fault, and the range is used as the threshold value of the alarm. And when the packet loss rate of the link quality data or the network delay is greater than an alarm threshold value, generating alarm information, and alarming the alarm information in the form of mails or short messages and the like.
In summary, the conventional network center link monitoring method usually needs to collect and analyze logs or information on network devices, and is complex in configuration and analysis and high in maintenance cost. Or the application side reflects the network fault, and the fault cannot be sufficiently and timely positioned. The invention directly monitors the quality of the link side without collecting and analyzing the information of the network equipment. The link quality conditions of all the subnets can be monitored only by deploying a simple monitoring service in each IDC machine room, a monitoring program can be developed by using Python or shell scripts, the development period is short, maintenance is basically not needed, the acquisition and processing of link quality data can be completed within two minutes by selecting proper Ping command parameters, the purpose of real-time monitoring is achieved, the monitoring data is displayed in a link matrix form, and a network engineer can be helped to visually detect the link state.
As shown in fig. 5, which is a schematic structural diagram of an embodiment 1 of a distributed data center link monitoring system disclosed in the present invention, the system may include:
a deployment module 501, configured to deploy one monitoring server in each internet data center machine room;
when the quality of a distributed data Center link needs to be monitored, a monitoring server is deployed in each IDC (Internet data Center) room. The IDC machine room is that the telecommunication department utilizes the existing Internet communication line and bandwidth resources to establish a standardized telecommunication professional machine room environment and provide all-round services in the aspects of server hosting, renting, related value adding and the like for enterprises and governments.
A configuration module 502, configured to select an IP from each subnet for all subnets of a machine room where the monitoring server is located, and configure the IPs of all subnets on a network card of the monitoring server;
the IP address represents the host on the network by the network number and the host number, and the direct intercommunication can be realized only between computers under one network number, and the intercommunication can be realized only between computers with different network numbers through a gateway. However, such a division is not very flexible in some cases, for which reason IP networks also allow a division into smaller networks, called subnets (Subnet).
And selecting one IP from each subnet of the machine room where the monitoring server is located, and configuring the IPs on the network card of the monitoring server. Therefore, each monitoring server is configured with the IP of all different subnets of the IDC room. The network card configuration on the monitoring server is as follows:
eth10.subnet1:inet IP1netmask mask1broadcast broadcast1
eth10.subnet2:inet IP1netmask mask1broadcast broadcast2
eth10.subnet3:inet IP1netmask mask1broadcast broadcast3
eth10.subnet4:inet IP1netmask mask4broadcast broadcast4
the processing module 503 is configured to obtain link quality data between different subnets of each internet data center machine room based on the monitoring server;
and then, executing a monitoring service program on each monitoring server at regular time to acquire link quality data among different subnets of each IDC machine room.
A storage module 504 for storing link quality data;
and after the link quality data among different subnets of each IDC machine room is obtained, the obtained link quality data is further stored, so that the link quality data can be conveniently displayed at the front end.
A display module 505, configured to display the link quality data based on the link quality matrix.
Finally, as shown in fig. 3, the link quality matrix is used to display the link conditions between different subnets in a matrix form, and different colors can represent different link qualities.
Further, as shown in fig. 4, a specific value of the link condition may be displayed for each point in the link quality matrix. Wherein, the row and column adopt time delay and packet loss rate curve to display the link condition directly for one IP of all sub-networks of different IDC machine rooms.
In summary, in the above embodiments, a distributed data center link monitoring method includes: deploying a monitoring server in each internet data center machine room; for all subnets of a machine room where the monitoring server is located, selecting an IP from each subnet, and configuring the IPs of all subnets on a network card of the monitoring server; acquiring link quality data among different subnets of each internet data center machine room based on a monitoring server; and storing the link quality data, and displaying the link quality data based on the link quality matrix. According to the invention, information of network equipment does not need to be collected and analyzed, link quality conditions of all subnets can be monitored only by deploying a simple monitoring service in each Internet data center machine room, and monitoring data is displayed in a link matrix form, so that link states can be visually detected.
As shown in fig. 6, which is a schematic structural diagram of an embodiment 2 of a distributed data center link monitoring system disclosed in the present invention, the system may include:
the deployment module 601 is used for deploying one monitoring server in each internet data center machine room;
when the quality of a distributed data Center link needs to be monitored, a monitoring server is deployed in each IDC (Internet data Center) room. The IDC machine room is that the telecommunication department utilizes the existing Internet communication line and bandwidth resources to establish a standardized telecommunication professional machine room environment and provide all-round services in the aspects of server hosting, renting, related value adding and the like for enterprises and governments.
A configuration module 602, configured to select an IP from each subnet for all subnets of a machine room where the monitoring server is located, and configure the IPs of all subnets on a network card of the monitoring server;
the IP address represents the host on the network by the network number and the host number, and the direct intercommunication can be realized only between computers under one network number, and the intercommunication can be realized only between computers with different network numbers through a gateway. However, such a division is not very flexible in some cases, for which reason IP networks also allow a division into smaller networks, called subnets (Subnet).
And selecting one IP from each subnet of the machine room where the monitoring server is located, and configuring the IPs on the network card of the monitoring server. Therefore, each monitoring server is configured with the IP of all different subnets of the IDC room. The network card configuration on the monitoring server is as follows:
eth10.subnet1:inet IP1netmask mask1broadcast broadcast1
eth10.subnet2:inet IP1netmask mask1broadcast broadcast2
eth10.subnet3:inet IP1netmask mask1broadcast broadcast3
eth10.subnet4:inet IP1netmask mask4broadcast broadcast4
a reading unit 603, configured to read all monitoring server IP sets of the internet data center machine room recorded on each monitoring server;
and then, executing a monitoring service program on each monitoring server at regular time to acquire link quality data among different subnets of each IDC machine room. Specifically, the IP sets of all the IDC rooms recorded on each monitoring server are read first.
A command running unit 604, configured to run a Ping command on each monitoring server, where all IPs on the monitoring server sequentially Ping IPs of all internet data center machine rooms monitoring servers, and obtain link quality data between different subnets of each internet data center machine room;
and then reading the IP of the monitoring node, namely reading all the IPs on the monitoring server, operating the Ping command on the monitoring server, and sequentially using the monitoring server IPs of all the IDC machine rooms of all the IPPINg on the monitoring server to obtain link quality data among different subnets of each Internet data center machine room.
Specifically, in order to ensure the real-time performance of link monitoring, the present invention is expected to limit the execution of Ping commands within one minute, so the specified parameters when executing Ping commands are set to be:
ping–I source_ip-c 10-i 6-l 500destination_ip
where-c 10 indicates sending 10 packets, -l 500 indicates the size of each packet is 500 bytes, -i 6 indicates sending one packet every 6 seconds.
In order to monitor the link condition in real time, the monitoring program is executed for two minutes, the execution time of each program is controlled within two minutes, wherein one minute is used for executing the Ping command, and one minute is used for storing the data.
A storage module 605 for storing link quality data;
and after the link quality data among different subnets of each IDC machine room is obtained, the obtained link quality data is further stored, so that the link quality data can be conveniently displayed at the front end.
Specifically, to display monitoring data of different time granularities, the data may be stored in three granularities, which are real-time data, hourly summarized data, and daily summarized data, respectively. Wherein the real-time data is updated every two minutes; summarizing data every two minutes every hour, and reserving all data of the last hour; one hour of data was summarized each day, and the last two weeks of data were retained. Wherein the hourly summary and the daily summary data are obtained from the real-time data and stored.
A display module 606 for displaying link quality data based on the link quality matrix;
finally, as shown in fig. 3, the link quality matrix is used to display the link conditions between different subnets in a matrix form, and different colors can represent different link qualities.
Further, as shown in fig. 4, a specific value of the link condition may be displayed for each point in the link quality matrix. Wherein, the row and column adopt time delay and packet loss rate curve to display the link condition directly for one IP of all sub-networks of different IDC machine rooms.
The generating module 607 is configured to generate the alarm information when the packet loss rate of the link quality data or the network delay is greater than the alarm threshold.
According to practical experience, when the packet loss rate is greater than 10% or the network delay is greater than 5ms, the network link is considered to be in fault, and the range is used as the threshold value of the alarm. And when the packet loss rate of the link quality data or the network delay is greater than an alarm threshold value, generating alarm information, and alarming the alarm information in the form of mails or short messages and the like.
In summary, the conventional network center link monitoring method usually needs to collect and analyze logs or information on network devices, and is complex in configuration and analysis and high in maintenance cost. Or the application side reflects the network fault, and the fault cannot be sufficiently and timely positioned. The invention directly monitors the quality of the link side without collecting and analyzing the information of the network equipment. The link quality conditions of all the subnets can be monitored only by deploying a simple monitoring service in each IDC machine room, a monitoring program can be developed by using Python or shell scripts, the development period is short, maintenance is basically not needed, the acquisition and processing of link quality data can be completed within two minutes by selecting proper Ping command parameters, the purpose of real-time monitoring is achieved, the monitoring data is displayed in a link matrix form, and a network engineer can be helped to visually detect the link state.
The embodiments in the present description are described in a progressive manner, each embodiment focuses on differences from other embodiments, and the same and similar parts among the embodiments are referred to each other. The device disclosed by the embodiment corresponds to the method disclosed by the embodiment, so that the description is simple, and the relevant points can be referred to the method part for description.
Those of skill would further appreciate that the various illustrative elements and algorithm steps described in connection with the embodiments disclosed herein may be implemented as electronic hardware, computer software, or combinations of both, and that the various illustrative components and steps have been described above generally in terms of their functionality in order to clearly illustrate this interchangeability of hardware and software. Whether such functionality is implemented as hardware or software depends upon the particular application and design constraints imposed on the implementation. Skilled artisans may implement the described functionality in varying ways for each particular application, but such implementation decisions should not be interpreted as causing a departure from the scope of the present invention.
The steps of a method or algorithm described in connection with the embodiments disclosed herein may be embodied directly in hardware, in a software module executed by a processor, or in a combination of the two. A software module may reside in Random Access Memory (RAM), memory, Read Only Memory (ROM), electrically programmable ROM, electrically erasable programmable ROM, registers, hard disk, a removable disk, a CD-ROM, or any other form of storage medium known in the art.
The previous description of the disclosed embodiments is provided to enable any person skilled in the art to make or use the present invention. Various modifications to these embodiments will be readily apparent to those skilled in the art, and the generic principles defined herein may be applied to other embodiments without departing from the spirit or scope of the invention. Thus, the present invention is not intended to be limited to the embodiments shown herein but is to be accorded the widest scope consistent with the principles and novel features disclosed herein.

Claims (8)

1. A distributed data center link monitoring method is characterized by comprising the following steps:
deploying a monitoring server in each internet data center machine room;
selecting an IP from each subnet of all subnets of a machine room where a monitoring server is located, and configuring the IPs of all subnets on a network card of the monitoring server;
acquiring link quality data among different subnets of each internet data center machine room based on the monitoring server;
storing the link quality data;
displaying the link quality data based on a link quality matrix;
the obtaining of the link quality data among different subnets of each internet data center machine room based on the monitoring server comprises:
reading all monitoring server IP sets of the Internet data center machine room recorded on each monitoring server;
and operating the Ping command on each monitoring server, sequentially Ping the IPs of all the Internet data center machine rooms monitoring servers by all the IPs on the monitoring servers, and obtaining link quality data among different subnets of each Internet data center machine room.
2. The method of claim 1, wherein the Ping command when executed specifies parameters to:
10 packets, each 500 bytes in size, are sent, one packet every 6 seconds.
3. The method of claim 1, wherein the storing the link quality data comprises:
storing the link quality data in real time;
summarizing the link quality data every hour;
aggregating the link quality data every day.
4. The method of claim 1, further comprising:
and when the packet loss rate or the network delay of the link quality data is greater than an alarm threshold, generating alarm information.
5. A distributed data center link monitoring system, comprising:
the deployment module is used for deploying one monitoring server in each Internet data center machine room;
the configuration module is used for selecting an IP from each subnet of all subnets of a machine room where the monitoring server is located and configuring the IPs of all subnets on a network card of the monitoring server;
the processing module is used for acquiring link quality data among different subnets of each internet data center machine room based on the monitoring server;
a storage module for storing the link quality data;
a display module for displaying the link quality data based on a link quality matrix;
the processing module comprises:
the reading unit is used for reading all monitoring server IP sets of the Internet data center machine room recorded on each monitoring server;
and the command operation unit is used for operating the Ping command on each monitoring server, sequentially Ping the IPs of all the Internet data center machine rooms monitoring servers by all the IPs on the monitoring servers, and obtaining the link quality data among different subnets of each Internet data center machine room.
6. The system of claim 5, wherein the Ping command when executed specifies parameters to:
10 packets, each 500 bytes in size, are sent, one packet every 6 seconds.
7. The system of claim 5, wherein the storage module, when executing storing the link quality data, is specifically configured to:
storing the link quality data in real time;
summarizing the link quality data every hour;
aggregating the link quality data every day.
8. The system of claim 5, further comprising:
and the generating module is used for generating alarm information when the packet loss rate or the network delay of the link quality data is greater than an alarm threshold.
CN201910287217.8A 2019-04-10 2019-04-10 Distributed data center link monitoring method and system Active CN109981377B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910287217.8A CN109981377B (en) 2019-04-10 2019-04-10 Distributed data center link monitoring method and system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910287217.8A CN109981377B (en) 2019-04-10 2019-04-10 Distributed data center link monitoring method and system

Publications (2)

Publication Number Publication Date
CN109981377A CN109981377A (en) 2019-07-05
CN109981377B true CN109981377B (en) 2022-03-01

Family

ID=67084034

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910287217.8A Active CN109981377B (en) 2019-04-10 2019-04-10 Distributed data center link monitoring method and system

Country Status (1)

Country Link
CN (1) CN109981377B (en)

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112367212B (en) * 2020-08-03 2022-04-05 浪潮云信息技术股份公司 Virtual machine network quality monitoring method and system in cloud environment
CN113395356B (en) * 2021-07-06 2023-04-28 山东电力工程咨询院有限公司 Health monitoring method and system for data center
CN113572644B (en) * 2021-07-26 2024-01-23 武汉众邦银行股份有限公司 Internet cloud dial testing automatic monitoring method and device

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2001035601A1 (en) * 1999-11-10 2001-05-17 Rainfinity, Inc. Distributed traffic controlling system and method for network data
CN101883409A (en) * 2010-06-18 2010-11-10 西安电子科技大学 RSSI-and-LQI-based rapid switch method in wireless sensor network
CN102546251A (en) * 2011-12-31 2012-07-04 清华大学 Hybrid-detection-technology-based network topology discovery and automatic topology method and system

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104270291B (en) * 2014-10-22 2018-05-01 网宿科技股份有限公司 CDN network quality control method
CN104683182B (en) * 2015-02-15 2018-04-10 上海帝联信息科技股份有限公司 IDC network of computer room service quality detection method and device
US9832128B1 (en) * 2017-03-20 2017-11-28 Engine Media, Llc Dynamic advertisement routing
CN107070726A (en) * 2017-05-22 2017-08-18 郑州云海信息技术有限公司 A kind of integrated management approach based on MDC

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2001035601A1 (en) * 1999-11-10 2001-05-17 Rainfinity, Inc. Distributed traffic controlling system and method for network data
CN101883409A (en) * 2010-06-18 2010-11-10 西安电子科技大学 RSSI-and-LQI-based rapid switch method in wireless sensor network
CN102546251A (en) * 2011-12-31 2012-07-04 清华大学 Hybrid-detection-technology-based network topology discovery and automatic topology method and system

Also Published As

Publication number Publication date
CN109981377A (en) 2019-07-05

Similar Documents

Publication Publication Date Title
US10686671B1 (en) Methods, systems, and computer readable media for testing network elements of an in-band network telemetry capable network
US10057136B2 (en) Method and apparatus for visualized network operation and maintenance
CN109981377B (en) Distributed data center link monitoring method and system
US20180091413A1 (en) Network health data aggregation service
CN107508722B (en) Service monitoring method and device
US20030225876A1 (en) Method and apparatus for graphically depicting network performance and connectivity
US20140215077A1 (en) Methods and systems for detecting, locating and remediating a congested resource or flow in a virtual infrastructure
US20130305091A1 (en) Drag and drop network topology editor for generating network test configurations
EP2056559B1 (en) Method and system for network simulation
EP2486698B1 (en) Method and system for reconstructing transactions in a communication network
CN110635965B (en) IPv6 network quality monitoring method, equipment and storage medium
CN112350854B (en) Flow fault positioning method, device, equipment and storage medium
CN109960634A (en) A kind of method for monitoring application program, apparatus and system
CN108390907B (en) Management monitoring system and method based on Hadoop cluster
CN110912751A (en) Network equipment topological graph generation method and related device
CN107181626B (en) Method and system for monitoring network bandwidth of distributed storage cluster system
US8966321B2 (en) Logical port and layer protocol test configuration resource manager
CN108809677A (en) The system of testing characteristics of network
CN110753364A (en) Network monitoring method, system, electronic device and storage medium
CN106209456A (en) A kind of kernel state lower network fault detection method and device
Levin et al. Network Monitoring in Federated Cloud Environment
CN108933707B (en) Safety monitoring system and method for industrial network
CN112468336B (en) Network quality monitoring and early warning method, device, terminal and storage medium
US10904123B2 (en) Trace routing in virtual networks
CN111262728A (en) Flow load monitoring system based on log port flow

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant