CN112367212B - Virtual machine network quality monitoring method and system in cloud environment - Google Patents

Virtual machine network quality monitoring method and system in cloud environment Download PDF

Info

Publication number
CN112367212B
CN112367212B CN202010765628.6A CN202010765628A CN112367212B CN 112367212 B CN112367212 B CN 112367212B CN 202010765628 A CN202010765628 A CN 202010765628A CN 112367212 B CN112367212 B CN 112367212B
Authority
CN
China
Prior art keywords
network quality
server
information
virtual machine
end processor
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202010765628.6A
Other languages
Chinese (zh)
Other versions
CN112367212A (en
Inventor
王东杰
方亚东
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Inspur Cloud Information Technology Co Ltd
Original Assignee
Inspur Cloud Information Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Inspur Cloud Information Technology Co Ltd filed Critical Inspur Cloud Information Technology Co Ltd
Priority to CN202010765628.6A priority Critical patent/CN112367212B/en
Publication of CN112367212A publication Critical patent/CN112367212A/en
Application granted granted Critical
Publication of CN112367212B publication Critical patent/CN112367212B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L43/00Arrangements for monitoring or testing data switching networks
    • H04L43/10Active monitoring, e.g. heartbeat, ping or trace-route
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/44Arrangements for executing specific programs
    • G06F9/455Emulation; Interpretation; Software simulation, e.g. virtualisation or emulation of application or operating system execution engines
    • G06F9/45533Hypervisors; Virtual machine monitors
    • G06F9/45558Hypervisor-specific management and integration aspects
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/04Network management architectures or arrangements
    • H04L41/042Network management architectures or arrangements comprising distributed management centres cooperatively managing the network
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/06Management of faults, events, alarms or notifications
    • H04L41/0695Management of faults, events, alarms or notifications the faulty arrangement being the maintenance, administration or management system
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/22Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks comprising specially adapted graphical user interfaces [GUI]
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/44Arrangements for executing specific programs
    • G06F9/455Emulation; Interpretation; Software simulation, e.g. virtualisation or emulation of application or operating system execution engines
    • G06F9/45533Hypervisors; Virtual machine monitors
    • G06F9/45558Hypervisor-specific management and integration aspects
    • G06F2009/45595Network integration; Enabling network access in virtual machine instances

Abstract

The invention discloses a method and a system for monitoring the network quality of a virtual machine in a cloud environment, belongs to the technical field of monitoring the network quality of the virtual machine, and aims to solve the technical problem of how to realize the network quality detection of the virtual machine in the cloud environment. The method comprises the following steps: configuring a front-end processor according to the subnet; configuring a monitoring mode and detection information through a server interface; establishing network communication between a front-end processor and a server; the server side sends the virtual machine information of the virtual machine to be tested to the front-end processor; the server side initiates a port detection request to the front-end processor, and the front-end processor scans the ports of the virtual machines according to the port detection request and acquires the survival ports of the virtual machines; the front-end processor detects the network quality of the virtual machine according to the network quality detection request and the configured monitoring mode, and returns the network quality information and the alarm information to the server side; and carrying out fault processing through the alarm information pushed by the server side, and displaying historical network quality information through the server side. The system comprises: a front-end processor and a server.

Description

Virtual machine network quality monitoring method and system in cloud environment
Technical Field
The invention relates to the technical field of virtual machine network quality monitoring, in particular to a virtual machine network quality monitoring method and a virtual machine network quality monitoring system in a cloud environment.
Background
As cloud computing technology is gradually matured in government affairs and enterprise fields, the scale of a data center is gradually enlarged, and meanwhile, due to the particularity of the fields, different subnets are often required to be divided to carry different services. Dozens of virtual machines are fewer in the service subnet, and thousands of virtual machines are more in the service subnet; in a cloud environment, the network equipment not only relates to a traditional physical machine, a switch, a router and a network gate, but also relates to a virtual router and a virtual switch; the network quality of the virtual machine is important for the quality of cloud products, and monitoring, analysis and warning of the network quality of the virtual machine are called as the direction of research in the industry.
In the face of increasingly complex network environments, the traditional monitoring system simply analyzes the network quality condition through a ping command of an ICMP protocol. Through tests, if network quality detection is carried out only through ping commands, about one hundred thousand subnets with the number of virtual machines alarm by mistake every day, the main reason for generating alarm is network jitter, and meanwhile, the switch bears larger burden and has heavier influence on the network quality due to the fact that an ICMP packet is frequently sent in the whole network. Meanwhile, because the virtual machine exists in the host machine, the network quality problem of the virtual machine is not necessarily real physical equipment, and the problem positioning is challenged.
For the service subnet with smaller scale, obviously, the overhead caused by independently placing the front-end processor for the service subnet with smaller scale is overlarge; in the cloud environment, each subnet can be accessed on a network node, so that a smaller service subnet can be integrated on the monitoring system level and is regarded as a larger service subnet, and only the front-end processor of the service subnet needs to be placed on the network node.
In a TCP/IP protocol cluster, a stable network path is established by a TCP protocol, and reliable data link can be realized in a network with unstable network quality. In the cloud environment, all hosts can be brought into another management network, and the IPMI protocol is used for detecting the physical equipment state of the hosts. In conclusion, TCP can be used to detect port activity to assist in revising ping tests; and acquiring the network quality condition of the host machine, and facilitating further positioning.
Based on the above, how to implement the network quality detection of the virtual machine in the cloud environment is a technical problem to be solved.
Disclosure of Invention
The technical task of the invention is to provide a method and a system for monitoring the network quality of a virtual machine in a cloud environment, aiming at the defects, so as to solve the problem of how to realize the network quality detection of the virtual machine in the cloud environment.
In a first aspect, the present invention provides a method for monitoring network quality of a virtual machine in a cloud environment, including the following steps:
configuring a front-end processor according to a subnet, deploying a detection program for detecting the network quality of the virtual machine in the front-end processor, and deploying a server in a cloud center in a clustering manner, wherein the server performs control interaction with operation and maintenance personnel and tenants in the form of a server interface;
configuring a monitoring mode and detection information through a server interface, wherein the monitoring mode is used for limiting the detection mode of the virtual machine, including but not limited to whether network quality detection is performed or not, whether port detection is performed or not and the number of ping times, the detection information includes but not limited to alarm information, port information and an acquisition cycle, and the acquisition cycle includes a port acquisition cycle and a network quality acquisition cycle;
the front-end processor initiates a registration request to the server, and the server returns alarm queue information to the corresponding front-end processor according to the registration request to establish network communication between the front-end processor and the server;
according to the acquisition period, the server side sends virtual machine information of a virtual machine to be detected to the front-end processor, wherein the virtual machine information comprises a network topology relation between the virtual machine and a host machine;
according to the port acquisition period, the server side initiates a port detection request to the front-end processor, and the front-end processor scans the ports of the virtual machines according to the port detection request, acquires the survival ports of the virtual machines and uploads the survival ports to the server side;
according to the network quality acquisition period, the server side initiates a network quality detection request to the front-end processor, the front-end processor performs network quality detection on the virtual machine according to the network quality detection request and a configured monitoring mode to obtain network quality information, detects whether to send an alarm or clear the alarm to obtain alarm information, and returns the network quality information and the alarm information to the server side;
and carrying out fault processing through the alarm information pushed by the server side, and displaying historical network quality information through the server side.
Preferably, the sub-networks comprise a service sub-network and an integrated service sub-network formed by integrating a plurality of service sub-networks;
configuring the front-end processor according to the subnet comprises:
deploying at least one front-end processor in each service subnet according to the number of virtual machines in the service subnets;
and integrating the smaller service subnets in the cloud center to obtain integrated service subnets, deploying at least one front-end processor in each integrated service subnet, and deploying the front-end processor in a cloud center network node for the integrated service subnets.
Preferably, the server is deployed in a clustering manner in the cloud center by the following method:
selecting a service subnet, and building a database cluster in a copy mode, wherein at least two databases in the database cluster are responsible for writing, and at least one database is responsible for reading;
building a server cluster in a copy mode, wherein at least two servers in the server cluster are built;
and deploying a server on the server, deploying Nginx service and configuring load balance on the server where the server is located, realizing high availability by using Keeplied, and providing access and limiting flow to the exposed API.
Preferably, the network quality information includes, but is not limited to, packet loss rate, time delay, and port activity.
Preferably, the method for detecting the network quality of the executing virtual machine by the front-end processor to obtain the network quality information and detecting whether to send the alarm and clear the alarm to obtain the alarm information includes the following steps:
acquiring information of a virtual machine to be detected according to the network quality detection request;
detecting the information of the virtual machines according to the needs, and grouping the virtual machines in the corresponding sub-networks;
and for each group of virtual machines, detecting whether the ports are communicated through ping, if so, sending alarm release information, if not, carrying out port activity detection on the virtual machines, if the ports are alive, sending the alarm release information, and if the ports are not alive, sending the alarm information.
Preferably, the monitoring mode further comprises whether to perform physical component activity detection.
Preferably, the fault processing is performed through the alarm information pushed by the server, and the fault processing method includes:
analyzing the abnormal generation content provided by the detection program, further analyzing whether a host corresponding to the virtual machine has a problem, and if so, further analyzing the physical equipment problem of the host;
when the virtual machine is positioned to generate false alarm due to network fluctuation, the next detection period is increased, and alarm information is sent only when a problem is detected in a plurality of detection periods;
if the virtual machine is disconnected or shut down, restarting the virtual machine through the virtualization platform;
if the network quality problem of the corresponding host is analyzed, the physical component state of the host is further analyzed, and if the host is shut down due to the power failure problem, the host is restarted after the fault is removed;
if the host machine physical component information is normal, network and operating system problems are further analyzed.
Preferably, the front-end processor is a selected virtual machine or an added physical host.
In a second aspect, the present invention provides a system for monitoring network quality of a virtual machine in a cloud environment, including:
the system comprises a front-end processor, a plurality of sub-network processors and a plurality of sub-network processors, wherein the front-end processor is internally provided with a detection program for detecting the network quality of the virtual machines, and each front-end processor is arranged in a corresponding sub-network;
the system comprises a server, a monitoring system and a monitoring system, wherein the server performs control interaction with operation and maintenance personnel and tenants in the form of a server interface and is used for configuring a monitoring mode and detection information, the monitoring mode is used for limiting the detection mode of the virtual machine, including but not limited to whether network quality detection is performed or not, whether port detection is performed or not and the number of ping times, the detection information includes but not limited to alarm information, port information and an acquisition cycle, and the acquisition cycle includes a port acquisition cycle and a network quality acquisition cycle;
the server is deployed in a cloud center in a clustered manner, each front-end processor performs data interaction with the server, and the front-end processors and the server are matched for executing the following operations:
the front-end processor initiates a registration request to the server, and the server returns alarm queue information to the corresponding front-end processor according to the registration request to establish network communication between the front-end processor and the server;
according to the acquisition period, the server side sends virtual machine information of a virtual machine to be detected to the front-end processor, wherein the virtual machine information comprises a network topology relation between the virtual machine and a host machine;
according to the port acquisition period, the server side initiates a port detection request to the front-end processor, and the front-end processor scans the ports of the virtual machines according to the port detection request, acquires the survival ports of the virtual machines and uploads the survival ports to the server side;
according to the network quality acquisition period, the server side initiates a network quality detection request to the front-end processor, the front-end processor performs network quality detection on the virtual machine according to the network quality detection request and a configured monitoring mode to obtain network quality information, detects whether to send an alarm or clear the alarm to obtain alarm information, and returns the network quality information and the alarm information to the server side;
and the server side pushes the alarm information to process the fault and displays the historical network quality information.
Preferably, the server is deployed in the cloud center in a clustering manner in the following manner:
selecting a service subnet, and building a database cluster in a copy mode, wherein at least two databases in the database cluster are responsible for writing, and at least one database is responsible for reading;
building a server cluster in a copy mode, wherein at least two servers in the server cluster are built;
and deploying a server on the server, deploying Nginx service and configuring load balance on the server where the server is located, realizing high availability by using Keeplied, and providing access and limiting flow to the exposed API.
Preferably, the front-end processor is configured to perform network quality detection on the virtual machine according to the network quality detection request and according to the configured monitoring mode to obtain the network quality information, and detect whether to send an alarm or clear the alarm to obtain the alarm information, and includes the following steps:
acquiring information of a virtual machine to be detected according to the network quality detection request;
grouping the virtual machines in the corresponding sub-networks according to the information of the virtual machines to be detected;
and for each group of virtual machines, detecting whether the ports are communicated or not through ping according to a configured monitoring mode, if so, sending alarm removing information, if not, carrying out port activity detection on the virtual machines, if the ports are alive, sending the alarm removing information, and if the ports are not alive, sending the alarm information.
The method and the system for monitoring the network quality of the virtual machine in the cloud environment have the following advantages:
1. configuring a front-end processor on a subnet, configuring a monitoring mode and detection information on a service end in a cloud center cluster manner, detecting a corresponding virtual machine through a detection program built in the front-end processor, pushing alarm information through a service end interface and displaying historical network quality information, wherein the number of false alarms in an unstable network environment can be effectively reduced, and in an actual test, the number of false alarms is reduced by about 99%;
2. in a large-scale service subnet, a certain number of front-end computers are deployed according to the number of virtual machines, port information of the virtual machines is collected, virtual network quality detection of service subnets with different network segments and unconnected with each other is carried out, for a small-scale service subnet, the front-end computers are integrated on an abstract level of a system, a network node of a cloud center is placed on the front-end computers to monitor the integrated subnet, the front-end computers are deployed in service subnets or network nodes isolated by each network, the number of the front-end computers is flexibly allocated in each subnet according to the number of managed virtual machines, each front-end computer has complete functions, all tasks can be independently completed, a plurality of front-end computers are cooperatively matched, the detection efficiency can be transversely expanded, and the waste of front-end computer resources by the small service subnet is avoided;
3. when the network quality is detected, the method not only comprises the commonly used ping detection, but also comprises the detection of the survival of the port of the virtual machine, thereby improving the detection accuracy;
4. deploying a server in a certain service subnet of the cloud center, wherein the server faces operation and maintenance personnel through a server interface, supports the user-defined configuration of the operation and maintenance personnel, supports the display of virtual machine network state information and the pushing of virtual machine network fault alarms, is convenient for finding out faults in time and processing the faults, and realizes the HA high availability characteristic of the server through the clustered deployment of a database, a server and the like; the concurrent processing capacity of the server is improved through load balancing; the rapid deployment is realized through containerization, and the environment dependence is reduced;
5. by obtaining the network topological relation between the virtual machine and the host machine and the physical state information of the host machine, the reason for the network quality of the virtual machine can be quickly positioned, operation and maintenance personnel can be helped to quickly recover the state of the virtual machine, and the service can be continuously and efficiently provided to the outside;
6. the virtual machine is used as the granularity to dynamically adjust the detection strategy, so that no alarm caused by other irrelevant factors such as local network equipment, an operating system and the like can be effectively avoided, and the alarm accuracy is improved;
7. the server can display historical network quality information, namely the server stores the network quality information of the virtual machines, the historical network quality information of a single virtual machine is displayed in a visualized mode through the server interface, the historical information of the network condition of the virtual machine can be traced, fault reasons of other products can be further determined, and responsibility can be cleared.
Drawings
In order to more clearly illustrate the technical solutions in the embodiments of the present invention, the drawings needed for the embodiments or the prior art descriptions will be briefly described below, and it is obvious that the drawings in the following description are only some embodiments of the present invention, and it is obvious for those skilled in the art to obtain other drawings based on the drawings without creative efforts.
The invention is further described below with reference to the accompanying drawings.
Fig. 1 is a block diagram of an architecture of a cloud center in which a plurality of service subnets exist between a front-end processor and a server in a method for monitoring network quality of a virtual machine in a cloud environment according to embodiment 1;
fig. 2 is a block diagram of a server in a method for monitoring network quality of a virtual machine in a cloud environment according to embodiment 1;
fig. 3 is a flowchart illustrating a deployment of a virtual machine network quality monitoring system in a method for monitoring virtual machine network quality in a cloud environment according to embodiment 1;
fig. 4 is a timing diagram of a virtual machine network quality monitoring system in a virtual machine network quality monitoring method in a cloud environment according to embodiment 1;
fig. 5 is a flow chart of detecting network quality of a front-end processor in a method for monitoring network quality of a virtual machine in a cloud environment according to embodiment 1.
Detailed Description
The present invention is further described in the following with reference to the drawings and the specific embodiments so that those skilled in the art can better understand the present invention and can implement the present invention, but the embodiments are not to be construed as limiting the present invention, and the embodiments and the technical features of the embodiments can be combined with each other without conflict.
The embodiment of the invention provides a method and a system for monitoring the network quality of a virtual machine in a cloud environment, which are used for solving the technical problem of how to realize the network quality detection of the virtual machine in the cloud environment.
Example 1:
the invention discloses a method for monitoring the network quality of a virtual machine in a cloud environment, which comprises the following steps:
s100, configuring a front-end processor according to a subnet, deploying a detection program for detecting the network quality of the virtual machine in the front-end processor, and deploying a server in a cloud center in a clustering manner, wherein the server performs control interaction with operation and maintenance personnel and tenants in the form of a server interface;
s200, configuring a monitoring mode and detection information through a server interface, wherein the monitoring mode is used for limiting the detection mode of the virtual machine, including but not limited to whether network quality detection is carried out or not, whether port detection is carried out or not and the number of ping times, the detection information includes but not limited to alarm information, port information and an acquisition cycle, and the acquisition cycle includes a port acquisition cycle and a network quality acquisition cycle;
s300, the front-end processor initiates a registration request to the server, the server returns alarm queue information to the corresponding front-end processor according to the registration request, the alarm queue information comprises a returned information uploading API and an alarm pushing API, and network communication between the front-end processor and the server is established;
s400, according to an acquisition cycle, the server side sends virtual machine information of a virtual machine to be detected to the front-end processor, wherein the virtual machine information comprises a network topology relation between the virtual machine and a host machine;
s500, according to a port acquisition cycle, the server side initiates a port detection request to the front-end processor, the front-end processor scans the ports of the virtual machines according to the port detection request and acquires the survival ports of the virtual machines, and uploads the survival ports to the server side;
s600, according to a network quality acquisition period, a server side initiates a network quality detection request to a front-end processor, the front-end processor performs network quality detection on a virtual machine according to the network quality detection request and a configured monitoring mode to obtain network quality information, detects whether to send an alarm or clear the alarm to obtain alarm information, and returns the network quality information and the alarm information to the server side;
s700, fault processing is carried out through the alarm information pushed by the server side, and historical network quality information is displayed through the server side.
The subnets comprise a service subnet and an integrated service subnet formed by integrating a plurality of service subnets, and at least one front-end processor is arranged in each service subnet according to the number of virtual machines in the service subnet; and integrating the smaller service subnets in the cloud center to obtain integrated service subnets, deploying at least one front-end processor in each integrated service subnet, and deploying the front-end processor in a cloud center network node for the integrated service subnets. Different front-end processors are placed in the large-scale service subnets of the cloud center or the integrated subnets, port information of the virtual machines is collected, and virtual network quality detection of the service subnets which are in different network segments and are not connected with each other is carried out; and fitting the service subnetworks with smaller scale, integrating the service subnetworks at the abstract level of the system, placing the front-end processor in the network node of the cloud center to monitor the integrated subnetworks, and avoiding the waste of the front-end processor resources by the smaller service subnetworks.
The front-end processor can select a virtual machine or a physical host, and if the number of the virtual machines in the service subnet is large, the detection amount is large, and the physical host is preferentially selected.
And providing a plurality of RESTAPI interfaces at a cloud center cluster deployment server for registration of the front-end processor, release of detection information, collection, analysis and display of collected information, and network quality alarm and clearing of the virtual machine. The HA high availability is realized by the cluster deployment of components such as a server, a database, a message queue and the like; the concurrent processing capacity of the server is improved through load balancing; by the aid of the containerized deployment front-end processor, dependence on environment is reduced, and deployment efficiency is improved.
In this embodiment, the server is deployed in a cloud center cluster by the following method: selecting a service subnet, and building a database cluster in a copy mode, wherein three databases are configured in the database cluster, one database is responsible for writing, and the other two databases are responsible for reading; the method comprises the steps of using three servers to deploy a server, deploying Nginx services on the server where the server is located, configuring load balancing, using Keeplied to achieve high availability, providing access to an exposed API, and limiting flow.
After the front-end processor and the server are deployed in the cloud center through the step S100, the step S200 is executed, the operation and maintenance personnel and the tenant can configure through the server interface, the operation and maintenance personnel can customize the monitoring mode of the virtual machine, including the ping times, whether to perform port activity detection and detect the port, whether to perform physical component activity detection, and the like, the tenant can configure detection information, such as alarm information, port information, whether to include the port for management and actually perform activity detection in the acquisition period, and the like, the acquisition period includes the port acquisition period and the network quality acquisition period, and meanwhile, the server interface can display real-time alarm information, provide the network topology relationship between the virtual machine and the host, and further help to locate the network problem.
After the configuration is completed, the network quality of the virtual machine is monitored, and the monitoring time sequence is shown in fig. 4.
Firstly, the front-end processor initiates a registration request to the server, the server returns the API for information uploading and alarm pushing of the server to the corresponding front-end processor according to the registration request, and network communication between the front-end processor and the server is established.
After the server is communicated with the front-end processor, step S400 is executed, the server counts the number of the front-end processors in a subnet, and sends the virtual machine information of the virtual machine to be tested to the front-end processor according to the acquisition period, wherein the virtual machine information comprises the network topology relationship between the virtual machine and the host machine, so as to further help to position the network problem. As shown in fig. 4, the server initiates a port detection message to the front-end processor, and the front-end processor scans the virtual machine ports, including the network port and the service port, according to the port detection message, and then uploads the port information to the server.
The front-end processor not only needs to detect the network quality, but also needs to collect the survival port of the virtual machine, and because in a cloud environment, a client usually needs to operate the virtual machine through a specific management protocol, the realization of the management protocol is generally based on a TCP protocol, and the management port is inevitably exposed, so that the scanning is preferentially performed on the management port. In step S500, according to the port acquisition period, the server initiates a port detection request to the front-end processor, and the front-end processor scans the ports of the virtual machines and acquires the live ports of the virtual machines according to the port detection request, and uploads the live ports to the server. The front-end processor scans ports for the virtual machine, the ports for providing services by the virtual machine cannot be easily changed, and in a cloud environment, when a cloud operator provides the virtual machine for a tenant, several fixed ports are often used, so that in summary, the scanning period of the service ports can be long, and can be set to 12 hours or 24 hours in actual production; the common management port is preferentially scanned.
After the port is detected, step S600 is executed, according to the network quality acquisition period of the virtual machine customized by the user, the server sends a request for network quality detection to the front-end processor, the front-end processor detects network quality information of the executing virtual machine, such as information of packet loss rate, time delay, port activity, and the like, and detects whether to send an alarm and clear an alarm, and sends all information to the server, thereby completing one detection.
The front-end processor carries out network quality detection aiming at the virtual machine, and the network quality detection comprises conventional ping detection and activity detection of a port, and the reason is that in a cloud environment with large network fluctuation, a large amount of error alarms can be generated only by the ping detection, a large amount of ICMP data packets can appear in the environment, and the accuracy can be improved through the port activity detection on the basis of the ping detection.
The front-end processor performs network quality detection on the virtual machine, and the specific steps are as shown in fig. 5:
firstly, acquiring information of a virtual machine to be detected according to a network quality detection request;
then, detecting the information of the virtual machines according to the needs, and grouping the virtual machines in the corresponding sub-networks;
and finally, for each group of virtual machines, detecting whether the ports are communicated through ping, if so, sending alarm release information, if not, carrying out port activity detection on the virtual machines, if the ports survive, sending the alarm release information, and if the ports do not survive, sending the alarm information.
The ping detection is firstly carried out, and the port activity detection is carried out only under the condition that the ping detection is not successful, so that a large number of ICMP message packets in the network can be effectively avoided, and meanwhile, the operation efficiency cannot be reduced due to long time of the port activity detection.
The network quality information and the alarm information are displayed through the server and are stored in the corresponding server, and operation and maintenance personnel perform fault processing through the alarm information pushed by the server and display historical network quality information through the server.
In this embodiment, the fault processing is performed by the following steps:
(1) analyzing the abnormal generation content provided by the detection program, further analyzing whether a host corresponding to the virtual machine has a problem, and if so, further analyzing the physical equipment problem of the host;
(2) when the virtual machine is positioned to generate false alarm due to network fluctuation, the next detection period is increased, and alarm information is sent only when a problem is detected in a plurality of detection periods;
(3) if the virtual machine is disconnected or shut down, restarting the virtual machine through the virtualization platform;
(4) if the network quality problem of the corresponding host is analyzed, the physical component state of the host is further analyzed, and if the host is shut down due to the power failure problem, the host is restarted after the fault is removed;
(5) if the host machine physical component information is normal, network and operating system problems are further analyzed.
The invention relates to a method for monitoring the network quality of a virtual machine in a cloud environment.A server management cluster is deployed in a cloud center and used for managing front-end computers in different subnets and providing services for operation and maintenance personnel and tenants; operation and maintenance personnel and tenants can manage the network quality detection strategy of each virtual machine through an interface or an interface provided by the server, acquire the network quality condition of the virtual machine, receive and push the network quality alarm of the virtual machine in various modes.
In the execution, a small service subnet of a cloud center is integrated, all virtual machines are divided into a plurality of subnets for detection, for the integrated subnet, a front-end processor is placed at a network node of the cloud center, one or more front-end processors are deployed in the subnet according to the number of the virtual machines and used for detecting the network quality of the virtual machines in the respective network, and the front-end processors scan the network and service ports of all the virtual machines in the whole subnet; the client side of the front-end processor pushes the acquired information to the server side, the server side collects, stores and displays the network quality information of all the subnets of the whole cloud center, and the information of any virtual machine is not stored in the front-end processor, so that the transverse expansion and the upgrading deployment are facilitated.
Example 2:
the invention relates to a virtual machine network quality monitoring system in a cloud environment, which comprises a plurality of front-end computers and a server side, wherein a detection program for detecting the network quality of the virtual machines is deployed in each front-end computer, and each front-end computer is deployed in a corresponding subnet; the server side performs control interaction with operation and maintenance personnel and tenants in the form of a server side interface, and is used for configuring a monitoring mode and detection information, wherein the monitoring mode is used for limiting the detection mode of the virtual machine, including but not limited to whether network quality detection is performed or not, whether port detection is performed or not and the number of ping times, the detection information includes but not limited to alarm information, port information and an acquisition period, and the acquisition period includes a port acquisition period and a network quality acquisition period.
The server is deployed in a cloud center in a clustered manner, each front-end processor performs data interaction with the server, and the front-end processors and the server are matched for executing the following operations:
(1) the front-end processor initiates a registration request to the server, and the server returns alarm queue information to the corresponding front-end processor according to the registration request to establish network communication between the front-end processor and the server;
(2) according to the acquisition period, the server side sends virtual machine information of a virtual machine to be detected to the front-end processor, wherein the virtual machine information comprises a network topology relation between the virtual machine and a host machine;
(3) according to the port acquisition period, the server side initiates a port detection request to the front-end processor, and the front-end processor scans the ports of the virtual machines according to the port detection request, acquires the survival ports of the virtual machines and uploads the survival ports to the server side;
(4) according to the network quality acquisition period, the server side initiates a network quality detection request to the front-end processor, the front-end processor performs network quality detection on the virtual machine according to the network quality detection request and a configured monitoring mode to obtain network quality information, detects whether to send an alarm or clear the alarm to obtain alarm information, and returns the network quality information and the alarm information to the server side;
(5) and the server side pushes the alarm information to process the fault and displays the historical network quality information.
The subnets comprise a service subnet and an integrated service subnet formed by integrating a plurality of service subnets, and at least one front-end processor is arranged in each service subnet according to the number of virtual machines in the service subnet; and integrating the smaller service subnets in the cloud center to obtain integrated service subnets, deploying at least one front-end processor in each integrated service subnet, and deploying the front-end processor in a cloud center network node for the integrated service subnets. Different front-end processors are placed in the large-scale service subnets of the cloud center or the integrated subnets, port information of the virtual machines is collected, and virtual network quality detection of the service subnets which are in different network segments and are not connected with each other is carried out; and fitting the service subnetworks with smaller scale, integrating the service subnetworks at the abstract level of the system, placing the front-end processor in the network node of the cloud center to monitor the integrated subnetworks, and avoiding the waste of the front-end processor resources by the smaller service subnetworks.
The front-end processor can select a virtual machine or a physical host, and if the number of the virtual machines in the service subnet is large, the detection amount is large, and the physical host is preferentially selected.
And providing a plurality of RESTAPI interfaces at a cloud center cluster deployment server for registration of the front-end processor, release of detection information, collection, analysis and display of collected information, and network quality alarm and clearing of the virtual machine. The HA high availability is realized by the cluster deployment of components such as a server, a database, a message queue and the like; the concurrent processing capacity of the server is improved through load balancing; by the aid of the containerized deployment front-end processor, dependence on environment is reduced, and deployment efficiency is improved.
In this embodiment, the server is deployed in a cloud center cluster by the following method: selecting a service subnet, and building a database cluster in a copy mode, wherein three databases are configured in the database cluster, one database is responsible for writing, and the other two databases are responsible for reading; the method comprises the steps of using three servers to deploy a server, deploying Nginx services on the server where the server is located, configuring load balancing, using Keeplied to achieve high availability, providing access to an exposed API, and limiting flow.
The front-end processor is used for detecting the network quality of the virtual machine according to the network quality detection request and the configured monitoring mode to obtain the network quality information and detecting whether to send an alarm or clear the alarm to obtain the alarm information, and the method comprises the following steps:
(1) acquiring information of a virtual machine to be detected according to the network quality detection request;
(2) grouping the virtual machines in the corresponding sub-networks according to the information of the virtual machines to be detected;
(3) and for each group of virtual machines, detecting whether the ports are communicated or not through ping according to a configured monitoring mode, if so, sending alarm removing information, if not, carrying out port activity detection on the virtual machines, if the ports are alive, sending the alarm removing information, and if the ports are not alive, sending the alarm information.
The virtual machine network quality monitoring system in the cloud environment can execute the virtual machine network quality monitoring method in the cloud environment disclosed in embodiment 1.
It should be noted that not all steps and modules in the above flows and system structure diagrams are necessary, and some steps or modules may be omitted according to actual needs. The execution order of the steps is not fixed and can be adjusted as required. The system structure described in the above embodiments may be a physical structure or a logical structure, that is, some modules may be implemented by the same physical entity, or some modules may be implemented by a plurality of physical entities, or some components in a plurality of independent devices may be implemented together.
While the invention has been shown and described in detail in the drawings and in the preferred embodiments, it is not intended to limit the invention to the embodiments disclosed, and it will be apparent to those skilled in the art that various combinations of the code auditing means in the various embodiments described above may be used to obtain further embodiments of the invention, which are also within the scope of the invention.

Claims (10)

1. The method for monitoring the network quality of the virtual machine in the cloud environment is characterized by comprising the following steps:
configuring a front-end processor according to a subnet, deploying a detection program for detecting the network quality of the virtual machine in the front-end processor, and deploying a server in a cloud center in a clustering manner, wherein the server performs control interaction with operation and maintenance personnel and tenants in the form of a server interface;
configuring a monitoring mode and detection information through a server side interface, wherein the monitoring mode is used for limiting the detection mode of the virtual machine, including whether network quality detection is carried out or not, whether port detection is carried out or not and the number of ping times, the detection information comprises alarm information, port information and an acquisition cycle, and the acquisition cycle comprises a port acquisition cycle and a network quality acquisition cycle;
the front-end processor initiates a registration request to the server, and the server returns alarm queue information to the corresponding front-end processor according to the registration request to establish network communication between the front-end processor and the server;
according to the acquisition period, the server side sends virtual machine information of a virtual machine to be detected to the front-end processor, wherein the virtual machine information comprises a network topology relation between the virtual machine and a host machine;
according to the port acquisition period, the server side initiates a port detection request to the front-end processor, and the front-end processor scans the ports of the virtual machines according to the port detection request, acquires the survival ports of the virtual machines and uploads the survival ports to the server side;
according to the network quality acquisition period, the server side initiates a network quality detection request to the front-end processor, the front-end processor performs network quality detection on the virtual machine according to the network quality detection request and a configured monitoring mode to obtain network quality information, detects whether to send an alarm or clear the alarm to obtain alarm information, and returns the network quality information and the alarm information to the server side;
and carrying out fault processing through the alarm information pushed by the server side, and displaying historical network quality information through the server side.
2. The method for monitoring the network quality of the virtual machine in the cloud environment according to claim 1, wherein the sub-networks include a service sub-network and an integrated service sub-network formed by integrating a plurality of service sub-networks;
configuring the front-end processor according to the subnet comprises:
deploying at least one front-end processor in each service subnet according to the number of virtual machines in the service subnets;
and integrating the service subnetworks meeting the threshold value in the cloud center to obtain integrated service subnetworks, deploying at least one front-end processor in each integrated service subnet, and deploying the front-end processor in a cloud center network node for the integrated service subnetworks.
3. The method for monitoring the network quality of the virtual machine in the cloud environment according to claim 1, wherein the server is deployed in a cloud center cluster manner by the following method:
selecting a service subnet, and building a database cluster in a copy mode, wherein at least two databases in the database cluster are responsible for writing, and at least one database is responsible for reading;
building a server cluster in a copy mode, wherein at least two servers in the server cluster are built;
and deploying a server on the server, deploying Nginx service and configuring load balance on the server where the server is located, realizing high availability by using Keeplied, and providing access and limiting flow to the exposed API.
4. The method for monitoring the network quality of the virtual machine in the cloud environment according to claim 1 or 2, wherein the network quality information includes a packet loss rate, a time delay and a port activity;
the front-end processor detects the network quality of the executing virtual machine to obtain the network quality information, and detects whether to send an alarm and clear the alarm to obtain the alarm information, and the method comprises the following steps:
acquiring information of a virtual machine to be detected according to the network quality detection request;
detecting the information of the virtual machines according to the needs, and grouping the virtual machines in the corresponding sub-networks;
and for each group of virtual machines, detecting whether the ports are communicated through ping, if so, sending alarm release information, if not, carrying out port activity detection on the virtual machines, if the ports are alive, sending the alarm release information, and if the ports are not alive, sending the alarm information.
5. The method for monitoring the network quality of the virtual machine in the cloud environment according to claim 4, wherein the monitoring mode further includes whether to perform activity detection on the physical components.
6. The method for monitoring the network quality of the virtual machine in the cloud environment according to claim 5, wherein the fault processing is performed through the alarm information pushed by the server, and the method comprises the following steps:
analyzing the abnormal generation content provided by the detection program, further analyzing whether a host corresponding to the virtual machine has a problem, and if so, further analyzing the physical equipment problem of the host;
when the virtual machine is positioned to generate false alarm due to network fluctuation, the next detection period is increased, and alarm information is sent only when a problem is detected in a plurality of detection periods;
if the virtual machine is disconnected or shut down, restarting the virtual machine through the virtualization platform;
if the network quality problem of the corresponding host is analyzed, the physical component state of the host is further analyzed, and if the host is shut down due to the power failure problem, the host is restarted after the fault is removed;
if the host machine physical component information is normal, network and operating system problems are further analyzed.
7. The method for monitoring the network quality of the virtual machines in the cloud environment according to claim 1 or 2, wherein the front-end processor is a selected virtual machine or an added physical host.
8. Virtual machine network quality monitored control system under cloud environment, its characterized in that includes:
the system comprises a front-end processor, a plurality of sub-network processors and a plurality of sub-network processors, wherein the front-end processor is internally provided with a detection program for detecting the network quality of the virtual machines, and each front-end processor is arranged in a corresponding sub-network;
the system comprises a server, a monitoring system and a monitoring system, wherein the server performs control interaction with operation and maintenance personnel and tenants in a server interface mode and is used for configuring a monitoring mode and detection information, the monitoring mode is used for limiting a detection mode of a virtual machine, and comprises whether network quality detection is performed or not, whether port detection is performed or not and the number of ping times, the detection information comprises alarm information, port information and an acquisition cycle, and the acquisition cycle comprises a port acquisition cycle and a network quality acquisition cycle;
the server is deployed in a cloud center in a clustered manner, each front-end processor performs data interaction with the server, and the front-end processors and the server are matched for executing the following operations:
the front-end processor initiates a registration request to the server, and the server returns alarm queue information to the corresponding front-end processor according to the registration request to establish network communication between the front-end processor and the server;
according to the acquisition period, the server side sends virtual machine information of a virtual machine to be detected to the front-end processor, wherein the virtual machine information comprises a network topology relation between the virtual machine and a host machine;
according to the port acquisition period, the server side initiates a port detection request to the front-end processor, and the front-end processor scans the ports of the virtual machines according to the port detection request, acquires the survival ports of the virtual machines and uploads the survival ports to the server side;
according to the network quality acquisition period, the server side initiates a network quality detection request to the front-end processor, the front-end processor performs network quality detection on the virtual machine according to the network quality detection request and a configured monitoring mode to obtain network quality information, detects whether to send an alarm or clear the alarm to obtain alarm information, and returns the network quality information and the alarm information to the server side;
and the server side pushes the alarm information to process the fault and displays the historical network quality information.
9. The system for monitoring the network quality of the virtual machine in the cloud environment according to claim 8, wherein the server is deployed in the cloud center in a clustering manner in the following manner:
selecting a service subnet, and building a database cluster in a copy mode, wherein at least two databases in the database cluster are responsible for writing, and at least one database is responsible for reading;
building a server cluster in a copy mode, wherein at least two servers in the server cluster are built;
and deploying a server on the server, deploying Nginx service and configuring load balance on the server where the server is located, realizing high availability by using Keeplied, and providing access and limiting flow to the exposed API.
10. The system for monitoring the network quality of the virtual machines in the cloud environment according to claim 8 or 9, wherein the front-end processor is configured to perform network quality detection on the virtual machines according to the network quality detection request and the configured monitoring method to obtain network quality information, and detect whether to send an alarm or clear the alarm to obtain alarm information, and the system comprises the following steps:
acquiring information of a virtual machine to be detected according to the network quality detection request;
grouping the virtual machines in the corresponding sub-networks according to the information of the virtual machines to be detected;
and for each group of virtual machines, detecting whether the ports are communicated or not through ping according to a configured monitoring mode, if so, sending alarm removing information, if not, carrying out port activity detection on the virtual machines, if the ports are alive, sending the alarm removing information, and if the ports are not alive, sending the alarm information.
CN202010765628.6A 2020-08-03 2020-08-03 Virtual machine network quality monitoring method and system in cloud environment Active CN112367212B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202010765628.6A CN112367212B (en) 2020-08-03 2020-08-03 Virtual machine network quality monitoring method and system in cloud environment

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202010765628.6A CN112367212B (en) 2020-08-03 2020-08-03 Virtual machine network quality monitoring method and system in cloud environment

Publications (2)

Publication Number Publication Date
CN112367212A CN112367212A (en) 2021-02-12
CN112367212B true CN112367212B (en) 2022-04-05

Family

ID=74516588

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202010765628.6A Active CN112367212B (en) 2020-08-03 2020-08-03 Virtual machine network quality monitoring method and system in cloud environment

Country Status (1)

Country Link
CN (1) CN112367212B (en)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115348194A (en) * 2022-08-12 2022-11-15 中国工商银行股份有限公司 Monitoring method and device based on business simulation, electronic equipment and storage medium
CN116974857B (en) * 2023-09-21 2024-01-23 中国西安卫星测控中心 Automatic deployment and update method and system for monitoring agent

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109274557A (en) * 2018-11-14 2019-01-25 江苏鸿信系统集成有限公司 Intelligent CMDB management and cloud host monitor method under a kind of cloud environment
CN109981377A (en) * 2019-04-10 2019-07-05 北京搜狐新媒体信息技术有限公司 A kind of distributive data center link monitoring method and system
CN110311818A (en) * 2019-06-29 2019-10-08 浪潮卓数大数据产业发展有限公司 A kind of abnormal data alarm method based on netty

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8966074B1 (en) * 2013-09-13 2015-02-24 Network Kinetix, LLC System and method for real-time analysis of network traffic
US10362098B2 (en) * 2016-06-21 2019-07-23 Facebook, Inc. Load balancing back-end application services utilizing derivative-based cluster metrics

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109274557A (en) * 2018-11-14 2019-01-25 江苏鸿信系统集成有限公司 Intelligent CMDB management and cloud host monitor method under a kind of cloud environment
CN109981377A (en) * 2019-04-10 2019-07-05 北京搜狐新媒体信息技术有限公司 A kind of distributive data center link monitoring method and system
CN110311818A (en) * 2019-06-29 2019-10-08 浪潮卓数大数据产业发展有限公司 A kind of abnormal data alarm method based on netty

Also Published As

Publication number Publication date
CN112367212A (en) 2021-02-12

Similar Documents

Publication Publication Date Title
US11128550B2 (en) Logical network traffic analysis
US10389596B2 (en) Discovering application topologies
US7203173B2 (en) Distributed packet capture and aggregation
JP6835444B2 (en) Software-defined data center and service cluster scheduling method and traffic monitoring method for that purpose
US7096264B2 (en) Network analyzer having distributed packet replay and triggering
US10956832B2 (en) Training a data center hardware instance network
CN112367212B (en) Virtual machine network quality monitoring method and system in cloud environment
CN110855493B (en) Application topological graph drawing device for mixed environment
CN111970146B (en) Monitoring platform and monitoring method for SRDC whole cabinet nodes
CN112714013B (en) Application fault positioning method in cloud environment
US20070258476A1 (en) Apparatus and method for locating trouble occurrence position in communication network
JP5287402B2 (en) Network monitoring and control device
CN105165054A (en) Method for processing network service faults, service management system and system management module
CN108449210B (en) Network routing fault monitoring system
CN108737163B (en) SDN controller application performance analysis method based on OpenFlow protocol
CN112350854A (en) Flow fault positioning method, device, equipment and storage medium
CN101252477B (en) Determining method and analyzing apparatus of network fault root
US20040158780A1 (en) Method and system for presenting neighbors of a device in a network via a graphical user interface
CN111786830B (en) Network quality simulation method in cloud computing environment
CN107888438B (en) Flow table technology-based system for automatically sensing and adapting to cloud environment change
KR100500836B1 (en) Fault management system of metro ethernet network and method thereof
CN112884176B (en) Management system and method
CN112910726A (en) Cloud environment flow monitoring method, device and system
EP3190472A2 (en) System for analyzing an industrial control network
WO2013001655A1 (en) Route search program and information processing device

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant