CN110995868A

CN110995868A - Load balancing method and device

Info

Publication number: CN110995868A
Application number: CN201911330103.3A
Authority: CN
Inventors: 杜志阔
Original assignee: Unicloud Nanjing Digital Technology Co Ltd
Current assignee: Unicloud Nanjing Digital Technology Co Ltd
Priority date: 2019-12-20
Filing date: 2019-12-20
Publication date: 2020-04-10

Abstract

The invention discloses a load balancing method and a load balancing system.A performance monitoring module is added in a Pool module, and the performance monitoring module is respectively connected with a main group and a standby group in the Pool module and is simultaneously connected with a scheduling module in the Pool module. The method comprises the steps of monitoring the operation indexes of the back-end servers in the main group, calculating the current load state of each back-end server according to the monitored operation indexes of the back-end servers in the main group, monitoring the load condition of the back-end servers in the main group, and switching to the standby group when the back-end servers in the main group are fully loaded or unavailable, so that the complex application scene with high performance and high reliability can be supported.

Description

Load balancing method and device

Technical Field

The invention relates to the technical field of computer networks, in particular to a load balancing method and a load balancing device.

Background

With the rapid development of the Internet and the continuous improvement of the business volume, the data access flow based on the network is rapidly increased; meanwhile, the server website provides more and more abundant contents and information for the visitor by means of application programs such as HTTP, FTP, SMTP and the like, and the server is gradually submerged by data; in addition, most web sites (especially those for e-commerce) need to provide uninterrupted 24-hour service, and any service interruption or loss of critical data in the communication results in a direct business loss. All this puts demands on high performance and high reliability of the application services.

Under the condition of multiple AZs in a cloud computing scene, due to the fact that indexes such as time delay of servers of different AZs are different, in order to better improve user experience of a terminal user, some customers propose that load balancing is expected to preferentially use the servers in the same AZ; and when the back-end server in the AZ fails or the AZ fails and is unavailable, the back-end server of the standby AZ is started.

The demand that a part of servers with good service indexes are preferentially used is met with certain universality when the servers with good service indexes are fully loaded or have faults and some standby servers are reused. For example, in a hybrid cloud scenario, a part of servers are deployed on physical machines, a part of machines are deployed on the cloud, and when traffic is not very large, some customers propose to preferentially use the servers deployed on the physical machines. When the physical machine server is fully loaded or fails, the server on the cloud is started, so that the terminal user experience is preferentially ensured, and the requirement of high-availability deployment of services can be met.

As shown in fig. 1, the existing load balancing system mainly includes the following scheduling algorithms of a scheduling module in the prior art:

round Robin Scheduling algorithm: and the requests are distributed to different servers or links in sequence, so that each real server or link shares the connection request of the user on average.

Weighted Round Robin Scheduling (Weighted Round Robin Scheduling) algorithm: and distributing the requests to different back-end servers in sequence according to the weight, wherein the requests with large weights are distributed more, and the requests with small weights are distributed less.

Hash (Source IP Hashing scheduling) algorithm based on source IP: requests from the same source IP are mapped onto a server or link by a Hash function.

The current technical solutions are all directed to a common network mode, such as using a nginx web server to perform load balancing or using a Haproxy to implement load balancing, and although these load balancing services also provide a richer scheduling algorithm, they cannot meet this complex demand scenario, for example, a physical server mentioned by a certain client is preferentially accessed to the local of the client, and when the physical server is fully loaded or fails, the physical server is accessed to a server deployed in the cloud.

Disclosure of Invention

The purpose of the invention is as follows: aiming at the defects in the prior art, the invention provides a load balancing system and a method thereof.

The technical scheme is as follows:

a method of load balancing, comprising the steps of:

(1) adding a performance monitoring module in the Pool module, wherein the performance monitoring module is respectively connected with a main group and a standby group in the Pool module and is simultaneously connected with a scheduling module in the Pool module;

(2) the performance monitoring module acquires the operation indexes of the back-end servers in the main group at preset time intervals, and then acquires the operation indexes of the back-end servers in the main group according to the acquired operation indexes, including CPU utilization rate and memory utilization rate; calculating the current load state of each rear-end server, and sending the current load state to the scheduling module; wherein, the load state comprises three states of full load, normal and unavailable;

(3) and a scheduling module in the Pool module receives the current load state of each back-end server in the main group sent by the performance monitoring module, and allocates the main group and the standby group according to the load state and actual requirements of the back-end servers in the main group.

In the step (2), the current load state of each back-end server is calculated as follows:

(21) define the status monitor Value: if the performance monitoring module cannot obtain the feedback of the back-end server, Value is-1, and the back-end server is unavailable at this time; if the performance monitoring module can obtain the feedback of the back-end server, Value ═ is (CPU utilization rate < set threshold) & (memory utilization rate < set threshold);

(22) calculating the Value of each backend server Value, and if the Value is-1, the backend server is unavailable; if Value is 1, the back-end server is normal; if Value is 0, the backend server is already fully loaded.

The allocation of the active set and the standby set according to the actual requirement is as follows:

(31) if the scheduling mode of the current Pool module is set to be a mixed mode of the standby group and the main group, the standby group does not need to be specially configured;

(32) if the current Pool module scheduling mode is set to preferentially use the main group mode, at this time:

when the state monitoring Value of each back-end server in the main group is-1, starting the standby server group;

and when the state monitoring Value of each back-end server in the main group is 0, starting the standby server group.

In the step (21), the threshold is set as follows: the CPU utilization rate is 85%; the threshold is set as follows: and 85% of memory.

A load balancing system comprises a message receiving module, a monitor module, a Pool selecting module and a plurality of Pool modules; the message receiving module, the listener module and the Pool selecting module are sequentially connected, and the Pool selecting module is respectively connected with all the Pool modules;

the Pool module comprises a scheduling module, a performance monitoring module, a main group and a standby group, wherein the main group and the standby group are respectively provided with a plurality of back-end servers; the scheduling module is respectively connected with the main group and the standby group, the performance monitoring module is respectively connected with the main group and the standby group, and the performance monitoring module is connected with the scheduling module;

the performance monitoring module acquires operation indexes of a back-end server in the main group at preset time intervals, wherein the operation indexes comprise CPU utilization rate and memory utilization rate; calculating the current load state of each back-end server according to the acquired operation indexes of the back-end servers in the main group, and sending the current load state to the scheduling module; wherein, the load state comprises three states of full load, normal and unavailable;

the scheduling module receives the current load state of each back-end server in the main group sent by the performance monitoring module, and allocates the main group and the standby group according to the load state and the actual demand of the back-end servers in the main group.

The performance monitoring module calculates the current load state of each back-end server according to the acquired operation index of the back-end server in the main group, specifically as follows:

obtaining the current load state of each back-end server by calculating a state monitoring Value, wherein if the performance detection module cannot obtain the feedback of the back-end server, the back-end server is unavailable when the state monitoring Value is-1; if the detection program can obtain the feedback of the back-end server, Value ═ is (CPU utilization rate < set threshold) & (memory utilization rate < set threshold); if Value is 1, the back-end server is normal; if Value is 0, the backend server is already fully loaded.

if the scheduling mode of the current Pool module is set to be a mixed mode of the standby group and the main group, the standby group does not need to be specially configured;

if the current Pool module scheduling mode is set to preferentially use the main group mode, at this time:

when the state monitoring Value of each back-end server in the main group is-1, starting the standby server group; and when the state monitoring Value of each back-end server in the main group is 0, starting the standby server group.

The set threshold in the status monitoring Value is as follows: the CPU utilization rate is 85%; the threshold is set as follows: and 85% of memory.

Has the advantages that: the invention adds a performance monitoring module in the Pool module to monitor the operation index of the back-end server in the main group, calculates the current load state of each back-end server according to the monitored operation index of the back-end server in the main group, further monitors the load condition of the back-end server in the main group, and switches to the standby group when the back-end server in the main group is fully loaded or unavailable, thereby being capable of supporting the complex application scene with high performance and high reliability.

Drawings

Fig. 1 is a diagram illustrating internal logic processing of a load balancing system in the prior art.

Fig. 2 is a processing diagram of the internal logic of the load balancing system of the present invention.

Detailed Description

The invention is further elucidated with reference to the drawings and the embodiments.

Fig. 2 is a processing diagram of the internal logic of the load balancing system of the present invention. As shown in fig. 2, the load balancing method of the present invention includes the steps of:

(2) the performance monitoring module acquires the operation indexes of the back-end servers in the main group at preset time intervals, calculates the current load state of each back-end server according to the acquired operation indexes of the back-end servers in the main group, and sends the current load state to the scheduling module; the indexes obtained from each back-end server in the main group comprise CPU utilization rate and memory utilization rate;

the specific calculation of the current load state of each back-end server is as follows:

(21) define the status monitor Value: if the performance monitoring module cannot obtain the feedback of the back-end server, Value is-1; (the backend server is considered unavailable); if the detection program can obtain the feedback of the back-end server, Value ═ is (CPU utilization rate < set threshold) & (memory utilization rate < set threshold);

(22) if the Value of each backend server Value is calculated, if the Value is-1, the backend server is unavailable; if Value is 1, the back-end server is normal; if Value is 0, the backend server is fully loaded;

(3) a scheduling module in the Pool module receives the current load state of each back-end server in the main group sent by the performance monitoring module, and allocates the main group and the standby group according to actual requirements;

the specific distribution is as follows:

The invention also provides a load balancing system, which comprises a message receiving module, a monitor module, a Pool selecting module and a plurality of Pool modules; the message receiving module, the listener module and the Pool selecting module are sequentially connected, and the Pool selecting module is respectively connected with all the Pool modules.

The Pool module comprises a scheduling module, a performance monitoring module, a main group and a standby group, wherein the main group and the standby group are respectively provided with a plurality of back-end servers which can be physical machines or virtual machines; the scheduling module is respectively connected with the main group and the standby group, the performance monitoring module is respectively connected with the main group and the standby group, and the performance monitoring module is connected with the scheduling module.

The performance monitoring module acquires the operation indexes of the back-end servers in the main group at preset time intervals, calculates the current load state of each back-end server according to the acquired operation indexes of the back-end servers in the main group, and sends the current load state to the scheduling module; in the present invention, the preset time interval is set according to specific requirements, and in a specific embodiment of the present invention, the preset time interval is set to 1 s.

The indexes required to be obtained from each physical server comprise CPU utilization rate and memory utilization rate;

if the server is not available: value ═ 1; when Value is 1, the physical server is considered normal, and when Value is 0, the server is considered full. In the present invention, the threshold is set as follows: the CPU utilization rate is 85%; the threshold is set as follows: and 85% of memory.

The scheduling module receives the current load state of each back-end server in the main group sent by the performance monitoring module, and allocates the main group and the standby group according to actual requirements;

the triggering condition for the standby group activation can be configured as follows:

(1) the standby group and the main group are used in a mixed mode: under such usage, the primary and standby groupings have no substantial significance;

(2) when the main server group is unavailable or the whole server group fails, starting the standby server group, wherein the condition to be configured is that the state monitoring Value of each physical server is-1;

or when the main server group is fully loaded, starting the standby server group, and configuring the condition that the state monitoring Value of each physical server in the main server group is 0. The invention adds a performance monitoring module in the Pool module to monitor the operation index of the back-end server in the main group, calculates the current load state of each back-end server according to the monitored operation index of the back-end server in the main group, further monitors the load condition of the back-end server in the main group, and switches to the standby group when the back-end server in the main group is fully loaded or unavailable, thereby being capable of supporting the complex application scene with high performance and high reliability.

Although the preferred embodiments of the present invention have been described in detail, the present invention is not limited to the details of the foregoing embodiments, and various equivalent changes (such as number, shape, position, etc.) may be made to the technical solution of the present invention within the technical spirit of the present invention, and the equivalents are protected by the present invention.

Claims

1. A method of load balancing, characterized by: the method comprises the following steps:

2. The load balancing method according to claim 1, wherein: in the step (2), the current load state of each back-end server is calculated as follows:

3. The load balancing method according to claim 2, wherein: in the step (3), the allocation of the active set and the standby set according to the actual requirement is as follows:

4. The load balancing method according to claim 2, wherein: in the step (21), the threshold is set as follows: the CPU utilization rate is 85%; the threshold is set as follows: and 85% of memory.

5. A load balancing system, characterized by: the system comprises a message receiving module, a listener module, a Pool selecting module and a plurality of Pool modules; the message receiving module, the listener module and the Pool selecting module are sequentially connected, and the Pool selecting module is respectively connected with all the Pool modules;

6. The load balancing system of claim 5, wherein: the performance monitoring module calculates the current load state of each back-end server according to the acquired operation index of the back-end server in the main group, specifically as follows:

7. The load balancing system of claim 6, wherein: the allocation of the active set and the standby set according to the actual requirement is as follows:

8. The load balancing system of claim 6, wherein: the set threshold in the status monitoring Value is as follows: the CPU utilization rate is 85%; the threshold is set as follows: and 85% of memory.