CN110543366A

CN110543366A - Service module capacity tuning method and device for service cluster and server

Info

Publication number: CN110543366A
Application number: CN201910799917.5A
Authority: CN
Inventors: 赵子青; 吴峰; 郭伟
Original assignee: Shanghai Yidianshikong Network Co Ltd
Current assignee: Shanghai Yidianshikong Network Co Ltd
Priority date: 2019-08-27
Filing date: 2019-08-27
Publication date: 2019-12-06

Abstract

the application discloses a method and a device for adjusting and optimizing the capacity of a service module of a service cluster, and a server. The method comprises the steps that when a server calling request under the same service module is received in a service cluster, whether module capacity among the servers has difference or not is judged; if there is a difference in module capacity, the called weights of the servers are adjusted to balance the capacity among the servers. The method and the device solve the technical problem that a capacity optimizing method for the business module with unhealthy capacity is lacked. By the method and the system, the unhealthy service module is optimized, the utilization rate of servers with different performances under the module can be maximized, and the capacity among the servers can be balanced.

Description

Service module capacity tuning method and device for service cluster and server

Technical Field

The present application relates to the field of service cluster processing, and in particular, to a method, an apparatus, and a server for tuning service module capacity for a service cluster.

Background

the service requests initiate load balancing access, the number of the requests can basically and uniformly fall on the servers under the called modules, and the capacity difference can be caused due to the performance difference among the servers.

The inventor finds that the lack of capacity tuning schemes for the service module with unhealthy capacity is not beneficial to make the capacity of the service module in a healthy state.

Aiming at the problem that a capacity optimizing method for a business module with unhealthy capacity is lacked in the related technology, an effective solution is not provided at present.

Disclosure of Invention

the present application mainly aims to provide a method, an apparatus, and a server for adjusting and optimizing the capacity of a service module for a service cluster, so as to solve the problem of lacking a method for adjusting and optimizing the capacity of a service module with unhealthy capacity.

in order to achieve the above object, according to an aspect of the present application, a method for tuning capacity of a service module for a service cluster is provided.

The method for adjusting and optimizing the capacity of the service module of the service cluster comprises the following steps: when a server calling request under the same service module is received in a service cluster, judging whether module capacities among the servers have differences or not; if there is a difference in module capacity, the called weights of the servers are adjusted to balance the capacity among the servers.

further, if there is a difference in module capacity, adjusting the called weights of the servers to balance the capacity among the servers comprises:

Judging whether the module capacity difference between the servers reaches the capacity difference of unhealthy service modules or not in the service peak period;

And if the module capacity difference between the servers is judged to reach the capacity difference of the unhealthy service modules in the service peak period, adjusting the called weight of the servers.

Further, when a server call request for the same service module is received in the service cluster, determining whether module capacities between servers have differences includes:

When a server call request under the same service module is received in a service cluster, selecting a judgment index as a preset standard for measuring the capacity of the service module according to the service function of the service module;

And judging whether the judgment indexes between the servers have differences according to preset standards.

further, if there is a difference in capacity, adjusting the called weights of the servers to balance module capacity among the servers comprises:

Determining the called quantity and the called weight when a server calling request under the same service module is received;

when the module capacity between the servers is different, adjusting the called weight of the servers to obtain a new weight (the single support capacity) and a new called amount;

The capacity benchmark is a safety threshold with the highest capacity, and the single support capacity refers to the server capacity support capacity.

Further, the module capacity is selected from the CPU utilization rate.

In order to achieve the above object, according to another aspect of the present application, a service module capacity tuning apparatus for a service cluster is provided.

The service module capacity tuning device for the service cluster according to the application comprises: the judging module is used for judging whether the module capacities among the servers have differences or not when the server calling requests under the same service module are received in the service cluster; and the adjusting and optimizing module is used for adjusting the called weight of the servers when the module capacity is different so as to balance the capacity among the servers.

Further, the tuning module is used for

further, the determining module is configured to determine whether the detected signal is a signal

in order to achieve the above object, according to still another aspect of the present application, there is provided a server including: the capacity adjusting and optimizing device of the service module.

in the method, the device and the server for adjusting the capacity of the service module of the service cluster, when a server call request for the same service module is received in the service cluster, whether the module capacity between the servers is different or not is judged, so that the aim of adjusting the called weight of the servers to balance the capacity between the servers is fulfilled if the module capacity is different, the technical effect of adjusting the optimization processing of unhealthy service modules is achieved, and the technical problem that a capacity adjusting party for the service module with unhealthy capacity is lacked is solved.

Drawings

The accompanying drawings, which are incorporated in and constitute a part of this application, serve to provide a further understanding of the application and to enable other features, objects, and advantages of the application to be more apparent. The drawings and their description illustrate the embodiments of the invention and do not limit it. In the drawings:

Fig. 1 is a schematic flow chart of a service module capacity tuning method for a service cluster according to a first embodiment of the present application;

Fig. 2 is a schematic flow chart of a service module capacity tuning method for a service cluster according to a second embodiment of the present application;

fig. 3 is a schematic flow chart of a service module capacity tuning method for a service cluster according to a third embodiment of the present application;

Fig. 4 is a schematic flow chart of a service module capacity tuning method for a service cluster according to a fourth embodiment of the present application;

Fig. 5 is a schematic structural diagram of a service module capacity tuning device for a service cluster according to an embodiment of the present application.

Detailed Description

in order to make the technical solutions better understood by those skilled in the art, the technical solutions in the embodiments of the present application will be clearly and completely described below with reference to the drawings in the embodiments of the present application, and it is obvious that the described embodiments are only partial embodiments of the present application, but not all embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present application.

It should be noted that the terms "first," "second," and the like in the description and claims of this application and in the drawings described above are used for distinguishing between similar elements and not necessarily for describing a particular sequential or chronological order. It should be understood that the data so used may be interchanged under appropriate circumstances such that embodiments of the application described herein may be used. Furthermore, the terms "comprises," "comprising," and "having," and any variations thereof, are intended to cover a non-exclusive inclusion, such that a process, method, system, article, or apparatus that comprises a list of steps or elements is not necessarily limited to those steps or elements expressly listed, but may include other steps or elements not expressly listed or inherent to such process, method, article, or apparatus.

In this application, the terms "upper", "lower", "left", "right", "front", "rear", "top", "bottom", "inner", "outer", "middle", "vertical", "horizontal", "lateral", "longitudinal", and the like indicate orientations or positional relationships based on the orientations or positional relationships shown in the drawings. These terms are used primarily to better describe the present application and its embodiments, and are not used to limit the indicated devices, elements or components to a particular orientation or to be constructed and operated in a particular orientation.

Moreover, some of the above terms may be used to indicate other meanings besides the orientation or positional relationship, for example, the term "on" may also be used to indicate some kind of attachment or connection relationship in some cases. The specific meaning of these terms in this application will be understood by those of ordinary skill in the art as appropriate.

furthermore, the terms "mounted," "disposed," "provided," "connected," and "sleeved" are to be construed broadly. For example, it may be a fixed connection, a removable connection, or a unitary construction; can be a mechanical connection, or an electrical connection; may be directly connected, or indirectly connected through intervening media, or may be in internal communication between two devices, elements or components. The specific meaning of the above terms in the present application can be understood by those of ordinary skill in the art as appropriate.

It should be noted that the embodiments and features of the embodiments in the present application may be combined with each other without conflict. The present application will be described in detail below with reference to the embodiments with reference to the attached drawings.

In the application, when each server faces the same number of called requests under the same module and the capacity between the servers is greatly different, the called weight of the server under the module needs to be optimized and adjusted, and the capacity between the servers is concentrated in an acceptable capacity interval as much as possible by adjusting the weight of the server, so that the capacity of the service module becomes a capacity health service module.

As shown in fig. 1, the method includes steps S102 to S104 as follows:

Step S102, when a server calling request under the same service module is received in a service cluster, judging whether module capacity between servers has difference;

When a server call request for the same service module is received in the service cluster, it is necessary to determine whether module capacities between servers are different. For example, it is determined whether there is a large difference in module capacity between servers, or whether the difference in module capacity between servers exceeds a threshold.

Specifically, the module capacity refers to selecting one index from multiple index dimensions of the module according to the function of the module as a unique core standard for measuring the module capacity.

It should be noted that, as the indices: CPU usage, memory usage, I/O usage, service return code on module program, etc. of the servers in the module.

For example, in the access type service, the utilization rate of the CPU is generally selected as the only core index of the support performance of the module, and the utilization rate of the CPU is 90%, which indicates that the module has reached the maximum support bottleneck of the performance.

For another example, the data storage module generally selects the disk I/O read/write performance as the only core index.

and step S104, if the module capacities are different, adjusting the called weight of the server to balance the capacities among the servers.

And if the difference of the module capacity is known according to the judgment result, adjusting the called weight of the server to balance the capacity among the servers so as to enable the module capacity to be a healthy capacity module.

It should be noted that, since the service requests initiate load-balanced access, the number of requests will fall substantially uniformly on the servers below the called module, and differences in capacity will occur due to performance differences between the servers. Specifically, there are some modules under which the capacity difference between servers is not large, and within an acceptable range, the capacity health module is defined. In addition, the capacity difference between servers under other modules is large, and the capacity is defined as an unhealthy module.

it should also be noted that, various methods for determining whether the module capacity belongs to the healthy capacity may be adopted, and the method is not specifically limited in the embodiment of the present application, and a person skilled in the art may select the module capacity according to the actual use situation.

The service cluster is a set that a plurality of service modules are combined to realize a complete service function or a certain core function.

the service module refers to a collection in which a certain service function point is deployed on a single server or a group of servers.

From the above description, it can be seen that the following technical effects are achieved by the present application:

in the embodiment of the application, when a server calling request under the same service module is received in a service cluster, the purpose of adjusting the called weight of the server to balance the capacity among the servers is achieved by judging whether the module capacity among the servers is different or not, so that the technical effect of carrying out tuning processing on unhealthy service modules is realized, and the technical problem that a capacity tuning party for the service modules with unhealthy capacity is lacked is solved.

according to the embodiment of the present application, as a preference in the embodiment, as shown in fig. 2, if there is a difference in module capacity, adjusting the called weight of the servers to balance the capacity among the servers includes:

Step S202, judging whether the module capacity difference between the servers reaches the capacity difference of unhealthy service modules at the service peak;

Step S204, if the module capacity difference between the servers is judged to reach the capacity difference of the unhealthy service modules in the service peak period, the called weight of the servers is adjusted.

Preferably, the module capacity is selected from CPU utilization.

and judging whether the module capacity difference between the servers reaches the capacity difference of the unhealthy service modules or not in the service peak period, and if the module capacity difference between the servers reaches the capacity difference of the unhealthy service modules in the service peak period, adjusting the called weight of the servers.

Specifically, when the same number of called requests of each server in the same module are met and the capacity difference between the servers is large, the called weight of the server in the module needs to be optimally adjusted, and the capacity between the servers is concentrated in an acceptable capacity interval as much as possible by adjusting the weight of the server, so that the capacity of the service module becomes a capacity health service module.

The server utilization rate of different performances under the module can be maximized by adjusting the called weight of the server under the called module, the capacity between the servers can be balanced, chain collapse caused by short capacity of a certain server is avoided, the service is more stable in operation, and the server cost is optimal.

According to the embodiment of the present application, as a preferred embodiment in the present embodiment, as shown in fig. 3, when a server call request for the same service module is received in a service cluster, determining whether module capacities between servers have a difference includes:

Step S302, when a server call request under the same service module is received in a service cluster, selecting a judgment index as a preset standard for measuring the capacity of the service module according to the service function of the service module;

Step S304, judging whether the judgment indexes between the servers have differences according to preset standards.

Preferably, the module capacity is selected from CPU utilization.

when a server call request under the same service module is received in the service cluster, a judgment index is selected as a preset standard for measuring the capacity of the service module according to the service function of the service module. And judging whether the judgment indexes between the servers have differences according to preset standards.

according to the embodiment of the present application, as a preference in the embodiment, as shown in fig. 4, if there is a difference in capacity, adjusting the called weight of the servers to balance the module capacity among the servers includes:

Step S402, determining the called quantity and the called weight when a server calling request under the same service module is received;

Step S404, when there is a difference in module capacity between servers, adjusting the called weight of the server to obtain a new weight (single support capability capacity)/called amount;

The method comprises the steps of determining the called quantity and the called weight when a server calling request under the same service module is received, and then adjusting the called weight of the server to obtain a new weight (the single support capability capacity) per the called quantity when the module capacity of the server is different.

the capacity benchmark is a safety threshold with the highest capacity, and the single-station supporting capacity refers to the server capacity supporting capacity.

According to the tuning method, the called weight of the server under the called module is adjusted, so that the utilization rate of the servers with different performances under the module can be maximized, the capacity among the servers can be balanced, chain running caused by the short capacity of a certain server is avoided, the service is more stable in operation, and the cost of the server is optimal.

Preferably, if the one-time tuning result cannot meet the requirement, namely the capacity standard value cannot be reached, multiple times of tuning are needed.

It should be noted that the steps illustrated in the flowcharts of the figures may be performed in a computer system such as a set of computer-executable instructions and that, although a logical order is illustrated in the flowcharts, in some cases, the steps illustrated or described may be performed in an order different than presented herein.

according to an embodiment of the present application, there is further provided a service module capacity tuning apparatus for a service cluster, for implementing the foregoing method, as shown in fig. 5, the apparatus includes: the judging module 10 is configured to judge whether module capacities between servers are different when a server call request for the same service module is received in a service cluster; and the tuning module 20 is used for adjusting the called weight of the servers to balance the capacity among the servers when the module capacity is different.

In the determining module 10 in the embodiment of the present application, when a server call request for the same service module is received in a service cluster, it is necessary to determine whether module capacities between servers are different. For example, it is determined whether there is a large difference in module capacity between servers, or whether the difference in module capacity between servers exceeds a threshold.

In the tuning module 20 according to the embodiment of the present application, if the module capacity is known to be different according to the determination result, the called weight of the server is adjusted to balance the capacity between the servers, so that the module capacity is a healthy capacity module.

According to the embodiment of the present application, as a preferred option in the embodiment, the tuning module 20 is configured to determine whether a module capacity difference between the servers reaches a capacity difference of an unhealthy service module during a service peak period; and if the module capacity difference between the servers is judged to reach the capacity difference of the unhealthy service modules in the service peak period, adjusting the called weight of the servers.

Preferably, the module capacity is selected from CPU utilization.

According to the embodiment of the present application, as an optimization in the embodiment, the determining module 10 is configured to select a determining index as a preset standard for measuring the capacity of a service module according to a service function of the service module when a server call request for the same service module is received in a service cluster; and judging whether the judgment indexes between the servers have differences according to preset standards.

Preferably, the module capacity is selected from CPU utilization.

according to the embodiment of the present application, as a preferred choice in the embodiment, the determining module 10 is configured to determine a called amount and a called weight when a server call request for the same service module is received; when the module capacity between the servers is different, adjusting the called weight of the servers to obtain a new weight (the single support capacity) and a new called amount;

The optimization module adjusts the called weight of the server under the called module, maximizes the utilization rate of the servers with different performances under the module, balances the capacity among the servers, avoids chain rushing caused by a certain server capacity short plate, and ensures that the service is more stable in operation and the server cost is optimal.

In another embodiment of the present application, there is also provided a server including: the capacity adjusting and optimizing device of the service module. The implementation principle and the beneficial effect of the service module capacity tuning device are as described above, and are not described herein again.

The realization principle of the tuning method can be realized by referring to the following modes:

In the following, the module a, the capacity core index CPU usage, and the module 7 servers, for example, the amount of access/call to each server N is 2000 times/second during the peak period, and the call weight of each server q is 100.

the method comprises the following steps: the capacity unhealthy module records the following data during service use peak periods:

	Server 1	Server 2	server 3	Server 4	Server 5	Server 6	server 7
								Called amount	2000 times/second	2000 times/second	2000 times/second	2000 times/second	2000 times/second	2000 times/second	2000 times/second
Called weight	100	100	100	100	100	100	100
								CPU utilization (%)	70	60	55	81	72	48	51

Step two: calculating the capacity supporting capacity of a single server, wherein the capacity supporting capacity of the single server is (called quantity is called weight)/CPU, the capacity supporting capacity of the single server is rounded, and data are calculated:

(called amount of weight of called) ═ 2000 × 100 ═ 200000

server 1

Server 2

server 3

Server 4

server 5

server 6

Server 7

Calculation process

＝20000/70

＝20000/60

＝20000/55

＝20000/81

＝20000/72

＝20000/48

＝20000/51

Single stand support capability

2857

3333

3636

2469

2777

4166

3921

step three: the established maximum safe value of the capacity is as follows: for example, the CPU utilization in the table is 85%, the server with the highest capacity under the module is found, and it is determined whether the highest capacity is within the safety value range, such as the server 4 with the highest capacity in the table above, which is 81, and within the highest range value.

step four: under the condition that the called quantity is not changed, the highest capacity 81 is used as a reference value, weight promotion is carried out on the modules with the capacity not reaching 81, and the capacity of the modules reaches 81, wherein the calculation method comprises the following steps:

New weight value (single support capacity) per called amount;

then, new weight calculation is carried out according to the capacity of other servers, the result is rounded, and the data is as follows:

Step five: the current network adjusts the weight, and then observes and records the data of the capacity of each server under the peak period module.

The net adjustment weights may be specified in the following table. The theory should be consistent because there may be a certain difference between the theoretical calculation value and the adjusted server real capacity value. For example, the value of x1.. x7 in the following table should be 81, but there may be some deviation in practice, and some servers may not have the same performance due to hardware.

So as the request increases, the magnitude of the load rise will be higher than expected, using x1... x7 as an indicator. For example, the value in x1... x7 is not 81, and the absolute value of the subtraction of 81 is greater than 5, and when the real value x1 of the capacity of the server 1 after tuning is 86, then the capacity of the server 1 needs to be tuned once again. Calculating theoretical value of capacity in peak period after adjustment:

	Server 1	Server 2	Server 3	Server 4	Server 5	Server 6	Server 7
								Adjusted weights	116	135	147	100	112	169	159
New capacity value after weight adjustment	81	81	81	81	81	81	81

Adjusted peak capacity true value:

Server 1

server 2

server 3

Server 4

Server 5

Server 6

server 7

Adjusted weights

116

135

147

100

112

169

159

New capacity value after weight adjustment

x1

x2

x3

x4

x5

x6

x7

it will be apparent to those skilled in the art that the modules or steps of the present application described above may be implemented by a general purpose computing device, they may be centralized on a single computing device or distributed across a network of multiple computing devices, and they may alternatively be implemented by program code executable by a computing device, such that they may be stored in a storage device and executed by a computing device, or fabricated separately as individual integrated circuit modules, or fabricated as a single integrated circuit module from multiple modules or steps. Thus, the present application is not limited to any specific combination of hardware and software.

The above description is only a preferred embodiment of the present application and is not intended to limit the present application, and various modifications and changes may be made by those skilled in the art. Any modification, equivalent replacement, improvement and the like made within the spirit and principle of the present application shall be included in the protection scope of the present application.

Claims

1. A method for adjusting and optimizing the capacity of a service module of a service cluster is characterized by comprising the following steps:

When a server calling request under the same service module is received in a service cluster, judging whether module capacities among the servers have differences or not;

If there is a difference in module capacity, the called weights of the servers are adjusted to balance the capacity among the servers.

2. The traffic module capacity tuning method of claim 1, wherein if there is a difference in module capacity, adjusting the called weights of the servers to equalize the capacity among the servers comprises:

3. The method of claim 1, wherein when a server call request for the same service module is received in a service cluster, determining whether the module capacities between servers differ comprises:

4. The traffic module capacity tuning method of claim 1, wherein if there is a difference in capacity, adjusting the called weights of the servers to equalize the module capacity among the servers comprises:

5. The method of claim 1, wherein the module capacity is CPU utilization.

6. A service module capacity tuning apparatus for a service cluster, comprising:

The judging module is used for judging whether the module capacities among the servers have differences or not when the server calling requests under the same service module are received in the service cluster;

And the adjusting and optimizing module is used for adjusting the called weight of the servers when the module capacity is different so as to balance the capacity among the servers.

7. the service module capacity tuning device of claim 6, wherein the tuning module is configured to tune the service module capacity of the service cluster

8. the service module capacity tuning apparatus for service cluster according to claim 6, wherein the determining module is configured to determine the capacity of the service module according to the capacity tuning apparatus

9. The service module capacity tuning apparatus for service cluster according to claim 6, wherein the determining module is configured to determine the capacity of the service module according to the capacity tuning apparatus

10. A server, comprising: the traffic module capacity tuning apparatus according to any one of claims 6 to 9.