CN110543366A - Service module capacity tuning method and device for service cluster and server - Google Patents

Service module capacity tuning method and device for service cluster and server Download PDF

Info

Publication number
CN110543366A
CN110543366A CN201910799917.5A CN201910799917A CN110543366A CN 110543366 A CN110543366 A CN 110543366A CN 201910799917 A CN201910799917 A CN 201910799917A CN 110543366 A CN110543366 A CN 110543366A
Authority
CN
China
Prior art keywords
capacity
module
service
servers
server
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201910799917.5A
Other languages
Chinese (zh)
Inventor
赵子青
吴峰
郭伟
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shanghai Yidianshikong Network Co Ltd
Original Assignee
Shanghai Yidianshikong Network Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shanghai Yidianshikong Network Co Ltd filed Critical Shanghai Yidianshikong Network Co Ltd
Priority to CN201910799917.5A priority Critical patent/CN110543366A/en
Publication of CN110543366A publication Critical patent/CN110543366A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/46Multiprogramming arrangements
    • G06F9/50Allocation of resources, e.g. of the central processing unit [CPU]
    • G06F9/5005Allocation of resources, e.g. of the central processing unit [CPU] to service a request
    • G06F9/5027Allocation of resources, e.g. of the central processing unit [CPU] to service a request the resource being a machine, e.g. CPUs, Servers, Terminals
    • G06F9/505Allocation of resources, e.g. of the central processing unit [CPU] to service a request the resource being a machine, e.g. CPUs, Servers, Terminals considering the load
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/46Multiprogramming arrangements
    • G06F9/50Allocation of resources, e.g. of the central processing unit [CPU]
    • G06F9/5083Techniques for rebalancing the load in a distributed system
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F2209/00Indexing scheme relating to G06F9/00
    • G06F2209/50Indexing scheme relating to G06F9/50
    • G06F2209/5013Request control
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F2209/00Indexing scheme relating to G06F9/00
    • G06F2209/50Indexing scheme relating to G06F9/50
    • G06F2209/508Monitor

Abstract

the application discloses a method and a device for adjusting and optimizing the capacity of a service module of a service cluster, and a server. The method comprises the steps that when a server calling request under the same service module is received in a service cluster, whether module capacity among the servers has difference or not is judged; if there is a difference in module capacity, the called weights of the servers are adjusted to balance the capacity among the servers. The method and the device solve the technical problem that a capacity optimizing method for the business module with unhealthy capacity is lacked. By the method and the system, the unhealthy service module is optimized, the utilization rate of servers with different performances under the module can be maximized, and the capacity among the servers can be balanced.

Description

Service module capacity tuning method and device for service cluster and server
Technical Field
The present application relates to the field of service cluster processing, and in particular, to a method, an apparatus, and a server for tuning service module capacity for a service cluster.
Background
the service requests initiate load balancing access, the number of the requests can basically and uniformly fall on the servers under the called modules, and the capacity difference can be caused due to the performance difference among the servers.
The inventor finds that the lack of capacity tuning schemes for the service module with unhealthy capacity is not beneficial to make the capacity of the service module in a healthy state.
Aiming at the problem that a capacity optimizing method for a business module with unhealthy capacity is lacked in the related technology, an effective solution is not provided at present.
Disclosure of Invention
the present application mainly aims to provide a method, an apparatus, and a server for adjusting and optimizing the capacity of a service module for a service cluster, so as to solve the problem of lacking a method for adjusting and optimizing the capacity of a service module with unhealthy capacity.
in order to achieve the above object, according to an aspect of the present application, a method for tuning capacity of a service module for a service cluster is provided.
The method for adjusting and optimizing the capacity of the service module of the service cluster comprises the following steps: when a server calling request under the same service module is received in a service cluster, judging whether module capacities among the servers have differences or not; if there is a difference in module capacity, the called weights of the servers are adjusted to balance the capacity among the servers.
further, if there is a difference in module capacity, adjusting the called weights of the servers to balance the capacity among the servers comprises:
Judging whether the module capacity difference between the servers reaches the capacity difference of unhealthy service modules or not in the service peak period;
And if the module capacity difference between the servers is judged to reach the capacity difference of the unhealthy service modules in the service peak period, adjusting the called weight of the servers.
Further, when a server call request for the same service module is received in the service cluster, determining whether module capacities between servers have differences includes:
When a server call request under the same service module is received in a service cluster, selecting a judgment index as a preset standard for measuring the capacity of the service module according to the service function of the service module;
And judging whether the judgment indexes between the servers have differences according to preset standards.
further, if there is a difference in capacity, adjusting the called weights of the servers to balance module capacity among the servers comprises:
Determining the called quantity and the called weight when a server calling request under the same service module is received;
when the module capacity between the servers is different, adjusting the called weight of the servers to obtain a new weight (the single support capacity) and a new called amount;
The capacity benchmark is a safety threshold with the highest capacity, and the single support capacity refers to the server capacity support capacity.
Further, the module capacity is selected from the CPU utilization rate.
In order to achieve the above object, according to another aspect of the present application, a service module capacity tuning apparatus for a service cluster is provided.
The service module capacity tuning device for the service cluster according to the application comprises: the judging module is used for judging whether the module capacities among the servers have differences or not when the server calling requests under the same service module are received in the service cluster; and the adjusting and optimizing module is used for adjusting the called weight of the servers when the module capacity is different so as to balance the capacity among the servers.
Further, the tuning module is used for
Judging whether the module capacity difference between the servers reaches the capacity difference of unhealthy service modules or not in the service peak period;
And if the module capacity difference between the servers is judged to reach the capacity difference of the unhealthy service modules in the service peak period, adjusting the called weight of the servers.
further, the determining module is configured to determine whether the detected signal is a signal
When a server call request under the same service module is received in a service cluster, selecting a judgment index as a preset standard for measuring the capacity of the service module according to the service function of the service module;
and judging whether the judgment indexes between the servers have differences according to preset standards.
Further, the determining module is configured to determine whether the detected signal is a signal
Determining the called quantity and the called weight when a server calling request under the same service module is received;
When the module capacity between the servers is different, adjusting the called weight of the servers to obtain a new weight (the single support capacity) and a new called amount;
the capacity benchmark is a safety threshold with the highest capacity, and the single support capacity refers to the server capacity support capacity.
in order to achieve the above object, according to still another aspect of the present application, there is provided a server including: the capacity adjusting and optimizing device of the service module.
in the method, the device and the server for adjusting the capacity of the service module of the service cluster, when a server call request for the same service module is received in the service cluster, whether the module capacity between the servers is different or not is judged, so that the aim of adjusting the called weight of the servers to balance the capacity between the servers is fulfilled if the module capacity is different, the technical effect of adjusting the optimization processing of unhealthy service modules is achieved, and the technical problem that a capacity adjusting party for the service module with unhealthy capacity is lacked is solved.
Drawings
The accompanying drawings, which are incorporated in and constitute a part of this application, serve to provide a further understanding of the application and to enable other features, objects, and advantages of the application to be more apparent. The drawings and their description illustrate the embodiments of the invention and do not limit it. In the drawings:
Fig. 1 is a schematic flow chart of a service module capacity tuning method for a service cluster according to a first embodiment of the present application;
Fig. 2 is a schematic flow chart of a service module capacity tuning method for a service cluster according to a second embodiment of the present application;
fig. 3 is a schematic flow chart of a service module capacity tuning method for a service cluster according to a third embodiment of the present application;
Fig. 4 is a schematic flow chart of a service module capacity tuning method for a service cluster according to a fourth embodiment of the present application;
Fig. 5 is a schematic structural diagram of a service module capacity tuning device for a service cluster according to an embodiment of the present application.
Detailed Description
in order to make the technical solutions better understood by those skilled in the art, the technical solutions in the embodiments of the present application will be clearly and completely described below with reference to the drawings in the embodiments of the present application, and it is obvious that the described embodiments are only partial embodiments of the present application, but not all embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present application.
It should be noted that the terms "first," "second," and the like in the description and claims of this application and in the drawings described above are used for distinguishing between similar elements and not necessarily for describing a particular sequential or chronological order. It should be understood that the data so used may be interchanged under appropriate circumstances such that embodiments of the application described herein may be used. Furthermore, the terms "comprises," "comprising," and "having," and any variations thereof, are intended to cover a non-exclusive inclusion, such that a process, method, system, article, or apparatus that comprises a list of steps or elements is not necessarily limited to those steps or elements expressly listed, but may include other steps or elements not expressly listed or inherent to such process, method, article, or apparatus.
In this application, the terms "upper", "lower", "left", "right", "front", "rear", "top", "bottom", "inner", "outer", "middle", "vertical", "horizontal", "lateral", "longitudinal", and the like indicate orientations or positional relationships based on the orientations or positional relationships shown in the drawings. These terms are used primarily to better describe the present application and its embodiments, and are not used to limit the indicated devices, elements or components to a particular orientation or to be constructed and operated in a particular orientation.
Moreover, some of the above terms may be used to indicate other meanings besides the orientation or positional relationship, for example, the term "on" may also be used to indicate some kind of attachment or connection relationship in some cases. The specific meaning of these terms in this application will be understood by those of ordinary skill in the art as appropriate.
furthermore, the terms "mounted," "disposed," "provided," "connected," and "sleeved" are to be construed broadly. For example, it may be a fixed connection, a removable connection, or a unitary construction; can be a mechanical connection, or an electrical connection; may be directly connected, or indirectly connected through intervening media, or may be in internal communication between two devices, elements or components. The specific meaning of the above terms in the present application can be understood by those of ordinary skill in the art as appropriate.
It should be noted that the embodiments and features of the embodiments in the present application may be combined with each other without conflict. The present application will be described in detail below with reference to the embodiments with reference to the attached drawings.
In the application, when each server faces the same number of called requests under the same module and the capacity between the servers is greatly different, the called weight of the server under the module needs to be optimized and adjusted, and the capacity between the servers is concentrated in an acceptable capacity interval as much as possible by adjusting the weight of the server, so that the capacity of the service module becomes a capacity health service module.
As shown in fig. 1, the method includes steps S102 to S104 as follows:
Step S102, when a server calling request under the same service module is received in a service cluster, judging whether module capacity between servers has difference;
When a server call request for the same service module is received in the service cluster, it is necessary to determine whether module capacities between servers are different. For example, it is determined whether there is a large difference in module capacity between servers, or whether the difference in module capacity between servers exceeds a threshold.
Specifically, the module capacity refers to selecting one index from multiple index dimensions of the module according to the function of the module as a unique core standard for measuring the module capacity.
It should be noted that, as the indices: CPU usage, memory usage, I/O usage, service return code on module program, etc. of the servers in the module.
For example, in the access type service, the utilization rate of the CPU is generally selected as the only core index of the support performance of the module, and the utilization rate of the CPU is 90%, which indicates that the module has reached the maximum support bottleneck of the performance.
For another example, the data storage module generally selects the disk I/O read/write performance as the only core index.
and step S104, if the module capacities are different, adjusting the called weight of the server to balance the capacities among the servers.
And if the difference of the module capacity is known according to the judgment result, adjusting the called weight of the server to balance the capacity among the servers so as to enable the module capacity to be a healthy capacity module.
It should be noted that, since the service requests initiate load-balanced access, the number of requests will fall substantially uniformly on the servers below the called module, and differences in capacity will occur due to performance differences between the servers. Specifically, there are some modules under which the capacity difference between servers is not large, and within an acceptable range, the capacity health module is defined. In addition, the capacity difference between servers under other modules is large, and the capacity is defined as an unhealthy module.
it should also be noted that, various methods for determining whether the module capacity belongs to the healthy capacity may be adopted, and the method is not specifically limited in the embodiment of the present application, and a person skilled in the art may select the module capacity according to the actual use situation.
The service cluster is a set that a plurality of service modules are combined to realize a complete service function or a certain core function.
the service module refers to a collection in which a certain service function point is deployed on a single server or a group of servers.
From the above description, it can be seen that the following technical effects are achieved by the present application:
in the embodiment of the application, when a server calling request under the same service module is received in a service cluster, the purpose of adjusting the called weight of the server to balance the capacity among the servers is achieved by judging whether the module capacity among the servers is different or not, so that the technical effect of carrying out tuning processing on unhealthy service modules is realized, and the technical problem that a capacity tuning party for the service modules with unhealthy capacity is lacked is solved.
according to the embodiment of the present application, as a preference in the embodiment, as shown in fig. 2, if there is a difference in module capacity, adjusting the called weight of the servers to balance the capacity among the servers includes:
Step S202, judging whether the module capacity difference between the servers reaches the capacity difference of unhealthy service modules at the service peak;
Step S204, if the module capacity difference between the servers is judged to reach the capacity difference of the unhealthy service modules in the service peak period, the called weight of the servers is adjusted.
Preferably, the module capacity is selected from CPU utilization.
and judging whether the module capacity difference between the servers reaches the capacity difference of the unhealthy service modules or not in the service peak period, and if the module capacity difference between the servers reaches the capacity difference of the unhealthy service modules in the service peak period, adjusting the called weight of the servers.
Specifically, when the same number of called requests of each server in the same module are met and the capacity difference between the servers is large, the called weight of the server in the module needs to be optimally adjusted, and the capacity between the servers is concentrated in an acceptable capacity interval as much as possible by adjusting the weight of the server, so that the capacity of the service module becomes a capacity health service module.
The server utilization rate of different performances under the module can be maximized by adjusting the called weight of the server under the called module, the capacity between the servers can be balanced, chain collapse caused by short capacity of a certain server is avoided, the service is more stable in operation, and the server cost is optimal.
According to the embodiment of the present application, as a preferred embodiment in the present embodiment, as shown in fig. 3, when a server call request for the same service module is received in a service cluster, determining whether module capacities between servers have a difference includes:
Step S302, when a server call request under the same service module is received in a service cluster, selecting a judgment index as a preset standard for measuring the capacity of the service module according to the service function of the service module;
Step S304, judging whether the judgment indexes between the servers have differences according to preset standards.
Preferably, the module capacity is selected from CPU utilization.
when a server call request under the same service module is received in the service cluster, a judgment index is selected as a preset standard for measuring the capacity of the service module according to the service function of the service module. And judging whether the judgment indexes between the servers have differences according to preset standards.
specifically, when the same number of called requests of each server in the same module are met and the capacity difference between the servers is large, the called weight of the server in the module needs to be optimally adjusted, and the capacity between the servers is concentrated in an acceptable capacity interval as much as possible by adjusting the weight of the server, so that the capacity of the service module becomes a capacity health service module.
The server utilization rate of different performances under the module can be maximized by adjusting the called weight of the server under the called module, the capacity between the servers can be balanced, chain collapse caused by short capacity of a certain server is avoided, the service is more stable in operation, and the server cost is optimal.
according to the embodiment of the present application, as a preference in the embodiment, as shown in fig. 4, if there is a difference in capacity, adjusting the called weight of the servers to balance the module capacity among the servers includes:
Step S402, determining the called quantity and the called weight when a server calling request under the same service module is received;
Step S404, when there is a difference in module capacity between servers, adjusting the called weight of the server to obtain a new weight (single support capability capacity)/called amount;
The method comprises the steps of determining the called quantity and the called weight when a server calling request under the same service module is received, and then adjusting the called weight of the server to obtain a new weight (the single support capability capacity) per the called quantity when the module capacity of the server is different.
the capacity benchmark is a safety threshold with the highest capacity, and the single-station supporting capacity refers to the server capacity supporting capacity.
According to the tuning method, the called weight of the server under the called module is adjusted, so that the utilization rate of the servers with different performances under the module can be maximized, the capacity among the servers can be balanced, chain running caused by the short capacity of a certain server is avoided, the service is more stable in operation, and the cost of the server is optimal.
Preferably, if the one-time tuning result cannot meet the requirement, namely the capacity standard value cannot be reached, multiple times of tuning are needed.
It should be noted that the steps illustrated in the flowcharts of the figures may be performed in a computer system such as a set of computer-executable instructions and that, although a logical order is illustrated in the flowcharts, in some cases, the steps illustrated or described may be performed in an order different than presented herein.
according to an embodiment of the present application, there is further provided a service module capacity tuning apparatus for a service cluster, for implementing the foregoing method, as shown in fig. 5, the apparatus includes: the judging module 10 is configured to judge whether module capacities between servers are different when a server call request for the same service module is received in a service cluster; and the tuning module 20 is used for adjusting the called weight of the servers to balance the capacity among the servers when the module capacity is different.
In the determining module 10 in the embodiment of the present application, when a server call request for the same service module is received in a service cluster, it is necessary to determine whether module capacities between servers are different. For example, it is determined whether there is a large difference in module capacity between servers, or whether the difference in module capacity between servers exceeds a threshold.
Specifically, the module capacity refers to selecting one index from multiple index dimensions of the module according to the function of the module as a unique core standard for measuring the module capacity.
It should be noted that, as the indices: CPU usage, memory usage, I/O usage, service return code on module program, etc. of the servers in the module.
for example, in the access type service, the utilization rate of the CPU is generally selected as the only core index of the support performance of the module, and the utilization rate of the CPU is 90%, which indicates that the module has reached the maximum support bottleneck of the performance.
For another example, the data storage module generally selects the disk I/O read/write performance as the only core index.
In the tuning module 20 according to the embodiment of the present application, if the module capacity is known to be different according to the determination result, the called weight of the server is adjusted to balance the capacity between the servers, so that the module capacity is a healthy capacity module.
It should be noted that, since the service requests initiate load-balanced access, the number of requests will fall substantially uniformly on the servers below the called module, and differences in capacity will occur due to performance differences between the servers. Specifically, there are some modules under which the capacity difference between servers is not large, and within an acceptable range, the capacity health module is defined. In addition, the capacity difference between servers under other modules is large, and the capacity is defined as an unhealthy module.
According to the embodiment of the present application, as a preferred option in the embodiment, the tuning module 20 is configured to determine whether a module capacity difference between the servers reaches a capacity difference of an unhealthy service module during a service peak period; and if the module capacity difference between the servers is judged to reach the capacity difference of the unhealthy service modules in the service peak period, adjusting the called weight of the servers.
Preferably, the module capacity is selected from CPU utilization.
And judging whether the module capacity difference between the servers reaches the capacity difference of the unhealthy service modules or not in the service peak period, and if the module capacity difference between the servers reaches the capacity difference of the unhealthy service modules in the service peak period, adjusting the called weight of the servers.
Specifically, when the same number of called requests of each server in the same module are met and the capacity difference between the servers is large, the called weight of the server in the module needs to be optimally adjusted, and the capacity between the servers is concentrated in an acceptable capacity interval as much as possible by adjusting the weight of the server, so that the capacity of the service module becomes a capacity health service module.
The server utilization rate of different performances under the module can be maximized by adjusting the called weight of the server under the called module, the capacity between the servers can be balanced, chain collapse caused by short capacity of a certain server is avoided, the service is more stable in operation, and the server cost is optimal.
According to the embodiment of the present application, as an optimization in the embodiment, the determining module 10 is configured to select a determining index as a preset standard for measuring the capacity of a service module according to a service function of the service module when a server call request for the same service module is received in a service cluster; and judging whether the judgment indexes between the servers have differences according to preset standards.
Preferably, the module capacity is selected from CPU utilization.
When a server call request under the same service module is received in the service cluster, a judgment index is selected as a preset standard for measuring the capacity of the service module according to the service function of the service module. And judging whether the judgment indexes between the servers have differences according to preset standards.
Specifically, when the same number of called requests of each server in the same module are met and the capacity difference between the servers is large, the called weight of the server in the module needs to be optimally adjusted, and the capacity between the servers is concentrated in an acceptable capacity interval as much as possible by adjusting the weight of the server, so that the capacity of the service module becomes a capacity health service module.
The server utilization rate of different performances under the module can be maximized by adjusting the called weight of the server under the called module, the capacity between the servers can be balanced, chain collapse caused by short capacity of a certain server is avoided, the service is more stable in operation, and the server cost is optimal.
according to the embodiment of the present application, as a preferred choice in the embodiment, the determining module 10 is configured to determine a called amount and a called weight when a server call request for the same service module is received; when the module capacity between the servers is different, adjusting the called weight of the servers to obtain a new weight (the single support capacity) and a new called amount;
The method comprises the steps of determining the called quantity and the called weight when a server calling request under the same service module is received, and then adjusting the called weight of the server to obtain a new weight (the single support capability capacity) per the called quantity when the module capacity of the server is different.
The capacity benchmark is a safety threshold with the highest capacity, and the single-station supporting capacity refers to the server capacity supporting capacity.
The optimization module adjusts the called weight of the server under the called module, maximizes the utilization rate of the servers with different performances under the module, balances the capacity among the servers, avoids chain rushing caused by a certain server capacity short plate, and ensures that the service is more stable in operation and the server cost is optimal.
In another embodiment of the present application, there is also provided a server including: the capacity adjusting and optimizing device of the service module. The implementation principle and the beneficial effect of the service module capacity tuning device are as described above, and are not described herein again.
The realization principle of the tuning method can be realized by referring to the following modes:
In the following, the module a, the capacity core index CPU usage, and the module 7 servers, for example, the amount of access/call to each server N is 2000 times/second during the peak period, and the call weight of each server q is 100.
the method comprises the following steps: the capacity unhealthy module records the following data during service use peak periods:
Server 1 Server 2 server 3 Server 4 Server 5 Server 6 server 7
Called amount 2000 times/second 2000 times/second 2000 times/second 2000 times/second 2000 times/second 2000 times/second 2000 times/second
Called weight 100 100 100 100 100 100 100
CPU utilization (%) 70 60 55 81 72 48 51
Step two: calculating the capacity supporting capacity of a single server, wherein the capacity supporting capacity of the single server is (called quantity is called weight)/CPU, the capacity supporting capacity of the single server is rounded, and data are calculated:
(called amount of weight of called) ═ 2000 × 100 ═ 200000
server 1 Server 2 server 3 Server 4 server 5 server 6 Server 7
Calculation process =20000/70 =20000/60 =20000/55 =20000/81 =20000/72 =20000/48 =20000/51
Single stand support capability 2857 3333 3636 2469 2777 4166 3921
step three: the established maximum safe value of the capacity is as follows: for example, the CPU utilization in the table is 85%, the server with the highest capacity under the module is found, and it is determined whether the highest capacity is within the safety value range, such as the server 4 with the highest capacity in the table above, which is 81, and within the highest range value.
step four: under the condition that the called quantity is not changed, the highest capacity 81 is used as a reference value, weight promotion is carried out on the modules with the capacity not reaching 81, and the capacity of the modules reaches 81, wherein the calculation method comprises the following steps:
New weight value (single support capacity) per called amount;
then, new weight calculation is carried out according to the capacity of other servers, the result is rounded, and the data is as follows:
Step five: the current network adjusts the weight, and then observes and records the data of the capacity of each server under the peak period module.
The net adjustment weights may be specified in the following table. The theory should be consistent because there may be a certain difference between the theoretical calculation value and the adjusted server real capacity value. For example, the value of x1.. x7 in the following table should be 81, but there may be some deviation in practice, and some servers may not have the same performance due to hardware.
So as the request increases, the magnitude of the load rise will be higher than expected, using x1... x7 as an indicator. For example, the value in x1... x7 is not 81, and the absolute value of the subtraction of 81 is greater than 5, and when the real value x1 of the capacity of the server 1 after tuning is 86, then the capacity of the server 1 needs to be tuned once again. Calculating theoretical value of capacity in peak period after adjustment:
Server 1 Server 2 Server 3 Server 4 Server 5 Server 6 Server 7
Adjusted weights 116 135 147 100 112 169 159
New capacity value after weight adjustment 81 81 81 81 81 81 81
Adjusted peak capacity true value:
Server 1 server 2 server 3 Server 4 Server 5 Server 6 server 7
Adjusted weights 116 135 147 100 112 169 159
New capacity value after weight adjustment x1 x2 x3 x4 x5 x6 x7
it will be apparent to those skilled in the art that the modules or steps of the present application described above may be implemented by a general purpose computing device, they may be centralized on a single computing device or distributed across a network of multiple computing devices, and they may alternatively be implemented by program code executable by a computing device, such that they may be stored in a storage device and executed by a computing device, or fabricated separately as individual integrated circuit modules, or fabricated as a single integrated circuit module from multiple modules or steps. Thus, the present application is not limited to any specific combination of hardware and software.
The above description is only a preferred embodiment of the present application and is not intended to limit the present application, and various modifications and changes may be made by those skilled in the art. Any modification, equivalent replacement, improvement and the like made within the spirit and principle of the present application shall be included in the protection scope of the present application.

Claims (10)

1. A method for adjusting and optimizing the capacity of a service module of a service cluster is characterized by comprising the following steps:
When a server calling request under the same service module is received in a service cluster, judging whether module capacities among the servers have differences or not;
If there is a difference in module capacity, the called weights of the servers are adjusted to balance the capacity among the servers.
2. The traffic module capacity tuning method of claim 1, wherein if there is a difference in module capacity, adjusting the called weights of the servers to equalize the capacity among the servers comprises:
judging whether the module capacity difference between the servers reaches the capacity difference of unhealthy service modules or not in the service peak period;
And if the module capacity difference between the servers is judged to reach the capacity difference of the unhealthy service modules in the service peak period, adjusting the called weight of the servers.
3. The method of claim 1, wherein when a server call request for the same service module is received in a service cluster, determining whether the module capacities between servers differ comprises:
When a server call request under the same service module is received in a service cluster, selecting a judgment index as a preset standard for measuring the capacity of the service module according to the service function of the service module;
And judging whether the judgment indexes between the servers have differences according to preset standards.
4. The traffic module capacity tuning method of claim 1, wherein if there is a difference in capacity, adjusting the called weights of the servers to equalize the module capacity among the servers comprises:
Determining the called quantity and the called weight when a server calling request under the same service module is received;
When the module capacity between the servers is different, adjusting the called weight of the servers to obtain a new weight (the single support capacity) and a new called amount;
The capacity benchmark is a safety threshold with the highest capacity, and the single support capacity refers to the server capacity support capacity.
5. The method of claim 1, wherein the module capacity is CPU utilization.
6. A service module capacity tuning apparatus for a service cluster, comprising:
The judging module is used for judging whether the module capacities among the servers have differences or not when the server calling requests under the same service module are received in the service cluster;
And the adjusting and optimizing module is used for adjusting the called weight of the servers when the module capacity is different so as to balance the capacity among the servers.
7. the service module capacity tuning device of claim 6, wherein the tuning module is configured to tune the service module capacity of the service cluster
Judging whether the module capacity difference between the servers reaches the capacity difference of unhealthy service modules or not in the service peak period;
And if the module capacity difference between the servers is judged to reach the capacity difference of the unhealthy service modules in the service peak period, adjusting the called weight of the servers.
8. the service module capacity tuning apparatus for service cluster according to claim 6, wherein the determining module is configured to determine the capacity of the service module according to the capacity tuning apparatus
when a server call request under the same service module is received in a service cluster, selecting a judgment index as a preset standard for measuring the capacity of the service module according to the service function of the service module;
and judging whether the judgment indexes between the servers have differences according to preset standards.
9. The service module capacity tuning apparatus for service cluster according to claim 6, wherein the determining module is configured to determine the capacity of the service module according to the capacity tuning apparatus
Determining the called quantity and the called weight when a server calling request under the same service module is received;
when the module capacity between the servers is different, adjusting the called weight of the servers to obtain a new weight (the single support capacity) and a new called amount;
The capacity benchmark is a safety threshold with the highest capacity, and the single support capacity refers to the server capacity support capacity.
10. A server, comprising: the traffic module capacity tuning apparatus according to any one of claims 6 to 9.
CN201910799917.5A 2019-08-27 2019-08-27 Service module capacity tuning method and device for service cluster and server Pending CN110543366A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910799917.5A CN110543366A (en) 2019-08-27 2019-08-27 Service module capacity tuning method and device for service cluster and server

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910799917.5A CN110543366A (en) 2019-08-27 2019-08-27 Service module capacity tuning method and device for service cluster and server

Publications (1)

Publication Number Publication Date
CN110543366A true CN110543366A (en) 2019-12-06

Family

ID=68712181

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910799917.5A Pending CN110543366A (en) 2019-08-27 2019-08-27 Service module capacity tuning method and device for service cluster and server

Country Status (1)

Country Link
CN (1) CN110543366A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112583922A (en) * 2020-12-16 2021-03-30 罗普特科技集团股份有限公司 Intelligent scheduling system for video monitoring service

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20070143460A1 (en) * 2005-12-19 2007-06-21 International Business Machines Corporation Load-balancing metrics for adaptive dispatching of long asynchronous network requests
CN101753444A (en) * 2009-12-31 2010-06-23 卓望数码技术(深圳)有限公司 Method and device for load balancing
CN102611735A (en) * 2011-12-21 2012-07-25 奇智软件(北京)有限公司 Load balancing method and system of application services
CN105049536A (en) * 2015-09-08 2015-11-11 南京大学 Load balancing system and load balancing method in IaaS (Infrastructure As A Service) cloud environment
CN107124472A (en) * 2017-06-26 2017-09-01 杭州迪普科技股份有限公司 Load-balancing method and device, computer-readable recording medium
CN108572873A (en) * 2018-04-24 2018-09-25 中国科学院重庆绿色智能技术研究院 A kind of load-balancing method and device solving the problems, such as Spark data skews
CN110134513A (en) * 2019-04-17 2019-08-16 平安科技(深圳)有限公司 Load-balancing method, device, computer equipment and storage medium

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20070143460A1 (en) * 2005-12-19 2007-06-21 International Business Machines Corporation Load-balancing metrics for adaptive dispatching of long asynchronous network requests
CN101753444A (en) * 2009-12-31 2010-06-23 卓望数码技术(深圳)有限公司 Method and device for load balancing
CN102611735A (en) * 2011-12-21 2012-07-25 奇智软件(北京)有限公司 Load balancing method and system of application services
CN105049536A (en) * 2015-09-08 2015-11-11 南京大学 Load balancing system and load balancing method in IaaS (Infrastructure As A Service) cloud environment
CN107124472A (en) * 2017-06-26 2017-09-01 杭州迪普科技股份有限公司 Load-balancing method and device, computer-readable recording medium
CN108572873A (en) * 2018-04-24 2018-09-25 中国科学院重庆绿色智能技术研究院 A kind of load-balancing method and device solving the problems, such as Spark data skews
CN110134513A (en) * 2019-04-17 2019-08-16 平安科技(深圳)有限公司 Load-balancing method, device, computer equipment and storage medium

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112583922A (en) * 2020-12-16 2021-03-30 罗普特科技集团股份有限公司 Intelligent scheduling system for video monitoring service
CN112583922B (en) * 2020-12-16 2022-09-20 罗普特科技集团股份有限公司 Intelligent scheduling system for video monitoring service

Similar Documents

Publication Publication Date Title
CN106453125B (en) A kind of remote service calling SiteServer LBS based on real time load rate
CN111694663B (en) Load balancing method, device and system for server cluster
CN110795203B (en) Resource scheduling method, device, system and computing equipment
KR102047900B1 (en) Method and apparatus for leveling load in a distributed database
CN102111813B (en) Method and equipment for balancing loads of carriers
CN101699901A (en) Method and device for optimizing search space of user equipment
CN101557344A (en) Dynamic load balancing method based on spatial geographical locations
CN105516347A (en) Method and device for load balance allocation of streaming media server
CN111045808B (en) Distributed network task scheduling method and device
CN110233866B (en) Load balancing method and load balancer
CN103746934A (en) CDN bandwidth balancing method, CDN control center and system
CN108055701B (en) Resource scheduling method and base station
US11889354B2 (en) Method for load imbalance optimization under same network coverage, apparatus, device, and storage medium
CN102724105B (en) A kind of load-balancing method and device
CN110213351A (en) A kind of dynamic self-adapting I/O load equalization methods towards wide area high-performance computing environment
KR101611817B1 (en) Apparatus and method for supporting idle mode load balancing
WO2014194704A1 (en) A grouping processing method and system
CN106998340B (en) Load balancing method and device for board resources
CN107220108A (en) A kind of method and system for realizing cloud data center load balancing
CN110543366A (en) Service module capacity tuning method and device for service cluster and server
CN109982375A (en) A kind of the load balancing method of adjustment and device of serving cell
CN110554920A (en) Method and device for processing capacity unhealthy module
CN104899072B (en) Fine granularity resource scheduling system based on virtual platform and method
CN101730147B (en) Carrier frequency allocating method and device
CN116684098A (en) Block chain-based data processing method, device, equipment, medium and product

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20191206

RJ01 Rejection of invention patent application after publication