CN105024842A - Method and device for capacity expansion of server - Google Patents

Method and device for capacity expansion of server Download PDF

Info

Publication number
CN105024842A
CN105024842A CN201410173228.0A CN201410173228A CN105024842A CN 105024842 A CN105024842 A CN 105024842A CN 201410173228 A CN201410173228 A CN 201410173228A CN 105024842 A CN105024842 A CN 105024842A
Authority
CN
China
Prior art keywords
server
load
load value
value
dilatation
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201410173228.0A
Other languages
Chinese (zh)
Inventor
鲍文平
何志敏
陈忠湘
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shenzhen Tencent Computer Systems Co Ltd
Original Assignee
Shenzhen Tencent Computer Systems Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shenzhen Tencent Computer Systems Co Ltd filed Critical Shenzhen Tencent Computer Systems Co Ltd
Priority to CN201410173228.0A priority Critical patent/CN105024842A/en
Publication of CN105024842A publication Critical patent/CN105024842A/en
Pending legal-status Critical Current

Links

Landscapes

  • Computer And Data Communications (AREA)

Abstract

The invention discloses a method ad a device for capacity expansion of a server, belonging to the information technology field. The method comprises steps of calculating load values of all items of each first server according to the operation data of each first server, if high load values exist among the load values of all items of all first servers, configuring at least one second server which is expanded in capacity and an operation environment for the second server according to the load values of all items of each first server, applying for a background service authority of the second server and assigning the business to the second server. The invention configures at least one second server which is expanded in capacity and the operation environment for the second server according to the load values of all items of each first server when the high load values exist among the load values of all items of the first server, applies for the background service authority of the second server and assigns the business to the second server. As a result, the invention realizes the automatic capacity expansion of the server, and the capacity expansion of the server is not limited by the user, which expands the application range of the server expansion.

Description

The expansion method of server and device
Technical field
The present invention relates to areas of information technology, particularly a kind of expansion method of server and device.
Background technology
Along with the development of information technology, the class of business that service provider provides and quantity get more and more.Business needs to use server in running, and needs adjust the quantity of server, to tackle miscellaneous service demand in time according to the kind of business and quantity.In practical application, often can run into and have bursts of activities promptly to reach the standard grade, or the situation that promoting service causes request amount to increase, at this moment need the quantity increasing server, namely dilatation is carried out to server.
Prior art provides a kind of server expansion scheme for small businesses service and common user's service, specifically comprise: the service condition monitoring each server, and according to the running environment of the virtual machine run in the service condition of each server and the use threshold value configuration server of each server and virtual machine; After the background service authority of artificial application virtual machine, by the virtual machine of traffic assignments to configuration.
Realizing in process of the present invention, inventor finds that prior art at least exists following problem:
Owing to carrying out server expansion for small businesses service and common user's service, the range of application of server expansion is caused to be restricted; In addition, need artificial application virtual machine background service authority, make the mode of server expansion intelligent not.
Summary of the invention
In order to solve the problem of prior art, embodiments provide a kind of expansion method and device of server.Described technical scheme is as follows:
First aspect, provides a kind of expansion method of server, and described method comprises:
Gather the service data of each first server, and calculate every load value of each first server according to the service data of each first server;
If there is high capacity value in every load value of all first servers, then configure the second server of at least one dilatation and the running environment of described second server according to every load value of each first server;
Apply for the background service authority of described second server, and by traffic assignments to described second server.
Second aspect, provides a kind of flash chamber of server, and described device comprises:
Acquisition module, for gathering the service data of each first server;
Computing module, for calculating every load value of each first server according to the service data of each first server;
Configuration module, for when there is high capacity value in every load value of all first servers, configures the second server of at least one dilatation and the running environment of described second server according to every load value of each first server;
Application module, for applying for the background service authority of described second server;
Distribution module, for by traffic assignments to described second server.
The beneficial effect that the technical scheme that the embodiment of the present invention provides is brought is:
During by there is high capacity value in every load value of first server, the second server of at least one dilatation and the running environment of second server is configured according to every load value of each first server, and apply for the background service authority of second server, again by traffic assignments to second server, thus achieve the automatic dilatation of server, and the dilatation of server is not subject to the restriction of business and user, expand the range of application of server expansion.
Accompanying drawing explanation
In order to be illustrated more clearly in the technical scheme in the embodiment of the present invention, below the accompanying drawing used required in describing embodiment is briefly described, apparently, accompanying drawing in the following describes is only some embodiments of the present invention, for those of ordinary skill in the art, under the prerequisite not paying creative work, other accompanying drawing can also be obtained according to these accompanying drawings.
Fig. 1 is the flow chart of the expansion method of the server that the embodiment of the present invention provides;
Fig. 2 is the flow chart of the expansion method of the server that another embodiment of the present invention provides;
Fig. 3 is the Organization Chart of the server expansion system that another embodiment of the present invention provides;
Fig. 4 is the flow chart of the load-balancing algorithm that another embodiment of the present invention provides;
Fig. 5 is that dilatation instruction that another embodiment of the present invention provides issues and the Organization Chart of executive system;
Fig. 6 is the structural representation of the flash chamber of the first server that another embodiment of the present invention provides;
Fig. 7 is the structural representation of the acquisition module that another embodiment of the present invention provides;
Fig. 8 is the structural representation of the flash chamber of the second server that another embodiment of the present invention provides;
Fig. 9 is the structural representation of the configuration module that another embodiment of the present invention provides;
Figure 10 is the structural representation of the determining unit that another embodiment of the present invention provides;
Figure 11 is the structural representation of the first application module that another embodiment of the present invention provides;
Figure 12 is the structural representation of the second application module that another embodiment of the present invention provides;
Figure 13 is the structural representation of the server that another embodiment of the present invention provides.
Embodiment
For making the object, technical solutions and advantages of the present invention clearly, below in conjunction with accompanying drawing, embodiment of the present invention is described further in detail.
The increase of class of business and quantity, makes the dilatation increase in demand of server.Provide server expansion scheme for small businesses service and common user's service at present, but this dilatation scheme can not apply for the authority of server automatically.For above-mentioned situation, embodiments provide a kind of expansion method of server, be applicable to framework complexity, have the large-scale business of multiple backstage right discriminating system, and achieve the automatic dilatation of server, and see Fig. 1, method flow comprises:
101: the service data gathering each first server, and the every load value calculating each first server according to the service data of each first server;
As a kind of embodiment, gather the service data of each first server, comprising:
Obtain the sampling period, and gather the service data of each first server according to the sampling period.
As a kind of embodiment, calculate every load value of each first server according to the service data of each first server after, also comprise:
Judge whether every load value of each first server is greater than load threshold corresponding to every load value;
If arbitrary load value of arbitrary first server is greater than load threshold corresponding to arbitrary load value, then judge to there is high capacity value in every load value of all first servers.
102: if there is high capacity value in every load value of all first servers, then configure the second server of at least one dilatation and the running environment of second server according to every load value of each first server;
As a kind of embodiment, configure the second server of at least one dilatation and the running environment of second server according to every load value of each first server, comprising:
Judge whether current time reaches cooling time;
If current time reaches cooling time, then according to every load value determination server expansion strategy of each first server, and configure the second server of at least one dilatation and the running environment of second server according to server expansion strategy.
As a kind of embodiment, according to every load value determination server expansion strategy of each first server, comprising:
Determine that whether the load of each first server is balanced;
If the load balancing of each first server, then calculate the number of servers of dilatation according to every load value of each first server, and using the number of servers of dilatation as server expansion strategy.
As a kind of embodiment, determine that whether the load of each first server is balanced, comprising:
Calculate load mean value corresponding to every load value according to every load value of all first servers, and the load mean value that every load value is corresponding with every load value is compared;
If the difference of the load mean value that all load values are corresponding with every load value is all not more than default value, then determine the load balancing of each first server.
103: the background service authority of application second server, and by traffic assignments to second server.
As a kind of embodiment, the background service authority of application second server, comprising:
Judge whether permission system authorizes application background service authority automatically;
If background service authority is applied in permission system mandate automatically, then directly apply for the background service authority of second server.
As a kind of embodiment, after judging whether permission system authorizes automatic application background service authority, also comprise:
If permission system unauthorized applies for background service authority automatically, then to the background service authority of agent application second server.
The method that the embodiment of the present invention provides, during by there is high capacity value in every load value of first server, the second server of at least one dilatation and the running environment of second server is configured according to every load value of each first server, and apply for the background service authority of second server, again by traffic assignments to second server, thus achieve the automatic dilatation of server, and the dilatation of server is not subject to the restriction of business and user, expands the range of application of server expansion.
Embodiments provide a kind of expansion method of server, in conjunction with the content of above-described embodiment, see Fig. 2, the method flow that the present embodiment provides comprises:
201: the service data gathering each first server, and the every load value calculating each first server according to the service data of each first server;
As a kind of embodiment, gather the service data of each first server, include but not limited to:
Obtain the sampling period, and gather the service data of each first server according to the sampling period.
The Organization Chart of server expansion system shown in Figure 3, this server expansion system by flow control system, operation in server, treat Operation Server, monitoring server (monitor), AC (Alter Controller, alteration control unit), KM (Keeper Master, cluster manager dual system), RM (Resource Master, service resources server), Transparent Proxy or PA (Permission Assignment, permission server) and business backstage permission grant System's composition.
Wherein, the corresponding first server of server in operation, deploy keeper (background program) in operation in server, keeper is responsible for data acquisition, namely performs the service data gathering each first server.
Alternatively, keeper receives the sampling period that monitor issues, and namely obtains the sampling period; Keeper, by the proc file system read in each server gathers the service data of each server in each sampling period, namely gathers the service data of each first server according to the sampling period.
The service data of each server gathered and acquisition time are reported monitor by keeper, monitor is after collecting the service data of each server, calculate load value corresponding to each loading index of each server according to service data, namely calculate every load value of each first server according to the service data of each first server.Wherein, the loading index of server includes but not limited to the loading index shown in following form, also show the computational methods of each loading index in this form.
Wherein, cpuTotal represents CPU total capacity; CpuUsed represents the CPU capacity used; User represent from system start be accumulated to current time, be in the running time of User space, not comprising nice (priority) value is negative process; Nice represent from system start be accumulated to current time, nice value for bear process shared by CPU time; System represent from system start be accumulated to current time, be in the running time of kernel mode; Idle represent from system start be accumulated to current time, except IO (Input/Output, I/O) other stand-by period beyond the stand-by period; Iowait represent from system start be accumulated to current time, the IO stand-by period; Irq represent from system start be accumulated to current time, hard break period; Softirq represent from system start be accumulated to current time, the weaken rock time; MemTotal represents total memory size; MemFree represents free memory size; Buffers represents the size of the disk buffering of block device; Cached represents the size of the disk buffering of file; Pgpgin represents from disk or SWAP (exchange partition) displacement to the byte number of internal memory; Pgpgout represents the byte number of replacing disk or SWAP from internal memory.
202: judge whether every load value of each first server is greater than load threshold corresponding to every load value;
About the size of load threshold corresponding to every load value, the present embodiment does not do concrete restriction.During concrete enforcement, for each loading index, load threshold can be set respectively.Alternatively, load threshold is divided into overall high load threshold and module high load threshold.Under normal circumstances, overall high load threshold is default threshold, and module high load threshold is the threshold value that can arrange.When not arranging module high load threshold, use overall high load threshold.
The Organization Chart of server expansion system shown in Figure 3, monitor judges whether there is high capacity value in every load value of server in all operations.
203: if arbitrary load value of arbitrary first server is greater than load threshold corresponding to arbitrary load value, then judge to there is high capacity value in every load value of all first servers;
When arbitrary load value of arbitrary first server is greater than load threshold corresponding to arbitrary load value, determines that arbitrary load value is high capacity value, namely judge to there is high capacity value in every load value of all first servers, perform subsequent step 204.The Organization Chart of server expansion system shown in Figure 3, when monitor exists high capacity value in the every load value judging server in all operations, triggers high capacity event, and notice AC has high capacity event.
204: configure the second server of at least one dilatation and the running environment of second server according to every load value of each first server;
As a kind of embodiment, configure the second server of at least one dilatation and the running environment of second server according to every load value of each first server, include but not limited to:
Judge whether current time reaches cooling time;
If current time reaches cooling time, then according to every load value determination server expansion strategy of each first server, and configure the second server of at least one dilatation and the running environment of second server according to server expansion strategy.
Wherein, cooling time can be arranged according to the actual requirements, and the present embodiment does not limit the length of cooling time.Within cooling time, even if judge to there is high capacity value in every load value of all first servers, do not perform yet and determine server expansion strategy, the configuration second server of dilatation and the running environment of second server.This is due in the short time after automatic dilatation server, still may occur the situation of load burr; Due to after automatic dilatation, business also may not be assigned to the server of automatic dilatation, if now immediately again dilatation can cause the waste of server resource; Therefore; by arranging the situation can avoiding cooling time occurring dilatation again before by traffic assignments to the server of automatic dilatation as far as possible; the situation of the lasting high capacity consumes free server that malicious external attack causes can also be reduced as far as possible, thus protection idle server.
As a kind of embodiment, according to every load value determination server expansion strategy of each first server, include but not limited to:
Determine that whether the load of each first server is balanced;
If the load balancing of each first server, then calculate the number of servers of dilatation according to every load value of each first server, and using the number of servers of dilatation as server expansion strategy.
The method that the embodiment of the present invention provides, when determining the load balancing of each first server, allows automatic dilatation, namely performs and determines server expansion strategy.Wherein, the automatic dilatation of server also can be arranged according to different traffic performances.
As a kind of embodiment, determine that whether the load of each first server is balanced, include but not limited to:
Calculate load mean value corresponding to every load value according to every load value of all first servers, and the load mean value that every load value is corresponding with every load value is compared;
If the difference of the load mean value that all load values are corresponding with every load value is all not more than default value, then determine the load balancing of each first server.
Wherein, the numerical value that default value can be different according to different business settings, the present embodiment does not limit the size of default value.Such as, default value can be 30% of load mean value.
In addition, the flow chart of load-balancing algorithm shown in Figure 4, determines that whether balanced the load of each first server process correspond to the load-balancing algorithm in Fig. 4.Wherein, the corresponding load mean value of average avg, avg is multiplied by business load fluctuation percentage f can obtain default value.When determining the load imbalance of arbitrary first server, not allowing the automatic dilatation carrying out server, namely terminating the dilatation of server.
For the ease of understanding, take first server as server 1, server 2 and server 3, loading index is A, B, C is that example is described.Wherein, the load value that server 1 corresponds to loading index A, B, C is respectively 1A, 1B and 1C, the load value that server 2 corresponds to loading index A, B, C is respectively 2A, 2B and 2C, the load value that server 3 corresponds to loading index A, B, C is respectively 3A, 3B and 3C, and default value is 30% of load mean value.Load mean value=(1A+2A+3A)/3 corresponding according to every load value computational load index A of all first servers, the load mean value that loading index B is corresponding=(1B+2B+3B)/3, the load mean value that loading index C is corresponding=(1C+2C+3C)/3, namely the load mean value that load value 1A, 2A and 3A is corresponding is (1A+2A+3A)/3, the load mean value that load value 1B, 2B and 3B are corresponding is (1B+2B+3B)/3, and the load mean value that load value 1C, 2C and 3C are corresponding is (1C+2C+3C)/3; Load value 1A, 2A and 3A are compared with load mean value (1A+2A+3A)/3 respectively, load value 1B, 2B and 3B are compared with load mean value (1B+2B+3B)/3 respectively, load value 1C, 2C and 3C are compared with load mean value (1C+2C+3C)/3 respectively; If the difference of load value 1A, 2A and 3A and load mean value (1A+2A+3A)/3 is not more than 30% of load mean value (1A+2A+3A)/3, and the difference of load value 1B, 2B and 3B and load mean value (1B+2B+3B)/3 is not more than 30% of load mean value (1B+2B+2B)/3, and the difference of load value 1C, 2C and 3C and load mean value (1C+2C+3C)/3 is not more than 30% of load mean value (1C+2C+3C)/3, then determine the load balancing of each first server.
In above-mentioned steps 203 when arbitrary load value of arbitrary first server is greater than load threshold corresponding to arbitrary load value, determine that this arbitrary load value is high capacity value; When the load balancing of each first server, can according to the number of servers of following formulae discovery dilatation:
Wherein, n represents the quantity of the server that there is high capacity value.
In addition, the relation of the relation of dilatation number of servers and idle server quantity, dilatation number of servers and runtime server quantity also may affect dilatation number of servers.Therefore, the quantity of dilatation server can be set according to the situation of reality.
Using the number of servers of dilatation after server expansion strategy, the second server of at least one dilatation and the running environment of second server can be configured according to server expansion strategy.
The Organization Chart of server expansion system shown in Figure 3, AC is responsible for the high capacity event that reception monitor reports, and high capacity event is gathered, calculate and make server expansion strategy, server expansion strategy is being handed down to KM with the form of dilatation instruction.
In addition, dilatation instruction shown in Figure 5 issues and the Organization Chart of executive system.This dilatation instruction issue and executive system by flow control system, treat Operation Server, AC (Alter Controller, alteration control unit), KM (Keeper Master, cluster manager dual system), RM (Resource Master, service resources server), Transparent Proxy or PA (Permission Assignment, permission server) and business backstage permission grant System's composition.Wherein, the second server of the corresponding dilatation of Operation Server is treated.
The process that the corresponding dilatation instruction of running environment configuring the second server of at least one dilatation and second server according to server expansion strategy issues, performs, namely AC selects server according to server expansion strategy from the buffer pond that all servers to be runed are formed, dilatation instruction is handed down to KM, dilatation instruction is distributed to the keeper treating Operation Server that AC has selected by KM, each keeper can obtain service resources file (pulling service resources) from RM, and operating environment is installed, thus complete dilatation instruction.Wherein, service resources file is used to indicate the routing information of business.
Each keeper is after completing dilatation instruction, execution result is reported to KM, after KM receives the execution result that all keeper report, the dilatation result of all keeper is gathered, and remittance the long and is reported AC, by AC according to remittance the long and, select start authority application or stop dilatation.When AC selects to start authority application, continue to perform subsequent step 205.
It should be noted that, the second server of dilatation can be virtual mirror image server, and can also be property server, the present embodiment do concrete restriction to this.Namely can select the type of server in practical application according to demand, make the dilatation of server more flexible.
205: the background service authority of application second server, and by traffic assignments to second server.
As a kind of embodiment, the background service authority of application second server, includes but not limited to:
Judge whether to authorize application background service authority automatically;
If authorize application background service authority automatically, then directly apply for the background service authority of second server.
As a kind of embodiment, after judging whether to authorize automatic application background service authority, also include but not limited to:
If unauthorized applies for background service authority automatically, then to the background service authority of agent application second server.
Dilatation instruction shown in Figure 5 issues and executive system, when authorizing automatic application background service authority, can be directly connected with business backstage permission grant system by PA, thus the background service authority of the server of application configuration in real time, namely apply for that IP (Internet Protocol, the agreement interconnected between network) authorizes.When unauthorized applies for background service authority automatically, because proxy server has obtained the access rights of business backstage permission grant system, then can to the background service authority of proxy server application second server.Specifically comprise: the server to configuration sends iptables and forwards rule, and by object IP and port, by proxy server, request bag is forwarded, namely carry out direct access service backstage permission grant system by proxy server, thus rights concerns when solving dilatation.
By the mode of the background service authority of the server of above-mentioned two kinds of application configurations, realize application background service authority automatically, and there is not operating lag, improve the efficiency of application background service authority.
Dilatation instruction shown in Figure 5 issues and executive system, when having applied for the background service authority of second server, after Transparent Proxy or PA return authority application success to AC, AC can notification streams amount control system, by nginx or unified Access Layer gateway, service traffics are directed into the server of configuration, thus complete the dilatation of server, make equally loaded.
The method that the present embodiment provides, during by there is high capacity value in every load value of first server, the second server of at least one dilatation and the running environment of second server is configured according to every load value of each first server, and apply for the background service authority of second server, again by traffic assignments to second server, thus achieve the automatic dilatation of server, and the dilatation of server is not subject to the restriction of business and user, expands the range of application of server expansion.
See Fig. 6, embodiments provide a kind of flash chamber of server, the expansion method of server of this device for performing above-mentioned any embodiment and providing.This device comprises:
Acquisition module 601, for gathering the service data of each first server;
Computing module 602, for calculating every load value of each first server according to the service data of each first server;
Configuration module 603, for when there is high capacity value in every load value of all first servers, configures the second server of at least one dilatation and the running environment of second server according to every load value of each first server;
Application module 604, for applying for the background service authority of second server;
Distribution module 605, for by traffic assignments to second server.
As a kind of embodiment, see Fig. 7, acquisition module 601, comprising:
Acquiring unit 6011, for obtaining the sampling period;
Collecting unit 6012, for gathering the service data of each first server according to the sampling period.
As a kind of embodiment, see Fig. 8, this device, also comprises:
First judge module 606, for judging whether every load value of each first server is greater than load threshold corresponding to every load value;
Second judge module 607, during for being greater than load threshold corresponding to arbitrary load value when arbitrary load value of arbitrary first server, judges to there is high capacity value in every load value of all first servers.
As a kind of embodiment, see Fig. 9, configuration module 603, also comprises:
First judging unit 6031, for judging whether current time reaches cooling time;
Determining unit 6032, for when current time reaches cooling time, according to every load value determination server expansion strategy of each first server;
Dispensing unit 6033, for configuring the second server of at least one dilatation and the running environment of second server according to server expansion strategy.
As a kind of embodiment, see Figure 10, determining unit 6032, comprising:
Determine subelement 60321, whether balanced for determining the load of each first server;
Computation subunit 60322, for when the load balancing of each first server, calculates the number of servers of dilatation according to every load value of each first server;
Process subelement 60323, for using the number of servers of dilatation as server expansion strategy.
As a kind of embodiment, determining subelement 60321, for calculating load mean value corresponding to every load value according to every load value of all first servers, and the load mean value that every load value is corresponding with every load value being compared; When the difference of the load mean value corresponding with every load value when all load values is all not more than default value, determine the load balancing of each first server.
As a kind of embodiment, see Figure 11, application module 604, comprising:
Second judging unit 6041, for judging whether permission system authorizes application background service authority automatically;
First application unit 6042, for when authority system authorization applies for background service authority automatically, directly applies for the background service authority of second server.
As a kind of embodiment, see Figure 12, application module 604, also comprises:
Second application unit 6043, for when permission system unauthorized applies for background service authority automatically, to the background service authority of agent application second server.
The device that the embodiment of the present invention provides, during by there is high capacity value in every load value of first server, the second server of at least one dilatation and the running environment of second server is configured according to every load value of each first server, and apply for the background service authority of second server, again by traffic assignments to second server, thus achieve the automatic dilatation of server, and the dilatation of server is not subject to the restriction of business and user, expands the range of application of server expansion.
Figure 13 is the structural representation of server in the embodiment of the present invention.This server 1300 can produce larger difference because of configuration or performance difference, one or more central processing units (centralprocessing units can be comprised, CPU) 1322 (such as, one or more processors) and memory 1332, one or more store the storage medium 1330 (such as one or more mass memory units) of application program 1342 or data 1344.Wherein, memory 1332 and storage medium 1330 can be of short duration storages or store lastingly.The program being stored in storage medium 1330 can comprise one or more modules (diagram does not mark), and each module can comprise a series of command operatings in server 1300:
Gather the service data of each first server, and calculate every load value of each first server according to the service data of each first server;
If there is high capacity value in every load value of all first servers, then configure the second server of at least one dilatation and the running environment of second server according to every load value of each first server;
The background service authority of application second server, and by traffic assignments to second server.
In another embodiment, also comprise to give an order:
Gather the service data of each first server, comprising:
Obtain the sampling period, and gather the service data of each first server according to the sampling period.
In another embodiment, also comprise to give an order:
Calculate every load value of each first server according to the service data of each first server after, also comprise:
Judge whether every load value of each first server is greater than load threshold corresponding to every load value;
If arbitrary load value of arbitrary first server is greater than load threshold corresponding to arbitrary load value, then judge to there is high capacity value in every load value of all first servers.
In another embodiment, also comprise to give an order:
Configure the second server of at least one dilatation and the running environment of second server according to every load value of each first server, comprising:
Judge whether current time reaches cooling time;
If current time reaches cooling time, then according to every load value determination server expansion strategy of each first server, and configure the second server of at least one dilatation and the running environment of second server according to server expansion strategy.
In another embodiment, also comprise to give an order:
According to every load value determination server expansion strategy of each first server, comprising:
Determine that whether the load of each first server is balanced;
If the load balancing of each first server, then calculate the number of servers of dilatation according to every load value of each first server, and using the number of servers of dilatation as server expansion strategy.
In another embodiment, also comprise to give an order:
Determine that whether the load of each first server is balanced, comprising:
Calculate load mean value corresponding to every load value according to every load value of all first servers, and the load mean value that every load value is corresponding with every load value is compared;
If the difference of the load mean value that all load values are corresponding with every load value is all not more than default value, then determine the load balancing of each first server.
In another embodiment, also comprise to give an order:
The background service authority of application second server, comprising:
Judge whether permission system authorizes application background service authority automatically;
If background service authority is applied in permission system mandate automatically, then directly apply for the background service authority of second server.
In another embodiment, also comprise to give an order:
After judging whether permission system authorizes automatic application background service authority, also comprise:
If permission system unauthorized applies for background service authority automatically, then to the background service authority of agent application second server.
Further, central processing unit 1322 can be set to communicate with storage medium 1330, and server 1300 performs a series of command operatings in storage medium 1330.
Server 1300 can also comprise one or more power supplys 1326, one or more wired or wireless network interfaces 1350, one or more input/output interfaces 1358, and/or, one or more operating systems 1341, such as Windows ServerTM, Mac OS XTM, UnixTM, LinuxTM, FreeBSDTM etc.
The server that the embodiment of the present invention provides, during by there is high capacity value in every load value of first server, the second server of at least one dilatation and the running environment of second server is configured according to every load value of each first server, and apply for the background service authority of second server, again by traffic assignments to second server, thus achieve the automatic dilatation of server, and the dilatation of server is not subject to the restriction of business and user, expand the range of application of server expansion.
The embodiment of the present invention additionally provides a kind of computer-readable recording medium, and this computer-readable recording medium can be the computer-readable recording medium comprised in the memory in above-described embodiment; Also can be individualism, be unkitted the computer-readable recording medium allocated in terminal.This computer-readable recording medium stores more than one or one program, and this more than one or one program is used for the expansion method of an execution server by one or more than one processor, the method comprises:
Gather the service data of each first server, and calculate every load value of each first server according to the service data of each first server;
If there is high capacity value in every load value of all first servers, then configure the second server of at least one dilatation and the running environment of second server according to every load value of each first server;
The background service authority of application second server, and by traffic assignments to second server.
Suppose that above-mentioned is the first possible execution mode, then, in the execution mode that the second provided based on the execution mode that the first is possible is possible, in the memory of terminal, also comprise the instruction for performing following operation:
Gather the service data of each first server, comprising:
Obtain the sampling period, and gather the service data of each first server according to the sampling period.
In the third the possible execution mode provided based on the execution mode that the first is possible, in the memory of terminal, also comprise the instruction for performing following operation:
Calculate every load value of each first server according to the service data of each first server after, also comprise:
Judge whether every load value of each first server is greater than load threshold corresponding to every load value;
If arbitrary load value of arbitrary first server is greater than load threshold corresponding to arbitrary load value, then judge to there is high capacity value in every load value of all first servers.
In the 4th kind of possible execution mode provided based on the execution mode that the first is possible, in the memory of terminal, also comprise the instruction for performing following operation:
Configure the second server of at least one dilatation and the running environment of second server according to every load value of each first server, comprising:
Judge whether current time reaches cooling time;
If current time reaches cooling time, then according to every load value determination server expansion strategy of each first server, and configure the second server of at least one dilatation and the running environment of second server according to server expansion strategy.
In the 5th kind of possible execution mode provided based on the 4th kind of possible execution mode, in the memory of terminal, also comprise the instruction for performing following operation:
According to every load value determination server expansion strategy of each first server, comprising:
Determine that whether the load of each first server is balanced;
If the load balancing of each first server, then calculate the number of servers of dilatation according to every load value of each first server, and using the number of servers of dilatation as server expansion strategy.
In the 6th kind of possible execution mode provided based on the 5th kind of possible execution mode, in the memory of terminal, also comprise the instruction for performing following operation:
Determine that whether the load of each first server is balanced, comprising:
Calculate load mean value corresponding to every load value according to every load value of all first servers, and the load mean value that every load value is corresponding with every load value is compared;
If the difference of the load mean value that all load values are corresponding with every load value is all not more than default value, then determine the load balancing of each first server.
In the 7th kind of possible execution mode provided based on the execution mode that the first is possible, in the memory of terminal, also comprise the instruction for performing following operation:
The background service authority of application second server, comprising:
Judge whether permission system authorizes application background service authority automatically;
If background service authority is applied in permission system mandate automatically, then directly apply for the background service authority of second server.
In the 8th kind of possible execution mode provided based on the 7th kind of possible execution mode, in the memory of terminal, also comprise the instruction for performing following operation:
After judging whether permission system authorizes automatic application background service authority, also comprise:
If permission system unauthorized applies for background service authority automatically, then to the background service authority of agent application second server.
The computer-readable recording medium that the embodiment of the present invention provides, during by there is high capacity value in every load value of first server, the second server of at least one dilatation and the running environment of second server is configured according to every load value of each first server, and apply for the background service authority of second server, again by traffic assignments to second server, thus achieve the automatic dilatation of server, and the dilatation of server is not subject to the restriction of business and user, expand the range of application of server expansion.
Provide a kind of graphical user interface in the embodiment of the present invention, this graphical user interface is used in terminal, and this terminal comprises touch-screen display, memory and one or more than one processor for performing one or more than one program; This graphical user interface comprises:
Gather the service data of each first server, and calculate every load value of each first server according to the service data of each first server;
If there is high capacity value in every load value of all first servers, then configure the second server of at least one dilatation and the running environment of second server according to every load value of each first server;
The background service authority of application second server, and by traffic assignments to second server.
The graphical user interface that the embodiment of the present invention provides, during by there is high capacity value in every load value of first server, the second server of at least one dilatation and the running environment of second server is configured according to every load value of each first server, and apply for the background service authority of second server, again by traffic assignments to second server, thus achieve the automatic dilatation of server, and the dilatation of server is not subject to the restriction of business and user, expand the range of application of server expansion.
It should be noted that: the flash chamber of the server that above-described embodiment provides is when carrying out server expansion, only be illustrated with the division of above-mentioned each functional module, in practical application, can distribute as required and by above-mentioned functions and be completed by different functional modules, internal structure by device is divided into different functional modules, to complete all or part of function described above.In addition, the flash chamber of the server that above-described embodiment provides and the expansion method embodiment of server belong to same design, and its specific implementation process refers to embodiment of the method, repeats no more here.
The invention described above embodiment sequence number, just to describing, does not represent the quality of embodiment.
One of ordinary skill in the art will appreciate that all or part of step realizing above-described embodiment can have been come by hardware, the hardware that also can carry out instruction relevant by program completes, described program can be stored in a kind of computer-readable recording medium, the above-mentioned storage medium mentioned can be read-only memory, disk or CD etc.
The foregoing is only preferred embodiment of the present invention, not in order to limit the present invention, within the spirit and principles in the present invention all, any amendment done, equivalent replacement, improvement etc., all should be included within protection scope of the present invention.

Claims (16)

1. an expansion method for server, is characterized in that, described method comprises:
Gather the service data of each first server, and calculate every load value of each first server according to the service data of each first server;
If there is high capacity value in every load value of all first servers, then configure the second server of at least one dilatation and the running environment of described second server according to every load value of each first server;
Apply for the background service authority of described second server, and by traffic assignments to described second server.
2. method according to claim 1, is characterized in that, the service data of described each first server of collection, comprising:
Obtain the sampling period, and gather the service data of each first server according to the described sampling period.
3. method according to claim 1, is characterized in that, the described service data according to each first server also comprises after calculating every load value of each first server:
Judge whether every load value of each first server is greater than load threshold corresponding to every load value;
If arbitrary load value of arbitrary first server is greater than load threshold corresponding to described arbitrary load value, then judge to there is high capacity value in every load value of all first servers.
4. method according to claim 1, is characterized in that, described every load value according to each first server configures the second server of at least one dilatation and the running environment of described second server, comprising:
Judge whether current time reaches cooling time;
If current time reaches cooling time, then according to every load value determination server expansion strategy of each first server, and configure the second server of at least one dilatation and the running environment of described second server according to described server expansion strategy.
5. method according to claim 4, is characterized in that, described every load value determination server expansion strategy according to each first server, comprising:
Determine that whether the load of each first server is balanced;
If the load balancing of each first server, then calculate the number of servers of dilatation according to every load value of each first server, and using the number of servers of dilatation as server expansion strategy.
6. method according to claim 5, is characterized in that, describedly determines that whether the load of each first server is balanced, comprising:
Calculate load mean value corresponding to every load value according to every load value of all first servers, and the load mean value that every load value is corresponding with every load value is compared;
If the difference of the load mean value that all load values are corresponding with every load value is all not more than default value, then determine the load balancing of each first server.
7. method according to claim 1, is characterized in that, the background service authority of the described second server of described application, comprising:
Judge whether permission system authorizes application background service authority automatically;
If background service authority is applied in permission system mandate automatically, then directly apply for the background service authority of described second server.
8. method according to claim 7, is characterized in that, describedly judges whether permission system authorizes automatically after application background service authority, also comprises:
If permission system unauthorized applies for background service authority automatically, then to the background service authority of second server described in agent application.
9. a flash chamber for server, is characterized in that, described device comprises:
Acquisition module, for gathering the service data of each first server;
Computing module, for calculating every load value of each first server according to the service data of each first server;
Configuration module, for when there is high capacity value in every load value of all first servers, configures the second server of at least one dilatation and the running environment of described second server according to every load value of each first server;
Application module, for applying for the background service authority of described second server;
Distribution module, for by traffic assignments to described second server.
10. device according to claim 9, is characterized in that, described acquisition module, comprising:
Acquiring unit, for obtaining the sampling period;
Collecting unit, for gathering the service data of each first server according to the described sampling period.
11. devices according to claim 9, is characterized in that, described device, also comprises:
First judge module, for judging whether every load value of each first server is greater than load threshold corresponding to every load value;
Second judge module, during for being greater than load threshold corresponding to described arbitrary load value when arbitrary load value of arbitrary first server, judges to there is high capacity value in every load value of all first servers.
12. devices according to claim 9, is characterized in that, described configuration module, comprising:
First judging unit, for judging whether current time reaches cooling time;
Determining unit, for when current time reaches cooling time, according to every load value determination server expansion strategy of each first server;
Dispensing unit, for configuring the second server of at least one dilatation and the running environment of described second server according to described server expansion strategy.
13. devices according to claim 12, is characterized in that, described determining unit, comprising:
Determine subelement, whether balanced for determining the load of each first server;
Computation subunit, for when the load balancing of each first server, calculates the number of servers of dilatation according to every load value of each first server;
Process subelement, for using the number of servers of dilatation as server expansion strategy.
14. devices according to claim 13, it is characterized in that, describedly determining subelement, for calculating load mean value corresponding to every load value according to every load value of all first servers, and the load mean value that every load value is corresponding with every load value being compared; When the difference of the load mean value corresponding with every load value when all load values is all not more than default value, determine the load balancing of each first server.
15. devices according to claim 9, is characterized in that, described application module, comprising:
Second judging unit, for judging whether permission system authorizes application background service authority automatically;
First application unit, for when authority system authorization applies for background service authority automatically, directly applies for the background service authority of described second server.
16. devices according to claim 15, is characterized in that, described application module, also comprises:
Second application unit, for when permission system unauthorized applies for background service authority automatically, to the background service authority of second server described in agent application.
CN201410173228.0A 2014-04-25 2014-04-25 Method and device for capacity expansion of server Pending CN105024842A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201410173228.0A CN105024842A (en) 2014-04-25 2014-04-25 Method and device for capacity expansion of server

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201410173228.0A CN105024842A (en) 2014-04-25 2014-04-25 Method and device for capacity expansion of server

Publications (1)

Publication Number Publication Date
CN105024842A true CN105024842A (en) 2015-11-04

Family

ID=54414572

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201410173228.0A Pending CN105024842A (en) 2014-04-25 2014-04-25 Method and device for capacity expansion of server

Country Status (1)

Country Link
CN (1) CN105024842A (en)

Cited By (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106254470A (en) * 2016-08-08 2016-12-21 广州唯品会信息科技有限公司 Distributed job burst distribution method and device
CN106453641A (en) * 2016-11-24 2017-02-22 深圳市小满科技有限公司 Dynamic enterprise cloud service platform expansion method, device and system
CN106598699A (en) * 2016-11-30 2017-04-26 华为技术有限公司 Virtual machine management method and device
CN106936903A (en) * 2017-03-02 2017-07-07 深圳市科脉技术股份有限公司 The communication means and system of O2O multichannels
WO2017162034A1 (en) * 2016-03-22 2017-09-28 阿里巴巴集团控股有限公司 Loading method and system
WO2017181830A1 (en) * 2016-04-19 2017-10-26 中兴通讯股份有限公司 Synchronous capacity enlargement method and device for server, and storage medium
CN108768877A (en) * 2018-07-20 2018-11-06 网宿科技股份有限公司 A kind of distribution method of burst flow, device and proxy server
CN109669758A (en) * 2018-09-11 2019-04-23 深圳平安财富宝投资咨询有限公司 Concocting method, device, equipment and the storage medium of server resource
WO2020134786A1 (en) * 2018-12-26 2020-07-02 中兴通讯股份有限公司 Server expansion method and device, server and storage medium
CN111386676A (en) * 2018-03-21 2020-07-07 华为技术有限公司 Control method of application programming interface API gateway cluster and API gateway cluster
CN111625195A (en) * 2020-05-26 2020-09-04 北京百度网讯科技有限公司 Method and device for server capacity expansion
CN111782147A (en) * 2020-06-30 2020-10-16 北京百度网讯科技有限公司 Method and apparatus for cluster scale-up
CN112199251A (en) * 2020-09-25 2021-01-08 同程网络科技股份有限公司 Method, system and device for realizing dynamic increase and decrease of servers through timing tasks
CN112671570A (en) * 2020-12-16 2021-04-16 微梦创科网络科技(中国)有限公司 Method and system for automatically expanding and contracting capacity

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20090113055A1 (en) * 2007-10-11 2009-04-30 Vonage Holdings Corp. Method and apparatus for fulfilling information requests in a networked environment
CN102508693A (en) * 2011-09-29 2012-06-20 华中科技大学 Web server capacity expansion system based on virtual machine
CN103248622A (en) * 2013-04-09 2013-08-14 中国科学院计算技术研究所 Method and system for guaranteeing service quality of automatic retractable online video

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20090113055A1 (en) * 2007-10-11 2009-04-30 Vonage Holdings Corp. Method and apparatus for fulfilling information requests in a networked environment
CN102508693A (en) * 2011-09-29 2012-06-20 华中科技大学 Web server capacity expansion system based on virtual machine
CN103248622A (en) * 2013-04-09 2013-08-14 中国科学院计算技术研究所 Method and system for guaranteeing service quality of automatic retractable online video

Cited By (21)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2017162034A1 (en) * 2016-03-22 2017-09-28 阿里巴巴集团控股有限公司 Loading method and system
WO2017181830A1 (en) * 2016-04-19 2017-10-26 中兴通讯股份有限公司 Synchronous capacity enlargement method and device for server, and storage medium
CN106254470B (en) * 2016-08-08 2019-06-14 广州品唯软件有限公司 Distributed job fragment distribution method and device
CN106254470A (en) * 2016-08-08 2016-12-21 广州唯品会信息科技有限公司 Distributed job burst distribution method and device
CN106453641A (en) * 2016-11-24 2017-02-22 深圳市小满科技有限公司 Dynamic enterprise cloud service platform expansion method, device and system
CN106453641B (en) * 2016-11-24 2018-05-22 深圳市小满科技有限公司 Enterprise's cloud service platform dynamic capacity-expanding method, apparatus and system
CN106598699A (en) * 2016-11-30 2017-04-26 华为技术有限公司 Virtual machine management method and device
CN106598699B (en) * 2016-11-30 2019-11-29 华为技术有限公司 A kind of management method and device of virtual machine
WO2018157446A1 (en) * 2017-03-02 2018-09-07 深圳市科脉技术股份有限公司 O2o multi-channel communication method and system
CN106936903A (en) * 2017-03-02 2017-07-07 深圳市科脉技术股份有限公司 The communication means and system of O2O multichannels
CN111386676A (en) * 2018-03-21 2020-07-07 华为技术有限公司 Control method of application programming interface API gateway cluster and API gateway cluster
US11362952B2 (en) 2018-03-21 2022-06-14 Huawei Cloud Computing Technologies Co., Ltd. Application programing interface API gateway cluster control method and API gateway cluster
US11743187B2 (en) 2018-03-21 2023-08-29 Huawei Cloud Computing Technolgoies Co., Ltd. Application programing interface (API) gateway cluster control method and API gateway cluster
CN108768877A (en) * 2018-07-20 2018-11-06 网宿科技股份有限公司 A kind of distribution method of burst flow, device and proxy server
CN109669758A (en) * 2018-09-11 2019-04-23 深圳平安财富宝投资咨询有限公司 Concocting method, device, equipment and the storage medium of server resource
WO2020134786A1 (en) * 2018-12-26 2020-07-02 中兴通讯股份有限公司 Server expansion method and device, server and storage medium
CN111625195A (en) * 2020-05-26 2020-09-04 北京百度网讯科技有限公司 Method and device for server capacity expansion
CN111625195B (en) * 2020-05-26 2023-11-07 北京百度网讯科技有限公司 Method and device for server capacity expansion
CN111782147A (en) * 2020-06-30 2020-10-16 北京百度网讯科技有限公司 Method and apparatus for cluster scale-up
CN112199251A (en) * 2020-09-25 2021-01-08 同程网络科技股份有限公司 Method, system and device for realizing dynamic increase and decrease of servers through timing tasks
CN112671570A (en) * 2020-12-16 2021-04-16 微梦创科网络科技(中国)有限公司 Method and system for automatically expanding and contracting capacity

Similar Documents

Publication Publication Date Title
CN105024842A (en) Method and device for capacity expansion of server
Tootoonchian et al. {ResQ}: Enabling {SLOs} in Network Function Virtualization
Khoshkholghi et al. Energy-efficient algorithms for dynamic virtual machine consolidation in cloud data centers
US9632839B2 (en) Dynamic virtual machine consolidation
CN103051564B (en) The method and apparatus of dynamic resource allocation
EP2724244B1 (en) Native cloud computing via network segmentation
EP2907276B1 (en) System and method for efficient use of flow table space in a network environment
US8656406B2 (en) Load balancer and load balancing system
CN108182105B (en) Local dynamic migration method and control system based on Docker container technology
CN105159775A (en) Load balancer based management system and management method for cloud computing data center
CN111796908B (en) System and method for automatic elastic expansion and contraction of resources and cloud platform
US8806018B2 (en) Dynamic capacity management of multiple parallel-connected computing resources
CN105187512A (en) Method and system for load balancing of virtual machine clusters
Nithya et al. SDCF: A software-defined cyber foraging framework for cloudlet environment
JP2003124976A (en) Method of allotting computer resources
KR20110083084A (en) Apparatus and method for operating server by using virtualization technology
EP3061209B1 (en) Methods, nodes and computer program for enabling of resource component allocation
CN104580120A (en) On-demand-service virtualization network intrusion detection method and device
WO2021120633A1 (en) Load balancing method and related device
Takouna et al. Communication-aware and energy-efficient scheduling for parallel applications in virtualized data centers
Alyas et al. Live migration of virtual machines using a mamdani fuzzy inference system
Farahnakian et al. Hierarchical vm management architecture for cloud data centers
CN109960579B (en) Method and device for adjusting service container
Mandal et al. MECpVmS: an SLA aware energy-efficient virtual machine selection policy for green cloud computing
CN106059940A (en) Flow control method and device

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20151104