CN108881432A - Cloud computing cluster load dispatching method based on GA algorithm - Google Patents

Cloud computing cluster load dispatching method based on GA algorithm Download PDF

Info

Publication number
CN108881432A
CN108881432A CN201810622604.8A CN201810622604A CN108881432A CN 108881432 A CN108881432 A CN 108881432A CN 201810622604 A CN201810622604 A CN 201810622604A CN 108881432 A CN108881432 A CN 108881432A
Authority
CN
China
Prior art keywords
task
load
algorithm
time
node
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201810622604.8A
Other languages
Chinese (zh)
Inventor
阮浩德
陈静
吴晓生
马星
马力
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Guangdong Urban & Rural Planning And Design Institute
Original Assignee
Guangdong Urban & Rural Planning And Design Institute
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Guangdong Urban & Rural Planning And Design Institute filed Critical Guangdong Urban & Rural Planning And Design Institute
Priority to CN201810622604.8A priority Critical patent/CN108881432A/en
Publication of CN108881432A publication Critical patent/CN108881432A/en
Pending legal-status Critical Current

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L67/00Network arrangements or protocols for supporting network services or applications
    • H04L67/50Network services
    • H04L67/60Scheduling or organising the servicing of application requests, e.g. requests for application data transmissions using the analysis and optimisation of the required network resources
    • H04L67/61Scheduling or organising the servicing of application requests, e.g. requests for application data transmissions using the analysis and optimisation of the required network resources taking into account QoS or priority requirements
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/12Computing arrangements based on biological models using genetic models
    • G06N3/126Evolutionary algorithms, e.g. genetic algorithms or genetic programming
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L67/00Network arrangements or protocols for supporting network services or applications
    • H04L67/01Protocols
    • H04L67/10Protocols in which an application is distributed across nodes in the network
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L67/00Network arrangements or protocols for supporting network services or applications
    • H04L67/01Protocols
    • H04L67/10Protocols in which an application is distributed across nodes in the network
    • H04L67/1001Protocols in which an application is distributed across nodes in the network for accessing one among a plurality of replicated servers
    • H04L67/1004Server selection for load balancing
    • H04L67/1008Server selection for load balancing based on parameters of servers, e.g. available memory or workload
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L67/00Network arrangements or protocols for supporting network services or applications
    • H04L67/50Network services
    • H04L67/60Scheduling or organising the servicing of application requests, e.g. requests for application data transmissions using the analysis and optimisation of the required network resources
    • H04L67/62Establishing a time schedule for servicing the requests
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L67/00Network arrangements or protocols for supporting network services or applications
    • H04L67/50Network services
    • H04L67/60Scheduling or organising the servicing of application requests, e.g. requests for application data transmissions using the analysis and optimisation of the required network resources
    • H04L67/63Routing a service request depending on the request content or context

Landscapes

  • Engineering & Computer Science (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Signal Processing (AREA)
  • Physics & Mathematics (AREA)
  • Health & Medical Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Biophysics (AREA)
  • Theoretical Computer Science (AREA)
  • General Engineering & Computer Science (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Evolutionary Biology (AREA)
  • Physiology (AREA)
  • General Health & Medical Sciences (AREA)
  • Artificial Intelligence (AREA)
  • Biomedical Technology (AREA)
  • Computational Linguistics (AREA)
  • Data Mining & Analysis (AREA)
  • Evolutionary Computation (AREA)
  • Genetics & Genomics (AREA)
  • Molecular Biology (AREA)
  • Computing Systems (AREA)
  • General Physics & Mathematics (AREA)
  • Mathematical Physics (AREA)
  • Software Systems (AREA)
  • Computer Hardware Design (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

Cloud computing cluster load dispatching method based on GA algorithm, including:S1:Task queue is read, the operating parameter of each task is found out from " service role execution time model table ";S2:Operating parameter is encoded, operating parameter set is converted into chromosome bit string set, is produced initial population Group (g), g=0;S3:Calculate the fitness value of each individual in initial population;S4:Judge termination condition g>=Gmax (Gmax is maximum reproductive order of generation), if condition is unsatisfactory for, executes S5, otherwise goes to S8:S5:It carries out selection operation and forms next-generation population Group (g), g+=1, using the selection algorithm of simulated annealing;S6:With probability PcCarry out crossover operation;S7:With probability PmMutation operation is carried out, S3 is gone to;S8:Algorithm terminates, and exports current optimal scheduling scheme.The request task corresponding time is greatly shortened in the present invention, and GIS service quality can be also obviously improved in the case where user volume increases, and improves users' satisfaction degree.

Description

Cloud computing cluster load dispatching method based on GA algorithm
Technical field
The present invention relates to network load scheduling fields, and in particular to a kind of cloud computing cluster load dispatch based on GA algorithm Method.
Background technique
The rapid development of cloud computing application, to its performance and service quality, more stringent requirements are proposed, multi-core processor with And the rise of virtualization technology, it is the further development of cloud computing cluster, common load balance dispatching is common square in the prior art Method is scheduled, and load balance scheduling algorithm is a key technology of cluster, oneself is through there is numerous dispatching algorithms in difference Effective use has been obtained under scene.In the Web system of multi-user's high concurrent, the quality of load-balancing algorithm is directly affected QoS is a hot spot of computer and related fields research all the time.In general, the algorithm partition is static load balancing Algorithm and Dynamic Load-balancing Algorithm.
1. static load balancing algorithm
The algorithm mainly considers from pending task angle, it is believed that system oneself known that the operation of task is special Property, such as the type of task, the time cost spent required for running.Then load-balancing algorithm is according to so known task Characteristic calculates the optimal task schedule method of salary distribution.How as far as possible accurately obtain task operation characteristic, and how root Go out optimal scheduling scheme according to these information plannings, is the key problem to be paid close attention to of the algorithm.
But static scheduling algorithm does not consider cluster operating status, and the operating status of cluster is continually changing, static state Algorithm will appear the uneven situation of distribution unavoidably, waste resource utilization ratio.
2. Dynamic Load-balancing Algorithm
One important algorithm is to need load balancer timely to obtain from the viewpoint of the operating status of cluster The use state of cluster Current resource such as CPU, memory, network etc. and the operating status of task.Algorithm is being scheduled it Before, each node load situation can be evaluated according to system running state, current task generally can all be distributed to the smallest section of load Point, to realize the scheduling of task.It is how accurate and timely to obtain cluster operating status and make load evaluation be the algorithm Key problem.
Relative to static load balancing algorithm, dynamic load-balancing algorithm considers the situation of change of group system, and Targetedly distribution is adjusted.The problem of load distribution, has generality, it may be considered that introduces from other field some new Method new approaches solve the problems, such as.
Related science man has done comparison to the geometric space operation based on spatial database and has been concluded that the overwhelming majority Spatial operation requires consumption great number CPU and a large amount of I/O, the complexity of spatial data determine spatial operation be a kind of CPU and I/O intensity calculating task, and the difference of the complexity of different types of spatial operation is also very big.Space querying not only wants root Ask the index that condition calculates spatial object according to Check, also the object inquired be read out and is modeled with objectification, be it is a kind of with Task based on I/O, supplemented by CPU calculating;Spatial overlay analysis is a kind of binary operation, not only sets mass data from storage Standby graftabl, it is also necessary to carry out complex space operation, complexity will often be higher by space querying one or more quantity Grade.In a cloud with the different complexities such as space querying, dynamic cartography, spatial data editing, spatial analysis, network analysis On the one hand computing cluster system, the pressure faced are that user volume is big, be on the other hand because of the greatest differences between request task And the laod unbalance between the clustered node amplified.
Simple load-balancing algorithm such as polling method, be for the cluster that load requests differ greatly it is unsuitable, it is some Increasingly complex Dynamic Load-balancing Algorithm can cope with such demand, but the complexity of itself and to system-computed and logical The occupancy of news resource can introduce some new problems, affect the development process of cloud computing cluster.
Summary of the invention
In view of the deficiencies of the prior art, the purpose of the present invention is intended to provide a kind of virtualization cloud computing cluster load balance calculation Method can also be obviously improved GIS service quality in the case where user volume increases so that the request task corresponding time is most short, improve Users' satisfaction degree.
To achieve the above object, the present invention adopts the following technical scheme that:
Cloud computing cluster load dispatching method based on GA algorithm, including:
S1:Task queue is read, the operating parameter of each task is found out from " service role execution time model table ";
S2:Operating parameter is encoded, operating parameter set is converted into chromosome bit string set, produces initial population Group (g), g=0;
S3:Calculate the fitness value of each individual in initial population;
S4:Judge termination condition g>=Gmax (Gmax is maximum reproductive order of generation), if condition is unsatisfactory for, executes S5, no Then go to S8:
S5:It carries out selection operation and forms next-generation population Group (g), g+=1, using the selection algorithm of simulated annealing; S6:With probability PcCarry out crossover operation;
S7:With probability PmMutation operation is carried out, S3 is gone to;
S8:Algorithm terminates, and exports current optimal scheduling scheme.
Further, a gene represents a task to be allocated in the chromosome bit string, and a gene representation is (t, l, p, n), wherein t indicates mission number, and l indicates task scale, i.e. the expected of task executes the time, and p indicates that task is preferential Grade, p is a Boolean, indicates high priority with 1 (True), can be prioritized in intra-node task execution, 0 (False) it indicates low priority, execution can be postponed, n indicates the clustered node server number that the task is assigned to, n's Value takes random manner to generate, i.e., from the value range of n 0<n<Positive integer is randomly selected in Ns and gives n assignment, forms a dye Colour solid;One chromosome is made of the task that one group of band distributes, the corresponding load dispatch scheme of a chromosome, a dyeing The expression way of body be (t, l, p, n) | t>C,l>0, p=0 | 1,0<n<Ns }, wherein C is that load strategy converts threshold values (C=K* Ns), 1 is positive integer, and Ns is clustered node quantity.
Further, the fitness value is indicated by fitness function, and the fitness function is positioned asIts The node load amount deviation of middle reflection clustered node load balance degreeClustered node average load Amount isNode load total amount isWherein clustered node loads Measure (Li) it is current distribution load value (Lc) and expected (algorithm distribution) load value (Le) the sum of i.e. Li=Lc+Le, node currently bears Load number of tasks is Tc, n-th of task is expected to execute the time as ctn, it is contemplated that loading commissions number is Te, when n-th of task is expected to execute Between be etn
Further, the current distribution load value (Lc) and node present load number of tasks TcIt is current to be referred to as clustered node Load information, clustered node current load information are obtained by the way of load capacity snapshot.
Further, the mode of the load capacity snapshot refers to when executing scheduling scheme, and task distribution condition is synchronous It being recorded in task execution real-time status snapshot table, each node also feeds back execution status of task into load balancer, and When more new state record;And when executing fitness evaluation, task real-time status snapshot table can be generated, it is then real-time to task State snapshot table is calculated and is analyzed, and obtains the value of present load amount.
Further, the task real-time status snapshot table record task ID, task priority, it is expected execute the time, Time started and state.
Further, described expected when executing the time by average execution in " GIS service task execution time model table " Between obtain, Jian time of beginning is time for distribute to node of task, and state value includes " 0 " and " 1 ", and " 0 " expression task execution fails, " 1 " expression task is allocated successfully, if task execution success, deletes the scheduler task from real-time task table, is only protected in table Stay the task record for being not carried out or executing failure, the executing failure of the task also can process periodically, reattempt to execution or to Client reflects task execution failure.
Further, the selection algorithm of simulated annealing is expressed as in the step S5p(ai) it is select probability, n is population G={ a1,a2,...,anNumber, wherein Individual aiAdaptive value be f (ai), T is annealing temperature.
Further, with probability P in the step S6cCarrying out crossover operation includes:
S61. it strikes a bargain in current population by the high group of individuals of one group of adaptive value of certain rule selection and matches pond, from mating pond Randomly select a pair of of individual;
S62. one or more positions are randomly selected from chromosome bit string as position to be intersected;
S63. according to crossover probability PcIntersected, wherein crossover probability PcIt is set as 0.4~0.9, the individual of pairing is existed At crossover location, respective bit string content is exchanged, forms two new individuals.
Further, the mutation probability P in the step S7mIt is 0.001~0.1.
The beneficial effects of the present invention are:
According to the particularity and complexity of GIS operation and application, establishes task based access control priority and execute the GIS of time Task model divides priority to request task as genetic algorithm gene coding basis, in scheduling, allow the response time it is short, The operation being commonly used preferentially is executed, to improve user satisfaction;The genetic algorithm for load balancing is devised, including Fitness function, genetic operator and control parameter, and form algorithm flow so that the hardware resource in cluster obtain it is optimal The utilization of change avoids the occurrence of certain excessively high and some nodes of node resource utilization rate and the relatively low or even idle feelings of utilization rate occurs Condition.
Detailed description of the invention
Fig. 1 is that the present invention is based on the step schematic diagrames of the cloud computing cluster load dispatching method of GA algorithm;
Fig. 2 is mixed scheduling strategy flow chart of the present invention;
Fig. 3 is service role execution time model table of the present invention;
Fig. 4 is initial population schematic diagram of the present invention;
Fig. 5 is load information securing mechanism figure of the present invention;
Fig. 6 is task execution real-time status snapshot table of the present invention;
Fig. 7 is 20 concurrent user's average response time comparison diagrams of the invention;
Fig. 8 is 50 concurrent user's average response time comparison diagrams of the invention.
Specific embodiment
In the following, being described further in conjunction with attached drawing and specific embodiment to the present invention:
" GA algorithm " is expressed as genetic algorithm, as shown in Figure 1, the cloud computing cluster load dispatching method based on GA algorithm, Including:
S1:Task queue is read, the operating parameter of each task is found out from " service role execution time model table ".
The case where above-mentioned " service role execution time model table ", is as follows:
As shown in figure 3, recite mission number, GIS service function, data mark in " service role execution time model table " Knowledge, average performance times and priority, each task are expressed with t (l, p), are respectively indicated mission number, task scale and are appointed Business priority.The task scale that l is indicated is reached using the expection implementation schedule of task here.P is a Boolean, with 1 (True) it indicates high priority, can be prioritized in intra-node task execution, 0 (False) indicates low priority, can quilt Postpone execution." service role execution time model table " illustrates the example model of task presentation, the average performance times in table The average value that will be tested according to cluster single node.
At the same time, selection Genetic Strategies are also needed, the cluster that the present invention constructs is a symmetrical cluster, i.e., each node As computing capability with service type is, in addition, starting more complicated Dynamic Load-balancing Algorithm and can expend and certain be System resource, leads to some delays, the invention proposes a kind of by poll and the scheduling plan combined based on genetic algorithm approach Slightly, as shown in Figure 2, it is desired to be able to adapt to various cluster pressure scenes, improve overall performance.
1. requesting density Dr=Nr/Tapan
Indicate the GIS number of requests in the unit time.Wherein TapanThe sampling time section of request, NrFor in the period Request number.
2. load strategy converts threshold values C=K*Ns
Indicate the critical value of switching load strategy.Wherein Ns is clustered node quantity, and it is one just whole that K, which is to adjust constant, Numerical value.
The strategy is when cluster load pressure is smaller, using static robin manner.As request density DrReach one Determine threshold values C when, enable the load balance scheduling strategy based on GA algorithm.
S2:Operating parameter is encoded, operating parameter set is converted into chromosome bit string set, produces initial population Group (g), g=0.
A gene represents a task to be allocated in above-mentioned chromosome bit string, and a gene representation is (t, l, p, n), Wherein t indicates mission number, and l indicates task scale, i.e. the expected of task executes the time, and p indicates task priority, and p is one Boolean indicates high priority with 1 (True), can be prioritized in intra-node task execution that 0 (False) indicates low Priority, can be postponed execution, and n indicates that the clustered node server number that the task is assigned to, the value of n take random side Formula generates, i.e., from the value range of n 0<n<Positive integer is randomly selected in Ns and gives n assignment, forms a chromosome;One chromosome It is made of the task that one group of band distributes, the corresponding load dispatch scheme of a chromosome, the expression way of a chromosome is {(t,l,p,n)|t>C,l>0, p=0 | 1,0<n<Ns }, wherein C is that load strategy converts threshold values (C=K*Ns), and 1 is positive integer, Ns is clustered node quantity.
It wherein needs to determine population scale before initialization of population, and population scale meaning is as follows:
Population scale, i.e. chromosome number have a significant impact to performance of genetic algorithms, and the population that a scale is n is lost The complexity for passing the mode that operator is therefrom generated and examined is O (n3).Population scale is bigger, and group more has diversity, algorithm A possibility that falling into local convergence is with regard to smaller, but the expansion of population scale also brings along dramatically increasing for calculation amount, to reduce Convergence rate.Population scale is too small, then will limit the search space of algorithm, it is likely that the case where Premature Convergence occur.General feelings Scholar suggests that population scale is between 20 to 200 under condition.Task execution time prediction model itself in view of the present embodiment Non-precision, furthermore the such scheduling system real time of cluster is stronger, needs quickly to make scheduling decision, therefore receive from improving It holds back speed to set out, population scale takes 20 here.
The individual of initial population is generated by random fashion here.T, l and p may be used in chromosome four-tuple (t, l, p, n) With determination, random manner is taken to generate the value of n here, i.e., from the value range of n 0<n<Positive integer is randomly selected in Ns to n Assignment forms a chromosome.Initial population schematic diagram is as shown in Figure 4.
S3:Calculate the fitness value of each individual in initial population.
Above-mentioned fitness function is assessed the chromosome adaptability of population, and executing the survival of the fittest is existence rule, Directly determine Evolutionary direction and the behavior of group.The chromosome for having higher adaptive value is considered to have preferable bit string knot A possibility that structure has preferable survival ability, generates outstanding offspring is bigger.For the present invention, fitness value is higher Load dispatch scheme is easier to be retained in scheme pond (population), and more easily output optimization scheme.
Since forementioned gene coding can not reflect the sequence of scheduling, the present invention by optimal scheduling conceptual internal according to The mode of priority ranking realizes the target of user satisfaction, i.e., in clustered node internal task queue, priority is high to appoint Business first carries out.Here fitness function is considered from the optimization aim of cluster server state, on each node server Utilization rate CPU is positively related in number of tasks and task execution time and server, can reflect the load state of CPU, in turn Reflect node overall load situation.The scheduling scheme that dispatching algorithm obtains is a desired value of node present load, uses task The summation of time is executed to quantify the value.
The fitness value is indicated that the fitness function is positioned as by fitness functionWherein reflection collection The node load amount deviation of group node load balance degreeClustered node average load amount isNode load total amount isWherein clustered node load capacity (Li) it is current distribution load value (Lc) and expected (algorithm distribution) load value (Le) the sum of i.e. Li=Lc+Le, node present load Number of tasks is Tc, n-th of task is expected to execute the time as ctn, it is contemplated that loading commissions number is Te, n-th of task is expected to execute the time For etn.Under normal circumstances, a fitness value always nonnegative value, and the bigger adaptability of numerical value is better.And node load amount deviation It is smaller, illustrate that allocation plan is more outstanding, chromosome fitness is higher, therefore, expresses here using the inverse of load deviation suitable Angle value is answered, numerical value is bigger, and load distribution scheme is better.
Above-mentioned current distribution load value (Lc) and node present load number of tasks TcIt is referred to as clustered node current load information, Clustered node current load information is obtained by the way of load capacity snapshot.
Node present load amount is also the factor for having to consider, is obtained used here as the mode of load capacity snapshot current Load information.The mechanism that load information obtains is executing scheduling scheme as shown in figure 5, the mode of the load capacity snapshot refers to When, by task distribution condition synchronous recording into task execution real-time status snapshot table, each node is also anti-by execution status of task It is fed in load balancer, with the record for the state of timely updating;And when executing fitness evaluation, task real-time status can be generated Then snapshot table is calculated and is analyzed to task real-time status snapshot table, obtains the value of present load amount.
As shown in fig. 6, the task real-time status snapshot table record task ID, task priority, it is expected execute the time, Time started and state.The expected execution time is by the average performance times in " GIS service task execution time model table " It obtaining, Jian time beginning is the time that task is distributed to node, and state value includes " 0 " and " 1 ", and " 0 " indicates task execution failure, " 1 " expression task is allocated successfully, if task execution success, deletes the scheduler task from real-time task table, is only protected in table Stay the task record for being not carried out or executing failure, the executing failure of the task also can process periodically, reattempt to execution or to Client reflects task execution failure.
S4:Judge termination condition g>=Gmax (Gmax is maximum reproductive order of generation), if condition is unsatisfactory for, executes S5, into Otherwise row genetic manipulation goes to S8, terminate algorithm:
S5:It carries out selection operation and forms next-generation population Group (g), g+=1, using the selection algorithm of simulated annealing;Institute The selection algorithm for stating simulated annealing in step S5 is expressed asp(ai) it is select probability, n For population G={ a1,a2,...,anNumber, wherein individual aiAdaptive value be f (ai), T is annealing temperature.The algorithm is stronger Local search ability the characteristics of allow to from falling into locally optimal solution in search process, and in the difference of genetic evolution Annealing temperature can be used to control different pressure in stage.
S6:With probability PcCarry out crossover operation;Operation simulation nature evolve in sexual reproduction process, realize The recombination of gene, on one side by original outstanding gene genetic to the next generation.The process of intersection includes following steps:
S61. it strikes a bargain in current population by the high group of individuals of one group of adaptive value of certain rule selection and matches pond, from mating pond Randomly select a pair of of individual;
S62. one or more positions are randomly selected from chromosome bit string as position to be intersected;
S63. according to crossover probability PcIntersected, wherein crossover probability PcIt is set as 0.4~0.9, the individual of pairing is existed At crossover location, respective bit string content is exchanged, forms two new individuals.
Crossover operator include a bit, two o'clock and multiple point crossover, consistent intersection etc..Here using the algorithm unanimously intersected, i.e., Each in chromosome bit string is all pressed into equal probabilities and carries out random uniform crossover.The general value of crossover probability be 0.4~ 0.9, higher probability can accelerate convergence rate, but also will increase the probability of " precocity ", and crossover probability takes median 0.6 here.
S7:With probability PmMutation operation is carried out, S3 is gone to;Operation simulation nature evolve in gene mutation it is existing As, mutation operator often can be to avoiding " precocity " from generating good preventive effect, on the one hand, mutation operator to evolve in lose The allele of mistake is restored, and to keep population diversity, while the variation of appropriateness, also can be improved the local search of algorithm Efficiency.For the stability for guaranteeing population, so that variation individual and its male parent are unlikely to generate excessive difference, mutation probability value It is typically small, usually 0.001~0.1.
Here mutation operation is reduced to, the variation to chromosome (t, l, p, n), only need to make a variation to the value of n, i.e., from 0, which randomly selects a value into Ns, replaces original value.
S8:Algorithm terminates, and exports current optimal scheduling scheme.
Inventive algorithm is tested:Algorithm of the invention is inserted into cloud computing cluster load balance module, with before Polling type, Smallest connection number algorithm be compared, the advance of algorithm is verified.The cluster of experiment is 5 physics Server, 4 are used as GIS server, two virtual machines of every server disposition, 8 dummy nodes formed altogether, every node deployment One GIS service process;1 is used as Web server, disposes load balance scheduling module.The hardware configuration of server such as following table:
Server software configuration such as following table:
Experimental data is:
Tile data:Wuhan topographic map, 1-9 grades.
Vector data:1 to 5000 topographic map, point, line, area's figure layer each one, element sum is about 2500.
Test scene:The operation of map view, attribute query and spatial analysis, operation time are carried out to WebGIS system at random Number such as following table,
Map view Hear to tile browsing Attribute query Cutting pattern analysis Buffer area buffer area
Number 5 5 5 1 1
Foundation is recorded in this, as LoadRunner test scene.Test is directed to 20 users, 50 users, test run respectively Time is 10 minutes.
Can most reflect performance indicator in the performance test results is average response time, and test result is aggregated, as schemed institute Show, by Fig. 7, Fig. 8 as it can be seen that based on heredity dispatching algorithm can effectively shorten the task under multi-user concurrent situation complete when Between, hence it is evident that improve the quality of GIS service.When user concurrent amount is little, effect is not particularly evident, but when user volume after Continuous when increasing, improvement is with regard to obvious.
It will be apparent to those skilled in the art that can make various other according to the above description of the technical scheme and ideas Corresponding change and deformation, and all these changes and deformation all should belong to the protection scope of the claims in the present invention Within.

Claims (10)

1. the cloud computing cluster load dispatching method based on GA algorithm, which is characterized in that including:
S1:Task queue is read, the operating parameter of each task is found out from " service role execution time model table ";
S2:Operating parameter is encoded, operating parameter set is converted into chromosome bit string set, produces initial population Group (g), g=0;
S3:Calculate the fitness value of each individual in initial population;
S4:Judge termination condition g>=Gmax (Gmax is maximum reproductive order of generation), if condition is unsatisfactory for, executes S5, otherwise turns To S8:
S5:It carries out selection operation and forms next-generation population Group (g), g+=1, using the selection algorithm of simulated annealing;
S6:With probability PcCarry out crossover operation;
S7:With probability PmMutation operation is carried out, S3 is gone to;
S8:Algorithm terminates, and exports current optimal scheduling scheme.
2. the cloud computing cluster load dispatching method according to claim 1 based on GA algorithm, which is characterized in that the dye A gene represents a task to be allocated in colour solid bit string, and a gene representation is (t, l, p, n), and wherein t indicates task Number, l indicate task scale, i.e. the expected of task executes the time, and p indicates task priority, and p is a Boolean, with 1 (True) it indicates high priority, can be prioritized in intra-node task execution, 0 (False) indicates low priority, can quilt Postpone execution, n indicates that the clustered node server number that the task is assigned to, the value of n take random manner to generate, i.e., from The value range 0 of n<n<Positive integer is randomly selected in Ns and gives n assignment, forms a chromosome;One chromosome is by one group of band point The matching of the task is constituted, the corresponding load dispatch scheme of a chromosome, the expression way of a chromosome be (t, l, p, n) | t>C,l>0, p=0 | 1,0<n<Ns }, wherein C is that load strategy converts threshold values (C=K*Ns), and 1 is positive integer, and Ns is cluster section Point quantity.
3. the cloud computing cluster load dispatching method according to claim 1 based on GA algorithm, which is characterized in that described suitable Angle value is answered to be indicated by fitness function, the fitness function is positioned asWherein reflect clustered node load balancing The node load amount deviation of degreeClustered node average load amount is Node load total amount isWherein clustered node load capacity (Li) it is current distribution load value (Lc) and expected (algorithm distribution) load value (Le) the sum of i.e. Li=Lc+Le, node present load number of tasks is Tc, n-th of task It is expected that executing the time is ctn, it is contemplated that loading commissions number is Te, n-th of task is expected to execute the time as etn
4. the cloud computing cluster load dispatching method according to claim 1 based on GA algorithm, which is characterized in that described to work as Preceding distribution load value (Lc) and node present load number of tasks TcIt is referred to as clustered node current load information, clustered node is currently born Information carrying breath is obtained by the way of load capacity snapshot.
5. the cloud computing cluster load dispatching method according to claim 4 based on GA algorithm, which is characterized in that described negative The mode of carrying capacity snapshot refers to when executing scheduling scheme, and task distribution condition synchronous recording is fast to task execution real-time status According in table, each node is also by execution status of task feedback into load balancer, with the record for the state of timely updating;And it is executing When fitness evaluation, task real-time status snapshot table can be generated, then task real-time status snapshot table is calculated and analyzed, Obtain the value of present load amount.
6. the cloud computing cluster load dispatching method according to claim 5 based on GA algorithm, which is characterized in that described State snapshot table records task ID, task priority, expected execution time, time started and state when pragmatic.
7. the cloud computing cluster load dispatching method according to claim 6 based on GA algorithm, which is characterized in that described pre- Phase execute the time by " GIS service task execution time model table " average performance times obtain, Jian begin the time be task to The time of node distribution, state value include " 0 " and " 1 ", and " 0 " indicates task execution failure, and " 1 " expression task is allocated successfully, if Task execution success, then delete the scheduler task from real-time task table, only retains in table and is not carried out or executes appointing for failure Business record, the executing failure of the task also can be processed periodically, reattempted to execution or failed to client reflection task execution.
8. the cloud computing cluster load dispatching method according to claim 1 based on GA algorithm, which is characterized in that the step The selection algorithm of simulated annealing is expressed as in rapid S5p(ai) it is select probability, n is kind Group G={ a1,a2,...,anNumber, wherein individual aiAdaptive value be f (ai), T is annealing temperature.
9. the cloud computing cluster load dispatching method according to claim 1 based on GA algorithm, which is characterized in that the step With probability P in rapid S6cCarrying out crossover operation includes:
S1. it strikes a bargain in current population by the high group of individuals of one group of adaptive value of certain rule selection and matches pond, it is random from mating pond Choose a pair of of individual;
S2. one or more positions are randomly selected from chromosome bit string as position to be intersected;
S3. according to crossover probability PcIntersected, wherein crossover probability PcIt is set as 0.4~0.9, the individual of pairing is being intersected At position, respective bit string content is exchanged, forms two new individuals.
10. the cloud computing cluster load dispatching method according to claim 1 based on GA algorithm, which is characterized in that described Mutation probability P in step S7mIt is 0.001~0.1.
CN201810622604.8A 2018-06-15 2018-06-15 Cloud computing cluster load dispatching method based on GA algorithm Pending CN108881432A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201810622604.8A CN108881432A (en) 2018-06-15 2018-06-15 Cloud computing cluster load dispatching method based on GA algorithm

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201810622604.8A CN108881432A (en) 2018-06-15 2018-06-15 Cloud computing cluster load dispatching method based on GA algorithm

Publications (1)

Publication Number Publication Date
CN108881432A true CN108881432A (en) 2018-11-23

Family

ID=64339471

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201810622604.8A Pending CN108881432A (en) 2018-06-15 2018-06-15 Cloud computing cluster load dispatching method based on GA algorithm

Country Status (1)

Country Link
CN (1) CN108881432A (en)

Cited By (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109726001A (en) * 2018-12-29 2019-05-07 中山大学 A kind of genetic algorithm for heterogeneous system
CN110109753A (en) * 2019-04-25 2019-08-09 成都信息工程大学 Resource regulating method and system based on various dimensions constraint genetic algorithm
CN110389819A (en) * 2019-06-24 2019-10-29 华中科技大学 A kind of dispatching method and system of computation-intensive batch processing task
CN110888719A (en) * 2019-09-18 2020-03-17 广州市巨硅信息科技有限公司 Distributed task scheduling system and method based on web service
CN111683132A (en) * 2020-06-04 2020-09-18 重庆英楼维兴信息科技有限公司 Business distribution method based on micro-service architecture and related device
CN112765463A (en) * 2021-01-12 2021-05-07 陈漩 Data management method for big data and user requirements and cloud computing server
CN113361833A (en) * 2020-03-02 2021-09-07 联芯集成电路制造(厦门)有限公司 Chemical mechanical polishing system and related dispatching management method
CN113411369A (en) * 2020-03-26 2021-09-17 山东管理学院 Cloud service resource collaborative optimization scheduling method, system, medium and equipment
CN117272838A (en) * 2023-11-17 2023-12-22 恒海云技术集团有限公司 Government affair big data platform data acquisition optimization method
CN117688102A (en) * 2023-12-28 2024-03-12 江苏北联宝德计算机系统有限公司 Operation data management system and method based on artificial intelligence

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102063339A (en) * 2010-12-21 2011-05-18 北京高森明晨信息科技有限公司 Resource load balancing method and equipment based on cloud computing system
CN103024048A (en) * 2012-12-17 2013-04-03 南京邮电大学 Resources scheduling method under cloud environment
CN103365727A (en) * 2013-07-09 2013-10-23 南京大学 Host load forecasting method in cloud computing environment
CN103902375A (en) * 2014-04-11 2014-07-02 北京工业大学 Cloud task scheduling method based on improved genetic algorithm
CN104811491A (en) * 2015-04-17 2015-07-29 华南理工大学 Cloud computing resource scheduling method based on genetic algorithm
CN104932938A (en) * 2015-06-16 2015-09-23 中电科软件信息服务有限公司 Cloud resource scheduling method based on genetic algorithm
CN105740051A (en) * 2016-01-27 2016-07-06 北京工业大学 Cloud computing resource scheduling realization method based on improved genetic algorithm

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102063339A (en) * 2010-12-21 2011-05-18 北京高森明晨信息科技有限公司 Resource load balancing method and equipment based on cloud computing system
CN103024048A (en) * 2012-12-17 2013-04-03 南京邮电大学 Resources scheduling method under cloud environment
CN103365727A (en) * 2013-07-09 2013-10-23 南京大学 Host load forecasting method in cloud computing environment
CN103902375A (en) * 2014-04-11 2014-07-02 北京工业大学 Cloud task scheduling method based on improved genetic algorithm
CN104811491A (en) * 2015-04-17 2015-07-29 华南理工大学 Cloud computing resource scheduling method based on genetic algorithm
CN104932938A (en) * 2015-06-16 2015-09-23 中电科软件信息服务有限公司 Cloud resource scheduling method based on genetic algorithm
CN105740051A (en) * 2016-01-27 2016-07-06 北京工业大学 Cloud computing resource scheduling realization method based on improved genetic algorithm

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
李源林: "基于服务器虚拟化的网络GIS集群关键技术研究", 《中国博士学位论文全文数据库 基础科学辑》 *
胡艳华;唐新来: "基于改进遗传算法的云计算任务调度算法", 《计算机技术与发展》 *

Cited By (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109726001A (en) * 2018-12-29 2019-05-07 中山大学 A kind of genetic algorithm for heterogeneous system
CN110109753A (en) * 2019-04-25 2019-08-09 成都信息工程大学 Resource regulating method and system based on various dimensions constraint genetic algorithm
CN110389819A (en) * 2019-06-24 2019-10-29 华中科技大学 A kind of dispatching method and system of computation-intensive batch processing task
CN110888719A (en) * 2019-09-18 2020-03-17 广州市巨硅信息科技有限公司 Distributed task scheduling system and method based on web service
CN113361833A (en) * 2020-03-02 2021-09-07 联芯集成电路制造(厦门)有限公司 Chemical mechanical polishing system and related dispatching management method
US11397425B2 (en) 2020-03-02 2022-07-26 United Semiconductor (Xiamen) Co., Ltd. CMP polishing system and associated pilot management system
CN113411369A (en) * 2020-03-26 2021-09-17 山东管理学院 Cloud service resource collaborative optimization scheduling method, system, medium and equipment
CN111683132A (en) * 2020-06-04 2020-09-18 重庆英楼维兴信息科技有限公司 Business distribution method based on micro-service architecture and related device
CN111683132B (en) * 2020-06-04 2022-07-19 重庆金窝窝网络科技有限公司 Business distribution method based on micro-service architecture and related device
CN112765463A (en) * 2021-01-12 2021-05-07 陈漩 Data management method for big data and user requirements and cloud computing server
CN117272838A (en) * 2023-11-17 2023-12-22 恒海云技术集团有限公司 Government affair big data platform data acquisition optimization method
CN117272838B (en) * 2023-11-17 2024-02-02 恒海云技术集团有限公司 Government affair big data platform data acquisition optimization method
CN117688102A (en) * 2023-12-28 2024-03-12 江苏北联宝德计算机系统有限公司 Operation data management system and method based on artificial intelligence
CN117688102B (en) * 2023-12-28 2024-06-07 江苏北联宝德计算机系统有限公司 Operation data management system and method based on artificial intelligence

Similar Documents

Publication Publication Date Title
CN108881432A (en) Cloud computing cluster load dispatching method based on GA algorithm
CN111027736B (en) Micro-service combined deployment and scheduling method under multi-objective optimization
Wang et al. Load balancing task scheduling based on genetic algorithm in cloud computing
CN110134495B (en) Container cross-host online migration method, storage medium and terminal equipment
CN105704255B (en) A kind of server load balancing method based on genetic algorithm
CN103631657B (en) A kind of method for scheduling task based on MapReduce
CN110231976B (en) Load prediction-based edge computing platform container deployment method and system
CN108170530B (en) Hadoop load balancing task scheduling method based on mixed element heuristic algorithm
CN105471985A (en) Load balance method, cloud platform computing method and cloud platform
CN105718364A (en) Dynamic assessment method for ability of computation resource in cloud computing platform
CN104461739B (en) A kind of virtual machine batch dispositions method based on cloudsim platforms
CN108845886B (en) Cloud computing energy consumption optimization method and system based on phase space
Zhou et al. Strategy optimization of resource scheduling based on cluster rendering
Shukla et al. FAT-ETO: Fuzzy-AHP-TOPSIS-Based efficient task offloading algorithm for scientific workflows in heterogeneous fog–cloud environment
Song et al. A deep reinforcement learning-based task scheduling algorithm for energy efficiency in data centers
Li Optimization of task offloading problem based on simulated annealing algorithm in MEC
Tang Load balancing optimization in cloud computing based on task scheduling
CN114201306B (en) Multi-dimensional geographic space entity distribution method and system based on load balancing technology
Khattar et al. Multi-criteria-based energy-efficient framework for VM placement in cloud data centers
Mohd. Zin et al. Replication techniques in data grid environments
Mao et al. A fine-grained and dynamic MapReduce task scheduling scheme for the heterogeneous cloud environment
CN109040283A (en) A kind of modified load-balancing algorithm based on difference reaction type
Zhu et al. Load balancing algorithm for web server based on weighted minimal connections
Park et al. Reinforcement Learning based load balancing in a distributed heterogeneous storage system
Jarvis et al. Performance-based middleware services for grid computing

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20181123