CN111026533A

CN111026533A - Workflow execution optimization method based on distributed estimation algorithm in cloud computing environment

Info

Publication number: CN111026533A
Application number: CN201911259945.4A
Authority: CN
Inventors: 谢毅; 桂奉献; 孙鹤
Original assignee: Zhejiang Gongshang University
Current assignee: Zhejiang Gongshang University
Priority date: 2019-12-10
Filing date: 2019-12-10
Publication date: 2020-04-17

Abstract

The invention discloses a workflow execution optimization method based on a distributed estimation algorithm in a cloud computing environment, which comprises the following steps: acquiring information required by executing optimization; calculating a level value of the task; initializing a contemporary population; decoding the improved contemporary population, calculating the fitness value of the contemporary population, and storing the optimal individuals; constructing an elite population, updating a probability model, sampling the probability model to generate a new contemporary population, and outputting an execution optimization result until a termination condition is met; compared with the traditional method, the method and the strategy are adopted, wherein the method and the strategy are based on topological sorting and continuous biased coding, initial individual generation based on hierarchy and benefit ratio, serial individual decoding based on insertion mode, individual improvement based on forward and backward, new individual generation based on sampling, optimal individual storage and the like, and the optimization capability and the search efficiency of the algorithm are improved.

Description

Workflow execution optimization method based on distributed estimation algorithm in cloud computing environment

Technical Field

The invention relates to the field of computer technology, information technology and system engineering, in particular to a cloud workflow execution optimization method, and more particularly relates to a workflow execution optimization method based on a distributed estimation algorithm in a cloud computing environment.

Background

The workflow under the cloud computing environment, called 'cloud workflow' for short, is the integration of cloud computing and workflow related technologies, and is used for scientific computing and cross-organization business cooperation requiring high-efficiency computing performance and large-scale storage support, such as: the system has wide application prospect in the fields of e-commerce, emergency management, supply chain management, health care and the like. In a cloud workflow, there are multiple types of computing resources and multiple tasks, and there are timing constraints between tasks, and a virtual machine is generally responsible for receiving and processing these tasks as a minimum allocation unit of computing resources during execution. Cloud workflow execution or scheduling optimization refers to how to reasonably configure virtual machines under the constraint of meeting the workflow task timing and user requirements, how to allocate workflow tasks to proper virtual machines and determine the execution sequence of the workflow tasks to optimize certain performance indexes such as: cost of implementation, response time, etc. The performance of the whole cloud workflow system is directly determined by cloud workflow execution optimization, and with the rapid increase of process automation requirements in a cloud computing environment, particularly requirements of large-scale assistance and distributed e-commerce and scientific computing applications, the cloud workflow execution optimization has become an important research content.

When cloud workflow execution optimization is performed, the optimization is generally performed only from the perspective of resource configuration or task scheduling, and collaborative optimization research on the resource configuration and the task scheduling is lacked. In fact, the resource allocation and the task scheduling of the cloud workflow are interacted as two optimization stages, the performance index of the cloud workflow execution is influenced together, and the execution performance of the cloud workflow can be effectively improved through collaborative optimization research aiming at the two optimization stages.

Therefore, with the increase of the complexity of the cloud workflow and the application requirements thereof, it is urgently needed to design a more efficient integrated collaborative optimization method to solve the problems of resource allocation and task scheduling optimization of the cloud workflow and improve the performance of cloud workflow execution.

Disclosure of Invention

Aiming at the defects that the cloud workflow execution optimization is usually performed only from the perspective of resource configuration or task scheduling before, an integrated collaborative optimization method of the resource configuration and the task scheduling is lacked, the cloud workflow execution performance is low and the like, the invention provides the workflow execution optimization method based on the distributed estimation algorithm in the cloud computing environment, and the cloud workflow execution performance is effectively improved.

The technical scheme adopted by the invention for solving the technical problems is as follows: a workflow execution optimization method based on a distributed estimation algorithm in a cloud computing environment comprises the following steps:

step 1: acquiring information required by executing optimization of the cloud workflow;

get task set T ═ T₁,…,t_I}，t_iRepresenting a task i, namely a task with the number i; wherein I is the number of tasks to be scheduled;

acquiring a time sequence relation between tasks: parent task set PR of task i_iSubstask set SC for task i_iWherein I is 1, …, I;

acquiring task related parameters: length len of task i_iI.e. the number of instructions that need to be consumed when task i is processed by the virtual machine, the list of input files IFL that is needed when task i is processed_i、

Output file list OFL generated after task i is processed_i、

And the size of the file in the file list, wherein: i is 1, …, I; task i is task i⁺The requirements of the parent task are as follows: there is a file that is the output file of task i and is also task i⁺The input file of (a), namely:

file∈OFL_i∧file∈IFL_i+；

obtaining a virtual machine type set VM ═ VM in a cloud computing environment₁,vm₂,…,vm_JWhere J is the number of types of virtual machines, vm_jRepresenting a j-class virtual machine;

acquiring related parameters of the virtual machine: class jComputing power ps of virtual machine_jBandwidth bw of class j virtual machines_jCost per unit time vc for class j virtual machines_jFixed lease-starting cost fc of j-class virtual machine_jMinimum billing time unit ut for class j virtual machines_jMinimum lease-on time ft of class j virtual machine_j(ii) a The cost of renting a class j virtual machine is calculated as follows:

wherein: lt is lease time, J is 1,2 …, J;

acquiring cost constraint Budget and time constraint Deadline executed by a workflow in a cloud computing environment; if no cost constraint exists, setting Budget as MBV, and if no time constraint exists, setting Deadline as MDV; wherein: MBV is the upper cost limit, and MDV is the upper time limit;

step 2: calculating a level value of the task;

for a starting task i without a parent task, the hierarchy value is:

lvl_i＝1 (1)

the hierarchy values of other tasks are calculated using the following recursive formula:

and step 3: initializing the contemporary population, and making BtCh equal to Null;

generating 1 individual based on the level and the benefit ratio, and sampling the initial probability model for N-1 times to generate N-1 individuals to form an initial current generation population; wherein N is the population size;

the individual encoding method is as follows: ch ═ gr₁,…,gr_I；gs₁,…,gs_I；gt₁,…,gt_IWhere { gr₁,…,gr_IThe scheduling order list is a topological order of task numbers; { gs₁,…,gs_IIs the virtual machine allocation list, gs_iA virtual machine instance number representing an assignment to an ith scheduled task, wherein: gs is₁＝1，gs_i≤max{gs₁,…,gs_i-1}+1；{gt₁,…,gt_IIs a list of virtual machine types, gt_iType of virtual machine instance denoted i, gt₁,…,gt_IIs an integer value between 1 and J;

the step of generating 1 individual based on the hierarchy and the benefit ratio comprises the following steps:

step A1: randomly arranging the tasks according to the level values of the tasks from small to large, namely randomly arranging the tasks with the level values of small in front of large and with the same level values to form an individual task scheduling sequence list { gr₁,…,gr_I}；

Step A2: generating individual virtual machine allocation list { gs) based on benefit ratio₁,…,gs_IAnd list of virtual machine types { gt₁,…,gt_I}; obtaining the execution time and the completion time of all tasks: et al_i、f_i，i＝1,…,I；

Step A3: output an individual ch₁＝{gr₁,…,gr_I；gs₁,…,gs_I；gt₁,…,gt_IExecution time and completion time of all tasks: et al_i、f_iI-1, 2 …, I, and calculating its workflow response time rs₁And the operation is finished;

the probability model comprises a task scheduling sequence probability model PMS (g), a virtual machine allocation probability model PMA (g) and a virtual machine type probability model PMT (g);

β therein_i,i′(g) Indicating that the task scheduled in the ith' generation is t_iThe probability of (a) of (b) being,

α therein_i,k(g) Representing the probability of assigning a virtual machine instance numbered k to the ith scheduled task in the g-th generation,

wherein delta_k,j(g) Representing the probability that the type of the virtual machine instance with the g generation number k is j;

k＝1,…,I；

the probability model of the initial task scheduling sequence is as follows:

wherein: STS_ρ＝{t_i|ξ_i<ρ≤I-ζ_iIs a set of tasks, ζ, that can be scheduled for scheduling at the ρ -th_iIs the number of descendant tasks of task i, ξ_iIs the number of ancestor tasks of task i;

the definition of the descendant task and the ancestor task is described as follows: if there is a task sequence

Satisfy the requirement of

Is that

Where 1 is not more than k<n is then

Is that

The task of the ancestor of (c),

is that

The descendant task of (2);

the initial virtual machine distribution probability model is as follows:

the initial virtual machine type probability model is as follows:

j is the number of types of virtual machines;

the probability models PMS (g), PMA (g) and PMT (g) are sampled for 1 time to generate 1 individual, and the method comprises the following steps:

step B1: sampling of virtual machine types:

step B1.1: let variable k be 1;

step B1.2: obtaining a probability A that the type of the virtual machine instance numbered k is j_k,j＝δ_k,j(g) J is 1, …, J; calculating the cumulative probability:

step B1.3: generating 1 random number λ ∈ [0,1) if

Then select type j, let gt_k＝j；

Step B1.4: let k be k + 1; if k is less than or equal to I, turning to step B1.2, otherwise, obtaining a virtual machine type list, and turning to step B2;

step B2: initializing a system state:

step B2.1: make all virtual machines available a time period list vatl_k＝{[0,∞]}，k＝1,…,I；

Step B2.2: let ready time rt of task_iTask set P (t) 0_i)＝PR_iI is 1, …, I; order task set

The task set UT is T;

step B2.3: in UT

T of_iMoving to RT; let the variable q be 1 and the variable MI be 1;

step B3 according to [ β ]_1,q(g) … β_I,q(g)]^TRandomly selecting a task from RT by roulette, not setting t_i(ii) a Let gr_q＝i；

Step B4 according to [ α ]_q,1(g) … α_q,I(g)]Using roulette in [1, MI]Randomly selecting a virtual machine instance number between the two, setting the number as k, and enabling gs_qK is; if k is MI, then MI is MI + 1;

step B5: handle t_iAssigned to virtual machine instance numbered k:

step B5.1: calculating t_iExecution time of

Step B5.2: in vatl_kFinding out an idle time period [ v ] from morning to evening_k,υ_k]Satisfy upsilon_k-ν_k≥et_iAnd upsilon_k-et_i≥rt_i；

Step B5.3: t is t_iStart time s of_i＝max{ν_k,rt_i}，t_iEnd time f of_i＝s_i+et_i；

Step B5.4: updating t_iReady time of subtask of (2)

Step B5.5: list of time slots available in virtual machine, vatl_kDeletion of [ v ]_k,υ_k]V, with insertion interval length greater than 0_k,s_i]And [ f_i,υ_k]；

Step B5.6: in all of

Deletion of t_iDeleting t in RT_i；

Step B5.7: in UT

T of_iMoving to RT;

step B6: if RT is not null, then q ═ q +1, go to step B3, otherwise go to step B7;

step B7: obtaining an individual ch_n＝{gr₁,…,gr_I；gs₁,…,gs_I；gt₁,…,gt_IExecution time and completion time of all tasks: et al_i、f_iI1, 2 …, I, whose workflow response time rs is calculated_nN belongs to {2, …, N }, and the operation is finished; and 4, step 4: employing FBI for each individual in contemporary populations&D, decoding and improving to obtain the workflow execution cost and response time of each individual, and then calculating the relative fitness value of all the infeasible individuals and the absolute fitness value of the feasible individuals; replacing the content stored in BtCh with the optimal individual if BtCh ═ Null or the optimal individual in the contemporary population is better than the individual stored in BtCh;

for each individual ch in the population_n＝{gr₁,…,gr_I；gs₁,…,gs_I；gt₁,…,gt_I1, …, N; the FBI&D comprises the following steps:

step C1: form a reverse body

Step C1.1: according to task completion time f_iRearranging the task scheduling order list from large to small { gr₁,…,gr_ISetting the ith gene value in the task scheduling sequence list as the serial number of the task finished by the I-th time, wherein I is 1, … and I; form a

Step C1.2: in order to maintain the validity of the original resource configuration scheme and codes, the virtual machine instance list { gs is adjusted₁,…,gs_IAnd list of virtual machine types { gt₁,…,gt_IIs formed by

Step C1.2.1: let variable epsilon be 1 and variable delta be 1; let the flag value flg₁＝…＝flg_I0; order to

k＝max{gs₁,…,gs_I}+1,…,I；

Step C1.2.2: if flg_εIf 0, go to step C1.2.3; otherwise go to step C1.2.5:

step C1.2.3: finding tasks

In { gr₁,…,gr_IThe scheduling order in (1) is not set to

In ch_nFind out the use number as

Task number set of virtual machine instance

In that

Find out the scheduling sequence set of the corresponding task in ST

Step C1.2.4: for all i ∈ SI, let

flg_i1 is ═ 1; order to

Let δ be δ + 1;

step C1.2.5: let epsilon equal to epsilon + 1; if ε ≦ I, go to step C1.2.2, otherwise go to step C2;

step C2: method for decoding reverse individuals by adopting serial reverse individuals based on insertion mode

Decoding to obtain the reverse completion time of all tasks

And its workflow reverse response time

If it is

Less than rs_nGo to step C3, otherwise go to step C5;

step C3: form positive individual ch_n＝{gr₁,…,gr_I；gs₁,…,gs_I；gt₁,…,gt_I}：

Step C3.1: according to task reverse completion time

Rearranging task scheduling order list from large to small

Setting the ith gene value in the task scheduling sequence list as the serial number of the last ith completed task, wherein I is 1, … and I; form { gr₁,…,gr_I}；

Step C3.2: adjusting virtual machine instance list to maintain original resource configuration scheme and code validity

And virtual machine type list

Form { gs₁,…,gs_I}、{gt₁,…,gt_I}：

Step C3.2.1: let variable epsilon be 1 and variable delta be 1; let the flag value flg₁＝…＝flg_I0; order to

Step C3.2.2: if flg_εIf 0, go to step C3.2.3; otherwise go to step C3.2.5:

step C3.2.3: finding task gr_εIn that

In the scheduling order of (1), do not set to

In that

Find out the use number as

Task number set of virtual machine instance

In { gr₁,…,gr_IFinding out the scheduling sequence set SI ═ i | gr of the corresponding task in ST_i∈ST}；

Step C3.2.4: let gs be the same for all i ∈ SI_i＝δ、flg_i1 is ═ 1; order to

Let δ be δ + 1;

step C3.2.5: let epsilon equal to epsilon + 1; if ε ≦ I, go to step C3.2.2, otherwise go to step C4;

step C4: adopting serial forward individual decoding method based on insertion mode to forward individual ch_nDecoding is carried out to obtain the completion time f of all tasks₁,…,f_IAnd its workflow response time rs_n(ii) a If rs_nIs less than

Go to step C1, otherwise, go to step C5;

step C5: output forward individual ch_nAnd its workflow response time rs_nCalculating its workflow execution cost ct_nAnd the operation is finished;

the serial reverse individual decoding method based on the insertion mode is used for decoding reverse individuals

The decoding comprises the following steps:

step D1: make reverse ready time of all tasks

Is a task

Output filesets exported to a shared database, i.e.

I is 1, …, I; make virtual machine available a time period list vatl_k＝{[0,∞]}，

Let the variable ε be 1;

step D2: choose the serial number as

The task of (1);

step D3: assigning task i to a number of

The virtual machine instance of (2):

step D3.1: in vatl_kFinding out an idle time period [ v ] from morning to evening_k,υ_k]Satisfy upsilon_k-ν_k≥et_iAnd

step D3.2: calculating a reverse start time for task i

Reverse completion time

Step D3.3: updating the reverse ready time of the parent task of task i

Step D3.4: list of time slots available in virtual machine, vatl_kDeletion of [ v ]_k,υ_k]With an insertion interval length greater than 0

And

step D4: let epsilon be epsilon +1, if epsilon is less than or equal to I, go to step D2, otherwise step D5;

step D5: obtaining reverse completion times for all tasks

I-1, …, I, and its workflow reversal response time

Finishing the operation;

the serial forward individual decoding method based on the insertion mode is used for forward individual ch_nThe decoding comprises the following steps:

step E1: let ready times rt of all tasks_i0, I-1, …, I; let the variable ε be 1; make available time period list of all virtual machine instances vatl_k＝{[0,∞]}，k＝1,…,max{gs₁,…,gs_I}；

Step E2: selecting the serial number i-gr_εThe task of (1);

step E3: assigning task i to a number k-gs based on an insertion pattern_εThe virtual machine instance of (1);

step E3.1: in vatl_kFinding out an idle time period [ v ] from morning to evening_k,υ_k]Satisfy upsilon_k-ν_k≥et_iAnd upsilon_k-et_i≥rt_i；

Step E3.2: calculating the start time s of task i_i＝max{ν_k,rt_iH, completion time f_i＝s_i+et_i；

Step E3.3: updating the Ready time of a subtask of task i

Step E3.4: list of time slots available in virtual machine, vatl_kDeletion of [ v ]_k,υ_k]V, with insertion interval length greater than 0_k,s_i]And [ f_i,υ_k]；

Step E4: let ε equal to ε +1, if ε ≦ I, go to step E2, otherwise step E5;

step E5: obtaining the end time f of all tasks_iI1, …, I, whose workflow response time rs is calculated_nAnd the operation is finished;

and 5: if the termination condition is not met, go to step 6; otherwise go to step 8;

the termination condition is that the optimal individuals stored in BtCh are not improved after iteration to a designated generation TG or continuous iteration GG generation;

step 6: constructing an elite population and updating a probability model;

selecting from the best to the bad of the contemporary population

Individuals as the current generation elite population POP_eWherein: n is a radical of_eFor elite population size, r_eE (0,1) is the elite rate;

the method for updating the probability model comprises the following steps:

marking value

Individual ch of generation_nThe ith scheduled task is distributed to the virtual machine instance with the number of k

Marking value

Individual ch of generation_nThe ith' scheduling of task i

Marking value

Individual ch of generation_nThe type of the virtual machine instance with the middle number of k is j

Respectively allocating the updating rates of a probability model, a task scheduling sequence probability model and a virtual machine type probability model to the virtual machine;

and 7: sampling the current probability models PMS (g), PMA (g) and PMT (g) for N times to generate N individuals, forming a new population, and enabling the new population to be a current generation population; turning to step 4;

and 8: if the feasible individuals are stored in the BtCh, outputting the corresponding execution scheme as an optimization scheme; otherwise, there is no feasible implementation.

Further, a specific calculation method of the MBV and MDV is as follows:

wherein:

is t_iThe maximum execution time.

Further, in the step a2, an individual virtual machine allocation list { gs is generated based on the benefit ratio₁,…,gs_IAnd list of virtual machine types { gt₁,…,gt_IThe concrete steps are as follows:

step A2.1: order virtual machineExample set

Let gs₁＝gs₂＝…＝gs_I＝0，gt₁＝gt₂＝…＝gt_I0; let ready times rt of all tasks₁＝rt₂＝…＝rt_I0; let the variable ε be 1;

step A2.2: let variable i become gr_εThe variable K is INS, the variable K is 1, the variable η is 1, the calculation is carried out on the t_iThe comprehensive benefit ratio after being respectively allocated to each potential virtual machine instance is as follows:

step A2.2.1: if K is less than or equal to K, go to step A2.2.2, otherwise, go to step A2.2.6;

step A2.2.2: calculating t_iExecution time after assignment to virtual machine instance numbered k

Wherein:

is a handle t_iThe processing time of the task when the virtual machine instance with the number k is processed,

is the processing power of the virtual machine instance numbered k;

is a handle t_iThe virtual machine instance with the number k needs to obtain the file transfer time of the input file from other virtual machines when processing,

k^-is to treat

The number of the virtual machine instance of (c),

and

is numbered k and k^-Bandwidth of the virtual machine instance of (1);

is a handle t_iThe virtual machine instance with number k needs to obtain the file transfer time of the input file from the shared database when processing,

step A2.2.3: in vatl_kFinding out an idle time period [ v ] from morning to evening_k,υ_k]Satisfy upsilon_k-ν_k≥et_i,kAnd upsilon_k-et_i,k≥rt_i；

Step A2.2.4: calculating t_iStarting time s after assigning to virtual machine instance numbered k_i,k＝max{ν_k,rt_iH, completion time f_i,k＝s_i,k+et_i,k；

Step A2.2.5, calculating the comprehensive benefit ratio ξ_i,k：

Wherein: theta epsilon [0,1] is a weight coefficient, mu >0 is a coordination coefficient of cost and time,

is t_iLease time, lt ', assigned to virtual machine instance numbered k after virtual machine instance numbered k'_k＝Rnt′_k-Hrt′_kIs t_iNot yet assigned to virtual machine instance numbered kThe lease time of the virtual machine instance numbered k,

is t_iThe return time of the virtual machine instance with the number of k before the virtual machine instance with the number of k is not distributed;

is t_iA start lease time for a virtual machine instance numbered k before a virtual machine instance numbered k has not yet been assigned; making k equal to k +1, and going to step a2.2.1;

a2.2.6, if η is not more than J, go to A2.2.7, otherwise go to A2.3;

step A2.2.7: calculating t_iExecution time after allocation to a new type η virtual machine instance

Wherein: omega_i,ηIs a type η virtual machine instance process t_iTime of (a), ω_i,η＝len_i/ps_η；

Is a handle t_iThe file transfer time allocated to the virtual machine instance of type η that needs to obtain the input file from the other virtual machine when processing,

k^-is to treat

The virtual machine instance number of (a); tau is_i,ηIs a handle t_iThe file transfer time allocated to the processing of the type η virtual machine instance required to obtain the input file from the shared database,

step A2.2.8: calculating t_iAssigned to this new typeStart time s after virtual machine instance of η_i,K+η＝rt_iCompletion time f_i,K+η＝s_i,K+η+et_i,K+η；

Step A2.2.9, calculating the comprehensive benefit ratio ξ_i,K+η：

Wherein

Is t_iAfter allocating the new virtual machine instance with type η, the lease time of the new virtual machine instance with type η is reached, so that η is η +1, go to step A2.2.6;

step A2.3 from ξ_i,1,…,ξ_i,K+JFind a minimum among them, do not set as

If the subscript value

Then order

Otherwise, let gs_ε＝K+1、

Adding a virtual machine instance ins numbered K +1_K+1I.e., INS ∪ INS_K+1，vatl_K+1＝{[0,∞]}；

Step A2.4: change task i to gr_εAssigning to a virtual machine instance k-gs_ε：

Step A2.4.1: calculating a start time of a task

End time

Execution time et_i＝f_i-s_i；

Step A2.4.2: updating the Ready time of a subtask of task i

Step A2.4.3: in vatl_kFinding out an idle time period [ v ] from morning to evening_k,υ_k]Satisfy upsilon_k-ν_k≥et_iAnd upsilon_k-et_i≥rt_i；

Step A2.4.4: in vatl_kDeletion of [ v ]_k,υ_k]V, with insertion interval length greater than 0_k,s_i]And [ f_i,υ_k]；

Step A2.5: let epsilon equal to epsilon + 1; if epsilon is less than or equal to I, go to step A2.2, otherwise go to step A2.6;

step A2.6: let K be | INS |, if K<I, then I-K random integers between 1 and J are generated, which are not set to: pi_K+1，……，π_I(ii) a Order: gt_K+1＝π_K+1，……，gt_I＝π_I；

Step A2.7: obtaining an individual { gr₁,…,gr_I；gs₁,…,gs_I；gt₁,…,gt_IExecution time and completion time of all tasks: et al_iAnd f_iI is 1,2 …, I, and the operation ends.

Further, for individual ch_nThe workflow response time rs_nAnd an execution cost ct_nThe specific calculation method is as follows:

wherein:

is a task gr_iThe response time of (a) is set,

wherein:

is the fixed lease start cost for the virtual machine instance numbered k,

is the cost per unit time of the virtual machine instance numbered k,

is the minimum billing time unit for the virtual machine instance numbered k,

is the minimum lease-on time for the virtual machine instance numbered k,

is the bandwidth of the virtual machine instance numbered k, lt_k＝Rnt_k-Hrt_kIs the lease time, Hrt, of the virtual machine instance numbered k_kThe start lease time of the virtual machine instance numbered k; rnt_kThe return time of the virtual machine instance with the number of k is obtained;

is to complete

If the maximum time at which the file in (1) is output to the corresponding recipient is reached

Without exporting files to the shared database

Then the corresponding recipient is processing

If the virtual machine of the subtask

Without subtasks, i.e.

The respective recipient is a shared database, otherwise if

With both file output to the shared database and subtasks

Then the corresponding recipient is processing

The virtual machines and the shared database of the subtasks,

further, for individual ch in the population_nN is 1,2 …, N if ct_n≤Budget∨rs_nWhen the adaline is not more than the preset value, ch is_nAs viable individuals, otherwise ch_nIs an infeasible individual;

the specific calculation method of the relative fitness value of the infeasible individual is as follows:

the specific calculation method of the absolute fitness value of the feasible individual is as follows: afit_n＝θ×μ×ct_n+(1-θ)×rs_n；

Wherein: theta belongs to [0,1] and is a weight coefficient, and mu is a coordination coefficient of cost and time;

when the individual quality is compared, the feasible individual is superior to the infeasible individual; for all feasible individuals, the smaller the absolute fitness value is, the better the individual is; for all infeasible individuals, the smaller the relative fitness value, the better the individual.

Further, in the step B3, the sequence is according to [ β ]_1,q(g) … β_I,q(g)]^TThe specific steps of selecting a task from the RT using roulette are as follows:

step B3.1: calculate each t in RT_iProbability of being selected

Step B3.2: calculating the cumulative probability:

step B3.3: generating 1 random number λ ∈ [0,1) if

Then t is selected_iAnd the task selection operation is ended.

Further, in the step B4, the sequence is according to [ α ]_q,1(g) … α_q,I(g)]Using roulette in [1, MI]The specific steps of randomly selecting a virtual machine instance number are as follows:

step B4.1: calculate the probability that the various possible numbers are chosen: a. the_k＝α_q,k(g)，k＝1,…,MI-1；

Step B4.2: calculating the cumulative probability:

step B4.3: generating 1 random number λ ∈ [0,1) if

Then number k is selected and the virtual machine instance numbering operation ends.

The invention has the beneficial effects that:

(1) compared with the existing workflow execution or scheduling optimization method in the cloud computing environment, the method provided by the invention considers resource allocation and task scheduling at the same time, and realizes integrated collaborative optimization of the resource allocation and the task scheduling.

(2) Compared with a heuristic method, a semi-intelligent calculation method combined with the heuristic method and an existing intelligent calculation method based on hierarchical coding, the method adopts an integer coding method, and any execution scheme can have an individual corresponding to the integer coding method, so that the search space is complete, and the global search can be realized.

(3) Compared with a common coding mode based on priority, the task scheduling sequence list adopts an integer coding method based on topological sorting, and the time sequence relation among tasks is considered, so that the decoding is simpler, the decoding efficiency can be effectively improved, and the overall efficiency of the algorithm is further improved.

(4) Compared with a general full-sequence unbiased coding mode, the invention adopts a continuous biased coding mode from small to large according to the task scheduling sequence to the virtual machine instance list, effectively reduces the coding search space on the premise of ensuring the completeness of the search space, and further improves the overall efficiency of the algorithm.

(5) Compared with the traditional non-insertion mode and parallel decoding methods, the serial individual decoding method which is used for arranging the task execution as early as possible based on the insertion mode can generally find a better corresponding scheduling scheme.

(6) Compared with the common one-way decoding method, the forward and backward individual decoding and improvement strategy FBI & D adopted by the design of the invention enhances the neighborhood optimizing capability of the individual, thereby improving the optimizing capability and the searching efficiency of the whole algorithm.

(7) The invention adopts the optimal individual storage strategy, can ensure that the optimal individual is not damaged, and leads the algorithm to be monotonous and convergent.

(8) Compared with the traditional intelligent calculation methods such as GA and the like, the algorithm designed by the invention uses sampling to replace genetic operation to generate new individuals, and the algorithm is more concise.

(9) Compared with the traditional random initialization method, the invention broadcasts an individual generated based on the hierarchy and benefit ratio method in the initialization population, so that the algorithm starts to search at a higher starting point, thereby improving the search efficiency and shortening the search time.

(10) The invention designs a new individual fitness value calculation and quality comparison method for optimizing the workflow execution cost and the response time in a cloud computing environment, and can quickly and conveniently realize the quality comparison among individuals.

(11) The invention designs a universal virtual machine renting charging model which is suitable for charging rules of all IaaS platforms such as Amazon EC2, Microsoft Azure, Google cloud, Array cloud, Tencent cloud and the like.

Drawings

Fig. 1 is a schematic flow chart of a workflow execution optimization method based on a distributed estimation algorithm in a cloud computing environment according to the present invention.

FIG. 2 is a timing diagram of tasks of a Montage workflow according to an embodiment of the present invention.

Detailed Description

The present invention will be described in further detail with reference to fig. 1 and 2 and examples, but the present invention is not limited to the examples.

Suppose that there are 5 virtual machine types vm numbered 1 to 5 under a cloud computing service provider, i.e., a cloud computing environment₁、vm₂、vm₃、vm₄、vm₅The computing power, bandwidth, unit time cost, fixed lease initiation cost, minimum billing time unit and minimum lease initiation time of various virtual machine types are shown in table 1; the time sequence relationship between one Montage workflow task is shown in FIG. 2, which is composed of 15 tasks t numbered from 1 to 15₁、t₂、…、t₁₅Composition, length of execution of each task, input files required for processing, and processed outputThe names and lengths of the files are shown in table 2.

TABLE 1

TABLE 2

For the above case, as shown in fig. 1, a workflow execution optimization method based on a distributed estimation algorithm in a cloud computing environment includes the following implementation steps:

executing the step 1: acquiring information required by executing optimization of the cloud workflow;

get task set T ═ T₁,t₂,t₃,t₄,t₅,t₆,t₇,t₈,t₉,t₁₀,t₁₁,t₁₂,t₁₃,t₁₄,t₁₅}；

Obtaining the time sequence relation between tasks, namely a parent task set PR of the task i_iAnd a set of subtasks SC_i：

PR₄＝{t₁}，PR₅＝{t₁,t₂}，PR₆＝{t₁,t₃}，PR₇＝{t₄,t₅,t₆}，PR₈＝{t₇}，PR₉＝{t₁,t₈}，PR₁₀＝{t₂,t₈}，PR₁₁＝{t₃,t₈}，PR₁₂＝{t₉,t₁₀,t₁₁}，PR₁₃＝{t₁₂}，PR₁₄＝{t₁₃}，PR₁₅＝{t₁₄}；

SC₁＝{t₄,t₅,t₆,t₉}，SC₂＝{t₅,t₁₀}，SC₃＝{t₆,t₁₁}，SC₄＝{t₇}，SC₅＝{t₇}，SC₆＝{t₇}，SC₇＝{t₈}，SC₈＝{t₉,t₁₀,t₁₁}，SC₉＝{t₁₂}，SC₁₀＝{t₁₂}，SC₁₁＝{t₁₂}，SC₁₂＝{t₁₃}，SC₁₃＝{t₁₄}，SC₁₄＝{t₁₅}，

Acquiring relevant parameters of the task: len (a)₁＝42×10⁷MI，IFL₁＝{f_d1,f_d2}，OFL₁＝{f_1-1,f_1-2}；

len₂＝36×10⁷MI，IFL₂＝{f_d1,f_d3}，OFL₂＝{f_2-1,f_2-2}；len₃＝63×10⁷MI，IFL₃＝{f_d1,f_d4}，OFL₃＝{f_3-1,f_3-2}；len₄＝46×10⁷MI，IFL₄＝{f_d1,f_1-1,f_1-2}，OFL₄＝{f_4-1,f_4-2}；……；

len₁₅＝48×10⁷MI，IFL₁₅＝{f_14-1}，OFL₁₅＝{f_15-1}；f_d1.size＝36000MB，f_d2.size＝43200MB，f_1-1.size＝39600MB，f_1-2.size＝39600MB，……，f_14-1.size＝356000MB，f_15-1.size＝42000MB；

Acquiring a virtual machine type set in a cloud computing environment: VM ═ VM₁,vm₂,vm₃,vm₄,vm₅}；

Acquiring related parameters of the virtual machine: ps is₁＝10000MI/s，bw₁＝200Mbit/s，vc₁0.5-membered, fc₁3 yuan, ut₁＝600s，ft₁＝7200s；ps₂＝20000MI/s，bw₂＝200Mbit/s，vc₂0.8 membered, fc₂4 yuan, ut₂＝600s，ft₂＝7200s；ps₃＝40000MI/s，bw₃＝300Mbit/s，vc₃1.4-membered, fc₃6 yuan, ut₃＝600s，ft₃＝7200s；ps₄＝40000MI/s，bw₄＝400Mbit/s，vc₄1.7 membered, fc₄7 yuan, ut₄＝600s，ft₄＝7200s；ps₅＝60000MI/s，bw₅＝400Mbit/s，vc₅2.1 yuan, fc₅8 Yuan, ut₅＝600s，ft₅＝7200s；

Acquiring cost constraint Budget and time constraint Deadline of workflow execution under a cloud computing environment, wherein the cost constraint Budget is 1100 yuan, and the time constraint Deadline is 140000 s;

and (3) executing the step 2: calculating a level value of the task;

if task 1, task 2 and task 3 have no parent task, lvl₁＝lvl₂＝lvl₃＝1；

Task 4 has only one parent task 1, then

Similarly, the hierarchy values of other tasks may be obtained: lvl₅＝lvl₆＝2；lvl₇＝3；lvl₈＝4；lvl₉＝lvl₁₀＝lvl₁₁＝5；lvl₁₂＝6；lvl₁₃＝7；lvl₁₄＝8；lvl₁₅＝9。

And (3) executing the step: initializing the contemporary population, and making BtCh equal to Null;

taking the population scale N as 10, the weight coefficient theta as 0.4 and the coordination coefficient mu of the cost and the time as 250;

generating 1 individual based on the hierarchy and the benefit ratio, and then sampling the initial probability model for 9 times to generate 9 individuals to form an initial current generation population;

the specific implementation process for generating 1 individual based on the hierarchy and the benefit ratio is as follows:

step a1 is executed: randomly arranging tasks according to the task hierarchy value from small to large, and randomly arranging the tasks with the hierarchy value of 1, wherein the tasks are 1,3 and 2; randomly arranging tasks with a hierarchy value of 2, wherein the tasks are 6,5 and 4; the task of the level value 3 is only 7; the task with the hierarchy value of 4 is only 8; randomly arranging the tasks with the hierarchy value of 5, wherein the tasks are 9,11 and 10; the task for the hierarchy value 6 is only 12; the task for the level value 7 is only 13; the task of the level value 8 is only 14; the task for the hierarchy value 9 is only 15; sequentially connecting the arrangements according to the hierarchy values from small to large to form an individual task scheduling sequence list {1,3,2,6,5,4,7,8,9,11,10,12,13,14,15 };

step a2 is executed: generating an individual virtual machine allocation list and a virtual machine type list, which are {1,2,3,2,1,3,1,1,1,4,4,4,4,4,4} and {5,5,5,5,3,4,5,4,2,2,1,4,3,2,3}, respectively, based on the benefit ratio; obtaining the execution time and the completion time of all tasks: et al_i、f_i，i＝1,…,15；

I.e. step a 2.1: order virtual machine instance set

gs₁＝…＝gs₁₅＝0；gt₁＝…＝gt₁₅0; ready time rt of all tasks₁＝…＝rt₁₅＝0；ε＝1；

Step a2.2 is performed: i-gr_ε＝gr₁1, K-INS-0, K-1, η -1, and calculating t₁The comprehensive benefit ratio after each potential virtual machine instance is respectively distributed: namely, step A2.2.1 is executed: since k is 1>K is 0, go to A2.2.6, execute step A2.2.6, go to A2.2.7 because η is equal to 1 and J is equal to 5, execute step A2.2.7, calculate t₁Execution time after allocation to a new type 1 virtual machine instance

Step a2.2.8 is performed: calculating t₁The start time s after the assignment of this new type 1 virtual machine instance_1,1＝rt₁0, completion time f_1,1＝s_1,1+et_1,145168+ 45168; execution of step A2.2.9: calculating the comprehensive benefit ratio:

(ii) a Wherein lt ″)₁45168+8 × (39600+39600)/200-0 ═ 48336, η ═ 1+1 ═ 2, go to step A2.2.6;

step A2.2.6 is executed by proceeding to step A2.2.7 because η ≦ J ≦ 5, and step A2.2.7 is executed by calculating t₁Execution time after allocation to a new type 2 virtual machine instance

Step a2.2.8 is performed: calculating t₁The start time s after the virtual machine instance of type 2 is allocated to this new instance_1,2＝rt₁0, completion time f_1,2＝s_1,2+et_1,224168, executing step A2.2.9, calculating comprehensive benefit ratio ξ_1,220481.6, η + 2+ 1-3, go to A2.2.6 and … …, and repeatedly execute the steps A2.2.6 to A2.2.9 until η is 6>J is 5, and the obtained comprehensive benefit ratio is ξ_1,3＝12934.4，ξ_1,4＝12810.8，ξ_1,5＝10470.8；

Step A2.3 Slave ξ_1,1,…,ξ_1,5Find a minimum of ξ_1,5(ii) a Since delta is 5>K＝0，gs₁＝0+1＝1，gt₁5-0-5; adding a virtual machine instance ins numbered 1₁I.e., INS ∪ INS₁＝{ins₁}，vatl₁＝{[0,∞]}；

Step a2.4 is performed: assigning task 1 to virtual machine instance k-gs₁1: step A2.4.1 is executed: calculating the start time s of a task₁＝s_1,50, end time f₁＝f_1,58584, execution time et₁＝f₁-s₁8584; execution of step A2.4.2: updating the Ready time rt of a subtask of task 1₄＝max{rt₄,f₁}＝max{0,8584}＝8584，rt₅＝8584，rt₆＝8584，rt₉8584; execution of step A2.4.3: in vatl₁Finding out an idle time period [0, ∞ from morning to evening]Satisfy ∞ -0 ≥ et₁8584 and ∞ -8584 ≧ rt₁0; execution of step A2.4.4: in vatl₁Deletion of [0, ∞]Insertion interval length of [8584, ∞ ] of more than 0]Then, vall₁＝{[8584,∞]}；

Step a2.5 is performed: epsilon is 1+1 is 2; as ∈ 2 ≦ I ≦ 15, go to step a 2.2;

step a2.2 is performed: i-gr₂3, K-INS-1, K-1, η -1, and calculating t₃The comprehensive benefit ratio after each potential virtual machine instance is respectively distributed: namely, step A2.2.1 is executed: if K is 1 ≦ K is 1, go to step A2.2.2; execution of step A2.2.2: calculating t₃Execution time after assignment to virtual machine instance number 1

Execution of step A2.2.3: in vatl₁Finding an idle period [8584, ∞ from the morning to the evening]Satisfies ∞ -8584 ≥ et_3,112084 and infinity-12084 ≧ rt₃0; execution of step A2.2.4: calculating t₃Starting time s after virtual machine instance number 1 is assigned_3,1Max {8584,0} -8584, completion time f_3,1＝s_3,1+et_3,120668; execution of step A2.2.5: calculating the comprehensive benefit ratio: hrt'₁＝min{s₁}＝0，Rnt′₁＝max{8584+8×(39600+39600)/400}＝10168；lt′₁＝Rnt′₁-Hrt′₁＝10168；

This gives:

k 1+ 1-2, go to step +0.6 × (20668+8 × (38400+38400)/400-0) ═ 4410+13322.4 ═ 17732.4

A2.2.1; step a2.2.1 is performed: since k is 2>K1, go to A2.2.6, execute step A2.2.6, go to step A2.2.7 because η is equal to 1 and J is equal to 5, execute step A2.2.7, calculate t₃Execution time after allocation to a new type 1 virtual machine instance

Step a2.2.8 is performed: calculating t₃The start time s after the assignment of this new type 1 virtual machine instance_3,2＝rt₃0, completion time f_3,2＝s_3,2+et_3,266168, executing A2.2.9 calculating the comprehensive benefit ratio ξ_3,247644, η + 1+ 2, go to step A2.2.6, … …, and repeat steps A2.2.6-A2.2.9 to obtain a total benefit ratio of ξ₃,₃＝28084，ξ_3,4＝17306，ξ_3,5＝17462，ξ_3,6＝13802；

Step A2.3 Slave ξ_3,1,…,ξ_3,6Find a minimum of ξ_3,6(ii) a Because delta is 6>K is 1, so gs₂＝1+1＝2，gt₂6-1-5; adding a virtual machine instance ins numbered 2₂I.e., INS ∪ INS₂＝{ins₁,ins₂}，vatl₂＝{[0,∞]}；

Step a2.4 is performed: assigning task 3 to virtual machine instance k-gs₂2: step A2.4.1 is executed: calculating the start time of the task: s₃＝s_3,60, end time: f. of₃＝f_3,612084; execution time: et al₃12084; execution of step A2.4.2: furthermore, the utility modelReady time rt of a subtask of a new task 3₆＝12084，rt₁₁12084; execution of step A2.4.3: in vatl₂Finding out an idle time period [0, ∞ from morning to evening]The conditions that ∞ -0 is more than or equal to 12084 and ∞ -12084 is more than or equal to 0 are met; execution of step A2.4.4: in vatl₂Deletion of [0, ∞]Insertion interval length of [12084, ∞ ] of more than 0]；

Step a2.5 is performed: 2+1 ═ 3; as ∈ 3 ≦ I ≦ 15, go to step a 2.2;

…

step a2.2 to step a2.5 are thus continuously executed until ∈ 16>I is 15, in which case INS is [ INS ]₁,ins₂,ins₃,ins₄]The virtual machine allocation list and the virtual machine type list are respectively as follows: {1,2,3,2,1,3,1,1,1,4,4,4,4, 4} and {5,5,5,5,0,0,0,0,0, 0}, go to step a 2.6;

step a2.6 is performed: since K | INS | ≦ 4 ≦ I ≦ 15, 11 random integers between 1 and 5 are generated: pi₅、π₆、π₇、π₈、π₉、π₁₀、π₁₁、π₁₂、π₁₃、π₁₄、π₁₅It is as follows: 3.4, 5,4,2,1, 4,3,2, 3; order: gt₅＝π₅＝3，…，gt₁₅＝π₁₅＝3；

Step a2.7 is performed: obtaining an individual

{1,3,2,6,5,4,7,8,9,11,10,12,13,14, 15; 1,2,3,2,1,3,1,1, 4,4,4,4,4, 4; 5,5,5,5,3,4,5,4,2,2,1,4,3,2,3} and the execution time and completion time of all tasks thereof: et al₁＝8584、et₂＝7584、et₃＝12084、et₄＝9970.67、et₅＝10818.67、et₆＝10870.67、et₇＝16186.67、et₈＝10626.67、et₉＝13566.67、et₁₀＝7445.33、et₁₁＝8816、et₁₂＝24866.67、et₁₃＝8720、et₁₄＝11666.67、et₁₅＝8000，f₁＝8584，f₂＝7584，f₃＝12084，f₄＝18554.67，f₅＝19402.67，f₆＝22954.67，f₇＝39141.33，f₈＝49768，f₉＝63334.67，f₁₀＝66029.33，f₁₁＝58584，f₁₂＝90896，f₁₃＝99616，f₁₄＝111282.67，f₁₅When 119282.67, the operation ends.

Step a3 is executed: output an individual ch₁1,3,2,6,5,4,7,8,9,11,10,12,13,14, 15; 1,2,3,2,1,3,1,1, 4,4,4,4,4, 4; 5,5,5,5,3,4,5,4,2,2,1,4,3,2,3}, and the execution time and completion time of all tasks thereof: et al₁＝8584、et₂＝7584、et₃＝12084、et₄＝9970.67、et₅＝10818.67、et₆＝10870.67、et₇＝16186.67、et₈＝10626.67、et₉＝13566.67、et₁₀＝7445.33、et₁₁＝8816、et₁₂＝24866.67、et₁₃＝8720、et₁₄＝11666.67、et₁₅＝8000，f₁＝8584，f₂＝7584，f₃＝12084，f₄＝18554.67，f₅＝19402.67，f₆＝22954.67，f₇＝39141.33，f₈＝49768，f₉＝63334.67，f₁₀＝66029.33，f₁₁＝58584，f₁₂＝90896，f₁₃＝99616，f₁₄＝111282.67，f₁₅119282.67; calculating workflow response time: due to the fact that

And SFL₁₅＝{f_15-1Get the results

Finishing the operation;

the specific implementation process of the initialized task scheduling order probability model PMS (1) is as follows:

according to the time sequence relation among tasks, the following steps are known: t is t₁Without ancestor task, its descendant task is t₄,t₅,t₆,t₇,t₈,t₉,t₁₀,t₁₁,t₁₂,t₁₃,t₁₄,t₁₅Therefore, ξ₁＝0，ζ₁ξ on the same principle as 12₂＝0，ζ₂＝10，ξ₃＝0，ζ₃＝10，ξ₄＝1，ζ₄＝9，ξ₅＝2，ζ₅＝9，ξ₆＝2，ζ₆＝9，ξ₇＝6，ζ₇＝8，ξ₈＝7，ζ₈＝7，ξ₉＝8，ζ₉＝4，ξ₁₀＝8，ζ₁₀＝4，ξ₁₁＝8，ζ₁₁＝4，ξ₁₂＝11，ζ₁₂＝3，ξ₁₃＝12，ζ₁₃＝2，ξ₁₄＝13，ζ₁₄＝1，ξ₁₅＝14，ζ₁₅＝0；

From ξ_i，ζ_iAvailable STS₁＝{t₁,t₂,t₃H, then β_1,1(1)＝γ_1,1/|STS₁|＝1/3＝0.33，β_2,1(1)＝γ_2,1/|STS₁|＝1/3＝0.33，β_3,1(1)＝1/3＝0.33，β_4,1(1)＝γ_4,1/|STS₁|＝0/3＝0.00，β_5,1(1)＝0/3，……；

From ξ_i，ζ_iAvailable STS₂＝{t₁,t₂,t₃,t₄H, then β_1,2(1)＝γ_1,2/|STS₂|＝1/4＝0.25，β_2,2(1)＝γ_2,2/|STS₂|＝1/4＝0.25，β_3,2(1)＝0.25，β_4,2(1)＝0.25，β_5,2(1)＝0.00，β_6,2(1)＝0.00，……；

From ξ_i，ζ_iAvailable STS₃＝{t₁,t₂,t₃,t₄,t₅,t₆H, then β_1,3(1)＝γ_1,3/|STS₃|＝1/6＝0.17，β_2,3(1)＝1/6＝0.17，β_3,3(1)＝1/6＝0.17，β_4,3(1)＝0.17，β_5,3(1)＝0.17，β_6,3(1)＝0.17，β_7,3(1)＝0.00，β_8,3(1)＝0.00，……；

Similarly, the remaining β can be obtained_i,i′(1) I 1, …,15, i' 4, …,15, the final product can be:

the virtual machine assignment probability model PMA (1) is initialized as follows:

the process of initializing the virtual machine type probability model PMT (1) is as follows:

δ_1,1(1)＝1/5＝0.2，δ_1,2(1)＝1/5＝0.2，…，δ_1,5(1)＝1/5＝0.2；

δ_2,1(1)＝1/5＝0.2，δ_2,2(1)＝1/5＝0.2，…，δ_2,5(1)＝1/5＝0.2；

in the same way, the residual delta can be obtained_i,k(1) I-3, …,15, k-1, …,5, ultimately yields:

the concrete implementation process of generating 1 individual by sampling probability models PMS (1), PMA (1) and PMT (1) for 1 time is as follows:

step B1 is executed: sampling of virtual machine types: namely, step B1.1 is executed: let k equal to 1; step B1.2 is performed: obtaining probabilities that the type of virtual machine instance with the number k ═ 1 is 1,2,3,4,5 respectively: a. the_1,1＝δ_1,1(1)＝0.2、A_1,2＝0.2、A₁,₃＝0.2、A_1,4＝0.2、A_1,50.2; calculating the cumulative probability:

in the same way, the method for preparing the composite material,

step B1.3 is performed: 1 random number is generated, which is λ 0.69, since

So type 4 is selected, i.e. command₁4; step B1.4 is executed: k is 1+1 is 2; since k is 2 ≤ I is 15, go to step B1.2; step B1.2 is performed: obtaining probabilities that the type of virtual machine instance with the number k-2 is 1,2,3,4,5, respectively: a. the_2,1＝δ_2,1(1)＝0.2、A_2,2＝0.2、A₂,₃＝0.2、A_2,4＝0.2、A_2,50.2; calculating the cumulative probability:

step B1.3 is performed: 1 random number is generated, which is λ 0.56, since

So type 3 is selected, i.e. command₂3; step B1.4 is executed: k is 2+1 is 3; since k is 3 ≤ I is 15, go to step B1.2; ….; this is repeated from step B1.2 to step B1.4 until k is 16>Obtaining a virtual machine type list {4,3,1,3,5,4,5,1,3,5,1,1,5,3,3 }; go to step B2;

step B2 is executed: initializing a system state: i.e. step B2.1: make all virtual machines available a time period list vatl₁＝{[0,∞]}，vatl₂＝{[0,∞]}，……，vatl₁₅＝{[0,∞]}; and B2.2: let ready times rt of all tasks₁＝0，rt₂＝0，……，rt₁₅＝0；

P(t₄)＝{t₁}，P(t₅)＝{t₁,t₂}，P(t₆)＝{t₁,t₃}，P(t₇)＝{t₄,t₅,t₆}，P(t₈)＝{t₇}，P(t₉)＝{t₁,t₈}，P(t₁₀)＝{t₂,t₈}，P(t₁₁)＝{t₃,t₈}，P(t₁₂)＝{t₉,t₁₀,t₁₁}，P(t₁₃)＝{t₁₂}，P(t₁₄)＝{t₁₃}，P(t₁₅)＝{t₁₄}；

UT＝T＝{t₁,t₂,t₃,t₄,t₅,t₆,t₇,t₈,t₉,t₁₀,t₁₁,t₁₂,t₁₃,t₁₄,t₁₅}; and step B2.3 is executed: in UT

T of₁、t₂、t₃Moving to RT, RT ═ t₁,t₂,t₃}，UT＝{t₄,t₅,t₆,t₇,t₈,t₉,t₁₀,t₁₁,t₁₂,t₁₃,t₁₄,t₁₅}；q＝1，MI＝1；

Step B3 is performed according to [ β ]_1,1(1) … β_15,1(1)]^TFrom RT to t using roulette₁,t₂,t₃Randomly taking out a task, namely executing the step B3.1: calculating the probability of each task being selected in the RT:

thus t₁The probability of being selected is:

in the same way, A₂＝0.33、A₃＝0.33；And step B3.2 is executed: calculating the cumulative probability:

step B3.3 is performed: generate 1 random number, which is λ 0.84, because

Thus selecting t₃The task selection operation is finished; let gr₁＝3；

Step B4 is performed according to [ α ]_1,1(1) … α_1,15(1)]Using roulette in [1, 1]]Randomly selects a virtual machine instance number between which is k 1, and makes gs_q＝gs₁K is 1; since k is 1 ═ MI, then MI ═ MI +1 ═ 2;

step B5 is executed: handle t₃To the virtual machine instance numbered 1: i.e. step B5.1: calculating t₃Execution time of

And B5.2: in vatl₁Finding out an idle time period [0, ∞ from morning to evening]Satisfy ∞ -0 ≥ et₃17334 and ∞ -17334 ≧ rt₃0; and executing the step B5.3: t is t₃Start time s of₃＝max{0,0}＝0，t₃End time f of₃＝s₃+et₃17334; and executing the step B5.4: updating t₃Rt of the subtask of (2)₆＝max{rt₆,f₃}＝17334，rt₁₁17334; and executing the step B5.5: list of time slots available in virtual machine, vatl₁Deletion of [0, ∞]Insertion interval length of [17334, ∞ ] greater than 0](ii) a And step B5.6 is executed: at P (t)₆)、P(t₁₁) Deletion of t₃Then P (t)₆)＝{t₁}，P(t₁₁)＝{t₈}; deleting t in RT₃，RT＝{t₁,t₂}; step B5.7 is performed: since UT is t₄,t₅,t₆,t₇,t₈,t₉,t₁₀,t₁₁,t₁₂,t₁₃,t₁₄,t₁₅Is absent in

So RT, UT are not changed;

step B6 is executed: since RT ═ t₁,t₂Q is not null, so q + 1+ 2, go to step B3;

step B3 is performed according to [ β ]_1,2(1) … β_15,2(1)]^TFrom RT to t using roulette₁,t₂Randomly choose a task, which is t₂Let gr be₂＝2；

Step B4 is performed according to [ α ]_2,1(1) … α_2,15(1)]Using roulette in [1,2 ]]Randomly selecting a virtual machine instance number, namely executing the step B4.1: calculate the probability that the various possible numbers are chosen: a. the₁＝α_2,1(1)＝0.07，A₂＝1-A₁0.93; and step B4.2 is executed: calculating the cumulative probability:

and executing the step B4.3: 1 random number λ is generated to be 0.24, since

Therefore, the number 2 is selected, and the virtual machine instance number selection operation is finished; let gs₂2; since 2 ═ MI, then MI ═ MI +1 ═ 3;

step B5 is executed: handle t₂To virtual machine instance number 2: i.e. step B5.1: calculating t₂Execution time et of₂11112; and B5.2: in vatl₂Finding out an idle time period [0, ∞ from morning to evening]Satisfy ∞ -0 ≥ et₂11112 and ∞ -11112 ≧ rt₂0; and executing the step B5.3: t is t₂Start time s of₂＝0，t₂End time f of₂＝s₂+et₂11112; and executing the step B5.4: updating t₂Rt of the subtask of (2)₅＝11112，rt₁₀11112; and executing the step B5.5: listing time periods available in virtual machinesvatl₂Deletion of [0, ∞][11112, ∞ with an insertion interval length greater than 0](ii) a And step B5.6 is executed: at P (t)₅)、P(t₁₀) Deletion of t₂Deleting t in RT₂Then RT ═ t₁}; step B5.7 is performed: since UT is t₄,t₅,t₆,t₇,t₈,t₉,t₁₀,t₁₁,t₁₂,t₁₃,t₁₄,t₁₅Is absent in

So RT, UT are not changed;

step B6 is executed: since RT ═ t₁Q is not null, so q + 2+1 is 3, go to step B3;

……

the steps B3 to B6 are repeatedly executed until RT is empty, and then the step B7 is executed;

step B7 is executed: obtaining an individual

ch₂1, {3,2,1,5,4,6,7,8,11,10,9,12,13,14, 15; 1,2,3,4,5,5,2,2,4,1,2,4,2,5, 1; 4,3,1,3,5,4,5,1,3,5,1,1,5,3,3} and the execution time and completion time of all tasks thereof: et al₁＝45168、et₂＝11112、et₃＝17334、et₄＝11554.67、et₅＝19004、et₆＝13990.67、et₇＝36160、et₈＝15780、et₉＝23518、et₁₀＝10816、et₁₁＝12888、et₁₂＝43140、et₁₃＝15520、et₁₄＝30866.67、et₁₅＝19120，f₁＝45168，f₂＝11112，f₃＝17334，f₄＝56722.67，f₅＝64172，f₆＝70713.33，f₇＝106873.33，f₈＝122653.33，f₉＝146171.33，f₁₀＝133469.33，f₁₁＝135541.33，f₁₂＝189311.33，f₁₃＝204831.33，f₁₄＝235698，f₁₅254818; calculate its workflow response time rs₂255658, the operation ends;

similarly, other individuals in the population are generated by sampling the initial probability model as follows:

ch₃＝{3,1,4,2,5,6,7,8,10,9,11,12,13,14,15；1,1,1,1,2,2,2,3,2,4,3,4,5,2,5；4,2,4,2,2,4,3,3,5,1,2,5,3,1,4}

ch₄＝{2,3,1,6,4,5,7,8,10,11,9,12,13,14,15；1,2,3,1,4,5,6,7,8,9,9,2,4,1,10；5,5,2,1,5,3,4,1,4,2,5,3,1,2,1}

ch₅＝{1,2,3,5,4,6,7,8,9,11,10,12,13,14,15；1,2,3,2,4,5,3,6,1,7,2,2,8,3,6；4,4,2,4,5,2,3,5,4,2,4,4,1,5,1}

ch₆＝{1,2,4,3,5,6,7,8,9,10,11,12,13,14,15；1,1,1,1,2,2,2,1,3,4,3,5,2,6,2；5,4,2,1,3,3,5,3,2,4,5,4,1,1,3}

ch₇＝{1,2,5,4,3,6,7,8,10,9,11,12,13,14,15；1,2,3,4,5,6,3,7,6,8,8,9,2,9,2；2,3,4,3,4,2,2,4,3,1,1,5,2,2,2}

ch₈＝{2,3,1,6,4,5,7,8,10,9,11,12,13,14,15；1,2,3,4,5,6,7,8,9,10,4,5,5,2,7；4,3,5,2,4,3,5,2,2,2,5,3,2,1,4}

ch₉＝{3,2,1,5,4,6,7,8,9,11,10,12,13,14,15；1,2,3,4,2,5,3,5,6,7,7,1,6,8,8；4,4,2,4,1,3,1,3,3,4,1,2,5,1,3}

ch₁₀＝{3,1,4,2,6,5,7,8,10,11,9,12,13,14,15；1,2,3,4,5,2,3,6,5,7,8,9,10,7,6；5,5,2,1,5,4,4,2,5,1,2,4,4,2,1}

the workflow response time is respectively as follows: rs₃＝419882，rs₄＝341793.33，rs₅＝321114，rs₆＝293148.67，rs₇＝284975.33，rs₈＝296870，rs₉＝414570.67，rs₁₀＝340553.33；

And (4) executing: decoding and improving each individual in the contemporary population by adopting FBI & D to obtain the workflow execution cost and response time of each individual, and then calculating the relative fitness value of all infeasible individuals and the absolute fitness value of feasible individuals; replacing the content stored in BtCh with the optimal individual if BtCh ═ Null or the optimal individual in the contemporary population is better than the individual stored in BtCh;

use of FBI on individuals in contemporary populations&D method modifications, e.g. to ch in subgroup 1₃＝{3,1,4,2,5,6,7,8,10,9,11,12,13,14,15；1,1,1,1,2,2,2,3,2,4,3,4,5,2,5；4,2,4,2,2,4,3,3,5,1,2,5,3,1,4}

The execution times of all its tasks have been obtained during the sampling process: et al₁＝12084、et₂＝10584、et₃＝17334、et₄＝12220、et₅＝33272、et₆＝33380、et₇＝34880、et₈＝17860、et₉＝44828、et₁₀＝20224、et₁₁＝11736、et₁₂＝70440、et₁₃＝29280、et₁₄＝63800、et₁₅38240; and completion time: f. of₁＝29418，f₂＝52222，f₃＝17334，f₄＝41638，f₅＝85494，f₆＝118874，f₇＝153754，f₈＝171614，f₉＝216442，f₁₀＝191838，f₁₁＝183350，f₁₂＝286882，f₁₃＝316162，f₁₄＝379962，f₁₅418202; using FBI&The improved process of the method D is as follows:

step C1 is executed: form a reverse body

Namely, step C1.1 is executed: according to task completion time f_iRearranging the task scheduling sequence list {3,1,4,2,5,6,7,8,10,9,11,12,13,14,15} from large to small, namely setting the ith gene value in the task scheduling sequence list as the ith-last completed task number, wherein i is 1, …, 15; form {15,14,13,12,9,10,11,8,7,6,5,2,4,1,3 };

step C1.2 is performed: in order to maintain the validity of the original resource configuration scheme and the codes, the virtual machine instance list and the virtual machine type list are adjusted: step C1.2.1 is executed: epsilon is 1; δ is 1; flg₁＝...＝flg₁₅＝0；

k 6 …, 15; execution of step C1.2.2: due to flg_ε＝flg₁Go to step C1.2.3, so go to 0: execution of step C1.2.3: find task 15 is in { gr₁,…,gr₁₅The scheduling order in (1) }, which is 15, at ch₃Find out the usage number gs₁₅The task number set ST of the virtual machine instance of 5 is 13,15, where

Finding out a scheduling sequence set SI (1, 3) of a corresponding task in ST; execution of step C1.2.4: order to

flg₁＝1、flg₃＝1；

δ 1+1 — 2; execution of step C1.2.5: epsilon is 1+1 is 2; if e is 2 ≦ I ≦ 15, go to step C1.2.2; execution of step C1.2.2: due to flg_ε＝flg₂Go to step C1.2.3, so go to 0: execution of step C1.2.3: finding tasks

In { gr₁,…,gr₁₅The scheduling order in (1) }, which is 14, at ch₃Find out the usage number gs₁₄The task number set ST of the virtual machine instance of 2 is 5,6,7,10,14, where

Finding out a scheduling sequence set SI of the corresponding task in ST as {2,6,9,10,11 }; execution of step C1.2.4: order to

flg₂＝1、flg₆＝1、flg₉＝1、flg₁₀＝1、flg₁₁＝1；

δ 2+1 — 3; execution of step C1.2.5: epsilon ═2+1 ═ 3; if e is 3 ≦ I ≦ 15, go to step C1.2.2; … …, respectively; the steps C1.2.2 to C1.2.5 are repeated until ∈ 16>Obtaining an adjusted virtual machine instance list {1,2,1,3,3,2,4,4,2,2,2,5,5, 5} and a virtual machine type list: {2,2,2,4,4,4,3,3,5,1,2,5,3,1,4}, which ultimately form an inverted individual

Go to step C2;

step C2 is executed: method for decoding reverse individuals by adopting serial reverse individuals based on insertion mode

Decoding is carried out, and reverse completion time of all tasks is obtained:

and its workflow reverse response time

Due to the fact that

Less than rs₃When the result is 419882, go to step C3;

step C3 is executed: form positive individual ch₃：

Step C3.1 is performed: according to task reverse completion time

Rearranging the task scheduling order list {15,14,13,12,9,10,11,8,7,6,5,2,4,1,3} from large to small, namely setting the ith gene value in the task scheduling order list to be the penultimate gene valuei number of completed tasks, i ═ 1, …, 15; form {1,2,5,3,6,4,7,8,9,10,11,12,13,14,15 };

step C3.2 is performed: in order to maintain the validity of the original resource configuration scheme and the codes, the virtual machine instance list and the virtual machine type list are adjusted: step C3.2.1 is executed: epsilon is 1; δ is 1; flg₁＝...＝flg_I＝0；

k is 6, …, 15; execution of step C3.2.2: due to flg_ε＝flg₁Go to step C3.2.3, so it is 0; execution of step C3.2.3: finding task gr₁1 is in

In a scheduling order of 14, in

Find out the use number as

Is given as 1,2,3,4, in { gr₁,…,gr₁₅Finding out a scheduling sequence set SI of a corresponding task in ST from {1,2,5,3,6,4,7,8,9,10,11,12,13,14,15} - {1,2,4,6 }; execution of step C3.2.4: let gs₁＝1、gs₂＝1、gs₄＝1、gs₆＝1，flg₁＝1、flg₂＝1、flg₄＝1、flg₆＝1；

δ 1+1 — 2; execution of step C3.2.5: epsilon is 1+1 is 2; since ∈ 2 ≦ I ≦ 15, go to step C3.2.2; execution of step C3.2.2: due to flg_ε＝flg₂Go to step C3.2.5 for 1; execution of step C3.2.5: 2+1 ═ 3; if e is 3 ≦ I ≦ 15, go to step C3.2.2; execution of step C3.2.2: due to flg_ε＝flg₃Go to step C3.2.3, so it is 0; execution of step C3.2.3: finding task gr₃Is 5 at

In a scheduling order of 11, in

Find out the use number as

Is given as the task number set ST of the virtual machine instance of (1) {5,6,7,10,14}, in { gr₁,…,gr₁₅Finding out a scheduling sequence set SI of a corresponding task in ST from {1,2,5,3,6,4,7,8,9,10,11,12,13,14,15} - {3,5,7,10,14 }; execution of step C3.2.4: let gs₃＝δ＝2、gs₅＝2、gs₇＝2、gs₁₀＝2、gs₁₄＝2，flg₃＝1、flg₅＝1、flg₇＝1、flg₁₀＝1、flg₁₄＝1；

δ 2+1 — 3; execution of step C3.2.5: epsilon is 3+1 is 4; if e is 4 ≦ I ≦ 15, go to step C3.2.2; … …, respectively; the steps C3.2.2 to C3.2.5 are repeated until ∈ 16>Obtaining an adjusted virtual machine instance list (I-15)

{1,1,2,1,2,1,2,3,4,2,3,4,5,2,5} and list of virtual machine types: {4,2,4,2,2,4,3,3,5,1,2,5,3,1,4}, which ultimately form a positive individual

ch

₃1,2,5,3,6,4,7,8,9,10,11,12,13,14, 15; 1,1,2,1,2,1,2,3,4,2,3,4,5,2, 5; 4,2,4,2,2,4,3,3,5,1,2,5,3,1,4}, go to step C4;

step C4 is executed: adopting serial individual decoding method based on insertion mode to forward individual ch₃Decoding is carried out, and the completion time of all tasks is obtained: f. of₁＝12084，f₂＝22668，f₃＝40002，f₄＝52222，f₅＝55940，f₆＝89320，f₇＝124200，f₈＝142060，f₉＝186888，f₁₀＝162284，f₁₁＝153796，f₁₂＝257328，f₁₃＝286608，f₁₄＝350408，f₁₅388648, and its workflow response time rs₃390328; due to rs₃390328 equal to

So go to step C5;

step C5 is executed: outputting positive individuals

ch

₃1,2,5,3,6,4,7,8,9,10,11,12,13,14, 15; 1,1,2,1,2,1,2,3,4,2,3,4,5,2, 5; 4,2,4,2,2,4,3,3,5,1,2,5,3,1,4} and its workflow response time rs₃390328, its workflow execution cost is calculated due to f₁＝12084，f₂＝22668，f₃＝40002，f₄＝52222，f₅＝55940，f₆＝89320，f₇＝124200，f₈＝142060，f₉＝186888，f₁₀＝162284，f₁₁＝153796，f₁₂＝257328，f₁₃＝286608，f₁₄＝350408，f₁₅＝388648；s₁＝0，s₂＝12084，s₃＝22668，s₄＝40002，s₅＝22668，s₆＝55940，s₇＝89320，s₈＝124200，s₉＝142060，s₁₀＝142060，s₁₁＝142060，s₁₂＝186888，s₁₃＝257328，s₁₄＝286608，s₁₅＝350408；

Therefore, it is

For the same reason, tf₂＝145324，tf₃＝146092，tf₄＝89320，tf₅＝89320，tf₆＝89320，tf₇＝126600，tf₈＝145324，tf₉＝230888，tf₁₀＝230888，tf₁₁＝230888，tf₁₂＝261168，tf₁₃＝286608，tf₁₄364648; due to the fact that

Therefore, it is

Rnt₁＝max{tf₁,tf₂,tf₃,tf₄}＝146188，Hrt₁＝min{s₁,s₂,s₃,s₄}＝0，lt₁＝Rnt₁-Hrt₁146188; in a similar way, lt₂＝341980，lt₃＝106688，lt₄＝119108，lt₅＝133000；

Finishing the operation;

with the above-mentioned individuals

For example, the serial reverse individual decoding method based on the insertion mode is implemented as follows:

step D1 is executed: due to SFL₁₅＝{f_15-1Get the results

Due to the fact that

Therefore, it is

Epsilon is 1; make virtual machine available a time period list vatl₁＝{[0,∞]}，vatl₂＝{[0,∞]}，……，vatl₅＝{[0,∞]}；

Step D2 is executed: choose the serial number as

The task of (1);

step D3 is executed: assignment of tasks 15 to numbers of

The virtual machine instance of (2): i.e. step D3.1 is performed: in vatl₁Finding out an idle time period [0, ∞ from morning to evening]Satisfy ∞ -0 ≥ et₁₅38240 and

step D3.2 is performed: calculating the reverse start time of task 15

Reverse completion time

Step D3.3 is performed: updating the reverse ready time of the parent task of task 15

Step D3.4 is performed: list of time slots available in virtual machine, vatl₁Deletion of [0, ∞]Insertion interval length greater than 0 [0,1680]And [39920, ∞]；

Step D4 is executed: if e is 1+1 ═ 2, and since e is 2 ≦ I ≦ 15, go to step D2;

step D2 is executed: choose the serial number as

The task of (1);

step D3 is executed: assignment of tasks 14 to numbers of

The virtual machine instance of (2): i.e. step D3.1 is performed: in vatl₂Finding out an idle time period [0, ∞ from morning to evening]Satisfy ∞ -0 ≥ et₁₄63800 and

step D3.2 is performed: calculating the reverse start time of task 14

Reverse completion time

Step D3.3 is performed: updating the reverse ready time of the parent task of task 14

Step D3.4 is performed: list of time slots available in virtual machine, vatl₂Deletion of [0, ∞]Insertion interval length greater than 0 [0,39920]And [103720, ∞]；

Step D4 is executed: if e is 2+1 ═ 3, and since e is 3 ≦ I ≦ 15, go to step D2;

……

the steps D2 to D4 are repeated until ∈ 16> I ═ 15, and the process goes to step D5

Step D5 is executed: obtaining reverse completion times for all tasks:

and its workflow reverse response time

Finishing the operation;

in the above-mentioned positive direction

ch

₃1,2,5,3,6,4,7,8,9,10,11,12,13,14, 15; 1,1,2,1,2,1,2,3,4,2,3,4,5,2, 5; 4,2,4,2,2,4,3,3,5,1,2,5,3,1,4} decoding as an example, the serial forward individual decoding method based on the insertion mode is implemented as follows:

executeStep E1: let ready times for all tasks: rt is an integer of₁＝0，rt₂＝0，……，rt₁₅0; epsilon is 1; let list of available slots for all virtual machine instances: vatl₁＝{[0,∞]}，vatl₂＝{[0,∞]}，……，vatl₅＝{[0,∞]}；

Step E2 is executed: selecting the serial number i-gr₁1 as a task;

step E3 is executed: assignment of task 1 to a number k-gs based on insertion pattern₁A virtual machine instance of 1; i.e. step E3.1 is performed: in vatl₁Finding out an idle time period [0, ∞ from morning to evening]Satisfy ∞ -0 ≥ et₁12084 and infinity-12084 ≧ rt₁0; step E3.2 is performed: calculating the start time s of task 1₁Max {0,0}, 0, completion time f₁＝s₁+et₁12084; step E3.3 is performed: updating the Ready time rt of a subtask of task 1₄＝max{rt₄,f₁}＝12084，rt₅＝12084，rt₆＝12084，rt₉12084; step E3.4 is performed: list of time slots available in virtual machine, vatl₁Deletion of [0, ∞]Insertion interval length of [12084, ∞ ] of more than 0]；

Step E4 is executed: e, if E is 1+1 ═ 2, and since E is 2 ≦ I ≦ 15, go to step E2;

step E2 is executed: selecting the serial number i-gr₂A task of 2;

step E3 is executed: assignment of task 2 to a number k-gs based on the insertion pattern₂A virtual machine instance of 1; i.e. step E3.1 is performed: in vatl₁Finding an idle period [12084, ∞ from morning to evening]Satisfies ∞ -12084 ≥ et₂10584 and ∞ -10584 ≧ rt₂0; step E3.2 is performed: calculating the start time s of task 2₂Max 12084,0 12084, completion time f₂＝s₂+et₂22668; step E3.3 is performed: updating the Ready time rt of a subtask of task 2₅＝max{rt₅,f₂}＝max{12084,22668}＝22668，rt₁₀22668; step E3.4 is performed: listing time periods available in virtual machinesvatl₁Deletion in [12084, ∞]Insertion interval length of more than 0 [22668, ∞]；

Step E4 is executed: e, if E is 2+1 ═ 3, and since E is 3 ≦ I ≦ 15, go to step E2;

……

the steps E2 to E4 are repeated until ∈ 16> I ═ 15, and then the process goes to step E5;

step E5 is executed: acquiring the end time of all tasks; f. of₁＝12084，f₂＝22668，f₃＝40002，f₄＝52222，f₅＝55940，f₆＝89320，f₇＝124200，f₈＝142060，f₉＝186888，f₁₀＝162284，f₁₁＝153796，f₁₂＝257328，f₁₃＝286608，f₁₄＝350408，f₁₅388648, its workflow response time rs is calculated₃390328, the operation ends;

individuals in the contemporary population become, after the improvement of the FBI & D method:

ch₁＝{1,3,2,6,5,4,7,8,9,11,10,12,13,14,15；1,2,3,2,1,3,1,1,1,4,4,4,4,4,4；5,5,5,5,3,4,5,4,2,2,1,4,3,2,3}

ch₃＝{3,2,1,5,4,6,7,8,11,10,9,12,13,14,15；1,2,3,4,5,5,2,2,4,1,2,4,2,5,1；4,3,1,3,5,4,5,1,3,5,1,1,5,3,3}

ch₃＝{1,2,5,3,6,4,7,8,9,10,11,12,13,14,15；1,1,2,1,2,1,2,3,4,2,3,4,5,2,5；4,2,4,2,2,4,3,3,5,1,2,5,3,1,4}

ch₆＝{1,2,5,3,6,4,7,8,9,10,11,12,13,14,15；1,1,2,1,2,1,2,1,3,4,3,5,2,6,2；5,4,2,1,3,3,5,3,2,4,5,4,1,1,3}

the individual workflow response times are respectively: rs₁＝120122.67，rs₂＝255658，rs₃＝390328，rs₄＝341793.33，rs₅＝321114，rs₆＝289314，rs₇＝284975.33，rs₈＝296870，rs₉＝414570.67，rs₁₀＝340553.33；

The individual workflow execution costs are respectively: ct₁＝985.4，ct₂＝2422.3，ct₃＝1536.2，ct₄＝2927.1，ct₅＝2318，ct₆＝1714.6，ct₇＝2328.1，ct₈＝3649，ct₉＝3114.3，ct₁₀＝3418.8；

Calculating the relative fitness value of all the infeasible individuals and the absolute fitness value of the feasible individuals in the population:

because the cost constraint Budget is 1100, the time constraint Deadline is 140000; therefore ch₁As feasible individual, ch₂、ch₃、ch₄、ch₅、ch₆、ch₇、ch₈、ch₉、ch₁₀Is an infeasible individual;

for infeasible subject ch₂、ch₃、ch₄、ch₅、ch₆、ch₇、ch₈、ch₉、ch₁₀Calculating a relative fitness value:

in the same way, rfit₃＝4.18、rfit₄＝5.1、rfit₅＝4.4、rfit₆＝3.63、rfit₇＝4.15、rfit₈＝5.44、rfit₉＝5.79、rfit₁₀＝5.54；

For feasible individual ch₁Calculating an absolute fitness value:

afit₁＝0.4×250×985.4+0.6×120122.67＝170613.60

since BtCh ═ Null, the best individual in the contemporary population is ch ₁1,3,2,6,5,4,7,8,9,11,10,12,13,14, 15; 1,2,3,2,1,3,1,1, 4,4,4,4,4, 4; 5,5,5,5,3,4,5,4,2,2,1,4,3,2,3}, so that BtCh ═ ch₁；

And 5, executing the step: if the termination condition is not met, go to step 6; otherwise go to step 8;

the termination condition is set as that iteration is carried out until a specified algebraic TG is 200; the current iteration is performed for 1 generation, so that the termination condition is not met, and the step 6 is carried out;

and 6, executing the step: constructing an elite population and updating a probability model;

rate of taking elite r_e＝0.2，

Selecting N from the current generation population from good to bad_e2 individuals ch₁＝{1,3,2,6,5,4,7,8,9,11,10,12,13,14,15；1,2,3,2,1,3,1,1,1,4,4,4,4,4,4；5,5,5,5,3,4,5,4,2,2,1,4,3,2,3}ch ₆1,2,5,3,6,4,7,8,9,10,11,12,13,14, 15; 1,1,2,1,2,1,2,1,3,4,3,5,2,6, 2; 5,4,2,1,3,3,5,3,2,4,5,4,1,1,3} as the current generation elite population POP_e＝{ch₁,ch₆}；

Updating rate of virtual machine distribution probability model

The specific implementation process of the virtual machine distribution probability model updating is as follows:

in the current generation elite population POP_eIn the above description, the 1 st scheduled task is assigned to the virtual machine instance numbered 1,there were 2 times in total, so:

then according to equation (9) there is:

similarly, other α s are available_i,k(2) I is 2, …,15, k is 1, …,15, and the final updated virtual machine assignment probability model is:

updating rate of probability model of task scheduling sequence

The specific implementation process of the task scheduling order probability model updating is as follows:

in the current generation elite population POP_eIn the above, the 1 st scheduled task is task 1, and there are 2 times, so there are:

then according to equation (10) there is:

similarly, other β s are available_i,i′(2) I is 1, …,15, i' is 2, …,15, and the probability model of the task scheduling order after updating is finally obtained as follows:

taking the update rate of the virtual machine type probability model

The specific implementation process of the virtual machine type probability model updating is as follows:

in the current generation elite population POP_eIn the example number 1, the virtual machine types are all 5, and there are 2 times, so:

then according to equation (11) there is:

in the same way, other deltas can be obtained_k,j(2) And k is 2, …,15, j is 1, …,5, and the final updated probability model of the virtual machine type is:

and 7, executing the step: sampling the current probability models PMS (g), PMA (g) and PMT (g) for N times to generate N individuals, forming a new population, and enabling the new population to be a current generation population; turning to step 4;

the current probability models PMS (2), PMA (2) and PMT (2) were sampled 10 times, and 10 individuals were generated as follows:

ch′₁＝{2,3,1,5,4,6,7,8,10,11,9,12,13,14,15；1,2,3,4,5,6,3,1,6,2,7,8,1,9,2；4,4,2,1,2,1,3,2,2,4,5,2,3,1,5}

ch′₂＝{1,3,6,2,5,4,7,8,9,11,10,12,13,14,15；1,2,3,4,5,6,2,7,8,9,10,4,7,4,7；1,5,1,4,4,2,1,3,5,2,5,3,1,1,3}

ch′₃＝{3,1,2,5,6,4,7,8,11,10,9,12,13,14,15；1,2,3,4,1,3,5,6,6,7,1,4,2,4,1；1,1,4,3,4,1,2,4,2,2,5,4,1,4,3}

ch′₄＝{1,2,5,3,6,4,7,8,10,11,9,12,13,14,15；1,2,1,3,3,1,4,1,2,5,6,7,4,8,9；3,4,1,5,3,1,5,1,1,5,5,4,5,5,4}

ch′₅＝{1,4,3,6,2,5,7,8,9,11,10,12,13,14,15；1,2,3,4,1,1,1,4,5,6,3,5,4,7,4,；4,3,2,4,2,4,1,1,4,4,3,4,5,2,2}

ch′₆＝{1,3,2,5,6,4,7,8,10,9,11,12,13,14,15；1,2,3,4,5,1,5,6,7,4,7,2,8,2,8；5,1,4,3,2,2,4,1,4,3,1,5,2,3,3}

ch′₇＝{3,2,1,6,4,5,7,8,9,10,11,12,13,14,15；1,2,1,3,2,4,5,1,6,7,4,6,3,7,8；3,4,3,1,3,4,4,4,5,1,5,3,1,2,1}

ch′₈＝{3,2,1,5,4,6,7,8,10,11,9,12,13,14,15；1,2,3,4,5,5,6,1,3,6,6,7,5,8,1；3,5,5,1,3,3,5,1,4,5,5,4,1,3,3}

ch′₉＝{1,3,6,4,2,5,7,8,9,10,11,12,13,14,15；1,2,3,2,4,1,5,6,3,7,8,4,2,9,4；4,4,1,5,3,3,5,1,5,3,1,2,2,1,3}

ch′₁₀＝{1,2,5,4,3,6,7,8,9,11,10,12,13,14,15；1,2,3,1,4,5,6,1,7,8,4,9,7,10,8；4,1,4,1,3,5,4,1,2,4,5,2,1,3,3}

making the new population formed by the 10 individuals be a current generation population; turning to step 4;

after FBI & D decoding and improvement, individuals in contemporary populations become:

ch₁＝{2,3,1,5,4,6,7,8,10,11,9,12,13,14,15；1,2,3,4,5,6,3,1,6,2,7,8,1,9,2；4,4,2,1,2,1,3,2,2,4,5,2,3,1,5}

ch₂＝{1,3,6,2,5,4,7,8,9,11,10,12,13,14,15；1,2,3,4,5,6,2,7,8,9,10,4,7,4,7；1,5,1,4,4,2,1,3,5,2,5,3,1,1,3}

ch₃＝{3,1,2,5,6,4,7,8,11,10,9,12,13,14,15；1,2,3,4,1,3,5,6,6,7,1,4,2,4,1；1,1,4,3,4,1,2,4,2,2,5,4,1,4,3}

ch₄＝{1,2,5,3,6,4,7,8,10,11,9,12,13,14,15；1,2,1,3,3,1,4,1,2,5,6,7,4,8,9；3,4,1,5,3,1,5,1,1,5,5,4,5,5,4}

ch₅＝{1,4,3,6,2,5,7,8,9,11,10,12,13,14,15；1,2,3,4,1,1,1,4,5,6,3,5,4,7,4；4,3,2,4,2,4,1,1,4,4,3,4,5,2,2}

ch₆＝{1,3,2,5,6,4,7,8,10,9,11,12,13,14,15；1,2,3,4,5,1,5,6,7,4,7,2,8,2,8；5,1,4,3,2,2,4,1,4,3,1,5,2,3,3}

ch₇＝{1,2,5,3,6,4,7,8,11,9,10,12,13,14,15；1,2,3,1,4,2,5,1,3,6,7,6,4,7,8；3,4,1,3,3,4,4,4,5,1,5,3,1,2,1}

ch₈＝{3,2,1,5,4,6,7,8,10,11,9,12,13,14,15；1,2,3,4,5,5,6,1,3,6,6,7,5,8,1；3,5,5,1,3,3,5,1,4,5,5,4,1,3,3}

ch₉＝{1,3,6,4,2,5,7,8,9,10,11,12,13,14,15；1,2,3,2,4,1,5,6,3,7,8,4,2,9,4；4,4,1,5,3,3,5,1,5,3,1,2,2,1,3}

ch₁₀＝{1,2,5,4,3,6,7,8,9,11,10,12,13,14,15；1,2,3,1,4,5,6,1,7,8,4,9,7,10,8；4,1,4,1,3,5,4,1,2,4,5,2,1,3,3}

the individual workflow response times are respectively: rs₁＝396332，rs₂＝439099.33，rs₃＝538904，rs₄＝481070.67，rs₅＝349182，rs₆＝560492，rs₇＝290336，rs₈＝347732.67，rs₉＝328615.33，rs₁₀＝375500.67；

The individual workflow execution costs are respectively: ct₁＝3109，ct₂＝2955.2，ct₃＝2994.1，ct₄＝2861.6，ct₅＝2087.5，ct₆＝2584.3，ct₇＝2238.9，ct₈＝3123.6，ct₉＝3366.7，ct₁₀＝2116.8；

All individuals were infeasible and their relative fitness values were: rfit₁＝5.66，rfit₂＝5.82，rfit₃＝6.57，rfit₄＝6.04，rfit₅＝4.39，rfit₆＝6.35，rfit₇＝4.11，rfit₈＝5.32，rfit₉＝5.41，rfit₁₀＝4.61；

Since the feasible individual absolute fitness value in BtCh is rfit_BtCh170613.6, the best individuals in the contemporary population are not superior to those stored in BtCh and therefore do not replace the content of BtCh;

the termination condition is iterated to a specified generation TG being 200 generations; as the current iteration is carried out for 2 generations and does not meet the termination condition, the step 6 is carried out;

……

the steps 6,7, 4 and 5 are repeatedly executed until the specified generation TG is 200 generations, and the current generation population becomes:

ch₁＝{3,1,4,2,5,6,7,8,11,9,10,12,13,14,15；1,2,3,2,2,1,2,2,2,2,2,2,4,4,4；5,5,5,5,4,3,4,2,3,5,1,3,2,1,2}

ch₂＝{3,1,4,2,5,6,7,8,11,9,10,12,13,14,15；1,2,3,2,2,1,2,2,2,2,2,2,4,4,4；5,5,5,5,4,3,4,2,3,5,1,3,2,1,2}

ch₃＝{3,1,4,2,5,6,7,8,11,9,10,12,13,14,15；1,2,3,2,2,1,2,2,2,2,2,2,4,4,4；5,5,5,5,4,3,4,2,3,5,1,3,2,1,2}

ch₄＝{3,1,4,2,5,6,7,8,11,9,10,12,13,14,15；1,2,3,2,2,1,2,2,2,2,2,2,4,4,4；5,5,5,5,4,3,4,2,3,5,1,3,3,1,2}

ch₅＝{3,1,4,2,5,6,7,8,11,9,10,12,13,14,15；1,2,3,2,2,1,2,2,2,2,2,2,4,4,4；5,5,5,5,4,3,4,2,3,5,1,3,2,1,2}

ch₆＝{3,1,4,2,5,6,7,8,11,9,10,12,13,14,15；1,2,3,2,2,1,2,2,2,2,2,2,4,4,4；5,5,5,5,4,3,4,2,3,5,1,3,2,1,2}

ch₇＝{3,1,4,2,5,6,7,8,11,9,10,12,13,14,15；1,2,3,2,2,1,2,2,2,2,2,2,4,4,4；5,5,5,5,4,3,4,2,3,5,1,3,2,1,2}

ch₈＝{3,1,4,2,5,6,7,8,11,9,10,12,13,14,15；1,2,3,2,2,1,2,2,2,2,2,2,4,4,4；5,5,5,5,4,3,4,2,3,5,1,3,3,1,2}

ch₉＝{3,1,4,2,5,6,7,8,11,9,10,12,13,14,15；1,2,3,2,2,1,2,2,2,2,2,2,4,4,4；5,5,5,5,4,3,4,2,3,5,1,3,3,1,2}

ch₁₀＝{3,1,4,2,5,6,7,8,11,9,10,12,13,14,15；1,2,3,2,2,1,2,2,2,2,2,2,1,4,4；5,5,5,5,4,3,4,2,3,5,1,3,2,1,2}

the individual workflow response times are respectively: rs₁＝125977.33，rs₂＝125977.33，rs₃＝125977.33，rs₄＝125977.33，rs₅＝125977.33，rs₆＝125977.33，rs₇＝125977.33，rs₈＝125977.33，rs₉＝125977.33，rs₁₀＝140377.33；

The individual workflow execution costs are respectively: ct₁＝756.5，ct₂＝756.5，ct₃＝756.5，ct₄＝756.5，ct₅＝756.5，ct₆＝756.5，ct₇＝756.5，ct₈＝756.5，ct₉＝756.5，ct₁₀＝1008.5；

The relative fitness values of the infeasible individuals are respectively as follows: rfit₁₀＝1.92；

The absolute fitness values of feasible individuals are respectively as follows: afit₁＝151236.4，afit₂＝151236.4，afit₃＝151236.4，afit₄＝151236.4，afit₅＝151236.4，afit₆＝151236.4，afit₇＝151236.4，afit₈＝151236.4，

afit₉＝151236.4；

BtCh ═ 3,1,4,2,5,6,7,8,11,9,10,12,13,14, 15; 1,2,3,2,2,1,2,2,2,2,2, 4,4, 4; 5,5,5,5,4,3,4,2,3,5,1,3,2,1,2 }; individual workflow response time rs_BtCh125977.33, execution cost ct_BtCh756.5, absolute fitness value afit_BtCh＝151236.4。

And step 8 is executed: if the feasible individuals are stored in the BtCh, outputting the corresponding execution scheme as an optimization scheme; otherwise, there is no feasible implementation.

Since {3,1,4,2,5,6,7,8,11,9,10,12,13,14, 15; 1,2,3,2,2,1,2,2,2,2,2, 4,4, 4; 5,5,5,5,4,3,4,2,3,5,1,3,2,1,2} are feasible individuals, so the corresponding execution schemes are output as optimization schemes, as shown in table 3.

Execution order	Task numbering	Example numbering	Starting time	Execution time	End time	Virtual machine type numbering
							1	3	1	0	12084	12084	5
2	1	2	0	8584	8584	5
							3	4	3	8584	9970.67	18554.67	5
4	2	2	8584	7584	16168	5
							5	5	2	16168	9186.67	25354.67	5
6	6	1	12084	10870.67	22954.67	5
							7	7	2	25354.67	16186.67	41541.33	5
8	8	2	41541.33	10626.67	52168	5
							9	11	2	52168	8336	60504	5
10	9	2	60504	13566.67	74070.67	5
							11	10	2	74070.67	5333.33	79404	5
12	12	2	79404	15426.67	94830.67	5
							13	13	4	94830.67	10640	105470.67	5
14	14	4	105470.67	11666.67	117137.33	5
							15	15	4	117137.33	8000.00	125137.33	5

TABLE 3

The above embodiments are only preferred embodiments of the present invention, and are not intended to limit the technical solutions of the present invention, so long as the technical solutions can be realized on the basis of the above embodiments without creative efforts, which should be considered to fall within the protection scope of the patent of the present invention.

Claims

1. A workflow execution optimization method based on a distributed estimation algorithm in a cloud computing environment is characterized by comprising the following steps: the method comprises the following steps:

get task set T ═ T₁,...,t_I}，t_iRepresenting a task i, namely a task with the number i; wherein I is the number of tasks to be scheduled;

Output file list OFL generated after task i is processed_i、

acquiring related parameters of the virtual machine: computing power ps of class j virtual machines_jBandwidth bw of class j virtual machines_jClass j virtualUnit time cost vc of a machine_jFixed lease-starting cost fc of j-class virtual machine_jMinimum billing time unit ut for class j virtual machines_jMinimum lease-on time ft of class j virtual machine_j(ii) a The cost of renting a class j virtual machine is calculated as follows:

wherein: lt is lease time, J is 1,2 …, J;

step 2: calculating a level value of the task;

for a starting task i without a parent task, the hierarchy value is:

lvl_i＝1 (1)

the individual encoding method is as follows: ch ═ gr₁,…,gr_I；gs₁,…,gs_I；gt₁,…,gt_IWhere { gr₁,…,gr_IThe scheduling order list is a topological order of task numbers; { gs₁,…,gs_IIs the virtual machine allocation list, gs_iA virtual machine instance number representing an assignment to an ith scheduled task, wherein: gs is₁＝1，gs_i≤max{gs₁,…,gs_i-1}+1；{gt₁,…,gt_IIs the virtual machine typeList, gt_iType of virtual machine instance denoted i, gt₁,…,gt_IIs an integer value between 1 and J;

α therein_i,k(g) Indicating assignment of tasks to ith schedule at the g-th generationThe probability of the virtual machine instance numbered k,

the probability model of the initial task scheduling sequence is as follows:

marking value

Satisfy the requirement of

Is that

Where 1 is not more than k<n is then

Is that

The task of the ancestor of (c),

is that

The descendant task of (2);

the initial virtual machine distribution probability model is as follows:

the initial virtual machine type probability model is as follows:

j is the number of types of virtual machines;

step B1: sampling of virtual machine types:

step B1.1: let variable k be 1;

step B1.3: generating 1 random number λ ∈ [0,1) if

Then select type j, let gt_k＝j；

step B2: initializing a system state:

The task set UT is T;

step B2.3: in UT

T of_iMoving to RT; let the variable q be 1 and the variable MI be 1;

step B3 according to [ β ]_1,q(g)…β_I,q(g)]^TRandomly selecting a task from RT by roulette, not setting t_i(ii) a Let gr_q＝i；

Step B4 according to [ α ]_q,1(g)…α_q,I(g)]Using roulette in [1, MI]Randomly selecting a virtual machine instance number between the two, setting the number as k, and enabling gs_qK is; if k is MI, then MI is MI + 1;

step B5: handle t_iAssigned to virtual machine instance numbered k:

step B5.1: calculating t_iExecution time of

Step B5.4: updating t_iReady time of subtask of (2)

Step B5.6: in all of

Deletion of t_iDeleting t in RT_i；

Step B5.7: in UT

T of_iMoving to RT;

for each of the populationIndividual ch_n＝{gr₁,…,gr_I；gs₁,…,gs_I；gt₁,…,gt_I1, …, N; the FBI&D comprises the following steps:

step C1: form a reverse body

k＝max{gs₁,…,gs_I}+1,…,I；

Step C1.2.2: if flg_εIf 0, go to step C1.2.3; otherwise go to step C1.2.5:

step C1.2.3: finding tasks

In { gr₁,…,gr_IThe scheduling order in (1) is not set to

In ch_nFind out the use number as

Task number set of virtual machine instance

In that

Find out the scheduling sequence set of the corresponding task in ST

Step C1.2.4: for all i ∈ SI, let

flg_i1 is ═ 1; order to

Let δ be δ + 1;

Decoding to obtain the reverse completion time of all tasks

And its workflow reverse response time

If it is

Less than rs_nGo to step C3, otherwise go to step C5;

Step C3.1: according to task reverse completion time

Rearranging task scheduling order list from large to small

And virtual machine type list

Form { gs₁,…,gs_I}、{gt₁,…,gt_I}：

Step C3.2.1: let variable epsilon be 1 and variable delta be 1; let the flag value flg₁＝...＝flg_I0; order to

Step C3.2.2: if flg_εIf 0, go to step C3.2.3; otherwise go to step C3.2.5:

step C3.2.3: finding task gr_εIn that

In the scheduling order of (1), do not set to

In that

Find out the use number as

Task number set of virtual machine instance

Let δ be δ + 1;

Go to step C1, otherwise, go to step C5;

the serial reverse individual decoding method based on the insertion modeFor reverse individuals

The decoding comprises the following steps:

step D1: make reverse ready time of all tasks

Is a task

Output filesets exported to a shared database, i.e.

Making a list of time periods available to a virtual machine

Let the variable ε be 1;

step D2: choose the serial number as

The task of (1);

step D3: assigning task i to a number of

The virtual machine instance of (2):

step D3.2: calculating a reverse start time for task i

Reverse completion time

Step D3.3: updating the reverse ready time of the parent task of task i

And

step D5: obtaining reverse completion times for all tasks

And its workflow reverse response time

Finishing the operation;

Step E2: selecting the serial number i-gr_εThe task of (1);

step E3: based on insert mouldThe formula assigns task i to number k ═ gs_εThe virtual machine instance of (1);

Step E3.3: updating the Ready time of a subtask of task i

Step E4: let ε equal to ε +1, if ε ≦ I, go to step E2, otherwise step E5;

step 6: constructing an elite population and updating a probability model;

selecting from the best to the bad of the contemporary population

the method for updating the probability model comprises the following steps:

marking value

Marking value

Marking value

2. The workflow execution optimization method based on the distributed estimation algorithm in the cloud computing environment according to claim 1, wherein: one specific calculation method for the MBV and MDV is as follows:

wherein:

is t_iThe maximum execution time.

3. The workflow execution optimization method based on the distributed estimation algorithm in the cloud computing environment according to claim 1, wherein: in the step A2, an individual virtual machine allocation list { gs is generated based on the benefit ratio₁,…,gs_IAnd list of virtual machine types { gt₁,…,gt_IThe concrete steps are as follows:

step A2.1: order virtual machine instance set

Wherein:

is the processing power of the virtual machine instance numbered k;

k^-is to process t_i-a virtual machine instance number of the virtual machine instance,

and

is numbered k and k^-Bandwidth of the virtual machine instance of (1);

Step A2.2.5, calculating the comprehensive benefit ratio ξ_i,k：

Wherein: theta is formed by [0,1]]Is a weight coefficient, mu>0 is a coordination coefficient of cost and time,

is t_iLease time, lt, assigned to a virtual machine instance numbered k after a virtual machine instance numbered k_k′＝Rnt_k′-Hrt_kIs' t_iThe lease time of a virtual machine instance numbered k before a virtual machine instance numbered k has not yet been assigned,

a2.2.6, if η is not more than J, go to A2.2.7, otherwise go to A2.3;

k^-is to process t_i-The virtual machine instance number of (a); tau is_i,ηIs a handle t_iThe file transfer time allocated to the processing of the type η virtual machine instance required to obtain the input file from the shared database,

step A2.2.8: calculating t_iThe start time s after the allocation of this new type η virtual machine instance_i,K+η＝rt_iCompletion time f_i,K+η＝s_i,K+η+et_i,K+η；

Step A2.2.9: calculating the comprehensive benefit ratio:

wherein

step A2.3 from ξ_i,1,…,ξ_i,K+JFind a minimum among them, do not set as

If the subscript value

Then order

Otherwise, let gs_ε＝K+1、

Step A2.4.1: calculating a start time of a task

End time

Execution time et_i＝f_i-s_i；

Step A2.4.2: updating the Ready time of a subtask of task i

4. The workflow execution optimization method based on the distributed estimation algorithm in the cloud computing environment according to claim 1, wherein: for individual ch_nThe workflow response time rs_nAnd an execution cost ct_nThe specific calculation method is as follows:

wherein:

is a task gr_iThe response time of (a) is set,

wherein:

is the fixed lease start cost for the virtual machine instance numbered k,

is the cost per unit time of the virtual machine instance numbered k,

is the minimum billing time unit for the virtual machine instance numbered k,

is the minimum lease-on time of a virtual machine instance numbered k，

is to complete

Without exporting files to the shared database

Then the corresponding recipient is processing

If the virtual machine of the subtask

Without subtasks, i.e.

The respective recipient is a shared database, otherwise if

With both file output to the shared database and subtasks

Then the corresponding recipient is processing

The virtual machines and the shared database of the subtasks,

5. the workflow execution optimization method based on the distributed estimation algorithm in the cloud computing environment according to claim 1, wherein: for individual ch in the population_nN is 1,2 …, N if ct_n≤Budget∨rs_nWhen the adaline is not more than the preset value, ch is_nAs viable individuals, otherwise ch_nIs an infeasible individual;

6. The method for workflow execution optimization based on distributed estimation algorithm in cloud computing environment as claimed in claim 1, wherein in said step B3 is according to [ β ]_1,q(g)…β_I,q(g)]^TThe specific steps of selecting a task from the RT using rouletteThe following were used:

step B3.1: calculate each t in RT_iProbability of being selected

Step B3.2: calculating the cumulative probability:

step B3.3: generating 1 random number λ ∈ [0,1) if

Then t is selected_iAnd the task selection operation is ended.

7. The method for workflow execution optimization based on distributed estimation algorithm in cloud computing environment as claimed in claim 1, wherein in said step B4 is according to [ α ]_q,1(g)…α_q,I(g)]Using roulette in [1, MI]The specific steps of randomly selecting a virtual machine instance number are as follows:

Step B4.2: calculating the cumulative probability:

step B4.3: generating 1 random number λ ∈ [0,1) if