CN112948058A

CN112948058A - Response time optimization method for fair deployment after centralized service decoupling

Info

Publication number: CN112948058A
Application number: CN202110271251.3A
Authority: CN
Inventors: 魏同权; 段媛; 张润泽
Original assignee: East China Normal University
Current assignee: East China Normal University
Priority date: 2021-03-12
Filing date: 2021-03-12
Publication date: 2021-06-11
Anticipated expiration: 2041-03-12
Also published as: CN112948058B

Abstract

The invention discloses a response time optimization method for fair deployment after centralized service decoupling, which is characterized in that a service decoupling method and an MOEA/D algorithm-based sub-service are adopted for deployment, and the decoupled sub-service and a low-coupling-degree service are deployed on an edge server and subjected to multi-objective optimization, so that the overall response time is shortest under the condition of ensuring the load fairness of the edge server. Compared with the prior art, the method has the advantages of ensuring the shortest overall response time under the condition of guaranteeing the load fairness of the edge server, effectively improving the expected response time of the whole system, greatly reducing the occupation of network bandwidth resources, reducing the service communication cost by about 40 percent, along with simple method, convenient use and high efficiency.

Description

Response time optimization method for fair deployment after centralized service decoupling

Technical Field

The invention relates to the technical field of clustering and multi-objective optimization, in particular to a response time optimization method for fair deployment after centralized service decoupling in edge computing.

Background

At present, the influence and complexity between the centralized services of the cloud are high, the development efficiency is low, and the development of software application is difficult to meet. With the advent of small-scale clouds and server clusters in the edge of the network, deployment of services onto edge servers is supported, which can reduce the response time of the system and save network bandwidth. Compared to a traditional Content Delivery Network (CDN) that solves latency and bandwidth problems by deploying a cache close to a request, deployment to an edge server enables sensitive or critical business data of an enterprise to be better protected, because a CDN grid is typically located at a data center of an Internet Service Provider (ISP) and belongs outside the enterprise.

In the service deployment strategy in the prior art, the position of the edge server and the deployment cost are considered, but the fairness of the working time between the edge servers is ignored, so that the overall response time of the load of the edge servers is longer, a large amount of network bandwidth resources are wasted or occupied, and the service communication cost is high.

Disclosure of Invention

The invention aims to design a response time optimization method for fair deployment after centralized service decoupling aiming at the defects of the prior art, which adopts a service decoupling method and an MOEA/D algorithm-based sub-service deployment, deploys the decoupled sub-service and a low-coupling-degree service on an edge server and performs multi-objective optimization, so that the overall response time is shortest under the condition of ensuring the load fairness of the edge server, the expected response time of the whole system is effectively improved, the occupation of network bandwidth resources is greatly reduced, the service communication cost is reduced by about 40%, and the method is simple, convenient to use and high in efficiency.

The purpose of the invention is realized as follows: a response time optimization method for fair deployment after decoupling of centralized services is characterized in that a service decoupling method and an MOEA/D algorithm-based sub-service deployment are adopted, the decoupled sub-service and a low-coupling-degree service are deployed on an edge server and subjected to multi-objective optimization, so that the overall response time is shortest under the condition of ensuring the load fairness of the edge server, the specific optimization comprises the construction of a coupling degree model, a transmission service communication overhead model and a communication delay model, the coupling degree calculation is carried out on all the services to obtain the coupling degree value of the service, the coupling degree of the service is used for judging whether the coupling degree of the service is higher than a given coupling degree threshold value or not, if the coupling degree of the service is lower than the coupling degree threshold value, the deployment of the sub-service is directly carried out, and if the coupling degree of the service is higher. The method comprises the following steps of decoupling two stages of high coupling services, firstly decoupling in an off-line stage, extracting characteristics based on service fields according to the subscription condition of each edge server to the high coupling services, clustering the high coupling services by using a k-means based method, decoupling in an on-line stage, taking the clustering result of the off-line decoupling as the initial input of the on-line clustering, clustering services with fields added or deleted by using a streaming k-means based method in real time, proposing a sub-service deployment model for the sub-services comprising the services after low coupling services and high coupling services are decoupled, and deploying the sub-services by using a MOEA/D based method, wherein the response time optimization of fair deployment specifically comprises the following steps:

step 1: and constructing a system architecture and coupling degree model of the edge cloud, a transmission service communication overhead model and a communication delay model.

Step 2: calculating the coupling degree of all the services, judging the coupling degree value of the obtained service and a given coupling degree threshold value, and if the coupling degree value is lower than the coupling degree threshold value, directly deploying the sub-services; if the coupling degree threshold value is higher, the service is decoupled.

And step 3: and (3) performing two-stage decoupling on the high coupling service in the step (2), firstly performing off-line stage decoupling, performing service field-based feature extraction according to the subscription condition of each edge server to the high coupling service, and clustering the high coupling service by using a k-means-based method. And then performing on-line stage decoupling, taking a clustering result of off-line decoupling as initial input of on-line clustering, and performing real-time clustering on services with added or deleted fields by using a streaming k-means based method.

And 4, step 4: and deploying sub-services based on an MOEA/D algorithm according to the constructed coupling degree model, the transmission service communication overhead model and the communication delay model, wherein the sub-services comprise the low-coupling service in the step 2 and the clustering output result of the two stages in the step 3.

And 5: and after deployment is finished, obtaining an optimal solution of multiple targets, so that the overall response time is shortest under the condition of ensuring the load fairness of the edge server.

The implementation process of the step 1 specifically comprises the following steps:

step A1: construction of degree of coupling model

Establishing cloud server to maintain N_sA service, each service S_i(1≤i≤N_s) By

Field composition for edge server N_esEach edge server E_j(1≤j≤N_es) Subscribing to several fields in one or more services according to its own needs.

Is provided with

Representing edge servers E_jAnd service S_iThe subscription relationship of the middle field k, if E_jSubscribed to service S_iK in (1), then

Is 1, otherwise is 0, respectively introduces the influence of different operations of the fields expressed by the following formulas a to c on the service

And

thus, the edge server E_jSubscription service S_iThe influence of the middle field k on the middle field is represented by the following formula d

Wherein: w is a_jAre the influence coefficients of different operations.

Thus, edge server E_jSubscription service S_iThe influence of (a) is represented by the following formula e

Wherein: p is a radical of_kIs the modification frequency of field k.

To sum up, service S_iDegree of coupling F_iCan be calculated by the following formula f:

step A2: construction of transport service communication overhead model

Recording service S_iOffload from cloud Server to edge Server E_jHas an additional information size of A (A > 0), and serves S_iSize B_iThen transmitting service S_iOverhead of communication

Can be calculated from the following formula g:

will S_iIs divided into

Sub-services, i.e.

Partitioned sub-services

A size of

Transmits its sub-service

Overhead of communication

Can be calculated by the following equation h:

wherein:

satisfy the requirement of

Edge server E_kSubscribed to sub-services

Fields of China, i.e. considered as subscribed to a sub-service

Its edge server E_kWhether or not sub-services are subscribed to

Is represented by the following formula i

Then, the edge server E_kAccepting sub-services offloaded from cloud servers

The required communication overhead can be calculated by the following equation j:

then, the edge server E_kAccepting service S offloaded from cloud server_iThe required communication overhead can be calculated by the following k:

therefore, all edge servers receive the service S in the cloud server through a given service dividing method_iThe communication overhead of (c) is then calculated by the following equation m:

and satisfies the following constraints:

A＞0，

step A3: construction of communication delay model

Note the book

Two edge servers E represented by the following n formula_iAnd E_jInter-transmission delay:

wherein: b (E)_i，E_j) Is the bandwidth; w (E)_i，E_j) Is represented by the following r formulaIs shown at edge server E_iTo the edge server E_jSub-service set S transmitted between_nbThe size of (2):

note the book

Is the propagation delay expressed by the following equation:

wherein: d (E)_i，E_j) Is the distance between two edge servers; theta is the propagation speed of the electric signal in the optical cable; if there is no route between the two edge servers, then

Will be set to infinity;

in summary, the wired communication delay between two edge servers is calculated by the following equation:

wherein:

for terminal i to edge server E_jThe delay of the wireless transmission distance between the two is represented by the following u formula:

wherein: w (i, E)_j) Representing terminal i to edge server E_jThe size of the transmission task, and the maximum data transmission rate tr (i, E)_j) From v as followsCalculating the formula:

wherein: b (i, E)_j) For terminal i to edge server E_jBandwidth of the wireless communication link to transmit data;

is the signal-to-noise ratio; p_iIs the transmission power of terminal i; g (i, E)_j) For terminal i and edge server E_jA signal gain in between; n is a radical of_pIs the in-channel noise power.

Thus, the terminal i goes to the edge server E_jIs calculated by the following equation w:

wherein: e_iIs located next to the base station closest to terminal i.

The step 2 specifically comprises: and calculating the coupling degrees of all services by using a coupling degree model formula. And judging whether the coupling degree of the service is higher than a given coupling degree threshold value or not, and if the coupling degree of the service is lower than the coupling degree threshold value M, the service is called as low-coupling service, and the service can be directly deployed. If the coupling degree is higher than the threshold value, the service is called as a high coupling service, decoupling is needed to be performed on the service, and then the service after decoupling, namely deployment of the sub-service, is performed.

The implementation process of the step 3 specifically comprises the following steps:

step B1: memory two-dimensional matrix

For service S_i(1≤i≤N_s) The feature matrix, which stores the field information of all edge servers subscribing the service, is represented by the following formula 1:

note the book

Representing a service S for the following formula 2_iThe feature vector of field k:

then, a two-dimensional matrix

Feature matrix represented by the following formula 3

Wherein,

for service S_iThe number of fields contained in;

according to the characteristic matrix

Performing k-means clustering on the obtained data, determining k value, and initializing clustering center by using k-means +

The minimum objective function is calculated by the following formula 4:

step B2: taking the clustering result of the step B1 as the input of the online clustering of the step, i.e. giving the t batch numberAccording to the clustering center after arrival and the batch data

Wherein: o is_tWhen the t +1 th batch of data arrives, the updated clustering center is expressed by the following formulas 5-6:

wherein: α is an attenuation factor, when increasing the field, ± select +, when decreasing the field, ± select-;

is the kth clustering center when the t-th batch processing arrives;

a number indicating the kth cluster;

the cluster center of the new field data of the cluster is represented by the following formula 7:

wherein:

is a certain field data in the t-th batch of data; i.e. i_vTo represent

Belong to item i_vClustering;

the number of new data assigned to the cluster is represented by equation 8 below:

in summary, the new cluster center point is represented by the following formula 9:

the implementation process of the step 4 specifically comprises the following steps:

step C1: definition of S_subFor clustered partitioned sub-service and low coupling service sets, S_i(1≤i≤N_s) Represented by the following formula 10

The sub-services:

the divided fields in each sub-service are not repeated, and the union of the two sub-services is null according to the following formula 11:

then S_subCan be represented by the following formula 12:

recording vis (i, s)_j)(1≤i≤N_t，s_j∈S_sub) Whether or not terminal i represented by the following formula 13 subscribes to sub-service s_j：

Note the book

Indicates whether the condition i-C is satisfied, if so, the method

Otherwise

Wherein: c is

A constant value; i is a variable represented by the following formula 14:

input(s)_j) Representing sub-services s_jWhether or not to deploy to edge server E_i(1≤i≤N_es) Up, if deployed, put(s)_j) Can be represented by the following formula 15:

put(s_j)＝i(15)；

wherein: put(s)_j)∈[0，N-1]And is

In summary, the objective function for optimizing the response time is calculated by the following equation 16:

wherein: i S_subIs the sub-service set S_subThe number of services; e is N_esA set of edge servers; n is a radical of_tRefers to the number of terminals.

Note t (E)_j，s_k) Is composed ofEdge server E represented by the following formula 17_jHandling sub-service jobs_kRequired time period of (c):

wherein:

serving a child s_kThe size of (d);

as an edge server E_jComputing power of (1), note t_max，t_minThe method comprises the following steps of respectively representing the longest working time and the shortest working time of the edge server by the following formulas 18-19:

in summary, the objective function for optimizing the edge server work balance can be represented by the following equation 20:

I₂＝min(t_max-t_min) (20)；

and the following constraint conditions are satisfied:

vis(i，s_k)∈{0，1}；

g(i，E_j)＞0；

N_p＞0；

s_k∈S_sub；

1≤i≤N_t，1≤j≤N_es，1≤k≤|S_sub|。

step C2: converting the multi-objective optimization problem into a plurality of single-objective optimization problems represented by the following formula 21 by using a MOEA/D algorithm:

s.t.x∈Ω

wherein: λ ═ λ₁，λ₂，...，λ_m) As weight vector, for each lambda_iEach of (i ═ 1, 2,. m) has λ_iIs not less than 0 and

is a reference vector in which the value of each reference point is the minimum value of each dimension of the objective function, i.e.

Inputting a multi-objective optimization problem, a condition for stopping iteration and a population size N_popWeight vector and number of neighbors of weight vector T_nbAnd finally outputting a set PS of pareto optimal points, wherein the specific algorithm is as follows: firstly, initializing PS to be a non-empty set, and setting an initial reference point

Calculating the distance between each weight vector and other vectors, and calculating the T nearest to the current individual_nbIndividual weight vectors by randomly initializing population individuals

Calculating a target value FVⁱ＝F(xⁱ) (ii) a For each population individual, randomly selecting two from neighbors, using a genetic operator to generate a solution y, performing Gaussian rounding on the generated new solution to obtain y', and then updating a reference point

If the updated reference point z_jThe value being less than that of the objective function, i.e. z_j＜f_j(y'), updating the reference point to z_j＝f_j(y'); updating the neighborhood, and for each neighbor j, if the constraint value corresponding to the new solution is less than or equal to the original individual of the neighborhood, namely g (y' | lambda)^j，z)≤g(x^j|λ^jZ), then the original individual is replaced with the new solution, i.e. x^jThe target value FV is updated simultaneously when y is equal to y^jF (y'); and finally, updating the PS value, judging whether a stopping condition is met, and stopping iteration and outputting the PS if the stopping condition is met.

Compared with the prior art, the method has the advantages of ensuring the shortest overall response time under the condition of guaranteeing the load fairness of the edge server, effectively improving the expected response time of the whole system, greatly reducing the occupation of network bandwidth resources, reducing the service communication cost by about 40 percent, along with simple method, convenient use and high efficiency.

Drawings

FIG. 1 is a bar graph of the degree of coupling of example 1;

FIG. 2 is a graph of communication overhead before/after service decoupling according to example 1;

FIG. 3 is a time histogram of the solution service add field operation of the online and offline algorithm of example 1;

FIG. 4 is a time histogram of the solution of the service reduction field operation of the online and offline algorithm of embodiment 1;

FIG. 5 is a graph of the average waiting time of the online and offline algorithms of different numbers of terminals in embodiment 1;

fig. 6 is a histogram of the optimal solution of service deployment obtained by three algorithms for the number of edge servers of 3 in embodiment 1;

FIG. 7 is a histogram of the optimal solution of service deployment obtained by three algorithms for the number of edge servers of 4 in example 1;

fig. 8 is a histogram of the optimal solution of service deployment obtained by three algorithms for the number of edge servers of 5 in embodiment 1.

Detailed Description

The method comprises the steps of constructing a system architecture and a coupling degree model, a transmission service communication overhead model and a communication delay model, calculating the coupling degree of all services, judging the coupling degree value of the obtained service and a given coupling degree threshold value, and directly deploying the sub-services if the coupling degree value of the service is lower than the coupling degree threshold value. Decoupling the service if the coupling value of the service is above the coupling threshold. The high coupling service is decoupled in two stages, firstly, the off-line stage decoupling is carried out, the feature extraction based on the service field is carried out according to the subscription condition of each edge server to the high coupling service, and the high coupling service is clustered by using a method based on k-means. And then performing on-line stage decoupling, taking a clustering result of off-line decoupling as initial input of on-line clustering, and performing real-time clustering on services with added or deleted fields by using a streaming k-means based method. For services after the sub-services comprise low coupling services and high coupling services after decoupling, a sub-service deployment model is provided, and the sub-services are deployed by using a MOEA/D (mobility agent architecture/data encryption) based method, so that the overall response time is shortest under the condition of ensuring the load fairness of the edge server, and the specific optimization comprises the following steps:

step 1: system architecture and coupling degree model for constructing edge cloud, transmission service communication overhead model and communication delay model

A1: degree of coupling model

And (4) forming fields. For edge server N_esIn other words, each edge server E_j(1≤j≤N_es) Subscribing to several fields in one or more services according to its own needs. Is provided with

Is 1, otherwise is 0. Introducing the influence of different operations on the service represented by the following a-c expressions

And

Wherein: w is a_lAre the influence coefficients of different operations.

Wherein: p is a radical of_kIs the modification frequency of field k.

To sum up, service S_iThe calculation formula of the coupling degree is expressed by the following formula f:

a2: construction of transport service communication overhead model

Represented by the following formula g:

will S_iIs divided into

Sub-services, i.e.

Partitioned sub-services

A size of

Transmits its sub-service

Overhead of communication

Represented by the following formula h:

wherein:

satisfy the requirement of

Edge server Ek subscribes to sub-services

Fields of China, i.e. considered as subscribed to a sub-service

Its edge server E_kWhether or not sub-services are subscribed to

Then it is represented by the following formula i

Then, the edge server E_kAccepting sub-services offloaded from cloud servers

The required communication overhead is represented by the following j formulaComprises the following steps:

then, the edge server E_kAccepting service S offloaded from cloud server_iThe required communication overhead is represented by the following k:

therefore, all edge servers receive the service S in the cloud server through a given service dividing method_iThe communication overhead is expressed by the following equation m:

and satisfies the following constraints:

A＞0，

a3: construction of communication delay model

Note the book

wherein: b (E)_i，E_j) Is the bandwidth; w (E)_i，E_j) Is represented by the following r formula in an edge server E_iTo the edge server E_jSub-service set S transmitted between_nbThe size of (2):

note the book

Is the propagation delay expressed by the following equation:

Will be set to infinity.

In summary, the wired communication delay between two edge servers is represented by the following equation:

wherein:

wherein: w (i, E)_j) Representing terminal i to edge server E_jThe size of the transmission task, and the maximum data transmission rate tr (i, E)_j) Represented by the following formula v:

signal-to-noise ratio; p_iIs the transmission power of terminal i; g (i, E)_j) For terminal i and edge server E_jA signal gain in between; n is a radical of_pIs the in-channel noise power.

Thus, the terminal i goes to the edge server E_jIs represented by the following w formula:

wherein: e_iIs located next to the base station closest to terminal i.

Step 2: and calculating the coupling degrees of all services by using a coupling degree model formula. And judging whether the coupling degree of the service is higher than a given coupling degree threshold value or not, and if the coupling degree of the service is lower than the coupling degree threshold value M, the service is called as low-coupling service, and the service can be directly deployed. If the coupling degree is higher than the threshold value, the service is called as a high coupling service, decoupling is needed to be performed on the service, and then the service after decoupling, namely deployment of the sub-service, is performed.

And step 3: the method comprises the following steps of performing off-line stage decoupling and on-line stage decoupling on the coupling service higher than a coupling threshold, wherein the implementation process specifically comprises the following steps:

b1: memory two-dimensional matrix

note the book

then, a two-dimensional matrix

Feature matrix represented by the following formula 3

Wherein,

for service S_iThe number of fields contained in (a).

According to the characteristic matrix

The minimization objective function is expressed by the following formula 4:

b2: taking the clustering result of the step B1 as the input of the online clustering of the step, namely, the clustering center after the given t-th batch data arrives, and the batch data

is the kth clustering center when the t-th batch processing arrives;

a number indicating the kth cluster;

wherein:

is a certain field data in the t-th batch of data; i.e. i_vTo represent

Belong to item i_vClustering;

and 4, step 4: deploying the sub-services based on an MOEA/D algorithm according to the constructed coupling degree model, the transmission service communication overhead model and the communication delay model, wherein the sub-services comprise the low coupling service in the step 2 and the clustering output result in the two stages in the step 3, and the implementation process specifically comprises the following steps:

c1: definition of S_subFor clustered partitioned sub-service and low coupling service sets, S_i(1≤i≤N_s) Represented by the following formula 10

The sub-services:

then S_subCan be represented by the following formula 12:

Note the book

Indicates whether the condition i-C is satisfied, if so, the method

Otherwise

Wherein: c is

A constant value; i is a variable represented by the following formula 14:

put(s_j)＝i (15)；

wherein: put(s)_j)∈[0，N-1]And is

In summary, the objective function for optimizing the response time is expressed by the following equation 16:

Note t (E)_j，s_k) An edge server E represented by the following formula 17_jHandling sub-service jobs_kRequired time period of (c):

wherein:

serving a child s_kThe size of (d);

I₂＝min(t_max-t_min) (20)；

and the following constraint conditions are satisfied:

vis(i，s_k)∈{0，1}；

g(i，E_j)＞0；

N_p＞0；

s_k∈S_sub；

1≤i≤N_t，1≤j≤N_es，1≤k≤|S_sub|。

c2: converting the multi-objective optimization problem into a plurality of single-objective optimization problems represented by the following formula 21 by using a MOEA/D algorithm:

s.t.x∈Ω

If the updated reference point z_jThe value being less than that of the objective function, i.e. z_j＜f_j(y'), updating the reference point to z_j＝f_j(y'); updating the neighborhood, and for each neighbor j, if the constraint value corresponding to the new solution is less than or equal to the original individual of the neighborhood, namely g (y' | lambda)^j，z)≤g(x^j|λ^jZ), then the original individual is replaced with the new solution, i.e. x^jThe target value FV is updated simultaneously when y is equal to y^jF (y'); finally, updating the PS value, judging whether a stop condition is met, if so, stopping iteration and outputtingPS。

And 5: after deployment is finished, a multi-target optimal solution can be obtained, so that the overall response time is shortest under the condition of ensuring the load fairness of the edge server.

The following takes the specific implementation of the simulation experiment as an example to verify the effectiveness of the coupling degree model, the two-stage method for decoupling the high-coupling service and the sub-service deployment strategy, and further details the present invention.

Example 1

(I) Experimental setup

1-1 degree of coupling model

To validate the degree of coupling model proposed by the present invention, a data set provided by CFETS information technology (shanghai) ltd was used. The data set contains field data for 20 systems subscribed to each service. Through this data set, 15 service-based system subscription states are consolidated out. The system is then treated as an edge server and the coupling calculation is performed on the 15 services. After the coupling values are obtained, the high coupling threshold is selected by a quartile method, i.e., the top 25%, 50%, and 75% of the services are considered high coupling services, in order from high to low. Wherein,

1-2 high-coupling service decoupling model

To verify the effectiveness of the two stages proposed by the present invention on the high coupling service decoupling method, the offline method and the online method were compared and analyzed and used two data sets, one provided by CFETS information technology (shanghai) ltd, and the other being the american census data (1990) data set, which contained 68 attributes for clustering. For the first data set, 15 services were clustered by k-means to verify the effectiveness of the offline decoupling algorithm. For the add-field based online decoupling algorithm, the first 200,000 pieces of data were selected from the second dataset and divided into 10 groups of 20,000 data each. For each set of data, the first 10,000 data were used for offline clustering. The last 10,000 data were used for online clustering, with an average of 5 groups. 4 batches of data are set and the last 2,000 data will be discarded to prevent data corruption and unusable cases. The benchmark algorithm is implemented using only an offline decoupling algorithm for each set of 18,000 data.

In the online decoupling algorithm, the parameter α is set to 0.9. Meanwhile, to verify the validity of the delete fragment based on-line decoupling algorithm, for the first dataset, the batch is set to 2, and each batch will be (n)_min5% +3) as the number of fragments to be deleted. Wherein the deleted fragments are randomly generated; n is_minIs the minimum number of each cluster. For the second data set, the first 200,000 pieces of data were selected from the second data set and divided into 10 groups of 20,000 data each. For each set of data, the first 10,000 data are used, while the last 10,000 data are discarded. Set 4 batches of data, each batch to be (n)_min5% +3) as the number of segments to be deleted, wherein the segments to be deleted will be generated randomly. The benchmark algorithm simply uses an offline decoupling algorithm for each set of remaining data. Since no new data is introduced in the deleted fields, there is no need to make the model sensitive to the deleted data, so the parameter α is 1. The offline and online algorithms are compared to the three indicators "bandwidth savings", "average latency" and "algorithm run time".

For the bandwidth indicator saved, the communication overhead is calculated according to the aforementioned f formula

Set A to 20 bytes and set each field size to 200 bytes, which contains 150 bytes of metadata and 50 bytes of data. If sub-service S_iIncluded

Segment, size of sub-service is obtained

For the average latency indicator, note t₀Is an initial time, N_bIs the number of batches. Recording the ith batch of data to start transmissionThe time of input is t_2i-1And for the ith batch of data,

represents the transmission from the cloud server to the terminal k (1 ≦ k ≦ N_t) Each field having a size b and a transmission rate of

Is the moment when terminal k starts transmission. Therefore, it can be concluded that the transmission of the ith batch of data is completed at time t_2iNamely:

in summary, the average latency of the offline decoupling algorithm is:

wherein: t is t_ntTime for next offline decoupling clustering; a is a specific time point in the time period.

The average latency of the on-line decoupling algorithm is:

setting an initial time to t₀Is 0, t_2i-1In the interval [24h, 72h]In (1),

set to 2000. Each field size b is 200 bytes.

Will be taken randomly from 0-2 h. a is set to 0.5. Wherein t is_ntIs equal to

Since the offline algorithm will execute immediately after the last batch of data arrives.

1-3 sub-service deployment policies

In order to verify the effectiveness of the MOEA/D adopted by the invention, the MOEA/D is compared and analyzed by using three reference algorithms, namely RAND, GA and PSO. The data sets used are the data sets of the base stations of the Shanghai telecom and the data sets of the trajectories of the Shanghai taxis. The data set of base stations in the shanghai telecommunications contains the exact location information of 3,233 base stations and the information of mobile subscribers connected to these base stations. This dataset is used to model the location information of base stations and cloud servers, some edge servers being randomly configured on some base stations. The shanghai taxi track dataset contains tracks of 4,328 taxis from shanghai on 2 months and 20 days of 2007. A specific time period is randomly selected from the data set, then a certain number of taxis are randomly selected during the time period, and the position of the taxis is simulated as the position of the terminal.

Since the position information in the data set is composed of latitude and longitude, given longitude and latitude are calculated as the following expression 1-3 between two points A (x)₁，y₁) And B (x)₂，y₂) Distance Len of_AB：

Wherein: r is 6371 kilometer of the radius of the earth; a ═ x₁-x₂；b＝y₁-y₂。

In the wired communication model, two edge servers E_i、E_jBandwidth B (E) in between_i，E_j) In [10,20 ]]Evenly distributed in Mhz. In the wireless communication model, the wireless channel gain g (i, E)_j) Is 127+30 × logd. Where d is the terminal i to the edge server E_jThe distance between them; terminal i and edge server E_jB (i, E) of the channel bandwidth in between_j) Randomly generated in 10Mhz to 20 Mhz; noise power is set to 2 × 10^-13W, the transmission power of each terminal is 0.5W; the sub-service size is set between 300MB and 600 MB; the storage space of the edge server is in the interval of [10, 16 ]]GB; the computational resources are 25Ghz and the computational intensity is [500,1000 ]]Cycle/bit. Thus, the computing power of the edge server

Is randomly generated between 25-50 bit/s.

(II) results of the experiment

2-1 degree of coupling model

The coupling values of the 15 services in the first data set are calculated according to the coupling definition and sorted from low to high according to the coupling values, and the calculation results are detailed in the service coupling values of table 1 below:

table 1: service coupling degree value

The services in table 1 above are the top 25%, 50% and 75%, respectively, and are considered to be highly coupled services.

Referring to fig. 1, services with a degree of coupling exceeding 25% are shown, sorted according to the degree of coupling value.

2-2 high coupling service decoupling model

Referring to fig. 2, it can be seen that the communication cost of most services after decoupling is reduced by about 40% compared to that before decoupling, and only service 8 after decoupling has a higher communication cost than that before decoupling. This is because after decoupling the service 8, all edge servers subscribing to the service 8 still fully subscribe to all sub-services divided by the service 8. The additional overhead will be accumulated after decoupling, so the communication cost of service 8 after decoupling is higher than before decoupling.

Referring to fig. 3-4, the results of the algorithm runtime for 10 data sets in the second data set are shown, respectively, using offline and online algorithms to solve the problem of adding (or deleting) fields by the service. It can be seen from the figure that the running time required by the online algorithm is less than that required by the offline algorithm, whether the segment is added or deleted, without reducing the accuracy of the algorithm. When the field adding service is considered, the online algorithm can save 27% of the time required for completing the offline algorithm on average; the online algorithm can save on average 8% of the time required to complete the offline algorithm when considering the service of deleting segments. This is because online algorithms require only one round to divide all data, while offline algorithms typically require several rounds of algorithms to converge.

Referring to fig. 5, the average wait time for offline and online algorithms with different numbers of terminals is shown. It can be seen from the figure that the average latency of the online algorithm with six different sets of terminal numbers is much lower than the average latency of the offline algorithm. The average latency of the online algorithm is substantially 4 s. It depends on the end of transmission time of the last edge server, which is usually determined by the start transmission time and the transmission time of the edge server. From the point of view of average latency, online algorithms are more conducive to maintaining a lower degree of coupling for real-time maintenance services.

2-3 sub-service deployment model

Referring to FIGS. 6-8, the number of edge servers is shown as 3, 4, and 5, respectively; when the number of the services is 5, 10,20 and 30, and the number of the terminals is 5, 10,20 and 30, the optimal solution of service deployment is obtained through three algorithms of MOEA/D, PSO and GA. Function value of mu₁I₁+μ₂I₂Wherein: mu.s₁＝1，μ ₂50. It can be seen from the figure that the optimal solutions obtained by the three algorithms are consistent when the magnitude is small. When the amount of data is gradually increased, the optimal solution based on the MOEA/D algorithm is better than the other two algorithms. This is because the GA algorithm may be trapped in the local optimal solution, and in the case of the PSO algorithm being in a convergent state, since the particles all move toward the optimal solution, the late convergence speed is slow and the accuracy is not high.

Referring to fig. 6 to 8, it is shown that when the number of edge servers is 3, 4 and 5, respectively, where (fig. a), (fig. b) (fig. c) and (fig. D) are 5, 10,20 and 30 for the number of services, respectively, the optimal solution of service deployment is obtained through three algorithms, MOEA/D, PSO and GA, with the number of terminals according to 5, 10,20 and 30.

The invention adopts the coupling degree calculation of the centralized service and then selects the service with high coupling degree to carry out two-stage decoupling. Then, under the condition that the response time is minimized, the MOEA/D multi-objective optimization method is used for deploying the decoupled sub-services and the low-coupling-degree services on the edge server in a fair manner, the deployment result is closer to the optimal solution, the problem of response time optimization of fair deployment after centralized service decoupling in edge computing is solved well, the service communication cost can be reduced by 40%, the decoupling algorithm comprises an off-line stage and an on-line stage, and the algorithm completion time of the on-line stage can be reduced by 27% maximally compared with the algorithm completion time of the off-line stage.

The above examples are only for further illustration of the present invention and are not intended to limit the present invention, and all equivalent implementations of the present invention should be included within the scope of the claims of the present invention.

Claims

1. A response time optimization method for fair deployment after centralized service decoupling is characterized in that a service decoupling method and an MOEA/D algorithm-based sub-service are adopted for deployment, the decoupled sub-service and a low-coupling-degree service are deployed on an edge server and multi-objective optimization is carried out, so that the overall response time is shortest under the condition of ensuring the load fairness of the edge server, and the specific optimization comprises the following steps:

step l: constructing a system architecture and coupling degree model, a transmission service communication overhead model and a communication delay model of the edge cloud; step 2: calculating the coupling degree of all the services, judging the calculated coupling degree and a set coupling degree threshold value,

if the coupling degree is lower than the threshold value of the coupling degree, the deployment of the sub-services is directly carried out; if the coupling degree threshold value is higher, the service is decoupled.

And step 3: performing off-line stage decoupling and on-line stage decoupling on the coupling service higher than the coupling degree threshold, wherein the off-line stage decoupling is to perform service field-based feature extraction according to the subscription condition of each edge server to the high coupling service, and clustering the high coupling service by using a k-means-based method; the on-line stage decoupling is to use the off-line decoupled clustering result as the initial input of on-line clustering, and to use a streaming k-means based method to perform real-time clustering on the services with added or deleted fields;

and 4, step 4: deploying sub-services by using an MOEA/D algorithm according to the constructed coupling degree model, the transmission service communication overhead model and the communication delay model, wherein the sub-services comprise the low coupling service in the step 2 and the clustering output result of the two stages in the step 3;

2. The method for optimizing response time for fair deployment after centralized service decoupling according to claim l, wherein the implementation process of step l specifically includes the following steps:

step A1: construction of degree of coupling model

Field composition for edge server N_esEach edge server E_j(1≤j≤N_es) Subscribing a plurality of fields in one or more services according to the requirements of the users;

is provided with

If the value is l, otherwise, the value is 0, and the influence of different operations of the fields represented by the following formulas a to c on the service is respectively introduced

And

Wherein: w is a_lImpact coefficients for different operations;

Wherein: p is a radical of_kIs the modification frequency of field k;

step A2: construction of transport service communication overhead model

Represented by the following formula g:

will S_iIs divided into

Sub-services, i.e.

Partitioned sub-services

A size of

Transmits its sub-service

Overhead of communication

Represented by the following formula h:

wherein:

satisfy the requirement of

Edge server E_kSubscribed to sub-services

Fields of China, i.e. considered as subscribed to a sub-service

Its edge server E_kWhether or not sub-services are subscribed to

Then it is represented by the following formula i

Then, the edge server E_kAccepting sub-services offloaded from cloud servers

The required communication overhead is represented by the following j equation:

and satisfies the following constraints:

A＞0，

step A3: construction of communication delay model

Note the book

note the book

Is the propagation delay expressed by the following equation:

Will be set to infinity;

wherein:

signal-to-noise ratio; p_iIs the transmission power of terminal i; g (i, E)_j) For terminal i and edge server E_jA signal gain in between; n is a radical of_pIs the in-channel noise power;

wherein: e_iIs located next to the base station closest to terminal i.

3. The method for optimizing response time for fair deployment after centralized service decoupling according to claim l, wherein the implementation process of step 2 specifically includes: calculating the coupling degrees of all services by using a coupling degree model formula; judging whether the coupling degree of the service is higher than a given coupling degree threshold value or not, and if the coupling degree of the service is lower than the coupling degree threshold value M, namely low-coupling service, directly deploying the service; if the coupling degree is higher than the threshold value, the service is called as a high coupling service, the service needs to be decoupled, and then the service after decoupling, namely the deployment of the sub-service, is carried out.

4. The response time optimization method for fair deployment after centralized service decoupling according to claim 1, wherein the implementation process of step 3 specifically includes the following steps:

step B1: memory two-dimensional matrix

For service S_i(1≤i≤N_s) And the characteristic matrix stores field information of all edge servers subscribing the service represented by the following formula:

note the book

then, a two-dimensional matrix

Feature matrix represented by the following formula 3

Wherein,

for service S_iThe number of fields contained in;

according to the characteristic matrix

And the minimization objective function is expressed by the following equation 4:

step B2: taking the clustering result of the step Bl as the input of the online clustering of the step, namely, giving the clustering center after the t-th batch of data arrives, and batch of data

Wherein: o is_tWhen the t + l data arrives, the updated clustering center is expressed by the following formulas 5-6:

is the kth clustering center when the t-th batch processing arrives;

a number indicating the kth cluster;

wherein:

is a certain field data in the t-th batch of data; i.e. i_vTo represent

Belong to item i_vClustering;

5. the response time optimization method for fair deployment after centralized service decoupling according to claim l, wherein the implementation process of step 4 specifically includes the following steps:

step Cl: definition of S_subFor clustered partitioned sub-service and low coupling service sets, S_i(1≤i≤N_s) Represented by the following formula 10

The sub-services:

each divided sub-service is expressed by the following formula 11, and the union of the two sub-services is null:

then S_subCan be represented by the following formula 12:

Note the book

Indicates whether the condition i-C is satisfied, if so, the method

Otherwise

Wherein: c is a constant; i is a variable represented by the following formula 14:

put(s_j)＝i(15)；

wherein: put(s)_j)∈[0，N-1]And is

wherein: i S_subIs the sub-service set S_subThe number of services; e is N_esA set of edge servers; n is a radical of_tRefers to the number of terminals;

note t (E)_j，s_k) Is an edge represented by the following formula 17Edge server E_jHandling sub-service jobs_kRequired time period of (c):

wherein:

serving a child s_kThe size of (d);

in summary, the objective function for optimizing the operation balance of the edge server is represented by the following equation 20:

I₂＝min(t_max-t_min) (20)；

and the following constraint conditions are satisfied:

vis(i，s_k)∈{0，1}；

g(i，E_j)＞0；

N_p＞0；

s_k∈S_sub；

1≤i≤N_t，1≤j≤N_es，1≤k≤|S_sub|；

step C2: converting the multi-objective optimization problem into a plurality of single-objective optimization problems expressed by the following 2l formula by using a MOEA/D algorithm:

s.t.x∈Ω